Skip to content

models Models documentation

github-actions[bot] edited this page Dec 19, 2023 · 36 revisions

Models

Models in this category


It can be used for multi-class and multi-label multimodal classification tasks, and is capable of handling datasets with features from diverse modes, including ...

  • ocsort_yolox_x_crowdhuman_mot17-private-half

    ocsort_yolox_x_crowdhuman_mot17-private-half model is from OpenMMLab's MMTracking library. This model is <a href="https://github.com/open-mmlab/mmtracking/blob/master/configs/mot/ocsort/metafile.yml#L24" target=...

  • OpenAI-CLIP-Image-Text-Embeddings-ViT-Base-Patch32

    The CLIP model was developed by OpenAI researchers to learn about what contributes to robustness in computer vision tasks and to test the ability of models to generalize to arbitrary image classification tasks in a zero-shot manner. The model uses a ViT-B/32 Transformer architecture as an image...

  • OpenAI-CLIP-ViT-Base-Patch32

    The CLIP model was developed by OpenAI researchers to learn about what contributes to robustness in computer vision tasks and to test the ability of models to generalize to arbitrary image classification tasks in a zero-shot manner. The model uses a ViT-B/32 Transformer architecture as an image...

  • OpenAI-CLIP-ViT-Large-Patch14

    The CLIP model was developed by OpenAI researchers to learn about what contributes to robustness in computer vision tasks and to test the ability of models to generalize to arbitrary image classification tasks in a zero-shot manner. The model uses a ViT-L/14 Transformer architecture as an image...

  • openai-whisper-large

    Whisper is an OpenAI pre-trained speech recognition model with potential applications for ASR solutions for developers. However, due to weak supervision and large-scale noisy data, it should be used with caution in high-risk domains. The model has been trained on 680k hours of audio data represen...

  • openai-whisper-large-v3

    Whisper is a model that can recognize and translate speech using deep learning. It was trained on a large amount of data from different sources and languages. Whisper models can handle various tasks and domains without needing to adjust the model.

Whisper large-v3 is similar to the previous larg...

Clone this wiki locally