NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 69136936 of 9,588 models

    Question Answering

    mradermacher/Llama-3.1-8B-Instuct-Uz-GGUF

    236
    2
    transformers
    Object Detection

    RoyRud1902/yolo11n-text

    235
    3
    ultralytics
    Image To Text

    tatsumoto/manga-ocr-base

    235
    transformers
    Audio Classification

    DBD-research-group/AudioProtoPNet-1-BirdSet-XCL

    235
    transformers
    Object Detection

    Genereux-akotenou/yolos-headwear

    235
    transformers
    Image To Text

    PaddlePaddle/el_PP-OCRv5_mobile_rec

    234
    PaddleOCR
    Text To Video

    maxin-cn/Latte-1

    234
    22
    diffusers
    Question Answering

    batterydata/batteryscibert-cased-squad-v1

    234
    transformers
    Image Segmentation

    Ricky06662/Seg-Zero-7B

    233
    4
    transformers
    Audio Classification

    jeromesky/pronunciation_accuracy_v1.0.3

    233
    7
    transformers
    Object Detection

    Ooredoo-Group/ooredoo-stamp-detection

    232
    2
    Zero Shot Image Classification

    facebook/metaclip-2-worldwide-giant-378

    232
    13
    transformers
    Text To Video

    Mark111111111/b4ddie4i-lora

    232
    1
    diffusers
    Audio Classification

    litagin/anime_speech_emotion_classification

    232
    6
    transformers
    Audio Classification

    mispeech/dasheng-0.6B

    232
    4
    transformers
    Reinforcement Learning

    mradermacher/IntelliAsk-Qwen3-32B-450-Merged-GGUF

    232
    transformers
    Object Detection

    DanielArgaiz/yolo_finetuned_fruits

    232
    transformers
    Object Detection

    Adit-jain/soccana

    231
    1
    ultralytics
    Image To Text

    AdithyaSK/Florence-2-large-ft-v

    231
    transformers
    Question Answering

    knowledgator/Llama-encoder-1.0B

    231
    3
    transformers
    Question Answering

    ncduy/xlm-roberta-base-squad2-distilled-finetuned-chaii

    230
    1
    transformers
    Question Answering

    FreedomIntelligence/Apollo-MoE-7B

    230
    10
    Image Segmentation

    ghazishazan/VideoMolmo

    230
    1
    transformers
    Image To Text

    WafaaFraih/blip-roco-radiology-captioning

    230
    3
    transformers
    289 / 400