NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 69857008 of 9,588 models

    Text To Video

    vrgamedevgirl84/LTX_2.3_Fantasy_Painterly_Style_LoRa

    218
    diffusers
    Zero Shot Classification

    MoritzLaurer/bge-m3-zeroshot-v2.0-c

    217
    14
    transformers
    Image To Text

    Xenova/trocr-small-handwritten

    217
    8
    transformers.js
    Question Answering

    datarpit/distilbert-base-uncased-finetuned-natural-questions

    217
    3
    transformers
    Voice Activity Detection

    videosdk-live/Namo-Turn-Detector-v1-Hindi

    216
    onnxruntime
    Object Detection

    FeurCoubeh/detr-fashionpedia

    216
    transformers
    Zero Shot Image Classification

    AnasMohamed/video-llava

    216
    transformers
    Image To Text

    Darayut/khmer-text-recognition

    216
    transformers
    Image To Text

    mradermacher/Hulu-Med-30A3-i1-GGUF

    216
    transformers
    Audio Classification

    bookbot/distil-ast-audioset

    216
    24
    transformers
    Question Answering

    mradermacher/OpenCerebrum-1.0-7b-SFT-i1-GGUF

    216
    transformers
    Object Detection

    onnx-community/rfdetr_nano-ONNX

    215
    1
    transformers.js
    Image To Text

    phxember/Uni-MuMER-Qwen3-VL-2B

    215
    transformers
    Text To Video

    mradermacher/zen-voyager-GGUF

    215
    transformers
    Text To Video

    Brian9999/game-editing

    215
    4
    diffusers
    Question Answering

    enfantdupeuple/Llama-Open-Finance-8B-Q4_K_M-GGUF

    215
    transformers
    Question Answering

    QuantFactory/Apollo2-9B-GGUF

    215
    2
    Audio To Audio

    nvidia/bigvgan_base_22khz_80band

    214
    PyTorch
    Image Segmentation

    smp-hub/dpt-large-ade20k

    214
    1
    segmentation-models-pytorch
    Image Segmentation

    facebook/sapiens-seg-foreground-1b-torchscript

    214
    3
    sapiens
    Zero Shot Image Classification

    apple/TiC-CLIP-basic-sequential

    214
    2
    tic-clip
    Question Answering

    rajeshthangaraj1/uae_rule_book_QA_assistant

    214
    1
    transformers
    Text To Audio

    sil-ai/dgo-tts-training-data-speecht5-a

    213
    transformers
    Object Detection

    nsugianto/tblstructrecog_finetuned_tbltransstrucrecog_v2_s1_370s

    213
    transformers
    292 / 400