NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 88578880 of 9,588 models

    Video Classification

    anirudhmu/videomae-base-finetuned-soccer-action-recognition2

    23
    1
    transformers
    Video Classification

    TanAlexanderlz/ALL_NoCrop_Aug16F-8B16F-GWlr-cosine

    23
    transformers
    Video Classification

    facebook/vjepa2-vitg-fpc32-384-diving48

    23
    7
    transformers
    Video Classification

    TanAlexanderlz/RALL_RGBCROP_5e6-poly_test_eval

    23
    transformers
    Zero Shot Classification

    deepanwa/NuNerZero_onnx

    23
    2
    Zero Shot Classification

    mjwong/mcontriever-xnli

    23
    transformers
    Visual Question Answering

    DeclanBracken/MiniCPM-Llama3-V-2.5-Transcriptor

    23
    transformers
    Visual Question Answering

    Nhaass/Qwen3-VL-2B-ChartQA

    23
    2
    transformers
    Visual Question Answering

    Cran-May/Shi-Ci-Vision

    23
    Visual Question Answering

    GeorgyGUF/INFRL-Qwen2.5-VL-72B-Preview-q8-with-bf16-output-and-bf16-embedding.gguf

    23
    transformers
    Video Classification

    RodrigoFardin/videomae-base-finetuned-dd

    23
    transformers
    Video Classification

    Chloepv/Angelcare-Cosmos-Reason2-8B

    23
    cosmos
    Unconditional Image Generation

    aareblau/diffusers-tutorial-butterflies-32

    23
    diffusers
    Unconditional Image Generation

    heisenberg3376/animal-diffusion-128

    23
    diffusers
    Voice Activity Detection

    kamilakesbi/speaker-segmentation-fine-tuned-callhome-jpn

    22
    transformers
    Document Question Answering

    Nobilis/layoutlmv2-base-uncased_finetuned_docvqa

    22
    transformers
    Unconditional Image Generation

    WiNE-iNEFF/Mineskin-Diffusion-v1.0

    22
    5
    diffusers
    Table Question Answering

    neulab/omnitab-large-finetuned-wtq

    22
    7
    transformers
    Video Classification

    PergaZuZ/videomae-base-finetuned-ucf101-subset

    22
    transformers
    Zero Shot Classification

    labrat-aiko/nli-popia-v1

    22
    transformers
    Zero Shot Classification

    knowledgator/gliclass-base-v2.0-rac-init

    22
    11
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA3-7B-Image

    22
    10
    transformers
    Visual Question Answering

    internlm/internlm-xcomposer2d5-ol-7b

    22
    50
    Visual Question Answering

    Ngoac/EraX-VL-2B-V1.5-Q4_K_M-GGUF

    22
    transformers
    370 / 400