NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    10,221 models available

    Showing 93859408 of 10,221 models

    Visual Question Answering

    hop1um/blip-vqa-rad

    24
    transformers
    Unconditional Image Generation

    keras-io/VGG19

    24
    1
    tf-keras
    Video Classification

    OckerGui/videomae-base-finetuned-ASBD_v2

    24
    transformers
    Video Classification

    Awais1718/videomae-base-finetuned-kinetics-finetuned-shoplifting-dataset-2

    24
    transformers
    Visual Question Answering

    MohishKhadse55/majorProject

    24
    transformers
    Unconditional Image Generation

    nroggendorff/nebulae

    24
    diffusers
    Voice Activity Detection

    aitytech/Pyannote-Segmentation-MLX

    23
    mlx
    Document Question Answering

    PrimWong/layout_qa_hparam_tuning

    23
    transformers
    Unconditional Image Generation

    WiNE-iNEFF/Mineskin-Diffusion-v1.0

    23
    5
    diffusers
    Tabular Regression

    ryukkt62/Suncast

    23
    17
    Video Classification

    Abs6187/isl-models

    23
    pytorch
    Video Classification

    anirudhmu/videomae-base-finetuned-soccer-action-recognition2

    23
    1
    transformers
    Video Classification

    TanAlexanderlz/ALL_NoCrop_Aug16F-8B16F-GWlr-cosine

    23
    transformers
    Video Classification

    TanAlexanderlz/RALL_RGBCROP_5e6-poly_test_eval

    23
    transformers
    Zero Shot Classification

    deepanwa/NuNerZero_onnx

    23
    2
    Zero Shot Classification

    mjwong/mcontriever-xnli

    23
    transformers
    Visual Question Answering

    Kevin0217/vilt_finetuned_200

    23
    transformers
    Visual Question Answering

    DeclanBracken/MiniCPM-Llama3-V-2.5-Transcriptor

    23
    transformers
    Visual Question Answering

    Punthon/ic-luvkka

    23
    transformers
    Visual Question Answering

    Nhaass/Qwen3-VL-2B-ChartQA

    23
    2
    transformers
    Visual Question Answering

    Cran-May/Shi-Ci-Vision

    23
    Visual Question Answering

    hf-tiny-model-private/tiny-random-Blip2ForConditionalGeneration

    23
    transformers
    Zero Shot Classification

    AntoineBlanot/flan-t5-xxl-classif-3way

    23
    3
    transformers
    Video Classification

    Peregalli/videomae-base-finetuned-ucf101-subset

    23
    transformers
    392 / 426