NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 83298352 of 9,588 models

    Visual Question Answering

    Joe99/visionlanguageTransformer

    56
    transformers
    Visual Question Answering

    Jeney/vilt-b32-finetuned-vqa

    56
    1
    transformers
    Visual Question Answering

    Jayanth9533/YOUR-REPO

    56
    transformers
    Visual Question Answering

    JHhan/vilt_finetuned_200

    56
    transformers
    Unconditional Image Generation

    ZhangJiaHui0122/sd-class-butterflies-32

    55
    diffusers
    Tabular Classification

    Kluuking/autotrain-flight-delay-3621096840

    55
    transformers
    Image Feature Extraction

    kiennt120/dinov3-vitl16-pretrain-lvd1689m

    55
    transformers
    Video Classification

    Jeyseb/videomae-base-finetuned-rwf2000-subset___v1

    55
    transformers
    Text To Audio

    Bagus/speecht5_finetuned_voxpopuli_nl

    55
    transformers
    Zero Shot Classification

    nahiar/zero-shot-classification

    55
    2
    transformers
    Zero Shot Classification

    Xenova/nli-deberta-v3-large

    55
    transformers.js
    Zero Shot Classification

    sileod/mdeberta-v3-base-tasksource-nli

    55
    18
    transformers
    Visual Question Answering

    Jaguar7788/vilt_finetuned_200

    55
    transformers
    Visual Question Answering

    GeorgyGUF/INFRL-Qwen2.5-VL-72B-Preview-ggufs-fully-quantized

    55
    transformers
    Visual Question Answering

    nectec/Pathumma-llm-vision-2.0.0-preview

    54
    Table Question Answering

    vahrush/NTB_probe_sec

    54
    transformers
    Image Feature Extraction

    dflorea/andina-dinov3-vits-triplet

    54
    transformers
    Image Feature Extraction

    timm/vit_so400m_patch16_siglip_gap_512.v2_webli

    54
    1
    timm
    Video Classification

    Sathwik-kom/anomaly-detector-videomae10

    54
    transformers
    Video Classification

    Hemgg/deepfake_model_Video-MAE

    54
    transformers
    Video Classification

    CondadosAI/uniformer_s_k400

    54
    acaua
    Text To Audio

    KHooshanfar/speecht5_tts_fa_kh

    54
    transformers
    Zero Shot Classification

    Xerv-AI/RainDrop-Demo-1

    54
    3
    transformers
    Visual Question Answering

    KFrimps/vilt_finetuned_200

    54
    transformers
    348 / 400