NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 90259048 of 9,588 models

    Visual Question Answering

    google/pix2struct-ai2d-large

    17
    4
    transformers
    Visual Question Answering

    azwierzc/vilt-b32-finetuned-vqa-pl

    17
    transformers
    Visual Question Answering

    unum-cloud/uform-gen-chat

    17
    18
    transformers
    Visual Question Answering

    LeroyDyer/_Spydaz_Web_AI_LlavaNext

    17
    1
    transformers
    Visual Question Answering

    Foreshhh/Qwen2-VL-7B-SafeRLHF

    17
    3
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2-8x7B

    17
    3
    transformers
    Visual Question Answering

    mattia-re-learn/llava-v1.5-13b

    17
    transformers
    Visual Question Answering

    TIGER-Lab/VL-Reasoner-7B

    17
    1
    transformers
    Visual Question Answering

    mncai/hunmin_vlm_235b_v0.11_merged_cua

    17
    3
    transformers
    Visual Question Answering

    OpenDataArena/MMFineReason-2B

    17
    8
    Zero Shot Classification

    knowledgator/gliclass-large-v1.0-lw

    17
    3
    transformers
    Video Classification

    MCG-NJU/videomae-base-short-finetuned-ssv2

    17
    1
    transformers
    Video Classification

    AndresB00157/videomae-base-finetuned-ucf101-subset

    17
    transformers
    Video Classification

    LinStevenn/videomae-base-readminds-assignment

    17
    transformers
    Zero Shot Classification

    mjwong/e5-large-mnli-anli

    17
    transformers
    Document Question Answering

    PrplHrt/LayoutLMv2_hub_500

    16
    transformers
    Tabular Classification

    tiiip/indian-stock-market-analyzer

    16
    Tabular Regression

    pcoloc/autotrain-dragino-7-7-1860763606

    16
    transformers
    Tabular Regression

    jwan2021/autotrain-us-housing-prices-1771761511

    16
    1
    transformers
    Depth Estimation

    KeighBee/coreml-DepthPro

    16
    9
    coreml
    Video Classification

    dsuhcs/video-mae-ollie-kickflip-1

    16
    transformers
    Zero Shot Classification

    mjwong/mcontriever-msmarco-xnli

    16
    transformers
    Zero Shot Classification

    HiTZ/A2T_RoBERTa_SMFA_ACE-arg

    16
    transformers
    Zero Shot Classification

    HiTZ/A2T_RoBERTa_SMFA_WikiEvents-arg_ACE-arg

    16
    1
    transformers
    377 / 400