NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    10,221 models available

    Showing 95779600 of 10,221 models

    Zero Shot Classification

    gritli/BioMed-left

    19
    transformers
    Zero Shot Classification

    mjwong/contriever-msmarco-mnli

    19
    transformers
    Zero Shot Classification

    knowledgator/gliclass-base-v1.0-lw

    19
    2
    transformers
    Visual Question Answering

    SergioAnaut/vilt-finetuned-fashion-vqa

    19
    transformers
    Visual Question Answering

    justinj92/phi-35-vision-burberry

    19
    transformers
    Visual Question Answering

    SpringWang08/medical-vqa-soup

    19
    peft
    Visual Question Answering

    BUAADreamer/Yi-VL-6B-hf

    18
    2
    transformers
    Document Question Answering

    AlexSaz/layoutlmv2-base-uncased_finetuned_docvqa

    18
    transformers
    Document Question Answering

    Sharka/CIVQA_DVQA_LayoutXLM

    18
    transformers
    Document Question Answering

    Oksana76B/document-question-answering

    18
    1
    transformers
    Unconditional Image Generation

    commaai/commavq-gpt2m

    18
    9
    transformers
    Table Question Answering

    vahrush/NTB_probe_sec

    18
    transformers
    Tabular Classification

    imodels/figs-compas-recidivism

    18
    1
    sklearn
    Tabular Classification

    matth/flowformer

    18
    7
    transformers
    Tabular Regression

    pcoloc/autotrain-dragino-7-7-max_300m-1861063640

    18
    1
    transformers
    Tabular Regression

    Fatihrizkia/autotrain-xauusdh4timestamp-100145147571

    18
    1
    transformers
    Depth Estimation

    qqceqqq/DepthCrafter

    18
    DepthCrafter
    Video Classification

    Dinh/videomae-base-finetuned-ucf101-subset

    18
    transformers
    Video Classification

    Dijaaa/videomae-base-finetuned-kinetics-finetuned-ucf-crime-subset

    18
    transformers
    Video Classification

    Natali12/videomae-base-finetuned-opportunity-locomotion

    18
    transformers
    Video Classification

    Ptisni/videomae-base-finetuned-ucf101-subset

    18
    transformers
    Video Classification

    ratchy-oak/vivit-b-16x2-kinetics400-finetuned-cctv-surveillance

    18
    1
    transformers
    Video Classification

    Shawon16/ViViT_wlasl_100_200ep_coR_

    18
    transformers
    Video Classification

    Fulwa/videomae-base1-finetuned-ucf101-subset

    18
    transformers
    400 / 426