NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    10,221 models available

    Showing 92899312 of 10,221 models

    Video Classification

    dvs/videomae-base-finetuned-kinetics-finetuned-movienet-2

    27
    transformers
    Video Classification

    anirudhmu/videomae-base-finetuned-soccer-action-recognitionx2

    27
    transformers
    Video Classification

    codircodir/videomae-base-finetuned-N

    27
    transformers
    Video Classification

    duyngocadn/videomae-base-finetuned-ucf101-subset

    27
    transformers
    Video Classification

    dvs/videomae-base-finetuned-kinetics-finetuned-movienet

    27
    transformers
    Visual Question Answering

    google/pix2struct-infographics-vqa-large

    27
    12
    transformers
    Visual Question Answering

    PengxiangLi/MAT

    27
    2
    Video Classification

    ManuD/videomae-base-finetuned-ucf101-subset

    27
    transformers
    Video Classification

    Chloepv/Angelcare-Cosmos-Reason2-8B

    27
    cosmos
    Video Classification

    Ldrago116/videomae-base-finetuned-ucf101-subset

    27
    transformers
    Unconditional Image Generation

    aareblau/diffusers-tutorial-butterflies-64

    27
    diffusers
    Video Classification

    NiiCole/vivit-b-16x2-kinetics400-finetuned-ucf101-subset

    27
    transformers
    Video Classification

    Prabesh06/vivit-b-16x2-kinetics400-finetuned-NORMALLLLAbnormalVideosOnly

    27
    transformers
    Visual Question Answering

    Ornelas/vilt_finetuned_fashion

    27
    transformers
    Unconditional Image Generation

    ZhangJiaHui0122/ddpm-celebahq-finetuned-butterflies-2epochs

    27
    diffusers
    Visual Question Answering

    nectec/Pathumma-llm-vision-1.0.0

    26
    11
    Visual Question Answering

    Duckq/blip2-opt-2.7b-emotion-llm

    26
    transformers
    Document Question Answering

    jinhybr/OCR-DocVQA-Donut

    26
    13
    transformers
    Unconditional Image Generation

    tensorkelechi/sky_diffuse

    26
    diffusers
    Unconditional Image Generation

    google/ncsnpp-church-256

    26
    3
    diffusers
    Unconditional Image Generation

    hp-l33/ARPG

    26
    2
    diffusers
    Tabular Regression

    jwan2021/autotrain-us-housing-prices-1771761511

    26
    1
    transformers
    Tabular Regression

    prabinpanta0/celsius-to-fahrenheit

    26
    3
    tf-keras
    Depth Estimation

    Onegafer/glpn-nyu-finetuned

    26
    transformers
    388 / 426