NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    10,221 models available

    Showing 92179240 of 10,221 models

    Unconditional Image Generation

    blurbnation/custom-pureland-MT-pipeline

    31
    diffusers
    Document Question Answering

    Nobilis/pokemon-lora

    30
    transformers
    Unconditional Image Generation

    huggingnft/azuki

    30
    2
    transformers
    Unconditional Image Generation

    lilili696969/ddpm-celebahq-finetuned-Vintage-Faces-FFHQAligned-2epochs

    30
    diffusers
    Unconditional Image Generation

    huggingnft/mini-mutants

    30
    1
    transformers
    Table Question Answering

    Yale-LILY/reastap-large-finetuned-wikisql

    30
    1
    transformers
    Table Question Answering

    google/tapas-medium-finetuned-sqa

    30
    1
    transformers
    Depth Estimation

    Onegafer/glpn-nyu-finetuned-diode-230530-193901

    30
    transformers
    Video Classification

    CAIR-HKISI/SurgMotion-vitg-xformer

    30
    3
    pytorch
    Video Classification

    VINAY-UMRETHE/SigMamba-V1-Small

    30
    transformers
    Video Classification

    marekk/video_soccer_goal_detection

    30
    transformers
    Video Classification

    TanAlexanderlz/RALL_RGBCROP_ori16F-8B16F-GACWDlr

    30
    transformers
    Video Classification

    gautamtata/videomae-base-finetuned-kinetics-finetuned-lipsync-subset-1

    30
    3
    transformers
    Text To Audio

    andre-coy/speecht5_tts_tandt

    30
    transformers
    Zero Shot Classification

    emrecan/bert-base-multilingual-cased-snli_tr

    30
    transformers
    Zero Shot Classification

    DAMO-NLP-SG/zero-shot-classify-SSTuning-XLM-R

    30
    10
    transformers
    Zero Shot Classification

    labrat-aiko/nli-popia-v1

    30
    transformers
    Visual Question Answering

    SimulaMet/MedGemma-KvasirVQA-x1-ft

    30
    peft
    Visual Question Answering

    internlm/internlm-xcomposer2d5-ol-7b

    30
    50
    Video Classification

    L1mbo/videomae-base-finetuned-kinetics-finetuned-ucf101-subset-finetuned-ucf101-subset

    30
    transformers
    Video Classification

    microsoft/xclip-base-patch16-hmdb-16-shot

    30
    transformers
    Video Classification

    Ponleur/vivit-MSASL-dataset

    30
    transformers
    Video Classification

    archit11/videomae-base-finetuned-ucfcrime-full

    29
    transformers
    Document Question Answering

    vkrnsn/layoutlmv2-base-uncased_finetuned_docvqa

    29
    transformers
    385 / 426