NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 69616984 of 9,588 models

    Zero Shot Classification

    Xenova/DeBERTa-v3-base-mnli

    223
    transformers.js
    Zero Shot Classification

    Sahajtomar/German_Zeroshot

    222
    26
    transformers
    Audio Classification

    MIT/ast-finetuned-audioset-12-12-0.447

    222
    transformers
    Audio To Audio

    speechbrain/sepformer-whamr-enhancement

    221
    14
    speechbrain
    Zero Shot Classification

    mlburnham/Political_DEBATE_base_v1.0

    221
    7
    transformers
    Visual Question Answering

    mradermacher/TreeVGR-7B-CI-GGUF

    221
    1
    transformers
    Object Detection

    nsugianto/detr-resnet50_finetuned_lstabledetv1s9_lsdocelementdetv1type3_session6

    221
    transformers
    Object Detection

    nsugianto/detr-resnet50_finetuned_detrresnet50_lsdocelementdetv1type7_v2_s2_2359s

    221
    transformers
    Question Answering

    tyqiangz/xlm-roberta-base-finetuned-chaii

    221
    transformers
    Question Answering

    twmkn9/distilbert-base-uncased-squad2

    221
    4
    transformers
    Image Feature Extraction

    onnx-community/dinov3-vits16-pretrain-lvd1689m-ONNX-MHA-scores

    220
    4
    transformers.js
    Audio Classification

    bookbot/distil-wav2vec2-adult-child-cls-52m

    220
    transformers
    Zero Shot Classification

    KBLab/megatron-bert-large-swedish-cased-165-zero-shot

    219
    5
    transformers
    Zero Shot Classification

    lighteternal/nli-xlm-r-greek

    219
    2
    transformers
    Image Segmentation

    canvit/probe-ade20k-40k-s512-c32-in21k

    219
    canvit-pytorch
    Image To Text

    MohamedRashad/arabic-small-nougat

    219
    26
    transformers
    Text To Video

    ussoewwin/Wan2.2_T2V_A14B_VACE-test_fp16_GGUF

    218
    comfyui
    Visual Question Answering

    internlm/internlm-xcomposer2d5-7b-4bit

    218
    13
    transformers
    Object Detection

    autolane/rfdetr-alpr

    218
    tensorrt
    Object Detection

    anyformat/doclayout-yolo-docstructbench

    218
    ultralytics
    Image Feature Extraction

    timm/convnext_large_mlp.clip_laion2b_ft_320

    218
    timm
    Audio Classification

    alkiskoudounas/voc2vec

    218
    4
    transformers
    Question Answering

    FreedomIntelligence/Apollo-MoE-0.5B

    218
    3
    Text To Video

    vrgamedevgirl84/LTX_2.3_Clay_Mation_Style_LoRa

    218
    4
    diffusers
    291 / 400