NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 76577680 of 9,588 models

    Text To Audio

    forkjoin-ai/qwen3-tts-12hz-0.6b-base

    122
    llama-cpp
    Zero Shot Classification

    mjwong/multilingual-e5-large-instruct-xnli-anli

    122
    1
    transformers
    Zero Shot Image Classification

    UCSC-VLAA/ViT-bigG-14-CLIPA-336-datacomp1B

    122
    4
    open_clip
    Text To Audio

    sil-ai/senga-NT-mmsnya-audio-aligned-speecht5

    122
    transformers
    Video Classification

    AdrienB134/videomae-base-finetuned-ucf101-subset

    121
    transformers
    Video Classification

    microsoft/xclip-large-patch14-16-frames

    121
    3
    transformers
    Video Classification

    BlackB/videomae-base-finetuned

    121
    transformers
    Video Classification

    Joy28/videomae-base-finetuned-subset-200epochs

    121
    transformers
    Visual Question Answering

    mradermacher/MemOCR-7B-i1-GGUF

    121
    1
    transformers
    Tabular Classification

    julien-c/wine-quality

    121
    20
    sklearn
    Object Detection

    keremberke/yolov5m-nfl

    121
    3
    yolov5
    Object Detection

    keremberke/yolov5n-garbage

    121
    5
    yolov5
    Object Detection

    keremberke/yolov5n-construction-safety

    121
    18
    yolov5
    Text To Video

    alibaba-pai/Wan2.1-Fun-14B-InP

    121
    43
    videox_fun
    Text To Video

    alibaba-pai/Wan2.1-Fun-V1.1-14B-Control

    121
    25
    videox_fun
    Text To Video

    alibaba-pai/Wan2.1-Fun-V1.1-14B-Control-Camera

    121
    8
    videox_fun
    Audio Classification

    AdonaiHS/distilhubert-finetuned-gtzan

    121
    transformers
    Text To Audio

    AEmotionStudio/audiox-models

    121
    1
    Text To Audio

    Joserzapata/speecht5_finetuned_voxpopuli_nl

    121
    transformers
    Zero Shot Classification

    emrecan/bert-base-turkish-cased-allnli_tr

    121
    1
    transformers
    Object Detection

    mradermacher/Polaris-VGA-0.8B-Post1.0-GGUF

    120
    transformers
    Object Detection

    qualcomm/DETR-ResNet101-DC5

    120
    pytorch
    Object Detection

    goodcasper/kvasir_capsule_only_bbox_itri_official_split

    120
    transformers
    Image Segmentation

    tomascanivari/mask2former-swin-large-coco-instance-finetuned-buildings

    120
    transformers
    320 / 400