NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 83058328 of 9,588 models

    Image Feature Extraction

    timm/samvit_large_patch16.sa1b

    58
    timm
    Image Feature Extraction

    timm/convnext_xxlarge.clip_laion2b_rewind

    58
    timm
    Text To Audio

    KMCan/speecht5_finetuned_voxpopuli_nl

    58
    transformers
    Text To Audio

    alakxender/mms-tts-div-finetuned-md-m01

    58
    1
    transformers
    Text To Audio

    JBJoyce/speecht5_finetuned_voxpopuli_sl

    58
    transformers
    Zero Shot Classification

    XManFromXlab/haystack-yaml-RCE

    58
    sentence-transformers
    Zero Shot Classification

    ClaudeYang/awesome_fb_model

    58
    1
    transformers
    Visual Question Answering

    mradermacher/NayanaVQA-GGUF

    57
    transformers
    Image Feature Extraction

    facebook/sapiens-pretrain-1b-torchscript

    57
    sapiens
    Video Classification

    microsoft/xclip-base-patch16-hmdb-4-shot

    57
    1
    transformers
    Video Classification

    KingTechnician/videomae-small-finetuned-kinetics-finetuned-xd-violence

    57
    transformers
    Text To Audio

    Marvis-AI/marvis-tts-250m-v0.1-MLX-4bit

    57
    8
    transformers
    Text To Audio

    mariammohamed00/speecht5_finetuned_arabic_fp16

    57
    transformers
    Text To Audio

    Sonl/speecht5_amharic_addis_explainer

    57
    transformers
    Image Feature Extraction

    SixAILab/nepa-base-patch14-224

    56
    1
    transformers
    Image Feature Extraction

    nvidia/PS3-1.5K-SigLIP

    56
    5
    Video Classification

    Hinata197/videomae-base-finetuned-ucf101-subset

    56
    transformers
    Text To Audio

    Somali-tts/diirow_tts

    56
    transformers
    Text To Audio

    Cyb3rguru/speecht5_finetuned_on_cleaned_female_hausa_001

    56
    transformers
    Text To Audio

    Jawad320/urduttsbyjawad

    56
    transformers
    Text To Audio

    JDhillon/speecht5_tts_lj_speech2

    56
    transformers
    Text To Audio

    Marvis-AI/marvis-tts-250m-v0.1-MLX-8bit

    56
    6
    transformers
    Zero Shot Classification

    MoritzLaurer/xtremedistil-l6-h256-mnli-fever-anli-ling-binary

    56
    3
    transformers
    Zero Shot Classification

    navteca/nli-deberta-v3-xsmall

    56
    1
    transformers
    347 / 400