NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 84018424 of 9,588 models

    Text To Audio

    FarmerlineML/yoruba_tts-2025

    49
    transformers
    Text To Audio

    sil-ai/khw-folkstories-audio-aligned-eng-ipa-speecht5

    49
    transformers
    Image Feature Extraction

    birder-project/rope_i_vit_reg1_t16_pn_npn_avg_c1_pe-spatial

    48
    birder
    Image Feature Extraction

    timm/vit_intern300m_patch14_448.ogvl_dist

    48
    timm
    Video Classification

    shazab/videomae-base-finetuned-ucf_crime

    48
    transformers
    Video Classification

    sirishgam001/videomae-base-finetuned-ucf101-subset

    48
    transformers
    Video Classification

    HimanshuGoyal2004/videomae-base-finetuned-ucf101-subset

    48
    transformers
    Video Classification

    LushoAp/videomae-base-finetuned-ucf101-subset

    48
    transformers
    Text To Audio

    Makaveliai/acestep-v15-sft-turbo_0.5

    48
    1
    transformers
    Zero Shot Classification

    emrecan/convbert-base-turkish-mc4-cased-allnli_tr

    48
    1
    transformers
    Zero Shot Classification

    NDugar/deberta-v2-xlarge-mnli

    48
    transformers
    Zero Shot Classification

    NDugar/v3-Large-mnli

    48
    1
    transformers
    Visual Question Answering

    Swicked86/phi4-mm-gptq

    48
    transformers
    Visual Question Answering

    MariaK/vilt_finetuned_200

    48
    transformers
    Visual Question Answering

    OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-7B

    48
    8
    transformers
    Unconditional Image Generation

    mahir123456/ddpm-celebahq-finetuned-butterflies-2epochs

    47
    diffusers
    Image Feature Extraction

    SliMM-X/CoMP-SigLIP-So400M

    47
    1
    slimm
    Image Feature Extraction

    p1atdev/siglip2-base-patch16-384-vision

    47
    transformers
    Text To Audio

    AbrorBalxiyev/uzbek-tts-model

    47
    1
    transformers
    Zero Shot Classification

    HeTree/HeCross

    47
    1
    transformers
    Zero Shot Classification

    akiFQC/bert-base-japanese-v3_nli-jsnli

    47
    sentence-transformers
    Image Feature Extraction

    timm/vit_giantopt_patch16_siglip_gap_256.v2_webli

    47
    timm
    Image Feature Extraction

    timm/vit_giantopt_patch16_siglip_gap_384.v2_webli

    47
    timm
    Image Feature Extraction

    mlx-vision/vit_small_patch16_224.dinov3-mlxim

    47
    mlx-image
    351 / 400