NEWVectors or files. Pick a path.Start →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    12,152 models available

    Showing 361384 of 12,152 models

    Text To Speech

    ai4bharat/indic-parler-tts

    881K
    234
    transformers
    Audio Classification

    audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim

    880K
    167
    transformers
    Automatic Speech Recognition

    Qwen/Qwen3-ASR-0.6B

    872K
    298
    Sentence Similarity

    sentence-transformers/paraphrase-MiniLM-L3-v2

    863K
    29
    sentence-transformers
    Text Generation

    nvidia/Qwen3.5-397B-A17B-NVFP4

    859K
    100
    Model Optimizer
    Image Segmentation

    ZhengPeng7/BiRefNet

    851K
    588
    birefnet
    Image Text To Text

    deepseek-ai/deepseek-vl2-tiny

    843K
    248
    transformers
    Zero Shot Image Classification

    google/siglip2-base-patch16-naflex

    843K
    30
    transformers
    Image Classification

    timm/tf_efficientnetv2_s.in21k_ft_in1k

    842K
    2
    timm
    Text Generation

    Qwen/Qwen3-4B-Instruct-2507-FP8

    840K
    79
    transformers
    Text Generation

    microsoft/phi-4

    836K
    2,251
    transformers
    Text Generation

    facebook/opt-1.3b

    832K
    184
    transformers
    Text Generation

    nvidia/Kimi-K2.6-NVFP4

    830K
    32
    Model Optimizer
    Image Text To Text

    AxionML/Qwen3.5-9B-NVFP4

    829K
    17
    transformers
    Text Generation

    GSAI-ML/LLaDA-8B-Instruct

    828K
    358
    transformers
    Translation

    Helsinki-NLP/opus-mt-fr-en

    827K
    53
    transformers
    Sentence Similarity

    sentence-transformers/distiluse-base-multilingual-cased-v2

    825K
    209
    sentence-transformers
    Text To Image

    stabilityai/sdxl-turbo

    824K
    2,563
    diffusers
    Text To Speech

    microsoft/VibeVoice-Realtime-0.5B

    823K
    1,232
    transformers
    Fill Mask

    microsoft/deberta-v3-small

    822K
    77
    transformers
    Automatic Speech Recognition

    pyannote/speaker-diarization

    821K
    1,279
    pyannote-audio
    Image Text To Text

    Qwen/Qwen3-VL-8B-Instruct-FP8

    821K
    71
    transformers
    Text Generation

    Qwen/Qwen2.5-32B-Instruct-AWQ

    819K
    101
    transformers
    Text Generation

    prefeitura-rio/Rio-3.0-Open

    818K
    5
    transformers
    16 / 507