NEWVectors or files. Pick a path.Start →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    13,252 models available

    Showing 36733696 of 13,252 models

    Automatic Speech Recognition

    optimum-internal-testing/tiny-random-whisper

    15K
    transformers
    Text To Speech

    utrobinmv/tts_ru_free_hf_vits_low_multispeaker

    15K
    25
    transformers
    Image Classification

    jedzqg/is_anime_or_real

    15K
    Sentence Similarity

    antoinelouis/colbert-xm

    15K
    70
    colbert-ai
    Translation

    ai4bharat/indictrans2-en-indic-1B

    15K
    56
    transformers
    Text To Video

    zai-org/CogVideoX-5b

    15K
    675
    diffusers
    Zero Shot Image Classification

    Xenova/clip-vit-large-patch14

    15K
    1
    transformers.js
    Image To Image

    starsfriday/Qwen-Image-Edit-2511-Upscale2K

    15K
    19
    diffusers
    Text To Speech

    wasmdashai/vits-ar-sa-A

    15K
    2
    transformers
    Image Classification

    google/vit-base-patch16-384

    15K
    50
    transformers
    Text Classification

    rogue-security/prompt-injection-jailbreak-sentinel-v2

    15K
    36
    transformers
    Text To Speech

    mlx-community/Kokoro-82M-bf16

    15K
    52
    mlx
    Fill Mask

    kuleshov-group/caduceus-ps_seqlen-131k_d_model-256_n_layer-16

    15K
    14
    transformers
    Automatic Speech Recognition

    KBLab/kb-whisper-large

    15K
    62
    transformers
    Feature Extraction

    OpenSearch-AI/Ops-MM-embedding-v1-7B

    15K
    14
    Image To Text

    nvidia/nemotron-ocr-v2

    15K
    206
    Zero Shot Image Classification

    google/siglip2-large-patch16-512

    15K
    22
    transformers
    Image Segmentation

    nvidia/segformer-b0-finetuned-cityscapes-512-1024

    15K
    1
    transformers
    Automatic Speech Recognition

    nguyenvulebinh/wav2vec2-base-vietnamese-250h

    15K
    46
    transformers
    Image Segmentation

    ZhengPeng7/BiRefNet_dynamic

    15K
    10
    birefnet
    Image Classification

    timm/tf_efficientnetv2_l.in21k_ft_in1k

    15K
    2
    timm
    Image Feature Extraction

    py-feat/img2pose

    15K
    1
    py-feat
    Video Classification

    google/videoprism-lvt-base-f16r288

    15K
    15
    videoprism
    Sentence Similarity

    jhgan/ko-sbert-multitask

    15K
    23
    sentence-transformers
    154 / 553