NEWVectors or files. Pick a path.Start →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    12,807 models available

    Showing 124 of 12,807 models

    Featured Models

    Benchmarked
    HFVisual Embeddings

    openai/clip-vit-large-patch14

    Contrastive Language-Image Pre-Training for zero-shot visual understanding

    13.9M
    3 benchmarks
    HFVisual Embeddings

    google/siglip-base-patch16-224

    Sigmoid Loss for Language Image Pre-Training, efficient contrastive learning

    1.2M
    3 benchmarks
    HFVisual Embeddings

    google/siglip2-giant-opt-patch16-384

    Multilingual vision-language encoder with dense features and localization

    309K
    2 benchmarks
    HFVisual Embeddings

    facebook/dinov2-large

    Self-supervised vision foundation model producing all-purpose visual features

    2.8M
    2 benchmarks
    PyTorchVisual Embeddings

    facebook/dinov3-large

    Next-generation self-supervised vision model with Gram anchoring and 6.7B scaling

    450K
    1 benchmarks
    HFVisual Embeddings

    laion/CLIP-ViT-bigG-14-laion2B-39B-b160k

    Open-source CLIP trained on 2B image-text pairs at giant scale

    62K
    2 benchmarks
    Sentence Similarity

    sentence-transformers/all-MiniLM-L6-v2

    164.3M
    4,946
    sentence-transformers
    Fill Mask

    google-bert/bert-base-uncased

    41.3M
    2,684
    transformers
    Feature Extraction

    BAAI/bge-small-en-v1.5

    38.7M
    488
    sentence-transformers
    Sentence Similarity

    sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2

    33.6M
    1,267
    sentence-transformers
    Sentence Similarity

    sentence-transformers/all-mpnet-base-v2

    24.0M
    1,306
    sentence-transformers
    Sentence Similarity

    BAAI/bge-m3

    21.7M
    3,111
    sentence-transformers
    Text Generation

    Qwen/Qwen3-0.6B

    17.4M
    1,325
    transformers
    Zero Shot Image Classification

    openai/clip-vit-base-patch32

    15.3M
    959
    transformers
    Fill Mask

    FacebookAI/xlm-roberta-base

    14.6M
    844
    transformers
    Audio Classification

    laion/clap-htsat-fused

    12.6M
    104
    transformers
    Sentence Similarity

    nomic-ai/nomic-embed-text-v1.5

    12.3M
    846
    sentence-transformers
    Text To Speech

    hexgrad/Kokoro-82M

    11.2M
    6,329
    Fill Mask

    FacebookAI/roberta-base

    10.8M
    614
    transformers
    Text Generation

    Qwen/Qwen3-4B

    10.5M
    636
    transformers
    Text Classification

    BAAI/bge-reranker-v2-m3

    10.3M
    1,035
    sentence-transformers
    Text Generation

    openai-community/gpt2

    10.2M
    3,298
    transformers
    Feature Extraction

    BAAI/bge-large-en-v1.5

    10.1M
    683
    sentence-transformers
    Text Generation

    Qwen/Qwen2.5-3B-Instruct

    9.1M
    499
    transformers
    Zero Shot Image Classification

    openai/clip-vit-large-patch14

    8.9M
    2,036
    transformers
    Text Generation

    Qwen/Qwen3-8B

    8.6M
    1,140
    transformers
    Fill Mask

    FacebookAI/roberta-large

    8.6M
    300
    transformers
    Text Generation

    Qwen/Qwen2.5-7B-Instruct

    8.4M
    1,360
    transformers
    Image Text To Text

    google/gemma-4-26B-A4B-it

    8.4M
    1,135
    transformers
    Text Generation

    facebook/opt-125m

    7.5M
    266
    transformers
    1 / 534