NEWVectors or files. Pick a path.Start →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    12,807 models available

    Showing 913936 of 12,807 models

    Text Generation

    Qwen/Qwen2.5-1.5B-Instruct-GGUF

    197K
    116
    Automatic Speech Recognition

    facebook/wav2vec2-conformer-rope-large-960h-ft

    197K
    10
    transformers
    Fill Mask

    microsoft/BiomedNLP-BiomedBERT-base-uncased-abstract-fulltext

    196K
    327
    transformers
    Fill Mask

    distilbert/distilbert-base-cased

    196K
    66
    transformers
    Text Generation

    poolside/Laguna-XS.2

    195K
    294
    transformers
    Automatic Speech Recognition

    jimregan/wav2vec2-large-xlsr-latvian-cv

    195K
    3
    transformers
    Image Text To Text

    google/gemma-4-26B-A4B-it-qat-q4_0-gguf

    195K
    66
    transformers
    Text Generation

    Qwen/Qwen2.5-3B-Instruct-GGUF

    195K
    134
    Object Detection

    facebook/detr-resnet-50

    194K
    954
    transformers
    Text Classification

    ElKulako/cryptobert

    194K
    189
    transformers
    Feature Extraction

    allegro/herbert-base-cased

    194K
    22
    transformers
    Automatic Speech Recognition

    pyannote/speaker-diarization-3.0

    194K
    218
    pyannote-audio
    Sentence Similarity

    dunzhang/stella-mrl-large-zh-v3.5-1792d

    194K
    50
    sentence-transformers
    Text Generation

    entropy/gpt2_zinc_87m

    194K
    4
    transformers
    Automatic Speech Recognition

    facebook/mms-1b-all

    193K
    200
    transformers
    Image Text To Text

    moonshotai/Kimi-VL-A3B-Instruct

    193K
    268
    transformers
    Text To Image

    h94/IP-Adapter-FaceID

    192K
    1,841
    diffusers
    Text Generation

    HuggingFaceTB/SmolLM3-3B-Base

    192K
    159
    transformers
    Feature Extraction

    google/canine-c

    192K
    35
    transformers
    Text To Image

    John6666/nova-furry-xl-il-v120-sdxl

    191K
    5
    diffusers
    Image Classification

    microsoft/resnet-50

    191K
    493
    transformers
    Text To Speech

    sesame/csm-1b

    191K
    2,392
    transformers
    Image Text To Text

    unsloth/Qwen2.5-VL-7B-Instruct-GGUF

    191K
    184
    transformers
    Image Text To Text

    Qwen/Qwen3-VL-4B-Thinking

    191K
    111
    transformers
    39 / 534