NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,002 models available

    Showing 27852808 of 9,002 models

    Feature Extraction

    optimum-intel-internal-testing/bge-small-en-v1.5

    21K
    sentence-transformers
    Fill Mask

    flaubert/flaubert_base_uncased

    21K
    3
    transformers
    Any To Any

    marksverdhei/Qwen3-Omni-30B-A3B-FP8

    21K
    2
    transformers
    Tabular Classification

    Prior-Labs/TabPFN-v2-clf

    21K
    81
    tabpfn
    Image Segmentation

    nvidia/segformer-b4-finetuned-cityscapes-1024-1024

    21K
    7
    transformers
    Text To Audio

    stabilityai/stable-audio-open-1.0

    21K
    1,450
    stable-audio-tools
    Text Classification

    pysentimiento/bert-it-sentiment

    21K
    1
    transformers
    Image Classification

    timm/convnext_large_mlp.clip_laion2b_soup_ft_in12k_in1k_320

    21K
    4
    timm
    Sentence Similarity

    sdadas/mmlw-retrieval-roberta-large-v2

    21K
    2
    sentence-transformers
    Image Classification

    google/vit-large-patch16-224

    21K
    46
    transformers
    Sentence Similarity

    FremyCompany/BioLORD-2023-C

    21K
    8
    sentence-transformers
    Feature Extraction

    Xenova/jina-embeddings-v2-base-en

    21K
    8
    transformers.js
    Image Classification

    prithivMLmods/Deep-Fake-Detector-v2-Model

    21K
    37
    transformers
    Robotics

    2toINF/X-VLA-Pt

    21K
    11
    Automatic Speech Recognition

    biodatlab/whisper-th-large-v3-combined

    21K
    10
    transformers
    Image Feature Extraction

    PIA-SPACE-LAB/dinov3-vitl-pretrain-lvd1689m

    21K
    2
    transformers
    Feature Extraction

    perplexity-ai/pplx-embed-v1-4b

    21K
    57
    sentence-transformers
    Image Segmentation

    facebook/mask2former-swin-small-ade-semantic

    20K
    8
    transformers
    Text To Speech

    nari-labs/Dia-1.6B-0626

    20K
    129
    Zero Shot Image Classification

    LanguageBind/LanguageBind_Image

    20K
    11
    transformers
    Fill Mask

    nlpaueb/bert-base-greek-uncased-v1

    20K
    38
    transformers
    Audio Classification

    MelodyMachine/Deepfake-audio-detection-V2

    20K
    18
    transformers
    Image Classification

    timm/vit_base_patch16_224.augreg_in21k_ft_in1k

    20K
    timm
    Text Classification

    Hello-SimpleAI/chatgpt-detector-roberta

    20K
    62
    transformers
    117 / 376