NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,002 models available

    Showing 29052928 of 9,002 models

    Text To Speech

    IndexTeam/IndexTTS-2

    18K
    691
    Automatic Speech Recognition

    mlx-community/whisper-large-v3-mlx

    18K
    76
    mlx
    Image Segmentation

    nvidia/segformer-b3-finetuned-ade-512-512

    18K
    14
    transformers
    Any To Any

    Qwen/Qwen3-Omni-30B-A3B-Captioner

    18K
    224
    transformers
    Feature Extraction

    Xenova/gte-small

    18K
    23
    transformers.js
    Image Classification

    timm/regnety_008.pycls_in1k

    18K
    timm
    Visual Question Answering

    Salesforce/blip-vqa-capfilt-large

    18K
    53
    transformers
    Sentence Similarity

    Octen/Octen-Embedding-0.6B

    17K
    34
    sentence-transformers
    Feature Extraction

    unslothai/azure

    17K
    transformers
    Fill Mask

    mental/mental-bert-base-uncased

    17K
    55
    transformers
    Audio Classification

    Jzuluaga/accent-id-commonaccent_ecapa

    17K
    17
    speechbrain
    Image Classification

    Hemg/AI-VS-REAL-IMAGE-DETECTION

    17K
    3
    transformers
    Image To Image

    lllyasviel/control_v11p_sd15_inpaint

    17K
    132
    diffusers
    Zero Shot Image Classification

    facebook/metaclip-2-worldwide-huge-quickgelu

    17K
    18
    transformers
    Text Classification

    alunadiderot/setfit-e5-base-category-classifier_v2

    17K
    1
    setfit
    Text Classification

    leolee99/PIGuard

    17K
    7
    transformers
    Image Classification

    timm/convnextv2_tiny.fcmae_ft_in22k_in1k_384

    17K
    4
    timm
    Fill Mask

    Rostlab/prot_bert_bfd

    17K
    17
    transformers
    Image Classification

    microsoft/dit-base-finetuned-rvlcdip

    17K
    37
    transformers
    Image To Text

    PaddlePaddle/PP-OCRv5_mobile_rec

    17K
    11
    PaddleOCR
    Image Classification

    timm/convnext_base.fb_in22k_ft_in1k_384

    17K
    timm
    Sentence Similarity

    hkunlp/instructor-base

    17K
    120
    sentence-transformers
    Feature Extraction

    rrivera1849/LUAR-CRUD

    17K
    4
    transformers
    Image Segmentation

    ZhengPeng7/BiRefNet-portrait

    17K
    14
    birefnet
    122 / 376