NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,002 models available

    Showing 30973120 of 9,002 models

    Feature Extraction

    histai/hibou-b

    14K
    17
    transformers
    Zero Shot Classification

    cross-encoder/nli-roberta-base

    14K
    15
    sentence-transformers
    Zero Shot Image Classification

    timm/vit_base_patch16_clip_224.laion400m_e32

    14K
    open_clip
    Image Feature Extraction

    timm/convnext_large_mlp.clip_laion2b_ft_soup_320

    14K
    timm
    Sentence Similarity

    deepvk/USER2-small

    14K
    10
    sentence-transformers
    Image Segmentation

    ZhengPeng7/BiRefNet_lite

    14K
    16
    birefnet
    Automatic Speech Recognition

    optimum-internal-testing/tiny-random-whisper

    14K
    transformers
    Image Classification

    timm/maxvit_tiny_tf_512.in1k

    14K
    timm
    Image To Text

    PaddlePaddle/PP-OCRv4_mobile_det

    14K
    3
    PaddleOCR
    Automatic Speech Recognition

    ivrit-ai/whisper-large-v3-turbo-ct2

    14K
    16
    ctranslate2
    Audio Classification

    rmarcosg/bark-detection-model

    14K
    transformers
    Voice Activity Detection

    nvidia/Frame_VAD_Multilingual_MarbleNet_v2.0

    13K
    40
    nemo
    Sentence Similarity

    sentence-transformers/nli-distilroberta-base-v2

    13K
    1
    sentence-transformers
    Fill Mask

    ehsanaghaei/SecureBERT

    13K
    65
    transformers
    Sentence Similarity

    Alibaba-NLP/gme-Qwen2-VL-2B-Instruct

    13K
    133
    sentence-transformers
    Text To Speech

    pnnbao-ump/VieNeu-TTS-0.3B

    13K
    21
    Sentence Similarity

    embaas/sentence-transformers-multilingual-e5-base

    13K
    8
    sentence-transformers
    Automatic Speech Recognition

    Cnam-LMSSC/wav2vec2-french-phonemizer

    13K
    8
    transformers
    Feature Extraction

    hyp1231/blair-roberta-large

    13K
    2
    transformers
    Text To Speech

    unsloth/orpheus-3b-0.1-ft

    13K
    14
    transformers
    Any To Any

    prithivMLmods/gemma-4-E4B-it-FP8

    13K
    5
    transformers
    Automatic Speech Recognition

    argmaxinc/speakerkit-coreml

    13K
    2
    whisperkit
    Feature Extraction

    skt/kobert-base-v1

    13K
    39
    transformers
    Zero Shot Image Classification

    kakaobrain/align-base

    13K
    31
    transformers
    130 / 376