NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,002 models available

    Showing 40814104 of 9,002 models

    Zero Shot Image Classification

    apple/MobileCLIP-B-OpenCLIP

    5K
    4
    open_clip
    Feature Extraction

    unum-cloud/uform3-image-text-multilingual-base

    5K
    15
    transformers
    Object Detection

    Xenova/yolos-tiny

    5K
    6
    transformers.js
    Automatic Speech Recognition

    facebook/wav2vec2-large-robust-ft-swbd-300h

    5K
    20
    transformers
    Sentence Similarity

    sentence-transformers/bert-large-nli-stsb-mean-tokens

    5K
    3
    sentence-transformers
    Automatic Speech Recognition

    gchhablani/wav2vec2-large-xlsr-gu

    5K
    transformers
    Automatic Speech Recognition

    nvidia/diar_sortformer_4spk-v1

    5K
    137
    nemo
    Image Feature Extraction

    facebook/webssl-dino3b-full2b-224

    5K
    transformers
    Image Classification

    google/mobilenet_v1_0.75_192

    5K
    2
    transformers
    Sentence Similarity

    AITeamVN/Vietnamese_Reranker

    5K
    3
    sentence-transformers
    Image Segmentation

    MykolaL/DelineateAnything

    5K
    4
    ultralytics
    Text To Speech

    onnx-community/chatterbox-multilingual-ONNX

    5K
    48
    chatterbox
    Image Classification

    1aurent/vit_small_patch8_224.lunit_dino

    5K
    3
    timm
    Text To Audio

    ACE-Step/acestep-captioner

    5K
    46
    transformers
    Image To Image

    peteromallet/Qwen-Image-Edit-InSubject

    5K
    86
    diffusers
    Automatic Speech Recognition

    BELLE-2/Belle-whisper-large-v3-zh-punct

    5K
    47
    transformers
    Image To Image

    dx8152/Qwen-Image-Edit-2509-Relight

    5K
    216
    diffusers
    Translation

    Helsinki-NLP/opus-mt-en-ro

    5K
    7
    transformers
    Image Feature Extraction

    facebook/webssl-dino300m-light2b-224

    5K
    3
    transformers
    Image Classification

    facebook/convnextv2-base-1k-224

    5K
    4
    transformers
    Text To Speech

    facebook/mms-tts-spa

    5K
    23
    transformers
    Automatic Speech Recognition

    TalTechNLP/whisper-large-v3-turbo-et-verbatim

    5K
    3
    transformers
    Text To Speech

    parler-tts/parler_tts_mini_v0.1

    5K
    358
    transformers
    Image To Text

    microsoft/git-base-coco

    5K
    21
    transformers
    171 / 376