NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,002 models available

    Showing 33373360 of 9,002 models

    Automatic Speech Recognition

    nvidia/parakeet-tdt-1.1b

    11K
    117
    nemo
    Question Answering

    deepset/xlm-roberta-base-squad2

    11K
    25
    transformers
    Any To Any

    coder3101/gemma-4-E4B-it-heretic

    10K
    23
    transformers
    Sentence Similarity

    ENOSYS/Octen-Embedding-0.6B-750-v1-GGUF

    10K
    2
    sentence-transformers
    Feature Extraction

    tanganke/clip-vit-base-patch32_stanford-cars

    10K
    2
    transformers
    Feature Extraction

    tanganke/clip-vit-base-patch32_eurosat

    10K
    transformers
    Feature Extraction

    tanganke/clip-vit-base-patch32_sun397

    10K
    1
    transformers
    Translation

    Helsinki-NLP/opus-mt-tc-big-sh-en

    10K
    transformers
    Robotics

    lerobot/pi05_base

    10K
    66
    lerobot
    Automatic Speech Recognition

    NbAiLab/wav2vec2-large-danish-npsc-nst

    10K
    2
    transformers
    Question Answering

    armageddon/roberta-large-squad2-covid-qa-deepset

    10K
    transformers
    Feature Extraction

    tanganke/clip-vit-base-patch32_svhn

    10K
    transformers
    Feature Extraction

    tanganke/clip-vit-base-patch32_resisc45

    10K
    transformers
    Image To Text

    PaddlePaddle/korean_PP-OCRv5_mobile_rec

    10K
    13
    PaddleOCR
    Text Classification

    optimum-intel-internal-testing/ov-tiny-random-distilbert

    10K
    transformers
    Image To Image

    1038lab/Qwen-Image-Edit-2511-FP8

    10K
    43
    diffusers
    Image Classification

    google/efficientnet-b2

    10K
    5
    transformers
    Image Classification

    timm/fastvit_t8.apple_in1k

    10K
    2
    timm
    Image Classification

    facebook/convnextv2-atto-1k-224

    10K
    3
    transformers
    Feature Extraction

    tanganke/clip-vit-base-patch32_mnist

    10K
    1
    transformers
    Image Classification

    prithivMLmods/AI-vs-Deepfake-vs-Real-Siglip2

    10K
    2
    transformers
    Automatic Speech Recognition

    MohamedRashad/Arabic-Whisper-CodeSwitching-Edition

    10K
    30
    transformers
    Automatic Speech Recognition

    NbAiLab/nb-wav2vec2-300m-nynorsk

    10K
    transformers
    Video Classification

    MCG-NJU/videomae-small-finetuned-kinetics

    10K
    1
    transformers
    140 / 376