NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 61456168 of 9,588 models

    Image To Text

    PaddlePaddle/devanagari_PP-OCRv3_mobile_rec

    582
    PaddleOCR
    Image Feature Extraction

    timm/convnextv2_large.fcmae

    581
    timm
    Audio Classification

    onnx-community/wav2vec2-base-Speech_Emotion_Recognition-ONNX

    581
    transformers.js
    Depth Estimation

    jingheya/lotus-depth-g-v2-1-disparity

    578
    16
    diffusers
    Object Detection

    guillherms/vision-architecture-analyzer-yolo11-detect

    578
    1
    ultralytics
    Question Answering

    mradermacher/VideoThinker-R1-Bias-3B-GGUF

    577
    transformers
    Any To Any

    OrchardPair/gemma-4-e4b-it-4bit

    577
    mlx
    Summarization

    mradermacher/Medra4b-vis-ep0.4-i1-GGUF

    576
    transformers
    Question Answering

    jamarju/roberta-large-bne-squad-2.0-es

    575
    transformers
    Depth Estimation

    Intel/dpt-beit-base-384

    574
    1
    transformers
    Image Segmentation

    michaelyuanqwq/roboengine-sam

    574
    2
    Image Feature Extraction

    timm/vit_giantopt_patch16_siglip_384.v2_webli

    574
    timm
    Audio To Audio

    aufklarer/DeepFilterNet3-CoreML

    573
    1
    coreml
    Robotics

    yinchenghust/deepthinkvla_base

    573
    transformers
    Image Feature Extraction

    tiiuae/siglino-0.6B

    573
    13
    transformers
    Question Answering

    SmallDoge/Doge-120M-MoE-Instruct

    572
    1
    transformers
    Reinforcement Learning

    mradermacher/DeepHermes-Egregore-8B-131K-i1-GGUF

    572
    1
    transformers
    Any To Any

    openbmb/MiniCPM-o-2_6-gguf

    572
    116
    Image To Text

    PaddlePaddle/PP-OCRv4_mobile_seal_det

    570
    PaddleOCR
    Question Answering

    SmallDoge/Doge-20M-MoE-Instruct

    569
    1
    transformers
    Zero Shot Classification

    cmarkea/distilcamembert-base-nli

    569
    13
    transformers
    Audio Classification

    tiantiaf/whisper-large-v3-speech-flow

    569
    2
    Summarization

    nandakishormpai/t5-small-machine-articles-tag-generation

    568
    transformers
    Image To Text

    microsoft/trocr-large-str

    568
    17
    transformers
    257 / 400