NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 63376360 of 9,588 models

    Image To Text

    DunnBC22/trocr-base-printed_captcha_ocr

    469
    9
    transformers
    Text To Audio

    forkjoin-ai/vibevoice-realtime-0.5b

    468
    llama-cpp
    Robotics

    liorbenhorin-nv/groot-libero_spatial-128_20000

    468
    lerobot
    Image Feature Extraction

    facebook/ijepa_vith16_1k

    468
    transformers
    Summarization

    google/pegasus-pubmed

    467
    9
    transformers
    Image To Text

    mradermacher/Perseus-Doc-vl-071225-i1-GGUF

    467
    1
    transformers
    Image Feature Extraction

    timm/vit_giant_patch14_clip_224.laion2b

    467
    timm
    Image Feature Extraction

    OpenGVLab/InternViT-6B-448px-V2_5

    466
    48
    Object Detection

    foduucom/thermal-image-object-detection

    465
    22
    ultralytics
    Image To Text

    mradermacher/Qwen3-VL-8B-Abliterated-Caption-it-i1-GGUF

    465
    5
    transformers
    Depth Estimation

    qualcomm/Midas-V2

    464
    10
    pytorch
    Video Classification

    MCG-NJU/videomae-base-short

    464
    4
    transformers
    Text To Audio

    ylacombe/musicgen-stereo-melody

    463
    transformers
    Robotics

    2toINF/X-VLA-RoboTwin2

    463
    1
    Image Segmentation

    facebook/sapiens-seg-0.3b-torchscript

    463
    sapiens
    Image To Text

    PaddlePaddle/PP-DocLayout-S

    463
    PaddleOCR
    Reinforcement Learning

    mradermacher/P1-30B-A3B-GGUF

    462
    1
    transformers
    Audio Classification

    awsaf49/sonics-spectttra-beta-120s

    462
    Summarization

    IDEA-CCNL/Randeng-Pegasus-238M-Summary-Chinese

    460
    49
    transformers
    Zero Shot Image Classification

    Xenova/siglip-base-patch16-384

    460
    1
    transformers.js
    Image Feature Extraction

    FM4CS/THOR-1.0-base

    460
    terratorch
    Audio Classification

    bookbot/wav2vec2-xls-r-adult-child-cls

    460
    transformers
    Text To Audio

    facebook/magnet-medium-10secs

    459
    9
    audiocraft
    Image To Text

    manu02/LAnA

    459
    1
    transformers
    265 / 400