NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 65776600 of 9,588 models

    Reinforcement Learning

    voyzan/poca-SoccerTwos

    343
    ml-agents
    Image Feature Extraction

    DeepGlint-AI/rice-vit-large-patch14-560

    343
    11
    Text To Video

    Skywork/SkyReels-V2-DF-1.3B-540P

    343
    45
    Question Answering

    mfeb/albert-xxlarge-v2-squad2

    342
    2
    transformers
    Zero Shot Image Classification

    timm/resnet50x16_clip.openai

    341
    open_clip
    Zero Shot Image Classification

    qihoo360/fg-clip-large

    341
    13
    transformers
    Image To Text

    manu02/LAnA-MIMIC

    341
    transformers
    Image Feature Extraction

    MiniMaxAI/VTP-Small-f16d64

    341
    14
    transformers
    Question Answering

    mradermacher/Qwen-encoder-0.5B-GGUF

    341
    1
    transformers
    Image To Text

    mradermacher/Hulu-Med-30A3-GGUF

    340
    transformers
    Image Feature Extraction

    nvidia/C-RADIOv2-B

    340
    10
    transformers
    Question Answering

    Trina-QwQ/Trama

    340
    17
    Robotics

    robotics-diffusion-transformer/RDT2-FM

    339
    7
    transformers
    Image To Text

    MohamedRashad/arabic-large-nougat

    339
    17
    transformers
    Text To Video

    zenlm/zen-voyager

    338
    2
    diffusers
    Robotics

    lamborghini3030/my_policy

    338
    lerobot
    Reinforcement Learning

    mradermacher/Tifa-Deepsex-14b-CoT-i1-GGUF

    338
    13
    transformers
    Object Detection

    macpaw-research/yolov11l-ui-elements-detection

    337
    8
    ultralytics
    Audio To Audio

    speechbrain/mtl-mimic-voicebank

    336
    35
    speechbrain
    Question Answering

    fanwu103/distilbert-base-uncased-finetuned-squad

    336
    transformers
    Text To Audio

    ford442/stable-audio-open-1.0

    334
    stable-audio-tools
    Voice Activity Detection

    videosdk-live/Namo-Turn-Detector-v1-Arabic

    334
    onnxruntime
    Image To Text

    fridaycandour/PaddleOCR-VL-1.5-GGUF

    334
    Zero Shot Image Classification

    visheratin/nllb-siglip-mrl-large

    333
    14
    open_clip
    275 / 400