NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 63136336 of 9,588 models

    Visual Question Answering

    soorism/Qwen3-VL-2B-instruct-SFT-FakeClues

    481
    transformers
    Reinforcement Learning

    Ding-Qiang/ppo-CarRacing-v2

    481
    stable-baselines3
    Image To Text

    EZCon/GLM-OCR-mlx

    481
    1
    mlx
    Robotics

    lerobot-data-collection/folding_final

    480
    4
    lerobot
    Image Segmentation

    facebook/sapiens-seg-1b-torchscript

    480
    5
    sapiens
    Image To Text

    agomberto/trocr-large-handwritten-fr

    479
    2
    transformers
    Image To Text

    badianeai/AnandaSky

    479
    2
    transformers
    Image Feature Extraction

    fushh7/ObjEmbed-2B

    478
    transformers
    Image Feature Extraction

    timm/vit_large_patch16_siglip_384.v2_webli

    478
    timm
    Object Detection

    mosesb/best-comic-panel-detection

    477
    10
    ultralytics
    Zero Shot Image Classification

    timm/ViT-L-16-SigLIP2-512

    476
    3
    open_clip
    Image Feature Extraction

    nvidia/MambaVision-S-1K

    476
    11
    transformers
    Image To Text

    Kansallisarkisto/multicentury-htr-model

    475
    1
    Image Segmentation

    facebook/maskformer-swin-large-ade

    474
    58
    transformers
    Text To Video

    Isi99999/Wan2.1-T2V-14B

    473
    1
    diffusers
    Image To Text

    Xenova/trocr-small-printed

    473
    5
    transformers.js
    Zero Shot Image Classification

    timm/vit_base_patch16_plus_clip_240.laion400m_e32

    472
    open_clip
    Object Detection

    aliciapiedrafita/yolo_finetuned_fruits

    471
    transformers
    Object Detection

    javillo-ur/yolo_finetuned_raccoon

    471
    1
    transformers
    Zero Shot Image Classification

    laion/CLIP-convnext_large_d_320.laion2B-s29B-b131K-ft

    471
    3
    open_clip
    Zero Shot Image Classification

    jrheiner/thesis-clip-geoloc-country

    471
    1
    transformers
    Image To Text

    sashakunitsyn/vlrm-blip2-opt-2.7b

    471
    19
    transformers
    Image Segmentation

    Xenova/segformer-b2-finetuned-ade-512-512

    469
    transformers.js
    Zero Shot Image Classification

    woweenie/open-clip-vit-h-nsfw-finetune

    469
    27
    open_clip
    264 / 400