NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 67696792 of 9,588 models

    Text To Audio

    sil-ai/swh-bible-audio-speecht5

    276
    1
    transformers
    Zero Shot Image Classification

    yyupenn/whyxrayclip

    276
    2
    open_clip
    Text To Video

    vrgamedevgirl84/LTX_2.3_Fantasy_Anime_Style_LoRa

    276
    diffusers
    Object Detection

    melihuzunoglu/human-fall-detection

    275
    3
    ultralytics
    Image To Text

    rajofearth/lfm-ucf-gguf

    275
    1
    Object Detection

    advecino/yolo_finetuned_fruits

    274
    transformers
    Object Detection

    miguelcale04/yolo_finetuned_kangaroo

    274
    transformers
    Image To Text

    PaddlePaddle/RT-DETR-H_layout_17cls

    274
    2
    PaddleOCR
    Audio To Audio

    JusperLee/TIGER-speech-tiny

    273
    1
    Reinforcement Learning

    ZhenghaiXue/Qwen2.5-7B-SimpleTIR

    273
    1
    Image To Text

    mradermacher/Perseus-Doc-vl-0712-i1-GGUF

    272
    1
    transformers
    Audio Classification

    superb/hubert-base-superb-ic

    272
    transformers
    Object Detection

    mradermacher/Polaris-VGA-2B-Post1.0-i1-GGUF

    271
    transformers
    Object Detection

    keremberke/yolov5n-smoke

    271
    2
    yolov5
    Image To Text

    mobilint/blip-image-captioning-large

    271
    Zero Shot Image Classification

    facebook/metaclip-2-worldwide-s16-384

    270
    3
    transformers
    Question Answering

    PraneetNS/codesentinel-full

    270
    1
    transformers
    Robotics

    CursedRock17/so101_block_grab_smolvla_0

    270
    lerobot
    Video Classification

    Nikeytas/videomae-crime-detector-production-v1

    269
    Object Detection

    jadechoghari/RT-DETRv2

    269
    4
    transformers
    Image To Text

    PaddlePaddle/UVDoc_safetensors

    269
    1
    PaddleOCR
    Image Feature Extraction

    apple/aimv2-huge-patch14-448

    269
    6
    transformers
    Image Feature Extraction

    nvidia/RADIO

    269
    42
    transformers
    Depth Estimation

    onnx-community/depth-anything-v2-base

    268
    transformers.js
    283 / 400