NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 64096432 of 9,588 models

    Image Feature Extraction

    bytedance-research/LVFace

    425
    28
    lvface
    Robotics

    allenai/GraspMolmo

    424
    10
    Question Answering

    gaotianyu1350/roberta-large-squad

    424
    transformers
    Voice Activity Detection

    funasr/fsmn-vad

    422
    22
    Image To Text

    Hyphonical/Pixtral-12B-Captioner-Relaxed-Q4_K_M-GGUF

    422
    1
    transformers
    Text To Video

    guoyww/animatediff-motion-adapter-v1-5

    421
    5
    diffusers
    Image To Text

    kkatiz/THAI-BLIP-2

    421
    8
    transformers
    Text To Video

    ZuluVision/MoviiGen1.1

    421
    104
    diffusers
    Reinforcement Learning

    mradermacher/ATLAS-8B-Thinking-i1-GGUF

    420
    1
    transformers
    Image To Text

    mradermacher/old-church-slavonic-dots-ocr-GGUF

    419
    transformers
    Image Feature Extraction

    nvidia/RADIO-B

    419
    3
    transformers
    Image To Text

    chatpig/llava-llama3

    415
    4
    Zero Shot Image Classification

    laion/CLIP-ViT-B-16-DataComp.L-s1B-b8K

    413
    1
    open_clip
    Text To Video

    obsxrver/wan2.2-t2v-scat

    412
    26
    diffusers
    Visual Question Answering

    openbmb/MiniCPM-Llama3-V-2_5-int4

    412
    79
    transformers
    Image Segmentation

    ianpan/chest-x-ray-basic

    412
    9
    transformers
    Summarization

    google/bigbird-pegasus-large-bigpatent

    410
    41
    transformers
    Object Detection

    apple/coreml-YOLOv3

    410
    18
    coreml
    Audio Classification

    MIT/ast-finetuned-audioset-10-10-0.448

    410
    1
    transformers
    Zero Shot Image Classification

    LanguageBind/LanguageBind_Video_Huge_V1.5_FT

    409
    6
    transformers
    Question Answering

    Shushant/biobert-v1.1-biomedicalQuestionAnswering

    409
    9
    transformers
    Audio To Audio

    mpariente/ConvTasNet_WHAM_sepclean

    408
    asteroid
    Image To Text

    DunnBC22/trocr-base-printed_license_plates_ocr

    408
    11
    transformers
    Image To Text

    team-lucid/trocr-small-korean

    408
    18
    transformers
    268 / 400