NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 47054728 of 9,588 models

    Text To Video

    samuelchristlie/Wan2.1-T2V-1.3B-GGUF

    3K
    17
    diffusers
    Image To Image

    TencentARC/t2i-adapter-depth-zoe-sdxl-1.0

    3K
    27
    diffusers
    Image Segmentation

    facebook/mask2former-swin-large-mapillary-vistas-panoptic

    3K
    2
    transformers
    Image Classification

    microsoft/swin-small-patch4-window7-224

    3K
    2
    transformers
    Image To Text

    KuroTo4ka/Qwen3-VL-8B-Instruct-Unredacted-MAX-Quants-GGUF

    3K
    3
    gguf
    Image To Image

    lambda/sd-image-variations-diffusers

    3K
    461
    diffusers
    Robotics

    lerobot/pi0fast-base

    3K
    24
    lerobot
    Text To Speech

    g-group-ai-lab/gwen-tts-0.6B

    3K
    13
    transformers
    Feature Extraction

    infgrad/stella-base-en-v2

    3K
    16
    sentence-transformers
    Image Feature Extraction

    google/vit-large-patch32-224-in21k

    3K
    1
    transformers
    Object Detection

    IDEA-Research/dab-detr-resnet-50

    3K
    2
    transformers
    Image Classification

    haywoodsloan/ai-image-detector-deploy

    3K
    21
    transformers
    Text To Audio

    ACE-Step/acestep-v15-xl-base

    3K
    73
    transformers
    Sentence Similarity

    lightonai/LateOn

    3K
    37
    PyLate
    Image Classification

    histai/SPIDER-breast-model

    3K
    6
    transformers
    Image To Image

    prithivMLmods/Qwen-Image-Edit-Rapid-AIO-V4

    3K
    4
    diffusers
    Sentence Similarity

    NeuML/bioclinical-modernbert-base-embeddings

    3K
    11
    sentence-transformers
    Text To Speech

    OuteAI/OuteTTS-0.2-500M-GGUF

    3K
    85
    Image Classification

    timm/repvit_m1_5.dist_300e_in1k

    3K
    timm
    Sentence Similarity

    BlueAvenir/sti_cyber_security_model_updated

    3K
    1
    sentence-transformers
    Text To Audio

    riffusion/riffusion-model-v1

    3K
    649
    diffusers
    Text To Audio

    ACE-Step/acestep-v15-base

    3K
    60
    transformers
    Image To Image

    unsloth/FLUX.2-klein-base-4B-GGUF

    3K
    19
    ggml
    Translation

    Helsinki-NLP/opus-mt-ga-en

    3K
    transformers
    197 / 400