NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 54255448 of 9,588 models

    Text To Speech

    neuphonic/neutts-nano-french-q4-gguf

    1K
    6
    Image Segmentation

    facebook/mask2former-swin-small-coco-panoptic

    1K
    1
    transformers
    Image Segmentation

    keremberke/yolov8m-building-segmentation

    1K
    10
    ultralytics
    Text To Speech

    pnnbao-ump/VieNeu-TTS-0.3B-lora-ngoc-huyen

    1K
    3
    peft
    Object Detection

    yihong1120/Construction-Hazard-Detection

    1K
    11
    ultralytics
    Zero Shot Image Classification

    timm/ViT-B-16-SigLIP2-384

    1K
    open_clip
    Image To Text

    EZCon/GLM-OCR-4bit-g32-mxfp4-mixed_4_8-mlx

    1K
    5
    mlx
    Feature Extraction

    DeepSoftwareAnalytics/CoCoSoDa

    1K
    3
    transformers
    Visual Question Answering

    google/pix2struct-docvqa-base

    1K
    44
    transformers
    Translation

    Unbabel/TowerInstruct-7B-v0.2

    1K
    40
    transformers
    Table Question Answering

    google/tapas-base-finetuned-sqa

    1K
    7
    transformers
    Image To Image

    dx8152/Qwen-Image-Edit-2511-Gaussian-Splash

    1K
    181
    diffusers
    Text To Speech

    Cseti/VibeVoice_7B_hun_v2

    1K
    15
    vibevoice
    Feature Extraction

    michaelfeil/ct2fast-LaBSE

    1K
    2
    transformers
    Text To Speech

    Aratako/MioTTS-0.1B

    1K
    20
    transformers
    Translation

    Helsinki-NLP/opus-mt-lv-en

    1K
    transformers
    Any To Any

    Tinman-Lab/Tinman-gemma4-companion-gguf

    1K
    Feature Extraction

    opensearch-project/opensearch-neural-sparse-encoding-doc-v1

    1K
    3
    sentence-transformers
    Unconditional Image Generation

    CompVis/ldm-celebahq-256

    1K
    51
    diffusers
    Image Segmentation

    tue-mps/eomt-dinov3-coco-instance-large-640

    1K
    transformers
    Image To Image

    TencentARC/t2iadapter_depth_sd15v2

    1K
    3
    diffusers
    Audio To Audio

    NandemoGHS/Anime-XCodec2

    1K
    17
    Audio Classification

    aufklarer/Sortformer-Diarization-CoreML

    1K
    Text To Speech

    zai-org/GLM-TTS

    1K
    335
    glm-tts
    227 / 400