NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 56175640 of 9,588 models

    Translation

    staka/fugumt-en-ja

    1K
    54
    transformers
    Audio Classification

    tiantiaf/wavlm-large-categorical-emotion

    1K
    4
    Feature Extraction

    ggml-org/embeddinggemma-300M-qat-q4_0-GGUF

    1K
    5
    sentence-transformers
    Translation

    mradermacher/HY-MT1.5-7B-GGUF

    1K
    4
    transformers
    Audio To Audio

    aufklarer/PersonaPlex-7B-MLX-8bit

    1K
    6
    mlx
    Text To Video

    hunyuanvideo-community/HunyuanVideo-1.5-Diffusers-480p_t2v

    1K
    diffusers
    Image To Text

    mradermacher/WR30a-Deep-7B-0711-i1-GGUF

    1K
    1
    transformers
    Text To Audio

    HeartMuLa/HeartMuLa-oss-3B

    1K
    255
    Audio Classification

    mispeech/dasheng-base

    1K
    9
    transformers
    Audio Classification

    orcasound/orcahello-srkw-detector-v1

    1K
    1
    orcahello
    Summarization

    nsi319/legal-led-base-16384

    1K
    17
    transformers
    Text To Video

    Lightricks/LTX-Video-0.9.7-dev

    1K
    21
    diffusers
    Image Segmentation

    openmmlab/upernet-convnext-large

    1K
    1
    transformers
    Any To Any

    FakeRockert543/gemma-4-e4b-it-MLX-4bit

    1K
    2
    mlx
    Text To Video

    calcuis/hyvid

    1K
    25
    Summarization

    pszemraj/pegasus-x-large-book-summary

    1K
    37
    transformers
    Image To Image

    latentcat/control_v1p_sd15_brightness

    1K
    193
    diffusers
    Robotics

    nvidia/GR00T-N1.7-DROID

    1K
    3
    Image To Image

    Scotttttt111/Qwen-Image-Edit-2511-Lightning

    1K
    1
    diffusers
    Image To Text

    PaddlePaddle/cyrillic_PP-OCRv5_mobile_rec

    1K
    PaddleOCR
    Feature Extraction

    cstr/octen-0.6b-GGUF

    1K
    Feature Extraction

    SaeedLab/MolDeBERTa-base-123M-mtr

    1K
    transformers
    Image To Text

    nyu-visionx/Cambrian-S-7B

    1K
    5
    transformers
    Audio To Audio

    speechbrain/sepformer-whamr16k

    1K
    12
    speechbrain
    235 / 400