NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,002 models available

    Showing 26892712 of 9,002 models

    Text To Image

    stabilityai/stable-diffusion-3.5-large-controlnet-canny

    25K
    15
    diffusers
    Image Classification

    timm/vit_base_patch16_384.augreg_in21k_ft_in1k

    25K
    timm
    Image To Image

    meituan-longcat/LongCat-Image-Edit

    25K
    169
    transformers
    Feature Extraction

    shunk031/aesthetics-predictor-v1-vit-large-patch14

    25K
    2
    transformers
    Image Feature Extraction

    timm/convnext_tiny.dinov3_lvd1689m

    25K
    1
    timm
    Feature Extraction

    indobenchmark/indobert-base-p2

    25K
    7
    transformers
    Audio Classification

    facebook/mms-lid-4017

    24K
    14
    transformers
    Image To Image

    QuantStack/Qwen-Image-Edit-GGUF

    24K
    284
    gguf
    Zero Shot Classification

    MoritzLaurer/deberta-v3-base-zeroshot-v2.0

    24K
    12
    transformers
    Text Classification

    weqweasdas/RM-Gemma-7B

    24K
    8
    transformers
    Automatic Speech Recognition

    deepdml/faster-distil-whisper-large-v3.5

    24K
    7
    ctranslate2
    Feature Extraction

    CofeAI/FLM-2-52B-Instruct-2407

    24K
    12
    transformers
    Feature Extraction

    BAAI/llm-embedder

    24K
    128
    transformers
    Feature Extraction

    unslothai/gcp

    24K
    transformers
    Fill Mask

    microsoft/BiomedNLP-BiomedBERT-large-uncased-abstract

    24K
    21
    transformers
    Sentence Similarity

    sentence-transformers/distiluse-base-multilingual-cased

    24K
    18
    sentence-transformers
    Zero Shot Image Classification

    Salesforce/blip2-itm-vit-g

    24K
    3
    transformers
    Zero Shot Image Classification

    google/siglip-base-patch16-256-multilingual

    24K
    53
    transformers
    Translation

    Helsinki-NLP/opus-mt-tc-big-en-bg

    24K
    1
    transformers
    Zero Shot Image Classification

    Marqo/marqo-fashionCLIP

    24K
    28
    open_clip
    Translation

    Rostlab/ProstT5

    24K
    34
    transformers
    Image Classification

    prithivMLmods/Watermark-Detection-SigLIP2

    24K
    30
    transformers
    Automatic Speech Recognition

    bijaykumarsingh/whisper-large-v3-bn-cv17

    24K
    transformers
    Text To Speech

    nari-labs/Dia2-2B

    24K
    165
    dia2
    113 / 376