NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,002 models available

    Showing 33133336 of 9,002 models

    Summarization

    cahya/t5-base-indonesian-summarization-cased

    11K
    6
    transformers
    Depth Estimation

    depth-anything/Depth-Anything-V2-Metric-Indoor-Base-hf

    11K
    2
    transformers
    Depth Estimation

    depth-anything/Depth-Anything-V2-Small

    11K
    77
    depth-anything-v2
    Image Feature Extraction

    py-feat/img2pose

    11K
    1
    py-feat
    Text To Video

    calcuis/wan-gguf

    11K
    182
    Text Classification

    philomath-1209/programming-language-identification

    11K
    11
    transformers
    Fill Mask

    aaronfeller/PeptideCLM-23M-all

    11K
    1
    transformers
    Image Feature Extraction

    google/vit-base-patch32-224-in21k

    11K
    19
    transformers
    Automatic Speech Recognition

    nvidia/nemotron-speech-streaming-en-0.6b

    11K
    530
    nemo
    Text To Video

    ByteDance/AnimateDiff-Lightning

    11K
    980
    diffusers
    Zero Shot Image Classification

    timm/ViT-L-16-SigLIP-256

    11K
    1
    open_clip
    Sentence Similarity

    dariolopez/roberta-base-bne-finetuned-msmarco-qa-es-mnrl-mn

    11K
    7
    sentence-transformers
    Translation

    Helsinki-NLP/opus-mt-cs-en

    11K
    3
    transformers
    Feature Extraction

    LSX-UniWue/LLaMmlein2Vec_120M

    11K
    llm2vec
    Depth Estimation

    depth-anything/Depth-Anything-V2-Metric-Indoor-Large-hf

    11K
    15
    transformers
    Fill Mask

    GroNLP/hateBERT

    11K
    42
    transformers
    Image Feature Extraction

    py-feat/resmasknet

    11K
    py-feat
    Fill Mask

    jjzha/jobbert-base-cased

    11K
    21
    transformers
    Image Classification

    timm/fastvit_t12.apple_in1k

    11K
    timm
    Object Detection

    yainage90/fashion-object-detection

    11K
    38
    transformers
    Image To Image

    unsloth/FLUX.2-dev-GGUF

    11K
    44
    ggml
    Text To Speech

    HKUSTAudio/Llasa-1B

    11K
    102
    Sentence Similarity

    sentence-transformers/all-mpnet-base-v1

    11K
    12
    sentence-transformers
    Fill Mask

    jackaduma/SecBERT

    11K
    61
    transformers
    139 / 376