NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,002 models available

    Showing 41534176 of 9,002 models

    Video Classification

    qubvel-hf/vjepa2-vitl-fpc16-256-ssv2

    4K
    7
    transformers
    Translation

    facebook/wmt19-de-en

    4K
    20
    transformers
    Sentence Similarity

    michaelfeil/embeddinggemma-300m

    4K
    sentence-transformers
    Translation

    tencent/HY-MT1.5-7B-GGUF

    4K
    47
    transformers
    Feature Extraction

    castorini/wiki-all-8-4-multi-dpr2-passage-encoder

    4K
    transformers
    Video Classification

    microsoft/xclip-large-patch14

    4K
    14
    transformers
    Audio Classification

    DavidCombei/wavLM-base-Deepfake_V2

    4K
    transformers
    Zero Shot Image Classification

    timm/ViT-L-16-SigLIP-384

    4K
    29
    open_clip
    Automatic Speech Recognition

    Oriserve/Whisper-Hindi2Hinglish-Apex

    4K
    7
    transformers
    Text To Speech

    facebook/mms-tts-tur

    4K
    26
    transformers
    Sentence Similarity

    lokeshch19/ModernPubMedBERT

    4K
    24
    sentence-transformers
    Reinforcement Learning

    mradermacher/Pluto-i1-GGUF

    4K
    transformers
    Sentence Similarity

    avsolatorio/NoInstruct-small-Embedding-v0

    4K
    24
    sentence-transformers
    Text To Speech

    bluryar/VoxCPM-GGUF

    4K
    12
    Text To Speech

    cartesia/azzurra-voice

    4K
    16
    transformers
    Image Feature Extraction

    timm/vit_base_patch16_clip_224.laion2b

    4K
    1
    timm
    Video Classification

    microsoft/xclip-base-patch16-zero-shot

    4K
    27
    transformers
    Image Classification

    timm/vit_base_patch16_clip_224.laion2b_ft_in12k_in1k

    4K
    2
    timm
    Text To Speech

    saheedniyi/YarnGPT

    4K
    47
    transformers
    Any To Any

    mlx-community/gemma-4-e4b-mxfp8

    4K
    1
    mlx
    Image To Image

    dx8152/Qwen-Edit-2509-Multi-Angle-Lighting

    4K
    165
    diffusers
    Text To Speech

    canopylabs/orpheus-3b-0.1-pretrained

    4K
    167
    transformers
    Image Segmentation

    pamixsun/segformer_for_optic_disc_cup_segmentation

    4K
    6
    transformers
    Automatic Speech Recognition

    nguyenvulebinh/wav2vec2-base-vietnamese-250h

    4K
    45
    transformers
    174 / 376