NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,002 models available

    Showing 33853408 of 9,002 models

    Automatic Speech Recognition

    onnx-community/whisper-large-v3-turbo

    10K
    73
    transformers.js
    Image Feature Extraction

    NorskRegnesentralSTI/NCS-v1-2.5d-base

    10K
    1
    transformers
    Fill Mask

    arampacha/roberta-tiny

    10K
    2
    transformers
    Fill Mask

    tbs17/MathBERT

    10K
    27
    transformers
    Text Classification

    agentlans/multilingual-e5-small-aligned-sentiment

    10K
    4
    Fill Mask

    thomas-sounack/BioClinical-ModernBERT-base

    10K
    35
    transformers
    Sentence Similarity

    clips/e5-small-trm-nl

    10K
    1
    sentence-transformers
    Sentence Similarity

    pfnet/plamo-embedding-1b

    10K
    44
    transformers
    Sentence Similarity

    sentence-transformers/msmarco-MiniLM-L6-cos-v5

    10K
    11
    sentence-transformers
    Image Classification

    timm/mobilevitv2_100.cvnets_in1k

    10K
    1
    timm
    Image Classification

    microsoft/swinv2-base-patch4-window16-256

    10K
    5
    transformers
    Any To Any

    KRAFTON/Raon-Speech-9B

    10K
    38
    transformers
    Image Classification

    microsoft/beit-base-patch16-224-pt22k-ft22k

    10K
    82
    transformers
    Image Segmentation

    openmmlab/upernet-convnext-small

    10K
    34
    transformers
    Image Classification

    timm/convnextv2_large.fcmae_ft_in22k_in1k

    10K
    timm
    Text Classification

    maidalun1020/bce-reranker-base_v1

    10K
    199
    sentence-transformers
    Fill Mask

    facebook/xmod-base

    10K
    17
    transformers
    Video Classification

    facebook/vjepa2-vitg-fpc64-384

    10K
    41
    transformers
    Automatic Speech Recognition

    waveletdeboshir/gigaam-rnnt

    10K
    9
    transformers
    Text To Video

    wangfuyun/AnimateLCM

    10K
    345
    diffusers
    Automatic Speech Recognition

    islomov/rubaistt_v2_medium

    10K
    21
    Zero Shot Image Classification

    timm/PE-Core-bigG-14-448

    10K
    6
    open_clip
    Automatic Speech Recognition

    distil-whisper/distil-large-v2

    10K
    516
    transformers
    Text Classification

    gpustack/bge-reranker-v2-m3-GGUF

    10K
    30
    sentence-transformers
    142 / 376