NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 47294752 of 9,588 models

    Audio To Audio

    kyutai/hibiki-zero-3b-pytorch-bf16

    3K
    51
    Sentence Similarity

    infgrad/stella-large-zh-v2

    3K
    32
    sentence-transformers
    Any To Any

    llmfan46/gemma-4-E4B-it-ultra-uncensored-heretic

    3K
    13
    transformers
    Text To Speech

    FluidInference/kokoro-82m-coreml

    3K
    8
    Image To Text

    google/pix2struct-base

    3K
    79
    transformers
    Sentence Similarity

    sentence-transformers/msmarco-distilbert-base-v3

    3K
    4
    sentence-transformers
    Feature Extraction

    helboukkouri/character-bert

    3K
    2
    transformers
    Image To Text

    noctrex/PaddleOCR-VL-1.5-GGUF

    3K
    7
    Any To Any

    nightmedia/gemma-4-E4B-it-mxfp8-mlx

    3K
    2
    mlx
    Object Detection

    foduucom/table-detection-and-extraction

    3K
    106
    ultralytics
    Feature Extraction

    openbmb/MiniCPM-Embedding

    3K
    250
    transformers
    Question Answering

    mrm8488/bert-base-spanish-wwm-cased-finetuned-spa-squad2-es

    3K
    13
    transformers
    Translation

    Helsinki-NLP/opus-mt-tc-big-en-fr

    3K
    8
    transformers
    Sentence Similarity

    tomaarsen/mpnet-base-nli

    3K
    1
    sentence-transformers
    Video Classification

    MCG-NJU/videomae-large-finetuned-kinetics

    3K
    14
    transformers
    Zero Shot Image Classification

    timm/resnet50x64_clip.openai

    3K
    open_clip
    Image Segmentation

    cmarkea/dit-base-layout-detection

    3K
    6
    transformers
    Any To Any

    SassyDiffusion/gemma-4-E4B-it-heretic-GGUF

    3K
    gguf
    Translation

    ByteDance-Seed/Seed-X-PPO-7B

    3K
    302
    Audio Classification

    hzhongresearch/yamnetp_ahead_ds

    3K
    keras
    Feature Extraction

    tencent/Penguin-Encoder

    3K
    22
    transformers
    Feature Extraction

    unsloth/Qwen3-Embedding-4B

    3K
    1
    sentence-transformers
    Image Classification

    timm/tresnet_m.miil_in21k

    3K
    1
    timm
    Translation

    ai4bharat/indictrans2-indic-indic-dist-320M

    3K
    6
    transformers
    198 / 400