NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    8,900 models available

    Showing 169192 of 8,900 models

    Zero Shot Image Classification

    openai/clip-vit-base-patch16

    1.7M
    158
    transformers
    Image Text To Text

    Qwen/Qwen2-VL-7B-Instruct-AWQ

    1.7M
    49
    transformers
    Translation

    google-t5/t5-base

    1.7M
    774
    transformers
    Zero Shot Image Classification

    google/siglip-base-patch16-224

    1.7M
    81
    transformers
    Image Text To Text

    unsloth/gemma-4-E4B-it-GGUF

    1.6M
    331
    Text To Speech

    Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

    1.6M
    1,438
    Fill Mask

    facebook/esm2_t33_650M_UR50D

    1.6M
    77
    transformers
    Image Text To Text

    Qwen/Qwen3.5-35B-A3B-FP8

    1.6M
    146
    transformers
    Text Generation

    Qwen/Qwen2.5-7B

    1.6M
    279
    transformers
    Text Generation

    Qwen/Qwen2.5-Coder-7B

    1.6M
    141
    transformers
    Image Text To Text

    opendatalab/MinerU2.5-2509-1.2B

    1.6M
    355
    transformers
    Image Text To Text

    Qwen/Qwen3-VL-32B-Instruct

    1.6M
    197
    transformers
    Text Generation

    apple/OpenELM-1_1B-Instruct

    1.5M
    75
    transformers
    Feature Extraction

    Xenova/bge-base-en-v1.5

    1.5M
    9
    transformers.js
    Sentence Similarity

    sentence-transformers-testing/stsb-bert-tiny-safetensors

    1.5M
    4
    sentence-transformers
    Image Text To Text

    microsoft/Phi-3.5-vision-instruct

    1.5M
    732
    transformers
    Sentence Similarity

    Qwen/Qwen3-VL-Embedding-8B

    1.5M
    389
    sentence-transformers
    Image Classification

    timm/resnet50.a1_in1k

    1.5M
    41
    timm
    Video Classification

    ai-forever/kandinsky-videomae-large-camera-motion

    1.5M
    5
    transformers
    Image Classification

    timm/resnet18.a1_in1k

    1.5M
    14
    timm
    Image Text To Text

    deepseek-ai/DeepSeek-OCR-2

    1.5M
    926
    transformers
    Image Text To Text

    unsloth/Qwen3.6-35B-A3B-GGUF

    1.5M
    754
    transformers
    Text To Speech

    k2-fsa/OmniVoice

    1.5M
    699
    omnivoice
    Fill Mask

    neuralmind/bert-large-portuguese-cased

    1.5M
    72
    transformers
    8 / 371