NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    8,900 models available

    Showing 7396 of 8,900 models

    Image Text To Text

    Qwen/Qwen2-VL-2B-Instruct

    3.8M
    499
    transformers
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-portuguese

    3.8M
    53
    transformers
    Automatic Speech Recognition

    MahmoudAshraf/mms-300m-1130-forced-aligner

    3.7M
    84
    transformers
    Text Generation

    openai/gpt-oss-120b

    3.6M
    4,733
    transformers
    Image Text To Text

    Qwen/Qwen3.5-4B

    3.6M
    491
    transformers
    Text Generation

    Qwen/Qwen2-1.5B-Instruct

    3.5M
    162
    transformers
    Any To Any

    google/gemma-4-E4B-it

    3.5M
    820
    transformers
    Object Detection

    microsoft/table-transformer-detection

    3.4M
    415
    transformers
    Sentence Similarity

    sentence-transformers/paraphrase-MiniLM-L6-v2

    3.4M
    146
    sentence-transformers
    Image Text To Text

    Qwen/Qwen3.5-27B

    3.4M
    961
    transformers
    Text Classification

    cardiffnlp/twitter-roberta-base-sentiment-latest

    3.4M
    789
    transformers
    Feature Extraction

    facebook/w2v-bert-2.0

    3.4M
    213
    transformers
    Text Generation

    meta-llama/Meta-Llama-3-8B

    3.3M
    6,522
    transformers
    Text Classification

    distilbert/distilbert-base-uncased-finetuned-sst-2-english

    3.2M
    891
    transformers
    Text Generation

    TinyLlama/TinyLlama-1.1B-Chat-v1.0

    3.1M
    1,569
    transformers
    Text Generation

    EleutherAI/pythia-160m

    3.0M
    39
    transformers
    Sentence Similarity

    intfloat/multilingual-e5-base

    3.0M
    352
    sentence-transformers
    Fill Mask

    google-bert/bert-base-multilingual-cased

    3.0M
    586
    transformers
    Image Text To Text

    unsloth/gemma-4-26B-A4B-it-GGUF

    2.9M
    599
    Image Text To Text

    Qwen/Qwen3.5-0.8B

    2.9M
    511
    transformers
    Text Generation

    meta-llama/Llama-3.2-3B-Instruct

    2.9M
    2,112
    transformers
    Text Generation

    Qwen/Qwen3-14B

    2.8M
    388
    transformers
    Sentence Similarity

    sentence-transformers/all-MiniLM-L12-v2

    2.8M
    304
    sentence-transformers
    Zero Shot Image Classification

    laion/CLIP-ViT-B-32-laion2B-s34B-b79K

    2.7M
    139
    open_clip
    4 / 371