NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,002 models available

    Showing 10811104 of 9,002 models

    Text Classification

    Arunavaonly/Bangla-twoclass-Sentiment-Analyzer

    177K
    1
    transformers
    Image Text To Text

    rednote-hilab/dots.ocr

    177K
    1,296
    dots_ocr
    Token Classification

    OpenMed/OpenMed-NER-OncologyDetect-MultiMed-568M

    177K
    1
    transformers
    Visual Question Answering

    dandelin/vilt-b32-finetuned-vqa

    177K
    421
    transformers
    Zero Shot Image Classification

    timm/ViT-B-16-SigLIP-i18n-256

    177K
    5
    open_clip
    Depth Estimation

    depth-anything/DA3NESTED-GIANT-LARGE-1.1

    176K
    17
    depth-anything-3
    Text Generation

    unsloth/gpt-oss-120b-BF16

    176K
    8
    transformers
    Sentence Similarity

    littlejohn-ai/bge-m3-spa-law-qa

    176K
    19
    sentence-transformers
    Image Text To Text

    Qwen/Qwen2.5-VL-72B-Instruct-AWQ

    175K
    71
    transformers
    Image Text To Text

    trl-internal-testing/tiny-Gemma3ForConditionalGeneration

    175K
    transformers
    Image Text To Text

    cyankiwi/Qwen3.6-35B-A3B-AWQ-4bit

    174K
    38
    transformers
    Text Generation

    trl-internal-testing/tiny-CohereForCausalLM

    174K
    transformers
    Feature Extraction

    unslothai/vram-24

    173K
    transformers
    Image Text To Text

    nvidia/Cosmos-Reason2-8B

    173K
    170
    cosmos
    Zero Shot Image Classification

    google/siglip2-so400m-patch14-224

    173K
    3
    transformers
    Text Generation

    Qwen/Qwen3-8B-FP8

    173K
    59
    transformers
    Summarization

    philschmid/bart-large-cnn-samsum

    173K
    267
    transformers
    Image Text To Text

    vikp/texify

    172K
    15
    transformers
    Text Generation

    hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4

    172K
    109
    transformers
    Image Segmentation

    facebook/mask2former-swin-large-cityscapes-semantic

    172K
    37
    transformers
    Automatic Speech Recognition

    nvidia/parakeet-tdt-0.6b-v2

    172K
    1,466
    nemo
    Automatic Speech Recognition

    comodoro/wav2vec2-xls-r-300m-cs-250

    172K
    3
    transformers
    Automatic Speech Recognition

    nvidia/canary-1b-v2

    172K
    378
    nemo
    Text Generation

    mudler/Qwen3.5-35B-A3B-APEX-GGUF

    171K
    89
    46 / 376