NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,002 models available

    Showing 21132136 of 9,002 models

    Image Classification

    facebook/convnextv2-tiny-22k-224

    57K
    4
    transformers
    Image To Image

    black-forest-labs/FLUX.2-klein-9b-kv-fp8

    57K
    64
    diffusers
    Feature Extraction

    facebook/encodec_32khz

    56K
    19
    transformers
    Image Classification

    timm/hrnet_w18.ms_aug_in1k

    56K
    3
    timm
    Image Classification

    timm/efficientnetv2_rw_m.agc_in1k

    56K
    1
    timm
    Image Text To Text

    kristaller486/dots.ocr-1.5

    56K
    22
    dots_ocr_1_5
    Object Detection

    facebook/detr-resnet-101

    56K
    129
    transformers
    Image Text To Text

    florence-community/Florence-2-large

    56K
    5
    transformers
    Image Text To Text

    bartowski/Qwen_Qwen3.5-27B-GGUF

    56K
    69
    Automatic Speech Recognition

    distil-whisper/distil-large-v3.5

    56K
    89
    transformers
    Image Feature Extraction

    timm/vit_large_patch16_siglip_256.v2_webli

    56K
    2
    timm
    Table Question Answering

    google/tapas-large-finetuned-sqa

    56K
    7
    transformers
    Image Text To Text

    RedHatAI/gemma-3-27b-it-FP8-dynamic

    56K
    13
    transformers
    Sentence Similarity

    jinaai/jina-embeddings-v5-text-nano-retrieval

    56K
    12
    llama.cpp
    Feature Extraction

    nvidia/NV-Embed-v2

    56K
    509
    transformers
    Text To Video

    ali-vilab/text-to-video-ms-1.7b

    56K
    657
    diffusers
    Fill Mask

    kuleshov-group/mdlm-owt

    56K
    22
    transformers
    Question Answering

    philschmid/distilbert-onnx

    55K
    3
    transformers
    Depth Estimation

    depth-anything/DA3-GIANT-1.1

    55K
    8
    depth-anything-3
    Image Classification

    LukeJacob2023/nsfw-image-detector

    55K
    23
    transformers
    Text To Image

    unsloth/Qwen-Image-2512-GGUF

    55K
    343
    Text To Speech

    kenpath/svara-tts-v1

    55K
    37
    transformers
    Feature Extraction

    prithivida/Splade_PP_en_v1

    55K
    30
    sentence-transformers
    Image Classification

    timm/resnet50_gn.a1h_in1k

    55K
    1
    timm
    89 / 376