NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,002 models available

    Showing 36253648 of 9,002 models

    Feature Extraction

    hf-tiny-model-private/tiny-random-MCTCTModel

    8K
    transformers
    Robotics

    moojink/openvla-7b-oft-finetuned-libero-spatial

    8K
    14
    transformers
    Image To Text

    PaddlePaddle/en_PP-OCRv4_mobile_rec

    8K
    3
    PaddleOCR
    Image To Image

    unsloth/FLUX.2-klein-base-9B-GGUF

    8K
    32
    ggml
    Video Classification

    microsoft/xclip-base-patch32-16-frames

    8K
    6
    transformers
    Object Detection

    facebook/detr-resnet-50-dc5

    8K
    6
    transformers
    Zero Shot Image Classification

    imageomics/bioclip-2.5-vith14

    8K
    9
    open_clip
    Zero Shot Image Classification

    timm/MobileCLIP2-S3-OpenCLIP

    8K
    3
    open_clip
    Text To Audio

    ACE-Step/acestep-v15-xl-turbo

    8K
    138
    transformers
    Feature Extraction

    Snarcy/RadioDino-s8

    8K
    timm
    Image Classification

    timm/seresnext50_32x4d.racm_in1k

    8K
    timm
    Feature Extraction

    Xenova/clap-htsat-unfused

    8K
    transformers.js
    Image Classification

    timm/convnext_tiny.fb_in1k

    8K
    1
    timm
    Sentence Similarity

    sentence-transformers/stsb-roberta-large

    8K
    4
    sentence-transformers
    Feature Extraction

    llmrails/ember-v1

    8K
    63
    sentence-transformers
    Object Detection

    microsoft/conditional-detr-resnet-50

    8K
    13
    transformers
    Image To Image

    lllyasviel/control_v11p_sd15_seg

    8K
    15
    diffusers
    Image To Image

    lllyasviel/control_v11p_sd15_normalbae

    8K
    18
    diffusers
    Text To Video

    QuantStack/Wan2.2-Fun-A14B-Control-Camera-GGUF

    8K
    11
    gguf
    Automatic Speech Recognition

    onnx-community/cohere-transcribe-03-2026-ONNX

    7K
    14
    transformers.js
    Video Classification

    microsoft/xclip-base-patch16-16-frames

    7K
    1
    transformers
    Text To Audio

    facebook/musicgen-melody-large

    7K
    32
    transformers
    Summarization

    facebook/bart-large-xsum

    7K
    36
    transformers
    Automatic Speech Recognition

    kotoba-tech/kotoba-whisper-bilingual-v1.0

    7K
    19
    transformers
    152 / 376