NEWAgents can now see video via MCP.Try it now →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,002 models available

    Showing 20172040 of 9,002 models

    Audio Classification

    superb/hubert-large-superb-er

    67K
    25
    transformers
    Text To Image

    John6666/hassaku-xl-illustrious-v31-sdxl

    67K
    1
    diffusers
    Automatic Speech Recognition

    imvladikon/wav2vec2-xls-r-300m-hebrew

    67K
    6
    transformers
    Text Classification

    PirateXX/AI-Content-Detector

    67K
    8
    transformers
    Zero Shot Classification

    cross-encoder/nli-deberta-v3-large

    67K
    39
    sentence-transformers
    Sentence Similarity

    emrecan/bert-base-turkish-cased-mean-nli-stsb-tr

    66K
    49
    sentence-transformers
    Automatic Speech Recognition

    bond005/wav2vec2-large-ru-golos

    66K
    17
    transformers
    Depth Estimation

    Intel/dpt-large

    66K
    204
    transformers
    Image Text To Text

    nvidia/Eagle2.5-8B

    66K
    39
    transformers
    Image Classification

    timm/swin_base_patch4_window7_224.ms_in22k_ft_in1k

    66K
    7
    timm
    Fill Mask

    facebook/esm1v_t33_650M_UR90S_1

    66K
    5
    transformers
    Image Text To Text

    unsloth/Qwen2.5-VL-7B-Instruct-unsloth-bnb-4bit

    66K
    51
    transformers
    Zero Shot Image Classification

    google/siglip2-so400m-patch16-256

    66K
    1
    transformers
    Feature Extraction

    farbodtavakkoli/OTel-Embedding-8B

    66K
    Audio To Audio

    JacobLinCool/MP-SENet-DNS

    65K
    2
    Image Text To Text

    openvla/openvla-7b-finetuned-libero-object

    65K
    1
    transformers
    Image Text To Text

    lmstudio-community/Qwen3.5-397B-A17B-MLX-8bit

    65K
    1
    transformers
    Image To Image

    unsloth/FLUX.2-klein-4B-GGUF

    65K
    133
    ggml
    Depth Estimation

    LiheYoung/depth-anything-base-hf

    65K
    12
    transformers
    Image Text To Text

    trl-internal-testing/tiny-Qwen3VLForConditionalGeneration

    64K
    transformers
    Image Segmentation

    shi-labs/oneformer_coco_swin_large

    64K
    8
    transformers
    Text To Image

    kpsss34/FHDR_Uncensored

    64K
    434
    diffusers
    Feature Extraction

    ibm-granite/granite-embedding-30m-sparse

    64K
    25
    sentence-transformers
    Automatic Speech Recognition

    AbelZimba/whisper-bemba-stt

    63K
    transformers
    85 / 376