NEWVectors or files. Pick a path.Start →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    13,634 models available

    Showing 69857008 of 13,634 models

    Video Classification

    facebook/vjepa2-vitl-fpc32-256-diving48

    1K
    5
    transformers
    Object Detection

    cmarkea/detr-layout-detection

    1K
    28
    transformers
    Visual Question Answering

    microsoft/git-base-vqav2

    1K
    21
    transformers
    Question Answering

    NastasiaM/mbert-loraxs-qa-LTr64-qkvd-ABohneLT-ABfr

    1K
    peft
    Feature Extraction

    mlx-community/Qwen3-Embedding-8B-mxfp8

    1K
    3
    sentence-transformers
    Zero Shot Image Classification

    kwanY/styleid

    1K
    5
    transformers
    Audio Classification

    aufklarer/Sortformer-Diarization-CoreML

    1K
    1
    Robotics

    lerobot/pi0fast-libero-v044

    1K
    10
    lerobot
    Translation

    tencent/Hy-MT1.5-1.8B-2bit-GGUF

    1K
    28
    Zero Shot Image Classification

    OFA-Sys/chinese-clip-vit-large-patch14-336px

    1K
    26
    transformers
    Any To Any

    mradermacher/Darkidol-Gemma-4-E4B-it-GGUF

    1K
    4
    transformers
    Any To Any

    Bingsu/gemma-4-E2B-it-GGUF

    1K
    gguf
    Visual Question Answering

    google/pix2struct-ai2d-base

    1K
    43
    transformers
    Summarization

    QyrouNnet-AI/QNS-2-ReLearn-Preview

    1K
    1
    Any To Any

    mlx-community/gemma-4-e4b-it-6bit

    1K
    2
    mlx
    Feature Extraction

    alexliap/Qwen3-Embedding-8B-FP8-DYNAMIC

    1K
    1
    sentence-transformers
    Feature Extraction

    Tevatron/AgentIR-4B

    1K
    9
    transformers
    Object Detection

    keremberke/yolov8m-protective-equipment-detection

    1K
    18
    ultralytics
    Zero Shot Image Classification

    DatologyAI/retr-opt-vit-b-32

    1K
    9
    open_clip
    Feature Extraction

    BASF-AI/ChEmbed-full

    1K
    1
    transformers
    Image Feature Extraction

    CompVis/cleandift

    1K
    8
    diffusion-single-file
    Image Segmentation

    nvidia/segformer-b0-finetuned-cityscapes-768-768

    1K
    transformers
    Any To Any

    EZCon/Huihui-gemma-4-E2B-it-abliterated-4bit-g32-mxfp4-mixed_4_8-mlx

    1K
    mlx
    Audio To Audio

    LiquidAI/LFM2.5-Audio-1.5B-JP

    1K
    65
    liquid-audio
    292 / 569