NEWVectors or files. Pick a path.Start →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    13,634 models available

    Showing 72257248 of 13,634 models

    Any To Any

    zecanard/gemma-4-E2B-it-ultra-uncensored-heretic-MLX-2bit-mixed_2_6

    941
    1
    mlx
    Image To Text

    PaddlePaddle/PP-OCRv6_small_det_safetensors

    941
    21
    PaddleOCR
    Zero Shot Image Classification

    BAAI/AltCLIP-m18

    940
    5
    transformers
    Reinforcement Learning

    mradermacher/SocialR1-8B-GGUF

    940
    1
    transformers
    Depth Estimation

    facebook/dpt-dinov2-base-nyu

    939
    transformers
    Image To Text

    JEILDLWLRMA/Qwen3-VL-8B-Instruct-NVFP4

    939
    1
    Object Detection

    keremberke/yolov8m-blood-cell-detection

    935
    11
    ultralytics
    Text To Audio

    OzzyGT/LTX-2.3-Distilled-1.1-sdnq-dynamic-int8

    935
    1
    diffusers
    Image Segmentation

    facebook/sapiens2-seg-5b

    935
    5
    sapiens2
    Reinforcement Learning

    mradermacher/GCIRS-Reasoning-1.5B-R1-i1-GGUF

    934
    transformers
    Feature Extraction

    cl-nagoya/sup-simcse-ja-large

    932
    15
    sentence-transformers
    Feature Extraction

    fxmarty/clip-vision-model-tiny

    931
    transformers
    Zero Shot Image Classification

    visheratin/mexma-siglip2

    930
    14
    Feature Extraction

    cstr/jina-v5-small-GGUF

    930
    1
    Image Segmentation

    tue-mps/videomt-dinov2-small-ytvis2019

    929
    transformers
    Audio Classification

    lab260/AASIST3

    928
    3
    Object Detection

    NAKSTStudio/yolov8m-chess-piece-detection

    926
    1
    ultralytics
    Object Detection

    keremberke/yolov8m-forklift-detection

    925
    8
    ultralytics
    Reinforcement Learning

    Malgesw/ppo-Huggy

    925
    ml-agents
    Object Detection

    ustc-community/dfine-medium-obj365

    924
    2
    transformers
    Image Feature Extraction

    nvidia/MambaVision-S-1K

    924
    11
    transformers
    Image To Text

    mradermacher/Rax-4.5-GGUF

    923
    2
    transformers
    Voice Activity Detection

    pyannote/speaker-diarization-precision-2

    922
    23
    pyannote-audio
    Text To Video

    vrgamedevgirl84/LTX_2.3_Clay_Mation_Style_LoRa

    922
    13
    diffusers
    302 / 569