NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 74417464 of 9,588 models

    Image Feature Extraction

    timm/resnet50_clip_gap.yfcc15m

    151
    timm
    Audio Classification

    ciao1122/results

    151
    transformers
    Object Detection

    aesat/detr-finetuned-chess

    150
    1
    transformers
    Object Detection

    TanTan2025/archivision-yolo

    150
    1
    ultralytics
    Image Segmentation

    Pranilllllll/segformer-satellite-segementation

    150
    transformers
    Image To Text

    PaddlePaddle/ka_PP-OCRv3_mobile_rec

    150
    PaddleOCR
    Image To Text

    mlx-community/GLM-OCR-5bit

    150
    transformers
    Text To Video

    ai-forever/Wan2.1-T2V-14B-NABLA-0.5-STA-11-5-5

    150
    diffusers
    Audio Classification

    DBD-research-group/Bird-MAE-Large

    150
    1
    transformers
    Text To Audio

    sil-ai/senga-LUK-ACT-MRK-1TI-2TI-aligned-speecht5

    150
    transformers
    Audio To Audio

    line-corporation/open-universe

    149
    3
    Visual Question Answering

    0xDing/yuren-baichuan-7b

    149
    27
    transformers
    Image Segmentation

    finloop/yolov8s-seg-solar-panels

    149
    7
    ultralytics
    Image To Text

    microsoft/git-large-r-coco

    149
    11
    transformers
    Audio Classification

    tiantiaf/voxlect-spanish-dialect-whisper-large-v3

    149
    5
    transformers
    Text To Audio

    marcorez8/acestep-v15-xl-base-bf16

    149
    1
    transformers
    Text To Audio

    EvgenyShivchenkoUIT/speecht5_finetuned_haitian_creole-without-spec-token

    149
    transformers
    Audio Classification

    ALM/wav2vec2-large-audioset

    149
    1
    transformers
    Object Detection

    mradermacher/Polaris-VGA-4B-Post1.0e-GGUF

    148
    transformers
    Visual Question Answering

    AI-Safeguard/Ivy-VL-llava

    148
    72
    transformers
    Object Detection

    keremberke/yolov5s-nfl

    148
    2
    yolov5
    Image Feature Extraction

    tiiuae/siglino-moe-0.15-0.6B

    148
    7
    transformers
    Audio Classification

    preszzz/drone-audio-detection-05-17-trial-0

    148
    4
    transformers
    Text To Audio

    mingyi456/Ace-Step1.5-DF11-ComfyUI

    148
    13
    diffusion-single-file
    311 / 400