NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 80898112 of 9,588 models

    Object Detection

    QuincySorrentino/AeroYOLO

    86
    ultralytics
    Voice Activity Detection

    BUT-FIT/diarizen-wavlm-large-s80-mlc

    85
    8
    transformers
    Visual Question Answering

    mPLUG/mPLUG-Owl3-1B-241014

    85
    2
    Object Detection

    Xenova/yolos-small

    85
    transformers.js
    Video Classification

    Dinh/videomae-base-finetuned-kinetics-finetuned-ucf101-subset

    85
    transformers
    Object Detection

    LouLeol/my_full_finetuning

    85
    transformers
    Visual Question Answering

    OpenMed/Qwen2.5-3B-MedVL

    84
    2
    Unconditional Image Generation

    commaai/commavq-gpt2m

    84
    9
    transformers
    Object Detection

    okhytrov/detr-finetuned-visdrone

    84
    transformers
    Object Detection

    esapzoi/litter-detection-yolov8

    84
    ultralytics
    Image Feature Extraction

    apple/aimv2-1B-patch14-224

    84
    8
    transformers
    Video Classification

    Dinh/videomae-base-finetuned-ucf101-subset

    84
    transformers
    Text To Video

    BAAI/nova-d48w1024-osp480

    84
    8
    diffusers
    Zero Shot Classification

    projecte-aina/roberta-base-ca-v2-cawikitc

    84
    1
    transformers
    Object Detection

    HrutikAdsare/waste-detection-yolov8

    84
    1
    ultralytics
    Visual Question Answering

    google/pix2struct-widget-captioning-large

    83
    20
    transformers
    Unconditional Image Generation

    nappa114514/Flux_Klein_illustration_style_transfer

    83
    10
    diffusers
    Object Detection

    akridge/yolo11-fish-detector-grayscale

    83
    3
    ultralytics
    Image Feature Extraction

    facebook/webssl-dino5b-full2b-224

    83
    transformers
    Video Classification

    Dinh/videomae-small-finetuned-kinetics-finetuned-action-finetuned-ucf101-subset

    83
    transformers
    Text To Audio

    mradermacher/CiSiMi-v0.1-GGUF

    83
    transformers
    Object Detection

    Dilipan/detr-finetuned-edzola-form-section

    83
    transformers
    Object Detection

    lifedebugger/table-transformer-finetuned-doclaynet

    83
    transformers
    Object Detection

    chunchun9999/pcb-ai-doctor

    83
    ultralytics
    338 / 400