NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 77537776 of 9,588 models

    Object Detection

    BjngChjjljng/DETR-fold0-50epoch

    114
    transformers
    Object Detection

    nsugianto/detr-resnet50_finetuned_detrresnet50_lsdocelementdetv1type7_s1_1158s

    114
    transformers
    Image Segmentation

    nickmuchi/segformer-b4-finetuned-segments-sidewalk

    114
    6
    transformers
    Image Feature Extraction

    StreamFormer/OmniStream

    114
    3
    transformers
    Image Feature Extraction

    apple/aimv2-3B-patch14-336

    114
    6
    transformers
    Image Feature Extraction

    xjh19972/QAA-1024

    114
    2
    Text To Video

    naomiKenKorem/LTXV_13B_LoRA_pose_tmp

    114
    diffusers
    Audio Classification

    GradientDescent2718/LS-EEND-ONNX

    114
    onnx
    Text To Audio

    facebook/musicgen-stereo-melody

    114
    11
    transformers
    Audio Classification

    Joserzapata/distilhubert-finetuned-gtzan

    114
    transformers
    Object Detection

    Xenova/gelan-e

    113
    transformers.js
    Object Detection

    Xenova/gelan-e_all

    113
    transformers.js
    Object Detection

    Yvonne511/yolov8_journals

    113
    transformers
    Object Detection

    BjngChjjljng/detr-finetuned

    113
    transformers
    Zero Shot Image Classification

    facebook/metaclip-2-worldwide-m16

    113
    4
    transformers
    Image Feature Extraction

    timm/vit_base_patch32_clip_256.datacompxl

    113
    timm
    Audio Classification

    DenBor/distilhubert-finetuned-gtzan

    113
    transformers
    Object Detection

    Arjun9350/ai24x7-cctv-qwen3-vl-8b-gguf

    113
    Video Classification

    Afaan97/videomae-base-finetuned-myvideos-subset-v2

    112
    transformers
    Object Detection

    keremberke/yolov5s-construction-safety

    112
    3
    yolov5
    Object Detection

    Charliesgt/pollencounter_detr_resnet50-dc5

    112
    transformers
    Zero Shot Image Classification

    visheratin/nllb-clip-base-siglip

    112
    1
    open_clip
    Audio Classification

    der02/sinama-translator

    112
    1
    keras
    Image Segmentation

    qualcomm/DeepLabV3-ResNet50

    112
    pytorch
    324 / 400