NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 75137536 of 9,588 models

    Object Detection

    EFFGRP/yolov11n-warehouse-pallets-1280

    139
    ultralytics
    Image Segmentation

    SpotLab/MobileViT_DeepLabv3

    139
    transformers
    Zero Shot Image Classification

    pansonga/CLIP-ViT-H-14-laion2B-s32B-b79K

    139
    open_clip
    Image Feature Extraction

    timm/aimv2_1b_patch14_224.apple_pt

    139
    timm
    Image Feature Extraction

    timm/vit_large_patch16_siglip_256.webli

    139
    timm
    Text To Video

    oxide-lab/LTX-Video-0.9.5-diffusers

    139
    diffusers
    Audio Classification

    0xmagical/wavlm-both

    139
    transformers
    Object Detection

    olvallej/yolo_finetuned_fruits

    139
    transformers
    Visual Question Answering

    Bingsu/temp_vilt_vqa

    138
    transformers
    Object Detection

    keremberke/yolov5m-aerial-sheep

    138
    2
    yolov5
    Object Detection

    lewiswatson/yolov8x-tuned-hand-gestures

    138
    9
    ultralytics
    Image Segmentation

    skytnt/anime-seg

    138
    52
    anime_segmentation
    Image Segmentation

    camilletyriard/glacier-segmentation-attention-unet

    138
    keras
    Image Feature Extraction

    timm/vit_so400m_patch14_siglip_224.webli

    138
    1
    timm
    Text To Video

    Warvito/animatediff-motion-adapter-sdxl-v1-0-beta

    138
    4
    diffusers
    Audio Classification

    ahmmedasaad2772/wav2vec2-base-arabic_speech_emotion_recognition

    138
    transformers
    Audio Classification

    0xmagical/wavelm-study

    138
    transformers
    Audio To Audio

    AEmotionStudio/sam-audio-models

    137
    5
    Audio To Audio

    MansfieldPlumbing/Demucs_v4_TRT

    137
    2
    tensorrt
    Document Question Answering

    YuukiAsuna/Vintern-1B-v2-ViTable-docvqa

    137
    2
    transformers
    Object Detection

    chayuto/au-fuel-sign-finder-yolo26n

    137
    ultralytics
    Object Detection

    atalaydenknalbant/Yolov13

    137
    19
    ultralytics
    Image Segmentation

    Xenova/clipseg-rd64

    137
    1
    transformers.js
    Image Feature Extraction

    timm/vit_huge_patch14_224.mae

    137
    timm
    314 / 400