NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 77777800 of 9,588 models

    Audio Classification

    JoshEe00/distilhubert-finetuned-gtzan

    112
    transformers
    Video Classification

    Joy28/videomae-base-finetuned-ucf101-subset-finetuned-subset-0401

    111
    transformers
    Video Classification

    Joy28/videomae-base-finetuned-subset-check10

    111
    transformers
    Audio To Audio

    YatharthS/NovaSR

    111
    83
    Visual Question Answering

    gaianet/MiniCPM-V-4-GGUF

    111
    Object Detection

    nsugianto/tblstructrecog_finetuned_detresnet_v1_s1_311s

    111
    transformers
    Object Detection

    DeltaSatellite1/CoinYOLO

    111
    Object Detection

    Blueccc/furniture_use_data_finetuning

    111
    transformers
    Image Feature Extraction

    JH-C-k/clipL336_TTR

    111
    transformers
    Text To Video

    finetrainers/3dgs-v0

    111
    3
    diffusers
    Text To Audio

    Xasan01/speecht5_finetuned_voxpopuli_fr

    111
    transformers
    Image Segmentation

    BVRA/TurtleDetector

    111
    1
    ultralytics
    Zero Shot Image Classification

    Azazelle/LongClip-L-diffusers

    111
    transformers
    Text To Video

    DFloat11/Wan2.2-T2V-A14B-2-DF11

    111
    4
    diffusers
    Audio Classification

    ChengzhiMu/whisper-base-finetuned-gtzan

    111
    transformers
    Video Classification

    Blueberry2018/SOCAL-cls

    110
    transformers
    Audio To Audio

    speechbrain/sepformer-wham-enhancement

    110
    33
    speechbrain
    Document Question Answering

    DenisAleksandrovich/layoutlmv2-base-uncased_finetuned_docvqa

    110
    transformers
    Object Detection

    keremberke/yolov5n-csgo

    110
    3
    yolov5
    Object Detection

    atalaydenknalbant/asl-yolo-models

    110
    5
    ultralytics
    Object Detection

    kiselyovd/vehicle-keypoints

    110
    ultralytics
    Object Detection

    Joshhhhhhhhhh/detr-resnet-50-finetuned-10-epochs-boat-dataset

    110
    transformers
    Image Segmentation

    qualcomm/FastSam-X

    110
    10
    pytorch
    Image Feature Extraction

    timm/vit_large_patch16_siglip_512.v2_webli

    110
    timm
    325 / 400