NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 76817704 of 9,588 models

    Image Segmentation

    Patnev71/segformer-b0-finetuned-floorplan

    120
    2
    transformers
    Audio Classification

    Sayantan090/wav2vec2-deepfake-voice-detector

    120
    transformers
    Audio Classification

    vocametrix/wav2vec2-xlsr-53-stuttering-classification

    120
    transformers
    Video Classification

    Blessing988/videomae-base-finetuned-ucf101-subset

    119
    transformers
    Video Classification

    Joy28/videomae-base-finetuned-subset-0401

    119
    transformers
    Unconditional Image Generation

    hajar001/stylegan2-ffhq-128

    119
    stylegan-pytorch
    Object Detection

    keremberke/yolov5m-forklift

    119
    3
    yolov5
    Object Detection

    cj94/detr-resnet-50-finetuned-real-boat-dataset

    119
    transformers
    Object Detection

    BjngChjjljng/DETR-fisheye-combine-40epoch

    119
    transformers
    Image Segmentation

    kumuji/Sa2VA-i-8B

    119
    transformers
    Image Feature Extraction

    facebook/webssl-dino7b-full8b-224

    119
    3
    transformers
    Text To Audio

    tuskbyte/vits_welsh_female_monospeaker_dutch_female

    119
    transformers
    Zero Shot Image Classification

    chs20/FARE4-ViT-B-32-laion2B-s34B-b79K

    119
    open_clip
    Audio To Audio

    mlx-community/mel-roformer-kim-vocal-2-mlx

    119
    4
    mlx
    Voice Activity Detection

    cstr/whisper-vad-encdec-asmr-GGUF

    119
    ggml
    Zero Shot Classification

    alexneakameni/gliznet-deberta-v3-base

    119
    transformers
    Video Classification

    Joy28/videomae-base-finetuned-subset

    118
    transformers
    Visual Question Answering

    introvoyz041/OpenMed-SynthVision-MedVL-AIO-GGUF

    118
    transformers
    Visual Question Answering

    BlackB/blip2-pokemon-pokemon

    118
    transformers
    Object Detection

    BjngChjjljng/DETR-fisheye-combine-10epoch

    118
    transformers
    Object Detection

    BjngChjjljng/detr-finetuned_v2

    118
    transformers
    Image Segmentation

    kitsumed/yolov8m_seg-speech-bubble

    118
    27
    Image Feature Extraction

    timm/vit_pe_core_small_patch16_384.fb

    118
    timm
    Image Feature Extraction

    timm/vit_base_patch16_siglip_384.webli

    118
    1
    timm
    321 / 400