NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 69376960 of 9,588 models

    Object Detection

    silvinadatz/detr-resnet-50-hardhat-finetuned

    229
    transformers
    Object Detection

    M1ngiii/detr-fashionpedia

    229
    transformers
    Question Answering

    yrshi/AutoRefine-Qwen2.5-3B-Base

    229
    2
    transformers
    Text To Audio

    PHVK1611/speecht5-tts-hindi-finetuned

    228
    transformers
    Voice Activity Detection

    onnx-community/smart-turn-v3-ONNX

    228
    1
    transformers.js
    Image Segmentation

    facebook/mask2former-swin-base-IN21k-cityscapes-instance

    228
    transformers
    Image Segmentation

    AEmotionStudio/sam3

    227
    1
    Image To Text

    mradermacher/Perseus-Doc-vl-071225-GGUF

    227
    1
    transformers
    Image Feature Extraction

    Manojb/dinov2-with-registers-base

    227
    transformers
    Audio Classification

    BKat/Musical-genres-Classification-Hubert-V1-finetuned-gtzan

    227
    1
    transformers
    Object Detection

    Jesse020202/detr_finetuned_cppe5_poisoned

    226
    transformers
    Image Segmentation

    qualcomm/FCN-ResNet50

    226
    pytorch
    Image Feature Extraction

    timm/sam2_hiera_small.fb_r896

    226
    timm
    Question Answering

    mradermacher/meditron-7b-CPT-SFT-i1-GGUF

    226
    transformers
    Object Detection

    joomatos/yolo_finetuned_raccoon

    225
    transformers
    Image To Text

    mradermacher/DREX-062225-exp-i1-GGUF

    225
    1
    transformers
    Audio Classification

    Simon-Kotchou/ssast-base-patch-audioset-16-16

    225
    transformers
    Image To Text

    cnmoro/tiny-image-captioning

    224
    3
    transformers
    Image Feature Extraction

    timm/vit_base_patch32_clip_224.laion2b

    224
    timm
    Audio Classification

    penkichiai/Riku_Binary_Wav2Vec

    224
    transformers
    Question Answering

    FlagAlpha/Llama2-Chinese-13b-Chat

    224
    274
    transformers
    Question Answering

    mradermacher/Llama-3.1-8B-Instruct-Uz-GGUF

    224
    1
    transformers
    Image Segmentation

    Marco333/segformer-b0-road-scene-7class

    224
    transformers
    Audio Classification

    sanchit-gandhi/whisper-medium-fleurs-lang-id

    223
    16
    transformers
    290 / 400