NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 75617584 of 9,588 models

    Zero Shot Classification

    protectai/deberta-v3-base-zeroshot-v1-onnx

    133
    4
    transformers
    Object Detection

    cindyxun/detr-finetuned-tesla

    133
    transformers
    Audio Classification

    lugan/SynTTS-Commands-Media-Benchmarks

    133
    1
    keras
    Zero Shot Image Classification

    arampacha/clip-rsicd-v5

    133
    1
    transformers
    Audio To Audio

    maitrix-org/Voila-autonomous-preview

    132
    17
    transformers
    Object Detection

    keremberke/yolov5n-aerial-sheep

    132
    2
    yolov5
    Object Detection

    Adeptschneider/detr-finetuned-arm-unicef-vulnerability-challenge-v1.0

    132
    transformers
    Object Detection

    Biswajit010/crack-transformer

    132
    transformers
    Object Detection

    crumeike/tornadonet-checkpoints

    132
    1
    ultralytics
    Image Segmentation

    qualcomm/FFNet-122NS-LowRes

    132
    pytorch
    Audio Classification

    cstr/ecapa-lid-107-GGUF

    132
    speechbrain
    Image Segmentation

    canvit/probe-ade20k-40k-s512-c9-in21k

    132
    canvit-pytorch
    Object Detection

    keremberke/yolov5s-valorant

    131
    5
    yolov5
    Object Detection

    onnx-community/rfdetr_base-ONNX

    131
    4
    transformers.js
    Image Segmentation

    Xenova/deeplabv3-mobilevit-xx-small

    131
    transformers.js
    Image Feature Extraction

    nvidia/PS3_Lang-1.5K-SigLIP2

    131
    1
    Object Detection

    TheCluster/YOLOv8-CoreML

    131
    6
    Object Detection

    keremberke/yolov5n-football

    130
    8
    yolov5
    Object Detection

    keremberke/yolov5s-smoke

    130
    3
    yolov5
    Object Detection

    lakibrankovic/womrs-yolo

    130
    1
    ultralytics
    Zero Shot Image Classification

    intfloat/mmE5-mllama-11b-instruct

    130
    20
    transformers
    Video Classification

    aap9002/RGB_Optic_Flow_Bend_Classification

    130
    keras
    Audio Classification

    forwarder1121/voice-based-stress-recognition

    130
    1
    Image Segmentation

    as-cle-bert/segformer-v1-breastcancer

    130
    transformers
    316 / 400