NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 71297152 of 9,588 models

    Text To Video

    alibaba-pai/CogVideoX-Fun-V1.1-Reward-LoRAs

    188
    60
    videox_fun
    Text To Video

    jasbloom/Wan2.2-T2V-A14B-Diffusers-bf16-mmxxii-rank256-lora

    188
    diffusers
    Visual Question Answering

    mradermacher/TreeVGR-7B-CI-i1-GGUF

    187
    1
    transformers
    Unconditional Image Generation

    nvidia/NV-Generate-MR-Brain

    187
    17
    Object Detection

    keremberke/yolov5m-garbage

    187
    11
    yolov5
    Object Detection

    0xnu/european-license-plate-recognition

    187
    Object Detection

    nsugianto/detr-resnet50_finetuned_detrresnet50_lsdocelementdetv1type7_s1_2359s_adjparam

    187
    transformers
    Image To Text

    topdu/unirec-0.1b

    187
    6
    Image Feature Extraction

    Ramos-Ramos/dino-resnet-50

    187
    1
    transformers
    Image To Text

    PaddlePaddle/ch_SVTRv2_rec

    187
    PaddleOCR
    Image To Text

    frankmorales2020/gemma-4-e4b-unesco-optimized

    187
    transformers
    Question Answering

    NoCanGo/WaifuGPT

    187
    1
    Object Detection

    stevenbucaille/rf-detr-base

    186
    transformers
    Object Detection

    ciasimbaya/ObjectDetection

    186
    9
    transformers
    Object Detection

    hustvl/yolos-small-dwr

    186
    4
    transformers
    Question Answering

    Alexa067/my_awesome_qa_model

    186
    transformers
    Depth Estimation

    onnx-community/depth-anything-v2-small-ONNX

    185
    transformers.js
    Zero Shot Image Classification

    apple/TiC-CLIP-bestpool-cumulative

    185
    4
    tic-clip
    Image Feature Extraction

    timm/resnet50x4_clip_gap.openai

    185
    timm
    Image Feature Extraction

    StratifAI/PolarisFMv2-huge

    185
    Question Answering

    atharvamundada99/bert-large-question-answering-finetuned-legal

    185
    17
    transformers
    Audio To Audio

    mispeech/dasheng-denoiser

    184
    7
    transformers
    Image To Text

    PaddlePaddle/te_PP-OCRv5_mobile_rec

    184
    PaddleOCR
    Image To Text

    MAGAer13/mplug-owl-llama-7b

    184
    16
    transformers
    298 / 400