NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    10,221 models available

    Showing 96739696 of 10,221 models

    Video Classification

    AC-1ML/videomae-base-finetuned-ucf101-subset

    17
    transformers
    Video Classification

    Leotrim/videomae-base-finetuned-ucf101-subset

    17
    transformers
    Voice Activity Detection

    aoiandroid/speaker-diarization-coreml

    17
    Voice Activity Detection

    aoiandroid/silero-vad-coreml

    17
    coreml
    Visual Question Answering

    IDEA-CCNL/Ziya-Visual-14B-Chat

    17
    7
    transformers
    Visual Question Answering

    SakanaAI/TAID-VLM-2B

    17
    5
    transformers
    Voice Activity Detection

    BUT-FIT/diarizen-wavlm-large-s80-md-origin

    16
    transformers
    Voice Activity Detection

    philschmid/pyannote-speaker-diarization-endpoint

    16
    20
    pyannote-audio
    Document Question Answering

    YuukiAsuna/VieTable-donut-docvqa-demo

    16
    1
    transformers
    Document Question Answering

    hf-tiny-model-private/tiny-random-LayoutLMv3ForQuestionAnswering

    16
    transformers
    Table Question Answering

    vaishali/multitabqa-base-atis

    16
    1
    transformers
    Tabular Classification

    adgrowr/autotrain-negative-keywords-classifier-61622134846

    16
    transformers
    Tabular Regression

    arviszeile/autotrain-golf-winner-2-87274143425

    16
    transformers
    Tabular Regression

    pcoloc/autotrain-mikrotik-7-7-1860563588

    16
    transformers
    Depth Estimation

    a6047425318/room-3d-scene-estimation

    16
    3
    transformers
    Depth Estimation

    ZEDXULTRA/lotus-depth-d-v1-1

    16
    diffusers
    Depth Estimation

    nielsr/dpt-large-redesign

    16
    transformers
    Depth Estimation

    hf-tiny-model-private/tiny-random-DPTForDepthEstimation

    16
    transformers
    Video Classification

    Dinh/videomae-base-finetuned-kinetics-finetuned-ucf101-subset

    16
    transformers
    Video Classification

    Dinh/videomae-small-finetuned-kinetics-finetuned-action-finetuned-ucf101-subset

    16
    transformers
    Video Classification

    JaehwiJeon/videomae-base-finetuned-ucf101-subset

    16
    transformers
    Video Classification

    gullalc/videomae-base-finetuned-kinetics-movieshots-scale

    16
    transformers
    Video Classification

    Naveengo/videomae-base-finetuned-kinetics-finetuned-ucf101-subset

    16
    transformers
    Video Classification

    mitegvg/videomae-base-finetuned-xd-violence-binary

    16
    transformers
    404 / 426