NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 90979120 of 9,588 models

    Voice Activity Detection

    BricksDisplay/ten-vad

    14
    Voice Activity Detection

    philschmid/pyannote-speaker-diarization-endpoint

    14
    20
    pyannote-audio
    Document Question Answering

    AprilLim/layoutlmv2-base-uncased-finetuned-test

    14
    transformers
    Document Question Answering

    KonstantinosKakkavas/layoutlmv2-base-uncased_finetuned_docvqa

    14
    transformers
    Table Question Answering

    am5uc/ServiceNow_Table_Question_Answering

    14
    transformers
    Table Question Answering

    google/tapas-medium-finetuned-sqa

    14
    1
    transformers
    Tabular Classification

    dodysuss/autotrain-planes-1918465011

    14
    Tabular Regression

    bibekbehera/autotrain-numeric_prediction-40376105019

    14
    transformers
    Tabular Regression

    jwan2021/autotrain-us-housing-prices-1771761510

    14
    1
    transformers
    Depth Estimation

    a6047425318/room-3d-scene-estimation

    14
    3
    transformers
    Depth Estimation

    facebook/sapiens-depth-1b

    14
    1
    sapiens
    Depth Estimation

    a414166402/DepthSmall

    14
    3
    transformers.js
    Depth Estimation

    Onegafer/glpn-nyu-finetuned-diode-230530-204740

    14
    transformers
    Depth Estimation

    facebook/sapiens-depth-0.6b-bfloat16

    14
    sapiens
    Depth Estimation

    yanjiusheng/marigold-depth-v1-0

    14
    diffusers
    Depth Estimation

    PComConfig/DA3NESTED-GIANT-LARGE-1.1

    14
    depth-anything-3
    Depth Estimation

    facebook/sapiens-depth-2b-bfloat16

    14
    sapiens
    Visual Question Answering

    kimdesok/vilt_finetuned_200

    14
    transformers
    Visual Question Answering

    OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-13B-448px

    14
    4
    transformers
    Visual Question Answering

    hf-tiny-model-private/tiny-random-ViltForQuestionAnswering

    14
    transformers
    Visual Question Answering

    TeeA/MATCHA-ViChart

    14
    transformers
    Visual Question Answering

    amitha/mllava-llama2-zh

    14
    transformers
    Visual Question Answering

    Keetawan/BLIP2SeaLLMs-1.5B_COCO

    14
    transformers
    Visual Question Answering

    Foreshhh/Qwen2-VL-7B-VLGuard

    14
    1
    380 / 400