NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    10,221 models available

    Showing 93379360 of 10,221 models

    Tabular Classification

    keras-io/imbalanced_classification

    25
    9
    tf-keras
    Tabular Classification

    keras-io/tab_transformer

    25
    43
    tf-keras
    Depth Estimation

    Onegafer/glpn-nyu-finetuned-diode-230603-091354

    25
    transformers
    Depth Estimation

    pearsonkyle/DepthAnyPanorama-coreml

    25
    Video Classification

    sano90/videomae-base-finetuned-bimanual-subset

    25
    transformers
    Video Classification

    Shawon16/VideoMAE_kinetics_wlasl100_20epoch_Signers

    25
    transformers
    Video Classification

    dmsrud/anomalous_behavior_video_cls_model

    25
    transformers
    Video Classification

    TanAlexanderlz/RALL_RGBCROP_Aug16F-16B16F

    25
    transformers
    Video Classification

    dvs/videomae-base-finetuned-movienet-take2-finetuned-movienet-take3

    25
    transformers
    Video Classification

    dvs/videomae-base-finetuned-movienet

    25
    transformers
    Video Classification

    dvs/videomae-base-finetuned-kinetics-finetuned-movienet-2-2

    25
    transformers
    Zero Shot Classification

    cublya/deberta-v3-large-zeroshot-v2.0

    25
    1
    transformers
    Visual Question Answering

    TIGER-Lab/VL-Reasoner-72B

    25
    3
    transformers
    Visual Question Answering

    unum-cloud/uform-gen-chat

    25
    18
    transformers
    Visual Question Answering

    OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-13B-448px

    25
    4
    transformers
    Unconditional Image Generation

    diffusers/ddpm-cifar10-32-demo

    25
    1
    diffusers
    Unconditional Image Generation

    Pie31415/dm_anime

    25
    diffusers
    Tabular Classification

    EnumaInc/LKT-lm-based-candidate-2-gbdt-tbkt2601

    25
    lightgbm
    Video Classification

    vishnushenoy09/videomae-tennis-shottype

    24
    Visual Question Answering

    google/pix2struct-ocrvqa-base

    24
    5
    transformers
    Visual Question Answering

    nhattan9999t/blip-kvasir-vqa

    24
    1
    transformers
    Unconditional Image Generation

    huggingnft/cyberkongz

    24
    5
    transformers
    Table Question Answering

    google/tapas-small-finetuned-wikisql-supervised

    24
    7
    transformers
    Tabular Classification

    YuchenShen/FoMo-0D

    24
    390 / 426