NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 83778400 of 9,588 models

    Zero Shot Classification

    pongjin/roberta_with_kornli

    52
    7
    transformers
    Depth Estimation

    Xenova/depth-anything-large-hf

    51
    4
    transformers.js
    Image Feature Extraction

    almogtavor/vestir-clothing-similarity

    51
    onnxruntime
    Depth Estimation

    MackinationsAi/depth-anything-v2-base-hf

    51
    transformers
    Video Classification

    KG5KEY/videomae-base-finetuned-gemep-epochs25

    51
    transformers
    Text To Audio

    M7Mardani/speecht5_tts_fa

    51
    transformers
    Unconditional Image Generation

    marcelo-victor/sd-class-butterflies-32

    50
    diffusers
    Table Question Answering

    DablSi/tatr-financial-fine-tune

    50
    1
    Table Question Answering

    JT-LM/JT-DA-8B

    50
    2
    Image Feature Extraction

    r3gm/controlnet-union-sdxl-1.0-fp16

    50
    1
    diffusers
    Image Feature Extraction

    timm/sam2_hiera_large.fb_r1024_2pt1

    50
    timm
    Image Feature Extraction

    timm/aimv2_huge_patch14_448.apple_pt

    50
    timm
    Image Feature Extraction

    gaunernst/vit_tiny_patch8_112.adaface_ms1mv3

    50
    2
    timm
    Video Classification

    T-5ive/videomae-base-finetuned-deception-dataset

    50
    transformers
    Text To Audio

    kaab4321/speecht5_finetuned_kaab_tts_full

    50
    transformers
    Zero Shot Classification

    AmelieSchreiber/esm2_t6_8M_UR50D_sequence_classifier_v1

    50
    transformers
    Image Feature Extraction

    AvitoTech/SigLIP2-Base-for-animal-identification

    50
    2
    transformers
    Document Question Answering

    am-infoweb/layoutlmv3-finetuned_docvqa

    49
    3
    transformers
    Table Question Answering

    google/tapas-large-finetuned-wikisql-supervised

    49
    6
    transformers
    Image Feature Extraction

    timm/aimv2_3b_patch14_224.apple_pt

    49
    timm
    Video Classification

    billskar23/videomae-base-videoMAE

    49
    transformers
    Video Classification

    Hemgg/deepfake_model_Video-MAE-1

    49
    transformers
    Video Classification

    mitegvg/videomae-small-kinetics-binary-finetuned-xd-violence

    49
    transformers
    Video Classification

    BBGAME605065444/videomae-base-finetuned-camera_move-subset

    49
    transformers
    350 / 400