NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 85698592 of 9,588 models

    Image Feature Extraction

    JWonderLand/StainNet-Base

    38
    timm
    Image Feature Extraction

    gray-apple/dino3

    38
    transformers
    Image Feature Extraction

    ly-corporation/favnavi-vision-ecommerce-v1-base

    38
    1
    transformers
    Text To Audio

    SaberMolaei/speecht5_tts_ckb0

    38
    1
    transformers
    Text To Audio

    awkyu/audiogen-medium

    38
    2
    transformers
    Text To Audio

    Dalision/Omni2Sound

    38
    4
    Video Classification

    CAIR-HKISI/SurgMotion-vitl

    37
    4
    pytorch
    Voice Activity Detection

    videosdk-live/Namo-Turn-Detector-v1-Japanese

    37
    onnxruntime
    Document Question Answering

    HEN10/layoutlmv2_Kb_qa04

    37
    transformers
    Unconditional Image Generation

    pratiktechie22/sd-class-butterflies-32-copy-1

    37
    diffusers
    Image Feature Extraction

    timm/sam2_hiera_small.fb_r896_2pt1

    37
    timm
    Image Feature Extraction

    timm/vit_large_patch16_siglip_gap_384.v2_webli

    37
    timm
    Image Feature Extraction

    timm/vit_base_patch16_siglip_256.webli_i18n

    37
    timm
    Image Feature Extraction

    timm/sam2_hiera_base_plus.fb_r896_2pt1

    37
    timm
    Video Classification

    qualcomm/ResNet-2Plus1D

    37
    pytorch
    Video Classification

    Naman712/Deep-fake-detection

    37
    5
    Video Classification

    d2o2ji/videomae-base-finetuned-kinetics-0409_final_5sec_org_ab7_val_inside_train

    37
    transformers
    Video Classification

    d2o2ji/videomae-base-finetuned-kinetics-0416_final_5sec_org_balanced_replacecroppretest

    37
    transformers
    Video Classification

    d2o2ji/videomae-base-finetuned-kinetics-0410_final_5sec_org_ab7_val_inside_train_04

    37
    transformers
    Text To Audio

    alakxender/mms-tts-div-ft-spk01-m01

    37
    transformers
    Text To Audio

    piyazon/TTS-Roman-Girl-Ug

    37
    transformers
    Zero Shot Classification

    asosoft/KuBERT-Central-Kurdish-BERT-Model

    37
    6
    transformers
    Zero Shot Classification

    takehika/mdeberta-v3-wanli-ja-nli

    37
    transformers
    Zero Shot Classification

    r-f/ModernBERT-large-zeroshot-v1

    37
    2
    transformers
    358 / 400