NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 71057128 of 9,588 models

    Tabular Classification

    Alexei1/imdb

    192
    1
    transformers
    Object Detection

    projectsidewalk/rampnet-model

    192
    1
    Image Feature Extraction

    OpenGVLab/InternViT-6B-448px-V1-0

    192
    9
    transformers
    Text To Audio

    AddisuSeteye/speecht5_tts_amharic

    191
    transformers
    Image To Text

    qualcomm/EasyOCR

    191
    40
    pytorch
    Image Feature Extraction

    gsumbul/SMARTIES-v1-ViT-B

    191
    2
    transformers
    Image To Text

    Rattatammanoon/hurricane-ocr-tlpr-v1-LoRA

    191
    1
    peft
    Depth Estimation

    aarondevstack/DepthPro-512x512-coreml

    191
    coreml
    Text To Audio

    nikolab/speecht5_tts_hr

    190
    5
    transformers
    Object Detection

    AnnaZhang/lwdetr_medium_60e_coco

    190
    transformers
    Zero Shot Image Classification

    laion/CLIP-ViT-B-16-CommonPool.L-s1B-b8K

    190
    open_clip
    Image To Text

    Xenova/trocr-base-printed

    190
    1
    transformers.js
    Image To Text

    noctrex/LightOnOCR-2-1B-bbox-soup-GGUF

    190
    1
    Object Detection

    mradermacher/Polaris-VGA-2B-Post1.0-GGUF

    189
    transformers
    Object Detection

    allenai/WildDet3D

    189
    37
    wilddet3d
    Image Segmentation

    waterfall109/FRTSearch

    189
    2
    pytorch
    Image Feature Extraction

    apple/aimv2-large-patch14-336

    189
    6
    transformers
    Image Feature Extraction

    timm/aimv2_large_patch14_224.apple_pt

    189
    timm
    Text To Video

    AlekseyCalvin/HSToric_Color_Wan2.2_5B_LoRA_BySilverAgePoets

    189
    2
    diffusers
    Audio Classification

    Mrsmetamorphosis/dementia-wav2vec-scientific-specaugment-V2

    189
    transformers
    Audio Classification

    speechbrain/google_speech_command_xvector

    189
    7
    speechbrain
    Question Answering

    Meshwa/llama3.2-3b-Reflection-v1

    189
    Object Detection

    Enos-123/traffic-accident-detection-yolo11x

    188
    13
    ultralytics
    Image Feature Extraction

    timm/resnet50x16_clip_gap.openai

    188
    timm
    297 / 400