NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 70337056 of 9,588 models

    Zero Shot Classification

    pitangent-ds/deberta-v3-nli-onnx-quantized

    207
    transformers
    Object Detection

    EsraaFouad/detr_fine_tune_face_detection_final

    207
    transformers
    Image Segmentation

    j-morano/rrwnet-rite

    207
    Zero Shot Image Classification

    fancyfeast/so400m-long

    207
    11
    transformers
    Image To Text

    calcuis/sd3.5-large-turbo

    207
    6
    Image Feature Extraction

    timm/vit_7b_patch16_dinov3.sat493m

    207
    1
    timm
    Image Segmentation

    prem-timsina/segformer-b0-finetuned-food

    206
    6
    transformers
    Question Answering

    varun-v-rao/gpt2-squad-model2

    206
    transformers
    Audio Classification

    mtg-upf/discogs-maest-10s-fs-129e

    206
    transformers
    Object Detection

    vineetsarpal/yolov11n-car-damage

    205
    2
    ultralytics
    Image Feature Extraction

    fushh7/ObjEmbed-4B

    205
    Image Feature Extraction

    OpenGVLab/InternVL-14B-224px

    205
    35
    transformers
    Question Answering

    sinequa/answer-finder.yuzu

    205
    transformers
    Question Answering

    FreedomIntelligence/Apollo-MoE-1.5B

    205
    1
    Question Answering

    HomayounSadri/bert-base-uncased-finetuned-squad-v2

    205
    transformers
    Question Answering

    MilyaShams/rubert-russian-qa-sberquad

    205
    Image To Text

    microsoft/git-base-textcaps

    204
    9
    transformers
    Question Answering

    ancs21/xlm-roberta-large-vi-qa

    203
    4
    transformers
    Depth Estimation

    justinsoberano/depth-ai

    202
    transformers
    Zero Shot Classification

    knowledgator/gliclass-modern-base-v2.0-init

    202
    25
    Image Segmentation

    lygitdata/BiRefNet_garmentiq_backup

    202
    birefnet
    Audio Classification

    ernensbjorn/perch-v2-int8-tflite

    202
    3
    tflite
    Visual Question Answering

    erax-ai/EraX-VL-7B-V1.5

    201
    9
    transformers
    Zero Shot Image Classification

    timm/MobileCLIP2-L-14-OpenCLIP

    201
    2
    open_clip
    294 / 400