NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 70097032 of 9,588 models

    Image Segmentation

    stevenbucaille/rf-detr-seg-xlarge

    213
    transformers
    Object Detection

    yasirfaizahmed/android_ui_detection_yolov8

    212
    4
    ultralytics
    Object Detection

    mipedro1/yolo_raccoon_detector

    212
    transformers
    Question Answering

    potsawee/longformer-large-4096-answering-race

    212
    16
    transformers
    Text To Video

    beishan2024/clip_vision_h.safetensors

    212
    Audio To Audio

    speechbrain/sepformer-wsj03mix

    211
    7
    speechbrain
    Object Detection

    lucasaf04/yolo_finetuned_fruits

    211
    transformers
    Image To Text

    StanfordAIMI/CheXagent-2-3b-srrg-findings

    211
    1
    transformers
    Image To Text

    mweinbach/nemotron-ocr-v2-coreml

    211
    coreml
    Text To Audio

    sil-ai/dgo-tts-training-data-speecht5-b

    210
    transformers
    Object Detection

    mshamrai/yolov8l-visdrone

    210
    5
    ultralytics
    Object Detection

    is36e/detr-resnet-101-dc5-sku110k

    210
    transformers
    Image Segmentation

    ZhengPeng7/BiRefNet-DIS5K-TR_TEs

    210
    birefnet
    Zero Shot Image Classification

    facebook/metaclip-2-mt5-worldwide-s16

    210
    4
    transformers
    Zero Shot Image Classification

    laion/CLIP-convnext_base_w-laion_aesthetic-s13B-b82K

    210
    5
    open_clip
    Question Answering

    akthegr8/excelpdf

    210
    transformers
    Image To Text

    fhswf/TrOCR_Math_handwritten

    209
    8
    transformers
    Image To Text

    docling-project/ChemicalOCR

    209
    1
    transformers
    Image Feature Extraction

    iszt/RETFound_mae_natureOCT

    209
    transformers
    Audio Classification

    mtg-upf/discogs-maest-30s-pw-129e-519l

    209
    transformers
    Question Answering

    sjrhuschlee/bart-base-squad2

    209
    transformers
    Image To Text

    Xenova/texify2

    208
    3
    transformers.js
    Image Feature Extraction

    MahmoodLab/KRONOS

    208
    28
    kronos
    Question Answering

    johnjose223/xlnet_squad

    208
    transformers
    293 / 400