NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 68656888 of 9,588 models

    Object Detection

    miguelcale04/yolo_finetuned_fruits

    247
    transformers
    Zero Shot Image Classification

    anton96vice/mobileclip2_tflite

    247
    1
    Image To Text

    DnaRnaProteins/qwen2.5-vl-7b-cells-cap

    247
    Zero Shot Image Classification

    ModelsLab/CLIP-ViT-H-14-laion2B-s32B-b79K

    246
    open_clip
    Image Feature Extraction

    timm/vit_so400m_patch16_siglip_384.v2_webli

    245
    timm
    Question Answering

    AnonymousSub/rule_based_roberta_hier_triplet_0.1_epochs_1_shard_1_squad2.0

    245
    transformers
    Reinforcement Learning

    mradermacher/inframind-0.5b-dapo-GGUF

    244
    transformers
    Image Segmentation

    smp-hub/segformer-b2-1024x1024-city-160k

    244
    segmentation-models-pytorch
    Zero Shot Image Classification

    MVRL/taxabind-vit-b-16

    244
    open_clip
    Question Answering

    csarron/roberta-base-squad-v1

    244
    transformers
    Reinforcement Learning

    mradermacher/eubiota-planner-8b-i1-GGUF

    244
    transformers
    Depth Estimation

    google/tipsv2-so400m14-dpt

    243
    3
    transformers
    Image Feature Extraction

    Xenova/dinov2-base

    243
    transformers.js
    Text To Video

    suyehb/Wan2.2-TI2V-5B-GGUF

    243
    gguf
    Question Answering

    PremalMatalia/roberta-base-best-squad2

    243
    1
    transformers
    Question Answering

    mradermacher/Llama-3.1-8B-Instruct-Uz-i1-GGUF

    243
    1
    transformers
    Object Detection

    qualcomm/3D-Deep-BOX

    242
    3
    pytorch
    Object Detection

    deepdoctection/tatr_tab_struct_v2

    242
    3
    transformers
    Image Segmentation

    tue-mps/eomt-dinov3-coco-panoptic-small-640

    242
    transformers
    Zero Shot Image Classification

    UCSC-VLAA/ViT-L-14-CLIPA-336-datacomp1B

    242
    2
    open_clip
    Question Answering

    sinequa/answer-finder-v1-S-en

    242
    transformers
    Question Answering

    hf-tiny-model-private/tiny-random-DistilBertForQuestionAnswering

    242
    transformers
    Image Segmentation

    facebook/mask2former-swin-base-IN21k-cityscapes-semantic

    241
    transformers
    Image To Text

    Flova/omr_transformer

    241
    12
    transformers
    287 / 400