NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 85218544 of 9,588 models

    Video Classification

    Q-Wind/videomae-base-finetuned-ucf101-newone

    40
    transformers
    Text To Audio

    arham061/speecht5_finetuned_voxpopuli_nl

    40
    1
    transformers
    Text To Audio

    rhymeswithlion/MIDI-LLM_Llama-3.2-1B-Q8_0-GGUF

    40
    1
    transformers
    Zero Shot Classification

    mjwong/multilingual-e5-large-xnli-anli

    40
    2
    transformers
    Zero Shot Classification

    smartytrios/docintel_ocr_llama_3_2_gguf

    40
    1
    Visual Question Answering

    Kevin0217/vilt_finetuned_200

    40
    transformers
    Visual Question Answering

    BAAI/Aquila-VL-2B-llava-qwen

    40
    62
    transformers
    Visual Question Answering

    YifanQiao/qwen3vl4b-hist-qa-checkpoint-723

    40
    peft
    Visual Question Answering

    MahimaNR/vilt_finetuned_200

    40
    transformers
    Visual Question Answering

    OpenDataArena/MMFineReason-8B

    39
    10
    Unconditional Image Generation

    satishpaib/sd-class-butterflies-32-copy-5

    39
    diffusers
    Unconditional Image Generation

    lilili696969/sd-class-butterflies-32

    39
    diffusers
    Unconditional Image Generation

    nassim-walha/sd-class-butterflies-32

    39
    diffusers
    Unconditional Image Generation

    ziyizhou/sd-class-butterflies-32

    39
    diffusers
    Unconditional Image Generation

    ym999ai/sd-class-butterflies-32

    39
    diffusers
    Unconditional Image Generation

    Sangeegk/sd-class-butterflies

    39
    diffusers
    Unconditional Image Generation

    Ron936/ddpm-celebahq-finetuned-butterflies-2epochs

    39
    diffusers
    Unconditional Image Generation

    ektagala24/sd-class-butterflies-32-copy-5

    39
    diffusers
    Image Feature Extraction

    AvitoTech/SigLIP-Base-for-animal-identification

    39
    2
    transformers
    Depth Estimation

    Manojb/dpt-large

    39
    Video Classification

    d2o2ji/videomae-base-finetuned-kinetics-0313-clip_duration-abnormal09_shortsidescale

    39
    transformers
    Text To Audio

    alakxender/mms-tts-div-finetuned-md-f01

    39
    transformers
    Zero Shot Classification

    NDugar/3epoch-3large

    39
    2
    transformers
    Visual Question Answering

    ayushk4/smol-gpt4

    39
    1
    transformers
    356 / 400