NEWWhy single embeddings fail for video.Read the post →
    Models/Visual Question Answering/SakanaAI/TAID-VLM-2B

    TAID-VLM-2B

    by SakanaAI

    Identifier
    Model ID
    SakanaAI/TAID-VLM-2B

    Tags

    transformerssafetensorsinternvl_chatfeature-extractionvisual-question-answeringcustom_codeendataset:TIGER-Lab/Mantis-Instructarxiv:2501.16937base_model:OpenGVLab/InternVL2-2Bbase_model:finetune:OpenGVLab/InternVL2-2Blicense:mitregion:us

    Use TAID-VLM-2B on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder