NEWAgents can now see video via MCP.Try it now →
    Models/Visual Question Answering/DAMO-NLP-SG/VideoLLaMA2.1-7B-16F-Base
    Visual Question Answeringtransformersapache-2.0

    VideoLLaMA2.1-7B-16F-Base

    by DAMO-NLP-SG

    Identifier
    Model ID
    DAMO-NLP-SG/VideoLLaMA2.1-7B-16F-Base

    Tags

    transformersvideollama2_qwen2text-generationmultimodal large language modellarge video-language modelvisual-question-answeringendataset:OpenGVLab/VideoChat2-ITdataset:Lin-Chen/ShareGPT4Vdataset:liuhaotian/LLaVA-Instruct-150Karxiv:2406.07476arxiv:2306.02858license:apache-2.0endpoints_compatibleregion:us

    Use VideoLLaMA2.1-7B-16F-Base on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder