16dl/month
1likes
Identifier
Model ID
DAMO-NLP-SG/VideoLLaMA2-72B-BaseTags
transformersqwen2text-generationmultimodal large language modellarge video-language modelvisual-question-answeringendataset:OpenGVLab/VideoChat2-ITdataset:Lin-Chen/ShareGPT4Vdataset:liuhaotian/LLaVA-Instruct-150Karxiv:2406.07476arxiv:2306.02858license:apache-2.0endpoints_compatibleregion:us
Use VideoLLaMA2-72B-Base on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval in Mixpeek Studio.
Open StudioSpecification
OrganizationDAMO-NLP-SG
TaskVisual Question Answering
Librarytransformers
Licenseapache-2.0
Downloads/mo16
Likes1
View on HuggingFace
See model card, files, and community discussion