11dl/month
5likes
Identifier
Model ID
DAMO-NLP-SG/VideoRefer-7BTags
transformerssafetensorsvideorefer_qwen2text-generationmultimodal large language modellarge video-language modelvisual-question-answeringenarxiv:2406.07476license:apache-2.0endpoints_compatibleregion:us
Use VideoRefer-7B on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval in Mixpeek Studio.
Open StudioSpecification
OrganizationDAMO-NLP-SG
TaskVisual Question Answering
Librarytransformers
Licenseapache-2.0
Downloads/mo11
Likes5
View on HuggingFace
See model card, files, and community discussion