37dl/month
5likes
Identifier
Model ID
DAMO-NLP-SG/VideoRefer-7BTags
transformerssafetensorsvideorefer_qwen2text-generationmultimodal large language modellarge video-language modelvisual-question-answeringenarxiv:2406.07476license:apache-2.0endpoints_compatibleregion:us
Use VideoRefer-7B on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
OrganizationDAMO-NLP-SG
TaskVisual Question Answering
Librarytransformers
Licenseapache-2.0
Downloads/mo37
Likes5
View on HuggingFace
See model card, files, and community discussion