15dl/month
2likes
Identifier
Model ID
DAMO-NLP-SG/VideoRefer-7B-stage2.5Tags
transformerssafetensorsvideorefer_qwen2text-generationmultimodal large language modellarge video-language modelvisual-question-answeringenarxiv:2406.07476license:apache-2.0endpoints_compatibleregion:us
Use VideoRefer-7B-stage2.5 on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
OrganizationDAMO-NLP-SG
TaskVisual Question Answering
Librarytransformers
Licenseapache-2.0
Downloads/mo15
Likes2
View on HuggingFace
See model card, files, and community discussion