24dl/month
Identifier
Model ID
ZGZzz/SAMETags
samevision-languagenavigationembodied-aivisual-navigationmixture-of-expertsmultimodalpytorchvisual-question-answeringendataset:R2Rdataset:REVERIEdataset:RXRdataset:CVDNdataset:SOONdataset:ObjectNav-MP3Darxiv:2412.05552license:mitmodel-indexregion:us
Use SAME on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
OrganizationZGZzz
TaskVisual Question Answering
Librarysame
Licensemit
Downloads/mo24
View on HuggingFace
See model card, files, and community discussion