6dl/month
Identifier
Model ID
Joe99/visionlanguageTransformerTags
transformerspytorchviltvisual-question-answeringenarxiv:2102.03334license:apache-2.0endpoints_compatibleregion:us
Use visionlanguageTransformer on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval in Mixpeek Studio.
Open StudioSpecification
OrganizationJoe99
TaskVisual Question Answering
Librarytransformers
Licenseapache-2.0
Downloads/mo6
View on HuggingFace
See model card, files, and community discussion