Image
Visual Question Answering
Answer natural language questions about image content
Note: This playground provides simulated output to showcase functionality. No input data is processed or stored on our servers. Use this demo to explore the feature extractor's capabilities before integrating it into your application.
Input
Enter a URL to a image file
Drag and drop a image file here, or click to browse
The VQA model to use. Default: blip-vqa
Maximum answer length. Default: 20
Number of alternative answers to generate. Default: 3
Whether to include context in the answer. Default: true
Output
{"question": "What color is the car?","answer": "red","confidence": 0.96,"alternative_answers": ["crimson","maroon"],"context": "image contains a red sports car parked on a street"}