← all datasets

A-OKVQA

Emerging
1papers using it
2026first seen

A-OKVQA is a dataset that contains visual questions based on real-world photographs, used to evaluate the compositional visual reasoning capabilities of Vision-Language Models.

A-OKVQA β€” datasets β€” computer-vision