A-OKVQA
Emerging1papers using it
2026first seen
A-OKVQA is a dataset that contains visual questions based on real-world photographs, used to evaluate the compositional visual reasoning capabilities of Vision-Language Models.
A-OKVQA is a dataset that contains visual questions based on real-world photographs, used to evaluate the compositional visual reasoning capabilities of Vision-Language Models.