← all datasets

Visual Question Answering

Emerging
6papers using it
2024first seen

Visual Question Answering (VQA) is a benchmark that evaluates the ability of models to answer questions about images, using both multiple-choice and caption-based tasks.

Papers using Visual Question Answering (6)

Visual Question Answering β€” datasets β€” multimodal