← all datasets

3D Visual Question Answering (VQA)

Emerging
6papers using it
2022first seen

3D Visual Question Answering (VQA) is a benchmark that evaluates the ability of models to understand and reason about 3D scenes by answering questions based on visual inputs, typically involving spatial and contextual information.

Papers using 3D Visual Question Answering (VQA) (6)

3D Visual Question Answering (VQA) β€” datasets β€” multimodal