3D-VQA
Emerging4papers using it
2023first seen
The '3D-VQA' dataset/benchmark contains 3D scenes and is used to evaluate models on the task of visual question answering in three-dimensional environments.
Papers using 3D-VQA (4)
- Computed Tomography Visual Question Answering with Cross-modal Feature GraphingMulti-CLIP: Contrastive Vision-Language Pre-training for Question
Answering tasks in 3D ScenesEvaluating Zero-Shot GPT-4V Performance on 3D Visual Question Answering
Benchmarks3DMIT: 3D Multi-modal Instruction Tuning for Scene Understanding