3D-VQA

Emerging

4papers using it

2023first seen

The '3D-VQA' dataset/benchmark contains 3D scenes and is used to evaluate models on the task of visual question answering in three-dimensional environments.

🔎 Find this dataset

Papers using 3D-VQA (4)

Computed Tomography Visual Question Answering with Cross-modal Feature Graphing2025

Multi-CLIP: Contrastive Vision-Language Pre-training for Question Answering tasks in 3D Scenes2023 · 7 cites

Evaluating Zero-Shot GPT-4V Performance on 3D Visual Question Answering Benchmarks2024 · 1 cites

3DMIT: 3D Multi-modal Instruction Tuning for Scene Understanding2024