Visual Question Answering (VQA)
Emerging9papers using it
2023first seen
Papers using Visual Question Answering (VQA) (9)
- Cross-Modal Attention Guided Unlearning in Vision-Language ModelsDo LVLMs Know What They Know? A Systematic Study of Knowledge Boundary Perception in LVLMsProvoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context LearningTowards Resource-efficient Multimodal Intelligence: Learned Routing Among Specialized Expert ModelsDo Large Vision-language Models Distinguish Between The Actual And Apparent Features Of Illusions?Adaptive Token Boundaries: Integrating Human Chunking Mechanisms into
Multimodal LLMsLarge Language Models are Visual Reasoning CoordinatorsUncertainty-Aware Evaluation for Vision-Language ModelsBoth Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM