R-Bench
Emerging6papers using it
180HF downloads
2HF likes
2024first seen
'R-Bench' is a benchmark dataset used to evaluate the performance of models in detecting hallucinations in outputs generated by large Vision-Language Models (LVLMs).
Papers using R-Bench (6)
- SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and PlanningNative Visual Understanding: Resolving Resolution Dilemmas In Vision-language ModelsMore Thinking, Less Seeing? Assessing Amplified Hallucination In Multimodal Reasoning ModelsRo-bench: Large-scale Robustness Evaluation Of Mllms With Text-driven Counterfactual VideosCutPaste&Find: Efficient Multimodal Hallucination Detector with Visual-aid Knowledge BaseEvaluating and Analyzing Relationship Hallucinations in Large
Vision-Language Models