11 datasets
Emerging4papers using it
2022first seen
The '11 datasets' benchmark contains a variety of datasets used to evaluate the effectiveness of Multimodal Prompt Learning methods, specifically in assessing their performance in robust generalization and cross-modal alignment.
Papers using 11 datasets (4)
- CAPT: Confusion-Aware Prompt Tuning for Reducing Vision-Language MisalignmentTABED: Test-Time Adaptive Ensemble Drafting for Robust Speculative Decoding in LVLMsFrom Points to Clouds: Learning Robust Semantic Distributions for Multi-modal PromptsLearning Domain Invariant Prompt for Vision-Language Models