M-3CoT
Emerging3papers using it
2024first seen
The 'M-3CoT' dataset/benchmark is used to evaluate multimodal reasoning strategies in vision-language models by providing a collection of tasks that require complex visual and language interactions.
The 'M-3CoT' dataset/benchmark is used to evaluate multimodal reasoning strategies in vision-language models by providing a collection of tasks that require complex visual and language interactions.