← all datasets

M-3CoT

Emerging
3papers using it
2024first seen

The 'M-3CoT' dataset/benchmark is used to evaluate multimodal reasoning strategies in vision-language models by providing a collection of tasks that require complex visual and language interactions.

Papers using M-3CoT (3)

M-3CoT β€” datasets β€” multimodal