GenEval geneval Leaderboard

#	Model	score	Paper
1	Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-training	0.89	—
2	MammothModa2: A Unified AR-Diffusion Framework for Multimodal Understanding and Generation	0.87	—
3	UniDDT: Unifying Multimodal Understanding and Generation with Decoupled Diffusion Transformer	0.87	—

GenEval geneval Leaderboard