#ModelFIDPaper
1REPA (SiT-XL/2 + REPA)1.42link
2TexTok (DiT + text-conditioned tokenizer)1.46β€”
3MAR-H (Diffusion Loss)1.55link
4VAR-d30 (Visual AutoRegressive)1.73link
5M-VAR-d321.78β€”
6SiT-XL/22.06link
7DiT-XL/22.27link
8LDM-4 (Latent Diffusion)3.60link
9ADM-G (Guided Diffusion)4.59link
FID on ImageNet 256x256 (class-conditional) fid-imagenet-256 Leaderboard