Awesome Diffusion Models
Diffusion Models is one of the most active areas in Awesome Generative Models β 2,467 papers in this collection, evaluated on datasets like CIFAR-10, ImageNet, COCO. A strong starting point is "Diffusion Posterior Sampling for General Noisy Inverse Problems".
Datasets & benchmarks
Key papers
- Diffusion Posterior Sampling for General Noisy Inverse Problems (2022)Hyungjin Chung et al.11.59
- Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis (2025)Bingxin Ke et al.10.47
- Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models (2025)Jinjin Zhang et al.10.03
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion
Transformers via In-Context Reflection (2025)Shufan Li et al.9.29
- RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space (2026)Xichen Pan et al.8.86
- Learning Few-Step Diffusion Models by Trajectory Distribution Matching (2025)Yihong Luo et al.8.84
- DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models (2023)Ximing Xing et al.8.82
- One-step Diffusion Models with $f$-Divergence Distribution Matching (2025)Yilun Xu et al.8.70
- UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer (2025)Haoxuan Wang et al.8.49
- Contrastive Flow Matching (2025)George Stoica et al.8.23
- Diffusion models for inverse problems (2025)Hyungjin Chung et al.7.83
- Latent Space Super-Resolution for Higher-Resolution Image Generation
with Diffusion Models (2025)Jinho Jeong et al.7.82
- A Review on Generative AI For Text-To-Image and Image-To-Image
Generation and Implications To Scientific Images (2025)Zineb Sordo and Eric Chagnon and Daniela Ushizima7.64
- Generative Modeling via Drifting (2026)Mingyang Deng et al.7.47
- CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer (2024)Zhuoyi Yang et al.7.30
- Attention Distillation: A Unified Approach to Visual Characteristics
Transfer (2025)Yang Zhou et al.7.19
- Bolt3D: Generating 3D Scenes in Seconds (2025)Stanislaw Szymanowicz and Jason Y. Zhang and Pratul Srinivasan and Ruiqi Gao and Arthur Brussee and Aleksander Holynski and Ricardo Martin-Brualla and Jonathan T. Barron and Philipp Henzler6.92
- Glance: Accelerating Diffusion Models with 1 Sample (2025)Zhuobai Dong et al.6.89
- Arbitrary-steps Image Super-resolution via Diffusion Inversion (2024)Zongsheng Yue et al.6.61
- Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation (2023)David Junhao Zhang et al.6.55
- GrainPaint: A multi-scale diffusion-based generative model for
microstructure reconstruction of large-scale objects (2025)Nathan Hoffman and Cashen Diniz and Dehao Liu and Theron Rodgers and Anh Tran and Mark Fuge6.23
- Latte: Latent Diffusion Transformer for Video Generation (2024)Xin Ma et al.6.11
- Automated Tuning for Diffusion Inverse Problem Solvers without Generative Prior Retraining (2025)Ya\c{s}ar Utku Al\c{c}alar et al.5.98
- Diffusion Model-Based Image Editing: A Survey (2024)Yi Huang et al.5.97
- Modular MeanFlow: Towards Stable and Scalable One-Step Generative Modeling (2025)Haochen You et al.5.93
- Holding the FP8 Quality Ceiling at 8-Bit Weights and Activations: INT8 and GGUF Post-Training Quantization of Ideogram 4.0 for Consumer GPUs (2026)Deep Gandhi et al.5.49
- Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation (2024)Hongxu Jiang et al.5.40
- A Wavelet Diffusion GAN for Image Super-Resolution (2024)Lorenzo Aloisi and Luigi Sigillo and Aurelio Uncini and Danilo Comminiello5.37
- Removing Structured Noise with Diffusion Models (2023)Tristan S.W. Stevens et al.5.30
- Generative modelling with jump-diffusions (2025)Adrian Baule5.29
- Diffusion Image Prior (2025)Hamadi Chihaoui and Paolo Favaro5.29
- Diffusion Models and Representation Learning: A Survey (2024)Michael Fuest et al.5.20
- Projected Coupled Diffusion for Test-Time Constrained Joint Generation (2025)Hao Luan et al.5.03
- Efficient On-Device Diffusion LLM Inference with Mobile NPU (2026)Tuowei Wang et al.5.01
- XRDiff: Crystal Structure Prediction from Powder X-Ray Diffraction Data Using Diffusion Models (2026)Nofit Segal et al.5.01
- Local Coverage Governs Memorization in Diffusion Models (2026)Claudia Merger et al.5.01
- Studying Image Diffusion Features for Zero-Shot Video Object
Segmentation (2025)Thanos Delatolas et al.4.93
- SupResDiffGAN a new approach for the Super-Resolution task (2025)Dawid Kope\'c et al.4.93
- Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers (2024)Katherine Crowson and Stefan Andreas Baumann and Alex Birch and Tanishq Mathew Abraham and Daniel Z. Kaplan and Enrico Shippole4.90
- GenDR: Lighten Generative Detail Restoration (2025)Yan Wang et al.4.87
- Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video
Generation Control (2025)Zekai Gu et al.4.76
- Enhancing Fetal Plane Classification Accuracy with Data Augmentation Using Diffusion Models (2025)Yueying Tian et al.4.76
- Learning Diffusion Priors from Observations by Expectation Maximization (2024)Fran\c{c}ois Rozet et al.4.74
- CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching (2025)Chen Chen et al.4.66
- Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (2024)Sihyun Yu et al.4.60
- Zero Shot Molecular Generation via Similarity Kernels (2024)Rokas Elijo\v{s}ius et al.4.57
- SINE: SINgle Image Editing with Text-to-Image Diffusion Models (2022)Zhixing Zhang et al.4.48
- PhytoSynth: Leveraging Multi-modal Generative Models for Crop Disease Data Generation with Novel Benchmarking and Prompt Engineering Approach (2025)Nitin Rai et al.4.47
- MCCD: Multi-Agent Collaboration-based Compositional Diffusion for
Complex Text-to-Image Generation (2025)Mingcheng Li et al.4.47
- Image-to-Image Translation with Diffusion Transformers and CLIP-Based Image Conditioning (2025)Qiang Zhu et al.4.47
- Diffusion models applied to skin and oral cancer classification (2025)Jos\'e J. M. Uliana et al.4.42
- LoMC: Localized Multidirectional Correction for Refusal Suppression in Routed Foundation Models (2026)Yan Hong et al.4.39
- CineOrchestra: Unified Entity-Centric Conditioning for Cinematic Video Generation (2026)Sharath Girish et al.4.39
- Diffusion Policy Optimization without Drifting Apart (2026)Haozhe Jiang et al.4.39
- Recursively Trained Diffusion Models: Limiting Collapse Distribution and Spectral Characterization (2026)Na\"il B. Khelifa et al.4.39
- Compressing Image Style Training into a Single Model Forward (2026)Zhongjie Duan et al.4.39
- Temporal Backtracking Search for Test-time Generative Video Reasoning (2026)Sejoon Jun et al.4.39
- Mirage Probes: How Vision Models Fake Visual Understanding (2026)Daniel Ben-Levi et al.4.39
- How do Self-Supervised Remote Sensing Vision Models Transfer to Downstream Tasks? (2026)Julia Romero et al.4.39
- Smoothing Dark Areas in Molecular Latent Diffusion (2026)Xi Wang et al.4.39