Awesome Conditioning & Control
Conditioning & Control is one of the most active areas in Awesome Generative Models β 1,347 papers in this collection, evaluated on datasets like ImageNet, CIFAR-10, MNIST. A strong starting point is "Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis".
Datasets & benchmarks
Key papers
- Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis (2025)Bingxin Ke et al.10.47
- tempoGAN: A Temporally Coherent, Volumetric GAN for Super-resolution
Fluid Flow (2018)You Xie et al.10.12
- DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models (2023)Ximing Xing et al.8.82
- UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer (2025)Haoxuan Wang et al.8.49
- Contrastive Flow Matching (2025)George Stoica et al.8.23
- Attention Distillation: A Unified Approach to Visual Characteristics
Transfer (2025)Yang Zhou et al.7.19
- Diffusion Model-Based Image Editing: A Survey (2024)Yi Huang et al.5.97
- A Wavelet Diffusion GAN for Image Super-Resolution (2024)Lorenzo Aloisi and Luigi Sigillo and Aurelio Uncini and Danilo Comminiello5.37
- Projected Coupled Diffusion for Test-Time Constrained Joint Generation (2025)Hao Luan et al.5.03
- CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching (2025)Chen Chen et al.4.66
- Image-to-Image Translation with Diffusion Transformers and CLIP-Based Image Conditioning (2025)Qiang Zhu et al.4.47
- PepALD: Macrocyclic Peptide Generation via Autoregressive Latent Diffusion (2026)Junming Zhang et al.4.39
- Towards Controllable Image Generation through Representation-Conditioned Diffusion Models (2026)Nithesh Chandher Karthikeyan et al.4.33
- Diffusion-Based Ukrainian Handwritten Text Generation with Cross-Domain Style Transfer (2026)Andrii Ahitoliev et al.4.33
- Unsupervised Diffusion Solver for Combinatorial Optimization via Combinatorial Adjoint Matching (2026)Shengyu Feng et al.4.33
- Guidance for Low-Level Perceptual Editing in Unconditional Diffusion Models (2026)Shreyansh Modi et al.4.33
- DiffusionRenderer: Neural Inverse and Forward Rendering with Video
Diffusion Models (2025)Ruofan Liang and Zan Gojcic and Huan Ling and Jacob Munkberg and Jon Hasselgren and Zhi-Hao Lin and Jun Gao and Alexander Keller and Nandita Vijaykumar and Sanja Fidler and Zian Wang4.25
- SummDiff: Generative Modeling of Video Summarization with Diffusion (2025)Kwanseok Kim et al.4.09
- Diffusion Models Are Real-Time Game Engines (2024)Dani Valevski et al.3.97
- Seeing It Before It Happens: In-Generation NSFW Detection for Diffusion-Based Text-to-Image Models (2025)Fan Yang et al.3.97
- StorySync: Training-Free Subject Consistency in Text-to-Image Generation via Region Harmonization (2025)Gopalji Gaur et al.3.97
- Conditional Latent Diffusion Models for Zero-Shot Instance Segmentation (2025)Maximilian Ulmer et al.3.97
- DreamComposer++: Empowering Diffusion Models with Multi-View Conditions for 3D Content Generation (2025)Yunhan Yang et al.3.92
- Conditional Variational Diffusion Models (2023)Gabriel della Maggiora et al.3.91
- Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic
Music Generation (2025)Jincheng Zhang et al.3.81
- Higher fidelity perceptual image and video compression with a latent conditioned residual denoising diffusion model (2025)Jonas Brenig et al.3.81
- TurboFill: Adapting Few-step Text-to-image Model for Fast Image
Inpainting (2025)Liangbin Xie et al.3.75
- Memory-Efficient 3D High-Resolution Medical Image Synthesis Using
CRF-Guided GANs (2025)Mahshid Shiri et al.3.70
- Language-Guided Trajectory Traversal in Disentangled Stable Diffusion
Latent Space for Factorized Medical Image Generation (2025)Zahra TehraniNasab et al.3.70
- LS-GAN: Human Motion Synthesis with Latent-space GANs (2025)Avinash Amballa et al.3.59
- Making Time Editable in Video Diffusion Transformers (2026)Konstantin Kuklev et al.3.51
- It\^o maps for any-step SDEs (2026)Zhengkai Pan et al.3.51
- CaricHarmony: Contrastive Diffusion Paths for Identity-Preserving Caricature Synthesis (2026)Dongyu Wang et al.3.51
- Toward 360-Degree Indoor Panorama Editing via Tuning-Free Diffusion Model with Refocusing Cross-Attention (2026)Dinh-Khoi Vo et al.3.51
- Conditioning Matters: Stabilizing Inversion and Attention in Diffusion Image Editing (2026)Zheyuan Zhan et al.3.51
- VideoWeave: Unlocking Geometric Consistency in Video Generation via Joint Geometry-Video Modeling (2026)Xunzhi Xiang et al.3.51
- D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models (2026)Dengyang Jiang et al.3.45
- {\Phi}-Noise: Training-Free Temporal Video Conditioning via Phase-Based Noise Manipulation (2026)Ofir Abramovich et al.3.45
- Adjoint Matching: Fine-tuning Flow and Diffusion Generative Models with
Memoryless Stochastic Optimal Control (2024)Carles Domingo-Enrich et al.3.36
- RAD: Region-Aware Diffusion Models for Image Inpainting (2024)Sora Kim et al.3.26
- DEFT: Efficient Fine-Tuning of Diffusion Models by Learning the Generalised $h$-transform (2024)Alexander Denker et al.3.20
- Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models (2024)Sooyeon Go and Kyungmook Choi and Minjung Shin and Youngjung Uh3.20
- Generation of non-stationary stochastic fields using Generative
Adversarial Networks (2022)Alhasan Abdellatif et al.3.19
- Zero-shot Face Editing via ID-Attribute Decoupled Inversion (2025)Yang Hou et al.3.15
- Conditioning diffusion models by explicit forward-backward bridging (2024)Adrien Corenflos et al.3.14
- Faster Diffusion via Temporal Attention Decomposition (2024)Haozhe Liu et al.3.09
- Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model (2024)Yuxuan Zhang et al.3.03
- Generative Latent Diffusion for Efficient Spatiotemporal Data Reduction (2025)Xiao Li and Liangji Zhu and Anand Rangarajan and Sanjay Ranka2.99
- Diffusion models for multivariate subsurface generation and efficient probabilistic inversion (2025)Roberto Miele et al.2.99
- Adaptive Multimodal Protein Plug-and-Play with Diffusion-Based Priors (2025)Amartya Banerjee et al.2.99
- Gramsr: Visual Feature Conditioning For Diffusion-based Super-resolution (2026)Fabio D'Oronzio, Federico Putamorsi, Leonardo Zini, et al.2.95
- PartComposer: Learning and Composing Part-Level Concepts from Single-Image Examples (2025)Junyu Liu et al.2.93
- Motion-Zero: Zero-Shot Moving Object Control Framework for
Diffusion-Based Video Generation (2024)Changgu Chen et al.2.92
- LightLab: Controlling Light Sources in Images with Diffusion Models (2025)Nadav Magar et al.2.87
- Stable Diffusion for Data Augmentation in COCO and Weed Datasets (2023)Boyang Deng2.86
- Conditional Stochastic Interpolation for Generative Learning (2023)Ding Huang et al.2.86
- A Diffusion-Based Framework for Occluded Object Movement (2025)Zheng-Peng Duan et al.2.82
- Deep Generative Model-Based Generation of Synthetic Individual-Specific
Brain MRI Segmentations (2025)Ruijie Wang et al.2.82
- Deep Bootstrap (2026)Jinyuan Chang et al.2.77
- Color Alignment in Diffusion (2025)Ka Chun Shum et al.2.76