Controlling Diffusion Models Through Blockwise Gradient Guidance and Sampling

None, None

Controlling Diffusion Models Through Blockwise Gradient Guidance and Sampling

Master Thesis (2025)

Author(s)

M. Goyal (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Anuj Singh – Mentor (Shell Global Solutions International B.V.)

H Jamali-Rad – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

Elvin Isufi – Mentor (TU Delft - Multimedia Computing)

Petr Kellnhofer – Graduation committee member (TU Delft - Computer Graphics and Visualisation)

Faculty

Electrical Engineering, Mathematics and Computer Science

Generative AI

To reference this document use:

https://resolver.tudelft.nl/uuid:f3f4cc66-2881-4f9d-a3a8-6337589db2c7

More Info

expand_more

Publication Year

2025

Language

English

Graduation Date

26-06-2025

Awarding Institution

Delft University of Technology

Programme

['Computer Science | Data Science and Technology']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Aligning diffusion model outputs with downstream objectives is essential for improving task-specific performance. Broadly, inference-time training-free approaches for aligning diffusion models can be categorized into two main strategies: sampling-based methods, which explore multiple candidate outputs and select those with higher reward signals, and gradient-guided methods, which use differentiable reward approximations to directly steer the generation process. In this work, we propose a universal algorithm, CoDeX, which brings together the strengths of blockwise sampling and gradient-based guidance into a unified framework. Building on the blockwise sampling paradigm of CoDe, CoDeX integrates local gradient signals during sampling, thereby addressing the sampling inefficiency inherent in complex reward-based sampling approaches like CoDe. At the same time, it overcomes the limited applicability of traditional gradient-guided methods, which often struggle with non-differentiable rewards. By cohesively combining these two paradigms, CoDeX enables more efficient sampling while offering better trade-offs between reward alignment and divergence from the diffusion unconditional prior. Empirical results demonstrate that CoDeX consistently outperforms CoDe and remains competitive with state-of-the-art baselines across a range of tasks.

Files

TU_Delft_Report_Thesis_Maurya_... (pdf)

(pdf | 5.22 Mb)

License info not available