A. Heinlein | TU Delft Repository

Sharpened CG Iteration Bound for High-contrast Heterogeneous Scalar Elliptic PDEs

Going Beyond Condition Number

Master thesis (2025) - P.M. Soliman (author) , A. Heinlein (mentor) , H.M. Schuttelaars (graduation committee member) , F.A. Cumaru Silva Alves (mentor)

This thesis addresses the limitations of the classical condition number-based Conjugate Gradient (CG) iteration bound in solving high-contrast heterogeneous scalar elliptic problems, particularly when employing two-level Schwarz preconditioners. The classical bound, which relies ...

This thesis addresses the limitations of the classical condition number-based Conjugate Gradient (CG) iteration bound in solving high-contrast heterogeneous scalar elliptic problems, particularly when employing two-level Schwarz preconditioners. The classical bound, which relies solely on the condition number of the system matrix, fails to accurately predict the convergence behavior in scenarios where the eigenspectrum of the preconditioned system exhibits pronounced clustering and spectral gaps. Motivated by this observation, the thesis develops and analyzes sharpened CG iteration bounds that incorporate detailed spectral information, offering a more nuanced and descriptive understanding of convergence.

Building on foundational work in spectral analysis and iterative solvers, the thesis introduces novel multi-cluster and tail-cluster bounds for the CG method. These bounds are derived through a combination of theoretical analysis and practical algorithms for partitioning eigenspectra, and are validated both analytically and numerically. The new bounds utilize key spectral characteristics, such as cluster condition numbers and spectral width, to more accurately estimate the number of iterations required for convergence. Numerical experiments demonstrate that the sharpened bounds can be up to 1000 times tighter than the classical bound and are effective in distinguishing the robustness of different Schwarz preconditioners.

Despite their improved accuracy, the practical application of these bounds for a priori iteration estimation is challenged by the need for detailed spectral information, which is often unavailable in the early stages of iterative solvers. The thesis discusses heuristic approaches for leveraging partial spectral data and highlights the dependency of bound accuracy on the choice of coefficient functions and preconditioners.

In conclusion, the sharpened CG iteration bounds developed in this work provide a significant advancement in predictive performance analysis for high-contrast elliptic problems. Future research directions include refining cluster partitioning algorithms, improving a priori spectral estimation, and extending the applicability of these bounds to more complex problems and preconditioners.

Domain decomposition-based neural networks for complex shaped domains

Master thesis (2025) - W. Chen (author) , A. Heinlein (mentor) , A. Papapantoleon (graduation committee member) , Amanda Howard (mentor)

Physics-informed neural networks (PINNs) provide a powerful framework for solving differential equations but often encounter difficulties when addressing high-frequency solutions. Finite basis physics-informed neural networks (FBPINNs) improve PINN performance through uniform ove ...

A Hybrid Framework for Accelerating Linear Solvers for Partial Differential Equations

Master thesis (2025) - Y. Wu (author) , A. Heinlein (mentor) , V. Dolean (mentor)

Solving large-scale linear systems derived from partial differential equations (PDEs) is an important problem in the field of scientific computing. Classical stationary iterative methods are effective at eliminating high-frequency components of the error, but struggle with low-fr ...

Solving large-scale linear systems derived from partial differential equations (PDEs) is an important problem in the field of scientific computing. Classical stationary iterative methods are effective at eliminating high-frequency components of the error, but struggle with low-frequency components. Deep learning-based solvers like the Deep Operator Network (DeepONet) are excellent at learning low-frequency functions but suffer from the issue of spectral bias. The Hybrid Iterative Numerical Transferable Solver (HINTS) framework was recently proposed to combine these complementary strengths. However, the original HINTS framework has a significant convergence slowdown in later iterations. This thesis reveals that this problem is primarily caused by two limitations: (1) the accumulation of mid-frequency components in the error due to the different spectral preferences between classical stationary methods and the DeepONet, and (2) a distribution shift between the low-frequency-dominated training data and the mid-frequency-dominated residuals encountered during the the iterative process in the HINTS framework.

To address these limitations, this thesis introduces two enhancement strategies. First, we propose Gradient-Enhanced HINTS (GE-HINTS), a method that incorporates first-order derivative information into the DeepONet's loss function. Motivated by the anti-frequency principle, this approach mitigates the model's spectral bias, and thus improve the performance of HINTS. Second, we develop "HINTS-in-the-loop" training strategies, which makes the DeepONet model aware of the true residual distributions it will encounter during inference. This is achieved through both an offline data augmentation strategy and an online, end-to-end differentiable training loop that optimizes the solver's multi-step performance.

Numerical experiments on benchmark problems demonstrated the effectiveness of our proposed methods. Both GE-HINTS and the HINTS-in-the-loop strategies significantly accelerate the convergence of the single-level HINTS solver. Overall, this thesis provides
both mechanistic understanding and practical strategies for accelerating the HINTS framework. We hope these insights will aid researchers seeking effective hybrid iterative solvers and will contribute to further progress in this area.

Adapting Contrastive Learning Methods for Few-Shot Imputation in High-Dimensional Data

Master thesis (2025) - M.H. Aldorf (author) , A. Heinlein (mentor) , S. Wiarda (mentor) , G.N.J.C. Bierkens (graduation committee member) , Cornelis Vuik (graduation committee member)

High-dimensional data imputation is a critical challenge in semiconductor metrology, where secondary measurements are often purposely omitted to optimize throughput. This thesis examines the Missing By Design (MBD) framework—an industrially motivated scenario in which data are sy ...

High-dimensional data imputation is a critical challenge in semiconductor metrology, where secondary measurements are often purposely omitted to optimize throughput. This thesis examines the Missing By Design (MBD) framework—an industrially motivated scenario in which data are systematically uncollected to reduce measurement overhead—and investigates a range of imputation solutions tailored to the particular complexities of wafer reflectivity and overlay. After establishing the physical, rank-deficient nature of wafer metrology data through singular-value decompositions and principal component analyses, we explore several classes of methods: linear regressions and matrix-completion techniques for baseline comparisons; deep neural-network regression (MLP) to capture nonlinearities; a contrastive-learning adaptation of CLIP for pairwise matching of primary–secondary measurements; and novel Bridge Models that refine coarse CLIP estimates with localized residual translations. Additionally, we integrate overlay-based domain constraints into CLIP via domain-guided neural network regularization (DG), ensuring physically coherent tool-to-tool (T2T) predictions. Comprehensive experiments on proprietary wafer datasets confirm that linear approaches including regressions and matrix completion methods, despite capturing the low rank structure of the data, underperform in downstream overlay and T2T prediction due to subtle nonlinear relationships. Deep neural networks offer strong reconstruction accuracy, yet demand extensive hyperparameter tuning and deeper network structures than contrastive alternatives such as CLIP-like approaches, which yield architecturally efficient, instance-based retrievals, but can lack the precision needed for rigorous overlay alignment. DG regularization, as an extension of the CLIP framework, considerably enhances T2T consistency and reduces raw reconstruction error. Meanwhile, the Bridge Model combines a CLIP-derived coarse imputation with a smaller learnable residual map between encoder domains, bridging global pairwise alignment and localized corrections for improved reconstruction and downstream tasks. Overall, this thesis presents a flexible suite of tools that advance high-dimensional MBD imputation in wafer metrology, offering valuable insights and a robust methodological foundation for future industrial applications.

Preconditioned Krylov Solvers under Shared-Memory Parallelism

Evaluating Convergence, Scalability, and Parallel Overhead

Bachelor thesis (2025) - H.J.G. Reijersen van Buuren (author) , A. Heinlein (mentor) , D.J.P. Lahaye (graduation committee member)

This thesis investigates how preconditioned Krylov subspace methods perform and scale under shared-memory parallelism. The focus is on the Conjugate Gradient (CG) method for symmetric positive definite systems and the Generalized Minimal Residual (GMRES) method for non-symmetric ...

This thesis investigates how preconditioned Krylov subspace methods perform and scale under shared-memory parallelism. The focus is on the Conjugate Gradient (CG) method for symmetric positive definite systems and the Generalized Minimal Residual (GMRES) method for non-symmetric systems. Both solvers are implemented in PyKokkos and applied to finite element discretisation generated with NGSolve. We look at a scalar Laplace problem, a Stokes-like vector problem, and a steady Stokes flow around a NACA,2412 airfoil.For CG, we study Jacobi and symmetric Gauss–Seidel (SGS) preconditioning, strong and weak scaling up to 16 threads, and kernel-level timings. As the mesh is refined, the iteration count grows in line with the increasing condition number of the stiffness matrix. Jacobi reduces the iteration count only slightly but is cheap and fully parallel, leading to runtimes similar to, and sometimes slightly better than, unpreconditioned CG. SGS roughly halves the iteration count, but its forward/backward sweeps are largely sequential, which limits speed-up on many cores and can make SGS slower overall despite faster convergence.For GMRES we analyse the influence of the restart parameter, preconditioning, and polynomial order. Higher-order vector elements lead to more off-diagonal entries in the system matrix, here scalar Jacobi becomes too weak and can even make restarted GMRES slower than using no preconditioner. SGS remains effective in terms of iterations, but this comes with the same parallel limitations as in CG. And overall GMRES shows poor strong and weak scaling on the tested CPUs. For the NACA Stokes system, Jacobi and SGS preconditioning fails, whereas a block preconditioner that respects the velocity–pressure structure shows rapid convergence.Overall, the results show that good performance on shared-memory architectures requires preconditioners that both respect the block structure of the PDE and are highly parallelizable.
Github: https://github.com/Hugoreijersen/Krylov-Subspace-Methods.git

On the Effectiveness of Modeling Uncertain Constraint-Based Utility Functions with Quadratic Polynomials

With Applications in Autonomous Negotiations

Master thesis (2025) - V.M. Þórsson (author) , A. Heinlein (mentor) , C.M. Jonker (graduation committee member) , R.J. Fokkink (graduation committee member) , W.P. Brinkman (graduation committee member)

The curse of dimensionality poses a fundamental challenge in autonomous negotiations: as the number of issues and their interdependencies increase, exhaustive evaluation of the outcome space quickly becomes infeasible. This thesis addresses this problem by introducing a surrogate ...

The curse of dimensionality poses a fundamental challenge in autonomous negotiations: as the number of issues and their interdependencies increase, exhaustive evaluation of the outcome space quickly becomes infeasible. This thesis addresses this problem by introducing a surrogate-based method that approximates uncertain hypercubic constraint-based utility functions with quadratic polynomials. An autonomous negotiation agent can then search for high-utility outcomes in this surrogate model. The research objective was to investigate how efficiently an autonomous negotiation agent can identify high-utility bids with this approach, and how this approach compares to linear approximations and established benchmark agents.

The main contributions of this thesis are threefold. First, it introduces a probabilistic complexity measure for these hypercubic functions, capturing how parameters such as dimensionality, constraint width, the number of constraints, and the number of issues interact to shape the function's complexity. Second, it develops a novel agent that leverages a regression model with quadratic basis functions to construct a surrogate model of a hypercubic constraint-based utility function. Third, it evaluates the agent through extensive experiments, demonstrating how performance scales with complexity. Following the steps outlined in this thesis, the performance of surrogate models can be directly compared.

The results demonstrate that the surrogate-based method is a promising approach, as the agent constructed in this thesis outperforms the agents from the 2014 Automated Negotiating Agent Competition which used similar scenarios as those considered in this thesis. These agents all have in common that they directly search the utility function as opposed to a surrogate model of it. Furthermore, the results indicate that simple basis functions, such as quadratic ones, enable the agent to reach the global maximum of its utility function in low-complexity hypercubic cases, with performance scaling reasonably well up to medium complexity. Beyond this point, however, performance deteriorates rapidly, clearly signaling the need for more expressive surrogate models.

Operator Learning for Loss Parameter Estimation in Dredging Operations

To optimize the suction production on Trailing Suction Hopper Dredgers

Master thesis (2025) - M. Kielhöfer (author) , A. Heinlein (mentor) , G. Jongbloed (graduation committee member) , M.B. van Gijzen (graduation committee member)

Accurate modeling of vacuum dynamics in Trailing Suction Hopper Dredgers (TSHDs) is critical for optimizing suction production and mitigating sensor anomalies. This study proposes a data-driven, physics-guided operator learning framework to estimate the vacuum pressure loss param ...

Activation function trade-offs for training efficiency of Physics-Informed Neural Networks used in solving 1D Burgers’ Equation

Analyzing the impact of the choice of adaptive activation function on the speed and accuracy of generating PDE solutions using PINNs

Bachelor thesis (2025) - R. Mihail (author) , J. Sun (mentor) , A. Heinlein (mentor) , Tie-xing Wang (mentor) , H.S. Hung (graduation committee member)

Physics-Informed Neural Networks(PINNs) have emerged as a potent, versatile solution to solving both forward and inverse problems regarding partial differential equations(PDEs), accomplished through integrating laws of physics into the learning process. The applications of this n ...

Physics-Informed Neural Networks with Adaptive Sampling for Option Pricing

Bachelor thesis (2025) - H.M. Agterberg (author) , J. Sun (mentor) , A. Heinlein (mentor) , T. Wang (mentor) , H.S. Hung (graduation committee member)

Today, machine learning has an accelerated impact in quantitative finance. Current models require large amounts of data, which can be expensive. A notable area of research, physics-informed neural networks (PINNs), has proven to be effective in approximating problems that are des ...

The impact of different methods of gradient descent on the spectral bias of physics-informed neural networks

Bachelor thesis (2025) - A.F. van den Arend Schmidt (author) , J. Sun (mentor) , A. Heinlein (mentor) , Tao Wang (mentor) , H.S. Hung (graduation committee member)

Physics-Informed Neural Networks (PINNs) are intended to solve complex problems that obey physical rules or laws but have noisy or little data. These problems are encountered in a wide range of fields including for instance bioengineering, fluid mechanics, meta-material design an ...

Analyzing the Impact of Adaptive Weighting in Self-Adaptive Physics-Informed Neural Networks for Solving PDEs

Bachelor thesis (2025) - J.P. Mańkowski (author) , J. Sun (mentor) , A. Heinlein (mentor) , T. Wang (mentor) , H.S. Hung (graduation committee member)

Self-Adaptive Physics-Informed Neural Networks
(SA-PINNs) are a variation of traditional Physics-Informed Neural Networks (PINNs) designed to
solve the challenges of solving ”stiff” partial differential equations (PDEs). By using adaptive weighting, SA-PINNs are able to f ...

Leveraging Parallel Schwarz Domain Decomposition

Using node level parallelism for the implementation of the parallel Schwarz method

Bachelor thesis (2024) - K.J. Gimbergh (author) , A. Heinlein (mentor) , B. van den Dries (graduation committee member)

This thesis concerns the implementation of parallel Schwarz domain decomposition using node-level parallelism, focusing on the parallel Schwarz method in comparison with the Jacobi iterative method. The study goes into the complexities of domain decomposition methods for solving ...

The Application of Neural Operators to Predict Skin Evolution After Burn Trauma

Master thesis (2024) - S. Husanović (author) , A. Heinlein (mentor) , F.J. Vermolen (mentor) , M.B. van Gijzen (graduation committee member) , E.G. Rens (graduation committee member)

Burn injuries present a significant global health challenge. Among the most severe long-term consequences are contractures, which can lead to functional impairments and disfigurement. Understanding and predicting the evolution of post-burn wounds is crucial for developing effecti ...

Data-driven turbulence modeling of two-phase flows in nuclear reactors

Master thesis (2024) - G. Bonilla (author) , A. Heinlein (mentor) , D. Toshniwal (mentor) , Cornelis Vuik (mentor) , Edo M.A. Frederix (mentor)

Understanding multiphase flows is critical in nuclear engineering, particularly for processes such as coolant dynamics in nuclear reactors and safety scenario analyses involving different fluid phases. Numerical simulations are a valuable tool for studying these phenomena, especi ...

Understanding multiphase flows is critical in nuclear engineering, particularly for processes such as coolant dynamics in nuclear reactors and safety scenario analyses involving different fluid phases. Numerical simulations are a valuable tool for studying these phenomena, especially when experimental approaches are impractical due to cost or safety concerns. While direct numerical simulations (DNS) offer detailed insights, their computational expense makes them impractical for turbulent flows, necessitating the use of turbulence models for efficiency.

This thesis introduces a novel machine learning framework designed to improve Reynolds-averaged Navier-Stokes models in turbulent stratified gas-liquid flows while employing the Boussinesq approximation. The framework encompasses two methods for turbulent viscosity field inversion and introduces correction terms in the turbulence model equations to ensure an accurate prediction of the turbulent viscosity field. Through sparse symbolic regression, the framework consistently discovers models that improve the accuracy of the baseline RANS model, even in untrained flow scenarios, though further testing is needed for varied flow regimes.

Key findings include the superior performance of sparse symbolic regression models over neural network (NN) models in improving the baseline RANS model accuracy. Notably, LASSO and elastic net techniques yielded the most successful models, significantly reducing baseline errors. However, these models did not surpass the Egorov damping approach in terms of accuracy, indicating the need for further refinement.

The developed models were numerically stable and robust, which is important for practical use. However, a main limitation is that the models' accuracy during training did not always correlate with the results when coupled with the RANS equations. Moreover, data from more varied flow conditions is needed to properly assess the generalizability of the models.

Overall, this research highlights the potential of data-driven turbulence modelling to enhance two-phase flow simulations, marking a significant step forward while also identifying areas for future improvement and exploration.

Creating a machine learning-based outlier removal algorithm that incorporates a priori knowledge of the physics

Master thesis (2024) - T.M. Kamminga (author) , A. Heinlein (mentor) , M.B. van Gijzen (coach) , G.N.J.C. Bierkens (coach) , L. Bekker (coach)

On Whole-Graph Embeddings from Node Feature Distributions

Triangle Count reveals Communities and improves Graph Neural Networks

Master thesis (2024) - L.E. Touwen (author) , J. Komjáthy (mentor) , Pawel Pralat (mentor) , A. Heinlein (coach)

We consider three topics motivated by the Network Exploration Toolkit (NEExT) for building unsupervised graph embeddings. NEExT vectorizes the graphs in a graph collection using the Wasserstein (optimal transport) distance between the distributions of node fe ...

A Domain Decompositionbased CNN Architecture for High-Resolution Image Segmentation

Master thesis (2024) - C. Verburg (author) , A. Heinlein (mentor) , D.J.P. Lahaye (graduation committee member) , Cornelis Vuik (graduation committee member) , Eric Cyr (mentor)

This thesis addresses the challenge of segmenting ultra-high-resolution images. Limitations of current approaches to segment these are that either detailed spatial contextual information is lost or many redundant computations are necessary. To overcome these issues, we propose a ...

Deep learning-based sea ice dynamics modelling

Master thesis (2024) - B.O. Analikwu (author) , A. Heinlein (mentor) , Carolin Mehlmann (mentor) , M. Möller (graduation committee member) , B.J. Meulenbroek (graduation committee member)

In this thesis, a deep learning-based surrogate model for predicting sea ice dynamics is developed that is capable of predicting linear kinematic features in a high-resolution setting. Predicting sea ice dynamics at high resolutions is critical for understanding climate patterns ...

In this thesis, a deep learning-based surrogate model for predicting sea ice dynamics is developed that is capable of predicting linear kinematic features in a high-resolution setting. Predicting sea ice dynamics at high resolutions is critical for understanding climate patterns and enabling safe navigation in Arctic regions. Traditional continuum models based on the viscous-plastic rheology are computationally intensive when employed at high resolutions to capture linear kinematic features (LKFs), which are narrow zones of deformation in sea ice.

A supervised learning approach was adopted, focusing on convolutional neural network architectures, specifically the U-Net and a classic bottleneck model. Various loss functions were explored, including traditional metrics like the MSE and novel domain-specific functions such as the strain rate error (SRE), which incorporates physical knowledge of sea ice behaviour. The models were trained on a dataset generated from numerical simulations of sea ice dynamics on a 2 km grid.

The key findings indicate that the U-Net architecture combined with the MSE+SRE loss function outperforms other models. This architecture's depth and use of skip connections allow it to capture complex, multi-scale patterns inherent in sea ice dynamics. The integration of the SRE into the loss function significantly enhances the model's ability to predict LKFs, demonstrating the benefit of incorporating domain knowledge into machine learning models.

While the surrogate model effectively predicts LKFs for short-term forecasts up to approximately 5.5 hours (10 time steps), errors accumulate over longer periods, leading to significant errors after about 11 hours (20 time steps). However, the efficiency gain of this surrogate model is striking: the computation of 10 time steps can be done within a second, compared to the 30 minutes that traditional numerical methods need. The study also underscores the critical importance of training data selection and preparation in influencing model performance and generalisation capabilities.

In conclusion, this work demonstrates that a deep learning-based surrogate model, particularly utilising the U-Net architecture and the MSE+SRE loss function, can effectively predict sea ice dynamics at high resolutions with significant computational efficiency. The model generates forecasts within seconds, offering a viable alternative to traditional numerical simulations that require hours of computation. Future work should focus on mitigating error accumulation to extend the forecasting horizon and exploring advanced learning methods to further integrate physical insights into the modelling process.

Machine learning for post-storm profile predictions

Using XBeach and convolutional neural network structure U-Net to predict 1D dune erosion profile shapes at the Holland Coast

Master thesis (2023) - P.A.K. van Asselt (author) , José A. Á. Antolínez (mentor) , A.J.H.M. Reniers (mentor) , A. Heinlein (mentor) , Panagiotis Athanasiou (graduation committee member) , R. McCall (graduation committee member)

To reduce computational efforts, surrogate models have been developed for dune erosion prediction. Current surrogate models can describe the relationship between the XBeach input and output (Athanasiou, 2022) and provides a prediction of a morphological indicator based on a param ...

Deep Learning the Dynamics of Mechanical Systems

Bachelor thesis (2023) - A.R. Wigmans (author) , A. Heinlein (mentor) , S. Jain (mentor)

This paper examines whether complex high-dimensional data that describes the dynamics of a cantilever beam can be transformed into a less complex system. In particular, the transformation that is examined is the reduction of the dimension. An essential aspect of this study involv ...