Performance and interaction assessment of neural network architectures and bivariate smart predict-then-optimize

None, None; None, None; None, None

Performance and interaction assessment of neural network architectures and bivariate smart predict-then-optimize

Journal Article (2025)

Author(s)

J. Wen (TU Delft - Algorithmics)

T.E.P.M.F. Abeel (TU Delft - Pattern Recognition and Bioinformatics)

M.M. de Weerdt (TU Delft - Algorithmics)

Research Group

Algorithmics

DOI related publication

https://doi.org/10.1007/s10994-025-06910-1

Deep neural networks Decision-focused learning Constraint optimization Machine learning and optimization Real-world applications

To reference this document use:

https://resolver.tudelft.nl/uuid:f848c2a9-73e0-4adb-9e97-9b64c3a539fe

More Info

expand_more

Publication Year

2025

Language

English

Research Group

Algorithmics

Issue number

11

Volume number

114

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Smart “predict, then optimize” (SPO) (Elmachtoub in Manag Sci 68(1): 9–26, 2022) is an end-to-end learning strategy for models that predict parameters in optimization problems. Unlike minimizing mean squared error (MSE) which cares about prediction accuracies, SPO aims to ensure that predictions lead to the best possible decisions. The associated loss function, termed SPO loss, measures the decision’s regret from optimal outcomes with parameter realizations. Existing literature has demonstrated the viability of SPO, however, these studies often focus on classical optimization problems and employ a limited set of models for benchmarking. In this study, we tackled a decision-making task inspired by real-world challenges across a wide range of neural network models. Unlike classical problems, our task requires a unique approach: collaboratively training two models to predict different variables. On top of that, one of the decision variables also affects the feasibility of the decisions, further increasing the complexity. While our implementation validates the benefits of SPO, we were surprised to find that models trained exclusively on SPO loss do not consistently attain the minimum regret. Our further investigation into hyperparameters illustrates that the well-tuned models learned very similar patterns from the feature set, irrespective of whether MSE or SPO loss was used. In other words, the change from MSE to SPO loss in training primarily affected the layer biases. Therefore, to improve the learning efficacy with SPO loss, we propose prioritizing learning feature patterns as the fundamental step. Possible strategies include using specialized neural network layers to capture deeper patterns more effectively or simply warming up by training with MSE. Specifically, a warming-up process is particularly advantageous for model(s) where the outputs are closely tied to constraints, as their prediction accuracy significantly impacts the decision feasibility. The insights are investigated empirically through two real-world trading scenarios. By leveraging datasets with diverse properties, we demonstrate the novelty and generalizability of our investigation.

Files

S10994-025-06910-1.pdf

(pdf | 2 Mb)