Generating Globally Optimized Multivariate-Split Survival Trees using GP-GOMEA

Master Thesis (2025)
Author(s)

K.T.C. Boudier (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Peter A.N. Bosman – Mentor (Centrum Wiskunde & Informatica (CWI))

T. Schlender – Mentor

Tanja Alderliesten – Mentor (Leiden University Medical Center)

M.M. de Weerdt – Graduation committee member (TU Delft - Algorithmics)

D.M.J. Tax – Graduation committee member (TU Delft - Pattern Recognition and Bioinformatics)

Faculty
Electrical Engineering, Mathematics and Computer Science
More Info
expand_more
Publication Year
2025
Language
English
Graduation Date
26-11-2025
Awarding Institution
Delft University of Technology
Programme
['Computer Science']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Survival trees are a statistical modeling technique used to predict the time until an event occurs. They are widely valued for their interpretability, as they allow practitioners to understand how different variables influence outcomes. However, traditional survival trees struggle to capture nonlinear relationships and rely on greedy splitting strategies, which limit their performance in complex settings.

This thesis proposes a novel approach that addresses these limitations by generating globally optimized survival trees using Genetic Programming Gene-pool Optimal Mixing Evolutionary Algorithm (GP-GOMEA). By integrating a state-of-the-art evolutionary algorithm into the tree construction process, the resulting survival trees optimize both the structure and the decision nodes at a global level.

The method was evaluated on a synthetic dataset designed to require nonlinear decision boundaries—the XOR problem. Our approach consistently outperformed traditional survival trees, achieving optimal or near-optimal results in the noise-free setting. Moreover, the results show that GP-GOMEA survival trees can maintain a high performance even with a smaller population size and limited data, demonstrating the method’s suitability for problems involving nonlinear interactions.

These findings suggest that GP-GOMEA survival tree is a promising direction for advancing survival tree methodology. Future work should include evaluating the method on real-world survival datasets and further tuning key hyperparameters, such as the number of decision nodes.

Files

License info not available