Generating Globally Optimized Multivariate-Split Survival Trees using GP-GOMEA

Master Thesis (2025)
Author(s)

K.T.C. Boudier (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Peter A.N. Bosman – Mentor (Centrum Wiskunde & Informatica (CWI))

Thalea Schlender – Mentor

Tanja Alderliesten – Mentor (Leiden University Medical Center)

M.M. de Weerdt – Graduation committee member (TU Delft - Electrical Engineering, Mathematics and Computer Science)

D.M.J. Tax – Graduation committee member (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Faculty
Electrical Engineering, Mathematics and Computer Science
More Info
expand_more
Publication Year
2025
Language
English
Graduation Date
26-11-2025
Awarding Institution
Delft University of Technology
Programme
Computer Science
Faculty
Electrical Engineering, Mathematics and Computer Science
Downloads counter
46

Abstract

Survival trees are a statistical modeling technique used to predict the time until an event occurs. They are widely valued for their interpretability, as they allow practitioners to understand how different variables influence outcomes. However, traditional survival trees struggle to capture nonlinear relationships and rely on greedy splitting strategies, which limit their performance in complex settings.

This thesis proposes a novel approach that addresses these limitations by generating globally optimized survival trees using Genetic Programming Gene-pool Optimal Mixing Evolutionary Algorithm (GP-GOMEA). By integrating a state-of-the-art evolutionary algorithm into the tree construction process, the resulting survival trees optimize both the structure and the decision nodes at a global level.

The method was evaluated on a synthetic dataset designed to require nonlinear decision boundaries—the XOR problem. Our approach consistently outperformed traditional survival trees, achieving optimal or near-optimal results in the noise-free setting. Moreover, the results show that GP-GOMEA survival trees can maintain a high performance even with a smaller population size and limited data, demonstrating the method’s suitability for problems involving nonlinear interactions.

These findings suggest that GP-GOMEA survival tree is a promising direction for advancing survival tree methodology. Future work should include evaluating the method on real-world survival datasets and further tuning key hyperparameters, such as the number of decision nodes.

No files available

Metadata only record. There are no files for this record.