Inferring phylogenetic networks from multifurcating trees via cherry picking and machine learning

Journal Article (2024)
Author(s)

Giulia Bernardini (University of Trieste)

Leo Van Iersel (TU Delft - Discrete Mathematics and Optimization)

E.A.T. Julien (TU Delft - Discrete Mathematics and Optimization)

Leen Stougie (Centrum Wiskunde & Informatica (CWI), INRIA-Erable, Vrije Universiteit Amsterdam)

Research Group
Discrete Mathematics and Optimization
DOI related publication
https://doi.org/10.1016/j.ympev.2024.108137
More Info
expand_more
Publication Year
2024
Language
English
Research Group
Discrete Mathematics and Optimization
Volume number
199
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The Hybridization problem asks to reconcile a set of conflicting phylogenetic trees into a single phylogenetic network with the smallest possible number of reticulation nodes. This problem is computationally hard and previous solutions are limited to small and/or severely restricted data sets, for example, a set of binary trees with the same taxon set or only two non-binary trees with non-equal taxon sets. Building on our previous work on binary trees, we present FHyNCH, the first algorithmic framework to heuristically solve the Hybridization problem for large sets of multifurcating trees whose sets of taxa may differ. Our heuristics combine the cherry-picking technique, recently proposed to solve the same problem for binary trees, with two carefully designed machine-learning models. We demonstrate that our methods are practical and produce qualitatively good solutions through experiments on both synthetic and real data sets.