Reconstructing Phylogenetic Networks via Cherry Picking and Machine Learning

None, None; None, None; None, None; None, None

Reconstructing Phylogenetic Networks via Cherry Picking and Machine Learning

Conference Paper (2022)

Author(s)

Giulia Bernardini (University of Trieste, Centrum Wiskunde & Informatica (CWI))

Leo van Iersel (TU Delft - Discrete Mathematics and Optimization)

Esther Julien (TU Delft - Discrete Mathematics and Optimization)

Leen Stougie (Vrije Universiteit Amsterdam, Centrum Wiskunde & Informatica (CWI), Erable)

Research Group

Discrete Mathematics and Optimization

DOI related publication

https://doi.org/10.4230/LIPIcs.WABI.2022.16

Machine Learning Heuristic Phylogenetics Hybridization Cherry Picking

To reference this document use:

https://resolver.tudelft.nl/uuid:29df3e7d-126e-4418-ae2e-3912af9ffc2b

More Info

expand_more

Publication Year

2022

Language

English

Research Group

Discrete Mathematics and Optimization

ISBN (electronic)

9783959772433

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Combining a set of phylogenetic trees into a single phylogenetic network that explains all of them is a fundamental challenge in evolutionary studies. In this paper, we apply the recently-introduced theoretical framework of cherry picking to design a class of heuristics that are guaranteed to produce a network containing each of the input trees, for practical-size datasets. The main contribution of this paper is the design and training of a machine learning model that captures essential information on the structure of the input trees and guides the algorithms towards better solutions. This is one of the first applications of machine learning to phylogenetic studies, and we show its promise with a proof-of-concept experimental study conducted on both simulated and real data consisting of binary trees with no missing taxa.

Files

LIPIcs_WABI_2022_16.pdf

(pdf | 1.25 Mb)