HAT

None, None; None, None; None, None; None, None; None, None

HAT

haplotype assembly tool using short and error-prone long reads

Journal Article (2022)

Author(s)

R. Shirali Hossein Zade (TU Delft - Pattern Recognition and Bioinformatics)

A. Urhan (Broad Institute of MIT and Harvard, TU Delft - Pattern Recognition and Bioinformatics)

A. Assis de Souza (TU Delft - Pattern Recognition and Bioinformatics)

A. Singh (TU Delft - Pattern Recognition and Bioinformatics)

T.E.P.M.F. Abeel (TU Delft - Pattern Recognition and Bioinformatics, Broad Institute of MIT and Harvard)

Research Group

Pattern Recognition and Bioinformatics

Copyright

DOI related publication

https://doi.org/10.1093/bioinformatics/btac702

To reference this document use:

https://resolver.tudelft.nl/uuid:96076582-041e-4d64-9d90-c68cc404af6e

More Info

expand_more

Publication Year

2022

Language

English

Copyright

Research Group

Pattern Recognition and Bioinformatics

Issue number

24

Volume number

38

Pages (from-to)

5352-5359

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Motivation: Haplotypes are the set of alleles co-occurring on a single chromosome and inherited together to the next generation. Because a monoploid reference genome loses this co-occurrence information, it has limited use in associating phenotypes with allelic combinations of genotypes. Therefore, methods to reconstruct the complete haplotypes from DNA sequencing data are crucial. Recently, several attempts have been made at haplotype reconstructions, but significant limitations remain. High-quality continuous haplotypes cannot be created reliably, particularly when there are few differences between the homologous chromosomes. Results: Here, we introduce HAT, a haplotype assembly tool that exploits short and long reads along with a reference genome to reconstruct haplotypes. HAT tries to take advantage of the accuracy of short reads and the length of the long reads to reconstruct haplotypes. We tested HAT on the aneuploid yeast strain Saccharomyces pastorianus CBS1483 and multiple simulated polyploid datasets of the same strain, showing that it outperforms existing tools.

Files

Btac702.pdf

(pdf | 0.775 Mb)