A thorough analysis of the contribution of experimental, derived and sequence-based predicted protein-protein interactions for functional annotation of proteins

None, None; None, None; None, None

A thorough analysis of the contribution of experimental, derived and sequence-based predicted protein-protein interactions for functional annotation of proteins

Journal Article (2020)

Author(s)

Stavros Makrodimitris (TU Delft - Pattern Recognition and Bioinformatics, Keygene N.V.)

Marcel Reinders (TU Delft - Pattern Recognition and Bioinformatics, Leiden University Medical Center)

R.C.H.J. van Ham (Keygene N.V., TU Delft - Pattern Recognition and Bioinformatics)

Research Group

Pattern Recognition and Bioinformatics

Copyright

DOI related publication

https://doi.org/10.1371/journal.pone.0242723

To reference this document use:

https://resolver.tudelft.nl/uuid:d32806a6-e184-4de5-877d-7a8a26ce801d

More Info

expand_more

Publication Year

2020

Language

English

Copyright

Research Group

Pattern Recognition and Bioinformatics

Issue number

11

Volume number

15

Pages (from-to)

1-18

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Physical interaction between two proteins is strong evidence that the proteins are involved in the same biological process, making Protein-Protein Interaction (PPI) networks a valuable data resource for predicting the cellular functions of proteins. However, PPI networks are largely incomplete for non-model species. Here, we tested to what extent these incomplete networks are still useful for genome-wide function prediction. We used two network-based classifiers to predict Biological Process Gene Ontology terms from protein interaction data in four species: Saccharomyces cerevisiae, Escherichia coli, Arabidopsis thaliana and Solanum lycopersicum (tomato). The classifiers had reasonable performance in the well-studied yeast, but performed poorly in the other species. We showed that this poor performance can be considerably improved by adding edges predicted from various data sources, such as text mining, and that associations from the STRING database are more useful than interactions predicted by a neural network from sequence-based features.

Files

Journal.pone.0242723.pdf

(pdf | 2.06 Mb)