A dissimilarity-based multiple instance learning approach for protein remote homology detection

Journal Article (2019)
Authors

Antonella Mensi (University of Verona)

Manuele Bicego (University of Verona)

Pietro Lovato (University of Verona)

Marco Loog (TU Delft - Pattern Recognition and Bioinformatics)

David M.J. Tax (TU Delft - Pattern Recognition and Bioinformatics)

Research Group
Pattern Recognition and Bioinformatics
To reference this document use:
https://doi.org/10.1016/j.patrec.2019.08.027
More Info
expand_more
Publication Year
2019
Language
English
Research Group
Pattern Recognition and Bioinformatics
Volume number
128
Pages (from-to)
231-236
DOI:
https://doi.org/10.1016/j.patrec.2019.08.027

Abstract

We study the problem of Protein Remote Homology Detection, which assesses the functional similarity of two proteins. We approach this as a problem of binary multiple-instance learning (MIL) that aims to distinguish between homologous and non-homologous proteins. The particular MIL approach employed is based on the dissimilarity representation in which various schemes of combining N-gram representations are considered. This approach allows us to cope with longer N-grams, capturing a richer biological context, and results in versatile framework offering competitive performance compared to state of the art.

No files available

Metadata only record. There are no files for this record.