Metabolomic predictors of phenotypic traits can replace and complement measured clinical variables in population-scale expression profiling studies

Journal Article (2022)
Author(s)

Anna Niehues (Radboud University Medical Center)

Daniele Bizzarri (Leiden University Medical Center)

Marcel J.T. Reinders (Leiden University Medical Center, TU Delft - Pattern Recognition and Bioinformatics)

P. Eline Slagboom (Max Planck Institute for Biology of Ageing, Leiden University Medical Center)

Alain J. van Gool (Radboud University Medical Center)

Erik B. van den Akker (TU Delft - Pattern Recognition and Bioinformatics, Leiden University Medical Center)

Peter A.C. 't Hoen (Radboud University Medical Center)

Contributor(s)

DOI related publication
https://doi.org/10.1186/s12864-022-08771-7 Final published version
More Info
expand_more
Publication Year
2022
Language
English
Journal title
BMC Genomics
Issue number
1
Volume number
23
Article number
546
Downloads counter
255
Collections
Institutional Repository
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Population-scale expression profiling studies can provide valuable insights into biological and disease-underlying mechanisms. The availability of phenotypic traits is essential for studying clinical effects. Therefore, missing, incomplete, or inaccurate phenotypic information can make analyses challenging and prevent RNA-seq or other omics data to be reused. A possible solution are predictors that infer clinical or behavioral phenotypic traits from molecular data. While such predictors have been developed based on different omics data types and are being applied in various studies, metabolomics-based surrogates are less commonly used than predictors based on DNA methylation profiles.In this study, we inferred 17 traits, including diabetes status and exposure to lipid medication, using previously trained metabolomic predictors. We evaluated whether these metabolomic surrogates can be used as an alternative to reported information for studying the respective phenotypes using expression profiling data of four population cohorts. For the majority of the 17 traits, the metabolomic surrogates performed similarly to the reported phenotypes in terms of effect sizes, number of significant associations, replication rates, and significantly enriched pathways.The application of metabolomics-derived surrogate outcomes opens new possibilities for reuse of multi-omics data sets. In studies where availability of clinical metadata is limited, missing or incomplete information can be complemented by these surrogates, thereby increasing the size of available data sets. Additionally, the availability of such surrogates could be used to correct for potential biological confounding. In the future, it would be interesting to further investigate the use of molecular predictors across different omics types and cohorts.