HASE

Framework for efficient high-dimensional association analyses

Journal Article (2016)
Author(s)

Gennady Roshchupkin (Erasmus MC)

H Adams (Erasmus MC)

Meike Vernooij (Erasmus MC)

Albert Hofman (Erasmus MC)

Cornelia M. van Duijn (Erasmus MC)

Mohammad A. Ikram (Erasmus MC)

Wiro Niessen (TU Delft - ImPhys/Quantitative Imaging, Erasmus MC)

Research Group
ImPhys/Quantitative Imaging
Copyright
© 2016 G. V. Roshchupkin, H. H H Adams, M. W. Vernooij, A. Hofman, C. M. Van Duijn, M. A. Ikram, W.J. Niessen
DOI related publication
https://doi.org/10.1038/srep36076
More Info
expand_more
Publication Year
2016
Language
English
Copyright
© 2016 G. V. Roshchupkin, H. H H Adams, M. W. Vernooij, A. Hofman, C. M. Van Duijn, M. A. Ikram, W.J. Niessen
Research Group
ImPhys/Quantitative Imaging
Volume number
6
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

High-throughput technology can now provide rich information on a person's biological makeup and environmental surroundings. Important discoveries have been made by relating these data to various health outcomes in fields such as genomics, proteomics, and medical imaging. However, cross-investigations between several high-throughput technologies remain impractical due to demanding computational requirements (hundreds of years of computing resources) and unsuitability for collaborative settings (terabytes of data to share). Here we introduce the HASE framework that overcomes both of these issues. Our approach dramatically reduces computational time from years to only hours and also requires several gigabytes to be exchanged between collaborators. We implemented a novel meta-analytical method that yields identical power as pooled analyses without the need of sharing individual participant data. The efficiency of the framework is illustrated by associating 9 million genetic variants with 1.5 million brain imaging voxels in three cohorts (total N = 4,034) followed by meta-analysis, on a standard computational infrastructure. These experiments indicate that HASE facilitates high-dimensional association studies enabling large multicenter association studies for future discoveries.