Finding Biomarkers for Type 2 Diabetes
A. Das (TU Delft - Electrical Engineering, Mathematics and Computer Science)
T.E.P.M.F. Abeel – Mentor (TU Delft - Pattern Recognition and Bioinformatics)
E.A. van der Toorn – Mentor (TU Delft - Pattern Recognition and Bioinformatics)
D. Calderon Franco – Mentor (TU Delft - BT/Environmental Biotechnology)
T. Höllt – Graduation committee member (TU Delft - Computer Graphics and Visualisation)
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
Type 2 Diabetes is a very prevalent disease in current times and leads to significant adverse effects. Recently, there has been a growing interest in the association of the human gut microbiome with respect to chronic diseases like Type 2 Diabetes with the aim to identify biomarkers. In this study, we researched the effect of different machine learning and feature selection techniques to identify biomarkers for Type 2 Diabetes that can later be used for diagnosis and prediction. The main methods that we explored were Random Forests,Linear Regression, Support Vector Machines andXGBoost along with mRMR and CMIM as feature selection techniques. These methods were applied to data taken from Europe and China. We found that mRMR improved the performance of the Random Forest classifier compared to CMIM.Apart from finding biomarkers specific to one location, we found that Clostridiales, Clostridium, Roseburia and Lactobacillus could be of interestin the prediction of Type 2 Diabetes irrespective of location. This study verified biomarkers found in previous literature and evaluated several techniquesfor the prediction of the disease across different regions.