M.J.T. Reinders | TU Delft Repository

Rheumatic Digital Twin

Proposed Machine Learning–Based Multimodal Framework to Inform Clinical Decision-Making

Review (2026) - Daniyal Selani, Rachel Knevel, Marcel Reinders, Erik B. van den Akker

Rheumatic diseases are chronic, immune-mediated conditions characterized by significant heterogeneity in presentation and disease course. However, current clinical approaches often rely on snapshot-based assessments that fail to capture the complex longitudinal evolution of these conditions. To address these limitations and support the implementation of precision medicine, we present the design for the Rheumatic Digital Twin, a novel, modular conceptual framework intended to integrate heterogeneous multimodal data, ranging from electronic health records and clinical notes to imaging and omics, into a dynamic, computational representation of the patient journey. Our theoretical architecture addresses challenges related to data silos and variable availability of data modalities through a multistage approach that envisions the use of domain-specific foundation models to independently process distinct data modalities. To effectively model the temporal progression inherent in chronic diseases, the proposed design utilizes Transformer architectures, leveraging self-attention mechanisms to treat patient events, such as lab results or medication changes, as sequential data tokens. We describe how these unimodal representations would subsequently be fused via joint embedding techniques to construct a shared, multimodal representational space. Envisioned to function analogously to a recommender system, the Rheumatic Digital Twin framework is modeled to map patients into a latent space where proximity reflects clinical and biological similarity. By identifying “nearest neighbors,” historical patients with comparable trajectories, the system aims to enable in silico cohorting, theoretically allowing clinicians to forecast key clinical events, predict treatment responses, and identify likely disease courses based on the outcomes of similar peers. ...

ProtFI, an efficient frailty-trained proteomics-based biomarker of aging, robustly predicts age-related decline

Journal article (2026) - Swier Garst, Lieke Kuiper, Erik van den Akker, Niels van den Berg, Mohsen Ghanbari, Simon Mooijaart, Marian Beekman, Marcel Reinders, P. Eline Slagboom, Joyce van Meurs

Many molecular aging biomarkers have been developed to capture heterogeneity in individual aging rates. Yet, systematic comparison of the modeling choices underlying these biomarkers has been limited. In this study, we trained aging biomarkers on the Rockwood frailty index (FI) and all-cause mortality using UK Biobank Olink proteomics and metabolomics (¹H-NMR) data (n = 40,696). We systematically established the impact of model choice, target outcome, and molecular data source on several age-related outcomes. From this, we developed two aging biomarkers, ProteinFrailty (ProtFI) and ProteinMortality (ProtMort), which are both ElasticNet models that use a minimal set of proteins to predict FI and mortality, respectively. In particular, ProtFI outperformed established aging biomarkers in relation to diverse outcomes, including incident cardiovascular disease, handgrip strength, and self-rated health, both in internal validation and two Dutch external cohorts (n = 995, n = 500). Our findings show that an efficient frailty-trained proteomic biomarker robustly predicts age-related decline. ...

Effects of a Combined Dietary and Physical Activity Intervention on Bone Density, Lean Mass and Fat Mass in Adults: The GOTO Trial

Journal article (2026) - F.A. Bogaards, I. Groenendijk, T. Gehrmann, M. Beekman, N. Lakenberg, H. Eka D. Suchiman, L.P.G.M. de Groot, M.J.T. Reinders, P. E. Slagboom

Background: Nutritional weight-loss interventions are known to reduce bone mineral density (BMD), which can be prevented by adding (resistance) exercise training. However, this combined effect is not well studied in non-obese adults. In addition, the association between biomarkers and metabolite-based composite health markers with changes in BMD in such an intervention has not been studied as thoroughly. Objective: The aims of the current study were to investigate the effect of a combined nutritional and activity lifestyle intervention on lumbar spine and total body BMD in healthy middle-aged to older adults, and to relate these effects to a selection of immune-metabolic biomarkers, muscle mass and fat mass measurements, and two composite metabolite-based health scores. Methods: In this ancillary study of the single-arm Growing Old TOgether (GOTO) trial (trial registration number GOTNL3301 [https://onderzoekmetmensen.nl/nl/trial/27183], NL-OMON27183), 134 participants (mean age 62.9 years, 49% female) undertook a 13-week lifestyle modification, incorporating 12.5% caloric restriction and 12.5% increase in physical activity. The impact on lumbar spine and total body BMD was evaluated using dual-energy X-ray absorptiometry (DEXA). The intervention effect on BMD was related to changes in immune-metabolic biomarkers and two metabolite-based immune-metabolic health scores. Results: The trial significantly reduced bodyweight with 3.3 and 3.4 kg, consisting of 1.4 and 1.1 kg lean mass, in males (fdr < 0.001) and females (fdr < 0.001), respectively. Lean mass reduced by 1.4 kg in males (fdr < 0.001) and 1.1 kg in females (fdr < 0.001), whereas total body fat% reduced significantly with −1.5% (fdr < 0.001) in males and −1.5% (fdr < 0.001) in females. In males, lumbar spine BMD increased with 3.0% (fdr < 0.001) and total body BMD with 0.7% (fdr = 0.002). In females, the lumbar spine BMD had a trend in the upwards direction (1.2%, fdr = 0.09) and the total body BMD remained stable (0.4%, fdr = 0.07). In males, the increase in lumbar spine BMD was significantly associated with decreased weight (fdr = 0.001) and with decreased body and trunk fat% (fdr = 0.001, fdr = 0.001) and improved immune-metabolic health (fdr = 0.02). Males with higher BMD but a poor metabolite-based health score at baseline had a stronger increase in lumbar spine BMD (fdr = 0.03). Conclusions: A combined nutritional and activity lifestyle intervention significantly improved BMD of males with good bone health at baseline while at the same time improving metabolic health. Nutritional weight-loss interventions may not harm BMD when combined with exercise. ...

Background: Nutritional weight-loss interventions are known to reduce bone mineral density (BMD), which can be prevented by adding (resistance) exercise training. However, this combined effect is not well studied in non-obese adults. In addition, the association between biomarkers and metabolite-based composite health markers with changes in BMD in such an intervention has not been studied as thoroughly. Objective: The aims of the current study were to investigate the effect of a combined nutritional and activity lifestyle intervention on lumbar spine and total body BMD in healthy middle-aged to older adults, and to relate these effects to a selection of immune-metabolic biomarkers, muscle mass and fat mass measurements, and two composite metabolite-based health scores. Methods: In this ancillary study of the single-arm Growing Old TOgether (GOTO) trial (trial registration number GOTNL3301 [https://onderzoekmetmensen.nl/nl/trial/27183], NL-OMON27183), 134 participants (mean age 62.9 years, 49% female) undertook a 13-week lifestyle modification, incorporating 12.5% caloric restriction and 12.5% increase in physical activity. The impact on lumbar spine and total body BMD was evaluated using dual-energy X-ray absorptiometry (DEXA). The intervention effect on BMD was related to changes in immune-metabolic biomarkers and two metabolite-based immune-metabolic health scores. Results: The trial significantly reduced bodyweight with 3.3 and 3.4 kg, consisting of 1.4 and 1.1 kg lean mass, in males (fdr < 0.001) and females (fdr < 0.001), respectively. Lean mass reduced by 1.4 kg in males (fdr < 0.001) and 1.1 kg in females (fdr < 0.001), whereas total body fat% reduced significantly with −1.5% (fdr < 0.001) in males and −1.5% (fdr < 0.001) in females. In males, lumbar spine BMD increased with 3.0% (fdr < 0.001) and total body BMD with 0.7% (fdr = 0.002). In females, the lumbar spine BMD had a trend in the upwards direction (1.2%, fdr = 0.09) and the total body BMD remained stable (0.4%, fdr = 0.07). In males, the increase in lumbar spine BMD was significantly associated with decreased weight (fdr = 0.001) and with decreased body and trunk fat% (fdr = 0.001, fdr = 0.001) and improved immune-metabolic health (fdr = 0.02). Males with higher BMD but a poor metabolite-based health score at baseline had a stronger increase in lumbar spine BMD (fdr = 0.03). Conclusions: A combined nutritional and activity lifestyle intervention significantly improved BMD of males with good bone health at baseline while at the same time improving metabolic health. Nutritional weight-loss interventions may not harm BMD when combined with exercise.

Precision-guided immunomodulatory therapy in sepsis

Journal article (2026) - Jim M. Smit, Marcel J.T. Reinders, Philip A. van der Zee, Jesse H. Krijthe

Brain Connectivity and Machine Learning Approaches to assess the underlying neurobiology and prediction accuracy of anorexia nervosa

A replication study

Journal article (2026) - Laura Monteiro Rente Dias, Hugo Schnack, Daniel Geisler, Marcel Reinders, Tonya White, Gwen Dieleman, Xucong Zhang

Resting-state fMRI has been used to study aberrant functional connectivity properties in patients with anorexia nervosa (AN) at several stages of the illness. One popular way to extract these metrics is to use graph theory to showcase aberrant brain connectivity between patients with AN versus controls. However, most studies use classic analyses to investigate these differences, which could limit the number and choices of features used in one model. Instead, machine learning models have proven to be a promising tool in studying the functional connectivity of various disorders. In this study, we employ a combination of local graph metrics and a support vector machine to distinguish between first-onset AN (N = 56) cases and controls (N = 64). We replicate and extend prior work evaluating the predictive value of an existing machine learning approaches in detecting functional connectivity differences in patients with AN. Our method achieves an average classification accuracy of 65% with cross-validation evaluation. We further demonstrate that the results are driven mainly by the participation index of the nodes that are implicated in distinguishing the two groups. Our findings contribute to the growing body of evidence supporting the predictive value of resting-state fMRI in the study of anorexia nervosa. ...

Decoding exon inclusion in the human brain reveals more divergent splicing mechanisms in neurons than glia

Journal article (2026) - Lieke Michielsen, Justine Hsu, Anoushka Joglekar, Natan Belchikov, Marcel J.T. Reinders, Hagen U. Tilgner, Ahmed Mahfouz

BACKGROUND: Alternative splicing contributes to molecular diversity across brain cell types. RNA-binding proteins (RBPs) regulate splicing, but the genome-wide mechanisms underlying cell-type-specific splicing remain poorly understood. RESULTS: Here, we want to unravel cell-type-specific splicing mechanisms by using RBP binding sites and/or the genomic sequence to predict exon inclusion in neurons and glia as measured by long-read single-cell data in the human hippocampus and frontal cortex. We found that exon inclusion of variable exons is harder to predict in neurons compared to glia in both brain regions. Comparing neurons and glia, the position of RBP binding sites in alternatively spliced exons in neurons differ more from non-variable exons indicating distinct splicing mechanisms. Model interpretation pinpointed RBPs, including QKI, potentially regulating alternative splicing between neurons and glia. Finally, we accurately predict and prioritize the effect of splicing QTLs. CONCLUSIONS: Our results indicate that the splicing mechanisms in variable exons in neurons diverged more from the standard mechanisms. Splicing in neurons might be less sequence-dependent and influenced more by, for instance, chromatin accessibility or methylation. Taken together, these results highlight new insights into the mechanisms regulating cell-type-specific alternative splicing in the brain. ...

Attrition and representativeness in development and validation of online symptom checkers—a case study on the Rheumatic? Questionnaire

Journal article (2026) - F. Dijkstra Zegers, L. Qin, D. Selani, G. Gomon, T. Maarseveen, K. Glas, M. Reinders, Erik van den Akker, Rachel Knevel, More Authors

BackgroundOnline symptom checkers are often developed and validated on data subject to self-selection and selective attrition, potentially introducing biases in prediction models.ObjectivesTo assess recruitment, selection, and attrition patterns in a large Dutch online symptom checker for musculoskeletal complaints and to evaluate potential biases by comparing participant characteristics across recruitment sources and with external target populations.MethodsUsing data from the online Dutch Rheumatic? Questionnaire on musculoskeletal complaints, we compared baseline characteristics and key self-reported symptoms between responders to the follow-up survey and nonresponders. The survey responders were furthermore compared according to source of recruitment to the questionnaire, i.e., via primary care clinics, secondary care clinics, or via different online sources. Sex, age and BMI distributions from the total study group were compared to external data of potential target populations of primary and secondary care patients within the Netherlands.ResultsThe total study group of answers to the questionnaire comprised 31,457 responders, of which 50% (n = 15,591) responded to the follow-up survey. Study participants were predominantly female (76%), middle-aged (one-third 50–60 years), never-smokers (66%), and overweight. While participants recruited through healthcare settings resembled target populations, follow-up survey responders were older, had more rheumatic diagnoses (49% vs. 32%), and reported more symptoms than non-responders. Participant characteristics varied by recruitment source, with social media attracting younger females while healthcare routes reached more diverse populations with varying symptom presentations.ConclusionPatterns of recruitment and attrition produced differences in participant characteristics. Healthcare-based recruitment yielded participants resembling intended target populations, and follow-up survey responders differed on some points from nonresponders. Awareness of these selection processes is essential when using real-world symptom checker data for model development. ...

BackgroundOnline symptom checkers are often developed and validated on data subject to self-selection and selective attrition, potentially introducing biases in prediction models.ObjectivesTo assess recruitment, selection, and attrition patterns in a large Dutch online symptom checker for musculoskeletal complaints and to evaluate potential biases by comparing participant characteristics across recruitment sources and with external target populations.MethodsUsing data from the online Dutch Rheumatic? Questionnaire on musculoskeletal complaints, we compared baseline characteristics and key self-reported symptoms between responders to the follow-up survey and nonresponders. The survey responders were furthermore compared according to source of recruitment to the questionnaire, i.e., via primary care clinics, secondary care clinics, or via different online sources. Sex, age and BMI distributions from the total study group were compared to external data of potential target populations of primary and secondary care patients within the Netherlands.ResultsThe total study group of answers to the questionnaire comprised 31,457 responders, of which 50% (n = 15,591) responded to the follow-up survey. Study participants were predominantly female (76%), middle-aged (one-third 50–60 years), never-smokers (66%), and overweight. While participants recruited through healthcare settings resembled target populations, follow-up survey responders were older, had more rheumatic diagnoses (49% vs. 32%), and reported more symptoms than non-responders. Participant characteristics varied by recruitment source, with social media attracting younger females while healthcare routes reached more diverse populations with varying symptom presentations.ConclusionPatterns of recruitment and attrition produced differences in participant characteristics. Healthcare-based recruitment yielded participants resembling intended target populations, and follow-up survey responders differed on some points from nonresponders. Awareness of these selection processes is essential when using real-world symptom checker data for model development.

All-atom protein sequence design using discrete diffusion models

Journal article (2026) - Amelia Villegas-Morcillo, Gijs J. Admiraal, Marcel J.T. Reinders, Jana M. Weber

Advancing protein design is crucial for breakthroughs in medicine and biotechnology. Traditional approaches for protein sequence representation often rely solely on the 20 canonical amino acids, limiting the representation of non-canonical amino acids and residues that undergo post-translational modifications. This work explores discrete diffusion models for generating novel protein sequences using the all-atom chemical representation SELFIES. By encoding the atomic composition of each amino acid in the protein, this approach expands the design possibilities beyond standard sequence representations. Using a modified ByteNet architecture within the discrete diffusion D3PM framework, we evaluate the impact of this all-atom representation on protein quality, diversity, and novelty, compared to conventional amino acid-based models. To this end, we develop a comprehensive assessment pipeline to determine whether generated SELFIES sequences translate into valid proteins containing both canonical and non-canonical amino acids. Additionally, we examine the influence of two noise schedules within the diffusion process—uniform (random replacement of tokens) and absorbing (progressive masking)—on generation performance. While models trained on the all-atom representation struggle to consistently generate fully valid proteins, the successfully generated proteins show improved novelty and diversity compared to their amino acid-based model counterparts. Furthermore, the all-atom representation achieves structural foldability results comparable to those of amino acid-based models. Lastly, our results highlight the absorbing noise schedule as the most effective for both representations. Data and code are available at https://github.com/Intelligent-molecular-systems/All-Atom-Protein-Sequence-Generation. ...

Comparative validation of handheld fractional exhaled nitric oxide measurements

Journal article (2025) - Sanne van Deelen, Gerdien A. Tramper-Stranders, Rudi W. Hendriks, Marcel J.T. Reinders, Gert Jan Braunstahl

Background: Fractional exhaled nitric oxide (FeNO) is a noninvasive method to determine the degree of airway inflammation. Handheld devices such as the Vivatmo Me are used for home monitoring. Differences were found between the Vivatmo Me and standard measurements with the NIOX VERO. Therefore, we aimed to determine the accuracy of the Vivatmo Me for FeNO measurements. Methods: Adult patients with an appointment for FeNO-measurement according to regular care, were invited to perform the FeNO measurement with both devices. From these measurements the FeNO values were compared, and the device user-friendliness was determined. Results: One hundred and sixty-four patients were included. The number of attempts needed for a successful measurement and the failure rate were higher with the Vivatmo Me. Although the measurements were highly correlated, a significant difference (p < 0.001) was found between FeNO values measured with both devices. From the Vivatmo measurements, 32% did not fall within the claimed accuracy ranges. A linear correction on the FeNO values reduced this number. Conclusion: Our findings indicate that the Vivatmo Me does not comply with the claimed accuracy of clinical FeNO measurements and the measurement is challenging to perform. By applying the proposed correction, the comparative validity of the FeNO measurement improves and therefore its clinical usefulness. ...

Location and amount of joint involvement differentiates rheumatoid arthritis into different clinical subsets

Journal article (2025) - Tjardo D. Maarseveen, Marc P. Maurits, Lavinia Agra Coletto, Simone Perniola, Stefan Böhringer, Nils Steinz, Marcel J.T. Reinders, Erik B. van den Akker, Rachel Knevel, More authors...

Rheumatoid arthritis (RA) is a heterogeneous disease with variable symptoms, prognosis, and treatment response, necessitating refined patient classification. We applied multimodal deep learning and clustering to identify distinct RA phenotypes using baseline clinical data from 1,387 patients in the Leiden Rheumatology clinic. Four Joint Involvement Patterns (JIP) emerged: foot-predominant arthritis, seropositive oligoarticular disease, seronegative hand arthritis, and polyarthritis. Findings were validated in clinical trial data (n = 307) and an independent secondary care cohort (n = 515). Clusters showed high stability and significant differences in remission rates (P = 0.007) and methotrexate failure (P < 0.001). JIP-hand patients had superior outcomes (particularly in ACPA-positive patients) versus JIP-foot (HR:0.37, P < 0.001) and JIP-poly (HR:0.33, P = 0.005), independent of baseline disease activity and clinical markers. Synovial histology analysis (n = 194) revealed distinct inflammatory patterns across clusters, hinting at different underlying biological mechanisms. These validated RA phenotypes based on joint involvement patterns may enable targeted research into disease mechanisms and personalized treatment strategies. ...

Two-Step Transfer Learning Improves Deep Learning–Based Drug Response Prediction in Small Datasets

A Case Study of Glioblastoma

Journal article (2025) - Jie Ju, Ioannis Ntafoulis, Michelle Klein, Marcel J.T. Reinders, Martine Lamfers, Andrew P. Stubbs, Yunlei Li

While deep learning (DL) is used in patients’ outcome predictions, the insufficiency of patient samples limits the accuracy. In this study, we investigated how transfer learning (TL) alleviates the small sample size problem. A 2-step TL framework was constructed for a difficult task: predicting the response of the drug temozolomide (TMZ) in glioblastoma (GBM) cell cultures. The GBM is aggressive, and most patients do not benefit from the only approved chemotherapeutic agent TMZ. O6-methylguanine-DNA methyltransferase (MGMT) promoter methylation status is the only biomarker for TMZ responsiveness but has shown limited predictive power. The 2-step TL framework was built on 3 datasets: (1) the subset of the Genomics of Drug Sensitivity in Cancer (GDSC) dataset, including miscellaneous cell cultures treated by TMZ, cyclophosphamide, bortezomib, and oxaliplatin, as the source dataset; (2) the Human Glioblastoma Cell Culture (HGCC) dataset, for fine-tuning; and (3) a small target dataset GSE232173, for validation. The latter two included specifically TMZ-treated GBM cell cultures. The DL models were pretrained on the cell cultures treated by each of the 4 drugs from GDSC, respectively. Then, the DL models were refined on HGCC, where the best source drug was identified. Finally, the DL model was validated on GSE232173. Using 2-step TL with pretraining on oxaliplatin was not only superior to those without TL and with 1-step TL but also better than 3 benchmark methods, including MGMT. The oxaliplatin-based TL improved the performance probably by increasing the weights of cell cycle-related genes, which relates to the TMZ response processes. Our findings support the potential of oxaliplatin being an alternative therapy for patients with GBM and TL facilitating drug repurposing research. We recommend that following our methodology, using mixed cancers and a related drug as the source and then fine-tuning the model with the target cancer and the target drug will enhance drug response prediction. ...

While deep learning (DL) is used in patients’ outcome predictions, the insufficiency of patient samples limits the accuracy. In this study, we investigated how transfer learning (TL) alleviates the small sample size problem. A 2-step TL framework was constructed for a difficult task: predicting the response of the drug temozolomide (TMZ) in glioblastoma (GBM) cell cultures. The GBM is aggressive, and most patients do not benefit from the only approved chemotherapeutic agent TMZ. O6-methylguanine-DNA methyltransferase (MGMT) promoter methylation status is the only biomarker for TMZ responsiveness but has shown limited predictive power. The 2-step TL framework was built on 3 datasets: (1) the subset of the Genomics of Drug Sensitivity in Cancer (GDSC) dataset, including miscellaneous cell cultures treated by TMZ, cyclophosphamide, bortezomib, and oxaliplatin, as the source dataset; (2) the Human Glioblastoma Cell Culture (HGCC) dataset, for fine-tuning; and (3) a small target dataset GSE232173, for validation. The latter two included specifically TMZ-treated GBM cell cultures. The DL models were pretrained on the cell cultures treated by each of the 4 drugs from GDSC, respectively. Then, the DL models were refined on HGCC, where the best source drug was identified. Finally, the DL model was validated on GSE232173. Using 2-step TL with pretraining on oxaliplatin was not only superior to those without TL and with 1-step TL but also better than 3 benchmark methods, including MGMT. The oxaliplatin-based TL improved the performance probably by increasing the weights of cell cycle-related genes, which relates to the TMZ response processes. Our findings support the potential of oxaliplatin being an alternative therapy for patients with GBM and TL facilitating drug repurposing research. We recommend that following our methodology, using mixed cancers and a related drug as the source and then fine-tuning the model with the target cancer and the target drug will enhance drug response prediction.

Tackling inter-subject variability in smartwatch data using factorization models

Journal article (2025) - Arman Naseri, David M.J. Tax, Ivo van der Bilt, Marcel Reinders

Smartwatches enable longitudinal and continuous data acquisition. This has the potential to remotely monitor (changes) of the health of users. However, differences among subjects (inter-subject variability) limit a model to generalize to unseen subjects. This study focused on binary classification tasks using heart rate and step counter from smartwatches, including night/day and inactive/active classification, as well as sleep and SpO2-related (oxygen saturation) tasks. To address inter-subject variability, we explored different transforming and normalization regimes for time series including per-subject and population-based strategies. We propose a modified factorized autoencoder, which separates the data into two latent spaces capturing class-specific and subject-specific information. Our proposed generalized factorized autoencoder and triplet factorized autoencoder improved classification accuracy over the baseline from 74.8 (± 10.5) to 83.1 (± 5.1) and 83.4 (± 5.3), respectively, for night/day classification, gains for inactive/active classification were modest, improving from 84.3 (± 9.4) to 86.9 (± 4.4) and 86.6 (± 4.3), respectively. Our study highlights challenges of handling inter-subject variability in smartwatch data and how factorization models can be used to enable more robust and personalized health monitoring solutions for diverse populations. ...

BADDADAN

Mechanistic modelling of time-series gene module expression

Journal article (2025) - Ben Noordijk, Marcel Reinders, Aalt D.J. van Dijk, Dick de Ridder

Plants respond to stresses like drought and heat through complex gene regulatory networks (GRNs). To improve resilience, understanding these is crucial, but large-scale GRNs (>100 genes) are difficult to model using ordinary differential equations (ODEs) due to the high number of parameters that have to be estimated. Here we solve this problem by introducing BADDADAN, which uses machine learning to identify gene modules—groups of co-expressed and/or co-regulated genes—and constructs an ODE model that predicts gene module dynamics under stress. By integrating time-series gene expression data with prior co-expression data it finds modules that are both coherent and interpretable. We demonstrate BADDADAN on heat and drought datasets of A. thaliana, modelling over 1,000 genes, recovering known mechanistic insights, and proposing new hypotheses. By combining machine learning with mechanistic modelling, BADDADAN deepens our understanding of stress-related GRNs in plants and potentially other organisms. ...

Fine-mapping of the TMEM106B locus reveals four haplotypes that are differentially associated with risk for neurodegeneration

Journal article (2025) - Henne Holstege, Alex N. Salazar, Lydian Knoop, Yolande A.L. Pijnenburg, Sven J. van der Lee, Sanduni Wijesekera, Jana Krizova, Mikko Hiltunen, Marcel JT Reinders, More Authors...

Background
Genome-wide association studies (GWAS) linked TMEM106B variants to susceptibility for neurodegenerative diseases, but the causal genetic elements remain unclear.

Method
We used genotyping data from 5,792 Alzheimer disease cases and controls, and applied COJO to identify haplotypes in the TMEM106B locus that independently associated with AD. Then, we used long-read sequencing data from 513 individuals to annotate these haplotypes with structural variations that map into them.

Results
Analysis of the genotyping data revealed that the TMEM106B locus consists of four major haplotypes: HA/Ha (covering the coding region), and HB/Hb (covering the upstream regulatory region). These combine into four combinations with varying population-frequencies: HAB (57%), HaB (34%), Hab (9%), and HAb (<1%). Long-read sequencing of 513 individuals showed that HA haplotypes (marked by 185-Threonine) carry unique methylated CpG sites and an AluYb8-retrotransposon in the 3' UTR, while the Ha haplotypes are marked by the 185-Serine allele. Hb haplotypes carry several structural variants (SVs) in nearby distal enhancers, including a 19 Kbp rearrangement, absent in all other haplotypes. Joint association models revealed that the HAB combination (AluYb8+185-Threonine) is risk-increasing, while Hab (SVs+185-Serine) confers the protective effect. HaB (185-Serine only) is neutral, while HAb was too rare to assess. Relative to middle-aged non-demented controls, cognitively healthy centenarians were more enriched with Hab (OR=1.49, padj=2.18×10-2) than with HaB (OR=1.23, padj=5.06×10-2). Proteomic analysis of temporal cortex tissues (n = 182) indicated that relative to the neutral HaB combination, the protective Hab is associated with 1.1-fold lower TMEM106B C-terminal peptide abundance, while the risk-increasing HAB is associated with 1.16-fold higher abundance.

Conclusion
Our data indicates that the genetic structure underlying the association of the TMEM106B locus with neurodegenerative diseases is driven by the effect of multiple haplotypes. ...

Background
Genome-wide association studies (GWAS) linked TMEM106B variants to susceptibility for neurodegenerative diseases, but the causal genetic elements remain unclear.

Method
We used genotyping data from 5,792 Alzheimer disease cases and controls, and applied COJO to identify haplotypes in the TMEM106B locus that independently associated with AD. Then, we used long-read sequencing data from 513 individuals to annotate these haplotypes with structural variations that map into them.

Results
Analysis of the genotyping data revealed that the TMEM106B locus consists of four major haplotypes: HA/Ha (covering the coding region), and HB/Hb (covering the upstream regulatory region). These combine into four combinations with varying population-frequencies: HAB (57%), HaB (34%), Hab (9%), and HAb (<1%). Long-read sequencing of 513 individuals showed that HA haplotypes (marked by 185-Threonine) carry unique methylated CpG sites and an AluYb8-retrotransposon in the 3' UTR, while the Ha haplotypes are marked by the 185-Serine allele. Hb haplotypes carry several structural variants (SVs) in nearby distal enhancers, including a 19 Kbp rearrangement, absent in all other haplotypes. Joint association models revealed that the HAB combination (AluYb8+185-Threonine) is risk-increasing, while Hab (SVs+185-Serine) confers the protective effect. HaB (185-Serine only) is neutral, while HAb was too rare to assess. Relative to middle-aged non-demented controls, cognitively healthy centenarians were more enriched with Hab (OR=1.49, padj=2.18×10-2) than with HaB (OR=1.23, padj=5.06×10-2). Proteomic analysis of temporal cortex tissues (n = 182) indicated that relative to the neutral HaB combination, the protective Hab is associated with 1.1-fold lower TMEM106B C-terminal peptide abundance, while the risk-increasing HAB is associated with 1.16-fold higher abundance.

Conclusion
Our data indicates that the genetic structure underlying the association of the TMEM106B locus with neurodegenerative diseases is driven by the effect of multiple haplotypes.

Switching from controlled to assisted mechanical ventilation

A multi-center retrospective study (SWITCH)

Journal article (2025) - Jim M. Smit, Jasper Van Bommel, Diederik A.M.P.J. Gommers, Marcel J.T. Reinders, Michel E. Van Genderen, Jesse H. Krijthe, Annemijn H. Jonkman

Background
Switching from controlled to assisted ventilation is crucial in the trajectory of intensive care unit (ICU) stay, but no guidelines exist. We described current practices, analyzed patient characteristics associated with switch success or failure, and explored the feasibility to predict switch failure.

Methods
In this retrospective study, we obtained highly granular longitudinal ICU data sets from three medical centers, covering demographics, severity scores, vital signs, ventilation, and laboratory parameters. The primary endpoint was switch success, considering a switch attempt to be successful if a patient did not return to controlled ventilation for the next 72 h while alive, and to be failed otherwise. We compared the characteristics of patients with successful vs. failed first switch attempts at ICU admission, immediately before, and 3 h after the attempt. We trained LASSO logistic regression models to predict switch failure.

Results
In 4524/6715 (67%) patients attempting a switch, the first attempt failed. The first switch attempt, regardless of success or failure, was generally made at normalized PaCO2 and pH levels, with PEEP < 10 cmH2O and PaO2/FiO2 indicating mild injury. Despite very similar baseline disease severity, switch failure was associated with significantly worse outcomes, including a 28-day mortality of 27% vs. 16% and median ventilator-free days of 16 vs. 22 (p < 0.001). Failed attempts were initiated significantly earlier than successful ones (median 1.8 vs. 1.3 days, p < 0.001). Before the switch, PaO2/FiO2, if measured at PEEP > 10 cmH2O, and respiratory system compliance was lower in patients with switch failure (median 185 vs. 205 mmHg, p < 0.001; 39 vs. 41 mL/cmH2O, P = 0.001), and post-switch, patients with switch failure experienced greater deterioration in gas exchange and minimal improvement in ventilatory parameters post-switch. Contrary to our hypotheses, patient characteristics for failed vs. successful switches were surprisingly similar, resulting in prediction models with limited discriminative performance.

Conclusions
Approximately two-thirds of attempts to switch patients to assisted ventilation fail, which are associated with significantly worse clinical outcomes, despite similar baseline disease severity. Contrary to our hypotheses, patients with successful and failed attempts showed similar characteristics, making switch failure difficult to predict. These findings underscore the importance of preventing switch failures and, given the retrospective nature of this study, highlight the need for prospective studies to better understand the reasons for switch failure and when spontaneous breathing can be safely initiated. ...

Background
Switching from controlled to assisted ventilation is crucial in the trajectory of intensive care unit (ICU) stay, but no guidelines exist. We described current practices, analyzed patient characteristics associated with switch success or failure, and explored the feasibility to predict switch failure.

Methods
In this retrospective study, we obtained highly granular longitudinal ICU data sets from three medical centers, covering demographics, severity scores, vital signs, ventilation, and laboratory parameters. The primary endpoint was switch success, considering a switch attempt to be successful if a patient did not return to controlled ventilation for the next 72 h while alive, and to be failed otherwise. We compared the characteristics of patients with successful vs. failed first switch attempts at ICU admission, immediately before, and 3 h after the attempt. We trained LASSO logistic regression models to predict switch failure.

Results
In 4524/6715 (67%) patients attempting a switch, the first attempt failed. The first switch attempt, regardless of success or failure, was generally made at normalized PaCO2 and pH levels, with PEEP < 10 cmH2O and PaO2/FiO2 indicating mild injury. Despite very similar baseline disease severity, switch failure was associated with significantly worse outcomes, including a 28-day mortality of 27% vs. 16% and median ventilator-free days of 16 vs. 22 (p < 0.001). Failed attempts were initiated significantly earlier than successful ones (median 1.8 vs. 1.3 days, p < 0.001). Before the switch, PaO2/FiO2, if measured at PEEP > 10 cmH2O, and respiratory system compliance was lower in patients with switch failure (median 185 vs. 205 mmHg, p < 0.001; 39 vs. 41 mL/cmH2O, P = 0.001), and post-switch, patients with switch failure experienced greater deterioration in gas exchange and minimal improvement in ventilatory parameters post-switch. Contrary to our hypotheses, patient characteristics for failed vs. successful switches were surprisingly similar, resulting in prediction models with limited discriminative performance.

Conclusions
Approximately two-thirds of attempts to switch patients to assisted ventilation fail, which are associated with significantly worse clinical outcomes, despite similar baseline disease severity. Contrary to our hypotheses, patients with successful and failed attempts showed similar characteristics, making switch failure difficult to predict. These findings underscore the importance of preventing switch failures and, given the retrospective nature of this study, highlight the need for prospective studies to better understand the reasons for switch failure and when spontaneous breathing can be safely initiated.

Analyzing PaO₂/FiO₂?

Mind the interaction with PEEP!

Journal article (2025) - J. M. Smit, J. H. Krijthe, J. Van Bommel, M. E. Van Genderen, M. J.T. Reinders, A. H. Jonkman

Transferability of European-derived Alzheimer’s disease polygenic risk scores across multiancestry populations

Journal article (2025) - Aude Nicolas, Richard Sherva, Benjamin Grenier-Boley, Yoontae Kim, Masataka Kikuchi, Jigyasha Timsina, Itziar de Rojas, Marcel J.T. Reinders, Jean-Charles Lambert, More authors...

A polygenic score (PGS) for Alzheimer’s disease (AD) was derived recently from data on genome-wide significant loci in European ancestry populations. We applied this PGS to populations in 17 European countries and observed a consistent association with the AD risk, age at onset and cerebrospinal fluid levels of AD biomarkers, independently of apolipoprotein E locus (APOE). This PGS was also associated with the AD risk in many other populations of diverse ancestries. A cross-ancestry polygenic risk score improved the association with the AD risk in most of the multiancestry populations tested when the APOE region was included. Finally, we found that the PGS/polygenic risk score captured AD-specific information because the association weakened as the diagnosis was broadened. In conclusion, a simple PGS captures the AD-specific genetic information that is common to populations of different ancestries, although studies of more diverse populations are still needed to better characterize the genetics of AD. ...

Exploring nanopore direct sequencing performance of forensic STRs, SNPs, InDels, and DNA methylation markers in a single assay

Journal article (2025) - Desiree D.S.H. de Bruin, Martin A. Haagmans, Kristiaan J. van der Gaag, Jerry Hoogenboom, Natalie E.C. Weiler, Niccoló Tesi, Henne Holstege, Marcel Reinders, Peter Henneman, More authors...

Introduction
The field of forensic DNA analysis has undergone rapid advancements in recent decades. The integration of massively parallel sequencing (MPS) has notably expanded the forensic toolkit, moving beyond identity matching to predicting phenotypic traits and biogeographical ancestry. This shift is of particular significance in cases where conventional DNA profiling fails to identify a single suspect. Supplementing forensic analyses with estimated biological age may be valuable but involves a complex and time-consuming DNA methylation analysis. This study explores and validates the performance of a comprehensive forensic third-generation sequencing assay utilizing Oxford Nanopore Technologies (ONT) in an adaptive and direct sequencing approach. We incorporated the most widely used forensic markers, i.e., STRs, SNPs, InDels, mitochondrial DNA (mtDNA), and two methylation-based clock classifiers, thereby combining forensic genetic and epigenetic analysis in one single workflow.

Methods and results
In our investigation, DNA from six anonymous individuals was sequenced using the ONT standard adaptive direct sequencing approach, reaching a mean percentage of on-target reads ranging from 6.6 % to 7.7 % per sample. ONT data was compared to standard MPS data and Illumina EPIC DNA methylation profiles. Basecalling employed recommended ONT software packages. TREAT was used for ONT-based analysis of autosomal and Y-chromosome STRs, achieving 90–92 % correct calls depending on allelic read depth thresholds. InDel analyses for two lower-quality samples proved challenging due to inadequate read depth, while the remaining four samples significantly contributed to the observed percentage markers (60.9 %) and correct calls (97.8 %). SNP analysis achieved a 98 % call rate, with only two mismatches and two missed alleles. ONT-generated DNA methylation data demonstrated Pearson’s correlation coefficients with EPIC data ranging from 0.67 to 0.97 for Horvath’s clock. Additional age-associated markers exhibited Pearson’s correlation coefficients with chronological age between 0.14 (ELOVL2) and 0.96 (FHL2) at read depths of <30 and <20, respectively. Despite excluding mtDNA from our targeted sequencing approach, adaptive proof-reading fragments covered the complete mtDNA with an average read depth of 21–72, showing 100 % concordance with reference data.

Discussion
Our exploratory study using ONT adaptive sequencing for conventional forensic and age associated DNA methylation markers showed high sequencing accuracy for a significant number of markers, showcasing ONT as a promising (epi)genetic forensic method. Future studies must address three critical aspects: determining clear quantity and quality measures and detection thresholds for accuracy, optimizing input DNA quantity for forensic casework expectations, and addressing ethical considerations associated with phenotype and ancestry analysis to prevent ethnic biases. ...

Introduction
The field of forensic DNA analysis has undergone rapid advancements in recent decades. The integration of massively parallel sequencing (MPS) has notably expanded the forensic toolkit, moving beyond identity matching to predicting phenotypic traits and biogeographical ancestry. This shift is of particular significance in cases where conventional DNA profiling fails to identify a single suspect. Supplementing forensic analyses with estimated biological age may be valuable but involves a complex and time-consuming DNA methylation analysis. This study explores and validates the performance of a comprehensive forensic third-generation sequencing assay utilizing Oxford Nanopore Technologies (ONT) in an adaptive and direct sequencing approach. We incorporated the most widely used forensic markers, i.e., STRs, SNPs, InDels, mitochondrial DNA (mtDNA), and two methylation-based clock classifiers, thereby combining forensic genetic and epigenetic analysis in one single workflow.

Methods and results
In our investigation, DNA from six anonymous individuals was sequenced using the ONT standard adaptive direct sequencing approach, reaching a mean percentage of on-target reads ranging from 6.6 % to 7.7 % per sample. ONT data was compared to standard MPS data and Illumina EPIC DNA methylation profiles. Basecalling employed recommended ONT software packages. TREAT was used for ONT-based analysis of autosomal and Y-chromosome STRs, achieving 90–92 % correct calls depending on allelic read depth thresholds. InDel analyses for two lower-quality samples proved challenging due to inadequate read depth, while the remaining four samples significantly contributed to the observed percentage markers (60.9 %) and correct calls (97.8 %). SNP analysis achieved a 98 % call rate, with only two mismatches and two missed alleles. ONT-generated DNA methylation data demonstrated Pearson’s correlation coefficients with EPIC data ranging from 0.67 to 0.97 for Horvath’s clock. Additional age-associated markers exhibited Pearson’s correlation coefficients with chronological age between 0.14 (ELOVL2) and 0.96 (FHL2) at read depths of <30 and <20, respectively. Despite excluding mtDNA from our targeted sequencing approach, adaptive proof-reading fragments covered the complete mtDNA with an average read depth of 21–72, showing 100 % concordance with reference data.

Discussion
Our exploratory study using ONT adaptive sequencing for conventional forensic and age associated DNA methylation markers showed high sequencing accuracy for a significant number of markers, showcasing ONT as a promising (epi)genetic forensic method. Future studies must address three critical aspects: determining clear quantity and quality measures and detection thresholds for accuracy, optimizing input DNA quantity for forensic casework expectations, and addressing ethical considerations associated with phenotype and ancestry analysis to prevent ethnic biases.

PredLyP

A computational tool for predicting tissue-specific (phago-)lysosomal post-digestion peptides

Journal article (2025) - Mattijn Wagt, Cristina Teodosio, Anniek L. de Jager, Jacques J.M. van Dongen, Marcel J.T. Reinders, Paula Díez, Indu Khatri

Peptides are versatile tools in immunotherapy, serving as vaccines and targets for specific immunotherapeutic strategies. Peptides engage immune cells like macrophages and T cells, enabling precise modulation of immune responses. In this context, we highlight the utility of macrophages, innate immune cells involved in constant surveillance, for detecting their phagolysosomal content as a minimally-invasive biomarker strategy. Analyzing proteolytic patterns in phagolysosomes offers a high-sensitivity approach to assess tissue homeostasis and tissue disruption, such as in cancer. Despite their potential, a major challenge lies in the lack of comprehensive tools for predicting cutting sites across phagolysosomal proteases. Therefore, we developed the computational tool PredLyP (abbreviation for “prediction of lysosomal proteases”) to identify cutting sites of phagolysosomal proteases, which are essential enzymes involved in protein degradation within (phago)lysosomes, to predict the potential peptides generated from the input proteins. Unlike existing tools, PredLyP utilizes Position Specific Scoring Matrices derived from amino acid sequences, physical (charge and hydropathy) and structural (secondary structure and solvent accessibility) features. Moreover, it incorporates a sequential cutter functionality that mimics the ordered action of proteases, providing predictive insights into substrate fragment generation. Comparisons with other tools demonstrate the superior sensitivity of PredLyP, enabling accurate prediction of complete and partial digestion fragments, a critical requirement for real-world applications in proteomics, antibody development, and immune system research. Overall, PredLyP represents a robust tool for advancing our understanding of proteolytic processes in phagolysosomes and their implications in health and disease. ...

Polygenic pathways shape white matter vulnerability to Alzheimer’s disease-related pathophysiological changes

Journal article (2025) - Mario Tranfa, Leonard Pieperhoff, Giuseppe Pontillo, Emma S. Luckett, Lyduine E. Collij, Tiago Gil Oliveira, Niccoló Tesi, Natalia Vilor-Tejedor, M.J.T. Reinders, More authors...

Background: The accumulation of amyloid-β1−42 (Aβ1−42) peptides and phosphorylated-Tau181 (p-Tau181) tangles from the preclinical stages of Alzheimer’s disease (AD) has led to a biological definition of the disease. However, among Aβ1−42-positive individuals, cognitive decline onset varies, and some never develop symptoms. Genetic influences on molecular pathways and their interactions with proteinopathy may underlie this heterogeneity. Leveraging data from a large sample of cognitively intact older adults in the European Prevention of Alzheimer Dementia (EPAD) cohort, we examined how AD-related pathophysiological changes (i.e., Aβ1−42 and p-Tau181), polygenic pathways and their interaction are associated with WM micro- and macrostructural properties. Methods: We selected 803 individuals (mean age = 64.7 ± 7.3 years, 458 [57.0%] females, 275 [34.2%] APOE-ε4 carriers) with CSF-Aβ1−42 and p-Tau181 measurements available, full genotyping, and structural and diffusion MRI. Polygenic risk scores (PRSs) were computed using 85 AD-related genetic variants. These were mapped to their corresponding genes and, after excluding those belonging to the APOE locus, clustered by function into six pathway-specific PRSs (i.e., immune activation, signal transduction, inflammation, lipid, amyloid, and clearance pathways). Diffusion MRIs were processed through the fixel-based analysis framework to derive fiber density (FD) and fiber cross-section (FC) metrics, which were averaged within WM tracts. Linear models assessed the effects of AD-related pathophysiological changes, global and pathway-specific PRSs, and their interactions on FD and FC at both the tract and fixel levels. Models were corrected for multiple comparisons. Results: P-Tau181 was primarily associated with greater FD. The lipid pathway was associated with greater FD and FC, with these effects predominantly occurring in the left hemisphere, consistent with evidence of hemispheric dominance. The clearance pathway moderated the effect of Aβ1−42 on FD, with a positive slope in A + compared to A- individuals. The immune activation pathway moderated the effect of p-Tau181 on FD, with a negative slope in T + compared to T- individuals. Conclusions: Pathway-specific genetic vulnerability to AD is associated with alterations in WM tracts both directly and by moderating the effects of AD-related pathophysiological changes. AD-associated genetic risk should be integrated into the AD diagnostic framework to enable targeted screening and intervention for future preclinical trials aimed at specific biological pathways. ...

Background: The accumulation of amyloid-β1−42 (Aβ1−42) peptides and phosphorylated-Tau181 (p-Tau181) tangles from the preclinical stages of Alzheimer’s disease (AD) has led to a biological definition of the disease. However, among Aβ1−42-positive individuals, cognitive decline onset varies, and some never develop symptoms. Genetic influences on molecular pathways and their interactions with proteinopathy may underlie this heterogeneity. Leveraging data from a large sample of cognitively intact older adults in the European Prevention of Alzheimer Dementia (EPAD) cohort, we examined how AD-related pathophysiological changes (i.e., Aβ1−42 and p-Tau181), polygenic pathways and their interaction are associated with WM micro- and macrostructural properties. Methods: We selected 803 individuals (mean age = 64.7 ± 7.3 years, 458 [57.0%] females, 275 [34.2%] APOE-ε4 carriers) with CSF-Aβ1−42 and p-Tau181 measurements available, full genotyping, and structural and diffusion MRI. Polygenic risk scores (PRSs) were computed using 85 AD-related genetic variants. These were mapped to their corresponding genes and, after excluding those belonging to the APOE locus, clustered by function into six pathway-specific PRSs (i.e., immune activation, signal transduction, inflammation, lipid, amyloid, and clearance pathways). Diffusion MRIs were processed through the fixel-based analysis framework to derive fiber density (FD) and fiber cross-section (FC) metrics, which were averaged within WM tracts. Linear models assessed the effects of AD-related pathophysiological changes, global and pathway-specific PRSs, and their interactions on FD and FC at both the tract and fixel levels. Models were corrected for multiple comparisons. Results: P-Tau181 was primarily associated with greater FD. The lipid pathway was associated with greater FD and FC, with these effects predominantly occurring in the left hemisphere, consistent with evidence of hemispheric dominance. The clearance pathway moderated the effect of Aβ1−42 on FD, with a positive slope in A + compared to A- individuals. The immune activation pathway moderated the effect of p-Tau181 on FD, with a negative slope in T + compared to T- individuals. Conclusions: Pathway-specific genetic vulnerability to AD is associated with alterations in WM tracts both directly and by moderating the effects of AD-related pathophysiological changes. AD-associated genetic risk should be integrated into the AD diagnostic framework to enable targeted screening and intervention for future preclinical trials aimed at specific biological pathways.