Machine learning in Alzheimer’s disease genetics

  • EADB

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

Traditional statistical approaches have advanced our understanding of the genetics of complex diseases, yet are limited to linear additive models. Here we applied machine learning (ML) to genome-wide data from 41,686 individuals in the largest European consortium on Alzheimer’s disease (AD) to investigate the effectiveness of various ML algorithms in replicating known findings, discovering novel loci, and predicting individuals at risk. We utilised Gradient Boosting Machines (GBMs), biological pathway-informed Neural Networks (NNs), and Model-based Multifactor Dimensionality Reduction (MB-MDR) models. ML approaches successfully captured all genome-wide significant genetic variants identified in the training set and 22% of associations from larger meta-analyses. They highlight 6 novel loci which replicate in an external dataset, including variants which map to ARHGAP25, LY6H, COG7, SOD1 and ZNF597. They further identify novel association in AP4E1, refining the genetic landscape of the known SPPL2A locus. Our results demonstrate that machine learning methods can achieve predictive performance comparable to classical approaches in genetic epidemiology and have the potential to uncover novel loci that remain undetected by traditional GWAS. These insights provide a complementary avenue for advancing the understanding of AD genetics.
Original languageEnglish
Article number6726
Pages (from-to)1-16
Number of pages16
JournalNature Communications
Volume16
Early online date22 Jul 2025
DOIs
Publication statusPublished - 2025

Funding

We thank the University of Lille’s Intensive Scientific Computing Mesocentre. M.B.S. and V.E.P. are supported by the UK Dementia Research Institute [UK DRI-3206] through UK DRI Ltd, principally funded by the Medical Research Council and by the UK Medical Research Council: MR/T04604X/1, MR/P005748/1. K.V.S. has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreements No 813533 (MLFPM) and No 860895 (TranSYS), and FNRS convention PDR T.0294.24 “Expanded PRS embracing pathways and interactions for increased clinical utility”. F.M. (Federico Melograna) was also funded by the Marie Sklodowska-Curie grant agreement 860895 (TranSYS). J.W. (Julie Williams) is supported by Moondance Dementia Research Laboratory. B.M.T. and G.R. (Gennady Roshchupkin) are funded by ZonMw VIDI (#09150171910068) and ZonMw Veni grant (1936320) respectively. N.A. is funded by GSK. RG and AD were supported by the Italian Ministry of Health (Ricerca Corrente). Outside the submitted work, T.G. received consulting fees from AbbVie, Alector, Anavex, Biogen, BMS; Cogthera, Eli Lilly, Functional Neuromodulation, Grifols, Iqvia, Janssen, Noselab, Novo Nordisk, NuiCare, Orphanzyme, Roche Diagnostics, Roche Pharma, UCB, and Vivoryon; lecture fees from Biogen, Eisai, Grifols, Medical Tribune, Novo Nordisk, Roche Pharma, Schwabe, and Synlab; and has received grants to his institution from Biogen, Eisai, and Roche Diagnostics. N.A. received funding from GSK. All other authors declare no competing interests.

FundersFunder number
FNRS
Horizon 2020 Framework Programme
Ministero della Salute
Université de Lille
Moondance Dementia Research Laboratory
GlaxoSmithKline
Medical Research CouncilMR/T04604X/1, MR/P005748/1
H2020 Marie Skłodowska-Curie Actions813533, 860895
ZonMw09150171910068, 1936320
UK Dementia Research InstituteDRI-3206

    Fingerprint

    Dive into the research topics of 'Machine learning in Alzheimer’s disease genetics'. Together they form a unique fingerprint.

    Cite this