Abstract
Cerebrospinal fluid (CSF) is an essential matrix for the discovery of neurological disease biomarkers. However, the high dynamic range of protein concentrations in CSF hinders the detection of the least abundant protein biomarkers by untargeted mass spectrometry. It is thus beneficial to gain a deeper understanding of the secretion processes within the brain. Here, we aim to explore if and how the secretion of brain proteins to the CSF can be predicted. By combining a curated CSF proteome and the brain elevated proteome of the Human Protein Atlas, brain proteins were classified as CSF or non-CSF secreted. A machine learning model was trained on a range of sequence-based features to differentiate between CSF and non-CSF groups and effectively predict the brain origin of proteins. The classification model achieves an area under the curve of 0.89 if using high confidence CSF proteins. The most important prediction features include the subcellular localization, signal peptides, and transmembrane regions. The classifier generalized well to the larger brain detected proteome and is able to correctly predict novel CSF proteins identified by affinity proteomics. In addition to elucidating the underlying mechanisms of protein secretion, the trained classification model can support biomarker candidate selection.
Original language | English |
---|---|
Pages (from-to) | 3068-3080 |
Number of pages | 13 |
Journal | Journal of Proteome Research |
Volume | 22 |
Issue number | 9 |
DOIs | |
Publication status | Published - 1 Sept 2023 |
Funding
The authors declare the following competing financial interest(s): C.E.T. has a collaboration contract with ADx Neurosciences, Quanterix and Eli Lilly, performed contract research or received grants from AC-Immune, Axon Neurosciences, Biogen, Brainstorm Therapeutics, Celgene, EIP Pharma, Eisai, PeopleBio Inc., Roche, Toyama, Vivoryon, and has a speaker contract with Roche. S.A. reports grants and nonfinancial support from Cergentis BV and a patent pending outside the submitted work. The MIRIADE project includes the following commercial beneficiaries and partners: ADx Neuroscience, ENPICOM, LGC Limited, PeopleBio Inc., Olink, Quanterix, and Roche. Acknowledgments K.W., C.E.T., and S.A. received funding from the European Union’s Horizon 2020 research and innovation program under the Marie Sklodowska-Curie grant agreement no. 860197, the MIRIADE project. R.d.W. was supported by a PPP Allowance (grant no. LSHM21018) made available by Health ∼ Holland, Top Sector Life Sciences & Health, to stimulate public–private partnerships. I.M.W.V. is supported by research grants of Alzheimer Nederland and Alzheimer’s Association. C.E.T. is supported by JPND (bPRIDE), the Dutch Research Council (ZonMw), Alzheimer Drug Discovery Foundation, the Self-ridges Group Foundation, Alzheimer Netherlands, and Alzheimer’s Association. C.E.T. is recipient of ABOARD, which is a public-private partnership receiving funding from ZonMW (no. 73305095007) and Health ∼ Holland, Top Sector Life Sciences & Health (grant no. LSHM20106). We would like to acknowledge BAZIS, the Supercomputing cluster of the Vrije Universiteit Amsterdam. K.W., C.E.T., and S.A. received funding from the European Union’s Horizon 2020 research and innovation program under the Marie Sklodowska-Curie grant agreement no. 860197, the MIRIADE project. R.d.W. was supported by a PPP Allowance (grant no. LSHM21018) made available by Health ∼ Holland, Top Sector Life Sciences & Health, to stimulate public-private partnerships. I.M.W.V. is supported by research grants of Alzheimer Nederland and Alzheimer’s Association. C.E.T. is supported by JPND (bPRIDE), the Dutch Research Council (ZonMw), Alzheimer Drug Discovery Foundation, the Self-ridges Group Foundation, Alzheimer Netherlands, and Alzheimer’s Association. C.E.T. is recipient of ABOARD, which is a public-private partnership receiving funding from ZonMW (no. 73305095007) and Health ∼ Holland, Top Sector Life Sciences & Health (grant no. LSHM20106). We would like to acknowledge BAZIS, the Supercomputing cluster of the Vrije Universiteit Amsterdam.
Funders | Funder number |
---|---|
AC-Immune | |
BAZIS | |
Cergentis BV | |
Alzheimer's Association | |
Alzheimer's Drug Discovery Foundation | LSHM20106, 73305095007 |
Eli Lilly and Company | |
Roche | |
Horizon 2020 Framework Programme | 860197, LSHM21018 |
EU Joint Programme – Neurodegenerative Disease Research | |
ZonMw | |
Nederlandse Organisatie voor Wetenschappelijk Onderzoek | |
Alzheimer Nederland |
Keywords
- Humans
- Proteome
- Brain
- Protein Transport
- Biological Transport
- Biomedical Research
- Cerebrospinal Fluid Proteins