Abstract
Background: Information on long-term alcohol consumption is relevant for medical and public health research, disease therapy, and other areas. Recently, DNA methylation-based inference of alcohol consumption from blood was reported with high accuracy, but these results were based on employing the same dataset for model training and testing, which can lead to accuracy overestimation. Moreover, only subsets of alcohol consumption categories were used, which makes it impossible to extrapolate such models to the general population. By using data from eight population-based European cohorts (N = 4677), we internally and externally validated the previously reported biomarkers and models for epigenetic inference of alcohol consumption from blood and developed new models comprising all data from all categories. Results: By employing data from six European cohorts (N = 2883), we empirically tested the reproducibility of the previously suggested biomarkers and prediction models via ten-fold internal cross-validation. In contrast to previous findings, all seven models based on 144-CpGs yielded lower mean AUCs compared to the models with less CpGs. For instance, the 144-CpG heavy versus non-drinkers model gave an AUC of 0.78 ± 0.06, while the 5 and 23 CpG models achieved 0.83 ± 0.05, respectively. The transportability of the models was empirically tested via external validation in three independent European cohorts (N = 1794), revealing high AUC variance between datasets within models. For instance, the 144-CpG heavy versus non-drinkers model yielded AUCs ranging from 0.60 to 0.84 between datasets. The newly developed models that considered data from all categories showed low AUCs but gave low AUC variation in the external validation. For instance, the 144-CpG heavy and at-risk versus light and non-drinkers model achieved AUCs of 0.67 ± 0.02 in the internal cross-validation and 0.61–0.66 in the external validation datasets. Conclusions: The outcomes of our internal and external validation demonstrate that the previously reported prediction models suffer from both overfitting and accuracy overestimation. Our results show that the previously proposed biomarkers are not yet sufficient for accurate and robust inference of alcohol consumption from blood. Overall, our findings imply that DNA methylation prediction biomarkers and models need to be improved considerably before epigenetic inference of alcohol consumption from blood can be considered for practical applications.
Original language | English |
---|---|
Article number | 198 |
Pages (from-to) | 1-13 |
Number of pages | 13 |
Journal | Clinical epigenetics |
Volume | 13 |
Early online date | 26 Oct 2021 |
DOIs | |
Publication status | Published - Dec 2021 |
Bibliographical note
Funding Information:This work was performed within the framework of the BBMRI Metabolomics Consortium funded by BBMRI-NL, a research infrastructure financed by the Dutch government (NWO 184.021.007 and 184.033.111). A full list of the BIOS consortium investigators is available in Additional file . SCEM, AV, MG, and MK were supported by the Erasmus MC University Medical Center Rotterdam. AV was additionally supported with an EUR Fellowship by Erasmus University Rotterdam. Detailed cohort specific funding are included in the Supplementary Methods (Additional file ). The researchers are independent from the funders. The study sponsors had no role in the study design, data collection, data analysis, interpretation of data and preparation, review or approval of the manuscript.
Funding Information:
HJG has received travel grants and speakers honoraria from Fresenius Medical Care, Neuraxpharm, Servier and Janssen Cilag as well as research funding from Fresenius Medical Care.
Funding Information:
The authors are grateful to the participants of the included cohorts; the Rotterdam Study (http://www.erasmus-epidemiology.nl/research/ergo.htm), the CODAM study (http://www.carimmaastricht.nl/), the Netherlands Twin Registry (http://www.tweelingenregister.org), the Leiden Longevity Study (http://www.leidenlangleven.nl), the PAN study (http://www.alsonderzoek.nl/), the KORA Study (https://www.helmholtzmuenchen.de/en/kora/index.html), SHIP-Trend (https://ship.community-medicine.de/) and TwinsUK (https://twinsuk.ac.uk/). Detailed cohort specific acknowledgments are included in Additional file 1 : Supplementary Methods.
Publisher Copyright:
© 2021, The Author(s).
Funding
This work was performed within the framework of the BBMRI Metabolomics Consortium funded by BBMRI-NL, a research infrastructure financed by the Dutch government (NWO 184.021.007 and 184.033.111). A full list of the BIOS consortium investigators is available in Additional file . SCEM, AV, MG, and MK were supported by the Erasmus MC University Medical Center Rotterdam. AV was additionally supported with an EUR Fellowship by Erasmus University Rotterdam. Detailed cohort specific funding are included in the Supplementary Methods (Additional file ). The researchers are independent from the funders. The study sponsors had no role in the study design, data collection, data analysis, interpretation of data and preparation, review or approval of the manuscript. HJG has received travel grants and speakers honoraria from Fresenius Medical Care, Neuraxpharm, Servier and Janssen Cilag as well as research funding from Fresenius Medical Care. The authors are grateful to the participants of the included cohorts; the Rotterdam Study (http://www.erasmus-epidemiology.nl/research/ergo.htm), the CODAM study (http://www.carimmaastricht.nl/), the Netherlands Twin Registry (http://www.tweelingenregister.org), the Leiden Longevity Study (http://www.leidenlangleven.nl), the PAN study (http://www.alsonderzoek.nl/), the KORA Study (https://www.helmholtzmuenchen.de/en/kora/index.html), SHIP-Trend (https://ship.community-medicine.de/) and TwinsUK (https://twinsuk.ac.uk/). Detailed cohort specific acknowledgments are included in Additional file 1 : Supplementary Methods.
Funders | Funder number |
---|---|
BBMRI-NL | |
Dutch Government | |
Erasmus MC University Medical Center Rotterdam | |
Leiden Longevity Study | |
TwinsUK | |
Fresenius Medical Care North America | |
Nederlandse Organisatie voor Wetenschappelijk Onderzoek | 184.033.111, 184.021.007 |
Nederlandse Organisatie voor Wetenschappelijk Onderzoek |
Keywords
- Alcohol inference
- Blood
- DNA methylation
- Epigenetics
- Inference
- Prediction
Fingerprint
Dive into the research topics of 'Validating biomarkers and models for epigenetic inference of alcohol consumption from blood'. Together they form a unique fingerprint.Datasets
-
Additional file 7 of Validating biomarkers and models for epigenetic inference of alcohol consumption from blood
Costeira, R. (Contributor), van der Kallen, C. J. H. (Contributor), van Meurs, J. B. J. (Contributor), Grabe, H. J. (Contributor), Beekman, M. (Contributor), Waldenberger, M. (Contributor), Wilson, R. (Contributor), van Heemst, D. (Contributor), Vidaki, A. (Contributor), V?lker, U. (Contributor), van Dongen, J. (Contributor), Slagboom, P. E. (Contributor), Ikram, M. A. (Contributor), Kunze, S. (Contributor), Ladwig, K. (Contributor), van den Berg, L. H. (Contributor), Kayser, M. (Contributor), Ghanbari, M. (Contributor), Uitterlinden, A. G. (Contributor), Teumer, A. (Contributor), Voortman, T. (Contributor), Bell, J. T. (Contributor), Peters, A. (Contributor), Boomsma, D. (Contributor), Maas, S. C. E. (Contributor) & V?lzke, H. (Contributor), figshare Academic Research System, 1 Jan 2021
DOI: 10.6084/m9.figshare.16880266.v1, https://springernature.figshare.com/articles/dataset/Additional_file_7_of_Validating_biomarkers_and_models_for_epigenetic_inference_of_alcohol_consumption_from_blood/16880266/1
Dataset
-
Additional file 3 of Validating biomarkers and models for epigenetic inference of alcohol consumption from blood
Vidaki, A. (Contributor), van Dongen, J. (Contributor), Maas, S. C. E. (Contributor), van Heemst, D. (Contributor), van Meurs, J. B. J. (Contributor), Wilson, R. (Contributor), Costeira, R. (Contributor), Boomsma, D. (Contributor), Grabe, H. J. (Contributor), van der Kallen, C. J. H. (Contributor), Uitterlinden, A. G. (Contributor), Ladwig, K. (Contributor), Teumer, A. (Contributor), Waldenberger, M. (Contributor), Kayser, M. (Contributor), Ghanbari, M. (Contributor), Slagboom, P. E. (Contributor), Voortman, T. (Contributor), van den Berg, L. H. (Contributor), Peters, A. (Contributor), V?lzke, H. (Contributor), Ikram, M. A. (Contributor), Bell, J. T. (Contributor), Beekman, M. (Contributor), Kunze, S. (Contributor) & V?lker, U. (Contributor), figshare Academic Research System, 1 Jan 2021
DOI: 10.6084/m9.figshare.16880254.v1, https://springernature.figshare.com/articles/dataset/Additional_file_3_of_Validating_biomarkers_and_models_for_epigenetic_inference_of_alcohol_consumption_from_blood/16880254/1
Dataset
-
Additional file 2 of Validating biomarkers and models for epigenetic inference of alcohol consumption from blood
Uitterlinden, A. G. (Contributor), Slagboom, P. E. (Contributor), Kayser, M. (Contributor), Boomsma, D. (Contributor), van Dongen, J. (Contributor), Maas, S. C. E. (Contributor), Ladwig, K. (Contributor), Kunze, S. (Contributor), Wilson, R. (Contributor), Costeira, R. (Contributor), van Meurs, J. B. J. (Contributor), Teumer, A. (Contributor), Bell, J. T. (Contributor), Voortman, T. (Contributor), Waldenberger, M. (Contributor), Ikram, M. A. (Contributor), van den Berg, L. H. (Contributor), V?lker, U. (Contributor), Peters, A. (Contributor), van Heemst, D. (Contributor), van der Kallen, C. J. H. (Contributor), V?lzke, H. (Contributor), Vidaki, A. (Contributor), Grabe, H. J. (Contributor), Ghanbari, M. (Contributor) & Beekman, M. (Contributor), figshare Academic Research System, 1 Jan 2021
DOI: 10.6084/m9.figshare.16880251.v1, https://springernature.figshare.com/articles/dataset/Additional_file_2_of_Validating_biomarkers_and_models_for_epigenetic_inference_of_alcohol_consumption_from_blood/16880251/1
Dataset