Skip to main navigation Skip to search Skip to main content

Genome-wide analyses of vocabulary size in infancy and toddlerhood: Associations With Attention-Deficit/Hyperactivity Disorder, Literacy, and Cognition-Related Traits

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

BACKGROUND: The number of words children produce (expressive vocabulary) and understand (receptive vocabulary) changes rapidly during early development, partially due to genetic factors. Here, we performed a meta-genome-wide association study of vocabulary acquisition and investigated polygenic overlap with literacy, cognition, developmental phenotypes and neurodevelopmental conditions, including Attention-Deficit/Hyperactivity Disorder (ADHD).

METHODS: We studied 37,913 parent-reported vocabulary size measures (English, Dutch, Danish) for 17,298 European descent children. Meta-analyses were performed for early-phase expressive (infancy, 15-18 months), late-phase expressive (toddlerhood, 24-38 months) and late-phase receptive (toddlerhood, 24-38 months) vocabulary. Subsequently, we estimated Single-Nucleotide Polymorphism heritability (SNP-h 2) and genetic correlations (r g), and modelled underlying factor structures with multivariate models.

RESULTS: Early-life vocabulary size was modestly heritable (SNP-h 2: 0.08(SE=0.01) to 0.24(SE=0.03)). Genetic overlap between infant expressive and toddler receptive vocabulary was negligible (r g=0.07(SE=0.10)), although each measure was moderately related to toddler expressive vocabulary (r g=0.69(SE=0.14) and r g=0.67(SE=0.16), respectively), suggesting a multi-factorial genetic architecture. Both infant and toddler expressive vocabulary were genetically linked to literacy (e.g. spelling: r g=0.58(SE=0.20) and r g=0.79(SE=0.25), respectively), underlining genetic similarity. However, genetic association of early-life vocabulary with educational attainment and intelligence emerged in toddlerhood only (e.g. receptive vocabulary and intelligence: r g=0.36(SE=0.12)). Increased ADHD risk was genetically associated with larger infant expressive vocabulary (r g=0.23(SE=0.08)). Multivariate genetic models in the ALSPAC cohort confirmed this finding for ADHD symptoms (r g=0.54(SE=0.26)), but showed that the association effect reversed for toddler receptive vocabulary (r g=-0.74(SE=0.23)), highlighting developmental heterogeneity.

CONCLUSIONS: The genetic architecture of early-life vocabulary changes during development, shaping polygenic association patterns with later-life ADHD, literacy and cognition-related traits.

Original languageEnglish
Pages (from-to)859-869
Number of pages11
JournalBiological psychiatry
Volume95
Issue number9
Early online date7 Dec 2023
DOIs
Publication statusPublished - 1 May 2024

Bibliographical note

Copyright © 2023. Published by Elsevier Inc.

Funding

EVe, EEi, FS, SEF and BSTP were funded by the Max Planck Society. TSA was supported by the Novo Nordisk Foundation Grant NNF18OC0052457. ATM is supported by the National Health and Medical Research Council. CS was supported by an Australian Government NHMRC Postgraduate Research Scholarship. EH receives funding from the National Health and Medical Research Council Australia, Australian Research Council, Medical Research Future Fund, and Tour de Cure. MG is funded by a Miguel Servet II fellowship (CPII18/00018) awarded by the Spanish Institute of Health Carlos III. We acknowledge support from the Spanish Ministry of Science and Innovation and the State Research Agency through the “Centro de Excelencia Severo Ochoa 2019-2023” Program (CEX2018-000806-S), and support from the Generalitat de Catalunya through the CERCA Program. SH receives support from the UK National Institute for Health Research through the academic clinical fellowship scheme. KR is supported by a Sir Henry Wellcome Postdoctoral Fellowship. CYS and JLM are supported by the UK Medical Research Council (MRC) Integrative Epidemiology Unit at the University of Bristol (MC_UU_00032/02). OAA is supported by KG Jebsen Stiftelsen, Research Council of Norway (#223273, 273291, 324252). EY receives support from the Research Council of Norway (grant numbers 262177; 288083). PRN was supported by grants from the European Research Council (AdG SELECTionPREDISPOSED #293574), the Bergen Research Foundation (“Utilizing the Mother and Child Cohort and the Medical Birth Registry for Better Health”), Stiftelsen Kristian Gerhard Jebsen (Translational Medical Center), the University of Bergen, the Research Council of Norway (FRIPRO grant #240413), the Western Norway Regional Health Authority (Strategic Fund “Personalized Medicine for Children and Adults”), the Novo Nordisk Foundation (grant #54741), and the Norwegian Diabetes Association. CAMC receives support from the European Union’s Horizon 2020 Research and Innovation Programme under grant agreement No 848158 (EarlyCause Project). EV, EE, FS, SEF, and BSP were funded by the Max Planck Society. TSA was supported by the Novo Nordisk Foundation (Grant No. NNF18OC0052457). ATM is supported by the National Health and Medical Research Council. CS was supported by an Australian Government National Health and Medical Research Council Postgraduate Research Scholarship. EH receives funding from the National Health and Medical Research Council Australia, Australian Research Council, Medical Research Future Fund, and Tour de Cure. MB is funded by an NWO VICI grant (Grant No. VI.C.211.054) and an ERC consolidation grant (WELL-BEING 771057). MG is funded by a Miguel Servet II fellowship (CPII18/00018) awarded by the Spanish Institute of Health Carlos III. We acknowledge support from the grant CEX2018-000806-S funded by MCIN/AEI/10.13039/501100011033 and support from the Generalitat de Catalunya through the CERCA (Centres de Recerca de Catalunya) Program. SH receives support from the UK National Institute for Health Research through the academic clinical fellowship scheme. KR is supported by a Sir Henry Wellcome Postdoctoral Fellowship. CYS and JLM are supported by the UK Medical Research Council Integrative Epidemiology Unit at the University of Bristol (MC_UU_00032/02). OAA is supported by KG Jebsen Stiftelsen, Research Council of Norway (Grant No. 223273, 273291, 324252). EY receives support from Research Council of Norway (Grant Nos. 262177 and 288083). PRN was supported by grants from the European Research Council (AdG SELECTionPREDISPOSED #293574), the Bergen Research Foundation (“Utilizing the Mother and Child Cohort and the Medical Birth Registry for Better Health”), Stiftelsen Kristian Gerhard Jebsen (Translational Medical Center), the University of Bergen, the Western Norway Regional Health Authority (Strategic Fund “Personalized Medicine for Children and Adults”), Novo Nordisk Foundation Grant No. 54741, and the Norwegian Diabetes Association. CAMC receives support from the European Union's Horizon 2020 Research and Innovation Programme (Grant agreement No. 848158) (EarlyCause Project). BSP, EV, and the EAGLE Working Group report that the EAGLE Working Group is one of the named authors and that all members qualify for authorship. We thank all children, parents, and caregivers for making this study possible. The project was embedded within the EAGLE Consortium. Cohort-specific acknowledgments and funding information can be found in the Supplemental Note. In addition, we thank all cohorts and researchers that made their summary statistics available to us. This includes the GenLang Consortium, Social Science Genetic Association Consortium, Early Growth Genetics Consortium, EAGLE Consortium, Psychiatric Genomics Consortium, and the Danish Lundbeck Foundation Initiative for Integrative Psychiatric Research. A previous version of this article was published as a preprint on bioRxiv: https://www.biorxiv.org/content/10.1101/2022.06.01.494306. Derived single-trait (stage I) and multitrait (stage II) vocabulary summary statistics will be made available upon publication of the article via a data repository. EV and BSP had full access to all summary statistic-level data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. OAA is a consultant to HealthLytix. All other authors report no biomedical financial interests or potential conflicts of interest.

FundersFunder number
Australian Research Council
Sir Henry Wellcome Postdoctoral Fellowship
Nederlandse Organisatie voor Wetenschappelijk Onderzoek
Ministerio de Ciencia e Innovación
Generalitat de Catalunya
Diabetesforbundet
Miguel Servet II fellowship
Stiftelsen Kristian Gerhard Jebsen
National Institute for Health and Care Research
State Research Agency
Max-Planck-Gesellschaft
European Commission
Medical Research Future Fund
Australian Government National Health and Medical Research Council
National Health and Medical Research Council
Universitetet i Bergen
Helse Vest Regionalt Helseføretak
Bergens Forskningsstiftelse
Horizon 2020
Centro de Excelencia Severo Ochoa 2019-2023CEX2018-000806-S
Medical Research CouncilMR/M021475/1
Norges Forskningsråd262177, 288083
Tour de CureCPII18/00018
Horizon 2020 Framework Programme771057, 848158
KG Jebsen Stiftelsen, Research Council of Norway223273, 324252, 273291
FRIPRO240413
Personalized Medicine for Children and Adults54741
European Research Council293574, CPII18/00018
Novo Nordisk FondenNNF18OC0052457
University of BristolMC_UU_00032/02
Spanish Institute of Health Carlos IIIMCIN/AEI/10.13039/501100011033

    Cohort Studies

    • Netherlands Twin Register (NTR)

    Fingerprint

    Dive into the research topics of 'Genome-wide analyses of vocabulary size in infancy and toddlerhood: Associations With Attention-Deficit/Hyperactivity Disorder, Literacy, and Cognition-Related Traits'. Together they form a unique fingerprint.

    Cite this