The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics

R. de Vlaming, P.J.F. Groenen

Research output: Contribution to JournalReview articleAcademicpeer-review

Abstract

In recent years, there has been a considerable amount of research on the use of regularization methods for inference and prediction in quantitative genetics. Such research mostly focuses on selection of markers and shrinkage of their effects. In this review paper, the use of ridge regression for prediction in quantitative genetics using single-nucleotide polymorphism data is discussed. In particular, we consider (i) the theoretical foundations of ridge regression, (ii) its link to commonly used methods in animal breeding, (iii) the computational feasibility, and (iv) the scope for constructing prediction models with nonlinear effects (e.g., dominance and epistasis). Based on a simulation study we gauge the current and future potential of ridge regression for prediction of human traits using genome-wide SNP data. We conclude that, for outcomes with a relatively simple genetic architecture, given current sample sizes in most cohorts (i.e., 푁 < 10,000) the predictive accuracy of ridge regression is slightly higher than the classical genomewide association study approach of repeated simple regression (i.e., one regression per SNP). However, both capture only a small proportion of the heritability. Nevertheless, we find evidence that for large-scale initiatives, such as biobanks, sample sizes can be achieved where ridge regression compared to the classical approach improves predictive accuracy substantially.
Original languageEnglish
Article number143712
Number of pages18
JournalBiomed research international
Volume2015
DOIs
Publication statusPublished - 2015

Fingerprint Dive into the research topics of 'The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics'. Together they form a unique fingerprint.

Cite this