TY - JOUR
T1 - Reflections on univariate and multivariate analysis of metabolomics data
AU - Saccenti, Edoardo
AU - Hoefsloot, Huub C.J.
AU - Smilde, Age K.
AU - Westerhuis, Johan A.
AU - Hendriks, Margriet M.W.B.
PY - 2014/1/1
Y1 - 2014/1/1
N2 - Metabolomics experiments usually result in a large quantity of data. Univariate and multivariate analysis techniques are routinely used to extract relevant information from the data with the aim of providing biological knowledge on the problem studied. Despite the fact that statistical tools like the t test, analysis of variance, principal component analysis, and partial least squares discriminant analysis constitute the backbone of the statistical part of the vast majority of metabolomics papers, it seems that many basic but rather fundamental questions are still often asked, like: Why do the results of univariate and multivariate analyses differ? Why apply univariate methods if you have already applied a multivariate method? Why if I do not see something univariately I see something multivariately? In the present paper we address some aspects of univariate and multivariate analysis, with the scope of clarifying in simple terms the main differences between the two approaches. Applications of the t test, analysis of variance, principal component analysis and partial least squares discriminant analysis will be shown on both real and simulated metabolomics data examples to provide an overview on fundamental aspects of univariate and multivariate methods.
AB - Metabolomics experiments usually result in a large quantity of data. Univariate and multivariate analysis techniques are routinely used to extract relevant information from the data with the aim of providing biological knowledge on the problem studied. Despite the fact that statistical tools like the t test, analysis of variance, principal component analysis, and partial least squares discriminant analysis constitute the backbone of the statistical part of the vast majority of metabolomics papers, it seems that many basic but rather fundamental questions are still often asked, like: Why do the results of univariate and multivariate analyses differ? Why apply univariate methods if you have already applied a multivariate method? Why if I do not see something univariately I see something multivariately? In the present paper we address some aspects of univariate and multivariate analysis, with the scope of clarifying in simple terms the main differences between the two approaches. Applications of the t test, analysis of variance, principal component analysis and partial least squares discriminant analysis will be shown on both real and simulated metabolomics data examples to provide an overview on fundamental aspects of univariate and multivariate methods.
KW - Consistency at large
KW - Hypothesis testing
KW - Multiple test correction
KW - Multivariate analysis
KW - Overfitting
KW - Univariate analysis
UR - http://www.scopus.com/inward/record.url?scp=84898548748&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84898548748&partnerID=8YFLogxK
U2 - 10.1007/s11306-013-0598-6
DO - 10.1007/s11306-013-0598-6
M3 - Review article
AN - SCOPUS:84898548748
VL - 10
SP - 361
EP - 374
JO - Metabolomics
JF - Metabolomics
SN - 1573-3882
IS - 3
ER -