TY - JOUR
T1 - Equating, or Correction for Between-Block Effects with Application to Body Fluid LC-MS and NMR Metabolomics Data Sets
AU - Draisma, H.H.M.
AU - Reijmers, Th.H.
AU - van der Kloet, F.
AU - Bobeldijk-Pastorova, I
AU - Spies-Faber, E.
AU - Vogels, J.
AU - Meulman, J.J.
AU - Boomsma, D.I.
AU - van der Greef, J.
AU - Hankemeier, T.
PY - 2010
Y1 - 2010
N2 - Combination of data sets from different objects (for example, from two groups of healthy volunteers from the same population) that were measured on a common set of variables (for example, metabolites or peptides) is desirable for statistical analysis in "omics" studies because it increases power. However, this type of combination is not directly possible if nonbiological systematic differences exist among the individual data sets, or "blocks". Such differences can, for example, be due to small analytical changes that are likely to accumulate over large time intervals between blocks of measurements. In this article we present a data transformation method, that we will refer to as "quantile equating", which per variable corrects for linear and nonlinear differences in distribution among blocks of semiquantitative data obtained with the same analytical method. We demonstrate the successful application of the quantile equating method to data obtained on two typical metabolomics platforms, i.e., liquid chromatography-mass spectrometry and nuclear magnetic resonance spectroscopy. We suggest uni- and multivariate methods to evaluate similarities and differences among data blocks before and after quantile equating. In conclusion, we have developed a method to correct for nonbiological systematic differences among semiquantitative data blocks and have demonstrated its successful application to metabolomics data sets. © 2010 American Chemical Society.
AB - Combination of data sets from different objects (for example, from two groups of healthy volunteers from the same population) that were measured on a common set of variables (for example, metabolites or peptides) is desirable for statistical analysis in "omics" studies because it increases power. However, this type of combination is not directly possible if nonbiological systematic differences exist among the individual data sets, or "blocks". Such differences can, for example, be due to small analytical changes that are likely to accumulate over large time intervals between blocks of measurements. In this article we present a data transformation method, that we will refer to as "quantile equating", which per variable corrects for linear and nonlinear differences in distribution among blocks of semiquantitative data obtained with the same analytical method. We demonstrate the successful application of the quantile equating method to data obtained on two typical metabolomics platforms, i.e., liquid chromatography-mass spectrometry and nuclear magnetic resonance spectroscopy. We suggest uni- and multivariate methods to evaluate similarities and differences among data blocks before and after quantile equating. In conclusion, we have developed a method to correct for nonbiological systematic differences among semiquantitative data blocks and have demonstrated its successful application to metabolomics data sets. © 2010 American Chemical Society.
U2 - 10.1021/ac902346a
DO - 10.1021/ac902346a
M3 - Article
SN - 0003-2700
VL - 82
SP - 1039
EP - 1046
JO - Analytical Chemistry
JF - Analytical Chemistry
IS - 3
ER -