Principal Covariates Clusterwise Regression (PCCR): Accounting for Multicollinearity and Population Heterogeneity in Hierarchically Organized Data

Tom Frans Wilderjans*, Eva Vande Gaer, Henk A.L. Kiers, Iven Van Mechelen, Eva Ceulemans

*Corresponding author for this work

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

In the behavioral sciences, many research questions pertain to a regression problem in that one wants to predict a criterion on the basis of a number of predictors. Although in many cases, ordinary least squares regression will suffice, sometimes the prediction problem is more challenging, for three reasons: first, multiple highly collinear predictors can be available, making it difficult to grasp their mutual relations as well as their relations to the criterion. In that case, it may be very useful to reduce the predictors to a few summary variables, on which one regresses the criterion and which at the same time yields insight into the predictor structure. Second, the population under study may consist of a few unknown subgroups that are characterized by different regression models. Third, the obtained data are often hierarchically structured, with for instance, observations being nested into persons or participants within groups or countries. Although some methods have been developed that partially meet these challenges (i.e., principal covariates regression (PCovR), clusterwise regression (CR), and structural equation models), none of these methods adequately deals with all of them simultaneously. To fill this gap, we propose the principal covariates clusterwise regression (PCCR) method, which combines the key idea’s behind PCovR (de Jong & Kiers in Chemom Intell Lab Syst 14(1–3):155–164, 1992) and CR (Späth in Computing 22(4):367–373, 1979). The PCCR method is validated by means of a simulation study and by applying it to cross-cultural data regarding satisfaction with life.

Original languageEnglish
Pages (from-to)86-111
Number of pages26
JournalPsychometrika
Volume82
Issue number1
DOIs
Publication statusPublished - 1 Mar 2017
Externally publishedYes

Keywords

  • clusterwise regression
  • component analysis
  • hierarchically organized data
  • multicollinearity
  • population heterogeneity

Fingerprint Dive into the research topics of 'Principal Covariates Clusterwise Regression (PCCR): Accounting for Multicollinearity and Population Heterogeneity in Hierarchically Organized Data'. Together they form a unique fingerprint.

  • Cite this