Frontiers in Psychology (May 2014)

Group Membership Prediction when Known Groups Consist of Unknown Subgroups:A Monte Carlo Comparison of Methods

  • Holmes eFinch,
  • Jocelyn E Bolin,
  • Ken eKelley

DOI
https://doi.org/10.3389/fpsyg.2014.00337
Journal volume & issue
Vol. 5

Abstract

Read online

Classification using standard statistical methods such as linear discriminant analysis (LDA) or logistic regression (LR) presume knowledge of group membership prior to the development of an algorithm for prediction. However, in many real world applications members of the same nominal group, might in fact come from different subpopulations on the underlying construct. For example, individuals diagnosed with depression will not all have the same levels of this disorder, though for the purposes of LDA or LR they will be treated in the same manner. The goal of this simulation study was to examine the performance of several methods for group classification in the case where within group membership was not homogeneous. For example, suppose there are 3 known groups but within each group two unknown classes. Several approaches were compared, including LDA, LR, Classification and Regression Trees (CART), Generalized Additive Models (GAM), and mixture discriminant analysis (MIXDA). Results of the study indicated that CART and mixture discriminant analysis were the most effective tools for situations in which known groups were not homogeneous, whereas LDA, LR and GAM had the highest rates of misclassification. Implications of these results for theory and practice are discussed.

Keywords