BMC Medical Research Methodology (Mar 2022)

The effect of missing data and imputation on the detection of bias in cognitive testing using differential item functioning methods

  • E. Nichols,
  • J. A. Deal,
  • B. K. Swenor,
  • A. G. Abraham,
  • N. M. Armstrong,
  • K. Bandeen-Roche,
  • M. C. Carlson,
  • M. Griswold,
  • F. R. Lin,
  • T. H. Mosley,
  • P. Y. Ramulu,
  • N. S. Reed,
  • A. R. Sharrett,
  • A. L. Gross

DOI
https://doi.org/10.1186/s12874-022-01572-2
Journal volume & issue
Vol. 22, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Background Item response theory (IRT) methods for addressing differential item functioning (DIF) can detect group differences in responses to individual items (e.g., bias). IRT and DIF-detection methods have been used increasingly often to identify bias in cognitive test performance by characteristics (DIF grouping variables) such as hearing impairment, race, and educational attainment. Previous analyses have not considered the effect of missing data on inferences, although levels of missing cognitive data can be substantial in epidemiologic studies. Methods We used data from Visit 6 (2016–2017) of the Atherosclerosis Risk in Communities Neurocognitive Study (N = 3,580) to explicate the effect of artificially imposed missing data patterns and imputation on DIF detection. Results When missing data was imposed among individuals in a specific DIF group but was unrelated to cognitive test performance, there was no systematic error. However, when missing data was related to cognitive test performance and DIF group membership, there was systematic error in DIF detection. Given this missing data pattern, the median DIF detection error associated with 10%, 30%, and 50% missingness was -0.03, -0.08, and -0.14 standard deviation (SD) units without imputation, but this decreased to -0.02, -0.04, and -0.08 SD units with multiple imputation. Conclusions Incorrect inferences in DIF testing have downstream consequences for the use of cognitive tests in research. It is therefore crucial to consider the effect and reasons behind missing data when evaluating bias in cognitive testing.

Keywords