BMC Medical Research Methodology (Nov 2024)

Evidence pointing toward invalidity of the SF-8 physical and mental scales: a fusion validity assessment

  • Leslie A. Hayduk,
  • Matthias Hoben,
  • Carole Estabrooks

DOI
https://doi.org/10.1186/s12874-024-02387-z
Journal volume & issue
Vol. 24, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Background The SF-8™ Short Form Health Survey creates physical and mental health scale scores from responses to eight survey questions. These widely used scales demonstrate reasonable reliablity, and some forms of validity but have not been assessed for fusion validity. We assess the fusion validity of the SF-8 physical and mental health scales, and provide comments assisting fusion validity assessment of other scales. Methods Checking the fusion validity of a scale requires including the scale and its constituent indicators in a structural equation model that has at least one variable causally downstream from the scale. We assessed fusion validity of the SF-8 physical and mental health scales in the context of work-related variables for care aides working in Canadian long-term care homes. Variables causally downstream from physical and mental health, such as work burnout, permit checking whether the SF-8 indicator items fuse to form cogent physical and mental scales, irrespective of whether those indicators share common-factor foundations. Results We found that the SF-8 physical and mental health scales did not function appropriately. The scales inappropriately claimed effects for several items that had no effects and provided biased estimates of other effects. These deficiencies seem grounded in the scales’ developmental history, which implicitly bolstered selection of some causally ambiguous items and paid insufficient attention to component factor model testing. Conclusion Our observations of causal incongruities question whether the SF-8 can provide valid assessments of physical and mental health. However, it would be imprudent to discontinue SF-8 use on the basis of a single study suggesting invalidity. This uncomfortable conclusion can be rechecked by re-analyzing data from any project that employed the SF-8 and recorded even one causal consequence of physical or mental health. The power of fusion validity assessment comes from connecting the recorded consequences simultaneously to both the scale and the items from which that scale is calculated.

Keywords