BMC Medical Education (May 2020)

Standard setting made easy: validating the Equal Z-score (EZ) method for setting cut-score for clinical examinations

  • Boaz Shulruf,
  • Ying-Ying Yang,
  • Pin-Hsiang Huang,
  • Ling-Yu Yang,
  • Chin-Chou Huang,
  • Chia-Chang Huang,
  • Chih-Wei Liu,
  • Shiau-Shian Huang,
  • Chen-Huan Chen,
  • Fa-Yauh Lee,
  • Shou-Yen Kao

DOI
https://doi.org/10.1186/s12909-020-02080-x
Journal volume & issue
Vol. 20, no. 1
pp. 1 – 9

Abstract

Read online

Abstracts Background This study aims to assess the feasibility, reliability and validity of the panel-based Equal Z-score (EZ) method applied to objective structural clinical examination (OSCE) of Chinese medical students and undertaking a comparison with the statistical techniques-based Borderline Regression Method (BRM). Methods Data received from two cohorts of 6th and 7th year medical students in Taiwan who set the mock OSCE as a formative assessment. Traditionally this medical school uses BRM to set the pass/fail cut-score. For the current study, 31 OSCE panellists volunteered to participate in the EZ method in parallel to the BRM. Results In the conduct of this study, each panel completed this task for an OSCE exam comprising 12 stations within less than 60 min. Moreover, none of the 31 panellists, whose are busy clinicians, had indicated that the task was too difficult or too time-consuming. Although EZ method yielded higher cut-scores than the BRM it was found reliable. Intraclass correlation (ICC) measuring absolute agreement, across the three groups of panellists was .893 and .937 for the first and second rounds respectively, demonstrating high level of agreement across groups with the EZ method and the alignment between the BRM and the EZ method was visually observed. The paired t-test results identified smaller differences between the cut-scores within methods than across methods. Conclusions Overall this study suggests that the EZ method is a feasible, reliable and valid standard setting method. The EZ method requires relatively little resources (takes about an hour to assess a 12 station OSCE); the calculation of the cut-score is simple and requires basic statistical skills; it is highly reliable even when only 10 panellists participate in the process; and its validity is supported by comparison to BRM. This study suggests that the EZ method is a feasible, reliable and valid standard setting method.

Keywords