Evaluating Inter-Rater Reliability: Transitioning to a Single Rater for Marking Modified Essay Questions in Undergraduate Medical Education

Shahid Hassan; Malanashita Ganeson; Ismail Abdul Sattar Burud

doi:10.18502/acta.v62i2.17040

Acta Medica Iranica (Nov 2024)

Evaluating Inter-Rater Reliability: Transitioning to a Single Rater for Marking Modified Essay Questions in Undergraduate Medical Education

Shahid Hassan,
Malanashita Ganeson,
Ismail Abdul Sattar Burud

Affiliations

Shahid Hassan: School of Medicine, American University of Barbados, Bridgetown, Barbados.
Malanashita Ganeson: Department of Family Medicine, Kualalumpur, Malaysia.
Ismail Abdul Sattar Burud: Department of Surgery, School of Medicine, International Medical University, Kuala Lumpur, Malaysia

DOI: https://doi.org/10.18502/acta.v62i2.17040
Journal volume & issue: Vol. 62, no. 2

Abstract

Read online

Modified Essay Questions (MEQs) are often included in high-stakes examinations to assess higher-order cognitive skills. If the marking guides for MEQs are inadequate, this can lead to inconsistencies in marking. To ensure the reliability of MEQs as a subjective assessment tool, candidates’ responses are typically evaluated by two or more assessors. Previous studies have examined the impact of marker variance. Current study explores the possibility of assigning a single assessor to mark the students' performances in MEQ based on statistically drawn evidence in the clinical phase of the MBBS program at a private medical school in Malaysia. A robust evaluation method was employed to determine whether to continue with two raters or shift to a single-rater scheme for MEQs, using the Discrepancy-Agreement Grading (DAG) System for evaluation. A low standard deviation was observed across all 11 pairs of scores, with insignificant t-statistics (P>0.05) in 2 pairs (18.18%) and significant t-statistics (P0.8), 7 pairs (63.63%) as having a moderate effect size (0.5-<0.8), and 3 pairs (27.27%) as having a weak effect size (0.2-<0.5). The data analysis suggests that it is feasible to consider marking MEQ items by a single assessor without negatively impacting the reliability of the MEQ as an assessment tool.

Published in Acta Medica Iranica

ISSN: 0044-6025 (Print); 1735-9694 (Online)
Publisher: Tehran University of Medical Sciences
Country of publisher: Iran, Islamic Republic of
LCC subjects: Medicine: Medicine (General)
Website: http://acta.tums.ac.ir/

About the journal

Abstract

Keywords