GMS Journal for Medical Education (Jun 2024)

Measuring personal characteristics in applicants to German medical schools: Piloting an online Situational Judgement Test with an open-ended response format

  • Knorr, Mirjana,
  • Mielke, Ina,
  • Amelung, Dorothee,
  • Safari, Mahla,
  • Gröne, Oana R.,
  • Breil, Simon M.,
  • MacIntosh, Alexander

DOI
https://doi.org/10.3205/zma001685
Journal volume & issue
Vol. 41, no. 3
p. Doc30

Abstract

Read online

Objectives: Situational Judgement Tests (SJT) are a cost-efficient method for the assessment of personal characteristics (e.g., empathy, professionalism, ethical thinking) in medical school admission. Recently, complex open-ended response format SJTs have become more feasible to conduct. However, research on their applicability to a German context is missing. This pilot study tests the acceptability, reliability, subgroup differences, and validity of an online SJT with open-ended response format developed in Canada (“Casper”).Methods: German medical school applicants and students from Hamburg were invited to take Casper in 2020 and 2021. The test consisted of 12 video- and text-based scenarios, each followed by three open-ended questions. Participants subsequently evaluated their test experience in an online survey. Data on sociodemographic characteristics, other admission criteria (Abitur, TMS, HAM-Nat, HAM-SJT) and study success (OSCE) was available in a central research database (stav). Results: The full sample consisted of 582 participants. Test-takers’ global perception of Casper was positive. Internal consistency was satisfactory in both years (=0.73; 0.82) while interrater agreement was moderate (ICC(1,2)=0.54). Participants who were female (=0.37) or did not have a migration background (=0.40) received higher scores. Casper scores correlated with HAM-SJT (=.18) but not with OSCE communication stations performance. The test was also related to Abitur grades (=-.15), the TMS (=.18), and HAM-Nat logical reasoning scores (=.23). Conclusion: This study provides positive evidence for the acceptability, internal consistency, and convergent validity of Casper. The selection and training of raters as well as the scenario content require further observation and adjustments to a German context to improve interrater reliability and predictive validity.

Keywords