GMS Journal for Medical Education (May 2016)

An algorithm for calculating exam quality as a basis for performance-based allocation of funds at medical schools

  • Kirschstein, Timo,
  • Wolters, Alexander,
  • Lenz, Jan-Hendrik,
  • Fröhlich, Susanne,
  • Hakenberg, Oliver,
  • Kundt, Günther,
  • Darmüntzel, Martin,
  • Hecker, Michael,
  • Altiner, Attila,
  • Müller-Hilke, Brigitte

DOI
https://doi.org/10.3205/zma001043
Journal volume & issue
Vol. 33, no. 3
p. Doc44

Abstract

Read online

Objective: The amendment of the Medical Licensing Act (ÄAppO) in Germany in 2002 led to the introduction of graded assessments in the clinical part of medical studies. This, in turn, lent new weight to the importance of written tests, even though the minimum requirements for exam quality are sometimes difficult to reach. Introducing exam quality as a criterion for the award of performance-based allocation of funds is expected to steer the attention of faculty members towards more quality and perpetuate higher standards. However, at present there is a lack of suitable algorithms for calculating exam quality.Methods: In the spring of 2014, the students‘ dean commissioned the „core group“ for curricular improvement at the University Medical Center in Rostock to revise the criteria for the allocation of performance-based funds for teaching. In a first approach, we developed an algorithm that was based on the results of the most common type of exam in medical education, multiple choice tests. It included item difficulty and discrimination, reliability as well as the distribution of grades achieved. Results: This algorithm quantitatively describes exam quality of multiple choice exams. However, it can also be applied to exams involving short assay questions and the OSCE. It thus allows for the quantitation of exam quality in the various subjects and – in analogy to impact factors and third party grants – a ranking among faculty. Conclusion: Our algorithm can be applied to all test formats in which item difficulty, the discriminatory power of the individual items, reliability of the exam and the distribution of grades are measured. Even though the content validity of an exam is not considered here, we believe that our algorithm is suitable as a general basis for performance-based allocation of funds.

Keywords