São Paulo Medical Journal (Jul 2024)

Reliability across content areas in progress tests assessing medical knowledge: a Brazilian cross-sectional study with implications for medical education assessments

  • Pedro Tadao Hamamoto Filho,
  • Miriam Hashimoto,
  • Alba Regina de Abreu Lima,
  • Leandro Arthur Diehl,
  • Neide Tomimura Costa,
  • Patrícia Moretti Rehder,
  • Samira Yarak,
  • Maria Cristina de Andrade,
  • Maria de Lourdes Marmorato Botta Hafner,
  • Zilda Maria Tosta Ribeiro,
  • Júlio César Moriguti,
  • Angélica Maria Bicudo

DOI
https://doi.org/10.1590/1516-3180.2023.0291.r1.13052024
Journal volume & issue
Vol. 142, no. 6

Abstract

Read online Read online

ABSTRACT BACKGROUND: Brazilian medical schools equitably divide their medical education assessments into five content areas: internal medicine, surgery, pediatrics, obstetrics and gynecology, and public health. However, this division does not follow international patterns and may threaten the examinations’ reliability and validity. OBJECTIVE: To assess the reliability indices of the content areas of serial, cross-institutional progress test examinations. DESIGN AND SETTINGS: This was an analytical, observational, and cross-sectional study conducted at nine public medical schools (mainly from the state of São Paulo) with progress test examinations conducted between 2017 and 2023. METHODS: The examinations covered the areas of basic sciences, internal medicine, surgery, pediatrics, obstetrics and gynecology, and public health. We calculated reliability indices using Cronbach’s α, which indicates the internal consistency of a test. We used simple linear regressions to analyze temporal trends. RESULTS: The results showed that the Cronbach’s α for basic sciences and internal medicine presented lower values, whereas gynecology, obstetrics, and public health presented higher values. After changes in the number of items and the exclusion of basic sciences as a separate content area, internal medicine ranked highest in 2023. Individually, all content areas except pediatrics remained stable over time. CONCLUSIONS: Maintaining an equitable division in assessment content may lead to suboptimal results in terms of assessment reliability, especially for internal medicine. Therefore, content sampling of medical knowledge for general assessments should be reappraised.

Keywords