The Comparison of Two Models for Evaluation of Pre-internship Comprehensive Test: Classical and Latent Trait

Hadi Abbasi; Mohammad Reza Falsafinejad; Ali Delavar; Noor Ali Farrokhi; Mohammad Ali Mohagheghi

مجله ایرانی آموزش در علوم پزشکی (Jun 2013)

The Comparison of Two Models for Evaluation of Pre-internship Comprehensive Test: Classical and Latent Trait

Hadi Abbasi,
Mohammad Reza Falsafinejad,
Ali Delavar,
Noor Ali Farrokhi,
Mohammad Ali Mohagheghi

Affiliations

Hadi Abbasi
Mohammad Reza Falsafinejad
Ali Delavar
Noor Ali Farrokhi
Mohammad Ali Mohagheghi

Journal volume & issue: Vol. 13, no. 3
pp. 167 – 178

Abstract

Read online

Introduction: Despite the widespread use of pre-internship comprehensive test and its importance in medical students’ assessment, there is a paucity of the studies that can provide a systematic psychometric analysis of the items of this test. Thus, the present study sought to assess March 2011 pre-internship test using classical and latent trait models and compare their results. Methods: In this cross-sectional descriptive research, item analysis based on classical model was conducted by calculation of difficulty and discrimination coefficients, estimation of reliability by Cronbach’s alpha coefficient method and distracters analysis by comparing distracter proportions using EXCEL software. NOHARM4 software was also used to assess test dimensionality. To assess items parameters (difficulty, discrimination, guessing, information functions, and standard error of measurement) latent trait models and BILOG-MG3 software were used. Results: According to the classical test theory, 30 items (15%) were in the acceptable range of difficulty and discrimination coefficients. Test reliability coefficient was 0.913. Seventy three items (36.5%) had problems with the distracters. The analysis of test dimensionality indicated that the test was unidimensional. Using Three-Parameter Logistic Model, we obtained the mean and standard deviation of items parameters, i.e. item difficulty (0.321, 1.874), item discrimination (1.021, 0.666), and item guessing (0.209, 0.082). Maximum test information function was between 1.0 to 2.8 ability levels, and the test had lower information compared to the higher levels of ability at cutting scores. Conclusion: The findings indicate that analyses of latent trait models can be used to overcome the limitations of the classical test theory.

Published in مجله ایرانی آموزش در علوم پزشکی

ISSN: 1608-9359 (Print); 1735-8892 (Online)
Publisher: Isfahan University of Medical Sciences, Medical Education Development Center
Country of publisher: Iran, Islamic Republic of
LCC subjects: Medicine: Medicine (General)
Website: http://ijme.mui.ac.ir/index.php?&slct_pg_id=10&sid=1&slc_lang=en

About the journal

Abstract

Keywords