Raters’ perceptions of rating scales criteria and its effect on the process and outcome of their rating

Nasim Heidari; Nasim Ghanbari; Abbas Abbasi

doi:10.1186/s40468-022-00168-3

Language Testing in Asia (Aug 2022)

Raters’ perceptions of rating scales criteria and its effect on the process and outcome of their rating

Nasim Heidari,
Nasim Ghanbari,
Abbas Abbasi

Affiliations

Nasim Heidari: Department of English Language and Literature, Faculty of Humanities, Persian Gulf University
Nasim Ghanbari: Department of English Language and Literature, Faculty of Humanities, Persian Gulf University
Abbas Abbasi: Department of English Language and Literature, Faculty of Humanities, Persian Gulf University

DOI: https://doi.org/10.1186/s40468-022-00168-3
Journal volume & issue: Vol. 12, no. 1
pp. 1 – 19

Abstract

Read online

Abstract It is widely believed that human rating performance is influenced by an array of different factors. Among these, rater-related variables such as experience, language background, perceptions, and attitudes have been mentioned. One of the important rater-related factors is the way the raters interact with the rating scales. In particular, how raters perceive the components of the scales to further plan their scoring seems important. For this aim, the present study investigated the raters’ perceptions of the rating scales and their subsequent rating behaviors for two analytic and holistic rating scales. Hence, nine highly experienced raters were asked to verbalize their thoughts while rating student essays using IELTS holistic scale and the analytic scale of ESL Composition Profile. Upon analyzing the think-aloud protocols, four themes emerged. The findings showed that when rating holistically, the raters either referred to the holistic scale components to validate their ratings (validation) or had a pre-evaluation reading to rate in a more reliable way (dominancy). In analytic rating, on the other hand, the raters used a pre-evaluation scale reading in order to keep the components and their criteria to memory to evaluate the text more accurately (dominancy) or regularly moved between the text and the scale components to assign a score (oscillation). Furthermore, the results of a Wilcoxon signed-rank test showed that when using the holistic and analytic rating scales, the raters assigned significantly different scores to the texts. On the whole, the results revealed that the way the raters perceived the scale components will affect their judgement of the texts. The study also provides several implications for rater training programs and EFL writing assessment.

Published in Language Testing in Asia

ISSN: 2229-0443 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Language and Literature
Website: https://languagetestingasia.springeropen.com/

About the journal

Abstract

Keywords