Machine Learning–Driven Language Assessment

Settles, Burr; T. LaFlair, Geoffrey; Hagiwara, Masato

doi:10.1162/tacl_a_00310

Transactions of the Association for Computational Linguistics (Jul 2020)

Machine Learning–Driven Language Assessment

Settles, Burr,
T. LaFlair, Geoffrey,
Hagiwara, Masato

Affiliations

Settles, Burr
T. LaFlair, Geoffrey
Hagiwara, Masato

DOI: https://doi.org/10.1162/tacl_a_00310
Journal volume & issue: Vol. 8
pp. 247 – 263

Abstract

Read online

We describe a method for rapidly creating language proficiency assessments, and provide experimental evidence that such tests can be valid, reliable, and secure. Our approach is the first to use machine learning and natural language processing to induce proficiency scales based on a given standard, and then use linguistic models to estimate item difficulty directly for computer-adaptive testing. This alleviates the need for expensive pilot testing with human subjects. We used these methods to develop an online proficiency exam called the Duolingo English Test, and demonstrate that its scores align significantly with other high-stakes English assessments. Furthermore, our approach produces test scores that are highly reliable, while generating item banks large enough to satisfy security requirements.

Published in Transactions of the Association for Computational Linguistics

ISSN: 2307-387X (Online)
Publisher: The MIT Press
Country of publisher: United States
LCC subjects: Language and Literature: Philology. Linguistics: Computational linguistics. Natural language processing
Website: https://direct.mit.edu/tacl

About the journal