Transactions of the Association for Computational Linguistics (Jul 2020)

Machine Learning–Driven Language Assessment

  • Settles, Burr,
  • T. LaFlair, Geoffrey,
  • Hagiwara, Masato

DOI
https://doi.org/10.1162/tacl_a_00310
Journal volume & issue
Vol. 8
pp. 247 – 263

Abstract

Read online

We describe a method for rapidly creating language proficiency assessments, and provide experimental evidence that such tests can be valid, reliable, and secure. Our approach is the first to use machine learning and natural language processing to induce proficiency scales based on a given standard, and then use linguistic models to estimate item difficulty directly for computer-adaptive testing. This alleviates the need for expensive pilot testing with human subjects. We used these methods to develop an online proficiency exam called the Duolingo English Test, and demonstrate that its scores align significantly with other high-stakes English assessments. Furthermore, our approach produces test scores that are highly reliable, while generating item banks large enough to satisfy security requirements.