Languages (Jan 2024)

The Processing of Multiword Units by Learners of English: Evidence from Pause Placement in Writing Process Data

  • Gaëtanelle Gilquin

DOI
https://doi.org/10.3390/languages9020051
Journal volume & issue
Vol. 9, no. 2
p. 51

Abstract

Read online

Different methods and sources of information have been proposed in the literature to study the processing of language and, in particular, instances of formulaic language such as multiword units. This article explores the possibility of using pause placement in writing process data to determine the likelihood that a multiword unit is processed as a whole in the mind. The data are texts produced by learners of English and corresponding keylog files from the Process Corpus of English in Education (PROCEED). N-grams are selected on the basis of the finished texts and retrieved from the keylogging data. The pause placement patterns of these n-grams are coded and serve as a basis to compute the Pause Placement and Processing (PPP) score. This score relies on the assumption that n-grams which are delineated but not interrupted by pauses (hence taking the form of ‘bursts of writing’) are more likely to be processed holistically. The PPP score points to structurally complete n-grams such as in fact and first of all as being more likely to be processed holistically than structurally incomplete n-grams such as that we and to the. While the results are plausible and can be further substantiated by characteristics of specific n-grams, it is acknowledged that additional effects might also be at work to explain the results obtained.

Keywords