Testing a computational model of causative overgeneralizations: Child judgment and production data from English, Hebrew, Hindi, Japanese and K’iche’ [version 2; peer review: 2 approved, 1 approved with reservations]

Stewart McCauley; Seth Campbell; Dipti Misra Sharma; Ruth Berman; Kumiko Fukumura; Rukmini Bhaya Nair; Margarita Julajuj Mendoza; Ben Ambridge; Laura Doherty; Tomoko Tatsumi; Ramya Maitreyee; Pedro Mateo Pedro; Shira Zicherman; Amy Bidgood; Ayuno Kawakami; Bhuvana Narasimhan; Clifton Pye; Dani Bekman; Inbal Arnon; Sindy Fabiola Can Pixabaj; Amir Efrati; Soumitra Samanta; Mario Marroquín Pelíz

Open Research Europe (Jan 2022)

Testing a computational model of causative overgeneralizations: Child judgment and production data from English, Hebrew, Hindi, Japanese and K’iche’ [version 2; peer review: 2 approved, 1 approved with reservations]

Stewart McCauley,
Seth Campbell,
Dipti Misra Sharma,
Ruth Berman,
Kumiko Fukumura,
Rukmini Bhaya Nair,
Margarita Julajuj Mendoza,
Ben Ambridge,
Laura Doherty,
Tomoko Tatsumi,
Ramya Maitreyee,
Pedro Mateo Pedro,
Shira Zicherman,
Amy Bidgood,
Ayuno Kawakami,
Bhuvana Narasimhan,
Clifton Pye,
Dani Bekman,
Inbal Arnon,
Sindy Fabiola Can Pixabaj,
Amir Efrati,
Soumitra Samanta,
Mario Marroquín Pelíz

Affiliations

Stewart McCauley: University of Iowa, Iowa City, Iowa, USA
Seth Campbell: University of Calgary, Calgary, Canada
Dipti Misra Sharma: Indian Institute of Information Technology, Hyderabad, India
Ruth Berman: Tel Aviv University, Tel Aviv, Israel
Kumiko Fukumura: University of Stirling, Stirling, UK
Rukmini Bhaya Nair: ORCiD; Indian Institute of Technology, Delhi, India
Margarita Julajuj Mendoza: Universidad del Valle de Guatemala, Guatemala City, Guatemala
Ben Ambridge: ORCiD; University of Liverpool, Liverpool, UK
Laura Doherty: University of Liverpool, Liverpool, UK
Tomoko Tatsumi: ORCiD; Kobe University, Kobe, Japan
Ramya Maitreyee: University of Liverpool, Liverpool, UK
Pedro Mateo Pedro: Universidad del Valle de Guatemala, Guatemala City, Guatemala
Shira Zicherman: Hebrew University of Jerusalem, Jerusalem, Israel
Amy Bidgood: ORCiD; University of Salford, Salford, UK
Ayuno Kawakami: University of Liverpool, Liverpool, UK
Bhuvana Narasimhan: University of Colorado, Boulder, Boulder, Colorado, USA
Clifton Pye: University of Kansas, Lawrence, Kansas, USA
Dani Bekman: Hebrew University of Jerusalem, Jerusalem, Israel
Inbal Arnon: Hebrew University of Jerusalem, Jerusalem, Israel
Sindy Fabiola Can Pixabaj: Universidad del Valle de Guatemala, Guatemala City, Guatemala
Amir Efrati: Hebrew University of Jerusalem, Jerusalem, Israel
Soumitra Samanta: University of Liverpool, Liverpool, UK
Mario Marroquín Pelíz: Universidad del Valle de Guatemala, Guatemala City, Guatemala

Journal volume & issue: Vol. 1

Abstract

Read online

How do language learners avoid the production of verb argument structure overgeneralization errors (*The clown laughed the man c.f. The clown made the man laugh), while retaining the ability to apply such generalizations productively when appropriate? This question has long been seen as one that is both particularly central to acquisition research and particularly challenging. Focussing on causative overgeneralization errors of this type, a previous study reported a computational model that learns, on the basis of corpus data and human-derived verb-semantic-feature ratings, to predict adults’ by-verb preferences for less- versus more-transparent causative forms (e.g., * The clown laughed the man vs The clown made the man laugh) across English, Hebrew, Hindi, Japanese and K’iche Mayan. Here, we tested the ability of this model (and an expanded version with multiple hidden layers) to explain binary grammaticality judgment data from children aged 4;0-5;0, and elicited-production data from children aged 4;0-5;0 and 5;6-6;6 (N=48 per language). In general, the model successfully simulated both children’s judgment and production data, with correlations of r=0.5-0.6 and r=0.75-0.85, respectively, and also generalized to unseen verbs. Importantly, learners of all five languages showed some evidence of making the types of overgeneralization errors – in both judgments and production – previously observed in naturalistic studies of English (e.g., *I’m dancing it). Together with previous findings, the present study demonstrates that a simple learning model can explain (a) adults’ continuous judgment data, (b) children’s binary judgment data and (c) children’s production data (with no training of these datasets), and therefore constitutes a plausible mechanistic account of the acquisition of verbs’ argument structure restrictions.

Published in Open Research Europe

ISSN: 2732-5121 (Online)
Publisher: F1000 Research Ltd
Country of publisher: United Kingdom
LCC subjects: Social Sciences
Website: https://open-research-europe.ec.europa.eu/

About the journal

Abstract

Keywords