Referential choice: Predictability and its limits

Andrej A Kibrik; Andrej A Kibrik; Mariya V. Khudyakova; Grigory B. Dobrov; Anastasia Linnik; Dmitrij A. Zalmanov

doi:10.3389/fpsyg.2016.01429

Frontiers in Psychology (Sep 2016)

Referential choice: Predictability and its limits

Andrej A Kibrik,
Andrej A Kibrik,
Mariya V. Khudyakova,
Grigory B. Dobrov,
Anastasia Linnik,
Dmitrij A. Zalmanov

Affiliations

Andrej A Kibrik: Russian Academy of Sciences
Andrej A Kibrik: Moscow State University
Mariya V. Khudyakova: National Research University Higher School of Economics
Grigory B. Dobrov: Consultant Plus
Anastasia Linnik: University of Potsdam
Dmitrij A. Zalmanov: Moscow State University

DOI: https://doi.org/10.3389/fpsyg.2016.01429
Journal volume & issue: Vol. 7

Abstract

Read online

We report a study of referential choice in discourse production, understood as the choice between various types of referential devices, such as pronouns and full noun phrases. Our goal is to predict referential choice, and to explore to what extent such prediction is possible. Our approach to referential choice includes a cognitively informed theoretical component, corpus analysis, machine learning methods and experimentation with human participants. Machine learning algorithms make use of 25 factors, including referent’s properties (such as animacy and protagonism), the distance between a referential expression and its antecedent, the antecedent’s syntactic role, and so on. Having found the predictions of our algorithm to coincide with the original almost 90% of the time, we hypothesized that fully accurate prediction is not possible because, in many situations, more than one referential option is available. This hypothesis was supported by an experimental study, in which participants answered questions about either the original text in the corpus, or about a text modified in accordance with the algorithm’s prediction. Proportions of correct answers to these questions, as well as participants’ rating of the questions’ difficulty, suggested that divergences between the algorithm’s prediction and the original referential device in the corpus occur overwhelmingly in situations where the referential choice is not categorical.

Published in Frontiers in Psychology

ISSN: 1664-1078 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Philosophy. Psychology. Religion: Psychology
Website: https://www.frontiersin.org/journals/psychology

About the journal

Abstract

Keywords