Proceedings of the XXth Conference of Open Innovations Association FRUCT (Apr 2020)

Russian Pragmatic Markers Database: Developing Speech Technologies for Everyday Spoken Discourse

  • Natalia Bogdanova-Beglarian,
  • Olga Blinova,
  • Tatiana Sherstinova,
  • Ekaterina Troshchenkova

DOI
https://doi.org/10.23919/FRUCT48808.2020.9087473
Journal volume & issue
Vol. 26, no. 1
pp. 60 – 66

Abstract

Read online

The paper presents recent results obtained within the ongoing project dedicated to the study of Russian pragmatic markers. Pragmatic markers are obligatory elements of natural speech in any language; moreover, they are considered to be functionally important for speech production and overcoming inevitable speech difficulties. A correct understanding of use and functions of pragmatic markers is a prerequisite for solution of many applied tasks related to speech technologies. The research is carried out on the data of two speech corpora ORD corpus of Russian Everyday Speech known as One Day of Speech corpus and SAT corpus Balanced Annotated Collection of Texts, which consists primarily of monologues. The article describes the database of Russian pragmatic markers designed to support both linguistic and pragmatic studies of spoken Russian and the development of speech technologies for everyday discourse. Besides, it presents actual statistical data on pragmatic markers distribution in natural speech depending on different factors.

Keywords