Russian Linguistic Bulletin (Sep 2016)
АВТОМАТИЗАЦИЯ ЛЕКСИЧЕСКОГО ПОИСКА В НАЦИОНАЛЬНОМ КОРПУСЕ ЧУВАШСКОГО ЯЗЫКА: МЕТОДЫ ИССЛЕДОВАНИЯ ПРОСТРАНСТВА ХУДОЖЕСТВЕННЫХ ТЕКСТОВ
Abstract
In this paper we consider automation of lexical search in national corpora of Chuvash language. Conceptual model of literary texts space is determined and there is a set of methods of literary texts analysis, which is based on it. Authors consider the following methods of exploration of literary texts space: method of text tokenization, method of text normalization, method of morphologic analysis, method of named entities recognition, method of text classification, method of texts search, method of determination of text’s topic.
Keywords