Современные информационные технологии и IT-образование (Dec 2018)

CLOUD SERVICES FOR NATURAL LANGUAGE PROCESSING

  • Ravil I. Mukhamediev,
  • Adilkhan Symagulov,
  • Yan I. Kuchin,
  • Sabina Abdullayeva,
  • Farida N. Abdoldina

DOI
https://doi.org/10.25559/SITITO.14.201804.872-880
Journal volume & issue
Vol. 14, no. 4
pp. 872 – 880

Abstract

Read online

The paper presents the results of experiments conducted with the aim of a comparative analysis of the performance of the existing cloud services for natural language processing in Russian. The article provides an overview of 10 cloud services: TextRazor, RosetteTextAnalytics, EurekaEngine, CloudNaturalLanguage, Texterra, Pullenti, NER-ru, UDPipe, AOT, DeepPavlov. Quantitative studies of their performance were made for 6 of them. In the process of evaluating services, the execution of such functions as the part of speech tagging, sentiment analysis, named entity recognition and the categorization of texts were analyzed. For a comparative assessment of the quality of the services, the following competition materials were used: factRuEval-2016 (named entities), AlemResearch (sentiment) and the corpora, Taiga and OpenCorpora (part of speech). The named entities recognition quality was evaluated by calculating Accuracy, Precision, Recall, and F1 parameters. As a result of the study, it was shown that when solving natural language text processing tasks in Russian, the best result is shown by the EurekaEngine service for recognizing named entities and sentiment analysis of the text, RosetteTextAnalytics service proved best in part of speech tagging the and TextRazor service in text categorization.

Keywords