Восточная Азия: факты и аналитика (Mar 2020)

Language policy and language resources on the Chinese Internet

  • Zavyalova O.I.

DOI
https://doi.org/10.24411/2686-7702-2020-10002
Journal volume & issue
no. 2020/1

Abstract

Read online

The language policy and planning in the PRC are represented on various official central and local websites. The annual statistical analysis of Chinese words and characters used online are among the tasks, set by the authorities before the linguists. The technologies, allowing to recognize words within the character texts and to mark them as belonging to a particular part of speech in the isolating syllabic Chinese language are applied within the process of creating numerous text corpora, often accessible on the Web. Speech recognition and speech synthesis technologies based on AI are used in machine translation, on the official websites for Standard Mandarin learners, as well as in spoken corpora. Among the latter are dialect corpora created in Taiwan and Hong Kong and available online

Keywords