Chinese Grammatical Error Correction Based on Convolutional Sequence to Sequence Model

Si Li; Jianbo Zhao; Guirong Shi; Yuanpeng Tan; Huifang Xu; Guang Chen; Haibo Lan; Zhiqing Lin

doi:10.1109/ACCESS.2019.2917631

IEEE Access (Jan 2019)

Chinese Grammatical Error Correction Based on Convolutional Sequence to Sequence Model

Si Li,
Jianbo Zhao,
Guirong Shi,
Yuanpeng Tan,
Huifang Xu,
Guang Chen,
Haibo Lan,
Zhiqing Lin

Affiliations

Si Li: School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, China
Jianbo Zhao: ORCiD; School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, China
Guirong Shi: State Grid Jibei Electric Power Company Limited, Beijing, China
Yuanpeng Tan: China Electric Power Research Institute, Beijing, China
Huifang Xu: China Electric Power Research Institute, Beijing, China
Guang Chen: School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, China
Haibo Lan: State Grid Jibei Electric Power Company Limited, Beijing, China
Zhiqing Lin: School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, China

DOI: https://doi.org/10.1109/ACCESS.2019.2917631
Journal volume & issue: Vol. 7
pp. 72905 – 72913

Abstract

Read online

Chinese grammatical error correction (CGEC) is practically useful for learners of Chinese as a second language, but it is a rather challenging task due to the complex and flexible nature of Chinese language so that existing methods for English cannot be directly applied. In this paper, we introduce a convolutional sequence to sequence model into the CGEC task for the first time, since many Chinese grammatical errors are concentrated between three and four words and convolutional neural network can better capture the local context. A convolution-based model can obtain the representations of the context by fixed size kernel. By stacking convolution layers, long-term dependences can be obtained. We also propose two optimization methods, shared embedding and policy gradient, to optimize the convolutional sequence to sequence model through sharing parameters and reconstructing loss function. Besides, we collate the existing Chinese grammatical correction corpus in detail. The results show that the models we proposed two different optimization methods both achieve large improvement compared with the natural machine translation model based on a recurrent neural network.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords