Part-of-Speech tagging enhancement to natural language processing for Thai wh-question classification with deep learning

Saranlita Chotirat; Phayung Meesad

Heliyon (Oct 2021)

Part-of-Speech tagging enhancement to natural language processing for Thai wh-question classification with deep learning

Saranlita Chotirat,
Phayung Meesad

Affiliations

Saranlita Chotirat: Department of Information Technology, Faculty of Information Technology and Digital Innovation, King Mongkut's University of Technology North Bangkok, Thailand
Phayung Meesad: Department of Information Technology Management, Faculty of Information Technology and Digital Innovation, King Mongkut's University of Technology North Bangkok, Thailand; Corresponding author.

Journal volume & issue: Vol. 7, no. 10
p. e08216

Abstract

Read online

Question classification is a crucial task for answer selection. Question classification could help define the structure of question sentences generated by features extraction from a sentence, such as who, when, where, and how. In this paper, we proposed a methodology to improve question classification from texts by using feature selection and word embedding techniques. We conducted several experiments to evaluate the performance of the proposed methodology using two different datasets (TREC-6 dataset and Thai sentence dataset) with term frequency and combined term frequency-inverse document frequency including Unigram, Unigram+Bigram, and Unigram + Trigram as features. Machine learning models based on traditional and deep learning classifiers were used. The traditional classification models were Multinomial Naïve Bayes, Logistic Regression, and Support Vector Machine. The deep learning techniques were Bidirectional Long Short-Term Memory (BiLSTM), Convolutional Neural Networks (CNN), and Hybrid model, which combined CNN and BiLSTM model. The experiment results showed that our methodology based on Part-of-Speech (POS) tagging was the best to improve question classification accuracy. The classifying question categories achieved with average micro F1-score of 0.98 when applied SVM model on adding all POS tags in the TREC-6 dataset. The highest average micro F1-score achieved 0.8 when applied GloVe by using CNN model on adding focusing tags in the Thai sentences dataset.

Published in Heliyon

ISSN: 2405-8440 (Online)
Publisher: Elsevier
Country of publisher: United Kingdom
LCC subjects: Science: Science (General); Social Sciences: Social sciences (General)
Website: https://www.cell.com/heliyon/home

About the journal

Abstract

Keywords