Enhancing the Performance of Telugu Named Entity Recognition Using Gazetteer Features

SaiKiranmai Gorla; Lalita Bhanu Murthy   Neti; Aruna Malapati

doi:10.3390/info11020082

Information (Feb 2020)

Enhancing the Performance of Telugu Named Entity Recognition Using Gazetteer Features

SaiKiranmai Gorla,
Lalita Bhanu Murthy Neti,
Aruna Malapati

Affiliations

SaiKiranmai Gorla: Department of Computer Science and Information Systems, Birla Institute of Technology and Science Pilani, Hyderabad Campus, Telangana 500078, India
Lalita Bhanu Murthy Neti: Department of Computer Science and Information Systems, Birla Institute of Technology and Science Pilani, Hyderabad Campus, Telangana 500078, India
Aruna Malapati: Department of Computer Science and Information Systems, Birla Institute of Technology and Science Pilani, Hyderabad Campus, Telangana 500078, India

DOI: https://doi.org/10.3390/info11020082
Journal volume & issue: Vol. 11, no. 2
p. 82

Abstract

Read online

Named entity recognition (NER) is a fundamental step for many natural language processing tasks and hence enhancing the performance of NER models is always appreciated. With limited resources being available, NER for South-East Asian languages like Telugu is quite a challenging problem. This paper attempts to improve the NER performance for Telugu using gazetteer-related features, which are automatically generated using Wikipedia pages. We make use of these gazetteer features along with other well-known features like contextual, word-level, and corpus features to build NER models. NER models are developed using three well-known classifiers—conditional random field (CRF), support vector machine (SVM), and margin infused relaxed algorithms (MIRA). The gazetteer features are shown to improve the performance, and theMIRA-based NER model fared better than its counterparts SVM and CRF.

Published in Information

ISSN: 2078-2489 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://www.mdpi.com/journal/information/

About the journal

Abstract

Keywords