Tulu Language Text Recognition and Translation

Prathwini; Anisha P. Rodrigues; P. Vijaya; Roshan Fernandes

doi:10.1109/ACCESS.2024.3355470

IEEE Access (Jan 2024)

Tulu Language Text Recognition and Translation

Prathwini,
Anisha P. Rodrigues,
P. Vijaya,
Roshan Fernandes

Affiliations

Prathwini: Department of Master of Computer Applications, NMAM Institute of Technology, NITTE (Deemed to be University), Nitte, India
Anisha P. Rodrigues: ORCiD; Department of Computer Science and Engineering, NMAM Institute of Technology, NITTE (Deemed to be University), Nitte, India
P. Vijaya: ORCiD; Department of Mathematics and Computer Science, Modern College of Business and Sciences, Muscat, Oman
Roshan Fernandes: ORCiD; Department of Computer Science and Engineering, NMAM Institute of Technology, NITTE (Deemed to be University), Nitte, India

DOI: https://doi.org/10.1109/ACCESS.2024.3355470
Journal volume & issue: Vol. 12
pp. 12734 – 12744

Abstract

Read online

Language is a primary means of communication, but it is not the only means; knowing a language does, however, assist speed up the process. Many distinct languages are spoken worldwide, and people use them to communicate. This is only one of the many reasons why language is so crucial. Based on the literature survey, it is evident that there is a lack of available translators for the Tulu language. Despite being prevalent predominantly in Karnataka, the Tulu language has not been as widely spoken as other Indian languages until recently, although it gained enough recognition to become the second language in Karnataka. The purpose of our research work aims at translating the English language into the Tulu language. During the evaluation the system was tested on a dataset consisting of handwritten characters during the evaluation process Convolutional Neural Networks used achieved an accuracy rate of 92%. To translate English to the Tulu language, we employed a parallel sentence dataset for the neural approach and a parallel word dataset for the rule-based approach. The rule-based approach resulted in an 89% accuracy rate for word-based analysis and an 81% accuracy rate for sentence-based analysis for the English-to-Tulu language translation. The neural machine translation approach of the Encoder-Decoder model with LSTM is been used to accomplish translation from English to Tulu with a BLEU score of 0.83 and Tulu to English with a BLUE score of 0.65. The model also employed hybrid machine translation to enhance the translation.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords