IEEE Access (Jan 2024)

Tulu Language Text Recognition and Translation

  • Prathwini,
  • Anisha P. Rodrigues,
  • P. Vijaya,
  • Roshan Fernandes

DOI
https://doi.org/10.1109/ACCESS.2024.3355470
Journal volume & issue
Vol. 12
pp. 12734 – 12744

Abstract

Read online

Language is a primary means of communication, but it is not the only means; knowing a language does, however, assist speed up the process. Many distinct languages are spoken worldwide, and people use them to communicate. This is only one of the many reasons why language is so crucial. Based on the literature survey, it is evident that there is a lack of available translators for the Tulu language. Despite being prevalent predominantly in Karnataka, the Tulu language has not been as widely spoken as other Indian languages until recently, although it gained enough recognition to become the second language in Karnataka. The purpose of our research work aims at translating the English language into the Tulu language. During the evaluation the system was tested on a dataset consisting of handwritten characters during the evaluation process Convolutional Neural Networks used achieved an accuracy rate of 92%. To translate English to the Tulu language, we employed a parallel sentence dataset for the neural approach and a parallel word dataset for the rule-based approach. The rule-based approach resulted in an 89% accuracy rate for word-based analysis and an 81% accuracy rate for sentence-based analysis for the English-to-Tulu language translation. The neural machine translation approach of the Encoder-Decoder model with LSTM is been used to accomplish translation from English to Tulu with a BLEU score of 0.83 and Tulu to English with a BLUE score of 0.65. The model also employed hybrid machine translation to enhance the translation.

Keywords