IEEE Access (Jan 2023)

Efficient Assamese Word Recognition for Societal Empowerment: A Comparative Feature-Based Analysis

  • Naiwrita Borah,
  • Udayan Baruah,
  • Mahesh Thylore Ramakrishna,
  • V. Vinoth Kumar,
  • D. Ramya Dorai,
  • Jonnakuti Rajkumar Annad

DOI
https://doi.org/10.1109/ACCESS.2023.3301564
Journal volume & issue
Vol. 11
pp. 82302 – 82326

Abstract

Read online

The preservation and digitization of historical data are crucial for ensuring the continuity and accessibility of information over successive generations. The present study investigates the utilization of machine learning methodologies in the identification of Assamese words, focusing specifically on their distinctive visual characteristics. The main aim of this project is to improve word recognition technologies in Indic languages, specifically focusing on Assamese, in order to preserve and provide access to Assamese literature for future generations. The classification procedure entails the examination of 19 shape-related attributes through a range of machine learning algorithms, such as Logistic Regression, Decision Trees, Random Forest, Support Vector Machine (SVM) with various kernels, K Nearest Neighbors, and Gradient Boosting. The assessment of the model involves the utilization of various metrics such as Accuracy, Precision, Kappa, F1-score, Model Build Time, and Model Run Time to evaluate the computational efficiency. Additionally, the metrics of Area under the Curve (AUC) and Receiver Operating Characteristic (ROC) are also considered in the evaluation process. Out of the four datasets analyzed, Dataset 3 exhibits the highest level of performance. It is worth noting that Gradient Boosting demonstrates the highest level of accuracy, reaching 96.03% for conventional machine learning appraoches. Logistic Regression and SVM with RBF kernel closely trail behind, achieving accuracies of 95.64% and 95.60% respectively. Furthermore, the research conducted in this study also employs multiple layers of Convolutional Neural Networks (CNN), resulting in a remarkable recognition accuracy of 97.3%. This finding demonstrates that the CNN model and the proposed feature-set are in close proximity to one another in terms of the evaluation metrics.

Keywords