Cross-Lingual Short-Text Semantic Similarity for Kannada–English Language Pair

Muralikrishna S N; Raghurama Holla; Harivinod N; Raghavendra Ganiga

doi:10.3390/computers13090236

Computers (Sep 2024)

Cross-Lingual Short-Text Semantic Similarity for Kannada–English Language Pair

Muralikrishna S N,
Raghurama Holla,
Harivinod N,
Raghavendra Ganiga

Affiliations

Muralikrishna S N: Department of Computer Science and Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal 576104, India
Raghurama Holla: Department of Data Science and Computer Applications, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal 576104, India
Harivinod N: Department of Computer Science and Engineering, St Joseph Engineering College, Mangaluru 575028, India
Raghavendra Ganiga: Department of Information & Communication Technology, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal 576104, India

DOI: https://doi.org/10.3390/computers13090236
Journal volume & issue: Vol. 13, no. 9
p. 236

Abstract

Read online

Analyzing the semantic similarity of cross-lingual texts is a crucial part of natural language processing (NLP). The computation of semantic similarity is essential for a variety of tasks such as evaluating machine translation systems, quality checking human translation, information retrieval, plagiarism checks, etc. In this paper, we propose a method for measuring the semantic similarity of Kannada–English sentence pairs that uses embedding space alignment, lexical decomposition, word order, and a convolutional neural network. The proposed method achieves a maximum correlation of 83% with human annotations. Experiments on semantic matching and retrieval tasks resulted in promising results in terms of precision and recall.

Published in Computers

ISSN: 2073-431X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.mdpi.com/journal/computers

About the journal

Abstract

Keywords