Applied Sciences (Jun 2024)

Transferring Sentiment Cross-Lingually within and across Same-Family Languages

  • Gaurish Thakkar,
  • Nives Mikelić Preradović,
  • Marko Tadić

DOI
https://doi.org/10.3390/app14135652
Journal volume & issue
Vol. 14, no. 13
p. 5652

Abstract

Read online

Natural language processing for languages with limited resources is hampered by a lack of data. Using English as a hub language for such languages, cross-lingual sentiment analysis has been developed. The sheer quantity of English language resources raises questions about its status as the primary resource. This research aims to examine the impact on sentiment analysis of adding data from same-family versus distant-family languages. We analyze the performance using low-resource and high-resource data from the same language family (Slavic), investigate the effect of using a distant-family language (English) and report the results for both settings. Quantitative experiments using multi-task learning demonstrate that adding a large quantity of data from related and distant-family languages is advantageous for cross-lingual sentiment transfer.

Keywords