Investigating translation for Indic languages with BLOOMZ-3b through prompting and LoRA fine-tuning

Aarathi Rajagopalan Nair; Deepa Gupta; B. Premjith

doi:10.1038/s41598-024-74617-9

Scientific Reports (Oct 2024)

Investigating translation for Indic languages with BLOOMZ-3b through prompting and LoRA fine-tuning

Aarathi Rajagopalan Nair,
Deepa Gupta,
B. Premjith

Affiliations

Aarathi Rajagopalan Nair: Department of Computer Science and Engineering, Amrita School of Computing, Amrita Vishwa Vidyapeetham
Deepa Gupta: Department of Computer Science and Engineering, Amrita School of Computing, Amrita Vishwa Vidyapeetham
B. Premjith: Amrita School of Artificial Intelligence, Amrita Vishwa Vidyapeetham

DOI: https://doi.org/10.1038/s41598-024-74617-9
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 22

Abstract

Read online

Abstract In the domain of natural language processing, the rise of Large Language Models and Generative AI represents a noteworthy transition, enabling machines to understand and generate text resembling that produced by humans. This research conducts a thorough examination of this transformative technology, with a focus on its influence on machine translation. The study explores the translation landscape between English and Indic languages, which include Hindi, Kannada, Malayalam, Tamil, and Telugu. To address this, the Large Language Model, BLOOMZ-3b, is utilized, which has been primarily developed for a text generation task. Multiple prompting engineering techniques for machine translation are prominently explored. The study further traverse fine-tuning the BLOOMZ-3b model using a Parameter Efficient Fine-Tuning technique called Low Rank Adaptation, aiming to reduce computational complexity. Hence, by combining innovative prompting approaches using BLOOMZ-3b model and fine-tuning the model, it contributes to continuous development of machine translation technologies beyond traditional borders of what can be done with respect to language processing. In this regard, not only does this research shed light on the intricacy of translation problems but it also sets a precedence for optimizing or adapting big language models to various languages which end up advancing Artificial Intelligence and Natural Language Processing at large.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords