Tokenization and Memory Optimization for Reducing GPU Load in NLP Deep Learning Models

Dejan Dodić; Dušan Regodić

doi:10.17559/TV-20231218001216

Tehnički Vjesnik (Jan 2024)

Tokenization and Memory Optimization for Reducing GPU Load in NLP Deep Learning Models

Dejan Dodić,
Dušan Regodić

Affiliations

Dejan Dodić: The Academy of Applied Technical and Preschool Studies, Department of Information - communication technologies, Beogradska 18, Niš, Serbia
Dušan Regodić: MB University, Faculty of Business and Law, Department of Advanced information technologies, Teodora Drajzera 27, Belgrade, Serbia

DOI: https://doi.org/10.17559/TV-20231218001216
Journal volume & issue: Vol. 31, no. 6
pp. 1995 – 2002

Abstract

Read online

In the current landscape of advanced natural language processing (NLP), managing GPU memory effectively is crucial. This paper delves into new tokenization methods and data handling to enhance NLP model efficiency, focusing on avoiding "CUDA out of memory" errors. It examines how sophisticated tokenization and managing text lengths in large datasets can boost model performance. These insights are vital for optimizing resources and scaling NLP models, especially with limited GPU memory. The paper also contextualizes NLP challenges, underlining the significance of memory optimization amidst growing language model complexities. It reviews key NLP technologies, including transformer models, and addresses their memory optimization challenges. Moreover, it underscores the paper's role in developing innovative techniques for more effective memory optimization, linking it to ongoing research and trends in NLP. This work aims to progress natural language processing methods and make AI technologies more accessible.

Published in Tehnički Vjesnik

ISSN: 1330-3651 (Print); 1848-6339 (Online)
Publisher: Faculty of Mechanical Engineering in Slavonski Brod, Faculty of Electrical Engineering in Osijek, Faculty of Civil Engineering in Osijek
Country of publisher: Croatia
LCC subjects: Technology: Engineering (General). Civil engineering (General)
Website: http://hrcak.srce.hr/tehnicki-vjesnik

About the journal

Abstract

Keywords