Generation of simulated data for Bengali text localization in natural images

Sourav Saha; Md. Easin Arafat; Md Aminul Haque Palash; Dewan Md Farid; M. Shamim Kaiser

Data in Brief (Dec 2023)

Generation of simulated data for Bengali text localization in natural images

Sourav Saha,
Md. Easin Arafat,
Md Aminul Haque Palash,
Dewan Md Farid,
M. Shamim Kaiser

Affiliations

Sourav Saha: Department of Computer Science and Engineering, United International University, United City, Madani Avenue, Badda, Dhaka 1212, Bangladesh
Md. Easin Arafat: Institute of Information Technology, Jahangirnagar University, Savar, Dhaka 1342, Bangladesh; Corresponding author at: Institute of Information Technology, Jahangirnagar University, Savar, Dhaka 1342, Bangladesh.
Md Aminul Haque Palash: Department of Research and Development, Pioneer Alpha, Dhaka 1205, Bangladesh; Department of Computer Science and Engineering, Chittagong University of Engineering & Technology, Rawjan, Chittagong 4349, Bangladesh
Dewan Md Farid: Department of Computer Science and Engineering, United International University, United City, Madani Avenue, Badda, Dhaka 1212, Bangladesh
M. Shamim Kaiser: Institute of Information Technology, Jahangirnagar University, Savar, Dhaka 1342, Bangladesh

Journal volume & issue: Vol. 51
p. 109568

Abstract

Read online

In the domain of vision-based applications, the importance of text cannot be underestimated due to its natural capacity to provide accurate and comprehensive information. The application of scene text editing systems enables the modification and enhancement of textual material included in natural images while maintaining the integrity of the overall visual layout. The complexity of keeping the original background context and font styles when altering, however, is an extremely difficult challenge considering the changed image must perfectly blend with the original without being altered. This article contains significant simulated data on the dynamic features of digital image editing, advertising, content development, and related fields. The system comprises key components such as 2D simulated text on the styled image (is), text image (it), masking of text (maskt), real background image (tb), real sample image (tf), text skeleton (tsk), and text styled image (tt). The source dataset contains diverse components such as background images, color variations, fonts, and text content, while the synthetic dataset consists of 49,000 randomly generated images. The dataset provides both researchers and practitioners with a rich resource for identifying and evaluating these dynamic features. The dataset is publicly accessible via the link: https://data.mendeley.com/datasets/h9kry9y46s/3

Published in Data in Brief

ISSN: 2352-3409 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Science (General)
Website: http://www.journals.elsevier.com/data-in-brief/

About the journal

Abstract

Keywords