Kurdish standard EMNIST-like character dataset

Hamsa D. Majeed; Goran Saman Nariman; Renas Sardar Azeez; Bawar Bilal Abdulqadir

Data in Brief (Feb 2024)

Kurdish standard EMNIST-like character dataset

Hamsa D. Majeed,
Goran Saman Nariman,
Renas Sardar Azeez,
Bawar Bilal Abdulqadir

Affiliations

Hamsa D. Majeed: Corresponding author.; Department of Information Technology, College of Science and Technology, University of Human Development, Kurdistan Region, Iraq
Goran Saman Nariman: Department of Information Technology, College of Science and Technology, University of Human Development, Kurdistan Region, Iraq
Renas Sardar Azeez: Department of Information Technology, College of Science and Technology, University of Human Development, Kurdistan Region, Iraq
Bawar Bilal Abdulqadir: Department of Information Technology, College of Science and Technology, University of Human Development, Kurdistan Region, Iraq

Journal volume & issue: Vol. 52
p. 110038

Abstract

Read online

A dataset was created by collecting handwritten samples of distinct Kurdish characters. The dataset consists primarily of 58 characters, and approximately 3800 adult volunteers who are native Kurdish speakers participated in the collection process. Each participant was requested to fill two rows in a character form printed on A4 landscape papers. These papers were divided into sets of four pages, with 18 columns and 10 rows of characters on each page, except for the fourth page in each set, which had 40 cells. To ensure a comprehensive dataset, over 760 sets were prepared and distributed across various universities and institutions. The collected samples underwent scanning, cropping, and preprocessing procedures following the characteristics established by the EMNIST project. The purpose of these procedures was to standardize the dataset and ensure uniformity in the representation of all characters.

Published in Data in Brief

ISSN: 2352-3409 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Science (General)
Website: http://www.journals.elsevier.com/data-in-brief/

About the journal

Abstract

Keywords