Intelligence System for Multi-Language Recognition

Fawziya Ramo; Mohammed Kannah

doi:10.33899/edusj.2022.132223.1200

مجلة التربية والعلم (Mar 2022)

Intelligence System for Multi-Language Recognition

Fawziya Ramo,
Mohammed Kannah

Affiliations

Fawziya Ramo: Department of Computer Sciences, College of Computer Sciences and Mathematics, University of Mosul, Mosul, IRAQ
Mohammed Kannah: Department of Computer Science, Collage of Computer Science and Mathematics, University of Mosul, Mosul, Iraq

DOI: https://doi.org/10.33899/edusj.2022.132223.1200
Journal volume & issue: Vol. 31, no. 1
pp. 93 – 110

Abstract

Read online

Language classification systems are used to classify spoken language from a particular phoneme sample and are usually the first step of many spoken language processing tasks, such as automatic speech recognition (ASR) systems Without automatic language detection, spoken speech cannot be properly analyzed and grammar rules cannot be applied, causing failures Subsequent speech recognition steps. We propose a language classification system that solves the problem in the image field, rather than the sound field. This research identified and implemented several low-level features using Mel Frequency Cepstral Coefficients, which extract traits from speech files of four languages (Arabic, English, French, Kurdish) from the database (M2L_Dataset) as the data source used in this research. A Convolutional Neuron Network is used to operate on spectrogram images of the available audio snippets. In extensive experiments, we showed that our model is applicable to a range of noisy scenarios and can easily be extended to previously unknown languages, while maintaining classification accuracy. We released our own code and extensive training package for language classification systems for the community. CNN algorithm was applied in this research to classify and the result was perfect, as the classification accuracy reached 97% between two languages if the sample length was only one second, but if the sample length was two seconds, the classification accuracy reached 98%. While the classification among three languages, the classification accuracy reached 95% if the sample length was only one second, but if the sample length was two seconds, the classification accuracy reached 96%.

Published in مجلة التربية والعلم

ISSN: 1812-125X (Print); 2664-2530 (Online)
Publisher: College of Education for Pure Sciences
Country of publisher: Iraq
LCC subjects: Education; Science: Science (General)
Website: https://edusj.mosuljournals.com

About the journal

Abstract

Keywords