IEEE Access (Jan 2024)

Arabic Speech Recognition: Advancement and Challenges

  • Ashifur Rahman,
  • Md. Mohsin Kabir,
  • M. F. Mridha,
  • Mohammed Alatiyyah,
  • Haifa F. Alhasson,
  • Shuaa S. Alharbi

DOI
https://doi.org/10.1109/ACCESS.2024.3376237
Journal volume & issue
Vol. 12
pp. 39689 – 39716

Abstract

Read online

Speech recognition is a captivating process that revolutionizes human-computer interactions, allowing us to interact and control machines through spoken commands. The foundation of speech recognition lies in understanding a given language’s linguistic and textual characteristics. Although automatic speech recognition (ASR) systems flawlessly convert speech into text for various international languages, their implementation for Arabic remains inadequate. In this research, we diligently explore the current state of Arabic ASR systems and unveil the challenges encountered during their development. We categorize these challenges into two groups: those specific to the Arabic language and those more general. We propose strategies to overcome these obstacles and emphasize the need for ASR architectures tailored to the Arabic language’s unique grammatical and phonetic structure. In addition, we provide a comprehensive and explicit description of various feature extraction methods, language models, and acoustic models utilized in the Arabic ASR system.

Keywords