Disfluency Assessment Using Deep Super Learners

Sheena Christabel Pravin; Susan Elias; Vishal Balaji Sivaraman; G. Rohith; Y. Asnath Victy Phamila

doi:10.1109/ACCESS.2024.3356350

IEEE Access (Jan 2024)

Disfluency Assessment Using Deep Super Learners

Sheena Christabel Pravin,
Susan Elias,
Vishal Balaji Sivaraman,
G. Rohith,
Y. Asnath Victy Phamila

Affiliations

Sheena Christabel Pravin: ORCiD; School of Electronics Engineering, Vellore Institute of Technology, Chennai, India
Susan Elias: ORCiD; School of Electronics Engineering, Vellore Institute of Technology, Chennai, India
Vishal Balaji Sivaraman: Electrical and Computer Engineering, University of Florida, Gainesville, FL, USA
G. Rohith: School of Electronics Engineering, Vellore Institute of Technology, Chennai, India
Y. Asnath Victy Phamila: ORCiD; School of Computer Science and Engineering, Vellore Institute of Technology, Chennai, India

DOI: https://doi.org/10.1109/ACCESS.2024.3356350
Journal volume & issue: Vol. 12
pp. 24079 – 24089

Abstract

Read online

The use of machine learning algorithms for the assessment of speech fluency is increasingly becoming recognized globally due to their ability to quickly identify speech impairments. This approach is preferred over manual diagnosis, as it reduces the likelihood of human error and minimizes the delay in commencing the therapy. A pipelined deep learner-dual classifier (PDL-DC) is proposed for the automated detection of speech impairment. The assessment of individuals’ speech fluency consisted of two distinct phases: the classification of speech disfluencies and the categorization of fluency disorders. Speech disfluencies, including revisions, prolongations, whole-word repetitions, word-medial repetitions, and filled pauses, were categorized into distinct groupings. The second aspect of classification pertains to the assessment of fluency levels, wherein speakers are classified into three categories: healthy individuals, individuals with stuttering, and individuals with Specific Language Impairment (SLI). The proposed model’s implementation of a pipelined design enables the dual validation of a subject’s fluency. The proposed model demonstrates an average classification accuracy, precision, and recall of 97%.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords