Relative Weights of Temporal Envelope Cues in Different Frequency Regions for Mandarin Vowel, Consonant, and Lexical Tone Recognition

Zhong Zheng; Zhong Zheng; Keyi Li; Gang Feng; Yang Guo; Yinan Li; Yinan Li; Lili Xiao; Lili Xiao; Chengqi Liu; Chengqi Liu; Shouhuan He; Zhen Zhang; Zhen Zhang; Di Qian; Yanmei Feng; Yanmei Feng

doi:10.3389/fnins.2021.744959

Frontiers in Neuroscience (Dec 2021)

Relative Weights of Temporal Envelope Cues in Different Frequency Regions for Mandarin Vowel, Consonant, and Lexical Tone Recognition

Zhong Zheng,
Zhong Zheng,
Keyi Li,
Gang Feng,
Yang Guo,
Yinan Li,
Yinan Li,
Lili Xiao,
Lili Xiao,
Chengqi Liu,
Chengqi Liu,
Shouhuan He,
Zhen Zhang,
Zhen Zhang,
Di Qian,
Yanmei Feng,
Yanmei Feng

Affiliations

Zhong Zheng: Department of Otolaryngology-Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, Shanghai, China
Zhong Zheng: Shanghai Key Laboratory of Sleep Disordered Breathing, Shanghai, China
Keyi Li: Sydney Institute of Language and Commerce, Shanghai University, Shanghai, China
Gang Feng: Department of Graduate, The First Affiliated Hospital of Jinzhou Medical University, Jinzhou, China
Yang Guo: Ear, Nose, and Throat Institute and Otorhinolaryngology Department, Eye and ENT Hospital of Fudan University, Shanghai, China
Yinan Li: Department of Otolaryngology-Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, Shanghai, China
Yinan Li: Shanghai Key Laboratory of Sleep Disordered Breathing, Shanghai, China
Lili Xiao: Department of Otolaryngology-Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, Shanghai, China
Lili Xiao: Shanghai Key Laboratory of Sleep Disordered Breathing, Shanghai, China
Chengqi Liu: Department of Otolaryngology-Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, Shanghai, China
Chengqi Liu: Shanghai Key Laboratory of Sleep Disordered Breathing, Shanghai, China
Shouhuan He: Department of Otolaryngology, Qingpu Branch of Zhongshan Hospital Affiliated to Fudan University, Shanghai, China
Zhen Zhang: Department of Otolaryngology-Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, Shanghai, China
Zhen Zhang: Shanghai Key Laboratory of Sleep Disordered Breathing, Shanghai, China
Di Qian: Department of Otolaryngology, Shenzhen Longhua District People’s Hospital, Shenzhen, China
Yanmei Feng: Department of Otolaryngology-Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, Shanghai, China
Yanmei Feng: Shanghai Key Laboratory of Sleep Disordered Breathing, Shanghai, China

DOI: https://doi.org/10.3389/fnins.2021.744959
Journal volume & issue: Vol. 15

Abstract

Read online

Objectives: Mandarin-speaking users of cochlear implants (CI) perform poorer than their English counterpart. This may be because present CI speech coding schemes are largely based on English. This study aims to evaluate the relative contributions of temporal envelope (E) cues to Mandarin phoneme (including vowel, and consonant) and lexical tone recognition to provide information for speech coding schemes specific to Mandarin.Design: Eleven normal hearing subjects were studied using acoustic temporal E cues that were extracted from 30 continuous frequency bands between 80 and 7,562 Hz using the Hilbert transform and divided into five frequency regions. Percent-correct recognition scores were obtained with acoustic E cues presented in three, four, and five frequency regions and their relative weights calculated using the least-square approach.Results: For stimuli with three, four, and five frequency regions, percent-correct scores for vowel recognition using E cues were 50.43–84.82%, 76.27–95.24%, and 96.58%, respectively; for consonant recognition 35.49–63.77%, 67.75–78.87%, and 87.87%; for lexical tone recognition 60.80–97.15%, 73.16–96.87%, and 96.73%. For frequency region 1 to frequency region 5, the mean weights in vowel recognition were 0.17, 0.31, 0.22, 0.18, and 0.12, respectively; in consonant recognition 0.10, 0.16, 0.18, 0.23, and 0.33; in lexical tone recognition 0.38, 0.18, 0.14, 0.16, and 0.14.Conclusion: Regions that contributed most for vowel recognition was Region 2 (502–1,022 Hz) that contains first formant (F1) information; Region 5 (3,856–7,562 Hz) contributed most to consonant recognition; Region 1 (80–502 Hz) that contains fundamental frequency (F0) information contributed most to lexical tone recognition.

Published in Frontiers in Neuroscience

ISSN: 1662-4548 (Print); 1662-453X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: http://www.frontiersin.org/neuroscience

About the journal

Abstract

Keywords