Adaptive Speech Streaming Based on Speech Quality Estimation and Artificial Bandwidth Extension for Voice over Wireless Multimedia Sensor Networks

Jin Ah Kang; Nam In Park; Hong Kook Kim; Seong Ro Lee

doi:10.1155/2015/395752

International Journal of Distributed Sensor Networks (Jun 2015)

Adaptive Speech Streaming Based on Speech Quality Estimation and Artificial Bandwidth Extension for Voice over Wireless Multimedia Sensor Networks

Jin Ah Kang,
Nam In Park,
Hong Kook Kim,
Seong Ro Lee

Affiliations

Jin Ah Kang: Smart-Work Research Section, Wired & Wireless Convergence Research Department, Electronics and Telecommunications Research Institute (ETRI), 218 Gajeong-ro, Yuseong-gu, Daejeon 305-700, Republic of Korea
Nam In Park: Digital Technology and Biometry Division, National Forensic Service (NFS), 10 Ipchun-ro, Wonju 220-170, Republic of Korea
Hong Kook Kim: School of Information and Communications, Gwangju Institute of Science and Technology (GIST), 123 Cheomdangwagi-ro, Buk-gu, Gwangju 500-712, Republic of Korea
Seong Ro Lee: Department of Information & Electronics Engineering, Mokpo National University, 1666 Youngsan-ro, Cheonggye-myeon, Muan-gun, Jeonnam 534-729, Republic of Korea

DOI: https://doi.org/10.1155/2015/395752
Journal volume & issue: Vol. 11

Abstract

Read online

In this paper, an adaptive speech streaming method is proposed to improve the perceived speech quality (PSQ) of voice over wireless multimedia sensor network (WMSNs). First of all, the proposed method estimates the PSQ of the received speech data under different network conditions that are represented by the packet loss rates (PLRs). Simultaneously, the proposed method classifies the speech signal as either an onset or a nononset frame. Based on the estimated PSQ and the speech class, it determines an appropriate bit rate for the redundant speech data (RSD) that are transmitted with the primary speech data (PSD) to help reconstruct the speech signals of any lost frames. In particular, when the estimated PLR is high, the bit rate of the RSD should be increased by decreasing that of the PSD. Thus, the bandwidth of the PSD is changed from wideband to narrowband, and an artificial bandwidth extension technique is applied to the decoded narrowband speech. It is shown from the simulation that the proposed method significantly improves the decoded speech quality under packet loss conditions in a WMSN, compared to a decoder-based packet loss concealment method and a conventional redundant speech transmission method.

Published in International Journal of Distributed Sensor Networks

ISSN: 1550-1329 (Print); 1550-1477 (Online)
Publisher: Hindawi - SAGE Publishing
Country of publisher: United Kingdom
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://onlinelibrary.wiley.com/journal/dsn

About the journal