An Encrypted Speech Retrieval Method Based on Deep Perceptual Hashing and CNN-BiLSTM

Qiuyu Zhang; Yuzhou Li; Yingjie Hu; Xuejiao Zhao

doi:10.1109/ACCESS.2020.3015876

IEEE Access (Jan 2020)

An Encrypted Speech Retrieval Method Based on Deep Perceptual Hashing and CNN-BiLSTM

Qiuyu Zhang,
Yuzhou Li,
Yingjie Hu,
Xuejiao Zhao

Affiliations

Qiuyu Zhang: ORCiD; School of Computer and Communication, Lanzhou University of Technology, Lanzhou, China
Yuzhou Li: School of Computer and Communication, Lanzhou University of Technology, Lanzhou, China
Yingjie Hu: School of Computer and Communication, Lanzhou University of Technology, Lanzhou, China
Xuejiao Zhao: School of Computer and Communication, Lanzhou University of Technology, Lanzhou, China

DOI: https://doi.org/10.1109/ACCESS.2020.3015876
Journal volume & issue: Vol. 8
pp. 148556 – 148569

Abstract

Read online

Since convolutional neural network (CNN) can only extract local features, and long short-term memory (LSTM) neural network model has a large number of learning calculations, a long processing time and an obvious degree of information loss as the length of speech increases. Utilizing the characteristics of autonomous feature extraction in deep learning, CNN and bidirectional long short-term memory (BiLSTM) network are combined to present an encrypted speech retrieval method based on deep perceptual hashing and CNN-BiLSTM. Firstly, the proposed method extracts the Log-Mel Spectrogram/MFCC features of the original speech and enters the CNN and BiLSTM networks in turn for model training. Secondly, we use the trained fusion network model to learn the deep perceptual feature and generate deep perceptual hashing sequences. Finally, the normalized Hamming distance algorithm is used for matching retrieval. In order to protect the speech security in the cloud, a speech encryption algorithm based on a 4D hyperchaotic system is proposed. The experimental results show that the proposed method has good discrimination, robustness, recall and precision compared with the existing methods, and it has good retrieval efficiency and retrieval accuracy for longer speech. Meanwhile, the proposed speech encryption algorithm has a high key space to resist exhaustive attacks.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords