IEEE Access (Jan 2020)

A Classification Retrieval Method for Encrypted Speech Based on Deep Neural Network and Deep Hashing

  • Qiuyu Zhang,
  • Xuejiao Zhao,
  • Yingjie Hu

DOI
https://doi.org/10.1109/ACCESS.2020.3036048
Journal volume & issue
Vol. 8
pp. 202469 – 202482

Abstract

Read online

In order to improve the retrieval efficiency and accuracy of the existing encrypted speech retrieval methods, and improve the semantic representation of speech features and classification performance, a classification retrieval method for encrypted speech based on deep neural network (DNN) and deep hashing is proposed. Firstly, the speech files are classified according to the category tags, and the speech files are encrypted by Rossler chaotic map method and uploaded to the cloud encrypted speech library. Secondly, the Log-Mel spectrogram features of the original speech are extracted, and extract deep semantic features and generate classification results through the trained convolutional neural network (CNN) and convolutional recurrent neural network (CRNN). Finally, the semantic feature hash code is obtained through the constructed hash function, combined with the category hash code encoded by One Hot coding to obtain the final deep hashing binary code, and uploaded to the deep hashing index table. When retrieval, the deep hashing binary code of the query speech is obtained, and the “two-stage” classification retrieval strategy and the normalized Hamming distance algorithm are used to match the semantic feature hash. Experimental results show that the proposed two DNN coding models have excellent feature learning performance, and has better recall rate, precision rate and retrieval efficiency.

Keywords