Improvement of image analysis/synthesis technologies of acoustic (speech) information for the control, safety and communication systems

V. Dvoryankin V. Dvoryankin; Nikita S. Dvoryankin; Roman A. Ustinov

doi:10.26583/bit.2019.1.07

Безопасность информационных технологий (Mar 2019)

Improvement of image analysis/synthesis technologies of acoustic (speech) information for the control, safety and communication systems

V. Dvoryankin V. Dvoryankin,
Nikita S. Dvoryankin,
Roman A. Ustinov

Affiliations

V. Dvoryankin V. Dvoryankin: Financial University under Government of the Russian Federation
Nikita S. Dvoryankin: National Nuclear Research University MEPhI
Roman A. Ustinov: Financial University under Government of the Russian Federation

DOI: https://doi.org/10.26583/bit.2019.1.07
Journal volume & issue: Vol. 26, no. 1
pp. 64 – 76

Abstract

Read online

Voice communication has been and remains one of the main ways of human communication and human-machine exchange. Today a construction of new perspective systems of processing and protection of speech information is impossible without modeling of effective mechanisms of speech transformation, creation of speech-like signals with the set properties.For this purpose a unique approach is proposed which deals with transformation of a halftone image of a speech signal spectrogram into a binary one, its subsequent modification in order to solve the problems of protection and processing of speech information, and the possibility of a reverse transition to a halftone image and subsequent synthesis of a new speech-like signal with the desired properties.An improvement of the model of speech formation, making use of the properties of auditory perception and taking into account the features of the formation of binary spectrograms can significantly reduce the amount of speech information without losing its semantic content and recognition and provide an opportunity to use a rich and well-tested arsenal of ways to recognize and process binary and halftone images and a number of other important advantages.The prospects of using image analysis-synthesis technologies in relation to narrow-band sonograms and other kind of images while solving the problems of acoustic steganography, digital noise cleaning and reconstruction of distorted phonograms, audio labeling of significant information, speech compression and restoration are also evaluated.

information protection, figurative analysis-synthesis, speech signal, binarization of speech spectrum images, short-time Fourier transform.

Published in Безопасность информационных технологий

ISSN: 2074-7128 (Print); 2074-7136 (Online)
Publisher: Joint Stock Company "Experimental Scientific and Production Association SPELS
Country of publisher: Russian Federation
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Science (General): Cybernetics: Information theory
Website: https://bit.spels.ru

About the journal

Abstract

Keywords