Data in Brief (Jun 2023)

Database description: Russian fricatives recorded in 198 real speech sentences from 59 speakers

  • Natalja Ulrich

Journal volume & issue
Vol. 48
p. 109205

Abstract

Read online

This speech dataset is primarily designed to investigate linguistic and speaker information in fricative sounds in Russian.Acoustic recordings were obtained from 59 students (30 females and 29 males) between 18 and 30 years. Eighteen participants were recorded in a second session. The participants were born and lived since their early childhood in St. Petersburg. The participants did not report any speech or hearing impairment. The recording sessions were conducted at the phonetic laboratory of the Phonetic Institute in St. Petersburg, in an audiometric booth using the recording program Speech-Recorder version 3.28.0 at a sample rate of 44.1 kHz (16-bit encoding). During the recordings, a clip-on microphone (Sennheiser MKE 2-P) was placed at a distance of 15cm from the speakers’ mouth and connected through an audio interface (Zoom U-22) to a laptop computer.The participants were instructed to read 198 randomized sentences from a computer screen. The fricatives [f], [s], [ʃ], [x], [v], [z], [ʒ], [sʲ], [ɕ], [vʲ], [zʲ] were embedded into those sentences. Two sentence structures were designed to obtain each real-word lexemes produced in three different contexts. The first type of sentence is a so-called carrier sentence with the structure of “She said ”X” and not “Y” ”. Minimal pairs of real words, containing one of the 11 tested fricatives were placed in both “X” and “Y” positions. The second type of pre-designed sentence was a natural language sentence including each of the lexemes.All raw audio files were first automatically pre-processed by applying the online tool Munich Automatic Segmentation system. Then, the files of the first recording session were filtered below 80 and above 20050 Hz, and the boundaries were manually corrected using Praat.The dataset consists of 22,561 fricative tokens. The number of observations per sound differs across categories, because of their natural distribution. The dataset is made available as a collection of audio files in wav format along with companion Praat TextGrid files for each sentence. Target fricatives are furthermore available as individual wav files.The whole dataset can be accessed with the DOI https://doi.org/10.48656/4q9c-gz16.Additionally, the experimental design allows the investigation of other sound categories. The number of speakers recorded gives further possibilities for phonetic-oriented speaker identification studies.

Keywords