Vestnik Volgogradskogo Gosudarstvennogo Universiteta. Seriâ 2. Âzykoznanie (Feb 2021)
Copies: Composition, Structure, Analytical Markup
Abstract
The article gives grounds for marking up machine-readable transcriptions of medieval Slavic manuscripts, which serve as textual material for the historical corpora, the excerpts are viewed as possessing valuable codicological or textological characteristics. Composition analysis of four manuscripts from the Slavonic Parimejnik (12 th – 14 th cc.) and modeling their structure with application of generally accepted tools of linguistic analysis have enabled solving the following tasks: analytical units identification, text format elaboration and search process algorithmization based on the natural language units characteristics. A suggested format of lectionaries description includes data on the section (sub-section) of the liturgical year, the number of the lectionary in this section, textual composition in relation to the texts of the Bible, the topic. The format of dates, days and time of lectionaries reading throughout the year includes data on the section (sub-section), the date of the fixed calendar or the week and the day within the section (sub-section) of the nonfixed calendar, the time of the church service, the event to which the service is devoted. The markup of the Parimejnik texts included indication of the excerpts boundaries and determination of their relations with the dictionary unitwith the aim to ensure the parallel corpus of four Slavic manuscripts of the Parimejnik with ranging the corresponding excerpts at the level of the lectionaries and Bible verses within the corpus "Manuscript" (manuscripts.ru).