RUDN Journal of Language Studies, Semiotics and Semantics (Dec 2024)
Towards a Taxonomy of Textbooks as a Genre: the Case of Russian Textbooks
Abstract
The project is presented in the paper initially is launched to design a functional recognition or classification model of a modern Russian school textbook as a genre. In this study we test and confirm the hypothesis that detection of domain (subject area) and complexity level of a textbook can be reduced to a limited number of quantitative linguistic parameters provided with accurately identified and verified value ranges. We outlined our approach to genre analysis as multi-dimensional, compiled a corpus of over 1 mln. tokens, measured values of 15 linguistic parameters in 19 textbooks of two different subject areas and complexity levels, revealed 7 complexity predictors, 7 subject area predictors, and one - frequency - a metaparameter able to discriminate textbooks of History and Social Studies from texts of other genres. Our findings highlight the significance of the following parameters for textbooks across the selected subject areas: incidence of nouns, verb tenses (present, past and future), local and global argument overlap, type-token ratio. Complexity classification model is ascertained to be a function of sentence length, word length, incidence of nouns in genitive case and verbs, Abstractness score, verb/noun ratio, and adjective/noun ratio. The outcomes of this analysis will be used to interpret quantitative linguistic descriptions and classify texts.
Keywords