Scientific Data (Nov 2023)

Grammars Across Time Analyzed (GATA): a dataset of 52 languages

  • Frederic Blum,
  • Carlos Barrientos,
  • Adriano Ingunza,
  • Damián E. Blasi,
  • Roberto Zariquiey

DOI
https://doi.org/10.1038/s41597-023-02659-1
Journal volume & issue
Vol. 10, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Grammars Across Time Analyzed (GATA) is a resource capturing two snapshots of the grammatical structure of a diverse range of languages separated in time, aimed at furthering research on historical linguistics, language evolution, and cultural change. GATA comprises grammatical information on 52 diverse languages across all continents, featuring morphological, syntactic, and phonological information based on published grammars of the same language at two different time points. Here we introduce the coding scheme and design features of GATA, and we describe some salient patterns related to language change and the coverage of grammatical descriptions over time.