Zeitschrift für Sprachwissenschaft (Nov 2021)

Quantifying graphemic variation via large text corpora

  • Lüschow Hanna

DOI
https://doi.org/10.1515/zfs-2021-2038
Journal volume & issue
Vol. 40, no. 3
pp. 421 – 440

Abstract

Read online

The use of some basic computer science concepts could expand the possibilities of (manual) graphematic text corpus analysis. With these it can be shown that graphematic variation decreases constantly in printed German texts from 1600 to 1900. While the variability is continuously lesser on a text-internal level, it decreases faster for the whole available writing system of individual decades. But which changes took place exactly? Which types of variation went away more quickly, which ones persisted? How do we deal with large amounts of data which cannot be processed manually anymore? Which aspects are of special importance or go missing while working with a large textual base?

Keywords