Journal of Open Humanities Data (Jun 2023)

The ChildPoeDE Corpus: 1082 German Children’s Poems for Computational and Experimental Studies on Poetry Reception

  • Marina Lehmann,
  • Anne Heumann,
  • Moniek M. Kuijpers,
  • Gerhard Lauer,
  • Jana Lüdtke

DOI
https://doi.org/10.5334/johd.102
Journal volume & issue
Vol. 9
pp. 6 – 6

Abstract

Read online

We introduce childPoeDE: the first corpus of German poetry for children comprising poems which are still read today and cover a wide range of topics and authors. ChildPoeDE contains poem texts and both poem-level and token-level metadata. Poem-level metadata includes information about the anthologies and authors, quantitative text features, rhyme and lexical richness. Token-level metadata covers word length, position and frequency, parts-of-speech, onomatopoeia and sonority. This corpus can be used for computational text analysis, but also as a source for stimulus material in experimental studies. The corpus metadata is freely accessible via Zenodo. The poem texts are protected by copyright.

Keywords