Lexikos (Dec 2018)

Corpus-driven Bantu Lexicography Part 2: Lemmatisation and Rulers for Lusoga

  • Gilles-Maurice de Schryver,
  • Minah Nabirye

DOI
https://doi.org/10.5788/28-1-1458
Journal volume & issue
Vol. 28
pp. 79 – 111

Abstract

Read online

This article is the second in a trilogy that deals with corpus-driven Bantu lexicography, which is illustrated for Lusoga. The focus here is on the macrostructure and in particular on the building of a lemmatised frequency list directly within a dictionary-writing system. The programming code for the parts of the lemmatisation that may be automated is included as addenda. A second focus is on the embedded part-of-speech and alphabetical rulers, for which it is shown how these may be used to plan the actual compilation of the dictionary entries.

Keywords