Ibérica (Jan 2017)

SCAP-TT

  • Patrick Goethals,
  • Els Lefever,
  • Lieve Macken

Journal volume & issue
no. 33

Abstract

Read online

In this research note we report on the first results of SCAP, the Spanish Corpus Annotation Project, applied to tourism discourse. In particular, we present and assess a new TreeTagger parameter set for Spanish (SCAP-TT), which has been trained for the Part-of-Speech tagging (POS-tagging) and lemmatisation of Spanish promotional tourism texts. Although SCAP-TT has been trained for specialized tourism discourse, we also show promising results for the annotation of other text genres such as essays and literary text

Keywords