Ibérica (Jan 2017)
SCAP-TT
Abstract
In this research note we report on the first results of SCAP, the Spanish Corpus Annotation Project, applied to tourism discourse. In particular, we present and assess a new TreeTagger parameter set for Spanish (SCAP-TT), which has been trained for the Part-of-Speech tagging (POS-tagging) and lemmatisation of Spanish promotional tourism texts. Although SCAP-TT has been trained for specialized tourism discourse, we also show promising results for the annotation of other text genres such as essays and literary text