Anuario del Seminario de Filología Vasca "Julio de Urquijo" (Apr 2009)

Corpusen etiketatze linguistikoa

  • Izaskun Aldezabal Roteta,
  • María Jesús Aranzabe,
  • Arantza Díaz de Ilarraza,
  • Ainara Estarrona,
  • Nerea Ezeiza,
  • Larraitz Uria

DOI
https://doi.org/10.1387/asju.1672
Journal volume & issue
Vol. 43, no. 1-2

Abstract

Read online

In this article, we shall comment on the steps that have to be taken to give a linguistic label to a corpus and the difficulties that appear in this process. Our main objective was to highlight the importance of the labelling when preparing a corpus that is useful for linguistic research, and the need to establish criteria and to take the decisions that this entails. We also explain how semi-automatic methods are applied and how the manual revision that guarantees the quality of the corpus is carried out. Once the corpus has been revised and labelled, it will be useful both for carrying out linguistic analyses and for improving or assessing the linguistic tools and resources, and also for channelling automatic study.