Eesti Rakenduslingvistika Ühingu Aastaraamat (Apr 2015)

Korpusleksikograafia uued võimalused eesti keele kollokatsioonisõnastiku näitel

  • Jelena Kallas,
  • Kristina Koppel,
  • Maria Tuulik

DOI
https://doi.org/10.5128/ERYa11.05
Journal volume & issue
Vol. 11
pp. 75 – 94

Abstract

Read online

This article aims to introduce new resources and methods used in Estonian corpus lexicography to create monolingual Estonian dictionaries. Corpora can be used in many ways: headwords list development, grammatical and frequency labels, word sense division, identifying collocations, good dictionary examples, translation equivalents (Kilgarriff 2013). The paper focuses on features offered by Sketch Engine (Kilgarriff et al. 2004), a state-of-the-art lexicographic tool for corpus analysis. For Estonian, Sketch Engine contains different types of corpora, including the recently created 260 million-word web corpus etTenTen13 and the 463 million-word Esto- nian National Corpus.

Keywords