RIDE (Nov 2018)

Rezension des „Corpus Oral de Referencia de la Lengua Española Contempóranea“

  • Katrin Betz

DOI
https://doi.org/10.18716/ride.a.9.5
Journal volume & issue
Vol. 9

Abstract

Read online

In this paper we review the „Corpus Oral de Referencia de la Lengua Española Contempóranea“. This corpus is a carefully compiled text collection of orthographically transcribed recordings of oral conversations. As it was compiled as a reference corpus, it provides texts of different styles and registers and has a considerable size of about 1,100,000 words. CORLEC therefore is an important resource for researchers interested in the field of spoken language. The Corpus is freely available as SGML-based-files or plain-text-files. As both variants are similar in their structure and content, the review addresses the two formats. The review first provides some background information and an overview of the structure of the corpus. Then, the transcription files are described in their structure and content. Eventually, a short overview of the integration of the corpus into other projects and a résumé are given.

Keywords