Journal of Open Humanities Data (Sep 2024)

Corpus Bootstrapping for Syriac Linguistics

  • Charbel El-Khaissi

DOI
https://doi.org/10.5334/johd.229
Journal volume & issue
Vol. 10
pp. 46 – 46

Abstract

Read online

The present article summarises a bootstrapping approach to Syriac corpus linguistics that gives freedom for Syriac researchers to apply part-of-speech (POS) tagging technology on texts of their choice using the SEDRA API mechanism and offers an annotated corpus based on this method using a representative and well-balanced selection of Syriac manuscripts spanning one millennium.

Keywords