The Journal for Transdisciplinary Research in Southern Africa (Apr 2010)

Constructing an XML database of linguistics data

  • J H Kroeze,
  • T JD Bothma,
  • M C Matthee

DOI
https://doi.org/10.4102/td.v6i1.118
Journal volume & issue
Vol. 6, no. 1
pp. e1 – e36

Abstract

Read online

A language-oriented, multi-dimensional database of the linguistic characteristics of the Hebrew text of the Old Testament can enable researchers to do ad hoc queries. XML is a suitable technology to transform free text into a database. A clause’s word order can be kept intact while other features such as syntactic and semantic functions can be marked as elements or attributes. The elements or attributes from the XML “database” can be accessed and proces sed by a 4th generation programming language, such as Visual Basic. XML is explored as an option to build an exploitable database of linguistic data by representing inherently multi-dimensional data, including syntactic and semantic analyses of free text.

Keywords