Journal of Computing Research and Innovation (Apr 2021)

Detection of Adjective Compound Word in Malay Language using Enhanced Syntactic Rules

  • Zamri Abu Bakar,
  • Normaly Kamal Ismail,
  • Nurhilyana Anuar,
  • Aminatul Solehah Idris

Journal volume & issue
Vol. 6, no. 2

Abstract

Read online

Compound word is defined as combination two or more words and it will produce a new meaning. Generally, compound word is existed in many languages such as English, Mandarin, Arabic and others. Although, there are discussion of existing methods to detect compound word yet some limitations on detecting Malay compound word. Thus, this study is done to improve accuracy towards adjective compound words. Training data is used in this study was Malay story books. Digitization data of Malay story book is used in this study. Then, the pre-processing method involved tokenization, stemming, bi-gram and part-of-speech (POS) tagging has been applied to produce the candidate compound word. Applying the enhanced syntactic rules shown the precision result is 70.3% through this study. Thus, this study will contribute to the academic research in improvise the issues on searching and document summarization application.

Keywords