EAI Endorsed Transactions on Industrial Networks and Intelligent Systems (Jun 2023)

Transforming Data with Ontology and Word Embedding for an Efficient Classification Framework

  • Thi Thanh Sang Nguyen,
  • Pham Minh Thu Do,
  • Thanh Tuan Nguyen,
  • Thanh Tho Quan

DOI
https://doi.org/10.4108/eetinis.v10i2.2726
Journal volume & issue
Vol. 10, no. 2

Abstract

Read online

Transforming data into appropriate formats is crucial because it can speed up the training process and enhance the performance of classification algorithms. It is, however, challenging due to the complicated process, resource-intensive and preserved meaning of the data. This study proposes new approaches to building knowledge representation models using word-embedding and ontology techniques, which can transform text data into digital data and still keep semantic/context information of themselves in order to enhance modeling data later. To evaluate the effectiveness of the built models, a classification framework is proposed and performed on a public real dataset. Experimental results show that the constructed knowledge representation models contribute significantly to the performance of classification methods.

Keywords