Journal of King Saud University: Computer and Information Sciences (May 2022)
Multi task learning with general vector space for cross-lingual semantic relation detection
Abstract
Semantic relation detection has an important role in natural language processing. In a supervised approach, the training process requires a sufficient amount of labeled data. However, in low-resource languages, labeled data are limited, whereas in rich-resource languages, labeled data are available in large quantities. In addition, various studies tend to model the single-task problem without considering the generalization with other tasks. Hence, a strategy that can utilize the availability of labeled data in rich-resource languages and generalize models to improve the identification of relations in a cross-lingual manner is needed. In this paper, we propose a framework to identify cross-lingual semantic relation using multi-task learning with a general vector space. The proposed method was designed to construct a general vector space and semantic relation identification. The experiments were conducted over three datasets: Indonesian–Arabic, English–Arabic, and English–Indonesia. The results show that the use of multi-task learning with a general vector space can overcome the problem of cross-lingual semantic relation identification. This is shown by the accuracy of the synonym and hypernym tasks that reached 84.9% and 84.8%, respectively.