Multi task learning with general vector space for cross-lingual semantic relation detection

Rizka W. Sholikah; Agus Z. Arifin; Chastine Fatichah; Ayu Purwarianti

Journal of King Saud University: Computer and Information Sciences (May 2022)

Multi task learning with general vector space for cross-lingual semantic relation detection

Rizka W. Sholikah,
Agus Z. Arifin,
Chastine Fatichah,
Ayu Purwarianti

Affiliations

Rizka W. Sholikah: Department of Informatics, Faculty of Intelligent Electrical and Informatics Technology, Institut Teknologi Sepuluh Nopember, Surabaya, Indonesia; Corresponding author at: Department of Informatics, Faculty of Intelligent Electrical and Informatics Technology, Institut Teknologi Sepuluh Nopember, Surabaya 60111, Indonesia.
Agus Z. Arifin: Department of Informatics, Faculty of Intelligent Electrical and Informatics Technology, Institut Teknologi Sepuluh Nopember, Surabaya, Indonesia
Chastine Fatichah: Department of Informatics, Faculty of Intelligent Electrical and Informatics Technology, Institut Teknologi Sepuluh Nopember, Surabaya, Indonesia
Ayu Purwarianti: Informatics, School of Electrical and Informatics, Institut Teknologi Bandung, Bandung, Indonesia

Journal volume & issue: Vol. 34, no. 5
pp. 2161 – 2169

Abstract

Read online

Semantic relation detection has an important role in natural language processing. In a supervised approach, the training process requires a sufficient amount of labeled data. However, in low-resource languages, labeled data are limited, whereas in rich-resource languages, labeled data are available in large quantities. In addition, various studies tend to model the single-task problem without considering the generalization with other tasks. Hence, a strategy that can utilize the availability of labeled data in rich-resource languages and generalize models to improve the identification of relations in a cross-lingual manner is needed. In this paper, we propose a framework to identify cross-lingual semantic relation using multi-task learning with a general vector space. The proposed method was designed to construct a general vector space and semantic relation identification. The experiments were conducted over three datasets: Indonesian–Arabic, English–Arabic, and English–Indonesia. The results show that the use of multi-task learning with a general vector space can overcome the problem of cross-lingual semantic relation identification. This is shown by the accuracy of the synonym and hypernym tasks that reached 84.9% and 84.8%, respectively.

Published in Journal of King Saud University: Computer and Information Sciences

ISSN: 1319-1578 (Print)
Publisher: Elsevier
Country of publisher: Saudi Arabia
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.journals.elsevier.com/journal-of-king-saud-university-computer-and-information-sciences/

About the journal

Abstract

Keywords