Mixture-of-Experts Model for Hypernymy Discrimination

ZENG Nan, XIE Zhipeng

doi:10.11896/jsjkx.211200066

Jisuanji kexue (Feb 2023)

Mixture-of-Experts Model for Hypernymy Discrimination

ZENG Nan, XIE Zhipeng

Affiliations

ZENG Nan, XIE Zhipeng: School of Computer Science,Fudan University,Shanghai 200438,China

DOI: https://doi.org/10.11896/jsjkx.211200066
Journal volume & issue: Vol. 50, no. 2
pp. 285 – 291

Abstract

Read online

Hypernymy discrimination is an essential and challenging task in NLP.Traditional supervised methods usually model all the hypernymies in the global semantic space,which has achieved fair performance.However,the distributed semantic representation of hypernymies is rather complex,and their manifestations may differ significantly in different areas of the semantic space,making it difficult to learn the global model.This paper employs the mixture-of-experts framework as a solution.It works on the basis of a divide-and-conquer strategy,which divides the semantic space into multiple subspaces,and each subspace corres-ponds to a local expert(model).A number of localized experts(models) focus on their own domains(or subspaces) to learn their specialties,and a gating mechanism determines the space partitioning and the expert aggregation.Experimental results show that the mixture-of-experts model outperforms the traditional global ones on public datasets.

hypernymy discrimination|mixture-of-experts|local model

Published in Jisuanji kexue

ISSN: 1002-137X (Print)
Publisher: Editorial office of Computer Science
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software; Technology: Technology (General)
Website: http://www.jsjkx.com/CN/1002-137X/home.shtml

About the journal

Abstract

Keywords