Word sense induction with agglomerative clustering and mutual information maximization

Hadi Abdine; Moussa Kamal Eddine; Davide Buscaldi; Michalis Vazirgiannis

AI Open (Jan 2023)

Word sense induction with agglomerative clustering and mutual information maximization

Hadi Abdine,
Moussa Kamal Eddine,
Davide Buscaldi,
Michalis Vazirgiannis

Affiliations

Hadi Abdine: LIX, École Polytechnique, France; Corresponding author.
Moussa Kamal Eddine: LIX, École Polytechnique, France
Davide Buscaldi: LIPN, Université Sorbonne Paris Nord, France
Michalis Vazirgiannis: LIX, École Polytechnique, France

Journal volume & issue: Vol. 4
pp. 193 – 201

Abstract

Read online

Word sense induction (WSI) is a challenging problem in natural language processing that involves the unsupervised automatic detection of a word’s senses (i.e., meanings). Recent work achieves significant results on the WSI task by pre-training a language model that can exclusively disambiguate word senses. In contrast, others employ off-the-shelf pre-trained language models with additional strategies to induce senses. This paper proposes a novel unsupervised method based on hierarchical clustering and invariant information clustering (IIC). The IIC loss is used to train a small model to optimize the mutual information between two vector representations of a target word occurring in a pair of synthetic paraphrases. This model is later used in inference mode to extract a higher-quality vector representation to be used in the hierarchical clustering. We evaluate our method on two WSI tasks and in two distinct clustering configurations (fixed and dynamic number of clusters). We empirically show that our approach is at least on par with the state-of-the-art baselines, outperforming them in several configurations. The code and data to reproduce this work are available to the public11 https://github.com/hadi-abdine/wsi-mim..

Published in AI Open

ISSN: 2666-6510 (Online)
Publisher: KeAi Communications Co. Ltd.
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.keaipublishing.com/en/journals/ai-open/

About the journal

Abstract

Keywords