Improving Word Embedding Using Variational Dropout

Zainab Albujasim; Diana Inkpen; Xuejun Han; Yuhong Guo

doi:10.32473/flairs.36.133326

Proceedings of the International Florida Artificial Intelligence Research Society Conference (May 2023)

Improving Word Embedding Using Variational Dropout

Zainab Albujasim,
Diana Inkpen,
Xuejun Han,
Yuhong Guo

Affiliations

Zainab Albujasim: ORCiD; Carleton University
Diana Inkpen: ORCiD; University of Ottawa
Xuejun Han: ORCiD; Carleton University
Yuhong Guo: ORCiD; Carleton University

DOI: https://doi.org/10.32473/flairs.36.133326
Journal volume & issue: Vol. 36

Abstract

Read online

Pre-trained word embeddings are essential in natural language processing (NLP). In recent years, many post-processing algorithms have been proposed to improve the pre-trained word embeddings. We present a novel method - Orthogonal Auto Encoder with Variational Dropout (OAEVD) for improving word embeddings based on orthogonal autoencoders and variational dropout. Specifically, the orthogonality constraint encourages more diversity in the latent space and increases semantic similarities between similar words, and variational dropout makes it more robust to overfitting. Empirical evaluation on a range of downstream NLP tasks, including semantic similarity, text classification, and concept categorization shows that our proposed method effectively improves the quality of pre-trained word embeddings. Moreover, the proposed method successfully reduces the dimensionality of pre-trained word embeddings while maintaining high performance.

Published in Proceedings of the International Florida Artificial Intelligence Research Society Conference

ISSN: 2334-0754 (Print); 2334-0762 (Online)
Publisher: LibraryPress@UF
Country of publisher: United States
LCC subjects: Technology; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://journals.flvc.org/FLAIRS/index

About the journal

Abstract

Keywords