Neural Embedding Allocation: Distributed Representations of Topic Models

Kamrun Naher Keya; Yannis Papanikolaou; James R. Foulds

doi:10.1162/coli_a_00457

Computational Linguistics (Aug 2022)

Neural Embedding Allocation: Distributed Representations of Topic Models

Kamrun Naher Keya,
Yannis Papanikolaou,
James R. Foulds

Affiliations

Kamrun Naher Keya
Yannis Papanikolaou
James R. Foulds

DOI: https://doi.org/10.1162/coli_a_00457
Journal volume & issue: Vol. 48, no. 4

Abstract

Read online

We propose a method that uses neural embeddings to improve the performance of any given LDA-style topic model. Our method, called neural embedding allocation (NEA), deconstructs topic models (LDA or otherwise) into interpretable vector-space embeddings of words, topics, documents, authors, and so on, by learning neural embeddings to mimic the topic model. We demonstrate that NEA improves coherence scores of the original topic model by smoothing out the noisy topics when the number of topics is large. Furthermore, we show NEA’s effectiveness and generality in deconstructing and smoothing LDA, author-topic models, and the recent mixed membership skip-gram topic model and achieve better performance with the embeddings compared to several state-of-the-art models.

Published in Computational Linguistics

ISSN: 0891-2017 (Print); 1530-9312 (Online)
Publisher: The MIT Press
Country of publisher: United States
LCC subjects: Language and Literature: Philology. Linguistics: Computational linguistics. Natural language processing
Website: https://direct.mit.edu/coli

About the journal