Unsupervised Abstractive Opinion Summarization by Generating Sentences with Tree-Structured Topic Guidance

Masaru Isonuma; Junichiro Mori; Danushka Bollegala; Ichiro Sakata

doi:10.1162/tacl_a_00406

Transactions of the Association for Computational Linguistics (Jan 2021)

Unsupervised Abstractive Opinion Summarization by Generating Sentences with Tree-Structured Topic Guidance

Masaru Isonuma,
Junichiro Mori,
Danushka Bollegala,
Ichiro Sakata

Affiliations

Masaru Isonuma: The University of Tokyo, Japan. [email protected]
Junichiro Mori: The University of Tokyo, Japan
Danushka Bollegala: University of Liverpool, United Kingdom. [email protected]
Ichiro Sakata: The University of Tokyo, Japan. [email protected]

DOI: https://doi.org/10.1162/tacl_a_00406
Journal volume & issue: Vol. 9
pp. 945 – 961

Abstract

Read online

AbstractThis paper presents a novel unsupervised abstractive summarization method for opinionated texts. While the basic variational autoencoder-based models assume a unimodal Gaussian prior for the latent code of sentences, we alternate it with a recursive Gaussian mixture, where each mixture component corresponds to the latent code of a topic sentence and is mixed by a tree-structured topic distribution. By decoding each Gaussian component, we generate sentences with tree-structured topic guidance, where the root sentence conveys generic content, and the leaf sentences describe specific topics. Experimental results demonstrate that the generated topic sentences are appropriate as a summary of opinionated texts, which are more informative and cover more input contents than those generated by the recent unsupervised summarization model (Bražinskas et al., 2020). Furthermore, we demonstrate that the variance of latent Gaussians represents the granularity of sentences, analogous to Gaussian word embedding (Vilnis and McCallum, 2015).

Published in Transactions of the Association for Computational Linguistics

ISSN: 2307-387X (Online)
Publisher: The MIT Press
Country of publisher: United States
LCC subjects: Language and Literature: Philology. Linguistics: Computational linguistics. Natural language processing
Website: https://direct.mit.edu/tacl

About the journal