Controllable Summarization with Constrained Markov Decision Process

Hou Pong Chan; Lu Wang; Irwin King

doi:10.1162/tacl_a_00423

Transactions of the Association for Computational Linguistics (Jan 2021)

Controllable Summarization with Constrained Markov Decision Process

Hou Pong Chan,
Lu Wang,
Irwin King

Affiliations

Hou Pong Chan: University of Macau, Macau SAR, China. [email protected]
Lu Wang: University of Michigan, Ann Arbor, MI, USA. [email protected]
Irwin King: The Chinese University of Hong Kong, Hong Kong SAR, China. [email protected]

DOI: https://doi.org/10.1162/tacl_a_00423
Journal volume & issue: Vol. 9
pp. 1213 – 1232

Abstract

Read online

AbstractWe study controllable text summarization, which allows users to gain control on a particular attribute (e.g., length limit) of the generated summaries. In this work, we propose a novel training framework based on Constrained Markov Decision Process (CMDP), which conveniently includes a reward function along with a set of constraints, to facilitate better summarization control. The reward function encourages the generation to resemble the human-written reference, while the constraints are used to explicitly prevent the generated summaries from violating user-imposed requirements. Our framework can be applied to control important attributes of summarization, including length, covered entities, and abstractiveness, as we devise specific constraints for each of these aspects. Extensive experiments on popular benchmarks show that our CMDP framework helps generate informative summaries while complying with a given attribute’s requirement.1

Published in Transactions of the Association for Computational Linguistics

ISSN: 2307-387X (Online)
Publisher: The MIT Press
Country of publisher: United States
LCC subjects: Language and Literature: Philology. Linguistics: Computational linguistics. Natural language processing
Website: https://direct.mit.edu/tacl

About the journal