Multi-Task Learning for Abstractive and Extractive Summarization

Yangbin Chen; Yun Ma; Xudong Mao; Qing Li

doi:10.1007/s41019-019-0087-7

Data Science and Engineering (Apr 2019)

Multi-Task Learning for Abstractive and Extractive Summarization

Yangbin Chen,
Yun Ma,
Xudong Mao,
Qing Li

Affiliations

Yangbin Chen: City University of Hong Kong
Yun Ma: City University of Hong Kong
Xudong Mao: The Hong Kong Polytechnic University
Qing Li: The Hong Kong Polytechnic University

DOI: https://doi.org/10.1007/s41019-019-0087-7
Journal volume & issue: Vol. 4, no. 1
pp. 14 – 23

Abstract

Read online

Abstract The abstractive method and extractive method are two main approaches for automatic document summarization. In this paper, to fully integrate the relatedness and advantages of both approaches, we propose a general unified framework for abstractive summarization which incorporates extractive summarization as an auxiliary task. In particular, our framework is composed of a shared hierarchical document encoder, a hierarchical attention mechanism-based decoder, and an extractor. We adopt multi-task learning method to train these two tasks jointly, which enables the shared encoder to better capture the semantics of the document. Moreover, as our main task is abstractive summarization, we constrain the attention learned in the abstractive task with the labels of the extractive task to strengthen the consistency between the two tasks. Experiments on the CNN/DailyMail dataset demonstrate that both the auxiliary task and the attention constraint contribute to improve the performance significantly, and our model is comparable to the state-of-the-art abstractive models. In addition, we cut half number of labels of the extractive task, pretrain the extractor, and jointly train the two tasks using the estimated sentence salience of the extractive task to constrain the attention of the abstractive task. The results do not decrease much compared with using full-labeled data of the auxiliary task.

Published in Data Science and Engineering

ISSN: 2364-1185 (Print); 2364-1541 (Online)
Publisher: SpringerOpen
Country of publisher: Germany
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.springer.com/41019

About the journal

Abstract

Keywords