Journal of Hebei University of Science and Technology (Dec 2020)

Khmer multi-document extractive summarization method based on hierarchical maximal marginal relevance

  • Zhaolin ZENG,
  • Xin YAN,
  • Bingbing YU,
  • Feng ZHOU,
  • Guangyi XU

DOI
https://doi.org/10.7535/hbkd.2020yx06005
Journal volume & issue
Vol. 41, no. 6
pp. 508 – 517

Abstract

Read online

In order to solve the problem of ineffective utilization of the semantic information between documents in the traditional multi-document extractive summarization method and the excessive redundant content in the summary result, a Khmer multi-document extractive summarization method based on hierarchical maximal marginal relevance(MMR)was proposed. Firstly, the Khmer multi-document text was input into the trained deep learning model to extract all the single-document summaries. Then, all single document summaries were iteratively merged according to a similar hierarchical waterfall method, and the improved MMR algorithm was used to reasonably select summary sentences to obtain the final multi-document summary. The experimental results show that the R1, R2, R3, RL values of the Khmer multi-document summary obtained by using the deep learning method combined with the hierarchical MMR algorithm increases by 4.31%, 5.33%, 645% and 4.26% respectively compared with other methods. The Khmer multi-document extractive summarization method based on hierarchical MMR can effectively improve the quality of Khmer multi-document summary while ensuring the diversity and difference of the summary sentences.

Keywords