Pre-trained language models for keyphrase prediction: A review

Muhammad Umair; Tangina Sultana; Young-Koo Lee

ICT Express (Aug 2024)

Pre-trained language models for keyphrase prediction: A review

Muhammad Umair,
Tangina Sultana,
Young-Koo Lee

Affiliations

Muhammad Umair: Department of Computer Science and Engineering, Kyung Hee University, Global Campus. Yongin-si, South Korea
Tangina Sultana: Department of Computer Science and Engineering, Kyung Hee University, Global Campus. Yongin-si, South Korea; Department of Electronics and Communication Engineering, Hajee Mohammad Danesh science and Technology University, Bangladesh
Young-Koo Lee: Department of Computer Science and Engineering, Kyung Hee University, Global Campus. Yongin-si, South Korea; Corresponding author.

Journal volume & issue: Vol. 10, no. 4
pp. 871 – 890

Abstract

Read online

Keyphrase Prediction (KP) is essential for identifying keyphrases in a document that can summarize its content. However, recent Natural Language Processing (NLP) advances have developed more efficient KP models using deep learning techniques. The limitation of a comprehensive exploration jointly both keyphrase extraction and generation using pre-trained language models spotlights a critical gap in the literature, compelling our survey paper to bridge this deficiency and offer a unified and in-depth analysis to address limitations in previous surveys. This paper extensively examines the topic of pre-trained language models for keyphrase prediction (PLM-KP), which are trained on large text corpora via different learning (supervisor, unsupervised, semi-supervised, and self-supervised) techniques, to provide respective insights into these two types of tasks in NLP, precisely, Keyphrase Extraction (KPE) and Keyphrase Generation (KPG). We introduce appropriate taxonomies for PLM-KPE and KPG to highlight these two main tasks of NLP. Moreover, we point out some promising future directions for predicting keyphrases.

Published in ICT Express

ISSN: 2405-9595 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://www.journals.elsevier.com/ict-express/

About the journal

Abstract

Keywords