VPN: Variation on Prompt Tuning for Named-Entity Recognition

Niu Hu; Xuan Zhou; Bing Xu; Hanqing Liu; Xiangjin Xie; Hai-Tao Zheng

doi:10.3390/app13148359

Applied Sciences (Jul 2023)

VPN: Variation on Prompt Tuning for Named-Entity Recognition

Niu Hu,
Xuan Zhou,
Bing Xu,
Hanqing Liu,
Xiangjin Xie,
Hai-Tao Zheng

Affiliations

Niu Hu: Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, China
Xuan Zhou: PAII Inc., Palo Alto, CA 94306, USA
Bing Xu: Ping An Technology (Shenzhen) Co., Ltd., Shenzhen 518063, China
Hanqing Liu: Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, China
Xiangjin Xie: Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, China
Hai-Tao Zheng: Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, China

DOI: https://doi.org/10.3390/app13148359
Journal volume & issue: Vol. 13, no. 14
p. 8359

Abstract

Read online

Recently, prompt-based methods have achieved a promising performance in many natural language processing benchmarks. Despite success in sentence-level classification tasks, prompt-based methods work poorly in token-level tasks, such as named entity recognition (NER), due to the sophisticated design of entity-related templates. Note that the nature of prompt tuning makes full use of the parameters of the mask language model (MLM) head, while previous methods solely utilized the last hidden layer of language models (LMs) and the power of the MLM head is overlooked. In this work, we discovered the characteristics of semantic feature changes in samples after being processed using MLMs. Based on this characteristic, we designed a prompt-tuning variant for NER tasks. We let the pre-trained model predict the label words derived from the training dataset at each position and fed the generated logits (non-normalized probability) to the CRF layer. We evaluated our method on three popular datasets, and the experiments showed that our proposed method outperforms the state-of-the-art model in all three Chinese datasets.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords