Mixup Based Cross-Consistency Training for Named Entity Recognition

Geonsik Youn; Bohan Yoon; Seungbin Ji; Dahee Ko; Jongtae Rhee

doi:10.3390/app122111084

Applied Sciences (Nov 2022)

Mixup Based Cross-Consistency Training for Named Entity Recognition

Geonsik Youn,
Bohan Yoon,
Seungbin Ji,
Dahee Ko,
Jongtae Rhee

Affiliations

Geonsik Youn: Department of Industrial and Systems Engineering, Dongguk University, Seoul 04620, Korea
Bohan Yoon: Department of Industrial and Systems Engineering, Dongguk University, Seoul 04620, Korea
Seungbin Ji: Department of Industrial and Systems Engineering, Dongguk University, Seoul 04620, Korea
Dahee Ko: Department of Industrial and Systems Engineering, Dongguk University, Seoul 04620, Korea
Jongtae Rhee: Department of Industrial and Systems Engineering, Dongguk University, Seoul 04620, Korea

DOI: https://doi.org/10.3390/app122111084
Journal volume & issue: Vol. 12, no. 21
p. 11084

Abstract

Read online

Named Entity Recognition (NER) is at the core of natural language understanding. The quality and amount of datasets determine the performance of deep-learning-based NER models. As datasets for NER require token-level or word-level labels to be assigned, annotating the datasets is expensive and time consuming. To alleviate efforts of manual anotation, many prior studies utilized weak supervision for NER tasks. However, using weak supervision directly would be an obstacle for training deep networks because the labels automatically annotated contain a a lot of noise. In this study, we propose a framework to better train the deep model for NER tasks using weakly labeled data. The proposed framework stems from the idea that mixup, which was recently considered as a data augmentation strategy, would be an obstacle to deep model training for NER tasks. Inspired by this idea, we used mixup as a perturbation function for consistency regularization, one of the semi-supervised learning strategies. To support our idea, we conducted several experiments for NER benchmarks. Experimental results proved that directly using mixup on NER tasks hinders deep model training while demonstrating that the proposed framework achieves improved performances compared to employing only a few human-annotated data.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords