Read-All-in-Once (RAiO): Multi-Layer Contextual Architecture for Long-Text Machine Reading Comprehension

Tuan-Anh Phan; Jason J. Jung; Khac-Hoai Nam Bui

doi:10.1109/ACCESS.2023.3298100

IEEE Access (Jan 2023)

Read-All-in-Once (RAiO): Multi-Layer Contextual Architecture for Long-Text Machine Reading Comprehension

Tuan-Anh Phan,
Jason J. Jung,
Khac-Hoai Nam Bui

Affiliations

Tuan-Anh Phan: ORCiD; Viettel Cyberspace Center, Viettel Group, Hanoi, Vietnam
Jason J. Jung: ORCiD; Department of Computer Engineering, Chung-Ang University, Seoul, South Korea
Khac-Hoai Nam Bui: ORCiD; Viettel Cyberspace Center, Viettel Group, Hanoi, Vietnam

DOI: https://doi.org/10.1109/ACCESS.2023.3298100
Journal volume & issue: Vol. 11
pp. 77873 – 77879

Abstract

Read online

Machine reading comprehension (MRC) is a cutting-edge technology in natural language processing (NLP), which focuses on teaching machines to read and understand the meaning of texts based on the emergence of large-scale datasets and neural network models. Recently, with the successful development of pre-trained transformer models (e.g., BERT), MRC has advanced significantly, surpassing human parity in several public datasets and being applied in various NLP tasks (e.g., QA systems). Nevertheless, long document MRC is still a remain challenge since the transformer-based models are limited by the input length. For instance, several well-known pre-trained language models such as BERT and RoBERTa are limited by 512 tokens. This study aims to provide a new simple approach for long document MRC. Specifically, recent state-of-the-art models follow the architecture with two crucial stages for reading long texts in order to enable local and global context representations. In this study, we present a new architecture that is able to enrich the global information of the context with one stage by exploiting the interaction of different levels of semantic units of the context (i.e., sentence and word level). Therefore, we name the proposed model as RAiO (Read-All-in-Once) approach. For the experiment, we evaluate RAiO on two benchmark long document MRC datasets such as NewsQA and NLQuAD. Accordingly, the experiment shows promising results of the proposed approach compared with strong baselines in this research field.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords