Understanding and Improving Disability Identification in Medical Documents

Hermenegildo Fabregat; Juan Martinez-Romo; Lourdes Araujo

doi:10.1109/ACCESS.2020.3019178

IEEE Access (Jan 2020)

Understanding and Improving Disability Identification in Medical Documents

Hermenegildo Fabregat,
Juan Martinez-Romo,
Lourdes Araujo

Affiliations

Hermenegildo Fabregat: ORCiD; Department of Lenguajes y Sistemas Informáticos, Universidad Nacional de Educación a Distancia (UNED), Madrid, Spain
Juan Martinez-Romo: ORCiD; Department of Lenguajes y Sistemas Informáticos, Universidad Nacional de Educación a Distancia (UNED), Madrid, Spain
Lourdes Araujo: ORCiD; Department of Lenguajes y Sistemas Informáticos, Universidad Nacional de Educación a Distancia (UNED), Madrid, Spain

DOI: https://doi.org/10.1109/ACCESS.2020.3019178
Journal volume & issue: Vol. 8
pp. 155399 – 155408

Abstract

Read online

Disabilities are a problem that affects a large number of people in the world. Gathering information about them is crucial to improve the daily life of the people who suffer from them but, since disabilities are often strongly associated with different types of diseases, the available data are widely dispersed. In this work we review existing proposal for the problem, making an in-depth analysis, and from it we make a proposal that improves the results of previous systems. The analysis focuses on the results of the participants in DIANN shared task was proposed (IberEval 2018), devoted to the detection of named disabilities in electronic documents. In order to evaluate the proposed systems using a common evaluation framework, a corpus of documents, in both English and Spanish, was gathered and annotated. Several teams participated in the task, either using classic methods or proposing specific approaches to deal effectively with the complexities of the task. Our aim is to provide insight for future advances in the field by analyzing the participating systems and identifying the most effective approaches and elements to tackle the problem. We have validated the lessons learned from this analysis through a new proposal that includes the most promising elements used by the participating teams. The proposed system improves, for both languages, the results obtained during the task.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords