Exploring semantic deep learning for building reliable and reusable one health knowledge from PubMed systematic reviews and veterinary clinical notes

Mercedes Arguello-Casteleiro; Robert Stevens; Julio Des-Diz; Chris Wroe; Maria Jesus Fernandez-Prieto; Nava Maroto; Diego Maseda-Fernandez; George Demetriou; Simon Peters; Peter-John M. Noble; Phil H. Jones; Jo Dukes-McEwan; Alan D. Radford; John Keane; Goran Nenadic

doi:10.1186/s13326-019-0212-6

Journal of Biomedical Semantics (Nov 2019)

Exploring semantic deep learning for building reliable and reusable one health knowledge from PubMed systematic reviews and veterinary clinical notes

Mercedes Arguello-Casteleiro,
Robert Stevens,
Julio Des-Diz,
Chris Wroe,
Maria Jesus Fernandez-Prieto,
Nava Maroto,
Diego Maseda-Fernandez,
George Demetriou,
Simon Peters,
Peter-John M. Noble,
Phil H. Jones,
Jo Dukes-McEwan,
Alan D. Radford,
John Keane,
Goran Nenadic

Affiliations

Mercedes Arguello-Casteleiro: School of Computer Science, University of Manchester
Robert Stevens: School of Computer Science, University of Manchester
Julio Des-Diz: Hospital do Salnés, Villagarcía de Arousa
Chris Wroe: BMJ, Tavistock Square
Maria Jesus Fernandez-Prieto: Salford Languages, University of Salford
Nava Maroto: Departamento de Lingüística Aplicada a la Ciencia y a la Tecnología, Universidad Politécnica de Madrid
Diego Maseda-Fernandez: Midcheshire Hospital Foundation Trust, NHS England
George Demetriou: School of Computer Science, University of Manchester
Simon Peters: School of Social Sciences, University of Manchester
Peter-John M. Noble: Small Animal Veterinary Surveillance Network, University of Liverpool
Phil H. Jones: Small Animal Veterinary Surveillance Network, University of Liverpool
Jo Dukes-McEwan: Small Animal Teaching Hospital, University of Liverpool
Alan D. Radford: Small Animal Veterinary Surveillance Network, University of Liverpool
John Keane: School of Computer Science, University of Manchester
Goran Nenadic: School of Computer Science, University of Manchester

DOI: https://doi.org/10.1186/s13326-019-0212-6
Journal volume & issue: Vol. 10, no. S1
pp. 1 – 28

Abstract

Read online

Abstract Background Deep Learning opens up opportunities for routinely scanning large bodies of biomedical literature and clinical narratives to represent the meaning of biomedical and clinical terms. However, the validation and integration of this knowledge on a scale requires cross checking with ground truths (i.e. evidence-based resources) that are unavailable in an actionable or computable form. In this paper we explore how to turn information about diagnoses, prognoses, therapies and other clinical concepts into computable knowledge using free-text data about human and animal health. We used a Semantic Deep Learning approach that combines the Semantic Web technologies and Deep Learning to acquire and validate knowledge about 11 well-known medical conditions mined from two sets of unstructured free-text data: 300 K PubMed Systematic Review articles (the PMSB dataset) and 2.5 M veterinary clinical notes (the VetCN dataset). For each target condition we obtained 20 related clinical concepts using two deep learning methods applied separately on the two datasets, resulting in 880 term pairs (target term, candidate term). Each concept, represented by an n-gram, is mapped to UMLS using MetaMap; we also developed a bespoke method for mapping short forms (e.g. abbreviations and acronyms). Existing ontologies were used to formally represent associations. We also create ontological modules and illustrate how the extracted knowledge can be queried. The evaluation was performed using the content within BMJ Best Practice. Results MetaMap achieves an F measure of 88% (precision 85%, recall 91%) when applied directly to the total of 613 unique candidate terms for the 880 term pairs. When the processing of short forms is included, MetaMap achieves an F measure of 94% (precision 92%, recall 96%). Validation of the term pairs with BMJ Best Practice yields precision between 98 and 99%. Conclusions The Semantic Deep Learning approach can transform neural embeddings built from unstructured free-text data into reliable and reusable One Health knowledge using ontologies and content from BMJ Best Practice.

Published in Journal of Biomedical Semantics

ISSN: 2041-1480 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://jbiomedsem.biomedcentral.com

About the journal

Abstract

Keywords