Comparison of MetaMap and cTAKES for entity extraction in clinical notes

Ruth Reátegui; Sylvie Ratté

doi:10.1186/s12911-018-0654-2

BMC Medical Informatics and Decision Making (Sep 2018)

Comparison of MetaMap and cTAKES for entity extraction in clinical notes

Ruth Reátegui,
Sylvie Ratté

Affiliations

Ruth Reátegui: École de technologie supérieure
Sylvie Ratté: École de technologie supérieure

DOI: https://doi.org/10.1186/s12911-018-0654-2
Journal volume & issue: Vol. 18, no. S3
pp. 13 – 19

Abstract

Read online

Abstract Background Clinical notes such as discharge summaries have a semi- or unstructured format. These documents contain information about diseases, treatments, drugs, etc. Extracting meaningful information from them becomes challenging due to their narrative format. In this context, we aimed to compare the automatic extraction capacity of medical entities using two tools: MetaMap and cTAKES. Methods We worked with i2b2 (Informatics for Integrating Biology to the Bedside) Obesity Challenge data. Two experiments were constructed. In the first one, only one UMLS concept related with the diseases annotated was extracted. In the second, some UMLS concepts were aggregated. Results Results were evaluated with manually annotated medical entities. With the aggregation process the result shows a better improvement. MetaMap had an average of 0.88 in recall, 0.89 in precision, and 0.88 in F-score. With cTAKES, the average of recall, precision and F-score were 0.91, 0.89, and 0.89, respectively. Conclusions The aggregation of concepts (with similar and different semantic types) was shown to be a good strategy for improving the extraction of medical entities, and automatic aggregation could be considered in future works.

Published in BMC Medical Informatics and Decision Making

ISSN: 1472-6947 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: http://bmcmedinformdecismak.biomedcentral.com

About the journal

Abstract

Keywords