Journal of Integrative Bioinformatics (Jun 2011)
Automatic extraction of microorganisms and their habitats from free text using text mining workflows
Abstract
In this paper we illustrate the usage of text mining workflows to automatically extract instances of microorganisms and their habitats from free text; these entries can then be curated and added to different databases. To this end, we use a Conditional Random Field (CRF) based classifier, as part of the workflows, to extract the mention of microorganisms, habitats and the inter-relation between organisms and their habitats.