An integrated pharmacokinetics ontology and corpus for text mining

Wu Heng-Yi; Karnik Shreyas; Subhadarshini Abhinita; Wang Zhiping; Philips Santosh; Han Xu; Chiang Chienwei; Liu Lei; Boustani Malaz; Rocha Luis M; Quinney Sara K; Flockhart David; Li Lang

doi:10.1186/1471-2105-14-35

BMC Bioinformatics (Feb 2013)

An integrated pharmacokinetics ontology and corpus for text mining

Wu Heng-Yi,
Karnik Shreyas,
Subhadarshini Abhinita,
Wang Zhiping,
Philips Santosh,
Han Xu,
Chiang Chienwei,
Liu Lei,
Boustani Malaz,
Rocha Luis M,
Quinney Sara K,
Flockhart David,
Li Lang

Affiliations

Wu Heng-Yi
Karnik Shreyas
Subhadarshini Abhinita
Wang Zhiping
Philips Santosh
Han Xu
Chiang Chienwei
Liu Lei
Boustani Malaz
Rocha Luis M
Quinney Sara K
Flockhart David
Li Lang

DOI: https://doi.org/10.1186/1471-2105-14-35
Journal volume & issue: Vol. 14, no. 1
p. 35

Abstract

Read online

Abstract Background Drug pharmacokinetics parameters, drug interaction parameters, and pharmacogenetics data have been unevenly collected in different databases and published extensively in the literature. Without appropriate pharmacokinetics ontology and a well annotated pharmacokinetics corpus, it will be difficult to develop text mining tools for pharmacokinetics data collection from the literature and pharmacokinetics data integration from multiple databases. Description A comprehensive pharmacokinetics ontology was constructed. It can annotate all aspects of in vitro pharmacokinetics experiments and in vivo pharmacokinetics studies. It covers all drug metabolism and transportation enzymes. Using our pharmacokinetics ontology, a PK-corpus was constructed to present four classes of pharmacokinetics abstracts: in vivo pharmacokinetics studies, in vivo pharmacogenetic studies, in vivo drug interaction studies, and in vitro drug interaction studies. A novel hierarchical three level annotation scheme was proposed and implemented to tag key terms, drug interaction sentences, and drug interaction pairs. The utility of the pharmacokinetics ontology was demonstrated by annotating three pharmacokinetics studies; and the utility of the PK-corpus was demonstrated by a drug interaction extraction text mining analysis. Conclusions The pharmacokinetics ontology annotates both in vitro pharmacokinetics experiments and in vivo pharmacokinetics studies. The PK-corpus is a highly valuable resource for the text mining of pharmacokinetics parameters and drug interactions.

Published in BMC Bioinformatics

ISSN: 1471-2105 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Biology (General)
Website: http://www.biomedcentral.com/bmcbioinformatics/

About the journal