BMC Bioinformatics (Feb 2013)

An integrated pharmacokinetics ontology and corpus for text mining

  • Wu Heng-Yi,
  • Karnik Shreyas,
  • Subhadarshini Abhinita,
  • Wang Zhiping,
  • Philips Santosh,
  • Han Xu,
  • Chiang Chienwei,
  • Liu Lei,
  • Boustani Malaz,
  • Rocha Luis M,
  • Quinney Sara K,
  • Flockhart David,
  • Li Lang

DOI
https://doi.org/10.1186/1471-2105-14-35
Journal volume & issue
Vol. 14, no. 1
p. 35

Abstract

Read online

Abstract Background Drug pharmacokinetics parameters, drug interaction parameters, and pharmacogenetics data have been unevenly collected in different databases and published extensively in the literature. Without appropriate pharmacokinetics ontology and a well annotated pharmacokinetics corpus, it will be difficult to develop text mining tools for pharmacokinetics data collection from the literature and pharmacokinetics data integration from multiple databases. Description A comprehensive pharmacokinetics ontology was constructed. It can annotate all aspects of in vitro pharmacokinetics experiments and in vivo pharmacokinetics studies. It covers all drug metabolism and transportation enzymes. Using our pharmacokinetics ontology, a PK-corpus was constructed to present four classes of pharmacokinetics abstracts: in vivo pharmacokinetics studies, in vivo pharmacogenetic studies, in vivo drug interaction studies, and in vitro drug interaction studies. A novel hierarchical three level annotation scheme was proposed and implemented to tag key terms, drug interaction sentences, and drug interaction pairs. The utility of the pharmacokinetics ontology was demonstrated by annotating three pharmacokinetics studies; and the utility of the PK-corpus was demonstrated by a drug interaction extraction text mining analysis. Conclusions The pharmacokinetics ontology annotates both in vitro pharmacokinetics experiments and in vivo pharmacokinetics studies. The PK-corpus is a highly valuable resource for the text mining of pharmacokinetics parameters and drug interactions.