Cancer Informatics (Jan 2009)

Semantic Web-Based Integration of Cancer Pathways and Allele Frequency Data

  • Kenneth K. Kidd,
  • Kei-Hoi Cheung,
  • Hongyu Zhao,
  • E. Holford,
  • Haseena Rajeevan

Journal volume & issue
Vol. 8, no. Semantic Technologie
pp. 19 – 30

Abstract

Read online

We demonstrate the use of Semantic Web technology to integrate the ALFRED allele frequency database and the Starpath pathway resource. The linking of population-specific genotype data with cancer-related pathway data is potentially useful given the growing interest in personalized medicine and the exploitation of pathway knowledge for cancer drug discovery. We model our data using the Web Ontology Language (OWL), drawing upon ideas from existing standard formats BioPAX for pathway data and PML for allele frequency data. We store our data within an Oracle database, using Oracle Semantic Technologies. We then query the data using Oracle’s rule-based inference engine and SPARQL-like RDF query language. The ability to perform queries across the domains of population genetics and pathways offers the potential to answer a number of cancer-related research questions. Among the possibilities is the ability to identify genetic variants which are associated with cancer pathways and whose frequency varies significantly between ethnic groups. This sort of information could be useful for designing clinical studies and for providing background data in personalized medicine. It could also assist with the interpretation of genetic analysis results such as those from genome-wide association studies.

Keywords