Cancer Informatics (Sep 2018)

Using Semantic Web Technologies to Enable Cancer Genomics Discovery at Petabyte Scale

  • Jovan Cejovic,
  • Jelena Radenkovic,
  • Vladimir Mladenovic,
  • Adam Stanojevic,
  • Milica Miletic,
  • Stevan Radanovic,
  • Dragan Bajcic,
  • Dragan Djordjevic,
  • Filip Jelic,
  • Milos Nesic,
  • Jessica Lau,
  • Patrick Grady,
  • Nick Groves-Kirkby,
  • Deniz Kural,
  • Brandi Davis-Dusenbery

DOI
https://doi.org/10.1177/1176935118774787
Journal volume & issue
Vol. 17

Abstract

Read online

Increased efforts in cancer genomics research and bioinformatics are producing tremendous amounts of data. These data are diverse in origin, format, and content. As the amount of available sequencing data increase, technologies that make them discoverable and usable are critically needed. In response, we have developed a Semantic Web–based Data Browser, a tool allowing users to visually build and execute ontology-driven queries. This approach simplifies access to available data and improves the process of using them in analyses on the Seven Bridges Cancer Genomics Cloud (CGC; www.cancergenomicscloud.org ). The Data Browser makes large data sets easily explorable and simplifies the retrieval of specific data of interest. Although initially implemented on top of The Cancer Genome Atlas (TCGA) data set, the Data Browser’s architecture allows for seamless integration of other data sets. By deploying it on the CGC, we have enabled remote researchers to access data and perform collaborative investigations.