BioData Mining (Jul 2017)

Arete – candidate gene prioritization using biological network topology with additional evidence types

  • Artem Lysenko,
  • Keith Anthony Boroevich,
  • Tatsuhiko Tsunoda

DOI
https://doi.org/10.1186/s13040-017-0141-9
Journal volume & issue
Vol. 10, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Background Refinement of candidate gene lists to select the most promising candidates for further experimental verification remains an essential step between high-throughput exploratory analysis and the discovery of specific causal genes. Given the qualitative and semantic complexity of biological data, successfully addressing this challenge requires development of flexible and interoperable solutions for making the best possible use of the largest possible fraction of all available data. Results We have developed an easily accessible framework that links two established network-based gene prioritization approaches with a supporting isolation forest-based integrative ranking method. The defining feature of the method is that both topological information of the biological networks and additional sources of evidence can be considered at the same time. The implementation was realized as an app extension for the Cytoscape graph analysis suite, and therefore can further benefit from the synergy with other analysis methods available as part of this system. Conclusions We provide efficient reference implementations of two popular gene prioritization algorithms – DIAMOnD and random walk with restart for the Cytoscape system. An extension of those methods was also developed that allows outputs of these algorithms to be combined with additional data. To demonstrate the utility of our software, we present two example disease gene prioritization application cases and show how our tool can be used to evaluate these different approaches.

Keywords