InfAcrOnt: calculating cross-ontology term similarities using information flow by a random walk

Liang Cheng; Yue Jiang; Hong Ju; Jie Sun; Jiajie Peng; Meng Zhou; Yang Hu

doi:10.1186/s12864-017-4338-6

BMC Genomics (Jan 2018)

InfAcrOnt: calculating cross-ontology term similarities using information flow by a random walk

Liang Cheng,
Yue Jiang,
Hong Ju,
Jie Sun,
Jiajie Peng,
Meng Zhou,
Yang Hu

Affiliations

Liang Cheng: College of Bioinformatics Science and Technology, Harbin Medical University
Yue Jiang: Hospital for Sick Children
Hong Ju: Department of Information Engineering, Heilongjiang Biological Science and Technology Career Academy
Jie Sun: College of Bioinformatics Science and Technology, Harbin Medical University
Jiajie Peng: School of Computer Science, Northwestern Polytechnical University
Meng Zhou: College of Bioinformatics Science and Technology, Harbin Medical University
Yang Hu: School of Life Science and Technology, Harbin Institute of Technology

DOI: https://doi.org/10.1186/s12864-017-4338-6
Journal volume & issue: Vol. 19, no. S1
pp. 125 – 134

Abstract

Read online

Abstract Background Since the establishment of the first biomedical ontology Gene Ontology (GO), the number of biomedical ontology has increased dramatically. Nowadays over 300 ontologies have been built including extensively used Disease Ontology (DO) and Human Phenotype Ontology (HPO). Because of the advantage of identifying novel relationships between terms, calculating similarity between ontology terms is one of the major tasks in this research area. Though similarities between terms within each ontology have been studied with in silico methods, term similarities across different ontologies were not investigated as deeply. The latest method took advantage of gene functional interaction network (GFIN) to explore such inter-ontology similarities of terms. However, it only used gene interactions and failed to make full use of the connectivity among gene nodes of the network. In addition, all existent methods are particularly designed for GO and their performances on the extended ontology community remain unknown. Results We proposed a method InfAcrOnt to infer similarities between terms across ontologies utilizing the entire GFIN. InfAcrOnt builds a term-gene-gene network which comprised ontology annotations and GFIN, and acquires similarities between terms across ontologies through modeling the information flow within the network by random walk. In our benchmark experiments on sub-ontologies of GO, InfAcrOnt achieves a high average area under the receiver operating characteristic curve (AUC) (0.9322 and 0.9309) and low standard deviations (1.8746e-6 and 3.0977e-6) in both human and yeast benchmark datasets exhibiting superior performance. Meanwhile, comparisons of InfAcrOnt results and prior knowledge on pair-wise DO-HPO terms and pair-wise DO-GO terms show high correlations. Conclusions The experiment results show that InfAcrOnt significantly improves the performance of inferring similarities between terms across ontologies in benchmark set.

Published in BMC Genomics

ISSN: 1471-2164 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Technology: Chemical technology: Biotechnology; Science: Biology (General): Genetics
Website: http://bmcgenomics.biomedcentral.com

About the journal

Abstract

Keywords