BMC Medical Genomics (Dec 2017)

Probability-based collaborative filtering model for predicting gene–disease associations

  • Xiangxiang Zeng,
  • Ningxiang Ding,
  • Alfonso Rodríguez-Patón,
  • Quan Zou

DOI
https://doi.org/10.1186/s12920-017-0313-y
Journal volume & issue
Vol. 10, no. S5
pp. 45 – 53

Abstract

Read online

Abstract Background Accurately predicting pathogenic human genes has been challenging in recent research. Considering extensive gene–disease data verified by biological experiments, we can apply computational methods to perform accurate predictions with reduced time and expenses. Methods We propose a probability-based collaborative filtering model (PCFM) to predict pathogenic human genes. Several kinds of data sets, containing data of humans and data of other nonhuman species, are integrated in our model. Firstly, on the basis of a typical latent factorization model, we propose model I with an average heterogeneous regularization. Secondly, we develop modified model II with personal heterogeneous regularization to enhance the accuracy of aforementioned models. In this model, vector space similarity or Pearson correlation coefficient metrics and data on related species are also used. Results We compared the results of PCFM with the results of four state-of-arts approaches. The results show that PCFM performs better than other advanced approaches. Conclusions PCFM model can be leveraged for predictions of disease genes, especially for new human genes or diseases with no known relationships.

Keywords