Frontiers in Microbiology (Mar 2021)

Finding Colon Cancer- and Colorectal Cancer-Related Microbes Based on Microbe–Disease Association Prediction

  • Yu Chen,
  • Hongjian Sun,
  • Mengzhe Sun,
  • Changguo Shi,
  • Hongmei Sun,
  • Xiaoli Shi,
  • Xiaoli Shi,
  • Binbin Ji,
  • Binbin Ji,
  • Jinpeng Cui

DOI
https://doi.org/10.3389/fmicb.2021.650056
Journal volume & issue
Vol. 12

Abstract

Read online

Microbes are closely associated with the formation and development of diseases. The identification of the potential associations between microbes and diseases can boost the understanding of various complex diseases. Wet experiments applied to microbe–disease association (MDA) identification are costly and time-consuming. In this manuscript, we developed a novel computational model, NLLMDA, to find unobserved MDAs, especially for colon cancer and colorectal carcinoma. NLLMDA integrated negative MDA selection, linear neighborhood similarity, label propagation, information integration, and known biological data. The Gaussian association profile (GAP) similarity of microbes and GAPs similarity and symptom similarity of diseases were firstly computed. Secondly, linear neighborhood method was then applied to the above computed similarity matrices to obtain more stable performance. Thirdly, negative MDA samples were selected, and the label propagation algorithm was used to score for microbe–disease pairs. The final association probabilities can be computed based on the information integration method. NLLMDA was compared with the other five classical MDA methods and obtained the highest area under the curve (AUC) value of 0.9031 and 0.9335 on cross-validations of diseases and microbe–disease pairs. The results suggest that NLLMDA was an effective prediction method. More importantly, we found that Acidobacteriaceae may have a close link with colon cancer and Tannerella may densely associate with colorectal carcinoma.

Keywords