PLoS Computational Biology (Jun 2023)

Knowledge graph embedding for profiling the interaction between transcription factors and their target genes.

  • Yang-Han Wu,
  • Yu-An Huang,
  • Jian-Qiang Li,
  • Zhu-Hong You,
  • Peng-Wei Hu,
  • Lun Hu,
  • Victor C M Leung,
  • Zhi-Hua Du

DOI
https://doi.org/10.1371/journal.pcbi.1011207
Journal volume & issue
Vol. 19, no. 6
p. e1011207

Abstract

Read online

Interactions between transcription factor and target gene form the main part of gene regulation network in human, which are still complicating factors in biological research. Specifically, for nearly half of those interactions recorded in established database, their interaction types are yet to be confirmed. Although several computational methods exist to predict gene interactions and their type, there is still no method available to predict them solely based on topology information. To this end, we proposed here a graph-based prediction model called KGE-TGI and trained in a multi-task learning manner on a knowledge graph that we specially constructed for this problem. The KGE-TGI model relies on topology information rather than being driven by gene expression data. In this paper, we formulate the task of predicting interaction types of transcript factor and target genes as a multi-label classification problem for link types on a heterogeneous graph, coupled with solving another link prediction problem that is inherently related. We constructed a ground truth dataset as benchmark and evaluated the proposed method on it. As a result of the 5-fold cross experiments, the proposed method achieved average AUC values of 0.9654 and 0.9339 in the tasks of link prediction and link type classification, respectively. In addition, the results of a series of comparison experiments also prove that the introduction of knowledge information significantly benefits to the prediction and that our methodology achieve state-of-the-art performance in this problem.