BMC Genomics (Aug 2021)

CeTF: an R/Bioconductor package for transcription factor co-expression networks using regulatory impact factors (RIF) and partial correlation and information (PCIT) analysis

  • Carlos Alberto Oliveira de Biagi,
  • Ricardo Perecin Nociti,
  • Danielle Barbosa Brotto,
  • Breno Osvaldo Funicheli,
  • Patrícia de Cássia Ruy,
  • João Paulo Bianchi Ximenez,
  • David Livingstone Alves Figueiredo,
  • Wilson Araújo Silva

DOI
https://doi.org/10.1186/s12864-021-07918-2
Journal volume & issue
Vol. 22, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Background Finding meaningful gene-gene interaction and the main Transcription Factors (TFs) in co-expression networks is one of the most important challenges in gene expression data mining. Results Here, we developed the R package “CeTF” that integrates the Partial Correlation with Information Theory (PCIT) and Regulatory Impact Factors (RIF) algorithms applied to gene expression data from microarray, RNA-seq, or single-cell RNA-seq platforms. This approach allows identifying the transcription factors most likely to regulate a given network in different biological systems — for example, regulation of gene pathways in tumor stromal cells and tumor cells of the same tumor. This pipeline can be easily integrated into the high-throughput analysis. To demonstrate the CeTF package application, we analyzed gastric cancer RNA-seq data obtained from TCGA (The Cancer Genome Atlas) and found the HOXB3 gene as the second most relevant TFs with a high regulatory impact (TFs-HRi) regulating gene pathways in the cell cycle. Conclusion This preliminary finding shows the potential of CeTF to list master regulators of gene networks. CeTF was designed as a user-friendly tool that provides many highly automated functions without requiring the user to perform many complicated processes. It is available on Bioconductor ( http://bioconductor.org/packages/CeTF ) and GitHub ( http://github.com/cbiagii/CeTF ).

Keywords