IET Systems Biology (Feb 2024)

Machine learning unveils RNA polymerase II binding as a predictor for SMAD2‐dependent transcription dynamics in response to Actvin signalling

  • Dan Shi,
  • Weihua Feng,
  • Zhike Zi

DOI
https://doi.org/10.1049/syb2.12085
Journal volume & issue
Vol. 18, no. 1
pp. 14 – 22

Abstract

Read online

Abstract The transforming growth factor‐β (TGF‐β) superfamily, including Nodal and Activin, plays a critical role in various cellular processes. Understanding the intricate regulation and gene expression dynamics of TGF‐β signalling is of interest due to its diverse biological roles. A machine learning approach is used to predict gene expression patterns induced by Activin using features, such as histone modifications, RNA polymerase II binding, SMAD2‐binding, and mRNA half‐life. RNA sequencing and ChIP sequencing datasets were analysed and differentially expressed SMAD2‐binding genes were identified. These genes were classified into activated and repressed categories based on their expression patterns. The predictive power of different features and combinations was evaluated using logistic regression models and their performances were assessed. Results showed that RNA polymerase II binding was the most informative feature for predicting the expression patterns of SMAD2‐binding genes. The authors provide insights into the interplay between transcriptional regulation and Activin signalling and offers a computational framework for predicting gene expression patterns in response to cell signalling.

Keywords