PLoS ONE (Jan 2016)

iFORM: Incorporating Find Occurrence of Regulatory Motifs.

  • Chao Ren,
  • Hebing Chen,
  • Bite Yang,
  • Feng Liu,
  • Zhangyi Ouyang,
  • Xiaochen Bo,
  • Wenjie Shu

DOI
https://doi.org/10.1371/journal.pone.0168607
Journal volume & issue
Vol. 11, no. 12
p. e0168607

Abstract

Read online

Accurately identifying the binding sites of transcription factors (TFs) is crucial to understanding the mechanisms of transcriptional regulation and human disease. We present incorporating Find Occurrence of Regulatory Motifs (iFORM), an easy-to-use and efficient tool for scanning DNA sequences with TF motifs described as position weight matrices (PWMs). Both performance assessment with a receiver operating characteristic (ROC) curve and a correlation-based approach demonstrated that iFORM achieves higher accuracy and sensitivity by integrating five classical motif discovery programs using Fisher's combined probability test. We have used iFORM to provide accurate results on a variety of data in the ENCODE Project and the NIH Roadmap Epigenomics Project, and the tool has demonstrated its utility in further elucidating individual roles of functional elements. Both the source and binary codes for iFORM can be freely accessed at https://github.com/wenjiegroup/iFORM. The identified TF binding sites across human cell and tissue types using iFORM have been deposited in the Gene Expression Omnibus under the accession ID GSE53962.