Mapping and analysis of Caenorhabditis elegans transcription factor sequence specificities
Kamesh Narasimhan,
Samuel A Lambert,
Ally WH Yang,
Jeremy Riddell,
Sanie Mnaimneh,
Hong Zheng,
Mihai Albu,
Hamed S Najafabadi,
John S Reece-Hoyes,
Juan I Fuxman Bass,
Albertha JM Walhout,
Matthew T Weirauch,
Timothy R Hughes
Affiliations
Kamesh Narasimhan
Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Canada
Samuel A Lambert
Department of Molecular Genetics, University of Toronto, Toronto, Canada
Ally WH Yang
Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Canada
Jeremy Riddell
Department of Molecular and Cellular Physiology, Systems Biology and Physiology Program, University of Cincinnati, Cincinnati, United States
Sanie Mnaimneh
Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Canada
Hong Zheng
Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Canada
Mihai Albu
Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Canada
Hamed S Najafabadi
Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Canada
John S Reece-Hoyes
Program in Systems Biology, University of Massachusetts Medical School, Worcester, United States
Juan I Fuxman Bass
Program in Systems Biology, University of Massachusetts Medical School, Worcester, United States
Albertha JM Walhout
Program in Systems Biology, University of Massachusetts Medical School, Worcester, United States
Matthew T Weirauch
Center for Autoimmune Genomics and Etiology, Cincinnati Children's Hospital Medical Center, Cincinnati, United States; Divisions of Biomedical Informatics and Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, United States; Department of Pediatrics, University of Cincinnati, Cincinnati, United States
Timothy R Hughes
Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Canada; Department of Molecular Genetics, University of Toronto, Toronto, Canada; Canadian Institutes For Advanced Research, Toronto, Canada
Caenorhabditis elegans is a powerful model for studying gene regulation, as it has a compact genome and a wealth of genomic tools. However, identification of regulatory elements has been limited, as DNA-binding motifs are known for only 71 of the estimated 763 sequence-specific transcription factors (TFs). To address this problem, we performed protein binding microarray experiments on representatives of canonical TF families in C. elegans, obtaining motifs for 129 TFs. Additionally, we predict motifs for many TFs that have DNA-binding domains similar to those already characterized, increasing coverage of binding specificities to 292 C. elegans TFs (∼40%). These data highlight the diversification of binding motifs for the nuclear hormone receptor and C2H2 zinc finger families and reveal unexpected diversity of motifs for T-box and DM families. Motif enrichment in promoters of functionally related genes is consistent with known biology and also identifies putative regulatory roles for unstudied TFs.