Academic Commons

Articles

Novel computational analysis of protein binding array data identifies direct targets of Nkx2.2 in the pancreas

Hill, Jonathon; Anderson, Keith; Mastracci, Teresa; Kaestner, Klaus; Sussel, Lori

The creation of a complete genome-wide map of transcription factor binding sites is essential for understanding gene regulatory networks in vivo. However, current prediction methods generally rely on statistical models that imperfectly model transcription factor binding. Generation of new prediction methods that are based on protein binding data, but do not rely on these models may improve prediction sensitivity and specificity. We propose a method for predicting transcription factor binding sites in the genome by directly mapping data generated from protein binding microarrays (PBM) to the genome and calculating a moving average of several overlapping octamers. Using this unique algorithm, we predicted binding sites for the essential pancreatic islet transcription factor Nkx2.2 in the mouse genome and confirmed >90% of the tested sites by EMSA and ChIP. Scores generated from this method more accurately predicted relative binding affinity than PWM based methods. We have also identified an alternative core sequence recognized by the Nkx2.2 homeodomain. Furthermore, we have shown that this method correctly identified binding sites in the promoters of two critical pancreatic islet β-cell genes, NeuroD1 and insulin2, that were not predicted by traditional methods. Finally, we show evidence that the algorithm can also be applied to predict binding sites for the nuclear receptor Hnf4α. PBM-mapping is an accurate method for predicting Nkx2.2 binding sites and may be widely applicable for the creation of genome-wide maps of transcription factor binding sites.

Files

  • thumnail for 1471-2105-12-62-S1.PDF 1471-2105-12-62-S1.PDF binary/octet-stream 23.5 KB Download File
  • thumnail for 1471-2105-12-62-S4.PDF 1471-2105-12-62-S4.PDF binary/octet-stream 29.8 KB Download File
  • thumnail for 1471-2105-12-62-S5.PDF 1471-2105-12-62-S5.PDF binary/octet-stream 5.52 MB Download File
  • thumnail for 1471-2105-12-62-S6.PDF 1471-2105-12-62-S6.PDF binary/octet-stream 47.3 KB Download File
  • thumnail for 1471-2105-12-62-S7.PDF 1471-2105-12-62-S7.PDF binary/octet-stream 40.5 KB Download File
  • thumnail for 547168fdf745640005bb46c064a3b4cc.zip 547168fdf745640005bb46c064a3b4cc.zip binary/octet-stream 8.87 MB Download File

Also Published In

Title
BMC Bioinformatics
DOI
https://doi.org/10.1186/1471-2105-12-62

More About This Work

Academic Units
Genetics and Development
Publisher
BioMed Central
Published Here
September 8, 2014
Academic Commons provides global access to research and scholarship produced at Columbia University, Barnard College, Teachers College, Union Theological Seminary and Jewish Theological Seminary. Academic Commons is managed by the Columbia University Libraries.