Academic Commons

Articles

Reconstruction of novel transcription factor regulons through inference of their binding sites

Elmas, Abdulkadir; Wang, Xiaodong; Samoilov, Michael S.

Background
In most sequenced organisms the number of known regulatory genes (e.g., transcription factors (TFs)) vastly exceeds the number of experimentally-verified regulons that could be associated with them. At present, identification of TF regulons is mostly done through comparative genomics approaches. Such methods could miss organism-specific regulatory interactions and often require expensive and time-consuming experimental techniques to generate the underlying data.
Results
In this work, we present an efficient algorithm that aims to identify a given transcription factor’s regulon through inference of its unknown binding sites, based on the discovery of its binding motif. The proposed approach relies on computational methods that utilize gene expression data sets and knockout fitness data sets which are available or may be straightforwardly obtained for many organisms. We computationally constructed the profiles of putative regulons for the TFs LexA, PurR and Fur in E. coli K12 and identified their binding motifs. Comparisons with an experimentally-verified database showed high recovery rates of the known regulon members, and indicated good predictions for the newly found genes with high biological significance. The proposed approach is also applicable to novel organisms for predicting unknown regulons of the transcriptional regulators. Results for the hypothetical protein D d e0289 in D. alaskensis include the discovery of a Fis-type TF binding motif.
Conclusions
The proposed motif-based regulon inference approach can discover the organism-specific regulatory interactions on a single genome, which may be missed by current comparative genomics techniques due to their limitations.

Files

  • thumnail for art_3A10.1186_2Fs12859-015-0685-y.pdf art_3A10.1186_2Fs12859-015-0685-y.pdf application/pdf 1.48 MB Download File

Also Published In

Title
BMC Bioinformatics
DOI
https://doi.org/10.1186/s12859-015-0685-y

More About This Work

Academic Units
Electrical Engineering
Published Here
March 30, 2016