Predicting transcription factor specificity with all-atom models

The binding of a transcription factor (TF) to a DNA operator site can initiate or repress the expression of a gene. Computational prediction of sites recognized by a TF has traditionally relied upon knowledge of several cognate sites, rather than an ab initio approach. Here, we examine the possibili...

Full description

Bibliographic Details
Main Authors: Virnau, Peter (Author), Rahi, Sahand Jamal (Contributor), Mirny, Leonid A. (Contributor), Kardar, Mehran (Contributor)
Other Authors: Harvard University- (Contributor), Massachusetts Institute of Technology. Department of Physics (Contributor)
Format: Article
Language:English
Published: Oxford University Press (OUP), 2012-05-25T19:57:05Z.
Subjects:
Online Access:Get fulltext
Description
Summary:The binding of a transcription factor (TF) to a DNA operator site can initiate or repress the expression of a gene. Computational prediction of sites recognized by a TF has traditionally relied upon knowledge of several cognate sites, rather than an ab initio approach. Here, we examine the possibility of using structure-based energy calculations that require no knowledge of bound sites but rather start with the structure of a protein-DNA complex. We study the PurR Escherichia coli TF, and explore to which extent atomistic models of protein-DNA complexes can be used to distinguish between cognate and noncognate DNA sites. Particular emphasis is placed on systematic evaluation of this approach by comparing its performance with bioinformatic methods, by testing it against random decoys and sites of homologous TFs. We also examine a set of experimental mutations in both DNA and the protein. Using our explicit estimates of energy, we show that the specificity for PurR is dominated by direct protein-DNA interactions, and weakly influenced by bending of DNA.
National Science Foundation (U.S.) (Grant DMR-08- 03315)
Deutsche Forschungsgemeinschaft (DFG) (Grant VI237/1)
NEC Research Support Fund
National Institutes of Health. National Centers for Biomedical Computing (Informatics for Integrating Biology and the Bedside)
National Institutes of Health (U.S.) (3U54LM008748-04S1)