2016 Theses Doctoral
Understanding complex biomolecular systems through the synergy of molecular dynamics simulations, NMR spectroscopy and X-Ray crystallography
Proteins and DNA are essential to life as we know it and understanding their function is understanding their structure and dynamics. The importance of the latter is being appreciated more in recent years and has led to the development of novel interdisciplinary techniques and approaches to studying protein function. Three techniques to study protein structure and dynamics have been used and combined in different ways in the context of this thesis and have led to a better understanding of the three systems described herein.
X-ray crystallography is the oldest and still arguably most popular technique to study macromolecular structures. Nuclear magnetic resonance (NMR) spectroscopy is a not much younger technique that is a powerful tool not only to probe molecular structure but also dynamics. The last technique described herein are molecular dynamics (MD) simulations, which are only just growing out of their infancy. MD simulations are computer simulations of macromolecules based on structures solved by X-ray crystallography or NMR spectroscopy, that can give mechanistic insight into dynamic processes of macromolecules whose amplitudes can be estimated by the former two techniques.
MD simulations of the model protein GB3 (B3 immunoglobulin-binding domain of streptococcal protein G) were conducted to identify origins of discrepancies between order parameters derived from different sets of MD simulations and NMR relaxation experiments.The results highlight the importance of time scales as well as sampling when comparing MD simulations to NMR experiments. Discrepancies are seen for unstructured regions like loops and termini and often correspond to nanosecond time scale transitions between conformational substates that are either over- or undersampled in simulation. Sampling biases can be somewhat remedied by running longer (microsecond time scale) simulations. However, some discrepancies persist over even very long trajectories. We show that these discrepancies can be due to the choice of the starting structure and more specifically even differences in protonation procedures. A test for convergence on the nanosecond time scale is shown to be able to correct for many of the observed discrepancies.
Next, MD simulations were used to predict in vitro thermostability of members of the bacterial Ribonuclease HI (RNase H) family of endonucleases. Thermodynamic stability is a central requirement for protein function and a goal of protein engineering is improvement of stability, particularly for applications in biotechnology. The temperature dependence of the generalized order parameter, S, for four RNase H homologs, from psychrotrophic, mesophilic and thermophilic organisms, is highly correlated with experimentally determined melting temperatures and with calculated free energies of folding at the midpoint temperature of the simulations. This study provides an approach for in silico mutational screens to improve thermostability of biologically and industrially relevant enzymes.
Lastly, we used a combination of X-ray crystallography, NMR spectroscopy and MD simulations to study specificity of the interaction between Drosophila Hox proteins and their DNA target sites. Hox proteins are transcription factors specifying segment identity during embryogenesis of bilaterian animals. The DNA binding homeodomains have been shown to confer specificity to the different Hox paralogs, while being very similar in sequence and structure. Our results underline earlier findings about the importance of the N-terminal arm and linker region of Hox homeodomains, the cofactor Exd, as well as DNA shape, for specificity. A comparison of predicted DNA shapes based on sequence alone with the shapes observed for different DNA target sequences in four crystal structures when in complex with the Drosophila Hox protein AbdB and the cofactor Exd, shows that a combined ”induced fit”/”conformational selection” mechanism is the most likely mechanism by which Hox homeodomains recognize DNA shape and achieve specificity.
The minor groove widths for all sequences is close to identical for all ternary complexes found in the different crystal structures, whereas predicted shapes vary between the different DNA sequences. The sequences that have shown higher affinity to AbdB in vitro have a predicted DNA shape that matches the observed DNA shape in the ternary complexes more closely than the sequences that show low in vitro affinity to AbdB. This strongly suggests that the AbdB-Exd complex selects DNA sequences with a higher propensity to adopt the final shape in their unbound form, leading to higher affinity.
An additional AbdB monomer binding site with a strongly preformed binding competent shape is observed for one of the oligomers in the reverse complement strand of one of the canonical (weak) Hox-Exd complex binding site. The shape preference seems strong enough for AbdB monomer binding to compete with AbdB-Exd dimer binding to that same oligomer, suggested by the presence of both binding modes in the same crystal. The monomer binding site is essentially able to compete with the dimer binding site, even though binding with the cofactor is not possible, because its shape is very close to the ideal shape.
A comparison of different crystal structures solved herein and in the literature as well as a set of molecular dynamics simulations was performed and led to insights about the importance of residues in the Hox N-terminal arm for the preference of certain Hox paralogs to certain DNA shapes. Taken together all these insights contribute to our understanding of Hox specificity in particular as well as protein-DNA interactions in general.
- Zeiske_columbia_0054D_13121.pdf binary/octet-stream 17.2 MB Download File
More About This Work
- Academic Units
- Biochemistry and Molecular Biophysics
- Thesis Advisors
- Palmer III, Arthur G.
- Ph.D., Columbia University
- Published Here
- January 29, 2016