51
|
Voelz VA, Zhou G. Bayesian inference of conformational state populations from computational models and sparse experimental observables. J Comput Chem 2014; 35:2215-24. [PMID: 25250719 DOI: 10.1002/jcc.23738] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2014] [Revised: 08/25/2014] [Accepted: 08/31/2014] [Indexed: 12/29/2022]
Abstract
We present a Bayesian inference approach to estimating conformational state populations from a combination of molecular modeling and sparse experimental data. Unlike alternative approaches, our method is designed for use with small molecules and emphasizes high-resolution structural models, using inferential structure determination with reference potentials, and Markov Chain Monte Carlo to sample the posterior distribution of conformational states. As an application of the method, we determine solution-state conformational populations of the 14-membered macrocycle cineromycin B, using a combination of previously published sparse Nuclear Magnetic Resonance (NMR) observables and replica-exchange molecular dynamic/Quantum Mechanical (QM)-refined conformational ensembles. Our results agree better with experimental data compared to previous modeling efforts. Bayes factors are calculated to quantify the consistency of computational modeling with experiment, and the relative importance of reference potentials and other model parameters.
Collapse
|
52
|
Razavi AM, Wuest WM, Voelz VA. Computational screening and selection of cyclic peptide hairpin mimetics by molecular simulation and kinetic network models. J Chem Inf Model 2014; 54:1425-32. [PMID: 24754484 DOI: 10.1021/ci500102y] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Designing peptidomimetic compounds to have a preorganized structure in solution is highly nontrivial. To show how simulation-based approaches can help speed this process, we performed an extensive simulation study of designed cyclic peptide mimics of a β-hairpin from bacterial protein LapD involved in a protein-protein interaction (PPI) pertinent to bacterial biofilm formation. We used replica exchange molecular dynamics (REMD) simulation to screen 20 covalently cross-linked designs with varying stereochemistry and selected the most favorable of these for massively parallel simulation on Folding@home in explicit solvent. Markov state models (MSMs) built from the trajectory data reveal how subtle chemical modifications can have a significant effect on conformational populations, leading to the overall stabilization of the target structure. In particular, we identify a key steric interaction between a methyl substituent and a valine side chain that acts to allosterically shift population between native and near-native states, which could be exploited in future designs. Visualization of this mechanism is aided considerably by the tICA method, which identifies degrees of freedom most important in slow conformational transitions. The combination of quantitative detail and human comprehension provided by MSMs suggests such approaches will be increasingly useful for design.
Collapse
|
53
|
Kortkhonjia E, Brandman R, Zhou JZ, Voelz VA, Chorny I, Kabakoff B, Patapoff TW, Dill KA, Swartz TE. Probing antibody internal dynamics with fluorescence anisotropy and molecular dynamics simulations. MAbs 2013; 5:306-22. [PMID: 23396076 PMCID: PMC3893241 DOI: 10.4161/mabs.23651] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
The solution dynamics of antibodies are critical to antibody function. We explore the internal solution dynamics of antibody molecules through the combination of time-resolved fluorescence anisotropy experiments on IgG1 with more than two microseconds of all-atom molecular dynamics (MD) simulations in explicit water, an order of magnitude more than in previous simulations. We analyze the correlated motions with a mutual information entropy quantity, and examine state transition rates in a Markov-state model, to give coarse-grained descriptors of the motions. Our MD simulations show that while there are many strongly correlated motions, antibodies are highly flexible, with Fab and Fc domains constantly forming and breaking contacts, both polar and non-polar. We find that salt bridges break and reform, and not always with the same partners. While the MD simulations in explicit water give the right time scales for the motions, the simulated motions are about 3-fold faster than the experiments. Overall, the picture that emerges is that antibodies do not simply fluctuate around a single state of atomic contacts. Rather, in these large molecules, different atoms come in contact during different motions.
Collapse
|
54
|
Voelz VA, Jäger M, Yao S, Chen Y, Zhu L, Waldauer SA, Bowman GR, Friedrichs M, Bakajin O, Lapidus LJ, Weiss S, Pande VS. Slow unfolded-state structuring in Acyl-CoA binding protein folding revealed by simulation and experiment. J Am Chem Soc 2012; 134:12565-77. [PMID: 22747188 DOI: 10.1021/ja302528z] [Citation(s) in RCA: 118] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Protein folding is a fundamental process in biology, key to understanding many human diseases. Experimentally, proteins often appear to fold via simple two- or three-state mechanisms involving mainly native-state interactions, yet recent network models built from atomistic simulations of small proteins suggest the existence of many possible metastable states and folding pathways. We reconcile these two pictures in a combined experimental and simulation study of acyl-coenzyme A binding protein (ACBP), a two-state folder (folding time ~10 ms) exhibiting residual unfolded-state structure, and a putative early folding intermediate. Using single-molecule FRET in conjunction with side-chain mutagenesis, we first demonstrate that the denatured state of ACBP at near-zero denaturant is unusually compact and enriched in long-range structure that can be perturbed by discrete hydrophobic core mutations. We then employ ultrafast laminar-flow mixing experiments to study the folding kinetics of ACBP on the microsecond time scale. These studies, along with Trp-Cys quenching measurements of unfolded-state dynamics, suggest that unfolded-state structure forms on a surprisingly slow (~100 μs) time scale, and that sequence mutations strikingly perturb both time-resolved and equilibrium smFRET measurements in a similar way. A Markov state model (MSM) of the ACBP folding reaction, constructed from over 30 ms of molecular dynamics trajectory data, predicts a complex network of metastable stables, residual unfolded-state structure, and kinetics consistent with experiment but no well-defined intermediate preceding the main folding barrier. Taken together, these experimental and simulation results suggest that the previously characterized fast kinetic phase is not due to formation of a barrier-limited intermediate but rather to a more heterogeneous and slow acquisition of unfolded-state structure.
Collapse
|
55
|
Voelz VA, Pande VS. Calculation of rate spectra from noisy time series data. Proteins 2012; 80:342-51. [PMID: 22095854 PMCID: PMC3291796 DOI: 10.1002/prot.23171] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2011] [Revised: 08/03/2011] [Accepted: 08/05/2011] [Indexed: 11/07/2022]
Abstract
As the resolution of experiments to measure folding kinetics continues to improve, it has become imperative to avoid bias that may come with fitting data to a predetermined mechanistic model. Toward this end, we present a rate spectrum approach to analyze timescales present in kinetic data. Computing rate spectra of noisy time series data via numerical discrete inverse Laplace transform is an ill-conditioned inverse problem, so a regularization procedure must be used to perform the calculation. Here, we show the results of different regularization procedures applied to noisy multiexponential and stretched exponential time series, as well as data from time-resolved folding kinetics experiments. In each case, the rate spectrum method recapitulates the relevant distribution of timescales present in the data, with different priors on the rate amplitudes naturally corresponding to common biases toward simple phenomenological models. These results suggest an attractive alternative to the "Occam's razor" philosophy of simply choosing models with the fewest number of relaxation rates.
Collapse
|
56
|
Voelz VA, Dill KA, Chorny I. Peptoid conformational free energy landscapes from implicit-solvent molecular simulations in AMBER. Biopolymers 2012; 96:639-50. [PMID: 21184487 DOI: 10.1002/bip.21575] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
To test the accuracy of existing AMBER force field models in predicting peptoid conformation and dynamics, we simulated a set of model peptoid molecules recently examined by Butterfoss et al. (JACS 2009, 131, 16798-16807) using QM methods as well as three peptoid sequences with experimentally determined structures. We found that AMBER force fields, when used with a Generalized Born/Surface Area (GBSA) implicit solvation model, could accurately reproduce the peptoid torsional landscape as well as the major conformers of known peptoid structures. Enhanced sampling by replica exchange molecular dynamics (REMD) using temperatures from 300 to 800 K was used to sample over cis-trans isomerization barriers. Compared to (Nrch)5 and cyclo-octasarcosyl, the free energy of N-(2-nitro-3-hydroxyl phenyl)glycine-N-(phenyl)glycine has the most "foldable" free energy landscape, due to deep trans-amide minima dictated by N-aryl sidechains. For peptoids with (S)-N (1-phenylethyl) (Nspe) side chains, we observe a discrepancy in backbone dihedral propensities between molecular simulations and QM calculations, which may be due to force field effects or the inability to capture n --> n* interactions. For these residues, an empirical phi-angle biasing potential can "rescue" the backbone propensities seen in QM. This approach can serve as a general strategy for addressing force fields without resorting to a complete reparameterization. Overall, this study demonstrates the utility of implicit-solvent REMD simulations for efficient sampling to predict peptoid conformational landscapes, providing a potential tool for first-principles design of sequences with specific folding properties.
Collapse
|
57
|
Lane TJ, Bowman GR, Beauchamp K, Voelz VA, Pande VS. Markov state model reveals folding and functional dynamics in ultra-long MD trajectories. J Am Chem Soc 2011; 133:18413-9. [PMID: 21988563 DOI: 10.1021/ja207470h] [Citation(s) in RCA: 131] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Two strategies have been recently employed to push molecular simulation to long, biologically relevant time scales: projection-based analysis of results from specialized hardware producing a small number of ultralong trajectories and the statistical interpretation of massive parallel sampling performed with Markov state models (MSMs). Here, we assess the MSM as an analysis method by constructing a Markov model from ultralong trajectories, specifically two previously reported 100 μs trajectories of the FiP35 WW domain (Shaw, D. E. Science 2010, 330, 341-346). We find that the MSM approach yields novel insights. It discovers new statistically significant folding pathways, in which either beta-hairpin of the WW domain can form first. The rates of this process approach experimental values in a direct quantitative comparison (time scales of 5.0 μs and 100 ns), within a factor of ∼2. Finally, the hub-like topology of the MSM and identification of a holo conformation predicts how WW domains may function through a conformational selection mechanism.
Collapse
|
58
|
Bowman GR, Voelz VA, Pande VS. Atomistic folding simulations of the five-helix bundle protein λ(6−85). J Am Chem Soc 2011; 133:664-7. [PMID: 21174461 DOI: 10.1021/ja106936n] [Citation(s) in RCA: 116] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Protein folding is a classic grand challenge that is relevant to numerous human diseases, such as protein misfolding diseases like Alzheimer’s disease. Solving the folding problem will ultimately require a combination of theory, simulation, and experiment, with theory and simulation providing an atomically detailed picture of both the thermodynamics and kinetics of folding and experimental tests grounding these models in reality. However, theory and simulation generally fall orders of magnitude short of biologically relevant time scales. Here we report significant progress toward closing this gap: an atomistic model of the folding of an 80-residue fragment of the λ repressor protein with explicit solvent that captures dynamics on a 10 milliseconds time scale. In addition, we provide a number of predictions that warrant further experimental investigation. For example, our model’s native state is a kinetic hub, and biexponential kinetics arises from the presence of many free-energy basins separated by barriers of different heights rather than a single low barrier along one reaction coordinate (the previously proposed incipient downhill folding scenario).
Collapse
|
59
|
Bowman GR, Voelz VA, Pande VS. Taming the complexity of protein folding. Curr Opin Struct Biol 2011; 21:4-11. [PMID: 21081274 PMCID: PMC3042729 DOI: 10.1016/j.sbi.2010.10.006] [Citation(s) in RCA: 137] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2010] [Revised: 10/21/2010] [Accepted: 10/24/2010] [Indexed: 12/01/2022]
Abstract
Protein folding is an important problem in structural biology with significant medical implications, particularly for misfolding disorders like Alzheimer's disease. Solving the folding problem will ultimately require a combination of theory and experiment, with theoretical models providing a comprehensive view of folding and experiments grounding these models in reality. Here we review progress towards this goal over the past decade, with an emphasis on recent theoretical advances that are empowering chemically detailed models of folding and the new results these technologies are providing. In particular, we discuss new insights made possible by Markov state models (MSMs), including the role of non-native contacts and the hub-like character of protein folded states.
Collapse
|
60
|
Chen Y, Voelz VA, Bakajin O, Pande VS, Lapidus LJ. Diffusion of Unfolded Acyl-Coenzyme A-Binding Protein Over a Complete Range of Denaturant. Biophys J 2011. [DOI: 10.1016/j.bpj.2010.12.1362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022] Open
|
61
|
Voelz VA, Jäger M, Zhu L, Yao S, Bakajin O, Weiss S, Lapidus LJ, Pande VS. Markov State Models of Millisecond Folder ACBP Reveals New Views of the Folding Reaction. Biophys J 2011. [DOI: 10.1016/j.bpj.2010.12.3015] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
|
62
|
Zhu L, Voelz VA, Bakajin O, Pande VS, Lapidus LJ. Revealing the Early Events of ACBP Folding by Ultrarapid Mixing. Biophys J 2011. [DOI: 10.1016/j.bpj.2010.12.2353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
|
63
|
Voelz VA, Singh VR, Wedemeyer WJ, Lapidus LJ, Pande VS. Unfolded-state dynamics and structure of protein L characterized by simulation and experiment. J Am Chem Soc 2010; 132:4702-9. [PMID: 20218718 DOI: 10.1021/ja908369h] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
While several experimental techniques now exist for characterizing protein unfolded states, all-atom simulation of unfolded states has been challenging due to the long time scales and conformational sampling required. We address this problem by using a combination of accelerated calculations on graphics processor units and distributed computing to simulate tens of thousands of molecular dynamics trajectories each up to approximately 10 mus (for a total aggregate simulation time of 127 ms). We used this approach in conjunction with Trp-Cys contact quenching experiments to characterize the unfolded structure and dynamics of protein L. We employed a polymer theory method to make quantitative comparisons between high-temperature simulated and chemically denatured experimental ensembles and find that reaction-limited quenching rates calculated from simulation agree remarkably well with experiment. In both experiment and simulation, we find that unfolded-state intramolecular diffusion rates are very slow compared to highly denatured chains and that a single-residue mutation can significantly alter unfolded-state dynamics and structure. This work suggests a view of the unfolded state in which surprisingly low diffusion rates could limit folding and opens the door for all-atom molecular simulation to be a useful predictive tool for characterizing protein unfolded states along with experiments that directly measure intramolecular diffusion.
Collapse
|
64
|
Voelz VA, Bowman GR, Beauchamp K, Pande VS. Molecular simulation of ab initio protein folding for a millisecond folder NTL9(1-39). J Am Chem Soc 2010; 132:1526-8. [PMID: 20070076 DOI: 10.1021/ja9090353] [Citation(s) in RCA: 380] [Impact Index Per Article: 27.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
To date, the slowest-folding proteins folded ab initio by all-atom molecular dynamics simulations have had folding times in the range of nanoseconds to microseconds. We report simulations of several folding trajectories of NTL9(1-39), a protein which has a folding time of approximately 1.5 ms. Distributed molecular dynamics simulations in implicit solvent on GPU processors were used to generate ensembles of trajectories out to approximately 40 micros for several temperatures and starting states. At a temperature less than the melting point of the force field, we observe a small number of productive folding events, consistent with predictions from a model of parallel uncoupled two-state simulations. The posterior distribution of the folding rate predicted from the data agrees well with the experimental folding rate (approximately 640/s). Markov State Models (MSMs) built from the data show a gap in the implied time scales indicative of two-state folding and heterogeneous pathways connecting diffuse mesoscopic substates. Structural analysis of the 14 out of 2000 macrostates transited by the top 10 folding pathways reveals that native-like pairing between strands 1 and 2 only occurs for macrostates with p(fold) > 0.5, suggesting beta(12) hairpin formation may be rate-limiting. We believe that using simulation data such as these to seed adaptive resampling simulations will be a promising new method for achieving statistically converged descriptions of folding landscapes at longer time scales than ever before.
Collapse
|
65
|
Voelz VA, Luttmann E, Bowman GR, Pande VS. Probing the nanosecond dynamics of a designed three-stranded beta-sheet with a massively parallel molecular dynamics simulation. Int J Mol Sci 2009; 10:1013-30. [PMID: 19399235 PMCID: PMC2672016 DOI: 10.3390/ijms10031013] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2009] [Revised: 03/04/2009] [Accepted: 03/09/2009] [Indexed: 11/16/2022] Open
Abstract
Recently a temperature-jump FTIR study of a designed three-stranded sheet showing a fast relaxation time of approximately 140 +/- 20 ns was published. We performed massively parallel molecular dynamics simulations in explicit solvent to probe the structural events involved in this relaxation. While our simulations produce similar relaxation rates, the structural ensemble is broad. We observe the formation of turn structure, but only very weak interaction in the strand regions, which is consistent with the lack of strong backbone-backbone NOEs in previous structural NMR studies. These results suggest that either (D)P(D)P-II folds at time scales longer than 240 ns, or that (D)P(D)P-II is not a well-defined three-stranded beta-sheet. This work also provides an opportunity to compare the performance of several popular forcefield models against one another.
Collapse
|
66
|
Voelz VA, Shell MS, Dill KA. Predicting peptide structures in native proteins from physical simulations of fragments. PLoS Comput Biol 2009; 5:e1000281. [PMID: 19197352 PMCID: PMC2629132 DOI: 10.1371/journal.pcbi.1000281] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2008] [Accepted: 12/17/2008] [Indexed: 11/25/2022] Open
Abstract
It has long been proposed that much of the information encoding how a protein folds is contained locally in the peptide chain. Here we present a large-scale simulation study designed to examine the extent to which conformations of peptide fragments in water predict native conformations in proteins. We perform replica exchange molecular dynamics (REMD) simulations of 872 8-mer, 12-mer, and 16-mer peptide fragments from 13 proteins using the AMBER 96 force field and the OBC implicit solvent model. To analyze the simulations, we compute various contact-based metrics, such as contact probability, and then apply Bayesian classifier methods to infer which metastable contacts are likely to be native vs. non-native. We find that a simple measure, the observed contact probability, is largely more predictive of a peptide's native structure in the protein than combinations of metrics or multi-body components. Our best classification model is a logistic regression model that can achieve up to 63% correct classifications for 8-mers, 71% for 12-mers, and 76% for 16-mers. We validate these results on fragments of a protein outside our training set. We conclude that local structure provides information to solve some but not all of the conformational search problem. These results help improve our understanding of folding mechanisms, and have implications for improving physics-based conformational sampling and structure prediction using all-atom molecular simulations. Proteins must fold to unique native structures in order to perform their functions. To do this, proteins must solve a complicated conformational search problem, the details of which remain difficult to study experimentally. Predicting folding pathways and the mechanisms by which proteins fold is thus central to understanding how proteins work. One longstanding question is the extent to which proteins solve the search problem locally, by folding into sub-structures that are dictated primarily by local sequence. Here, we address this question by conducting a large-scale molecular dynamics simulation study of protein fragments in water. The simulation data was then used to optimize a statistical model that predicted native and non-native contacts. The performance of the resulting model suggests that local structuring provides some but not all of the information to solve the folding problem, and that molecular dynamics simulation of fragments can be useful for protein structure prediction and design.
Collapse
|
67
|
Dill KA, Ozkan SB, Weikl TR, Chodera JD, Voelz VA. The protein folding problem: when will it be solved? Curr Opin Struct Biol 2007; 17:342-6. [PMID: 17572080 DOI: 10.1016/j.sbi.2007.06.001] [Citation(s) in RCA: 163] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2007] [Revised: 04/11/2007] [Accepted: 06/06/2007] [Indexed: 01/29/2023]
Abstract
The protein folding problem can be viewed as three different problems: defining the thermodynamic folding code; devising a good computational structure prediction algorithm; and answering Levinthal's question regarding the kinetic mechanism of how proteins can fold so quickly. Once regarded as a grand challenge, protein folding has seen much progress in recent years. Folding codes are now being used to successfully design proteins and non-biological foldable polymers; aided by the Critical Assessment of Techniques for Structure Prediction (CASP) competition, protein structure prediction has now become quite good. Even the once-challenging Levinthal puzzle now seems to have an answer--a protein can avoid searching irrelevant conformations and fold quickly by making local independent decisions first, followed by non-local global decisions later.
Collapse
|
68
|
Abstract
It has been proposed that proteins fold by a process called "Zipping and Assembly" (Z&A). Zipping refers to the growth of local substructures within the chain, and assembly refers to the coming together of already-formed pieces. Our interest here is in whether Z&A is a general method that can fold most of sequence space, to global minima, efficiently. Using the HP model, we can address this question by enumerating full conformation and sequence spaces. We find that Z&A reaches the global energy minimum native states, even though it searches only a very small fraction of conformational space, for most sequences in the full sequence space. We find that Z&A, a mechanism-based search, is more efficient in our tests than the replica exchange search method. Folding efficiency is increased for chains having: (a) small loop-closure steps, consistent with observations by Plaxco et al. 1998;277;985-994 that folding rates correlate with contact order, (b) neither too few nor too many nucleation sites per chain, and (c) assembly steps that do not occur too early in the folding process. We find that the efficiency increases with chain length, although our range of chain lengths is limited. We believe these insights may be useful for developing faster protein conformational search algorithms.
Collapse
|
69
|
Sullivan DC, Aynechi T, Voelz VA, Kuntz ID. Information content of molecular structures. Biophys J 2003; 85:174-90. [PMID: 12829474 PMCID: PMC1303075 DOI: 10.1016/s0006-3495(03)74464-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2002] [Accepted: 03/13/2003] [Indexed: 11/21/2022] Open
Abstract
For a completely enumerated set of conformers of a macromolecule or for exhaustive lattice walks of model polymers it is straightforward to use Shannon information theory to deduce the information content of the ensemble. It is also practicable to develop numerical measures of the information content of sets of exact distance constraints applied to specific conformational ensembles. We examine the effects of experimental uncertainties by considering "noisy" constraints. The introduction of noise requires additional assumptions about noise distribution and conformational clustering protocols that make the problem of measuring information content more complex. We make use of a standard concept in communication theory, the "noise sphere," to link uncertainty in measurements to information loss. Most of our numerical results are derived from two-dimensional lattice ensembles. Expressing results in terms of information per degree of freedom removes almost all of the chain length dependence. We also explore off-lattice polyalanine chains that yield surprisingly similar results.
Collapse
|