1
|
Abaturov LV, Nosova NG. Structure of crambin in solution, crystal and in the trajectories of molecular dynamics simulations. Biophysics (Nagoya-shi) 2013. [DOI: 10.1134/s0006350913030020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
|
2
|
Park JK, Jernigan R, Wu Z. Coarse grained normal mode analysis vs. refined Gaussian Network Model for protein residue-level structural fluctuations. Bull Math Biol 2013; 75:124-60. [PMID: 23296997 DOI: 10.1007/s11538-012-9797-y] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2012] [Accepted: 11/08/2012] [Indexed: 11/26/2022]
Abstract
We investigate several approaches to coarse grained normal mode analysis on protein residual-level structural fluctuations by choosing different ways of representing the residues and the forces among them. Single-atom representations using the backbone atoms C(α), C, N, and C(β) are considered. Combinations of some of these atoms are also tested. The force constants between the representative atoms are extracted from the Hessian matrix of the energy function and served as the force constants between the corresponding residues. The residue mean-square-fluctuations and their correlations with the experimental B-factors are calculated for a large set of proteins. The results are compared with all-atom normal mode analysis and the residue-level Gaussian Network Model. The coarse-grained methods perform more efficiently than all-atom normal mode analysis, while their B-factor correlations are also higher. Their B-factor correlations are comparable with those estimated by the Gaussian Network Model and in many cases better. The extracted force constants are surveyed for different pairs of residues with different numbers of separation residues in sequence. The statistical averages are used to build a refined Gaussian Network Model, which is able to predict residue-level structural fluctuations significantly better than the conventional Gaussian Network Model in many test cases.
Collapse
Affiliation(s)
- Jun-Koo Park
- Department of Mathematics, Iowa State University, Ames, IA 50010, USA.
| | | | | |
Collapse
|
3
|
Huang Y, Bonett S, Kloczkowski A, Jernigan R, Wu Z. P.R.E.S.S.--an R-package for exploring residual-level protein structural statistics. J Bioinform Comput Biol 2012; 10:1242007. [PMID: 22809383 DOI: 10.1142/s0219720012420073] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
P.R.E.S.S. is an R-package developed to allow researchers to get access to and manipulate a large set of statistical data on protein residue-level structural properties such as residue-level virtual bond lengths, virtual bond angles, and virtual torsion angles. A large set of high-resolution protein structures is downloaded and surveyed. Their residue-level structural properties are calculated and documented. The statistical distributions and correlations of these properties can be queried and displayed. Tools are also provided for modeling and analyzing a given structure in terms of its residue-level structural properties. In particular, new tools for computing residue-level statistical potentials and displaying residue-level Ramachandran-like plots are developed for structural analysis and refinement. P.R.E.S.S. has been released in R as an open source software package, with a user-friendly GUI, accessible and executable by a public user in any R environment. P.R.E.S.S. can also be downloaded directly at http://www.math.iastate.edu/press/.
Collapse
Affiliation(s)
- Yuanyuan Huang
- Program on Bioinformatics and Computational Biology, Department of Mathematics, Iowa State University, Ames, Iowa 50011, USA
| | | | | | | | | |
Collapse
|
4
|
Abaturov LV, Nosova NG. Crystallographic and NMR spectroscopic protein structures: Interresidue contacts. Mol Biol 2012. [DOI: 10.1134/s0026893312020021] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
5
|
Koppole S, Schaefer M. A discriminative Ramachandran potential of mean force aimed at minimizing secondary structure bias. J Comput Chem 2012; 33:791-9. [DOI: 10.1002/jcc.22908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2011] [Revised: 10/24/2011] [Accepted: 11/20/2011] [Indexed: 11/12/2022]
|
6
|
Abstract
More than 50% of RNA secondary structure is estimated to be A-form helices, which are linked together by various junctions. Here we describe a protocol for computing three interhelical Euler angles describing the relative orientation of helices across RNA junctions. 5' and 3' helices, H1 and H2, respectively, are assigned based on the junction topology. A reference canonical helix is constructed using an appropriate molecular builder software consisting of two continuous idealized A-form helices (iH1 and iH2) with helix axis oriented along the molecular Z-direction running toward the positive direction from iH1 to iH2. The phosphate groups and the carbon and oxygen atoms of the sugars are used to superimpose helix H1 of a target interhelical junction onto the corresponding iH1 of the reference helix. A copy of iH2 is then superimposed onto the resulting H2 helix to generate iH2'. A rotation matrix R is computed, which rotates iH2' into iH2 and expresses the rotation parameters in terms of three Euler angles α(h), β(h) and γ(h). The angles are processed to resolve a twofold degeneracy and to select an overall rotation around the axis of the reference helix. The three interhelical Euler angles define clockwise rotations around the 5' (-γ(h)) and 3' (α(h)) helices and an interhelical bend angle (β(h)). The angles can be depicted graphically to provide a 'Ramachandran'-type view of RNA global structure that can be used to identify unusual conformations as well as to understand variations due to changes in sequence, junction topology and other parameters.
Collapse
|
7
|
Huang Y, Bonett S, Kloczkowski A, Jernigan R, Wu Z. Statistical measures on residue-level protein structural properties. ACTA ACUST UNITED AC 2011; 12:119-36. [PMID: 21452025 DOI: 10.1007/s10969-011-9104-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2010] [Accepted: 03/14/2011] [Indexed: 11/30/2022]
Abstract
The atomic-level structural properties of proteins, such as bond lengths, bond angles, and torsion angles, have been well studied and understood based on either chemistry knowledge or statistical analysis. Similar properties on the residue-level, such as the distances between two residues and the angles formed by short sequences of residues, can be equally important for structural analysis and modeling, but these have not been examined and documented on a similar scale. While these properties are difficult to measure experimentally, they can be statistically estimated in meaningful ways based on their distributions in known proteins structures. Residue-level structural properties including various types of residue distances and angles are estimated statistically. A software package is built to provide direct access to the statistical data for the properties including some important correlations not previously investigated. The distributions of residue distances and angles may vary with varying sequences, but in most cases, are concentrated in some high probability ranges, corresponding to their frequent occurrences in either α-helices or β-sheets. Strong correlations among neighboring residue angles, similar to those between neighboring torsion angles at the atomic-level, are revealed based on their statistical measures. Residue-level statistical potentials can be defined using the statistical distributions and correlations of the residue distances and angles. Ramachandran-like plots for strongly correlated residue angles are plotted and analyzed. Their applications to structural evaluation and refinement are demonstrated. With the increase in both number and quality of known protein structures, many structural properties can be derived from sets of protein structures by statistical analysis and data mining, and these can even be used as a supplement to the experimental data for structure determinations. Indeed, the statistical measures on various types of residue distances and angles provide more systematic and quantitative assessments on these properties, which can otherwise be estimated only individually and qualitatively. Their distributions and correlations in known protein structures show their importance for providing insights into how proteins may fold naturally to various residue-level structures.
Collapse
Affiliation(s)
- Yuanyuan Huang
- Program on Bioinformatics and Computational Biology, Iowa State University, Ames, IA 50014, USA.
| | | | | | | | | |
Collapse
|
8
|
Application of biasing-potential replica-exchange simulations for loop modeling and refinement of proteins in explicit solvent. Proteins 2010; 78:2809-19. [DOI: 10.1002/prot.22796] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
9
|
Rey-Stolle MF, Enciso M, Rey A. Topology-based models and NMR structures in protein folding simulations. J Comput Chem 2009; 30:1212-9. [DOI: 10.1002/jcc.21149] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
10
|
Kirchmair J, Markt P, Distinto S, Schuster D, Spitzer GM, Liedl KR, Langer T, Wolber G. The Protein Data Bank (PDB), its related services and software tools as key components for in silico guided drug discovery. J Med Chem 2009; 51:7021-40. [PMID: 18975926 DOI: 10.1021/jm8005977] [Citation(s) in RCA: 70] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Johannes Kirchmair
- Department of Pharmaceutical Chemistry, Faculty of Chemistry and Pharmacy and Center for Molecular Biosciences, University of Innsbruck, Innrain 52, A-6020 Innsbruck, Austria
| | | | | | | | | | | | | | | |
Collapse
|
11
|
Kloczkowski A, Jernigan RL, Wu Z, Song G, Yang L, Kolinski A, Pokarowski P. Distance matrix-based approach to protein structure prediction. ACTA ACUST UNITED AC 2009; 10:67-81. [PMID: 19224393 DOI: 10.1007/s10969-009-9062-2] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2008] [Accepted: 02/01/2009] [Indexed: 10/21/2022]
Abstract
Much structural information is encoded in the internal distances; a distance matrix-based approach can be used to predict protein structure and dynamics, and for structural refinement. Our approach is based on the square distance matrix D = [r(ij)(2)] containing all square distances between residues in proteins. This distance matrix contains more information than the contact matrix C, that has elements of either 0 or 1 depending on whether the distance r (ij) is greater or less than a cutoff value r (cutoff). We have performed spectral decomposition of the distance matrices D = sigma lambda(k)V(k)V(kT), in terms of eigenvalues lambda kappa and the corresponding eigenvectors v kappa and found that it contains at most five nonzero terms. A dominant eigenvector is proportional to r (2)--the square distance of points from the center of mass, with the next three being the principal components of the system of points. By predicting r (2) from the sequence we can approximate a distance matrix of a protein with an expected RMSD value of about 7.3 A, and by combining it with the prediction of the first principal component we can improve this approximation to 4.0 A. We can also explain the role of hydrophobic interactions for the protein structure, because r is highly correlated with the hydrophobic profile of the sequence. Moreover, r is highly correlated with several sequence profiles which are useful in protein structure prediction, such as contact number, the residue-wise contact order (RWCO) or mean square fluctuations (i.e. crystallographic temperature factors). We have also shown that the next three components are related to spatial directionality of the secondary structure elements, and they may be also predicted from the sequence, improving overall structure prediction. We have also shown that the large number of available HIV-1 protease structures provides a remarkable sampling of conformations, which can be viewed as direct structural information about the dynamics. After structure matching, we apply principal component analysis (PCA) to obtain the important apparent motions for both bound and unbound structures. There are significant similarities between the first few key motions and the first few low-frequency normal modes calculated from a static representative structure with an elastic network model (ENM) that is based on the contact matrix C (related to D), strongly suggesting that the variations among the observed structures and the corresponding conformational changes are facilitated by the low-frequency, global motions intrinsic to the structure. Similarities are also found when the approach is applied to an NMR ensemble, as well as to atomic molecular dynamics (MD) trajectories. Thus, a sufficiently large number of experimental structures can directly provide important information about protein dynamics, but ENM can also provide a similar sampling of conformations. Finally, we use distance constraints from databases of known protein structures for structure refinement. We use the distributions of distances of various types in known protein structures to obtain the most probable ranges or the mean-force potentials for the distances. We then impose these constraints on structures to be refined or include the mean-force potentials directly in the energy minimization so that more plausible structural models can be built. This approach has been successfully used by us in 2006 in the CASPR structure refinement (http://predictioncenter.org/caspR).
Collapse
Affiliation(s)
- Andrzej Kloczkowski
- Laurence H. Baker Center for Bioinformatics and Biological Statistics, Iowa State University, 112 Office and Lab Bldg, Ames, IA 50011-3020, USA.
| | | | | | | | | | | | | |
Collapse
|
12
|
Cui F, Jernigan R, Wu Z. Knowledge-based versus experimentally acquired distance and angle constraints for NMR structure refinement. J Bioinform Comput Biol 2008; 6:283-300. [PMID: 18464323 DOI: 10.1142/s0219720008003448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2007] [Revised: 10/30/2007] [Accepted: 11/17/2007] [Indexed: 11/18/2022]
Abstract
Nuclear Overhauser effects (NOE) distance constraints and torsion angle constraints are major conformational constraints for nuclear magnetic resonance (NMR) structure refinement. In particular, the number of NOE constraints has been considered as an important determinant for the quality of NMR structures. Of course, the availability of torsion angle constraints is also critical for the formation of correct local conformations. In our recent work, we have shown how a set of knowledge-based short-range distance constraints can also be utilized for NMR structure refinement, as a complementary set of conformational constraints to the NOE and torsion angle constraints. In this paper, we show the results from a series of structure refinement experiments by using different types of conformational constraints--NOE, torsion angle, or knowledge-based constraints--or their combinations, and make a quantitative assessment on how the experimentally acquired constraints contribute to the quality of structural models and whether or not they can be combined with or substituted by the knowledge-based constraints. We have carried out the experiments on a small set of NMR structures. Our preliminary calculations have revealed that the torsion angle constraints contribute substantially to the quality of the structures, but require to be combined with the NOE constraints to be fully effective. The knowledge-based constraints can be functionally as crucial as the torsion angle constraints, although they are statistical constraints after all and are not meant to be able to replace the latter.
Collapse
Affiliation(s)
- Feng Cui
- Laboratory of Cell Biology, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | | | | |
Collapse
|
13
|
Abstract
Protein Inter-atomic Distance Distributions (PIDD) is a dedicated database and structural bio-informatics system for distance based protein modeling. The database is developed to host and analyze the statistical data for protein inter-atomic distances based on their distributions in databases of known protein structures such as in the Protein Data Bank (PDB). PIDD is capable of generating, caching, and displaying the statistical distributions of the distances of various types and ranges. The collected information can be used to extract geometric restraints or mean-force potentials for protein structure determination including nuclear magnetic resonance structure determination and comparative model refinement. PIDD is supported with a friendly designed web interface so that users can easily specify the distance types and ranges, and retrieve, visualize or download the distributions of the distances as they desire. PIDD is freely accessible at http://www.math.iastate.edu/pidd.
Collapse
Affiliation(s)
| | | | - Robert Jernigan
- Department of Biochemistry, Biophysics, and Molecular Biology, Iowa State UniversityAmes, IA 50011, USA
| | - Zhijun Wu
- Department of Mathematics, Iowa State UniversityAmes, IA 50011, USA
- To whom correspondence should be addressed. Tel: +1 515 294 8165; Fax: +1 515 294 5454;
| |
Collapse
|