Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Koehl P, Levitt M. Protein topology and stability define the space of allowed sequences. Proc Natl Acad Sci U S A 2002;99:1280-5. [PMID: 11805293 PMCID: PMC122181 DOI: 10.1073/pnas.032405199] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

For:	Koehl P, Levitt M. Protein topology and stability define the space of allowed sequences. Proc Natl Acad Sci U S A 2002;99:1280-5. [PMID: 11805293 PMCID: PMC122181 DOI: 10.1073/pnas.032405199] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Number

Cited by Other Article(s)

Pereira de Araújo AF. Sequence-dependent and -independent information in a combined random energy model for protein folding and coding. Proteins 2024;92:679-687. [PMID: 38158239 DOI: 10.1002/prot.26658] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 12/11/2023] [Accepted: 12/15/2023] [Indexed: 01/03/2024]

Abstract

Random energy models (REMs) provide a simple description of the energy landscapes that guide protein folding and evolution. The requirement of a large energy gap between the native structure and unfolded conformations, considered necessary for cooperative, protein-like, folding behavior, indicates that proteins differ markedly from random heteropolymers. It has been suggested, therefore, that natural selection might have acted to choose nonrandom amino acid sequences satisfying this particular condition, implying that a large fraction of possible, unselected random sequences, would not fold to any structure. From an informational perspective, however, this scenario could indicate that protein structures, regarded as messages to be transmitted through a communication channel, would not be efficiently encoded in amino acid sequences, regarded as the communication channel for this transmission, since a large fraction of possible channel states would not be used. Here, we use a combined REM for conformations and sequences, with previously estimated parameters for natural proteins, to explore an alternative possibility in which the appropriate shape of the landscape results mainly from the deviation from randomness of possible native structures instead of sequences. We observe that this situation emerges naturally if the distribution of conformational energies happens to arise from two independent contributions corresponding to sequence-dependent and -independent terms. This construction is consistent with the hypothesis of a protein burial folding code, with native structures being determined by a modest amount of sequence-dependent atomic burial information with sequence-independent constraints imposed by unspecific hydrogen bond formation. More generally, an appropriate combination of sequence-dependent and -independent information accommodates the possibility of an efficient structural encoding with the main physical requirement for folding, providing possible insight not only on the folding process but also on several aspects sequence evolution such as neutral networks, conformational coverage, and de novo gene emergence.

Collapse

Koch J, Romero‐Romero S, Höcker B. Stepwise introduction of stabilizing mutations reveals nonlinear additive effects in de novo TIM barrels. Protein Sci 2024;33:e4926. [PMID: 38380781 PMCID: PMC10880431 DOI: 10.1002/pro.4926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2023] [Revised: 01/29/2024] [Accepted: 01/30/2024] [Indexed: 02/22/2024]

Abstract

Over the past decades, the TIM-barrel fold has served as a model system for the exploration of how changes in protein sequences affect their structural, stability, and functional characteristics, and moreover, how this information can be leveraged to design proteins from the ground up. After numerous attempts to design de novo proteins with this specific fold, sTIM11 was the first validated de novo design of an idealized four-fold symmetric TIM barrel. Subsequent efforts to enhance the stability of this initial design resulted in the development of DeNovoTIMs, a family of de novo TIM barrels with various stabilizing mutations. In this study, we present an investigation into the biophysical and thermodynamic effects upon introducing a varying number of stabilizing mutations per quarter along the sequence of a four-fold symmetric TIM barrel. We compared the base design DeNovoTIM0 without any stabilizing mutations with variants containing mutations in one, two, three, and all four quarters-designated TIM1q, TIM2q, TIM3q, and DeNovoTIM6, respectively. This analysis revealed a stepwise and nonlinear change in the thermodynamic properties that correlated with the number of mutated quarters, suggesting positive nonadditive effects. To shed light on the significance of the location of stabilized quarters, we engineered two variants of TIM2q which contain the same number of mutations but positioned in different quarter locations. Characterization of these TIM2q variants revealed that the mutations exhibit varying effects on the overall protein stability, contingent upon the specific region in which they are introduced. These findings emphasize that the amount and location of stabilized interfaces among the four quarters play a crucial role in shaping the conformational stability of these four-fold symmetric TIM barrels. Analysis of de novo proteins, as described in this study, enhances our understanding of how sequence variations can finely modulate stability in both naturally occurring and computationally designed proteins.

Collapse

Vila JA. Protein folding rate evolution upon mutations. Biophys Rev 2023;15:661-669. [PMID: 37681091 PMCID: PMC10480377 DOI: 10.1007/s12551-023-01088-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Accepted: 06/24/2023] [Indexed: 09/09/2023] Open

van der Linden MG, Ferreira DC, Pereira de Araújo AF. Constrained Layer Assignment for the Protein Burial Folding Code Accounting for Chain Connectivity. J Phys Chem B 2022;126:6159-6170. [PMID: 35952378 DOI: 10.1021/acs.jpcb.2c03931] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Abstract

The connection between protein sequences and tertiary structures has intrigued investigators for decades. A plausible hypothesis for the coding scheme postulates that atomic burial information obtainable from the sequence could be sufficient for structural determination when combined to sequence-independent constraints. Accordingly, folding simulations using native burial information expressed by atomic central distances, discretized into a small number L of equiprobable burial layers, have indeed been successful in reaching and distinguishing the native structure of several globular proteins. Attempted predictions of layers from sequence, however, turned out to be insufficiently accurate for most proteins. Here we explore the possibility that a nonuniform assignment of layers, which is intended to account for constraints imposed by chain connectivity, might provide a more efficient burial encoding of tertiary structures. We consider the condition that adjacent C_α-atoms along the sequence cannot occupy nonadjacent layers, in which case the information required to specify sequences of burials would be smaller. It is shown that appropriate folding behavior can still be observed in this explicitly more constrained scenario with a structure-dependent assignment intended to produce the thinnest possible layers still compatible with the imposed burial constraint. This thinnest assignment turns out to be sufficiently restrictive for the observed examples and provides appropriately thinner layers or, equivalently, a larger number of layers, for examples previously observed to indeed require more restrictive constraints when compared to counterparts of similar size, as well as the appropriate increase in number of layers for larger proteins. Implications for the general understanding of the protein folding code are discussed.

Collapse

Magi Meconi G, Sasselli IR, Bianco V, Onuchic JN, Coluzza I. Key aspects of the past 30 years of protein design. REPORTS ON PROGRESS IN PHYSICS. PHYSICAL SOCIETY (GREAT BRITAIN) 2022;85:086601. [PMID: 35704983 DOI: 10.1088/1361-6633/ac78ef] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 06/15/2022] [Indexed: 06/15/2023]

About the Protein Space Vastness. Protein J 2020;39:472-475. [PMID: 33130957 DOI: 10.1007/s10930-020-09939-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/27/2020] [Indexed: 12/24/2022]

Tian P, Best RB. Exploring the sequence fitness landscape of a bridge between protein folds. PLoS Comput Biol 2020;16:e1008285. [PMID: 33048928 PMCID: PMC7553338 DOI: 10.1371/journal.pcbi.1008285] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Accepted: 08/24/2020] [Indexed: 12/15/2022] Open

Loshbaugh AL, Kortemme T. Comparison of Rosetta flexible-backbone computational protein design methods on binding interactions. Proteins 2019;88:206-226. [PMID: 31344278 DOI: 10.1002/prot.25790] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2019] [Revised: 07/15/2019] [Accepted: 07/19/2019] [Indexed: 01/03/2023]

Yan Z, Wang J. Superfunneled Energy Landscape of Protein Evolution Unifies the Principles of Protein Evolution, Folding, and Design. PHYSICAL REVIEW LETTERS 2019;122:018103. [PMID: 31012725 DOI: 10.1103/physrevlett.122.018103] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2017] [Revised: 11/08/2018] [Indexed: 06/09/2023]

Ludwiczak J, Jarmula A, Dunin-Horkawicz S. Combining Rosetta with molecular dynamics (MD): A benchmark of the MD-based ensemble protein design. J Struct Biol 2018;203:54-61. [PMID: 29454111 DOI: 10.1016/j.jsb.2018.02.004] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2017] [Revised: 01/25/2018] [Accepted: 02/13/2018] [Indexed: 01/15/2023]

Kinjo AR. Cooperative "folding transition" in the sequence space facilitates function-driven evolution of protein families. J Theor Biol 2018;443:18-27. [PMID: 29355538 DOI: 10.1016/j.jtbi.2018.01.019] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2017] [Revised: 01/16/2018] [Accepted: 01/17/2018] [Indexed: 12/23/2022]

Tian P, Best RB. How Many Protein Sequences Fold to a Given Structure? A Coevolutionary Analysis. Biophys J 2017;113:1719-1730. [PMID: 29045866 PMCID: PMC5647607 DOI: 10.1016/j.bpj.2017.08.039] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2017] [Revised: 08/03/2017] [Accepted: 08/08/2017] [Indexed: 12/23/2022] Open

Tertiary alphabet for the observable protein structural universe. Proc Natl Acad Sci U S A 2016;113:E7438-E7447. [PMID: 27810958 DOI: 10.1073/pnas.1607178113] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Mignon D, Simonson T. Comparing three stochastic search algorithms for computational protein design: Monte Carlo, replica exchange Monte Carlo, and a multistart, steepest-descent heuristic. J Comput Chem 2016;37:1781-93. [PMID: 27197555 DOI: 10.1002/jcc.24393] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2015] [Revised: 02/26/2016] [Accepted: 03/27/2016] [Indexed: 01/11/2023]

Carcamo-Noriega EN, Saab-Rincon G. Identification of fibrillogenic regions in human triosephosphate isomerase. PeerJ 2016;4:e1676. [PMID: 26870617 PMCID: PMC4748702 DOI: 10.7717/peerj.1676] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2015] [Accepted: 01/20/2016] [Indexed: 12/31/2022] Open

Abstract

Background. Amyloid secondary structure relies on the intermolecular assembly of polypeptide chains through main-chain interaction. According to this, all proteins have the potential to form amyloid structure, nevertheless, in nature only few proteins aggregate into toxic or functional amyloids. Structural characteristics differ greatly among amyloid proteins reported, so it has been difficult to link the fibrillogenic propensity with structural topology. However, there are ubiquitous topologies not represented in the amyloidome that could be considered as amyloid-resistant attributable to structural features, such is the case of TIM barrel topology.

Methods. This work was aimed to study the fibrillogenic propensity of human triosephosphate isomerase (HsTPI) as a model of TIM barrels. In order to do so, aggregation of HsTPI was evaluated under native-like and destabilizing conditions. Fibrillogenic regions were identified by bioinformatics approaches, protein fragmentation and peptide aggregation.

Results. We identified four fibrillogenic regions in the HsTPI corresponding to the β3, β6, β7 y α8 of the TIM barrel. From these, the β3-strand region (residues 59–66) was highly fibrillogenic. In aggregation assays, HsTPI under native-like conditions led to amorphous assemblies while under partially denaturing conditions (urea 3.2 M) formed more structured aggregates. This slightly structured aggregates exhibited residual cross-β structure, as demonstrated by the recognition of the WO1 antibody and ATR-FTIR analysis.

Discussion. Despite the fibrillogenic regions present in HsTPI, the enzyme maintained under native-favoring conditions displayed low fibrillogenic propensity. This amyloid-resistance can be attributed to the three-dimensional arrangement of the protein, where β-strands, susceptible to aggregation, are protected in the core of the molecule. Destabilization of the protein structure may expose inner regions promoting β-aggregation, as well as the formation of hydrophobic disordered aggregates. Being this last pathway kinetically favored over the thermodynamically more stable fibril aggregation pathway.

Collapse

Ferreira DC, van der Linden MG, de Oliveira LC, Onuchic JN, de Araújo AFP. Information and redundancy in the burial folding code of globular proteins within a wide range of shapes and sizes. Proteins 2016;84:515-31. [PMID: 26815167 DOI: 10.1002/prot.24998] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2015] [Revised: 12/28/2015] [Accepted: 01/19/2016] [Indexed: 11/09/2022]

Nepal R, Spencer J, Bhogal G, Nedunuri A, Poelman T, Kamath T, Chung E, Kantardjieff K, Gottlieb A, Lustig B. Logistic regression models to predict solvent accessible residues using sequence- and homology-based qualitative and quantitative descriptors applied to a domain-complete X-ray structure learning set. J Appl Crystallogr 2015;48:1976-1984. [PMID: 26664348 PMCID: PMC4665666 DOI: 10.1107/s1600576715018531] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2014] [Accepted: 10/03/2015] [Indexed: 11/11/2022] Open

Ollikainen N, Kortemme T. Computational protein design quantifies structural constraints on amino acid covariation. PLoS Comput Biol 2013;9:e1003313. [PMID: 24244128 PMCID: PMC3828131 DOI: 10.1371/journal.pcbi.1003313] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2013] [Accepted: 09/20/2013] [Indexed: 02/02/2023] Open

Abstract

Amino acid covariation, where the identities of amino acids at different sequence positions are correlated, is a hallmark of naturally occurring proteins. This covariation can arise from multiple factors, including selective pressures for maintaining protein structure, requirements imposed by a specific function, or from phylogenetic sampling bias. Here we employed flexible backbone computational protein design to quantify the extent to which protein structure has constrained amino acid covariation for 40 diverse protein domains. We find significant similarities between the amino acid covariation in alignments of natural protein sequences and sequences optimized for their structures by computational protein design methods. These results indicate that the structural constraints imposed by protein architecture play a dominant role in shaping amino acid covariation and that computational protein design methods can capture these effects. We also find that the similarity between natural and designed covariation is sensitive to the magnitude and mechanism of backbone flexibility used in computational protein design. Our results thus highlight the necessity of including backbone flexibility to correctly model precise details of correlated amino acid changes and give insights into the pressures underlying these correlations.

Proteins generally fold into specific three-dimensional structures to perform their cellular functions, and the presence of misfolded proteins is often deleterious for cellular and organismal fitness. For these reasons, maintenance of protein structure is thought to be one of the major fitness pressures acting on proteins. Consequently, the sequences of today's naturally occurring proteins contain signatures reflecting the constraints imposed by protein structure. Here we test the ability of computational protein design methods to recapitulate and explain these signatures. We focus on the physical basis of evolutionary pressures that act on interactions between amino acids in folded proteins, which are critical in determining protein structure and function. Such pressures can be observed from the appearance of amino acid covariation, where the amino acids at certain positions in protein sequences are correlated with each other. We find similar patterns of amino acid covariation in natural sequences and sequences optimized for their structures using computational protein design, demonstrating the importance of structural constraints in protein molecular evolution and providing insights into the structural mechanisms leading to covariation. In addition, these results characterize the ability of computational methods to model the precise details of correlated amino acid changes, which is critical for engineering new proteins with useful functions beyond those seen in nature.

Collapse

Mach P, Koehl P. Capturing protein sequence-structure specificity using computational sequence design. Proteins 2013;81:1556-70. [DOI: 10.1002/prot.24307] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2012] [Revised: 03/28/2013] [Accepted: 04/11/2013] [Indexed: 02/05/2023]

Mohanty S, Purwar M, Srinivasan N, Rekha N. Tethering preferences of domain families co-occurring in multi-domain proteins. MOLECULAR BIOSYSTEMS 2013;9:1708-25. [PMID: 23571467 DOI: 10.1039/c3mb25481j] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Zhang J, Grigoryan G. Mining tertiary structural motifs for assessment of designability. Methods Enzymol 2013;523:21-40. [PMID: 23422424 PMCID: PMC4222026 DOI: 10.1016/b978-0-12-394292-0.00002-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Diez-García F, Chakrabartty A, González C, Laurents DV. An Arg-rich putative prebiotic protein is as stable as its Lys-rich variant. Arch Biochem Biophys 2012;528:118-26. [DOI: 10.1016/j.abb.2012.09.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2012] [Revised: 09/14/2012] [Accepted: 09/15/2012] [Indexed: 10/27/2022]

De novo automated design of small RNA circuits for engineering synthetic riboregulation in living cells. Proc Natl Acad Sci U S A 2012;109:15271-6. [PMID: 22949707 DOI: 10.1073/pnas.1203831109] [Citation(s) in RCA: 116] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open

Rocha JR, van der Linden MG, Ferreira DC, Azevêdo PH, Pereira de Araújo AF. Information-theoretic analysis and prediction of protein atomic burials: on the search for an informational intermediate between sequence and structure. ACTA ACUST UNITED AC 2012;28:2755-62. [PMID: 22923297 DOI: 10.1093/bioinformatics/bts512] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

Abstract

MOTIVATION

It has been recently suggested that atomic burials, as expressed by molecular central distances, contain sufficient information to determine the tertiary structure of small globular proteins. A possible approach to structural determination from sequence could therefore involve a sequence-to-burial intermediate prediction step whose accuracy, however, is theoretically limited by the mutual information between these two variables. We use a non-redundant set of globular protein structures to estimate the mutual information between local amino acid sequence and atomic burials. Discretizing central distances of or atoms in equiprobable burial levels, we estimate relevant mutual information measures that are compared with actual predictions obtained from a Naive Bayesian Classifier (NBC) and a Hidden Markov Model (HMM).

RESULTS

Mutual information density for 20 amino acids and two or three burial levels were estimated to be roughly 15% of the unconditional burial entropy density. Lower estimates for the mutual information between local amino acid sequence and burial of a single residue indicated an increase in mutual information with the number of burial levels up to at least five or six levels. Prediction schemes were found to efficiently extract the available burial information from local sequence. Lower estimates for the mutual information involving single burials are consistently approached by predictions from the NBC and actually surpassed by predictions from the HMM. Near-optimal prediction for the HMM is indicated by the agreement between its density of prediction information and the corresponding density of mutual information between input and output representations.

AVAILABILITY

The dataset of protein structures and the prediction implementations are available at http://www.btc.unb.br/ (in 'Software').

Collapse

Rorick M. Quantifying protein modularity and evolvability: a comparison of different techniques. Biosystems 2012;110:22-33. [PMID: 22796584 DOI: 10.1016/j.biosystems.2012.06.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2011] [Revised: 06/20/2012] [Accepted: 06/27/2012] [Indexed: 10/28/2022]

Analytic markovian rates for generalized protein structure evolution. PLoS One 2012;7:e34228. [PMID: 22693543 PMCID: PMC3367531 DOI: 10.1371/journal.pone.0034228] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2011] [Accepted: 02/26/2012] [Indexed: 12/24/2022] Open

Systematic assessment of accuracy of comparative model of proteins belonging to different structural fold classes. J Mol Model 2011;17:2831-7. [PMID: 21301906 DOI: 10.1007/s00894-011-0976-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2010] [Accepted: 01/17/2011] [Indexed: 10/18/2022]

Shukla P. Thermodynamics of protein folding: a random matrix formulation. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2010;22:415106. [PMID: 21386596 DOI: 10.1088/0953-8984/22/41/415106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]

Grigoryan G, Degrado WF. Probing designability via a generalized model of helical bundle geometry. J Mol Biol 2010;405:1079-100. [PMID: 20932976 DOI: 10.1016/j.jmb.2010.08.058] [Citation(s) in RCA: 171] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2010] [Revised: 08/26/2010] [Accepted: 08/31/2010] [Indexed: 10/19/2022]

Abstract

Because the space of folded protein structures is highly degenerate, with recurring secondary and tertiary motifs, methods for representing protein structure in terms of collective physically relevant coordinates are of great interest. By collapsing structural diversity to a handful of parameters, such methods can be used to delineate the space of designable structures (i.e., conformations that can be stabilized with a large number of sequences)-a crucial task for de novo protein design. We first demonstrate this on natural α-helical coiled coils using the Crick parameterization. We show that over 95% of known coiled-coil structures are within 1-Å C(α) root mean square deviation of a Crick-ideal backbone. Derived parameters show that natural geometric space of coiled coils is highly restricted and can be represented by "allowed" conformations amidst a potential continuum of conformers. Allowed structures have (1) restricted axial offsets between helices, which differ starkly between parallel and anti-parallel structures; (2) preferred superhelical radii, which depend linearly on the oligomerization state; (3) pronounced radius-dependent a- and d-position amino acid propensities; and (4) discrete angles of rotation of helices about their axes, which are surprisingly independent of oligomerization state or orientation. In all, we estimate the space of designable coiled-coil structures to be reduced at least 160-fold relative to the space of geometrically feasible structures. To extend the benefits of structural parameterization to other systems, we developed a general mathematical framework for parameterizing arbitrary helical structures, which reduces to the Crick parameterization as a special case. The method is successfully validated on a set of non-coiled-coil helical bundles, frequent in channels and transporter proteins, which show significant helix bending but not supercoiling. Programs for coiled-coil parameter fitting and structure generation are provided via a web interface at http://www.gevorggrigoryan.com/cccp/, and code for generalized helical parameterization is available upon request.

Collapse

Fromer M, Yanover C, Linial M. Design of multispecific protein sequences using probabilistic graphical modeling. Proteins 2010;78:530-47. [PMID: 19842166 DOI: 10.1002/prot.22575] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Kanapin AA, Mulder N, Kuznetsov VA. Projection of gene-protein networks to the functional space of the proteome and its application to analysis of organism complexity. BMC Genomics 2010;11 Suppl 1:S4. [PMID: 20158875 PMCID: PMC2822532 DOI: 10.1186/1471-2164-11-s1-s4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

am Busch MS, Mignon D, Simonson T. Computational protein design as a tool for fold recognition. Proteins 2009;77:139-58. [PMID: 19408297 DOI: 10.1002/prot.22426] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

A sequence-compatible amount of native burial information is sufficient for determining the structure of small globular proteins. Proc Natl Acad Sci U S A 2009;106:19001-4. [PMID: 19858496 DOI: 10.1073/pnas.0910851106] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Babor M, Kortemme T. Multi-constraint computational design suggests that native sequences of germline antibody H3 loops are nearly optimal for conformational flexibility. Proteins 2009;75:846-58. [PMID: 19194863 DOI: 10.1002/prot.22293] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Abstract

The limited size of the germline antibody repertoire has to recognize a far larger number of potential antigens. The ability of a single antibody to bind multiple ligands due to conformational flexibility in the antigen-binding site can significantly enlarge the repertoire. Among the six complementarity determining regions (CDRs) that generally comprise the binding site, the CDR H3 loop is particularly variable. Computational protein design studies showed that predicted low energy sequences compatible with a given backbone structure often have considerable similarity to the corresponding native sequences of naturally occurring proteins, indicating that native protein sequences are close to optimal for their structures. Here, we take a step forward to determine whether conformational flexibility, believed to play a key functional role in germline antibodies, is also central in shaping their native sequence. In particular, we use a multi-constraint computational design strategy, along with the Rosetta scoring function, to propose that the native sequences of CDR H3 loops from germline antibodies are nearly optimal for conformational flexibility. Moreover, we find that antibody maturation may lead to sequences with a higher degree of optimization for a single conformation, while disfavoring sequences that are intrinsically flexible. In addition, this computational strategy allows us to predict mutations in the CDR H3 loop to stabilize the antigen-bound conformation, a computational mimic of affinity maturation, that may increase antigen binding affinity by preorganizing the antigen binding loop. In vivo affinity maturation data are consistent with our predictions. The method described here can be useful to design antibodies with higher selectivity and affinity by reducing conformational diversity.

Collapse

Jha AN, Ananthasuresh GK, Vishveshwara S. A search for energy minimized sequences of proteins. PLoS One 2009;4:e6684. [PMID: 19690619 PMCID: PMC2724685 DOI: 10.1371/journal.pone.0006684] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2009] [Accepted: 07/23/2009] [Indexed: 11/21/2022] Open

Backbone flexibility in computational protein design. Curr Opin Biotechnol 2009;20:420-8. [DOI: 10.1016/j.copbio.2009.07.006] [Citation(s) in RCA: 87] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2009] [Revised: 07/17/2009] [Accepted: 07/25/2009] [Indexed: 11/22/2022]

Prediction of protein-protein interface sequence diversity using flexible backbone computational protein design. Structure 2009;16:1777-88. [PMID: 19081054 DOI: 10.1016/j.str.2008.09.012] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2008] [Revised: 09/26/2008] [Accepted: 09/30/2008] [Indexed: 11/21/2022]

Morra G, Colombo G. Relationship between energy distribution and fold stability: Insights from molecular dynamics simulations of native and mutant proteins. Proteins 2008;72:660-72. [PMID: 18247351 DOI: 10.1002/prot.21963] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

am Busch MS, Lopes A, Amara N, Bathelt C, Simonson T. Testing the Coulomb/Accessible Surface Area solvent model for protein stability, ligand binding, and protein design. BMC Bioinformatics 2008;9:148. [PMID: 18366628 PMCID: PMC2292695 DOI: 10.1186/1471-2105-9-148] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2007] [Accepted: 03/13/2008] [Indexed: 11/10/2022] Open

Abstract

Background

Protein structure prediction and computational protein design require efficient yet sufficiently accurate descriptions of aqueous solvent. We continue to evaluate the performance of the Coulomb/Accessible Surface Area (CASA) implicit solvent model, in combination with the Charmm19 molecular mechanics force field. We test a set of model parameters optimized earlier, and we also carry out a new optimization in this work, using as a target a set of experimental stability changes for single point mutations of various proteins and peptides. The optimization procedure is general, and could be used with other force fields. The computation of stability changes requires a model for the unfolded state of the protein. In our approach, this state is represented by tripeptide structures of the sequence Ala-X-Ala for each amino acid type X. We followed an iterative optimization scheme which, at each cycle, optimizes the solvation parameters and a set of tripeptide structures for the unfolded state. This protocol uses a set of 140 experimental stability mutations and a large set of tripeptide conformations to find the best tripeptide structures and solvation parameters.

Results

Using the optimized parameters, we obtain a mean unsigned error of 2.28 kcal/mol for the stability mutations. The performance of the CASA model is assessed by two further applications: (i) calculation of protein-ligand binding affinities and (ii) computational protein design. For these two applications, the previous parameters and the ones optimized here give a similar performance. For ligand binding, we obtain reasonable agreement with a set of 55 experimental mutation data, with a mean unsigned error of 1.76 kcal/mol with the new parameters and 1.47 kcal/mol with the earlier ones. We show that the optimized CASA model is not inferior to the Generalized Born/Surface Area (GB/SA) model for the prediction of these binding affinities. Likewise, the new parameters perform well for the design of 8 SH3 domain proteins where an average of 32.8% sequence identity relative to the native sequences was achieved. Further, it was shown that the computed sequences have the character of naturally-occuring homologues of the native sequences.

Conclusion

Overall, the two CASA variants explored here perform very well for a wide variety of applications. Both variants provide an efficient solvent treatment for the computational engineering of ligands and proteins.

Collapse

Visual Analysis of Biomolecular Surfaces. ACTA ACUST UNITED AC 2008. [DOI: 10.1007/978-3-540-72630-2_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]

Schmidt Am Busch M, Lopes A, Mignon D, Simonson T. Computational protein design: Software implementation, parameter optimization, and performance of a simple model. J Comput Chem 2008;29:1092-102. [DOI: 10.1002/jcc.20870] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Ingolfsson H, Yona G. Protein domain prediction. Methods Mol Biol 2008;426:117-143. [PMID: 18542860 DOI: 10.1007/978-1-60327-058-8_7] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]

Armstrong KA, Tidor B. Computationally mapping sequence space to understand evolutionary protein engineering. Biotechnol Prog 2007;24:62-73. [PMID: 18020358 DOI: 10.1021/bp070134h] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Launay G, Mendez R, Wodak S, Simonson T. Recognizing protein-protein interfaces with empirical potentials and reduced amino acid alphabets. BMC Bioinformatics 2007;8:270. [PMID: 17662112 PMCID: PMC2034607 DOI: 10.1186/1471-2105-8-270] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2007] [Accepted: 07/27/2007] [Indexed: 11/25/2022] Open

Abstract

Background

In structural genomics, an important goal is the detection and classification of protein–protein interactions, given the structures of the interacting partners. We have developed empirical energy functions to identify native structures of protein–protein complexes among sets of decoy structures. To understand the role of amino acid diversity, we parameterized a series of functions, using a hierarchy of amino acid alphabets of increasing complexity, with 2, 3, 4, 6, and 20 amino acid groups. Compared to previous work, we used the simplest possible functional form, with residue–residue interactions and a stepwise distance-dependence. We used increased computational ressources, however, constructing 290,000 decoys for 219 protein–protein complexes, with a realistic docking protocol where the protein partners are flexible and interact through a molecular mechanics energy function. The energy parameters were optimized to correctly assign as many native complexes as possible. To resolve the multiple minimum problem in parameter space, over 64000 starting parameter guesses were tried for each energy function. The optimized functions were tested by cross validation on subsets of our native and decoy structures, by blind tests on series of native and decoy structures available on the Web, and on models for 13 complexes submitted to the CAPRI structure prediction experiment.

Results

Performance is similar to several other statistical potentials of the same complexity. For example, the CAPRI target structure is correctly ranked ahead of 90% of its decoys in 6 cases out of 13. The hierarchy of amino acid alphabets leads to a coherent hierarchy of energy functions, with qualitatively similar parameters for similar amino acid types at all levels. Most remarkably, the performance with six amino acid classes is equivalent to that of the most detailed, 20-class energy function.

Conclusion

This suggests that six carefully chosen amino acid classes are sufficient to encode specificity in protein–protein interactions, and provide a starting point to develop more complicated energy functions.

Collapse

Meyerguz L, Kleinberg J, Elber R. The network of sequence flow between protein structures. Proc Natl Acad Sci U S A 2007;104:11627-32. [PMID: 17596339 PMCID: PMC1913895 DOI: 10.1073/pnas.0701393104] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2007] [Indexed: 12/24/2022] Open

Lopes A, Alexandrov A, Bathelt C, Archontis G, Simonson T. Computational sidechain placement and protein mutagenesis with implicit solvent models. Proteins 2007;67:853-67. [PMID: 17348031 DOI: 10.1002/prot.21379] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Abstract

Structure prediction and computational protein design should benefit from accurate solvent models. We have applied implicit solvent models to two problems that are central to this area. First, we performed sidechain placement for 29 proteins, using a solvent model that combines a screened Coulomb term with an Accessible Surface Area term (CASA model). With optimized parameters, the prediction quality is comparable with earlier work that omitted electrostatics and solvation altogether. Second, we computed the stability changes associated with point mutations involving ionized sidechains. For over 1000 mutations, including many fully or partly buried positions, we compared CASA and two generalized Born models (GB) with a more accurate model, which solves the Poisson equation of continuum electrostatics numerically. CASA predicts the correct sign and order of magnitude of the stability change for 81% of the mutations, compared to 97% with the best GB. We also considered 140 mutations for which experimental data are available. Comparing to experiment requires additional assumptions about the unfolded protein structure, protein relaxation in response to the mutations, and contributions from the hydrophobic effect. With a simple, commonly-used unfolded state model, the mean unsigned error is 2.1 kcal/mol with both CASA and the best GB. Overall, the electrostatic model is not important for sidechain placement; CASA and GB are equivalent for surface mutations, while GB is far superior for fully or partly buried positions. Thus, for problems like protein design that involve all these aspects, the most recent GB models represent an important step forward. Along with the recent discovery of efficient, pairwise implementations of GB, this will open new possibilities for the computational engineering of proteins.

Collapse

Woycechowsky KJ, Vamvaca K, Hilvert D. Novel enzymes through design and evolution. ADVANCES IN ENZYMOLOGY AND RELATED AREAS OF MOLECULAR BIOLOGY 2007;75:241-94, xiii. [PMID: 17124869 DOI: 10.1002/9780471224464.ch4] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

The Structurally Constrained Neutral Model of Protein Evolution. ACTA ACUST UNITED AC 2007. [DOI: 10.1007/978-3-540-35306-5_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]

Jernigan RL, Kloczkowski A. Packing regularities in biological structures relate to their dynamics. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2006;350:251-76. [PMID: 16957327 PMCID: PMC2039702 DOI: 10.1385/1-59745-189-4:251] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/28/2023]

Abstract

The high packing density inside proteins leads to certain geometric regularities and also is one of the most important contributors to the high extent of cooperativity manifested by proteins in their cohesive domain motions. The orientations between neighboring nonbonded residues in proteins substantially follow the similar geometric regularities, regardless of whether the residues are on the surface or buried, a direct result of hydrophobicity forces. These orientations are relatively fixed and correspond closely to small deformations from those of the face-centered cubic lattice, which is the way in which identical spheres pack at the highest density. Packing density also is related to the extent of conservation of residues, and we show this relationship for residue packing densities by averaging over a large sample or residue packings. There are three regimes: (1) over a broad range of packing densities the relationship between sequence entropy and inverse packing density is nearly linear, (2) over a limited range of low packing densities the sequence entropy is nearly constant, and (3) at extremely low packing densities the sequence entropy is highly variable. These packing results provide important justification for the simple elastic network models that have been shown for a large number of proteins to represent protein dynamics so successfully, even when the models are extremely coarse grained. Elastic network models for polymeric chains are simple and could be combined with these protein elastic networks to represent partially denatured parts of proteins. Finally, we show results of applications of the elastic network model to study the functional motions of the ribosome, based on its known structure. These results indicate expected correlations among its components for the step-wise processing steps in protein synthesis, and suggest ways to use these elastic network models to develop more detailed mechanisms, an important possibility because most experiments yield only static structures.

Collapse

Ma BG, Guo JX, Zhang HY. Direct correlation between proteins' folding rates and their amino acid compositions: An ab initio folding rate prediction. Proteins 2006;65:362-72. [PMID: 16937389 DOI: 10.1002/prot.21140] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]