1
|
Pal T, Wessén J, Das S, Chan HS. Differential Effects of Sequence-Local versus Nonlocal Charge Patterns on Phase Separation and Conformational Dimensions of Polyampholytes as Model Intrinsically Disordered Proteins. J Phys Chem Lett 2024; 15:8248-8256. [PMID: 39105804 DOI: 10.1021/acs.jpclett.4c01973] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/07/2024]
Abstract
Conformational properties of intrinsically disordered proteins (IDPs) are governed by a sequence-ensemble relationship. To differentiate the impact of sequence-local versus sequence-nonlocal features of an IDP's charge pattern on its conformational dimensions and its phase-separation propensity, the charge "blockiness" κ and the nonlocality-weighted sequence charge decoration (SCD) parameters are compared for their correlations with isolated-chain radii of gyration (Rgs) and upper critical solution temperatures (UCSTs) of polyampholytes modeled by random phase approximation, field-theoretic simulation, and coarse-grained molecular dynamics. SCD is superior to κ in predicting Rg because SCD accounts for effects of contact order, i.e., nonlocality, on dimensions of isolated chains. In contrast, κ and SCD are comparably good, though nonideal, predictors of UCST because frequencies of interchain contacts in the multiple-chain condensed phase are less sensitive to sequence positions than frequencies of intrachain contacts of an isolated chain, as reflected by κ correlating better with condensed-phase interaction energy than SCD.
Collapse
Affiliation(s)
- Tanmoy Pal
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | - Jonas Wessén
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | - Suman Das
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
- Department of Chemistry, Gandhi Institute of Technology and Management, Visakhapatnam, Andhra Pradesh 530045, India
| | - Hue Sun Chan
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| |
Collapse
|
2
|
Badonyi M, Marsh JA. Hallmarks and evolutionary drivers of cotranslational protein complex assembly. FEBS J 2024; 291:3557-3567. [PMID: 37202910 DOI: 10.1111/febs.16869] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 05/11/2023] [Accepted: 05/18/2023] [Indexed: 05/20/2023]
Abstract
Recent discoveries have highlighted the prevalence of cotranslational assembly in proteomes, revealing a range of mechanisms that enables the assembly of protein complex subunits on the ribosome. Structural analyses have uncovered emergent properties that may inherently control whether a subunit undergoes cotranslational assembly. However, the evolutionary paths that have yielded such complexes over an extended timescale remain largely unclear. In this review, we reflect on historical experiments that contributed to the field, including breakthroughs that have made possible the proteome-wide detection of cotranslational assembly, and the technical challenges yet to be overcome. We introduce a simple framework that encapsulates the hallmarks of cotranslational assembly and discuss how results from new experiments are shaping our view of the mechanistic, structural and evolutionary factors driving the phenomenon.
Collapse
Affiliation(s)
- Mihaly Badonyi
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, UK
| | - Joseph A Marsh
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, UK
| |
Collapse
|
3
|
Das D, Ainavarapu SRK. Protein engineering using circular permutation - structure, function, stability, and applications. FEBS J 2024; 291:3581-3596. [PMID: 38676939 DOI: 10.1111/febs.17146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Revised: 03/13/2024] [Accepted: 04/12/2024] [Indexed: 04/29/2024]
Abstract
Protein engineering is important for creating novel variants from natural proteins, enabling a wide range of applications. Approaches such as rational design and directed evolution are routinely used to make new protein variants. Computational tools like de novo design can introduce new protein folds. Expanding the amino acid repertoire to include unnatural amino acids with non-canonical side chains in vitro by native chemical ligation and in vivo via codon expansion methods broadens sequence and structural possibilities. Circular permutation (CP) is an invaluable approach to redesigning a protein by rearranging the amino acid sequence, where the connectivity of the secondary structural elements is altered without changing the overall structure of the protein. Artificial CP proteins (CPs) are employed in various applications such as biocatalysis, sensing of small molecules by fluorescence, genome editing, ligand-binding protein switches, and optogenetic engineering. Many studies have shown that CP can lead to either reduced or enhanced stability or catalytic efficiency. The effects of CP on a protein's energy landscape cannot be predicted a priori. Thus, it is important to understand how CP can affect the thermodynamic and kinetic stability of a protein. In this review, we discuss the discovery and advancement of techniques to create protein CP, and existing reviews on CP. We delve into the plethora of biological applications for designed CP proteins. We subsequently discuss the experimental and computational reports on the effects of CP on the thermodynamic and kinetic stabilities of proteins of various topologies. An understanding of the various aspects of CP will allow the reader to design robust CP proteins for their specific purposes.
Collapse
Affiliation(s)
- Debanjana Das
- Department of Chemical Sciences, Tata Institute of Fundamental Research, Mumbai, India
| | | |
Collapse
|
4
|
An L, Said M, Tran L, Majumder S, Goreshnik I, Lee GR, Juergens D, Dauparas J, Anishchenko I, Coventry B, Bera AK, Kang A, Levine PM, Alvarez V, Pillai A, Norn C, Feldman D, Zorine D, Hicks DR, Li X, Sanchez MG, Vafeados DK, Salveson PJ, Vorobieva AA, Baker D. Binding and sensing diverse small molecules using shape-complementary pseudocycles. Science 2024; 385:276-282. [PMID: 39024436 DOI: 10.1126/science.adn3780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 06/03/2024] [Indexed: 07/20/2024]
Abstract
We describe an approach for designing high-affinity small molecule-binding proteins poised for downstream sensing. We use deep learning-generated pseudocycles with repeating structural units surrounding central binding pockets with widely varying shapes that depend on the geometry and number of the repeat units. We dock small molecules of interest into the most shape complementary of these pseudocycles, design the interaction surfaces for high binding affinity, and experimentally screen to identify designs with the highest affinity. We obtain binders to four diverse molecules, including the polar and flexible methotrexate and thyroxine. Taking advantage of the modular repeat structure and central binding pockets, we construct chemically induced dimerization systems and low-noise nanopore sensors by splitting designs into domains that reassemble upon ligand addition.
Collapse
Affiliation(s)
- Linna An
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Meerit Said
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Long Tran
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Chemistry, University of Washington, Seattle, WA, USA
- Department of Chemical Engineering, University of Washington, Seattle, WA, USA
| | - Sagardip Majumder
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Inna Goreshnik
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Gyu Rie Lee
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - David Juergens
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Graduate Program in Molecular Engineering, University of Washington, Seattle, WA, USA
| | - Justas Dauparas
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Ivan Anishchenko
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Brian Coventry
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Asim K Bera
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Alex Kang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Paul M Levine
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Valentina Alvarez
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Arvind Pillai
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | | | | | - Dmitri Zorine
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Derrick R Hicks
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Xinting Li
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | | | - Dionne K Vafeados
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Patrick J Salveson
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Anastassia A Vorobieva
- VIB-VUB Center for Structural Biology, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Chemistry, University of Washington, Seattle, WA, USA
| |
Collapse
|
5
|
Cea PA, Pérez M, Herrera SM, Muñoz SM, Fuentes-Ugarte N, Coche-Miranda J, Maturana P, Guixé V, Castro-Fernandez V. Deciphering Structural Traits for Thermal and Kinetic Stability across Protein Family Evolution through Ancestral Sequence Reconstruction. Mol Biol Evol 2024; 41:msae127. [PMID: 38913681 PMCID: PMC11229819 DOI: 10.1093/molbev/msae127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 05/17/2024] [Accepted: 06/17/2024] [Indexed: 06/26/2024] Open
Abstract
Natural proteins are frequently marginally stable, and an increase in environmental temperature can easily lead to unfolding. As a result, protein engineering to improve protein stability is an area of intensive research. Nonetheless, since there is usually a high degree of structural homology between proteins from thermophilic organisms and their mesophilic counterparts, the identification of structural determinants for thermoadaptation is challenging. Moreover, in many cases, it has become clear that the success of stabilization strategies is often dependent on the evolutionary history of a protein family. In the last few years, the use of ancestral sequence reconstruction (ASR) as a tool for elucidation of the evolutionary history of functional traits of a protein family has gained strength. Here, we used ASR to trace the evolutionary pathways between mesophilic and thermophilic kinases that participate in the biosynthetic pathway of vitamin B1 in bacteria. By combining biophysics approaches, X-ray crystallography, and molecular dynamics simulations, we found that the thermal stability of these enzymes correlates with their kinetic stability, where the highest thermal/kinetic stability is given by an increase in small hydrophobic amino acids that allow a higher number of interatomic hydrophobic contacts, making this type of interaction the main support for stability in this protein architecture. The results highlight the potential benefits of using ASR to explore the evolutionary history of protein sequence and structure to identify traits responsible for the kinetic and thermal stability of any protein architecture.
Collapse
Affiliation(s)
- Pablo A Cea
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - Myriam Pérez
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - Sixto M Herrera
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - Sebastián M Muñoz
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - Nicolás Fuentes-Ugarte
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - José Coche-Miranda
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - Pablo Maturana
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - Victoria Guixé
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| | - Victor Castro-Fernandez
- Departamento de Biología, Facultad de Ciencias, Laboratorio de Bioquímica y Biología Molecular, Universidad de Chile, Santiago, Chile
| |
Collapse
|
6
|
Campos LA, Muñoz V. Targeting the protein folding transition state by mutation: Large scale (un)folding rate accelerations without altering native stability. Protein Sci 2024; 33:e5031. [PMID: 38864692 PMCID: PMC11168068 DOI: 10.1002/pro.5031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 05/06/2024] [Accepted: 05/09/2024] [Indexed: 06/13/2024]
Abstract
Proteins are constantly undergoing folding and unfolding transitions, with rates that determine their homeostasis in vivo and modulate their biological function. The ability to optimize these rates without affecting overall native stability is hence highly desirable for protein engineering and design. The great challenge is, however, that mutations generally affect folding and unfolding rates with inversely complementary fractions of the net free energy change they inflict on the native state. Here we address this challenge by targeting the folding transition state (FTS) of chymotrypsin inhibitor 2 (CI2), a very slow and stable two-state folding protein with an FTS known to be refractory to change by mutation. We first discovered that the CI2's FTS is energetically taxed by the desolvation of several, highly conserved, charges that form a buried salt bridge network in the native structure. Based on these findings, we designed a CI2 variant that bears just four mutations and aims to selectively stabilize the FTS. This variant has >250-fold faster rates in both directions and hence identical native stability, demonstrating the success of our FTS-centric design strategy. With an optimized FTS, CI2 also becomes 250-fold more sensitive to proteolytic degradation by its natural substrate chymotrypsin, and completely loses its activity as inhibitor. These results indicate that CI2 has been selected through evolution to have a very unstable FTS in order to attain the kinetic stability needed to effectively function as protease inhibitor. Moreover, the CI2 case showcases that protein (un)folding rates can critically pivot around a few key residues-interactions, which can strongly modify the general effects of known structural factors such as domain size and fold topology. From a practical standpoint, our results suggest that future efforts should perhaps focus on identifying such critical residues-interactions in proteins as best strategy to significantly improve our ability to predict and engineer protein (un)folding rates.
Collapse
Affiliation(s)
- Luis A. Campos
- Instituto Madrileño de Estudios Avanzados en Nanociencia (IMDEA Nanociencia)MadridSpain
- Unidad de Nanobiotecnología Asociada al Centro Nacional de Biotecnología (CSIC)MadridSpain
| | - Victor Muñoz
- Department of BioengineeringUniversity of CaliforniaMercedCaliforniaUSA
- Center for Cellular and Biomolecular MachinesUniversity of CaliforniaMercedCaliforniaUSA
| |
Collapse
|
7
|
Saha D, Kothari S, Kulkarni SD, Thambiraja M, Yennamalli RM, Das DK. Genetic heterogeneity and respiratory chain enzyme analysis in pediatric Indian patients with mitochondrial disorder: Report of novel variants in POLG1 gene and their functional implication using molecular dynamic simulation. Mitochondrion 2024; 76:101870. [PMID: 38471579 DOI: 10.1016/j.mito.2024.101870] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 03/04/2024] [Accepted: 03/07/2024] [Indexed: 03/14/2024]
Abstract
Mitochondrial disorders are a heterogeneous group of disorders caused by mutations in the mitochondrial DNA or in nuclear genes encoding the mitochondrial proteins and subunits. Polymerase Gamma (POLG) is a nuclear gene and mutation in the POLG gene are one of the major causes of inherited mitochondrial disorders. In this study, 15 pediatric patients, with a wide spectrum of clinical phenotypes were screened using blood samples (n = 15) and muscle samples (n = 4). Respiratory chain enzyme analysis in the muscle samples revealed multi-complex deficiencies with Complex I deficiency present in (1/4) patients, Complex II (2/4), Complex III (3/4) and Complex IV (2/4) patients. Multiple large deletions were observed in 4/15 patients using LR-PCR. Whole exome sequencing (WES) revealed a compound heterozygous mutation consisting of a POLG1 novel variant (NP_002684.1:p.Trp261X) and a missense variant (NP_002684.1:p. Leu304Arg) in one patient and another patient harboring a novel homozygous POLG1 variant (NP_002684.1:p. Phe750Val). These variants (NP_002684.1:p. Leu304Arg) and (NP_002684.1:p. Phe750Val) and their interactions with DNA were modelled using molecular docking and molecular dynamics (MD) simulation studies. The protein conformation was analyzed as root mean square deviation (RMSD), root mean square fluctuation (RMSF) which showed local fluctuations in the mutants compared to the wildtype. However, Solvent Accessible Surface Area (SASA) significantly increased for NP_002684.1:p.Leu304Arg and decreased in NP_002684.1:p.Phe750Val mutants. Further, Contact Order analysis indicated that the Aromatic-sulfur interactions were destabilizing in the mutants. Overall, these in-silico analysis has revealed a destabilizing mutations suggesting pathogenic variants in POLG1 gene.
Collapse
Affiliation(s)
- Debolina Saha
- Stem Cell Biology Department, ICMR-National Institute for Research in Reproductive and Child Health, JM Street, Parel, Mumbai 400012, India
| | - Sonam Kothari
- Department of Pediatric Neurology, Bai Jerbai Wadia Hospital for Children, Acharya Donde Marg, Parel, Mumbai, Maharashtra 400012, India
| | - Shilpa Duttaprasanna Kulkarni
- Department of Pediatric Neurology, Bai Jerbai Wadia Hospital for Children, Acharya Donde Marg, Parel, Mumbai, Maharashtra 400012, India
| | - Menaka Thambiraja
- Department of Bioinformatics, School of Chemical and Biotechnology, SASTRA Deemed to be University, Thanjavur, Tamilnadu 613401, India
| | - Ragothaman M Yennamalli
- Department of Bioinformatics, School of Chemical and Biotechnology, SASTRA Deemed to be University, Thanjavur, Tamilnadu 613401, India.
| | - Dhanjit K Das
- Stem Cell Biology Department, ICMR-National Institute for Research in Reproductive and Child Health, JM Street, Parel, Mumbai 400012, India.
| |
Collapse
|
8
|
Martin P, Kurth EA, Budean D, Momplaisir N, Qu E, Simien JM, Orellana GE, Brautigam CA, Smrcka AV, Haglund E. Biophysical characterization of the CXC chemokine receptor 2 ligands. PLoS One 2024; 19:e0298418. [PMID: 38625857 PMCID: PMC11020491 DOI: 10.1371/journal.pone.0298418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 01/24/2024] [Indexed: 04/18/2024] Open
Abstract
The chemokines of the immune system act as first responders by operating as chemoattractants, directing immune cells to specific locations of inflamed tissues. This promiscuous network is comprised of 50 ligands and 18 receptors where the ligands may interact with the receptors in various oligomeric states i.e., monomers, homodimers, and heterodimers. Chemokine receptors are G-protein coupled receptors (GPCRs) present in the membrane of immune cells. The migration of immune cells occurs in response to a concentration gradient of the ligands. Chemotaxis of neutrophils is directed by CXC-ligand (CXCL) activation of the membrane bound CXC chemokine receptor 2 (CXCR2). CXCR2 plays an important role in human health and is linked to disorders such as autoimmune disorders, inflammation, and cancer. Yet, despite their important role, little is known about the biophysical characteristics controlling ligand:ligand and ligand:receptor interaction essential for biological activity. In this work, we study the homodimers of three of the CXCR2 cognate ligands, CXCL1, CXCL5, and CXCL8. The ligands share high structural integrity but a low sequence identity. We show that the sequence diversity has evolved different binding affinities and stabilities for the CXC-ligands resulting in diverse agonist/antagonist behavior. Furthermore, CXC-ligands fold through a three-state mechanism, populating a folded monomeric state before associating into an active dimer.
Collapse
Affiliation(s)
- Patrick Martin
- Department of Chemistry, University of Hawaii at Manoa, Honolulu, Hawaii, United States of America
| | - Emily A. Kurth
- Department of Chemistry, University of Hawaii at Manoa, Honolulu, Hawaii, United States of America
| | - David Budean
- Department of Chemistry, University of Hawaii at Manoa, Honolulu, Hawaii, United States of America
| | - Nathalie Momplaisir
- Department of Pharmacology, University of Michigan Medical School, Ann Arbor, Michigan, United States of America
| | - Elaine Qu
- Department of Pharmacology, University of Michigan Medical School, Ann Arbor, Michigan, United States of America
| | - Jennifer M. Simien
- Department of Chemistry, University of Hawaii at Manoa, Honolulu, Hawaii, United States of America
| | - Grace E. Orellana
- Department of Chemistry, University of Hawaii at Manoa, Honolulu, Hawaii, United States of America
| | - Chad A. Brautigam
- Department of Biophysics and the Department of Microbiology, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
| | - Alan V. Smrcka
- Department of Pharmacology, University of Michigan Medical School, Ann Arbor, Michigan, United States of America
| | - Ellinor Haglund
- Department of Chemistry, University of Hawaii at Manoa, Honolulu, Hawaii, United States of America
| |
Collapse
|
9
|
Harihar B, Saravanan KM, Gromiha MM, Selvaraj S. Importance of Inter-residue Contacts for Understanding Protein Folding and Unfolding Rates, Remote Homology, and Drug Design. Mol Biotechnol 2024:10.1007/s12033-024-01119-4. [PMID: 38498284 DOI: 10.1007/s12033-024-01119-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Accepted: 02/10/2024] [Indexed: 03/20/2024]
Abstract
Inter-residue interactions in protein structures provide valuable insights into protein folding and stability. Understanding these interactions can be helpful in many crucial applications, including rational design of therapeutic small molecules and biologics, locating functional protein sites, and predicting protein-protein and protein-ligand interactions. The process of developing machine learning models incorporating inter-residue interactions has been improved recently. This review highlights the theoretical models incorporating inter-residue interactions in predicting folding and unfolding rates of proteins. Utilizing contact maps to depict inter-residue interactions aids researchers in developing computer models for detecting remote homologs and interface residues within protein-protein complexes which, in turn, enhances our knowledge of the relationship between sequence and structure of proteins. Further, the application of contact maps derived from inter-residue interactions is highlighted in the field of drug discovery. Overall, this review presents an extensive assessment of the significant models that use inter-residue interactions to investigate folding rates, unfolding rates, remote homology, and drug development, providing potential future advancements in constructing efficient computational models in structural biology.
Collapse
Affiliation(s)
- Balasubramanian Harihar
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli, Tamil Nadu, 620024, India
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, 600036, India
| | - Konda Mani Saravanan
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli, Tamil Nadu, 620024, India
- Department of Biotechnology, Bharath Institute of Higher Education and Research, Chennai, Tamil Nadu, 600073, India
| | - Michael M Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, 600036, India
| | - Samuel Selvaraj
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli, Tamil Nadu, 620024, India.
| |
Collapse
|
10
|
Duran-Romaña R, Houben B, De Vleeschouwer M, Louros N, Wilson MP, Matthijs G, Schymkowitz J, Rousseau F. N-glycosylation as a eukaryotic protective mechanism against protein aggregation. SCIENCE ADVANCES 2024; 10:eadk8173. [PMID: 38295165 PMCID: PMC10830103 DOI: 10.1126/sciadv.adk8173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 12/28/2023] [Indexed: 02/02/2024]
Abstract
The tendency for proteins to form aggregates is an inherent part of every proteome and arises from the self-assembly of short protein segments called aggregation-prone regions (APRs). While posttranslational modifications (PTMs) have been implicated in modulating protein aggregation, their direct role in APRs remains poorly understood. In this study, we used a combination of proteome-wide computational analyses and biophysical techniques to investigate the potential involvement of PTMs in aggregation regulation. Our findings reveal that while most PTM types are disfavored near APRs, N-glycosylation is enriched and evolutionarily selected, especially in proteins prone to misfolding. Experimentally, we show that N-glycosylation inhibits the aggregation of peptides in vitro through steric hindrance. Moreover, mining existing proteomics data, we find that the loss of N-glycans at the flanks of APRs leads to specific protein aggregation in Neuro2a cells. Our findings indicate that, among its many molecular functions, N-glycosylation directly prevents protein aggregation in higher eukaryotes.
Collapse
Affiliation(s)
- Ramon Duran-Romaña
- Switch Laboratory, VIB Center for Brain and Disease Research, 3000 Leuven, Belgium
- Switch Laboratory, Department of Cellular and Molecular Medicine, KU Leuven, 3000 Leuven, Belgium
| | - Bert Houben
- Switch Laboratory, VIB Center for Brain and Disease Research, 3000 Leuven, Belgium
- Switch Laboratory, Department of Cellular and Molecular Medicine, KU Leuven, 3000 Leuven, Belgium
| | - Matthias De Vleeschouwer
- Switch Laboratory, VIB Center for Brain and Disease Research, 3000 Leuven, Belgium
- Switch Laboratory, Department of Cellular and Molecular Medicine, KU Leuven, 3000 Leuven, Belgium
| | - Nikolaos Louros
- Switch Laboratory, VIB Center for Brain and Disease Research, 3000 Leuven, Belgium
- Switch Laboratory, Department of Cellular and Molecular Medicine, KU Leuven, 3000 Leuven, Belgium
| | - Matthew P. Wilson
- Laboratory for Molecular Diagnosis, Center for Human Genetics, KU Leuven, 3000 Leuven, Belgium
| | - Gert Matthijs
- Laboratory for Molecular Diagnosis, Center for Human Genetics, KU Leuven, 3000 Leuven, Belgium
| | - Joost Schymkowitz
- Switch Laboratory, VIB Center for Brain and Disease Research, 3000 Leuven, Belgium
- Switch Laboratory, Department of Cellular and Molecular Medicine, KU Leuven, 3000 Leuven, Belgium
| | - Frederic Rousseau
- Switch Laboratory, VIB Center for Brain and Disease Research, 3000 Leuven, Belgium
- Switch Laboratory, Department of Cellular and Molecular Medicine, KU Leuven, 3000 Leuven, Belgium
| |
Collapse
|
11
|
Xiao N, Yang W, Wang J, Li J, Zhao R, Li M, Li C, Liu K, Li Y, Yin C, Chen Z, Li X, Jiang Y. Protein structuromics: A new method for protein structure-function crosstalk in glioma. Proteins 2024; 92:24-36. [PMID: 37497743 DOI: 10.1002/prot.26555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2023] [Revised: 06/16/2023] [Accepted: 07/04/2023] [Indexed: 07/28/2023]
Abstract
Glioma is a type of tumor that starts in the glial cells of the brain or spine. Since the 1800s, when the disease was first named, its survival rates have always been unsatisfactory. Despite great advances in molecular biology and traditional treatment methods, many questions regarding cancer occurrence and the underlying mechanism remain to be answered. In this study, we assessed the protein structural features of 20 oncogenes and 20 anti-oncogenes via protein structure and dynamic analysis methods and 3D structural and systematic analyses of the structure-function relationships of proteins. All of these results directly indicate that unfavorable group proteins show more complex structures than favorable group proteins. As the tumor cell microenvironment changes, the balance of oncogene-related and anti-oncogene-related proteins is disrupted, and most of the structures of the two groups of proteins will be disrupted. However, more unfavorable group proteins will maintain and refold to achieve their correct shape faster and perform their functions more quickly than favorable group proteins, and the former thus support cancer development. We hope that these analyses will help promote mechanistic research and the development of new treatments for glioma.
Collapse
Affiliation(s)
- Nan Xiao
- Department of Medical Science, Medical College of Jinzhou Medical University, Jinzhou, Liaoning, China
| | - Wenming Yang
- Department of Neurosurgery, The First Affiliated Hospital of Jinzhou Medical University, Jinzhou, Liaoning, China
| | - Jin Wang
- Department of Rehabilitation, Medical College of Jinzhou Medical University, Jinzhou, Liaoning, China
| | - Jiarong Li
- Department of Rehabilitation, Medical College of Jinzhou Medical University, Jinzhou, Liaoning, China
| | - Ruoxuan Zhao
- Department of Medical Science, Medical College of Jinzhou Medical University, Jinzhou, Liaoning, China
| | - Muzheng Li
- Department of Rehabilitation, Medical College of Jinzhou Medical University, Jinzhou, Liaoning, China
| | - Chi Li
- Department of Anesthesiology, Medical College of Jinzhou Medical University, Jinzhou, Liaoning, China
| | - Kang Liu
- Department of Medical Science, Medical College of Jinzhou Medical University, Jinzhou, Liaoning, China
| | - Yingxin Li
- Department of Medical Science, Medical College of Jinzhou Medical University, Jinzhou, Liaoning, China
| | - Chaoqun Yin
- Department of Medical Science, Medical College of Jinzhou Medical University, Jinzhou, Liaoning, China
| | - Zhibo Chen
- Department of Medical Science, Medical College of Jinzhou Medical University, Jinzhou, Liaoning, China
| | - Xingqi Li
- Department of Medicine, Medical College of Jinzhou Medical University, Jinzhou, Liaoning, China
| | - Yun Jiang
- Department of Medical Science, Medical College of Jinzhou Medical University, Jinzhou, Liaoning, China
| |
Collapse
|
12
|
Hofmaier M, Heger JE, Lentz S, Schwarz S, Müller-Buschbaum P, Scheibel T, Fery A, Müller M. Influence of the Sequence Motive Repeating Number on Protein Folding in Spider Silk Protein Films. Biomacromolecules 2023; 24:5707-5721. [PMID: 37934893 DOI: 10.1021/acs.biomac.3c00688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2023]
Abstract
Like multiblock copolymers, spider silk proteins are built of repetitive sequence motives. One prominent repetitive motif is based on the consensus sequence of spidroin 4 of the spider Araneus diadematus ADF4. The number x of the repeating sequence motives (C) determines the molecular weight of the recombinant ADF4-based, engineered spider silk protein denoted as eADF4(Cx). eADF4(Cx) can be used as a model for intrinsically disordered proteins (IDP) and to elucidate their folding. Herein, the influence of the variation of the sequence motive repeating number x (x = 1, 2, 4, 8, 16) on the protein folding within eADF4(Cx) films was investigated. eADF4(Cx) films were cast from 1,1,1,3,3,3-hexafluoropropan-2-ol (HFIP) solutions onto planar silicon model substrates, revealing mainly helical or random coil structure. Upon treatment with methanol vapor (ptm), the formation of crystalline β-sheets was triggered. Dichroic Fourier-transform infrared (FTIR) spectroscopy, circular dichroism, spectroscopic ellipsometry, atomic force microscopy, grazing-incidence small-angle X-ray scattering (GISAXS), grazing-incidence wide-angle X-ray scattering (GIWAXS), and electrokinetic and contact angle measurements were used to get information concerning the secondary structure and folding kinetics, orientation of β-sheets, the ratio of parallel/antiparallel β-sheets, domain sizes and distributions, surface topography, surface potential, hydrophobicity and the film integrity under water. Significant differences in the final β-sheet content, the share of antiparallel β-sheet structures, film integrity, surface potential, and isoelectric points between eADF4(Cx) with x = 1, 2 and eADF4(Cx) with x = 4, 8, 16 gave new insights in the molecular weight-dependent structure formation and film properties of IDP systems. GISAXS and kinetic measurements confirmed a relation between β-sheet crystal growth rate and final β-sheet crystal size. Further, competing effects of reduced diffusibility hindering accelerated crystal growth and enhanced backfolding promoting accelerated crystal growth with increasing molecular weight were discussed.
Collapse
Affiliation(s)
- Mirjam Hofmaier
- Institute of Physical Chemistry and Polymer Physics, Leibniz Institute of Polymer Research Dresden (IPF), Dresden 01069, Germany
- Chair of Physical Chemistry of Polymeric Materials, Technical University Dresden (TUD), Dresden 01069, Germany
| | - Julian E Heger
- TUM School of Natural Sciences, Department of Physics, Chair for Functional Materials, Technical University of Munich, Garching 85748, Germany
| | - Sarah Lentz
- Functional Polymer Interfaces Group, University of Bayreuth, Bayreuth 95447, Germany
| | - Simona Schwarz
- Institute of Physical Chemistry and Polymer Physics, Leibniz Institute of Polymer Research Dresden (IPF), Dresden 01069, Germany
| | - Peter Müller-Buschbaum
- TUM School of Natural Sciences, Department of Physics, Chair for Functional Materials, Technical University of Munich, Garching 85748, Germany
- Heinz Maier-Leibnitz Zentrum (MLZ), Technical University of Munich, Garching 85748, Germany
| | - Thomas Scheibel
- Chair of Biomaterials, University of Bayreuth, Bayreuth 95447, Germany
- Bayreuth Center of Colloids and Interfaces (BZKG), University of Bayreuth, Bayreuth 95440, Germany
- Bayreuth Center for Molecular Bioscience (BZMB), University of Bayreuth, Bayreuth 95440, Germany
- Bayreuth Center for Material Science and Engineering (BayMAT), Universität Bayreuth, Bayreuth 95440, Germany
- Bavarian Polymer Institute (BPI), University of Bayreuth, Bayreuth 95440, Germany
| | - Andreas Fery
- Institute of Physical Chemistry and Polymer Physics, Leibniz Institute of Polymer Research Dresden (IPF), Dresden 01069, Germany
- Chair of Physical Chemistry of Polymeric Materials, Technical University Dresden (TUD), Dresden 01069, Germany
| | - Martin Müller
- Institute of Physical Chemistry and Polymer Physics, Leibniz Institute of Polymer Research Dresden (IPF), Dresden 01069, Germany
- Chair of Macromolecular Chemistry, Technical University of Dresden (TUD), Dresden 01062, Germany
| |
Collapse
|
13
|
Hsu STD. Folding and functions of knotted proteins. Curr Opin Struct Biol 2023; 83:102709. [PMID: 37778185 DOI: 10.1016/j.sbi.2023.102709] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2023] [Revised: 09/02/2023] [Accepted: 09/05/2023] [Indexed: 10/03/2023]
Abstract
Topologically knotted proteins have entangled structural elements within their native structures that cannot be disentangled simply by pulling from the N- and C-termini. Systematic surveys have identified different types of knotted protein structures, constituting as much as 1% of the total entries within the Protein Data Bank. Many knotted proteins rely on their knotted structural elements to carry out evolutionarily conserved biological functions. Being knotted may also provide mechanical stability to withstand unfolding-coupled proteolysis. Reconfiguring a knotted protein topology by circular permutation or cyclization provides insights into the importance of being knotted in the context of folding and functions. With the explosion of predicted protein structures by artificial intelligence, we are now entering a new era of exploring the entangled protein universe.
Collapse
Affiliation(s)
- Shang-Te Danny Hsu
- Institute of Biological Chemistry, Academia Sinica, Taipei 11529, Taiwan; Institute of Biochemical Sciences, National Taiwan University, Taipei 10617, Taiwan; International Institute for Sustainability with Knotted Chiral Meta Matter (WPI-SKCM(2)), Hiroshima University, Higashi-Hiroshima, Hiroshima 739-8526, Japan.
| |
Collapse
|
14
|
Das D, Ainavarapu SRK. Circular permutation at azurin's active site slows down its folding. J Biol Inorg Chem 2023; 28:737-749. [PMID: 37957357 DOI: 10.1007/s00775-023-02023-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Accepted: 09/26/2023] [Indexed: 11/15/2023]
Abstract
Circular permutation (CP) is a technique by which the primary sequence of a protein is rearranged to create new termini. The connectivity of the protein is altered but the overall protein structure generally remains unperturbed. Understanding the effect of CP can help design robust proteins for numerous applications such as in genetic engineering, optoelectronics, and improving catalytic activity. Studies on different protein topologies showed that CP usually affects protein stability as well as unfolding rates. Though a significant number of proteins contain metals or other cofactors, reports of metalloprotein CPs are rare. Thus, we chose a bacterial metalloprotein, azurin, and its CP within the metal-binding site (cpF114). We studied the stabilities, folding, and unfolding rates of apo- and Zn2+-bound CP azurin using fluorescence and circular dichroism. The introduced CP had destabilizing effects on the protein. Also, the folding of the Zn2+-CP protein was much slower than that of the Zn2+-WT or apo-protein. We compared this study to our previously reported azurin-cpN42, where we had observed an equilibrium and kinetic intermediate. cpF114 exhibits an apparent two-state equilibrium unfolding but has an off-pathway kinetic intermediate. Our study hinted at CP as a method to modify the energy landscape of proteins to alter their folding pathways. WT azurin, being a faster folder, may have evolved to optimize the folding rate of metal-bound protein compared to its CPs, albeit all of them have the same structure and function. Our study underscores that protein sequence and protein termini positions are crucial for metalloproteins. TOC Figure. (Top) Zn2+-azurin WT structure (PDB code: 1E67) and 2-D topology diagram of Zn2+-cpF114 azurin. (Bottom) Cartoon diagram representing folding (red arrows) and unfolding (blue arrows) of apo- and Zn2+- WT and cpF114 azurins. The width of the arrows represents the rate of the corresponding processes.
Collapse
Affiliation(s)
- Debanjana Das
- Department of Chemical Sciences, Tata Institute of Fundamental Research, Dr. Homi Bhabha Road, Colaba, Mumbai, 400005, India
| | - Sri Rama Koti Ainavarapu
- Department of Chemical Sciences, Tata Institute of Fundamental Research, Dr. Homi Bhabha Road, Colaba, Mumbai, 400005, India.
| |
Collapse
|
15
|
Liu D, Zhang B, Liu J, Li H, Song L, Zhang G. Assessing protein model quality based on deep graph coupled networks using protein language model. Brief Bioinform 2023; 25:bbad420. [PMID: 38018909 PMCID: PMC10685403 DOI: 10.1093/bib/bbad420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 10/19/2023] [Accepted: 10/31/2023] [Indexed: 11/30/2023] Open
Abstract
Model quality evaluation is a crucial part of protein structural biology. How to distinguish high-quality models from low-quality models, and to assess which high-quality models have relatively incorrect regions for improvement, are remain a challenge. More importantly, the quality assessment of multimer models is a hot topic for structure prediction. In this study, we propose GraphCPLMQA, a novel approach for evaluating residue-level model quality that combines graph coupled networks and embeddings from protein language models. The GraphCPLMQA consists of a graph encoding module and a transform-based convolutional decoding module. In encoding module, the underlying relational representations of sequence and high-dimensional geometry structure are extracted by protein language models with Evolutionary Scale Modeling. In decoding module, the mapping connection between structure and quality is inferred by the representations and low-dimensional features. Specifically, the triangular location and residue level contact order features are designed to enhance the association between the local structure and the overall topology. Experimental results demonstrate that GraphCPLMQA using single-sequence embedding achieves the best performance compared with the CASP15 residue-level interface evaluation methods among 9108 models in the local residue interface test set of CASP15 multimers. In CAMEO blind test (20 May 2022 to 13 August 2022), GraphCPLMQA ranked first compared with other servers (https://www.cameo3d.org/quality-estimation). GraphCPLMQA also outperforms state-of-the-art methods on 19, 035 models in CASP13 and CASP14 monomer test set.
Collapse
Affiliation(s)
- Dong Liu
- College of Information Engineering, Zhejiang University of Technology
| | - Biao Zhang
- College of Information Engineering, Zhejiang University of Technology
| | - Jun Liu
- College of Information Engineering, Zhejiang University of Technology
| | - Hui Li
- researcher of AI in the BioMap
| | - Le Song
- Chief Scientist of AI in the BioMap & MBZUAI
| | - Guijun Zhang
- College of Information Engineering, Zhejiang University of Technology
| |
Collapse
|
16
|
Rat C, Heindl C, Neuweiler H. Domain swap facilitates structural transitions of spider silk protein C-terminal domains. Protein Sci 2023; 32:e4783. [PMID: 37712205 PMCID: PMC10578117 DOI: 10.1002/pro.4783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Revised: 08/25/2023] [Accepted: 09/12/2023] [Indexed: 09/16/2023]
Abstract
Domain swap is a mechanism of protein dimerization where the two interacting domains exchange parts of their structure. Web spiders make use of the process in the connection of C-terminal domains (CTDs) of spidroins, the soluble protein building blocks that form tough silk fibers. Besides providing connectivity and solubility, spidroin CTDs are responsible for inducing structural transitions during passage through an acidified assembly zone within spinning ducts. The underlying molecular mechanisms are elusive. Here, we studied the folding of five homologous spidroin CTDs from different spider species or glands. Four of these are domain-swapped dimers formed by five-helix bundles from spidroins of major and minor ampullate glands. The fifth is a dimer that lacks domain swap, formed by four-helix bundles from a spidroin of a flagelliform gland. Spidroins from this gland do not undergo structural transitions whereas the others do. We found a three-state mechanism of folding and dimerization that was conserved across homologues. In chemical denaturation experiments the native CTD dimer unfolded to a dimeric, partially structured intermediate, followed by full unfolding to denatured monomers. The energetics of the individual folding steps varied between homologues. Contrary to the common belief that domain swap stabilizes protein assemblies, the non-swapped homologue was most stable and folded four orders of magnitude faster than a swapped variant. Domain swap of spidroin CTDs induces an entropic penalty to the folding of peripheral helices, thus unfastening them for acid-induced unfolding within a spinning duct, which primes them for refolding into alternative structures during silk formation.
Collapse
Affiliation(s)
- Charlotte Rat
- Department of Biotechnology & BiophysicsJulius‐Maximilians‐University WürzburgWürzburgGermany
| | - Cedric Heindl
- Department of Biotechnology & BiophysicsJulius‐Maximilians‐University WürzburgWürzburgGermany
| | - Hannes Neuweiler
- Department of Biotechnology & BiophysicsJulius‐Maximilians‐University WürzburgWürzburgGermany
| |
Collapse
|
17
|
Salicari L, Baiesi M, Orlandini E, Trovato A. Folding kinetics of an entangled protein. PLoS Comput Biol 2023; 19:e1011107. [PMID: 37956216 PMCID: PMC10681328 DOI: 10.1371/journal.pcbi.1011107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Revised: 11/27/2023] [Accepted: 11/02/2023] [Indexed: 11/15/2023] Open
Abstract
The possibility of the protein backbone adopting lasso-like entangled motifs has attracted increasing attention. After discovering the surprising abundance of natively entangled protein domain structures, it was shown that misfolded entangled subpopulations might become thermosensitive or escape the homeostasis network just after translation. To investigate the role of entanglement in shaping folding kinetics, we introduce a novel indicator and analyze simulations of a coarse-grained, structure-based model for two small single-domain proteins. The model recapitulates the well-known two-state folding mechanism of a non-entangled SH3 domain. However, despite its small size, a natively entangled antifreeze RD1 protein displays a rich refolding behavior, populating two distinct kinetic intermediates: a short-lived, entangled, near-unfolded state and a longer-lived, non-entangled, near-native state. The former directs refolding along a fast pathway, whereas the latter is a kinetic trap, consistently with known experimental evidence of two different characteristic times. Upon trapping, the natively entangled loop folds without being threaded by the N-terminal residues. After trapping, the native entangled structure emerges by either backtracking to the unfolded state or threading through the already formed but not yet entangled loop. Along the fast pathway, trapping does not occur because the native contacts at the closure of the lasso-like loop fold after those involved in the N-terminal thread, confirming previous predictions. Despite this, entanglement may appear already in unfolded configurations. Remarkably, a longer-lived, near-native intermediate, with non-native entanglement properties, recalls what was observed in cotranslational folding.
Collapse
Affiliation(s)
- Leonardo Salicari
- Department of Physics and Astronomy “G. Galilei”, University of Padova, Padova, Italy
- National Institute of Nuclear Physics (INFN), Padova Section, Padova, Italy
| | - Marco Baiesi
- Department of Physics and Astronomy “G. Galilei”, University of Padova, Padova, Italy
- National Institute of Nuclear Physics (INFN), Padova Section, Padova, Italy
| | - Enzo Orlandini
- Department of Physics and Astronomy “G. Galilei”, University of Padova, Padova, Italy
- National Institute of Nuclear Physics (INFN), Padova Section, Padova, Italy
| | - Antonio Trovato
- Department of Physics and Astronomy “G. Galilei”, University of Padova, Padova, Italy
- National Institute of Nuclear Physics (INFN), Padova Section, Padova, Italy
| |
Collapse
|
18
|
Ingraham JB, Baranov M, Costello Z, Barber KW, Wang W, Ismail A, Frappier V, Lord DM, Ng-Thow-Hing C, Van Vlack ER, Tie S, Xue V, Cowles SC, Leung A, Rodrigues JV, Morales-Perez CL, Ayoub AM, Green R, Puentes K, Oplinger F, Panwar NV, Obermeyer F, Root AR, Beam AL, Poelwijk FJ, Grigoryan G. Illuminating protein space with a programmable generative model. Nature 2023; 623:1070-1078. [PMID: 37968394 PMCID: PMC10686827 DOI: 10.1038/s41586-023-06728-8] [Citation(s) in RCA: 46] [Impact Index Per Article: 46.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 10/06/2023] [Indexed: 11/17/2023]
Abstract
Three billion years of evolution has produced a tremendous diversity of protein molecules1, but the full potential of proteins is likely to be much greater. Accessing this potential has been challenging for both computation and experiments because the space of possible protein molecules is much larger than the space of those likely to have functions. Here we introduce Chroma, a generative model for proteins and protein complexes that can directly sample novel protein structures and sequences, and that can be conditioned to steer the generative process towards desired properties and functions. To enable this, we introduce a diffusion process that respects the conformational statistics of polymer ensembles, an efficient neural architecture for molecular systems that enables long-range reasoning with sub-quadratic scaling, layers for efficiently synthesizing three-dimensional structures of proteins from predicted inter-residue geometries and a general low-temperature sampling algorithm for diffusion models. Chroma achieves protein design as Bayesian inference under external constraints, which can involve symmetries, substructure, shape, semantics and even natural-language prompts. The experimental characterization of 310 proteins shows that sampling from Chroma results in proteins that are highly expressed, fold and have favourable biophysical properties. The crystal structures of two designed proteins exhibit atomistic agreement with Chroma samples (a backbone root-mean-square deviation of around 1.0 Å). With this unified approach to protein design, we hope to accelerate the programming of protein matter to benefit human health, materials science and synthetic biology.
Collapse
Affiliation(s)
| | | | | | | | - Wujie Wang
- Generate Biomedicines, Somerville, MA, USA
| | | | | | | | | | | | - Shan Tie
- Generate Biomedicines, Somerville, MA, USA
| | | | | | - Alan Leung
- Generate Biomedicines, Somerville, MA, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Dias RVR, Pedro RP, Sanches MN, Moreira GC, Leite VBP, Caruso IP, de Melo FA, de Oliveira LC. Unveiling Metastable Ensembles of GRB2 and the Relevance of Interdomain Communication during Folding. J Chem Inf Model 2023; 63:6344-6353. [PMID: 37824286 DOI: 10.1021/acs.jcim.3c00955] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2023]
Abstract
The folding process of multidomain proteins is a highly intricate phenomenon involving the assembly of distinct domains into a functional three-dimensional structure. During this process, each domain may fold independently while interacting with others. The folding of multidomain proteins can be influenced by various factors, including their composition, the structure of each domain, or the presence of disordered regions, as well as the surrounding environment. Misfolding of multidomain proteins can lead to the formation of nonfunctional structures associated with a range of diseases, including cancers or neurodegenerative disorders. Understanding this process is an important step for many biophysical analyses such as stability, interaction, malfunctioning, and rational drug design. One such multidomain protein is growth factor receptor-bound protein 2 (GRB2), an adaptor protein that is essential in regulating cell survival. GRB2 consists of one central Src homology 2 (SH2) domain flanked by two Src homology 3 (SH3) domains. The SH2 domain interacts with phosphotyrosine regions in other proteins, while the SH3 domains recognize proline-rich regions on protein partners during cell signaling. Here, we combined computational and experimental techniques to investigate the folding process of GRB2. Through computational simulations, we sampled the conformational space and mapped the mechanisms involved by the free energy profiles, which may indicate possible intermediate states. From the molecular dynamics trajectories, we used the energy landscape visualization method (ELViM), which allowed us to visualize a three-dimensional (3D) representation of the overall energy surface. We identified two possible parallel folding routes that cannot be seen in a one-dimensional analysis, with one occurring more frequently during folding. Supporting these results, we used differential scanning calorimetry (DSC) and fluorescence spectroscopy techniques to confirm these intermediate states in vitro. Finally, we analyzed the deletion of domains to compare our model outputs to previously published results, supporting the presence of interdomain modulation. Overall, our study highlights the significance of interdomain communication within the GRB2 protein and its impact on the formation, stability, and structural plasticity of the protein, which are crucial for its interaction with other proteins in key signaling pathways.
Collapse
Affiliation(s)
- Raphael V R Dias
- Department of Physics, São Paulo State University (UNESP), Institute of Biosciences, Humanities, and Exact Sciences, São José do Rio Preto, SP 15054-000, Brazil
- Multiuser Center for Biomolecular Innovation (CMIB), São Paulo State University (UNESP), São José do Rio Preto, SP 15054-000, Brazil
| | - Renan P Pedro
- Department of Physics, São Paulo State University (UNESP), Institute of Biosciences, Humanities, and Exact Sciences, São José do Rio Preto, SP 15054-000, Brazil
- Multiuser Center for Biomolecular Innovation (CMIB), São Paulo State University (UNESP), São José do Rio Preto, SP 15054-000, Brazil
| | - Murilo N Sanches
- Department of Physics, São Paulo State University (UNESP), Institute of Biosciences, Humanities, and Exact Sciences, São José do Rio Preto, SP 15054-000, Brazil
| | - Giovana C Moreira
- Department of Physics, São Paulo State University (UNESP), Institute of Biosciences, Humanities, and Exact Sciences, São José do Rio Preto, SP 15054-000, Brazil
- Multiuser Center for Biomolecular Innovation (CMIB), São Paulo State University (UNESP), São José do Rio Preto, SP 15054-000, Brazil
| | - Vitor B P Leite
- Department of Physics, São Paulo State University (UNESP), Institute of Biosciences, Humanities, and Exact Sciences, São José do Rio Preto, SP 15054-000, Brazil
| | - Icaro P Caruso
- Department of Physics, São Paulo State University (UNESP), Institute of Biosciences, Humanities, and Exact Sciences, São José do Rio Preto, SP 15054-000, Brazil
- Multiuser Center for Biomolecular Innovation (CMIB), São Paulo State University (UNESP), São José do Rio Preto, SP 15054-000, Brazil
| | - Fernando A de Melo
- Department of Physics, São Paulo State University (UNESP), Institute of Biosciences, Humanities, and Exact Sciences, São José do Rio Preto, SP 15054-000, Brazil
- Multiuser Center for Biomolecular Innovation (CMIB), São Paulo State University (UNESP), São José do Rio Preto, SP 15054-000, Brazil
| | - Leandro C de Oliveira
- Department of Physics, São Paulo State University (UNESP), Institute of Biosciences, Humanities, and Exact Sciences, São José do Rio Preto, SP 15054-000, Brazil
| |
Collapse
|
20
|
Casier R, Duhamel J. Appraisal of blob-Based Approaches in the Prediction of Protein Folding Times. J Phys Chem B 2023; 127:8852-8859. [PMID: 37793094 DOI: 10.1021/acs.jpcb.3c04958] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/06/2023]
Abstract
A series of reports published in the last 3 years has illustrated that a blob-based model (BBM) can predict the folding time of proteins from their primary amino acid (aa) sequence based on three simple rules established to characterize the long-range backbone dynamics (LRBD) of racemic polypeptides. The sole use of LRBD to predict protein folding times with the BBM represents a radical departure from all other prediction methods currently applied to determine protein folding times, which rely instead on parameters such as the structure content, folding kinetics, chain length, amino acid properties, or contact topography of proteins. Furthermore, the built-in modularity of the BBM enables the parametrization and inclusion of new phenomena affecting the LRBD of polypeptides, while its conceptual simplicity makes it an interesting new mathematical tool for studying protein folding. However, its novelty implies that its relationship with many other methods used to predict protein folding times has not been well researched. Consequently, the purpose of this report is to uncover the physical phenomena encountered during protein folding that are best described by the BBM through the identification of parameters that have been recognized over the years as being strong predictors for protein folding, such as protein size, topology, structural class, and folding kinetics. This was accomplished by determining the parameters most strongly correlated with the folding times predicted by the BBM. While the BBM in its present form appears to be a good indicator of the folding times of the vast majority of the 195 proteins considered so far, this report finds that it excels for moderately large proteins that are primarily composed of locally formed structural motifs such as α-helices or for proteins that fold in multiple steps. Altogether, these observations based on the use of the BBM support the notion that proteins fold the way they do because the LRBD of polypeptides is mostly driven by the local interactions experienced between aa's within reach of one another.
Collapse
Affiliation(s)
- Remi Casier
- Institute for Polymer Research, Waterloo Institute for Nanotechnology, Department of Chemistry, University of Waterloo, Waterloo, Ontario N2L3G1, Canada
| | - Jean Duhamel
- Institute for Polymer Research, Waterloo Institute for Nanotechnology, Department of Chemistry, University of Waterloo, Waterloo, Ontario N2L3G1, Canada
| |
Collapse
|
21
|
Turina P, Fariselli P, Capriotti E. K-Pro: Kinetics Data on Proteins and Mutants. J Mol Biol 2023; 435:168245. [PMID: 37625584 DOI: 10.1016/j.jmb.2023.168245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 08/16/2023] [Accepted: 08/17/2023] [Indexed: 08/27/2023]
Abstract
The study of protein folding plays a crucial role in improving our understanding of protein function and of the relationship between genetics and phenotypes. In particular, understanding the thermodynamics and kinetics of the folding process is important for uncovering the mechanisms behind human disorders caused by protein misfolding. To address this issue, it is essential to collect and curate experimental kinetic and thermodynamic data on protein folding. K-Pro is a new database designed for collecting and storing experimental kinetic data on monomeric proteins, with a two-state folding mechanism. With 1,529 records from 62 proteins corresponding to 65 structures, K-Pro contains various kinetic parameters such as the logarithm of the folding and unfolding rates, Tanford's β and the ϕ values. When available, the database also includes thermodynamic parameters associated with the kinetic data. K-Pro features a user-friendly interface that allows browsing and downloading kinetic data of interest. The graphical interface provides a visual representation of the protein and mutants, and it is cross-linked to key databases such as PDB, UniProt, and PubMed. K-Pro is open and freely accessible through https://folding.biofold.org/k-pro and supports the latest versions of popular browsers.
Collapse
Affiliation(s)
- Paola Turina
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Via F. Selmi 3, 40126 Bologna, Italy
| | - Piero Fariselli
- Department of Medical Sciences, University of Torino, Via Santena 19, 10126 Torino, Italy
| | - Emidio Capriotti
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Via F. Selmi 3, 40126 Bologna, Italy.
| |
Collapse
|
22
|
Ramakrishna Reddy P, Kulandaisamy A, Michael Gromiha M. TMH Stab-pred: Predicting the stability of α-helical membrane proteins using sequence and structural features. Methods 2023; 218:118-124. [PMID: 37572768 DOI: 10.1016/j.ymeth.2023.08.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2023] [Revised: 08/02/2023] [Accepted: 08/04/2023] [Indexed: 08/14/2023] Open
Abstract
The folding and stability of transmembrane proteins (TMPs) are governed by the insertion of secondary structural elements into the cell membrane followed by their assembly. Understanding the important features that dictate the stability of TMPs is important for elucidating their functions. In this work, we related sequence and structure-based parameters with free energy (ΔG0) of α-helical membrane proteins. Our results showed that the free energy transfer of hydrophobic peptides, relative contact order, total interaction energy, number of hydrogen bonds and lipid accessibility of transmembrane regions are important for stability. Further, we have developed multiple-regression models to predict the stability of α-helical membrane proteins using these features and our method can predict the stability with a correlation and mean absolute error (MAE) of 0.89 and 1.21 kcal/mol, respectively, on jack-knife test. The method was validated with a blind test set of three recently reported experimental ΔG0, which could predict the stability within an average MAE of 0.51 kcal/mol. Further, we developed a webserver for predicting the stability and it is freely available at (https://web.iitm.ac.in/bioinfo2/TMHS/). The importance of selected parameters and limitations are discussed.
Collapse
Affiliation(s)
- P Ramakrishna Reddy
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India
| | - A Kulandaisamy
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India; Basic and Translational Research Division, Department of Cardiology, Boston Children's Hospital, Boston, MA 02115, USA
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India; Department of Computer Science, Tokyo Institute of Technology, Yokohama, Japan; Department of Computer Science, National University of Singapore, Singapore.
| |
Collapse
|
23
|
Liu F, Dang Y, Li L, Feng H, Li J, Wang H, Zhang X, Zhang Z, Ye S, Tian Y, Chen Q. Structure and mechanism of a neuropeptide-activated channel in the ENaC/DEG superfamily. Nat Chem Biol 2023; 19:1276-1285. [PMID: 37550431 DOI: 10.1038/s41589-023-01401-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 07/10/2023] [Indexed: 08/09/2023]
Abstract
Phe-Met-Arg-Phe-amide (FMRFamide)-activated sodium channels (FaNaCs) are a family of channels activated by the neuropeptide FMRFamide, and, to date, the underlying ligand gating mechanism remains unknown. Here we present the high-resolution cryo-electron microscopy structures of Aplysia californica FaNaC in both apo and FMRFamide-bound states. AcFaNaC forms a chalice-shaped trimer and possesses several notable features, including two FaNaC-specific insertion regions, a distinct finger domain and non-domain-swapped transmembrane helix 2 in the transmembrane domain (TMD). One FMRFamide binds to each subunit in a cleft located in the top-most region of the extracellular domain, with participation of residues from the neighboring subunit. Bound FMRFamide adopts an extended conformation. FMRFamide binds tightly to A. californica FaNaC in an N terminus-in manner, which causes collapse of the binding cleft and induces large local conformational rearrangements. Such conformational changes are propagated downward toward the TMD via the palm domain, possibly resulting in outward movement of the TMD and dilation of the ion conduction pore.
Collapse
Affiliation(s)
- Fenglian Liu
- State Key Laboratory of Conservation and Utilization of Bio-Resources in Yunnan and Center for Life Sciences, School of Life Sciences, Yunnan University, Kunming, China
| | - Yu Dang
- State Key Laboratory of Membrane Biology, Peking University-Tsinghua University-National Institute of Biological Sciences Joint Graduate Program, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Lu Li
- Academy of Medical Engineering and Translational Medicine (AMT) and Tianjin Key Laboratory of Brain Science and Neural Engineering, Tianjin University, Tianjin, China
| | - Hao Feng
- Frontiers Science Center for Synthetic Biology (Ministry of Education), Tianjin Key Laboratory of Function and Application of Biological Macromolecular Structures, School of Life Sciences, Tianjin University, Tianjin, China
| | - Jianlin Li
- Academy of Medical Engineering and Translational Medicine (AMT) and Tianjin Key Laboratory of Brain Science and Neural Engineering, Tianjin University, Tianjin, China
| | - Haowei Wang
- State Key Laboratory of Conservation and Utilization of Bio-Resources in Yunnan and Center for Life Sciences, School of Life Sciences, Yunnan University, Kunming, China
| | - Xu Zhang
- State Key Laboratory of Conservation and Utilization of Bio-Resources in Yunnan and Center for Life Sciences, School of Life Sciences, Yunnan University, Kunming, China
| | - Zhe Zhang
- State Key Laboratory of Membrane Biology, Peking University-Tsinghua University-National Institute of Biological Sciences Joint Graduate Program, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China.
- Center for Life Sciences, School of Life Sciences, Peking University, Beijing, China.
| | - Sheng Ye
- Frontiers Science Center for Synthetic Biology (Ministry of Education), Tianjin Key Laboratory of Function and Application of Biological Macromolecular Structures, School of Life Sciences, Tianjin University, Tianjin, China.
| | - Yutao Tian
- Academy of Medical Engineering and Translational Medicine (AMT) and Tianjin Key Laboratory of Brain Science and Neural Engineering, Tianjin University, Tianjin, China.
| | - Qingfeng Chen
- State Key Laboratory of Conservation and Utilization of Bio-Resources in Yunnan and Center for Life Sciences, School of Life Sciences, Yunnan University, Kunming, China.
| |
Collapse
|
24
|
Roel-Touris J, Nadal M, Marcos E. Single-chain dimers from de novo immunoglobulins as robust scaffolds for multiple binding loops. Nat Commun 2023; 14:5939. [PMID: 37741853 PMCID: PMC10517939 DOI: 10.1038/s41467-023-41717-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 09/15/2023] [Indexed: 09/25/2023] Open
Abstract
Antibody derivatives have sought to recapitulate the antigen binding properties of antibodies, but with improved biophysical attributes convenient for therapeutic, diagnostic and research applications. However, their success has been limited by the naturally occurring structure of the immunoglobulin dimer displaying hypervariable binding loops, which is hard to modify by traditional engineering approaches. Here, we devise geometrical principles for de novo designing single-chain immunoglobulin dimers, as a tunable two-domain architecture that optimizes biophysical properties through more favorable dimer interfaces. Guided by these principles, we computationally designed protein scaffolds that were hyperstable, structurally accurate and robust for accommodating multiple functional loops, both individually and in combination, as confirmed through biochemical assays and X-ray crystallography. We showcase the modularity of this architecture by deep-learning-based diversification, opening up the possibility for tailoring the number, positioning, and relative orientation of ligand-binding loops targeting one or two distal epitopes. Our results provide a route to custom-design robust protein scaffolds for harboring multiple functional loops.
Collapse
Affiliation(s)
- Jorge Roel-Touris
- Protein Design and Modeling Lab, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB), CSIC, Baldiri Reixac 10, 08028, Barcelona, Spain
| | - Marta Nadal
- Protein Design and Modeling Lab, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB), CSIC, Baldiri Reixac 10, 08028, Barcelona, Spain
| | - Enrique Marcos
- Protein Design and Modeling Lab, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB), CSIC, Baldiri Reixac 10, 08028, Barcelona, Spain.
| |
Collapse
|
25
|
Huang Z, Cui X, Xia Y, Zhao K, Zhang G. Pathfinder: Protein folding pathway prediction based on conformational sampling. PLoS Comput Biol 2023; 19:e1011438. [PMID: 37695768 PMCID: PMC10513300 DOI: 10.1371/journal.pcbi.1011438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 09/21/2023] [Accepted: 08/17/2023] [Indexed: 09/13/2023] Open
Abstract
The study of protein folding mechanism is a challenge in molecular biology, which is of great significance for revealing the movement rules of biological macromolecules, understanding the pathogenic mechanism of folding diseases, and designing protein engineering materials. Based on the hypothesis that the conformational sampling trajectory contain the information of folding pathway, we propose a protein folding pathway prediction algorithm named Pathfinder. Firstly, Pathfinder performs large-scale sampling of the conformational space and clusters the decoys obtained in the sampling. The heterogeneous conformations obtained by clustering are named seed states. Then, a resampling algorithm that is not constrained by the local energy basin is designed to obtain the transition probabilities of seed states. Finally, protein folding pathways are inferred from the maximum transition probabilities of seed states. The proposed Pathfinder is tested on our developed test set (34 proteins). For 11 widely studied proteins, we correctly predicted their folding pathways and specifically analyzed 5 of them. For 13 proteins, we predicted their folding pathways to be further verified by biological experiments. For 6 proteins, we analyzed the reasons for the low prediction accuracy. For the other 4 proteins without biological experiment results, potential folding pathways were predicted to provide new insights into protein folding mechanism. The results reveal that structural analogs may have different folding pathways to express different biological functions, homologous proteins may contain common folding pathways, and α-helices may be more prone to early protein folding than β-strands.
Collapse
Affiliation(s)
- Zhaohong Huang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou, China
| | - Xinyue Cui
- College of Information Engineering, Zhejiang University of Technology, Hangzhou, China
| | - Yuhao Xia
- College of Information Engineering, Zhejiang University of Technology, Hangzhou, China
| | - Kailong Zhao
- College of Information Engineering, Zhejiang University of Technology, Hangzhou, China
| | - Guijun Zhang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou, China
| |
Collapse
|
26
|
Nikolsky KS, Kulikova LI, Petrovskiy DV, Rudnev VR, Butkova TV, Malsagova KA, Kopylov AT, Kaysheva AL. Three-helix bundle and SH3-type barrels: autonomously stable structural motifs in small and large proteins. J Biomol Struct Dyn 2023:1-15. [PMID: 37640007 DOI: 10.1080/07391102.2023.2250450] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 08/12/2023] [Indexed: 08/31/2023]
Abstract
In this study, we investigated two variants of a three-helix bundle and SH3-type barrel, compact in space, present in small and large proteins of various living organisms. Using a neural graph network, proteins with three-helix bundle (n = 1377) and SH3-type barrels (n = 1914) spatial folds were selected. Molecular experiments were performed for small proteins with these folds, and motifs were studied autonomously outside the protein environment at 300, 340, and 370 K. A comparative analysis of the main parameters of the structures in the course of the experiment was performed, including gyration radius, area accessible to the solvent, number of hydrophobic and hydrogen bonds, and root-mean-square deviation of atomic positions (RMSD). We exhibited an autonomous stability of the studied folds outside the protein environment in an aquatic medium. We aimed to demonstrate the possibility of analyzing three-helix bundle and SH3-type barrels autonomously outside the protein globule, thereby reducing the computational time and increasing performance without significant loss of information.Communicated by Ramaswamy H. Sarma.
Collapse
|
27
|
Aina A, Hsueh SCC, Gibbs E, Peng X, Cashman NR, Plotkin SS. De Novo Design of a β-Helix Tau Protein Scaffold: An Oligomer-Selective Vaccine Immunogen Candidate for Alzheimer's Disease. ACS Chem Neurosci 2023; 14:2603-2617. [PMID: 37458595 DOI: 10.1021/acschemneuro.3c00007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/03/2023] Open
Abstract
Tau pathology is associated with many neurodegenerative disorders, including Alzheimer's disease (AD), where the spatio-temporal pattern of tau neurofibrillary tangles strongly correlates with disease progression, which motivates therapeutics selective for misfolded tau. Here, we introduce a new avidity-enhanced, multi-epitope approach for protein-misfolding immunogen design, which is predicted to mimic the conformational state of an exposed epitope in toxic tau oligomers. A predicted oligomer-selective tau epitope 343KLDFK347 was scaffolded by designing a β-helix structure that incorporated multiple instances of the 16-residue tau fragment 339VKSEKLDFKDRVQSKI354. Large-scale conformational ensemble analyses involving Jensen-Shannon Divergence and the embedding depth D showed that the multi-epitope scaffolding approach, employed in designing the β-helix scaffold, was predicted to better discriminate toxic tau oligomers than other "monovalent" strategies utilizing a single instance of an epitope for vaccine immunogen design. Using Rosetta, 10,000 sequences were designed and screened for the linker portions of the β-helix scaffold, along with a C-terminal stabilizing α-helix that interacts with the linkers, to optimize the folded structure and stability of the scaffold. Structures were ranked by energy, and the lowest 1% (82 unique sequences) were verified using AlphaFold. Several selection criteria involving AlphaFold are implemented to obtain a lead-designed sequence. The structure was further predicted to have free energetic stability by using Hamiltonian replica exchange molecular dynamics (MD) simulations. The synthesized β-helix scaffold showed direct binding in surface plasmon resonance (SPR) experiments to several antibodies that were raised to the structured epitope using a designed cyclic peptide. Moreover, the strength of binding of these antibodies to in vitro tau oligomers correlated with the strength of binding to the β-helix construct, suggesting that the construct presents an oligomer-like conformation and may thus constitute an effective oligomer-selective immunogen.
Collapse
Affiliation(s)
- Adekunle Aina
- Department of Physics and Astronomy, The University of British Columbia, Vancouver, British Columbia V6T 1Z1, Canada
| | - Shawn C C Hsueh
- Department of Physics and Astronomy, The University of British Columbia, Vancouver, British Columbia V6T 1Z1, Canada
| | - Ebrima Gibbs
- Djavad Mowafaghian Centre for Brain Health, The University of British Columbia, Vancouver, British Columbia V6T 1Z1, Canada
| | - Xubiao Peng
- Department of Physics and Astronomy, The University of British Columbia, Vancouver, British Columbia V6T 1Z1, Canada
| | - Neil R Cashman
- Djavad Mowafaghian Centre for Brain Health, The University of British Columbia, Vancouver, British Columbia V6T 1Z1, Canada
| | - Steven S Plotkin
- Department of Physics and Astronomy, The University of British Columbia, Vancouver, British Columbia V6T 1Z1, Canada
- Genome Science and Technology Program, The University of British Columbia, Vancouver, British Columbia V6T 1Z1, Canada
| |
Collapse
|
28
|
Wijker S, Palmans ARA. Protein-Inspired Control over Synthetic Polymer Folding for Structured Functional Nanoparticles in Water. Chempluschem 2023; 88:e202300260. [PMID: 37417828 DOI: 10.1002/cplu.202300260] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 07/06/2023] [Accepted: 07/06/2023] [Indexed: 07/08/2023]
Abstract
The folding of proteins into functional nanoparticles with defined 3D structures has inspired chemists to create simple synthetic systems mimicking protein properties. The folding of polymers into nanoparticles in water proceeds via different strategies, resulting in the global compaction of the polymer chain. Herein, we review the different methods available to control the conformation of synthetic polymers and collapse/fold them into structured, functional nanoparticles, such as hydrophobic collapse, supramolecular self-assembly, and covalent cross-linking. A comparison is made between the design principles of protein folding to synthetic polymer folding and the formation of structured nanocompartments in water, highlighting similarities and differences in design and function. We also focus on the importance of structure for functional stability and diverse applications in complex media and cellular environments.
Collapse
Affiliation(s)
- Stefan Wijker
- Institute for Complex Molecular Systems, Laboratory of Macromolecular and Organic Chemistry, Eindhoven University of Technology, 5600 MB, Eindhoven, The Netherlands
| | - Anja R A Palmans
- Institute for Complex Molecular Systems, Laboratory of Macromolecular and Organic Chemistry, Eindhoven University of Technology, 5600 MB, Eindhoven, The Netherlands
| |
Collapse
|
29
|
Roy A, Ray S. An in-silico study to understand the effect of lineage diversity on cold shock response: unveiling protein-RNA interactions among paralogous CSPs of E. coli. 3 Biotech 2023; 13:236. [PMID: 37333716 PMCID: PMC10272043 DOI: 10.1007/s13205-023-03656-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 05/30/2023] [Indexed: 06/20/2023] Open
Abstract
Cold shock proteins (CSPs) are small, cytoplasmic, ubiquitous and acidic proteins. They have a single nucleic acid-binding domain and pose as "RNA chaperones" by binding to ssRNA in a low sequence specificity and cooperative manner. They are found in a family of nine homologous CSPs in E. coli. CspA, CspB, CspG and CspI are immensely cold inducible, CspE and CspC are consistently released at usual physiological temperatures and CspD is also induced under nutrient stress. The paralogous protein pairs CSPA/CSPB, CSPC/CSPE, CSPG/CSPI and CSPF/CSPH were first identified. The eight proteins were subjected to molecular modelling and simulation to obtain the most stable conformation in correspondence to their equilibrated RMSD and RMSF graph. The results were compared and it was observed that CSPB, CSPE, CSPF and CSPI were more stable than their paralogous partner conforming to their near equilibrated RMSD curve and low fluctuating RMSF graph. The paralogous proteins were docked with ssRNA and simultaneously binding affinity, interaction types, electrostatic surface potential, hydrophobicity, conformational analysis and SASA were calculated to minutely study and understand the molecular mechanism initiated by these proteins. It was found that CSPB, CSPC, CSPH and CSPI displayed higher affinity towards ssRNA than their paralogous partner. The results further corroborated with ΔGmmgbsa and ΔGfold energy. Between the paralogous pairs CSPC, CSPH and CSPI exhibited higher binding free energy than their partner. Further, CSPB, CSPC and CSPI exhibited higher folding free energy than their paralogous pair. CSPH exhibited highest ΔGmmgbsa of - 522.2 kcal/mol and lowest was displayed by CSPG of around - 309.3 kcal/mol. Highest number of mutations were recognised in CSPF/CSPH and CSPG/CSPI pair. Difference in interaction pattern was maximum in CSPF/CSPH owing to their high number of non-synonymous substitutions. Maximum difference in surface electrostatic potential was observed in case of CSPA, CSPG and CSPF. This research work emphasizes on discerning the molecular mechanism initiated by these proteins with a structural, mutational and functional approach. Supplementary Information The online version contains supplementary material available at 10.1007/s13205-023-03656-2.
Collapse
Affiliation(s)
- Alankar Roy
- Amity Institute of Biotechnology, Amity University, Kolkata, India
| | - Sujay Ray
- Amity Institute of Biotechnology, Amity University, Kolkata, India
| |
Collapse
|
30
|
Xiao N, Ma H, Gao H, Yang J, Tong D, Gan D, Yang J, Li C, Liu K, Li Y, Chen Z, Yin C, Li X, Wang H. Structure-function crosstalk in liver cancer research: Protein structuromics. Int J Biol Macromol 2023:125291. [PMID: 37315670 DOI: 10.1016/j.ijbiomac.2023.125291] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 06/04/2023] [Accepted: 06/07/2023] [Indexed: 06/16/2023]
Abstract
Liver cancer can be primary (starting in the liver) or secondary (cancer that has spread from elsewhere to the liver, known as liver metastasis). Liver metastasis is more common than primary liver cancer. Despite great advances in molecular biology methods and treatments, liver cancer is still associated with a poor survival rate and a high death rate, and there is no cure. Many questions remain regarding the mechanisms of liver cancer occurrence and development as well as tumor reoccurrence after treatment. In this study, we assessed the protein structural features of 20 oncogenes and 20 anti-oncogenes via protein structure and dynamic analysis methods and 3D structural and systematic analyses of the structure-function relationships of proteins. Our aim was to provide new insights that may inform research on the development and treatment of liver cancer.
Collapse
Affiliation(s)
- Nan Xiao
- Department of Medical Science, Medical College of Jinzhou Medical University, Jinzhou City, Liaoning Province, China.
| | - Hongming Ma
- Department of Oncology, China Emergency General Hospital City, Beijing, China
| | - Hong Gao
- Department of Oncology, China Emergency General Hospital City, Beijing, China
| | - Jing Yang
- Department of Computer Center, Medical College of Jinzhou Medical University, Jinzhou City, Liaoning Province, China
| | - Dan Tong
- Department of Nurse, Medical College of Jinzhou Medical University, Jinzhou City, Liaoning Province, China
| | - Dingzhu Gan
- Department of Publicity, Peking Union Medical College, Beijing, China
| | - Jinhua Yang
- Department of Development and Production, Institute of Medical Biology, Peking Union Medical College, Kunming City, Yunnan Province, China
| | - Chi Li
- Department of Anesthesiology, Medical College of Jinzhou Medical University, Jinzhou City, Liaoning Province, China
| | - Kang Liu
- Department of Medical Science, Medical College of Jinzhou Medical University, Jinzhou City, Liaoning Province, China
| | - Yingxin Li
- Department of Medical Science, Medical College of Jinzhou Medical University, Jinzhou City, Liaoning Province, China
| | - Zhibo Chen
- Department of Medical Science, Medical College of Jinzhou Medical University, Jinzhou City, Liaoning Province, China
| | - Chaoqun Yin
- Department of Medical Science, Medical College of Jinzhou Medical University, Jinzhou City, Liaoning Province, China
| | - Xingqi Li
- Department of Medicine, Medical College of Jinzhou Medical University, Jinzhou City, Liaoning Province, China
| | - Hongwu Wang
- Department of Respiratory and Critical Care Medicine, Dongzhimen Hospital Affiliated to Beijing University of Chinese Medicine, Beijing, China
| |
Collapse
|
31
|
Das D, Yadav P, Mitra S, Ainavarapu SRK. Metal-binding and circular permutation-dependent thermodynamic and kinetic stability of azurin. Proteins 2023; 91:634-648. [PMID: 36511110 DOI: 10.1002/prot.26454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2022] [Revised: 11/18/2022] [Accepted: 11/25/2022] [Indexed: 12/15/2022]
Abstract
Native topology is known to determine the folding kinetics and the energy landscape of proteins. Furthermore, the circular permutation (CP) of proteins alters the order of the secondary structure connectivity while retaining the three-dimensional structure, making it an elegant and powerful approach to altering native topology. Previous studies elucidated the influence of CP in proteins with different folds such as Greek key β-barrel, β-sandwich, β-α-β, and all α-Greek key. CP mainly affects the protein stability and unfolding kinetics, while folding kinetics remains mostly unaltered. However, the effect of CP on metalloproteins is yet to be elaborately studied. The active site of metalloproteins poses an additional complexity in studying protein folding. Here, we investigate a CP variant (cpN42) of azurin-in both metal-free and metal-bound (holo) forms. As observed earlier in other proteins, apo-forms of wild-type (WT) and cpN42 fold with similar rates. In contrast, zinc-binding accelerates the folding of WT but decelerates the folding of cpN42. On zinc-binding, the spontaneous folding rate of WT increases by >250 times that of cpN42, which is unprecedented and the highest for any CP to date. On the other hand, zinc-binding reduces the spontaneous unfolding rate of cpN42 by ~100 times, making the WT and CP azurins unfold at similar rates. Our study demonstrates metal binding as a novel way to modulate the unfolding and folding rates of CPs compared to their WT counterparts. We hope our study increases the understanding of the effect of CP on the folding mechanism and energy landscape of metalloproteins.
Collapse
Affiliation(s)
- Debanjana Das
- Department of Chemical Sciences, Dr. Homi Bhabha Road, Colaba, Tata Institute of Fundamental Research, Mumbai, India
| | - Priya Yadav
- Department of Chemical Sciences, Dr. Homi Bhabha Road, Colaba, Tata Institute of Fundamental Research, Mumbai, India
| | - Soumyajit Mitra
- Department of Chemical Sciences, Dr. Homi Bhabha Road, Colaba, Tata Institute of Fundamental Research, Mumbai, India
| | - Sri Rama Koti Ainavarapu
- Department of Chemical Sciences, Dr. Homi Bhabha Road, Colaba, Tata Institute of Fundamental Research, Mumbai, India
| |
Collapse
|
32
|
Lauzon D, Vallée-Bélisle A. Functional advantages of building nanosystems using multiple molecular components. Nat Chem 2023; 15:458-467. [PMID: 36759713 DOI: 10.1038/s41557-022-01127-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Accepted: 12/15/2022] [Indexed: 02/11/2023]
Abstract
Over half of all the natural nanomachines in living organisms are multimeric and likely exploit the self-assembly of their components to provide functional benefits. However, the advantages and disadvantages of building nanosystems using multiple molecular components remain relatively unexplored at the thermodynamic, kinetic and functional levels. In this study we used theory and a simple DNA-based model that forms the same nanostructures with different numbers of components to advance our knowledge in this area. Despite its lower assembly rate, we found that a system built with three components may undergo a more cooperative assembly transition from less preorganized components, which facilitates the emergence of functionalities. Using simple variations of its components, we also found that trimeric nanosystems display a much higher level of programmability than their dimeric counterparts because they can assemble with various levels of cooperativity, self-inhibition and time-dependent properties. We show here how two simple strategies (for example, cutting and adding components) can be employed to efficiently programme the regulatory function of a more complex, artificially selected, RNA-cleaving catalytic nanosystem.
Collapse
Affiliation(s)
- D Lauzon
- Laboratoire de Biosenseurs & Nanomachines, Département de Chimie, Université de Montréal, Montréal, Québec, Canada
| | - A Vallée-Bélisle
- Laboratoire de Biosenseurs & Nanomachines, Département de Chimie, Université de Montréal, Montréal, Québec, Canada.
| |
Collapse
|
33
|
Nithiyanandam S, Sangaraju VK, Manavalan B, Lee G. Computational prediction of protein folding rate using structural parameters and network centrality measures. Comput Biol Med 2023; 155:106436. [PMID: 36848800 DOI: 10.1016/j.compbiomed.2022.106436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Revised: 11/28/2022] [Accepted: 12/13/2022] [Indexed: 02/17/2023]
Abstract
Protein folding is a complex physicochemical process whereby a polymer of amino acids samples numerous conformations in its unfolded state before settling on an essentially unique native three-dimensional (3D) structure. To understand this process, several theoretical studies have used a set of 3D structures, identified different structural parameters, and analyzed their relationships using the natural logarithmic protein folding rate (ln(kf)). Unfortunately, these structural parameters are specific to a small set of proteins that are not capable of accurately predicting ln(kf) for both two-state (TS) and non-two-state (NTS) proteins. To overcome the limitations of the statistical approach, a few machine learning (ML)-based models have been proposed using limited training data. However, none of these methods can explain plausible folding mechanisms. In this study, we evaluated the predictive capabilities of ten different ML algorithms using eight different structural parameters and five different network centrality measures based on newly constructed datasets. In comparison to the other nine regressors, support vector machine was found to be the most appropriate for predicting ln(kf) with mean absolute differences of 1.856, 1.55, and 1.745 for the TS, NTS, and combined datasets, respectively. Furthermore, combining structural parameters and network centrality measures improves the prediction performance compared to individual parameters, indicating that multiple factors are involved in the folding process.
Collapse
Affiliation(s)
- Saraswathy Nithiyanandam
- Department of Molecular Science and Technology, Ajou University, 206 World Cup-ro, Suwon, 16499, South Korea
| | - Vinoth Kumar Sangaraju
- Department of Physiology, Ajou University School of Medicine, 206 World Cup-ro, Suwon, 16499, South Korea
| | - Balachandran Manavalan
- Department of Physiology, Ajou University School of Medicine, 206 World Cup-ro, Suwon, 16499, South Korea.
| | - Gwang Lee
- Department of Molecular Science and Technology, Ajou University, 206 World Cup-ro, Suwon, 16499, South Korea; Computational Biology and Bioinformatics Laboratory, Department of Integrative Biotechnology, College of Biotechnology and Bioengineering, Sungkyunkwan University, Suwon, 16419, Gyeonggi-do, South Korea.
| |
Collapse
|
34
|
Wand AJ. Deep mining of the protein energy landscape. STRUCTURAL DYNAMICS (MELVILLE, N.Y.) 2023; 10:020901. [PMID: 37124940 PMCID: PMC10147411 DOI: 10.1063/4.0000180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Accepted: 03/30/2023] [Indexed: 05/03/2023]
Abstract
For over half a century, it has been known that protein molecules naturally undergo extensive structural fluctuations, and that these internal motions are intimately related to their functional properties. The energy landscape view has provided a powerful framework for describing the various physical states that proteins visit during their lifetimes. This Perspective focuses on the commonly neglected and often disparaged axis of the protein energy landscape: entropy. Initially seen largely as a barrier to functionally relevant states of protein molecules, it has recently become clear that proteins retain considerable conformational entropy in the "native" state, and that this entropy can and often does contribute significantly to the free energy of fundamental protein properties, processes, and functions. NMR spectroscopy, molecular dynamics simulations, and emerging crystallographic views have matured in parallel to illuminate dynamic disorder of the "ground state" of proteins and their importance in not only transiting between biologically interesting structures but also greatly influencing their stability, cooperativity, and contribution to critical properties such as allostery.
Collapse
|
35
|
Casier R, Duhamel J. Synergetic Effects of Alanine and Glycine in Blob-Based Methods for Predicting Protein Folding Times. J Phys Chem B 2023; 127:1325-1337. [PMID: 36749707 DOI: 10.1021/acs.jpcb.2c08155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
The polypeptide PGlyAlaGlu was prepared with 20 mol % glycine (Gly), 36 mol % d,l-alanine (Ala), and 44 mol % d,l-glutamic acid (Glu) and labeled with the dye 1-pyrenemethylamine to yield a series of Py-PGlyAlaGlu samples. The fluorescence decays of the Py-PGlyAlaGlu samples were analyzed according to the fluorescence blob model (FBM) to obtain the number Nblobexp of amino acids (aa's) encompassed inside the subvolume Vblob of the polypeptide probed by an excited pyrene. An Nblobexp value of 29 (±2) was retrieved for Py-PGlyAlaGlu, which was much larger than for any of the copolypeptide PGlyGlu or PAlaGlu prepared with either Gly and Glu or Ala and Glu, respectively. The continuous increase in Nblobexp with decreasing side chain size (SCS) from 10 aa's for PGlu to 16 aa's for PAlaGlu and 22 aa's for PGlyGlu was used earlier to define the reach of an aa and determine the groups of aa's that could interact with each other along a polypeptide backbone according to their SCS. These groups of aa's, referred to as blobs, led to the implementation of blob-based models (BBM) to predict the folding time τFtheo,BBM of 145 proteins, which was found to match their experimental folding time τFexp with a relatively high 0.71 correlation coefficient. Nevertheless, the much higher Nblobexp value found for Py-PGlyAlaGlu compared to all other pyrene-labeled polypeptides studied to date indicates that the reach of aa's along a polypeptide sequence is affected not only by SCS but also by synergetic effects between different aa's. Following this new insight, a revised BBM was implemented to predict τFtheo,BBM for 195 proteins assuming the existence or absence of synergies to control the interactions between aa's along a polypeptide sequence. Similarly good correlation coefficients of 0.71 and 0.74 were obtained for a direct 1:1 comparison of τFexp and τFtheo,BBM for the 195 proteins without and with synergies, respectively. This result suggests that synergetic effects between different aa's have little effect on τFtheo,BBM predicted from BBM underlying the robustness of this methodology.
Collapse
Affiliation(s)
- Remi Casier
- Institute for Polymer Research, Waterloo Institute for Nanotechnology, Department of Chemistry, University of Waterloo, Waterloo, ON N2L 3G1, Canada
| | - Jean Duhamel
- Institute for Polymer Research, Waterloo Institute for Nanotechnology, Department of Chemistry, University of Waterloo, Waterloo, ON N2L 3G1, Canada
| |
Collapse
|
36
|
Factors That Control the Force Needed to Unfold a Membrane Protein in Silico Depend on the Mode of Denaturation. Int J Mol Sci 2023; 24:ijms24032654. [PMID: 36768981 PMCID: PMC9917119 DOI: 10.3390/ijms24032654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 01/23/2023] [Accepted: 01/24/2023] [Indexed: 02/01/2023] Open
Abstract
Single-molecule force spectroscopy methods, such as AFM and magnetic tweezers, have proved extremely beneficial in elucidating folding pathways for soluble and membrane proteins. To identify factors that determine the force rupture levels in force-induced membrane protein unfolding, we applied our near-atomic-level Upside molecular dynamics package to study the vertical and lateral pulling of bacteriorhodopsin (bR) and GlpG, respectively. With our algorithm, we were able to selectively alter the magnitudes of individual interaction terms and identify that, for vertical pulling, hydrogen bond strength had the strongest effect, whereas other non-bonded protein and membrane-protein interactions had only moderate influences, except for the extraction of the last helix where the membrane-protein interactions had a stronger influence. The up-down topology of the transmembrane helices caused helices to be pulled out as pairs. The rate-limiting rupture event often was the loss of H-bonds and the ejection of the first helix, which then propagated tension to the second helix, which rapidly exited the bilayer. The pulling of the charged linkers across the membrane had minimal influence, as did changing the bilayer thickness. For the lateral pulling of GlpG, the rate-limiting rupture corresponded to the separation of the helices within the membrane, with the H-bonds generally being broken only afterward. Beyond providing a detailed picture of the rupture events, our study emphasizes that the pulling mode greatly affects the factors that determine the forces needed to unfold a membrane protein.
Collapse
|
37
|
Andrews B, Ruggiero T, Urbanc B. How do salt and lipids affect conformational dynamics of Aβ42 monomers in water? Phys Chem Chem Phys 2023; 25:2566-2583. [PMID: 36602150 DOI: 10.1039/d2cp05044g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
It is well established that amyloid β-protein (Aβ) self-assembly is involved in triggering of Alzheimer's disease. On the other hand, evidence of physiological function of Aβ interacting with lipids has only begun to emerge. Details of Aβ-lipid interactions, which may underlie physiological and pathological activities of Aβ, are not well understood. Here, the effects of salt and 1,2-dimyristoyl-sn-glycero-3-phosphocholine (DMPC) lipids on conformational dynamics of Aβ42 monomer in water are examined by all-atom molecular dynamics (MD). We acquired six sets of 250 ns long MD trajectories for each of the three lipid concentrations (0, 27, and 109 mM) in the absence and presence of 150 mM salt. Ten replica trajectories per set are used to enhance sampling of Aβ42 conformational space. We show that salt facilitates long-range tertiary contacts in Aβ42, resulting in more compact Aβ42 conformations. By contrast, addition of lipids results in lipid-concentration dependent Aβ42 unfolding concomitant with enhanced stability of the turn in the A21-A30 region. At the high lipid concentration, salt enables the N-terminal region of Aβ42 to form long-range tertiary contacts and interact with lipids, which results in formation of a parallel β-strand. Aβ42 forms stable lipid-protein complexes whereby the protein is adhered to the lipid cluster rather than embedded into it. We propose that the inability of Aβ42 monomer to get embedded into the lipid cluster may be important for facilitating repair of leaks in the blood-brain barrier without penetrating and damaging cellular membranes.
Collapse
Affiliation(s)
- Brian Andrews
- Department of Physics, Drexel University, Philadelphia, Pennsylvania, USA.
| | - Thomas Ruggiero
- Department of Physics, Drexel University, Philadelphia, Pennsylvania, USA.
| | - Brigita Urbanc
- Department of Physics, Drexel University, Philadelphia, Pennsylvania, USA.
| |
Collapse
|
38
|
Holm L, Laiho A, Törönen P, Salgado M. DALI shines a light on remote homologs: One hundred discoveries. Protein Sci 2023; 32:e4519. [PMID: 36419248 PMCID: PMC9793968 DOI: 10.1002/pro.4519] [Citation(s) in RCA: 139] [Impact Index Per Article: 139.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 11/15/2022] [Accepted: 11/20/2022] [Indexed: 11/25/2022]
Abstract
Structural comparison reveals remote homology that often fails to be detected by sequence comparison. The DALI web server (http://ekhidna2.biocenter.helsinki.fi/dali) is a platform for structural analysis that provides database searches and interactive visualization, including structural alignments annotated with secondary structure, protein families and sequence logos, and 3D structure superimposition supported by color-coded sequence and structure conservation. Here, we are using DALI to mine the AlphaFold Database version 1, which increased the structural coverage of protein families by 20%. We found 100 remote homologous relationships hitherto unreported in the current reference database for protein domains, Pfam 35.0. In particular, we linked 35 domains of unknown function (DUFs) to the previously characterized families, generating a functional hypothesis that can be explored downstream in structural biology studies. Other findings include gene fusions, tandem duplications, and adjustments to domain boundaries. The evidence for homology can be browsed interactively through live examples on DALI's website.
Collapse
Affiliation(s)
- Liisa Holm
- Organismal and Evolutionary Biology Research Program, Faculty of Biological and Environmental Sciences & Institute of Biotechnology, Helsinki Institute of Life SciencesUniversity of HelsinkiHelsinkiFinland
| | - Aleksi Laiho
- Organismal and Evolutionary Biology Research Program, Faculty of Biological and Environmental Sciences & Institute of Biotechnology, Helsinki Institute of Life SciencesUniversity of HelsinkiHelsinkiFinland
| | - Petri Törönen
- Organismal and Evolutionary Biology Research Program, Faculty of Biological and Environmental Sciences & Institute of Biotechnology, Helsinki Institute of Life SciencesUniversity of HelsinkiHelsinkiFinland
| | - Marco Salgado
- Organismal and Evolutionary Biology Research Program, Faculty of Biological and Environmental Sciences & Institute of Biotechnology, Helsinki Institute of Life SciencesUniversity of HelsinkiHelsinkiFinland
| |
Collapse
|
39
|
Nam K, Wolf-Watz M. Protein dynamics: The future is bright and complicated! STRUCTURAL DYNAMICS (MELVILLE, N.Y.) 2023; 10:014301. [PMID: 36865927 PMCID: PMC9974214 DOI: 10.1063/4.0000179] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Accepted: 02/03/2023] [Indexed: 06/18/2023]
Abstract
Biological life depends on motion, and this manifests itself in proteins that display motion over a formidable range of time scales spanning from femtoseconds vibrations of atoms at enzymatic transition states, all the way to slow domain motions occurring on micro to milliseconds. An outstanding challenge in contemporary biophysics and structural biology is a quantitative understanding of the linkages among protein structure, dynamics, and function. These linkages are becoming increasingly explorable due to conceptual and methodological advances. In this Perspective article, we will point toward future directions of the field of protein dynamics with an emphasis on enzymes. Research questions in the field are becoming increasingly complex such as the mechanistic understanding of high-order interaction networks in allosteric signal propagation through a protein matrix, or the connection between local and collective motions. In analogy to the solution to the "protein folding problem," we argue that the way forward to understanding these and other important questions lies in the successful integration of experiment and computation, while utilizing the present rapid expansion of sequence and structure space. Looking forward, the future is bright, and we are in a period where we are on the doorstep to, at least in part, comprehend the importance of dynamics for biological function.
Collapse
Affiliation(s)
- Kwangho Nam
- Department of Chemistry and Biochemistry, University of Texas at Arlington, Arlington, Texas 76019, USA
| | | |
Collapse
|
40
|
Finkelstein AV, Bogatyreva NS, Ivankov DN, Garbuzynskiy SO. Protein folding problem: enigma, paradox, solution. Biophys Rev 2022; 14:1255-1272. [PMID: 36659994 PMCID: PMC9842845 DOI: 10.1007/s12551-022-01000-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Accepted: 09/19/2022] [Indexed: 01/22/2023] Open
Abstract
The ability of protein chains to spontaneously form their three-dimensional structures is a long-standing mystery in molecular biology. The most conceptual aspect of this mystery is how the protein chain can find its native, "working" spatial structure (which, for not too big protein chains, corresponds to the global free energy minimum) in a biologically reasonable time, without exhaustive enumeration of all possible conformations, which would take billions of years. This is the so-called "Levinthal's paradox." In this review, we discuss the key ideas and discoveries leading to the current understanding of protein folding kinetics, including folding landscapes and funnels, free energy barriers at the folding/unfolding pathways, and the solution of Levinthal's paradox. A special role here is played by the "all-or-none" phase transition occurring at protein folding and unfolding and by the point of thermodynamic (and kinetic) equilibrium between the "native" and the "unfolded" phases of the protein chain (where the theory obtains the simplest form). The modern theory provides an understanding of key features of protein folding and, in good agreement with experiments, it (i) outlines the chain length-dependent range of protein folding times, (ii) predicts the observed maximal size of "foldable" proteins and domains. Besides, it predicts the maximal size of proteins and domains that fold under solely thermodynamic (rather than kinetic) control. Complementarily, a theoretical analysis of the number of possible protein folding patterns, performed at the level of formation and assembly of secondary structures, correctly outlines the upper limit of protein folding times.
Collapse
Affiliation(s)
- Alexei V. Finkelstein
- Institute of Protein Research of the Russian Academy of Sciences, 142290 Pushchino, Moscow Region, Russia
- Biotechnology Department of the Lomonosov Moscow State University, 4 Institutskaya Str, 142290 Pushchino, Moscow Region, Russia
- Biology Department of the Lomonosov Moscow State University, 1-12 Leninskie Gory, 119991 Moscow, Russia
| | - Natalya S. Bogatyreva
- Institute of Protein Research of the Russian Academy of Sciences, 142290 Pushchino, Moscow Region, Russia
| | - Dmitry N. Ivankov
- Center of Life Sciences, Skolkovo Institute of Science and Technology, 121205 Moscow, Russia
| | - Sergiy O. Garbuzynskiy
- Institute of Protein Research of the Russian Academy of Sciences, 142290 Pushchino, Moscow Region, Russia
| |
Collapse
|
41
|
Tajana M, Trovato A, Tiana G. Key interaction patterns in proteins revealed by cluster expansion of the partition function. THE EUROPEAN PHYSICAL JOURNAL. E, SOFT MATTER 2022; 45:95. [PMID: 36447074 DOI: 10.1140/epje/s10189-022-00250-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 11/19/2022] [Indexed: 06/16/2023]
Abstract
The native conformation of structured proteins is stabilized by a complex network of interactions. We analyzed the elementary patterns that constitute such network and ranked them according to their importance in shaping protein sequence design. To achieve this goal, we employed a cluster expansion of the partition function in the space of sequences and evaluated numerically the statistical importance of each cluster. An important feature of this procedure is that it is applied to a dense finite system. We found that patterns that contribute most to the partition function are cycles with even numbers of nodes, while cliques are typically detrimental. Each cluster also gives a contribute to the sequence entropy, which is a measure of the evolutionary designability of a fold. We compared the entropies associated with different interaction patterns to their abundances in the native structures of real proteins.
Collapse
Affiliation(s)
- Matteo Tajana
- Department of Physics, Università degli Studi di Milano, Via Celoria 16, 20133, Milan, Italy
| | - Antonio Trovato
- Department of Physics and Astronomy "G. Galilei", Università degli Studi di Padova and INFN, Via Marzolo 8, 35121, Padova, Italy
| | - Guido Tiana
- Department of Physics and Center for Complexity and Biosystems, Università degli Studi di Milano and INFN, Via Celoria 16, 20133, Milan, Italy.
| |
Collapse
|
42
|
Stan G, Lorimer GH, Thirumalai D. Friends in need: How chaperonins recognize and remodel proteins that require folding assistance. Front Mol Biosci 2022; 9:1071168. [PMID: 36479385 PMCID: PMC9720267 DOI: 10.3389/fmolb.2022.1071168] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Accepted: 11/07/2022] [Indexed: 08/19/2023] Open
Abstract
Chaperonins are biological nanomachines that help newly translated proteins to fold by rescuing them from kinetically trapped misfolded states. Protein folding assistance by the chaperonin machinery is obligatory in vivo for a subset of proteins in the bacterial proteome. Chaperonins are large oligomeric complexes, with unusual seven fold symmetry (group I) or eight/nine fold symmetry (group II), that form double-ring constructs, enclosing a central cavity that serves as the folding chamber. Dramatic large-scale conformational changes, that take place during ATP-driven cycles, allow chaperonins to bind misfolded proteins, encapsulate them into the expanded cavity and release them back into the cellular environment, regardless of whether they are folded or not. The theory associated with the iterative annealing mechanism, which incorporated the conformational free energy landscape description of protein folding, quantitatively explains most, if not all, the available data. Misfolded conformations are associated with low energy minima in a rugged energy landscape. Random disruptions of these low energy conformations result in higher free energy, less folded, conformations that can stochastically partition into the native state. Two distinct mechanisms of annealing action have been described. Group I chaperonins (GroEL homologues in eubacteria and endosymbiotic organelles), recognize a large number of misfolded proteins non-specifically and operate through highly coordinated cooperative motions. By contrast, the less well understood group II chaperonins (CCT in Eukarya and thermosome/TF55 in Archaea), assist a selected set of substrate proteins. Sequential conformational changes within a CCT ring are observed, perhaps promoting domain-by-domain substrate folding. Chaperonins are implicated in bacterial infection, autoimmune disease, as well as protein aggregation and degradation diseases. Understanding the chaperonin mechanism and the specific proteins they rescue during the cell cycle is important not only for the fundamental aspect of protein folding in the cellular environment, but also for effective therapeutic strategies.
Collapse
Affiliation(s)
- George Stan
- Department of Chemistry, University of Cincinnati, Cincinnati, OH, United States
| | - George H. Lorimer
- Center for Biomolecular Structure and Organization, Department of Chemistry and Biochemistry, University of Maryland, College Park, MD, United States
| | - D. Thirumalai
- Department of Chemistry, University of Texas, Austin, TX, United States
- Department of Physics, University of Texas, Austin, TX, United States
| |
Collapse
|
43
|
Effects of Residual Composition and Distribution on the Structural Characteristics of the Protein. Int J Mol Sci 2022; 23:ijms232214263. [PMID: 36430742 PMCID: PMC9699447 DOI: 10.3390/ijms232214263] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 11/11/2022] [Accepted: 11/14/2022] [Indexed: 11/19/2022] Open
Abstract
The effect of ratio and consecutive number of hydrophobic residues in the repeating unit of protein chains was investigated by MD simulation. The modified off-lattice HNP model was applied in this study. The protein chains constituted by different HNP ratios or different numbers of consecutively hydrophobic residues with the same chain length were simulated under a broad temperature range. We concluded that the proteins with higher ratio or larger number of sequentially hydrophobic residues present more orientated and compact structure under a certain low temperature. It is attributed to the lower non-bonded potential energy between H-H residual pairs, especially more hydrophobic residues in a procession among the protein chain. Considering the microscopic structure of the protein, more residue contacts are achieved with the proteins with higher ratios and sequential H residues under the low temperature. Meanwhile, with the ratio and consecutive number of H residues increasing, the distribution of stem length showed a transition from exponential decline to unimodal and even multiple peaks, indicating the specific ordered structure formed. These results provide an insight into 3D structural properties of proteins from their residue sequences, which has a primary structure at molecular level and, ultimately, a practical possibility of applying in biotechnological applications.
Collapse
|
44
|
Manriquez‐Sandoval E, Fried SD. DomainMapper: Accurate domain structure annotation including those with non-contiguous topologies. Protein Sci 2022; 31:e4465. [PMID: 36208126 PMCID: PMC9601794 DOI: 10.1002/pro.4465] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 09/30/2022] [Accepted: 10/03/2022] [Indexed: 11/11/2022]
Abstract
Automated domain annotation is an important tool for structural informatics. These pipelines typically involve searching query sequences against hidden Markov model (HMM) profiles, yielding matches to profiles for various domains. However, domain annotation can be ambiguous or inaccurate when proteins contain domains with non-contiguous residue ranges, and especially when insertional domains are hosted within them. Here, we present DomainMapper, an algorithm that accurately assigns a unique domain structure annotation to a query sequence, including those with complex topologies. We validate our domain assignments using the AlphaFold database and confirm that non-contiguity is pervasive (10.74% of all domains in yeast and 4.52% in human). Using this resource, we find that certain folds have strong propensities to be non-contiguous or insertional across the Tree of Life. DomainMapper is freely available and can be ran as a single command-line function.
Collapse
Affiliation(s)
| | - Stephen D. Fried
- T. C. Jenkins Department of BiophysicsJohns Hopkins UniversityBaltimoreMDUSA
- Department of ChemistryJohns Hopkins UniversityBaltimoreMDUSA
| |
Collapse
|
45
|
Contreras-Torres E, Marrero-Ponce Y, Terán JE, Agüero-Chapin G, Antunes A, García-Jacas CR. Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions. Front Chem 2022; 10:959143. [PMID: 36277354 PMCID: PMC9585278 DOI: 10.3389/fchem.2022.959143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Accepted: 08/15/2022] [Indexed: 11/13/2022] Open
Abstract
This study introduces a set of fuzzy spherically truncated three-dimensional (3D) multi-linear descriptors for proteins. These indices codify geometric structural information from kth spherically truncated spatial-(dis)similarity two-tuple and three-tuple tensors. The coefficients of these truncated tensors are calculated by applying a smoothing value to the 3D structural encoding based on the relationships between two and three amino acids of a protein embedded into a sphere. At considering, the geometrical center of the protein matches with center of the sphere, the distance between each amino acid involved in any specific interaction and the geometrical center of the protein can be computed. Then, the fuzzy membership degree of each amino acid from an spherical region of interest is computed by fuzzy membership functions (FMFs). The truncation value is finally a combination of the membership degrees from interacting amino acids, by applying the arithmetic mean as fusion rule. Several fuzzy membership functions with diverse biases on the calculation of amino acids memberships (e.g., Z-shaped (close to the center), PI-shaped (middle region), and A-Gaussian (far from the center)) were considered as well as traditional truncation functions (e.g., Switching). Such truncation functions were comparatively evaluated by exploring: 1) the frequency of membership degrees, 2) the variability and orthogonality analyses among them based on the Shannon Entropy’s and Principal Component’s methods, respectively, and 3) the prediction performance of alignment-free prediction of protein folding rates and structural classes. These analyses unraveled the singularity of the proposed fuzzy spherically truncated MDs with respect to the classical (non-truncated) ones and respect to the MDs truncated with traditional functions. They also showed an improved prediction power by attaining an external correlation coefficient of 95.82% in the folding rate modelling and an accuracy of 100% in distinguishing structural protein classes. These outcomes are better than the ones attained by existing approaches, justifying the theoretical contribution of this report. Thus, the fuzzy spherically truncated-based protein descriptors from MuLiMs-MCoMPAs (http://tomocomd.com/mulims-mcompas) are promising alignment-free predictors for modeling protein functions and properties.
Collapse
Affiliation(s)
- Ernesto Contreras-Torres
- Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Universidad San Francisco de Quito (USFQ), Quito, Pichincha, Ecuador
- Instituto de Simulación Computacional (ISC-USFQ), Quito, Pichincha, Ecuador
- BCAM—Basque Center for Applied Mathematics, Bilbao, Spain
| | - Yovani Marrero-Ponce
- Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Universidad San Francisco de Quito (USFQ), Quito, Pichincha, Ecuador
- Instituto de Simulación Computacional (ISC-USFQ), Quito, Pichincha, Ecuador
- Computer-Aided Molecular “Biosilico” Discovery and Bioinformatics Research International Network (CAMD-BIR IN), Quito, Ecuador
- *Correspondence: Yovani Marrero-Ponce, , , César R. García-Jacas, , ,
| | - Julio E. Terán
- Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Universidad San Francisco de Quito (USFQ), Quito, Pichincha, Ecuador
- Instituto de Simulación Computacional (ISC-USFQ), Quito, Pichincha, Ecuador
- Department of Textile Engineering, Chemistry and Science, College of Textiles, North Carolina State University, Raleigh, NC, United States
| | - Guillermin Agüero-Chapin
- CIIMAR—Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Porto, Portugal
- Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Porto, Portugal
| | - Agostinho Antunes
- CIIMAR—Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Porto, Portugal
- Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Porto, Portugal
| | - César R. García-Jacas
- Cátedras Conacyt—Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), Ensenada, Baja California, Mexico
- *Correspondence: Yovani Marrero-Ponce, , , César R. García-Jacas, , ,
| |
Collapse
|
46
|
Chidyausiku TM, Mendes SR, Klima JC, Nadal M, Eckhard U, Roel-Touris J, Houliston S, Guevara T, Haddox HK, Moyer A, Arrowsmith CH, Gomis-Rüth FX, Baker D, Marcos E. De novo design of immunoglobulin-like domains. Nat Commun 2022; 13:5661. [PMID: 36192397 PMCID: PMC9530121 DOI: 10.1038/s41467-022-33004-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Accepted: 08/17/2022] [Indexed: 11/21/2022] Open
Abstract
Antibodies, and antibody derivatives such as nanobodies, contain immunoglobulin-like (Ig) β-sandwich scaffolds which anchor the hypervariable antigen-binding loops and constitute the largest growing class of drugs. Current engineering strategies for this class of compounds rely on naturally existing Ig frameworks, which can be hard to modify and have limitations in manufacturability, designability and range of action. Here, we develop design rules for the central feature of the Ig fold architecture—the non-local cross-β structure connecting the two β-sheets—and use these to design highly stable Ig domains de novo, confirm their structures through X-ray crystallography, and show they can correctly scaffold functional loops. Our approach opens the door to the design of antibody-like scaffolds with tailored structures and superior biophysical properties. The immunoglobulin domain framework of antibodies has been a long standing design challenge. Here, the authors describe design rules for tailoring these domains and show they can be accurately designed, de novo, with high stability and the ability to scaffold functional loops.
Collapse
Affiliation(s)
- Tamuka M Chidyausiku
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA.,Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, WA, 98195, USA.,Novartis Institutes for BioMedical Research Inc., San Diego, CA, 92121, USA
| | - Soraia R Mendes
- Proteolysis Laboratory, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB-CSIC), Baldiri Reixac 15, 08028, Barcelona, Spain
| | - Jason C Klima
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA.,Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA.,Encodia, Inc., San Diego, CA, 92121, USA
| | - Marta Nadal
- Protein Design and Modeling Lab, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB-CSIC), Baldiri Reixac 15, 08028, Barcelona, Spain
| | - Ulrich Eckhard
- Proteolysis Laboratory, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB-CSIC), Baldiri Reixac 15, 08028, Barcelona, Spain
| | - Jorge Roel-Touris
- Protein Design and Modeling Lab, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB-CSIC), Baldiri Reixac 15, 08028, Barcelona, Spain
| | - Scott Houliston
- Structural Genomics Consortium, University of Toronto, Toronto, ON, M5G 1L7, Canada.,Princess Margaret Cancer Centre and Department of Medical Biophysics, University of Toronto, Toronto, ON, M5G 2M9, Canada
| | - Tibisay Guevara
- Proteolysis Laboratory, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB-CSIC), Baldiri Reixac 15, 08028, Barcelona, Spain
| | - Hugh K Haddox
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Adam Moyer
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Cheryl H Arrowsmith
- Structural Genomics Consortium, University of Toronto, Toronto, ON, M5G 1L7, Canada.,Princess Margaret Cancer Centre and Department of Medical Biophysics, University of Toronto, Toronto, ON, M5G 2M9, Canada
| | - F Xavier Gomis-Rüth
- Proteolysis Laboratory, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB-CSIC), Baldiri Reixac 15, 08028, Barcelona, Spain.
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA. .,Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA. .,Howard Hughes Medical Institute, University of Washington, Seattle, WA, 98195, USA.
| | - Enrique Marcos
- Protein Design and Modeling Lab, Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB-CSIC), Baldiri Reixac 15, 08028, Barcelona, Spain.
| |
Collapse
|
47
|
Colberg M, Schofield J. Configurational entropy, transition rates, and optimal interactions for rapid folding in coarse-grained model proteins. J Chem Phys 2022; 157:125101. [PMID: 36182418 DOI: 10.1063/5.0098612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Under certain conditions, the dynamics of coarse-grained models of solvated proteins can be described using a Markov state model, which tracks the evolution of populations of configurations. The transition rates among states that appear in the Markov model can be determined by computing the relative entropy of states and their mean first passage times. In this paper, we present an adaptive method to evaluate the configurational entropy and the mean first passage times for linear chain models with discontinuous potentials. The approach is based on event-driven dynamical sampling in a massively parallel architecture. Using the fact that the transition rate matrix can be calculated for any choice of interaction energies at any temperature, it is demonstrated how each state's energy can be chosen such that the average time to transition between any two states is minimized. The methods are used to analyze the optimization of the folding process of two protein systems: the crambin protein and a model with frustration and misfolding. It is shown that the folding pathways for both systems are comprised of two regimes: first, the rapid establishment of local bonds, followed by the subsequent formation of more distant contacts. The state energies that lead to the most rapid folding encourage multiple pathways, and they either penalize folding pathways through kinetic traps by raising the energies of trapping states or establish an escape route from the trapping states by lowering free energy barriers to other states that rapidly reach the native state.
Collapse
Affiliation(s)
- Margarita Colberg
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Jeremy Schofield
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| |
Collapse
|
48
|
Woodard J, Iqbal S, Mashaghi A. Circuit topology predicts pathogenicity of missense mutations. Proteins 2022; 90:1634-1644. [PMID: 35394672 PMCID: PMC9543832 DOI: 10.1002/prot.26342] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 03/07/2022] [Accepted: 03/30/2022] [Indexed: 12/05/2022]
Abstract
The contact topology of a protein determines important aspects of the folding process. The topological measure of contact order has been shown to be predictive of the rate of folding. Circuit topology is emerging as another fundamental descriptor of biomolecular structure, with predicted effects on the folding rate. We analyze the residue‐based circuit topological environments of 21 K mutations labeled as pathogenic or benign. Multiple statistical lines of reasoning support the conclusion that the number of contacts in two specific circuit topological arrangements, namely inverse parallel and cross relations, with contacts involving the mutated residue have discriminatory value in determining the pathogenicity of human variants. We investigate how results vary with residue type and according to whether the gene is essential. We further explore the relationship to a number of structural features and find that circuit topology provides nonredundant information on protein structures and pathogenicity of mutations. Results may have implications for the polymer physics of protein folding and suggest that “local” topological information, including residue‐based circuit topology and residue contact order, could be useful in improving state‐of‐the‐art machine learning algorithms for pathogenicity prediction.
Collapse
Affiliation(s)
- Jaie Woodard
- Medical Systems Biophysics and Bioengineering, Leiden Academic Centre for Drug Research, Faculty of Science, Leiden University, Leiden, The Netherlands.,Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA
| | - Sumaiya Iqbal
- Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, USA
| | - Alireza Mashaghi
- Medical Systems Biophysics and Bioengineering, Leiden Academic Centre for Drug Research, Faculty of Science, Leiden University, Leiden, The Netherlands.,Centre for Interdisciplinary Genome Research, Faculty of Science, Leiden University, Leiden, The Netherlands
| |
Collapse
|
49
|
Ma X, Sun H, Hong H, Guo Z, Su H, Chen H. Free-energy landscape of two-state protein acylphosphatase with large contact order revealed by force-dependent folding and unfolding dynamics. Phys Rev E 2022; 106:024404. [PMID: 36109974 DOI: 10.1103/physreve.106.024404] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2022] [Accepted: 07/15/2022] [Indexed: 06/15/2023]
Abstract
Acylphosphatase (AcP) is a small protein with 98 amino acid residues that catalyzes the hydrolysis of carboxyl-phosphate bonds. AcP is a typical two-state protein with slow folding rate due to its relatively large contact order in the native structure. The mechanical properties and unfolding behavior of AcP has been studied by atomic force microscope. Here using stable magnetic tweezers, we measured the force-dependent folding rates within a force range 1-3 pN, and unfolding rates 15-40 pN. The obtained unfolding rates show different force sensitivities at forces below and above ∼27 pN, which determines a free-energy landscape with two energy barriers. Our results indicate that the free-energy landscape of small globule proteins have general Bactrian camel shape, and large contact order of the native state produces a high barrier dominate at low forces.
Collapse
Affiliation(s)
- Xuening Ma
- Research Institute for Biomimetics and Soft Matter, Fujian Provincial Key Lab for Soft Functional Materials Research, Department of Physics, Xiamen University, Xiamen 361005, China
| | - Hao Sun
- Research Institute for Biomimetics and Soft Matter, Fujian Provincial Key Lab for Soft Functional Materials Research, Department of Physics, Xiamen University, Xiamen 361005, China
- Center of Biomedical Physics, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou 325000, China
- Oujiang Laboratory, Wenzhou, Zhejiang 325000, China
| | - Haiyan Hong
- Research Institute for Biomimetics and Soft Matter, Fujian Provincial Key Lab for Soft Functional Materials Research, Department of Physics, Xiamen University, Xiamen 361005, China
| | - Zilong Guo
- Center of Biomedical Physics, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou 325000, China
- Oujiang Laboratory, Wenzhou, Zhejiang 325000, China
| | - Huanhuan Su
- Research Institute for Biomimetics and Soft Matter, Fujian Provincial Key Lab for Soft Functional Materials Research, Department of Physics, Xiamen University, Xiamen 361005, China
| | - Hu Chen
- Research Institute for Biomimetics and Soft Matter, Fujian Provincial Key Lab for Soft Functional Materials Research, Department of Physics, Xiamen University, Xiamen 361005, China
- Center of Biomedical Physics, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou 325000, China
- Oujiang Laboratory, Wenzhou, Zhejiang 325000, China
| |
Collapse
|
50
|
Baldwin Q, Sumpter B, Panagiotou E. The Local Topological Free Energy of the SARS-CoV-2 Spike Protein. Polymers (Basel) 2022; 14:polym14153014. [PMID: 35893978 PMCID: PMC9332627 DOI: 10.3390/polym14153014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Revised: 07/15/2022] [Accepted: 07/19/2022] [Indexed: 02/04/2023] Open
Abstract
The novel coronavirus SARS-CoV-2 infects human cells using a mechanism that involves binding and structural rearrangement of its Spike protein. Understanding protein rearrangement and identifying specific amino acids where mutations affect protein rearrangement has attracted much attention for drug development. In this manuscript, we use a mathematical method to characterize the local topology/geometry of the SARS-CoV-2 Spike protein backbone. Our results show that local conformational changes in the FP, HR1, and CH domains are associated with global conformational changes in the RBD domain. The SARS-CoV-2 variants analyzed in this manuscript (alpha, beta, gamma, delta Mink, G614, N501) show differences in the local conformations of the FP, HR1, and CH domains as well. Finally, most mutations of concern are either in or in the vicinity of high local topological free energy conformations, suggesting that high local topological free energy conformations could be targets for mutations with significant impact of protein function. Namely, the residues 484, 570, 614, 796, and 969, which are present in variants of concern and are targeted as important in protein function, are predicted as such from our model.
Collapse
Affiliation(s)
- Quenisha Baldwin
- Department of Biology, Tuskegee University, Tuskegee, AL 36088, USA;
| | - Bobby Sumpter
- Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA;
| | - Eleni Panagiotou
- Department of Mathematics and SimCenter, University of Tennessee at Chattanooga, Chattanooga, TN 37403, USA
- Correspondence: or
| |
Collapse
|