1
|
Louros N, Schymkowitz J, Rousseau F. Mechanisms and pathology of protein misfolding and aggregation. Nat Rev Mol Cell Biol 2023; 24:912-933. [PMID: 37684425 DOI: 10.1038/s41580-023-00647-2] [Citation(s) in RCA: 22] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/28/2023] [Indexed: 09/10/2023]
Abstract
Despite advances in machine learning-based protein structure prediction, we are still far from fully understanding how proteins fold into their native conformation. The conventional notion that polypeptides fold spontaneously to their biologically active states has gradually been replaced by our understanding that cellular protein folding often requires context-dependent guidance from molecular chaperones in order to avoid misfolding. Misfolded proteins can aggregate into larger structures, such as amyloid fibrils, which perpetuate the misfolding process, creating a self-reinforcing cascade. A surge in amyloid fibril structures has deepened our comprehension of how a single polypeptide sequence can exhibit multiple amyloid conformations, known as polymorphism. The assembly of these polymorphs is not a random process but is influenced by the specific conditions and tissues in which they originate. This observation suggests that, similar to the folding of native proteins, the kinetics of pathological amyloid assembly are modulated by interactions specific to cells and tissues. Here, we review the current understanding of how intrinsic protein conformational propensities are modulated by physiological and pathological interactions in the cell to shape protein misfolding and aggregation pathology.
Collapse
Affiliation(s)
- Nikolaos Louros
- Switch Laboratory, VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Cellular and Molecular Medicine, KU Leuven, Leuven, Belgium
| | - Joost Schymkowitz
- Switch Laboratory, VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium.
- Department of Cellular and Molecular Medicine, KU Leuven, Leuven, Belgium.
| | - Frederic Rousseau
- Switch Laboratory, VIB-KU Leuven Center for Brain & Disease Research, Leuven, Belgium.
- Department of Cellular and Molecular Medicine, KU Leuven, Leuven, Belgium.
| |
Collapse
|
2
|
AbsoluRATE: An in-silico method to predict the aggregation kinetics of native proteins. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2021; 1869:140682. [PMID: 34102324 DOI: 10.1016/j.bbapap.2021.140682] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Revised: 05/12/2021] [Accepted: 06/04/2021] [Indexed: 12/12/2022]
Abstract
Protein aggregation has two aspects, namely, mechanistic and kinetics. Understanding protein aggregation kinetics is critical for prediction of progression of diseases caused by amyloidosis, accumulation of aggregates in biotherapeutics during storage and engineering commercial nano-biomaterials. In this work, we have collected experimentally determined absolute protein aggregation rates and developed an SVM based regression model to predict absolute rates of protein and peptide aggregation near-physiological conditions. The regression model achieved a correlation coefficient of 0.72 with MAE of 0.91 (natural log of kapp, where kapp is in hour-1) using leave-one-out cross-validation on a dataset of 82 non-redundant proteins/peptides. The model accounts for the experimental conditions (such as temperature, pH, ionic and protein concentration) and sequence-based properties. The amino acid sequence features revealed by this model as being important for aggregation kinetics, are also associated with the aggregation mechanism. In particular, inherent aggregation propensity of the protein/peptide sequence and number of aggregation prone regions (APRs) unpunctuated by the gatekeeping residues, were found to play important roles in the prediction of the absolute aggregation rates. This analysis shows that mechanism and kinetics of protein aggregation are coupled via common sequence attributes. The aggregation kinetic prediction method developed in this work is available at https://web.iitm.ac.in/bioinfo2/absolurate-pred/index.html.
Collapse
|
3
|
Prabakaran R, Rawat P, Kumar S, Michael Gromiha M. ANuPP: A Versatile Tool to Predict Aggregation Nucleating Regions in Peptides and Proteins. J Mol Biol 2020; 433:166707. [PMID: 33972019 DOI: 10.1016/j.jmb.2020.11.006] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Revised: 10/28/2020] [Accepted: 11/05/2020] [Indexed: 12/22/2022]
Abstract
Short aggregation prone sequence motifs can trigger aggregation in peptide and protein sequences. Most algorithms developed so far to identify potential aggregation prone regions (APRs) use amino acid residue composition and/or sequence pattern features. In this work, we have investigated the importance of atomic-level characteristics rather than residue level to understand the initiation of aggregation in proteins and peptides. Using atomic-level features an ensemble-classifier, ANuPP has been developed to predict the aggregation-nucleating regions in peptides and proteins. In a dataset of 1279 hexapeptides, ANuPP achieved an area under the curve (AUC) of 0.831 with 77% accuracy on 10-fold cross-validation and an AUC of 0.883 with 83% accuracy in a blind test dataset of 142 hexapeptides. Further, it showed an average SOV of 48.7% on identifying APR regions in 37 proteins. The performance of ANuPP is better than other methods reported in the literature on both amyloidogenic hexapeptide prediction and APR identification. We have developed a web server for ANuPP and it is available at https://web.iitm.ac.in/bioinfo2/ANuPP/. Insights gained from this work demonstrate the importance of atomic and functional group characteristics towards diversity of atomic level origins as well as mechanisms of protein aggregation.
Collapse
Affiliation(s)
- R Prabakaran
- Protein Bioinformatics Lab, Department of Biotechnology, Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
| | - Puneet Rawat
- Protein Bioinformatics Lab, Department of Biotechnology, Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
| | - Sandeep Kumar
- Biotherapeutics Discovery, Boehringer Ingelheim Pharmaceutical Inc., Ridgefield, CT, USA.
| | - M Michael Gromiha
- Protein Bioinformatics Lab, Department of Biotechnology, Indian Institute of Technology Madras, Chennai, Tamil Nadu, India; School of Computing, Institute of Innovative Research, Tokyo Institute of Technology, Yokohama, Kanagawa, Japan.
| |
Collapse
|
4
|
Saravanan KM, Zhang H, Zhang H, Xi W, Wei Y. On the Conformational Dynamics of β-Amyloid Forming Peptides: A Computational Perspective. Front Bioeng Biotechnol 2020; 8:532. [PMID: 32656188 PMCID: PMC7325929 DOI: 10.3389/fbioe.2020.00532] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 05/04/2020] [Indexed: 12/12/2022] Open
Abstract
Understanding the conformational dynamics of proteins and peptides involved in important functions is still a difficult task in computational structural biology. Because such conformational transitions in β-amyloid (Aβ) forming peptides play a crucial role in many neurological disorders, researchers from different scientific fields have been trying to address issues related to the folding of Aβ forming peptides together. Many theoretical models have been proposed in the recent years for studying Aβ peptides using mathematical, physicochemical, and molecular dynamics simulation, and machine learning approaches. In this article, we have comprehensively reviewed the developmental advances in the theoretical models for Aβ peptide folding and interactions, particularly in the context of neurological disorders. Furthermore, we have extensively reviewed the advances in molecular dynamics simulation as a tool used for studying the conversions between polymorphic amyloid forms and applications of using machine learning approaches in predicting Aβ peptides and aggregation-prone regions in proteins. We have also provided details on the theoretical advances in the study of Aβ peptides, which would enhance our understanding of these peptides at the molecular level and eventually lead to the development of targeted therapies for certain acute neurological disorders such as Alzheimer's disease in the future.
Collapse
Affiliation(s)
| | | | | | - Wenhui Xi
- Center for High Performance Computing, Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Yanjie Wei
- Center for High Performance Computing, Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| |
Collapse
|
5
|
Draceni Y, Pechmann S. Pervasive convergent evolution and extreme phenotypes define chaperone requirements of protein homeostasis. Proc Natl Acad Sci U S A 2019; 116:20009-20014. [PMID: 31527276 PMCID: PMC6778244 DOI: 10.1073/pnas.1904611116] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Maintaining protein homeostasis is an essential requirement for cell and organismal viability. An elaborate regulatory system within cells, the protein homeostasis network, safeguards that proteins are correctly folded and functional. At the heart of this regulatory system lies a class of specialized protein quality control enzymes called chaperones that are tasked with assisting proteins in their folding, avoiding aggregation and degradation. Failure and decline of protein homeostasis are directly associated with conditions of aging and aging-related neurodegeneration. However, it is not clear what tips the balance of protein homeostasis and leads to onset of aging and diseases. Here, using a comparative genomics approach we report general principles of maintaining protein homeostasis across the eukaryotic tree of life. Expanding a previous study of 16 eukaryotes to the quantitative analysis of 216 eukaryotic genomes, we find a strong correlation between the composition of eukaryotic chaperone networks and genome complexity that is distinct for different species kingdoms. Organisms with pronounced phenotypes clearly buck this trend. Northobranchius furzeri, the shortest-lived vertebrate and a widely used model for fragile protein homeostasis, is found to be chaperone limited while Heterocephalus glaber as the longest-lived rodent and thus an especially robust organism is characterized by above-average numbers of chaperones. Strikingly, the relative size of chaperone networks is found to generally correlate with longevity in Metazoa. Our results thus indicate that the balance in protein homeostasis may be a key variable in explaining organismal robustness.
Collapse
Affiliation(s)
- Yasmine Draceni
- Département de Biochimie, Université de Montréal, Montréal, QC H3T 1J4, Canada
| | - Sebastian Pechmann
- Département de Biochimie, Université de Montréal, Montréal, QC H3T 1J4, Canada
| |
Collapse
|
6
|
Foy SG, Wilson BA, Bertram J, Cordes MHJ, Masel J. A Shift in Aggregation Avoidance Strategy Marks a Long-Term Direction to Protein Evolution. Genetics 2019; 211:1345-1355. [PMID: 30692195 PMCID: PMC6456324 DOI: 10.1534/genetics.118.301719] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2018] [Accepted: 01/25/2019] [Indexed: 01/06/2023] Open
Abstract
To detect a direction to evolution, without the pitfalls of reconstructing ancestral states, we need to compare "more evolved" to "less evolved" entities. But because all extant species have the same common ancestor, none are chronologically more evolved than any other. However, different gene families were born at different times, allowing us to compare young protein-coding genes to those that are older and hence have been evolving for longer. To be retained during evolution, a protein must not only have a function, but must also avoid toxic dysfunction such as protein aggregation. There is conflict between the two requirements: hydrophobic amino acids form the cores of protein folds, but also promote aggregation. Young genes avoid strongly hydrophobic amino acids, which is presumably the simplest solution to the aggregation problem. Here we show that young genes' few hydrophobic residues are clustered near one another along the primary sequence, presumably to assist folding. The higher aggregation risk created by the higher hydrophobicity of older genes is counteracted by more subtle effects in the ordering of the amino acids, including a reduction in the clustering of hydrophobic residues until they eventually become more interspersed than if distributed randomly. This interspersion has previously been reported to be a general property of proteins, but here we find that it is restricted to old genes. Quantitatively, the index of dispersion delineates a gradual trend, i.e., a decrease in the clustering of hydrophobic amino acids over billions of years.
Collapse
Affiliation(s)
- Scott G Foy
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
| | - Benjamin A Wilson
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
| | - Jason Bertram
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
| | - Matthew H J Cordes
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona 85721
| | - Joanna Masel
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
| |
Collapse
|
7
|
Wang W, Roberts CJ. Protein aggregation – Mechanisms, detection, and control. Int J Pharm 2018; 550:251-268. [DOI: 10.1016/j.ijpharm.2018.08.043] [Citation(s) in RCA: 100] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Revised: 08/18/2018] [Accepted: 08/20/2018] [Indexed: 12/19/2022]
|
8
|
An in-silico method for identifying aggregation rate enhancer and mitigator mutations in proteins. Int J Biol Macromol 2018; 118:1157-1167. [DOI: 10.1016/j.ijbiomac.2018.06.102] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2018] [Revised: 06/19/2018] [Accepted: 06/20/2018] [Indexed: 12/27/2022]
|
9
|
Prabakaran R, Goel D, Kumar S, Gromiha MM. Aggregation prone regions in human proteome: Insights from large-scale data analyses. Proteins 2017; 85:1099-1118. [DOI: 10.1002/prot.25276] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2016] [Revised: 02/10/2017] [Accepted: 02/24/2017] [Indexed: 12/25/2022]
Affiliation(s)
- R. Prabakaran
- Department of Biotechnology, Bhupat Jyoti Mehta School of Biosciences; Indian Institute of Technology Madras; Chennai 600036 India
| | - Dhruv Goel
- Department of Computer Science and Engineering; Motilal Nehru National Institute of Technology; Allahabad 211004 India
| | - Sandeep Kumar
- Biotherapeutics Pharmaceutical Sciences, Pfizer Inc; 700 Chesterfield Parkway West Chesterfield Missouri 63017, USA
| | - M. Michael Gromiha
- Department of Biotechnology, Bhupat Jyoti Mehta School of Biosciences; Indian Institute of Technology Madras; Chennai 600036 India
| |
Collapse
|
10
|
Hauschild P, Röttig A, Madkour MH, Al-Ansari AM, Almakishah NH, Steinbüchel A. Lipid accumulation in prokaryotic microorganisms from arid habitats. Appl Microbiol Biotechnol 2017; 101:2203-2216. [PMID: 28175949 DOI: 10.1007/s00253-017-8149-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2016] [Revised: 01/23/2017] [Accepted: 01/25/2017] [Indexed: 12/20/2022]
Abstract
This review shall provide support for the suitability of arid environments as preferred location to search for unknown lipid-accumulative bacteria. Bacterial lipids are attracting more and more attention as sustainable replacement for mineral oil in fuel and plastic production. The development of prokaryotic microorganisms in arid desert habitats is affected by its harsh living conditions. Drought, nutrient limitation, strong radiation, and extreme temperatures necessitate effective adaption mechanisms. Accumulation of storage lipids as energy reserve and source of metabolic water represents a common adaption in desert animals and presumably in desert bacteria and archaea as well. Comparison of corresponding literature resulted in several bacterial species from desert habitats, which had already been described as lipid-accumulative elsewhere. Based on the gathered information, literature on microbial communities in hot desert, cold desert, and humid soil were analyzed on its content of lipid-accumulative bacteria. With more than 50% of the total community size in single studies, hot deserts appear to be more favorable for lipid-accumulative species then humid soil (≤20%) and cold deserts (≤17%). Low bacterial lipid accumulation in cold deserts is assumed to result from the influence of low temperatures on fatty acids and the increased necessity of permanent adaption methods.
Collapse
Affiliation(s)
- Philippa Hauschild
- Institut für Molekulare Mikrobiologie und Biotechnologie, Westfälische Wilhelms-Universität Münster, Corrensstraße 3, D-48149, Münster, Germany
| | - Annika Röttig
- Institut für Molekulare Mikrobiologie und Biotechnologie, Westfälische Wilhelms-Universität Münster, Corrensstraße 3, D-48149, Münster, Germany
| | - Mohamed H Madkour
- Environmental Sciences Department, Faculty of Meteorology, Environment and Arid Land Agriculture, King Abdulaziz University, Jeddah, 21589, Saudi Arabia
| | - Ahmed M Al-Ansari
- Environmental Sciences Department, Faculty of Meteorology, Environment and Arid Land Agriculture, King Abdulaziz University, Jeddah, 21589, Saudi Arabia
| | - Naief H Almakishah
- Environmental Sciences Department, Faculty of Meteorology, Environment and Arid Land Agriculture, King Abdulaziz University, Jeddah, 21589, Saudi Arabia
| | - Alexander Steinbüchel
- Institut für Molekulare Mikrobiologie und Biotechnologie, Westfälische Wilhelms-Universität Münster, Corrensstraße 3, D-48149, Münster, Germany. .,Environmental Sciences Department, Faculty of Meteorology, Environment and Arid Land Agriculture, King Abdulaziz University, Jeddah, 21589, Saudi Arabia.
| |
Collapse
|
11
|
Fonin AV, Uversky VN, Kuznetsova IM, Turoverov KK. Protein folding and stability in the presence of osmolytes. Biophysics (Nagoya-shi) 2016. [DOI: 10.1134/s0006350916020056] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
|
12
|
CPAD, Curated Protein Aggregation Database: A Repository of Manually Curated Experimental Data on Protein and Peptide Aggregation. PLoS One 2016; 11:e0152949. [PMID: 27043825 PMCID: PMC4820268 DOI: 10.1371/journal.pone.0152949] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2016] [Accepted: 03/20/2016] [Indexed: 11/24/2022] Open
Abstract
Accurate distinction between peptide sequences that can form amyloid-fibrils or amorphous β-aggregates, identification of potential aggregation prone regions in proteins, and prediction of change in aggregation rate of a protein upon mutation(s) are critical to research on protein misfolding diseases, such as Alzheimer’s and Parkinson’s, as well as biotechnological production of protein based therapeutics. We have developed a Curated Protein Aggregation Database (CPAD), which has collected results from experimental studies performed by scientific community aimed at understanding protein/peptide aggregation. CPAD contains more than 2300 experimentally observed aggregation rates upon mutations in known amyloidogenic proteins. Each entry includes numerical values for the following parameters: change in rate of aggregation as measured by fluorescence intensity or turbidity, name and source of the protein, Uniprot and Protein Data Bank codes, single point as well as multiple mutations, and literature citation. The data in CPAD has been supplemented with five different types of additional information: (i) Amyloid fibril forming hexa-peptides, (ii) Amorphous β-aggregating hexa-peptides, (iii) Amyloid fibril forming peptides of different lengths, (iv) Amyloid fibril forming hexa-peptides whose crystal structures are available in the Protein Data Bank (PDB) and (v) Experimentally validated aggregation prone regions found in amyloidogenic proteins. Furthermore, CPAD is linked to other related databases and resources, such as Uniprot, Protein Data Bank, PUBMED, GAP, TANGO, WALTZ etc. We have set up a web interface with different search and display options so that users have the ability to get the data in multiple ways. CPAD is freely available at http://www.iitm.ac.in/bioinfo/CPAD/. The potential applications of CPAD have also been discussed.
Collapse
|
13
|
Abstract
Owing to its association with a diverse range of human diseases, the determinants of protein aggregation are studied intensively. It is generally accepted that the effective aggregation tendency of a protein depends on many factors such as folding efficiency towards the native state, thermodynamic stability of that conformation, intrinsic aggregation propensity of the polypeptide sequence and its ability to be recognized by the protein quality control system. The intrinsic aggregation propensity of a polypeptide sequence is related to the presence of short APRs (aggregation-prone regions) that self-associate to form intermolecular β-structured assemblies. These are typically short sequence segments (5-15 amino acids) that display high hydrophobicity, low net charge and a high tendency to form β-structures. As the presence of such APRs is a prerequisite for aggregation, a plethora of methods have been developed to identify APRs in amino acid sequences. In the present chapter, the methodological basis of these approaches is discussed, as well as some practical applications.
Collapse
|
14
|
Sudha G, Nussinov R, Srinivasan N. An overview of recent advances in structural bioinformatics of protein-protein interactions and a guide to their principles. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2014; 116:141-50. [PMID: 25077409 DOI: 10.1016/j.pbiomolbio.2014.07.004] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2014] [Accepted: 07/13/2014] [Indexed: 12/20/2022]
Abstract
Rich data bearing on the structural and evolutionary principles of protein-protein interactions are paving the way to a better understanding of the regulation of function in the cell. This is particularly the case when these interactions are considered in the framework of key pathways. Knowledge of the interactions may provide insights into the mechanisms of crucial 'driver' mutations in oncogenesis. They also provide the foundation toward the design of protein-protein interfaces and inhibitors that can abrogate their formation or enhance them. The main features to learn from known 3-D structures of protein-protein complexes and the extensive literature which analyzes them computationally and experimentally include the interaction details which permit undertaking structure-based drug discovery, the evolution of complexes and their interactions, the consequences of alterations such as post-translational modifications, ligand binding, disease causing mutations, host pathogen interactions, oligomerization, aggregation and the roles of disorder, dynamics, allostery and more to the protein and the cell. This review highlights some of the recent advances in these areas, including design, inhibition and prediction of protein-protein complexes. The field is broad, and much work has been carried out in these areas, making it challenging to cover it in its entirety. Much of this is due to the fast increase in the number of molecules whose structures have been determined experimentally and the vast increase in computational power. Here we provide a concise overview.
Collapse
Affiliation(s)
- Govindarajan Sudha
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India.
| | - Ruth Nussinov
- Cancer and Inflammation Program, Frederick National Laboratory for Cancer Research, Leidos Biomedical Research, Inc., National Cancer Institute, Frederick, MD 21702, USA; Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
| | | |
Collapse
|
15
|
Thangakani AM, Kumar S, Nagarajan R, Velmurugan D, Gromiha MM. GAP: towards almost 100 percent prediction for β-strand-mediated aggregating peptides with distinct morphologies. Bioinformatics 2014; 30:1983-90. [DOI: 10.1093/bioinformatics/btu167] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
|
16
|
Buck PM, Kumar S, Singh SK. On the role of aggregation prone regions in protein evolution, stability, and enzymatic catalysis: insights from diverse analyses. PLoS Comput Biol 2013; 9:e1003291. [PMID: 24146608 PMCID: PMC3798281 DOI: 10.1371/journal.pcbi.1003291] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2013] [Accepted: 08/30/2013] [Indexed: 11/18/2022] Open
Abstract
The various roles that aggregation prone regions (APRs) are capable of playing in proteins are investigated here via comprehensive analyses of multiple non-redundant datasets containing randomly generated amino acid sequences, monomeric proteins, intrinsically disordered proteins (IDPs) and catalytic residues. Results from this study indicate that the aggregation propensities of monomeric protein sequences have been minimized compared to random sequences with uniform and natural amino acid compositions, as observed by a lower average aggregation propensity and fewer APRs that are shorter in length and more often punctuated by gate-keeper residues. However, evidence for evolutionary selective pressure to disrupt these sequence regions among homologous proteins is inconsistent. APRs are less conserved than average sequence identity among closely related homologues (≥80% sequence identity with a parent) but APRs are more conserved than average sequence identity among homologues that have at least 50% sequence identity with a parent. Structural analyses of APRs indicate that APRs are three times more likely to contain ordered versus disordered residues and that APRs frequently contribute more towards stabilizing proteins than equal length segments from the same protein. Catalytic residues and APRs were also found to be in structural contact significantly more often than expected by random chance. Our findings suggest that proteins have evolved by optimizing their risk of aggregation for cellular environments by both minimizing aggregation prone regions and by conserving those that are important for folding and function. In many cases, these sequence optimizations are insufficient to develop recombinant proteins into commercial products. Rational design strategies aimed at improving protein solubility for biotechnological purposes should carefully evaluate the contributions made by candidate APRs, targeted for disruption, towards protein structure and activity. Biotechnology requires the large-scale expression, yield, and storage of recombinant proteins. Each step in protein production has the potential to cause aggregation as proteins, not evolved to exist outside the cell, endure the various steps involved in commercial manufacturing processes. Mechanistic studies into protein aggregation have revealed that certain sequence regions contribute more to the aggregation propensity of a protein than other sequence regions do. Efforts to disrupt these regions have thus far indicated that rational sequence engineering is a useful technique to reduce the aggregation of biotechnologically relevant proteins. To improve our ability to rationally engineer proteins with enhanced expression, solubility, and shelf-life we conducted extensive analyses of aggregation prone regions (APRs) within protein sequences to characterize the various roles these regions play in proteins. Findings from this work indicate that protein sequences have evolved by minimizing their aggregation propensities. However, we also found that many APRs are conserved in protein families and are essential to maintain protein stability and function. Therefore, the contributions that APRs, targeted for disruption, make towards protein stability and function should be carefully evaluated when improving protein solubility via rational design.
Collapse
Affiliation(s)
- Patrick M Buck
- Pharmaceutical Research and Development, Biotherapeutics Pharmaceutical Sciences, Pfizer Inc., Chesterfield, Missouri, United States of America
| | | | | |
Collapse
|