1
|
Bevacqua A, Bakshi S, Xia Y. Principal component analysis of alpha-helix deformations in transmembrane proteins. PLoS One 2021; 16:e0257318. [PMID: 34525125 PMCID: PMC8443038 DOI: 10.1371/journal.pone.0257318] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Accepted: 08/30/2021] [Indexed: 12/03/2022] Open
Abstract
α-helices are deformable secondary structural components regularly observed in protein folds. The overall flexibility of an α-helix can be resolved into constituent physical deformations such as bending in two orthogonal planes and twisting along the principal axis. We used Principal Component Analysis to identify and quantify the contribution of each of these dominant deformation modes in transmembrane α-helices, extramembrane α-helices, and α-helices in soluble proteins. Using three α-helical samples from Protein Data Bank entries spanning these three cellular contexts, we determined that the relative contributions of these modes towards total deformation are independent of the α-helix's surroundings. This conclusion is supported by the observation that the identities of the top three deformation modes, the scaling behaviours of mode eigenvalues as a function of α-helix length, and the percentage contribution of individual modes on total variance were comparable across all three α-helical samples. These findings highlight that α-helical deformations are independent of cellular location and will prove to be valuable in furthering the development of flexible templates in de novo protein design.
Collapse
Affiliation(s)
- Alexander Bevacqua
- Department of Bioengineering, McGill University, Montreal, Quebec, Canada
| | - Sachit Bakshi
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
| | - Yu Xia
- Department of Bioengineering, McGill University, Montreal, Quebec, Canada
| |
Collapse
|
2
|
Sharir-Ivry A, Xia Y. Quantifying evolutionary importance of protein sites: A Tale of two measures. PLoS Genet 2021; 17:e1009476. [PMID: 33826605 PMCID: PMC8026052 DOI: 10.1371/journal.pgen.1009476] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Accepted: 03/09/2021] [Indexed: 12/05/2022] Open
Abstract
A key challenge in evolutionary biology is the accurate quantification of selective pressure on proteins and other biological macromolecules at single-site resolution. The evolutionary importance of a protein site under purifying selection is typically measured by the degree of conservation of the protein site itself. A possible alternative measure is the strength of the site-induced conservation gradient in the rest of the protein structure. However, the quantitative relationship between these two measures remains unknown. Here, we show that despite major differences, there is a strong linear relationship between the two measures such that more conserved protein sites also induce stronger conservation gradient in the rest of the protein. This linear relationship is universal as it holds for different types of proteins and functional sites in proteins. Our results show that the strong selective pressure acting on the functional site in general percolates through the rest of the protein via residue-residue contacts. Surprisingly however, catalytic sites in enzymes are the principal exception to this rule. Catalytic sites induce significantly stronger conservation gradients in the rest of the protein than expected from the degree of conservation of the site alone. The unique requirement for the active site to selectively stabilize the transition state of the catalyzed chemical reaction imposes additional selective constraints on the rest of the enzyme. Sites within proteins which are important for stability or function are under stronger selective pressure and evolve more slowly than other sites. Catalytic sites in enzymes are such highly conserved sites with relatively low evolutionary rates. Recently, catalytic sites were shown to induce a strong gradient of conservation such that the closer a residue is to the catalytic site, the more conserved it is. Here we show that there is a universal linear relationship between the degree of evolutionary conservation of a protein site and the conservation gradient it induces in the protein tertiary structure, applicable to all types of sites. Our findings suggest that selective pressure acting on a protein site generally percolates through the rest of the protein via residue-residue contacts. Remarkably however, catalytic sites induce significantly stronger conservation gradients than expected from their degree of conservation alone. Our results indicate that the strong conservation gradient induced by catalytic sites is driven by the unique function of enzyme catalysis, which requires the participation of many residues beyond the few key catalytic residues. Our results provide insights into evolutionary conservation patterns of and surrounding proteins functional sites, with implications for functional site prediction and protein design.
Collapse
Affiliation(s)
- Avital Sharir-Ivry
- Department of Bioengineering, McGill University, Montreal, Quebec, Canada
| | - Yu Xia
- Department of Bioengineering, McGill University, Montreal, Quebec, Canada
- * E-mail:
| |
Collapse
|
3
|
Sharir-Ivry A, Xia Y. Non-catalytic Binding Sites Induce Weaker Long-Range Evolutionary Rate Gradients than Catalytic Sites in Enzymes. J Mol Biol 2019; 431:3860-3870. [DOI: 10.1016/j.jmb.2019.07.019] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 06/26/2019] [Accepted: 07/11/2019] [Indexed: 01/02/2023]
|
4
|
Sharir-Ivry A, Xia Y. Using Pseudoenzymes to Probe Evolutionary Design Principles of Enzymes. Evol Bioinform Online 2019; 15:1176934319855937. [PMID: 31236007 PMCID: PMC6572901 DOI: 10.1177/1176934319855937] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2019] [Accepted: 05/15/2019] [Indexed: 12/24/2022] Open
Abstract
Enzymes are governed by unique evolutionary design principles as their catalytic sites were shown to induce long-range evolutionary conservation gradients. We have recently used a comparative bioinformatics approach to disentangle structural determinants from other possible determinants of the evolutionary conservation gradients. The approach is based on comparing the evolutionary patterns of enzymes to those of pseudoenzymes with the same tertiary structure where the catalytic functionality is turned off. This approach provides a way to evaluate several hypotheses regarding the origin of the observed evolutionary conservation gradient in enzymes. The conclusions from such comparative analyses are important for a better understanding of the unique evolutionary design principles of enzymes, which can in turn potentially guide the design of new and improved enzymes.
Collapse
Affiliation(s)
| | - Yu Xia
- Department of Bioengineering, McGill University, Montreal, QC, Canada
| |
Collapse
|
5
|
Yampolsky LY, Wolf YI, Bouzinier MA. Net Evolutionary Loss of Residue Polarity in Drosophilid Protein Cores Indicates Ongoing Optimization of Amino Acid Composition. Genome Biol Evol 2018; 9:2879-2892. [PMID: 28985302 PMCID: PMC5737390 DOI: 10.1093/gbe/evx191] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/16/2017] [Indexed: 02/07/2023] Open
Abstract
Amino acid frequencies in proteins may not be at equilibrium. We consider two possible explanations for the nonzero net residue fluxes in drosophilid proteins. First, protein interiors may have a suboptimal residue composition and be under a selective pressure favoring stability, that is, leading to the loss of polar (and the gain of large) amino acids. One would then expect stronger net fluxes on the protein interior than at the exposed sites. Alternatively, if most of the polarity loss occurs at the exposed sites and the selective constraint on amino acid composition at such sites decreases over time, net loss of polarity may be neutral and caused by disproportionally high occurrence of polar residues at exposed, least constrained sites. We estimated net evolutionary fluxes of residue polarity and volume at sites with different solvent accessibility in conserved protein families from 12 species of Drosophila. Net loss of polarity, miniscule in magnitude, but consistent across all lineages, occurred at all sites except the most exposed ones, where net flux of polarity was close to zero or, in membrane proteins, even positive. At the intermediate solvent accessibility the net fluxes of polarity and volume were similar to neutral predictions, whereas much of the polarity loss not attributable to neutral expectations occurred at the buried sites. These observations are consistent with the hypothesis that residue composition in many proteins is structurally suboptimal and continues to evolve toward lower polarity in the protein interior, in particular in proteins with intracellular localization. The magnitude of polarity and volume changes was independent from the protein’s evolutionary age, indicating that the approach to equilibrium has been slow or that no such single equilibrium exists.
Collapse
Affiliation(s)
- Lev Y Yampolsky
- Department of Biological Sciences, East Tennessee State University
| | - Yuri I Wolf
- National Center for Biotechnology Information, NIH, Bethesda, Maryland
| | | |
Collapse
|
6
|
Sharir-Ivry A, Xia Y. The Impact of Native State Switching on Protein Sequence Evolution. Mol Biol Evol 2017; 34:1378-1390. [DOI: 10.1093/molbev/msx071] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
|
7
|
Sojo V, Dessimoz C, Pomiankowski A, Lane N. Membrane Proteins Are Dramatically Less Conserved than Water-Soluble Proteins across the Tree of Life. Mol Biol Evol 2016; 33:2874-2884. [PMID: 27501943 PMCID: PMC5062322 DOI: 10.1093/molbev/msw164] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Membrane proteins are crucial in transport, signaling, bioenergetics, catalysis, and as drug targets. Here, we show that membrane proteins have dramatically fewer detectable orthologs than water-soluble proteins, less than half in most species analyzed. This sparse distribution could reflect rapid divergence or gene loss. We find that both mechanisms operate. First, membrane proteins evolve faster than water-soluble proteins, particularly in their exterior-facing portions. Second, we demonstrate that predicted ancestral membrane proteins are preferentially lost compared with water-soluble proteins in closely related species of archaea and bacteria. These patterns are consistent across the whole tree of life, and in each of the three domains of archaea, bacteria, and eukaryotes. Our findings point to a fundamental evolutionary principle: membrane proteins evolve faster due to stronger adaptive selection in changing environments, whereas cytosolic proteins are under more stringent purifying selection in the homeostatic interior of the cell. This effect should be strongest in prokaryotes, weaker in unicellular eukaryotes (with intracellular membranes), and weakest in multicellular eukaryotes (with extracellular homeostasis). We demonstrate that this is indeed the case. Similarly, we show that extracellular water-soluble proteins exhibit an even stronger pattern of low homology than membrane proteins. These striking differences in conservation of membrane proteins versus water-soluble proteins have important implications for evolution and medicine.
Collapse
Affiliation(s)
- Victor Sojo
- CoMPLEX, University College London, London, United Kingdom Department of Genetics, Evolution and Environment, University College London, London, United Kingdom Systems Biophysics, Faculty of Physics, Ludwig-Maximilian University of Munich, Munich, Germany
| | - Christophe Dessimoz
- Department of Genetics, Evolution and Environment, University College London, London, United Kingdom Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland
| | - Andrew Pomiankowski
- CoMPLEX, University College London, London, United Kingdom Department of Genetics, Evolution and Environment, University College London, London, United Kingdom
| | - Nick Lane
- CoMPLEX, University College London, London, United Kingdom Department of Genetics, Evolution and Environment, University College London, London, United Kingdom
| |
Collapse
|
8
|
Echave J, Spielman SJ, Wilke CO. Causes of evolutionary rate variation among protein sites. Nat Rev Genet 2016; 17:109-21. [PMID: 26781812 DOI: 10.1038/nrg.2015.18] [Citation(s) in RCA: 176] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
It has long been recognized that certain sites within a protein, such as sites in the protein core or catalytic residues in enzymes, are evolutionarily more conserved than other sites. However, our understanding of rate variation among sites remains surprisingly limited. Recent progress to address this includes the development of a wide array of reliable methods to estimate site-specific substitution rates from sequence alignments. In addition, several molecular traits have been identified that correlate with site-specific mutation rates, and novel mechanistic biophysical models have been proposed to explain the observed correlations. Nonetheless, current models explain, at best, approximately 60% of the observed variance, highlighting the limitations of current methods and models and the need for new research directions.
Collapse
Affiliation(s)
- Julian Echave
- Escuela de Ciencia y Tecnología, Universidad Nacional de San Martín, 1650 San Martín, Buenos Aires, Argentina
| | - Stephanie J Spielman
- Department of Integrative Biology, Center for Computational Biology and Bioinformatics, and Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas 78712, USA
| | - Claus O Wilke
- Department of Integrative Biology, Center for Computational Biology and Bioinformatics, and Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas 78712, USA
| |
Collapse
|
9
|
Spielman SJ, Kumar K, Wilke CO. Comprehensive, structurally-informed alignment and phylogeny of vertebrate biogenic amine receptors. PeerJ 2015; 3:e773. [PMID: 25737813 PMCID: PMC4338800 DOI: 10.7717/peerj.773] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Accepted: 01/26/2015] [Indexed: 01/29/2023] Open
Abstract
Biogenic amine receptors play critical roles in regulating behavior and physiology in both vertebrates and invertebrates, particularly within the central nervous system. Members of the G-protein coupled receptor (GPCR) family, these receptors interact with endogenous bioamine ligands such as dopamine, serotonin, and epinephrine, and are targeted by a wide array of pharmaceuticals. Despite the clear clinical and biological importance of these receptors, their evolutionary history remains poorly characterized. In particular, the relationships among biogenic amine receptors and any specific evolutionary constraints acting within distinct receptor subtypes are largely unknown. To advance and facilitate studies in this receptor family, we have constructed a comprehensive, high-quality sequence alignment of vertebrate biogenic amine receptors. In particular, we have integrated a traditional multiple sequence approach with robust structural domain predictions to ensure that alignment columns accurately capture the highly-conserved GPCR structural domains, and we demonstrate how ignoring structural information produces spurious inferences of homology. Using this alignment, we have constructed a structurally-partitioned maximum-likelihood phylogeny from which we deduce novel biogenic amine receptor relationships and uncover previously unrecognized lineage-specific receptor clades. Moreover, we find that roughly 1% of the 3039 sequences in our final alignment are either misannotated or unclassified, and we propose updated classifications for these receptors. We release our comprehensive alignment and its corresponding phylogeny as a resource for future research into the evolution and diversification of biogenic amine receptors.
Collapse
Affiliation(s)
| | - Keerthana Kumar
- Department of Integrative Biology, The University of Texas at Austin, Austin, USA
| | - Claus O. Wilke
- Department of Integrative Biology, The University of Texas at Austin, Austin, USA
| |
Collapse
|