1
|
Gerbino KR, Borin JM, Ardell SM, Lee JJ, Corbett KD, Meyer JR. Bacteriophage Φ21's receptor-binding protein evolves new functions through destabilizing mutations that generate non-genetic phenotypic heterogeneity. Virus Evol 2024; 10:veae049. [PMID: 39170727 PMCID: PMC11336670 DOI: 10.1093/ve/veae049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 06/21/2024] [Accepted: 07/10/2024] [Indexed: 08/23/2024] Open
Abstract
How viruses evolve to expand their host range is a major question with implications for predicting the next pandemic. Gain-of-function experiments have revealed that host-range expansions can occur through relatively few mutations in viral receptor-binding proteins, and the search for molecular mechanisms that explain such expansions is underway. Previous research on expansions of receptor use in bacteriophage λ has shown that mutations that destabilize λ's receptor-binding protein cause it to fold into new conformations that can utilize novel receptors but have weakened thermostability. These observations led us to hypothesize that other viruses may take similar paths to expand their host range. Here, we find support for our hypothesis by studying another virus, bacteriophage 21 (Φ21), which evolves to use two new host receptors within 2 weeks of laboratory evolution. By measuring the thermodynamic stability of Φ21 and its descendants, we show that as Φ21 evolves to use new receptors and expands its host range, it becomes less stable and produces viral particles that are genetically identical but vary in their thermostabilities. Next, we show that this non-genetic heterogeneity between particles is directly associated with receptor use innovation, as phage particles with more derived receptor-use capabilities are more unstable and decay faster. Lastly, by manipulating the expression of protein chaperones during Φ21 infection, we demonstrate that heterogeneity in receptor use of phage particles arises during protein folding. Altogether, our results provide support for the hypothesis that viruses can evolve new receptor-use tropisms through mutations that destabilize the receptor-binding protein and produce multiple protein conformers.
Collapse
Affiliation(s)
- Krista R Gerbino
- School of Biological Sciences, University of California San Diego, 9500 Gilman Dr, La Jolla, CA 92093, United States
| | - Joshua M Borin
- School of Biological Sciences, University of California San Diego, 9500 Gilman Dr, La Jolla, CA 92093, United States
| | - Sarah M Ardell
- School of Biological Sciences, University of California San Diego, 9500 Gilman Dr, La Jolla, CA 92093, United States
| | - Justin J Lee
- School of Biological Sciences, University of California San Diego, 9500 Gilman Dr, La Jolla, CA 92093, United States
| | - Kevin D Corbett
- School of Biological Sciences, University of California San Diego, 9500 Gilman Dr, La Jolla, CA 92093, United States
- Department of Cellular and Molecular Medicine, University of California San Diego, 9500 Gilman Dr, La Jolla, CA 92093, United States
| | - Justin R Meyer
- School of Biological Sciences, University of California San Diego, 9500 Gilman Dr, La Jolla, CA 92093, United States
| |
Collapse
|
2
|
Moss MJ, Chamness LM, Clark PL. The Effects of Codon Usage on Protein Structure and Folding. Annu Rev Biophys 2024; 53:87-108. [PMID: 38134335 PMCID: PMC11227313 DOI: 10.1146/annurev-biophys-030722-020555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2023]
Abstract
The rate of protein synthesis is slower than many folding reactions and varies depending on the synonymous codons encoding the protein sequence. Synonymous codon substitutions thus have the potential to regulate cotranslational protein folding mechanisms, and a growing number of proteins have been identified with folding mechanisms sensitive to codon usage. Typically, these proteins have complex folding pathways and kinetically stable native structures. Kinetically stable proteins may fold only once over their lifetime, and thus, codon-mediated regulation of the pioneer round of protein folding can have a lasting impact. Supporting an important role for codon usage in folding, conserved patterns of codon usage appear in homologous gene families, hinting at selection. Despite these exciting developments, there remains few experimental methods capable of quantifying translation elongation rates and cotranslational folding mechanisms in the cell, which challenges the development of a predictive understanding of how biology uses codons to regulate protein folding.
Collapse
Affiliation(s)
- McKenze J Moss
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, USA; , ,
| | - Laura M Chamness
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, USA; , ,
| | - Patricia L Clark
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, USA; , ,
| |
Collapse
|
3
|
Nguyen A, Zhao H, Myagmarsuren D, Srinivasan S, Wu D, Chen J, Piszczek G, Schuck P. Modulation of biophysical properties of nucleocapsid protein in the mutant spectrum of SARS-CoV-2. eLife 2024; 13:RP94836. [PMID: 38941236 PMCID: PMC11213569 DOI: 10.7554/elife.94836] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2024] Open
Abstract
Genetic diversity is a hallmark of RNA viruses and the basis for their evolutionary success. Taking advantage of the uniquely large genomic database of SARS-CoV-2, we examine the impact of mutations across the spectrum of viable amino acid sequences on the biophysical phenotypes of the highly expressed and multifunctional nucleocapsid protein. We find variation in the physicochemical parameters of its extended intrinsically disordered regions (IDRs) sufficient to allow local plasticity, but also observe functional constraints that similarly occur in related coronaviruses. In biophysical experiments with several N-protein species carrying mutations associated with major variants, we find that point mutations in the IDRs can have nonlocal impact and modulate thermodynamic stability, secondary structure, protein oligomeric state, particle formation, and liquid-liquid phase separation. In the Omicron variant, distant mutations in different IDRs have compensatory effects in shifting a delicate balance of interactions controlling protein assembly properties, and include the creation of a new protein-protein interaction interface in the N-terminal IDR through the defining P13L mutation. A picture emerges where genetic diversity is accompanied by significant variation in biophysical characteristics of functional N-protein species, in particular in the IDRs.
Collapse
Affiliation(s)
- Ai Nguyen
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, United States
| | - Huaying Zhao
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, United States
| | - Dulguun Myagmarsuren
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, United States
| | - Sanjana Srinivasan
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, United States
| | - Di Wu
- Biophysics Core Facility, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, United States
| | - Jiji Chen
- Advanced Imaging and Microscopy Resource, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, United States
| | - Grzegorz Piszczek
- Biophysics Core Facility, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, United States
| | - Peter Schuck
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, United States
| |
Collapse
|
4
|
Strobel HM, Labador SD, Basu D, Sane M, Corbett KD, Meyer JR. Viral Receptor-Binding Protein Evolves New Function through Mutations That Cause Trimer Instability and Functional Heterogeneity. Mol Biol Evol 2024; 41:msae056. [PMID: 38586942 PMCID: PMC10999833 DOI: 10.1093/molbev/msae056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 02/07/2024] [Accepted: 03/11/2024] [Indexed: 04/09/2024] Open
Abstract
When proteins evolve new activity, a concomitant decrease in stability is often observed because the mutations that confer new activity can destabilize the native fold. In the conventional model of protein evolution, reduced stability is considered a purely deleterious cost of molecular innovation because unstable proteins are prone to aggregation and are sensitive to environmental stressors. However, recent work has revealed that nonnative, often unstable protein conformations play an important role in mediating evolutionary transitions, raising the question of whether instability can itself potentiate the evolution of new activity. We explored this question in a bacteriophage receptor-binding protein during host-range evolution. We studied the properties of the receptor-binding protein of bacteriophage λ before and after host-range evolution and demonstrated that the evolved protein is relatively unstable and may exist in multiple conformations with unique receptor preferences. Through a combination of structural modeling and in vitro oligomeric state analysis, we found that the instability arises from mutations that interfere with trimer formation. This study raises the intriguing possibility that protein instability might play a previously unrecognized role in mediating host-range expansions in viruses.
Collapse
Affiliation(s)
- Hannah M Strobel
- School of Biological Sciences, University of California San Diego, La Jolla, CA, USA
| | - Sweetzel D Labador
- School of Biological Sciences, University of California San Diego, La Jolla, CA, USA
| | - Dwaipayan Basu
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
| | - Mrudula Sane
- School of Biological Sciences, University of California San Diego, La Jolla, CA, USA
| | - Kevin D Corbett
- School of Biological Sciences, University of California San Diego, La Jolla, CA, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
| | - Justin R Meyer
- School of Biological Sciences, University of California San Diego, La Jolla, CA, USA
| |
Collapse
|
5
|
Nguyen A, Zhao H, Myagmarsuren D, Srinivasan S, Wu D, Chen J, Piszczek G, Schuck P. Modulation of Biophysical Properties of Nucleocapsid Protein in the Mutant Spectrum of SARS-CoV-2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.21.568093. [PMID: 38045241 PMCID: PMC10690151 DOI: 10.1101/2023.11.21.568093] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
Genetic diversity is a hallmark of RNA viruses and the basis for their evolutionary success. Taking advantage of the uniquely large genomic database of SARS-CoV-2, we examine the impact of mutations across the spectrum of viable amino acid sequences on the biophysical phenotypes of the highly expressed and multifunctional nucleocapsid protein. We find variation in the physicochemical parameters of its extended intrinsically disordered regions (IDRs) sufficient to allow local plasticity, but also exhibiting functional constraints that similarly occur in related coronaviruses. In biophysical experiments with several N-protein species carrying mutations associated with major variants, we find that point mutations in the IDRs can have nonlocal impact and modulate thermodynamic stability, secondary structure, protein oligomeric state, particle formation, and liquid-liquid phase separation. In the Omicron variant, distant mutations in different IDRs have compensatory effects in shifting a delicate balance of interactions controlling protein assembly properties, and include the creation of a new protein-protein interaction interface in the N-terminal IDR through the defining P13L mutation. A picture emerges where genetic diversity is accompanied by significant variation in biophysical characteristics of functional N-protein species, in particular in the IDRs.
Collapse
Affiliation(s)
- Ai Nguyen
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, MD 20892, USA
| | - Huaying Zhao
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, MD 20892, USA
| | - Dulguun Myagmarsuren
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, MD 20892, USA
| | - Sanjana Srinivasan
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, MD 20892, USA
| | - Di Wu
- Biophysics Core Facility, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Jiji Chen
- Advanced Imaging and Microscopy Resource, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, MD 20892, USA
| | - Grzegorz Piszczek
- Biophysics Core Facility, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Peter Schuck
- Laboratory of Dynamics of Macromolecular Assembly, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, MD 20892, USA
| |
Collapse
|
6
|
Rai S, Pramanik S, Mukherjee S. Deciphering the liquid-liquid phase separation induced modulation in the structure, dynamics, and enzymatic activity of an ordered protein β-lactoglobulin. Chem Sci 2024; 15:3936-3948. [PMID: 38487243 PMCID: PMC10935713 DOI: 10.1039/d3sc06802a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 01/23/2024] [Indexed: 03/17/2024] Open
Abstract
Owing to the significant role in the subcellular organization of biomolecules, physiology, and the realm of biomimetic materials, studies related to biomolecular condensates formed through liquid-liquid phase separation (LLPS) have emerged as a growing area of research. Despite valuable contributions of prior research, there is untapped potential in exploring the influence of phase separation on the conformational dynamics and enzymatic activities of native proteins. Herein, we investigate the LLPS of β-lactoglobulin (β-LG), a non-intrinsically disordered protein, under crowded conditions. In-depth characterization through spectroscopic and microscopic techniques revealed the formation of dynamic liquid-like droplets, distinct from protein aggregates, driven by hydrophobic interactions. Our analyses revealed that phase separation can alter structural flexibility and photophysical properties. Importantly, the phase-separated β-LG exhibited efficient enzymatic activity as an esterase; a characteristic seemingly exclusive to β-LG droplets. The droplets acted as robust catalytic crucibles, providing an ideal environment for efficient ester hydrolysis. Further investigation into the catalytic mechanism suggested the involvement of specific amino acid residues, rather than general acid or base catalysis. Also, the alteration in conformational distribution caused by phase separation unveils the latent functionality. Our study delineates the understanding of protein phase separation and insights into the diverse catalytic strategies employed by proteins. It opens exciting possibilities for designing functional artificial compartments based on phase-separated biomolecules.
Collapse
Affiliation(s)
- Saurabh Rai
- Department of Chemistry, Indian Institute of Science Education and Research Bhopal Bhopal Bypass Road Bhopal 462066 Madhya Pradesh India
| | - Srikrishna Pramanik
- Department of Chemistry, Indian Institute of Science Education and Research Bhopal Bhopal Bypass Road Bhopal 462066 Madhya Pradesh India
| | - Saptarshi Mukherjee
- Department of Chemistry, Indian Institute of Science Education and Research Bhopal Bhopal Bypass Road Bhopal 462066 Madhya Pradesh India
| |
Collapse
|
7
|
Yang M, Harrison BR, Promislow DEL. Cellular age explains variation in age-related cell-to-cell transcriptome variability. Genome Res 2023; 33:1906-1916. [PMID: 37973195 PMCID: PMC10760448 DOI: 10.1101/gr.278144.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 10/12/2023] [Indexed: 11/19/2023]
Abstract
Organs and tissues age at different rates within a single individual. Such asynchrony in aging has been widely observed at multiple levels, from functional hallmarks, such as anatomical structures and physiological processes, to molecular endophenotypes, such as the transcriptome and metabolome. However, we lack a conceptual framework to understand why some components age faster than others. Just as demographic models explain why aging evolves, here we test the hypothesis that demographic differences among cell types, determined by cell-specific differences in turnover rate, can explain why the transcriptome shows signs of aging in some cell types but not others. Through analysis of mouse single-cell transcriptome data across diverse tissues and ages, we find that cellular age explains a large proportion of the variation in the age-related increase in transcriptome variance. We further show that long-lived cells are characterized by relatively high expression of genes associated with proteostasis and that the transcriptome of long-lived cells shows greater evolutionary constraint than short-lived cells. In contrast, in short-lived cell types, the transcriptome is enriched for genes associated with DNA repair. Based on these observations, we develop a novel heuristic model that explains how and why aging rates differ among cell types.
Collapse
Affiliation(s)
- Ming Yang
- Department of Laboratory Medicine and Pathology, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Benjamin R Harrison
- Department of Laboratory Medicine and Pathology, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | - Daniel E L Promislow
- Department of Laboratory Medicine and Pathology, University of Washington School of Medicine, Seattle, Washington 98195, USA;
- Department of Biology, University of Washington, Seattle, Washington 98195, USA
| |
Collapse
|
8
|
Prabantu VM, Tandon H, Sandhya S, Sowdhamini R, Srinivasan N. The alteration of structural network upon transient association between proteins studied using graph theory. Proteins 2023. [PMID: 37902388 DOI: 10.1002/prot.26606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 08/31/2023] [Accepted: 09/14/2023] [Indexed: 10/31/2023]
Abstract
Proteins such as enzymes perform their function by predominant non-covalent bond interactions between transiently interacting units. There is an impact on the overall structural topology of the protein, albeit transient nature of such interactions, that enable proteins to deactivate or activate. This aspect of the alteration of the structural topology is studied by employing protein structural networks, which are node-edge representative models of protein structure, reported as a robust tool for capturing interactions between residues. Several methods have been optimized to collect meaningful, functionally relevant information by studying alteration of structural networks. In this article, different methods of comparing protein structural networks are employed, along with spectral decomposition of graphs to study the subtle impact of protein-protein interactions. A detailed analysis of the structural network of interacting partners is performed across a dataset of around 900 pairs of bound complexes and corresponding unbound protein structures. The variation in network parameters at, around, and far away from the interface are analyzed. Finally, we present interesting case studies, where an allosteric mechanism of structural impact is understood from communication-path detection methods. The results of this analysis are beneficial in understanding protein stability, for future engineering, and docking studies.
Collapse
Affiliation(s)
| | - Himani Tandon
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
- Structural Studies Division, MRC Laboratory of Molecular Biology, Cambridge, UK
| | - Sankaran Sandhya
- Faculty of Life and Health Sciences, Department of Biotechnology, Ramaiah University of Applied Sciences, Bangalore, India
| | - Ramanathan Sowdhamini
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
- National Centre for Biological Sciences (TIFR), Bangalore, India
- Institute of Bioinformatics and Applied Biotechnology, Bangalore, India
| | | |
Collapse
|
9
|
Gollapalli P, Rudrappa S, Kumar V, Santosh Kumar HS. Domain Architecture Based Methods for Comparative Functional Genomics Toward Therapeutic Drug Target Discovery. J Mol Evol 2023; 91:598-615. [PMID: 37626222 DOI: 10.1007/s00239-023-10129-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2022] [Accepted: 08/06/2023] [Indexed: 08/27/2023]
Abstract
Genes duplicate, mutate, recombine, fuse or fission to produce new genes, or when genes are formed from de novo, novel functions arise during evolution. Researchers have tried to quantify the causes of these molecular diversification processes to know how these genes increase molecular complexity over a period of time, for instance protein domain organization. In contrast to global sequence similarity, protein domain architectures can capture key structural and functional characteristics, making them better proxies for describing functional equivalence. In Prokaryotes and eukaryotes it has proven that, domain designs are retained over significant evolutionary distances. Protein domain architectures are now being utilized to categorize and distinguish evolutionarily related proteins and find homologs among species that are evolutionarily distant from one another. Additionally, structural information stored in domain structures has accelerated homology identification and sequence search methods. Tools for functional protein annotation have been developed to discover, protein domain content, domain order, domain recurrence, and domain position as all these contribute to the prediction of protein functional accuracy. In this review, an attempt is made to summarise facts and speculations regarding the use of protein domain architecture and modularity to identify possible therapeutic targets among cellular activities based on the understanding their linked biological processes.
Collapse
Affiliation(s)
- Pavan Gollapalli
- Center for Bioinformatics and Biostatistics, Nitte (Deemed to be University), Mangalore, Karnataka, 575018, India
| | - Sushmitha Rudrappa
- Department of Biotechnology and Bioinformatics, Jnana Sahyadri Campus, Kuvempu University, Shankaraghatta, Shivamogga, Karnataka, 577451, India
| | - Vadlapudi Kumar
- Department of Biochemistry, Davangere University, Shivagangothri, Davangere, Karnataka, 577007, India
| | - Hulikal Shivashankara Santosh Kumar
- Department of Biotechnology and Bioinformatics, Jnana Sahyadri Campus, Kuvempu University, Shankaraghatta, Shivamogga, Karnataka, 577451, India.
| |
Collapse
|
10
|
Vallina Estrada E, Zhang N, Wennerström H, Danielsson J, Oliveberg M. Diffusive intracellular interactions: On the role of protein net charge and functional adaptation. Curr Opin Struct Biol 2023; 81:102625. [PMID: 37331204 DOI: 10.1016/j.sbi.2023.102625] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Revised: 05/16/2023] [Accepted: 05/16/2023] [Indexed: 06/20/2023]
Abstract
A striking feature of nucleic acids and lipid membranes is that they all carry net negative charge and so is true for the majority of intracellular proteins. It is suggested that the role of this negative charge is to assure a basal intermolecular repulsion that keeps the cytosolic content suitably 'fluid' for function. We focus in this review on the experimental, theoretical and genetic findings which serve to underpin this idea and the new questions they raise. Unlike the situation in test tubes, any functional protein-protein interaction in the cytosol is subject to competition from the densely crowded background, i.e. surrounding stickiness. At the nonspecific limit of this stickiness is the 'random' protein-protein association, maintaining profuse populations of transient and constantly interconverting complexes at physiological protein concentrations. The phenomenon is readily quantified in studies of the protein rotational diffusion, showing that the more net negatively charged a protein is the less it is retarded by clustering. It is further evident that this dynamic protein-protein interplay is under evolutionary control and finely tuned across organisms to maintain optimal physicochemical conditions for the cellular processes. The emerging picture is then that specific cellular function relies on close competition between numerous weak and strong interactions, and where all parts of the protein surfaces are involved. The outstanding challenge is now to decipher the very basics of this many-body system: how the detailed patterns of charged, polar and hydrophobic side chains not only control protein-protein interactions at close- and long-range but also the collective properties of the cellular interior as a whole.
Collapse
Affiliation(s)
- Eloy Vallina Estrada
- Department of Biochemistry and Biophysics, Arrhenius Laboratories of Natural Sciences, Stockholm University, S-106 91 Stockholm, Sweden
| | - Nannan Zhang
- Department of Biochemistry and Biophysics, Arrhenius Laboratories of Natural Sciences, Stockholm University, S-106 91 Stockholm, Sweden
| | - Håkan Wennerström
- Division of Physical Chemistry, Department of Chemistry, Lund University, Box 124, 22100 Lund, Sweden
| | - Jens Danielsson
- Department of Biochemistry and Biophysics, Arrhenius Laboratories of Natural Sciences, Stockholm University, S-106 91 Stockholm, Sweden
| | - Mikael Oliveberg
- Department of Biochemistry and Biophysics, Arrhenius Laboratories of Natural Sciences, Stockholm University, S-106 91 Stockholm, Sweden.
| |
Collapse
|
11
|
Chakravarty D, Sreenivasan S, Swint-Kruse L, Porter LL. Identification of a covert evolutionary pathway between two protein folds. Nat Commun 2023; 14:3177. [PMID: 37264049 DOI: 10.1038/s41467-023-38519-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 05/03/2023] [Indexed: 06/03/2023] Open
Abstract
Although homologous protein sequences are expected to adopt similar structures, some amino acid substitutions can interconvert α-helices and β-sheets. Such fold switching may have occurred over evolutionary history, but supporting evidence has been limited by the: (1) abundance and diversity of sequenced genes, (2) quantity of experimentally determined protein structures, and (3) assumptions underlying the statistical methods used to infer homology. Here, we overcome these barriers by applying multiple statistical methods to a family of ~600,000 bacterial response regulator proteins. We find that their homologous DNA-binding subunits assume divergent structures: helix-turn-helix versus α-helix + β-sheet (winged helix). Phylogenetic analyses, ancestral sequence reconstruction, and AlphaFold2 models indicate that amino acid substitutions facilitated a switch from helix-turn-helix into winged helix. This structural transformation likely expanded DNA-binding specificity. Our approach uncovers an evolutionary pathway between two protein folds and provides a methodology to identify secondary structure switching in other protein families.
Collapse
Affiliation(s)
- Devlina Chakravarty
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Shwetha Sreenivasan
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS, 66160, USA
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS, 66160, USA
| | - Lauren L Porter
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA.
- Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA.
| |
Collapse
|
12
|
Chauhan R, Bhattacharya J, Solanki R, Ahmad FJ, Alankar B, Kaur H. GUD-VE visualization tool for physicochemical properties of proteins. MethodsX 2023; 10:102226. [PMID: 37424755 PMCID: PMC10326500 DOI: 10.1016/j.mex.2023.102226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Accepted: 05/17/2023] [Indexed: 07/11/2023] Open
Abstract
The physicochemical properties of primary sequences of proteins helps in determining both the structure and biological functions. The sequence analysis of the proteins and nucleic acids is most fundamental element of bioinformatics. Without these elements, it is impossible to gain insight deeper molecular and biochemical mechanisms. For this purpose, the computational methods like bioinformatics tools assist experts and novices alike in resolving issues relating to protein analysis. Similarly, this proposed work, for the graphical user interface (GUI) based prediction and visualization through the computations-based method done on Jupyter Notebook with tkinter package which allows the creation of a program on a local host platform and accessed by the programmer.•When it is queried with a protein sequence, it predicts physicochemical parameters of the peptides.•Users can choose to visualize the findings acquired either anonymously or on the user-specified email address and compare the biophysical properties of one protein with other using amino acids (AA) sequences. The aim of this paper is to meet the requirements of experimentalists, not just hardcore bioinformaticians related to biophysical properties prediction and comparison with other proteins. The code for it has been uploaded on GitHub (an online repository of codes) in private mode.
Collapse
Affiliation(s)
- Ritu Chauhan
- Amity University, Noida 201313, Uttar Pradesh, India
| | | | - Rubi Solanki
- School of Interdisciplinary Sciences and Technology, Jamia Hamdard, New Delhi 110062, India
| | - Farhan Jalees Ahmad
- School of Interdisciplinary Sciences and Technology, Jamia Hamdard, New Delhi 110062, India
| | - Bhavya Alankar
- Department of Computer Science and Engineering, School of Engineering Sciences and Technology, Jamia Hamdard, New Delhi 110062, India
| | - Harleen Kaur
- Department of Computer Science and Engineering, School of Engineering Sciences and Technology, Jamia Hamdard, New Delhi 110062, India
| |
Collapse
|
13
|
Nag M, Clark AC. Conserved folding landscape of monomeric initiator caspases. J Biol Chem 2023; 299:103075. [PMID: 36858199 PMCID: PMC10074801 DOI: 10.1016/j.jbc.2023.103075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 02/23/2023] [Accepted: 02/24/2023] [Indexed: 03/03/2023] Open
Abstract
The apoptotic caspase subfamily evolved into two subfamilies-monomeric initiators and dimeric effectors; both subfamilies share a conserved caspase-hemoglobinase fold with a protease domain containing a large subunit and a small subunit. Sequence variations in the conserved caspase-hemoglobinase fold resulted in changes in oligomerization, enzyme specificity, and regulation, making caspases an excellent model for examining the mechanisms of molecular evolution in fine-tuning structure, function, and allosteric regulation. We examined the urea-induced equilibrium folding/unfolding of two initiator caspases, monomeric caspase-8 and cFLIPL, over a broad pH range. Both proteins unfold by a three-state equilibrium mechanism that includes a partially folded intermediate. In addition, both proteins undergo a conserved pH-dependent conformational change that is controlled by an evolutionarily conserved mechanism. We show that the conformational free energy landscape of the caspase monomer is conserved in the monomeric and dimeric subfamilies. Molecular dynamics simulations in the presence or the absence of urea, coupled with limited trypsin proteolysis and mass spectrometry, show that the small subunit is unstable in the protomer and unfolds prior to the large subunit. In addition, the unfolding of helix 2 in the large subunit results in disruption of a conserved allosteric site. Because the small subunit forms the interface for dimerization, our results highlight an important driving force for the evolution of the dimeric caspase subfamily through stabilizing the small subunit.
Collapse
Affiliation(s)
- Mithun Nag
- Department of Biology, University of Texas at Arlington, Arlington, Texas, USA
| | - A Clay Clark
- Department of Biology, University of Texas at Arlington, Arlington, Texas, USA.
| |
Collapse
|
14
|
Medina-Ortiz K, Navia F, Mosquera-Gil C, Sánchez A, Sterling G, Fierro L, Castaño S. Identification of the NA +/K +-ATPase α-Isoforms in Six Species of Poison Dart Frogs and their Sensitivity to Cardiotonic Steroids. J Chem Ecol 2023; 49:116-132. [PMID: 36877397 PMCID: PMC10102066 DOI: 10.1007/s10886-023-01404-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2022] [Revised: 01/17/2023] [Accepted: 01/18/2023] [Indexed: 03/07/2023]
Abstract
Cardiotonic steroids (CTS) are a group of compounds known to be toxic due to their ability to inhibit the Na+/K+-ATPase (NKA), which is essential to maintain the balance of ions in animal cells. An evolutionary strategy of molecular adaptation to avoid self-intoxication acquired by CTS defended organisms and their predators is the structural modification of their NKA where specific amino acid substitutions confer resistant phenotypes. Several lineages of poison dart frogs (Dendrobatidae) are well known to sequester a wide variety of lipophilic alkaloids from their arthropod diet, however there is no evidence of CTS-sequestration or dietary exposure. Interestingly this study identified the presence of α-NKA isoforms (α1 and α2) with amino acid substitutions indicative of CTS-resistant phenotypes in skeletal muscle transcriptomes obtained from six species of dendrobatids: Phyllobates aurotaenia, Oophaga anchicayensis, Epipedobates boulengeri, Andinobates bombetes, Andinobates minutus, and Leucostethus brachistriatus, collected in the Valle del Cauca (Colombia). P. aurotaenia, A. minutus, and E. boulengeri presented two variants for α1-NKA, with one of them having these substitutions. In contrast, O. anchicayensis and A. bombetes have only one α1-NKA isoform with an amino acid sequence indicative of CTS susceptibility and an α2-NKA with one substitution that could confer a reduced affinity for CTS. The α1 and α2 isoforms of L. brachistriatus do not contain substitutions imparting CTS resistance. Our findings indicate that poison dart frogs express α-NKA isoforms with different affinities for CTS and the pattern of this expression might be influenced by factors related to evolutionary, physiological, ecological, and geographical burdens.
Collapse
Affiliation(s)
- Katherine Medina-Ortiz
- Laboratorio de Herpetología Y Toxinología, Department of Physiological Sciences, Faculty of Health, Universidad del Valle, Cali, Colombia.
| | - Felipe Navia
- Laboratorio de Herpetología Y Toxinología, Department of Physiological Sciences, Faculty of Health, Universidad del Valle, Cali, Colombia
| | - Claudia Mosquera-Gil
- Laboratorio de Herpetología Y Toxinología, Department of Physiological Sciences, Faculty of Health, Universidad del Valle, Cali, Colombia
| | - Adalberto Sánchez
- Laboratorio de Herpetología Y Toxinología, Department of Physiological Sciences, Faculty of Health, Universidad del Valle, Cali, Colombia
| | - Gonzalo Sterling
- Laboratorio de Herpetología Y Toxinología, Department of Physiological Sciences, Faculty of Health, Universidad del Valle, Cali, Colombia
| | - Leonardo Fierro
- Laboratorio de Herpetología Y Toxinología, Department of Physiological Sciences, Faculty of Health, Universidad del Valle, Cali, Colombia
| | - Santiago Castaño
- Laboratorio de Herpetología Y Toxinología, Department of Physiological Sciences, Faculty of Health, Universidad del Valle, Cali, Colombia.
| |
Collapse
|
15
|
Kiefl E, Esen OC, Miller SE, Kroll KL, Willis AD, Rappé MS, Pan T, Eren AM. Structure-informed microbial population genetics elucidate selective pressures that shape protein evolution. SCIENCE ADVANCES 2023; 9:eabq4632. [PMID: 36812328 DOI: 10.1126/sciadv.abq4632] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Accepted: 01/18/2023] [Indexed: 06/18/2023]
Abstract
Comprehensive sampling of natural genetic diversity with metagenomics enables highly resolved insights into the interplay between ecology and evolution. However, resolving adaptive, neutral, or purifying processes of evolution from intrapopulation genomic variation remains a challenge, partly due to the sole reliance on gene sequences to interpret variants. Here, we describe an approach to analyze genetic variation in the context of predicted protein structures and apply it to a marine microbial population within the SAR11 subclade 1a.3.V, which dominates low-latitude surface oceans. Our analyses reveal a tight association between genetic variation and protein structure. In a central gene in nitrogen metabolism, we observe decreased occurrence of nonsynonymous variants from ligand-binding sites as a function of nitrate concentrations, revealing genetic targets of distinct evolutionary pressures maintained by nutrient availability. Our work yields insights into the governing principles of evolution and enables structure-aware investigations of microbial population genetics.
Collapse
Affiliation(s)
- Evan Kiefl
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA
- Graduate Program in Biophysical Sciences, University of Chicago, Chicago, IL 60637, USA
| | - Ozcan C Esen
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Samuel E Miller
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA 02543, USA
| | - Kourtney L Kroll
- Graduate Program in Biophysical Sciences, University of Chicago, Chicago, IL 60637, USA
| | - Amy D Willis
- Department of Biostatistics, University of Washington, Seattle, WA 98195, USA
| | - Michael S Rappé
- Hawai'i Institute of Marine Biology, University of Hawai'i at Mānoa, Kāne'ohe, HI 96822, USA
| | - Tao Pan
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL 60637, USA
| | - A Murat Eren
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA 02543, USA
- Institute for Chemistry and Biology of the Marine Environment, University of Oldenburg, Oldenburg, Germany
- Alfred Wegener Institute for Polar and Marine Research, Bremerhaven, Germany
- Helmholtz Institute for Functional Marine Biodiversity, Oldenburg, Germany
| |
Collapse
|
16
|
Design of an artificial phage-display library based on a new scaffold improved for average stability of the randomized proteins. Sci Rep 2023; 13:1339. [PMID: 36693880 PMCID: PMC9873692 DOI: 10.1038/s41598-023-27710-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Accepted: 01/06/2023] [Indexed: 01/25/2023] Open
Abstract
Scaffold-based protein libraries are designed to be both diverse and rich in functional/folded proteins. However, introducing an extended diversity while preserving stability of the initial scaffold remains a challenge. Here we developed an original approach to select the ensemble of folded proteins from an initial library. The thermostable CheY protein from Thermotoga maritima was chosen as scaffold. Four loops of CheY were diversified to create a new binding surface. The subset of the library giving rise to folded proteins was first selected using a natural protein partner of the template scaffold. Then, a gene shuffling approach based on a single restriction enzyme was used to recombine DNA sequences encoding these filtrated variants. Taken together, the filtration strategy and the shuffling of the filtrated sequences were shown to enrich the library in folded and stable sequences while maintaining a large diversity in the final library (Lib-Cheytins 2.1). Binders of the Oplophorus luciferase Kaz domain were then selected by phage display from the final library, showing affinities in the μM range. One of the best variants induced a loss of 92% of luminescent activity, suggesting that this Cheytin preferentially binds to the Kaz active site.
Collapse
|
17
|
Aledo P, Aledo JC. Proteome-Wide Structural Computations Provide Insights into Empirical Amino Acid Substitution Matrices. Int J Mol Sci 2023; 24:ijms24010796. [PMID: 36614247 PMCID: PMC9821064 DOI: 10.3390/ijms24010796] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Revised: 12/24/2022] [Accepted: 12/29/2022] [Indexed: 01/04/2023] Open
Abstract
The relative contribution of mutation and selection to the amino acid substitution rates observed in empirical matrices is unclear. Herein, we present a neutral continuous fitness-stability model, inspired by the Arrhenius law (qij=aije-ΔΔGij). The model postulates that the rate of amino acid substitution (i→j) is determined by the product of a pre-exponential factor, which is influenced by the genetic code structure, and an exponential term reflecting the relative fitness of the amino acid substitutions. To assess the validity of our model, we computed changes in stability of 14,094 proteins, for which 137,073,638 in silico mutants were analyzed. These site-specific data were summarized into a 20 square matrix, whose entries, ΔΔGij, were obtained after averaging through all the sites in all the proteins. We found a significant positive correlation between these energy values and the disease-causing potential of each substitution, suggesting that the exponential term accurately summarizes the fitness effect. A remarkable observation was that amino acids that were highly destabilizing when acting as the source, tended to have little effect when acting as the destination, and vice versa (source → destination). The Arrhenius model accurately reproduced the pattern of substitution rates collected in the empirical matrices, suggesting a relevant role for the genetic code structure and a tuning role for purifying selection exerted via protein stability.
Collapse
|
18
|
Michel F, Shanmugaratnam S, Romero-Romero S, Höcker B. Structures of permuted halves of a modern ribose-binding protein. Acta Crystallogr D Struct Biol 2023; 79:40-49. [PMID: 36601806 PMCID: PMC9815098 DOI: 10.1107/s205979832201186x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Accepted: 12/13/2022] [Indexed: 12/31/2022] Open
Abstract
Periplasmic binding proteins (PBPs) are a class of proteins that participate in the cellular transport of various ligands. They have been used as model systems to study mechanisms in protein evolution, such as duplication, recombination and domain swapping. It has been suggested that PBPs evolved from precursors half their size. Here, the crystal structures of two permuted halves of a modern ribose-binding protein (RBP) from Thermotoga maritima are reported. The overexpressed proteins are well folded and show a monomer-dimer equilibrium in solution. Their crystal structures show partially noncanonical PBP-like fold type I conformations with structural deviations from modern RBPs. One of the half variants forms a dimer via segment swapping, suggesting a high degree of malleability. The structural findings on these permuted halves support the evolutionary hypothesis that PBPs arose via a duplication event of a flavodoxin-like protein and further support a domain-swapping step that might have occurred during the evolution of the PBP-like fold, a process that is necessary to generate the characteristic motion of PBPs essential to perform their functions.
Collapse
Affiliation(s)
- Florian Michel
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | | | | | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany,Correspondence e-mail:
| |
Collapse
|
19
|
Chakraborty C, Bhattacharya M, Chatterjee S, Sharma AR, Saha RP, Dhama K, Agoramoorthy G. Integrative Bioinformatics Approaches Indicate a Particular Pattern of Some SARS-CoV-2 and Non-SARS-CoV-2 Proteins. Vaccines (Basel) 2022; 11:vaccines11010038. [PMID: 36679883 PMCID: PMC9864461 DOI: 10.3390/vaccines11010038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 12/12/2022] [Accepted: 12/20/2022] [Indexed: 12/28/2022] Open
Abstract
Pattern recognition plays a critical role in integrative bioinformatics to determine the structural patterns of proteins of viruses such as SARS-CoV-2. This study identifies the pattern of SARS-CoV-2 proteins to depict the structure-function relationships of the protein alphabets of SARS-CoV-2 and COVID-19. The assembly enumeration algorithm, Anisotropic Network Model, Gaussian Network Model, Markovian Stochastic Model, and image comparison protein-like alphabets were used. The distance score was the lowest with 22 for "I" and highest with 40 for "9". For post-processing and decision, two protein alphabets "C" (PDB ID: 6XC3) and "S" (PDB ID: 7OYG) were evaluated to understand the structural, functional, and evolutionary relationships, and we found uniqueness in the functionality of proteins. Here, models were constructed using "SARS-CoV-2 proteins" (12 numbers) and "non-SARS-CoV-2 proteins" (14 numbers) to create two words, "SARS-CoV-2" and "COVID-19". Similarly, we developed two slogans: "Vaccinate the world against COVID-19" and "Say no to SARS-CoV-2", which were made with the proteins structure. It might generate vaccine-related interest to broad reader categories. Finally, the evolutionary process appears to enhance the protein structure smoothly to provide suitable functionality shaped by natural selection.
Collapse
Affiliation(s)
- Chiranjib Chakraborty
- Department of Biotechnology, School of Life Science and Biotechnology, Adamas University, Kolkata 700126, West Bengal, India
- Correspondence:
| | - Manojit Bhattacharya
- Department of Zoology, Fakir Mohan University, Vyasa Vihar, Balasore 756020, Odisha, India
| | - Srijan Chatterjee
- Institute for Skeletal Aging and Orthopaedic Surgery, Hallym University-Chuncheon Sacred Heart Hospital, Chuncheon-si 24252, Gangwon-do, Republic of Korea
| | - Ashish Ranjan Sharma
- Institute for Skeletal Aging and Orthopaedic Surgery, Hallym University-Chuncheon Sacred Heart Hospital, Chuncheon-si 24252, Gangwon-do, Republic of Korea
| | - Rudra P. Saha
- Department of Biotechnology, School of Life Science and Biotechnology, Adamas University, Kolkata 700126, West Bengal, India
| | - Kuldeep Dhama
- Division of Pathology, ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, Uttar Pradesh, India
| | | |
Collapse
|
20
|
Evolutionary entrenchment in immune proteins selects against stabilizing mutations. Proc Natl Acad Sci U S A 2022; 119:e2216087119. [PMID: 36449544 PMCID: PMC9894131 DOI: 10.1073/pnas.2216087119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open
|
21
|
Fareed MM, Dutta K, Dandekar T, Tarabonda H, Skorb EV, Shityakov S. In silico investigation of nonsynonymous single nucleotide polymorphisms in BCL2 apoptosis regulator gene to design novel protein-based drugs against cancer. J Cell Biochem 2022; 123:2044-2056. [PMID: 36146908 DOI: 10.1002/jcb.30330] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Revised: 09/04/2022] [Accepted: 09/10/2022] [Indexed: 12/24/2022]
Abstract
BCL2 apoptosis regulator gene encodes Bcl-2 pro-survival protein, which plays an important role to evade apoptosis in various cancers. Moreover, single nucleotide polymorphisms (SNPs) in the BCL2 gene can be nonsynonymous (nsSNPs), which might affect the protein stability and probably its function. Therefore, we implement cutting-edge computational techniques based on the Spherical Polar Fourier and Monte-Carlo algorithms to investigate the impact of these SNPs on the B cell lymphoma-2 (Bcl-2) stability and therapeutic potential of protein-based molecules to inhibit this protein. As a result, we identified two nsSNPs (Q118R and R129C) to be deleterious and highly conserved, having a negative effect on protein stability. Additionally, molecular docking and molecular dynamics simulations confirmed the decreased binding affinity of mutated Bcl-2 variants to bind three-helix bundle protein inhibitor as these mutations occurred in the protein-protein binding site. Overall, this computational approach investigating nsSNPs provides a useful basis for designing novel molecules to inhibit Bcl-2 pro-survival pathway in malignant cells.
Collapse
Affiliation(s)
- Muhammad Mazhar Fareed
- Department of Computer Science, School of Science and Engineering, Università degli studi di Verona, Verona, Italy
| | - Kunal Dutta
- Department of Human Physiology, Vidyasagar University, Midnapore, West Bengal, India
| | - Thomas Dandekar
- Department of Bioinformatics, Biocenter, University of Würzburg, Würzburg, Germany
| | - Herman Tarabonda
- Laboratory of Chemoinformatics, Infochemistry Scientific Center, ITMO University, Saint-Petersburg, Russia
| | - Ekaterina V Skorb
- Laboratory of Chemoinformatics, Infochemistry Scientific Center, ITMO University, Saint-Petersburg, Russia
| | - Sergey Shityakov
- Laboratory of Chemoinformatics, Infochemistry Scientific Center, ITMO University, Saint-Petersburg, Russia
| |
Collapse
|
22
|
Harman JL, Reardon PN, Costello SM, Warren GD, Phillips SR, Connor PJ, Marqusee S, Harms MJ. Evolution avoids a pathological stabilizing interaction in the immune protein S100A9. Proc Natl Acad Sci U S A 2022; 119:e2208029119. [PMID: 36194634 PMCID: PMC9565474 DOI: 10.1073/pnas.2208029119] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Accepted: 09/07/2022] [Indexed: 01/03/2023] Open
Abstract
Stability constrains evolution. While much is known about constraints on destabilizing mutations, less is known about the constraints on stabilizing mutations. We recently identified a mutation in the innate immune protein S100A9 that provides insight into such constraints. When introduced into human S100A9, M63F simultaneously increases the stability of the protein and disrupts its natural ability to activate Toll-like receptor 4. Using chemical denaturation, we found that M63F stabilizes a calcium-bound conformation of hS100A9. We then used NMR to solve the structure of the mutant protein, revealing that the mutation distorts the hydrophobic binding surface of hS100A9, explaining its deleterious effect on function. Hydrogen-deuterium exchange (HDX) experiments revealed stabilization of the region around M63F in the structure, notably Phe37. In the structure of the M63F mutant, the Phe37 and Phe63 sidechains are in contact, plausibly forming an edge-face π-stack. Mutating Phe37 to Leu abolished the stabilizing effect of M63F as probed by both chemical denaturation and HDX. It also restored the biological activity of S100A9 disrupted by M63F. These findings reveal that Phe63 creates a molecular staple with Phe37 that stabilizes a nonfunctional conformation of the protein, thus disrupting function. Using a bioinformatic analysis, we found that S100A9 proteins from different organisms rarely have Phe at both positions 37 and 63, suggesting that avoiding a pathological stabilizing interaction indeed constrains S100A9 evolution. This work highlights an important evolutionary constraint on stabilizing mutations, namely, that they must avoid inappropriately stabilizing nonfunctional protein conformations.
Collapse
Affiliation(s)
- Joseph L Harman
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR 97403
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403
| | - Patrick N Reardon
- College of Science, NMR Facility, Oregon State University, Corvallis, OR 97331
| | - Shawn M Costello
- Biophysics Graduate Program, University of California, Berkeley, Berkeley, CA 94720
| | - Gus D Warren
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR 97403
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403
| | - Sophia R Phillips
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR 97403
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403
| | - Patrick J Connor
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR 97403
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720
- Department of Chemistry, University of California, Berkeley, Berkeley, CA 94720
- California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, CA 94720
| | - Michael J Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR 97403
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403
| |
Collapse
|
23
|
Kim I, Dubrow A, Zuniga B, Zhao B, Sherer N, Bastiray A, Li P, Cho JH. Energy landscape reshaped by strain-specific mutations underlies epistasis in NS1 evolution of influenza A virus. Nat Commun 2022; 13:5775. [PMID: 36182933 PMCID: PMC9526705 DOI: 10.1038/s41467-022-33554-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Accepted: 09/22/2022] [Indexed: 11/24/2022] Open
Abstract
Elucidating how individual mutations affect the protein energy landscape is crucial for understanding how proteins evolve. However, predicting mutational effects remains challenging because of epistasis—the nonadditive interactions between mutations. Here, we investigate the biophysical mechanism of strain-specific epistasis in the nonstructural protein 1 (NS1) of influenza A viruses (IAVs). We integrate structural, kinetic, thermodynamic, and conformational dynamics analyses of four NS1s of influenza strains that emerged between 1918 and 2004. Although functionally near-neutral, strain-specific NS1 mutations exhibit long-range epistatic interactions with residues at the p85β-binding interface. We reveal that strain-specific mutations reshaped the NS1 energy landscape during evolution. Using NMR spin dynamics, we find that the strain-specific mutations altered the conformational dynamics of the hidden network of tightly packed residues, underlying the evolution of long-range epistasis. This work shows how near-neutral mutations silently alter the biophysical energy landscapes, resulting in diverse background effects during molecular evolution. Influenza A virus (IAV) nonstructural protein 1 (NS1) is a multifunctional virulence factor that interacts with several host factors such as phosphatidylinositol-3-kinase (PI3K). NS1 binds specifically to the p85β regulatory subunit of PI3K and subsequently activates PI3K signaling. Here, Kim et al. show that functionally near-neutral, strain-specific NS1 mutations lead to variations in binding kinetics to p85β exhibit long-range epistatic interactions. Applying NMR they provide evidence that the structural dynamics of the NS1 hydrophobic core have evolved over time and contributed to epistasis.
Collapse
Affiliation(s)
- Iktae Kim
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Alyssa Dubrow
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Bryan Zuniga
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Baoyu Zhao
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Noah Sherer
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Abhishek Bastiray
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Pingwei Li
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Jae-Hyun Cho
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA.
| |
Collapse
|
24
|
Strobel HM, Stuart EC, Meyer JR. A Trait-Based Approach to Predicting Viral Host-Range Evolvability. Annu Rev Virol 2022; 9:139-156. [PMID: 36173699 DOI: 10.1146/annurev-virology-091919-092003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Predicting the evolution of virus host range has proven to be extremely difficult, in part because of the sheer diversity of viruses, each with unique biology and ecological interactions. We have not solved this problem, but to make the problem more tractable, we narrowed our focus to three traits intrinsic to all viruses that may play a role in host-range evolvability: mutation rate, recombination rate, and phenotypic heterogeneity. Although each trait should increase evolvability, they cannot do so unbounded because fitness trade-offs limit the ability of all three traits to maximize evolvability. By examining these constraints, we can begin to identify groups of viruses with suites of traits that make them especially concerning, as well as ecological and environmental conditions that might push evolution toward accelerating host-range expansion.
Collapse
Affiliation(s)
- Hannah M Strobel
- Division of Biological Sciences, University of California, San Diego, La Jolla, California, USA;
| | - Elizabeth C Stuart
- Division of Biological Sciences, University of California, San Diego, La Jolla, California, USA;
| | - Justin R Meyer
- Division of Biological Sciences, University of California, San Diego, La Jolla, California, USA;
| |
Collapse
|
25
|
Abdelzaher H, Tawfik SM, Nour A, Abdelkader S, Elbalkiny ST, Abdelkader M, Abbas WA, Abdelnaser A. Climate change, human health, and the exposome: Utilizing OMIC technologies to navigate an era of uncertainty. Front Public Health 2022; 10:973000. [PMID: 36211706 PMCID: PMC9533016 DOI: 10.3389/fpubh.2022.973000] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 08/17/2022] [Indexed: 01/25/2023] Open
Abstract
Climate change is an anthropogenic phenomenon that is alarming scientists and non-scientists alike. The emission of greenhouse gases is causing the temperature of the earth to rise and this increase is accompanied by a multitude of climate change-induced environmental exposures with potential health impacts. Tracking human exposure has been a major research interest of scientists worldwide. This has led to the development of exposome studies that examine internal and external individual exposures over their lifetime and correlate them to health. The monitoring of health has also benefited from significant technological advances in the field of "omics" technologies that analyze physiological changes on the nucleic acid, protein, and metabolism levels, among others. In this review, we discuss various climate change-induced environmental exposures and their potential health implications. We also highlight the potential integration of the technological advancements in the fields of exposome tracking, climate monitoring, and omics technologies shedding light on important questions that need to be answered.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Anwar Abdelnaser
- Institute of Global Health and Human Ecology, The American University in Cairo, New Cairo, Egypt
| |
Collapse
|
26
|
Tang QY, Ren W, Wang J, Kaneko K. The Statistical Trends of Protein Evolution: A Lesson from AlphaFold Database. Mol Biol Evol 2022; 39:6701686. [PMID: 36108094 PMCID: PMC9550990 DOI: 10.1093/molbev/msac197] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
The recent development of artificial intelligence provides us with new and powerful tools for studying the mysterious relationship between organism evolution and protein evolution. In this work, based on the AlphaFold Protein Structure Database (AlphaFold DB), we perform comparative analyses of the proteins of different organisms. The statistics of AlphaFold-predicted structures show that, for organisms with higher complexity, their constituent proteins will have larger radii of gyration, higher coil fractions, and slower vibrations, statistically. By conducting normal mode analysis and scaling analyses, we demonstrate that higher organismal complexity correlates with lower fractal dimensions in both the structure and dynamics of the constituent proteins, suggesting that higher functional specialization is associated with higher organismal complexity. We also uncover the topology and sequence bases of these correlations. As the organismal complexity increases, the residue contact networks of the constituent proteins will be more assortative, and these proteins will have a higher degree of hydrophilic-hydrophobic segregation in the sequences. Furthermore, by comparing the statistical structural proximity across the proteomes with the phylogenetic tree of homologous proteins, we show that, statistical structural proximity across the proteomes may indirectly reflect the phylogenetic proximity, indicating a statistical trend of protein evolution in parallel with organism evolution. This study provides new insights into how the diversity in the functionality of proteins increases and how the dimensionality of the manifold of protein dynamics reduces during evolution, contributing to the understanding of the origin and evolution of lives.
Collapse
Affiliation(s)
| | - Weitong Ren
- Theoretical Molecular Science Laboratory, RIKEN Cluster for Pioneering Research, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan
| | - Jun Wang
- School of Physics, National Laboratory of Solid State Microstructure, and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, People’s Republic of China
| | | |
Collapse
|
27
|
Jayaraman V, Toledo‐Patiño S, Noda‐García L, Laurino P. Mechanisms of protein evolution. Protein Sci 2022; 31:e4362. [PMID: 35762715 PMCID: PMC9214755 DOI: 10.1002/pro.4362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 05/11/2022] [Accepted: 05/14/2022] [Indexed: 11/06/2022]
Abstract
How do proteins evolve? How do changes in sequence mediate changes in protein structure, and in turn in function? This question has multiple angles, ranging from biochemistry and biophysics to evolutionary biology. This review provides a brief integrated view of some key mechanistic aspects of protein evolution. First, we explain how protein evolution is primarily driven by randomly acquired genetic mutations and selection for function, and how these mutations can even give rise to completely new folds. Then, we also comment on how phenotypic protein variability, including promiscuity, transcriptional and translational errors, may also accelerate this process, possibly via "plasticity-first" mechanisms. Finally, we highlight open questions in the field of protein evolution, with respect to the emergence of more sophisticated protein systems such as protein complexes, pathways, and the emergence of pre-LUCA enzymes.
Collapse
Affiliation(s)
- Vijay Jayaraman
- Department of Molecular Cell BiologyWeizmann Institute of ScienceRehovotIsrael
| | - Saacnicteh Toledo‐Patiño
- Protein Engineering and Evolution UnitOkinawa Institute of Science and Technology Graduate UniversityOkinawaJapan
| | - Lianet Noda‐García
- Department of Plant Pathology and Microbiology, Institute of Environmental Sciences, Robert H. Smith Faculty of Agriculture, Food and EnvironmentHebrew University of JerusalemRehovotIsrael
| | - Paola Laurino
- Protein Engineering and Evolution UnitOkinawa Institute of Science and Technology Graduate UniversityOkinawaJapan
| |
Collapse
|
28
|
Yadahalli S, Jayanthi LP, Gosavi S. A Method for Assessing the Robustness of Protein Structures by Randomizing Packing Interactions. Front Mol Biosci 2022; 9:849272. [PMID: 35832734 PMCID: PMC9271847 DOI: 10.3389/fmolb.2022.849272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Accepted: 04/27/2022] [Indexed: 12/02/2022] Open
Abstract
Many single-domain proteins are not only stable and water-soluble, but they also populate few to no intermediates during folding. This reduces interactions between partially folded proteins, misfolding, and aggregation, and makes the proteins tractable in biotechnological applications. Natural proteins fold thus, not necessarily only because their structures are well-suited for folding, but because their sequences optimize packing and fit their structures well. In contrast, folding experiments on the de novo designed Top7 suggest that it populates several intermediates. Additionally, in de novo protein design, where sequences are designed for natural and new non-natural structures, tens of sequences still need to be tested before success is achieved. Both these issues may be caused by the specific scaffolds used in design, i.e., some protein scaffolds may be more tolerant to packing perturbations and varied sequences. Here, we report a computational method for assessing the response of protein structures to packing perturbations. We then benchmark this method using designed proteins and find that it can identify scaffolds whose folding gets disrupted upon perturbing packing, leading to the population of intermediates. The method can also isolate regions of both natural and designed scaffolds that are sensitive to such perturbations and identify contacts which when present can rescue folding. Overall, this method can be used to identify protein scaffolds that are more amenable to whole protein design as well as to identify protein regions which are sensitive to perturbations and where further mutations should be avoided during protein engineering.
Collapse
|
29
|
Abstract
The hypervariable residues that compose the major part of proteins’ surfaces are generally considered outside evolutionary control. Yet, these “nonconserved” residues determine the outcome of stochastic encounters in crowded cells. It has recently become apparent that these encounters are not as random as one might imagine, but carefully orchestrated by the intracellular electrostatics to optimize protein diffusion, interactivity, and partner search. The most influential factor here is the protein surface-charge density, which takes different optimal values across organisms with different intracellular conditions. In this study, we examine how far the net-charge density and other physicochemical properties of proteomes will take us in terms of distinguishing organisms in general. The results show that these global proteome properties not only follow the established taxonomical hierarchy, but also provide clues to functional adaptation. In many cases, the proteome–property divergence is even resolved at species level. Accordingly, the variable parts of the genes are not as free to drift as they seem in sequence alignment, but present a complementary tool for functional, taxonomic, and evolutionary assignment.
Collapse
|
30
|
Rhoades R, Henry B, Prichett D, Fang Y, Teng S. Computational Saturation Mutagenesis to Investigate the Effects of Neurexin-1 Mutations on AlphaFold Structure. Genes (Basel) 2022; 13:789. [PMID: 35627176 PMCID: PMC9141173 DOI: 10.3390/genes13050789] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 04/20/2022] [Accepted: 04/23/2022] [Indexed: 02/05/2023] Open
Abstract
Neurexin-1 (NRXN1) is a membrane protein essential in synapse formation and cell signaling as a cell-adhesion molecule and cell-surface receptor. NRXN1 and its binding partner neuroligin have been associated with deficits in cognition. Recent genetics research has linked NRXN1 missense mutations to increased risk for brain disorders, including schizophrenia (SCZ) and autism spectrum disorder (ASD). Investigation of the structure-function relationship in NRXN1 has proven difficult due to a lack of the experimental full-length membrane protein structure. AlphaFold, a deep learning-based predictor, succeeds in high-quality protein structure prediction and offers a solution for membrane protein model construction. In the study, we applied a computational saturation mutagenesis method to analyze the systemic effects of missense mutations on protein functions in a human NRXN1 structure predicted from AlphaFold and an experimental Bos taurus structure. The folding energy changes were calculated to estimate the effects of the 29,540 mutations of AlphaFold model on protein stability. The comparative study on the experimental and computationally predicted structures shows that these energy changes are highly correlated, demonstrating the reliability of the AlphaFold structure for the downstream bioinformatics analysis. The energy calculation revealed that some target mutations associated with SCZ and ASD could make the protein unstable. The study can provide helpful information for characterizing the disease-causing mutations and elucidating the molecular mechanisms by which the variations cause SCZ and ASD. This methodology could provide the bioinformatics protocol to investigate the effects of target mutations on multiple AlphaFold structures.
Collapse
Affiliation(s)
- Raina Rhoades
- Department of Biology, Howard University, Washington, DC 20059, USA; (R.R.); (B.H.); (D.P.)
| | - Brianna Henry
- Department of Biology, Howard University, Washington, DC 20059, USA; (R.R.); (B.H.); (D.P.)
| | - Dominique Prichett
- Department of Biology, Howard University, Washington, DC 20059, USA; (R.R.); (B.H.); (D.P.)
| | - Yayin Fang
- Department of Biochemistry and Molecular Biology, Howard University, Washington, DC 20059, USA;
| | - Shaolei Teng
- Department of Biology, Howard University, Washington, DC 20059, USA; (R.R.); (B.H.); (D.P.)
| |
Collapse
|
31
|
Markthaler D, Fleck M, Stankiewicz B, Hansen N. Exploring the Effect of Enhanced Sampling on Protein Stability Prediction. J Chem Theory Comput 2022; 18:2569-2583. [PMID: 35298174 DOI: 10.1021/acs.jctc.1c01012] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Changes in protein stability due to side-chain mutations are evaluated by alchemical free-energy calculations based on classical molecular dynamics (MD) simulations in explicit solvent using the GROMOS force field. Three proteins of different complexity with a total number of 93 single-point mutations are analyzed, and the relative free-energy differences are discussed with respect to configurational sampling and (dis)agreement with experimental data. For the smallest protein studied, a 34-residue WW domain, the starting structure dependence of the alchemical free-energy changes, is discussed in detail. Deviations from previous simulations for the two other proteins are shown to result from insufficient sampling in the earlier studies. Hamiltonian replica exchange in combination with multiple starting structures and sufficient sampling time of more than 100 ns per intermediate alchemical state is required in some cases to reach convergence.
Collapse
Affiliation(s)
- Daniel Markthaler
- Institute of Thermodynamics and Thermal Process Engineering, University of Stuttgart, Pfaffenwaldring 9, 70569 Stuttgart, Germany
| | - Maximilian Fleck
- Institute of Thermodynamics and Thermal Process Engineering, University of Stuttgart, Pfaffenwaldring 9, 70569 Stuttgart, Germany
| | - Bartosz Stankiewicz
- Institute of Thermodynamics and Thermal Process Engineering, University of Stuttgart, Pfaffenwaldring 9, 70569 Stuttgart, Germany
| | - Niels Hansen
- Institute of Thermodynamics and Thermal Process Engineering, University of Stuttgart, Pfaffenwaldring 9, 70569 Stuttgart, Germany
| |
Collapse
|
32
|
Golla H, Kannan A, Gopi S, Murugan S, Perumalsamy LR, Naganathan AN. Structural-Energetic Basis for Coupling between Equilibrium Fluctuations and Phosphorylation in a Protein Native Ensemble. ACS CENTRAL SCIENCE 2022; 8:282-293. [PMID: 35233459 PMCID: PMC8880421 DOI: 10.1021/acscentsci.1c01548] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Indexed: 06/14/2023]
Abstract
The functioning of proteins is intimately tied to their fluctuations in the native ensemble. The structural-energetic features that determine fluctuation amplitudes and hence the shape of the underlying landscape, which in turn determine the magnitude of the functional output, are often confounded by multiple variables. Here, we employ the FF1 domain from human p190A RhoGAP protein as a model system to uncover the molecular basis for phosphorylation of a buried tyrosine, which is crucial to the transcriptional activity associated with transcription factor TFII-I. Combining spectroscopy, calorimetry, statistical-mechanical modeling, molecular simulations, and in vitro phosphorylation assays, we show that the FF1 domain samples a diverse array of conformations in its native ensemble, some of which are phosphorylation-competent. Upon eliminating unfavorable charge-charge interactions through a single charge-reversal (K53E) or charge-neutralizing (K53Q) mutation, we observe proportionately lower phosphorylation extents due to the altered structural coupling, damped equilibrium fluctuations, and a more compact native ensemble. We thus establish a conformational selection mechanism for phosphorylation in the FF1 domain with K53 acting as a "gatekeeper", modulating the solvent exposure of the buried tyrosine. Our work demonstrates the role of unfavorable charge-charge interactions in governing functional events through the modulation of native ensemble characteristics, a feature that could be prevalent in ordered protein domains.
Collapse
Affiliation(s)
- Hemashree Golla
- Department
of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
| | - Adithi Kannan
- Department
of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
| | - Soundhararajan Gopi
- Department
of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
| | - Sowmiya Murugan
- Department
of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
| | - Lakshmi R Perumalsamy
- Department
of Biomedical Sciences, Sri Ramachandra
Institute of Higher Education and Research, Chennai 600116, India
| | - Athi N. Naganathan
- Department
of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
| |
Collapse
|
33
|
Palenchar PM. The Influence of Codon Usage, Protein Abundance, and Protein Stability on Protein Evolution Vary by Evolutionary Distance and the Type of Protein. Protein J 2022; 41:216-229. [PMID: 35147896 DOI: 10.1007/s10930-022-10045-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/01/2022] [Indexed: 12/01/2022]
Abstract
In general, the evolutionary rate of proteins is not primarily related to protein and amino acid functions, and factors such as protein abundance, codon usage, and the protein's TM are more important. To better understand the factors that affect protein evolution, E. coli MG1655 orthologs were compared to those in closely related bacteria and to more distantly related prokaryotes, eukaryotes, and archaea. Also, the evolution of different types of proteins was studied. The analyses indicate that the amino acid conservation of enzymes that do not use macromolecules (e.g. DNA, RNA, and proteins) as substrates and that carry out metabolic processes involving small molecules (i.e. small molecule enzymes) is different than other enzymes. For example, the small molecule enzymes have a lower percent identity than other enzymes when sequences from closely related bacteria are compared. Analyses indicate the lower percent identity is not a result of the amino acid or codon usage of the small molecule enzymes. The small molecule enzymes also don't have a significantly lower protein abundance indicating that is also not likely an important factor driving differences in amino acid conservation. Analyses indicate different methods to measure the TM of proteins have different relationships between amino acid conservation over different evolutionary distances. In totality, the results demonstrate that the relationship between the factors thought to affect protein evolution (protein abundance, codon usage, and proteins TMs) and protein evolution are complex and depend on the factor, the organisms, and the type of proteins being analyzed.
Collapse
Affiliation(s)
- Peter M Palenchar
- Department of Chemistry, Villanova University, 800 E. Lancaster Ave, Villanova, PA, 19805, USA.
| |
Collapse
|
34
|
Strobel HM, Horwitz EK, Meyer JR. Viral protein instability enhances host-range evolvability. PLoS Genet 2022; 18:e1010030. [PMID: 35176040 PMCID: PMC8890733 DOI: 10.1371/journal.pgen.1010030] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 03/02/2022] [Accepted: 01/11/2022] [Indexed: 12/29/2022] Open
Abstract
Viruses are highly evolvable, but what traits endow this property? The high mutation rates of viruses certainly play a role, but factors that act above the genetic code, like protein thermostability, are also expected to contribute. We studied how the thermostability of a model virus, bacteriophage λ, affects its ability to evolve to use a new receptor, a key evolutionary transition that can cause host-range evolution. Using directed evolution and synthetic biology techniques we generated a library of host-recognition protein variants with altered stabilities and then tested their capacity to evolve to use a new receptor. Variants fell within three stability classes: stable, unstable, and catastrophically unstable. The most evolvable were the two unstable variants, whereas seven of eight stable variants were significantly less evolvable, and the two catastrophically unstable variants could not grow. The slowly evolving stable variants were delayed because they required an additional destabilizing mutation. These results are particularly noteworthy because they contradict a widely supported contention that thermostabilizing mutations enhance evolvability of proteins by increasing mutational robustness. Our work suggests that the relationship between thermostability and evolvability is more complex than previously thought, provides evidence for a new molecular model of host-range expansion evolution, and identifies instability as a potential predictor of viral host-range evolution.
Collapse
Affiliation(s)
- Hannah M. Strobel
- Division of Biological Sciences, University of California San Diego, La Jolla, California, United States of America
| | - Elijah K. Horwitz
- Division of Biological Sciences, University of California San Diego, La Jolla, California, United States of America
| | - Justin R. Meyer
- Division of Biological Sciences, University of California San Diego, La Jolla, California, United States of America
| |
Collapse
|
35
|
Conformational dynamics promotes disordered regions from function-dispensable to essential in evolved site-specific DNA recombinases. Comput Struct Biotechnol J 2022; 20:989-1001. [PMID: 35242289 PMCID: PMC8860914 DOI: 10.1016/j.csbj.2022.01.010] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Revised: 01/11/2022] [Accepted: 01/11/2022] [Indexed: 12/11/2022] Open
Abstract
New functional regions emerging in evolution of DNA site-specific recombinase tails. Transient structural nucleation promotes function-dispensable regions to essential. Molecular dynamics reveals conformational diversity and its functional implications. Evolved disordered molecular mechanisms of N-term tails for protein stability. Structural disorder-based link between protein evolution, stability and function.
Protein intrinsically disordered regions (IDRs) play pivotal roles in molecular recognition and regulatory processes through structural disorder-to-order transitions. To understand and exploit the distinctive functional implications of IDRs and to unravel the underlying molecular mechanisms, structural disorder-to-function relationships need to be deciphered. The DNA site-specific recombinase system Cre/loxP represents an attractive model to investigate functional molecular mechanisms of IDRs. Cre contains a functionally dispensable disordered N-terminal tail, which becomes indispensable in the evolved Tre/loxLTR recombinase system. The difficulty to experimentally obtain structural information about this tail has so far precluded any mechanistic study on its involvement in DNA recombination. Here, we use in vitro and in silico evolution data, conformational dynamics, AI-based folding simulations, thermodynamic stability calculations, mutagenesis and DNA recombination assays to investigate how evolution and the dynamic behavior of this IDR may determine distinct functional properties. Our studies suggest that partial conformational order in the N-terminal tail of Tre recombinase and its packing to a conserved hydrophobic surface on the protein provide thermodynamic stability. Based on our results, we propose a link between protein stability and function, offering new plausible atom-detailed mechanistic insights into disorder-function relationships. Our work highlights the potential of N-terminal tails to be exploited for regulation of the activity of Cre-like tyrosine-type SSRs, which merits future investigations and could be of relevance in future rational engineering for their use in biotechnology and genomic medicine.
Collapse
|
36
|
Coyote-Maestas W, Nedrud D, Suma A, He Y, Matreyek KA, Fowler DM, Carnevale V, Myers CL, Schmidt D. Probing ion channel functional architecture and domain recombination compatibility by massively parallel domain insertion profiling. Nat Commun 2021; 12:7114. [PMID: 34880224 PMCID: PMC8654947 DOI: 10.1038/s41467-021-27342-0] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 11/16/2021] [Indexed: 11/10/2022] Open
Abstract
Protein domains are the basic units of protein structure and function. Comparative analysis of genomes and proteomes showed that domain recombination is a main driver of multidomain protein functional diversification and some of the constraining genomic mechanisms are known. Much less is known about biophysical mechanisms that determine whether protein domains can be combined into viable protein folds. Here, we use massively parallel insertional mutagenesis to determine compatibility of over 300,000 domain recombination variants of the Inward Rectifier K+ channel Kir2.1 with channel surface expression. Our data suggest that genomic and biophysical mechanisms acted in concert to favor gain of large, structured domain at protein termini during ion channel evolution. We use machine learning to build a quantitative biophysical model of domain compatibility in Kir2.1 that allows us to derive rudimentary rules for designing domain insertion variants that fold and traffic to the cell surface. Positional Kir2.1 responses to motif insertion clusters into distinct groups that correspond to contiguous structural regions of the channel with distinct biophysical properties tuned towards providing either folding stability or gating transitions. This suggests that insertional profiling is a high-throughput method to annotate function of ion channel structural regions.
Collapse
Affiliation(s)
- Willow Coyote-Maestas
- grid.17635.360000000419368657Department of Biochemistry, Molecular Biology & Biophysics, University of Minnesota, Minneapolis, MN 55455 USA
| | - David Nedrud
- grid.17635.360000000419368657Department of Biochemistry, Molecular Biology & Biophysics, University of Minnesota, Minneapolis, MN 55455 USA
| | - Antonio Suma
- grid.264727.20000 0001 2248 3398Department of Chemistry, Temple University, Philadelphia, PA 19122 USA
| | - Yungui He
- grid.17635.360000000419368657Department of Genetics, Cell Biology & Development, University of Minnesota, Minneapolis, MN 55455 USA
| | - Kenneth A. Matreyek
- grid.67105.350000 0001 2164 3847Department of Pathology, Case Western Reserve University School of Medicine, Cleveland, OH 44106 USA
| | - Douglas M. Fowler
- grid.34477.330000000122986657Department of Genome Sciences, University of Washington, Seattle, WA 98115 USA ,grid.34477.330000000122986657Department of Bioengineering, University of Washington, Seattle, WA 98115 USA
| | - Vincenzo Carnevale
- grid.264727.20000 0001 2248 3398Department of Chemistry, Temple University, Philadelphia, PA 19122 USA
| | - Chad L. Myers
- grid.17635.360000000419368657Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN 55455 USA
| | - Daniel Schmidt
- Department of Genetics, Cell Biology & Development, University of Minnesota, Minneapolis, MN, 55455, USA.
| |
Collapse
|
37
|
Blondel L, Besse S, Rivard EL, Ylla G, Extavour CG. Evolution of a cytoplasmic determinant: evidence for the biochemical basis of functional evolution of the novel germ line regulator oskar. Mol Biol Evol 2021; 38:5491-5513. [PMID: 34550378 PMCID: PMC8662646 DOI: 10.1093/molbev/msab284] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Germ line specification is essential in sexually reproducing organisms. Despite their critical role, the evolutionary history of the genes that specify animal germ cells is heterogeneous and dynamic. In many insects, the gene oskar is required for the specification of the germ line. However, the germ line role of oskar is thought to be a derived role resulting from co-option from an ancestral somatic role. To address how evolutionary changes in protein sequence could have led to changes in the function of Oskar protein that enabled it to regulate germ line specification, we searched for oskar orthologs in 1,565 publicly available insect genomic and transcriptomic data sets. The earliest-diverging lineage in which we identified an oskar ortholog was the order Zygentoma (silverfish and firebrats), suggesting that oskar originated before the origin of winged insects. We noted some order-specific trends in oskar sequence evolution, including whole gene duplications, clade-specific losses, and rapid divergence. An alignment of all known 379 Oskar sequences revealed new highly conserved residues as candidates that promote dimerization of the LOTUS domain. Moreover, we identified regions of the OSK domain with conserved predicted RNA binding potential. Furthermore, we show that despite a low overall amino acid conservation, the LOTUS domain shows higher conservation of predicted secondary structure than the OSK domain. Finally, we suggest new key amino acids in the LOTUS domain that may be involved in the previously reported Oskar−Vasa physical interaction that is required for its germ line role.
Collapse
Affiliation(s)
- Leo Blondel
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
| | - Savandara Besse
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
| | - Emily L Rivard
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA
| | - Guillem Ylla
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Cassandra G Extavour
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, USA.,Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
| |
Collapse
|
38
|
Ferruz N, Michel F, Lobos F, Schmidt S, Höcker B. Fuzzle 2.0: Ligand Binding in Natural Protein Building Blocks. Front Mol Biosci 2021; 8:715972. [PMID: 34485385 PMCID: PMC8416435 DOI: 10.3389/fmolb.2021.715972] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 08/06/2021] [Indexed: 11/13/2022] Open
Abstract
Modern proteins have been shown to share evolutionary relationships via subdomain-sized fragments. The assembly of such fragments through duplication and recombination events led to the complex structures and functions we observe today. We previously implemented a pipeline that identified more than 1,000 of these fragments that are shared by different protein folds and developed a web interface to analyze and search for them. This resource named Fuzzle helps structural and evolutionary biologists to identify and analyze conserved parts of a protein but it also provides protein engineers with building blocks for example to design proteins by fragment combination. Here, we describe a new version of this web resource that was extended to include ligand information. This addition is a significant asset to the database since now protein fragments that bind specific ligands can be identified and analyzed. Often the mode of ligand binding is conserved in proteins thereby supporting a common evolutionary origin. The same can now be explored for subdomain-sized fragments within this database. This ligand binding information can also be used in protein engineering to graft binding pockets into other protein scaffolds or to transfer functional sites via recombination of a specific fragment. Fuzzle 2.0 is freely available at https://fuzzle.uni-bayreuth.de/2.0.
Collapse
Affiliation(s)
- Noelia Ferruz
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Florian Michel
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Francisco Lobos
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Steffen Schmidt
- Computational Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| |
Collapse
|
39
|
Abstract
Correlated motions in proteins arising from the collective movements of residues have long been proposed to be fundamentally important to key properties of proteins, from allostery and catalysis to evolvability. Recent breakthroughs in structural biology have made it possible to capture proteins undergoing complex conformational changes, yet intrinsic correlated motions within a conformation remain one of the least understood facets of protein structure. For many decades, the analysis of total X-ray scattering held the promise of animating crystal structures with correlated motions. With recent advances in both X-ray detectors and data interpretation methods, this long-held promise can now be met. In this Perspective, we will introduce how correlated motions are captured in total scattering and provide guidelines for the collection, interpretation, and validation of data. As structural biology continues to push the boundaries, we see an opportunity to gain atomistic insight into correlated motions using total scattering as a bridge between theory and experiment.
Collapse
Affiliation(s)
- Da Xu
- Department of Chemistry and Chemical Biology, Cornell University, 259 East Avenue, Ithaca, New York 14853, United States
| | - Steve P Meisburger
- Department of Chemistry and Chemical Biology, Cornell University, 259 East Avenue, Ithaca, New York 14853, United States
| | - Nozomi Ando
- Department of Chemistry and Chemical Biology, Cornell University, 259 East Avenue, Ithaca, New York 14853, United States
| |
Collapse
|
40
|
Evolutionary Tracking of SARS-CoV-2 Genetic Variants Highlights an Intricate Balance of Stabilizing and Destabilizing Mutations. mBio 2021; 12:e0118821. [PMID: 34281387 PMCID: PMC8406184 DOI: 10.1128/mbio.01188-21] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
The currently ongoing COVID-19 pandemic caused by SARS-CoV-2 has accounted for millions of infections and deaths across the globe. Genome sequences of SARS-CoV-2 are being published daily in public databases and the availability of these genome data sets has allowed unprecedented access to the mutational patterns of SARS-CoV-2 evolution. We made use of the same genomic information for conducting phylogenetic analysis and identifying lineage-specific mutations. The catalogued lineage-defining mutations were analyzed for their stabilizing or destabilizing impact on viral proteins. We recorded persistence of D614G, S477N, A222V, and V1176F variants and a global expansion of the PANGOLIN variant B.1. In addition, a retention of Q57H (B.1.X), R203K/G204R (B.1.1.X), T85I (B.1.2-B.1.3), G15S+T428I (C.X), and I120F (D.X) variations was observed. Overall, we recorded a striking balance between stabilizing and destabilizing mutations, therefore leading to well-maintained protein structures. With selection pressures in the form of newly developed vaccines and therapeutics to mount in the coming months, the task of mapping viral mutations and recording their impact on key viral proteins should be crucial to preemptively catch any escape mechanism for which SARS-CoV-2 may evolve.
Collapse
|
41
|
Norn C, André I, Theobald DL. A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices. Protein Sci 2021; 30:2057-2068. [PMID: 34218472 PMCID: PMC8442976 DOI: 10.1002/pro.4155] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Revised: 06/25/2021] [Accepted: 06/29/2021] [Indexed: 12/30/2022]
Abstract
Proteins evolve under a myriad of biophysical selection pressures that collectively control the patterns of amino acid substitutions. These evolutionary pressures are sufficiently consistent over time and across protein families to produce substitution patterns, summarized in global amino acid substitution matrices such as BLOSUM, JTT, WAG, and LG, which can be used to successfully detect homologs, infer phylogenies, and reconstruct ancestral sequences. Although the factors that govern the variation of amino acid substitution rates have received much attention, the influence of thermodynamic stability constraints remains unresolved. Here we develop a simple model to calculate amino acid substitution matrices from evolutionary dynamics controlled by a fitness function that reports on the thermodynamic effects of amino acid mutations in protein structures. This hybrid biophysical and evolutionary model accounts for nucleotide transition/transversion rate bias, multi‐nucleotide codon changes, the number of codons per amino acid, and thermodynamic protein stability. We find that our theoretical model accurately recapitulates the complex yet universal pattern observed in common global amino acid substitution matrices used in phylogenetics. These results suggest that selection for thermodynamically stable proteins, coupled with nucleotide mutation bias filtered by the structure of the genetic code, is the primary driver behind the global amino acid substitution patterns observed in proteins throughout the tree of life.
Collapse
Affiliation(s)
- Christoffer Norn
- Biochemistry and Structural Biology, Lund University, Lund, Sweden
| | - Ingemar André
- Biochemistry and Structural Biology, Lund University, Lund, Sweden
| | - Douglas L Theobald
- Biochemistry Department, Brandeis University, Waltham, Massachusetts, USA
| |
Collapse
|
42
|
Rao SJA, Shetty NP. Evolutionary selectivity of amino acid is inspired from the enhanced structural stability and flexibility of the folded protein. Life Sci 2021; 281:119774. [PMID: 34197884 DOI: 10.1016/j.lfs.2021.119774] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 06/16/2021] [Accepted: 06/18/2021] [Indexed: 12/18/2022]
Abstract
AIM The present study attempts to decipher the site-specific amino acid alterations at certain positions experiencing preferential selectivity and their effect on proteins' stability and flexibility. The study examines the selection preferences by considering pair-wise non-bonded interaction energies of adjacent and interacting amino acids present at the interacting site, along with their evolutionary history. MATERIALS AND METHODS For the study, variations in the interacting residues of spike protein (S-Protein) receptor-binding domain (RBD) of different coronaviruses were examined. The MD simulation trajectory analysis revealed that, though all the variants studied were structurally stable at their native and bound confirmations, the RBD of 2019-nCoV/SARS-CoV-2 was found to be more flexible and more dynamic. Furthermore, a noticeable change observed in the non-bonded interaction energies of the amino acids interacting with the receptor corroborated their selection at respective positions. KEY FINDINGS The conformational changes exerted by the altered amino acids could be the reason for a broader range of interacting receptors among the selected proteins. SIGNIFICANCE The results envisage a strong indication that the residue selection at certain positions is governed by a well-orchestrated feedback mechanism, which follows increased stability and flexibility in the folded structure compared to its evolutionary predecessor.
Collapse
Affiliation(s)
- S J Aditya Rao
- Plant Cell Biotechnology Department, CSIR-Central Food Technological Research Institute, Mysore, Karnataka, India.
| | - Nandini P Shetty
- Plant Cell Biotechnology Department, CSIR-Central Food Technological Research Institute, Mysore, Karnataka, India
| |
Collapse
|
43
|
Romero-Romero S, Kordes S, Michel F, Höcker B. Evolution, folding, and design of TIM barrels and related proteins. Curr Opin Struct Biol 2021; 68:94-104. [PMID: 33453500 PMCID: PMC8250049 DOI: 10.1016/j.sbi.2020.12.007] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 12/13/2020] [Accepted: 12/14/2020] [Indexed: 12/16/2022]
Abstract
Proteins are chief actors in life that perform a myriad of exquisite functions. This diversity has been enabled through the evolution and diversification of protein folds. Analysis of sequences and structures strongly suggest that numerous protein pieces have been reused as building blocks and propagated to many modern folds. This information can be traced to understand how the protein world has diversified. In this review, we discuss the latest advances in the analysis of protein evolutionary units, and we use as a model system one of the most abundant and versatile topologies, the TIM-barrel fold, to highlight the existing common principles that interconnect protein evolution, structure, folding, function, and design.
Collapse
Affiliation(s)
| | - Sina Kordes
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Florian Michel
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany.
| |
Collapse
|
44
|
Dubreuil B, Levy ED. Abundance Imparts Evolutionary Constraints of Similar Magnitude on the Buried, Surface, and Disordered Regions of Proteins. Front Mol Biosci 2021; 8:626729. [PMID: 33996892 PMCID: PMC8119896 DOI: 10.3389/fmolb.2021.626729] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 03/29/2021] [Indexed: 12/02/2022] Open
Abstract
An understanding of the forces shaping protein conservation is key, both for the fundamental knowledge it represents and to allow for optimal use of evolutionary information in practical applications. Sequence conservation is typically examined at one of two levels. The first is a residue-level, where intra-protein differences are analyzed and the second is a protein-level, where inter-protein differences are studied. At a residue level, we know that solvent-accessibility is a prime determinant of conservation. By inverting this logic, we inferred that disordered regions are slightly more solvent-accessible on average than the most exposed surface residues in domains. By integrating abundance information with evolutionary data within and across proteins, we confirmed a previously reported strong surface-core association in the evolution of structured regions, but we found a comparatively weak association between disordered and structured regions. The facts that disordered and structured regions experience different structural constraints and evolve independently provide a unique setup to examine an outstanding question: why is a protein’s abundance the main determinant of its sequence conservation? Indeed, any structural or biophysical property linked to the abundance-conservation relationship should increase the relative conservation of regions concerned with that property (e.g., disordered residues with mis-interactions, domain residues with misfolding). Surprisingly, however, we found the conservation of disordered and structured regions to increase in equal proportion with abundance. This observation implies that either abundance-related constraints are structure-independent, or multiple constraints apply to different regions and perfectly balance each other.
Collapse
Affiliation(s)
- Benjamin Dubreuil
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Emmanuel D Levy
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
45
|
On the Emergence of Orientational Order in Folded Proteins with Implications for Allostery. Symmetry (Basel) 2021. [DOI: 10.3390/sym13050770] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
The beautiful structures of single- and multi-domain proteins are clearly ordered in some fashion but cannot be readily classified using group theory methods that are successfully used to describe periodic crystals. For this reason, protein structures are considered to be aperiodic, and may have evolved this way for functional purposes, especially in instances that require a combination of softness and rigidity within the same molecule. By analyzing the solved protein structures, we show that orientational symmetry is broken in the aperiodic arrangement of the secondary structure elements (SSEs), which we deduce by calculating the nematic order parameter, P2. We find that the folded structures are nematic droplets with a broad distribution of P2. We argue that a non-zero value of P2, leads to an arrangement of the SSEs that can resist external forces, which is a requirement for allosteric proteins. Such proteins, which resist mechanical forces in some regions while being flexible in others, transmit signals from one region of the protein to another (action at a distance) in response to binding of ligands (oxygen, ATP, or other small molecules).
Collapse
|
46
|
Jalal ASB, Tran NT, Stevenson CE, Chan EW, Lo R, Tan X, Noy A, Lawson DM, Le TBK. Diversification of DNA-Binding Specificity by Permissive and Specificity-Switching Mutations in the ParB/Noc Protein Family. Cell Rep 2021; 32:107928. [PMID: 32698006 PMCID: PMC7383237 DOI: 10.1016/j.celrep.2020.107928] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Revised: 03/25/2020] [Accepted: 06/26/2020] [Indexed: 12/17/2022] Open
Abstract
Specific interactions between proteins and DNA are essential to many biological processes. Yet, it remains unclear how the diversification in DNA-binding specificity was brought about, and the mutational paths that led to changes in specificity are unknown. Using a pair of evolutionarily related DNA-binding proteins, each with a different DNA preference (ParB [Partitioning Protein B] and Noc [Nucleoid Occlusion Factor], which both play roles in bacterial chromosome maintenance), we show that specificity is encoded by a set of four residues at the protein-DNA interface. Combining X-ray crystallography and deep mutational scanning of the interface, we suggest that permissive mutations must be introduced before specificity-switching mutations to reprogram specificity and that mutational paths to new specificity do not necessarily involve dual-specificity intermediates. Overall, our results provide insight into the possible evolutionary history of ParB and Noc and, in a broader context, might be useful for understanding the evolution of other classes of DNA-binding proteins. DNA-binding specificity for parS and NBS is conserved within ParB and Noc family Specificity is encoded by a set of four residues at the protein-DNA interface Mutations must be introduced in a defined order to reprogram specificity
Collapse
Affiliation(s)
- Adam S B Jalal
- Department of Molecular Microbiology, John Innes Centre, Norwich NR4 7UH, UK
| | - Ngat T Tran
- Department of Molecular Microbiology, John Innes Centre, Norwich NR4 7UH, UK
| | - Clare E Stevenson
- Department of Biological Chemistry, John Innes Centre, Norwich NR4 7UH, UK
| | - Elliot W Chan
- Department of Physics, Biological Physical Sciences Institute, University of York, York YO10, UK
| | - Rebecca Lo
- Department of Molecular Microbiology, John Innes Centre, Norwich NR4 7UH, UK
| | - Xiao Tan
- Department of Molecular Microbiology, John Innes Centre, Norwich NR4 7UH, UK
| | - Agnes Noy
- Department of Physics, Biological Physical Sciences Institute, University of York, York YO10, UK
| | - David M Lawson
- Department of Biological Chemistry, John Innes Centre, Norwich NR4 7UH, UK
| | - Tung B K Le
- Department of Molecular Microbiology, John Innes Centre, Norwich NR4 7UH, UK.
| |
Collapse
|
47
|
Madhurima K, Nandi B, Sekhar A. Metamorphic proteins: the Janus proteins of structural biology. Open Biol 2021; 11:210012. [PMID: 33878950 PMCID: PMC8059507 DOI: 10.1098/rsob.210012] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
The structural paradigm that the sequence of a protein encodes for a unique three-dimensional native fold does not acknowledge the intrinsic plasticity encapsulated in conformational free energy landscapes. Metamorphic proteins are a recently discovered class of biomolecules that illustrate this plasticity by folding into at least two distinct native state structures of comparable stability in the absence of ligands or cofactors to facilitate fold-switching. The expanding list of metamorphic proteins clearly shows that these proteins are not mere aberrations in protein evolution, but may have actually been a consequence of distinctive patterns in selection pressure such as those found in virus–host co-evolution. In this review, we describe the structure–function relationships observed in well-studied metamorphic protein systems, with specific focus on how functional residues are sequestered or exposed in the two folds of the protein. We also discuss the implications of metamorphosis for protein evolution and the efforts that are underway to predict metamorphic systems from sequence properties alone.
Collapse
Affiliation(s)
- Kulkarni Madhurima
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560 012, India
| | - Bodhisatwa Nandi
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560 012, India
| | - Ashok Sekhar
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560 012, India
| |
Collapse
|
48
|
Downard KM. Protein phylogenetics with mass spectrometry. A comparison of methods. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2021; 13:1442-1454. [PMID: 33725067 DOI: 10.1039/d1ay00153a] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Advances in protein mass spectrometry have provided the ability to identify and sequence proteins with unprecedented speed, sensitivity and accuracy. These benefits now offer advantages for studies of protein evolution and phylogeny avoiding the need to generate and align DNA sequences which can prove time consuming, costly and difficult in the case of large genomes and for highly diverse organisms. The methods of phylogenetic analysis using protein mass spectrometry can be classified into three categories: (1) de novo protein sequencing followed by multiple sequence alignment for classical phylogenetic reconstruction, (2) direct phylogenetic reconstruction using expressed protein mass profiles exploited in microbial biotyping applications, and (3) the construction of trees using proteolytic peptide mass map or fingerprint data. This review describes the three approaches together with the relevant tools and algorithms required to implement them. It also compares each of these alternative protein based methods alongside conventional gene sequence based phylogenetics.
Collapse
Affiliation(s)
- Kevin M Downard
- Infectious Disease Responses Laboratory, Prince of Wales Clinical Research Sciences, Sydney, NSW 2031, Australia.
| |
Collapse
|
49
|
Structure and function of naturally evolved de novo proteins. Curr Opin Struct Biol 2021; 68:175-183. [PMID: 33567396 DOI: 10.1016/j.sbi.2020.11.010] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 11/16/2020] [Accepted: 11/27/2020] [Indexed: 01/05/2023]
Abstract
Comparative evolutionary genomics has revealed that novel protein coding genes can emerge randomly from non-coding DNA. While most of the myriad of transcripts which continuously emerge vanish rapidly, some attain regulatory regions, become translated and survive. More surprisingly, sequence properties of de novo proteins are almost indistinguishable from randomly obtained sequences, yet de novo proteins may gain functions and integrate into eukaryotic cellular networks quite easily. We here discuss current knowledge on de novo proteins, their structures, functions and evolution. Since the existence of de novo proteins seems at odds with decade-long attempts to construct proteins with novel structures and functions from scratch, we suggest that a better understanding of de novo protein evolution may fuel new strategies for protein design.
Collapse
|
50
|
Berger D, Stångberg J, Baur J, Walters RJ. Elevated temperature increases genome-wide selection on de novo mutations. Proc Biol Sci 2021; 288:20203094. [PMID: 33529558 DOI: 10.1098/rspb.2020.3094] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Adaptation in new environments depends on the amount of genetic variation available for evolution, and the efficacy by which natural selection discriminates among this variation. However, whether some ecological factors reveal more genetic variation, or impose stronger selection pressures than others, is typically not known. Here, we apply the enzyme kinetic theory to show that rising global temperatures are predicted to intensify natural selection throughout the genome by increasing the effects of DNA sequence variation on protein stability. We test this prediction by (i) estimating temperature-dependent fitness effects of induced mutations in seed beetles adapted to ancestral or elevated temperature, and (ii) calculate 100 paired selection estimates on mutations in benign versus stressful environments from unicellular and multicellular organisms. Environmental stress per se did not increase mean selection on de novo mutation, suggesting that the cost of adaptation does not generally increase in new ecological settings to which the organism is maladapted. However, elevated temperature increased the mean strength of selection on genome-wide polymorphism, signified by increases in both mutation load and mutational variance in fitness. These results have important implications for genetic diversity gradients and the rate and repeatability of evolution under climate change.
Collapse
Affiliation(s)
- David Berger
- Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
| | - Josefine Stångberg
- Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
| | - Julian Baur
- Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
| | - Richard J Walters
- Centre for Environmental and Climate Research, Lund University, Sölvegatan 37, 223 62 Lund, Sweden
| |
Collapse
|