1
|
Das D, Ainavarapu SRK. Protein engineering using circular permutation - structure, function, stability, and applications. FEBS J 2024; 291:3581-3596. [PMID: 38676939 DOI: 10.1111/febs.17146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Revised: 03/13/2024] [Accepted: 04/12/2024] [Indexed: 04/29/2024]
Abstract
Protein engineering is important for creating novel variants from natural proteins, enabling a wide range of applications. Approaches such as rational design and directed evolution are routinely used to make new protein variants. Computational tools like de novo design can introduce new protein folds. Expanding the amino acid repertoire to include unnatural amino acids with non-canonical side chains in vitro by native chemical ligation and in vivo via codon expansion methods broadens sequence and structural possibilities. Circular permutation (CP) is an invaluable approach to redesigning a protein by rearranging the amino acid sequence, where the connectivity of the secondary structural elements is altered without changing the overall structure of the protein. Artificial CP proteins (CPs) are employed in various applications such as biocatalysis, sensing of small molecules by fluorescence, genome editing, ligand-binding protein switches, and optogenetic engineering. Many studies have shown that CP can lead to either reduced or enhanced stability or catalytic efficiency. The effects of CP on a protein's energy landscape cannot be predicted a priori. Thus, it is important to understand how CP can affect the thermodynamic and kinetic stability of a protein. In this review, we discuss the discovery and advancement of techniques to create protein CP, and existing reviews on CP. We delve into the plethora of biological applications for designed CP proteins. We subsequently discuss the experimental and computational reports on the effects of CP on the thermodynamic and kinetic stabilities of proteins of various topologies. An understanding of the various aspects of CP will allow the reader to design robust CP proteins for their specific purposes.
Collapse
Affiliation(s)
- Debanjana Das
- Department of Chemical Sciences, Tata Institute of Fundamental Research, Mumbai, India
| | | |
Collapse
|
2
|
Chaisupa P, Wright RC. State-of-the-art in engineering small molecule biosensors and their applications in metabolic engineering. SLAS Technol 2024; 29:100113. [PMID: 37918525 PMCID: PMC11314541 DOI: 10.1016/j.slast.2023.10.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 10/18/2023] [Accepted: 10/25/2023] [Indexed: 11/04/2023]
Abstract
Genetically encoded biosensors are crucial for enhancing our understanding of how molecules regulate biological systems. Small molecule biosensors, in particular, help us understand the interaction between chemicals and biological processes. They also accelerate metabolic engineering by increasing screening throughput and eliminating the need for sample preparation through traditional chemical analysis. Additionally, they offer significantly higher spatial and temporal resolution in cellular analyte measurements. In this review, we discuss recent progress in in vivo biosensors and control systems-biosensor-based controllers-for metabolic engineering. We also specifically explore protein-based biosensors that utilize less commonly exploited signaling mechanisms, such as protein stability and induced degradation, compared to more prevalent transcription factor and allosteric regulation mechanism. We propose that these lesser-used mechanisms will be significant for engineering eukaryotic systems and slower-growing prokaryotic systems where protein turnover may facilitate more rapid and reliable measurement and regulation of the current cellular state. Lastly, we emphasize the utilization of cutting-edge and state-of-the-art techniques in the development of protein-based biosensors, achieved through rational design, directed evolution, and collaborative approaches.
Collapse
Affiliation(s)
- Patarasuda Chaisupa
- Department of Biological Systems Engineering, Virginia Tech, Blacksburg, VA 24061, United States
| | - R Clay Wright
- Department of Biological Systems Engineering, Virginia Tech, Blacksburg, VA 24061, United States; Translational Plant Sciences Center (TPSC), Virginia Tech, Blacksburg, VA 24061, United States.
| |
Collapse
|
3
|
Coleman T, Shin J, Silberg JJ, Shamoo Y, Atkinson JT. The Biochemical Impact of Extracting an Embedded Adenylate Kinase Domain Using Circular Permutation. Biochemistry 2024; 63:599-609. [PMID: 38357768 DOI: 10.1021/acs.biochem.3c00605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/16/2024]
Abstract
Adenylate kinases (AKs) have evolved AMP-binding and lid domains that are encoded as continuous polypeptides embedded at different locations within the discontinuous polypeptide encoding the core domain. A prior study showed that AK homologues of different stabilities consistently retain cellular activity following circular permutation that splits a region with high energetic frustration within the AMP-binding domain into discontinuous fragments. Herein, we show that mesophilic and thermophilic AKs having this topological restructuring retain activity and substrate-binding characteristics of the parental AK. While permutation decreased the activity of both AK homologues at physiological temperatures, the catalytic activity of the thermophilic AK increased upon permutation when assayed >30 °C below the melting temperature of the native AK. The thermostabilities of the permuted AKs were uniformly lower than those of native AKs, and they exhibited multiphasic unfolding transitions, unlike the native AKs, which presented cooperative thermal unfolding. In addition, proteolytic digestion revealed that permutation destabilized each AK in differing manners, and mass spectrometry suggested that the new termini within the AMP-binding domain were responsible for the increased proteolysis sensitivity. These findings illustrate how changes in contact order can be used to tune enzyme activity and alter folding dynamics in multidomain enzymes.
Collapse
Affiliation(s)
- Tom Coleman
- Department of BioSciences, Rice University, MS-140, 6100 Main Street, Houston, Texas 77005, United States
| | - John Shin
- Department of BioSciences, Rice University, MS-140, 6100 Main Street, Houston, Texas 77005, United States
| | - Jonathan J Silberg
- Department of BioSciences, Rice University, MS-140, 6100 Main Street, Houston, Texas 77005, United States
- Department of Chemical and Biomolecular Engineering, Rice University, MS-362, 6100 Main Street, Houston, Texas 77005, United States
- Department of Bioengineering, Rice University, MS-142, 6100 Main Street, Houston, Texas 77005, United States
| | - Yousif Shamoo
- Department of BioSciences, Rice University, MS-140, 6100 Main Street, Houston, Texas 77005, United States
| | - Joshua T Atkinson
- Department of BioSciences, Rice University, MS-140, 6100 Main Street, Houston, Texas 77005, United States
- Department of Physics and Astronomy, University of Southern California, Los Angeles, California 90007, United States
- Department of Civil and Environmental Engineering, Princeton University, Princeton, New Jersey 08544, United States
- Omenn-Darling Bioengineering Institute, Princeton University, Princeton, New Jersey 08544, United States
| |
Collapse
|
4
|
Kolesnik VV, Nurtdinov RF, Oloruntimehin ES, Karabelsky AV, Malogolovkin AS. Optimization strategies and advances in the research and development of AAV-based gene therapy to deliver large transgenes. Clin Transl Med 2024; 14:e1607. [PMID: 38488469 PMCID: PMC10941601 DOI: 10.1002/ctm2.1607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 02/07/2024] [Accepted: 02/15/2024] [Indexed: 03/18/2024] Open
Abstract
Adeno-associated virus (AAV)-based therapies are recognized as one of the most potent next-generation treatments for inherited and genetic diseases. However, several biological and technological aspects of AAV vectors remain a critical issue for their widespread clinical application. Among them, the limited capacity of the AAV genome significantly hinders the development of AAV-based gene therapy. In this context, genetically modified transgenes compatible with AAV are opening up new opportunities for unlimited gene therapies for many genetic disorders. Recent advances in de novo protein design and remodelling are paving the way for new, more efficient and targeted gene therapeutics. Using computational and genetic tools, AAV expression cassette and transgenic DNA can be split, miniaturized, shuffled or created from scratch to mediate efficient gene transfer into targeted cells. In this review, we highlight recent advances in AAV-based gene therapy with a focus on its use in translational research. We summarize recent research and development in gene therapy, with an emphasis on large transgenes (>4.8 kb) and optimizing strategies applied by biomedical companies in the research pipeline. We critically discuss the prospects for AAV-based treatment and some emerging challenges. We anticipate that the continued development of novel computational tools will lead to rapid advances in basic gene therapy research and translational studies.
Collapse
Affiliation(s)
- Valeria V. Kolesnik
- Martsinovsky Institute of Medical ParasitologyTropical and Vector‐Borne Diseases, Sechenov UniversityMoscowRussia
| | - Ruslan F. Nurtdinov
- Martsinovsky Institute of Medical ParasitologyTropical and Vector‐Borne Diseases, Sechenov UniversityMoscowRussia
| | - Ezekiel Sola Oloruntimehin
- Martsinovsky Institute of Medical ParasitologyTropical and Vector‐Borne Diseases, Sechenov UniversityMoscowRussia
| | | | - Alexander S. Malogolovkin
- Martsinovsky Institute of Medical ParasitologyTropical and Vector‐Borne Diseases, Sechenov UniversityMoscowRussia
- Center for Translational MedicineSirius University of Science and TechnologySochiRussia
| |
Collapse
|
5
|
Das D, Ainavarapu SRK. Circular permutation at azurin's active site slows down its folding. J Biol Inorg Chem 2023; 28:737-749. [PMID: 37957357 DOI: 10.1007/s00775-023-02023-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Accepted: 09/26/2023] [Indexed: 11/15/2023]
Abstract
Circular permutation (CP) is a technique by which the primary sequence of a protein is rearranged to create new termini. The connectivity of the protein is altered but the overall protein structure generally remains unperturbed. Understanding the effect of CP can help design robust proteins for numerous applications such as in genetic engineering, optoelectronics, and improving catalytic activity. Studies on different protein topologies showed that CP usually affects protein stability as well as unfolding rates. Though a significant number of proteins contain metals or other cofactors, reports of metalloprotein CPs are rare. Thus, we chose a bacterial metalloprotein, azurin, and its CP within the metal-binding site (cpF114). We studied the stabilities, folding, and unfolding rates of apo- and Zn2+-bound CP azurin using fluorescence and circular dichroism. The introduced CP had destabilizing effects on the protein. Also, the folding of the Zn2+-CP protein was much slower than that of the Zn2+-WT or apo-protein. We compared this study to our previously reported azurin-cpN42, where we had observed an equilibrium and kinetic intermediate. cpF114 exhibits an apparent two-state equilibrium unfolding but has an off-pathway kinetic intermediate. Our study hinted at CP as a method to modify the energy landscape of proteins to alter their folding pathways. WT azurin, being a faster folder, may have evolved to optimize the folding rate of metal-bound protein compared to its CPs, albeit all of them have the same structure and function. Our study underscores that protein sequence and protein termini positions are crucial for metalloproteins. TOC Figure. (Top) Zn2+-azurin WT structure (PDB code: 1E67) and 2-D topology diagram of Zn2+-cpF114 azurin. (Bottom) Cartoon diagram representing folding (red arrows) and unfolding (blue arrows) of apo- and Zn2+- WT and cpF114 azurins. The width of the arrows represents the rate of the corresponding processes.
Collapse
Affiliation(s)
- Debanjana Das
- Department of Chemical Sciences, Tata Institute of Fundamental Research, Dr. Homi Bhabha Road, Colaba, Mumbai, 400005, India
| | - Sri Rama Koti Ainavarapu
- Department of Chemical Sciences, Tata Institute of Fundamental Research, Dr. Homi Bhabha Road, Colaba, Mumbai, 400005, India.
| |
Collapse
|
6
|
Fang J, Li T, Lee J, Im D, Xu L, Liu Y, Seo J, Zhang WB. A single-domain protein catenane of dihydrofolate reductase. Natl Sci Rev 2023; 10:nwad304. [PMID: 38188024 PMCID: PMC10769465 DOI: 10.1093/nsr/nwad304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Revised: 08/07/2023] [Accepted: 08/07/2023] [Indexed: 01/09/2024] Open
Abstract
A single-domain protein catenane refers to two mechanically interlocked polypeptide rings that fold synergistically into a compact and integrated structure, which is extremely rare in nature. Here, we report a single-domain protein catenane of dihydrofolate reductase (cat-DHFR). This design was achieved by rewiring the connectivity between secondary motifs to introduce artificial entanglement and synthesis was readily accomplished through a series of programmed and streamlined post-translational processing events in cells without any additional in vitro reactions. The target molecule contained few exogenous motifs and was thoroughly characterized using a combination of ultra-performance liquid chromatography-mass spectrometry, sodium dodecyl sulfate-polyacrylamide gel electrophoresis, protease cleavage experiments and ion mobility spectrometry-mass spectrometry. Compared with the linear control, cat-DHFR retained its catalytic capability and exhibited enhanced stability against thermal or chemical denaturation due to conformational restriction. These results suggest that linear proteins may be converted into their concatenated single-domain counterparts with almost identical chemical compositions, well-preserved functions and elevated stabilities, representing an entirely new horizon in protein science.
Collapse
Affiliation(s)
- Jing Fang
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Tianzuo Li
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Jiyeon Lee
- Department of Chemistry, Pohang University of Science and Technology (POSTECH), Pohang 37673, Republic of Korea
| | - Dahye Im
- Department of Chemistry, Pohang University of Science and Technology (POSTECH), Pohang 37673, Republic of Korea
| | - Lianjie Xu
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Yajie Liu
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Jongcheol Seo
- Department of Chemistry, Pohang University of Science and Technology (POSTECH), Pohang 37673, Republic of Korea
| | - Wen-Bin Zhang
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
- Beijing Academy of Artificial Intelligence, Beijing 100084, China
| |
Collapse
|
7
|
Puri S, Liu CY, Hu IC, Lai CH, Hsu STD, Lyu PC. Elucidation of the folding pathway of a circular permutant of topologically knotted YbeA by tryptophan substitutions. Biochem Biophys Res Commun 2023; 672:81-88. [PMID: 37343318 DOI: 10.1016/j.bbrc.2023.06.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Revised: 06/05/2023] [Accepted: 06/06/2023] [Indexed: 06/23/2023]
Abstract
CP74 is an engineered circular permutant of a deep trefoil knotted SpoU-TrmD (SPOUT) RNA methyl transferase protein YbeA from E. coli. We have previously established that the circular permutation unties the knotted topology of YbeA and CP74 forms a domain-swapped dimer with a large dimeric interface of ca. 4600 Å2. To understand the impact of domain-swapping and the newly formed hinge region joining the two folded domains on the folding and stability of CP74, the five equally spaced tryptophan residues were individually substituted into phenylalanine to monitor their conformational and stability changes by a battery of biophysical tools. Far-UV circular dichroism, intrinsic fluorescence, and small-angle X-ray scattering dictated minimal global conformational perturbations to the native structures in the tryptophan variants. The structures of the tryptophan variants also showed the conservation of the domain-swapped ternary structure with the exception that the W72F exhibited significant asymmetry in the α-helix 5. Comparative global thermal and chemical stability analyses indicated the pivotal role of W100 in the folding of CP74 followed by W19 and W72. Solution-state NMR spectroscopy and hydrogen-deuterium exchange mass spectrometry further revealed the accumulation of a native-like intermediate state in which the hinge region made important contributions to maintain the domain-swapped ternary structure of CP74.
Collapse
Affiliation(s)
- Sarita Puri
- Institute of Biological Chemistry, Academia Sinica, Taipei, 11529, Taiwan
| | - Cheng-Yu Liu
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 30013, Taiwan
| | - I-Chen Hu
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 30013, Taiwan
| | - Chih-Hsuan Lai
- Institute of Biological Chemistry, Academia Sinica, Taipei, 11529, Taiwan; Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 30013, Taiwan
| | - Shang-Te Danny Hsu
- Institute of Biological Chemistry, Academia Sinica, Taipei, 11529, Taiwan; Institute of Biochemical Sciences, National Taiwan University, Taipei, 10617, Taiwan; International Institute for Sustainability with Knotted Chiral Meta Matter, Hiroshima University, Higashihiroshima, 739-8527, Japan.
| | - Ping-Chiang Lyu
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 30013, Taiwan; Department of Medical Science, National Tsing Hua University, Hsinchu, 30013, Taiwan.
| |
Collapse
|
8
|
Li H, Schneider T, Tan Y, Zhang D. Ribonuclease T2 represents a distinct circularly permutated version of the BECR RNases. Protein Sci 2023; 32:e4531. [PMID: 36477982 PMCID: PMC9793965 DOI: 10.1002/pro.4531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Revised: 11/07/2022] [Accepted: 11/30/2022] [Indexed: 12/13/2022]
Abstract
Detection of homologous relationships among proteins and understanding their mechanisms of diversification are major topics in the fields of protein science, bioinformatics, and phylogenetics. Recent developments in sequence/profile-based and structural similarity-based methods have greatly facilitated the unification and classification of many protein families into superfamilies or folds, yet many proteins remain unclassified in current protein databases. As one of the three earliest identified RNases in biology, ribonuclease T2, also known as RNase I in Escherichia coli, RNase Rh in fungi, or S-RNase in plant, is thought to be an ancient RNase family due to its widespread distribution and distinct structure. In this study, we present evidence that RNase T2 represents a circularly permutated version of the BECR (Barnase-EndoU-Colicin E5/D-RelE) fold RNases. This subtle relationship cannot be detected by traditional methods such as sequence/profile-based comparisons, structure-similarity searches, and circular permutation detections. However, we were able to identify the structural similarity using rational reconstruction of a theoretical RNase T2 ancestor via a reverse circular permutation process, followed by structural modeling using AlphaFold2, and structural comparisons. This relationship is further supported by the fact that RNase T2 and other typical BECR RNases, namely Colicin D, RNase A, and BrnT, share similar catalytic site configurations, all involving an analogous set of conserved residues on the α0 helix and the β4 strand of the BECR fold. This study revealed a hidden root of RNase T2 in bacterial toxin systems and demonstrated that reconstruction and modeling of ancestral topology is an effective strategy to identify remote relationship between proteins.
Collapse
Affiliation(s)
- Huan Li
- Department of BiologyCollege of Arts & Sciences, Saint Louis UniversitySaint LouisMissouriUSA
| | - Theresa Schneider
- Department of BiologyCollege of Arts & Sciences, Saint Louis UniversitySaint LouisMissouriUSA
| | - Yongjun Tan
- Department of BiologyCollege of Arts & Sciences, Saint Louis UniversitySaint LouisMissouriUSA
| | - Dapeng Zhang
- Department of BiologyCollege of Arts & Sciences, Saint Louis UniversitySaint LouisMissouriUSA
- Program of Bioinformatics and Computational BiologySchool of Science and Engineering, Saint Louis UniversitySaint LouisMissouriUSA
| |
Collapse
|
9
|
SeqCP: A sequence-based algorithm for searching circularly permuted proteins. Comput Struct Biotechnol J 2022; 21:185-201. [PMID: 36582435 PMCID: PMC9763678 DOI: 10.1016/j.csbj.2022.11.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 11/10/2022] [Accepted: 11/10/2022] [Indexed: 11/16/2022] Open
Abstract
Circular permutation (CP) is a protein sequence rearrangement in which the amino- and carboxyl-termini of a protein can be created in different positions along the imaginary circularized sequence. Circularly permutated proteins usually exhibit conserved three-dimensional structures and functions. By comparing the structures of circular permutants (CPMs), protein research and bioengineering applications can be approached in ways that are difficult to achieve by traditional mutagenesis. Most current CP detection algorithms depend on structural information. Because there is a vast number of proteins with unknown structures, many CP pairs may remain unidentified. An efficient sequence-based CP detector will help identify more CP pairs and advance many protein studies. For instance, some hypothetical proteins may have CPMs with known functions and structures that are informative for functional annotation, but existing structure-based CP search methods cannot be applied when those hypothetical proteins lack structural information. Despite the considerable potential for applications, sequence-based CP search methods have not been well developed. We present a sequence-based method, SeqCP, which analyzes normal and duplicated sequence alignments to identify CPMs and determine candidate CP sites for proteins. SeqCP was trained by data obtained from the Circular Permutation Database and tested with nonredundant datasets from the Protein Data Bank. It shows high reliability in CP identification and achieves an AUC of 0.9. SeqCP has been implemented into a web server available at: http://pcnas.life.nthu.edu.tw/SeqCP/.
Collapse
Key Words
- AUC, area under the ROC curve
- CE, combinatorial extension
- CE-CP, CE with Circular Permutations
- CP, circular permutation
- CPDB, Circular Permutation Database
- CPMs, circular permutants
- CPSARST, Circular Permutation Search Aided by Ramachandran Sequential Transformation
- Circular permutants
- Circular permutation
- MCC, Matthews correlation coefficient
- Protein sequence analysis
- Protein structure modeling
- RMSD, root-mean-square distance
- ROC, receiver operating characteristic
Collapse
|
10
|
Comparative phylogeny and evolutionary analysis of Dicer-like protein family in two plant monophyletic lineages. J Genet Eng Biotechnol 2022; 20:103. [PMID: 35821291 PMCID: PMC9276914 DOI: 10.1186/s43141-022-00380-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Accepted: 06/14/2022] [Indexed: 11/10/2022]
Abstract
BACKGROUND Small RNAs (sRNAs) that do not get untranslated into proteins exhibit a pivotal role in the expression regulation of their cognate gene(s) in almost all eukaryotic lineages, including plants. Hitherto, numerous protein families such as Dicer, a unique class of Ribonuclease III, have been reported to be involved in sRNAs processing pathways and silencing. In this study, we aimed to investigate the phylogenetic relationship and evolutionary history of the DCL protein family. RESULTS Our results illustrated the DCL family of proteins grouped into four main subfamilies (DCLs 1-4) presented in either Eudicotyledons or Liliopsids. The accurate observation of the phylogenetic trees supports the independent expansion of DCL proteins among the Eudicotyledons and Liliopsids species. They share the common origin, and the main duplication events for the formation of the DCL subfamilies occurred before the Eudicotyledons/Liliopsids split from their ancestral DCL. In addition, shreds of evidence revealed that the divergence happened when multicellularization started and since the need for complex gene regulation considered being a necessity by organisms. At that time, they have evolved independently among the monophyletic lineages. The other finding was that the combination of DCL protein subfamilies bears several highly conserved functional domains in plant species that originated from their ancestor architecture. The conservation of these domains happens to be both lineage-specific and inter lineage-specific. CONCLUSIONS DCL subfamilies (i.e., DCL1-DCL4) distribute in their single clades after diverging from their common ancestor and before emerging into higher plants. Therefore, it seems that the main duplication events for the formation of the DCL subfamilies occurred before the Eudicotyledons/Liliopsida split and before the appearance of moss, and after the single-cell green algae. We also observed the same trends among the main DCL subfamilies from functional unit composition and architecture. Despite the long evolutionary course from the divergence of Liliopsida lineage from the Eudicotyledons, a significant diversifying force to domain composition and orientation was absent. The results of this study provide a deeper insight into DCL protein evolutionary history and possible sequence and structural relationships between DCL protein subfamilies in the main higher plant monophyletic lineages; i.e., Eudicotyledons and Liliopsida.
Collapse
|
11
|
Liu S, Wu K, Chen C. Obtaining protein foldability information from computational models of AlphaFold2 and RoseTTAFold. Comput Struct Biotechnol J 2022; 20:4481-4489. [PMID: 36051869 PMCID: PMC9421090 DOI: 10.1016/j.csbj.2022.08.034] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Revised: 08/15/2022] [Accepted: 08/15/2022] [Indexed: 12/02/2022] Open
Abstract
The recent breakthrough from AlphaFold2 and RoseTTAFold set a profound milestone for solving the protein folding problem, but they were not explicitly trained to predict protein foldability, i.e., if a protein can really fold into the predicted 3D structure. We wondered if the computational models from AlphaFold2 and RoseTTAFold might carry protein foldability information. Therefore, we predicted the structural models of 159 circular permutants and 158 alanine insertion mutants of the 159-residue dihydrofolate reductase. Our data showed that although AlphaFold2 and RoseTTAFold cannot directly identify unfoldable proteins, the RMSD values of computational models are correlated with protein foldability, with higher RMSD values indicating lower protein foldability. Furthermore, this correlation is independent of secondary structures, and the RMSD values of computational models are quantitatively correlated with protein foldability but not protein functions. Additionally, using a dataset of 129 de novo designed proteins, we showed that inter-model RMSD values between AlphaFold2 models and RoseTTAFold models are a good indicator of protein foldability. At last, we showed that inter-model RMSD values are also useful for evaluating protein solubility by modeling 1664 natural proteins. Our work could be of great value to the design of novel proteins and the prediction of protein foldability.
Collapse
Affiliation(s)
- Sen Liu
- Key Laboratory of Fermentation Engineering (Ministry of Education) & Cooperative Innovation Center of Industrial Fermentation (Ministry of Education & Hubei Province), Hubei University of Technology, Wuhan 430068, China
- National “111” Center for Cellular Regulation and Molecular Pharmaceutics, Hubei University of Technology, Wuhan 430068, China
- Hubei Key Laboratory of Industrial Microbiology, Hubei University of Technology, Wuhan 430068, China
- Corresponding author at: Key Laboratory of Fermentation Engineering (Ministry of Education) & Cooperative Innovation Center of Industrial Fermentation (Ministry of Education & Hubei Province), Hubei University of Technology, Wuhan 430068, China.
| | - Kan Wu
- Key Laboratory of Fermentation Engineering (Ministry of Education) & Cooperative Innovation Center of Industrial Fermentation (Ministry of Education & Hubei Province), Hubei University of Technology, Wuhan 430068, China
- National “111” Center for Cellular Regulation and Molecular Pharmaceutics, Hubei University of Technology, Wuhan 430068, China
- Hubei Key Laboratory of Industrial Microbiology, Hubei University of Technology, Wuhan 430068, China
| | - Cheng Chen
- Key Laboratory of Fermentation Engineering (Ministry of Education) & Cooperative Innovation Center of Industrial Fermentation (Ministry of Education & Hubei Province), Hubei University of Technology, Wuhan 430068, China
- National “111” Center for Cellular Regulation and Molecular Pharmaceutics, Hubei University of Technology, Wuhan 430068, China
- Hubei Key Laboratory of Industrial Microbiology, Hubei University of Technology, Wuhan 430068, China
| |
Collapse
|
12
|
Bandyopadhyay B, Peleg Y. Application of Restriction Free (RF) Cloning in Circular Permutation. Methods Mol Biol 2022; 2461:149-163. [PMID: 35727449 DOI: 10.1007/978-1-0716-2152-3_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
The restriction free (RF) cloning has emerged as one of the highly efficient techniques in the area of genetic engineering. RF cloning has wide range of applications in plasmid DNA manipulation including cloning of a single gene, simultaneous assembly of multiple DNA fragments, and mutagenesis from single to multiple simultaneous alterations of a target DNA. Recently, we have developed a new technique of circular permutation using RF cloning. Circular permutation is widely used to investigate the mechanisms of protein folding and function. Previously, restriction enzyme based cloning was used to introduce circular permutation. Our RF cloning method made the protocol faster and more cost-effective. In this chapter, we describe a step-by-step protocol for generating circular permutants using RF methodology.
Collapse
|
13
|
Shinohara N, Nishitani K. Cryogenian Origin and Subsequent Diversification of the Plant Cell-Wall Enzyme XTH Family. PLANT & CELL PHYSIOLOGY 2021; 62:1874-1889. [PMID: 34197607 PMCID: PMC8711696 DOI: 10.1093/pcp/pcab093] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Revised: 05/24/2021] [Accepted: 07/01/2021] [Indexed: 05/22/2023]
Abstract
All land plants encode large multigene families of xyloglucan endotransglucosylase/hydrolases (XTHs), plant-specific enzymes that cleave and reconnect plant cell-wall polysaccharides. Despite the ubiquity of these enzymes, considerable uncertainty remains regarding the evolutionary history of the XTH family. Phylogenomic and comparative analyses in this study traced the non-plant origins of the XTH family to Alphaproteobacteria ExoKs, bacterial enzymes involved in loosening biofilms, rather than Firmicutes licheninases, plant biomass digesting enzymes, as previously supposed. The relevant horizontal gene transfer (HGT) event was mapped to the divergence of non-swimming charophycean algae in the Cryogenian geological period. This HGT event was the likely origin of charophycean EG16-2s, which are putative intermediates between ExoKs and XTHs. Another HGT event in the Cryogenian may have led from EG16-2s or ExoKs to fungal Congo Red Hypersensitive proteins (CRHs) to fungal CRHs, enzymes that cleave and reconnect chitin and glucans in fungal cell walls. This successive transfer of enzyme-encoding genes may have supported the adaptation of plants and fungi to the ancient icy environment by facilitating their sessile lifestyles. Furthermore, several protein evolutionary steps, including coevolution of substrate-interacting residues and putative intra-family gene fusion, occurred in the land plant lineage and drove diversification of the XTH family. At least some of those events correlated with the evolutionary gain of broader substrate specificities, which may have underpinned the expansion of the XTH family by enhancing duplicated gene survival. Together, this study highlights the Precambrian evolution of life and the mode of multigene family expansion in the evolutionary history of the XTH family.
Collapse
Affiliation(s)
- Naoki Shinohara
- *Corresponding authors: Naoki Shinohara, E-mail, ; Kazuhiko Nishitani, E-mail,
| | - Kazuhiko Nishitani
- *Corresponding authors: Naoki Shinohara, E-mail, ; Kazuhiko Nishitani, E-mail,
| |
Collapse
|
14
|
Purification and characterization of a highly thermostable GlcNAc-binding lectin from Collaea speciosa seeds. Int J Biol Macromol 2021; 193:1562-1571. [PMID: 34740693 DOI: 10.1016/j.ijbiomac.2021.10.219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Revised: 10/29/2021] [Accepted: 10/29/2021] [Indexed: 11/21/2022]
Abstract
Lectins from plants of the Diocleinae subtribe often exhibit specificity towards mannose/glucose and derived sugars, with some plants also displaying a second lectin specific to lactose/GalNAc. Here, we present a novel lectin from Collaea speciosa, named CsL, that displays specificity for GlcNAc/glucose. The lectin was extracted from Collaea speciosa seeds and purified by a single chromatographic step on a Sephadex G-50 matrix. In solution, the lectin appears as a dimeric protein composed of 25 kDa monomers. The protein is stable at pH 7-8 and dependent on divalent cations. CsL maintained its agglutination activity after heating to 90 °C for 1 h. Glycan array studies revealed that CsL binds to N-glycans with terminal GlcNAc residues, chitobiose and chitotriose moieties. The partial amino acid sequence of the lectin is similar to that of some lactose-specific lectins from the same subtribe. In contrast to other ConA-like lectins, CsL is not toxic to Artemia. Because of its remarkably different properties and specificity, this lectin could be the first member of a new group inside the Diocleinae lectins.
Collapse
|
15
|
Nonis SG, Haywood J, Schmidberger JW, Mackie ERR, Soares da Costa TP, Bond CS, Mylne JS. Structural and biochemical analyses of concanavalin A circular permutation by jack bean asparaginyl endopeptidase. THE PLANT CELL 2021; 33:2794-2811. [PMID: 34235541 PMCID: PMC8408470 DOI: 10.1093/plcell/koab130] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Accepted: 05/05/2021] [Indexed: 06/01/2023]
Abstract
Over 30 years ago, an intriguing posttranslational modification was found responsible for creating concanavalin A (conA), a carbohydrate-binding protein from jack bean (Canavalia ensiformis) seeds and a common carbohydrate chromatography reagent. ConA biosynthesis involves what was then an unprecedented rearrangement in amino-acid sequence, whereby the N-terminal half of the gene-encoded conA precursor (pro-conA) is swapped to become the C-terminal half of conA. Asparaginyl endopeptidase (AEP) was shown to be involved, but its mechanism was not fully elucidated. To understand the structural basis and consequences of circular permutation, we generated recombinant jack bean pro-conA plus jack bean AEP (CeAEP1) and solved crystal structures for each to 2.1 and 2.7 Å, respectively. By reconstituting conA biosynthesis in vitro, we prove CeAEP1 alone can perform both cleavage and cleavage-coupled transpeptidation to form conA. CeAEP1 structural analysis reveals how it is capable of carrying out both reactions. Biophysical assays illustrated that pro-conA is less stable than conA. This observation was explained by fewer intermolecular interactions between subunits in the pro-conA crystal structure and consistent with a difference in the prevalence for tetramerization in solution. These findings elucidate the consequences of circular permutation in the only posttranslation example known to occur in nature.
Collapse
Affiliation(s)
- Samuel G. Nonis
- School of Molecular Sciences, The University of Western Australia, Crawley, Perth 6009, Australia
- The ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley, Perth 6009, Australia
| | - Joel Haywood
- School of Molecular Sciences, The University of Western Australia, Crawley, Perth 6009, Australia
- The ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley, Perth 6009, Australia
| | - Jason W. Schmidberger
- School of Molecular Sciences, The University of Western Australia, Crawley, Perth 6009, Australia
| | - Emily R. R. Mackie
- Department of Biochemistry and Genetics, La Trobe Institute for Molecular Science, La Trobe University, Victoria 3086, Australia
| | - Tatiana P. Soares da Costa
- Department of Biochemistry and Genetics, La Trobe Institute for Molecular Science, La Trobe University, Victoria 3086, Australia
| | - Charles S. Bond
- School of Molecular Sciences, The University of Western Australia, Crawley, Perth 6009, Australia
| | - Joshua S. Mylne
- School of Molecular Sciences, The University of Western Australia, Crawley, Perth 6009, Australia
- The ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley, Perth 6009, Australia
| |
Collapse
|
16
|
Cheung LKY, Dupuis JH, Dee DR, Bryksa BC, Yada RY. Roles of Plant-Specific Inserts in Plant Defense. TRENDS IN PLANT SCIENCE 2020; 25:682-694. [PMID: 32526173 DOI: 10.1016/j.tplants.2020.02.009] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2019] [Revised: 02/10/2020] [Accepted: 02/14/2020] [Indexed: 06/11/2023]
Abstract
Ubiquitously expressed in plants, the plant-specific insert (PSI) of typical plant aspartic proteases (tpAPs) has been associated with plant development, stress response, and defense processes against invading pathogens. Despite sharing high sequence identity, structural studies revealed possible different mechanisms of action among species. The PSI induces signaling pathways of defense hormones in vivo and demonstrates broad-spectrum activity against phytopathogens in vitro. Recent characterization of the PSI-tpAP relationship uncovered novel, nonconventional intracellular protein transport pathways and improved tpAP production yields for industrial applications. In spite of research to date, relatively little is known about the structure-function relationships of PSIs. A comprehensive understanding of their biological roles may benefit plant protection strategies against virulent phytopathogens.
Collapse
Affiliation(s)
- Lennie K Y Cheung
- Faculty of Land and Food Systems, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - John H Dupuis
- Faculty of Land and Food Systems, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - Derek R Dee
- Faculty of Land and Food Systems, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - Brian C Bryksa
- Faculty of Land and Food Systems, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - Rickey Y Yada
- Faculty of Land and Food Systems, University of British Columbia, Vancouver, BC V6T 1Z4, Canada. @ubc.ca
| |
Collapse
|
17
|
Yurkova MS, Zenin VA, Nagibina GS, Melnik BS, Fedorov AN. Physico-Chemical Characterization of Permutated Variants of Chaperone GroEL Apical Domain. APPL BIOCHEM MICRO+ 2019. [DOI: 10.1134/s0003683819130027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
18
|
Yurkova MS, Sharapova OA, Zenin VA, Fedorov AN. Versatile format of minichaperone-based protein fusion system. Sci Rep 2019; 9:15063. [PMID: 31636289 PMCID: PMC6803692 DOI: 10.1038/s41598-019-51015-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2019] [Accepted: 09/17/2019] [Indexed: 11/23/2022] Open
Abstract
Hydrophobic recombinant proteins often tend to aggregate upon expression into inclusion bodies and are difficult to refold. Producing them in soluble forms constitutes a common bottleneck problem. A fusion system for production of insoluble hydrophobic proteins in soluble stable forms with thermophilic minichaperone, GroEL apical domain (GrAD) as a carrier, has recently been developed. To provide the utmost flexibility of the system for interactions between the carrier and various target protein moieties a strategy of making permutated protein variants by gene engineering has been applied: the original N- and C-termini of the minichaperone were linked together by a polypeptide linker and new N- and C-termini were made at desired parts of the protein surface. Two permutated GrAD forms were created and analyzed. Constructs of GrAD and both of its permutated forms fused with the initially insoluble N-terminal fragment of hepatitis C virus' E2 protein were tested. Expressed fusions formed inclusion bodies. After denaturation, all fusions were completely renatured in stable soluble forms. A variety of permutated GrAD variants can be created. The versatile format of the system provides opportunities for choosing an optimal pair between particular target protein moiety and the best-suited original or specific permutated carrier.
Collapse
Affiliation(s)
- Maria S Yurkova
- Bach Institute of Biochemistry, Research Center of Biotechnology of the Russian Academy of Sciences, 119071, Moscow, Russian Federation
- Tropogen Inc, Moscow, Russia
| | - Olga A Sharapova
- Alder BioPharmaceuticals, Inc., 11804 N Creek Pkwy S, Bothell, WA, 98011, USA
| | - Vladimir A Zenin
- Bach Institute of Biochemistry, Research Center of Biotechnology of the Russian Academy of Sciences, 119071, Moscow, Russian Federation
| | - Alexey N Fedorov
- Bach Institute of Biochemistry, Research Center of Biotechnology of the Russian Academy of Sciences, 119071, Moscow, Russian Federation.
- Tropogen Inc, Moscow, Russia.
| |
Collapse
|
19
|
Pavlenko OS, Gra OA, Mustafaev ON, Kabarbaeva KV, Sadovskaya NS, Tyurin AA, Fadeev VS, Goldenkova-Pavlova IV. Thermostable Lichenase from Clostridium thermocellum as a Host Protein in the Domain Insertion Approach. BIOCHEMISTRY. BIOKHIMIIA 2019; 84:931-940. [PMID: 31522675 DOI: 10.1134/s0006297919080091] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2019] [Revised: 04/29/2019] [Accepted: 04/30/2019] [Indexed: 12/17/2023]
Abstract
Clostridium thermocellum lichenase (endo-β-1,3;1,4-glucan-D-glycosyl hydrolase, EC 3.2.1.73 (P29716)) has been tested for the insertion of two model fluorescent proteins (EGFP and TagRFP) into two regions of this enzyme. Functional folding of the resulting proteins was confirmed by retention of lichenase activity and EGFP and TagRFP fluorescence. These results convincingly demonstrate that (i) the two experimentally selected lichenase loop regions may serve as the areas for domain insertion without disturbing enzyme folding in vivo; (ii) lichenase permits not only single but also tandem insertions of large protein domains. High specific activity, outstanding thermostability, and efficient in vitro refolding of thermostable lichenase make it an attractive new host protein for the insertional fusion of domains in the engineering of multifunctional proteins.
Collapse
Affiliation(s)
- O S Pavlenko
- Timiryazev Institute of Plant Physiology, Russian Academy of Sciences, Moscow, 127276, Russia.
| | - O A Gra
- Timiryazev Institute of Plant Physiology, Russian Academy of Sciences, Moscow, 127276, Russia
| | - O N Mustafaev
- Baku State University, Department of Biophysics and Molecular Biology, Baku, AZ1106, Azerbaijan.
| | - K V Kabarbaeva
- Timiryazev Institute of Plant Physiology, Russian Academy of Sciences, Moscow, 127276, Russia
| | - N S Sadovskaya
- Timiryazev Institute of Plant Physiology, Russian Academy of Sciences, Moscow, 127276, Russia
| | - A A Tyurin
- Timiryazev Institute of Plant Physiology, Russian Academy of Sciences, Moscow, 127276, Russia
| | - V S Fadeev
- Timiryazev Institute of Plant Physiology, Russian Academy of Sciences, Moscow, 127276, Russia
| | - I V Goldenkova-Pavlova
- Timiryazev Institute of Plant Physiology, Russian Academy of Sciences, Moscow, 127276, Russia.
| |
Collapse
|
20
|
Lafita A, Tian P, Best RB, Bateman A. Tandem domain swapping: determinants of multidomain protein misfolding. Curr Opin Struct Biol 2019; 58:97-104. [PMID: 31260947 PMCID: PMC6863430 DOI: 10.1016/j.sbi.2019.05.012] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2019] [Accepted: 05/13/2019] [Indexed: 11/25/2022]
Abstract
Domain swapping refers to the exchange of structural elements between protein domains. Experiments show that tandem homologous domains are prone to domain swapping. Recent studies establish a framework to understand the formation of tandem domain swaps. Prediction of tandem domain swaps is possible but hindered by the amount of available data.
Tandem homologous domains in proteins are susceptible to misfolding through the formation of domain swaps, non-native conformations involving the exchange of equivalent structural elements between adjacent domains. Cutting-edge biophysical experiments have recently allowed the observation of tandem domain swapping events at the single molecule level. In addition, computer simulations have shed light into the molecular mechanisms of domain swap formation and serve as the basis for methods to systematically predict them. At present, the number of studies on tandem domain swaps is still small and limited to a few domain folds, but they offer important insights into the folding and evolution of multidomain proteins with applications in the field of protein design.
Collapse
Affiliation(s)
- Aleix Lafita
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK.
| | - Pengfei Tian
- Novozymes A/S, Krogshøjvej 36, DK-2880 Bagsværd, Denmark
| | - Robert B Best
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| |
Collapse
|
21
|
Bandyopadhyay B, Peleg Y. Facilitating circular permutation using Restriction Free (RF) cloning. Protein Eng Des Sel 2019; 31:65-68. [PMID: 29319799 DOI: 10.1093/protein/gzx061] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2017] [Accepted: 11/14/2017] [Indexed: 02/02/2023] Open
Abstract
Circular permutation is a powerful tool to test the role of topology in protein folding and function. Previous methods for generating circular permutants were based on rearranging gene elements using restriction enzymes-based cloning. Here, we present a Restriction Free (RF) approach to achieve circular permutation which is faster and more cost-effective.
Collapse
Affiliation(s)
| | - Yoav Peleg
- The Israel Structural Proteomics Center (ISPC), Weizmann Institute of Science, Rehovot 7610001, Israel
| |
Collapse
|
22
|
Kovacs NA, Penev PI, Venapally A, Petrov AS, Williams LD. Circular Permutation Obscures Universality of a Ribosomal Protein. J Mol Evol 2018; 86:581-592. [PMID: 30306205 DOI: 10.1007/s00239-018-9869-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2018] [Accepted: 09/28/2018] [Indexed: 12/29/2022]
Abstract
Functions, origins, and evolution of the translation system are best understood in the context of unambiguous and phylogenetically based taxonomy and nomenclature. Here, we map ribosomal proteins onto the tree of life and provide a nomenclature for ribosomal proteins that is consistent with phylogenetic relationships. We have increased the accuracy of homology relationships among ribosomal proteins, providing a more informative picture of their lineages. We demonstrate that bL33 (bacteria) and eL42 (archaea/eukarya) are homologs with common ancestry and acute similarities in sequence and structure. Their similarities were previously obscured by circular permutation. The most likely mechanism of permutation between bL33 and eL42 is duplication followed by fusion and deletion of both the first and last β-hairpins. bL33 and eL42 are composed of zinc ribbon protein folds, one of the most common zinc finger fold-groups of, and most frequently observed in translation-related domains. Bacterial-specific ribosomal protein bL33 and archaeal/eukaryotic-specific ribosomal protein eL42 are now both assigned the name of uL33, indicating a universal ribosomal protein. We provide a phylogenetic naming scheme for all ribosomal proteins that is based on phylogenetic relationships to be used as a tool for studying the systemics, evolution, and origins of the ribosome.
Collapse
Affiliation(s)
- Nicholas A Kovacs
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, 30332-0400, USA
| | - Petar I Penev
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, 30332-0400, USA
| | - Amitej Venapally
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, 30332-0400, USA
| | - Anton S Petrov
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, 30332-0400, USA.
| | - Loren Dean Williams
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, 30332-0400, USA.
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, 30332-0400, USA.
| |
Collapse
|
23
|
Herrera-Morande A, Castro-Fernández V, Merino F, Ramírez-Sarmiento CA, Fernández FJ, Vega MC, Guixé V. Protein topology determines substrate-binding mechanism in homologous enzymes. Biochim Biophys Acta Gen Subj 2018; 1862:2869-2878. [PMID: 30251675 DOI: 10.1016/j.bbagen.2018.09.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 08/21/2018] [Accepted: 09/11/2018] [Indexed: 10/28/2022]
Abstract
During evolution, some homologs proteins appear with different connectivity between secondary structures (different topology) but conserving the tridimensional arrangement of them (same architecture). These events can produce two types of arrangements; circular permutation or non-cyclic permutations. The first one results in the N and C terminus transferring to a different position on a protein sequence while the second refers to a more complex arrangement of the structural elements. In ribokinase superfamily, two different topologies can be identified, which are related to each other as a non-cyclic permutation occurred during the evolution. Interestingly, this change in topology is correlated with the nucleotide specificity of its members. Thereby, the connectivity of the secondary elements allows us to distinguish an ATP-dependent and an ADP-dependent topology. Here we address the impact of introducing the topology of a homologous ATP-dependent kinase in an ADP-dependent kinase (Thermococcus litoralis glucokinase) in the structure, nucleotide specificity, and substrate binding order of the engineered enzyme. Structural evidence demonstrates that rewiring the topology of TlGK leads to an active and soluble enzyme without modifications on its three-dimensional architecture. The permuted enzyme (PerGK) retains the nucleotide preference of the parent TlGK enzyme but shows a change in the substrate binding order. Our results illustrate how the rearrangement of the protein folding topology during the evolution of the ribokinase superfamily enzymes may have dictated the substrate-binding order in homologous enzymes of this superfamily.
Collapse
Affiliation(s)
| | | | - Felipe Merino
- Departamento de Biología, Facultad de Ciencias, Universidad de Chile, Santiago, Chile
| | | | - Francisco J Fernández
- Centro de Investigaciones Biológicas (CIB-CSIC), Structural and Chemical Biology Dep., Madrid, Spain
| | - M Cristina Vega
- Centro de Investigaciones Biológicas (CIB-CSIC), Structural and Chemical Biology Dep., Madrid, Spain.
| | - Victoria Guixé
- Departamento de Biología, Facultad de Ciencias, Universidad de Chile, Santiago, Chile.
| |
Collapse
|
24
|
Albert P, Varga B, Zsibrita N, Kiss A. Circularly permuted variants of two CG-specific prokaryotic DNA methyltransferases. PLoS One 2018; 13:e0197232. [PMID: 29746549 PMCID: PMC5944983 DOI: 10.1371/journal.pone.0197232] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2018] [Accepted: 04/27/2018] [Indexed: 01/06/2023] Open
Abstract
The highly similar prokaryotic DNA (cytosine-5) methyltransferases (C5-MTases) M.MpeI and M.SssI share the specificity of eukaryotic C5-MTases (5'-CG), and can be useful research tools in the study of eukaryotic DNA methylation and epigenetic regulation. In an effort to improve the stability and solubility of complementing fragments of the two MTases, genes encoding circularly permuted (CP) variants of M.MpeI and M.SssI were created, and cloned in a plasmid vector downstream of an arabinose-inducible promoter. MTase activity of the CP variants was tested by digestion of the plasmids with methylation-sensitive restriction enzymes. Eleven of the fourteen M.MpeI permutants and six of the seven M.SssI permutants had detectable MTase activity as indicated by the full or partial protection of the plasmid carrying the cpMTase gene. Permutants cp62M.MpeI and cp58M.SssI, in which the new N-termini are located between conserved motifs II and III, had by far the highest activity. The activity of cp62M.MpeI was comparable to the activity of wild-type M.MpeI. Based on the location of the split sites, the permutants possessing MTase activity can be classified in ten types. Although most permutation sites were designed to fall outside of conserved motifs, and the MTase activity of the permutants measured in cell extracts was in most cases substantially lower than that of the wild-type enzyme, the high proportion of circular permutation topologies compatible with MTase activity is remarkable, and is a new evidence for the structural plasticity of C5-MTases. A computer search of the REBASE database identified putative C5-MTases with CP arrangement. Interestingly, all natural circularly permuted C5-MTases appear to represent only one of the ten types of permutation topology created in this work.
Collapse
Affiliation(s)
- Pál Albert
- Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
- Doctoral School in Biology, Faculty of Science and Informatics, University of Szeged, Szeged, Hungary
| | - Bence Varga
- Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Nikolett Zsibrita
- Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
- Doctoral School in Biology, Faculty of Science and Informatics, University of Szeged, Szeged, Hungary
| | - Antal Kiss
- Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| |
Collapse
|
25
|
Lateef Z, Gimenez G, Baker ES, Ward VK. Transcriptomic analysis of human norovirus NS1-2 protein highlights a multifunctional role in murine monocytes. BMC Genomics 2017; 18:39. [PMID: 28056773 PMCID: PMC5217272 DOI: 10.1186/s12864-016-3417-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2016] [Accepted: 12/12/2016] [Indexed: 12/22/2022] Open
Abstract
Background The GII.4 Sydney 2012 strain of human norovirus (HuNoV) is a pandemic strain that is responsible for the majority of norovirus outbreaks in healthcare settings. The function of the non-structural (NS)1-2 protein from HuNoV is unknown. Results In silico analysis of human norovirus NS1-2 protein showed that it shares features with the murine NS1-2 protein, including a disordered region, a transmembrane domain and H-box and NC sequence motifs. The proteins also contain caspase cleavage and phosphorylation sites, indicating that processing and phosphorylation may be a conserved feature of norovirus NS1-2 proteins. In this study, RNA transcripts of human and murine norovirus full-length and the disordered region of NS1-2 were transfected into monocytes, and next generation sequencing was used to analyse the transcriptomic profile of cells expressing virus proteins. The profiles were then compared to the transcriptomic profile of MNV-infected cells. Conclusions RNAseq analysis showed that NS1-2 proteins from human and murine noroviruses affect multiple immune systems (chemokine, cytokine, and Toll-like receptor signaling) and intracellular pathways (NFκB, MAPK, PI3K-Akt signaling) in murine monocytes. Comparison to the transcriptomic profile of MNV-infected cells indicated the pathways that NS1-2 may affect during norovirus infection. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3417-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Zabeen Lateef
- Department of Microbiology and Immunology, Otago School of Medical Sciences, University of Otago, 720 Cumberland St, Dunedin, 9054, New Zealand.
| | - Gregory Gimenez
- Otago Genomics and Bioinformatics Facility, University of Otago, Dunedin, 9054, New Zealand
| | - Estelle S Baker
- Department of Microbiology and Immunology, Otago School of Medical Sciences, University of Otago, 720 Cumberland St, Dunedin, 9054, New Zealand
| | - Vernon K Ward
- Department of Microbiology and Immunology, Otago School of Medical Sciences, University of Otago, 720 Cumberland St, Dunedin, 9054, New Zealand
| |
Collapse
|
26
|
Structure prediction and functional analysis of a non-permutated lectin from Dioclea grandiflora. Biochimie 2016; 131:54-67. [DOI: 10.1016/j.biochi.2016.09.010] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2015] [Accepted: 09/19/2016] [Indexed: 01/22/2023]
|
27
|
Shishovs M, Rumnieks J, Diebolder C, Jaudzems K, Andreas LB, Stanek J, Kazaks A, Kotelovica S, Akopjana I, Pintacuda G, Koning RI, Tars K. Structure of AP205 Coat Protein Reveals Circular Permutation in ssRNA Bacteriophages. J Mol Biol 2016; 428:4267-4279. [PMID: 27591890 DOI: 10.1016/j.jmb.2016.08.025] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2016] [Revised: 08/18/2016] [Accepted: 08/27/2016] [Indexed: 12/18/2022]
Abstract
AP205 is a single-stranded RNA bacteriophage that has a coat protein sequence not similar to any other known single-stranded RNA phage. Here, we report an atomic-resolution model of the AP205 virus-like particle based on a crystal structure of an unassembled coat protein dimer and a cryo-electron microscopy reconstruction of the assembled particle, together with secondary structure information from site-specific solid-state NMR data. The AP205 coat protein dimer adopts the conserved Leviviridae coat protein fold except for the N-terminal region, which forms a beta-hairpin in the other known single-stranded RNA phages. AP205 has a similar structure at the same location formed by N- and C-terminal beta-strands, making it a circular permutant compared to the other coat proteins. The permutation moves the coat protein termini to the most surface-exposed part of the assembled particle, which explains its increased tolerance to long N- and C-terminal fusions.
Collapse
Affiliation(s)
- Mihails Shishovs
- Latvian Biomedical Research and Study Center, Rātsupītes 1, LV1067 Riga, Latvia
| | - Janis Rumnieks
- Latvian Biomedical Research and Study Center, Rātsupītes 1, LV1067 Riga, Latvia
| | - Christoph Diebolder
- Netherlands Centre for Electron Nanoscopy, Institute of Biology Leiden, Leiden University Einsteinweg 55, 2333 CC Leiden, The Netherlands
| | - Kristaps Jaudzems
- Université de Lyon, Centre de RMN à Très Hauts Champs, Institut des Sciences Analytiques (UMR 5280 - CNRS, ENS Lyon, UCB Lyon 1), 69100 Villeurbanne, France
| | - Loren B Andreas
- Université de Lyon, Centre de RMN à Très Hauts Champs, Institut des Sciences Analytiques (UMR 5280 - CNRS, ENS Lyon, UCB Lyon 1), 69100 Villeurbanne, France
| | - Jan Stanek
- Université de Lyon, Centre de RMN à Très Hauts Champs, Institut des Sciences Analytiques (UMR 5280 - CNRS, ENS Lyon, UCB Lyon 1), 69100 Villeurbanne, France
| | - Andris Kazaks
- Latvian Biomedical Research and Study Center, Rātsupītes 1, LV1067 Riga, Latvia
| | - Svetlana Kotelovica
- Latvian Biomedical Research and Study Center, Rātsupītes 1, LV1067 Riga, Latvia
| | - Inara Akopjana
- Latvian Biomedical Research and Study Center, Rātsupītes 1, LV1067 Riga, Latvia
| | - Guido Pintacuda
- Université de Lyon, Centre de RMN à Très Hauts Champs, Institut des Sciences Analytiques (UMR 5280 - CNRS, ENS Lyon, UCB Lyon 1), 69100 Villeurbanne, France
| | - Roman I Koning
- Netherlands Centre for Electron Nanoscopy, Institute of Biology Leiden, Leiden University Einsteinweg 55, 2333 CC Leiden, The Netherlands; Department of Cell Biology, Leiden University Medical Center, Postal Zone S1-P, P.O.Box 9600, 2300 RC Leiden, The Netherlands
| | - Kaspars Tars
- Latvian Biomedical Research and Study Center, Rātsupītes 1, LV1067 Riga, Latvia; Faculty of Biology, Department of Molecular Biology, University of Latvia, Jelgavas 1, LV-1004 Riga, Latvia.
| |
Collapse
|
28
|
Tian P, Best RB. Structural Determinants of Misfolding in Multidomain Proteins. PLoS Comput Biol 2016; 12:e1004933. [PMID: 27163669 PMCID: PMC4862688 DOI: 10.1371/journal.pcbi.1004933] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2016] [Accepted: 04/21/2016] [Indexed: 12/02/2022] Open
Abstract
Recent single molecule experiments, using either atomic force microscopy (AFM) or Förster resonance energy transfer (FRET) have shown that multidomain proteins containing tandem repeats may form stable misfolded structures. Topology-based simulation models have been used successfully to generate models for these structures with domain-swapped features, fully consistent with the available data. However, it is also known that some multidomain protein folds exhibit no evidence for misfolding, even when adjacent domains have identical sequences. Here we pose the question: what factors influence the propensity of a given fold to undergo domain-swapped misfolding? Using a coarse-grained simulation model, we can reproduce the known propensities of multidomain proteins to form domain-swapped misfolds, where data is available. Contrary to what might be naively expected based on the previously described misfolding mechanism, we find that the extent of misfolding is not determined by the relative folding rates or barrier heights for forming the domains present in the initial intermediates leading to folded or misfolded structures. Instead, it appears that the propensity is more closely related to the relative stability of the domains present in folded and misfolded intermediates. We show that these findings can be rationalized if the folded and misfolded domains are part of the same folding funnel, with commitment to one structure or the other occurring only at a relatively late stage of folding. Nonetheless, the results are still fully consistent with the kinetic models previously proposed to explain misfolding, with a specific interpretation of the observed rate coefficients. Finally, we investigate the relation between interdomain linker length and misfolding, and propose a simple alchemical model to predict the propensity for domain-swapped misfolding of multidomain proteins.
Collapse
Affiliation(s)
- Pengfei Tian
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Robert B. Best
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland, United States of America
| |
Collapse
|
29
|
Tyurin А, Sadovskaya N, Nikiforova K, Mustafaev О, Komakhin R, Fadeev V, Goldenkova-Pavlova I. Clostridium thermocellum thermostable lichenase with circular permutations and modifications in the N-terminal region retains its activity and thermostability. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2015; 1854:10-9. [DOI: 10.1016/j.bbapap.2014.10.012] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2014] [Revised: 09/25/2014] [Accepted: 10/15/2014] [Indexed: 11/30/2022]
|
30
|
Bliven SE, Bourne PE, Prlić A. Detection of circular permutations within protein structures using CE-CP. Bioinformatics 2014; 31:1316-8. [PMID: 25505094 DOI: 10.1093/bioinformatics/btu823] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2014] [Accepted: 12/08/2014] [Indexed: 12/19/2022] Open
Abstract
MOTIVATION Circular permutation is an important type of protein rearrangement. Natural circular permutations have implications for protein function, stability and evolution. Artificial circular permutations have also been used for protein studies. However, such relationships are difficult to detect for many sequence and structure comparison algorithms and require special consideration. RESULTS We developed a new algorithm, called Combinatorial Extension for Circular Permutations (CE-CP), which allows the structural comparison of circularly permuted proteins. CE-CP was designed to be user friendly and is integrated into the RCSB Protein Data Bank. It was tested on two collections of circularly permuted proteins. Pairwise alignments can be visualized both in a desktop application or on the web using Jmol and exported to other programs in a variety of formats. AVAILABILITY AND IMPLEMENTATION The CE-CP algorithm can be accessed through the RCSB website at http://www.rcsb.org/pdb/workbench/workbench.do. Source code is available under the LGPL 2.1 as part of BioJava 3 (http://biojava.org; http://github.com/biojava/biojava). CONTACT sbliven@ucsd.edu or info@rcsb.org.
Collapse
Affiliation(s)
- Spencer E Bliven
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA 92093, USA, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA and RCSB Protein Data Bank, San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA 92093, USA Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA 92093, USA, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA and RCSB Protein Data Bank, San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA 92093, USA
| | - Philip E Bourne
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA 92093, USA, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA and RCSB Protein Data Bank, San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA 92093, USA Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA 92093, USA, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA and RCSB Protein Data Bank, San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA 92093, USA
| | - Andreas Prlić
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA 92093, USA, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA and RCSB Protein Data Bank, San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA 92093, USA
| |
Collapse
|
31
|
Mrozek D, Małysiak-Mrozek B, Kłapciński A. Cloud4Psi: cloud computing for 3D protein structure similarity searching. Bioinformatics 2014; 30:2822-5. [PMID: 24930141 PMCID: PMC4173022 DOI: 10.1093/bioinformatics/btu389] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Summary: Popular methods for 3D protein structure similarity searching, especially those that generate high-quality alignments such as Combinatorial Extension (CE) and Flexible structure Alignment by Chaining Aligned fragment pairs allowing Twists (FATCAT) are still time consuming. As a consequence, performing similarity searching against large repositories of structural data requires increased computational resources that are not always available. Cloud computing provides huge amounts of computational power that can be provisioned on a pay-as-you-go basis. We have developed the cloud-based system that allows scaling of the similarity searching process vertically and horizontally. Cloud4Psi (Cloud for Protein Similarity) was tested in the Microsoft Azure cloud environment and provided good, almost linearly proportional acceleration when scaled out onto many computational units. Availability and implementation: Cloud4Psi is available as Software as a Service for testing purposes at: http://cloud4psi.cloudapp.net/. For source code and software availability, please visit the Cloud4Psi project home page at http://zti.polsl.pl/dmrozek/science/cloud4psi.htm. Contact:dariusz.mrozek@polsl.pl
Collapse
Affiliation(s)
- Dariusz Mrozek
- Institute of Informatics, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland
| | - Bożena Małysiak-Mrozek
- Institute of Informatics, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland
| | - Artur Kłapciński
- Institute of Informatics, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland
| |
Collapse
|
32
|
Myers-Turnbull D, Bliven SE, Rose PW, Aziz ZK, Youkharibache P, Bourne PE, Prlić A. Systematic detection of internal symmetry in proteins using CE-Symm. J Mol Biol 2014; 426:2255-68. [PMID: 24681267 DOI: 10.1016/j.jmb.2014.03.010] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2013] [Revised: 03/17/2014] [Accepted: 03/18/2014] [Indexed: 11/26/2022]
Abstract
Symmetry is an important feature of protein tertiary and quaternary structures that has been associated with protein folding, function, evolution, and stability. Its emergence and ensuing prevalence has been attributed to gene duplications, fusion events, and subsequent evolutionary drift in sequence. This process maintains structural similarity and is further supported by this study. To further investigate the question of how internal symmetry evolved, how symmetry and function are related, and the overall frequency of internal symmetry, we developed an algorithm, CE-Symm, to detect pseudo-symmetry within the tertiary structure of protein chains. Using a large manually curated benchmark of 1007 protein domains, we show that CE-Symm performs significantly better than previous approaches. We use CE-Symm to build a census of symmetry among domain superfamilies in SCOP and note that 18% of all superfamilies are pseudo-symmetric. Our results indicate that more domains are pseudo-symmetric than previously estimated. We establish a number of recurring types of symmetry-function relationships and describe several characteristic cases in detail. With the use of the Enzyme Commission classification, symmetry was found to be enriched in some enzyme classes but depleted in others. CE-Symm thus provides a methodology for a more complete and detailed study of the role of symmetry in tertiary protein structure [availability: CE-Symm can be run from the Web at http://source.rcsb.org/jfatcatserver/symmetry.jsp. Source code and software binaries are also available under the GNU Lesser General Public License (version 2.1) at https://github.com/rcsb/symmetry. An interactive census of domains identified as symmetric by CE-Symm is available from http://source.rcsb.org/jfatcatserver/scopResults.jsp].
Collapse
Affiliation(s)
- Douglas Myers-Turnbull
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA 92093, USA
| | - Spencer E Bliven
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA 92093, USA
| | - Peter W Rose
- San Diego Supercomputer Center, University of California San Diego, La Jolla, CA 92093, USA
| | - Zaid K Aziz
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA 92093, USA
| | | | - Philip E Bourne
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA.
| | - Andreas Prlić
- San Diego Supercomputer Center, University of California San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
33
|
Dai X, Zhu M, Wang YP. Circular permutation of E. coli EPSP synthase: increased inhibitor resistance, improved catalytic activity, and an indicator for protein fragment complementation. Chem Commun (Camb) 2014; 50:1830-2. [PMID: 24402609 DOI: 10.1039/c3cc48722a] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
We performed the first circular permutation analysis for E. coli 5-enolpyruvylshikimate-3-phosphate synthase, and identified one circular permutant with notably increased resistance to its specific inhibitor and several others with moderately improved catalytic activity. Valid circular permutation sites can be used as effective split sites of protein fragment complementation.
Collapse
Affiliation(s)
- Xiongfeng Dai
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing, 100871, China.
| | | | | |
Collapse
|
34
|
Bacterial co-expression of the α and β protomers of human l-asparaginase-3: Achieving essential N-terminal exposure of a catalytically critical threonine located in the β-subunit. Protein Expr Purif 2014; 93:1-10. [DOI: 10.1016/j.pep.2013.10.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2013] [Revised: 10/11/2013] [Accepted: 10/14/2013] [Indexed: 02/06/2023]
|
35
|
Abstract
Rhodopsins are photochemically reactive membrane proteins that covalently bind retinal chromophores. Type I rhodopsins are found in both prokaryotes and eukaryotic microbes, whereas type II rhodopsins function as photoactivated G-protein coupled receptors (GPCRs) in animal vision. Both rhodopsin families share the seven transmembrane α-helix GPCR fold and a Schiff base linkage from a conserved lysine to retinal in helix G. Nevertheless, rhodopsins are widely cited as a striking example of evolutionary convergence, largely because the two families lack detectable sequence similarity and differ in many structural and mechanistic details. Convergence entails that the shared rhodopsin fold is so especially suited to photosensitive function that proteins from separate origins were selected for this architecture twice. Here we show, however, that the rhodopsin fold is not required for photosensitive activity. We engineered functional bacteriorhodopsin variants with novel folds, including radical noncircular permutations of the α-helices, circular permutations of an eight-helix construct, and retinal linkages relocated to other helices. These results contradict a key prediction of convergence and thereby provide an experimental attack on one of the most intractable problems in molecular evolution: how to establish structural homology for proteins devoid of discernible sequence similarity.
Collapse
|
36
|
Debès C, Wang M, Caetano-Anollés G, Gräter F. Evolutionary optimization of protein folding. PLoS Comput Biol 2013; 9:e1002861. [PMID: 23341762 PMCID: PMC3547816 DOI: 10.1371/journal.pcbi.1002861] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2012] [Accepted: 11/09/2012] [Indexed: 01/07/2023] Open
Abstract
Nature has shaped the make up of proteins since their appearance, 3.8 billion years ago. However, the fundamental drivers of structural change responsible for the extraordinary diversity of proteins have yet to be elucidated. Here we explore if protein evolution affects folding speed. We estimated folding times for the present-day catalog of protein domains directly from their size-modified contact order. These values were mapped onto an evolutionary timeline of domain appearance derived from a phylogenomic analysis of protein domains in 989 fully-sequenced genomes. Our results show a clear overall increase of folding speed during evolution, with known ultra-fast downhill folders appearing rather late in the timeline. Remarkably, folding optimization depends on secondary structure. While alpha-folds showed a tendency to fold faster throughout evolution, beta-folds exhibited a trend of folding time increase during the last 1.5 billion years that began during the “big bang” of domain combinations. As a consequence, these domain structures are on average slow folders today. Our results suggest that fast and efficient folding of domains shaped the universe of protein structure. This finding supports the hypothesis that optimization of the kinetic and thermodynamic accessibility of the native fold reduces protein aggregation propensities that hamper cellular functions. Nature has come up with an enormous variety of protein three-dimensional structures, each of which is thought to be optimized for its specific function. A fundamental biological endeavor is to uncover the driving evolutionary forces for discovering and optimizing new folds. A long-standing hypothesis is that fold evolution obeys constraints to properly fold into native structure. We here test this hypothesis by analyzing trends of proteins to fold fast during evolution. Using phylogenomic and structural analyses, we observe an overall decrease in folding times between 3.8 and 1.5 billion years ago, which can be interpreted as an evolutionary optimization for rapid folding. This trend towards fast folding probably resulted in manifold advantages, including high protein accessibility for the cell and a reduction of protein aggregation during misfolding.
Collapse
Affiliation(s)
- Cédric Debès
- Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Minglei Wang
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, United States of America
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, United States of America
- * E-mail: (GCA); (FG)
| | - Frauke Gräter
- Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
- CAS-MPG Partner Institute and Key Laboratory for Computational Biology, Shanghai, China
- * E-mail: (GCA); (FG)
| |
Collapse
|
37
|
Searls DB. A primer in macromolecular linguistics. Biopolymers 2012; 99:203-17. [PMID: 23034580 DOI: 10.1002/bip.22101] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2012] [Accepted: 05/25/2012] [Indexed: 01/01/2023]
Abstract
Polymeric macromolecules, when viewed abstractly as strings of symbols, can be treated in terms of formal language theory, providing a mathematical foundation for characterizing such strings both as collections and in terms of their individual structures. In addition this approach offers a framework for analysis of macromolecules by tools and conventions widely used in computational linguistics. This article introduces the ways that linguistics can be and has been applied to molecular biology, covering the relevant formal language theory at a relatively nontechnical level. Analogies between macromolecules and human natural language are used to provide intuitive insights into the relevance of grammars, parsing, and analysis of language complexity to biology.
Collapse
|
38
|
Prlić A, Yates A, Bliven SE, Rose PW, Jacobsen J, Troshin PV, Chapman M, Gao J, Koh CH, Foisy S, Holland R, Rimsa G, Heuer ML, Brandstätter-Müller H, Bourne PE, Willis S. BioJava: an open-source framework for bioinformatics in 2012. Bioinformatics 2012; 28:2693-5. [PMID: 22877863 PMCID: PMC3467744 DOI: 10.1093/bioinformatics/bts494] [Citation(s) in RCA: 149] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Motivation: BioJava is an open-source project for processing of biological data in the Java programming language. We have recently released a new version (3.0.5), which is a major update to the code base that greatly extends its functionality. Results: BioJava now consists of several independent modules that provide state-of-the-art tools for protein structure comparison, pairwise and multiple sequence alignments, working with DNA and protein sequences, analysis of amino acid properties, detection of protein modifications and prediction of disordered regions in proteins as well as parsers for common file formats using a biologically meaningful data model. Availability: BioJava is an open-source project distributed under the Lesser GPL (LGPL). BioJava can be downloaded from the BioJava website (http://www.biojava.org). BioJava requires Java 1.6 or higher. All inquiries should be directed to the BioJava mailing lists. Details are available at http://biojava.org/wiki/BioJava:MailingLists Contact: andreas.prlic@gmail.com
Collapse
Affiliation(s)
- Andreas Prlić
- San Diego Supercomputer Center, University of California San Diego, La Jolla, CA 92093, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Wodak SJ, Mietchen D, Collings AM, Russell RB, Bourne PE. Topic pages: PLoS Computational Biology meets Wikipedia. PLoS Comput Biol 2012; 8:e1002446. [PMID: 22479174 PMCID: PMC3315447 DOI: 10.1371/journal.pcbi.1002446] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Affiliation(s)
- Shoshana J. Wodak
- Hospital for Sick Children, Toronto, Canada
- Department of Biochemistry, University of Toronto, Toronto, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Canada
| | - Daniel Mietchen
- EvoMRI Communications, Jena, Germany
- Open Knowledge Foundation Germany, Berlin, Germany
| | | | | | - Philip E. Bourne
- Department of Pharmacology, University of California San Diego, La Jolla, California, United States of America
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California, United States of America
- * E-mail:
| |
Collapse
|