1
|
Kinch LN, Schaeffer RD, Zhang J, Cong Q, Orth K, Grishin N. Insights into virulence: structure classification of the Vibrio parahaemolyticus RIMD mobilome. mSystems 2023; 8:e0079623. [PMID: 38014954 PMCID: PMC10734457 DOI: 10.1128/msystems.00796-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 10/17/2023] [Indexed: 11/29/2023] Open
Abstract
IMPORTANCE The pandemic Vpar strain RIMD causes seafood-borne illness worldwide. Previous comparative genomic studies have revealed pathogenicity islands in RIMD that contribute to the success of the strain in infection. However, not all virulence determinants have been identified, and many of the proteins encoded in known pathogenicity islands are of unknown function. Based on the EOCD database, we used evolution-based classification of structure models for the RIMD proteome to improve our functional understanding of virulence determinants acquired by the pandemic strain. We further identify and classify previously unknown mobile protein domains as well as fast evolving residue positions in structure models that contribute to virulence and adaptation with respect to a pre-pandemic strain. Our work highlights key contributions of phage in mediating seafood born illness, suggesting this strain balances its avoidance of phage predators with its successful colonization of human hosts.
Collapse
Affiliation(s)
- Lisa N. Kinch
- Department of Molecular Biology, University of Texas Southwestern Medical Center, Dallas, Texas, USA
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - R. Dustin Schaeffer
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Jing Zhang
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, USA
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Qian Cong
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, USA
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, Texas, USA
- Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Kim Orth
- Department of Molecular Biology, University of Texas Southwestern Medical Center, Dallas, Texas, USA
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, USA
- Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Nick Grishin
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, USA
- Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, USA
| |
Collapse
|
2
|
Nicastro GG, Burroughs AM, Iyer L, Aravind L. Functionally comparable but evolutionarily distinct nucleotide-targeting effectors help identify conserved paradigms across diverse immune systems. Nucleic Acids Res 2023; 51:11479-11503. [PMID: 37889040 PMCID: PMC10681802 DOI: 10.1093/nar/gkad879] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 09/21/2023] [Accepted: 09/28/2023] [Indexed: 10/28/2023] Open
Abstract
While nucleic acid-targeting effectors are known to be central to biological conflicts and anti-selfish element immunity, recent findings have revealed immune effectors that target their building blocks and the cellular energy currency-free nucleotides. Through comparative genomics and sequence-structure analysis, we identified several distinct effector domains, which we named Calcineurin-CE, HD-CE, and PRTase-CE. These domains, along with specific versions of the ParB and MazG domains, are widely present in diverse prokaryotic immune systems and are predicted to degrade nucleotides by targeting phosphate or glycosidic linkages. Our findings unveil multiple potential immune systems associated with at least 17 different functional themes featuring these effectors. Some of these systems sense modified DNA/nucleotides from phages or operate downstream of novel enzymes generating signaling nucleotides. We also uncovered a class of systems utilizing HSP90- and HSP70-related modules as analogs of STAND and GTPase domains that are coupled to these nucleotide-targeting- or proteolysis-induced complex-forming effectors. While widespread in bacteria, only a limited subset of nucleotide-targeting effectors was integrated into eukaryotic immune systems, suggesting barriers to interoperability across subcellular contexts. This work establishes nucleotide-degrading effectors as an emerging immune paradigm and traces their origins back to homologous domains in housekeeping systems.
Collapse
Affiliation(s)
- Gianlucca G Nicastro
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, USA
| | - A Maxwell Burroughs
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, USA
| | - Lakshminarayan M Iyer
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, USA
| | - L Aravind
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, USA
| |
Collapse
|
3
|
Zabrady M, Zabrady K, Li AH, Doherty AJ. Reverse transcriptases prime DNA synthesis. Nucleic Acids Res 2023; 51:7125-7142. [PMID: 37279911 PMCID: PMC10415136 DOI: 10.1093/nar/gkad478] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Revised: 05/11/2023] [Accepted: 05/19/2023] [Indexed: 06/08/2023] Open
Abstract
The discovery of reverse transcriptases (RTs) challenged the central dogma by establishing that genetic information can also flow from RNA to DNA. Although they act as DNA polymerases, RTs are distantly related to replicases that also possess de novo primase activity. Here we identify that CRISPR associated RTs (CARTs) directly prime DNA synthesis on both RNA and DNA. We demonstrate that RT-dependent priming is utilized by some CRISPR-Cas complexes to synthesise new spacers and integrate these into CRISPR arrays. Expanding our analyses, we show that primer synthesis activity is conserved in representatives of other major RT classes, including group II intron RT, telomerase and retroviruses. Together, these findings establish a conserved innate ability of RTs to catalyse de novo DNA primer synthesis, independently of accessory domains or alternative priming mechanisms, which likely plays important roles in a wide variety of biological pathways.
Collapse
Affiliation(s)
- Matej Zabrady
- Genome Damage and Stability Centre, School of Life Sciences, University of Sussex, Brighton BN1 9RQ, UK
| | - Katerina Zabrady
- Genome Damage and Stability Centre, School of Life Sciences, University of Sussex, Brighton BN1 9RQ, UK
| | - Arthur W H Li
- Genome Damage and Stability Centre, School of Life Sciences, University of Sussex, Brighton BN1 9RQ, UK
| | - Aidan J Doherty
- Genome Damage and Stability Centre, School of Life Sciences, University of Sussex, Brighton BN1 9RQ, UK
| |
Collapse
|
4
|
Wozniak K, Brzezinski K. Biological Catalysis and Information Storage Have Relied on N-Glycosyl Derivatives of β-D-Ribofuranose since the Origins of Life. Biomolecules 2023; 13:biom13050782. [PMID: 37238652 DOI: 10.3390/biom13050782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 04/24/2023] [Accepted: 04/29/2023] [Indexed: 05/28/2023] Open
Abstract
Most naturally occurring nucleotides and nucleosides are N-glycosyl derivatives of β-d-ribose. These N-ribosides are involved in most metabolic processes that occur in cells. They are essential components of nucleic acids, forming the basis for genetic information storage and flow. Moreover, these compounds are involved in numerous catalytic processes, including chemical energy production and storage, in which they serve as cofactors or coribozymes. From a chemical point of view, the overall structure of nucleotides and nucleosides is very similar and simple. However, their unique chemical and structural features render these compounds versatile building blocks that are crucial for life processes in all known organisms. Notably, the universal function of these compounds in encoding genetic information and cellular catalysis strongly suggests their essential role in the origins of life. In this review, we summarize major issues related to the role of N-ribosides in biological systems, especially in the context of the origin of life and its further evolution, through the RNA-based World(s), toward the life we observe today. We also discuss possible reasons why life has arisen from derivatives of β-d-ribofuranose instead of compounds based on other sugar moieties.
Collapse
Affiliation(s)
- Katarzyna Wozniak
- Department of Structural Biology of Prokaryotic Organisms, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-074 Poznan, Poland
| | - Krzysztof Brzezinski
- Department of Structural Biology of Prokaryotic Organisms, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-074 Poznan, Poland
| |
Collapse
|
5
|
Burroughs A, Aravind L. New biochemistry in the Rhodanese-phosphatase superfamily: emerging roles in diverse metabolic processes, nucleic acid modifications, and biological conflicts. NAR Genom Bioinform 2023; 5:lqad029. [PMID: 36968430 PMCID: PMC10034599 DOI: 10.1093/nargab/lqad029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 02/10/2023] [Accepted: 03/09/2023] [Indexed: 03/25/2023] Open
Abstract
The protein-tyrosine/dual-specificity phosphatases and rhodanese domains constitute a sprawling superfamily of Rossmannoid domains that use a conserved active site with a cysteine to catalyze a range of phosphate-transfer, thiotransfer, selenotransfer and redox activities. While these enzymes have been extensively studied in the context of protein/lipid head group dephosphorylation and various thiotransfer reactions, their overall diversity and catalytic potential remain poorly understood. Using comparative genomics and sequence/structure analysis, we comprehensively investigate and develop a natural classification for this superfamily. As a result, we identified several novel clades, both those which retain the catalytic cysteine and those where a distinct active site has emerged in the same location (e.g. diphthine synthase-like methylases and RNA 2' OH ribosyl phosphate transferases). We also present evidence that the superfamily has a wider range of catalytic capabilities than previously known, including a set of parallel activities operating on various sugar/sugar alcohol groups in the context of NAD+-derivatives and RNA termini, and potential phosphate transfer activities involving sugars and nucleotides. We show that such activities are particularly expanded in the RapZ-C-DUF488-DUF4326 clade, defined here for the first time. Some enzymes from this clade are predicted to catalyze novel DNA-end processing activities as part of nucleic-acid-modifying systems that are likely to function in biological conflicts between viruses and their hosts.
Collapse
Affiliation(s)
- A Maxwell Burroughs
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - L Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
6
|
Janzen E, Shen Y, Vázquez-Salazar A, Liu Z, Blanco C, Kenchel J, Chen IA. Emergent properties as by-products of prebiotic evolution of aminoacylation ribozymes. Nat Commun 2022; 13:3631. [PMID: 35752631 PMCID: PMC9233669 DOI: 10.1038/s41467-022-31387-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2021] [Accepted: 06/16/2022] [Indexed: 11/24/2022] Open
Abstract
Systems of catalytic RNAs presumably gave rise to important evolutionary innovations, such as the genetic code. Such systems may exhibit particular tolerance to errors (error minimization) as well as coding specificity. While often assumed to result from natural selection, error minimization may instead be an emergent by-product. In an RNA world, a system of self-aminoacylating ribozymes could enforce the mapping of amino acids to anticodons. We measured the activity of thousands of ribozyme mutants on alternative substrates (activated analogs for tryptophan, phenylalanine, leucine, isoleucine, valine, and methionine). Related ribozymes exhibited shared preferences for substrates, indicating that adoption of additional amino acids by existing ribozymes would itself lead to error minimization. Furthermore, ribozyme activity was positively correlated with specificity, indicating that selection for increased activity would also lead to increased specificity. These results demonstrate that by-products of ribozyme evolution could lead to adaptive value in specificity and error tolerance.
Collapse
Affiliation(s)
- Evan Janzen
- Program in Biomolecular Science and Engineering, University of California, Santa Barbara, CA, 93106, USA.,Department of Chemistry and Biochemistry, University of California, Santa Barbara, CA, 93106, USA
| | - Yuning Shen
- Department of Chemistry and Biochemistry, University of California, Santa Barbara, CA, 93106, USA.,Department of Chemical and Biomolecular Engineering, Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, 90095, USA
| | - Alberto Vázquez-Salazar
- Department of Chemical and Biomolecular Engineering, Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, 90095, USA
| | - Ziwei Liu
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge Biomedical Campus, Cambridge, CB2 0QH, UK
| | - Celia Blanco
- Department of Chemical and Biomolecular Engineering, Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, 90095, USA
| | - Josh Kenchel
- Program in Biomolecular Science and Engineering, University of California, Santa Barbara, CA, 93106, USA.,Department of Chemistry and Biochemistry, University of California, Santa Barbara, CA, 93106, USA.,Department of Chemical and Biomolecular Engineering, Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, 90095, USA
| | - Irene A Chen
- Program in Biomolecular Science and Engineering, University of California, Santa Barbara, CA, 93106, USA. .,Department of Chemistry and Biochemistry, University of California, Santa Barbara, CA, 93106, USA. .,Department of Chemical and Biomolecular Engineering, Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, 90095, USA.
| |
Collapse
|
7
|
Tang Y, Dong Q, Wang T, Gong L, Gu Y. PNET2 is a component of the plant nuclear lamina and is required for proper genome organization and activity. Dev Cell 2021; 57:19-31.e6. [PMID: 34822788 DOI: 10.1016/j.devcel.2021.11.002] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Revised: 09/24/2021] [Accepted: 10/29/2021] [Indexed: 01/01/2023]
Abstract
The interaction between chromatin and the nuclear lamina (NL) is intrinsically important to the establishment of three-dimensional chromatin architecture and spatiotemporal regulation of gene expression. However, critical regulators involved in this process are poorly understood in plants. Here, we report that Arabidopsis PNET2 and its two homologs are bona fide inner nuclear membrane proteins and integral components of the NL. PNET2s physically interact with the plant nucleoskeleton and engage nucleosome-enriched chromatin at the nuclear periphery. Loss of all three PNET2s leads to severely disrupted growth and development, concomitant activation of abiotic and biotic stress responses, and ultimate lethality in Arabidopsis. The pent2 triple mutant also displays drastic transcriptome changes accompanied by a globally altered chromatin architecture revealed by HiC analysis. Our study identified PNET2 as an inner nuclear membrane (INM) component of the NL, which associates with chromatin and play a critical role in orchestrating gene expression and chromatin organization in plants.
Collapse
Affiliation(s)
- Yu Tang
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA; Innovative Genomics Institute, University of California, Berkeley, CA 94720, USA
| | - Qianli Dong
- Key Laboratory of Molecular Epigenetics of the Ministry of Education (MOE), Northeast Normal University, Changchun 130024, China
| | - Tianya Wang
- Key Laboratory of Molecular Epigenetics of the Ministry of Education (MOE), Northeast Normal University, Changchun 130024, China
| | - Lei Gong
- Key Laboratory of Molecular Epigenetics of the Ministry of Education (MOE), Northeast Normal University, Changchun 130024, China
| | - Yangnan Gu
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA; Innovative Genomics Institute, University of California, Berkeley, CA 94720, USA.
| |
Collapse
|
8
|
Hajredini F, Ghose R. A Conserved Structural Role for the Walker-A Lysine in P-Loop Containing Kinases. Front Mol Biosci 2021; 8:747206. [PMID: 34660698 PMCID: PMC8517177 DOI: 10.3389/fmolb.2021.747206] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2021] [Accepted: 09/20/2021] [Indexed: 12/01/2022] Open
Abstract
Bacterial tyrosine kinases (BY-kinases) and shikimate kinases (SKs) comprise two structurally divergent P-loop containing enzyme families that share similar catalytic site geometries, most notably with respect to their Walker-A, Walker-B, and DxD motifs. We had previously demonstrated that in BY-kinases, a specific interaction between the Walker-A and Walker-B motifs, driven by the conserved “catalytic” lysine housed on the former, leads to a conformation that is unable to efficiently coordinate Mg2+•ATP and is therefore incapable of chemistry. Here, using enhanced sampling molecular dynamics simulations, we demonstrate that structurally similar interactions between the Walker-A and Walker-B motifs, also mediated by the catalytic lysine, stabilize a state in SKs that deviates significantly from one that is necessary for the optimal coordination of Mg2+•ATP. This structural role of the Walker-A lysine is a general feature in SKs and is found to be present in members that encode a Walker-B sequence characteristic of the family (Coxiella burnetii SK), and in those that do not (Mycobacterium tuberculosis SK). Thus, the structural role of the Walker-A lysine in stabilizing an inactive state, distinct from its catalytic function, is conserved between two distantly related P-loop containing kinase families, the SKs and the BY-kinases. The universal conservation of this element, and of the key characteristics of its associated interaction partners within the Walker motifs of P-loop containing enzymes, suggests that this structural role of the Walker-A lysine is perhaps a widely deployed regulatory mechanism within this ancient family.
Collapse
Affiliation(s)
- Fatlum Hajredini
- Department of Chemistry and Biochemistry, The City College of New York, New York, NY, United States.,PhD Program in Biochemistry, The Graduate Center of CUNY, New York, NY, United States
| | - Ranajeet Ghose
- Department of Chemistry and Biochemistry, The City College of New York, New York, NY, United States.,PhD Program in Biochemistry, The Graduate Center of CUNY, New York, NY, United States.,PhD Program in Chemistry, The Graduate Center of CUNY, New York, NY, United States.,PhD Program in Physics, The Graduate Center of CUNY, New York, NY, United States
| |
Collapse
|
9
|
Structure of an open conformation of T7 DNA polymerase reveals novel structural features regulating primer-template stabilization at the polymerization active site. Biochem J 2021; 478:2665-2679. [PMID: 34160020 DOI: 10.1042/bcj20200922] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Revised: 06/21/2021] [Accepted: 06/23/2021] [Indexed: 01/25/2023]
Abstract
The crystal structure of full-length T7 DNA polymerase in complex with its processivity factor thioredoxin and double-stranded DNA in the polymerization active site exhibits two novel structural motifs in family-A DNA polymerases: an extended β-hairpin at the fingers subdomain, that interacts with the DNA template strand downstream the primer-terminus, and a helix-loop-helix motif (insertion1) located between residues 102 to 122 in the exonuclease domain. The extended β-hairpin is involved in nucleotide incorporation on substrates with 5'-overhangs longer than 2 nt, suggesting a role in stabilizing the template strand into the polymerization domain. Our biochemical data reveal that insertion1 of the exonuclease domain makes stabilizing interactions that facilitate proofreading by shuttling the primer strand into the exonuclease active site. Overall, our studies evidence conservation of the 3'-5' exonuclease domain fold between family-A DNA polymerases and highlight the modular architecture of T7 DNA polymerase. Our data suggest that the intercalating β-hairpin guides the template-strand into the polymerization active site after the T7 primase-helicase unwinds the DNA double helix ameliorating the formation of secondary structures and decreasing the appearance of indels.
Collapse
|
10
|
Kolodny R, Nepomnyachiy S, Tawfik DS, Ben-Tal N. Bridging Themes: Short Protein Segments Found in Different Architectures. Mol Biol Evol 2021; 38:2191-2208. [PMID: 33502503 PMCID: PMC8136508 DOI: 10.1093/molbev/msab017] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
The vast majority of theoretically possible polypeptide chains do not fold, let alone confer function. Hence, protein evolution from preexisting building blocks has clear potential advantages over ab initio emergence from random sequences. In support of this view, sequence similarities between different proteins is generally indicative of common ancestry, and we collectively refer to such homologous sequences as "themes." At the domain level, sequence homology is routinely detected. However, short themes which are segments, or fragments of intact domains, are particularly interesting because they may provide hints about the emergence of domains, as opposed to divergence of preexisting domains, or their mixing-and-matching to form multi-domain proteins. Here we identified 525 representative short themes, comprising 20-80 residues that are unexpectedly shared between domains considered to have emerged independently. Among these "bridging themes" are ones shared between the most ancient domains, for example, Rossmann, P-loop NTPase, TIM-barrel, flavodoxin, and ferredoxin-like. We elaborate on several particularly interesting cases, where the bridging themes mediate ligand binding. Ligand binding may have contributed to the stability and the plasticity of these building blocks, and to their ability to invade preexisting domains or serve as starting points for completely new domains.
Collapse
Affiliation(s)
- Rachel Kolodny
- Department of Computer Science, University of Haifa, Haifa, Israel
| | | | - Dan S Tawfik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Nir Ben-Tal
- George S. Wise Faculty of Life Sciences, Department of Biochemistry and Molecular Biology, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
11
|
Affiliation(s)
- Dragana Despotovic
- Department of Biomolecular Sciences Weizmann Institute of Science Rehovot 7610001 Israel
| | - Dan S. Tawfik
- Department of Biomolecular Sciences Weizmann Institute of Science Rehovot 7610001 Israel
| |
Collapse
|
12
|
M. Iyer L, Anantharaman V, Krishnan A, Burroughs AM, Aravind L. Jumbo Phages: A Comparative Genomic Overview of Core Functions and Adaptions for Biological Conflicts. Viruses 2021; 13:v13010063. [PMID: 33466489 PMCID: PMC7824862 DOI: 10.3390/v13010063] [Citation(s) in RCA: 48] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Revised: 12/31/2020] [Accepted: 12/31/2020] [Indexed: 02/07/2023] Open
Abstract
Jumbo phages have attracted much attention by virtue of their extraordinary genome size and unusual aspects of biology. By performing a comparative genomics analysis of 224 jumbo phages, we suggest an objective inclusion criterion based on genome size distributions and present a synthetic overview of their manifold adaptations across major biological systems. By means of clustering and principal component analysis of the phyletic patterns of conserved genes, all known jumbo phages can be classified into three higher-order groups, which include both myoviral and siphoviral morphologies indicating multiple independent origins from smaller predecessors. Our study uncovers several under-appreciated or unreported aspects of the DNA replication, recombination, transcription and virion maturation systems. Leveraging sensitive sequence analysis methods, we identify novel protein-modifying enzymes that might help hijack the host-machinery. Focusing on host–virus conflicts, we detect strategies used to counter different wings of the bacterial immune system, such as cyclic nucleotide- and NAD+-dependent effector-activation, and prevention of superinfection during pseudolysogeny. We reconstruct the RNA-repair systems of jumbo phages that counter the consequences of RNA-targeting host effectors. These findings also suggest that several jumbo phage proteins provide a snapshot of the systems found in ancient replicons preceding the last universal ancestor of cellular life.
Collapse
Affiliation(s)
- Lakshminarayan M. Iyer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (L.M.I.); (V.A.); (A.M.B.)
| | - Vivek Anantharaman
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (L.M.I.); (V.A.); (A.M.B.)
| | - Arunkumar Krishnan
- Department of Biological Sciences, Indian Institute of Science Education and Research (IISER) Berhampur, Odisha 760010, India;
| | - A. Maxwell Burroughs
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (L.M.I.); (V.A.); (A.M.B.)
| | - L. Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (L.M.I.); (V.A.); (A.M.B.)
- Correspondence:
| |
Collapse
|
13
|
Medvedev KE, Kinch LN, Dustin Schaeffer R, Pei J, Grishin NV. A Fifth of the Protein World: Rossmann-like Proteins as an Evolutionarily Successful Structural unit. J Mol Biol 2020; 433:166788. [PMID: 33387532 DOI: 10.1016/j.jmb.2020.166788] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 11/26/2020] [Accepted: 12/18/2020] [Indexed: 10/22/2022]
Abstract
The Rossmann-like fold is the most prevalent and diversified doubly-wound superfold of ancient evolutionary origin. Rossmann-like domains are present in a variety of metabolic enzymes and are capable of binding diverse ligands. Discerning evolutionary relationships among these domains is challenging because of their diverse functions and ancient origin. We defined a minimal Rossmann-like structural motif (RLM), identified RLM-containing domains among known 3D structures (20%) and classified them according to their homologous relationships. New classifications were incorporated into our Evolutionary Classification of protein Domains (ECOD) database. We defined 156 homology groups (H-groups), which were further clustered into 123 possible homology groups (X-groups). Our analysis revealed that RLM-containing proteins constitute approximately 15% of the human proteome. We found that disease-causing mutations are more frequent within RLM domains than within non-RLM domains of these proteins, highlighting the importance of RLM-containing proteins for human health.
Collapse
Affiliation(s)
- Kirill E Medvedev
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, United States.
| | - Lisa N Kinch
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, TX, United States
| | - R Dustin Schaeffer
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, United States
| | - Jimin Pei
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, TX, United States
| | - Nick V Grishin
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, United States; Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, TX, United States; Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, TX, United States.
| |
Collapse
|
14
|
Brzezinski K. S-adenosyl-l-homocysteine Hydrolase: A Structural Perspective on the Enzyme with Two Rossmann-Fold Domains. Biomolecules 2020; 10:biom10121682. [PMID: 33339190 PMCID: PMC7765604 DOI: 10.3390/biom10121682] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Revised: 12/09/2020] [Accepted: 12/15/2020] [Indexed: 12/27/2022] Open
Abstract
S-adenosyl-l-homocysteine hydrolase (SAHase) is a major regulator of cellular methylation reactions that occur in eukaryotic and prokaryotic organisms. SAHase activity is also a significant source of l-homocysteine and adenosine, two compounds involved in numerous vital, as well as pathological processes. Therefore, apart from cellular methylation, the enzyme may also influence other processes important for the physiology of particular organisms. Herein, presented is the structural characterization and comparison of SAHases of eukaryotic and prokaryotic origin, with an emphasis on the two principal domains of SAHase subunit based on the Rossmann motif. The first domain is involved in the binding of a substrate, e.g., S-adenosyl-l-homocysteine or adenosine and the second domain binds the NAD+ cofactor. Despite their structural similarity, the molecular interactions between an adenosine-based ligand molecule and macromolecular environment are different in each domain. As a consequence, significant differences in the conformation of d-ribofuranose rings of nucleoside and nucleotide ligands, especially those attached to adenosine moiety, are observed. On the other hand, the chemical nature of adenine ring recognition, as well as an orientation of the adenine ring around the N-glycosidic bond are of high similarity for the ligands bound in the substrate- and cofactor-binding domains.
Collapse
Affiliation(s)
- Krzysztof Brzezinski
- Laboratory of Structural Microbiology, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
| |
Collapse
|
15
|
Longo LM, Jabłońska J, Vyas P, Kanade M, Kolodny R, Ben-Tal N, Tawfik DS. On the emergence of P-Loop NTPase and Rossmann enzymes from a Beta-Alpha-Beta ancestral fragment. eLife 2020; 9:e64415. [PMID: 33295875 PMCID: PMC7758060 DOI: 10.7554/elife.64415] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Accepted: 12/04/2020] [Indexed: 12/14/2022] Open
Abstract
This article is dedicated to the memory of Michael G. Rossmann. Dating back to the last universal common ancestor, P-loop NTPases and Rossmanns comprise the most ubiquitous and diverse enzyme lineages. Despite similarities in their overall architecture and phosphate binding motif, a lack of sequence identity and some fundamental structural differences currently designates them as independent emergences. We systematically searched for structure and sequence elements shared by both lineages. We detected homologous segments that span the first βαβ motif of both lineages, including the phosphate binding loop and a conserved aspartate at the tip of β2. The latter ligates the catalytic metal in P-loop NTPases, while in Rossmanns it binds the nucleotide's ribose moiety. Tubulin, a Rossmann GTPase, demonstrates the potential of the β2-Asp to take either one of these two roles. While convergence cannot be completely ruled out, we show that both lineages likely emerged from a common βαβ segment that comprises the core of these enzyme families to this very day.
Collapse
Affiliation(s)
- Liam M Longo
- Weizmann Institute of Science, Department of Biomolecular SciencesRehovotIsrael
| | - Jagoda Jabłońska
- Weizmann Institute of Science, Department of Biomolecular SciencesRehovotIsrael
| | - Pratik Vyas
- Weizmann Institute of Science, Department of Biomolecular SciencesRehovotIsrael
| | - Manil Kanade
- Weizmann Institute of Science, Department of Biomolecular SciencesRehovotIsrael
| | - Rachel Kolodny
- University of Haifa, Department of Computer ScienceHaifaIsrael
| | - Nir Ben-Tal
- Tel Aviv University, George S. Wise Faculty of Life Sciences, Department of Biochemistry and Molecular BiologyTel AvivIsrael
| | - Dan S Tawfik
- Weizmann Institute of Science, Department of Biomolecular SciencesRehovotIsrael
| |
Collapse
|
16
|
Aline Dias da P, Nathalia Marins de A, Gabriel Guarany de A, Robson Francisco de S, Cristiane Rodrigues G. The World of Cyclic Dinucleotides in Bacterial Behavior. Molecules 2020; 25:molecules25102462. [PMID: 32466317 PMCID: PMC7288161 DOI: 10.3390/molecules25102462] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2019] [Revised: 03/05/2020] [Accepted: 03/17/2020] [Indexed: 02/07/2023] Open
Abstract
The regulation of multiple bacterial phenotypes was found to depend on different cyclic dinucleotides (CDNs) that constitute intracellular signaling second messenger systems. Most notably, c-di-GMP, along with proteins related to its synthesis, sensing, and degradation, was identified as playing a central role in the switching from biofilm to planktonic modes of growth. Recently, this research topic has been under expansion, with the discoveries of new CDNs, novel classes of CDN receptors, and the numerous functions regulated by these molecules. In this review, we comprehensively describe the three main bacterial enzymes involved in the synthesis of c-di-GMP, c-di-AMP, and cGAMP focusing on description of their three-dimensional structures and their structural similarities with other protein families, as well as the essential residues for catalysis. The diversity of CDN receptors is described in detail along with the residues important for the interaction with the ligand. Interestingly, genomic data strongly suggest that there is a tendency for bacterial cells to use both c-di-AMP and c-di-GMP signaling networks simultaneously, raising the question of whether there is crosstalk between different signaling systems. In summary, the large amount of sequence and structural data available allows a broad view of the complexity and the importance of these CDNs in the regulation of different bacterial behaviors. Nevertheless, how cells coordinate the different CDN signaling networks to ensure adaptation to changing environmental conditions is still open for much further exploration.
Collapse
|
17
|
Medvedev KE, Kinch LN, Schaeffer RD, Grishin NV. Functional analysis of Rossmann-like domains reveals convergent evolution of topology and reaction pathways. PLoS Comput Biol 2019; 15:e1007569. [PMID: 31869345 PMCID: PMC6957218 DOI: 10.1371/journal.pcbi.1007569] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2019] [Revised: 01/13/2020] [Accepted: 11/26/2019] [Indexed: 12/18/2022] Open
Abstract
Rossmann folds are ancient, frequently diverged domains found in many biological reaction pathways where they have adapted for different functions. Consequently, discernment and classification of their homologous relations and function can be complicated. We define a minimal Rossmann-like structure motif (RLM) that corresponds for the common core of known Rossmann domains and use this motif to identify all RLM domains in the Protein Data Bank (PDB), thus finding they constitute about 20% of all known 3D structures. The Evolutionary Classification of protein structure Domains (ECOD) classifies RLM domains in a number of groups that lack evidence for homology (X-groups), which suggests that they could have evolved independently multiple times. Closely related, homologous RLM enzyme families can diverge to bind different ligands using similar binding sites and to catalyze different reactions. Conversely, non-homologous RLM domains can converge to catalyze the same reactions or to bind the same ligand with alternate binding modes. We discuss a special case of such convergent evolution that is relevant to the polypharmacology paradigm, wherein the same drug (methotrexate) binds to multiple non-homologous RLM drug targets with different topologies. Finally, assigning proteins with RLM domain to the Enzyme Commission classification suggest that RLM enzymes function mainly in metabolism (and comprise 38% of reference metabolic pathways) and are overrepresented in extant pathways that represent ancient biosynthetic routes such as nucleotide metabolism, energy metabolism, and metabolism of amino acids. In fact, RLM enzymes take part in five out of eight enzymatic reactions of the Wood-Ljungdahl metabolic pathway thought to be used by the last universal common ancestor (LUCA). The prevalence of RLM domains in this ancient metabolism might explain their wide distribution among enzymes.
Collapse
Affiliation(s)
- Kirill E. Medvedev
- Departments of Biophysics and Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
| | - Lisa N. Kinch
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
| | - R. Dustin Schaeffer
- Departments of Biophysics and Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
| | - Nick V. Grishin
- Departments of Biophysics and Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, United States of America
| |
Collapse
|
18
|
Martínez-Jiménez MI, Calvo PA, García-Gómez S, Guerra-González S, Blanco L. The Zn-finger domain of human PrimPol is required to stabilize the initiating nucleotide during DNA priming. Nucleic Acids Res 2019; 46:4138-4151. [PMID: 29608762 PMCID: PMC5934617 DOI: 10.1093/nar/gky230] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2018] [Accepted: 03/19/2018] [Indexed: 11/30/2022] Open
Abstract
Human PrimPol is a monomeric enzyme whose DNA primase activity is required to rescue stalled replication forks during nuclear and mitochondrial DNA replication. PrimPol contains an Archeal-Eukaryotic Primases (AEP) core followed by a C-terminal Zn finger-containing domain (ZnFD), that is exclusively required for primer formation and for PrimPol function in vivo. The present study describes the sequential substrate interactions of human PrimPol during primer synthesis, and the relevance of the ZnFD at each individual step. Both the formation of a PrimPol:ssDNA binary complex and the upcoming interaction with the 3′-nucleotide (pre-ternary complex) remained intact when lacking the ZnFD. Conversely, the ZnFD was required for the subsequent binding and selection of the 5′-nucleotide that will become the first nucleotide of the new primer strand. Providing different 5′-site nucleotides, we can conclude that the ZnFD of PrimPol most likely interacts with the γ-phosphate moiety of the 5′-site nucleotide, optimizing formation of the initial dimer. Moreover, the ZnFD also contributes to recognize the cryptic G at the preferred priming sequence 3′GTC5′. Dimer elongation to obtain long DNA primers occurs processively and is facilitated by the 5′-terminal triphosphate, indicating that the ZnFD is also essential in the subsequent translocation/elongation events during DNA primer synthesis.
Collapse
Affiliation(s)
- María I Martínez-Jiménez
- Centro de Biología Molecular Severo Ochoa (CSIC-UAM), c/ Nicolás Cabrera 1, 28049 Cantoblanco, Madrid, Spain
| | - Patricia A Calvo
- Centro de Biología Molecular Severo Ochoa (CSIC-UAM), c/ Nicolás Cabrera 1, 28049 Cantoblanco, Madrid, Spain
| | - Sara García-Gómez
- Centro de Biología Molecular Severo Ochoa (CSIC-UAM), c/ Nicolás Cabrera 1, 28049 Cantoblanco, Madrid, Spain
| | - Susana Guerra-González
- Centro de Biología Molecular Severo Ochoa (CSIC-UAM), c/ Nicolás Cabrera 1, 28049 Cantoblanco, Madrid, Spain
| | - Luis Blanco
- Centro de Biología Molecular Severo Ochoa (CSIC-UAM), c/ Nicolás Cabrera 1, 28049 Cantoblanco, Madrid, Spain
| |
Collapse
|
19
|
Blanco L, Calvo PA, Diaz-Talavera A, Carvalho G, Calero N, Martínez-Carrón A, Velázquez-Ruiz C, Villadangos S, Guerra S, Martínez-Jiménez MI. Mechanism of DNA primer synthesis by human PrimPol. Enzymes 2019; 45:289-310. [PMID: 31627881 DOI: 10.1016/bs.enz.2019.06.003] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
PrimPol is the second primase discovered in eukaryotic cells, whose function is to restart the stalled replication forks during both mitochondrial and nuclear DNA replication. This chapter revises our current knowledge about the mechanism of synthesis of DNA primers by human PrimPol, and the importance of its distinctive Zn-finger domain (ZnFD). After PrimPol forms a binary complex with ssDNA, the formation of the pre-ternary complex strictly requires the presence of Mn2+ ions to stabilize the interaction of the incoming deoxynucleotide at the 3'-site. The capacity to bind both ssDNA template and 3'-deoxynucleotide was shown to reside in the AEP core of PrimPol, with ZnFD being dispensable at these two early steps of the primase reaction. Sugar selection favoring dNTPs versus NTPs at the 3' site is mediated by a specific tyrosine (Tyr100) acting as a steric gate. Besides, a specific glutamate residue (Glu116) conforming a singular A motif (DxE) promotes the use of Mn2+ to stabilize the pre-ternary complex. Mirroring the function of the PriL subunit of dimeric AEP primases, the ZnFD of PrimPol is crucial to stabilize the initiating 5'-nucleotide, specifically interacting with the gamma-phosphate. Such an interaction is crucial to optimize dimer formation and the subsequent translocation events leading to the processive synthesis of a mature DNA primer. Finally, the capacity of PrimPol to tolerate lesions is discussed in the context of its DNA primase function, and its potential as a TLS primase.
Collapse
Affiliation(s)
- Luis Blanco
- Centro de Biología Molecular Severo Ochoa, CSIC-UAM, Madrid, Spain.
| | - Patricia A Calvo
- Centro de Biología Molecular Severo Ochoa, CSIC-UAM, Madrid, Spain
| | | | - Gustavo Carvalho
- Centro de Biología Molecular Severo Ochoa, CSIC-UAM, Madrid, Spain
| | - Nieves Calero
- Centro de Biología Molecular Severo Ochoa, CSIC-UAM, Madrid, Spain
| | | | | | | | - Susana Guerra
- Centro de Biología Molecular Severo Ochoa, CSIC-UAM, Madrid, Spain
| | | |
Collapse
|
20
|
Chouhan BPS, Maimaiti S, Gade M, Laurino P. Rossmann-Fold Methyltransferases: Taking a "β-Turn" around Their Cofactor, S-Adenosylmethionine. Biochemistry 2018; 58:166-170. [PMID: 30406995 DOI: 10.1021/acs.biochem.8b00994] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Methyltransferases (MTases) are superfamilies of enzymes that catalyze the transfer of a methyl group from S-adenosylmethionine (SAM), a nucleoside-based cofactor, to a wide variety of substrates such as DNA, RNA, proteins, small molecules, and lipids. Depending upon their structural features, the MTases can be further classified into different classes; we consider exclusively the largest class of MTases, the Rossmann-fold MTases. It has been shown that the nucleoside cofactor-binding Rossmann enzymes, particularly the nicotinamide adenine dinucleotide (NAD)-, flavin adenine dinucleotide (FAD)-, and SAM-binding MTases enzymes, share common binding motifs that include a Gly-rich loop region that interacts with the cofactor and a highly conserved acidic residue (Asp/Glu) that interacts with the ribose moiety of the cofactor. Here, we observe that the Gly-rich loop region of the Rossmann MTases adapts a specific type II' β-turn in the proximity of the cofactor (<4 Å), and it appears to be a key feature of these superfamilies. Additionally, we demonstrate that the conservation of this β-turn could play a critical role in the enzyme-cofactor interaction, thereby shedding new light on the structural conformation of the Gly-rich loop region from Rossmann MTases.
Collapse
Affiliation(s)
- Bhanu Pratap Singh Chouhan
- Okinawa Institute of Science and Technology Graduate University , 1919-1 Tancha, Onna-son , Okinawa 904-0412 , Japan
| | - Shayida Maimaiti
- Okinawa Institute of Science and Technology Graduate University , 1919-1 Tancha, Onna-son , Okinawa 904-0412 , Japan
| | - Madhuri Gade
- Okinawa Institute of Science and Technology Graduate University , 1919-1 Tancha, Onna-son , Okinawa 904-0412 , Japan
| | - Paola Laurino
- Okinawa Institute of Science and Technology Graduate University , 1919-1 Tancha, Onna-son , Okinawa 904-0412 , Japan
| |
Collapse
|
21
|
Blacklock KM, Yang L, Mulligan VK, Khare SD. A computational method for the design of nested proteins by loop-directed domain insertion. Proteins 2018; 86:354-369. [PMID: 29250820 DOI: 10.1002/prot.25445] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Revised: 12/04/2017] [Accepted: 12/15/2017] [Indexed: 12/23/2022]
Abstract
The computational design of novel nested proteins-in which the primary structure of one protein domain (insert) is flanked by the primary structure segments of another (parent)-would enable the generation of multifunctional proteins. Here we present a new algorithm, called Loop-Directed Domain Insertion (LooDo), implemented within the Rosetta software suite, for the purpose of designing nested protein domain combinations connected by flexible linker regions. Conformational space for the insert domain is sampled using large libraries of linker fragments for linker-to-parent domain superimposition followed by insert-to-linker superimposition. The relative positioning of the two domains (treated as rigid bodies) is sampled efficiently by a grid-based, mutual placement compatibility search. The conformations of the loop residues, and the identities of loop as well as interface residues, are simultaneously optimized using a generalized kinematic loop closure algorithm and Rosetta EnzymeDesign, respectively, to minimize interface energy. The algorithm was found to consistently sample near-native conformations and interface sequences for a benchmark set of structurally similar but functionally divergent domain-inserted enzymes from the α/β hydrolase superfamily, and discriminates well between native and nonnative conformations and sequences, although loop conformations tended to deviate from the native conformations. Furthermore, in cross-domain placement tests, native insert-parent domain combinations were ranked as the best-scoring structures compared to nonnative domain combinations. This algorithm should be broadly applicable to the design of multi-domain protein complexes with any combination of inserted or tandem domain connections.
Collapse
Affiliation(s)
- Kristin M Blacklock
- Institute for Quantitative Biomedicine, Rutgers The State University of New Jersey, Piscataway, New Jersey.,Department of Chemistry and Chemical Biology, Rutgers The State University of New Jersey, Piscataway, New Jersey.,Center for Integrative Proteomics Research, Rutgers The State University of New Jersey, Piscataway, New Jersey
| | - Lu Yang
- Department of Chemistry and Chemical Biology, Rutgers The State University of New Jersey, Piscataway, New Jersey.,Center for Integrative Proteomics Research, Rutgers The State University of New Jersey, Piscataway, New Jersey
| | - Vikram K Mulligan
- Institute for Protein Design and Department of Biochemistry, University of Washington, Seattle, Washington
| | - Sagar D Khare
- Institute for Quantitative Biomedicine, Rutgers The State University of New Jersey, Piscataway, New Jersey.,Department of Chemistry and Chemical Biology, Rutgers The State University of New Jersey, Piscataway, New Jersey.,Center for Integrative Proteomics Research, Rutgers The State University of New Jersey, Piscataway, New Jersey
| |
Collapse
|
22
|
Affiliation(s)
- Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Artem S. Novozhilov
- Department of Mathematics, North Dakota State University, Fargo, North Dakota 58108, USA
| |
Collapse
|
23
|
Nikolskaya AN, Arighi CN, Huang H, Barker WC, Wu CH. PIRSF Family Classification System for Protein Functional and Evolutionary Analysis. Evol Bioinform Online 2017. [DOI: 10.1177/117693430600200033] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
The PIRSF protein classification system ( http://pir.georgetown.edu/pirsf/ ) reflects evolutionary relationships of full-length proteins and domains. The primary PIRSF classification unit is the homeomorphic family, whose members are both homologous (evolved from a common ancestor) and homeomorphic (sharing full-length sequence similarity and a common domain architecture). PIRSF families are curated systematically based on literature review and integrative sequence and functional analysis, including sequence and structure similarity, domain architecture, functional association, genome context, and phyletic pattern. The results of classification and expert annotation are summarized in PIRSF family reports with graphical viewers for taxonomic distribution, domain architecture, family hierarchy, and multiple alignment and phylogenetic tree. The PIRSF system provides a comprehensive resource for bioinformatics analysis and comparative studies of protein function and evolution. Domain or fold-based searches allow identification of evolutionarily related protein families sharing domains or structural folds. Functional convergence and functional divergence are revealed by the relationships between protein classification and curated family functions. The taxonomic distribution allows the identification of lineage-specific or broadly conserved protein families and can reveal horizontal gene transfer. Here we demonstrate, with illustrative examples, how to use the web-based PIRSF system as a tool for functional and evolutionary studies of protein families.
Collapse
Affiliation(s)
| | - Cecilia N. Arighi
- Protein Information Resource, Department of Biochemistry and Molecular & Cellular Biology
| | - Hongzhan Huang
- Protein Information Resource, Department of Biochemistry and Molecular & Cellular Biology
| | - Winona C. Barker
- Protein Information Resource, Department of Biochemistry and Molecular & Cellular Biology
| | - Cathy H. Wu
- Protein Information Resource, Department of Biochemistry and Molecular & Cellular Biology
| |
Collapse
|
24
|
Harding T, Roger AJ, Simpson AGB. Adaptations to High Salt in a Halophilic Protist: Differential Expression and Gene Acquisitions through Duplications and Gene Transfers. Front Microbiol 2017; 8:944. [PMID: 28611746 PMCID: PMC5447177 DOI: 10.3389/fmicb.2017.00944] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2017] [Accepted: 05/11/2017] [Indexed: 11/13/2022] Open
Abstract
The capacity of halophiles to thrive in extreme hypersaline habitats derives partly from the tight regulation of ion homeostasis, the salt-dependent adjustment of plasma membrane fluidity, and the increased capability to manage oxidative stress. Halophilic bacteria, and archaea have been intensively studied, and substantial research has been conducted on halophilic fungi, and the green alga Dunaliella. By contrast, there have been very few investigations of halophiles that are phagotrophic protists, i.e., protozoa. To gather fundamental knowledge about salt adaptation in these organisms, we studied the transcriptome-level response of Halocafeteria seosinensis (Stramenopiles) grown under contrasting salinities. We provided further evolutionary context to our analysis by identifying genes that underwent recent duplications. Genes that were highly responsive to salinity variations were involved in stress response (e.g., chaperones), ion homeostasis (e.g., Na+/H+ transporter), metabolism and transport of lipids (e.g., sterol biosynthetic genes), carbohydrate metabolism (e.g., glycosidases), and signal transduction pathways (e.g., transcription factors). A significantly high proportion (43%) of duplicated genes were also differentially expressed, accentuating the importance of gene expansion in adaptation by H. seosinensis to high salt environments. Furthermore, we found two genes that were lateral acquisitions from bacteria, and were also highly up-regulated and highly expressed at high salt, suggesting that this evolutionary mechanism could also have facilitated adaptation to high salt. We propose that a transition toward high-salt adaptation in the ancestors of H. seosinensis required the acquisition of new genes via duplication, and some lateral gene transfers (LGTs), as well as the alteration of transcriptional programs, leading to increased stress resistance, proper establishment of ion gradients, and modification of cell structure properties like membrane fluidity.
Collapse
Affiliation(s)
- Tommy Harding
- Department of Biochemistry and Molecular Biology, Centre for Comparative Genomics and Evolutionary Bioinformatics, Dalhousie UniversityHalifax, NS, Canada
| | - Andrew J. Roger
- Department of Biochemistry and Molecular Biology, Centre for Comparative Genomics and Evolutionary Bioinformatics, Dalhousie UniversityHalifax, NS, Canada
| | - Alastair G. B. Simpson
- Department of Biology and Centre for Comparative Genomics and Evolutionary Bioinformatics, Dalhousie UniversityHalifax, NS, Canada
| |
Collapse
|
25
|
Frozen Accident Pushing 50: Stereochemistry, Expansion, and Chance in the Evolution of the Genetic Code. Life (Basel) 2017; 7:life7020022. [PMID: 28545255 PMCID: PMC5492144 DOI: 10.3390/life7020022] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Revised: 05/19/2017] [Accepted: 05/20/2017] [Indexed: 12/31/2022] Open
Abstract
Nearly 50 years ago, Francis Crick propounded the frozen accident scenario for the evolution of the genetic code along with the hypothesis that the early translation system consisted primarily of RNA. Under the frozen accident perspective, the code is universal among modern life forms because any change in codon assignment would be highly deleterious. The frozen accident can be considered the default theory of code evolution because it does not imply any specific interactions between amino acids and the cognate codons or anticodons, or any particular properties of the code. The subsequent 49 years of code studies have elucidated notable features of the standard code, such as high robustness to errors, but failed to develop a compelling explanation for codon assignments. In particular, stereochemical affinity between amino acids and the cognate codons or anticodons does not seem to account for the origin and evolution of the code. Here, I expand Crick’s hypothesis on RNA-only translation system by presenting evidence that this early translation already attained high fidelity that allowed protein evolution. I outline an experimentally testable scenario for the evolution of the code that combines a distinct version of the stereochemical hypothesis, in which amino acids are recognized via unique sites in the tertiary structure of proto-tRNAs, rather than by anticodons, expansion of the code via proto-tRNA duplication, and the frozen accident.
Collapse
|
26
|
Chatterjee S, Patra MM, Samaddar S, Basu A, Das Gupta SK. Mutual interaction enables the mycobacterial plasmid pAL5000 origin binding protein RepB to recruit RepA, the plasmid replicase, to the origin. Microbiology (Reading) 2017; 163:595-610. [DOI: 10.1099/mic.0.000447] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Affiliation(s)
- Soniya Chatterjee
- Department of Microbiology, Bose Institute, P-1/12 C.I.T. Scheme VII-M Kolkata 700054, India
| | - Madhu Manti Patra
- Department of Microbiology, Bose Institute, P-1/12 C.I.T. Scheme VII-M Kolkata 700054, India
| | - Sourabh Samaddar
- Department of Microbiology, Bose Institute, P-1/12 C.I.T. Scheme VII-M Kolkata 700054, India
| | - Arnab Basu
- Department of Biochemistry and Molecular Biology, Saint Louis University, One North Grand, MO 63103, USA
| | - Sujoy K Das Gupta
- Department of Microbiology, Bose Institute, P-1/12 C.I.T. Scheme VII-M Kolkata 700054, India
| |
Collapse
|
27
|
Zhang D, Burroughs AM, Vidal ND, Iyer LM, Aravind L. Transposons to toxins: the provenance, architecture and diversification of a widespread class of eukaryotic effectors. Nucleic Acids Res 2016; 44:3513-33. [PMID: 27060143 PMCID: PMC4857004 DOI: 10.1093/nar/gkw221] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2016] [Accepted: 03/22/2016] [Indexed: 01/13/2023] Open
Abstract
Enzymatic effectors targeting nucleic acids, proteins and other cellular components are the mainstay of conflicts across life forms. Using comparative genomics we identify a large class of eukaryotic proteins, which include effectors from oomycetes, fungi and other parasites. The majority of these proteins have a characteristic domain architecture with one of several N-terminal 'Header' domains, which are predicted to play a role in trafficking of these effectors, including a novel version of the Ubiquitin fold. The Headers are followed by one or more diverse C-terminal domains, such as restriction endonuclease (REase), protein kinase, HNH endonuclease, LK-nuclease (a RNase) and multiple distinct peptidase domains, which are predicted to carry their toxicity determinants. The most common types of these proteins appear to have originated from prokaryotic transposases (e.g. TN7 and Mu) and combine a CDC6/ORC1-STAND clade NTPase domain with a C-terminal REase domain. Other than the so-called Crinkler effectors of oomycetes and fungi, these effectors are encoded by other eukaryotic parasites such as trypanosomatids (the RHS proteins) and the rhizarian Plasmodiophora, and symbionts like Capsaspora Remarkably, we also find these proteins in free-living eukaryotes, including several viridiplantae, fungi, amoebozoans and animals. These versions might either still be transposons or function in other poorly understood eukaryote-specific inter-organismal and inter-genomic conflicts. These include the Medea1 selfish element of Tribolium that spreads via post-zygotic killing. We present a unified mechanism for the recombination-dependent diversification and action of this widespread class of molecular weaponry deployed across diverse conflicts ranging from parasitic to free-living forms.
Collapse
Affiliation(s)
- Dapeng Zhang
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - A Maxwell Burroughs
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Newton D Vidal
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Lakshminarayan M Iyer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - L Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
28
|
Laurino P, Tóth-Petróczy Á, Meana-Pañeda R, Lin W, Truhlar DG, Tawfik DS. An Ancient Fingerprint Indicates the Common Ancestry of Rossmann-Fold Enzymes Utilizing Different Ribose-Based Cofactors. PLoS Biol 2016; 14:e1002396. [PMID: 26938925 PMCID: PMC4777477 DOI: 10.1371/journal.pbio.1002396] [Citation(s) in RCA: 60] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2015] [Accepted: 01/29/2016] [Indexed: 01/30/2023] Open
Abstract
Nucleoside-based cofactors are presumed to have preceded proteins. The Rossmann fold is one of the most ancient and functionally diverse protein folds, and most Rossmann enzymes utilize nucleoside-based cofactors. We analyzed an omnipresent Rossmann ribose-binding interaction: a carboxylate side chain at the tip of the second β-strand (β2-Asp/Glu). We identified a canonical motif, defined by the β2-topology and unique geometry. The latter relates to the interaction being bidentate (both ribose hydroxyls interacting with the carboxylate oxygens), to the angle between the carboxylate and the ribose, and to the ribose's ring configuration. We found that this canonical motif exhibits hallmarks of divergence rather than convergence. It is uniquely found in Rossmann enzymes that use different cofactors, primarily SAM (S-adenosyl methionine), NAD (nicotinamide adenine dinucleotide), and FAD (flavin adenine dinucleotide). Ribose-carboxylate bidentate interactions in other folds are not only rare but also have a different topology and geometry. We further show that the canonical geometry is not dictated by a physical constraint--geometries found in noncanonical interactions have similar calculated bond energies. Overall, these data indicate the divergence of several major Rossmann-fold enzyme classes, with different cofactors and catalytic chemistries, from a common pre-LUCA (last universal common ancestor) ancestor that possessed the β2-Asp/Glu motif.
Collapse
Affiliation(s)
- Paola Laurino
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Ágnes Tóth-Petróczy
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Rubén Meana-Pañeda
- Department of Chemistry, Chemical Theory Center, and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Wei Lin
- Department of Chemistry, Chemical Theory Center, and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Donald G. Truhlar
- Department of Chemistry, Chemical Theory Center, and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Dan S. Tawfik
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
- * E-mail:
| |
Collapse
|
29
|
Zhang J, Landick R. A Two-Way Street: Regulatory Interplay between RNA Polymerase and Nascent RNA Structure. Trends Biochem Sci 2016; 41:293-310. [PMID: 26822487 DOI: 10.1016/j.tibs.2015.12.009] [Citation(s) in RCA: 93] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2015] [Revised: 12/21/2015] [Accepted: 12/22/2015] [Indexed: 02/06/2023]
Abstract
The vectorial (5'-to-3' at varying velocity) synthesis of RNA by cellular RNA polymerases (RNAPs) creates a rugged kinetic landscape, demarcated by frequent, sometimes long-lived, pauses. In addition to myriad gene-regulatory roles, these pauses temporally and spatially program the co-transcriptional, hierarchical folding of biologically active RNAs. Conversely, these RNA structures, which form inside or near the RNA exit channel, interact with the polymerase and adjacent protein factors to influence RNA synthesis by modulating pausing, termination, antitermination, and slippage. Here, we review the evolutionary origin, mechanistic underpinnings, and regulatory consequences of this interplay between RNAP and nascent RNA structure. We categorize and rationalize the extensive linkage between the transcriptional machinery and its product, and provide a framework for future studies.
Collapse
Affiliation(s)
- Jinwei Zhang
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD 20892, USA.
| | - Robert Landick
- Departments of Biochemistry and Bacteriology, University of Wisconsin-Madison, Madison, WI 53706, USA.
| |
Collapse
|
30
|
Mohanta TK, Mohanta N, Mohanta YK, Bae H. Genome-Wide Identification of Calcium Dependent Protein Kinase Gene Family in Plant Lineage Shows Presence of Novel D-x-D and D-E-L Motifs in EF-Hand Domain. FRONTIERS IN PLANT SCIENCE 2015; 6:1146. [PMID: 26734045 PMCID: PMC4690006 DOI: 10.3389/fpls.2015.01146] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2015] [Accepted: 12/02/2015] [Indexed: 05/04/2023]
Abstract
Calcium ions are considered ubiquitous second messengers in eukaryotic signal transduction pathways. Intracellular Ca(2+) concentration are modulated by various signals such as hormones and biotic and abiotic stresses. Modulation of Ca(2+) ion leads to stimulation of calcium dependent protein kinase genes (CPKs), which results in regulation of gene expression and therefore mediates plant growth and development as well as biotic and abiotic stresses. Here, we reported the CPK gene family of 40 different plant species (950 CPK genes) and provided a unified nomenclature system for all of them. In addition, we analyzed their genomic, biochemical and structural conserved features. Multiple sequence alignment revealed that the kinase domain, auto-inhibitory domain and EF-hands regions of regulatory domains are highly conserved in nature. Additionally, the EF-hand domains of higher plants were found to contain four D-x-D and two D-E-L motifs, while lower eukaryotic plants had two D-x-D and one D-x-E motifs in their EF-hands. Phylogenetic analysis showed that CPK genes are clustered into four different groups. By studying the CPK gene family across the plant lineage, we provide the first evidence of the presence of D-x-D motif in the calcium binding EF-hand domain of CPK proteins.
Collapse
Affiliation(s)
- Tapan K. Mohanta
- School of Biotechnology, Yeungnam UniversityGyeongsan, South Korea
| | - Nibedita Mohanta
- Department Of Biotechnology, North Orissa UniversityBaripada, India
| | | | - Hanhong Bae
- School of Biotechnology, Yeungnam UniversityGyeongsan, South Korea
| |
Collapse
|
31
|
Iyer LM, Zhang D, Aravind L. Adenine methylation in eukaryotes: Apprehending the complex evolutionary history and functional potential of an epigenetic modification. Bioessays 2015; 38:27-40. [PMID: 26660621 PMCID: PMC4738411 DOI: 10.1002/bies.201500104] [Citation(s) in RCA: 104] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
While N6‐methyladenosine (m6A) is a well‐known epigenetic modification in bacterial DNA, it remained largely unstudied in eukaryotes. Recent studies have brought to fore its potential epigenetic role across diverse eukaryotes with biological consequences, which are distinct and possibly even opposite to the well‐studied 5‐methylcytosine mark. Adenine methyltransferases appear to have been independently acquired by eukaryotes on at least 13 occasions from prokaryotic restriction‐modification and counter‐restriction systems. On at least four to five instances, these methyltransferases were recruited as RNA methylases. Thus, m6A marks in eukaryotic DNA and RNA might be more widespread and diversified than previously believed. Several m6A‐binding protein domains from prokaryotes were also acquired by eukaryotes, facilitating prediction of potential readers for these marks. Further, multiple lineages of the AlkB family of dioxygenases have been recruited as m6A demethylases. Although members of the TET/JBP family of dioxygenases have also been suggested to be m6A demethylases, this proposal needs more careful evaluation. Also watch the Video Abstract.
Collapse
Affiliation(s)
- Lakshminarayan M Iyer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Dapeng Zhang
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - L Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
32
|
Bhargav SP, Vahokoski J, Kallio JP, Torda AE, Kursula P, Kursula I. Two independently folding units of Plasmodium profilin suggest evolution via gene fusion. Cell Mol Life Sci 2015; 72:4193-203. [PMID: 26012696 PMCID: PMC11113795 DOI: 10.1007/s00018-015-1932-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2015] [Revised: 05/13/2015] [Accepted: 05/18/2015] [Indexed: 10/23/2022]
Abstract
Gene fusion is a common mechanism of protein evolution that has mainly been discussed in the context of multidomain or symmetric proteins. Less is known about fusion of ancestral genes to produce small single-domain proteins. Here, we show with a domain-swapped mutant Plasmodium profilin that this small, globular, apparently single-domain protein consists of two foldons. The separation of binding sites for different protein ligands in the two halves suggests evolution via an ancient gene fusion event, analogous to the formation of multidomain proteins. Finally, the two fragments can be assembled together after expression as two separate gene products. The possibility to engineer both domain-swapped dimers and half-profilins that can be assembled back to a full profilin provides perspectives for engineering of novel protein folds, e.g., with different scaffolding functions.
Collapse
Affiliation(s)
| | - Juha Vahokoski
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, P.O. Box 5400, 90014, Oulu, Finland
| | - Juha Pekka Kallio
- Helmholtz Centre for Infection Research, Notkestrasse 85, 22607, Hamburg, Germany
- German Electron Synchrotron (DESY), Notkestrasse 85, 22607, Hamburg, Germany
| | - Andrew E Torda
- Centre for Bioinformatics, University of Hamburg, Bundesstrasse 43, 20146, Hamburg, Germany
| | - Petri Kursula
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, P.O. Box 5400, 90014, Oulu, Finland
- Biocenter Oulu, University of Oulu, P.O. Box 5000, 90014, Oulu, Finland
- Department of Biomedicine, University of Bergen, Jonas Lies vei 91, 5009, Bergen, Norway
| | - Inari Kursula
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, P.O. Box 5400, 90014, Oulu, Finland.
- Helmholtz Centre for Infection Research, Notkestrasse 85, 22607, Hamburg, Germany.
- German Electron Synchrotron (DESY), Notkestrasse 85, 22607, Hamburg, Germany.
- Department of Biomedicine, University of Bergen, Jonas Lies vei 91, 5009, Bergen, Norway.
| |
Collapse
|
33
|
Černý J, Černá Bolfíková B, de A Zanotto PM, Grubhoffer L, Růžek D. A deep phylogeny of viral and cellular right-hand polymerases. INFECTION GENETICS AND EVOLUTION 2015; 36:275-286. [PMID: 26431690 DOI: 10.1016/j.meegid.2015.09.026] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2015] [Revised: 09/22/2015] [Accepted: 09/28/2015] [Indexed: 12/27/2022]
Abstract
Right-hand polymerases are important players in genome replication and repair in cellular organisms as well as in viruses. All right-hand polymerases are grouped into seven related protein families: viral RNA-dependent RNA polymerases, reverse transcriptases, single-subunit RNA polymerases, and DNA polymerase families A, B, D, and Y. Although the evolutionary relationships of right-hand polymerases within each family have been proposed, evolutionary relationships between families remain elusive because their sequence similarity is too low to allow classical phylogenetic analyses. The structure of viral RNA-dependent RNA polymerases recently was shown to be useful in inferring their evolution. Here, we address evolutionary relationships between right-hand polymerase families by combining sequence and structure information. We used a set of 22 viral and cellular polymerases representing all right-hand polymerase families with known protein structure. In contrast to previous studies, which focused only on the evolution of particular families, the current approach allowed us to present the first robust phylogenetic analysis unifying evolution of all right-hand polymerase families. All polymerase families branched into discrete lineages, following a fairly robust adjacency pattern. Only single-subunit RNA polymerases formed an inner group within DNA polymerase family A. RNA-dependent RNA polymerases of RNA viruses and reverse transcriptases of retroviruses formed two sister groups and were distinguishable from all other polymerases. DNA polymerases of DNA bacteriophages did not form a monophyletic group and are phylogenetically mixed with cellular DNA polymerase families A and B. Based on the highest genetic variability and structural simplicity, we assume that RNA-dependent RNA polymerases are the most ancient group of right-hand polymerases, in agreement with the RNA World hypothesis, because RNA-dependent RNA polymerases are enzymes that could serve in replication of RNA genomes. Moreover, our results show that protein structure can be used in phylogenetic analyses of distantly related proteins that share only limited sequence similarity.
Collapse
Affiliation(s)
- Jiří Černý
- Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, Branišovská 31, 370 05 České Budějovice, Czech Republic; Faculty of Science, University of South Bohemia in České Budějovice, Branišovská 31, 370 05 České Budějovice, Czech Republic.
| | - Barbora Černá Bolfíková
- Faculty of Tropical AgriSciences, Czech University of Life Sciences Prague, Kamýcká 126, Suchdol, 165 21 Prague 6, Czech Republic
| | - Paolo M de A Zanotto
- Department of Microbiology, Biomedical Sciences Institute, ICB II University of Sao Paulo, 05508-000 Sao Paulo, Brazil
| | - Libor Grubhoffer
- Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, Branišovská 31, 370 05 České Budějovice, Czech Republic; Faculty of Science, University of South Bohemia in České Budějovice, Branišovská 31, 370 05 České Budějovice, Czech Republic
| | - Daniel Růžek
- Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, Branišovská 31, 370 05 České Budějovice, Czech Republic; Veterinary Research Institute, Hudcova 296/70, 621 00 Brno, Czech Republic
| |
Collapse
|
34
|
Koonin EV. Why the Central Dogma: on the nature of the great biological exclusion principle. Biol Direct 2015; 10:52. [PMID: 26377089 PMCID: PMC4573691 DOI: 10.1186/s13062-015-0084-3] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2015] [Accepted: 09/14/2015] [Indexed: 11/12/2022] Open
Abstract
Abstract The Central Dogma of molecular biology posits that transfer of information from proteins back to nucleic acids does not occur in biological systems. I argue that the impossibility of reverse translation is indeed a major, physical exclusion principle that emerges due to the transition from the digital information carriers, nucleic acids, to analog information carriers, proteins, which involves irreversible suppression of the digital information. Reviewers This article was reviewed by Itai Yanai, Martin Lercher and Frank Eisenhaber.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Instittues of Health, Bethesda, MD, 20894, USA.
| |
Collapse
|
35
|
Abstract
LAP2-emerin-MAN1 (LEM)-domain proteins are modular proteins characterized by the presence of a conserved motif of about 50 residues. Most LEM-domain proteins localize at the inner nuclear membrane, but some are also found in the endoplasmic reticulum or nuclear interior. Their architecture has been analyzed by predicting the limits of their globular domains, determining the 3D structure of these domains and in a few cases calculating the 3D structure of specific domains bound to biological targets. The LEM domain adopts an α-helical fold also found in SAP and HeH domains of prokaryotes and unicellular eukaryotes. The LEM domain binds to BAF (barrier-to-autointegration factor; BANF1), which interacts with DNA and tethers chromatin to the nuclear envelope. LAP2 isoforms also share an N-terminal LEM-like domain, which binds DNA. The structure and function of other globular domains that distinguish LEM-domain proteins from each other have been characterized, including the C-terminal dimerization domain of LAP2α and C-terminal WH and UHM domains of MAN1. LEM-domain proteins also have large intrinsically disordered regions that are involved in intra- and intermolecular interactions and are highly regulated by posttranslational modifications in vivo.
Collapse
|
36
|
Mohanta TK, Mohanta N, Mohanta YK, Parida P, Bae H. Genome-wide identification of Calcineurin B-Like (CBL) gene family of plants reveals novel conserved motifs and evolutionary aspects in calcium signaling events. BMC PLANT BIOLOGY 2015; 15:189. [PMID: 26245459 PMCID: PMC4527274 DOI: 10.1186/s12870-015-0543-0] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/13/2015] [Accepted: 06/09/2015] [Indexed: 05/21/2023]
Abstract
BACKGROUND Calcium ions, the most versatile secondary messenger found in plants, are involved in the regulation of diverse arrays of plant growth and development, as well as biotic and abiotic stress responses. The calcineurin B-like proteins are one of the most important genes that act as calcium sensors. RESULTS In this study, we identified calcineurin B-like gene family members from 38 different plant species and assigned a unique nomenclature to each of them. Sequence analysis showed that, the CBL proteins contain three calcium binding EF-hand domain that contains several conserved Asp and Glu amino acid residues. The third EF-hand of the CBL protein was found to posses the D/E-x-D calcium binding sensor motif. Phylogenetic analysis showed that, the CBL genes fall into six different groups. Additionally, except group B CBLs, all the CBL proteins were found to contain N-terminal palmitoylation and myristoylation sites. An evolutionary study showed that, CBL genes are evolved from a common ancestor and subsequently diverged during the course of evolution of land plants. Tajima's neutrality test showed that, CBL genes are highly polymorphic and evolved via decreasing population size due to balanced selection. Differential expression analysis with cold and heat stress treatment led to differential modulation of OsCBL genes. CONCLUSIONS The basic architecture of plant CBL genes is conserved throughout the plant kingdom. Evolutionary analysis showed that, these genes are evolved from a common ancestor of lower eukaryotic plant lineage and led to broadening of the calcium signaling events in higher eukaryotic organisms.
Collapse
Affiliation(s)
- Tapan Kumar Mohanta
- School of Biotechnology, Yeungnam University Gyeongsan, Gyeongbook, 712-749, Republic of Korea.
| | - Nibedita Mohanta
- Department of Biotechnology, North Orissa University, Sri Ramchandra Vihar, Takatpur, Baripada, Mayurbhanj, Orissa, 757003, India.
| | - Yugal Kishore Mohanta
- Department of Botany, North Orissa University, Sri Ramchandra Vihar, Takatpur, Baripada, Mayurbhanj, Orissa, 757003, India.
| | - Pratap Parida
- Center for studies in Biotechnology, Dibrugarh University, Dibrugarh, 786004, Assam, India.
| | - Hanhong Bae
- School of Biotechnology, Yeungnam University Gyeongsan, Gyeongbook, 712-749, Republic of Korea.
| |
Collapse
|
37
|
Faure G, Koonin EV. Universal distribution of mutational effects on protein stability, uncoupling of protein robustness from sequence evolution and distinct evolutionary modes of prokaryotic and eukaryotic proteins. Phys Biol 2015; 12:035001. [PMID: 25927823 DOI: 10.1088/1478-3975/12/3/035001] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Robustness to destabilizing effects of mutations is thought of as a key factor of protein evolution. The connections between two measures of robustness, the relative core size and the computationally estimated effect of mutations on protein stability (ΔΔG), protein abundance and the selection pressure on protein-coding genes (dN/dS) were analyzed for the organisms with a large number of available protein structures including four eukaryotes, two bacteria and one archaeon. The distribution of the effects of mutations in the core on protein stability is universal and indistinguishable in eukaryotes and bacteria, centered at slightly destabilizing amino acid replacements, and with a heavy tail of more strongly destabilizing replacements. The distribution of mutational effects in the hyperthermophilic archaeon Thermococcus gammatolerans is significantly shifted toward strongly destabilizing replacements which is indicative of stronger constraints that are imposed on proteins in hyperthermophiles. The median effect of mutations is strongly, positively correlated with the relative core size, in evidence of the congruence between the two measures of protein robustness. However, both measures show only limited correlations to the expression level and selection pressure on protein-coding genes. Thus, the degree of robustness reflected in the universal distribution of mutational effects appears to be a fundamental, ancient feature of globular protein folds whereas the observed variations are largely neutral and uncoupled from short term protein evolution. A weak anticorrelation between protein core size and selection pressure is observed only for surface residues in prokaryotes but a stronger anticorrelation is observed for all residues in eukaryotic proteins. This substantial difference between proteins of prokaryotes and eukaryotes is likely to stem from the demonstrable higher compactness of prokaryotic proteins.
Collapse
Affiliation(s)
- Guilhem Faure
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | |
Collapse
|
38
|
Biswas KH, Badireddy S, Rajendran A, Anand GS, Visweswariah SS. Cyclic nucleotide binding and structural changes in the isolated GAF domain of Anabaena adenylyl cyclase, CyaB2. PeerJ 2015; 3:e882. [PMID: 25922789 PMCID: PMC4411481 DOI: 10.7717/peerj.882] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2014] [Accepted: 03/18/2015] [Indexed: 01/01/2023] Open
Abstract
GAF domains are a large family of regulatory domains, and a subset are found associated with enzymes involved in cyclic nucleotide (cNMP) metabolism such as adenylyl cyclases and phosphodiesterases. CyaB2, an adenylyl cyclase from Anabaena, contains two GAF domains in tandem at the N-terminus and an adenylyl cyclase domain at the C-terminus. Cyclic AMP, but not cGMP, binding to the GAF domains of CyaB2 increases the activity of the cyclase domain leading to enhanced synthesis of cAMP. Here we show that the isolated GAFb domain of CyaB2 can bind both cAMP and cGMP, and enhanced specificity for cAMP is observed only when both the GAFa and the GAFb domains are present in tandem (GAFab domain). In silico docking and mutational analysis identified distinct residues important for interaction with either cAMP or cGMP in the GAFb domain. Structural changes associated with ligand binding to the GAF domains could not be detected by bioluminescence resonance energy transfer (BRET) experiments. However, amide hydrogen-deuterium exchange mass spectrometry (HDXMS) experiments provided insights into the structural basis for cAMP-induced allosteric regulation of the GAF domains, and differences in the changes induced by cAMP and cGMP binding to the GAF domain. Thus, our findings could allow the development of molecules that modulate the allosteric regulation by GAF domains present in pharmacologically relevant proteins.
Collapse
Affiliation(s)
- Kabir Hassan Biswas
- Department of Molecular Reproduction, Development and Genetics, Indian Institute of Science , Bangalore , India
| | - Suguna Badireddy
- Department of Biological Sciences, National University of Singapore , Singapore , Singapore
| | - Abinaya Rajendran
- Department of Molecular Reproduction, Development and Genetics, Indian Institute of Science , Bangalore , India
| | | | - Sandhya S Visweswariah
- Department of Molecular Reproduction, Development and Genetics, Indian Institute of Science , Bangalore , India
| |
Collapse
|
39
|
Boza G, Szilágyi A, Kun Á, Santos M, Szathmáry E. Evolution of the division of labor between genes and enzymes in the RNA world. PLoS Comput Biol 2014; 10:e1003936. [PMID: 25474573 PMCID: PMC4256009 DOI: 10.1371/journal.pcbi.1003936] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2014] [Accepted: 09/26/2014] [Indexed: 11/18/2022] Open
Abstract
The RNA world is a very likely interim stage of the evolution after the first replicators and before the advent of the genetic code and translated proteins. Ribozymes are known to be able to catalyze many reaction types, including cofactor-aided metabolic transformations. In a metabolically complex RNA world, early division of labor between genes and enzymes could have evolved, where the ribozymes would have been transcribed from the genes more often than the other way round, benefiting the encapsulating cells through this dosage effect. Here we show, by computer simulations of protocells harboring unlinked RNA replicators, that the origin of replicational asymmetry producing more ribozymes from a gene template than gene strands from a ribozyme template is feasible and robust. Enzymatic activities of the two modeled ribozymes are in trade-off with their replication rates, and the relative replication rates compared to those of complementary strands are evolvable traits of the ribozymes. The degree of trade-off is shown to have the strongest effect in favor of the division of labor. Although some asymmetry between gene and enzymatic strands could have evolved even in earlier, surface-bound systems, the shown mechanism in protocells seems inevitable and under strong positive selection. This could have preadapted the genetic system for transcription after the subsequent origin of chromosomes and DNA.
Collapse
Affiliation(s)
- Gergely Boza
- Department of Plant Systematics, Ecology and Theoretical Biology, Institute of Biology, Eötvös Loránd University, Budapest, Hungary
- MTA-ELTE-MTMT Ecology Research Group, Budapest, Hungary
| | - András Szilágyi
- Department of Plant Systematics, Ecology and Theoretical Biology, Institute of Biology, Eötvös Loránd University, Budapest, Hungary
- Parmenides Center for the Conceptual Foundations of Science, Pullach, Germany
- MTA-ELTE Research Group in Theoretical Biology and Evolutionary Ecology, Budapest, Hungary
| | - Ádám Kun
- Department of Plant Systematics, Ecology and Theoretical Biology, Institute of Biology, Eötvös Loránd University, Budapest, Hungary
- MTA-ELTE-MTMT Ecology Research Group, Budapest, Hungary
- Parmenides Center for the Conceptual Foundations of Science, Pullach, Germany
| | - Mauro Santos
- Departament de Genètica i de Microbiologia, Grup de Biologia Evolutiva, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Eörs Szathmáry
- Department of Plant Systematics, Ecology and Theoretical Biology, Institute of Biology, Eötvös Loránd University, Budapest, Hungary
- Parmenides Center for the Conceptual Foundations of Science, Pullach, Germany
- MTA-ELTE Research Group in Theoretical Biology and Evolutionary Ecology, Budapest, Hungary
- * E-mail:
| |
Collapse
|
40
|
Stryjewska A, Kiepura K, Librowski T, Lochyński S. Biotechnology and genetic engineering in the new drug development. Part I. DNA technology and recombinant proteins. Pharmacol Rep 2014; 65:1075-85. [PMID: 24399704 DOI: 10.1016/s1734-1140(13)71466-x] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2012] [Revised: 05/13/2013] [Indexed: 11/17/2022]
Abstract
Pharmaceutical biotechnology has a long tradition and is rooted in the last century, first exemplified by penicillin and streptomycin as low molecular weight biosynthetic compounds. Today, pharmaceutical biotechnology still has its fundamentals in fermentation and bioprocessing, but the paradigmatic change affected by biotechnology and pharmaceutical sciences has led to an updated definition. The biotechnology revolution redrew the research, development, production and even marketing processes of drugs. Powerful new instruments and biotechnology related scientific disciplines (genomics, proteomics) make it possible to examine and exploit the behavior of proteins and molecules. Recombinant DNA (rDNA) technologies (genetic, protein, and metabolic engineering) allow the production of a wide range of peptides, proteins, and biochemicals from naturally nonproducing cells. This technology, now approximately 25 years old, is becoming one of the most important technologies developed in the 20(th) century. Pharmaceutical products and industrial enzymes were the first biotech products on the world market made by means of rDNA. Despite important advances regarding rDNA applications in mammalian cells, yeasts still represent attractive hosts for the production of heterologous proteins. In this review we describe these processes.
Collapse
Affiliation(s)
- Agnieszka Stryjewska
- Department of Bioorganic Chemistry, Faculty of Chemistry, Wrocław University of Technology, Wyb. Wyspiańskiego 27, PL 50-370 Wrocław, Poland. ;
| | | | | | | |
Collapse
|
41
|
Abstract
All life on earth can be naturally classified into cellular life forms and virus-like selfish elements, the latter being fully dependent on the former for their reproduction. Cells are reproducers that not only replicate their genome but also reproduce the cellular organization that depends on semipermeable, energy-transforming membranes and cannot be recovered from the genome alone, under the famous dictum of Rudolf Virchow, Omnis cellula e cellula. In contrast, simple selfish elements are replicators that can complete their life cycles within the host cell starting from genomic RNA or DNA alone. The origin of the cellular organization is the central and perhaps the hardest problem of evolutionary biology. I argue that the origin of cells can be understood only in conjunction with the origin and evolution of selfish genetic elements. A scenario of precellular evolution is presented that involves cohesion of the genomes of the emerging cellular life forms from primordial pools of small genetic elements that eventually segregated into hosts and parasites. I further present a model of the coevolution of primordial membranes and membrane proteins, discuss protocellular and non-cellular models of early evolution, and examine the habitats on the primordial earth that could have been conducive to precellular evolution and the origin of cells.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institute of Health, Bethesda, MD, 20894, USA,
| |
Collapse
|
42
|
Pandya C, Dunaway-Mariano D, Xia Y, Allen KN. Structure-guided approach for detecting large domain inserts in protein sequences as illustrated using the haloacid dehalogenase superfamily. Proteins 2014; 82:1896-906. [PMID: 24577717 DOI: 10.1002/prot.24543] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2013] [Revised: 02/19/2014] [Accepted: 02/22/2014] [Indexed: 11/11/2022]
Abstract
In multi-domain proteins, the domains typically run end-to-end, that is, one domain follows the C-terminus of another domain. However, approximately 10% of multi-domain proteins are formed by insertion of one domain sequence into that of another domain. Detecting such insertions within protein sequences is a fundamental challenge in structural biology. The haloacid dehalogenase superfamily (HADSF) serves as a challenging model system wherein a variable cap domain (∼5-200 residues in length) accessorizes the ubiquitous Rossmann-fold core domain, with variations in insertion site and topology corresponding to different classes of cap types. Herein, we describe a comprehensive computational strategy, CapPredictor, for determining large, variable domain insertions in protein sequences. Using a novel sequence-alignment algorithm in conjunction with a structure-guided sequence profile from 154 core-domain-only structures, more than 40,000 HADSF member sequences were assigned cap types. The resulting data set afforded insight into HADSF evolution. Notably, a similar distribution of cap-type classes across different phyla was observed, indicating that all cap types existed in the last universal common ancestor. In addition, comparative analyses of the predicted cap-type and functional assignments showed that different cap types carry out similar chemistries. Thus, while cap domains play a role in substrate recognition and chemical reactivity, cap-type does not strictly define functional class. Through this example, we have shown that CapPredictor is an effective new tool for the study of form and function in protein families where domain insertion occurs.
Collapse
Affiliation(s)
- Chetanya Pandya
- Bioinformatics Graduate Program, Boston University, 24 Cummington Mall, Boston, Massachusetts, 02215
| | | | | | | |
Collapse
|
43
|
Abstract
In a series of conceptual articles published around the millennium, Carl Woese emphasized that evolution of cells is the central problem of evolutionary biology, that the three-domain ribosomal tree of life is an essential framework for reconstructing cellular evolution, and that the evolutionary dynamics of functionally distinct cellular systems are fundamentally different, with the information processing systems “crystallizing” earlier than operational systems. The advances of evolutionary genomics over the last decade vindicate major aspects of Woese’s vision. Despite the observations of pervasive horizontal gene transfer among bacteria and archaea, the ribosomal tree of life comes across as a central statistical trend in the “forest” of phylogenetic trees of individual genes, and hence, an appropriate scaffold for evolutionary reconstruction. The evolutionary stability of information processing systems, primarily translation, becomes ever more striking with the accumulation of comparative genomic data indicating that nearly allof the few universal genes encode translation system components. Woese’s view on the fundamental distinctions between the three domains of cellular life also withstand the test of comparative genomics, although his non-acceptance of symbiogenetic scenarios for the origin of eukaryotes might not. Above all, Woese’s key prediction that understanding evolution of microbes will be the core of the new evolutionary biology appears to be materializing.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institute of Health, Bethesda, MD 20894
| |
Collapse
|
44
|
Bettendorff L, Wins P. Thiamine triphosphatase and the CYTH superfamily of proteins. FEBS J 2013; 280:6443-55. [DOI: 10.1111/febs.12498] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2013] [Accepted: 07/01/2013] [Indexed: 11/28/2022]
Affiliation(s)
| | - Pierre Wins
- GIGA-Neuroscience; University of Liège; Belgium
| |
Collapse
|
45
|
Stryjewska A, Kiepura K, Librowski T, Lochyński S. Biotechnology and genetic engineering in the new drug development. Part III. Biocatalysis, metabolic engineering and molecular modelling. Pharmacol Rep 2013; 65:1102-11. [DOI: 10.1016/s1734-1140(13)71468-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2012] [Revised: 05/13/2013] [Indexed: 02/03/2023]
|
46
|
Consequences of domain insertion on sequence-structure divergence in a superfold. Proc Natl Acad Sci U S A 2013; 110:E3381-7. [PMID: 23959887 DOI: 10.1073/pnas.1305519110] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Although the universe of protein structures is vast, these innumerable structures can be categorized into a finite number of folds. New functions commonly evolve by elaboration of existing scaffolds, for example, via domain insertions. Thus, understanding structural diversity of a protein fold evolving via domain insertions is a fundamental challenge. The haloalkanoic dehalogenase superfamily serves as an excellent model system wherein a variable cap domain accessorizes the ubiquitous Rossmann-fold core domain. Here, we determine the impact of the cap-domain insertion on the sequence and structure divergence of the core domain. Through quantitative analysis on a unique dataset of 154 core-domain-only and cap-domain-only structures, basic principles of their evolution have been uncovered. The relationship between sequence and structure divergence of the core domain is shown to be monotonic and independent of the corresponding type of domain insert, reflecting the robustness of the Rossmann fold to mutation. However, core domains with the same cap type share greater similarity at the sequence and structure levels, suggesting interplay between the cap and core domains. Notably, results reveal that the variance in structure maps to α-helices flanking the central β-sheet and not to the domain-domain interface. Collectively, these results hint at intramolecular coevolution where the fold diverges differentially in the context of an accessory domain, a feature that might also apply to other multidomain superfamilies.
Collapse
|
47
|
Kushwaha HR, Singla-Pareek SL, Pareek A. Putative osmosensor--OsHK3b--a histidine kinase protein from rice shows high structural conservation with its ortholog AtHK1 from Arabidopsis. J Biomol Struct Dyn 2013; 32:1318-32. [PMID: 23869567 PMCID: PMC4017273 DOI: 10.1080/07391102.2013.818576] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2013] [Revised: 06/19/2013] [Indexed: 11/10/2022]
Abstract
Prokaryotes and eukaryotes respond to various environmental stimuli using the two-component system (TCS). Essentially, it consists of membrane-bound histidine kinase (HK) which senses the stimuli and further transfers the signal to the response regulator, which in turn, regulates expression of various target genes. Recently, sequence-based genome wide analysis has been carried out in Arabidopsis and rice to identify all the putative members of TCS family. One of the members of this family i.e. AtHK1, (a putative osmosensor, hybrid-type sensory histidine kinase) is known to interact with AtHPt1 (phosphotransfer proteins) in Arabidopsis. Based on predicted rice interactome network (PRIN), the ortholog of AtHK1 in rice, OsHK3b, was found to be interacting with OsHPt2. The analysis of amino acid sequence of AtHK1 showed the presence of transmitter domain (TD) and receiver domain (RD), while OsHK3b showed presence of three conserved domains namely CHASE (signaling domain), TD, and RD. In order to elaborate on structural details of functional domains of hybrid-type HK and phosphotransfer proteins in both these genera, we have modeled them using homology modeling approach. The structural motifs present in various functional domains of the orthologous proteins were found to be highly conserved. Binding analysis of the RD domain of these sensory proteins in Arabidopsis and rice revealed the role of various residues such as histidine in HPt protein which are essential for their interaction.
Collapse
Affiliation(s)
- Hemant Ritturaj Kushwaha
- Synthetic Biology and Biofuel Group, International Center for Genetic Engineering and Biotechnology, New Delhi 110067, India
| | - Sneh Lata Singla-Pareek
- Plant Molecular Biology, International Center for Genetic Engineering and Biotechnology, New Delhi 110067, India
| | - Ashwani Pareek
- Stress Physiology and Molecular Biology Laboratory, School of Life Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| |
Collapse
|
48
|
Zhang D, Iyer LM, He F, Aravind L. Discovery of Novel DENN Proteins: Implications for the Evolution of Eukaryotic Intracellular Membrane Structures and Human Disease. Front Genet 2012; 3:283. [PMID: 23248642 PMCID: PMC3521125 DOI: 10.3389/fgene.2012.00283] [Citation(s) in RCA: 183] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2012] [Accepted: 11/20/2012] [Indexed: 12/14/2022] Open
Abstract
The tripartite DENN module, comprised of a N-terminal longin domain, followed by DENN, and d-DENN domains, is a GDP-GTP exchange factor (GEFs) for Rab GTPases, which are regulators of practically all membrane trafficking events in eukaryotes. Using sequence and structure analysis we identify multiple novel homologs of the DENN module, many of which can be traced back to the ancestral eukaryote. These findings provide unexpected leads regarding key cellular processes such as autophagy, vesicle-vacuole interactions, chromosome segregation, and human disease. Of these, SMCR8, the folliculin interacting protein-1 and 2 (FNIP1 and FNIP2), nitrogen permease regulator 2 (NPR2), and NPR3 are proposed to function in recruiting Rab GTPases during different steps of autophagy, fusion of autophagosomes with the vacuole and regulation of cellular metabolism. Another novel DENN protein identified in this study is C9ORF72; expansions of the hexanucleotide GGGGCC in its first intron have been recently implicated in amyotrophic lateral sclerosis (ALS) and fronto-temporal dementia (FTD). While this mutation is proposed to cause a RNA-level defect, the identification of C9ORF72 as a potential DENN-type GEF raises the possibility that at least part of the pathology might relate to a specific Rab-dependent vesicular trafficking process, as has been observed in the case of some other neurological conditions with similar phenotypes. We present evidence that the longin domain, such as those found in the DENN module, are likely to have been ultimately derived from the related domains found in prokaryotic GTPase-activating proteins of MglA-like GTPases. Thus, the origin of the longin domains from this ancient GTPase-interacting domain, concomitant with the radiation of GTPases, especially of the Rab clade, played an important role in the dynamics of eukaryotic intracellular membrane systems.
Collapse
Affiliation(s)
- Dapeng Zhang
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health Bethesda, MD, USA
| | | | | | | |
Collapse
|
49
|
Bernhardt HS. The RNA world hypothesis: the worst theory of the early evolution of life (except for all the others)(a). Biol Direct 2012; 7:23. [PMID: 22793875 PMCID: PMC3495036 DOI: 10.1186/1745-6150-7-23] [Citation(s) in RCA: 118] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2012] [Accepted: 07/11/2012] [Indexed: 01/16/2023] Open
Abstract
The problems associated with the RNA world hypothesis are well known. In the following I discuss some of these difficulties, some of the alternative hypotheses that have been proposed, and some of the problems with these alternative models. From a biosynthetic - as well as, arguably, evolutionary - perspective, DNA is a modified RNA, and so the chicken-and-egg dilemma of "which came first?" boils down to a choice between RNA and protein. This is not just a question of cause and effect, but also one of statistical likelihood, as the chance of two such different types of macromolecule arising simultaneously would appear unlikely. The RNA world hypothesis is an example of a 'top down' (or should it be 'present back'?) approach to early evolution: how can we simplify modern biological systems to give a plausible evolutionary pathway that preserves continuity of function? The discovery that RNA possesses catalytic ability provides a potential solution: a single macromolecule could have originally carried out both replication and catalysis. RNA - which constitutes the genome of RNA viruses, and catalyzes peptide synthesis on the ribosome - could have been both the chicken and the egg! However, the following objections have been raised to the RNA world hypothesis: (i) RNA is too complex a molecule to have arisen prebiotically; (ii) RNA is inherently unstable; (iii) catalysis is a relatively rare property of long RNA sequences only; and (iv) the catalytic repertoire of RNA is too limited. I will offer some possible responses to these objections in the light of work by our and other labs. Finally, I will critically discuss an alternative theory to the RNA world hypothesis known as 'proteins first', which holds that proteins either preceded RNA in evolution, or - at the very least - that proteins and RNA coevolved. I will argue that, while theoretically possible, such a hypothesis is probably unprovable, and that the RNA world hypothesis, although far from perfect or complete, is the best we currently have to help understand the backstory to contemporary biology.
Collapse
Affiliation(s)
- Harold S Bernhardt
- Department of Biochemistry, University of Otago, P,O, Box 56, Dunedin, New Zealand.
| |
Collapse
|
50
|
Joseph AP, Valadié H, Srinivasan N, de Brevern AG. Local structural differences in homologous proteins: specificities in different SCOP classes. PLoS One 2012; 7:e38805. [PMID: 22745680 PMCID: PMC3382195 DOI: 10.1371/journal.pone.0038805] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2011] [Accepted: 05/10/2012] [Indexed: 11/19/2022] Open
Abstract
The constant increase in the number of solved protein structures is of great help in understanding the basic principles behind protein folding and evolution. 3-D structural knowledge is valuable in designing and developing methods for comparison, modelling and prediction of protein structures. These approaches for structure analysis can be directly implicated in studying protein function and for drug design. The backbone of a protein structure favours certain local conformations which include α-helices, β-strands and turns. Libraries of limited number of local conformations (Structural Alphabets) were developed in the past to obtain a useful categorization of backbone conformation. Protein Block (PB) is one such Structural Alphabet that gave a reasonable structure approximation of 0.42 Å. In this study, we use PB description of local structures to analyse conformations that are preferred sites for structural variations and insertions, among group of related folds. This knowledge can be utilized in improving tools for structure comparison that work by analysing local structure similarities. Conformational differences between homologous proteins are known to occur often in the regions comprising turns and loops. Interestingly, these differences are found to have specific preferences depending upon the structural classes of proteins. Such class-specific preferences are mainly seen in the all-β class with changes involving short helical conformations and hairpin turns. A test carried out on a benchmark dataset also indicates that the use of knowledge on the class specific variations can improve the performance of a PB based structure comparison approach. The preference for the indel sites also seem to be confined to a few backbone conformations involving β-turns and helix C-caps. These are mainly associated with short loops joining the regular secondary structures that mediate a reversal in the chain direction. Rare β-turns of type I’ and II’ are also identified as preferred sites for insertions.
Collapse
Affiliation(s)
- Agnel Praveen Joseph
- INSERM, UMR-S 665, Dynamique des Structures et Interactions des Macromolécules Biologiques (DSIMB), Paris, France
- Univ Paris Diderot, Sorbonne Paris Cité, UMR 665, Paris, France
- Institut National de la Transfusion Sanguine (INTS), Paris, France
| | - Hélène Valadié
- INSERM UMR-S 726, DSIMB, Université Paris Diderot - Paris 7, Paris, France
| | | | - Alexandre G. de Brevern
- INSERM, UMR-S 665, Dynamique des Structures et Interactions des Macromolécules Biologiques (DSIMB), Paris, France
- Univ Paris Diderot, Sorbonne Paris Cité, UMR 665, Paris, France
- Institut National de la Transfusion Sanguine (INTS), Paris, France
- * E-mail:
| |
Collapse
|