1
|
Rogers SO. Integrated evolution of ribosomal RNAs, introns, and intron nurseries. Genetica 2018; 147:103-119. [PMID: 30578455 DOI: 10.1007/s10709-018-0050-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2018] [Accepted: 12/13/2018] [Indexed: 12/21/2022]
Abstract
The initial components of ribosomes first appeared more than 3.8 billion years ago during a time when many types of RNAs were evolving. While modern ribosomes are complex molecular machines consisting of rRNAs and proteins, they were assembled during early evolution by the association and joining of small functional RNA units. Introns may have provided the means to ligate many of these pieces together. All four classes of introns (group I, group II, spliceosomal, and archaeal) are present in many rRNA gene loci over a broad phylogenetic range. A survey of rRNA intron sequences across the three major life domains suggests that some of the classes of introns may have diverged from one another within rRNA gene loci. Analyses of rRNA sequences revealed self-splicing group I and group II introns are present in ancestral regions of the SSU (small subunit) and LSU (large subunit), whereas spliceosomal and archaeal introns appeared in sections of the rRNA that evolved later. Most classes of introns increased in number for approximately 1 billion years. However, their frequencies are low in the most recently evolved regions added to the SSU and LSU rRNAs. Furthermore, many of the introns appear to have been in the same locations for billions of years, suggesting an ancient origin for these sequences. In this Perspectives paper, I reviewed and analyzed rRNA intron sequences, locations, structural characteristics, and splicing mechanisms; and suggest that rRNA gene loci may have served as evolutionary nurseries for intron formation and diversification.
Collapse
Affiliation(s)
- Scott O Rogers
- Department of Biological Sciences, Bowling Green State University, Bowling Green, OH, 43403, USA.
| |
Collapse
|
2
|
Batot G, Michalska K, Ekberg G, Irimpan EM, Joachimiak G, Jedrzejczak R, Babnigg G, Hayes CS, Joachimiak A, Goulding CW. The CDI toxin of Yersinia kristensenii is a novel bacterial member of the RNase A superfamily. Nucleic Acids Res 2017; 45:5013-5025. [PMID: 28398546 PMCID: PMC5435912 DOI: 10.1093/nar/gkx230] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2017] [Accepted: 03/31/2017] [Indexed: 01/01/2023] Open
Abstract
Contact-dependent growth inhibition (CDI) is an important mechanism of inter-bacterial competition found in many Gram-negative pathogens. CDI+ cells express cell-surface CdiA proteins that bind neighboring bacteria and deliver C-terminal toxin domains (CdiA-CT) to inhibit target-cell growth. CDI+ bacteria also produce CdiI immunity proteins, which specifically neutralize cognate CdiA-CT toxins to prevent self-inhibition. Here, we present the crystal structure of the CdiA-CT/CdiIYkris complex from Yersinia kristensenii ATCC 33638. CdiA-CTYkris adopts the same fold as angiogenin and other RNase A paralogs, but the toxin does not share sequence similarity with these nucleases and lacks the characteristic disulfide bonds of the superfamily. Consistent with the structural homology, CdiA-CTYkris has potent RNase activity in vitro and in vivo. Structure-guided mutagenesis reveals that His175, Arg186, Thr276 and Tyr278 contribute to CdiA-CTYkris activity, suggesting that these residues participate in substrate binding and/or catalysis. CdiIYkris binds directly over the putative active site and likely neutralizes toxicity by blocking access to RNA substrates. Significantly, CdiA-CTYkris is the first non-vertebrate protein found to possess the RNase A superfamily fold, and homologs of this toxin are associated with secretion systems in many Gram-negative and Gram-positive bacteria. These observations suggest that RNase A-like toxins are commonly deployed in inter-bacterial competition.
Collapse
Affiliation(s)
- Gaëlle Batot
- Department of Molecular Biology & Biochemistry, University of California Irvine, Irvine, CA 92697, USA
- These authors contributed equally to this work as first authors
| | - Karolina Michalska
- Midwest Center for Structural Genomics, Argonne National Laboratory, Argonne, IL 60439, USA
- Structural Biology Center, Biosciences Division, Argonne National Laboratory, Argonne, IL 60439, USA
- These authors contributed equally to this work as first authors
| | - Greg Ekberg
- Department of Molecular, Cellular and Developmental Biology, University of California, Santa Barbara, Santa Barbara, CA 93106-9625, USA
- These authors contributed equally to this work as first authors
| | - Ervin M. Irimpan
- Department of Molecular Biology & Biochemistry, University of California Irvine, Irvine, CA 92697, USA
| | - Grazyna Joachimiak
- Midwest Center for Structural Genomics, Argonne National Laboratory, Argonne, IL 60439, USA
| | - Robert Jedrzejczak
- Midwest Center for Structural Genomics, Argonne National Laboratory, Argonne, IL 60439, USA
| | - Gyorgy Babnigg
- Midwest Center for Structural Genomics, Argonne National Laboratory, Argonne, IL 60439, USA
| | - Christopher S. Hayes
- Department of Molecular, Cellular and Developmental Biology, University of California, Santa Barbara, Santa Barbara, CA 93106-9625, USA
- Biomolecular Science and Engineering Program, University of California, Santa Barbara, Santa Barbara, CA 93106-9625, USA
| | - Andrzej Joachimiak
- Midwest Center for Structural Genomics, Argonne National Laboratory, Argonne, IL 60439, USA
- Structural Biology Center, Biosciences Division, Argonne National Laboratory, Argonne, IL 60439, USA
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL 60637, USA
| | - Celia W. Goulding
- Department of Molecular Biology & Biochemistry, University of California Irvine, Irvine, CA 92697, USA
- Department of Pharmaceutical Sciences, University of California Irvine, Irvine, CA 92697, USA
- To whom correspondence should be addressed. Tel: +1 949 824 0337; Fax: +1 949 824 8551
| |
Collapse
|
3
|
Johnson PM, Gucinski GC, Garza-Sánchez F, Wong T, Hung LW, Hayes CS, Goulding CW. Functional Diversity of Cytotoxic tRNase/Immunity Protein Complexes from Burkholderia pseudomallei. J Biol Chem 2016; 291:19387-400. [PMID: 27445337 DOI: 10.1074/jbc.m116.736074] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2016] [Indexed: 12/23/2022] Open
Abstract
Contact-dependent growth inhibition (CDI) is a widespread mechanism of inter-bacterial competition. CDI(+) bacteria deploy large CdiA effector proteins, which carry variable C-terminal toxin domains (CdiA-CT). CDI(+) cells also produce CdiI immunity proteins that specifically neutralize cognate CdiA-CT toxins to prevent auto-inhibition. Here, we present the crystal structure of the CdiA-CT/CdiI(E479) toxin/immunity protein complex from Burkholderia pseudomallei isolate E479. The CdiA-CT(E479) tRNase domain contains a core α/β-fold that is characteristic of PD(D/E)XK superfamily nucleases. Unexpectedly, the closest structural homolog of CdiA-CT(E479) is another CDI toxin domain from B. pseudomallei 1026b. Although unrelated in sequence, the two B. pseudomallei nuclease domains share similar folds and active-site architectures. By contrast, the CdiI(E479) and CdiI(1026b) immunity proteins share no significant sequence or structural homology. CdiA-CT(E479) and CdiA-CT(1026b) are both tRNases; however, each nuclease cleaves tRNA at a distinct position. We used a molecular docking approach to model each toxin bound to tRNA substrate. The resulting models fit into electron density envelopes generated by small-angle x-ray scattering analysis of catalytically inactive toxin domains bound stably to tRNA. CdiA-CT(E479) is the third CDI toxin found to have structural homology to the PD(D/E)XK superfamily. We propose that CDI systems exploit the inherent sequence variability and active-site plasticity of PD(D/E)XK nucleases to generate toxin diversity. These findings raise the possibility that many other uncharacterized CDI toxins may belong to the PD(D/E)XK superfamily.
Collapse
Affiliation(s)
| | | | - Fernando Garza-Sánchez
- Department of Molecular, Cellular and Developmental Biology, University of California at Santa Barbara, Santa Barbara, California 93106-9625, and
| | - Timothy Wong
- From the Departments of Molecular Biology and Biochemistry and
| | - Li-Wei Hung
- the Physics Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545
| | - Christopher S Hayes
- the Biomolecular Science and Engineering Program and Department of Molecular, Cellular and Developmental Biology, University of California at Santa Barbara, Santa Barbara, California 93106-9625, and
| | - Celia W Goulding
- From the Departments of Molecular Biology and Biochemistry and Pharmaceutical Sciences, University of California at Irvine, Irvine, California 92697,
| |
Collapse
|
4
|
Abstract
Varkud Satellite (VS) ribozyme mediates rolling circle replication of a plasmid found in the Neurospora mitochondria. We report crystal structures of this ribozyme at 3.1Å resolution, revealing an intertwined dimer formed by an exchange of substrate helices. Within each protomer, an arrangement of three-way helical junctions organizes seven helices into a global fold that creates a docking site for the substrate helix of the other protomer, resulting in the formation of two active sites in trans. This mode of RNA-RNA association resembles the process of domain swapping in proteins and has implications for RNA regulation and evolution. Within each active site, adenine and guanine nucleobases abut the scissile phosphate, poised to serve direct roles in catalysis. Similarities to the active sites of the hairpin and hammerhead ribozymes highlight the functional significance of active site features, underscore the ability of RNA to access functional architectures from distant regions of sequence space, and suggest convergent evolution.
Collapse
|
5
|
Battini R, D'Arrigo S, Cassandrini D, Guzzetta A, Fiorillo C, Pantaleoni C, Romano A, Alfei E, Cioni G, Santorelli FM. Novel mutations in TSEN54 in pontocerebellar hypoplasia type 2. J Child Neurol 2014; 29:520-5. [PMID: 23307886 DOI: 10.1177/0883073812470002] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Pontocerebellar hypoplasias represent a group of neurodegenerative autosomal recessive disorders characterized by hypoplasia/atrophy of the cerebellum, hypoplastic ventral pons, and microcephaly and associated with various clinical features. Pontocerebellar hypolasia type 2 is the most common form, and different mutations in genes encoding subunits of the transfer ribonucleic acid (RNA)-splicing endonuclease (TSEN) complex were identified in patients. The authors report clinical, imaging, and molecular studies in 2 unrelated patients with different clinical pictures of the pontocerebellar hypoplasia type 2 spectrum and novel mutations in TSEN54, aiming to further define the clinical spectrum of the disease and possible indicators of more favorable progression. They identified a novel missense mutation c.355T>G/p.Y119D in compound heterozygosity with the "common" c.919G>T/p.A307S (patient 1) and a novel homozygous c.7ins6(CCGGAG)/p.E2-P3insPE variant (patient 2). An expanded array of mutations might contribute in defining possible differences in severity and phenotype-genotype correlations.
Collapse
|
6
|
Dunin-Horkawicz S, Feder M, Bujnicki JM. Phylogenomic analysis of the GIY-YIG nuclease superfamily. BMC Genomics 2006; 7:98. [PMID: 16646971 PMCID: PMC1564403 DOI: 10.1186/1471-2164-7-98] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2006] [Accepted: 04/28/2006] [Indexed: 11/28/2022] Open
Abstract
Background The GIY-YIG domain was initially identified in homing endonucleases and later in other selfish mobile genetic elements (including restriction enzymes and non-LTR retrotransposons) and in enzymes involved in DNA repair and recombination. However, to date no systematic search for novel members of the GIY-YIG superfamily or comparative analysis of these enzymes has been reported. Results We carried out database searches to identify all members of known GIY-YIG nuclease families. Multiple sequence alignments together with predicted secondary structures of identified families were represented as Hidden Markov Models (HMM) and compared by the HHsearch method to the uncharacterized protein families gathered in the COG, KOG, and PFAM databases. This analysis allowed for extending the GIY-YIG superfamily to include members of COG3680 and a number of proteins not classified in COGs and to predict that these proteins may function as nucleases, potentially involved in DNA recombination and/or repair. Finally, all old and new members of the GIY-YIG superfamily were compared and analyzed to infer the phylogenetic tree. Conclusion An evolutionary classification of the GIY-YIG superfamily is presented for the very first time, along with the structural annotation of all (sub)families. It provides a comprehensive picture of sequence-structure-function relationships in this superfamily of nucleases, which will help to design experiments to study the mechanism of action of known members (especially the uncharacterized ones) and will facilitate the prediction of function for the newly discovered ones.
Collapse
Affiliation(s)
- Stanislaw Dunin-Horkawicz
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland
| | - Marcin Feder
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland
| |
Collapse
|
7
|
Rigden DJ. An inactivated nuclease-like domain in RecC with novel function: implications for evolution. BMC STRUCTURAL BIOLOGY 2005; 5:9. [PMID: 15985153 PMCID: PMC1185551 DOI: 10.1186/1472-6807-5-9] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/15/2005] [Accepted: 06/28/2005] [Indexed: 02/03/2023]
Abstract
BACKGROUND The PD-(D/E)xK superfamily, containing a wide variety of other exo- and endonucleases, is a notable example of general function conservation in the face of extreme sequence and structural variation. Almost all members employ a small number of shared conserved residues to bind catalytically essential metal ions and thereby effect DNA cleavage. The crystal structure of the RecBCD prokaryotic DNA repair machinery shows that RecB contains such a nuclease domain at its C-terminus. The RecC C-terminal region was reported as having a novel fold. RESULTS The RecC C-terminal region can be divided into an alpha/beta domain and a smaller alpha-helical bundle domain. Here we show that the alpha/beta domain is homologous to the RecB nuclease domain but lacks the features necessary for catalysis. Instead, the domain has a novel function within the nuclease superfamily--providing a hoop through which single-stranded DNA passes. Comparison with other structures of nuclease domains bound to DNA reveals strikingly different modes of ligand binding. The alpha-helical bundle domain contributes the pin which splits the DNA duplex. CONCLUSION The demonstrated homology of RecB and RecC shows how evolution acted to produce the present RecBCD complex through aggregation of new domains as well as functional divergence and structural redeployment of existing domains. Distantly homologous nuclease(-like) domains bind DNA in highly diverse manners.
Collapse
Affiliation(s)
- Daniel John Rigden
- School of Biological Sciences, University of Liverpool, Crown St., Liverpool L69 7ZB, UK.
| |
Collapse
|
8
|
Feder M, Bujnicki JM. Identification of a new family of putative PD-(D/E)XK nucleases with unusual phylogenomic distribution and a new type of the active site. BMC Genomics 2005; 6:21. [PMID: 15720711 PMCID: PMC551604 DOI: 10.1186/1471-2164-6-21] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2004] [Accepted: 02/18/2005] [Indexed: 12/18/2022] Open
Abstract
Background Prediction of structure and function for uncharacterized protein families by identification of evolutionary links to characterized families and known structures is one of the cornerstones of genomics. Theoretical assignment of three-dimensional folds and prediction of protein function even at a very general level can facilitate the experimental determination of the molecular mechanism of action and the role that members of a given protein family fulfill in the cell. Here, we predict the three-dimensional fold and study the phylogenomic distribution of members of a large family of uncharacterized proteins classified in the Clusters of Orthologous Groups database as COG4636. Results Using protein fold-recognition we found that members of COG4636 are remotely related to Holliday junction resolvases and other nucleases from the PD-(D/E)XK superfamily. Structure modeling and sequence analyses suggest that most members of COG4636 exhibit a new, unusual variant of the putative active site, in which the catalytic Lys residue migrated in the sequence, but retained similar spatial position with respect to other functionally important residues. Sequence analyses revealed that members of COG4636 and their homologs are found mainly in Cyanobacteria, but also in other bacterial phyla. They undergo horizontal transfer and extensive proliferation in the colonized genomes; for instance in Gloeobacter violaceus PCC 7421 they comprise over 2% of all protein-encoding genes. Thus, members of COG4636 appear to be a new type of selfish genetic elements, which may fulfill an important role in the genome dynamics of Cyanobacteria and other species they invaded. Our analyses provide a platform for experimental determination of the molecular and cellular function of members of this large protein family. Conclusion After submission of this manuscript, a crystal structure of one of the COG4636 members was released in the Protein Data Bank (code 1wdj; Idaka, M., Wada, T., Murayama, K., Terada, T., Kuramitsu, S., Shirouzu, M., Yokoyama, S.: Crystal structure of Tt1808 from Thermus thermophilus Hb8, to be published). Our analysis of the Tt1808 structure reveals that we correctly predicted all functionally important features of the COG4636 family, including the membership in the PD-(D/E)xK superfamily of nucleases, the three-dimensional fold, the putative catalytic residues, and the unusual configuration of the active site.
Collapse
Affiliation(s)
- Marcin Feder
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland
| |
Collapse
|
9
|
|
10
|
Iyer LM, Koonin EV, Aravind L. Extensive domain shuffling in transcription regulators of DNA viruses and implications for the origin of fungal APSES transcription factors. Genome Biol 2002; 3:RESEARCH0012. [PMID: 11897024 PMCID: PMC88810 DOI: 10.1186/gb-2002-3-3-research0012] [Citation(s) in RCA: 94] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2001] [Revised: 01/09/2002] [Accepted: 01/10/2002] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Viral DNA-binding proteins have served as good models to study the biochemistry of transcription regulation and chromatin dynamics. Computational analysis of viral DNA-binding regulatory proteins and identification of their previously undetected homologs encoded by cellular genomes might lead to a better understanding of their function and evolution in both viral and cellular systems. RESULTS The phyletic range and the conserved DNA-binding domains of the viral regulatory proteins of the poxvirus D6R/N1R and baculoviral Bro protein families have not been previously defined. Using computational analysis, we show that the amino-terminal module of the D6R/N1R proteins defines a novel, conserved DNA-binding domain (the KilA-N domain) that is found in a wide range of proteins of large bacterial and eukaryotic DNA viruses. The KilA-N domain is suggested to be homologous to the fungal DNA-binding APSES domain. We provide evidence for the KilA-N and APSES domains sharing a common fold with the nucleic acid-binding modules of the LAGLIDADG nucleases and the amino-terminal domains of the tRNA endonuclease. The amino-terminal module of the Bro proteins is another, distinct DNA-binding domain (the Bro-N domain) that is present in proteins whose domain architectures parallel those of the KilA-N domain-containing proteins. A detailed analysis of the KilA-N and Bro-N domains and the associated domains points to extensive domain shuffling and lineage-specific gene family expansion within DNA virus genomes. CONCLUSIONS We define a large class of novel viral DNA-binding proteins and their cellular homologs and identify their domain architectures. On the basis of phyletic pattern analysis we present evidence for a probable viral origin of the fungus-specific cell-cycle regulatory transcription factors containing the APSES DNA-binding domain. We also demonstrate the extensive role of lineage-specific gene expansion and domain shuffling, within a limited set of approximately 24 domains, in the generation of the diversity of virus-specific regulatory proteins.
Collapse
Affiliation(s)
- Lakshminarayan M Iyer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | | | | |
Collapse
|
11
|
Bujnicki JM, Rotkiewicz P, Kolinski A, Rychlewski L. Three-dimensional modeling of the I-TevI homing endonuclease catalytic domain, a GIY-YIG superfamily member, using NMR restraints and Monte Carlo dynamics. PROTEIN ENGINEERING 2001; 14:717-21. [PMID: 11739889 DOI: 10.1093/protein/14.10.717] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Using a recent version of the SICHO algorithm for in silico protein folding, we made a blind prediction of the tertiary structure of the N-terminal, independently folded, catalytic domain (CD) of the I-TevI homing endonuclease, a representative of the GIY-YIG superfamily of homing endonucleases. The secondary structure of the I-TevI CD has been determined using NMR spectroscopy, but computational sequence analysis failed to detect any protein of known tertiary structure related to the GIY-YIG nucleases (Kowalski et al., Nucleic Acids Res., 1999, 27, 2115-2125). To provide further insight into the structure-function relationships of all GIY-YIG superfamily members, including the recently described subfamily of type II restriction enzymes (Bujnicki et al., Trends Biochem. Sci., 2000, 26, 9-11), we incorporated the experimentally determined and predicted secondary and tertiary restraints in a reduced (side chain only) protein model, which was minimized by Monte Carlo dynamics and simulated annealing. The subsequently elaborated full atomic model of the I-TevI CD allows the available experimental data to be put into a structural context and suggests that the GIY-YIG domain may dimerize in order to bring together the conserved residues of the active site.
Collapse
Affiliation(s)
- J M Bujnicki
- Bioinformatics Laboratory, International Institute of Molecular and Cell Biology, ul. ks. Trojdena 4, 02-109 Warsaw, Poland.
| | | | | | | |
Collapse
|