26
|
Sugimoto H, Hirano M, Tanaka H, Tanaka T, Kitagawa-Yogo R, Muramoto N, Mitsukawa N. Plastid-targeted forms of restriction endonucleases enhance the plastid genome rearrangement rate and trigger the reorganization of its genomic architecture. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 102:1042-1057. [PMID: 31925982 DOI: 10.1111/tpj.14687] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Revised: 12/25/2019] [Accepted: 01/02/2020] [Indexed: 06/10/2023]
Abstract
Plant cells have acquired chloroplasts (plastids) with a unique genome (ptDNA), which developed during the evolution of endosymbiosis. The gene content and genome structure of ptDNAs in land plants are considerably stable, although those of algal ptDNAs are highly varied. Plant cells seem, therefore, to be intolerant of any structural or organizational changes in the ptDNA. Genome rearrangement functions as a driver of genomic evolutionary divergence. Here, we aimed to create various types of rearrangements in the ptDNA of Arabidopsis genomes using plastid-targeted forms of restriction endonucleases (pREs). Arabidopsis plants expressing each of the three specific pREs, i.e., pTaqI, pHinP1I, and pMseI, were generated; they showed the leaf variegation phenotypes associated with impaired chloroplast development. We confirmed that these pREs caused double-stranded breaks (DSB) at their recognition sites in ptDNAs. Genome-wide analysis of ptDNAs revealed that the transgenic lines exhibited a large number of rearrangements such as inversions and deletions/duplications, which were dominantly repaired by microhomology-mediated recombination and microhomology-mediated end-joining, and less by non-homologous end-joining. Notably, pHinP1I, which recognized a small number of sites in ptDNA, induced drastic structural changes, including regional copy number variations throughout ptDNAs. In contrast, the transient expression of either pTaqI or pMseI, whose recognition site numbers were relatively larger, resulted in small-scale changes at the whole genome level. These results indicated that DSB frequencies and their distribution are major determinants in shaping ptDNAs.
Collapse
|
27
|
Kim YK, Jo S, Cheon SH, Joo MJ, Hong JR, Kwak M, Kim KJ. Corrigendum: Plastome Evolution and Phylogeny of Orchidaceae, With 24 New Sequences. FRONTIERS IN PLANT SCIENCE 2020; 11:322. [PMID: 32265969 PMCID: PMC7099975 DOI: 10.3389/fpls.2020.00322] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 03/04/2020] [Indexed: 06/11/2023]
Abstract
[This corrects the article DOI: 10.3389/fpls.2020.00022.].
Collapse
|
28
|
Kim YK, Jo S, Cheon SH, Joo MJ, Hong JR, Kwak M, Kim KJ. Plastome Evolution and Phylogeny of Orchidaceae, With 24 New Sequences. FRONTIERS IN PLANT SCIENCE 2020; 11:22. [PMID: 32153600 PMCID: PMC7047749 DOI: 10.3389/fpls.2020.00022] [Citation(s) in RCA: 49] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Accepted: 01/10/2020] [Indexed: 05/08/2023]
Abstract
In order to understand the evolution of the orchid plastome, we annotated and compared 124 complete plastomes of Orchidaceae representing all the major lineages in their structures, gene contents, gene rearrangements, and IR contractions/expansions. Forty-two of these plastomes were generated from the corresponding author's laboratory, and 24 plastomes-including nine genera (Amitostigma, Bulbophyllum, Dactylorhiza, Dipodium, Galearis, Gymnadenia, Hetaeria, Oreorchis, and Sedirea)-are new in this study. All orchid plastomes, except Aphyllorchis montana, Epipogium aphyllum, and Gastrodia elata, have a quadripartite structure consisting of a large single copy (LSC), two inverted repeats (IRs), and a small single copy (SSC) region. The IR region was completely lost in the A. montana and G. elata plastomes. The SSC is lost in the E. aphyllum plastome. The smallest plastome size was 19,047 bp, in E. roseum, and the largest plastome size was 178,131 bp, in Cypripedium formosanum. The small plastome sizes are primarily the result of gene losses associated with mycoheterotrophic habitats, while the large plastome sizes are due to the expansion of noncoding regions. The minimal number of common genes among orchid plastomes to maintain minimal plastome activity was 15, including the three subunits of rpl (14, 16, and 36), seven subunits of rps (2, 3, 4, 7, 8, 11, and 14), three subunits of rrn (5, 16, and 23), trnC-GCA, and clpP genes. Three stages of gene loss were observed among the orchid plastomes. The first was ndh gene loss, which is widespread in Apostasioideae, Vanilloideae, Cypripedioideae, and Epidendroideae, but rare in the Orchidoideae. The second stage was the loss of photosynthetic genes (atp, pet, psa, and psb) and rpo gene subunits, which are restricted to Aphyllorchis, Hetaeria, Hexalectris, and some species of Corallorhiza and Neottia. The third stage was gene loss related to prokaryotic gene expression (rpl, rps, trn, and others), which was observed in Epipogium, Gastrodia, Lecanorchis, and Rhizanthella. In addition, an intermediate stage between the second and third stage was observed in Cyrtosia (Vanilloideae). The majority of intron losses are associated with the loss of their corresponding genes. In some orchid taxa, however, introns have been lost in rpl16, rps16, and clpP(2) without their corresponding gene being lost. A total of 104 gene rearrangements were counted when comparing 116 orchid plastomes. Among them, many were concentrated near the IRa/b-SSC junction area. The plastome phylogeny of 124 orchid species confirmed the relationship of {Apostasioideae [Vanilloideae (Cypripedioideae (Orchidoideae, Epidendroideae))]} at the subfamily level and the phylogenetic relationships of 17 tribes were also established. Molecular clock analysis based on the whole plastome sequences suggested that Orchidaceae diverged from its sister family 99.2 mya, and the estimated divergence times of five subfamilies are as follows: Apostasioideae (79.91 mya), Vanilloideae (69.84 mya), Cypripedioideae (64.97 mya), Orchidoideae (59.16 mya), and Epidendroideae (59.16 mya). We also released the first nuclear ribosomal (nr) DNA unit (18S-ITS1-5.8S-ITS2-28S-NTS-ETS) sequences for the 42 species of Orchidaceae. Finally, the phylogenetic tree based on the nrDNA unit sequences is compared to the tree based on the 42 identical plastome sequences, and the differences between the two datasets are discussed in this paper.
Collapse
|
29
|
Ritschard EA, Whitelaw B, Albertin CB, Cooke IR, Strugnell JM, Simakov O. Coupled Genomic Evolutionary Histories as Signatures of Organismal Innovations in Cephalopods: Co-evolutionary Signatures Across Levels of Genome Organization May Shed Light on Functional Linkage and Origin of Cephalopod Novelties. Bioessays 2019; 41:e1900073. [PMID: 31664724 DOI: 10.1002/bies.201900073] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2019] [Revised: 09/05/2019] [Indexed: 12/07/2023]
Abstract
How genomic innovation translates into organismal organization remains largely unanswered. Possessing the largest invertebrate nervous system, in conjunction with many species-specific organs, coleoid cephalopods (octopuses, squids, cuttlefishes) provide exciting model systems to investigate how organismal novelties evolve. However, dissecting these processes requires novel approaches that enable deeper interrogation of genome evolution. Here, the existence of specific sets of genomic co-evolutionary signatures between expanded gene families, genome reorganization, and novel genes is posited. It is reasoned that their co-evolution has contributed to the complex organization of cephalopod nervous systems and the emergence of ecologically unique organs. In the course of reviewing this field, how the first cephalopod genomic studies have begun to shed light on the molecular underpinnings of morphological novelty is illustrated and their impact on directing future research is described. It is argued that the application and evolutionary profiling of evolutionary signatures from these studies will help identify and dissect the organismal principles of cephalopod innovations. By providing specific examples, the implications of this approach both within and beyond cephalopod biology are discussed.
Collapse
|
30
|
Tsushima A, Gan P, Kumakura N, Narusaka M, Takano Y, Narusaka Y, Shirasu K. Genomic Plasticity Mediated by Transposable Elements in the Plant Pathogenic Fungus Colletotrichum higginsianum. Genome Biol Evol 2019; 11:1487-1500. [PMID: 31028389 PMCID: PMC6535813 DOI: 10.1093/gbe/evz087] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/17/2019] [Indexed: 12/22/2022] Open
Abstract
Phytopathogen genomes are under constant pressure to change, as pathogens are locked in an evolutionary arms race with their hosts, where pathogens evolve effector genes to manipulate their hosts, whereas the hosts evolve immune components to recognize the products of these genes. Colletotrichum higginsianum (Ch), a fungal pathogen with no known sexual morph, infects Brassicaceae plants including Arabidopsis thaliana. Previous studies revealed that Ch differs in its virulence toward various Arabidopsis thaliana ecotypes, indicating the existence of coevolutionary selective pressures. However, between-strain genomic variations in Ch have not been studied. Here, we sequenced and assembled the genome of a Ch strain, resulting in a highly contiguous genome assembly, which was compared with the chromosome-level genome assembly of another strain to identify genomic variations between strains. We found that the two closely related strains vary in terms of large-scale rearrangements, the existence of strain-specific regions, and effector candidate gene sets and that these variations are frequently associated with transposable elements (TEs). Ch has a compartmentalized genome consisting of gene-sparse, TE-dense regions with more effector candidate genes and gene-dense, TE-sparse regions harboring conserved genes. Additionally, analysis of the conservation patterns and syntenic regions of effector candidate genes indicated that the two strains vary in their effector candidate gene sets because of de novo evolution, horizontal gene transfer, or gene loss after divergence. Our results reveal mechanisms for generating genomic diversity in this asexual pathogen, which are important for understanding its adaption to hosts.
Collapse
|
31
|
Chen X, Jiang Y, Gao F, Zheng W, Krock TJ, Stover NA, Lu C, Katz LA, Song W. Genome analyses of the new model protist Euplotes vannus focusing on genome rearrangement and resistance to environmental stressors. Mol Ecol Resour 2019; 19:1292-1308. [PMID: 30985983 DOI: 10.1111/1755-0998.13023] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Revised: 04/05/2019] [Accepted: 04/08/2019] [Indexed: 12/11/2022]
Abstract
As a model organism for studies of cell and environmental biology, the free-living and cosmopolitan ciliate Euplotes vannus shows intriguing features like dual genome architecture (i.e., separate germline and somatic nuclei in each cell/organism), "gene-sized" chromosomes, stop codon reassignment, programmed ribosomal frameshifting (PRF) and strong resistance to environmental stressors. However, the molecular mechanisms that account for these remarkable traits remain largely unknown. Here we report a combined analysis of de novo assembled high-quality macronuclear (MAC; i.e., somatic) and partial micronuclear (MIC; i.e., germline) genome sequences for E. vannus, and transcriptome profiling data under varying conditions. The results demonstrate that: (a) the MAC genome contains more than 25,000 complete "gene-sized" nanochromosomes (~85 Mb haploid genome size) with the N50 ~2.7 kb; (b) although there is a high frequency of frameshifting at stop codons UAA and UAG, we did not observe impaired transcript abundance as a result of PRF in this species as has been reported for other euplotids; (c) the sequence motif 5'-TA-3' is conserved at nearly all internally-eliminated sequence (IES) boundaries in the MIC genome, and chromosome breakage sites (CBSs) are duplicated and retained in the MAC genome; (d) by profiling the weighted correlation network of genes in the MAC under different environmental stressors, including nutrient scarcity, extreme temperature, salinity and the presence of ammonia, we identified gene clusters that respond to these external physical or chemical stimulations, and (e) we observed a dramatic increase in HSP70 gene transcription under salinity and chemical stresses but surprisingly, not under temperature changes; we link this temperature-resistance to the evolved loss of temperature stress-sensitive elements in regulatory regions. Together with the genome resources generated in this study, which are available online at Euplotes vannus Genome Database (http://evan.ciliate.org), these data provide molecular evidence for understanding the unique biology of highly adaptable microorganisms.
Collapse
|
32
|
Cunha LFI, Protti F. Genome Rearrangements on Multigenomic Models: Applications of Graph Convexity Problems. J Comput Biol 2019; 26:1214-1222. [PMID: 31120333 DOI: 10.1089/cmb.2019.0091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Genome rearrangements are events where large blocks of DNA exchange pieces during evolution. The analysis of such events is a tool for understanding evolutionary genomics, in whose context many rearrangement distances have been proposed, based on finding the minimum number of rearrangements to transform one genome into another, using some predefined operation. However, when more than two genomes are considered, we have new challenging problems. Studying such problems from a combinatorial point of view has been shown to be a useful tool to approach such problems, for example, the reconstruction of phylogenetic trees. We focus on genome rearrangement problems related to graph convexity. Such an approach is in connection with some other well-known studies on multigenomic models, for example, those based on the median and on the closest string. We propose an association between graph convexities and genome rearrangements in such a way that graph convexity problems deal with input sets of vertices and try to answer questions concerning the closure of such inputs. The concept of closure is useful for studies on genome rearrangement by suggesting mechanisms to reduce the genomic search space. Regarding the computational complexity, and considering the Hamming distance on strings, we solve the following problems: decide if a given set is convex; compute the interval and the convex hull of a given set; and determine the convexity number, interval number, and hull number of a Hamming graph. All such problems are solved for three types of convexities: geodetic, monophonic, and P3. Considering the Cayley distance on permutations, we solve the convexity number and interval determination problems for the geodetic convexity.
Collapse
|
33
|
Frequency of DNA end joining in trans is not determined by the predamage spatial proximity of double-strand breaks in yeast. Proc Natl Acad Sci U S A 2019; 116:9481-9490. [PMID: 31019070 DOI: 10.1073/pnas.1818595116] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
DNA double-strand breaks (DSBs) are serious genomic insults that can lead to chromosomal rearrangements if repaired incorrectly. To gain insight into the nuclear mechanisms contributing to these rearrangements, we developed an assay in yeast to measure cis (same site) vs. trans (different site) repair for the majority process of precise nonhomologous end joining (NHEJ). In the assay, the HO endonuclease gene is placed between two HO cut sites such that HO expression is self-terminated upon induction. We further placed an additional cut site in various genomic loci such that NHEJ in trans led to expression of a LEU2 reporter gene. Consistent with prior reports, cis NHEJ was more efficient than trans NHEJ. However, unlike homologous recombination, where spatial distance between a single DSB and donor locus was previously shown to correlate with repair efficiency, trans NHEJ frequency remained essentially constant regardless of the position of the two DSB loci, even when they were on the same chromosome or when two trans repair events were put in competition. Repair of similar DSBs via single-strand annealing of short terminal direct repeats showed substantially higher repair efficiency and trans repair frequency, but still without a strong correlation of trans repair to genomic position. Our results support a model in which yeast cells mobilize, and perhaps compartmentalize, multiple DSBs in a manner that no longer reflects the predamage position of two broken loci.
Collapse
|
34
|
Deng L, Wu RA, Sonneville R, Kochenova OV, Labib K, Pellman D, Walter JC. Mitotic CDK Promotes Replisome Disassembly, Fork Breakage, and Complex DNA Rearrangements. Mol Cell 2019; 73:915-929.e6. [PMID: 30849395 PMCID: PMC6410736 DOI: 10.1016/j.molcel.2018.12.021] [Citation(s) in RCA: 92] [Impact Index Per Article: 18.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2018] [Revised: 10/03/2018] [Accepted: 12/21/2018] [Indexed: 12/27/2022]
Abstract
DNA replication errors generate complex chromosomal rearrangements and thereby contribute to tumorigenesis and other human diseases. One mechanism that triggers these errors is mitotic entry before the completion of DNA replication. To address how mitosis might affect DNA replication, we used Xenopus egg extracts. When mitotic CDK (Cyclin B1-CDK1) is used to drive interphase egg extracts into a mitotic state, the replicative CMG (CDC45/MCM2-7/GINS) helicase undergoes ubiquitylation on its MCM7 subunit, dependent on the E3 ubiquitin ligase TRAIP. Whether replisomes have stalled or undergone termination, CMG ubiquitylation is followed by its extraction from chromatin by the CDC48/p97 ATPase. TRAIP-dependent CMG unloading during mitosis is also seen in C. elegans early embryos. At stalled forks, CMG removal results in fork breakage and end joining events involving deletions and templated insertions. Our results identify a mitotic pathway of global replisome disassembly that can trigger replication fork collapse and DNA rearrangements.
Collapse
|
35
|
Rodrigues Oliveira A, Lima Brito K, Dias Z, Dias U. Sorting by Weighted Reversals and Transpositions. J Comput Biol 2019; 26:420-431. [PMID: 30785331 DOI: 10.1089/cmb.2018.0257] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Genome rearrangements are global mutations that change large stretches of DNA sequence throughout genomes. They are rare but accumulate during the evolutionary process leading to organisms with similar genetic material in different places and orientations within the genome. Sorting by Genome Rearrangements problems seek for minimum-length sequences of rearrangements that transform one genome into the other. These problems accept alternative versions that assign weights for each event, and the goal is to find a minimum-weight sequence. We study the Sorting by Weighted Reversals and Transpositions problem on signed permutations. In this study, we use weight 2 for reversals and 3 for transpositions and consider theoretical and practical aspects in our analysis. We present two algorithms with approximation factors of 5/3 and 3/2. We also developed a generic approximation algorithm to deal with different weights for reversals and transpositions, and we show the approximation factor reached in each scenario.
Collapse
|
36
|
Repeats of Unusual Size in Plant Mitochondrial Genomes: Identification, Incidence and Evolution. G3-GENES GENOMES GENETICS 2019; 9:549-559. [PMID: 30563833 PMCID: PMC6385970 DOI: 10.1534/g3.118.200948] [Citation(s) in RCA: 72] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Plant mitochondrial genomes have excessive size relative to coding capacity, a low mutation rate in genes and a high rearrangement rate. They also have abundant non-tandem repeats often including pairs of large repeats which cause isomerization of the genome by recombination, and numerous repeats of up to several hundred base pairs that recombine only when the genome is stressed by DNA damaging agents or mutations in DNA repair pathway genes. Early work on mitochondrial genomes led to the suggestion that repeats in the size range from several hundred to a few thousand base pair are underrepresented. The repeats themselves are not well-conserved between species, and are not always annotated in mitochondrial sequence assemblies. We systematically identified and compared these repeats, which are important clues to mechanisms of DNA maintenance in mitochondria. We developed a tool to find and curate non-tandem repeats larger than 50bp and analyzed the complete mitochondrial sequences from 157 plant species. We observed an interesting difference between taxa: the repeats are larger and more frequent in the vascular plants. Analysis of closely related species also shows that plant mitochondrial genomes evolve in dramatic bursts of breakage and rejoining, complete with DNA sequence gain and loss. We suggest an adaptive explanation for the existence of the repeats and their evolution.
Collapse
|
37
|
Fukui K, Harada A, Wakamatsu T, Minobe A, Ohshita K, Ashiuchi M, Yano T. The GIY-YIG endonuclease domain of Arabidopsis MutS homolog 1 specifically binds to branched DNA structures. FEBS Lett 2018; 592:4066-4077. [PMID: 30372520 DOI: 10.1002/1873-3468.13279] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 10/16/2018] [Accepted: 10/24/2018] [Indexed: 01/18/2023]
Abstract
In plant organelle genomes, homeologous recombination between heteroallelic positions of repetitive sequences is increased by dysfunction of the gene encoding MutS homolog 1 (MSH1), a plant organelle-specific homolog of bacterial mismatch-binding protein MutS1. The C-terminal region of plant MSH1 contains the GIY-YIG endonuclease motif. The biochemical characteristics of plant MSH1 have not been investigated; accordingly, the molecular mechanism by which plant MSH1 suppresses homeologous recombination is unknown. Here, we characterized the recombinant GIY-YIG domain of Arabidopsis thaliana MSH1, showing that the domain possesses branched DNA-specific DNA-binding activity. Interestingly, the domain exhibited no endonuclease activity, suggesting that the mismatch-binding domain is required for DNA incision. Based on these results, we propose a possible mechanism for MSH1-dependent suppression of homeologous recombination.
Collapse
|
38
|
Algady W, Louzada S, Carpenter D, Brajer P, Färnert A, Rooth I, Ngasala B, Yang F, Shaw MA, Hollox EJ. The Malaria-Protective Human Glycophorin Structural Variant DUP4 Shows Somatic Mosaicism and Association with Hemoglobin Levels. Am J Hum Genet 2018; 103:769-776. [PMID: 30388403 PMCID: PMC6218809 DOI: 10.1016/j.ajhg.2018.10.008] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Accepted: 10/04/2018] [Indexed: 01/23/2023] Open
Abstract
Glycophorin A and glycophorin B are red blood cell surface proteins and are both receptors for the parasite Plasmodium falciparum, which is the principal cause of malaria in sub-Saharan Africa. DUP4 is a complex structural genomic variant that carries extra copies of a glycophorin A-glycophorin B fusion gene and has a dramatic effect on malaria risk by reducing the risk of severe malaria by up to 40%. Using fiber-FISH and Illumina sequencing, we validate the structural arrangement of the glycophorin locus in the DUP4 variant and reveal somatic variation in copy number of the glycophorin B-glycophorin A fusion gene. By developing a simple, specific, PCR-based assay for DUP4, we show that the DUP4 variant reaches a frequency of 13% in the population of a malaria-endemic village in south-eastern Tanzania. We genotype a substantial proportion of that village and demonstrate an association of DUP4 genotype with hemoglobin levels, a phenotype related to malaria, using a family-based association test. Taken together, we show that DUP4 is a complex structural variant that may be susceptible to somatic variation and show that DUP4 is associated with a malarial-related phenotype in a longitudinally followed population.
Collapse
|
39
|
Burger G, Valach M. Perfection of eccentricity: Mitochondrial genomes of diplonemids. IUBMB Life 2018; 70:1197-1206. [PMID: 30304578 DOI: 10.1002/iub.1927] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2018] [Revised: 07/09/2018] [Accepted: 07/10/2018] [Indexed: 01/14/2023]
Abstract
Mitochondria are the sandbox of evolution as exemplified most particularly by the diplonemids, a group of marine microeukaryotes. These protists are uniquely characterized by their highly multipartite mitochondrial genome and systematically fragmented genes whose pieces are spread out over several dozens of chromosomes. The type species Diplonema papillatum was the first member of this group in which the expression of fragmented mitochondrial genes was investigated experimentally. We now know that gene expression involves separate transcription of gene pieces (modules), RNA editing of module transcripts, and module joining to mature mRNAs and rRNAs. The mechanism of cognate module recognition and ligation is distinct from known intron splicing and remains to be uncovered. Here, we review the current status of research on mitochondrial genome architecture, as well as gene complement, structure, and expression modes in diplonemids. Further, we discuss the potential molecular mechanisms of posttranscriptional processing, and finally reflect on the evolutionary trajectories and trends of mtDNA evolution as seen in this protist group. © 2018 IUBMB Life, 70(12):1197-1206, 2018.
Collapse
|
40
|
Genome Rearrangement Shapes Prochlorococcus Ecological Adaptation. Appl Environ Microbiol 2018; 84:AEM.01178-18. [PMID: 29915114 PMCID: PMC6102989 DOI: 10.1128/aem.01178-18] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2018] [Accepted: 06/10/2018] [Indexed: 12/13/2022] Open
Abstract
Prochlorococcus, the most abundant and smallest known free-living photosynthetic microorganism, plays a key role in marine ecosystems and biogeochemical cycles. Prochlorococcus genome evolution is a fundamental issue related to how Prochlorococcus clades adapted to different ecological niches. Recent studies revealed that the gene gain and loss is crucial to the clade differentiation. The significance of our research is that we interpreted the Prochlorococcus genome evolution from the perspective of genome structure and associated the genome rearrangement with the Prochlorococcus clade differentiation and subsequent ecological adaptation. Prochlorococcus is the most abundant and smallest known free-living photosynthetic microorganism and is a key player in marine ecosystems and biogeochemical cycles. Prochlorococcus can be broadly divided into high-light-adapted (HL) and low-light-adapted (LL) clades. In this study, we isolated two low-light-adapted clade I (LLI) strains from the western Pacific Ocean and obtained their genomic data. We reconstructed Prochlorococcus evolution based on genome rearrangement. Our results showed that genome rearrangement might have played an important role in Prochlorococcus evolution. We also found that the Prochlorococcus clades with streamlined genomes maintained relatively high synteny throughout most of their genomes, and several regions served as rearrangement hotspots. Backbone analysis showed that different clades shared a conserved backbone but also had clade-specific regions, and the genes in these regions were associated with ecological adaptations. IMPORTANCEProchlorococcus, the most abundant and smallest known free-living photosynthetic microorganism, plays a key role in marine ecosystems and biogeochemical cycles. Prochlorococcus genome evolution is a fundamental issue related to how Prochlorococcus clades adapted to different ecological niches. Recent studies revealed that the gene gain and loss is crucial to the clade differentiation. The significance of our research is that we interpreted the Prochlorococcus genome evolution from the perspective of genome structure and associated the genome rearrangement with the Prochlorococcus clade differentiation and subsequent ecological adaptation.
Collapse
|
41
|
Furrer DI, Swart EC, Kraft MF, Sandoval PY, Nowacki M. Two Sets of Piwi Proteins Are Involved in Distinct sRNA Pathways Leading to Elimination of Germline-Specific DNA. Cell Rep 2018; 20:505-520. [PMID: 28700949 PMCID: PMC5522536 DOI: 10.1016/j.celrep.2017.06.050] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2017] [Revised: 06/02/2017] [Accepted: 06/20/2017] [Indexed: 12/22/2022] Open
Abstract
Piwi proteins and piRNAs protect eukaryotic germlines against the spread of transposons. During development in the ciliate Paramecium, two Piwi-dependent sRNA classes are involved in the elimination of transposons and transposon-derived DNA: scan RNAs (scnRNAs), associated with Ptiwi01 and Ptiwi09, and iesRNAs, whose binding partners we now identify as Ptiwi10 and Ptiwi11. scnRNAs derive from the maternal genome and initiate DNA elimination during development, whereas iesRNAs continue DNA targeting until the removal process is complete. Here, we show that scnRNAs and iesRNAs are processed by distinct Dicer-like proteins and bind Piwi proteins in a mutually exclusive manner, suggesting separate biogenesis pathways. We also demonstrate that the PTIWI10 gene is transcribed from the developing nucleus and that its transcription depends on prior DNA excision, suggesting a mechanism of gene expression control triggered by the removal of short DNA segments interrupting the gene. Identification of two Piwi proteins (Ptiwi10/11) associated with iesRNAs Piwi proteins bind Dicer-produced sRNAs and remove passenger strands Ptiwi10 is expressed from the new somatic macronucleus DNA elimination activates gene transcription
Collapse
|
42
|
Spencer-Smith R, Gould SW, Pulijala M, Snyder LAS. Investigating Potential Chromosomal Rearrangements during Laboratory Culture of Neisseria gonorrhoeae. Microorganisms 2018; 6:microorganisms6010010. [PMID: 29361673 PMCID: PMC5874624 DOI: 10.3390/microorganisms6010010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2017] [Revised: 12/19/2017] [Accepted: 01/19/2018] [Indexed: 01/02/2023] Open
Abstract
Comparisons of genome sequence data between different strains and isolates of Neisseria spp., such as Neisseria gonorrhoeae, reveal that over the evolutionary history of these organisms, large scale chromosomal rearrangements have occurred. Factors within the genomes, such as repetitive sequences and prophage, are believed to have contributed to these observations. However, the timescale in which rearrangements occur is not clear, nor whether it might be expected for them to happen in the laboratory. In this study, N. gonorrhoeae was repeatedly passaged in the laboratory and assessed for large scale chromosomal rearrangements. Using gonococcal strain NCCP11945, for which there is a complete genome sequence, cultures were passaged for eight weeks in the laboratory. The resulting genomic DNA was assessed using Pulsed Field Gel Electrophoresis, comparing the results to the predicted results from the genome sequence data. Three cultures generated Pulsed Field Gel Electrophoresis patterns that varied from the genomic data and were further investigated for potential chromosomal rearrangements.
Collapse
|
43
|
Suhren JH, Noto T, Kataoka K, Gao S, Liu Y, Mochizuki K. Negative Regulators of an RNAi-Heterochromatin Positive Feedback Loop Safeguard Somatic Genome Integrity in Tetrahymena. Cell Rep 2017; 18:2494-2507. [PMID: 28273462 PMCID: PMC5357732 DOI: 10.1016/j.celrep.2017.02.024] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2016] [Revised: 12/22/2016] [Accepted: 02/06/2017] [Indexed: 11/05/2022] Open
Abstract
RNAi-mediated positive feedback loops are pivotal for the maintenance of heterochromatin, but how they are downregulated at heterochromatin-euchromatin borders is not well understood. In the ciliated protozoan Tetrahymena, heterochromatin is formed exclusively on the sequences that are removed from the somatic genome by programmed DNA elimination, and an RNAi-mediated feedback loop is important for assembling heterochromatin on the eliminated sequences. In this study, we show that the heterochromatin protein 1 (HP1)-like protein Coi6p, its interaction partners Coi7p and Lia5p, and the histone demethylase Jmj1p are crucial for confining the production of small RNAs and the formation of heterochromatin to the eliminated sequences. The loss of Coi6p, Coi7p, or Jmj1p causes ectopic DNA elimination. The results provide direct evidence for the existence of a dedicated mechanism that counteracts a positive feedback loop between RNAi and heterochromatin at heterochromatin-euchromatin borders to maintain the integrity of the somatic genome. The HP1-like protein Coi6p confines small RNA and heterochromatin formation Two Coi6p-binding proteins and the histone demethylase Jmj1p likely act with Coi6p Coi6p and Jmj1p are important for preventing ectopic DNA elimination Suppression of RNAi-heterochromatin feedback loop maintains somatic genome integrity
Collapse
|
44
|
Wu CS, Wang TJ, Wu CW, Wang YN, Chaw SM. Plastome Evolution in the Sole Hemiparasitic Genus Laurel Dodder (Cassytha) and Insights into the Plastid Phylogenomics of Lauraceae. Genome Biol Evol 2017; 9:2604-2614. [PMID: 28985306 PMCID: PMC5737380 DOI: 10.1093/gbe/evx177] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/06/2017] [Indexed: 12/29/2022] Open
Abstract
To date, little is known about the evolution of plastid genomes (plastomes) in Lauraceae. As one of the top five largest families in tropical forests, the Lauraceae contain many species that are important ecologically and economically. Lauraceous species also provide wonderful materials to study the evolutionary trajectory in response to parasitism because they contain both nonparasitic and parasitic species. This study compared the plastomes of nine Lauraceous species, including the sole hemiparasitic and herbaceous genus Cassytha (laurel dodder; here represented by Cassytha filiformis). We found differential contractions of the canonical inverted repeat (IR), resulting in two IR types present in Lauraceae. These two IR types reinforce Cryptocaryeae and Neocinnamomum-Perseeae-Laureae as two separate clades. Our data reveal several traits unique to Cas. filiformis, including loss of IRs, loss or pseudogenization of 11 ndh and rpl23 genes, richness of repeats, and accelerated rates of nucleotide substitutions in protein-coding genes. Although Cas. filiformis is low in chlorophyll content, our analysis based on dN/dS ratios suggests that both its plastid house-keeping and photosynthetic genes are under strong selective constraints. Hence, we propose that short generation time and herbaceous lifestyle rather than reduced photosynthetic ability drive the accelerated rates of nucleotide substitutions in Cas. filiformis.
Collapse
|
45
|
Zeira R, Zehavi M, Shamir R. A Linear-Time Algorithm for the Copy Number Transformation Problem. J Comput Biol 2017; 24:1179-1194. [PMID: 28837352 DOI: 10.1089/cmb.2017.0060] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Problems of genome rearrangement are central in both evolution and cancer. Most evolutionary scenarios have been studied under the assumption that the genome contains a single copy of each gene. In contrast, tumor genomes undergo deletions and duplications, and thus, the number of copies of genes varies. The number of copies of each segment along a chromosome is called its copy number profile (CNP). Understanding CNP changes can assist in predicting disease progression and treatment. To date, questions related to distances between CNPs gained little scientific attention. Here we focus on the following fundamental problem, introduced by Schwarz et al.: given two CNPs, u and v, compute the minimum number of operations transforming u into v, where the edit operations are segmental deletions and amplifications. We establish the computational complexity of this problem, showing that it is solvable in linear time and constant space.
Collapse
|
46
|
Weng ML, Ruhlman TA, Jansen RK. Expansion of inverted repeat does not decrease substitution rates in Pelargonium plastid genomes. THE NEW PHYTOLOGIST 2017; 214:842-851. [PMID: 27991660 DOI: 10.1111/nph.14375] [Citation(s) in RCA: 74] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2016] [Accepted: 11/04/2016] [Indexed: 05/23/2023]
Abstract
For species with minor inverted repeat (IR) boundary changes in the plastid genome (plastome), nucleotide substitution rates were previously shown to be lower in the IR than the single copy regions (SC). However, the impact of large-scale IR expansion/contraction on plastid nucleotide substitution rates among closely related species remains unclear. We included plastomes from 22 Pelargonium species, including eight newly sequenced genomes, and used both pairwise and model-based comparisons to investigate the impact of the IR on sequence evolution in plastids. Ten types of plastome organization with different inversions or IR boundary changes were identified in Pelargonium. Inclusion in the IR was not sufficient to explain the variation of nucleotide substitution rates. Instead, the rate heterogeneity in Pelargonium plastomes was a mixture of locus-specific, lineage-specific and IR-dependent effects. Our study of Pelargonium plastomes that vary in IR length and gene content demonstrates that the evolutionary consequences of retaining these repeats are more complicated than previously suggested.
Collapse
|
47
|
Hamilton EP, Kapusta A, Huvos PE, Bidwell SL, Zafar N, Tang H, Hadjithomas M, Krishnakumar V, Badger JH, Caler EV, Russ C, Zeng Q, Fan L, Levin JZ, Shea T, Young SK, Hegarty R, Daza R, Gujja S, Wortman JR, Birren BW, Nusbaum C, Thomas J, Carey CM, Pritham EJ, Feschotte C, Noto T, Mochizuki K, Papazyan R, Taverna SD, Dear PH, Cassidy-Hanley DM, Xiong J, Miao W, Orias E, Coyne RS. Structure of the germline genome of Tetrahymena thermophila and relationship to the massively rearranged somatic genome. eLife 2016; 5. [PMID: 27892853 PMCID: PMC5182062 DOI: 10.7554/elife.19090] [Citation(s) in RCA: 108] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2016] [Accepted: 11/14/2016] [Indexed: 12/30/2022] Open
Abstract
The germline genome of the binucleated ciliate Tetrahymena thermophila undergoes programmed chromosome breakage and massive DNA elimination to generate the somatic genome. Here, we present a complete sequence assembly of the germline genome and analyze multiple features of its structure and its relationship to the somatic genome, shedding light on the mechanisms of genome rearrangement as well as the evolutionary history of this remarkable germline/soma differentiation. Our results strengthen the notion that a complex, dynamic, and ongoing interplay between mobile DNA elements and the host genome have shaped Tetrahymena chromosome structure, locally and globally. Non-standard outcomes of rearrangement events, including the generation of short-lived somatic chromosomes and excision of DNA interrupting protein-coding regions, may represent novel forms of developmental gene regulation. We also compare Tetrahymena's germline/soma differentiation to that of other characterized ciliates, illustrating the wide diversity of adaptations that have occurred within this phylum.
Collapse
|
48
|
Avdeyev P, Jiang S, Aganezov S, Hu F, Alekseyev MA. Reconstruction of Ancestral Genomes in Presence of Gene Gain and Loss. J Comput Biol 2016; 23:150-64. [PMID: 26885568 DOI: 10.1089/cmb.2015.0160] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Since most dramatic genomic changes are caused by genome rearrangements as well as gene duplications and gain/loss events, it becomes crucial to understand their mechanisms and reconstruct ancestral genomes of the given genomes. This problem was shown to be NP-complete even in the "simplest" case of three genomes, thus calling for heuristic rather than exact algorithmic solutions. At the same time, a larger number of input genomes may actually simplify the problem in practice as it was earlier illustrated with MGRA, a state-of-the-art software tool for reconstruction of ancestral genomes of multiple genomes. One of the key obstacles for MGRA and other similar tools is presence of breakpoint reuses when the same breakpoint region is broken by several different genome rearrangements in the course of evolution. Furthermore, such tools are often limited to genomes composed of the same genes with each gene present in a single copy in every genome. This limitation makes these tools inapplicable for many biological datasets and degrades the resolution of ancestral reconstructions in diverse datasets. We address these deficiencies by extending the MGRA algorithm to genomes with unequal gene contents. The developed next-generation tool MGRA2 can handle gene gain/loss events and shares the ability of MGRA to reconstruct ancestral genomes uniquely in the case of limited breakpoint reuse. Furthermore, MGRA2 employs a number of novel heuristics to cope with higher breakpoint reuse and process datasets inaccessible for MGRA. In practical experiments, MGRA2 shows superior performance for simulated and real genomes as compared to other ancestral genome reconstruction tools.
Collapse
|
49
|
Yu S, Hao F, Leong HW. An O([Formula: see text]) algorithm for sorting signed genomes by reversals, transpositions, transreversals and block-interchanges. J Bioinform Comput Biol 2015; 14:1640002. [PMID: 26707923 DOI: 10.1142/s0219720016400023] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
We consider the problem of sorting signed permutations by reversals, transpositions, transreversals, and block-interchanges. The problem arises in the study of species evolution via large-scale genome rearrangement operations. Recently, Hao et al. gave a 2-approximation scheme called genome sorting by bridges (GSB) for solving this problem. Their result extended and unified the results of (i) He and Chen - a 2-approximation algorithm allowing reversals, transpositions, and block-interchanges (by also allowing transversals) and (ii) Hartman and Sharan - a 1.5-approximation algorithm allowing reversals, transpositions, and transversals (by also allowing block-interchanges). The GSB result is based on introduction of three bridge structures in the breakpoint graph, the L-bridge, T-bridge, and X-bridge that models goodreversal, transposition/transreversal, and block-interchange, respectively. However, the paper by Hao et al. focused on proving the 2-approximation GSB scheme and only mention a straightforward [Formula: see text] algorithm. In this paper, we give an [Formula: see text] algorithm for implementing the GSB scheme. The key idea behind our faster GSB algorithm is to represent cycles in the breakpoint graph by their canonical sequences, which greatly simplifies the search for these bridge structures. We also give some comparison results (running time and computed distances) against the original GSB implementation.
Collapse
|
50
|
Bacterial clade with the ribosomal RNA operon on a small plasmid rather than the chromosome. Proc Natl Acad Sci U S A 2015; 112:14343-7. [PMID: 26534993 DOI: 10.1073/pnas.1514326112] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
rRNA is essential for life because of its functional importance in protein synthesis. The rRNA (rrn) operon encoding 16S, 23S, and 5S rRNAs is located on the "main" chromosome in all bacteria documented to date and is frequently used as a marker of chromosomes. Here, our genome analysis of a plant-associated alphaproteobacterium, Aureimonas sp. AU20, indicates that this strain has its sole rrn operon on a small (9.4 kb), high-copy-number replicon. We designated this unusual replicon carrying the rrn operon on the background of an rrn-lacking chromosome (RLC) as the rrn-plasmid. Four of 12 strains close to AU20 also had this RLC/rrn-plasmid organization. Phylogenetic analysis showed that those strains having the RLC/rrn-plasmid organization represented one clade within the genus Aureimonas. Our finding introduces a previously unaddressed viewpoint into studies of genetics, genomics, and evolution in microbiology and biology in general.
Collapse
|