1
|
Shukla HG, Chakraborty M, Emerson J. Genetic variation in recalcitrant repetitive regions of the Drosophila melanogaster genome. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.11.598575. [PMID: 38915508 PMCID: PMC11195212 DOI: 10.1101/2024.06.11.598575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]
Abstract
Many essential functions of organisms are encoded in highly repetitive genomic regions, including histones involved in DNA packaging, centromeres that are core components of chromosome segregation, ribosomal RNA comprising the protein translation machinery, telomeres that ensure chromosome integrity, piRNA clusters encoding host defenses against selfish elements, and virtually the entire Y chromosome. These regions, formed by highly similar tandem arrays, pose significant challenges for experimental and informatic study, impeding sequence-level descriptions essential for understanding genetic variation. Here, we report the assembly and variation analysis of such repetitive regions in Drosophila melanogaster, offering significant improvements to the existing community reference assembly. Our work successfully recovers previously elusive segments, including complete reconstructions of the histone locus and the pericentric heterochromatin of the X chromosome, spanning the Stellate locus to the distal flank of the rDNA cluster. To infer structural changes in these regions where alignments are often not practicable, we introduce landmark anchors based on unique variants that are putatively orthologous. These regions display considerable structural variation between different D. melanogaster strains, exhibiting differences in copy number and organization of homologous repeat units between haplotypes. In the histone cluster, although we observe minimal genetic exchange indicative of crossing over, the variation patterns suggest mechanisms such as unequal sister chromatid exchange. We also examine the prevalence and scale of concerted evolution in the histone and Stellate clusters and discuss the mechanisms underlying these observed patterns.
Collapse
Affiliation(s)
- Harsh G. Shukla
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California 92697, USA
- Graduate Program in Mathematical, Computational and Systems Biology, University of California Irvine, Irvine, California 92697, USA
| | - Mahul Chakraborty
- Department of Biology, Texas A&M University, College Station, Texas 77843, USA
| | - J.J. Emerson
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California 92697, USA
- Center for Complex Biological Systems, University of California Irvine, Irvine, California 92697, USA
| |
Collapse
|
2
|
Tunjić-Cvitanić M, García-Souto D, Pasantes JJ, Šatović-Vukšić E. Dominance of transposable element-related satDNAs results in great complexity of "satDNA library" and invokes the extension towards "repetitive DNA library". MARINE LIFE SCIENCE & TECHNOLOGY 2024; 6:236-251. [PMID: 38827134 PMCID: PMC11136912 DOI: 10.1007/s42995-024-00218-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Accepted: 02/26/2024] [Indexed: 06/04/2024]
Abstract
Research on bivalves is fast-growing, including genome-wide analyses and genome sequencing. Several characteristics qualify oysters as a valuable model to explore repetitive DNA sequences and their genome organization. Here we characterize the satellitomes of five species in the family Ostreidae (Crassostrea angulata, C. virginica, C. hongkongensis, C. ariakensis, Ostrea edulis), revealing a substantial number of satellite DNAs (satDNAs) per genome (ranging between 33 and 61) and peculiarities in the composition of their satellitomes. Numerous satDNAs were either associated to or derived from transposable elements, displaying a scarcity of transposable element-unrelated satDNAs in these genomes. Due to the non-conventional satellitome constitution and dominance of Helitron-associated satDNAs, comparative satellitomics demanded more in-depth analyses than standardly employed. Comparative analyses (including C. gigas, the first bivalve species with a defined satellitome) revealed that 13 satDNAs occur in all six oyster genomes, with Cg170/HindIII satDNA being the most abundant in all of them. Evaluating the "satDNA library model" highlighted the necessity to adjust this term when studying tandem repeat evolution in organisms with such satellitomes. When repetitive sequences with potential variation in the organizational form and repeat-type affiliation are examined across related species, the introduction of the terms "TE library" and "repetitive DNA library" becomes essential. Supplementary Information The online version contains supplementary material available at 10.1007/s42995-024-00218-0.
Collapse
Affiliation(s)
| | - Daniel García-Souto
- Genomes and Disease, Centre for Research in Molecular Medicine and Chronic Diseases (CIMUS), Universidade de Santiago de Compostela, 15706 Santiago de Compostela, Spain
- Department of Zoology, Genetics and Physical Anthropology, Universidade de Santiago de Compostela, 15706 Santiago de Compostela, Spain
| | - Juan J. Pasantes
- Centro de Investigación Mariña, Dpto de Bioquímica, Xenética e Inmunoloxía, Universidade de Vigo, 36310 Vigo, Spain
| | - Eva Šatović-Vukšić
- Division of Molecular Biology, Ruđer Bošković Institute, 10000 Zagreb, Croatia
| |
Collapse
|
3
|
Petraccioli A, Maio N, Carotenuto R, Odierna G, Guarino FM. The Satellite DNA PcH-Sat, Isolated and Characterized in the Limpet Patella caerulea (Mollusca, Gastropoda), Suggests the Origin from a Nin-SINE Transposable Element. Genes (Basel) 2024; 15:541. [PMID: 38790169 PMCID: PMC11121367 DOI: 10.3390/genes15050541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Revised: 04/16/2024] [Accepted: 04/23/2024] [Indexed: 05/26/2024] Open
Abstract
Satellite DNA (sat-DNA) was previously described as junk and selfish DNA in the cellular economy, without a clear functional role. However, during the last two decades, evidence has been accumulated about the roles of sat-DNA in different cellular functions and its probable involvement in tumorigenesis and adaptation to environmental changes. In molluscs, studies on sat-DNAs have been performed mainly on bivalve species, especially those of economic interest. Conversely, in Gastropoda (which includes about 80% of the currently described molluscs species), studies on sat-DNA have been largely neglected. In this study, we isolated and characterized a sat-DNA, here named PcH-sat, in the limpet Patella caerulea using the restriction enzyme method, particularly HaeIII. Monomeric units of PcH-sat are 179 bp long, AT-rich (58.7%), and with an identity among monomers ranging from 91.6 to 99.8%. Southern blot showed that PcH-sat is conserved in P. depressa and P. ulyssiponensis, while a smeared signal of hybridization was present in the other three investigated limpets (P. ferruginea, P. rustica and P. vulgata). Dot blot showed that PcH-sat represents about 10% of the genome of P. caerulea, 5% of that of P. depressa, and 0.3% of that of P. ulyssiponensis. FISH showed that PcH-sat was mainly localized on pericentromeric regions of chromosome pairs 2 and 4-7 of P. caerulea (2n = 18). A database search showed that PcH-sat contains a large segment (of 118 bp) showing high identity with a homologous trait of the Nin-SINE transposable element (TE) of the patellogastropod Lottia gigantea, supporting the hypothesis that TEs are involved in the rising and tandemization processes of sat-DNAs.
Collapse
Affiliation(s)
| | | | | | - Gaetano Odierna
- Department of Biology, University of Naples Federico II, Via Cinthia, I-80126 Naples, Italy; (A.P.); (N.M.); (R.C.); (F.M.G.)
| | | |
Collapse
|
4
|
Flynn JM, Yamashita YM. The implications of satellite DNA instability on cellular function and evolution. Semin Cell Dev Biol 2024; 156:152-159. [PMID: 37852904 DOI: 10.1016/j.semcdb.2023.10.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 09/21/2023] [Accepted: 10/11/2023] [Indexed: 10/20/2023]
Abstract
Abundant tandemly repeated satellite DNA is present in most eukaryotic genomes. Previous limitations including a pervasive view that it was uninteresting junk DNA, combined with challenges in studying it, are starting to dissolve - and recent studies have found important functions for satellite DNAs. The observed rapid evolution and implied instability of satellite DNA now has important significance for their functions and maintenance within the genome. In this review, we discuss the processes that lead to satellite DNA copy number instability, and the importance of mechanisms to manage the potential negative effects of instability. Satellite DNA is vulnerable to challenges during replication and repair, since it forms difficult-to-process secondary structures and its homology within tandem arrays can result in various types of recombination. Satellite DNA instability may be managed by DNA or chromatin-binding proteins ensuring proper nuclear localization and repair, or by proteins that process aberrant structures that satellite DNAs tend to form. We also discuss the pattern of satellite DNA mutations from recent mutation accumulation (MA) studies that have tracked changes in satellite DNA for up to 1000 generations with minimal selection. Finally, we highlight examples of satellite evolution from studies that have characterized satellites across millions of years of Drosophila fruit fly evolution, and discuss possible ways that selection might act on the satellite DNA composition.
Collapse
Affiliation(s)
- Jullien M Flynn
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA; Howard Hughes Medical Institute, Cambridge, MA, USA.
| | - Yukiko M Yamashita
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA; Howard Hughes Medical Institute, Cambridge, MA, USA; Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
5
|
Jedlička P, Tokan V, Kejnovská I, Hobza R, Kejnovský E. Telomeric retrotransposons show propensity to form G-quadruplexes in various eukaryotic species. Mob DNA 2023; 14:3. [PMID: 37038191 PMCID: PMC10088271 DOI: 10.1186/s13100-023-00291-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 03/07/2023] [Indexed: 04/12/2023] Open
Abstract
BACKGROUND Canonical telomeres (telomerase-synthetised) are readily forming G-quadruplexes (G4) on the G-rich strand. However, there are examples of non-canonical telomeres among eukaryotes where telomeric tandem repeats are invaded by specific retrotransposons. Drosophila melanogaster represents an extreme example with telomeres composed solely by three retrotransposons-Het-A, TAHRE and TART (HTT). Even though non-canonical telomeres often show strand biased G-distribution, the evidence for the G4-forming potential is limited. RESULTS Using circular dichroism spectroscopy and UV absorption melting assay we have verified in vitro G4-formation in the HTT elements of D. melanogaster. Namely 3 in Het-A, 8 in TART and 2 in TAHRE. All the G4s are asymmetrically distributed as in canonical telomeres. Bioinformatic analysis showed that asymmetric distribution of potential quadruplex sequences (PQS) is common in telomeric retrotransposons in other Drosophila species. Most of the PQS are located in the gag gene where PQS density correlates with higher DNA sequence conservation and codon selection favoring G4-forming potential. The importance of G4s in non-canonical telomeres is further supported by analysis of telomere-associated retrotransposons from various eukaryotic species including green algae, Diplomonadida, fungi, insects and vertebrates. Virtually all analyzed telomere-associated retrotransposons contained PQS, frequently with asymmetric strand distribution. Comparison with non-telomeric elements showed independent selection of PQS-rich elements from four distinct LINE clades. CONCLUSION Our findings of strand-biased G4-forming motifs in telomere-associated retrotransposons from various eukaryotic species support the G4-formation as one of the prerequisites for the recruitment of specific retrotransposons to chromosome ends and call for further experimental studies.
Collapse
Affiliation(s)
- Pavel Jedlička
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Kralovopolska 135, 61200, Brno, Czech Republic
| | - Viktor Tokan
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Kralovopolska 135, 61200, Brno, Czech Republic.
| | - Iva Kejnovská
- Department of Biophysics of Nucleic Acids, Institute of Biophysics of the Czech Academy of Sciences, Kralovopolska 135, 61200, Brno, Czech Republic
| | - Roman Hobza
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Kralovopolska 135, 61200, Brno, Czech Republic
| | - Eduard Kejnovský
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Kralovopolska 135, 61200, Brno, Czech Republic.
| |
Collapse
|
6
|
Šatović-Vukšić E, Plohl M. Satellite DNAs-From Localized to Highly Dispersed Genome Components. Genes (Basel) 2023; 14:genes14030742. [PMID: 36981013 PMCID: PMC10048060 DOI: 10.3390/genes14030742] [Citation(s) in RCA: 28] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Revised: 03/15/2023] [Accepted: 03/16/2023] [Indexed: 03/30/2023] Open
Abstract
According to the established classical view, satellite DNAs are defined as abundant non-coding DNA sequences repeated in tandem that build long arrays located in heterochromatin. Advances in sequencing methodologies and development of specialized bioinformatics tools enabled defining a collection of all repetitive DNAs and satellite DNAs in a genome, the repeatome and the satellitome, respectively, as well as their reliable annotation on sequenced genomes. Supported by various non-model species included in recent studies, the patterns of satellite DNAs and satellitomes as a whole showed much more diversity and complexity than initially thought. Differences are not only in number and abundance of satellite DNAs but also in their distribution across the genome, array length, interspersion patterns, association with transposable elements, localization in heterochromatin and/or in euchromatin. In this review, we compare characteristic organizational features of satellite DNAs and satellitomes across different animal and plant species in order to summarize organizational forms and evolutionary processes that may lead to satellitomes' diversity and revisit some basic notions regarding repetitive DNA landscapes in genomes.
Collapse
Affiliation(s)
- Eva Šatović-Vukšić
- Division of Molecular Biology, Ruđer Bošković Institute, 10000 Zagreb, Croatia
| | - Miroslav Plohl
- Division of Molecular Biology, Ruđer Bošković Institute, 10000 Zagreb, Croatia
| |
Collapse
|
7
|
Peona V, Kutschera VE, Blom MPK, Irestedt M, Suh A. Satellite DNA evolution in Corvoidea inferred from short and long reads. Mol Ecol 2023; 32:1288-1305. [PMID: 35488497 DOI: 10.1111/mec.16484] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 04/11/2022] [Accepted: 04/17/2022] [Indexed: 11/29/2022]
Abstract
Satellite DNA (satDNA) is a fast-evolving portion of eukaryotic genomes. The homogeneous and repetitive nature of such satDNA causes problems during the assembly of genomes, and therefore it is still difficult to study it in detail in nonmodel organisms as well as across broad evolutionary timescales. Here, we combined the use of short- and long-read data to explore the diversity and evolution of satDNA between individuals of the same species and between genera of birds spanning ~40 millions of years of bird evolution using birds-of-paradise (Paradisaeidae) and crow (Corvus) species. These avian species highlighted the presence of a GC-rich Corvoidea satellitome composed of 61 satellite families and provided a set of candidate satDNA monomers for being centromeric on the basis of length, abundance, homogeneity and transcription. Surprisingly, we found that the satDNA of crow species rapidly diverged between closely related species while the satDNA appeared more similar between birds-of-paradise species belonging to different genera.
Collapse
Affiliation(s)
- Valentina Peona
- Department of Organismal Biology - Systematic Biology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Verena E Kutschera
- Department of Biochemistry and Biophysics, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Stockholm University, Solna, Sweden
| | - Mozes P K Blom
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden.,Museum für Naturkunde, Leibniz Institut für Evolutions- und Biodiversitätsforschung, Berlin, Germany
| | - Martin Irestedt
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden
| | - Alexander Suh
- Department of Organismal Biology - Systematic Biology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden.,School of Biological Sciences-Organisms and the Environment, University of East Anglia, Norwich, UK
| |
Collapse
|
8
|
Koga A, Hashimoto K, Honda Y, Nishihara H. Marsupial genome analysis suggests that satellite DNA formation from walb endogenous retrovirus is an event specific to the red-necked wallaby. Genes Cells 2023; 28:149-155. [PMID: 36527312 DOI: 10.1111/gtc.12999] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 11/29/2022] [Accepted: 12/11/2022] [Indexed: 12/23/2022]
Abstract
We recently identified walbRep, a satellite DNA residing in the genome of the red-necked wallaby Notamacropus rufogriseus. It originates from the walb endogenous retrovirus and is organized in a manner in which the provirus structure is retained. The walbRep repeat units feature an average pairwise nucleotide identity as high as 99.5%, raising the possibility of a recent origin. The tammar wallaby N. eugenii is a species estimated to have diverged from the red-necked wallaby 2-3 million years ago. In PCR analyses of these two and other related species, walbRep-specific fragment amplification was observed only in the red-necked wallaby. Sequence database searches for the tammar wallaby resulted in sequence alignment lists that were sufficiently powerful to exclude the possibility of walbRep existence. These results suggested that the walbRep formation occurred in the red-necked wallaby lineage after its divergence from the tammar wallaby lineage, thus in a time span of maximum 3 million years.
Collapse
Affiliation(s)
- Akihiko Koga
- Center for Evolutionary Origins of Human Behavior, Kyoto University, Inuyama, Japan
| | | | - Yusuke Honda
- Noichi Zoological Park of Kochi Prefecture, Konan, Japan
| | - Hidenori Nishihara
- School of Life Science and Technology, Tokyo Institute of Technology, Yokohama, Japan
| |
Collapse
|
9
|
Zattera ML, Bruschi DP. Transposable Elements as a Source of Novel Repetitive DNA in the Eukaryote Genome. Cells 2022; 11:3373. [PMID: 36359770 PMCID: PMC9659126 DOI: 10.3390/cells11213373] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 09/23/2022] [Accepted: 09/26/2022] [Indexed: 12/02/2022] Open
Abstract
The impact of transposable elements (TEs) on the evolution of the eukaryote genome has been observed in a number of biological processes, such as the recruitment of the host's gene expression network or the rearrangement of genome structure. However, TEs may also provide a substrate for the emergence of novel repetitive elements, which contribute to the generation of new genomic components during the course of the evolutionary process. In this review, we examine published descriptions of TEs that give rise to tandem sequences in an attempt to comprehend the relationship between TEs and the emergence of de novo satellite DNA families in eukaryotic organisms. We evaluated the intragenomic behavior of the TEs, the role of their molecular structure, and the chromosomal distribution of the paralogous copies that generate arrays of repeats as a substrate for the emergence of new repetitive elements in the genome. We highlight the involvement and importance of TEs in the eukaryote genome and its remodeling processes.
Collapse
Affiliation(s)
- Michelle Louise Zattera
- Departamento de Genética, Programa de Pós-Graduação em Genética, Setor de Ciências Biológicas, Universidade Federal do Paraná, Curitiba 81530-000, PR, Brazil
| | - Daniel Pacheco Bruschi
- Departamento de Genética, Laboratorio de Citogenética Evolutiva e Conservação Animal, Setor de Ciências Biológicas, Universidade Federal do Paraná, Curitiba 81530-000, PR, Brazil
| |
Collapse
|
10
|
Ubi BE, Gorafi YSA, Yaakov B, Monden Y, Kashkush K, Tsujimoto H. Exploiting the miniature inverted-repeat transposable elements insertion polymorphisms as an efficient DNA marker system for genome analysis and evolutionary studies in wheat and related species. FRONTIERS IN PLANT SCIENCE 2022; 13:995586. [PMID: 36119578 PMCID: PMC9479669 DOI: 10.3389/fpls.2022.995586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Accepted: 08/09/2022] [Indexed: 06/15/2023]
Abstract
Transposable elements (TEs) constitute ~80% of the complex bread wheat genome and contribute significantly to wheat evolution and environmental adaptation. We studied 52 TE insertion polymorphism markers to ascertain their efficiency as a robust DNA marker system for genetic studies in wheat and related species. Significant variation was found in miniature inverted-repeat transposable element (MITE) insertions in relation to ploidy with the highest number of "full site" insertions occurring in the hexaploids (32.6 ± 3.8), while the tetraploid and diploid progenitors had 22.3 ± 0.6 and 15.0 ± 3.5 "full sites," respectively, which suggested a recent rapid activation of these transposons after the formation of wheat. Constructed phylogenetic trees were consistent with the evolutionary history of these species which clustered mainly according to ploidy and genome types (SS, AA, DD, AABB, and AABBDD). The synthetic hexaploids sub-clustered near the tetraploid species from which they were re-synthesized. Preliminary genotyping in 104 recombinant inbred lines (RILs) showed predominantly 1:1 segregation for simplex markers, with four of these markers already integrated into our current DArT-and SNP-based linkage map. The MITE insertions also showed stability with no single excision observed. The MITE insertion site polymorphisms uncovered in this study are very promising as high-potential evolutionary markers for genomic studies in wheat.
Collapse
Affiliation(s)
- Benjamin Ewa Ubi
- Molecular Breeding Laboratory, Arid Land Research Center, Tottori University, Tottori, Japan
- Department of Biotechnology, Ebonyi State University, Abakaliki, Abakaliki, Ebonyi, Nigeria
| | - Yasir Serag Alnor Gorafi
- International Platform for Dryland Research and Education, Tottori University, Tottori, Japan
- Agricultural Research Corporation, Wad Medani, Sudan
| | - Beery Yaakov
- French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| | - Yuki Monden
- Graduate School of Environmental and Life Science, Okayama University, Okayama, Japan
| | - Khalil Kashkush
- Department of Life Sciences, Ben-Gurion University, Beer-Sheva, Israel
| | - Hisashi Tsujimoto
- Molecular Breeding Laboratory, Arid Land Research Center, Tottori University, Tottori, Japan
| |
Collapse
|
11
|
Hayashi S, Honda Y, Kanesaki E, Koga A. Marsupial satellite DNA as faithful reflections of long terminal repeat (LTR) retroelement structure. Genome 2022; 65:469-478. [PMID: 35930809 DOI: 10.1139/gen-2022-0039] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Long terminal repeat (LTR) retroelements, including endogenous retroviruses, are one of the origins of satellite DNAs. However, the vast majority of satellite DNAs originating from LTR retroelements consist of parts of the element. In addition, they frequently contain sequences unrelated to that element. Here we report a novel marsupial satellite DNA (named walbRep) that contains, and consists solely of, the entire sequence of an LTR retroelement (the walb element). As is common with LTR retroelements, walb copies exhibit length variation. We focused on the abundance of copies of a specific length (2.7 kb) in the genome of the red-necked wallaby. Cloning and analyses of long genomic DNA fragments revealed a satellite DNA in which the LTR sequence (0.4 kb) and the sequence of the internal region of a nonautonomous walb copy (2.3 kb) were repeated alternately. The junctions between these two components exhibited the same end-to-end arrangements as those in the walb element. This satellite organization could be accounted for by a simple formation model that includes slippage during chromosome pairing followed by homologous recombination but does not invoke any other types of rearrangements. We discuss the possible reasons why satellite DNAs having such structures are rarely found in mammals.
Collapse
Affiliation(s)
| | - Yusuke Honda
- Noichi Zoological Park of Kochi Prefecture, Konan, Japan;
| | | | | |
Collapse
|
12
|
Li F, Lee M, Esnault C, Wendover K, Guo Y, Atkins P, Zaratiegui M, Levin HL. Identification of an integrase-independent pathway of retrotransposition. SCIENCE ADVANCES 2022; 8:eabm9390. [PMID: 35767609 DOI: 10.1126/sciadv.abm9390] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Retroviruses and long terminal repeat retrotransposons rely on integrase (IN) to insert their complementary DNA (cDNA) into the genome of host cells. Nevertheless, in the absence of IN, retroelements can retain notable levels of insertion activity. We have characterized the IN-independent pathway of Tf1 and found that insertion sites had homology to the primers of reverse transcription, which form single-stranded DNAs at the termini of the cDNA. In the absence of IN activity, a similar bias was observed with HIV-1. Our studies showed that the Tf1 insertions result from single-strand annealing, a noncanonical form of homologous recombination mediated by Rad52. By expanding our analysis of insertions to include repeat sequences, we found most formed tandem elements by inserting at preexisting copies of a related transposable element. Unexpectedly, we found that wild-type Tf1 uses the IN-independent pathway as an alternative mode of insertion.
Collapse
Affiliation(s)
- Feng Li
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD 20892, USA
| | - Michael Lee
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD 20892, USA
| | - Caroline Esnault
- Bioinformatics and Scientific Programming Core, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD 20892, USA
| | - Katie Wendover
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD 20892, USA
| | - Yabin Guo
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD 20892, USA
| | - Paul Atkins
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD 20892, USA
| | - Mikel Zaratiegui
- Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Nelson Biological Laboratories A133, 604 Allison Rd., Piscataway, NJ 08854, USA
| | - Henry L Levin
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD 20892, USA
| |
Collapse
|
13
|
Delihas N. An ancestral genomic sequence that serves as a nucleation site for de novo gene birth. PLoS One 2022; 17:e0267864. [PMID: 35552551 PMCID: PMC9097989 DOI: 10.1371/journal.pone.0267864] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 04/17/2022] [Indexed: 11/24/2022] Open
Abstract
The process of gene birth is of major interest with current excitement concerning de novo gene formation. We report a new and different mechanism of de novo gene birth based on the finding and the characteristics of a short non-coding sequence situated between two protein genes, termed a spacer sequence. This non-coding sequence is present in genomes of Mus musculus, the house mouse and Philippine tarsier, a primitive ancestral primate. The ancestral sequence is highly conserved during primate evolution with certain base pairs totally invariant from mouse to humans. By following the birth of the sequence of human lincRNA BCRP3 (BCR activator of RhoGEF and GTPase 3 pseudogene) during primate evolution, we find diverse genes, long non-coding RNA and protein genes (and sequences that do not appear to encode a gene) that all stem from the 3’ end of the spacer, and all begin with a similar sequence. During primate evolution, part of the BCRP3 sequence initially formed in the Old World Monkeys and developed into different primate genes before evolving into the BCRP3 gene in humans. The gene developmental process consists of the initiation of DNA synthesis at spacer 3’ ends, addition of a complex of tandem transposable elements and the addition of a segment of another gene. The findings support the concept of the spacer sequence as a starting site for DNA synthesis that leads to formation of different genes with the addition of other sequences. These data suggest a new process of de novo gene birth.
Collapse
Affiliation(s)
- Nicholas Delihas
- Department of Microbiology and Immunology, Renaissance School of Medicine, Stony Brook University, Stony Brook, New York, United States of America
- * E-mail:
| |
Collapse
|
14
|
Wei KHC, Mai D, Chatla K, Bachtrog D. Dynamics and Impacts of Transposable Element Proliferation in the Drosophila nasuta Species Group Radiation. Mol Biol Evol 2022; 39:msac080. [PMID: 35485457 PMCID: PMC9075770 DOI: 10.1093/molbev/msac080] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Transposable element (TE) mobilization is a constant threat to genome integrity. Eukaryotic organisms have evolved robust defensive mechanisms to suppress their activity, yet TEs can escape suppression and proliferate, creating strong selective pressure for host defense to adapt. This genomic conflict fuels a never-ending arms race that drives the rapid evolution of TEs and recurrent positive selection of genes involved in host defense; the latter has been shown to contribute to postzygotic hybrid incompatibility. However, how TE proliferation impacts genome and regulatory divergence remains poorly understood. Here, we report the highly complete and contiguous (N50 = 33.8-38.0 Mb) genome assemblies of seven closely related Drosophila species that belong to the nasuta species group-a poorly studied group of flies that radiated in the last 2 My. We constructed a high-quality de novo TE library and gathered germline RNA-seq data, which allowed us to comprehensively annotate and compare TE insertion patterns between the species, and infer the evolutionary forces controlling their spread. We find a strong negative association between TE insertion frequency and expression of genes nearby; this likely reflects survivor bias from reduced fitness impact of TEs inserting near lowly expressed, nonessential genes, with limited TE-induced epigenetic silencing. Phylogenetic analyses of insertions of 147 TE families reveal that 53% of them show recent amplification in at least one species. The most highly amplified TE is a nonautonomous DNA element (Drosophila INterspersed Element; DINE) which has gone through multiple bouts of expansions with thousands of full-length copies littered throughout each genome. Across all TEs, we find that TEs expansions are significantly associated with high expression in the expanded species consistent with suppression escape. Thus, whereas horizontal transfer followed by the invasion of a naïve genome has been highlighted to explain the long-term survival of TEs, our analysis suggests that evasion of host suppression of resident TEs is a major strategy to persist over evolutionary times. Altogether, our results shed light on the heterogenous and context-dependent nature in which TEs affect gene regulation and the dynamics of rampant TE proliferation amidst a recently radiated species group.
Collapse
Affiliation(s)
- Kevin H.-C. Wei
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA
| | - Dat Mai
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA
| | - Kamalakar Chatla
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA
| | - Doris Bachtrog
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA
| |
Collapse
|
15
|
Nguyen A, Wang W, Chong E, Chatla K, Bachtrog D. Transposable element accumulation drives size differences among polymorphic Y Chromosomes in Drosophila. Genome Res 2022; 32:1074-1088. [PMID: 35501131 DOI: 10.1101/gr.275996.121] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 04/15/2022] [Indexed: 11/24/2022]
Abstract
Y Chromosomes of many species are gene poor and show low levels of nucleotide variation, yet often display high amounts of structural diversity. Dobzhansky cataloged several morphologically distinct Y Chromosomes in Drosophila pseudoobscura that differ in size and shape, but the molecular causes of their dramatic size differences are unclear. Here we use cytogenetics and long-read sequencing to study the sequence content of polymorphic Y Chromosomes in D. pseudoobscura We show that Y Chromosomes differ almost 2-fold in size, ranging from 30 to 60 Mb. Most of this size difference is caused by a handful of active transposable elements (TEs) that have recently expanded on the largest Y Chromosome, with different elements being responsible for Y expansion on differently sized D. pseudoobscura Y's. We show that Y Chromosomes differ in their heterochromatin enrichment, expression of Y-enriched TEs, and also influence expression of dozens of autosomal and X-linked genes. The same helitron element that showed the most drastic amplification on the largest Y in D. pseudoobscura independently amplified on a polymorphic large Y Chromosome in D. affinis, suggesting that some TEs are inherently more prone to become deregulated on Y Chromosomes.
Collapse
|
16
|
Zinshteyn D, Barbash DA. Stonewall prevents expression of ectopic genes in the ovary and accumulates at insulator elements in D. melanogaster. PLoS Genet 2022; 18:e1010110. [PMID: 35324887 PMCID: PMC8982855 DOI: 10.1371/journal.pgen.1010110] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 04/05/2022] [Accepted: 02/18/2022] [Indexed: 11/29/2022] Open
Abstract
Germline stem cells (GSCs) are the progenitor cells of the germline for the lifetime of an animal. In Drosophila, these cells reside in a cellular niche that is required for both their maintenance (self-renewal) and differentiation (asymmetric division resulting in a daughter cell that differs from the GSC). The stem cell—daughter cell transition is tightly regulated by a number of processes, including an array of proteins required for genome stability. The germline stem-cell maintenance factor Stonewall (Stwl) associates with heterochromatin, but its molecular function is poorly understood. We performed RNA-Seq on stwl mutant ovaries and found significant derepression of many transposon families but not heterochromatic genes. We also discovered inappropriate expression of multiple classes of genes. Most prominent are testis-enriched genes, including the male germline sex-determination switch Phf7, the differentiation factor bgcn, and a large testis-specific gene cluster on chromosome 2, all of which are upregulated or ectopically expressed in stwl mutant ovaries. Surprisingly, we also found that RNAi knockdown of stwl in somatic S2 cells results in ectopic expression of these testis genes. Using parallel ChIP-Seq and RNA-Seq experiments in S2 cells, we discovered that Stwl localizes upstream of transcription start sites and at heterochromatic sequences including repetitive sequences associated with telomeres. Stwl is also enriched at bgcn, suggesting that it directly regulates this essential differentiation factor. Finally, we identify Stwl binding motifs that are shared with known insulator binding proteins. We propose that Stwl affects gene regulation, including repression of male transcripts in the female germline, by binding insulators and establishing chromatin boundaries. Stem cells are defined by their ability to divide asymmetrically, resulting in a differentiated cell and a stem cell daughter. In fruit flies, sperm and egg production begins with germline stem cells (GSCs). The ability of a GSC to differentiate or self-renew is tightly regulated by a myriad of factors. Some of these are transcription factors, which are responsible for activating or suppressing other genes to promote one state in favor of another. Stonewall is an ovarian nuclear protein required for GSC self-renewal, whose molecular function is poorly understood. Here we show that Stonewall is responsible for preventing the activation of “male” molecular programming in the fruit fly ovary. When Stonewall is absent from the ovary, egg production is terminated and testis-specific genes become highly expressed, including the male transcript of Phf7, which induces male sexual identity in female germ cells. We also show that Stonewall is likely localizing to genomic insulators, which are regions of the genome that shield genes from nearby regulators. Our findings suggest that Stonewall helps to organize the genome in ovarian germ cells and prevent expression of male genes.
Collapse
Affiliation(s)
- Daniel Zinshteyn
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Daniel A. Barbash
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
- * E-mail:
| |
Collapse
|
17
|
Constitutive Heterochromatin in Eukaryotic Genomes: A Mine of Transposable Elements. Cells 2022; 11:cells11050761. [PMID: 35269383 PMCID: PMC8909793 DOI: 10.3390/cells11050761] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 02/10/2022] [Accepted: 02/18/2022] [Indexed: 12/22/2022] Open
Abstract
Transposable elements (TEs) are abundant components of constitutive heterochromatin of the most diverse evolutionarily distant organisms. TEs enrichment in constitutive heterochromatin was originally described in the model organism Drosophila melanogaster, but it is now considered as a general feature of this peculiar portion of the genomes. The phenomenon of TE enrichment in constitutive heterochromatin has been proposed to be the consequence of a progressive accumulation of transposable elements caused by both reduced recombination and lack of functional genes in constitutive heterochromatin. However, this view does not take into account classical genetics studies and most recent evidence derived by genomic analyses of heterochromatin in Drosophila and other species. In particular, the lack of functional genes does not seem to be any more a general feature of heterochromatin. Sequencing and annotation of Drosophila melanogaster constitutive heterochromatin have shown that this peculiar genomic compartment contains hundreds of transcriptionally active genes, generally larger in size than that of euchromatic ones. Together, these genes occupy a significant fraction of the genomic territory of heterochromatin. Moreover, transposable elements have been suggested to drive the formation of heterochromatin by recruiting HP1 and repressive chromatin marks. In addition, there are several pieces of evidence that transposable elements accumulation in the heterochromatin might be important for centromere and telomere structure. Thus, there may be more complexity to the relationship between transposable elements and constitutive heterochromatin, in that different forces could drive the dynamic of this phenomenon. Among those forces, preferential transposition may be an important factor. In this article, we present an overview of experimental findings showing cases of transposon enrichment into the heterochromatin and their positive evolutionary interactions with an impact to host genomes.
Collapse
|
18
|
Palazzo A, Caizzi R, Moschetti R, Marsano RM. What Have We Learned in 30 Years of Investigations on Bari Transposons? Cells 2022; 11:583. [PMID: 35159391 PMCID: PMC8834629 DOI: 10.3390/cells11030583] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Revised: 02/03/2022] [Accepted: 02/07/2022] [Indexed: 12/17/2022] Open
Abstract
Transposable elements (TEs) have been historically depicted as detrimental genetic entities that selfishly aim at perpetuating themselves, invading genomes, and destroying genes. Scientists often co-opt "special" TEs to develop new and powerful genetic tools, that will hopefully aid in changing the future of the human being. However, many TEs are gentle, rarely unleash themselves to harm the genome, and bashfully contribute to generating diversity and novelty in the genomes they have colonized, yet they offer the opportunity to develop new molecular tools. In this review we summarize 30 years of research focused on the Bari transposons. Bari is a "normal" transposon family that has colonized the genomes of several Drosophila species and introduced genomic novelties in the melanogaster species. We discuss how these results have contributed to advance the field of TE research and what future studies can still add to the current knowledge.
Collapse
|
19
|
Said I, McGurk MP, Clark AG, Barbash DA. Patterns of piRNA Regulation in Drosophila Revealed through Transposable Element Clade Inference. Mol Biol Evol 2022; 39:msab336. [PMID: 34921315 PMCID: PMC8788220 DOI: 10.1093/molbev/msab336] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Transposable elements (TEs) are self-replicating "genetic parasites" ubiquitous to eukaryotic genomes. In addition to conflict between TEs and their host genomes, TEs of the same family are in competition with each other. They compete for the same genomic niches while experiencing the same regime of copy-number selection. This suggests that competition among TEs may favor the emergence of new variants that can outcompete their ancestral forms. To investigate the sequence evolution of TEs, we developed a method to infer clades: collections of TEs that share SNP variants and represent distinct TE family lineages. We applied this method to a panel of 85 Drosophila melanogaster genomes and found that the genetic variation of several TE families shows significant population structure that arises from the population-specific expansions of single clades. We used population genetic theory to classify these clades into younger versus older clades and found that younger clades are associated with a greater abundance of sense and antisense piRNAs per copy than older ones. Further, we find that the abundance of younger, but not older clades, is positively correlated with antisense piRNA production, suggesting a general pattern where hosts preferentially produce antisense piRNAs from recently active TE variants. Together these findings suggest a pattern whereby new TE variants arise by mutation and then increase in copy number, followed by the host producing antisense piRNAs that may be used to silence these emerging variants.
Collapse
Affiliation(s)
- Iskander Said
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Michael P McGurk
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Daniel A Barbash
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| |
Collapse
|
20
|
Affiliation(s)
| | - Francisco J. Ruiz-Ruano
- Department of Organismal Biology – Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
- School of Biological Sciences, Norwich Research Park University of East Anglia, Norwich, UK
| |
Collapse
|
21
|
Taming, Domestication and Exaptation: Trajectories of Transposable Elements in Genomes. Cells 2021; 10:cells10123590. [PMID: 34944100 PMCID: PMC8700633 DOI: 10.3390/cells10123590] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Revised: 11/30/2021] [Accepted: 12/06/2021] [Indexed: 02/06/2023] Open
Abstract
During evolution, several types of sequences pass through genomes. Along with mutations and internal genetic tinkering, they are a useful source of genetic variability for adaptation and evolution. Most of these sequences are acquired by horizontal transfers (HT), but some of them may come from the genomes themselves. If they are not lost or eliminated quickly, they can be tamed, domesticated, or even exapted. Each of these processes results from a series of events, depending on the interactions between these sequences and the host genomes, but also on environmental constraints, through their impact on individuals or population fitness. After a brief reminder of the characteristics of each of these states (taming, domestication, exaptation), the evolutionary trajectories of these new or acquired sequences will be presented and discussed, emphasizing that they are not totally independent insofar as the first can constitute a step towards the second, and the second is another step towards the third.
Collapse
|
22
|
Berloco MF, Minervini CF, Moschetti R, Palazzo A, Viggiano L, Marsano RM. Evidence of the Physical Interaction between Rpl22 and the Transposable Element Doc5, a Heterochromatic Transposon of Drosophila melanogaster. Genes (Basel) 2021; 12:1997. [PMID: 34946947 PMCID: PMC8701128 DOI: 10.3390/genes12121997] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2021] [Revised: 12/06/2021] [Accepted: 12/12/2021] [Indexed: 11/16/2022] Open
Abstract
Chromatin is a highly dynamic biological entity that allows for both the control of gene expression and the stabilization of chromosomal domains. Given the high degree of plasticity observed in model and non-model organisms, it is not surprising that new chromatin components are frequently described. In this work, we tested the hypothesis that the remnants of the Doc5 transposable element, which retains a heterochromatin insertion pattern in the melanogaster species complex, can be bound by chromatin proteins, and thus be involved in the organization of heterochromatic domains. Using the Yeast One Hybrid approach, we found Rpl22 as a potential interacting protein of Doc5. We further tested in vitro the observed interaction through Electrophoretic Mobility Shift Assay, uncovering that the N-terminal portion of the protein is sufficient to interact with Doc5. However, in situ localization of the native protein failed to detect Rpl22 association with chromatin. The results obtained are discussed in the light of the current knowledge on the extra-ribosomal role of ribosomal protein in eukaryotes, which suggests a possible role of Rpl22 in the determination of the heterochromatin in Drosophila.
Collapse
Affiliation(s)
- Maria Francesca Berloco
- Department of Biology, University of Bari “Aldo Moro”, 70126 Bari, Italy; (M.F.B.); (R.M.); (A.P.)
| | - Crescenzio Francesco Minervini
- Department of Emergency and Organ Transplantation (D.E.T.O.), Hematology and Stem Cell Transplantation Unit, University of Bari “Aldo Moro”, 70124 Bari, Italy;
| | - Roberta Moschetti
- Department of Biology, University of Bari “Aldo Moro”, 70126 Bari, Italy; (M.F.B.); (R.M.); (A.P.)
| | - Antonio Palazzo
- Department of Biology, University of Bari “Aldo Moro”, 70126 Bari, Italy; (M.F.B.); (R.M.); (A.P.)
| | - Luigi Viggiano
- Department of Biology, University of Bari “Aldo Moro”, 70126 Bari, Italy; (M.F.B.); (R.M.); (A.P.)
| | | |
Collapse
|
23
|
Kuhn GCS, Heringer P, Dias GB. Structure, Organization, and Evolution of Satellite DNAs: Insights from the Drosophila repleta and D. virilis Species Groups. PROGRESS IN MOLECULAR AND SUBCELLULAR BIOLOGY 2021; 60:27-56. [PMID: 34386871 DOI: 10.1007/978-3-030-74889-0_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The fact that satellite DNAs (satDNAs) in eukaryotes are abundant genomic components, can perform functional roles, but can also change rapidly across species while being homogenous within a species, makes them an intriguing and fascinating genomic component to study. It is also becoming clear that satDNAs represent an important piece in genome architecture and that changes in their structure, organization, and abundance can affect the evolution of genomes and species in many ways. Since the discovery of satDNAs more than 50 years ago, species from the Drosophila genus have continuously been used as models to study several aspects of satDNA biology. These studies have been largely concentrated in D. melanogaster and closely related species from the Sophophora subgenus, even though the vast majority of all Drosophila species belong to the Drosophila subgenus. This chapter highlights some studies on the satDNA structure, organization, and evolution in two species groups from the Drosophila subgenus: the repleta and virilis groups. We also discuss and review the classification of other abundant tandem repeats found in these species in the light of the current information available.
Collapse
Affiliation(s)
- Gustavo C S Kuhn
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, MG, Brazil.
| | - Pedro Heringer
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, MG, Brazil
| | - Guilherme Borges Dias
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA, USA
| |
Collapse
|
24
|
Heitkam T, Schulte L, Weber B, Liedtke S, Breitenbach S, Kögler A, Morgenstern K, Brückner M, Tröber U, Wolf H, Krabel D, Schmidt T. Comparative Repeat Profiling of Two Closely Related Conifers ( Larix decidua and Larix kaempferi) Reveals High Genome Similarity With Only Few Fast-Evolving Satellite DNAs. Front Genet 2021; 12:683668. [PMID: 34322154 PMCID: PMC8312256 DOI: 10.3389/fgene.2021.683668] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2021] [Accepted: 05/25/2021] [Indexed: 12/26/2022] Open
Abstract
In eukaryotic genomes, cycles of repeat expansion and removal lead to large-scale genomic changes and propel organisms forward in evolution. However, in conifers, active repeat removal is thought to be limited, leading to expansions of their genomes, mostly exceeding 10 giga base pairs. As a result, conifer genomes are largely littered with fragmented and decayed repeats. Here, we aim to investigate how the repeat landscapes of two related conifers have diverged, given the conifers' accumulative genome evolution mode. For this, we applied low-coverage sequencing and read clustering to the genomes of European and Japanese larch, Larix decidua (Lamb.) Carrière and Larix kaempferi (Mill.), that arose from a common ancestor, but are now geographically isolated. We found that both Larix species harbored largely similar repeat landscapes, especially regarding the transposable element content. To pin down possible genomic changes, we focused on the repeat class with the fastest sequence turnover: satellite DNAs (satDNAs). Using comparative bioinformatics, Southern, and fluorescent in situ hybridization, we reveal the satDNAs' organizational patterns, their abundances, and chromosomal locations. Four out of the five identified satDNAs are widespread in the Larix genus, with two even present in the more distantly related Pseudotsuga and Abies genera. Unexpectedly, the EulaSat3 family was restricted to L. decidua and absent from L. kaempferi, indicating its evolutionarily young age. Taken together, our results exemplify how the accumulative genome evolution of conifers may limit the overall divergence of repeats after speciation, producing only few repeat-induced genomic novelties.
Collapse
Affiliation(s)
- Tony Heitkam
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Luise Schulte
- Institute of Botany, Technische Universität Dresden, Dresden, Germany.,Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany
| | - Beatrice Weber
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Susan Liedtke
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Sarah Breitenbach
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Anja Kögler
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Kristin Morgenstern
- Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, Tharandt, Germany
| | | | - Ute Tröber
- Staatsbetrieb Sachsenforst, Pirna, Germany
| | - Heino Wolf
- Staatsbetrieb Sachsenforst, Pirna, Germany
| | - Doris Krabel
- Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, Tharandt, Germany
| | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| |
Collapse
|
25
|
Wei KHC, Chan C, Bachtrog D. Establishment of H3K9me3-dependent heterochromatin during embryogenesis in Drosophila miranda. eLife 2021; 10:55612. [PMID: 34128466 PMCID: PMC8285105 DOI: 10.7554/elife.55612] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Accepted: 06/14/2021] [Indexed: 12/27/2022] Open
Abstract
Heterochromatin is a key architectural feature of eukaryotic genomes crucial for silencing of repetitive elements. During Drosophila embryonic cellularization, heterochromatin rapidly appears over repetitive sequences, but the molecular details of how heterochromatin is established are poorly understood. Here, we map the genome-wide distribution of H3K9me3-dependent heterochromatin in individual embryos of Drosophila miranda at precisely staged developmental time points. We find that canonical H3K9me3 enrichment is established prior to cellularization and matures into stable and broad heterochromatin domains through development. Intriguingly, initial nucleation sites of H3K9me3 enrichment appear as early as embryonic stage 3 over transposable elements (TEs) and progressively broaden, consistent with spreading to neighboring nucleosomes. The earliest nucleation sites are limited to specific regions of a small number of recently active retrotransposon families and often appear over promoter and 5' regions of LTR retrotransposons, while late nucleation sites develop broadly across the entirety of most TEs. Interestingly, early nucleating TEs are strongly associated with abundant maternal piRNAs and show early zygotic transcription. These results support a model of piRNA-associated co-transcriptional silencing while also suggesting additional mechanisms for site-restricted H3K9me3 nucleation at TEs in pre-cellular Drosophila embryos.
Collapse
Affiliation(s)
- Kevin H-C Wei
- Department of Integrative Biology, University of California, Berkeley, Berkeley, United States
| | - Carolus Chan
- Department of Integrative Biology, University of California, Berkeley, Berkeley, United States
| | - Doris Bachtrog
- Department of Integrative Biology, University of California, Berkeley, Berkeley, United States
| |
Collapse
|
26
|
Sklyar T, Kurahina N, Lavrentieva K, Burlaka V, Lykholat T, Lykholat O. Autonomic (Mobile) Genetic Elements of Bacteria and Their Hierarchy. CYTOL GENET+ 2021. [DOI: 10.3103/s0095452721030099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
27
|
Mustafin RN, Khusnutdinova EK. Involvement of transposable elements in neurogenesis. Vavilovskii Zhurnal Genet Selektsii 2021; 24:209-218. [PMID: 33659801 PMCID: PMC7893149 DOI: 10.18699/vj20.613] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
The article is about the role of transposons in the regulation of functioning of neuronal stem cells and mature neurons of the human brain. Starting from the first division of the zygote, embryonic development is governed by regular activations of transposable elements, which are necessary for the sequential regulation of the expression of genes specific for each cell type. These processes include differentiation of neuronal stem cells, which requires the finest tuning of expression of neuron genes in various regions of the brain. Therefore, in the hippocampus, the center of human neurogenesis, the highest transposon activity has been identified, which causes somatic mosaicism of cells during the formation of specific brain structures. Similar data were obtained in studies on experimental animals. Mobile genetic elements are the most important sources of long non-coding RNAs that are coexpressed with important brain protein-coding genes. Significant activity of long non-coding RNA was detected in the hippocampus, which confirms the role of transposons in the regulation of brain function. MicroRNAs, many of which arise from transposon transcripts, also play an important role in regulating the differentiation of neuronal stem cells. Therefore, transposons, through their own processed transcripts, take an active part in the epigenetic regulation of differentiation of neurons. The global regulatory role of transposons in the human brain is due to the emergence of protein-coding genes in evolution by their exonization, duplication and domestication. These genes are involved in an epigenetic regulatory network with the participation of transposons, since they contain nucleotide sequences complementary to miRNA and long non-coding RNA formed from transposons. In the memory formation, the role of the exchange of virus-like mRNA with the help of the Arc protein of endogenous retroviruses HERV between neurons has been revealed. A possible mechanism for the implementation of this mechanism may be reverse transcription of mRNA and site-specific insertion into the genome with a regulatory effect on the genes involved in the memory.
Collapse
Affiliation(s)
| | - E K Khusnutdinova
- Institute of Biochemistry and Genetics - Subdivision of the Ufa Federal Research Centre of the Russian Academy of Sciences, Ufa, Russia
| |
Collapse
|
28
|
McGurk MP, Dion-Côté AM, Barbash DA. Rapid evolution at the Drosophila telomere: transposable element dynamics at an intrinsically unstable locus. Genetics 2021; 217:iyaa027. [PMID: 33724410 PMCID: PMC8045721 DOI: 10.1093/genetics/iyaa027] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Accepted: 12/03/2020] [Indexed: 12/26/2022] Open
Abstract
Drosophila telomeres have been maintained by three families of active transposable elements (TEs), HeT-A, TAHRE, and TART, collectively referred to as HTTs, for tens of millions of years, which contrasts with an unusually high degree of HTT interspecific variation. While the impacts of conflict and domestication are often invoked to explain HTT variation, the telomeres are unstable structures such that neutral mutational processes and evolutionary tradeoffs may also drive HTT evolution. We leveraged population genomic data to analyze nearly 10,000 HTT insertions in 85 Drosophila melanogaster genomes and compared their variation to other more typical TE families. We observe that occasional large-scale copy number expansions of both HTTs and other TE families occur, highlighting that the HTTs are, like their feral cousins, typically repressed but primed to take over given the opportunity. However, large expansions of HTTs are not caused by the runaway activity of any particular HTT subfamilies or even associated with telomere-specific TE activity, as might be expected if HTTs are in strong genetic conflict with their hosts. Rather than conflict, we instead suggest that distinctive aspects of HTT copy number variation and sequence diversity largely reflect telomere instability, with HTT insertions being lost at much higher rates than other TEs elsewhere in the genome. We extend previous observations that telomere deletions occur at a high rate, and surprisingly discover that more than one-third do not appear to have been healed with an HTT insertion. We also report that some HTT families may be preferentially activated by the erosion of whole telomeres, implying the existence of HTT-specific host control mechanisms. We further suggest that the persistent telomere localization of HTTs may reflect a highly successful evolutionary strategy that trades away a stable insertion site in order to have reduced impact on the host genome. We propose that HTT evolution is driven by multiple processes, with niche specialization and telomere instability being previously underappreciated and likely predominant.
Collapse
Affiliation(s)
- Michael P McGurk
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Anne-Marie Dion-Côté
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, SE-752 36 Uppsala, Sweden
| | - Daniel A Barbash
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
29
|
Maiwald S, Weber B, Seibt KM, Schmidt T, Heitkam T. The Cassandra retrotransposon landscape in sugar beet (Beta vulgaris) and related Amaranthaceae: recombination and re-shuffling lead to a high structural variability. ANNALS OF BOTANY 2021; 127:91-109. [PMID: 33009553 PMCID: PMC7750724 DOI: 10.1093/aob/mcaa176] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Accepted: 09/28/2020] [Indexed: 05/26/2023]
Abstract
BACKGROUND AND AIMS Plant genomes contain many retrotransposons and their derivatives, which are subject to rapid sequence turnover. As non-autonomous retrotransposons do not encode any proteins, they experience reduced selective constraints leading to their diversification into multiple families, usually limited to a few closely related species. In contrast, the non-coding Cassandra terminal repeat retrotransposons in miniature (TRIMs) are widespread in many plants. Their hallmark is a conserved 5S rDNA-derived promoter in their long terminal repeats (LTRs). As sugar beet (Beta vulgaris) has a well-described LTR retrotransposon landscape, we aim to characterize TRIMs in beet and related genomes. METHODS We identified Cassandra retrotransposons in the sugar beet reference genome and characterized their structural relationships. Genomic organization, chromosomal localization, and distribution of Cassandra-TRIMs across the Amaranthaceae were verified by Southern and fluorescent in situ hybridization. KEY RESULTS All 638 Cassandra sequences in the sugar beet genome contain conserved LTRs and thus constitute a single family. Nevertheless, variable internal regions required a subdivision into two Cassandra subfamilies within B. vulgaris. The related Chenopodium quinoa harbours a third subfamily. These subfamilies vary in their distribution within Amaranthaceae genomes, their insertion times and the degree of silencing by small RNAs. Cassandra retrotransposons gave rise to many structural variants, such as solo LTRs or tandemly arranged Cassandra retrotransposons. These Cassandra derivatives point to an interplay of template switch and recombination processes - mechanisms that likely caused Cassandra's subfamily formation and diversification. CONCLUSIONS We traced the evolution of Cassandra in the Amaranthaceae and detected a considerable variability within the short internal regions, whereas the LTRs are strongly conserved in sequence and length. Presumably these hallmarks make Cassandra a prime target for unequal recombination, resulting in the observed structural diversity, an example of the impact of LTR-mediated evolutionary mechanisms on the host genome.
Collapse
Affiliation(s)
- Sophie Maiwald
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Beatrice Weber
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Kathrin M Seibt
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| | - Tony Heitkam
- Institute of Botany, Technische Universität Dresden, Dresden, Germany
| |
Collapse
|
30
|
Ahmad SF, Singchat W, Jehangir M, Suntronpong A, Panthum T, Malaivijitnond S, Srikulnath K. Dark Matter of Primate Genomes: Satellite DNA Repeats and Their Evolutionary Dynamics. Cells 2020; 9:E2714. [PMID: 33352976 PMCID: PMC7767330 DOI: 10.3390/cells9122714] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 12/15/2020] [Accepted: 12/16/2020] [Indexed: 12/12/2022] Open
Abstract
A substantial portion of the primate genome is composed of non-coding regions, so-called "dark matter", which includes an abundance of tandemly repeated sequences called satellite DNA. Collectively known as the satellitome, this genomic component offers exciting evolutionary insights into aspects of primate genome biology that raise new questions and challenge existing paradigms. A complete human reference genome was recently reported with telomere-to-telomere human X chromosome assembly that resolved hundreds of dark regions, encompassing a 3.1 Mb centromeric satellite array that had not been identified previously. With the recent exponential increase in the availability of primate genomes, and the development of modern genomic and bioinformatics tools, extensive growth in our knowledge concerning the structure, function, and evolution of satellite elements is expected. The current state of knowledge on this topic is summarized, highlighting various types of primate-specific satellite repeats to compare their proportions across diverse lineages. Inter- and intraspecific variation of satellite repeats in the primate genome are reviewed. The functional significance of these sequences is discussed by describing how the transcriptional activity of satellite repeats can affect gene expression during different cellular processes. Sex-linked satellites are outlined, together with their respective genomic organization. Mechanisms are proposed whereby satellite repeats might have emerged as novel sequences during different evolutionary phases. Finally, the main challenges that hinder the detection of satellite DNA are outlined and an overview of the latest methodologies to address technological limitations is presented.
Collapse
Affiliation(s)
- Syed Farhan Ahmad
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Worapong Singchat
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Maryam Jehangir
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Department of Structural and Functional Biology, Institute of Bioscience at Botucatu, São Paulo State University (UNESP), Botucatu, São Paulo 18618-689, Brazil
| | - Aorarat Suntronpong
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Thitipong Panthum
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Suchinda Malaivijitnond
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Department of Biology, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand
| | - Kornsorn Srikulnath
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Center of Excellence on Agricultural Biotechnology (AG-BIO/PERDO-CHE), Bangkok 10900, Thailand
- Omics Center for Agriculture, Bioresources, Food and Health, Kasetsart University (OmiKU), Bangkok 10900, Thailand
| |
Collapse
|
31
|
de Lima LG, Hanlon SL, Gerton JL. Origins and Evolutionary Patterns of the 1.688 Satellite DNA Family in Drosophila Phylogeny. G3 (BETHESDA, MD.) 2020; 10:4129-4146. [PMID: 32934018 PMCID: PMC7642928 DOI: 10.1534/g3.120.401727] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/06/2020] [Accepted: 09/09/2020] [Indexed: 12/11/2022]
Abstract
Satellite DNAs (satDNAs) are a ubiquitous feature of eukaryotic genomes and are usually the major components of constitutive heterochromatin. The 1.688 satDNA, also known as the 359 bp satellite, is one of the most abundant repetitive sequences in Drosophila melanogaster and has been linked to several different biological functions. We investigated the presence and evolution of the 1.688 satDNA in 16 Drosophila genomes. We find that the 1.688 satDNA family is much more ancient than previously appreciated, being shared among part of the melanogaster group that diverged from a common ancestor ∼27 Mya. We found that the 1.688 satDNA family has two major subfamilies spread throughout Drosophila phylogeny (∼360 bp and ∼190 bp). Phylogenetic analysis of ∼10,000 repeats extracted from 14 of the species revealed that the 1.688 satDNA family is present within heterochromatin and euchromatin. A high number of euchromatic repeats are gene proximal, suggesting the potential for local gene regulation. Notably, heterochromatic copies display concerted evolution and a species-specific pattern, whereas euchromatic repeats display a more typical evolutionary pattern, suggesting that chromatin domains may influence the evolution of these sequences. Overall, our data indicate the 1.688 satDNA as the most perduring satDNA family described in Drosophila phylogeny to date. Our study provides a strong foundation for future work on the functional roles of 1.688 satDNA across many Drosophila species.
Collapse
Affiliation(s)
| | - Stacey L Hanlon
- Stowers Institute for Medical Research, Kansas City, Missouri 64110
| | | |
Collapse
|
32
|
Vojvoda Zeljko T, Pavlek M, Meštrović N, Plohl M. Satellite DNA-like repeats are dispersed throughout the genome of the Pacific oyster Crassostrea gigas carried by Helentron non-autonomous mobile elements. Sci Rep 2020; 10:15107. [PMID: 32934255 PMCID: PMC7492417 DOI: 10.1038/s41598-020-71886-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Accepted: 08/11/2020] [Indexed: 01/31/2023] Open
Abstract
Satellite DNAs (satDNAs) are long arrays of tandem repeats typically located in heterochromatin and span the centromeres of eukaryotic chromosomes. Despite the wealth of knowledge about satDNAs, little is known about a fraction of short, satDNA-like arrays dispersed throughout the genome. Our survey of the Pacific oyster Crassostrea gigas sequenced genome revealed genome assembly replete with satDNA-like tandem repeats. We focused on the most abundant arrays, grouped according to sequence similarity into 13 clusters, and explored their flanking sequences. Structural analysis showed that arrays of all 13 clusters represent central repeats of 11 non-autonomous elements named Cg_HINE, which are classified into the Helentron superfamily of DNA transposons. Each of the described elements is formed by a unique combination of flanking sequences and satDNA-like central repeats, coming from one, exceptionally two clusters in a consecutive order. While some of the detected Cg_HINE elements are related according to sequence similarities in flanking and repetitive modules, others evidently arose in independent events. In addition, some of the Cg_HINE's central repeats are related to the classical C. gigas satDNA, interconnecting mobile elements and satDNAs. Genome-wide distribution of Cg_HINE implies non-autonomous Helentrons as a dynamic system prone to efficiently propagate tandem repeats in the C. gigas genome.
Collapse
Affiliation(s)
- Tanja Vojvoda Zeljko
- Division of Molecular Biology, Ruđer Bošković Institute, Bijenička 54, 10 000, Zagreb, Croatia
| | - Martina Pavlek
- Division of Molecular Biology, Ruđer Bošković Institute, Bijenička 54, 10 000, Zagreb, Croatia
| | - Nevenka Meštrović
- Division of Molecular Biology, Ruđer Bošković Institute, Bijenička 54, 10 000, Zagreb, Croatia
| | - Miroslav Plohl
- Division of Molecular Biology, Ruđer Bošković Institute, Bijenička 54, 10 000, Zagreb, Croatia.
| |
Collapse
|
33
|
Sproul JS, Khost DE, Eickbush DG, Negm S, Wei X, Wong I, Larracuente AM. Dynamic Evolution of Euchromatic Satellites on the X Chromosome in Drosophila melanogaster and the simulans Clade. Mol Biol Evol 2020; 37:2241-2256. [PMID: 32191304 PMCID: PMC7403614 DOI: 10.1093/molbev/msaa078] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Satellite DNAs (satDNAs) are among the most dynamically evolving components of eukaryotic genomes and play important roles in genome regulation, genome evolution, and speciation. Despite their abundance and functional impact, we know little about the evolutionary dynamics and molecular mechanisms that shape satDNA distributions in genomes. Here, we use high-quality genome assemblies to study the evolutionary dynamics of two complex satDNAs, Rsp-like and 1.688 g/cm3, in Drosophila melanogaster and its three nearest relatives in the simulans clade. We show that large blocks of these repeats are highly dynamic in the heterochromatin, where their genomic location varies across species. We discovered that small blocks of satDNA that are abundant in X chromosome euchromatin are similarly dynamic, with repeats changing in abundance, location, and composition among species. We detail the proliferation of a rare satellite (Rsp-like) across the X chromosome in D. simulans and D. mauritiana. Rsp-like spread by inserting into existing clusters of the older, more abundant 1.688 satellite, in events likely facilitated by microhomology-mediated repair pathways. We show that Rsp-like is abundant on extrachromosomal circular DNA in D. simulans, which may have contributed to its dynamic evolution. Intralocus satDNA expansions via unequal exchange and the movement of higher order repeats also contribute to the fluidity of the repeat landscape. We find evidence that euchromatic satDNA repeats experience cycles of proliferation and diversification somewhat analogous to bursts of transposable element proliferation. Our study lays a foundation for mechanistic studies of satDNA proliferation and the functional and evolutionary consequences of satDNA movement.
Collapse
Affiliation(s)
- John S Sproul
- Department of Biology, University of Rochester, Rochester, NY
| | | | | | - Sherif Negm
- Department of Biology, University of Rochester, Rochester, NY
| | - Xiaolu Wei
- Department of Biomedical Genetics, University of Rochester Medical Center, Rochester, NY
| | - Isaac Wong
- Department of Biology, University of Rochester, Rochester, NY
| | | |
Collapse
|
34
|
|
35
|
Kelleher ES, Barbash DA, Blumenstiel JP. Taming the Turmoil Within: New Insights on the Containment of Transposable Elements. Trends Genet 2020; 36:474-489. [PMID: 32473745 DOI: 10.1016/j.tig.2020.04.007] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Revised: 04/15/2020] [Accepted: 04/17/2020] [Indexed: 12/28/2022]
Abstract
Transposable elements (TEs) are mobile genetic parasites that can exponentially increase their genomic abundance through self-propagation. Classic theoretical papers highlighted the importance of two potentially escalating forces that oppose TE spread: regulated transposition and purifying selection. Here, we review new insights into mechanisms of TE regulation and purifying selection, which reveal the remarkable foresight of these theoretical models. We further highlight emergent connections between transcriptional control enacted by small RNAs and the contribution of TE insertions to structural mutation and host-gene regulation. Finally, we call for increased comparative analysis of TE dynamics and fitness effects, as well as host control mechanisms, to reveal how interconnected forces shape the differential prevalence and distribution of TEs across the tree of life.
Collapse
|
36
|
Sultana N, Menzel G, Heitkam T, Kojima KK, Bao W, Serçe S. Bioinformatic and Molecular Analysis of Satellite Repeat Diversity in Vaccinium Genomes. Genes (Basel) 2020; 11:E527. [PMID: 32397417 PMCID: PMC7290377 DOI: 10.3390/genes11050527] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Revised: 05/06/2020] [Accepted: 05/06/2020] [Indexed: 12/11/2022] Open
Abstract
Bioinformatic and molecular characterization of satellite repeats was performed to understand the impact of their diversification on Vaccinium genome evolution. Satellite repeat diversity was evaluated in four cultivated and wild species, including the diploid species Vaccinium myrtillus and Vaccinium uliginosum, as well as the tetraploid species Vaccinium corymbosum and Vaccinium arctostaphylos. We comparatively characterized six satellite repeat families using in total 76 clones with 180 monomers. We observed that the monomer units of VaccSat1, VaccSat2, VaccSat5, and VaccSat6 showed a higher order repeat (HOR) structure, likely originating from the organization of two adjacent subunits with differing similarity, length and size. Moreover, VaccSat1, VaccSat3, VaccSat6, and VaccSat7 were found to have sequence similarity to parts of transposable elements. We detected satellite-typical tandem organization for VaccSat1 and VaccSat2 in long arrays, while VaccSat5 and VaccSat6 distributed in multiple sites over all chromosomes of tetraploid V. corymbosum, presumably in long arrays. In contrast, very short arrays of VaccSat3 and VaccSat7 are dispersedly distributed over all chromosomes in the same species, likely as internal parts of transposable elements. We provide a comprehensive overview on satellite species specificity in Vaccinium, which are potentially useful as molecular markers to address the taxonomic complexity of the genus, and provide information for genome studies of this genus.
Collapse
Affiliation(s)
- Nusrat Sultana
- Faculty of Life and Earth Sciences, Jagannath University, Dhaka 1100, Bangladesh
- Faculty of Biology, Technische Universität Dresden, D-01062 Dresden, Germany; (G.M.); (T.H.)
| | - Gerhard Menzel
- Faculty of Biology, Technische Universität Dresden, D-01062 Dresden, Germany; (G.M.); (T.H.)
| | - Tony Heitkam
- Faculty of Biology, Technische Universität Dresden, D-01062 Dresden, Germany; (G.M.); (T.H.)
| | - Kenji K. Kojima
- Genetic Information Research Institute, Cupertino, CA 95014, USA; (K.K.K.); (W.B.)
| | - Weidong Bao
- Genetic Information Research Institute, Cupertino, CA 95014, USA; (K.K.K.); (W.B.)
| | - Sedat Serçe
- Department of Agricultural Genetic Engineering, Ayhan Şahenk Faculty of Agricultural Sciences and Technologies, Niğde Ömer Halisdemir University, 51240 Niğde, Turkey;
| |
Collapse
|
37
|
Shatskikh AS, Kotov AA, Adashev VE, Bazylev SS, Olenina LV. Functional Significance of Satellite DNAs: Insights From Drosophila. Front Cell Dev Biol 2020; 8:312. [PMID: 32432114 PMCID: PMC7214746 DOI: 10.3389/fcell.2020.00312] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2020] [Accepted: 04/08/2020] [Indexed: 12/12/2022] Open
Abstract
Since their discovery more than 60 years ago, satellite repeats are still one of the most enigmatic parts of eukaryotic genomes. Being non-coding DNA, satellites were earlier considered to be non-functional “junk,” but recently this concept has been extensively revised. Satellite DNA contributes to the essential processes of formation of crucial chromosome structures, heterochromatin establishment, dosage compensation, reproductive isolation, genome stability and development. Genomic abundance of satellites is under stabilizing selection owing of their role in the maintenance of vital regions of the genome – centromeres, pericentromeric regions, and telomeres. Many satellites are transcribed with the generation of long or small non-coding RNAs. Misregulation of their expression is found to lead to various defects in the maintenance of genomic architecture, chromosome segregation and gametogenesis. This review summarizes our current knowledge concerning satellite functions, the mechanisms of regulation and evolution of satellites, focusing on recent findings in Drosophila. We discuss here experimental and bioinformatics data obtained in Drosophila in recent years, suggesting relevance of our analysis to a wide range of eukaryotic organisms.
Collapse
Affiliation(s)
- Aleksei S Shatskikh
- Laboratory of Analysis of Clinical and Model Tumor Pathologies on the Organismal Level, Institute of Molecular Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Alexei A Kotov
- Laboratory of Biochemical Genetics of Animals, Institute of Molecular Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Vladimir E Adashev
- Laboratory of Biochemical Genetics of Animals, Institute of Molecular Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Sergei S Bazylev
- Laboratory of Biochemical Genetics of Animals, Institute of Molecular Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Ludmila V Olenina
- Laboratory of Biochemical Genetics of Animals, Institute of Molecular Genetics, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
38
|
Talbert PB, Henikoff S. What makes a centromere? Exp Cell Res 2020; 389:111895. [PMID: 32035948 DOI: 10.1016/j.yexcr.2020.111895] [Citation(s) in RCA: 101] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2019] [Revised: 01/18/2020] [Accepted: 02/05/2020] [Indexed: 12/26/2022]
Abstract
Centromeres are the eukaryotic chromosomal sites at which the kinetochore forms and attaches to spindle microtubules to orchestrate chromosomal segregation in mitosis and meiosis. Although centromeres are essential for cell division, their sequences are not conserved and evolve rapidly. Centromeres vary dramatically in size and organization. Here we categorize their diversity and explore the evolutionary forces shaping them. Nearly all centromeres favor AT-rich DNA that is gene-free and transcribed at a very low level. Repair of frequent centromere-proximal breaks probably contributes to their rapid sequence evolution. Point centromeres are only ~125 bp and are specified by common protein-binding motifs, whereas short regional centromeres are 1-5 kb, typically have unique sequences, and may have pericentromeric repeats adapted to facilitate centromere clustering. Transposon-rich centromeres are often ~100-300 kb and are favored by RNAi machinery that silences transposons, by suppression of meiotic crossovers at centromeres, and by the ability of some transposons to target centromeres. Megabase-length satellite centromeres arise in plants and animals with asymmetric female meiosis that creates centromere competition, and favors satellite monomers one or two nucleosomes in length that position and stabilize centromeric nucleosomes. Holocentromeres encompass the length of a chromosome and may differ dramatically between mitosis and meiosis. We propose a model in which low level transcription of centromeres facilitates the formation of non-B DNA that specifies centromeres and promotes loading of centromeric nucleosomes.
Collapse
Affiliation(s)
- Paul B Talbert
- Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA, 98109, USA
| | - Steven Henikoff
- Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA, 98109, USA.
| |
Collapse
|
39
|
Vondrak T, Ávila Robledillo L, Novák P, Koblížková A, Neumann P, Macas J. Characterization of repeat arrays in ultra-long nanopore reads reveals frequent origin of satellite DNA from retrotransposon-derived tandem repeats. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 101:484-500. [PMID: 31559657 PMCID: PMC7004042 DOI: 10.1111/tpj.14546] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 09/09/2019] [Accepted: 09/12/2019] [Indexed: 05/21/2023]
Abstract
Amplification of monomer sequences into long contiguous arrays is the main feature distinguishing satellite DNA from other tandem repeats, yet it is also the main obstacle in its investigation because these arrays are in principle difficult to assemble. Here we explore an alternative, assembly-free approach that utilizes ultra-long Oxford Nanopore reads to infer the length distribution of satellite repeat arrays, their association with other repeats and the prevailing sequence periodicities. Using the satellite DNA-rich legume plant Lathyrus sativus as a model, we demonstrated this approach by analyzing 11 major satellite repeats using a set of nanopore reads ranging from 30 to over 200 kb in length and representing 0.73× genome coverage. We found surprising differences between the analyzed repeats because only two of them were predominantly organized in long arrays typical for satellite DNA. The remaining nine satellites were found to be derived from short tandem arrays located within LTR-retrotransposons that occasionally expanded in length. While the corresponding LTR-retrotransposons were dispersed across the genome, this array expansion occurred mainly in the primary constrictions of the L. sativus chromosomes, which suggests that these genome regions are favourable for satellite DNA accumulation.
Collapse
Affiliation(s)
- Tihana Vondrak
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
- Faculty of ScienceUniversity of South BohemiaČeské BudějoviceCzech Republic
| | - Laura Ávila Robledillo
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
- Faculty of ScienceUniversity of South BohemiaČeské BudějoviceCzech Republic
| | - Petr Novák
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
| | - Andrea Koblížková
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
| | - Pavel Neumann
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
| | - Jiří Macas
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
| |
Collapse
|
40
|
Bracewell R, Chatla K, Nalley MJ, Bachtrog D. Dynamic turnover of centromeres drives karyotype evolution in Drosophila. eLife 2019; 8:e49002. [PMID: 31524597 PMCID: PMC6795482 DOI: 10.7554/elife.49002] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Accepted: 09/12/2019] [Indexed: 12/21/2022] Open
Abstract
Centromeres are the basic unit for chromosome inheritance, but their evolutionary dynamics is poorly understood. We generate high-quality reference genomes for multiple Drosophila obscura group species to reconstruct karyotype evolution. All chromosomes in this lineage were ancestrally telocentric and the creation of metacentric chromosomes in some species was driven by de novo seeding of new centromeres at ancestrally gene-rich regions, independently of chromosomal rearrangements. The emergence of centromeres resulted in a drastic size increase due to repeat accumulation, and dozens of genes previously located in euchromatin are now embedded in pericentromeric heterochromatin. Metacentric chromosomes secondarily became telocentric in the pseudoobscura subgroup through centromere repositioning and a pericentric inversion. The former (peri)centric sequences left behind shrunk dramatically in size after their inactivation, yet contain remnants of their evolutionary past, including increased repeat-content and heterochromatic environment. Centromere movements are accompanied by rapid turnover of the major satellite DNA detected in (peri)centromeric regions.
Collapse
Affiliation(s)
- Ryan Bracewell
- Department of Integrative BiologyUniversity of California, BerkeleyBerkeleyUnited States
| | - Kamalakar Chatla
- Department of Integrative BiologyUniversity of California, BerkeleyBerkeleyUnited States
| | - Matthew J Nalley
- Department of Integrative BiologyUniversity of California, BerkeleyBerkeleyUnited States
| | - Doris Bachtrog
- Department of Integrative BiologyUniversity of California, BerkeleyBerkeleyUnited States
| |
Collapse
|
41
|
Centromere Repeats: Hidden Gems of the Genome. Genes (Basel) 2019; 10:genes10030223. [PMID: 30884847 PMCID: PMC6471113 DOI: 10.3390/genes10030223] [Citation(s) in RCA: 88] [Impact Index Per Article: 17.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2019] [Revised: 03/07/2019] [Accepted: 03/11/2019] [Indexed: 01/08/2023] Open
Abstract
Satellite DNAs are now regarded as powerful and active contributors to genomic and chromosomal evolution. Paired with mobile transposable elements, these repetitive sequences provide a dynamic mechanism through which novel karyotypic modifications and chromosomal rearrangements may occur. In this review, we discuss the regulatory activity of satellite DNA and their neighboring transposable elements in a chromosomal context with a particular emphasis on the integral role of both in centromere function. In addition, we discuss the varied mechanisms by which centromeric repeats have endured evolutionary processes, producing a novel, species-specific centromeric landscape despite sharing a ubiquitously conserved function. Finally, we highlight the role these repetitive elements play in the establishment and functionality of de novo centromeres and chromosomal breakpoints that underpin karyotypic variation. By emphasizing these unique activities of satellite DNAs and transposable elements, we hope to disparage the conventional exemplification of repetitive DNA in the historically-associated context of ‘junk’.
Collapse
|
42
|
Mustafin RN. Functional Dualism of Transposon Transcripts in Evolution of Eukaryotic Genomes. Russ J Dev Biol 2019. [DOI: 10.1134/s1062360418070019] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
43
|
Dalla Benetta E, Akbari OS, Ferree PM. Sequence Expression of Supernumerary B Chromosomes: Function or Fluff? Genes (Basel) 2019; 10:E123. [PMID: 30744010 PMCID: PMC6409846 DOI: 10.3390/genes10020123] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Revised: 02/01/2019] [Accepted: 02/05/2019] [Indexed: 12/25/2022] Open
Abstract
B chromosomes are enigmatic heritable elements found in the genomes of numerous plant and animal species. Contrary to their broad distribution, most B chromosomes are non-essential. For this reason, they are regarded as genome parasites. In order to be stably transmitted through generations, many B chromosomes exhibit the ability to "drive", i.e., they transmit themselves at super-Mendelian frequencies to progeny through directed interactions with the cell division apparatus. To date, very little is understood mechanistically about how B chromosomes drive, although a likely scenario is that expression of B chromosome sequences plays a role. Here, we highlight a handful of previously identified B chromosome sequences, many of which are repetitive and non-coding in nature, that have been shown to be expressed at the transcriptional level. We speculate on how each type of expressed sequence could participate in B chromosome drive based on known functions of RNA in general chromatin- and chromosome-related processes. We also raise some challenges to functionally testing these possible roles, a goal that will be required to more fully understand whether and how B chromosomes interact with components of the cell for drive and transmission.
Collapse
Affiliation(s)
- Elena Dalla Benetta
- W. M. Keck Science Department of Claremont McKenna, Pitzer, and Scripps Colleges, Claremont, CA 91711, USA.
- Division of Biological Sciences, Section of Cell and Developmental Biology, University of California, San Diego, La Jolla, CA 92093, USA.
| | - Omar S Akbari
- Division of Biological Sciences, Section of Cell and Developmental Biology, University of California, San Diego, La Jolla, CA 92093, USA.
| | - Patrick M Ferree
- W. M. Keck Science Department of Claremont McKenna, Pitzer, and Scripps Colleges, Claremont, CA 91711, USA.
| |
Collapse
|
44
|
Dennenmoser S, Sedlazeck FJ, Schatz MC, Altmüller J, Zytnicki M, Nolte AW. Genome‐wide patterns of transposon proliferation in an evolutionary young hybrid fish. Mol Ecol 2019; 28:1491-1505. [DOI: 10.1111/mec.14969] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2018] [Revised: 10/15/2018] [Accepted: 10/23/2018] [Indexed: 01/19/2023]
Affiliation(s)
- Stefan Dennenmoser
- Institute for Biology and Environmental Sciences Carl von Ossietzky University Oldenburg Oldenburg Germany
| | | | - Michael C. Schatz
- Cold Spring Harbor Laboratory Cold Spring Harbor New York
- Departments of Computer Science and Biology Johns Hopkins University Baltimore Maryland
| | - Janine Altmüller
- Cologne Center for Genomics, and Institute of Human Genetics University of Cologne Cologne Germany
| | | | - Arne W. Nolte
- Institute for Biology and Environmental Sciences Carl von Ossietzky University Oldenburg Oldenburg Germany
| |
Collapse
|