1
|
Lesur I, Rogier O, Sow MD, Boury C, Duplan A, Garnier A, Senhaji-Rachik A, Civan P, Daron J, Delaunay A, Duvaux L, Benoit V, Guichoux E, Le Provost G, Sanou E, Ambroise C, Plomion C, Salse J, Segura V, Tost J, Maury S. A strategy for studying epigenetic diversity in natural populations: proof of concept in poplar and oak. JOURNAL OF EXPERIMENTAL BOTANY 2024; 75:5568-5584. [PMID: 38889253 DOI: 10.1093/jxb/erae266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 06/17/2024] [Indexed: 06/20/2024]
Abstract
In the last 20 years, several techniques have been developed for quantifying DNA methylation, the most studied epigenetic marks in eukaryotes, including the gold standard method, whole-genome bisulfite sequencing (WGBS). WGBS quantifies genome-wide DNA methylation but has several inconveniences rendering it less suitable for population-scale epigenetic studies. The high cost of deep sequencing and the large amounts of data generated prompted us to seek an alternative approach. Restricting studies to parts of the genome would be a satisfactory alternative had there not been a major limitation: the need to select upstream targets corresponding to differentially methylated regions as targets. Given the need to study large numbers of samples, we propose a strategy for investigating DNA methylation variation in natural populations, taking into account the structural complexity of genomes, their size, and their content in unique coding regions versus repeated regions as transposable elements. We first identified regions of highly variable DNA methylation in a subset of genotypes representative of the biological diversity in the population by WGBS. We then analysed the variations of DNA methylation in these targeted regions at the population level by sequencing capture bisulfite (SeqCapBis). The entire strategy was then validated by applying it to another species. Our strategy was developed as a proof of concept on natural populations of two forest species: Populus nigra and Quercus petraea.
Collapse
Affiliation(s)
- Isabelle Lesur
- INRAE, Univ. Bordeaux, BIOGECO, F-33610 Cestas, France
- HelixVenture, F-33700 Mérignac, France
| | | | - Mamadou Dia Sow
- INRAE/UCA UMR GDEC 1095. 5 Chemin de Beaulieu, F-63100 Clermont Ferrand, France
- P2e, INRAE, Université d'Orleans, EA 1207 USC 1328, F-45067 Orleans, France
| | | | - Alexandre Duplan
- INRAE, ONF, BioForA, F-45075 Orléans, France
- P2e, INRAE, Université d'Orleans, EA 1207 USC 1328, F-45067 Orleans, France
| | - Abel Garnier
- Centre National de Recherche en Génomique Humaine, CEA-Institut de Biologie, François Jacob, Université Paris-Saclay, F-91000 Evry, France
| | | | - Peter Civan
- INRAE/UCA UMR GDEC 1095. 5 Chemin de Beaulieu, F-63100 Clermont Ferrand, France
| | - Josquin Daron
- Institut Pasteur, Université Paris Cité, CNRS UMR2000, Insect-Virus Interactions Unit, F-75724 Paris, France
| | - Alain Delaunay
- P2e, INRAE, Université d'Orleans, EA 1207 USC 1328, F-45067 Orleans, France
| | | | | | | | | | - Edmond Sanou
- LaMME, 23 Bd. de France, F-91037 Évry Cedex, France
| | | | | | - Jérôme Salse
- INRAE/UCA UMR GDEC 1095. 5 Chemin de Beaulieu, F-63100 Clermont Ferrand, France
| | - Vincent Segura
- INRAE, ONF, BioForA, F-45075 Orléans, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro Montpellier, F-34398 Montpellier, France
| | - Jörg Tost
- Centre National de Recherche en Génomique Humaine, CEA-Institut de Biologie, François Jacob, Université Paris-Saclay, F-91000 Evry, France
| | - Stéphane Maury
- P2e, INRAE, Université d'Orleans, EA 1207 USC 1328, F-45067 Orleans, France
| |
Collapse
|
2
|
Lin L, Huang Y, McIntyre J, Chang CH, Colmenares S, Lee YCG. Prevalent Fast Evolution of Genes Involved in Heterochromatin Functions. Mol Biol Evol 2024; 41:msae181. [PMID: 39189646 PMCID: PMC11408610 DOI: 10.1093/molbev/msae181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 08/14/2024] [Accepted: 08/20/2024] [Indexed: 08/28/2024] Open
Abstract
Heterochromatin is a gene-poor and repeat-rich genomic compartment universally found in eukaryotes. Despite its low transcriptional activity, heterochromatin plays important roles in maintaining genome stability, organizing chromosomes, and suppressing transposable elements. Given the importance of these functions, it is expected that genes involved in heterochromatin regulation would be highly conserved. Yet, a handful of these genes were found to evolve rapidly. To investigate whether these previous findings are anecdotal or general to genes modulating heterochromatin, we compile an exhaustive list of 106 candidate genes involved in heterochromatin functions and investigate their evolution over short and long evolutionary time scales in Drosophila. Our analyses find that these genes exhibit significantly more frequent evolutionary changes, both in the forms of amino acid substitutions and gene copy number change, when compared to genes involved in Polycomb-based repressive chromatin. While positive selection drives amino acid changes within both structured domains with diverse functions and intrinsically disordered regions, purifying selection may have maintained the proportions of intrinsically disordered regions of these proteins. Together with the observed negative associations between the evolutionary rate of these genes and the genomic abundance of transposable elements, we propose an evolutionary model where the fast evolution of genes involved in heterochromatin functions is an inevitable outcome of the unique functional roles of heterochromatin, while the rapid evolution of transposable elements may be an effect rather than cause. Our study provides an important global view of the evolution of genes involved in this critical cellular domain and provides insights into the factors driving the distinctive evolution of heterochromatin.
Collapse
Affiliation(s)
- Leila Lin
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA, USA
| | - Yuheng Huang
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA, USA
| | - Jennifer McIntyre
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA, USA
| | - Ching-Ho Chang
- Division of Basic Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Serafin Colmenares
- Department of Cell and Molecular Biology, University of California, Berkeley, CA, USA
| | - Yuh Chwen G Lee
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA, USA
| |
Collapse
|
3
|
Wu Y, Wang F, Lyu K, Liu R. Comparative Analysis of Transposable Elements in the Genomes of Citrus and Citrus-Related Genera. PLANTS (BASEL, SWITZERLAND) 2024; 13:2462. [PMID: 39273946 PMCID: PMC11397423 DOI: 10.3390/plants13172462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2024] [Revised: 08/28/2024] [Accepted: 09/01/2024] [Indexed: 09/15/2024]
Abstract
Transposable elements (TEs) significantly contribute to the evolution and diversity of plant genomes. In this study, we explored the roles of TEs in the genomes of Citrus and Citrus-related genera by constructing a pan-genome TE library from 20 published genomes of Citrus and Citrus-related accessions. Our results revealed an increase in TE content and the number of TE types compared to the original annotations, as well as a decrease in the content of unclassified TEs. The average length of TEs per assembly was approximately 194.23 Mb, representing 41.76% (Murraya paniculata) to 64.76% (Citrus gilletiana) of the genomes, with a mean value of 56.95%. A significant positive correlation was found between genome size and both the number of TE types and TE content. Consistent with the difference in mean whole-genome size (39.83 Mb) between Citrus and Citrus-related genera, Citrus genomes contained an average of 34.36 Mb more TE sequences than Citrus-related genomes. Analysis of the estimated insertion time and half-life of long terminal repeat retrotransposons (LTR-RTs) suggested that TE removal was not the primary factor contributing to the differences among genomes. These findings collectively indicate that TEs are the primary determinants of genome size and play a major role in shaping genome structures. Principal coordinate analysis (PCoA) of Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) identifiers revealed that the fragmented TEs were predominantly derived from ancestral genomes, while intact TEs were crucial in the recent evolutionary diversification of Citrus. Moreover, the presence or absence of intact TEs near the AdhE superfamily was closely associated with the bitterness trait in the Citrus species. Overall, this study enhances TE annotation in Citrus and Citrus-related genomes and provides valuable data for future genetic breeding and agronomic trait research in Citrus.
Collapse
Affiliation(s)
- Yilei Wu
- College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Center for Agroforestry Mega Data Science, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Fusheng Wang
- National Citrus Engineering Research Center, Citrus Research Institute, Southwest University, Chongqing 400712, China
| | - Keliang Lyu
- Center for Agroforestry Mega Data Science, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Renyi Liu
- Center for Agroforestry Mega Data Science, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| |
Collapse
|
4
|
Betancourt AJ, Wei KHC, Huang Y, Lee YCG. Causes and Consequences of Varying Transposable Element Activity: An Evolutionary Perspective. Annu Rev Genomics Hum Genet 2024; 25:1-25. [PMID: 38603565 DOI: 10.1146/annurev-genom-120822-105708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2024]
Abstract
Transposable elements (TEs) are genomic parasites found in nearly all eukaryotes, including humans. This evolutionary success of TEs is due to their replicative activity, involving insertion into new genomic locations. TE activity varies at multiple levels, from between taxa to within individuals. The rapidly accumulating evidence of the influence of TE activity on human health, as well as the rapid growth of new tools to study it, motivated an evaluation of what we know about TE activity thus far. Here, we discuss why TE activity varies, and the consequences of this variation, from an evolutionary perspective. By studying TE activity in nonhuman organisms in the context of evolutionary theories, we can shed light on the factors that affect TE activity. While the consequences of TE activity are usually deleterious, some have lasting evolutionary impacts by conferring benefits on the host or affecting other evolutionary processes.
Collapse
Affiliation(s)
- Andrea J Betancourt
- Institute of Infection, Veterinary, and Ecological Sciences, University of Liverpool, Liverpool, United Kingdom
| | - Kevin H-C Wei
- Department of Zoology, University of British Columbia, Vancouver, British Columbia, Canada
| | - Yuheng Huang
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California, USA
| | - Yuh Chwen G Lee
- Center for Complex Biological Systems, University of California, Irvine, California, USA;
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California, USA
| |
Collapse
|
5
|
Lin L, Huang Y, McIntyre J, Chang CH, Colmenares S, Lee YCG. Prevalent fast evolution of genes involved in heterochromatin functions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.03.583199. [PMID: 38496614 PMCID: PMC10942301 DOI: 10.1101/2024.03.03.583199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Heterochromatin is a gene-poor and repeat-rich genomic compartment universally found in eukaryotes. Despite its low transcriptional activity, heterochromatin plays important roles in maintaining genome stability, organizing chromosomes, and suppressing transposable elements (TEs). Given the importance of these functions, it is expected that the genes involved in heterochromatin regulation would be highly conserved. Yet, a handful of these genes were found to evolve rapidly. To investigate whether these previous findings are anecdotal or general to genes modulating heterochromatin, we compile an exhaustive list of 106 candidate genes involved in heterochromatin functions and investigate their evolution over short and long evolutionary time scales in Drosophila. Our analyses find that these genes exhibit significantly more frequent evolutionary changes, both in the forms of amino acid substitutions and gene copy number change, when compared to genes involved in Polycomb-based repressive chromatin. While positive selection drives amino acid changes within both structured domains with diverse functions and intrinsically disordered regions (IDRs), purifying selection may have maintained the proportions of IDRs of these proteins. Together with the observed negative associations between evolutionary rates of these genes and genomic TE abundance, we propose an evolutionary model where the fast evolution of genes involved in heterochromatin functions is an inevitable outcome of the unique functional roles of heterochromatin, while the rapid evolution of TEs may be an effect rather than cause. Our study provides an important global view of the evolution of genes involved in this critical cellular domain and provides insights into the factors driving the distinctive evolution of heterochromatin.
Collapse
|
6
|
Huang HY, Zhang S, Choucha FA, Verdenaud M, Tan FQ, Pichot C, Parsa HS, Slavkovic F, Chen Q, Troadec C, Marcel F, Dogimont C, Quadrana L, Boualem A, Bendahmane A. Harbinger transposon insertion in ethylene signaling gene leads to emergence of new sexual forms in cucurbits. Nat Commun 2024; 15:4877. [PMID: 38849342 PMCID: PMC11161486 DOI: 10.1038/s41467-024-49250-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Accepted: 05/28/2024] [Indexed: 06/09/2024] Open
Abstract
In flowering plants, the predominant sexual morph is hermaphroditism, and the emergence of unisexuality is poorly understood. Using Cucumis melo (melon) as a model system, we explore the mechanisms driving sexual forms. We identify a spontaneous mutant exhibiting a transition from bisexual to unisexual male flower, and identify the causal mutation as a Harbinger transposon impairing the expression of Ethylene Insensitive 2 (CmEIN2) gene. Genetics and transcriptomic analysis reveal a dual role of CmEIN2 in both sex determination and fruit shape formation. Upon expression of CmACS11, EIN2 is recruited to repress the expression of the carpel inhibitor, CmWIP1. Subsequently, EIN2 is recruited to mediate stamina inhibition. Following the sex determination phase, EIN2 promotes fruit shape elongation. Genome-wide analysis reveals that Harbinger transposon mobilization is triggered by environmental cues, and integrates preferentially in active chromatin, particularly within promoter regions. Characterization of a large collection of melon germplasm points to active transpositions in the wild, compared to cultivated accessions. Our study underscores the association between chromatin dynamics and the temporal aspects of mobile genetic element insertions, providing valuable insights into plant adaptation and crop genome evolution.
Collapse
Affiliation(s)
- Hsin-Ya Huang
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif-sur-Yvette, France
| | - Siqi Zhang
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif-sur-Yvette, France
| | - Fadi Abou Choucha
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif-sur-Yvette, France
| | - Marion Verdenaud
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif-sur-Yvette, France
| | - Feng-Quan Tan
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif-sur-Yvette, France
| | - Clement Pichot
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif-sur-Yvette, France
| | - Hadi Shirazi Parsa
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif-sur-Yvette, France
| | - Filip Slavkovic
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif-sur-Yvette, France
| | - Qinghe Chen
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif-sur-Yvette, France
| | - Christelle Troadec
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif-sur-Yvette, France
| | - Fabien Marcel
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif-sur-Yvette, France
| | - Catherine Dogimont
- INRAE, Génétique et Amélioration des Fruits et Légumes (GAFL), 84143, Montfavet, France
| | - Leandro Quadrana
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif-sur-Yvette, France
| | - Adnane Boualem
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif-sur-Yvette, France
| | - Abdelhafid Bendahmane
- Université Paris-Saclay, CNRS, INRAE, Université Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91190, Gif-sur-Yvette, France.
| |
Collapse
|
7
|
Huang Y, Lee YCG. Blessing or curse: how the epigenetic resolution of host-transposable element conflicts shapes their evolutionary dynamics. Proc Biol Sci 2024; 291:20232775. [PMID: 38593848 PMCID: PMC11003778 DOI: 10.1098/rspb.2023.2775] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Accepted: 03/01/2024] [Indexed: 04/11/2024] Open
Abstract
Transposable elements (TEs) are selfish genetic elements whose antagonistic interactions with hosts represent a common genetic conflict in eukaryotes. To resolve this conflict, hosts have widely adopted epigenetic silencing that deposits repressive marks at TEs. However, this mechanism is imperfect and fails to fully halt TE replication. Furthermore, TE epigenetic silencing can inadvertently spread repressive marks to adjacent functional sequences, a phenomenon considered a 'curse' of this conflict resolution. Here, we used forward simulations to explore how TE epigenetic silencing and its harmful side effects shape the evolutionary dynamics of TEs and their hosts. Our findings reveal that epigenetic silencing allows TEs and their hosts to stably coexist under a wide range of conditions, because the underlying molecular mechanisms give rise to copy-number dependency of the strength of TE silencing. Interestingly, contrary to intuitive expectations that TE epigenetic silencing should evolve to be as strong as possible, we found a selective benefit for modifier alleles that weaken TE silencing under biologically feasible conditions. These results reveal that the dual nature of TE epigenetic silencing, with both positive and negative effects, complicates its evolutionary trajectory and makes it challenging to determine whether TE epigenetic silencing is a 'blessing' or a 'curse'.
Collapse
Affiliation(s)
- Yuheng Huang
- Department of Ecology and Evolutionary Biology, University of California, Irvine, USA
| | - Yuh Chwen G. Lee
- Department of Ecology and Evolutionary Biology, University of California, Irvine, USA
| |
Collapse
|
8
|
Jiang J, Xu YC, Zhang ZQ, Chen JF, Niu XM, Hou XH, Li XT, Wang L, Zhang YE, Ge S, Guo YL. Forces driving transposable element load variation during Arabidopsis range expansion. THE PLANT CELL 2024; 36:840-862. [PMID: 38036296 PMCID: PMC10980350 DOI: 10.1093/plcell/koad296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 10/25/2023] [Accepted: 11/06/2023] [Indexed: 12/02/2023]
Abstract
Genetic load refers to the accumulated and potentially life-threatening deleterious mutations in populations. Understanding the mechanisms underlying genetic load variation of transposable element (TE) insertion, a major large-effect mutation, during range expansion is an intriguing question in biology. Here, we used 1,115 global natural accessions of Arabidopsis (Arabidopsis thaliana) to study the driving forces of TE load variation during its range expansion. TE load increased with range expansion, especially in the recently established Yangtze River basin population. Effective population size, which explains 62.0% of the variance in TE load, high transposition rate, and selective sweeps contributed to TE accumulation in the expanded populations. We genetically mapped and identified multiple candidate causal genes and TEs, and revealed the genetic architecture of TE load variation. Overall, this study reveals the variation in TE genetic load during Arabidopsis expansion and highlights the causes of TE load variation from the perspectives of both population genetics and quantitative genetics.
Collapse
Affiliation(s)
- Juan Jiang
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yong-Chao Xu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
| | - Zhi-Qin Zhang
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jia-Fu Chen
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xiao-Min Niu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
| | - Xing-Hui Hou
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
| | - Xin-Tong Li
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Li Wang
- Agricultural Synthetic Biology Center, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518000, China
| | - Yong E Zhang
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents & Key Laboratory of the Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Song Ge
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ya-Long Guo
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
9
|
Wen X, Li J, Yang F, Zhang X, Li Y. Exploring the Effect of High-Energy Heavy Ion Beam on Rice Genome: Transposon Activation. Genes (Basel) 2023; 14:2178. [PMID: 38137000 PMCID: PMC10742395 DOI: 10.3390/genes14122178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 11/25/2023] [Accepted: 11/29/2023] [Indexed: 12/24/2023] Open
Abstract
High-energy heavy ion beams are a new type of physical mutagen that can produce a wide range of phenotypic variations. In order to understand the mechanism of high-energy heavy ion beams, we resequenced the whole genome of individual plants with obvious phenotypic variations in rice. The sequence alignment results revealed a large number of SNPs and InDels, as well as genetic variations related to grain type and heading date. The distribution of SNP and InDel on chromosomes is random, but they often occur in the up/downstream regions and the intergenic region. Mutagenesis can cause changes in transposons such as Dasheng, mPing, Osr13 and RIRE2, affecting the stability of the genome. This study obtained the major gene mutation types, discovered differentially active transposons, screened out gene variants related to phenotype, and explored the mechanism of high-energy heavy ion beam radiation on rice genes.
Collapse
Affiliation(s)
- Xiaoting Wen
- Key Laboratory of Soybean Molecular Design and Breeding, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun 130102, China; (X.W.); (F.Y.); (X.Z.); (Y.L.)
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jingpeng Li
- Key Laboratory of Soybean Molecular Design and Breeding, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun 130102, China; (X.W.); (F.Y.); (X.Z.); (Y.L.)
- Jilin Provincial Laboratory of Crop Germplasm Resources, Changchun 130299, China
| | - Fu Yang
- Key Laboratory of Soybean Molecular Design and Breeding, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun 130102, China; (X.W.); (F.Y.); (X.Z.); (Y.L.)
| | - Xin Zhang
- Key Laboratory of Soybean Molecular Design and Breeding, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun 130102, China; (X.W.); (F.Y.); (X.Z.); (Y.L.)
| | - Yiwei Li
- Key Laboratory of Soybean Molecular Design and Breeding, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun 130102, China; (X.W.); (F.Y.); (X.Z.); (Y.L.)
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
10
|
Langmüller AM, Nolte V, Dolezal M, Schlötterer C. The genomic distribution of transposable elements is driven by spatially variable purifying selection. Nucleic Acids Res 2023; 51:9203-9213. [PMID: 37560917 PMCID: PMC10516647 DOI: 10.1093/nar/gkad635] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 07/10/2023] [Accepted: 07/18/2023] [Indexed: 08/11/2023] Open
Abstract
It is widely accepted that the genomic distribution of transposable elements (TEs) mainly reflects the outcome of purifying selection and insertion bias (1). Nevertheless, the relative importance of these two evolutionary forces could not be tested thoroughly. Here, we introduce an experimental system, which allows separating purifying selection from TE insertion bias. We used experimental evolution to study the TE insertion patterns in Drosophila simulans founder populations harboring 1040 insertions of an active P-element. After 10 generations at a large population size, we detected strong selection against P-element insertions. The exception were P-element insertions in genomic regions for which a strong insertion bias has been proposed (2-4). Because recurrent P-element insertions cannot explain this pattern, we conclude that purifying selection, with variable strength along the chromosomes, is the major determinant of the genomic distribution of P-elements. Genomic regions with relaxed purifying selection against P-element insertions exhibit normal levels of purifying selection against base substitutions. This suggests that different types of purifying selection operate on base substitutions and P-element insertions. Our results highlight the power of experimental evolution to understand basic evolutionary processes, which are difficult to infer from patterns of natural variation alone.
Collapse
Affiliation(s)
- Anna M Langmüller
- Institut für Populationsgenetik, Vetmeduni Vienna, Veterinärplatz 1, 1210 Wien, Austria
- Vienna Graduate School of Population Genetics, Vetmeduni Vienna, Veterinärplatz 1, 1210 Vienna, Austria
| | - Viola Nolte
- Institut für Populationsgenetik, Vetmeduni Vienna, Veterinärplatz 1, 1210 Wien, Austria
| | - Marlies Dolezal
- Plattform Bioinformatik und Biostatistik, Vetmeduni Vienna, Veterinärplatz 1, 1210 Vienna, Austria
| | - Christian Schlötterer
- Institut für Populationsgenetik, Vetmeduni Vienna, Veterinärplatz 1, 1210 Wien, Austria
| |
Collapse
|
11
|
Chen J, Basting PJ, Han S, Garfinkel DJ, Bergman CM. Reproducible evaluation of transposable element detectors with McClintock 2 guides accurate inference of Ty insertion patterns in yeast. Mob DNA 2023; 14:8. [PMID: 37452430 PMCID: PMC10347736 DOI: 10.1186/s13100-023-00296-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 06/09/2023] [Indexed: 07/18/2023] Open
Abstract
BACKGROUND Many computational methods have been developed to detect non-reference transposable element (TE) insertions using short-read whole genome sequencing data. The diversity and complexity of such methods often present challenges to new users seeking to reproducibly install, execute, or evaluate multiple TE insertion detectors. RESULTS We previously developed the McClintock meta-pipeline to facilitate the installation, execution, and evaluation of six first-generation short-read TE detectors. Here, we report a completely re-implemented version of McClintock written in Python using Snakemake and Conda that improves its installation, error handling, speed, stability, and extensibility. McClintock 2 now includes 12 short-read TE detectors, auxiliary pre-processing and analysis modules, interactive HTML reports, and a simulation framework to reproducibly evaluate the accuracy of component TE detectors. When applied to the model microbial eukaryote Saccharomyces cerevisiae, we find substantial variation in the ability of McClintock 2 components to identify the precise locations of non-reference TE insertions, with RelocaTE2 showing the highest recall and precision in simulated data. We find that RelocaTE2, TEMP, TEMP2 and TEBreak provide consistent estimates of [Formula: see text]50 non-reference TE insertions per strain and that Ty2 has the highest number of non-reference TE insertions in a species-wide panel of [Formula: see text]1000 yeast genomes. Finally, we show that best-in-class predictors for yeast applied to resequencing data have sufficient resolution to reveal a dyad pattern of integration in nucleosome-bound regions upstream of yeast tRNA genes for Ty1, Ty2, and Ty4, allowing us to extend knowledge about fine-scale target preferences revealed previously for experimentally-induced Ty1 insertions to spontaneous insertions for other copia-superfamily retrotransposons in yeast. CONCLUSION McClintock ( https://github.com/bergmanlab/mcclintock/ ) provides a user-friendly pipeline for the identification of TEs in short-read WGS data using multiple TE detectors, which should benefit researchers studying TE insertion variation in a wide range of different organisms. Application of the improved McClintock system to simulated and empirical yeast genome data reveals best-in-class methods and novel biological insights for one of the most widely-studied model eukaryotes and provides a paradigm for evaluating and selecting non-reference TE detectors in other species.
Collapse
Affiliation(s)
- Jingxuan Chen
- Institute of Bioinformatics, University of Georgia, Athens, GA USA
| | | | - Shunhua Han
- Institute of Bioinformatics, University of Georgia, Athens, GA USA
| | - David J. Garfinkel
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA USA
| | - Casey M. Bergman
- Institute of Bioinformatics, University of Georgia, Athens, GA USA
- Department of Genetics, University of Georgia, Athens, GA USA
| |
Collapse
|
12
|
Wang Y, McNeil P, Abdulazeez R, Pascual M, Johnston SE, Keightley PD, Obbard DJ. Variation in mutation, recombination, and transposition rates in Drosophila melanogaster and Drosophila simulans. Genome Res 2023; 33:587-598. [PMID: 37037625 PMCID: PMC10234296 DOI: 10.1101/gr.277383.122] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Accepted: 03/28/2023] [Indexed: 04/12/2023]
Abstract
The rates of mutation, recombination, and transposition are core parameters in models of evolution. They impact genetic diversity, responses to ongoing selection, and levels of genetic load. However, even for key evolutionary model species such as Drosophila melanogaster and Drosophila simulans, few estimates of these parameters are available, and we have little idea of how rates vary between individuals, sexes, or populations. Knowledge of this variation is fundamental for parameterizing models of genome evolution. Here, we provide direct estimates of mutation, recombination, and transposition rates and their variation in a West African and a European population of D. melanogaster and a European population of D. simulans Across 89 flies, we observe 58 single-nucleotide mutations, 286 crossovers, and 89 transposable element (TE) insertions. Compared to the European D. melanogaster, we find the West African population has a lower mutation rate (1.67 × 10-9 site-1 gen-1 vs. 4.86 × 10-9 site-1 gen-1) and a lower transposition rate (8.99 × 10-5 copy-1 gen-1 vs. 23.36 × 10-5 copy-1 gen-1), but a higher recombination rate (3.44 cM/Mb vs. 2.06 cM/Mb). The European D. simulans population has a similar mutation rate to European D. melanogaster, but a significantly higher recombination rate and a lower, but not significantly different, transposition rate. Overall, we find paternal-derived mutations are more frequent than maternal ones in both species. Our study quantifies the variation in rates of mutation, recombination, and transposition among different populations and sexes, and our direct estimates of these parameters in D. melanogaster and D. simulans will benefit future studies in population and evolutionary genetics.
Collapse
Affiliation(s)
- Yiguan Wang
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom;
| | - Paul McNeil
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | | | - Marta Pascual
- Departament de Genètica, Microbiologia i Estadística and IRBio, Universitat de Barcelona, 08028 Barcelona, Spain
| | - Susan E Johnston
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Peter D Keightley
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Darren J Obbard
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| |
Collapse
|
13
|
Chen J, Basting PJ, Han S, Garfinkel DJ, Bergman CM. Reproducible evaluation of transposable element detectors with McClintock 2 guides accurate inference of Ty insertion patterns in yeast. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.13.528343. [PMID: 36824955 PMCID: PMC9948991 DOI: 10.1101/2023.02.13.528343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
BACKGROUND Many computational methods have been developed to detect non-reference transposable element (TE) insertions using short-read whole genome sequencing data. The diversity and complexity of such methods often present challenges to new users seeking to reproducibly install, execute, or evaluate multiple TE insertion detectors. RESULTS We previously developed the McClintock meta-pipeline to facilitate the installation, execution, and evaluation of six first-generation short-read TE detectors. Here, we report a completely re-implemented version of McClintock written in Python using Snakemake and Conda that improves its installation, error handling, speed, stability, and extensibility. McClintock 2 now includes 12 short-read TE detectors, auxiliary pre-processing and analysis modules, interactive HTML reports, and a simulation framework to reproducibly evaluate the accuracy of component TE detectors. When applied to the model microbial eukaryote Saccharomyces cerevisiae, we find substantial variation in the ability of McClintock 2 components to identify the precise locations of non-reference TE insertions, with RelocaTE2 showing the highest recall and precision in simulated data. We find that RelocaTE2, TEMP, TEMP2 and TEBreak provide a consistent and biologically meaningful view of non-reference TE insertions in a species-wide panel of ∼1000 yeast genomes, as evaluated by coverage-based abundance estimates and expected patterns of tRNA promoter targeting. Finally, we show that best-in-class predictors for yeast have sufficient resolution to reveal a dyad pattern of integration in nucleosome-bound regions upstream of yeast tRNA genes for Ty1, Ty2, and Ty4, allowing us to extend knowledge about fine-scale target preferences first revealed experimentally for Ty1 to natural insertions and related copia-superfamily retrotransposons in yeast. CONCLUSION McClintock (https://github.com/bergmanlab/mcclintock/) provides a user-friendly pipeline for the identification of TEs in short-read WGS data using multiple TE detectors, which should benefit researchers studying TE insertion variation in a wide range of different organisms. Application of the improved McClintock system to simulated and empirical yeast genome data reveals best-in-class methods and novel biological insights for one of the most widely-studied model eukaryotes and provides a paradigm for evaluating and selecting non-reference TE detectors for other species.
Collapse
Affiliation(s)
- Jingxuan Chen
- Institute of Bioinformatics, University of Georgia, Athens, GA
| | | | - Shunhua Han
- Institute of Bioinformatics, University of Georgia, Athens, GA
| | - David J. Garfinkel
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA
| | - Casey M. Bergman
- Institute of Bioinformatics, University of Georgia, Athens, GA
- Department of Genetics, University of Georgia, Athens, GA
| |
Collapse
|
14
|
Wierzbicki F, Kofler R, Signor S. Evolutionary dynamics of piRNA clusters in Drosophila. Mol Ecol 2023; 32:1306-1322. [PMID: 34878692 DOI: 10.1111/mec.16311] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 11/24/2021] [Accepted: 12/01/2021] [Indexed: 12/21/2022]
Abstract
Small RNAs produced from transposable element (TE)-rich sections of the genome, termed piRNA clusters, are a crucial component in the genomic defence against selfish DNA. In animals, it is thought the invasion of a TE is stopped when a copy of the TE inserts into a piRNA cluster, triggering the production of cognate small RNAs that silence the TE. Despite this importance for TE control, little is known about the evolutionary dynamics of piRNA clusters, mostly because these repeat-rich regions are difficult to assemble and compare. Here, we establish a framework for studying the evolution of piRNA clusters quantitatively. Previously introduced quality metrics and a newly developed software for multiple alignments of repeat annotations (Manna) allow us to estimate the level of polymorphism segregating in piRNA clusters and the divergence among homologous piRNA clusters. By studying 20 conserved piRNA clusters in multiple assemblies of four Drosophila species, we show that piRNA clusters are evolving rapidly. While 70%-80% of the clusters are conserved within species, the clusters share almost no similarity between species as closely related as D. melanogaster and D. simulans. Furthermore, abundant insertions and deletions are segregating within the Drosophila species. We show that the evolution of clusters is mainly driven by large insertions of recently active TEs and smaller deletions mostly in older TEs. The effect of these forces is so rapid that homologous clusters often do not contain insertions from the same TE families.
Collapse
Affiliation(s)
- Filip Wierzbicki
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria
- Vienna Graduate School of Population Genetics, Vienna, Austria
| | - Robert Kofler
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria
| | - Sarah Signor
- Biological Sciences, North Dakota State University, Fargo, North Dakota, USA
| |
Collapse
|
15
|
López-Cortegano E, Craig RJ, Chebib J, Balogun EJ, Keightley PD. Rates and spectra of de novo structural mutations in Chlamydomonas reinhardtii. Genome Res 2023; 33:45-60. [PMID: 36617667 PMCID: PMC9977147 DOI: 10.1101/gr.276957.122] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 12/06/2022] [Indexed: 12/14/2022]
Abstract
Genetic variation originates from several types of spontaneous mutation, including single-nucleotide substitutions, short insertions and deletions (indels), and larger structural changes. Structural mutations (SMs) drive genome evolution and are thought to play major roles in evolutionary adaptation, speciation, and genetic disease, including cancers. Sequencing of mutation accumulation (MA) lines has provided estimates of rates and spectra of single-nucleotide and indel mutations in many species, yet the rate of new SMs is largely unknown. Here, we use long-read sequencing to determine the full mutation spectrum in MA lines derived from two strains (CC-1952 and CC-2931) of the green alga Chlamydomonas reinhardtii The SM rate is highly variable between strains and between MA lines, and SMs represent a substantial proportion of all mutations in both strains (CC-1952 6%; CC-2931 12%). The SM spectra differ considerably between the two strains, with almost all inversions and translocations occurring in CC-2931 MA lines. This variation is associated with heterogeneity in the number and type of active transposable elements (TEs), which comprise major proportions of SMs in both strains (CC-1952 22%; CC-2931 38%). In CC-2931, a Crypton and a previously undescribed type of DNA element have caused 71% of chromosomal rearrangements, whereas in CC-1952, a Dualen LINE is associated with 87% of duplications. Other SMs, notably large duplications in CC-2931, are likely products of various double-strand break repair pathways. Our results show that diverse types of SMs occur at substantial rates, and support prominent roles for SMs and TEs in evolution.
Collapse
Affiliation(s)
- Eugenio López-Cortegano
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Rory J Craig
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
- California Institute for Quantitative Biosciences, UC Berkeley, Berkeley, California 94720, USA
| | - Jobran Chebib
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Eniolaye J Balogun
- Department of Ecology and Evolutionary Biology, University of Toronto, Ontario ON M5S 3B2, Canada
- Department of Biology, University of Toronto Mississauga, Mississauga ON L5L 1C6, Canada
| | - Peter D Keightley
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| |
Collapse
|
16
|
Miyao A, Yamanouchi U. Transposable element finder (TEF): finding active transposable elements from next generation sequencing data. BMC Bioinformatics 2022; 23:500. [PMID: 36418944 PMCID: PMC9682801 DOI: 10.1186/s12859-022-05011-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 10/26/2022] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Detection of newly transposed events by transposable elements (TEs) from next generation sequence (NGS) data is difficult, due to their multiple distribution sites over the genome containing older TEs. The previously reported Transposon Insertion Finder (TIF) detects TE transpositions on the reference genome from NGS short reads using end sequences of target TE. TIF requires the sequence of target TE and is not able to detect transpositions for TEs with an unknown sequence. RESULT The new algorithm Transposable Element Finder (TEF) enables the detection of TE transpositions, even for TEs with an unknown sequence. TEF is a finding tool of transposed TEs, in contrast to TIF as a detection tool of transposed sites for TEs with a known sequence. The transposition event is often accompanied with a target site duplication (TSD). Focusing on TSD, two algorithms to detect both ends of TE, TSDs and target sites are reported here. One is based on the grouping with TSDs and direct comparison of k-mers from NGS without similarity search. The other is based on the junction mapping of TE end sequence candidates. Both methods succeed to detect both ends and TSDs of known active TEs in several tests with rice, Arabidopsis and Drosophila data and discover several new TEs in new locations. PCR confirmed the detected transpositions of TEs in several test cases in rice. CONCLUSIONS TEF detects transposed TEs with TSDs as a result of TE transposition, sequences of both ends and their inserted positions of transposed TEs by direct comparison of NGS data between two samples. Genotypes of transpositions are verified by counting of junctions of head and tail, and non-insertion sequences in NGS reads. TEF is easy to run and independent of any TE library, which makes it useful to detect insertions from unknown TEs bypassed by common TE annotation pipelines.
Collapse
Affiliation(s)
- Akio Miyao
- grid.416835.d0000 0001 2222 0432Institute of Crop Science, National Agriculture and Food Research Organization, 2-1-2, Kannondai, Tsukuba, Ibaraki 305-8518 Japan
| | - Utako Yamanouchi
- grid.416835.d0000 0001 2222 0432Institute of Crop Science, National Agriculture and Food Research Organization, 2-1-2, Kannondai, Tsukuba, Ibaraki 305-8518 Japan
| |
Collapse
|
17
|
Rech GE, Radío S, Guirao-Rico S, Aguilera L, Horvath V, Green L, Lindstadt H, Jamilloux V, Quesneville H, González J. Population-scale long-read sequencing uncovers transposable elements associated with gene expression variation and adaptive signatures in Drosophila. Nat Commun 2022; 13:1948. [PMID: 35413957 PMCID: PMC9005704 DOI: 10.1038/s41467-022-29518-8] [Citation(s) in RCA: 42] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 03/15/2022] [Indexed: 12/16/2022] Open
Abstract
High quality reference genomes are crucial to understanding genome function, structure and evolution. The availability of reference genomes has allowed us to start inferring the role of genetic variation in biology, disease, and biodiversity conservation. However, analyses across organisms demonstrate that a single reference genome is not enough to capture the global genetic diversity present in populations. In this work, we generate 32 high-quality reference genomes for the well-known model species D. melanogaster and focus on the identification and analysis of transposable element variation as they are the most common type of structural variant. We show that integrating the genetic variation across natural populations from five climatic regions increases the number of detected insertions by 58%. Moreover, 26% to 57% of the insertions identified using long-reads were missed by short-reads methods. We also identify hundreds of transposable elements associated with gene expression variation and new TE variants likely to contribute to adaptive evolution in this species. Our results highlight the importance of incorporating the genetic variation present in natural populations to genomic studies, which is essential if we are to understand how genomes function and evolve.
Collapse
Affiliation(s)
- Gabriel E Rech
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Santiago Radío
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Sara Guirao-Rico
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Laura Aguilera
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Vivien Horvath
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Llewellyn Green
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Hannah Lindstadt
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | | | | | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain.
| |
Collapse
|
18
|
Lewerentz J, Johansson AM, Larsson J, Stenberg P. Transposon activity, local duplications and propagation of structural variants across haplotypes drive the evolution of the Drosophila S2 cell line. BMC Genomics 2022; 23:276. [PMID: 35392795 PMCID: PMC8991648 DOI: 10.1186/s12864-022-08472-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 03/15/2022] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND Immortalized cell lines are widely used model systems whose genomes are often highly rearranged and polyploid. However, their genome structure is seldom deciphered and is thus not accounted for during analyses. We therefore used linked short- and long-read sequencing to perform haplotype-level reconstruction of the genome of a Drosophila melanogaster cell line (S2-DRSC) with a complex genome structure. RESULTS Using a custom implementation (that is designed to use ultra-long reads in complex genomes with nested rearrangements) to call structural variants (SVs), we found that the most common SV was repetitive sequence insertion or deletion (> 80% of SVs), with Gypsy retrotransposon insertions dominating. The second most common SV was local sequence duplication. SNPs and other SVs were rarer, but several large chromosomal translocations and mitochondrial genome insertions were observed. Haplotypes were highly similar at the nucleotide level but structurally very different. Insertion SVs existed at various haplotype frequencies and were unlinked on chromosomes, demonstrating that haplotypes have different structures and suggesting the existence of a mechanism that allows SVs to propagate across haplotypes. Finally, using public short-read data, we found that transposable element insertions and local duplications are common in other D. melanogaster cell lines. CONCLUSIONS The S2-DRSC cell line evolved through retrotransposon activity and vast local sequence duplications, that we hypothesize were the products of DNA re-replication events. Additionally, mutations can propagate across haplotypes (possibly explained by mitotic recombination), which enables fine-tuning of mutational impact and prevents accumulation of deleterious events, an inherent problem of clonal reproduction. We conclude that traditional linear homozygous genome representation conceals the complexity when dealing with rearranged and heterozygous clonal cells.
Collapse
Affiliation(s)
- Jacob Lewerentz
- Department of Molecular Biology, Umeå University, SE-901 87, Umeå, Västerbotten, Sweden.
| | - Anna-Mia Johansson
- Department of Molecular Biology, Umeå University, SE-901 87, Umeå, Västerbotten, Sweden
| | - Jan Larsson
- Department of Molecular Biology, Umeå University, SE-901 87, Umeå, Västerbotten, Sweden.
| | - Per Stenberg
- Department of Ecology and Environmental Sciences, Umeå University, SE-901 87, Umeå, Västerbotten, Sweden.
| |
Collapse
|
19
|
Lee YCG. Synergistic epistasis of the deleterious effects of transposable elements. Genetics 2022; 220:iyab211. [PMID: 34888644 PMCID: PMC9097265 DOI: 10.1093/genetics/iyab211] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Accepted: 11/10/2021] [Indexed: 11/12/2022] Open
Abstract
The replicative nature and generally deleterious effects of transposable elements (TEs) raise an outstanding question about how TE copy number is stably contained in host populations. Classic theoretical analyses predict that, when the decline in fitness due to each additional TE insertion is greater than linear, or when there is synergistic epistasis, selection against TEs can result in a stable equilibrium of TE copy number. While several mechanisms are predicted to yield synergistic deleterious effects of TEs, we lack empirical investigations of the presence of such epistatic interactions. Purifying selection with synergistic epistasis generates repulsion linkage between deleterious alleles. We investigated this population genetic signal in the likely ancestral Drosophila melanogaster population and found evidence supporting the presence of synergistic epistasis among TE insertions, especially TEs expected to exert large fitness impacts. Even though synergistic epistasis of TEs has been predicted to arise through ectopic recombination and TE-mediated epigenetic silencing mechanisms, we only found mixed support for the associated predictions. We observed signals of synergistic epistasis for a large number of TE families, which is consistent with the expectation that such epistatic interaction mainly happens among copies of the same family. Curiously, significant repulsion linkage was also found among TE insertions from different families, suggesting the possibility that synergism of TEs' deleterious fitness effects could arise above the family level and through mechanisms similar to those of simple mutations. Our findings set the stage for investigating the prevalence and importance of epistatic interactions in the evolutionary dynamics of TEs.
Collapse
Affiliation(s)
- Yuh Chwen G Lee
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA 92697, USA
| |
Collapse
|
20
|
Ho EKH, Schaack S. Intraspecific Variation in the Rates of Mutations Causing Structural Variation in Daphnia magna. Genome Biol Evol 2021; 13:6444992. [PMID: 34849778 PMCID: PMC8691059 DOI: 10.1093/gbe/evab241] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/21/2021] [Indexed: 12/17/2022] Open
Abstract
Mutations that cause structural variation are important sources of genetic variation upon which other evolutionary forces can act, however, they are difficult to observe and therefore few direct estimates of their rate and spectrum are available. Understanding mutation rate evolution, however, requires adding to the limited number of species for which direct estimates are available, quantifying levels of intraspecific variation in mutation rates, and assessing whether rate estimates co-vary across types of mutation. Here, we report structural variation-causing mutation rates (svcMRs) for six categories of mutations (short insertions and deletions, long deletions and duplications, and deletions and duplications at copy number variable sites) from nine genotypes of Daphnia magna collected from three populations in Finland, Germany, and Israel using a mutation accumulation approach. Based on whole-genome sequence data and validated using simulations, we find svcMRs are high (two orders of magnitude higher than base substitution mutation rates measured in the same lineages), highly variable among populations, and uncorrelated across categories of mutation. Furthermore, to assess the impact of scvMRs on the genome, we calculated rates while adjusting for the lengths of events and ran simulations to determine if the mutations occur in genic regions more or less frequently than expected by chance. Our results pose a challenge to most prevailing theories aimed at explaining the evolution of the mutation rate, underscoring the importance of obtaining additional mutation rate estimates in more genotypes, for more types of mutation, in more species, in order to improve our future understanding of mutation rates, their variation, and their evolution.
Collapse
Affiliation(s)
- Eddie K H Ho
- Department of Biology, Reed College, Portland, Oregon, USA
| | - Sarah Schaack
- Department of Biology, Reed College, Portland, Oregon, USA
| |
Collapse
|
21
|
Zakharenko LP. Phenotypically Unstable Mutations as Markers of Chromosomal Rearrangements Involving DNA Transposons. RUSS J GENET+ 2021. [DOI: 10.1134/s1022795421110156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
22
|
Ho EKH, Bellis ES, Calkins J, Adrion JR, Latta IV LC, Schaack S. Engines of change: Transposable element mutation rates are high and variable within Daphnia magna. PLoS Genet 2021; 17:e1009827. [PMID: 34723969 PMCID: PMC8594854 DOI: 10.1371/journal.pgen.1009827] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Revised: 11/16/2021] [Accepted: 09/16/2021] [Indexed: 12/22/2022] Open
Abstract
Transposable elements (TEs) represent a major portion of most eukaryotic genomes, yet little is known about their mutation rates or how their activity is shaped by other evolutionary forces. Here, we compare short- and long-term patterns of genome-wide mutation accumulation (MA) of TEs among 9 genotypes from three populations of Daphnia magna from across a latitudinal gradient. While the overall proportion of the genome comprised of TEs is highly similar among genotypes from Finland, Germany, and Israel, populations are distinguishable based on patterns of insertion site polymorphism. Our direct rate estimates indicate TE movement is highly variable (net rates ranging from -11.98 to 12.79 x 10-5 per copy per generation among genotypes), differing both among populations and TE families. Although gains outnumber losses when selection is minimized, both types of events appear to be highly deleterious based on their low frequency in control lines where propagation is not limited to random, single-progeny descent. With rate estimates 4 orders of magnitude higher than base substitutions, TEs clearly represent a highly mutagenic force in the genome. Quantifying patterns of intra- and interspecific variation in TE mobility with and without selection provides insight into a powerful mechanism generating genetic variation in the genome.
Collapse
Affiliation(s)
- Eddie K. H. Ho
- Department of Biology, Reed College, Portland, Oregon, United States of America
| | - Emily S. Bellis
- Department of Biology, Reed College, Portland, Oregon, United States of America
- Department of Computer Science, Arkansas State University, Jonesboro, Arkansas, United States of America
| | - Jaclyn Calkins
- Department of Biology, Reed College, Portland, Oregon, United States of America
- College of Human Medicine, Michigan State University, East Lansing, Michigan, United States of America
| | - Jeffrey R. Adrion
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, United States of America
| | - Leigh C. Latta IV
- Department of Biology, Reed College, Portland, Oregon, United States of America
- Lewis-Clark State College, Lewiston, Idaho, United States of America
| | - Sarah Schaack
- Department of Biology, Reed College, Portland, Oregon, United States of America
| |
Collapse
|
23
|
Han S, Basting PJ, Dias GB, Luhur A, Zelhof AC, Bergman CM. Transposable element profiles reveal cell line identity and loss of heterozygosity in Drosophila cell culture. Genetics 2021; 219:6321957. [PMID: 34849875 PMCID: PMC8633141 DOI: 10.1093/genetics/iyab113] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Accepted: 07/01/2021] [Indexed: 11/28/2022] Open
Abstract
Cell culture systems allow key insights into biological mechanisms yet suffer from irreproducible outcomes in part because of cross-contamination or mislabeling of cell lines. Cell line misidentification can be mitigated by the use of genotyping protocols, which have been developed for human cell lines but are lacking for many important model species. Here, we leverage the classical observation that transposable elements (TEs) proliferate in cultured Drosophila cells to demonstrate that genome-wide TE insertion profiles can reveal the identity and provenance of Drosophila cell lines. We identify multiple cases where TE profiles clarify the origin of Drosophila cell lines (Sg4, mbn2, and OSS_E) relative to published reports, and also provide evidence that insertions from only a subset of long-terminal repeat retrotransposon families are necessary to mark Drosophila cell line identity. We also develop a new bioinformatics approach to detect TE insertions and estimate intra-sample allele frequencies in legacy whole-genome sequencing data (called ngs_te_mapper2), which revealed loss of heterozygosity as a mechanism shaping the unique TE profiles that identify Drosophila cell lines. Our work contributes to the general understanding of the forces impacting metazoan genomes as they evolve in cell culture and paves the way for high-throughput protocols that use TE insertions to authenticate cell lines in Drosophila and other organisms.
Collapse
Affiliation(s)
- Shunhua Han
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Preston J Basting
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Guilherme B Dias
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA.,Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - Arthur Luhur
- Drosophila Genomics Resource Center, Indiana University, Bloomington, IN 47405, USA.,Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Andrew C Zelhof
- Drosophila Genomics Resource Center, Indiana University, Bloomington, IN 47405, USA.,Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Casey M Bergman
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA.,Department of Genetics, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
24
|
Mérel V, Gibert P, Buch I, Rada VR, Estoup A, Gautier M, Fablet M, Boulesteix M, Vieira C. The worldwide invasion of Drosophila suzukii is accompanied by a large increase of transposable element load and a small number of putatively adaptive insertions. Mol Biol Evol 2021; 38:4252-4267. [PMID: 34021759 PMCID: PMC8476158 DOI: 10.1093/molbev/msab155] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Transposable Elements (TEs) are ubiquitous and mobile repeated sequences. They are major determinants of host fitness. Here, we characterized the TE content of the spotted wing fly Drosophila suzukii. Using a recently improved genome assembly, we reconstructed TE sequences de novo, and found that TEs occupy 47% of the genome and are mostly located in gene poor regions. The majority of TE insertions segregate at low frequencies, indicating a recent and probably ongoing TE activity. To explore TE dynamics in the context of biological invasions, we studied variation of TE abundance in genomic data from 16 invasive and six native populations of D. suzukii. We found a large increase of the TE load in invasive populations correlated with a reduced Watterson estimate of genetic diversity θ̂w a proxy of effective population size. We did not find any correlation between TE contents and bioclimatic variables, indicating a minor effect of environmentally induced TE activity. A genome-wide association study revealed that ca. 2,000 genomic regions are associated with TE abundance. We did not find, however, any evidence in such regions of an enrichment for genes known to interact with TE activity (e.g. transcription factor encoding genes or genes of the piRNA pathway). Finally, the study of TE insertion frequencies revealed 15 putatively adaptive TE insertions, six of them being likely associated with the recent invasion history of the species.
Collapse
Affiliation(s)
- Vincent Mérel
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622 Villeurbanne, France
| | - Patricia Gibert
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622 Villeurbanne, France
| | - Inessa Buch
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622 Villeurbanne, France
| | - Valentina Rodriguez Rada
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622 Villeurbanne, France
| | - Arnaud Estoup
- CBGP, Univ Montpellier, CIRAD, INRAE, Institut Agro, IRD, Montpellier, France
| | - Mathieu Gautier
- CBGP, Univ Montpellier, CIRAD, INRAE, Institut Agro, IRD, Montpellier, France
| | - Marie Fablet
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622 Villeurbanne, France
| | - Matthieu Boulesteix
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622 Villeurbanne, France
| | - Cristina Vieira
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622 Villeurbanne, France
| |
Collapse
|
25
|
Chen P, Zhang J. Asexual Experimental Evolution of Yeast Does Not Curtail Transposable Elements. Mol Biol Evol 2021; 38:2831-2842. [PMID: 33720342 PMCID: PMC8233515 DOI: 10.1093/molbev/msab073] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Compared with asexual reproduction, sex facilitates the transmission of transposable elements (TEs) from one genome to another, but boosts the efficacy of selection against deleterious TEs. Thus, theoretically, it is unclear whether sex has a positive net effect on TE’s proliferation. An empirical study concluded that sex is at the root of TE’s evolutionary success because the yeast TE load was found to decrease rapidly in approximately 1,000 generations of asexual but not sexual experimental evolution. However, this finding contradicts the maintenance of TEs in natural yeast populations where sexual reproduction occurs extremely infrequently. Here, we show that the purported TE load reduction during asexual experimental evolution is likely an artifact of low genomic sequencing coverages. We observe stable TE loads in both sexual and asexual experimental evolution from multiple yeast data sets with sufficient coverages. To understand the evolutionary dynamics of yeast TEs, we turn to asexual mutation accumulation lines that have been under virtually no selection. We find that both TE transposition and excision rates per generation, but not their difference, tend to be higher in environments where yeast grows more slowly. However, the transposition rate is not significantly higher than the excision rate and the variance of the TE number among natural strains is close to its neutral expectation, suggesting that selection against TEs is at best weak in yeast. We conclude that the yeast TE load is maintained largely by a transposition–excision balance and that the influence of sex remains unclear.
Collapse
Affiliation(s)
- Piaopiao Chen
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
26
|
Rogers RL, Grizzard SL, Titus-McQuillan JE, Bockrath K, Patel S, Wares JP, Garner JT, Moore CC. Gene family amplification facilitates adaptation in freshwater unionid bivalve Megalonaias nervosa. Mol Ecol 2021; 30:1155-1173. [PMID: 33382161 DOI: 10.1111/mec.15786] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 12/01/2020] [Accepted: 12/14/2020] [Indexed: 01/05/2023]
Abstract
Freshwater unionid bivalves currently face severe anthropogenic challenges. Over 70% of species in the United States are threatened, endangered or extinct due to pollution, damming of waterways and overfishing. These species are notable for their unusual life history strategy, parasite-host co-evolution and biparental mitochondrial inheritance. Among this clade, the washboard mussel Megalonaias nervosa is one species that remains prevalent across the Southeastern United States, with robust population sizes. We have created a reference genome for M. nervosa to determine how genome content has evolved in the face of these widespread environmental challenges. We observe dynamic changes in genome content, with a burst of recent transposable element proliferation causing a 382 Mb expansion in genome content. Birth-death models suggest rapid expansions among gene families, with a mutation rate of 1.16 × 10-8 duplications per gene per generation. Cytochrome P450 gene families have experienced exceptional recent amplification beyond expectations based on genome-wide birth-death processes. These genes are associated with increased rates of amino acid changes, a signature of selection driving evolution of detox genes. Fitting evolutionary models of adaptation from standing genetic variation, we can compare adaptive potential across species and mutation types. The large population size in M. nervosa suggests a 4.7-fold advantage in the ability to adapt from standing genetic variation compared with a low diversity endemic E. hopetonensis. Estimates suggest that gene family evolution may offer an exceptional substrate of genetic variation in M. nervosa, with Psgv = 0.185 compared with Psgv = 0.067 for single nucleotide changes. Hence, we suggest that gene family evolution is a source of 'hopeful monsters' within the genome that may facilitate adaptation when selective pressures shift. These results suggest that gene family expansion is a key driver of adaptive evolution in this key species of freshwater Unionidae that is currently facing widespread environmental challenges. This work has clear implications for conservation genomics on freshwater bivalves as well as evolutionary theory. This genome represents a first step to facilitate reverse ecological genomics in Unionidae and identify the genetic underpinnings of phenotypic diversity.
Collapse
Affiliation(s)
- Rebekah L Rogers
- Department of Bioinformatics and Genomics, University of North Carolina, Charlotte, NC, USA
| | - Stephanie L Grizzard
- Department of Bioinformatics and Genomics, University of North Carolina, Charlotte, NC, USA.,Department of Biological Sciences, Old Dominion University, Norfolk, VA, USA
| | | | - Katherine Bockrath
- Department of Genetics, University of Georgia, Athens, GA, USA.,U.S. Fish and Wildlife Service, Midwest Fisheries Center Whitney Genetics Lab, Onalaska, WI, USA
| | - Sagar Patel
- Department of Bioinformatics and Genomics, University of North Carolina, Charlotte, NC, USA.,Department of Biology, Saint Louis University, St. Louis, MO, USA.,Donald Danforth Plant Science Center, St. Louis, MO, USA
| | - John P Wares
- Department of Genetics, University of Georgia, Athens, GA, USA.,Odum School of Ecology, University of Georgia, Athens, GA, USA
| | - Jeffrey T Garner
- Division of Wildlife and Freshwater Fisheries, Alabama Department of Conservation and Natural Resources, Florence, AL, USA
| | - Cathy C Moore
- Department of Bioinformatics and Genomics, University of North Carolina, Charlotte, NC, USA
| |
Collapse
|
27
|
Bergthorsson U, Sheeba CJ, Konrad A, Belicard T, Beltran T, Katju V, Sarkies P. Long-term experimental evolution reveals purifying selection on piRNA-mediated control of transposable element expression. BMC Biol 2020; 18:162. [PMID: 33158445 PMCID: PMC7646084 DOI: 10.1186/s12915-020-00897-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Accepted: 10/13/2020] [Indexed: 11/29/2022] Open
Abstract
Background Transposable elements (TEs) are an almost universal constituent of eukaryotic genomes. In animals, Piwi-interacting small RNAs (piRNAs) and repressive chromatin often play crucial roles in preventing TE transcription and thus restricting TE activity. Nevertheless, TE content varies widely across eukaryotes and the dynamics of TE activity and TE silencing across evolutionary time is poorly understood. Results Here, we used experimentally evolved populations of C. elegans to study the dynamics of TE expression over 409 generations. The experimental populations were evolved at population sizes of 1, 10 and 100 individuals to manipulate the efficiency of natural selection versus genetic drift. We demonstrate increased TE expression relative to the ancestral population, with the largest increases occurring in the smallest populations. We show that the transcriptional activation of TEs within active regions of the genome is associated with failure of piRNA-mediated silencing, whilst desilenced TEs in repressed chromatin domains retain small RNAs. Additionally, we find that the sequence context of the surrounding region influences the propensity of TEs to lose silencing through failure of small RNA-mediated silencing. Conclusions Our results show that natural selection in C. elegans is responsible for maintaining low levels of TE expression, and provide new insights into the epigenomic features responsible.
Collapse
Affiliation(s)
- Ulfar Bergthorsson
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, 77845, USA
| | - Caroline J Sheeba
- MRC London Institute of Medical Sciences, Du Cane Road, London, W12 0NN, UK.,Institute of Clinical Sciences, Imperial College London, Du Cane Road, London, W12 0NN, UK
| | - Anke Konrad
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, 77845, USA.,Present Address: Intituto Gulbenkian de Ciencia, Rua da Quinta Grande, 6, 2780-156, Oeiras, Portugal
| | - Tony Belicard
- MRC London Institute of Medical Sciences, Du Cane Road, London, W12 0NN, UK.,Institute of Clinical Sciences, Imperial College London, Du Cane Road, London, W12 0NN, UK
| | - Toni Beltran
- MRC London Institute of Medical Sciences, Du Cane Road, London, W12 0NN, UK.,Institute of Clinical Sciences, Imperial College London, Du Cane Road, London, W12 0NN, UK.,Present Address: Centre for Genomic Regulation, PRBB Building, 08003, Barcelona, Spain
| | - Vaishali Katju
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, 77845, USA.
| | - Peter Sarkies
- MRC London Institute of Medical Sciences, Du Cane Road, London, W12 0NN, UK. .,Institute of Clinical Sciences, Imperial College London, Du Cane Road, London, W12 0NN, UK.
| |
Collapse
|
28
|
Woodruff GC, Teterina AA. Degradation of the Repetitive Genomic Landscape in a Close Relative of Caenorhabditis elegans. Mol Biol Evol 2020; 37:2549-2567. [PMID: 32359146 PMCID: PMC7475029 DOI: 10.1093/molbev/msaa107] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
The abundance, diversity, and genomic distribution of repetitive elements is highly variable among species. These patterns are thought to be driven in part by reproductive mode and the interaction of selection and recombination, and recombination rates typically vary by chromosomal position. In the nematode Caenorhabditis elegans, repetitive elements are enriched at chromosome arms and depleted on centers, and this mirrors the chromosomal distributions of other genomic features such as recombination rate. How conserved is this genomic landscape of repeats, and what evolutionary forces maintain it? To address this, we compared the genomic organization of repetitive elements across five Caenorhabditis species with chromosome-level assemblies. As previously reported, repeat content is enriched on chromosome arms in most Caenorhabditis species, and no obvious patterns of repeat content associated with reproductive mode were observed. However, the fig-associated C. inopinata has experienced repetitive element expansion and reveals no association of global repeat density with chromosome position. Patterns of repeat superfamily specific distributions reveal this global pattern is driven largely by a few repeat superfamilies that in C. inopinata have expanded in number and have weak associations with chromosome position. Additionally, 15% of predicted protein-coding genes in C. inopinata align to transposon-related proteins. When these are excluded, C. inopinata has no enrichment of genes in chromosome centers, in contrast to its close relatives who all have such clusters. Forward evolutionary simulations reveal that chromosomal heterogeneity in recombination rate alone can generate structured repetitive genomic landscapes when insertions are weakly deleterious, whereas chromosomal heterogeneity in the fitness effects of transposon insertion can promote such landscapes across a variety of evolutionary scenarios. Thus, patterns of gene density along chromosomes likely contribute to global repetitive landscapes in this group, although other historical or genomic factors are needed to explain the idiosyncrasy of genomic organization of various transposable element taxa within C. inopinata. Taken together, these results highlight the power of comparative genomics and evolutionary simulations in testing hypotheses regarding the causes of genome organization.
Collapse
Affiliation(s)
- Gavin C Woodruff
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR
| | - Anastasia A Teterina
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR
- Center of Parasitology, Severtsov Institute of Ecology and Evolution RAS, Moscow, Russia
| |
Collapse
|
29
|
Mérel V, Boulesteix M, Fablet M, Vieira C. Transposable elements in Drosophila. Mob DNA 2020; 11:23. [PMID: 32636946 PMCID: PMC7334843 DOI: 10.1186/s13100-020-00213-z] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2020] [Accepted: 04/14/2020] [Indexed: 12/25/2022] Open
Abstract
Drosophila has been studied as a biological model for many years and many discoveries in biology rely on this species. Research on transposable elements (TEs) is not an exception. Drosophila has contributed significantly to our knowledge on the mechanisms of transposition and their regulation, but above all, it was one of the first organisms on which genetic and genomic studies of populations were done. In this review article, in a very broad way, we will approach the TEs of Drosophila with a historical hindsight as well as recent discoveries in the field.
Collapse
Affiliation(s)
- Vincent Mérel
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622 Villeurbanne, France
| | - Matthieu Boulesteix
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622 Villeurbanne, France
| | - Marie Fablet
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622 Villeurbanne, France
| | - Cristina Vieira
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, F-69622 Villeurbanne, France
| |
Collapse
|
30
|
Signor S. Transposable elements in individual genotypes of Drosophila simulans. Ecol Evol 2020; 10:3402-3412. [PMID: 32273997 PMCID: PMC7141027 DOI: 10.1002/ece3.6134] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Revised: 01/25/2020] [Accepted: 02/04/2020] [Indexed: 11/17/2022] Open
Abstract
Transposable elements are abundant, dynamic components of the genome that affect organismal phenotypes and fitness. In Drosophila melanogaster, they have increased in abundance as the species spread out of Africa, and different populations differ in their transposable element content. However, very little is currently known about how transposable elements differ between individual genotypes, and how that relates to the population dynamics of transposable elements overall. The sister species of D. melanogaster, D. simulans, has also recently become cosmopolitan, and panels of inbred genotypes exist from cosmopolitan and African flies. Therefore, we can determine whether the differences in colonizing populations are repeated in D. simulans, what the dynamics of transposable elements are in individual genotypes, and how that compares to wild flies. After estimating copy number in cosmopolitan and African D. simulans, I find that transposable element load is higher in flies from cosmopolitan populations. In addition, transposable element load varies considerably between populations, between genotypes, but not overall between wild and inbred lines. Certain genotypes either contain active transposable elements or are more permissive of transposition and accumulate copies of particular transposable elements. Overall, it is important to quantify genotype-specific transposable element dynamics as well as population averages to understand the dynamics of transposable element accumulation over time.
Collapse
Affiliation(s)
- Sarah Signor
- Department of Biological SciencesNorth Dakota State UniversityFargoNDUSA
| |
Collapse
|
31
|
Ellison CE, Cao W. Nanopore sequencing and Hi-C scaffolding provide insight into the evolutionary dynamics of transposable elements and piRNA production in wild strains of Drosophila melanogaster. Nucleic Acids Res 2020; 48:290-303. [PMID: 31754714 PMCID: PMC6943127 DOI: 10.1093/nar/gkz1080] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Revised: 10/29/2019] [Accepted: 11/01/2019] [Indexed: 01/29/2023] Open
Abstract
Illumina sequencing has allowed for population-level surveys of transposable element (TE) polymorphism via split alignment approaches, which has provided important insight into the population dynamics of TEs. However, such approaches are not able to identify insertions of uncharacterized TEs, nor can they assemble the full sequence of inserted elements. Here, we use nanopore sequencing and Hi-C scaffolding to produce de novo genome assemblies for two wild strains of Drosophila melanogaster from the Drosophila Genetic Reference Panel (DGRP). Ovarian piRNA populations and Illumina split-read TE insertion profiles have been previously produced for both strains. We find that nanopore sequencing with Hi-C scaffolding produces highly contiguous, chromosome-length scaffolds, and we identify hundreds of TE insertions that were missed by Illumina-based methods, including a novel micropia-like element that has recently invaded the DGRP population. We also find hundreds of piRNA-producing loci that are specific to each strain. Some of these loci are created by strain-specific TE insertions, while others appear to be epigenetically controlled. Our results suggest that Illumina approaches reveal only a portion of the repetitive sequence landscape of eukaryotic genomes and that population-level resequencing using long reads is likely to provide novel insight into the evolutionary dynamics of repetitive elements.
Collapse
Affiliation(s)
- Christopher E Ellison
- Department of Genetics, Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Weihuan Cao
- Department of Genetics, Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| |
Collapse
|
32
|
Díaz-González J, Domínguez A. Different structural variants of roo retrotransposon are active in Drosophila melanogaster. Gene 2020; 741:144546. [PMID: 32165306 DOI: 10.1016/j.gene.2020.144546] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 01/30/2020] [Accepted: 03/08/2020] [Indexed: 11/29/2022]
Abstract
Retrotransposon roo is one of the most active elements in Drosophila melanogaster. The level of nucleotide diversity between copies of roo is very low but structural variation in the 5'-UTR is considerable. Transposition of roo at high frequency (around 5 × 10-2 per generation) has been shown previously in the set of mutation accumulation lines named Oviedo. Here we isolated thirteen individual insertions by inverse PCR and sequenced the 5' end of the elements (between 1663 and 2039 nt) including the LTR, the 5'-UTR and a fragment of 661 nucleotides from the ORF, to study whether the new transposed copies come from a unique variant (the master copy model) or different elements are able to move (the transposon model). The elements in the Oviedo lines presented the same structural variants as the reference genome. Different structural variants were active, a behaviour compatible with the "transposon model" in which the copies localized in multiple sites in the genome are able to transpose. At the level of sequence, the copies of roo in our lines are highly similar to the elements in the reference genome. The phylogenetic tree shows a shallow diversification with unsupported nodes denoting that all the elements currently active are very young. This observation together with the great polymorphism in insertion sites implies a rapid turnover of the elements.
Collapse
Affiliation(s)
- J Díaz-González
- Departamento de Biología Funcional, Área de Genética. Universidad de Oviedo, 33071 Oviedo, Spain
| | - A Domínguez
- Departamento de Biología Funcional, Área de Genética. Universidad de Oviedo, 33071 Oviedo, Spain.
| |
Collapse
|
33
|
Luo S, Zhang H, Duan Y, Yao X, Clark AG, Lu J. The evolutionary arms race between transposable elements and piRNAs in Drosophila melanogaster. BMC Evol Biol 2020; 20:14. [PMID: 31992188 PMCID: PMC6988346 DOI: 10.1186/s12862-020-1580-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2019] [Accepted: 01/13/2020] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND The piwi-interacting RNAs (piRNAs) are small non-coding RNAs that specifically repress transposable elements (TEs) in the germline of Drosophila. Despite our expanding understanding of TE:piRNA interaction, whether there is an evolutionary arms race between TEs and piRNAs was unclear. RESULTS Here, we studied the population genomics of TEs and piRNAs in the worldwide strains of D. melanogaster. By conducting a correlation analysis between TE contents and the abundance of piRNAs from ovaries of representative strains of D. melanogaster, we find positive correlations between TEs and piRNAs in six TE families. Our simulations further highlight that TE activities and the strength of purifying selection against TEs are important factors shaping the interactions between TEs and piRNAs. Our studies also suggest that the de novo generation of piRNAs is an important mechanism to repress the newly invaded TEs. CONCLUSIONS Our results revealed the existence of an evolutionary arms race between the copy numbers of TEs and the abundance of antisense piRNAs at the population level. Although the interactions between TEs and piRNAs are complex and many factors should be considered to impact their interaction dynamics, our results suggest the emergence, repression specificity and strength of piRNAs on TEs should be considered in studying the landscapes of TE insertions in Drosophila. These results deepen our understanding of the interactions between piRNAs and TEs, and also provide novel insights into the nature of genomic conflicts of other forms.
Collapse
Affiliation(s)
- Shiqi Luo
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, College of Life Sciences and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
- College of Plant Protection, Beijing Advanced Innovation Center for Food Nutrition and Human Health, China Agricultural University, Beijing, 100193, China
| | - Hong Zhang
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, College of Life Sciences and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
| | - Yuange Duan
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, College of Life Sciences and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, China
| | - Xinmin Yao
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, College of Life Sciences and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
| | - Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, 14853, USA.
| | - Jian Lu
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, College of Life Sciences and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China.
| |
Collapse
|
34
|
Vendrell-Mir P, Barteri F, Merenciano M, González J, Casacuberta JM, Castanera R. A benchmark of transposon insertion detection tools using real data. Mob DNA 2019; 10:53. [PMID: 31892957 PMCID: PMC6937713 DOI: 10.1186/s13100-019-0197-9] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Accepted: 12/17/2019] [Indexed: 02/01/2023] Open
Abstract
Background Transposable elements (TEs) are an important source of genomic variability in eukaryotic genomes. Their activity impacts genome architecture and gene expression and can lead to drastic phenotypic changes. Therefore, identifying TE polymorphisms is key to better understand the link between genotype and phenotype. However, most genotype-to-phenotype analyses have concentrated on single nucleotide polymorphisms as they are easier to reliable detect using short-read data. Many bioinformatic tools have been developed to identify transposon insertions from resequencing data using short reads. Nevertheless, the performance of most of these tools has been tested using simulated insertions, which do not accurately reproduce the complexity of natural insertions. Results We have overcome this limitation by building a dataset of insertions from the comparison of two high-quality rice genomes, followed by extensive manual curation. This dataset contains validated insertions of two very different types of TEs, LTR-retrotransposons and MITEs. Using this dataset, we have benchmarked the sensitivity and precision of 12 commonly used tools, and our results suggest that in general their sensitivity was previously overestimated when using simulated data. Our results also show that, increasing coverage leads to a better sensitivity but with a cost in precision. Moreover, we found important differences in tool performance, with some tools performing better on a specific type of TEs. We have also used two sets of experimentally validated insertions in Drosophila and humans and show that this trend is maintained in genomes of different size and complexity. Conclusions We discuss the possible choice of tools depending on the goals of the study and show that the appropriate combination of tools could be an option for most approaches, increasing the sensitivity while maintaining a good precision.
Collapse
Affiliation(s)
- Pol Vendrell-Mir
- 1Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, Campus UAB, Edifici CRAG, Bellaterra, 08193 Barcelona, Spain
| | - Fabio Barteri
- 1Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, Campus UAB, Edifici CRAG, Bellaterra, 08193 Barcelona, Spain
| | - Miriam Merenciano
- 2Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Passeig Maritim Barceloneta 37-49, 08003 Barcelona, Spain
| | - Josefa González
- 2Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Passeig Maritim Barceloneta 37-49, 08003 Barcelona, Spain
| | - Josep M Casacuberta
- 1Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, Campus UAB, Edifici CRAG, Bellaterra, 08193 Barcelona, Spain
| | - Raúl Castanera
- 1Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, Campus UAB, Edifici CRAG, Bellaterra, 08193 Barcelona, Spain
| |
Collapse
|
35
|
Lerat E, Casacuberta J, Chaparro C, Vieira C. On the Importance to Acknowledge Transposable Elements in Epigenomic Analyses. Genes (Basel) 2019; 10:genes10040258. [PMID: 30935103 PMCID: PMC6523952 DOI: 10.3390/genes10040258] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2019] [Revised: 03/27/2019] [Accepted: 03/27/2019] [Indexed: 12/21/2022] Open
Abstract
Eukaryotic genomes comprise a large proportion of repeated sequences, an important fraction of which are transposable elements (TEs). TEs are mobile elements that have a significant impact on genome evolution and on gene functioning. Although some TE insertions could provide adaptive advantages to species, transposition is a highly mutagenic event that has to be tightly controlled to ensure its viability. Genomes have evolved sophisticated mechanisms to control TE activity, the most important being epigenetic silencing. However, the epigenetic control of TEs can also affect genes located nearby that can become epigenetically regulated. It has been proposed that the combination of TE mobilization and the induced changes in the epigenetic landscape could allow a rapid phenotypic adaptation to global environmental changes. In this review, we argue the crucial need to take into account the repeated part of genomes when studying the global impact of epigenetic modifications on an organism. We emphasize more particularly why it is important to carefully consider TEs and what bioinformatic tools can be used to do so.
Collapse
Affiliation(s)
- Emmanuelle Lerat
- CNRS, Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon, Université Lyon 1, UMR 5558, F-69622 Villeurbanne, France.
| | - Josep Casacuberta
- Center for Research in Agricultural Genomics, CRAG (CSIC-IRTA-UAB-UB), Campus UAB, Cerdanyola del Vallès, 08193 Barcelona, Spain.
| | - Cristian Chaparro
- CNRS, IHPE UMR 5244, University of Perpignan Via Domitia, IFREMER, University Montpellier, F-66860 Perpignan, France.
| | - Cristina Vieira
- CNRS, Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon, Université Lyon 1, UMR 5558, F-69622 Villeurbanne, France.
| |
Collapse
|
36
|
Abstract
Among the multitude of papers published yearly in scientific journals, precious few publications may be worth looking back in half a century to appreciate the significance of the discoveries that would later become common knowledge and get a chance to shape a field or several adjacent fields. Here, Kimura's fundamental concept of neutral mutation-random drift, which was published 50 years ago, is re-examined in light of its pervasive influence on comparative genomics and, more specifically, on the contribution of transposable elements to eukaryotic genome evolution.
Collapse
Affiliation(s)
- Irina R Arkhipova
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA
| |
Collapse
|
37
|
Lerat E, Goubert C, Guirao‐Rico S, Merenciano M, Dufour A, Vieira C, González J. Population-specific dynamics and selection patterns of transposable element insertions in European natural populations. Mol Ecol 2019; 28:1506-1522. [PMID: 30506554 PMCID: PMC6849870 DOI: 10.1111/mec.14963] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Revised: 10/30/2018] [Accepted: 11/05/2018] [Indexed: 01/02/2023]
Abstract
Transposable elements (TEs) are ubiquitous sequences in genomes of virtually all species. While TEs have been investigated for several decades, only recently we have the opportunity to study their genome-wide population dynamics. Most of the studies so far have been restricted either to the analysis of the insertions annotated in the reference genome or to the analysis of a limited number of populations. Taking advantage of the European Drosophila population genomics consortium (DrosEU) sequencing data set, we have identified and measured the dynamics of TEs in a large sample of European Drosophila melanogaster natural populations. We showed that the mobilome landscape is population-specific and highly diverse depending on the TE family. In contrast with previous studies based on SNP variants, no geographical structure was observed for TE abundance or TE divergence in European populations. We further identified de novo individual insertions using two available programs and, as expected, most of the insertions were present at low frequencies. Nevertheless, we identified a subset of TEs present at high frequencies and located in genomic regions with a high recombination rate. These TEs are candidates for being the target of positive selection, although neutral processes should be discarded before reaching any conclusion on the type of selection acting on them. Finally, parallel patterns of association between the frequency of TE insertions and several geographical and temporal variables were found between European and North American populations, suggesting that TEs can be potentially implicated in the adaptation of populations across continents.
Collapse
Affiliation(s)
- Emmanuelle Lerat
- Laboratoire de Biométrie et Biologie EvolutiveUMR 5558Université de Lyon, Université Lyon 1, CNRSVilleurbanneFrance
| | - Clément Goubert
- Molecular Biology and GeneticsCornell UniversityIthacaNew York
| | - Sara Guirao‐Rico
- Institute of Evolutionary Biology (CSIC‐Universitat Pompeu Fabra)BarcelonaSpain
| | - Miriam Merenciano
- Institute of Evolutionary Biology (CSIC‐Universitat Pompeu Fabra)BarcelonaSpain
| | - Anne‐Béatrice Dufour
- Laboratoire de Biométrie et Biologie EvolutiveUMR 5558Université de Lyon, Université Lyon 1, CNRSVilleurbanneFrance
| | - Cristina Vieira
- Laboratoire de Biométrie et Biologie EvolutiveUMR 5558Université de Lyon, Université Lyon 1, CNRSVilleurbanneFrance
| | - Josefa González
- Institute of Evolutionary Biology (CSIC‐Universitat Pompeu Fabra)BarcelonaSpain
| |
Collapse
|
38
|
Adrion JR, Begun DJ, Hahn MW. Patterns of transposable element variation and clinality in
Drosophila. Mol Ecol 2019; 28:1523-1536. [DOI: 10.1111/mec.14961] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2018] [Revised: 11/14/2018] [Accepted: 11/15/2018] [Indexed: 01/02/2023]
Affiliation(s)
- Jeffrey R. Adrion
- Department of Biology University of Oregon Eugene Oregon
- Department of Biology Indiana University Bloomington Indiana
| | - David J. Begun
- Department of Evolution and Ecology University of California Davis, Davis California
| | - Matthew W. Hahn
- Department of Biology Indiana University Bloomington Indiana
- Department of Computer Science Indiana University Bloomington Indiana
| |
Collapse
|
39
|
Pirogov SA, Maksimenko OG, Georgiev PG. Transposable Elements in the Evolution of Gene Regulatory Networks. RUSS J GENET+ 2019. [DOI: 10.1134/s1022795419010113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
40
|
Guio L, González J. New Insights on the Evolution of Genome Content: Population Dynamics of Transposable Elements in Flies and Humans. Methods Mol Biol 2019; 1910:505-530. [PMID: 31278675 DOI: 10.1007/978-1-4939-9074-0_16] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Understanding the abundance, diversity, and distribution of TEs in genomes is crucial to understand genome structure, function, and evolution. Advances in whole-genome sequencing techniques, as well as in bioinformatics tools, have increased our ability to detect and analyze the transposable element content in genomes. In addition to reference genomes, we now have access to population datasets in which multiple individuals within a species are sequenced. In this chapter, we highlight the recent advances in the study of TE population dynamics focusing on fruit flies and humans, which represent two extremes in terms of TE abundance, diversity, and activity. We review the most recent methodological approaches applied to the study of TE dynamics as well as the new knowledge on host factors involved in the regulation of TE activity. In addition to transposition rates, we also focus on TE deletion rates and on the selective forces that affect the dynamics of TEs in genomes.
Collapse
Affiliation(s)
- Lain Guio
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain.
| |
Collapse
|
41
|
Danzmann RG, Norman JD, Rondeau EB, Messmer AM, Kent MP, Lien S, Igboeli O, Fast MD, Koop BF. A genetic linkage map for the salmon louse (Lepeophtheirus salmonis): evidence for high male:female and inter-familial recombination rate differences. Mol Genet Genomics 2018; 294:343-363. [PMID: 30460550 DOI: 10.1007/s00438-018-1513-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2018] [Accepted: 10/15/2018] [Indexed: 01/28/2023]
Abstract
A salmon louse (Lepeophtheirus salmonis salmonis) genetic linkage map was constructed to serve as a genomic resource for future investigations into the biology of this important marine parasitic copepod species, and to provide insights into the inheritance patterns of genetic markers in this species. SNP genotyping of 8 families confirmed the presence of 15 linkage groups based upon the assignment of 93,773 markers. Progeny sample size weight adjusted map sizes in males (with the exception of SL12 and SL15) ranged in size from 96.50 cM (SL11) to 134.61 cM (SL06), and total combined map steps or bins ranged from 143 (SL09) to 203 (SL13). The SL12 male map was the smallest linkage group with a weight-averaged size of 3.05 cM with 6 recombination bins. Male:female specific recombination rate differences are 10.49:1 and represent one of the largest reported sex-specific differences for any animal species. Recombination ratio differences (M:F) ranged from 1.0 (SL12) to 29:1 (SL15). The number of markers exhibiting normal Mendelian segregation within the sex linkage group SL15 was extremely low (N = 80) in comparison to other linkage groups genotyped [range: 1459 (SL12)-10206 markers (SL05)]. Re-evaluation of Mendelian inheritance patterns of markers unassigned to any mapping parent according to hemizygous segregation patterns (models presented) identified matches for many of these markers to hemizygous patterns. The greatest proportion of these markers assigned to SL15 (N increased to 574). Inclusion of the hemizygous markers revised SL15 sex-specific recombination rate differences to 28:1. Recombination hot- and coldspots were identified across all linkage groups with all linkage groups possessing multiple peaks. Nine of 13 linkage groups evaluated possessed adjacent domains with hot-coldspot transitional zones. The most common pattern was for one end of the linkage to show elevated recombination in addition to internal regions. For SL01 and SL06, however, a terminal region with high recombination was not evident while a central domain possessing extremely high-recombination levels was present. High levels of recombination were weakly coupled to higher levels of SNP variation within domains, but this association was very strong for the central domains of SL01 and SL06. From the pooled paternal half-sib lots (several virgin females placed with 1 male), only 1 or two surviving family lots were obtained. Surviving families possessed parents where both the male and female possessed either inherently low or high recombination rates. This study provides insight into the organization of the sea louse genome, and describes large differences in recombination rate that exist among individuals of the same sex, and between the sexes. These differences in recombination rate may be coupled to the capabilities of this species to adapt to environmental and pharmaceutical treatments, given that family survivorship appears to be enhanced when parents have similar recombination levels.
Collapse
Affiliation(s)
- Roy G Danzmann
- Department of Integrative Biology, College of Biological Sciences, University of Guelph, 50 Stone Road East, Guelph, ON, N1G 2W1, Canada.
| | - Joseph D Norman
- Department of Integrative Biology, College of Biological Sciences, University of Guelph, 50 Stone Road East, Guelph, ON, N1G 2W1, Canada.,The Hospital for Sick Children, 686 Bay St., Toronto, ON, M5G 0A4, Canada
| | - Eric B Rondeau
- Department of Biology, Centre for Biomedical Research, University of Victoria, 3800 Finnerty Road, Victoria, BC, V8W 3N5, Canada
| | - Amber M Messmer
- Department of Biology, Centre for Biomedical Research, University of Victoria, 3800 Finnerty Road, Victoria, BC, V8W 3N5, Canada
| | - Matthew P Kent
- Department of Animal and Aquacultural Sciences, Centre for Integrative Genetics (CIGENE), Norwegian University of Life Sciences, 1432, As, Norway
| | - Sigbjørn Lien
- Department of Animal and Aquacultural Sciences, Centre for Integrative Genetics (CIGENE), Norwegian University of Life Sciences, 1432, As, Norway
| | - Okechukwu Igboeli
- Department of Pathology and Microbiology, Atlantic Veterinary College, University of Prince Edward Island, 550 University Ave., Charlottetown, PEI, C1A 4P3, Canada
| | - Mark D Fast
- Department of Pathology and Microbiology, Atlantic Veterinary College, University of Prince Edward Island, 550 University Ave., Charlottetown, PEI, C1A 4P3, Canada
| | - Ben F Koop
- Department of Biology, Centre for Biomedical Research, University of Victoria, 3800 Finnerty Road, Victoria, BC, V8W 3N5, Canada
| |
Collapse
|
42
|
Manee MM, Jackson J, Bergman CM. Conserved Noncoding Elements Influence the Transposable Element Landscape in Drosophila. Genome Biol Evol 2018; 10:1533-1545. [PMID: 29850787 PMCID: PMC6007792 DOI: 10.1093/gbe/evy104] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/22/2018] [Indexed: 12/15/2022] Open
Abstract
Highly conserved noncoding elements (CNEs) constitute a significant proportion of the genomes of multicellular eukaryotes. The function of most CNEs remains elusive, but growing evidence indicates they are under some form of purifying selection. Noncoding regions in many species also harbor large numbers of transposable element (TE) insertions, which are typically lineage specific and depleted in exons because of their deleterious effects on gene function or expression. However, it is currently unknown whether the landscape of TE insertions in noncoding regions is random or influenced by purifying selection on CNEs. Here, we combine comparative and population genomic data in Drosophila melanogaster to show that the abundance of TE insertions in intronic and intergenic CNEs is reduced relative to random expectation, supporting the idea that selective constraints on CNEs eliminate a proportion of TE insertions in noncoding regions. However, we find no evidence for differences in the allele frequency spectra for polymorphic TE insertions in CNEs versus those in unconstrained spacer regions, suggesting that the distribution of fitness effects acting on observable TE insertions is similar across different functional compartments in noncoding DNA. Our results provide evidence that selective constraints on CNEs contribute to shaping the landscape of TE insertion in eukaryotic genomes, and provide further evidence that CNEs are indeed functionally constrained and not simply mutational cold spots.
Collapse
Affiliation(s)
- Manee M Manee
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom.,National Center for Biotechnology, King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia.,Center of Excellence for Genomics (CEG), King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - John Jackson
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom.,Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - Casey M Bergman
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom.,Department of Genetics, University of Georgia, Athens, GA.,Institute of Bioinformatics, University of Georgia, Athens, GA
| |
Collapse
|
43
|
Jiang X, Tang H, Mohammed Ismail W, Lynch M. A Maximum-Likelihood Approach to Estimating the Insertion Frequencies of Transposable Elements from Population Sequencing Data. Mol Biol Evol 2018; 35:2560-2571. [PMID: 30099533 PMCID: PMC6188571 DOI: 10.1093/molbev/msy152] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Transposable elements (TEs) contribute to a large fraction of the expansion of many eukaryotic genomes due to the capability of TEs duplicating themselves through transposition. A first step to understanding the roles of TEs in a eukaryotic genome is to characterize the population-wide variation of TE insertions in the species. Here, we present a maximum-likelihood (ML) method for estimating allele frequencies and detecting selection on TE insertions in a diploid population, based on the genotypes at TE insertion sites detected in multiple individuals sampled from the population using paired-end (PE) sequencing reads. Tests of the method on simulated data show that it can accurately estimate the allele frequencies of TE insertions even when the PE sequencing is conducted at a relatively low coverage (=5X). The method can also detect TE insertions under strong selection, and the detection ability increases with sample size in a population, although a substantial fraction of actual TE insertions under selection may be undetected. Application of the ML method to genomic sequencing data collected from a natural Daphnia pulex population shows that, on the one hand, most (>90%) TE insertions present in the reference D. pulex genome are either fixed or nearly fixed (with allele frequencies >0.95); on the other hand, among the nonreference TE insertions (i.e., those detected in some individuals in the population but absent from the reference genome), the majority (>70%) are still at low frequencies (<0.1). Finally, we detected a substantial fraction (∼9%) of nonreference TE insertions under selection.
Collapse
Affiliation(s)
- Xiaoqian Jiang
- Department of Biology, Indiana University, Bloomington, IN
| | - Haixu Tang
- School of Informatics and Computing, Indiana University, Bloomington, IN
| | | | - Michael Lynch
- Center for Mechanisms of Evolution, Arizona State University, Temple, AZ
| |
Collapse
|
44
|
Song MJ, Schaack S. Evolutionary Conflict between Mobile DNA and Host Genomes. Am Nat 2018; 192:263-273. [DOI: 10.1086/698482] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
|
45
|
Kent TV, Uzunović J, Wright SI. Coevolution between transposable elements and recombination. Philos Trans R Soc Lond B Biol Sci 2018; 372:rstb.2016.0458. [PMID: 29109221 DOI: 10.1098/rstb.2016.0458] [Citation(s) in RCA: 142] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/18/2017] [Indexed: 12/24/2022] Open
Abstract
One of the most striking patterns of genome structure is the tight, typically negative, association between transposable elements (TEs) and meiotic recombination rates. While this is a highly recurring feature of eukaryotic genomes, the mechanisms driving correlations between TEs and recombination remain poorly understood, and distinguishing cause versus effect is challenging. Here, we review the evidence for a relation between TEs and recombination, and discuss the underlying evolutionary forces. Evidence to date suggests that overall TE densities correlate negatively with recombination, but the strength of this correlation varies across element types, and the pattern can be reversed. Results suggest that heterogeneity in the strength of selection against ectopic recombination and gene disruption can drive TE accumulation in regions of low recombination, but there is also strong evidence that the regulation of TEs can influence local recombination rates. We hypothesize that TE insertion polymorphism may be important in driving within-species variation in recombination rates in surrounding genomic regions. Furthermore, the interaction between TEs and recombination may create positive feedback, whereby TE accumulation in non-recombining regions contributes to the spread of recombination suppression. Further investigation of the coevolution between recombination and TEs has important implications for our understanding of the evolution of recombination rates and genome structure.This article is part of the themed issue 'Evolutionary causes and consequences of recombination rate variation in sexual organisms'.
Collapse
Affiliation(s)
- Tyler V Kent
- Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks St, Toronto, Ontario, Canada M5S3B2
| | - Jasmina Uzunović
- Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks St, Toronto, Ontario, Canada M5S3B2
| | - Stephen I Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks St, Toronto, Ontario, Canada M5S3B2
| |
Collapse
|
46
|
Oppenheim SJ, Feindt W, DeSalle R, Goldstein PZ. De Novo characterization of transcriptomes from two North American Papaipema stem-borers (Lepidoptera: Noctuidae). PLoS One 2018; 13:e0191061. [PMID: 29364900 PMCID: PMC5783364 DOI: 10.1371/journal.pone.0191061] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2017] [Accepted: 12/27/2017] [Indexed: 11/19/2022] Open
Abstract
Stem-borers in the genus Papaipema (Lepidoptera: Noctuidae) range from highly polyphagous agricultural pests to specialists on more than 20 families of flowering plants, many of them highly toxic. Papaipema is the largest genus of noctuids endemic to North America and provides an excellent study system for the evolution of noctuid host plant use. To improve the availability of genomic resources for such investigations, we performed de novo transcriptome sequencing and assembly for two specialist Papaipema with unusual larval hosts: P. speciosissima, which is associated with ferns, and the undescribed P. “sp. 4,” which is associated with bamboo. The resulting transcriptomes were similar in terms of completeness, gene count, and gene identity, but we identified some 8,000 genes (~17% of each transcriptome) not shared between the two species. While some of these have identifiable orthologs in other Lepidoptera, ~5% of each transcriptome consists of species-specific genes. We examine the function of these genes and find that almost half have retrotransposon-related functional domains. The potential role of species-specific genes is discussed, and the expansion of certain retrotransposon families in Papaipema is examined.
Collapse
Affiliation(s)
- Sara J. Oppenheim
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, New York, United States of America
- * E-mail:
| | - Wiebke Feindt
- Stiftung Tierärztliche Hochschule Hannover, ITZ, Division of Ecology and Evolution, Hannover, Germany
| | - Rob DeSalle
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, New York, United States of America
| | - Paul Z. Goldstein
- Systematic Entomology Laboratory, USDA, National Museum of Natural History, Washington DC, United States of America
| |
Collapse
|