1
|
Han S, Dias GB, Basting PJ, Viswanatha R, Perrimon N, Bergman C. Local assembly of long reads enables phylogenomics of transposable elements in a polyploid cell line. Nucleic Acids Res 2022; 50:e124. [PMID: 36156149 PMCID: PMC9757076 DOI: 10.1093/nar/gkac794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 07/21/2022] [Accepted: 09/16/2022] [Indexed: 12/24/2022] Open
Abstract
Animal cell lines often undergo extreme genome restructuring events, including polyploidy and segmental aneuploidy that can impede de novo whole-genome assembly (WGA). In some species like Drosophila, cell lines also exhibit massive proliferation of transposable elements (TEs). To better understand the role of transposition during animal cell culture, we sequenced the genome of the tetraploid Drosophila S2R+ cell line using long-read and linked-read technologies. WGAs for S2R+ were highly fragmented and generated variable estimates of TE content across sequencing and assembly technologies. We therefore developed a novel WGA-independent bioinformatics method called TELR that identifies, locally assembles, and estimates allele frequency of TEs from long-read sequence data (https://github.com/bergmanlab/telr). Application of TELR to a ∼130x PacBio dataset for S2R+ revealed many haplotype-specific TE insertions that arose by transposition after initial cell line establishment and subsequent tetraploidization. Local assemblies from TELR also allowed phylogenetic analysis of paralogous TEs, which revealed that proliferation of TE families in vitro can be driven by single or multiple source lineages. Our work provides a model for the analysis of TEs in complex heterozygous or polyploid genomes that are recalcitrant to WGA and yields new insights into the mechanisms of genome evolution in animal cell culture.
Collapse
Affiliation(s)
| | | | - Preston J Basting
- Institute of Bioinformatics, University of Georgia, 120 E. Green St., Athens, GA, USA
| | - Raghuvir Viswanatha
- Department of Genetics, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, USA
| | - Norbert Perrimon
- Department of Genetics, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, USA,Howard Hughes Medical Institute, Boston, MA, USA
| | - Casey M Bergman
- To whom correspondence should be addressed. Tel: +1 706 542 1764; Fax: +1 706 542 3910;
| |
Collapse
|
2
|
Han S, Dias GB, Basting PJ, Nelson MG, Patel S, Marzo M, Bergman CM. Ongoing transposition in cell culture reveals the phylogeny of diverse Drosophila S2 sublines. Genetics 2022; 221:iyac077. [PMID: 35536183 PMCID: PMC9252272 DOI: 10.1093/genetics/iyac077] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 04/28/2022] [Indexed: 11/13/2022] Open
Abstract
Cultured cells are widely used in molecular biology despite poor understanding of how cell line genomes change in vitro over time. Previous work has shown that Drosophila cultured cells have a higher transposable element content than whole flies, but whether this increase in transposable element content resulted from an initial burst of transposition during cell line establishment or ongoing transposition in cell culture remains unclear. Here, we sequenced the genomes of 25 sublines of Drosophila S2 cells and show that transposable element insertions provide abundant markers for the phylogenetic reconstruction of diverse sublines in a model animal cell culture system. DNA copy number evolution across S2 sublines revealed dramatically different patterns of genome organization that support the overall evolutionary history reconstructed using transposable element insertions. Analysis of transposable element insertion site occupancy and ancestral states support a model of ongoing transposition dominated by episodic activity of a small number of retrotransposon families. Our work demonstrates that substantial genome evolution occurs during long-term Drosophila cell culture, which may impact the reproducibility of experiments that do not control for subline identity.
Collapse
Affiliation(s)
- Shunhua Han
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Guilherme B Dias
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - Preston J Basting
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Michael G Nelson
- Faculty of Life Sciences, University of Manchester, Manchester M13 9PT, UK
| | - Sanjai Patel
- Faculty of Life Sciences, University of Manchester, Manchester M13 9PT, UK
| | - Mar Marzo
- Faculty of Life Sciences, University of Manchester, Manchester M13 9PT, UK
| | - Casey M Bergman
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
3
|
Mombach DM, Fontoura Gomes TMFD, Silva MM, Loreto ÉLS. Molecular and biological effects of Cisplatin in Drosophila. Comp Biochem Physiol C Toxicol Pharmacol 2022; 252:109229. [PMID: 34728387 DOI: 10.1016/j.cbpc.2021.109229] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/03/2021] [Revised: 10/20/2021] [Accepted: 10/27/2021] [Indexed: 11/24/2022]
Abstract
Cisplatin is widely used in cancer treatment and is one of the best cytostatic agents available for antitumor therapy. Drosophila melanogaster has one of the best annotated genomes and one of the best characterized sets of transposable elements (TE) sequences. This model organism is useful for analyzing the mode of action of several compounds in vivo and evaluating the behavioral consequences of treatments. The aim of our study was to increase the knowledge about the effects of Cisplatin in Drosophila by joining RNA-seq and biological assays. RNA-seq was followed by analyses of differential expression of genes (DEGs) and TEs (DETEs), and of pathways and ontology terms. DETEs were confirmed by qPCR. Cisplatin was evaluated at 50 and 100 μg/mL in Drosophila culture medium for 24 h. The fly locomotor assay, survival analysis, oviposition and development were used as biological assays. Cisplatin induced DEGs in a dose-dependent fashion, and four TEs were up-regulated. Most DEGs are related to DNA damage and detoxification processes. Cisplatin increases Drosophila locomotor activity and interrupts development. Genes and processes related to the assays were also identified. This is the first study to evaluate the effects of Cisplatin in flies using RNA-seq. Gene alteration was almost limited to drug metabolism and DNA damage, and the drug did not vastly affect Drosophila on the molecular level. Contrary to the hypothesis that stress dramatically alters TEs mobilization, only four TEs were up-regulated. Our study, together with previous knowledge, asserts Drosophila as a valuable organism in the study of chemotherapy drugs.
Collapse
Affiliation(s)
- Daniela Moreira Mombach
- Programa de Pós-Graduação em Genética e Biologia Molecular, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil
| | | | - Mônica Medeiros Silva
- Departamento de Bioquímica e Biologia Molecular, Universidade Federal de Santa Maria, Santa Maria, RS, Brazil
| | - Élgion Lúcio Silva Loreto
- Programa de Pós-Graduação em Genética e Biologia Molecular, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil; Departamento de Bioquímica e Biologia Molecular, Universidade Federal de Santa Maria, Santa Maria, RS, Brazil.
| |
Collapse
|
4
|
Ullastres A, Merenciano M, González J. Regulatory regions in natural transposable element insertions drive interindividual differences in response to immune challenges in Drosophila. Genome Biol 2021; 22:265. [PMID: 34521452 PMCID: PMC8439047 DOI: 10.1186/s13059-021-02471-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Accepted: 08/19/2021] [Indexed: 02/08/2023] Open
Abstract
Background Variation in gene expression underlies interindividual variability in relevant traits including immune response. However, the genetic variation responsible for these gene expression changes remains largely unknown. Among the non-coding variants that could be relevant, transposable element insertions are promising candidates as they have been shown to be a rich and diverse source of cis-regulatory elements. Results In this work, we use a population genetics approach to identify transposable element insertions likely to increase the tolerance of Drosophila melanogaster to bacterial infection by affecting the expression of immune-related genes. We identify 12 insertions associated with allele-specific expression changes in immune-related genes. We experimentally validate three of these insertions including one likely to be acting as a silencer, one as an enhancer, and one with a dual role as enhancer and promoter. The direction in the change of gene expression associated with the presence of several of these insertions is consistent with an increased survival to infection. Indeed, for one of the insertions, we show that this is the case by analyzing both natural populations and CRISPR/Cas9 mutants in which the insertion is deleted from its native genomic context. Conclusions We show that transposable elements contribute to gene expression variation in response to infection in D. melanogaster and that this variation is likely to affect their survival capacity. Because the role of transposable elements as regulatory elements is not restricted to Drosophila, transposable elements are likely to play a role in immune response in other organisms as well. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-021-02471-3.
Collapse
Affiliation(s)
- Anna Ullastres
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Passeig Marítim de la Barceloneta 37-49, 08003, Barcelona, Spain
| | - Miriam Merenciano
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Passeig Marítim de la Barceloneta 37-49, 08003, Barcelona, Spain
| | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Passeig Marítim de la Barceloneta 37-49, 08003, Barcelona, Spain.
| |
Collapse
|
5
|
Chakraborty M, Chang CH, Khost DE, Vedanayagam J, Adrion JR, Liao Y, Montooth KL, Meiklejohn CD, Larracuente AM, Emerson JJ. Evolution of genome structure in the Drosophila simulans species complex. Genome Res 2021; 31:380-396. [PMID: 33563718 PMCID: PMC7919458 DOI: 10.1101/gr.263442.120] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Accepted: 12/28/2020] [Indexed: 12/25/2022]
Abstract
The rapid evolution of repetitive DNA sequences, including satellite DNA, tandem duplications, and transposable elements, underlies phenotypic evolution and contributes to hybrid incompatibilities between species. However, repetitive genomic regions are fragmented and misassembled in most contemporary genome assemblies. We generated highly contiguous de novo reference genomes for the Drosophila simulans species complex (D. simulans, D. mauritiana, and D. sechellia), which speciated ∼250,000 yr ago. Our assemblies are comparable in contiguity and accuracy to the current D. melanogaster genome, allowing us to directly compare repetitive sequences between these four species. We find that at least 15% of the D. simulans complex species genomes fail to align uniquely to D. melanogaster owing to structural divergence-twice the number of single-nucleotide substitutions. We also find rapid turnover of satellite DNA and extensive structural divergence in heterochromatic regions, whereas the euchromatic gene content is mostly conserved. Despite the overall preservation of gene synteny, euchromatin in each species has been shaped by clade- and species-specific inversions, transposable elements, expansions and contractions of satellite and tRNA tandem arrays, and gene duplications. We also find rapid divergence among Y-linked genes, including copy number variation and recent gene duplications from autosomes. Our assemblies provide a valuable resource for studying genome evolution and its consequences for phenotypic evolution in these genetic model species.
Collapse
Affiliation(s)
- Mahul Chakraborty
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California 92697, USA
| | - Ching-Ho Chang
- Department of Biology, University of Rochester, Rochester, New York 14627, USA
| | - Danielle E Khost
- Department of Biology, University of Rochester, Rochester, New York 14627, USA
- FAS Informatics and Scientific Applications, Harvard University, Cambridge, Massachusetts 02138, USA
| | - Jeffrey Vedanayagam
- Department of Developmental Biology, Memorial Sloan-Kettering Cancer Center, New York, New York 10065, USA
| | - Jeffrey R Adrion
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon 97403, USA
| | - Yi Liao
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California 92697, USA
| | - Kristi L Montooth
- School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, Nebraska 68502, USA
| | - Colin D Meiklejohn
- School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, Nebraska 68502, USA
| | | | - J J Emerson
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California 92697, USA
| |
Collapse
|
6
|
Fabian DK, Dönertaş HM, Fuentealba M, Partridge L, Thornton JM. Transposable Element Landscape in Drosophila Populations Selected for Longevity. Genome Biol Evol 2021; 13:6141024. [PMID: 33595657 PMCID: PMC8355499 DOI: 10.1093/gbe/evab031] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/11/2021] [Indexed: 12/11/2022] Open
Abstract
Transposable elements (TEs) inflict numerous negative effects on health and fitness as they replicate by integrating into new regions of the host genome. Even though organisms employ powerful mechanisms to demobilize TEs, transposons gradually lose repression during aging. The rising TE activity causes genomic instability and was implicated in age-dependent neurodegenerative diseases, inflammation, and the determination of lifespan. It is therefore conceivable that long-lived individuals have improved TE silencing mechanisms resulting in reduced TE expression relative to their shorter-lived counterparts and fewer genomic insertions. Here, we test this hypothesis by performing the first genome-wide analysis of TE insertions and expression in populations of Drosophila melanogaster selected for longevity through late-life reproduction for 50–170 generations from four independent studies. Contrary to our expectation, TE families were generally more abundant in long-lived populations compared with nonselected controls. Although simulations showed that this was not expected under neutrality, we found little evidence for selection driving TE abundance differences. Additional RNA-seq analysis revealed a tendency for reducing TE expression in selected populations, which might be more important for lifespan than regulating genomic insertions. We further find limited evidence of parallel selection on genes related to TE regulation and transposition. However, telomeric TEs were genomically and transcriptionally more abundant in long-lived flies, suggesting improved telomere maintenance as a promising TE-mediated mechanism for prolonging lifespan. Our results provide a novel viewpoint indicating that reproduction at old age increases the opportunity of TEs to be passed on to the next generation with little impact on longevity.
Collapse
Affiliation(s)
- Daniel K Fabian
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, United Kingdom
- Institute of Healthy Ageing, Department of Genetics, Evolution and Environment, University College London, United Kingdom
- Corresponding author: E-mail:
| | - Handan Melike Dönertaş
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - Matías Fuentealba
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, United Kingdom
- Institute of Healthy Ageing, Department of Genetics, Evolution and Environment, University College London, United Kingdom
| | - Linda Partridge
- Institute of Healthy Ageing, Department of Genetics, Evolution and Environment, University College London, United Kingdom
- Max Planck Institute for Biology of Ageing, Cologne, Germany
| | - Janet M Thornton
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| |
Collapse
|
7
|
Abstract
Drosophila melanogaster, a small dipteran of African origin, represents one of the best-studied model organisms. Early work in this system has uniquely shed light on the basic principles of genetics and resulted in a versatile collection of genetic tools that allow to uncover mechanistic links between genotype and phenotype. Moreover, given its worldwide distribution in diverse habitats and its moderate genome-size, Drosophila has proven very powerful for population genetics inference and was one of the first eukaryotes whose genome was fully sequenced. In this book chapter, we provide a brief historical overview of research in Drosophila and then focus on recent advances during the genomic era. After describing different types and sources of genomic data, we discuss mechanisms of neutral evolution including the demographic history of Drosophila and the effects of recombination and biased gene conversion. Then, we review recent advances in detecting genome-wide signals of selection, such as soft and hard selective sweeps. We further provide a brief introduction to background selection, selection of noncoding DNA and codon usage and focus on the role of structural variants, such as transposable elements and chromosomal inversions, during the adaptive process. Finally, we discuss how genomic data helps to dissect neutral and adaptive evolutionary mechanisms that shape genetic and phenotypic variation in natural populations along environmental gradients. In summary, this book chapter serves as a starting point to Drosophila population genomics and provides an introduction to the system and an overview to data sources, important population genetic concepts and recent advances in the field.
Collapse
|
8
|
Ellison CE, Kagda MS, Cao W. Telomeric TART elements target the piRNA machinery in Drosophila. PLoS Biol 2020; 18:e3000689. [PMID: 33347429 PMCID: PMC7785250 DOI: 10.1371/journal.pbio.3000689] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Revised: 01/05/2021] [Accepted: 12/10/2020] [Indexed: 11/23/2022] Open
Abstract
Coevolution between transposable elements (TEs) and their hosts can be antagonistic, where TEs evolve to avoid silencing and the host responds by reestablishing TE suppression, or mutualistic, where TEs are co-opted to benefit their host. The TART-A TE functions as an important component of Drosophila telomeres but has also reportedly inserted into the Drosophila melanogaster nuclear export factor gene nxf2. We find that, rather than inserting into nxf2, TART-A has actually captured a portion of nxf2 sequence. We show that TART-A produces abundant Piwi-interacting small RNAs (piRNAs), some of which are antisense to the nxf2 transcript, and that the TART-like region of nxf2 is evolving rapidly. Furthermore, in D. melanogaster, TART-A is present at higher copy numbers, and nxf2 shows reduced expression, compared to the closely related species Drosophila simulans. We propose that capturing nxf2 sequence allowed TART-A to target the nxf2 gene for piRNA-mediated repression and that these 2 elements are engaged in antagonistic coevolution despite the fact that TART-A is serving a critical role for its host genome. Co-evolution between transposable elements (TEs) and their hosts can be antagonistic, where TEs evolve to avoid silencing and the host responds by re-establishing TE suppression, or mutualistic, where TEs are co-opted to benefit their host. This study shows that a specialized Drosophila retrotransposon that functions as a telomere has captured a portion of a host piRNA gene which may allow it to evade silencing.
Collapse
Affiliation(s)
- Christopher E. Ellison
- Department of Genetics, Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, New Jersey, United States of America
- * E-mail:
| | - Meenakshi S. Kagda
- Department of Genetics, Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, New Jersey, United States of America
| | - Weihuan Cao
- Department of Genetics, Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, New Jersey, United States of America
| |
Collapse
|
9
|
Winbush A, Singh ND. Genomics of Recombination Rate Variation in Temperature-Evolved Drosophila melanogaster Populations. Genome Biol Evol 2020; 13:6008691. [PMID: 33247719 PMCID: PMC7851596 DOI: 10.1093/gbe/evaa252] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/24/2020] [Indexed: 12/14/2022] Open
Abstract
Meiotic recombination is a critical process that ensures proper segregation of chromosome homologs through DNA double-strand break repair mechanisms. Rates of recombination are highly variable among various taxa, within species, and within genomes with far-reaching evolutionary and genomic consequences. The genetic basis of recombination rate variation is therefore crucial in the study of evolutionary biology but remains poorly understood. In this study, we took advantage of a set of experimental temperature-evolved populations of Drosophila melanogaster with heritable differences in recombination rates depending on the temperature regime in which they evolved. We performed whole-genome sequencing and identified several chromosomal regions that appear to be divergent depending on temperature regime. In addition, we identify a set of single-nucleotide polymorphisms and associated genes with significant differences in allele frequency when the different temperature populations are compared. Further refinement of these gene candidates emphasizing those expressed in the ovary and associated with DNA binding reveals numerous potential candidate genes such as Hr38, EcR, and mamo responsible for observed differences in recombination rates in these experimental evolution lines thus providing insight into the genetic basis of recombination rate variation.
Collapse
Affiliation(s)
- Ari Winbush
- Department of Biology, Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, USA
| | - Nadia D Singh
- Department of Biology, Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, USA
- Corresponding author: E-mail:
| |
Collapse
|
10
|
Ellison CE, Cao W. Nanopore sequencing and Hi-C scaffolding provide insight into the evolutionary dynamics of transposable elements and piRNA production in wild strains of Drosophila melanogaster. Nucleic Acids Res 2020; 48:290-303. [PMID: 31754714 PMCID: PMC6943127 DOI: 10.1093/nar/gkz1080] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Revised: 10/29/2019] [Accepted: 11/01/2019] [Indexed: 01/29/2023] Open
Abstract
Illumina sequencing has allowed for population-level surveys of transposable element (TE) polymorphism via split alignment approaches, which has provided important insight into the population dynamics of TEs. However, such approaches are not able to identify insertions of uncharacterized TEs, nor can they assemble the full sequence of inserted elements. Here, we use nanopore sequencing and Hi-C scaffolding to produce de novo genome assemblies for two wild strains of Drosophila melanogaster from the Drosophila Genetic Reference Panel (DGRP). Ovarian piRNA populations and Illumina split-read TE insertion profiles have been previously produced for both strains. We find that nanopore sequencing with Hi-C scaffolding produces highly contiguous, chromosome-length scaffolds, and we identify hundreds of TE insertions that were missed by Illumina-based methods, including a novel micropia-like element that has recently invaded the DGRP population. We also find hundreds of piRNA-producing loci that are specific to each strain. Some of these loci are created by strain-specific TE insertions, while others appear to be epigenetically controlled. Our results suggest that Illumina approaches reveal only a portion of the repetitive sequence landscape of eukaryotic genomes and that population-level resequencing using long reads is likely to provide novel insight into the evolutionary dynamics of repetitive elements.
Collapse
Affiliation(s)
- Christopher E Ellison
- Department of Genetics, Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Weihuan Cao
- Department of Genetics, Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| |
Collapse
|
11
|
Hill T, Koseva BS, Unckless RL. The Genome of Drosophila innubila Reveals Lineage-Specific Patterns of Selection in Immune Genes. Mol Biol Evol 2019; 36:1405-1417. [PMID: 30865231 PMCID: PMC6573480 DOI: 10.1093/molbev/msz059] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Pathogenic microbes can exert extraordinary evolutionary pressure on their hosts. They can spread rapidly and sicken or even kill their host to promote their own proliferation. Because of this strong selective pressure, immune genes are some of the fastest evolving genes across metazoans, as highlighted in mammals and insects. Drosophila melanogaster serves as a powerful model for studying host/pathogen evolution. While Drosophila melanogaster are frequently exposed to various pathogens, little is known about D. melanogaster's ecology, or if they are representative of other Drosophila species in terms of pathogen pressure. Here, we characterize the genome of Drosophila innubila, a mushroom-feeding species highly diverged from D. melanogaster and investigate the evolution of the immune system. We find substantial differences in the rates of evolution of immune pathways between D. innubila and D. melanogaster. Contrasting what was previously found for D. melanogaster, we find little evidence of rapid evolution of the antiviral RNAi genes and high rates of evolution in the Toll pathway. This suggests that, while immune genes tend to be rapidly evolving in most species, the specific genes that are fastest evolving may depend either on the pathogens faced by the host and/or divergence in the basic architecture of the host's immune system.
Collapse
Affiliation(s)
- Tom Hill
- Department of Molecular Biosciences, University of Kansas, Lawrence, KS
| | | | - Robert L Unckless
- Department of Molecular Biosciences, University of Kansas, Lawrence, KS
| |
Collapse
|
12
|
Manee MM, Jackson J, Bergman CM. Conserved Noncoding Elements Influence the Transposable Element Landscape in Drosophila. Genome Biol Evol 2018; 10:1533-1545. [PMID: 29850787 PMCID: PMC6007792 DOI: 10.1093/gbe/evy104] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/22/2018] [Indexed: 12/15/2022] Open
Abstract
Highly conserved noncoding elements (CNEs) constitute a significant proportion of the genomes of multicellular eukaryotes. The function of most CNEs remains elusive, but growing evidence indicates they are under some form of purifying selection. Noncoding regions in many species also harbor large numbers of transposable element (TE) insertions, which are typically lineage specific and depleted in exons because of their deleterious effects on gene function or expression. However, it is currently unknown whether the landscape of TE insertions in noncoding regions is random or influenced by purifying selection on CNEs. Here, we combine comparative and population genomic data in Drosophila melanogaster to show that the abundance of TE insertions in intronic and intergenic CNEs is reduced relative to random expectation, supporting the idea that selective constraints on CNEs eliminate a proportion of TE insertions in noncoding regions. However, we find no evidence for differences in the allele frequency spectra for polymorphic TE insertions in CNEs versus those in unconstrained spacer regions, suggesting that the distribution of fitness effects acting on observable TE insertions is similar across different functional compartments in noncoding DNA. Our results provide evidence that selective constraints on CNEs contribute to shaping the landscape of TE insertion in eukaryotic genomes, and provide further evidence that CNEs are indeed functionally constrained and not simply mutational cold spots.
Collapse
Affiliation(s)
- Manee M Manee
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom.,National Center for Biotechnology, King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia.,Center of Excellence for Genomics (CEG), King Abdulaziz City for Science and Technology, Riyadh, Saudi Arabia
| | - John Jackson
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom.,Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - Casey M Bergman
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom.,Department of Genetics, University of Georgia, Athens, GA.,Institute of Bioinformatics, University of Georgia, Athens, GA
| |
Collapse
|
13
|
Bergman CM, Han S, Nelson MG, Bondarenko V, Kozeretska I. Genomic analysis of P elements in natural populations of Drosophila melanogaster. PeerJ 2017; 5:e3824. [PMID: 28929030 PMCID: PMC5602686 DOI: 10.7717/peerj.3824] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2017] [Accepted: 08/29/2017] [Indexed: 11/20/2022] Open
Abstract
The Drosophila melanogaster P transposable element provides one of the best cases of horizontal transfer of a mobile DNA sequence in eukaryotes. Invasion of natural populations by the P element has led to a syndrome of phenotypes known as P-M hybrid dysgenesis that emerges when strains differing in their P element composition mate and produce offspring. Despite extensive research on many aspects of P element biology, many questions remain about the genomic basis of variation in P-M dysgenesis phenotypes across populations. Here we compare estimates of genomic P element content with gonadal dysgenesis phenotypes for isofemale strains obtained from three worldwide populations of D. melanogaster to illuminate the molecular basis of natural variation in cytotype status. We show that P element abundance estimated from genome sequences of isofemale strains is highly correlated across different bioinformatics approaches, but that abundance estimates are sensitive to method and filtering strategies as well as incomplete inbreeding of isofemale strains. We find that P element content varies significantly across populations, with strains from a North American population having fewer P elements but a higher proportion of full-length elements than strains from populations sampled in Europe or Africa. Despite these geographic differences in P element abundance and structure, neither the number of P elements nor the ratio of full-length to internally-truncated copies is strongly correlated with the degree of gonadal dysgenesis exhibited by an isofemale strain. Thus, variation in P element abundance and structure across different populations does not necessarily lead to corresponding geographic differences in gonadal dysgenesis phenotypes. Finally, we confirm that population differences in the abundance and structure of P elements that are observed from isofemale lines can also be observed in pool-seq samples from the same populations. Our work supports the view that genomic P element content alone is not sufficient to explain variation in gonadal dysgenesis across strains of D. melanogaster, and informs future efforts to decode the genomic basis of geographic and temporal differences in P element induced phenotypes.
Collapse
Affiliation(s)
- Casey M Bergman
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom.,Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA, United States of America
| | - Shunhua Han
- Institute of Bioinformatics, University of Georgia, Athens, GA, United States of America
| | - Michael G Nelson
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom
| | - Vladyslav Bondarenko
- Department of General and Molecular Genetics, Taras Shevchenko University of Kyiv, Kyiv, Ukraine
| | - Iryna Kozeretska
- Department of General and Molecular Genetics, Taras Shevchenko University of Kyiv, Kyiv, Ukraine
| |
Collapse
|
14
|
McClintock: An Integrated Pipeline for Detecting Transposable Element Insertions in Whole-Genome Shotgun Sequencing Data. G3-GENES GENOMES GENETICS 2017. [PMID: 28637810 PMCID: PMC5555480 DOI: 10.1534/g3.117.043893] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Transposable element (TE) insertions are among the most challenging types of variants to detect in genomic data because of their repetitive nature and complex mechanisms of replication . Nevertheless, the recent availability of large resequencing data sets has spurred the development of many new methods to detect TE insertions in whole-genome shotgun sequences. Here we report an integrated bioinformatics pipeline for the detection of TE insertions in whole-genome shotgun data, called McClintock (https://github.com/bergmanlab/mcclintock), which automatically runs and standardizes output for multiple TE detection methods. We demonstrate the utility of McClintock by evaluating six TE detection methods using simulated and real genome data from the model microbial eukaryote, Saccharomyces cerevisiae We find substantial variation among McClintock component methods in their ability to detect nonreference TEs in the yeast genome, but show that nonreference TEs at nearly all biologically realistic locations can be detected in simulated data by combining multiple methods that use split-read and read-pair evidence. In general, our results reveal that split-read methods detect fewer nonreference TE insertions than read-pair methods, but generally have much higher positional accuracy. Analysis of a large sample of real yeast genomes reveals that most McClintock component methods can recover known aspects of TE biology in yeast such as the transpositional activity status of families, target preferences, and target site duplication structure, albeit with varying levels of accuracy. Our work provides a general framework for integrating and analyzing results from multiple TE detection methods, as well as useful guidance for researchers studying TEs in yeast resequencing data.
Collapse
|
15
|
Abstract
Molecular population genetics aims to explain genetic variation and molecular evolution from population genetics principles. The field was born 50 years ago with the first measures of genetic variation in allozyme loci, continued with the nucleotide sequencing era, and is currently in the era of population genomics. During this period, molecular population genetics has been revolutionized by progress in data acquisition and theoretical developments. The conceptual elegance of the neutral theory of molecular evolution or the footprint carved by natural selection on the patterns of genetic variation are two examples of the vast number of inspiring findings of population genetics research. Since the inception of the field, Drosophila has been the prominent model species: molecular variation in populations was first described in Drosophila and most of the population genetics hypotheses were tested in Drosophila species. In this review, we describe the main concepts, methods, and landmarks of molecular population genetics, using the Drosophila model as a reference. We describe the different genetic data sets made available by advances in molecular technologies, and the theoretical developments fostered by these data. Finally, we review the results and new insights provided by the population genomics approach, and conclude by enumerating challenges and new lines of inquiry posed by increasingly large population scale sequence data.
Collapse
|
16
|
Reid NM, Jackson CE, Gilbert D, Minx P, Montague MJ, Hampton TH, Helfrich LW, King BL, Nacci DE, Aluru N, Karchner SI, Colbourne JK, Hahn ME, Shaw JR, Oleksiak MF, Crawford DL, Warren WC, Whitehead A. The landscape of extreme genomic variation in the highly adaptable Atlantic killifish. Genome Biol Evol 2017; 9:659-676. [PMID: 28201664 PMCID: PMC5381573 DOI: 10.1093/gbe/evx023] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2016] [Revised: 01/30/2017] [Accepted: 02/04/2017] [Indexed: 12/22/2022] Open
Abstract
Understanding and predicting the fate of populations in changing environments require knowledge about the mechanisms that support phenotypic plasticity and the adaptive value and evolutionary fate of genetic variation within populations. Atlantic killifish (Fundulus heteroclitus) exhibit extensive phenotypic plasticity that supports large population sizes in highly fluctuating estuarine environments. Populations have also evolved diverse local adaptations. To yield insights into the genomic variation that supports their adaptability, we sequenced a reference genome and 48 additional whole genomes from a wild population. Evolution of genes associated with cell cycle regulation and apoptosis is accelerated along the killifish lineage, which is likely tied to adaptations for life in highly variable estuarine environments. Genome-wide standing genetic variation, including nucleotide diversity and copy number variation, is extremely high. The highest diversity genes are those associated with immune function and olfaction, whereas genes under greatest evolutionary constraint are those associated with neurological, developmental, and cytoskeletal functions. Reduced genetic variation is detected for tight junction proteins, which in killifish regulate paracellular permeability that supports their extreme physiological flexibility. Low-diversity genes engage in more regulatory interactions than high-diversity genes, consistent with the influence of pleiotropic constraint on molecular evolution. High genetic variation is crucial for continued persistence of species given the pace of contemporary environmental change. Killifish populations harbor among the highest levels of nucleotide diversity yet reported for a vertebrate species, and thus may serve as a useful model system for studying evolutionary potential in variable and changing environments.
Collapse
Affiliation(s)
- Noah M Reid
- Department of Environmental Toxicology, University of California, Davis, CA 95616
| | - Craig E Jackson
- School of Public and Environmental Affairs, Indiana University, Bloomington, IN 47405
| | - Don Gilbert
- Biology Department, Indiana University, Bloomington, IN 47405
| | - Patrick Minx
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO 63108
| | - Michael J Montague
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO 63108
| | - Thomas H Hampton
- Department of Microbiology and Immunology, Dartmouth College Geisel School of Medicine, Hanover, NH 03755
| | - Lily W Helfrich
- Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA 02543
| | - Benjamin L King
- Mount Desert Island Biological Laboratory, Salisbury Cove, ME 04672
| | - Diane E Nacci
- US Environmental Protection Agency, Office of Research and Development, Narragansett, RI, 02882
| | - Neel Aluru
- Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA 02543
| | - Sibel I Karchner
- Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA 02543
| | - John K Colbourne
- School of Biosciences, University of Birmingham, United Kingdom, B15 2TT
| | - Mark E Hahn
- Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA 02543
| | - Joseph R Shaw
- School of Public and Environmental Affairs, Indiana University, Bloomington, IN 47405
| | - Marjorie F Oleksiak
- Department of Marine Biology and Ecology, Rosenstiel School of Marine and Atmospheric Science, University of Miami, Miami, FL 33149
| | - Douglas L Crawford
- Department of Marine Biology and Ecology, Rosenstiel School of Marine and Atmospheric Science, University of Miami, Miami, FL 33149
| | - Wesley C Warren
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO 63108
| | - Andrew Whitehead
- Department of Environmental Toxicology, University of California, Davis, CA 95616
| |
Collapse
|
17
|
Rius N, Guillén Y, Delprat A, Kapusta A, Feschotte C, Ruiz A. Exploration of the Drosophila buzzatii transposable element content suggests underestimation of repeats in Drosophila genomes. BMC Genomics 2016; 17:344. [PMID: 27164953 PMCID: PMC4862133 DOI: 10.1186/s12864-016-2648-8] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2015] [Accepted: 04/22/2016] [Indexed: 11/10/2022] Open
Abstract
Background Many new Drosophila genomes have been sequenced in recent years using new-generation sequencing platforms and assembly methods. Transposable elements (TEs), being repetitive sequences, are often misassembled, especially in the genomes sequenced with short reads. Consequently, the mobile fraction of many of the new genomes has not been analyzed in detail or compared with that of other genomes sequenced with different methods, which could shed light into the understanding of genome and TE evolution. Here we compare the TE content of three genomes: D. buzzatii st-1, j-19, and D. mojavensis. Results We have sequenced a new D. buzzatii genome (j-19) that complements the D. buzzatii reference genome (st-1) already published, and compared their TE contents with that of D. mojavensis. We found an underestimation of TE sequences in Drosophila genus NGS-genomes when compared to Sanger-genomes. To be able to compare genomes sequenced with different technologies, we developed a coverage-based method and applied it to the D. buzzatii st-1 and j-19 genome. Between 10.85 and 11.16 % of the D. buzzatii st-1 genome is made up of TEs, between 7 and 7,5 % of D. buzzatii j-19 genome, while TEs represent 15.35 % of the D. mojavensis genome. Helitrons are the most abundant order in the three genomes. Conclusions TEs in D. buzzatii are less abundant than in D. mojavensis, as expected according to the genome size and TE content positive correlation. However, TEs alone do not explain the genome size difference. TEs accumulate in the dot chromosomes and proximal regions of D. buzzatii and D. mojavensis chromosomes. We also report a significantly higher TE density in D. buzzatii and D. mojavensis X chromosomes, which is not expected under the current models. Our easy-to-use correction method allowed us to identify recently active families in D. buzzatii st-1 belonging to the LTR-retrotransposon superfamily Gypsy. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2648-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Nuria Rius
- Department de Genética i Microbiologia, Universitat Autònoma de Barcelona, Bellaterra (Barcelona), Spain.
| | - Yolanda Guillén
- Department de Genética i Microbiologia, Universitat Autònoma de Barcelona, Bellaterra (Barcelona), Spain
| | - Alejandra Delprat
- Department de Genética i Microbiologia, Universitat Autònoma de Barcelona, Bellaterra (Barcelona), Spain
| | - Aurélie Kapusta
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Cédric Feschotte
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Alfredo Ruiz
- Department de Genética i Microbiologia, Universitat Autònoma de Barcelona, Bellaterra (Barcelona), Spain
| |
Collapse
|
18
|
Stanley CE, Kulathinal RJ. Genomic signatures of domestication on neurogenetic genes in Drosophila melanogaster. BMC Evol Biol 2016; 16:6. [PMID: 26728183 PMCID: PMC4700609 DOI: 10.1186/s12862-015-0580-1] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2015] [Accepted: 12/22/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Domesticated animals quickly evolve docile and submissive behaviors after isolation from their wild conspecifics. Model organisms reared for prolonged periods in the laboratory also exhibit similar shifts towards these domesticated behaviors. Yet whether this divergence is due to inadvertent selection in the lab or the fixation of deleterious mutations remains unknown. RESULTS Here, we compare the genomes of lab-reared and wild-caught Drosophila melanogaster to understand the genetic basis of these recently endowed behaviors common to laboratory models. From reassembled genomes of common lab strains, we identify unique, derived variants not present in global populations (lab-specific SNPs). Decreased selective constraints across low frequency SNPs (unique to one or two lab strains) are different from patterns found in the wild and more similar to neutral expectations, suggesting an overall accumulation of deleterious mutations. However, high-frequency lab SNPs found in most or all lab strains reveal an enrichment of X-linked loci and neuro-sensory genes across large extended haplotypes. Among shared polymorphisms, we also find highly differentiated SNPs, in which the derived allele is higher in frequency in the wild (Fst*wild>lab), enriched for similar neurogenetic ontologies, indicative of relaxed selection on more active wild alleles in the lab. CONCLUSIONS Among random mutations that continuously accumulate in the laboratory, we detect common adaptive signatures in domesticated lab strains of fruit flies. Our results demonstrate that lab animals can quickly evolve domesticated behaviors via unconscious selection by humans early on a broad pool of disproportionately large neurogenetic targets followed by the fixation of accumulated deleterious mutations on functionally similar targets.
Collapse
Affiliation(s)
- Craig E Stanley
- Department of Biology, Temple University, Philadelphia, PA, USA.
| | - Rob J Kulathinal
- Department of Biology, Temple University, Philadelphia, PA, USA.
| |
Collapse
|
19
|
Smukowski Heil CS, Ellison C, Dubin M, Noor MAF. Recombining without Hotspots: A Comprehensive Evolutionary Portrait of Recombination in Two Closely Related Species of Drosophila. Genome Biol Evol 2015; 7:2829-42. [PMID: 26430062 PMCID: PMC4684701 DOI: 10.1093/gbe/evv182] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/03/2015] [Indexed: 12/12/2022] Open
Abstract
Meiotic recombination rate varies across the genome within and between individuals, populations, and species in virtually all taxa studied. In almost every species, this variation takes the form of discrete recombination hotspots, determined in some mammals by a protein called PRDM9. Hotspots and their determinants have a profound effect on the genomic landscape, and share certain features that extend across the tree of life. Drosophila, in contrast, are anomalous in their absence of hotspots, PRDM9, and other species-specific differences in the determination of recombination. To better understand the evolution of meiosis and general patterns of recombination across diverse taxa, we present a truly comprehensive portrait of recombination across time, combining recently published cross-based contemporary recombination estimates from each of two sister species with newly obtained linkage-disequilibrium-based historic estimates of recombination from both of these species. Using Drosophila pseudoobscura and Drosophila miranda as a model system, we compare recombination rate between species at multiple scales, and we suggest that Drosophila replicate the pattern seen in human-chimpanzee in which recombination rate is conserved at broad scales. We also find evidence of a species-wide recombination modifier(s), resulting in both a present and historic genome-wide elevation of recombination rates in D. miranda, and identify broad scale effects on recombination from the presence of an inversion. Finally, we reveal an unprecedented view of the distribution of recombination in D. pseudoobscura, illustrating patterns of linked selection and where recombination is taking place. Overall, by combining these estimation approaches, we highlight key similarities and differences in recombination between Drosophila and other organisms.
Collapse
Affiliation(s)
- Caiti S Smukowski Heil
- Biology Department, Duke University Genome Sciences Department, University of Washington
| | - Chris Ellison
- Department of Integrative Biology, University of California, Berkeley
| | | | | |
Collapse
|
20
|
Steele LD, Coates B, Valero MC, Sun W, Seong KM, Muir WM, Clark JM, Pittendrigh BR. Selective sweep analysis in the genomes of the 91-R and 91-C Drosophila melanogaster strains reveals few of the 'usual suspects' in dichlorodiphenyltrichloroethane (DDT) resistance. PLoS One 2015; 10:e0123066. [PMID: 25826265 PMCID: PMC4380341 DOI: 10.1371/journal.pone.0123066] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2014] [Accepted: 02/17/2015] [Indexed: 11/19/2022] Open
Abstract
Adaptation of insect phenotypes for survival after exposure to xenobiotics can result from selection at multiple loci with additive genetic effects. To the authors' knowledge, no selective sweep analysis has been performed to identify such loci in highly dichlorodiphenyltrichloroethane (DDT) resistant insects. Here we compared a highly DDT resistant phenotype in the Drosophila melanogaster (Drosophila) 91-R strain to the DDT susceptible 91-C strain, both of common origin. Whole genome re-sequencing data from pools of individuals was generated separately for 91-R and 91-C, and mapped to the reference Drosophila genome assembly (v. 5.72). Thirteen major and three minor effect chromosome intervals with reduced nucleotide diversity (π) were identified only in the 91-R population. Estimates of Tajima's D (D) showed corresponding evidence of directional selection in these same genome regions of 91-R, however, no similar reductions in π or D estimates were detected in 91-C. An overabundance of non-synonymous proteins coding to synonymous changes were identified in putative open reading frames associated with 91-R. Except for NinaC and Cyp4g1, none of the identified genes were the 'usual suspects' previously observed to be associated with DDT resistance. Additionally, up-regulated ATP-binding cassette transporters have been previously associated with DDT resistance; however, here we identified a structurally altered MDR49 candidate resistance gene. The remaining fourteen genes have not previously been shown to be associated with DDT resistance. These results suggest hitherto unknown mechanisms of DDT resistance, most of which have been overlooked in previous transcriptional studies, with some genes having orthologs in mammals.
Collapse
Affiliation(s)
- Laura D. Steele
- Department of Entomology, University of Illinois, Urbana-Champaign, Illinois, United States of America
- * E-mail:
| | - Brad Coates
- United States Department of Agriculture, Agricultural Research Service, Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, Iowa, United States of America
| | - M. Carmen Valero
- Department of Entomology, University of Illinois, Urbana-Champaign, Illinois, United States of America
| | - Weilin Sun
- Department of Entomology, University of Illinois, Urbana-Champaign, Illinois, United States of America
| | - Keon Mook Seong
- Department of Entomology, University of Illinois, Urbana-Champaign, Illinois, United States of America
| | - William M. Muir
- Department of Animal Sciences, Purdue University, West Lafayette, Indiana, United States of America
| | - John M. Clark
- Department of Veterinary & Animal Science, University of Massachusetts, Amherst, Massachusetts, United States of America
| | - Barry R. Pittendrigh
- Department of Entomology, University of Illinois, Urbana-Champaign, Illinois, United States of America
| |
Collapse
|
21
|
Bergman CM. A proposal for the reference-based annotation of de novo transposable element insertions. Mob Genet Elements 2014; 2:51-54. [PMID: 22754753 PMCID: PMC3383450 DOI: 10.4161/mge.19479] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Understanding the causes and consequences of transposable element (TE) activity in the genomic era requires sophisticated bioinformatics approaches to accurately identify individual insertion sites. Next-generation sequencing technology now makes it possible to rapidly identify new TE insertions using resequencing data, opening up new possibilities to study the nature of TE-induced mutation and the target site preferences of different TE families. While the identification of new TE insertion sites is seemingly a simple task, the mechanisms of transposition present unique challenges for the annotation of de novo transposable element insertions mapped to a reference genome. Here I discuss these challenges and propose a framework for the annotation of de novo TE insertions that accommodates known mechanisms of TE insertion and established coordinate systems for genome annotation.
Collapse
Affiliation(s)
- Casey M Bergman
- Faculty of Life Sciences; University of Manchester; Manchester, UK
| |
Collapse
|
22
|
Background selection as baseline for nucleotide variation across the Drosophila genome. PLoS Genet 2014; 10:e1004434. [PMID: 24968283 PMCID: PMC4072542 DOI: 10.1371/journal.pgen.1004434] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Accepted: 04/28/2014] [Indexed: 11/21/2022] Open
Abstract
The constant removal of deleterious mutations by natural selection causes a reduction in neutral diversity and efficacy of selection at genetically linked sites (a process called Background Selection, BGS). Population genetic studies, however, often ignore BGS effects when investigating demographic events or the presence of other types of selection. To obtain a more realistic evolutionary expectation that incorporates the unavoidable consequences of deleterious mutations, we generated high-resolution landscapes of variation across the Drosophila melanogaster genome under a BGS scenario independent of polymorphism data. We find that BGS plays a significant role in shaping levels of variation across the entire genome, including long introns and intergenic regions distant from annotated genes. We also find that a very large percentage of the observed variation in diversity across autosomes can be explained by BGS alone, up to 70% across individual chromosome arms at 100-kb scale, thus indicating that BGS predictions can be used as baseline to infer additional types of selection and demographic events. This approach allows detecting several outlier regions with signal of recent adaptive events and selective sweeps. The use of a BGS baseline, however, is particularly appropriate to investigate the presence of balancing selection and our study exposes numerous genomic regions with the predicted signature of higher polymorphism than expected when a BGS context is taken into account. Importantly, we show that these conclusions are robust to the mutation and selection parameters of the BGS model. Finally, analyses of protein evolution together with previous comparisons of genetic maps between Drosophila species, suggest temporally variable recombination landscapes and, thus, local BGS effects that may differ between extant and past phases. Because genome-wide BGS and temporal changes in linkage effects can skew approaches to estimate demographic and selective events, future analyses should incorporate BGS predictions and capture local recombination variation across genomes and along lineages. The removal of deleterious mutations from natural populations has potential consequences on patterns of variation across genomes. Population genetic analyses, however, often assume that such effects are negligible across recombining regions of species like Drosophila. We use simple models of purifying selection and current knowledge of recombination rates and gene distribution across the genome to obtain a baseline of variation predicted by the constant input and removal of deleterious mutations. We find that purifying selection alone can explain a major fraction of the observed variance in nucleotide diversity across the genome. The use of a baseline of variation predicted by linkage to deleterious mutations as null expectation exposes genomic regions under other selective regimes, including more regions showing the signature of balancing selection than would be evident when using traditional approaches. Our study also indicates that most, if not all, nucleotides across the D. melanogaster genome are significantly influenced by the removal of deleterious mutations, even when located in the middle of highly recombining regions and distant from genes. Additionally, the study of rates of protein evolution confirms previous analyses suggesting that the recombination landscape across the genome has changed in the recent history of D. melanogaster. All these reported factors can skew current analyses designed to capture demographic events or estimate the strength and frequency of adaptive mutations, and illustrate the need for new and more realistic theoretical and modeling approaches to study naturally occurring genetic variation.
Collapse
|
23
|
Steele LD, Muir WM, Seong KM, Valero MC, Rangesa M, Sun W, Clark JM, Coates B, Pittendrigh BR. Genome-wide sequencing and an open reading frame analysis of dichlorodiphenyltrichloroethane (DDT) susceptible (91-C) and resistant (91-R) Drosophila melanogaster laboratory populations. PLoS One 2014; 9:e98584. [PMID: 24915415 PMCID: PMC4051598 DOI: 10.1371/journal.pone.0098584] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2014] [Accepted: 05/05/2014] [Indexed: 11/30/2022] Open
Abstract
The Drosophila melanogaster 91-R and 91-C strains are of common origin, however, 91-R has been intensely selected for dichlorodiphenyltrichloroethane (DDT) resistance over six decades while 91-C has been maintained as the non-selected control strain. These fly strains represent a unique genetic resource to understand the accumulation and fixation of mutations under laboratory conditions over decades of pesticide selection. Considerable research has been done to investigate the differential expression of genes associated with the highly DDT resistant strain 91-R, however, with the advent of whole genome sequencing we can now begin to develop an in depth understanding of the genomic changes associated with this intense decades-long xenobiotic selection pressure. Here we present the first whole genome sequencing analysis of the 91-R and 91-C fly strains to identify genome-wide structural changes within the open reading frames. Between-strain changes in allele frequencies revealed a higher percent of new alleles going to fixation for the 91-R strain, as compared to 91-C (P<0.0001). These results suggest that resistance to DDT in the 91-R laboratory strain could potentially be due primarily to new mutations, as well as being polygenic rather than the result of a few major mutations, two hypotheses that remain to be tested.
Collapse
Affiliation(s)
- Laura D. Steele
- Department of Entomology, University of Illinois, Urbana-Champaign, Urbana, Illinois, United States of America
- * E-mail:
| | - William M. Muir
- Department of Animal Sciences, Purdue University, West Lafayette, Indiana, United States of America
| | - Keon Mook Seong
- Department of Entomology, University of Illinois, Urbana-Champaign, Urbana, Illinois, United States of America
| | - M. Carmen Valero
- Department of Entomology, University of Illinois, Urbana-Champaign, Urbana, Illinois, United States of America
| | - Madhumitha Rangesa
- Department of Entomology, University of Illinois, Urbana-Champaign, Urbana, Illinois, United States of America
| | - Weilin Sun
- Department of Entomology, University of Illinois, Urbana-Champaign, Urbana, Illinois, United States of America
| | - John M. Clark
- Veterinary and Animal Sciences, University of Massachusetts, Amherst, Massachusetts, United States of America
| | - Brad Coates
- United States Department of Agriculture, Agricultural Research Service, Corn Insects & Crop Genetics Research Unit, Iowa State University, Ames, Iowa, United States of America
| | - Barry R. Pittendrigh
- Department of Entomology, University of Illinois, Urbana-Champaign, Urbana, Illinois, United States of America
| |
Collapse
|
24
|
Veselkina ER, Rybina OY, Symonenko AV, Alatortsev VE, Roshchina NV, Pasyukova EG. Molecular variability in geographically distant populations of Drosophila melanogaster at the Lim3 gene regulating nervous system development. RUSS J GENET+ 2014. [DOI: 10.1134/s1022795414050111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
25
|
Sequencing, assembling, and correcting draft genomes using recombinant populations. G3-GENES GENOMES GENETICS 2014; 4:669-79. [PMID: 24531727 PMCID: PMC4059239 DOI: 10.1534/g3.114.010264] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Current de novo whole-genome sequencing approaches often are inadequate for organisms lacking substantial preexisting genetic data. Problems with these methods are manifest as: large numbers of scaffolds that are not ordered within chromosomes or assigned to individual chromosomes, misassembly of allelic sequences as separate loci when the individual(s) being sequenced are heterozygous, and the collapse of recently duplicated sequences into a single locus, regardless of levels of heterozygosity. Here we propose a new approach for producing de novo whole-genome sequences—which we call recombinant population genome construction—that solves many of the problems encountered in standard genome assembly and that can be applied in model and nonmodel organisms. Our approach takes advantage of next-generation sequencing technologies to simultaneously barcode and sequence a large number of individuals from a recombinant population. The sequences of all recombinants can be combined to create an initial de novo assembly, followed by the use of individual recombinant genotypes to correct assembly splitting/collapsing and to order and orient scaffolds within linkage groups. Recombinant population genome construction can rapidly accelerate the transformation of nonmodel species into genome-enabled systems by simultaneously producing a high-quality genome assembly and providing genomic tools (e.g., high-confidence single-nucleotide polymorphisms) for immediate applications. In populations segregating for important functional traits, this approach also enables simultaneous mapping of quantitative trait loci. We demonstrate our method using simulated Illumina data from a recombinant population of Caenorhabditis elegans and show that the method can produce a high-fidelity, high-quality genome assembly for both parents of the cross.
Collapse
|
26
|
Abstract
Drosophila melanogaster, an ancestrally African species, has recently spread throughout the world, associated with human activity. The species has served as the focus of many studies investigating local adaptation relating to latitudinal variation in non-African populations, especially those from the United States and Australia. These studies have documented the existence of shared, genetically determined phenotypic clines for several life history and morphological traits. However, there are no studies designed to formally address the degree of shared latitudinal differentiation at the genomic level. Here we present our comparative analysis of such differentiation. Not surprisingly, we find evidence of substantial, shared selection responses on the two continents, probably resulting from selection on standing ancestral variation. The polymorphic inversion In(3R)P has an important effect on this pattern, but considerable parallelism is also observed across the genome in regions not associated with inversion polymorphism. Interestingly, parallel latitudinal differentiation is observed even for variants that are not particularly strongly differentiated, which suggests that very large numbers of polymorphisms are targets of spatially varying selection in this species.
Collapse
|
27
|
Ruan J, Jiang L, Chong Z, Gong Q, Li H, Li C, Tao Y, Zheng C, Zhai W, Turissini D, Cannon CH, Lu X, Wu CI. Pseudo-Sanger sequencing: massively parallel production of long and near error-free reads using NGS technology. BMC Genomics 2013; 14:711. [PMID: 24134808 PMCID: PMC4046676 DOI: 10.1186/1471-2164-14-711] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2013] [Accepted: 10/07/2013] [Indexed: 01/15/2023] Open
Abstract
Background Usually, next generation sequencing (NGS) technology has the property of ultra-high throughput but the read length is remarkably short compared to conventional Sanger sequencing. Paired-end NGS could computationally extend the read length but with a lot of practical inconvenience because of the inherent gaps. Now that Illumina paired-end sequencing has the ability of read both ends from 600 bp or even 800 bp DNA fragments, how to fill in the gaps between paired ends to produce accurate long reads is intriguing but challenging. Results We have developed a new technology, referred to as pseudo-Sanger (PS) sequencing. It tries to fill in the gaps between paired ends and could generate near error-free sequences equivalent to the conventional Sanger reads in length but with the high throughput of the Next Generation Sequencing. The major novelty of PS method lies on that the gap filling is based on local assembly of paired-end reads which have overlaps with at either end. Thus, we are able to fill in the gaps in repetitive genomic region correctly. The PS sequencing starts with short reads from NGS platforms, using a series of paired-end libraries of stepwise decreasing insert sizes. A computational method is introduced to transform these special paired-end reads into long and near error-free PS sequences, which correspond in length to those with the largest insert sizes. The PS construction has 3 advantages over untransformed reads: gap filling, error correction and heterozygote tolerance. Among the many applications of the PS construction is de novo genome assembly, which we tested in this study. Assembly of PS reads from a non-isogenic strain of Drosophila melanogaster yields an N50 contig of 190 kb, a 5 fold improvement over the existing de novo assembly methods and a 3 fold advantage over the assembly of long reads from 454 sequencing. Conclusions Our method generated near error-free long reads from NGS paired-end sequencing. We demonstrated that de novo assembly could benefit a lot from these Sanger-like reads. Besides, the characteristic of the long reads could be applied to such applications as structural variations detection and metagenomics. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-14-711) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | - Xuemei Lu
- Laboratory of Disease Genomics and Individualized Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, People's Republic of China.
| | | |
Collapse
|
28
|
Campo D, Lehmann K, Fjeldsted C, Souaiaia T, Kao J, Nuzhdin SV. Whole-genome sequencing of two North American Drosophila melanogaster populations reveals genetic differentiation and positive selection. Mol Ecol 2013; 22:5084-97. [PMID: 24102956 DOI: 10.1111/mec.12468] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2012] [Revised: 07/15/2013] [Accepted: 07/16/2013] [Indexed: 11/29/2022]
Abstract
The prevailing demographic model for Drosophila melanogaster suggests that the colonization of North America occurred very recently from a subset of European flies that rapidly expanded across the continent. This model implies a sudden population growth and range expansion consistent with very low or no population subdivision. As flies adapt to new environments, local adaptation events may be expected. To describe demographic and selective events during North American colonization, we have generated a data set of 35 individual whole-genome sequences from inbred lines of D. melanogaster from a west coast US population (Winters, California, USA) and compared them with a public genome data set from Raleigh (Raleigh, North Carolina, USA). We analysed nuclear and mitochondrial genomes and described levels of variation and divergence within and between these two North American D. melanogaster populations. Both populations exhibit negative values of Tajima's D across the genome, a common signature of demographic expansion. We also detected a low but significant level of genome-wide differentiation between the two populations, as well as multiple allele surfing events, which can be the result of gene drift in local subpopulations on the edge of an expansion wave. In contrast to this genome-wide pattern, we uncovered a 50-kilobase segment in chromosome arm 3L that showed all the hallmarks of a soft selective sweep in both populations. A comparison of allele frequencies within this divergent region among six populations from three continents allowed us to cluster these populations in two differentiated groups, providing evidence for the action of natural selection on a global scale.
Collapse
Affiliation(s)
- D Campo
- Molecular and Computational Biology, University of Southern California, Los Angeles, CA, 90089, USA
| | | | | | | | | | | |
Collapse
|
29
|
Cridland JM, Macdonald SJ, Long AD, Thornton KR. Abundance and distribution of transposable elements in two Drosophila QTL mapping resources. Mol Biol Evol 2013; 30:2311-27. [PMID: 23883524 PMCID: PMC3773372 DOI: 10.1093/molbev/mst129] [Citation(s) in RCA: 85] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Here we present computational machinery to efficiently and accurately identify transposable element (TE) insertions in 146 next-generation sequenced inbred strains of Drosophila melanogaster. The panel of lines we use in our study is composed of strains from a pair of genetic mapping resources: the Drosophila Genetic Reference Panel (DGRP) and the Drosophila Synthetic Population Resource (DSPR). We identified 23,087 TE insertions in these lines, of which 83.3% are found in only one line. There are marked differences in the distribution of elements over the genome, with TEs found at higher densities on the X chromosome, and in regions of low recombination. We also identified many more TEs per base pair of intronic sequence and fewer TEs per base pair of exonic sequence than expected if TEs are located at random locations in the euchromatic genome. There was substantial variation in TE load across genes. For example, the paralogs derailed and derailed-2 show a significant difference in the number of TE insertions, potentially reflecting differences in the selection acting on these loci. When considering TE families, we find a very weak effect of gene family size on TE insertions per gene, indicating that as gene family size increases the number of TE insertions in a given gene within that family also increases. TEs are known to be associated with certain phenotypes, and our data will allow investigators using the DGRP and DSPR to assess the functional role of TE insertions in complex trait variation more generally. Notably, because most TEs are very rare and often private to a single line, causative TEs resulting in phenotypic differences among individuals may typically fail to replicate across mapping panels since individual elements are unlikely to segregate in both panels. Our data suggest that “burden tests” that test for the effect of TEs as a class may be more fruitful.
Collapse
Affiliation(s)
- Julie M Cridland
- Department of Ecology, Evolution and Physiology, University of California, Irvine
| | | | | | | |
Collapse
|
30
|
Fine-scale heterogeneity in crossover rate in the garnet-scalloped region of the Drosophila melanogaster X chromosome. Genetics 2013; 194:375-87. [PMID: 23410829 DOI: 10.1534/genetics.112.146746] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Homologous recombination affects myriad aspects of genome evolution, from standing levels of nucleotide diversity to the efficacy of natural selection. Rates of crossing over show marked variability at all scales surveyed, including species-, population-, and individual-level differences. Even within genomes, crossovers are nonrandomly distributed in a wide diversity of taxa. Although intra- and intergenomic heterogeneities in crossover distribution have been documented in Drosophila, the scale and degree of crossover rate heterogeneity remain unclear. In addition, the genetic features mediating this heterogeneity are unknown. Here we quantify fine-scale heterogeneity in crossover distribution in a 2.1-Mb region of the Drosophila melanogaster X chromosome by localizing crossover breakpoints in 2500 individuals, each containing a single crossover in this specific X chromosome region. We show 90-fold variation in rates of crossing over at a 5-kb scale, place this variation in the context of several aspects of genome evolution, and identify several genetic features associated with crossover rates. Our results shed new light on the scale and magnitude of crossover rate heterogeneity in D. melanogaster and highlight potential features mediating this heterogeneity.
Collapse
|
31
|
Cardoso-Moreira M, Arguello JR, Clark AG. Mutation spectrum of Drosophila CNVs revealed by breakpoint sequencing. Genome Biol 2012; 13:R119. [PMID: 23259534 PMCID: PMC4056370 DOI: 10.1186/gb-2012-13-12-r119] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2012] [Accepted: 12/22/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The detailed study of breakpoints associated with copy number variants (CNVs) can elucidate the mutational mechanisms that generate them and the comparison of breakpoints across species can highlight differences in genomic architecture that may lead to lineage-specific differences in patterns of CNVs. Here, we provide a detailed analysis of Drosophila CNV breakpoints and contrast it with similar analyses recently carried out for the human genome. RESULTS By applying split-read methods to a total of 10x coverage of 454 shotgun sequence across nine lines of D. melanogaster and by re-examining a previously published dataset of CNVs detected using tiling arrays, we identified the precise breakpoints of more than 600 insertions, deletions, and duplications. Contrasting these CNVs with those found in humans showed that in both taxa CNV breakpoints fall into three classes: blunt breakpoints; simple breakpoints associated with microhomology; and breakpoints with additional nucleotides inserted/deleted and no microhomology. In both taxa CNV breakpoints are enriched with non-B DNA sequence structures, which may impair DNA replication and/or repair. However, in contrast to human genomes, non-allelic homologous-recombination (NAHR) plays a negligible role in CNV formation in Drosophila. In flies, non-homologous repair mechanisms are responsible for simple, recurrent, and complex CNVs, including insertions of de novo sequence as large as 60 bp. CONCLUSIONS Humans and Drosophila differ considerably in the importance of homology-based mechanisms for the formation of CNVs, likely as a consequence of the differences in the abundance and distribution of both segmental duplications and transposable elements between the two genomes.
Collapse
|
32
|
Pool JE, Corbett-Detig RB, Sugino RP, Stevens KA, Cardeno CM, Crepeau MW, Duchen P, Emerson JJ, Saelao P, Begun DJ, Langley CH. Population Genomics of sub-saharan Drosophila melanogaster: African diversity and non-African admixture. PLoS Genet 2012; 8:e1003080. [PMID: 23284287 PMCID: PMC3527209 DOI: 10.1371/journal.pgen.1003080] [Citation(s) in RCA: 229] [Impact Index Per Article: 19.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2012] [Accepted: 09/27/2012] [Indexed: 11/25/2022] Open
Abstract
Drosophila melanogaster has played a pivotal role in the development of modern population genetics. However, many basic questions regarding the demographic and adaptive history of this species remain unresolved. We report the genome sequencing of 139 wild-derived strains of D. melanogaster, representing 22 population samples from the sub-Saharan ancestral range of this species, along with one European population. Most genomes were sequenced above 25X depth from haploid embryos. Results indicated a pervasive influence of non-African admixture in many African populations, motivating the development and application of a novel admixture detection method. Admixture proportions varied among populations, with greater admixture in urban locations. Admixture levels also varied across the genome, with localized peaks and valleys suggestive of a non-neutral introgression process. Genomes from the same location differed starkly in ancestry, suggesting that isolation mechanisms may exist within African populations. After removing putatively admixed genomic segments, the greatest genetic diversity was observed in southern Africa (e.g. Zambia), while diversity in other populations was largely consistent with a geographic expansion from this potentially ancestral region. The European population showed different levels of diversity reduction on each chromosome arm, and some African populations displayed chromosome arm-specific diversity reductions. Inversions in the European sample were associated with strong elevations in diversity across chromosome arms. Genomic scans were conducted to identify loci that may represent targets of positive selection within an African population, between African populations, and between European and African populations. A disproportionate number of candidate selective sweep regions were located near genes with varied roles in gene regulation. Outliers for Europe-Africa F(ST) were found to be enriched in genomic regions of locally elevated cosmopolitan admixture, possibly reflecting a role for some of these loci in driving the introgression of non-African alleles into African populations.
Collapse
Affiliation(s)
- John E Pool
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, Wisconsin, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Langley CH, Stevens K, Cardeno C, Lee YCG, Schrider DR, Pool JE, Langley SA, Suarez C, Corbett-Detig RB, Kolaczkowski B, Fang S, Nista PM, Holloway AK, Kern AD, Dewey CN, Song YS, Hahn MW, Begun DJ. Genomic variation in natural populations of Drosophila melanogaster. Genetics 2012; 192:533-98. [PMID: 22673804 PMCID: PMC3454882 DOI: 10.1534/genetics.112.142018] [Citation(s) in RCA: 243] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2011] [Accepted: 05/24/2012] [Indexed: 02/07/2023] Open
Abstract
This report of independent genome sequences of two natural populations of Drosophila melanogaster (37 from North America and 6 from Africa) provides unique insight into forces shaping genomic polymorphism and divergence. Evidence of interactions between natural selection and genetic linkage is abundant not only in centromere- and telomere-proximal regions, but also throughout the euchromatic arms. Linkage disequilibrium, which decays within 1 kbp, exhibits a strong bias toward coupling of the more frequent alleles and provides a high-resolution map of recombination rate. The juxtaposition of population genetics statistics in small genomic windows with gene structures and chromatin states yields a rich, high-resolution annotation, including the following: (1) 5'- and 3'-UTRs are enriched for regions of reduced polymorphism relative to lineage-specific divergence; (2) exons overlap with windows of excess relative polymorphism; (3) epigenetic marks associated with active transcription initiation sites overlap with regions of reduced relative polymorphism and relatively reduced estimates of the rate of recombination; (4) the rate of adaptive nonsynonymous fixation increases with the rate of crossing over per base pair; and (5) both duplications and deletions are enriched near origins of replication and their density correlates negatively with the rate of crossing over. Available demographic models of X and autosome descent cannot account for the increased divergence on the X and loss of diversity associated with the out-of-Africa migration. Comparison of the variation among these genomes to variation among genomes from D. simulans suggests that many targets of directional selection are shared between these species.
Collapse
Affiliation(s)
- Charles H Langley
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
The role of background selection in shaping patterns of molecular evolution and variation: evidence from variability on the Drosophila X chromosome. Genetics 2012; 191:233-46. [PMID: 22377629 DOI: 10.1534/genetics.111.138073] [Citation(s) in RCA: 94] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In the putatively ancestral population of Drosophila melanogaster, the ratio of silent DNA sequence diversity for X-linked loci to that for autosomal loci is approximately one, instead of the expected "null" value of 3/4. One possible explanation is that background selection (the hitchhiking effect of deleterious mutations) is more effective on the autosomes than on the X chromosome, because of the lack of crossing over in male Drosophila. The expected effects of background selection on neutral variability at sites in the middle of an X chromosome or an autosomal arm were calculated for different models of chromosome organization and methods of approximation, using current estimates of the deleterious mutation rate and distributions of the fitness effects of deleterious mutations. The robustness of the results to different distributions of fitness effects, dominance coefficients, mutation rates, mapping functions, and chromosome size was investigated. The predicted ratio of X-linked to autosomal variability is relatively insensitive to these variables, except for the mutation rate and map length. Provided that the deleterious mutation rate per genome is sufficiently large, it seems likely that background selection can account for the observed X to autosome ratio of variability in the ancestral population of D. melanogaster. The fact that this ratio is much less than one in D. pseudoobscura is also consistent with the model's predictions, since this species has a high rate of crossing over. The results suggest that background selection may play a major role in shaping patterns of molecular evolution and variation.
Collapse
|
35
|
KHADEM M, MUNTÉ A, CAMACHO R, AGUADÉ M, SEGARRA C. Multilocus analysis of nucleotide variation in Drosophila madeirensis, an endemic species of the Laurisilva forest in Madeira. J Evol Biol 2012; 25:726-39. [DOI: 10.1111/j.1420-9101.2012.02467.x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
36
|
Linheiro RS, Bergman CM. Whole genome resequencing reveals natural target site preferences of transposable elements in Drosophila melanogaster. PLoS One 2012; 7:e30008. [PMID: 22347367 PMCID: PMC3276498 DOI: 10.1371/journal.pone.0030008] [Citation(s) in RCA: 99] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2011] [Accepted: 12/11/2011] [Indexed: 12/20/2022] Open
Abstract
Transposable elements are mobile DNA sequences that integrate into host genomes using diverse mechanisms with varying degrees of target site specificity. While the target site preferences of some engineered transposable elements are well studied, the natural target preferences of most transposable elements are poorly characterized. Using population genomic resequencing data from 166 strains of Drosophila melanogaster, we identified over 8,000 new insertion sites not present in the reference genome sequence that we used to decode the natural target preferences of 22 families of transposable element in this species. We found that terminal inverted repeat transposon and long terminal repeat retrotransposon families present clade-specific target site duplications and target site sequence motifs. Additionally, we found that the sequence motifs at transposable element target sites are always palindromes that extend beyond the target site duplication. Our results demonstrate the utility of population genomics data for high-throughput inference of transposable element targeting preferences in the wild and establish general rules for terminal inverted repeat transposon and long terminal repeat retrotransposon target site selection in eukaryotic genomes.
Collapse
Affiliation(s)
- Raquel S. Linheiro
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom
| | - Casey M. Bergman
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom
| |
Collapse
|
37
|
Abstract
A major challenge of biology is understanding the relationship between molecular genetic variation and variation in quantitative traits, including fitness. This relationship determines our ability to predict phenotypes from genotypes and to understand how evolutionary forces shape variation within and between species. Previous efforts to dissect the genotype-phenotype map were based on incomplete genotypic information. Here, we describe the Drosophila melanogaster Genetic Reference Panel (DGRP), a community resource for analysis of population genomics and quantitative traits. The DGRP consists of fully sequenced inbred lines derived from a natural population. Population genomic analyses reveal reduced polymorphism in centromeric autosomal regions and the X chromosome, evidence for positive and negative selection, and rapid evolution of the X chromosome. Many variants in novel genes, most at low frequency, are associated with quantitative traits and explain a large fraction of the phenotypic variance. The DGRP facilitates genotype-phenotype mapping using the power of Drosophila genetics.
Collapse
|
38
|
Kofler R, Betancourt AJ, Schlötterer C. Sequencing of pooled DNA samples (Pool-Seq) uncovers complex dynamics of transposable element insertions in Drosophila melanogaster. PLoS Genet 2012; 8:e1002487. [PMID: 22291611 PMCID: PMC3266889 DOI: 10.1371/journal.pgen.1002487] [Citation(s) in RCA: 144] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2011] [Accepted: 12/01/2011] [Indexed: 12/16/2022] Open
Abstract
Transposable elements (TEs) are mobile genetic elements that parasitize genomes by semi-autonomously increasing their own copy number within the host genome. While TEs are important for genome evolution, appropriate methods for performing unbiased genome-wide surveys of TE variation in natural populations have been lacking. Here, we describe a novel and cost-effective approach for estimating population frequencies of TE insertions using paired-end Illumina reads from a pooled population sample. Importantly, the method treats insertions present in and absent from the reference genome identically, allowing unbiased TE population frequency estimates. We apply this method to data from a natural Drosophila melanogaster population from Portugal. Consistent with previous reports, we show that low recombining genomic regions harbor more TE insertions and maintain insertions at higher frequencies than do high recombining regions. We conservatively estimate that there are almost twice as many “novel” TE insertion sites as sites known from the reference sequence in our population sample (6,824 novel versus 3,639 reference sites, with on average a 31-fold coverage per insertion site). Different families of transposable elements show large differences in their insertion densities and population frequencies. Our analyses suggest that the history of TE activity significantly contributes to this pattern, with recently active families segregating at lower frequencies than those active in the more distant past. Finally, using our high-resolution TE abundance measurements, we identified 13 candidate positively selected TE insertions based on their high population frequencies and on low Tajima's D values in their neighborhoods. Transposable elements (TE's) are parasitic genetic elements that spread by replicating themselves within a host genome. Most organisms are burdened with transposable elements; in fact, up to 80% of some genomes can consist of TE–derived DNA. Here, we use new sequencing technology to examine variation in genomic TE composition within a population at a finer scale and in a more unbiased fashion than has been possible before. We study a Portuguese population of D. melanogaster and find a large number of TE insertions, most of which occur in few individuals. Our analysis confirms that TE insertions are subject to purifying selection that counteracts their spread, and it suggests that the genome records waves of past TE invasions, with recently active elements occurring at low population frequency. We also find indications that TE insertions may sometimes have beneficial effects.
Collapse
Affiliation(s)
- Robert Kofler
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria
| | | | | |
Collapse
|
39
|
Carneiro MO, Taubes CH, Hartl DL. Model transcriptional networks with continuously varying expression levels. BMC Evol Biol 2011; 11:363. [PMID: 22182343 PMCID: PMC3270072 DOI: 10.1186/1471-2148-11-363] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2011] [Accepted: 12/19/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND At a time when genomes are being sequenced by the hundreds, much attention has shifted from identifying genes and phenotypes to understanding the networks of interactions among genes. We developed a gene network developmental model expanding on previous models of transcription regulatory networks. In our model, each network is described by a matrix representing the interactions between transcription factors, and a vector of continuous values representing the transcription factor expression in an individual. RESULTS In this work we used the gene network model to look at the impact of mating as well as insertions and deletions of genes in the evolution of complexity of these networks. We found that the natural process of diploid mating increases the likelihood of maintaining complexity, especially in higher order networks (more than 10 genes). We also show that gene insertion is a very efficient way to add more genes to a network as it provides a much higher chance of developmental stability. CONCLUSIONS The continuous model affords a more complete view of the evolution of interacting genes. The notion of a continuous output vector also incorporates the reality of gene networks and graded concentrations of gene products.
Collapse
Affiliation(s)
- Mauricio O Carneiro
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA, 02138
| | - Clifford H Taubes
- Department of Mathematics, Harvard University, Cambridge, MA, USA, 02138
| | - Daniel L Hartl
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA, 02138
| |
Collapse
|
40
|
Weber CC, Pink CJ, Hurst LD. Late-replicating domains have higher divergence and diversity in Drosophila melanogaster. Mol Biol Evol 2011; 29:873-82. [PMID: 22046001 DOI: 10.1093/molbev/msr265] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Several reports from mammals indicate that an increase in the mutation rate in late-replicating regions may, in part, be responsible for the observed genomic heterogeneity in neutral substitution rates and levels of diversity, although the mechanisms for this remain poorly understood. Recent evidence also suggests that late replication is associated with high mutability in yeast. This then raises the question as to whether a similar effect is operating across all eukaryotes. Limited evidence from one chromosome arm in Drosophila melanogaster suggests the opposite pattern, with regions overlapping early-firing origins showing increased levels of diversity and divergence. Given the availability of genome-wide replication timing profiles for D. melanogaster, we now return to this issue. Consistent with what is seen in other taxa, we find that divergence at synonymous sites in exon cores, as well as divergence at putatively unconstrained intronic sites, is elevated in late-replicating regions. Analysis of genes with low codon usage bias suggests a ∼30% difference in mutation rate between the earliest and the latest replicating sequence. Intronic sequence suggests a more modest difference. We additionally show that an increase in diversity in late-replicating sequences is not owing to replication timing covarying with the local recombination rate. If anything, the effects of recombination mask the impact of replication timing. We conclude that, contrary to prior reports and consistent with what is seen in mammals and yeast, there is indeed a relationship between rates of nucleotide divergence and diversity and replication timing that is consistent with an increase in the mutation rate during late S-phase in D. melanogaster. It is therefore plausible that such an effect might be common among eukaryotes. The result may have implications for the inference of positive selection.
Collapse
Affiliation(s)
- Claudia C Weber
- Department of Biology and Biochemistry, University of Bath, Bath, United Kingdom
| | | | | |
Collapse
|
41
|
Esteve-Codina A, Kofler R, Himmelbauer H, Ferretti L, Vivancos AP, Groenen MAM, Folch JM, Rodríguez MC, Pérez-Enciso M. Partial short-read sequencing of a highly inbred Iberian pig and genomics inference thereof. Heredity (Edinb) 2011; 107:256-64. [PMID: 21407255 PMCID: PMC3183945 DOI: 10.1038/hdy.2011.13] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2010] [Revised: 01/20/2011] [Accepted: 01/27/2011] [Indexed: 11/08/2022] Open
Abstract
Despite dramatic reduction in sequencing costs with the advent of next generation sequencing technologies, obtaining a complete mammalian genome sequence at sufficient depth is still costly. An alternative is partial sequencing. Here, we have sequenced a reduced representation library of an Iberian sow from the Guadyerbas strain, a highly inbred strain that has been used in numerous QTL studies because of its extreme phenotypic characteristics. Using the Illumina Genome Analyzer II (San Diego, CA, USA), we resequenced ∼ 1% of the genome with average 4 × depth, identifying 68,778 polymorphisms. Of these, 55,457 were putative fixed differences with respect to the assembly, based on the genome of a Duroc pig, and 13,321 were heterozygous positions within Guadyerbas. Despite being highly inbred, the estimate of heterozygosity within Guadyerbas was ∼ 0.78 kb(-1) in autosomes, after correcting for low depth. Nucleotide variability was consistently higher at the telomeric regions than on the rest of the chromosome, likely a result of increased recombination rates. Further, variability was 50% lower in the X-chromosome than in autosomes, which may be explained by a recent bottleneck or by selection. We divided the whole genome in 500 kb windows and we analyzed overrepresented gene ontology terms in regions of low and high variability. Multi organism process, pigmentation and cell killing were overrepresented in high variability regions and metabolic process ontology, within low variability regions. Further, a genome wide Hudson-Kreitman-Aguadé test was carried out per window; overall, variability was in agreement with neutral expectations.
Collapse
Affiliation(s)
- A Esteve-Codina
- Departament de Ciència Animal i dels Aliments, Facultat de Veterinària, Universitat Autònoma de Barcelona, Bellaterra, Spain
| | - R Kofler
- Centre for Genomic Regulation (CRG), Universitat Pompeu Fabra, Barcelona, Spain
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - H Himmelbauer
- Centre for Genomic Regulation (CRG), Universitat Pompeu Fabra, Barcelona, Spain
| | - L Ferretti
- Departament de Ciència Animal i dels Aliments, Facultat de Veterinària, Universitat Autònoma de Barcelona, Bellaterra, Spain
- Department of Animal Science, Centre for Research in Agrigenomics (CRAG), Bellaterra, Spain
| | - A P Vivancos
- Centre for Genomic Regulation (CRG), Universitat Pompeu Fabra, Barcelona, Spain
| | - M A M Groenen
- Animal Breeding and Genomics Centre, Wageningen University and Research Centre, Wageningen, The Netherlands
| | - J M Folch
- Departament de Ciència Animal i dels Aliments, Facultat de Veterinària, Universitat Autònoma de Barcelona, Bellaterra, Spain
| | - M C Rodríguez
- Departamento de Mejora Genética Animal, Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Madrid, Spain
| | - M Pérez-Enciso
- Departament de Ciència Animal i dels Aliments, Facultat de Veterinària, Universitat Autònoma de Barcelona, Bellaterra, Spain
- Institut Català de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| |
Collapse
|
42
|
Abstract
SummaryPopulation genomics is the study of the amount and causes of genome-wide variability in natural populations, a topic that has been under discussion since Darwin. This paper first briefly reviews the early development of molecular approaches to the subject: the pioneering unbiased surveys of genetic variability at multiple loci by means of gel electrophoresis and restriction enzyme mapping. The results of surveys of levels of genome-wide variability using DNA resequencing studies are then discussed. Studies of the extent to which variability for different classes of variants (non-synonymous, synonymous and non-coding) are affected by natural selection, or other directional forces such as biased gene conversion, are also described. Finally, the effects of deleterious mutations on population fitness and the possible role of Hill–Robertson interference in shaping patterns of sequence variability are discussed.
Collapse
|
43
|
PoPoolation: a toolbox for population genetic analysis of next generation sequencing data from pooled individuals. PLoS One 2011; 6:e15925. [PMID: 21253599 PMCID: PMC3017084 DOI: 10.1371/journal.pone.0015925] [Citation(s) in RCA: 395] [Impact Index Per Article: 30.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2010] [Accepted: 11/30/2010] [Indexed: 11/19/2022] Open
Abstract
Recent statistical analyses suggest that sequencing of pooled samples provides a cost effective approach to determine genome-wide population genetic parameters. Here we introduce PoPoolation, a toolbox specifically designed for the population genetic analysis of sequence data from pooled individuals. PoPoolation calculates estimates of θWatterson, θπ, and Tajima's D that account for the bias introduced by pooling and sequencing errors, as well as divergence between species. Results of genome-wide analyses can be graphically displayed in a sliding window plot. PoPoolation is written in Perl and R and it builds on commonly used data formats. Its source code can be downloaded from http://code.google.com/p/popoolation/. Furthermore, we evaluate the influence of mapping algorithms, sequencing errors, and read coverage on the accuracy of population genetic parameter estimates from pooled data.
Collapse
|
44
|
Kolaczkowski B, Kern AD, Holloway AK, Begun DJ. Genomic differentiation between temperate and tropical Australian populations of Drosophila melanogaster. Genetics 2011; 187:245-60. [PMID: 21059887 PMCID: PMC3018305 DOI: 10.1534/genetics.110.123059] [Citation(s) in RCA: 163] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2010] [Accepted: 11/03/2010] [Indexed: 11/18/2022] Open
Abstract
Determining the genetic basis of environmental adaptation is a central problem of evolutionary biology. This issue has been fruitfully addressed by examining genetic differentiation between populations that are recently separated and/or experience high rates of gene flow. A good example of this approach is the decades-long investigation of selection acting along latitudinal clines in Drosophila melanogaster. Here we use next-generation genome sequencing to reexamine the well-studied Australian D. melanogaster cline. We find evidence for extensive differentiation between temperate and tropical populations, with regulatory regions and unannotated regions showing particularly high levels of differentiation. Although the physical genomic scale of geographic differentiation is small--on the order of gene sized--we observed several larger highly differentiated regions. The region spanned by the cosmopolitan inversion polymorphism In(3R)P shows higher levels of differentiation, consistent with the major difference in allele frequencies of Standard and In(3R)P karyotypes in temperate vs. tropical Australian populations. Our analysis reveals evidence for spatially varying selection on a number of key biological processes, suggesting fundamental biological differences between flies from these two geographic regions.
Collapse
Affiliation(s)
- Bryan Kolaczkowski
- Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire 03755 and Department of Evolution and Ecology, University of California, Davis, California 95616
| | - Andrew D. Kern
- Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire 03755 and Department of Evolution and Ecology, University of California, Davis, California 95616
| | - Alisha K. Holloway
- Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire 03755 and Department of Evolution and Ecology, University of California, Davis, California 95616
| | - David J. Begun
- Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire 03755 and Department of Evolution and Ecology, University of California, Davis, California 95616
| |
Collapse
|
45
|
Neafsey DE, Barker BM, Sharpton TJ, Stajich JE, Park DJ, Whiston E, Hung CY, McMahan C, White J, Sykes S, Heiman D, Young S, Zeng Q, Abouelleil A, Aftuck L, Bessette D, Brown A, FitzGerald M, Lui A, Macdonald JP, Priest M, Orbach MJ, Galgiani JN, Kirkland TN, Cole GT, Birren BW, Henn MR, Taylor JW, Rounsley SD. Population genomic sequencing of Coccidioides fungi reveals recent hybridization and transposon control. Genome Res 2010; 20:938-46. [PMID: 20516208 PMCID: PMC2892095 DOI: 10.1101/gr.103911.109] [Citation(s) in RCA: 129] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2009] [Accepted: 04/28/2010] [Indexed: 11/24/2022]
Abstract
We have sequenced the genomes of 18 isolates of the closely related human pathogenic fungi Coccidioides immitis and Coccidioides posadasii to more clearly elucidate population genomic structure, bringing the total number of sequenced genomes for each species to 10. Our data confirm earlier microsatellite-based findings that these species are genetically differentiated, but our population genomics approach reveals that hybridization and genetic introgression have recently occurred between the two species. The directionality of introgression is primarily from C. posadasii to C. immitis, and we find more than 800 genes exhibiting strong evidence of introgression in one or more sequenced isolates. We performed PCR-based sequencing of one region exhibiting introgression in 40 C. immitis isolates to confirm and better define the extent of gene flow between the species. We find more coding sequence than expected by chance in the introgressed regions, suggesting that natural selection may play a role in the observed genetic exchange. We find notable heterogeneity in repetitive sequence composition among the sequenced genomes and present the first detailed genome-wide profile of a repeat-induced point mutation (RIP) process distinctly different from what has been observed in Neurospora. We identify promiscuous HLA-I and HLA-II epitopes in both proteomes and discuss the possible implications of introgression and population genomic data for public health and vaccine candidate prioritization. This study highlights the importance of population genomic data for detecting subtle but potentially important phenomena such as introgression.
Collapse
|
46
|
Lu J, Clark AG. Population dynamics of PIWI-interacting RNAs (piRNAs) and their targets in Drosophila. Genome Res 2009; 20:212-27. [PMID: 19948818 DOI: 10.1101/gr.095406.109] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Transposable elements (TEs) are mobile DNA sequences that make up a large fraction of eukaryotic genomes. Recently it was discovered that PIWI-interacting RNAs (piRNAs), a class of small RNA molecules that are mainly generated from transposable elements, are crucial repressors of active TEs in the germline of fruit flies. By quantifying expression levels of 32 TE families in piRNA pathway mutants relative to wild-type fruit flies, we provide evidence that piRNAs can severely silence the activities of retrotransposons. We incorporate piRNAs into a population genetic framework for retrotransposons and perform forward simulations to model the population dynamics of piRNA loci and their targets. Using parameters optimized for Drosophila melanogaster, our simulation results indicate that (1) piRNAs can significantly reduce the fitness cost of retrotransposons; (2) retrotransposons that generate piRNAs (piRTs) are selectively more advantageous, and such retrotransposon insertions more easily attain high frequency or fixation; (3) retrotransposons that are repressed by piRNAs (targetRTs), however, also have an elevated probability of reaching high frequency or fixation in the population because their deleterious effects are attenuated. By surveying the polymorphisms of piRT and targetRT insertions across nine strains of D. melanogaster, we verified these theoretical predictions with population genomic data. Our theoretical and empirical analysis suggests that piRNAs can significantly increase the fitness of individuals that bear them; however, piRNAs may provide a shelter or Trojan horse for retrotransposons, allowing them to increase in frequency in a population by shielding the host from the deleterious consequences of retrotransposition.
Collapse
Affiliation(s)
- Jian Lu
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
| | | |
Collapse
|