1
|
Abstract
Drosophila melanogaster is a leading model in population genetics and genomics, and a growing number of whole-genome data sets from natural populations of this species have been published over the last years. A major challenge is the integration of disparate data sets, often generated using different sequencing technologies and bioinformatic pipelines, which hampers our ability to address questions about the evolution of this species. Here we address these issues by developing a bioinformatics pipeline that maps pooled sequencing (Pool-Seq) reads from D. melanogaster to a hologenome consisting of fly and symbiont genomes and estimates allele frequencies using either a heuristic (PoolSNP) or a probabilistic variant caller (SNAPE-pooled). We use this pipeline to generate the largest data repository of genomic data available for D. melanogaster to date, encompassing 271 previously published and unpublished population samples from over 100 locations in >20 countries on four continents. Several of these locations have been sampled at different seasons across multiple years. This data set, which we call Drosophila Evolution over Space and Time (DEST), is coupled with sampling and environmental metadata. A web-based genome browser and web portal provide easy access to the SNP data set. We further provide guidelines on how to use Pool-Seq data for model-based demographic inference. Our aim is to provide this scalable platform as a community resource which can be easily extended via future efforts for an even more extensive cosmopolitan data set. Our resource will enable population geneticists to analyze spatiotemporal genetic patterns and evolutionary dynamics of D. melanogaster populations in unprecedented detail.
Collapse
|
2
|
The discovery, distribution, and diversity of DNA viruses associated with Drosophila melanogaster in Europe. Virus Evol 2021; 7:veab031. [PMID: 34408913 PMCID: PMC8363768 DOI: 10.1093/ve/veab031] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Drosophila melanogaster is an important model for antiviral immunity in arthropods, but very few DNA viruses have been described from the family Drosophilidae. This deficiency limits our opportunity to use natural host-pathogen combinations in experimental studies, and may bias our understanding of the Drosophila virome. Here, we report fourteen DNA viruses detected in a metagenomic analysis of 6668 pool-sequenced Drosophila, sampled from forty-seven European locations between 2014 and 2016. These include three new nudiviruses, a new and divergent entomopoxvirus, a virus related to Leptopilina boulardi filamentous virus, and a virus related to Musca domestica salivary gland hypertrophy virus. We also find an endogenous genomic copy of galbut virus, a double-stranded RNA partitivirus, segregating at very low frequency. Remarkably, we find that Drosophila Vesanto virus, a small DNA virus previously described as a bidnavirus, may be composed of up to twelve segments and thus represent a new lineage of segmented DNA viruses. Two of the DNA viruses, Drosophila Kallithea nudivirus and Drosophila Vesanto virus are relatively common, found in 2 per cent or more of wild flies. The others are rare, with many likely to be represented by a single infected fly. We find that virus prevalence in Europe reflects the prevalence seen in publicly available datasets, with Drosophila Kallithea nudivirus and Drosophila Vesanto virus the only ones commonly detectable in public data from wild-caught flies and large population cages, and the other viruses being rare or absent. These analyses suggest that DNA viruses are at lower prevalence than RNA viruses in D.melanogaster, and may be less likely to persist in laboratory cultures. Our findings go some way to redressing an earlier bias toward RNA virus studies in Drosophila, and lay the foundation needed to harness the power of Drosophila as a model system for the study of DNA viruses.
Collapse
|
3
|
Genomic Analysis of European Drosophila melanogaster Populations Reveals Longitudinal Structure, Continent-Wide Selection, and Previously Unknown DNA Viruses. Mol Biol Evol 2020; 37:2661-2678. [PMID: 32413142 PMCID: PMC7475034 DOI: 10.1093/molbev/msaa120] [Citation(s) in RCA: 58] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Genetic variation is the fuel of evolution, with standing genetic variation especially important for short-term evolution and local adaptation. To date, studies of spatiotemporal patterns of genetic variation in natural populations have been challenging, as comprehensive sampling is logistically difficult, and sequencing of entire populations costly. Here, we address these issues using a collaborative approach, sequencing 48 pooled population samples from 32 locations, and perform the first continent-wide genomic analysis of genetic variation in European Drosophila melanogaster. Our analyses uncover longitudinal population structure, provide evidence for continent-wide selective sweeps, identify candidate genes for local climate adaptation, and document clines in chromosomal inversion and transposable element frequencies. We also characterize variation among populations in the composition of the fly microbiome, and identify five new DNA viruses in our samples.
Collapse
|
4
|
The molecular genealogy of sequential overlapping inversions implies both homologous chromosomes of a heterokaryotype in an inversion origin. Sci Rep 2019; 9:17009. [PMID: 31740730 PMCID: PMC6861252 DOI: 10.1038/s41598-019-53582-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Accepted: 10/14/2019] [Indexed: 11/25/2022] Open
Abstract
Cytological and molecular studies have revealed that inversion chromosomal polymorphism is widespread across taxa and that inversions are among the most common structural changes fixed between species. Two major mechanisms have been proposed for the origin of inversions considering that breaks occur at either repetitive or non-homologous sequences. While inversions originating through the first mechanism might have a multiple origin, those originating through the latter mechanism would have a unique origin. Variation at regions flanking inversion breakpoints can be informative on the origin and history of inversions given the reduced recombination in heterokaryotypes. Here, we have analyzed nucleotide variation at a fragment flanking the most centromere-proximal shared breakpoint of several sequential overlapping inversions of the E chromosome of Drosophila subobscura —inversions E1, E2, E9 and E3. The molecular genealogy inferred from variation at this shared fragment does not exhibit the branching pattern expected according to the sequential origin of inversions. The detected discordance between the molecular and cytological genealogies has led us to consider a novel possibility for the origin of an inversion, and more specifically that one of these inversions originated on a heterokaryotype for chromosomal arrangements. Based on this premise, we propose three new models for inversions origin.
Collapse
|
5
|
The High-Quality Genome Sequence of the Oceanic Island Endemic Species Drosophila guanche Reveals Signals of Adaptive Evolution in Genes Related to Flight and Genome Stability. Genome Biol Evol 2018; 10:1956-1969. [PMID: 29947749 PMCID: PMC6101566 DOI: 10.1093/gbe/evy135] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/26/2018] [Indexed: 12/18/2022] Open
Abstract
Drosophila guanche is a member of the obscura group that originated in the Canary Islands archipelago upon its colonization by D. subobscura. It evolved into a new species in the laurisilva, a laurel forest present in wet regions that in the islands have only minor long-term weather fluctuations. Oceanic island endemic species such as D. guanche can become model species to investigate not only the relative role of drift and adaptation in speciation processes but also how population size affects nucleotide variation. Moreover, the previous identification of two satellite DNAs in D. guanche makes this species attractive for studying how centromeric DNA evolves. As a prerequisite for its establishment as a model species suitable to address all these questions, we generated a high-quality D. guanche genome sequence composed of 42 cytologically mapped scaffolds, which are assembled into six super-scaffolds (one per chromosome). The comparative analysis of the D. guanche proteome with that of twelve other Drosophila species identified 151 genes that were subject to adaptive evolution in the D. guanche lineage, with a subset of them being involved in flight and genome stability. For example, the Centromere Identifier (CID) protein, directly interacting with centromeric satellite DNA, shows signals of adaptation in this species. Both genomic analyses and FISH of the two satellites would support an ongoing replacement of centromeric satellite DNA in D. guanche.
Collapse
|
6
|
Inversion evolutionary rates might limit the experimental identification of inversion breakpoints in non-model species. Sci Rep 2017; 7:17281. [PMID: 29222501 PMCID: PMC5722822 DOI: 10.1038/s41598-017-17650-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2017] [Accepted: 11/29/2017] [Indexed: 11/09/2022] Open
Abstract
Chromosomal inversions are structural changes that alter gene order but generally not gene content in the affected region. In Drosophila, extensive cytological studies revealed the widespread character of inversion polymorphism, with evidence for its adaptive character. In Drosophila subobscura, polymorphism affects both its four large autosomal elements and its X (A) chromosome. The characterization of eight of these autosomal inversions breakpoints revealed that most of them originated through the staggered-breaks mechanism. Here, we have performed chromosomal walks to identify the breakpoints of two X-chromosome widely distributed inversions -A2 and A1- of D. subobscura. Inversion A2 is considered a warm-adapted arrangement that exhibits parallel latitudinal clines in the species ancestral distribution area and in both American subcontinents, whereas inversion A1 is only present in the Palearctic region where it presents an east-west cline. The duplication detected at the A2 inversion breakpoints is consistent with its origin by the staggered-breaks mechanism. Inversion A1 breakpoints could not be molecularly identified even though they could be narrowly delimited. This result points to chromosome walking limitations when using as a guide the genome of other species. Limitations stem from the rate of evolution by paracentric inversions, which in Drosophila is highest for the X chromosome.
Collapse
|
7
|
Dense gene physical maps of the non-model species Drosophila subobscura. Chromosome Res 2017; 25:145-154. [PMID: 28078516 DOI: 10.1007/s10577-016-9549-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2016] [Revised: 12/20/2016] [Accepted: 12/22/2016] [Indexed: 11/29/2022]
Abstract
The comparative analysis of genetic and physical maps as well as of whole genome sequences had revealed that in the Drosophila genus, most structural rearrangements occurred within chromosomal elements as a result of paracentric inversions. Genome sequence comparison would seem the best method to estimate rates of chromosomal evolution, but the high-quality reference genomes required for this endeavor are still scanty. Here, we have obtained dense physical maps for Muller elements A, C, and E of Drosophila subobscura, a species with an extensively studied rich and adaptive chromosomal polymorphism. These maps are based on 462 markers: 115, 236, and 111 markers for elements A, C, and E, respectively. The availability of these dense maps will facilitate genome assembly and will thus greatly contribute to obtaining a good reference genome, which is a required step for D. subobscura to attain the model species status. The comparative analysis of these physical maps and those obtained from the D. pseudoobscura and D. melanogaster genomes allowed us to infer the number of fixed inversions and chromosomal evolutionary rates for each pairwise comparison. For all three elements, rates inferred from the more closely related species were higher than those inferred from the more distantly related species, which together with results of relative-rate tests point to an acceleration in the D. subobscura lineage at least for elements A and E.
Collapse
|
8
|
The origin of chromosomal inversions as a source of segmental duplications in the Sophophora subgenus of Drosophila. Sci Rep 2016; 6:30715. [PMID: 27470196 PMCID: PMC4965816 DOI: 10.1038/srep30715] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2016] [Accepted: 07/05/2016] [Indexed: 01/06/2023] Open
Abstract
Chromosomal inversions can contribute to the adaptation of organisms to their environment by capturing particular advantageous allelic combinations of a set of genes included in the inverted fragment and also by advantageous functional changes due to the inversion process itself that might affect not only the expression of flanking genes but also their dose and structure. Of the two mechanisms originating inversions —ectopic recombination, and staggered double-strand breaks and subsequent repair— only the latter confers the inversion the potential to have dosage effects and/or to generate advantageous chimeric genes. In Drosophila subobscura, there is ample evidence for the adaptive character of its chromosomal polymorphism, with an important contribution of some warm-climate arrangements such as E1+2+9+12. Here, we have characterized the breakpoints of inversion E12 and established that it originated through the staggered-break mechanism like four of the five inversions of D. subobscura previously studied. This mechanism that also predominates in the D. melanogaster lineage might be prevalent in the Sophophora subgenus and contribute to the adaptive character of the polymorphic and fixed inversions of its species. Finally, we have shown that the D. subobscura inversion breakpoint regions have generally been disrupted by additional structural changes occurred at different time scales.
Collapse
|
9
|
Evidence for a gene involved in multiple and diverse rearrangements in the Drosophila genus. Mol Biol Evol 2014; 31:2998-3001. [PMID: 25135946 DOI: 10.1093/molbev/msu239] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
In Drosophila, chromosomes have been extensively reorganized during evolution, with most rearrangements affecting the gene order in chromosomal elements but not their gene content. The level of reorganization and the evidence for breakpoint reuse vary both between and within elements. The subito gene stands out as a gene involved in multiple rearrangements both because of its active single-gene transposition and because it is the nearest gene to diverse rearrangements breakpoints. Indeed, subito has undergone three single-gene transpositions and it is the nearest gene to the breakpoints of other single-gene transpositions and of two chromosomal inversions. Given that subito is involved in meiosis and therefore active in the female germ line, the high number of nearby fixed breakages might be related among others to the presumed high accessibility of the subito region to the machinery associated with double-strand breaks repair. A second important contributor would be the reduced and simple regulatory region of subito, which would imply that a fraction of the rearrangements originating from subito nearby breakages would have not affected either its pattern or timing of expression and would have, thus, not resulted in reduced fitness.
Collapse
|
10
|
Characterization of the breakpoints of a polymorphic inversion complex detects strict and broad breakpoint reuse at the molecular level. Mol Biol Evol 2014; 31:2331-41. [PMID: 24881049 DOI: 10.1093/molbev/msu177] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Inversions are an integral part of structural variation within species, and they play a leading role in genome reorganization across species. Work at both the cytological and genome sequence levels has revealed heterogeneity in the distribution of inversion breakpoints, with some regions being recurrently used. Breakpoint reuse at the molecular level has mostly been assessed for fixed inversions through genome sequence comparison, and therefore rather broadly. Here, we have identified and sequenced the breakpoints of two polymorphic inversions-E1 and E2 that share a breakpoint-in the extant Est and E1 + 2 chromosomal arrangements of Drosophila subobscura. The breakpoints are two medium-sized repeated motifs that mediated the inversions by two different mechanisms: E1 via staggered breaks and subsequent repair and E2 via repeat-mediated ectopic recombination. The fine delimitation of the shared breakpoint revealed its strict reuse at the molecular level regardless of which was the intermediate arrangement. The occurrence of other rearrangements in the most proximal and distal extended breakpoint regions reveals the broad reuse of these regions. This differential degree of fragility might be related to their sharing the presence outside the inverted region of snoRNA-encoding genes.
Collapse
|
11
|
Polymorphism at genes involved in salt tolerance in Arabidopsis thaliana (Brassicaceae). AMERICAN JOURNAL OF BOTANY 2013; 100:384-390. [PMID: 23345415 DOI: 10.3732/ajb.1200332] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
PREMISE OF THE STUDY Genes involved in relevant functions for environmental adaptation can be considered primary candidates for their variation having been shaped by natural selection. Detecting recent selective events through their footprint on nucleotide variation constitutes a challenging task in species with a complex demographic history such as Arabidopsis thaliana. We have surveyed nucleotide variation in this species at nine genes involved in salt tolerance. The available genomewide information for this species has allowed us to contrast the levels and patterns of variation detected at the candidate genes with empirical distributions obtained from noncandidate regions. METHODS We sequenced nine genes involved in salt tolerance (~32 kb) in 20 ecotypes of A. thaliana and analyzed polymorphism and divergence at the individual gene and multilocus levels. KEY RESULTS Variation at the nine genes studied was characterized by a generalized skew toward polymorphisms with low-frequency variants. Except for genes RCD1 and NHX8, this pattern was similar to that generally detected in the A. thaliana genome and could thus be primarily explained by the species demographic history. The more extreme deviation at the NHX8 gene and its excess of polymorphism relative to divergence points to the recent action of selection on this gene. CONCLUSIONS The analysis of nucleotide polymorphism and divergence at nine genes involved in salt tolerance provided little evidence for the recent action of positive selection. Only the signals detected at NHX8 from both polymorphism and divergence were suggestive of the putative contribution of this gene to local adaptation.
Collapse
|
12
|
Multilocus analysis of variation using a large empirical data set: phenylpropanoid pathway genes in Arabidopsis thaliana. Mol Ecol 2008; 17:1211-23. [PMID: 18221273 DOI: 10.1111/j.1365-294x.2007.03633.x] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Detecting the signature of adaptation on nucleotide variation is often difficult in species that like Arabidopsis thaliana might have a complex demographic history. Recent re-sequencing surveys in this species provided genome-wide information that would mainly reflect its demographic history. We have used a large empirical data set (LED) as well as multilocus coalescent simulations to analyse sequence variation at loci involved in the phenylpropanoid pathway of this species. We surveyed and examined DNA sequence variation at nine of these loci (about 19.7 kb) in 23 accessions of A. thaliana and one accession of its closely related species Arabidopsis lyrata. Nucleotide variation was lower at nonsynonymous sites than at silent sites in all loci, indicating generalized functional constraint at the protein level. No association between variation and position in the metabolic pathway was detected. When the data were contrasted against the standard neutral model, significant deviations for silent variation were detected with Tajima's D, Fu's F(S) and Fay and Wu's H multilocus test statistics. These deviations were in the same direction than in previous large-scale multilocus analyses, suggesting a genome-wide effect. When the nine-locus data set was contrasted against the large empirical data set, the level (Watterson's theta) and pattern of variation (Tajima's D) detected in these loci did not deviate either at the single-locus or multilocus level from the corresponding empirical distributions. These results would support an important role of the demographic history of A. thaliana in shaping nucleotide variation at the nine studied phenylpropanoid loci. The potential and limitations of the empirical distribution approach are discussed.
Collapse
|
13
|
Characterization of the satellite DNA Msat-160 from the species Chionomys nivalis (Rodentia, Arvicolinae). Genetica 2006; 130:43-51. [PMID: 16897458 DOI: 10.1007/s10709-006-0018-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2006] [Accepted: 05/09/2006] [Indexed: 11/28/2022]
Abstract
The satellite DNA Msat-160 has been previously characterized in several species of the genus Microtus. Here we present the characterization of Msat-160 from Chionomys nivalis, a species with a very primitive karyotype. As in other Microtus species analyzed, C. nivalis Msat-160 is AT rich, has a monomer length of 160 bp, is undermethylated and is mainly located in all the pericentromeric heterochromatin of all autosomes and the X chromosome, but is completely absent from the Y chromosome. Hence, our results support the hypothesis that Msat-160 was initially distributed in the pericentromeric heterochromatin of all autosomes and the X chromosome. The taxonomic status of the genus Chionomys in relation to the genus Microtus is a very interesting issue, so we constructed phylogenetic dendrograms using Msat-160 sequences from several Microtus species. Although the results were not informative about this issue, the presence of Msat-160 in C. nivalis and Microtus species suggested that both genera are closely related and that this satellite DNA was present in the common ancestor. Studies of Msat-160 in different arvicoline species could help to determine the origin of this satellite and, perhaps, to establish the phylogenetic relationships of some arvicoline groups.
Collapse
|
14
|
Distribution of L1-retroposons on the giant sex chromosomes of Microtus cabrerae (Arvicolidae, Rodentia): functional and evolutionary implications. Chromosome Res 2006; 14:177-86. [PMID: 16544191 DOI: 10.1007/s10577-006-1034-9] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2005] [Accepted: 01/06/2006] [Indexed: 11/28/2022]
Abstract
Long interspersed nuclear elements (L1 or LINE-1) are the most abundant and active retroposons in the mammalian genome. Traditionally, the bulk of L1 sequences have been explained by the 'selfish DNA' hypothesis; however, recently it has been also argued that L1s could play an important role in genome and gene organizations. The non-random chromosomal distribution of these retroelements is a striking feature considered to reflect this functionality. In the present study we have cloned and analyzed three different L1 fragments from the genome of the rodent Microtus cabrerae. In addition, we have examined the chromosomal distribution of this L1 in several species of Microtus, a very interesting group owing to the presence in some species of enlarged ('giant') sex chromosomes. Interestingly, in all species analyzed, L1-retroposons have preferentially accumulated on both the giant- and the normal-sized sex chromosomes compared with the autosomes. Also we have demonstrated that L1-retroposons are not similarly distributed among the heterochromatic blocks of the giant sex chromosomes in M. cabrerae and M. agrestis, which suggest that L1 retroposition and amplification over the sex heterochromatin have been different and independent processes in each species. Finally, we proposed that the main factors responsible for the L1 distribution on the mammalian sex chromosomes are the heterochromatic nature of the Y chromosome and the possible role of L1 sequences during the X-inactivation process.
Collapse
|