1
|
Kim BY, Gellert HR, Church SH, Suvorov A, Anderson SS, Barmina O, Beskid SG, Comeault AA, Crown KN, Diamond SE, Dorus S, Fujichika T, Hemker JA, Hrcek J, Kankare M, Katoh T, Magnacca KN, Martin RA, Matsunaga T, Medeiros MJ, Miller DE, Pitnick S, Schiffer M, Simoni S, Steenwinkel TE, Syed ZA, Takahashi A, Wei KHC, Yokoyama T, Eisen MB, Kopp A, Matute D, Obbard DJ, O’Grady PM, Price DK, Toda MJ, Werner T, Petrov DA. Single-fly genome assemblies fill major phylogenomic gaps across the Drosophilidae Tree of Life. PLoS Biol 2024; 22:e3002697. [PMID: 39024225 PMCID: PMC11257246 DOI: 10.1371/journal.pbio.3002697] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 06/03/2024] [Indexed: 07/20/2024] Open
Abstract
Long-read sequencing is driving rapid progress in genome assembly across all major groups of life, including species of the family Drosophilidae, a longtime model system for genetics, genomics, and evolution. We previously developed a cost-effective hybrid Oxford Nanopore (ONT) long-read and Illumina short-read sequencing approach and used it to assemble 101 drosophilid genomes from laboratory cultures, greatly increasing the number of genome assemblies for this taxonomic group. The next major challenge is to address the laboratory culture bias in taxon sampling by sequencing genomes of species that cannot easily be reared in the lab. Here, we build upon our previous methods to perform amplification-free ONT sequencing of single wild flies obtained either directly from the field or from ethanol-preserved specimens in museum collections, greatly improving the representation of lesser studied drosophilid taxa in whole-genome data. Using Illumina Novaseq X Plus and ONT P2 sequencers with R10.4.1 chemistry, we set a new benchmark for inexpensive hybrid genome assembly at US $150 per genome while assembling genomes from as little as 35 ng of genomic DNA from a single fly. We present 183 new genome assemblies for 179 species as a resource for drosophilid systematics, phylogenetics, and comparative genomics. Of these genomes, 62 are from pooled lab strains and 121 from single adult flies. Despite the sample limitations of working with small insects, most single-fly diploid assemblies are comparable in contiguity (>1 Mb contig N50), completeness (>98% complete dipteran BUSCOs), and accuracy (>QV40 genome-wide with ONT R10.4.1) to assemblies from inbred lines. We present a well-resolved multi-locus phylogeny for 360 drosophilid and 4 outgroup species encompassing all publicly available (as of August 2023) genomes for this group. Finally, we present a Progressive Cactus whole-genome, reference-free alignment built from a subset of 298 suitably high-quality drosophilid genomes. The new assemblies and alignment, along with updated laboratory protocols and computational pipelines, are released as an open resource and as a tool for studying evolution at the scale of an entire insect family.
Collapse
Affiliation(s)
- Bernard Y. Kim
- Department of Biology, Stanford University, Stanford, California, United States of America
| | - Hannah R. Gellert
- Department of Biology, Stanford University, Stanford, California, United States of America
| | - Samuel H. Church
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, Connecticut United States of America
| | - Anton Suvorov
- Department of Biological Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
| | - Sean S. Anderson
- Department of Biology, University of North Carolina Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Olga Barmina
- Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America
| | - Sofia G. Beskid
- Department of Biology, Stanford University, Stanford, California, United States of America
| | - Aaron A. Comeault
- School of Environmental and Natural Sciences, Bangor University, Bangor, United Kingdom
| | - K. Nicole Crown
- Department of Biology, Case Western Reserve University, Cleveland, Ohio, United States of America
| | - Sarah E. Diamond
- Department of Biology, Case Western Reserve University, Cleveland, Ohio, United States of America
| | - Steve Dorus
- Center for Reproductive Evolution, Department of Biology, Syracuse University, Syracuse, New York, United States of America
| | - Takako Fujichika
- Department of Biological Sciences, Tokyo Metropolitan University, Tokyo, Japan
| | - James A. Hemker
- Department of Developmental Biology, Stanford University, Stanford, California, United States of America
| | - Jan Hrcek
- Institute of Entomology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic
| | - Maaria Kankare
- Department of Biological and Environmental Science, University of Jyväskylä, Jyväskylä, Finland
| | - Toru Katoh
- Department of Biological Sciences, Hokkaido University, Sapporo, Japan
| | - Karl N. Magnacca
- Hawaii Invertebrate Program, Division of Forestry & Wildlife, Honolulu, Hawaii, United States of America
| | - Ryan A. Martin
- Department of Biology, Case Western Reserve University, Cleveland, Ohio, United States of America
| | - Teruyuki Matsunaga
- Department of Complexity Science and Engineering, The University of Tokyo, Tokyo, Japan
| | - Matthew J. Medeiros
- Pacific Biosciences Research Center, University of Hawaiʻi, Mānoa, Hawaii, United States of America
| | - Danny E. Miller
- Division of Genetic Medicine, Department of Pediatrics; Department of Laboratory Medicine and Pathology, University of Washington, Seattle, Washington, United States of America
| | - Scott Pitnick
- Center for Reproductive Evolution, Department of Biology, Syracuse University, Syracuse, New York, United States of America
| | - Michele Schiffer
- Daintree Rainforest Observatory, James Cook University, Townsville, Australia
| | - Sara Simoni
- Department of Biology, Stanford University, Stanford, California, United States of America
| | | | - Zeeshan A. Syed
- Center for Reproductive Evolution, Department of Biology, Syracuse University, Syracuse, New York, United States of America
| | - Aya Takahashi
- Department of Biological Sciences, Tokyo Metropolitan University, Tokyo, Japan
| | - Kevin H-C. Wei
- Department of Zoology, The University of British Columbia, Vancouver, Canada
| | - Tsuya Yokoyama
- Department of Biology, Stanford University, Stanford, California, United States of America
| | - Michael B. Eisen
- Department of Cell and Molecular Biology, University of California Berkeley, Berkeley, California, United States of America
- Howard Hughes Medical Institute, University of California Berkeley, Berkeley, California, United States of America
| | - Artyom Kopp
- Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America
| | - Daniel Matute
- Department of Biology, University of North Carolina Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Darren J. Obbard
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh, United Kingdom
| | - Patrick M. O’Grady
- Department of Entomology, Cornell University, Ithaca, New York, United States of America
| | - Donald K. Price
- School of Life Sciences, University of Nevada Las Vegas, Las Vegas, Nevada, United States of America
| | | | - Thomas Werner
- Department of Biological Sciences, Michigan Technological University, Houghton, Michigan, United States of America
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, California, United States of America
- CZ Biohub, Investigator, San Francisco, California, United States of America
| |
Collapse
|
2
|
Im JH, Lazzaro BP. Population genetic analysis of autophagy and phagocytosis genes in Drosophila melanogaster and D. simulans. PLoS One 2018; 13:e0205024. [PMID: 30281656 PMCID: PMC6169979 DOI: 10.1371/journal.pone.0205024] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2018] [Accepted: 09/18/2018] [Indexed: 12/03/2022] Open
Abstract
Autophagy and phagocytosis are cellular immune mechanisms for internalization and elimination of intracellular and extracellular pathogens. Some pathogens have evolved the ability to inhibit or manipulate these processes, raising the prospect of adaptive reciprocal co-evolution by the host. We performed population genetic analyses on phagocytosis and autophagy genes in Drosophila melanogaster and D. simulans to test for molecular evolutionary signatures of immune adaptation. We found that phagocytosis and autophagy genes as a whole exhibited an elevated level of haplotype homozygosity in both species. In addition, we detected signatures of recent selection, notably in the Atg14 and Ykt6 genes in D. melanogaster and a pattern of elevated sequence divergence in the genderblind (gb) gene on the D. simulans lineage. These results suggest that the evolution of the host cellular immune system as a whole may be shaped by a dynamic conflict between Drosophila and its pathogens even without pervasive evidence of strong adaptive evolution at the individual gene level.
Collapse
Affiliation(s)
- Joo Hyun Im
- Cornell Institute of Host-Microbe Interactions and Disease, Cornell University, Ithaca, NY, United States of America.,Graduate Field of Genetics, Genomics, and Development, Cornell University, Ithaca, NY, United States of America.,Department of Entomology, Cornell University, Ithaca, NY, United States of America
| | - Brian P Lazzaro
- Cornell Institute of Host-Microbe Interactions and Disease, Cornell University, Ithaca, NY, United States of America.,Graduate Field of Genetics, Genomics, and Development, Cornell University, Ithaca, NY, United States of America.,Department of Entomology, Cornell University, Ithaca, NY, United States of America
| |
Collapse
|
4
|
Singh ND, Larracuente AM, Sackton TB, Clark AG. Comparative Genomics on the Drosophila Phylogenetic Tree. ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS 2009. [DOI: 10.1146/annurev.ecolsys.110308.120214] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
With the sequencing of 12 complete euchromatic Drosophila genomes, the genus Drosophila is a leading model for comparative genomics. In this review, we discuss the novel insights into evolutionary processes afforded by the newly available genomic sequences when placed in the context of the phylogeny. We focus on three levels: insights into whole-genome content, such as changes in genome size and content across the phylogeny; insights into large-scale patterns of divergence and conservation, such as selective constraints on genes and chromosome-level evolution of sex chromosomes; and insights into finer-scale processes in individual lineages and genes, such as lineage-specific evolution in response to ecological context. As the field of comparative genomics is still young, we also discuss current challenges, such as the development of more sophisticated evolutionary models to capture nonequilibrium processes and the improvement of assembly and alignment algorithms to better capture uncertainty in the data.
Collapse
Affiliation(s)
- Nadia D. Singh
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853
| | - Amanda M. Larracuente
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853
| | - Timothy B. Sackton
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138
| | - Andrew G. Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853
| |
Collapse
|
5
|
Obbard DJ, Welch JJ, Kim KW, Jiggins FM. Quantifying adaptive evolution in the Drosophila immune system. PLoS Genet 2009; 5:e1000698. [PMID: 19851448 PMCID: PMC2759075 DOI: 10.1371/journal.pgen.1000698] [Citation(s) in RCA: 203] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2009] [Accepted: 09/23/2009] [Indexed: 11/28/2022] Open
Abstract
It is estimated that a large proportion of amino acid substitutions in Drosophila have been fixed by natural selection, and as organisms are faced with an ever-changing array of pathogens and parasites to which they must adapt, we have investigated the role of parasite-mediated selection as a likely cause. To quantify the effect, and to identify which genes and pathways are most likely to be involved in the host–parasite arms race, we have re-sequenced population samples of 136 immunity and 287 position-matched non-immunity genes in two species of Drosophila. Using these data, and a new extension of the McDonald-Kreitman approach, we estimate that natural selection fixes advantageous amino acid changes in immunity genes at nearly double the rate of other genes. We find the rate of adaptive evolution in immunity genes is also more variable than other genes, with a small subset of immune genes evolving under intense selection. These genes, which are likely to represent hotspots of host–parasite coevolution, tend to share similar functions or belong to the same pathways, such as the antiviral RNAi pathway and the IMD signalling pathway. These patterns appear to be general features of immune system evolution in both species, as rates of adaptive evolution are correlated between the D. melanogaster and D. simulans lineages. In summary, our data provide quantitative estimates of the elevated rate of adaptive evolution in immune system genes relative to the rest of the genome, and they suggest that adaptation to parasites is an important force driving molecular evolution. All organisms are attacked by an ever-changing array of pathogens and parasites, and it is widely supposed that the ensuing host–parasite “arms race” must drive extensive adaptive evolution in genes of the immune system. Here we have taken advantage of new sequencing technologies and analytical approaches to quantify the amount of adaptation that is occurring in immunity genes relative to the rest of the genome. We sampled two species of fruit fly (D. melanogaster and D. simulans) from eight different populations around the world, and sequenced 136 immunity and 287 non-immunity genes from these samples. Based on the differences in the sequences between the two species, and the genetic diversity within each species, we have estimated that natural selection drives twice as much change in immune-related proteins as in proteins with no immune function. Interestingly, the rate of adaptation is also more variable among immunity genes than among other genes in the genome, with a small subset of immunity genes evolving under intense natural selection. We suggest that these genes may represent hotspots of host–parasite coevolution within the genome.
Collapse
Affiliation(s)
- Darren J Obbard
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, UK.
| | | | | | | |
Collapse
|
6
|
Lehmann T, Hume JCC, Licht M, Burns CS, Wollenberg K, Simard F, Ribeiro JMC. Molecular evolution of immune genes in the malaria mosquito Anopheles gambiae. PLoS One 2009; 4:e4549. [PMID: 19234606 PMCID: PMC2642720 DOI: 10.1371/journal.pone.0004549] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2008] [Accepted: 12/31/2008] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND As pathogens that circumvent the host immune response are favoured by selection, so are host alleles that reduce parasite load. Such evolutionary processes leave their signature on the genes involved. Deciphering modes of selection operating on immune genes might reveal the nature of host-pathogen interactions and factors that govern susceptibility in host populations. Such understanding would have important public health implications. METHODOLOGY/FINDINGS We analyzed polymorphisms in four mosquito immune genes (SP14D1, GNBP, defensin, and gambicin) to decipher selection effects, presumably mediated by pathogens. Using samples of Anopheles arabiensis, An. quadriannulatus and four An. gambiae populations, as well as published sequences from other Culicidae, we contrasted patterns of polymorphisms between different functional units of the same gene within and between populations. Our results revealed selection signatures operating on different time scales. At the most recent time scale, within-population diversity revealed purifying selection. Between populations and between species variation revealed reduced differentiation (GNBP and gambicin) at coding vs. noncoding- regions, consistent with balancing selection. McDonald-Kreitman tests between An. quadriannulatus and both sibling species revealed higher fixation rate of synonymous than nonsynonymous substitutions (GNBP) in accordance with frequency dependent balancing selection. At the longest time scale (>100 my), PAML analysis using distant Culicid taxa revealed positive selection at one codon in gambicin. Patterns of genetic variation were independent of exposure to human pathogens. SIGNIFICANCE AND CONCLUSIONS Purifying selection is the most common form of selection operating on immune genes as it was detected on a contemporary time scale on all genes. Selection for "hypervariability" was not detected, but negative balancing selection, detected at a recent evolutionary time scale between sibling species may be rather common. Detection of positive selection at the deepest evolutionary time scale suggests that it occurs infrequently, possibly in association with speciation events. Our results provided no evidence to support the hypothesis that selection was mediated by pathogens that are transmitted to humans.
Collapse
Affiliation(s)
- Tovi Lehmann
- Division of Parasitic Diseases, Entomology Branch, Centers for Disease Control & Prevention, Chamblee, Georgia, United States of America.
| | | | | | | | | | | | | |
Collapse
|