2
|
Michell C, Wutke S, Aranda M, Nyman T. Genomes of the willow-galling sawflies Euura lappo and Eupontania aestiva (Hymenoptera: Tenthredinidae): a resource for research on ecological speciation, adaptation, and gall induction. G3 (BETHESDA, MD.) 2021; 11:jkab094. [PMID: 33788947 PMCID: PMC8104934 DOI: 10.1093/g3journal/jkab094] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Accepted: 03/09/2021] [Indexed: 12/14/2022]
Abstract
Hymenoptera is a hyperdiverse insect order represented by over 153,000 different species. As many hymenopteran species perform various crucial roles for our environments, such as pollination, herbivory, and parasitism, they are of high economic and ecological importance. There are 99 hymenopteran genomes in the NCBI database, yet only five are representative of the paraphyletic suborder Symphyta (sawflies, woodwasps, and horntails), while the rest represent the suborder Apocrita (bees, wasps, and ants). Here, using a combination of 10X Genomics linked-read sequencing, Oxford Nanopore long-read technology, and Illumina short-read data, we assembled the genomes of two willow-galling sawflies (Hymenoptera: Tenthredinidae: Nematinae: Euurina): the bud-galling species Euura lappo and the leaf-galling species Eupontania aestiva. The final assembly for E. lappo is 259.85 Mbp in size, with a contig N50 of 209.0 kbp and a BUSCO score of 93.5%. The E. aestiva genome is 222.23 Mbp in size, with a contig N50 of 49.7 kbp and a 90.2% complete BUSCO score. De novo annotation of repetitive elements showed that 27.45% of the genome was composed of repetitive elements in E. lappo and 16.89% in E. aestiva, which is a marked increase compared to previously published hymenopteran genomes. The genomes presented here provide a resource for inferring phylogenetic relationships among basal hymenopterans, comparative studies on host-related genomic adaptation in plant-feeding insects, and research on the mechanisms of plant manipulation by gall-inducing insects.
Collapse
Affiliation(s)
- Craig Michell
- Department of Environmental and Biological Sciences, University of Eastern Finland, Joensuu, 80100, Finland
| | - Saskia Wutke
- Department of Environmental and Biological Sciences, University of Eastern Finland, Joensuu, 80100, Finland
| | - Manuel Aranda
- Biological and Environmental Sciences & Engineering Division, Red Sea Research Center, King Abdullah University of Science and Technology, Thuwal, 23955-6900, Saudi Arabia
| | - Tommi Nyman
- Department of Ecosystems in the Barents Region, Norwegian Institute of Bioeconomy Research, Svanvik, 9925, Norway
| |
Collapse
|
3
|
Guillén Y, Rius N, Delprat A, Williford A, Muyas F, Puig M, Casillas S, Ràmia M, Egea R, Negre B, Mir G, Camps J, Moncunill V, Ruiz-Ruano FJ, Cabrero J, de Lima LG, Dias GB, Ruiz JC, Kapusta A, Garcia-Mas J, Gut M, Gut IG, Torrents D, Camacho JP, Kuhn GCS, Feschotte C, Clark AG, Betrán E, Barbadilla A, Ruiz A. Genomics of ecological adaptation in cactophilic Drosophila. Genome Biol Evol 2014; 7:349-66. [PMID: 25552534 PMCID: PMC4316639 DOI: 10.1093/gbe/evu291] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Cactophilic Drosophila species provide a valuable model to study gene–environment interactions and ecological adaptation. Drosophila buzzatii and Drosophila mojavensis are two cactophilic species that belong to the repleta group, but have very different geographical distributions and primary host plants. To investigate the genomic basis of ecological adaptation, we sequenced the genome and developmental transcriptome of D. buzzatii and compared its gene content with that of D. mojavensis and two other noncactophilic Drosophila species in the same subgenus. The newly sequenced D. buzzatii genome (161.5 Mb) comprises 826 scaffolds (>3 kb) and contains 13,657 annotated protein-coding genes. Using RNA sequencing data of five life-stages we found expression of 15,026 genes, 80% protein-coding genes, and 20% noncoding RNA genes. In total, we detected 1,294 genes putatively under positive selection. Interestingly, among genes under positive selection in the D. mojavensis lineage, there is an excess of genes involved in metabolism of heterocyclic compounds that are abundant in Stenocereus cacti and toxic to nonresident Drosophila species. We found 117 orphan genes in the shared D. buzzatii–D. mojavensis lineage. In addition, gene duplication analysis identified lineage-specific expanded families with functional annotations associated with proteolysis, zinc ion binding, chitin binding, sensory perception, ethanol tolerance, immunity, physiology, and reproduction. In summary, we identified genetic signatures of adaptation in the shared D. buzzatii–D. mojavensis lineage, and in the two separate D. buzzatii and D. mojavensis lineages. Many of the novel lineage-specific genomic features are promising candidates for explaining the adaptation of these species to their distinct ecological niches.
Collapse
Affiliation(s)
- Yolanda Guillén
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Spain
| | - Núria Rius
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Spain
| | - Alejandra Delprat
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Spain
| | | | - Francesc Muyas
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Spain
| | - Marta Puig
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Spain
| | - Sònia Casillas
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Spain Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, Spain
| | - Miquel Ràmia
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Spain Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, Spain
| | - Raquel Egea
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Spain Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, Spain
| | - Barbara Negre
- EMBL/CRG Research Unit in Systems Biology, Centre for Genomic Regulation (CRG), Barcelona, Spain Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Gisela Mir
- IRTA, Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Campus UAB, Edifici CRAG, Barcelona, Spain The Peter MacCallum Cancer Centre, East Melbourne, Victoria, Australia
| | - Jordi Camps
- Centro Nacional de Análisis Genómico (CNAG), Parc Científic de Barcelona, Torre I, Barcelona, Spain
| | - Valentí Moncunill
- Barcelona Supercomputing Center (BSC), Edifici TG (Torre Girona), Barcelona, Spain and Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| | | | - Josefa Cabrero
- Departamento de Genética, Facultad de Ciencias, Universidad de Granada, Spain
| | - Leonardo G de Lima
- Instituto de Ciências Biológicas, Departamento de Biologia Geral, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | - Guilherme B Dias
- Instituto de Ciências Biológicas, Departamento de Biologia Geral, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | - Jeronimo C Ruiz
- Informática de Biossistemas, Centro de Pesquisas René Rachou-Fiocruz Minas, Belo Horizonte, MG, Brazil
| | - Aurélie Kapusta
- Department of Human Genetics, University of Utah School of Medicine
| | - Jordi Garcia-Mas
- IRTA, Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Campus UAB, Edifici CRAG, Barcelona, Spain
| | - Marta Gut
- Centro Nacional de Análisis Genómico (CNAG), Parc Científic de Barcelona, Torre I, Barcelona, Spain
| | - Ivo G Gut
- Centro Nacional de Análisis Genómico (CNAG), Parc Científic de Barcelona, Torre I, Barcelona, Spain
| | - David Torrents
- Barcelona Supercomputing Center (BSC), Edifici TG (Torre Girona), Barcelona, Spain and Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| | - Juan P Camacho
- Departamento de Genética, Facultad de Ciencias, Universidad de Granada, Spain
| | - Gustavo C S Kuhn
- Instituto de Ciências Biológicas, Departamento de Biologia Geral, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | - Cédric Feschotte
- Department of Human Genetics, University of Utah School of Medicine
| | - Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University
| | - Esther Betrán
- Department of Biology, University of Texas at Arlington
| | - Antonio Barbadilla
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Spain Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, Spain
| | - Alfredo Ruiz
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Spain
| |
Collapse
|
5
|
Misawa K. A codon substitution model that incorporates the effect of the GC contents, the gene density and the density of CpG islands of human chromosomes. BMC Genomics 2011; 12:397. [PMID: 21819607 PMCID: PMC3169530 DOI: 10.1186/1471-2164-12-397] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2011] [Accepted: 08/06/2011] [Indexed: 11/16/2022] Open
Abstract
Background Developing a model for codon substitutions is essential for the analyses of protein sequences. Recent studies on the mutation rates in the non-coding regions have shown that CpG mutation rates in the human genome are negatively correlated to the local GC content and to the densities of functional elements. This study aimed at understanding the effect of genomic features, namely, GC content, gene density, and frequency of CpG islands, on the rates of codon substitution in human chromosomes. Results Codon substitution rates of CpG to TpG mutations, TpG to CpG mutations, and non-CpG transitions and transversions in humans were estimated by comparing the coding regions of thousands of human and chimpanzee genes and inferring their ancestral sequences by using macaque genes as the outgroup. Since the genomic features are depending on each other, partial regression coefficients of these features were obtained. Conclusion The substitution rates of codons depend on gene densities of the chromosomes. Transcription-associated mutation is one such pressure. On the basis of these results, a model of codon substitutions that incorporates the effect of genomic features on codon substitution in human chromosomes was developed.
Collapse
Affiliation(s)
- Kazuharu Misawa
- Research Program for Computational Science, Research and Development Group for Next-Generation Integrated Living Matter Simulation, Fusion of Data and Analysis Research and Development Team, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa 230-0045, Japan.
| |
Collapse
|