1
|
Kimura A, Go AC, Markow T, Ranz JM. Evidence of Nonrandom Patterns of Functional Chromosome Organization in Danaus plexippus. Genome Biol Evol 2024; 16:evae054. [PMID: 38488057 PMCID: PMC10972686 DOI: 10.1093/gbe/evae054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/13/2024] [Indexed: 05/01/2024] Open
Abstract
Our understanding on the interplay between gene functionality and gene arrangement at different chromosome scales relies on a few Diptera and the honeybee, species with quality reference genome assemblies, accurate gene annotations, and abundant transcriptome data. Using recently generated 'omic resources in the monarch butterfly Danaus plexippus, a species with many more and smaller chromosomes relative to Drosophila species and the honeybee, we examined the organization of genes preferentially expressed at broadly defined developmental stages (larva, pupa, adult males, and adult females) at both fine and whole-chromosome scales. We found that developmental stage-regulated genes do not form more clusters, but do form larger clusters, than expected by chance, a pattern consistent across the gene categories examined. Notably, out of the 30 chromosomes in the monarch genome, 12 of them, plus the fraction of the chromosome Z that corresponds to the ancestral Z in other Lepidoptera, were found enriched for developmental stage-regulated genes. These two levels of nonrandom gene organization are not independent as enriched chromosomes for developmental stage-regulated genes tend to harbor disproportionately large clusters of these genes. Further, although paralogous genes were overrepresented in gene clusters, their presence is not enough to explain two-thirds of the documented cases of whole-chromosome enrichment. The composition of the largest clusters often included paralogs from more than one multigene family as well as unrelated single-copy genes. Our results reveal intriguing patterns at the whole-chromosome scale in D. plexippus while shedding light on the interplay between gene expression and chromosome organization beyond Diptera and Hymenoptera.
Collapse
Affiliation(s)
- Ashlyn Kimura
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA 92647, USA
| | - Alwyn C Go
- Department of Biology, University of Winnipeg, Winnipeg, MB R3B 2E9, Canada
| | - Therese Markow
- Unidad de Genómica Avanzada (Langebio), CINVESTAV, Irapuato, GTO 36824, México
- Section of Cell and Developmental Biology, Division of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - José M Ranz
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA 92647, USA
| |
Collapse
|
2
|
Wang S, Girardello M, Zhang W. Potential and progress of studying mountain biodiversity by means of butterfly genetics and genomics. J Genet Genomics 2024; 51:292-301. [PMID: 37302475 DOI: 10.1016/j.jgg.2023.06.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 06/04/2023] [Accepted: 06/05/2023] [Indexed: 06/13/2023]
Abstract
Mountains are rich in biodiversity, and butterflies are species-rich and have a good ecological and evolutionary research foundation. This review addresses the potential and progress of studying mountain biodiversity using butterflies as a model. We discuss the uniqueness of mountain ecosystems, factors influencing the distribution of mountain butterflies, representative genetic and evolutionary models in butterfly research, and evolutionary studies of mountain biodiversity involving butterfly genetics and genomics. Finally, we demonstrate the necessity of studying mountain butterflies and propose future perspectives. This review provides insights for studying the biodiversity of mountain butterflies as well as a summary of research methods for reference.
Collapse
Affiliation(s)
- Shuting Wang
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing 100871, China; Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Marco Girardello
- cE3c - Centre for Ecology, Evolution and Environmental Changes/Azorean Biodiversity Group, Faculdade de Ciências Agrárias e do Ambiente, Universidade dos Açores, 9700-042 Angra do Heroísmo, Terceira, Portugal
| | - Wei Zhang
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing 100871, China; Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China.
| |
Collapse
|
3
|
Llorente-Bousquets J, Nieves-Uribe S, Flores-Gallardo A. Exochorion in the tribe Nymphalini (Lepidoptera: Nymphalidae): the genus Hypanartia Hbner, [1821] and comparison with related genera. Zootaxa 2023; 5330:151-200. [PMID: 38221141 DOI: 10.11646/zootaxa.5330.2.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Indexed: 01/16/2024]
Abstract
We describe and compare the exochorion of six species of Hypanartia (H. paullus, H. bella, H. lethe, H. godmanii, H. dione disjuncta, and H. trimaculata autumna) using specimens collected in the field and deposited in entomological collections. We used a standard staining technique and scanning electron microscopy to visualize and describe the main exochorionic characters, especially the ridges and the differentiation in the micropylar, perimicropylar, and transition zones in the apical region. We included plates with photographs, drawings, and schemes for clearer visualization of these structures, which are tabulated for comparison. For our characterization, we considered the ridges in colonnade as the main exochorionic feature in the Nymphalini. Thus, we made brief comparisons of these features among several species of the tribe, particularly with the genus Antanartia. The exochorionic characters agree with the separation of groups (paullus group and dione group). By examining the characteristics of H. paullus, the basal species of the paullus group, we found that ridges with conspicuous columns (e.g., in one section only) appears to be the plesiomorphic state, while those with elongated shafts and walls only in the intercolumn represent the apomorphic state.
Collapse
Affiliation(s)
- Jorge Llorente-Bousquets
- Museo de Zoologa (Entomologa); Departamento de Biologa Evolutiva; Facultad de Ciencias; Universidad Nacional Autnoma de Mxico; Mxico; 04510; CDMX; Mxico.
| | - Sandra Nieves-Uribe
- Museo de Zoologa (Entomologa); Departamento de Biologa Evolutiva; Facultad de Ciencias; Universidad Nacional Autnoma de Mxico; Mxico; 04510; CDMX; Mxico; Posgrado en Ciencias Biolgicas; Unidad de Posgrado de la Universidad Nacional Autnoma de Mxico; Edificio D; 1 Piso; Circuito de Posgrados; Ciudad Universitaria; 04510; CDMX; Mxico.
| | - Adrin Flores-Gallardo
- Museo de Zoologa (Entomologa); Departamento de Biologa Evolutiva; Facultad de Ciencias; Universidad Nacional Autnoma de Mxico; Mxico; 04510; CDMX; Mxico; Posgrado en Filosofa de la Ciencia; Universidad Nacional Autnoma de Mxico; Mxico.
| |
Collapse
|
4
|
Pei X, Liu X, Zang C, Yu S, Xie J, Lin Y, Liang B, Wu X, Liang C. Genome Resource of Streptomyces atratus PY-1, a Broad-Spectrum Antimicrobial Strain in Particular Antagonistic Against Plasmopara viticola. PLANT DISEASE 2023; 107:2506-2508. [PMID: 36774571 DOI: 10.1094/pdis-09-22-2093-a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Streptomyces atratus PY-1 exhibited promising antimicrobial properties; in particular, it is highly inhibitory to Plasmopara viticola, which causes downy mildew of grape. It is very necessary to carry out systematic and in-depth research on the PY-1 strain for the improvement, application, and promotion of biocontrol agents. The PY-1 genome was fully sequenced and assembled. We present the draft genome sequence of PY-1, with a size of 9, 254, and 781 bp. Preliminary analysis on the PY-1 genome sequence shows that at least 35 gene clusters are involved in the biosynthesis of polyketides, terpenes, and nonribosomally synthesized peptides.
Collapse
Affiliation(s)
- Xue Pei
- Institute of Plant Protection, Liaoning Academy of Agricultural Sciences, Shenyang 110161, China
- College of Plant Protection, China Agricultural University, Beijing 100193, China
| | - Xiaozhou Liu
- Institute of Plant Protection, Liaoning Academy of Agricultural Sciences, Shenyang 110161, China
| | - Chaoqun Zang
- Institute of Plant Protection, Liaoning Academy of Agricultural Sciences, Shenyang 110161, China
| | - Shuyi Yu
- Institute of Plant Protection, Liaoning Academy of Agricultural Sciences, Shenyang 110161, China
| | - Jinhui Xie
- Institute of Plant Protection, Liaoning Academy of Agricultural Sciences, Shenyang 110161, China
| | - Ying Lin
- Institute of Plant Protection, Liaoning Academy of Agricultural Sciences, Shenyang 110161, China
| | - Bingbing Liang
- Institute of Plant Protection, Liaoning Academy of Agricultural Sciences, Shenyang 110161, China
| | - Xuehong Wu
- College of Plant Protection, China Agricultural University, Beijing 100193, China
| | - Chunhao Liang
- Institute of Plant Protection, Liaoning Academy of Agricultural Sciences, Shenyang 110161, China
| |
Collapse
|
5
|
Pan Z, Ding Y, Zhang S, Li L, Ma F. Chromosome-Level Genome Assembly of Papilio elwesi Leech, 1889 (Lepidoptera: Papilionidae). INSECTS 2023; 14:304. [PMID: 36975989 PMCID: PMC10058580 DOI: 10.3390/insects14030304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 03/16/2023] [Accepted: 03/17/2023] [Indexed: 06/18/2023]
Abstract
A rarely seen butterfly species, the large swallowtail butterfly Papilio elwesi Leech, 1889 (Lepidoptera: Papilionidae), endemic to the Chinese mainland, has been declared a state-protected animal in China since 2000, but its genome is not yet available. To obtain high-quality genome assembly and annotation, we sequenced the genome and transcriptome of P. elwesi using the PacBio and PromethION platforms, respectively. The final assembled genome was 358.51 Mb, of which 97.59% was anchored to chromosomes (30 autosomes and 1 Z sex chromosome), with a contig/scaffold N50 length of 6.79/12.32 Mb and 99.0% (n = 1367) BUSCO completeness. The genome annotation pointed to 36.82% (131.99 Mb) repetitive elements and 1296 non-coding RNAs in the genome, along with 13,681 protein-coding genes that cover 98.6% (1348) of the BUSCO genes. Among the 11,499 identified gene families, 104 underwent significantly rapid expansions or contractions, and these rapidly expanding families play roles in detoxification and metabolism. Additionally, strong synteny exists between the chromosomes of P. elwesi and P. machaon. The chromosome-level genome of P. elwesi could serve as an important genomic resource for furthering our understanding of butterfly evolution and for more in-depth genomic analyses.
Collapse
Affiliation(s)
- Zhixiang Pan
- School of Life Sciences, Taizhou University, Taizhou 318000, China
| | - Yinhuan Ding
- Department of Agronomy and Horticulture, Jiangsu Vocational College of Agriculture and Forestry, Jurong 212400, China
| | - Shusheng Zhang
- The Management Center of Wuyanling National Natural Reserve in Zhejiang, Wenzhou 325500, China
| | - Luxian Li
- Zhejiang Environment Technology Company Limited, Hangzhou 311100, China
| | - Fangzhou Ma
- Nanjing Institute of Environmental Sciences under Ministry of Ecology and Environment, Nanjing 210042, China
| |
Collapse
|
6
|
Wang J, Yuan L, Tang J, Liu J, Sun C, Itgen MW, Chen G, Sessions SK, Zhang G, Mueller RL. Transposable element and host silencing activity in gigantic genomes. Front Cell Dev Biol 2023; 11:1124374. [PMID: 36910142 PMCID: PMC9998948 DOI: 10.3389/fcell.2023.1124374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 02/09/2023] [Indexed: 02/26/2023] Open
Abstract
Transposable elements (TEs) and the silencing machinery of their hosts are engaged in a germline arms-race dynamic that shapes TE accumulation and, therefore, genome size. In animal species with extremely large genomes (>10 Gb), TE accumulation has been pushed to the extreme, prompting the question of whether TE silencing also deviates from typical conditions. To address this question, we characterize TE silencing via two pathways-the piRNA pathway and KRAB-ZFP transcriptional repression-in the male and female gonads of Ranodon sibiricus, a salamander species with a ∼21 Gb genome. We quantify 1) genomic TE diversity, 2) TE expression, and 3) small RNA expression and find a significant relationship between the expression of piRNAs and TEs they target for silencing in both ovaries and testes. We also quantified TE silencing pathway gene expression in R. sibiricus and 14 other vertebrates with genome sizes ranging from 1 to 130 Gb and find no association between pathway expression and genome size. Taken together, our results reveal that the gigantic R. sibiricus genome includes at least 19 putatively active TE superfamilies, all of which are targeted by the piRNA pathway in proportion to their expression levels, suggesting comprehensive piRNA-mediated silencing. Testes have higher TE expression than ovaries, suggesting that they may contribute more to the species' high genomic TE load. We posit that apparently conflicting interpretations of TE silencing and genomic gigantism in the literature, as well as the absence of a correlation between TE silencing pathway gene expression and genome size, can be reconciled by considering whether the TE community or the host is currently "on the attack" in the arms race dynamic.
Collapse
Affiliation(s)
- Jie Wang
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, Sichuan, China
| | - Liang Yuan
- School of Life Sciences, Xinjiang Normal University, Urumqi, China
| | - Jiaxing Tang
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, Sichuan, China.,College of Life Sciences, Sichuan Normal University, Chengdu, China
| | - Jiongyu Liu
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, Sichuan, China
| | - Cheng Sun
- College of Life Sciences, Capital Normal University, Beijing, China
| | - Michael W Itgen
- Department of Biology, Colorado State University, Fort Collins, CO, United States
| | - Guiying Chen
- College of Life Sciences, Sichuan Normal University, Chengdu, China
| | | | - Guangpu Zhang
- CAS Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization & Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, Sichuan, China.,College of Life Sciences, Sichuan Normal University, Chengdu, China
| | | |
Collapse
|
7
|
Bastide H, López-Villavicencio M, Ogereau D, Lledo J, Dutrillaux AM, Debat V, Llaurens V. Genome assembly of 3 Amazonian Morpho butterfly species reveals Z-chromosome rearrangements between closely related species living in sympatry. Gigascience 2022; 12:giad033. [PMID: 37216769 PMCID: PMC10202424 DOI: 10.1093/gigascience/giad033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Revised: 02/13/2023] [Accepted: 05/12/2023] [Indexed: 05/24/2023] Open
Abstract
The genomic processes enabling speciation and species coexistence in sympatry are still largely unknown. Here we describe the whole-genome sequencing and assembly of 3 closely related species from the butterfly genus Morpho: Morpho achilles (Linnaeus, 1758), Morpho helenor (Cramer, 1776), and Morpho deidamia (Höbner, 1819). These large blue butterflies are emblematic species of the Amazonian rainforest. They live in sympatry in a wide range of their geographical distribution and display parallel diversification of dorsal wing color pattern, suggesting local mimicry. By sequencing, assembling, and annotating their genomes, we aim at uncovering prezygotic barriers preventing gene flow between these sympatric species. We found a genome size of 480 Mb for the 3 species and a chromosomal number ranging from 2n = 54 for M. deidamia to 2n = 56 for M. achilles and M. helenor. We also detected inversions on the sex chromosome Z that were differentially fixed between species, suggesting that chromosomal rearrangements may contribute to their reproductive isolation. The annotation of their genomes allowed us to recover in each species at least 12,000 protein-coding genes and to discover duplications of genes potentially involved in prezygotic isolation like genes controlling color discrimination (L-opsin). Altogether, the assembly and the annotation of these 3 new reference genomes open new research avenues into the genomic architecture of speciation and reinforcement in sympatry, establishing Morpho butterflies as a new eco-evolutionary model.
Collapse
Affiliation(s)
| | - Manuela López-Villavicencio
- Institut de Systématique, Evolution et Biodiversité (UMR 7205 CNRS/MNHN/SU/EPHE/UA), Muséum National d’Histoire Naturelle–CP50, 75005 Paris, France
| | | | - Joanna Lledo
- GeT-PlaGe, Bât G2, INRAe, 31326 Castanet-Tolosan Cedex, France
| | - Anne-Marie Dutrillaux
- Institut de Systématique, Evolution et Biodiversité (UMR 7205 CNRS/MNHN/SU/EPHE/UA), Muséum National d’Histoire Naturelle–CP50, 75005 Paris, France
| | - Vincent Debat
- Institut de Systématique, Evolution et Biodiversité (UMR 7205 CNRS/MNHN/SU/EPHE/UA), Muséum National d’Histoire Naturelle–CP50, 75005 Paris, France
| | - Violaine Llaurens
- Institut de Systématique, Evolution et Biodiversité (UMR 7205 CNRS/MNHN/SU/EPHE/UA), Muséum National d’Histoire Naturelle–CP50, 75005 Paris, France
| |
Collapse
|
8
|
Whiteford S, van’t Hof AE, Krishna R, Marubbi T, Widdison S, Saccheri IJ, Guest M, Morrison NI, Darby AC. Recovering individual haplotypes and a contiguous genome assembly from pooled long-read sequencing of the diamondback moth (Lepidoptera: Plutellidae). G3 (BETHESDA, MD.) 2022; 12:jkac210. [PMID: 35980174 PMCID: PMC9526047 DOI: 10.1093/g3journal/jkac210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 07/25/2022] [Indexed: 06/15/2023]
Abstract
The assembly of divergent haplotypes using noisy long-read data presents a challenge to the reconstruction of haploid genome assemblies, due to overlapping distributions of technical sequencing error, intralocus genetic variation, and interlocus similarity within these data. Here, we present a comparative analysis of assembly algorithms representing overlap-layout-consensus, repeat graph, and de Bruijn graph methods. We examine how postprocessing strategies attempting to reduce redundant heterozygosity interact with the choice of initial assembly algorithm and ultimately produce a series of chromosome-level assemblies for an agricultural pest, the diamondback moth, Plutella xylostella (L.). We compare evaluation methods and show that BUSCO analyses may overestimate haplotig removal processing in long-read draft genomes, in comparison to a k-mer method. We discuss the trade-offs inherent in assembly algorithm and curation choices and suggest that "best practice" is research question dependent. We demonstrate a link between allelic divergence and allele-derived contig redundancy in final genome assemblies and document the patterns of coding and noncoding diversity between redundant sequences. We also document a link between an excess of nonsynonymous polymorphism and haplotigs that are unresolved by assembly or postassembly algorithms. Finally, we discuss how this phenomenon may have relevance for the usage of noisy long-read genome assemblies in comparative genomics.
Collapse
Affiliation(s)
- Samuel Whiteford
- Corresponding author: Institute of Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK.
| | - Arjen E van’t Hof
- Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - Ritesh Krishna
- Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
- IBM Research UK, STFC Daresbury Laboratory, Warrington WA4 4AD, UK
| | | | - Stephanie Widdison
- General Bioinformatics, Jealott's Hill International Research Centre, Bracknell RG42 6EY, UK
| | - Ilik J Saccheri
- Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| | - Marcus Guest
- Syngenta, Jealott's Hill International Research Centre, Bracknell, RG42 6EY, UK
| | | | - Alistair C Darby
- Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK
| |
Collapse
|
9
|
Mackintosh A, Laetsch DR, Baril T, Ebdon S, Jay P, Vila R, Hayward A, Lohse K. The genome sequence of the scarce swallowtail, Iphiclides podalirius. G3 GENES|GENOMES|GENETICS 2022; 12:6656352. [PMID: 35929795 PMCID: PMC9434224 DOI: 10.1093/g3journal/jkac193] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 07/20/2022] [Indexed: 12/04/2022]
Abstract
The scarce swallowtail, Iphiclides podalirius (Linnaeus, 1758), is a species of butterfly in the family Papilionidae. Here, we present a chromosome-level genome assembly for Iphiclides podalirius as well as gene and transposable element annotations. We investigate how the density of genomic features differs between the 30 Iphiclides podalirius chromosomes. We find that shorter chromosomes have higher heterozygosity at four-fold-degenerate sites and a greater density of transposable elements. While the first result is an expected consequence of differences in recombination rate, the second suggests a counter-intuitive relationship between recombination and transposable element evolution. This high-quality genome assembly, the first for any species in the tribe Leptocircini, will be a valuable resource for population genomics in the genus Iphiclides and comparative genomics more generally.
Collapse
Affiliation(s)
- Alexander Mackintosh
- Institute of Ecology and Evolution, University of Edinburgh , Edinburgh EH9 3FL, UK
| | - Dominik R Laetsch
- Institute of Ecology and Evolution, University of Edinburgh , Edinburgh EH9 3FL, UK
| | - Tobias Baril
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus , Cornwall TR10 9FE, UK
| | - Sam Ebdon
- Institute of Ecology and Evolution, University of Edinburgh , Edinburgh EH9 3FL, UK
| | - Paul Jay
- Ecologie Systématique Evolution, Bâtiment 360, CNRS, AgroParisTech, Université Paris-Saclay , 91400 Orsay, France
| | - Roger Vila
- Institut de Biologia Evolutiva (CSIC—Universitat Pompeu Fabra) , Barcelona 08003, Spain
| | - Alex Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus , Cornwall TR10 9FE, UK
| | - Konrad Lohse
- Institute of Ecology and Evolution, University of Edinburgh , Edinburgh EH9 3FL, UK
| |
Collapse
|
10
|
Guo R, Papanicolaou A, Fritz ML. Validation of reference-assisted assembly using existing and novel Heliothine genomes. Genomics 2022; 114:110441. [PMID: 35931274 DOI: 10.1016/j.ygeno.2022.110441] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Revised: 07/19/2022] [Accepted: 07/29/2022] [Indexed: 11/16/2022]
Abstract
Chloridea subflexa and Chloridea virescens are a pair of closely related noctuid species exhibiting pheromone-based sexual isolation and divergent host plant preferences. We produced a novel Illumina short read C. subflexa genome assembly and an improved C. virescens genome assembly, which offer opportunities to study the genomic basis for evolutionarily important traits in this lepidopteran family with few genomic resources. We then examined the feasibility of reference-assisted assembly, an approach that leverages existing high quality genomic resources for genome improvement in closely related taxa and applied it to our Heliothine genomes. Our work demonstrates that reference-assisted assembly has the potential to enhance contiguity and completeness of existing insect genomic resources with minimal additional laboratory costs. We conclude by discussing both the potential and pitfalls of reference-assisted assembly according to the intended downstream assembly application.
Collapse
Affiliation(s)
- Rong Guo
- Department of Entomology, University of Maryland, College Park, MD 20742, USA; Computational Biology, Bioinformatics and Genomics Program, Department of Biological Sciences, University of Maryland, College Park, MD 20742, USA
| | - Alexie Papanicolaou
- Hawkesbury Institute for the Environment, Western Sydney University, Richmond, NSW 2753, Australia.
| | - Megan L Fritz
- Department of Entomology, University of Maryland, College Park, MD 20742, USA; Computational Biology, Bioinformatics and Genomics Program, Department of Biological Sciences, University of Maryland, College Park, MD 20742, USA.
| |
Collapse
|
11
|
Tunstrom K, Wheat CW, Parmesan C, Singer MC, Mikheyev AS. A genome for Edith's checkerspot butterfly: an insect with complex host-adaptive suites and rapid evolutionary responses to environmental changes. Genome Biol Evol 2022; 14:6649524. [PMID: 35876165 PMCID: PMC9348621 DOI: 10.1093/gbe/evac113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/06/2022] [Indexed: 11/15/2022] Open
Abstract
Insects have been key players in the assessments of biodiversity impacts of anthropogenically driven environmental change, including the evolutionary and ecological impacts of climate change. Populations of Edith’s Checkerspot Butterfly (Euphydryas editha) adapt rapidly to diverse environmental conditions, with numerous high-impact studies documenting these dynamics over several decades. However, studies of the underlying genetic bases of these responses have been hampered by missing genomic resources, limiting the ability to connect genomic responses to environmental change. Using a combination of Oxford Nanopore long reads, haplotype merging, HiC scaffolding followed by Illumina polishing, we generated a highly contiguous and complete assembly (contigs n = 142, N50 = 21.2 Mb, total length = 607.8 Mb; BUSCOs n = 5,286, single copy complete = 97.8%, duplicated = 0.9%, fragmented = 0.3%, missing = 1.0%). A total of 98% of the assembled genome was placed into 31 chromosomes, which displayed large-scale synteny with other well-characterized lepidopteran genomes. The E. editha genome, annotation, and functional descriptions now fill a missing gap for one of the leading field-based ecological model systems in North America.
Collapse
Affiliation(s)
- Kalle Tunstrom
- Department of Zoology, Stockholm University, Stockholm, Sweden
| | | | - Camille Parmesan
- Station d'Écologie Théorique et Expérimentale, CNRS, 2 route du CNRS, 09200 Moulis, France.,Biological and Marine Sciences, University of Plymouth, Plymouth, U.K.,Department of Geological Sciences, University of Texas at Austin, U.S.A
| | - Michael C Singer
- Station d'Écologie Théorique et Expérimentale, CNRS, 2 route du CNRS, 09200 Moulis, France.,Biological and Marine Sciences, University of Plymouth, Plymouth, U.K
| | - Alexander S Mikheyev
- Research School of Biology, Australian National University, Canberra, ACT, Australia
| |
Collapse
|
12
|
Chen X, Zhu W, Wang B, Wang Y, You P. A chromosome-level genome assembly of Paracymoriza distinctalis (Lepidoptera: Crambidae: Acentropinae). ARCHIVES OF INSECT BIOCHEMISTRY AND PHYSIOLOGY 2022; 110:e21883. [PMID: 35294789 DOI: 10.1002/arch.21883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 02/20/2022] [Accepted: 02/24/2022] [Indexed: 06/14/2023]
Abstract
Paracymoriza distinctalis is a semiaquatic lepidopteran insect, which is of great value for studying the differentiation of the Pyraloidea superfamily. However, the understanding of heredity, evolution, and functional genomics of P. distinctalis are limited by few genome-wide resources. Here, we applied PacBio sequencing and the chromosome capture technique to assemble the first P. distinctalis genome from a single female individual. The genome size is 1.2 Gb with 32 chromosomes and the N50 is 38.91 Mb. Approximately 576.37 Mb, accounting for 48.93% of the genome, was identified as repeats. The genome comprises 39,003 protein-coding genes, 66.56% of which were functionally annotated. Comparative genomics analysis suggested that the common ancestor of P. distinctalis and Chilo suppressalis lived ~83.5 million years ago. This chromosome-level genome assembly work is not only conducive to the understanding of P. distinctalis, but also may promote the study of the genomes of other lepidopteran species.
Collapse
Affiliation(s)
- Xiaoning Chen
- College of Life Sciences, Shaanxi Normal University, Xi'an, Shaanxi, China
- Shaanxi Key Laboratory for Animal Conservation, Shaanxi Institute of Zoology, Xi'an, Shaanxi, China
| | - Wenbo Zhu
- College of Life Sciences, Shaanxi Normal University, Xi'an, Shaanxi, China
| | - Biao Wang
- College of Life Sciences, Shaanxi Normal University, Xi'an, Shaanxi, China
| | - Yuqi Wang
- College of Life Sciences, Shaanxi Normal University, Xi'an, Shaanxi, China
| | - Ping You
- College of Life Sciences, Shaanxi Normal University, Xi'an, Shaanxi, China
| |
Collapse
|
13
|
Hai DM, Yen DT, Liem PT, Tam BM, Huong DTT, Hang BTB, Hieu DQ, Garigliany MM, Coppieters W, Kestemont P, Phuong NT, Farnir F. A High-Quality Genome Assembly of Striped Catfish (Pangasianodon hypophthalmus) Based on Highly Accurate Long-Read HiFi Sequencing Data. Genes (Basel) 2022; 13:genes13050923. [PMID: 35627308 PMCID: PMC9141817 DOI: 10.3390/genes13050923] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Revised: 05/17/2022] [Accepted: 05/18/2022] [Indexed: 11/26/2022] Open
Abstract
The HiFi sequencing technology yields highly accurate long-read data with accuracies greater than 99.9% that can be used to improve results for complex applications such as genome assembly. Our study presents a high-quality chromosome-scale genome assembly of striped catfish (Pangasianodon hypophthalmus), a commercially important species cultured mainly in Vietnam, integrating HiFi reads and Hi-C data. A 788.4 Mb genome containing 381 scaffolds with an N50 length of 21.8 Mb has been obtained from HiFi reads. These scaffolds have been further ordered and clustered into 30 chromosome groups, ranging from 1.4 to 57.6 Mb, based on Hi-C data. The present updated assembly has a contig N50 of 14.7 Mb, representing a 245-fold and 4.2-fold improvement over the previous Illumina and Illumina-Nanopore-Hi-C based version, respectively. In addition, the proportion of repeat elements and BUSCO genes identified in our genome is remarkably higher than in the two previously released striped catfish genomes. These results highlight the power of using HiFi reads to assemble the highly repetitive regions and to improve the quality of genome assembly. The updated, high-quality genome assembled in this work will provide a valuable genomic resource for future population genetics, conservation biology and selective breeding studies of striped catfish.
Collapse
Affiliation(s)
- Dao Minh Hai
- FARAH/Sustainable Animal Production, Faculty of Veterinary Medicine, University of Liege (B43), 4000 Liege, Belgium;
- College of Aquaculture and Fisheries, Can Tho University, Can Tho 900000, Vietnam; (D.T.Y.); (P.T.L.); (B.M.T.); (D.T.T.H.); (B.T.B.H.); (D.Q.H.); (N.T.P.)
| | - Duong Thuy Yen
- College of Aquaculture and Fisheries, Can Tho University, Can Tho 900000, Vietnam; (D.T.Y.); (P.T.L.); (B.M.T.); (D.T.T.H.); (B.T.B.H.); (D.Q.H.); (N.T.P.)
| | - Pham Thanh Liem
- College of Aquaculture and Fisheries, Can Tho University, Can Tho 900000, Vietnam; (D.T.Y.); (P.T.L.); (B.M.T.); (D.T.T.H.); (B.T.B.H.); (D.Q.H.); (N.T.P.)
| | - Bui Minh Tam
- College of Aquaculture and Fisheries, Can Tho University, Can Tho 900000, Vietnam; (D.T.Y.); (P.T.L.); (B.M.T.); (D.T.T.H.); (B.T.B.H.); (D.Q.H.); (N.T.P.)
| | - Do Thi Thanh Huong
- College of Aquaculture and Fisheries, Can Tho University, Can Tho 900000, Vietnam; (D.T.Y.); (P.T.L.); (B.M.T.); (D.T.T.H.); (B.T.B.H.); (D.Q.H.); (N.T.P.)
| | - Bui Thi Bich Hang
- College of Aquaculture and Fisheries, Can Tho University, Can Tho 900000, Vietnam; (D.T.Y.); (P.T.L.); (B.M.T.); (D.T.T.H.); (B.T.B.H.); (D.Q.H.); (N.T.P.)
| | - Dang Quang Hieu
- College of Aquaculture and Fisheries, Can Tho University, Can Tho 900000, Vietnam; (D.T.Y.); (P.T.L.); (B.M.T.); (D.T.T.H.); (B.T.B.H.); (D.Q.H.); (N.T.P.)
| | - Mutien-Marie Garigliany
- FARAH/Veterinary Public Health, Faculty of Veterinary Medicine, University of Liege (B43), 4000 Liege, Belgium;
| | | | - Patrick Kestemont
- Research Unit in Environmental and Evolutionary Biology, Institute of Life, Earth & Environnment, University of Namur, rue de Bruxelles 61, 5000 Namur, Belgium;
| | - Nguyen Thanh Phuong
- College of Aquaculture and Fisheries, Can Tho University, Can Tho 900000, Vietnam; (D.T.Y.); (P.T.L.); (B.M.T.); (D.T.T.H.); (B.T.B.H.); (D.Q.H.); (N.T.P.)
| | - Frédéric Farnir
- FARAH/Sustainable Animal Production, Faculty of Veterinary Medicine, University of Liege (B43), 4000 Liege, Belgium;
- Correspondence:
| |
Collapse
|
14
|
Feron R, Waterhouse RM. Assessing species coverage and assembly quality of rapidly accumulating sequenced genomes. Gigascience 2022; 11:6537158. [PMID: 35217859 PMCID: PMC8881204 DOI: 10.1093/gigascience/giac006] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 12/12/2021] [Accepted: 01/13/2022] [Indexed: 11/14/2022] Open
Abstract
BACKGROUND Ambitious initiatives to coordinate genome sequencing of Earth's biodiversity mean that the accumulation of genomic data is growing rapidly. In addition to cataloguing biodiversity, these data provide the basis for understanding biological function and evolution. Accurate and complete genome assemblies offer a comprehensive and reliable foundation upon which to advance our understanding of organismal biology at genetic, species, and ecosystem levels. However, ever-changing sequencing technologies and analysis methods mean that available data are often heterogeneous in quality. To guide forthcoming genome generation efforts and promote efficient prioritization of resources, it is thus essential to define and monitor taxonomic coverage and quality of the data. FINDINGS Here we present an automated analysis workflow that surveys genome assemblies from the United States NCBI, assesses their completeness using the relevant BUSCO datasets, and collates the results into an interactively browsable resource. We apply our workflow to produce a community resource of available assemblies from the phylum Arthropoda, the Arthropoda Assembly Assessment Catalogue. Using this resource, we survey current taxonomic coverage and assembly quality at the NCBI, examine how key assembly metrics relate to gene content completeness, and compare results from using different BUSCO lineage datasets. CONCLUSIONS These results demonstrate how the workflow can be used to build a community resource that enables large-scale assessments to survey species coverage and data quality of available genome assemblies, and to guide prioritizations for ongoing and future sampling, sequencing, and genome generation initiatives.
Collapse
Affiliation(s)
- Romain Feron
- Department of Ecology and Evolution, Le Biophore UNIL-Sorge, University of Lausanne, Lausanne 1015, Switzerland.,Evolutionary-Functional Genomics Group, L'Amphipole UNIL-Sorge, Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Robert M Waterhouse
- Department of Ecology and Evolution, Le Biophore UNIL-Sorge, University of Lausanne, Lausanne 1015, Switzerland.,Evolutionary-Functional Genomics Group, L'Amphipole UNIL-Sorge, Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| |
Collapse
|
15
|
Abstract
In less than 25 y, the field of animal genome science has transformed from a discipline seeking its first glimpses into genome sequences across the Tree of Life to a global enterprise with ambitions to sequence genomes for all of Earth's eukaryotic diversity [H. A. Lewin et al., Proc. Natl. Acad. Sci. U.S.A. 115, 4325-4333 (2018)]. As the field rapidly moves forward, it is important to take stock of the progress that has been made to best inform the discipline's future. In this Perspective, we provide a contemporary, quantitative overview of animal genome sequencing. We identified the best available genome assemblies in GenBank, the world's most extensive genetic database, for 3,278 unique animal species across 24 phyla. We assessed taxonomic representation, assembly quality, and annotation status for major clades. We show that while tremendous taxonomic progress has occurred, stark disparities in genomic representation exist, highlighted by a systemic overrepresentation of vertebrates and underrepresentation of arthropods. In terms of assembly quality, long-read sequencing has dramatically improved contiguity, whereas gene annotations are available for just 34.3% of taxa. Furthermore, we show that animal genome science has diversified in recent years with an ever-expanding pool of researchers participating. However, the field still appears to be dominated by institutions in the Global North, which have been listed as the submitting institution for 77% of all assemblies. We conclude by offering recommendations for improving genomic resource availability and research value while also broadening global representation.
Collapse
|
16
|
Li X, Ellis E, Plotkin D, Imada Y, Yago M, Heckenhauer J, Cleland TP, Dikow RB, Dikow T, Storer CG, Kawahara AY, Frandsen PB. First Annotated Genome of a Mandibulate Moth, Neomicropteryx cornuta, Generated Using PacBio HiFi Sequencing. Genome Biol Evol 2021; 13:6380144. [PMID: 34599325 PMCID: PMC8557830 DOI: 10.1093/gbe/evab229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/27/2021] [Indexed: 11/14/2022] Open
Abstract
We provide a new, annotated genome assembly of Neomicropteryx cornuta, a species of the so-called mandibulate archaic moths (Lepidoptera: Micropterigidae). These moths belong to a lineage that is thought to have split from all other Lepidoptera more than 300 Ma and are consequently vital to understanding the early evolution of superorder Amphiesmenoptera, which contains the order Lepidoptera (butterflies and moths) and its sister order Trichoptera (caddisflies). Using PacBio HiFi sequencing reads, we assembled a highly contiguous genome with a contig N50 of nearly 17 Mb. The assembled genome length of 541,115,538 bp is about half the length of the largest published Amphiesmenoptera genome (Limnephilus lunatus, Trichoptera) and double the length of the smallest (Papilio polytes, Lepidoptera). We find high recovery of universal single copy orthologs with 98.1% of BUSCO genes present and provide a genome annotation of 15,643 genes aided by resolved isoforms from PacBio IsoSeq data. This high-quality genome assembly provides an important resource for studying ecological and evolutionary transitions in the early evolution of Amphiesmenoptera.
Collapse
Affiliation(s)
- Xuankun Li
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural History, University of Florida, USA
| | - Emily Ellis
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural History, University of Florida, USA
| | - David Plotkin
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural History, University of Florida, USA
| | - Yume Imada
- Graduate School of Science and Engineering, Ehime University, Matsuyama, Japan
| | - Masaya Yago
- The University Museum, The University of Tokyo, Hongo, Bunkyo-ku, Japan
| | - Jacqueline Heckenhauer
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Frankfurt, Germany.,Department of Terrestrial Zoology, Entomology III, Senckenberg Research Institute and Natural History Museum Frankfurt, Frankfurt, Germany
| | - Timothy P Cleland
- Museum Conservation Institute, Smithsonian Institution, Suitland, Maryland, USA
| | - Rebecca B Dikow
- Data Science Lab, Office of the Chief Information Officer, Smithsonian Institution, Washington, District of Columbia, USA
| | - Torsten Dikow
- Department of Entomology, National Museum of Natural History (USNM), Smithsonian Institution, Washington, District of Columbia, USA
| | - Caroline G Storer
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural History, University of Florida, USA
| | - Akito Y Kawahara
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural History, University of Florida, USA
| | - Paul B Frandsen
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Frankfurt, Germany.,Data Science Lab, Office of the Chief Information Officer, Smithsonian Institution, Washington, District of Columbia, USA.,Department of Plant and Wildlife Sciences, Brigham Young University, USA
| |
Collapse
|
17
|
Ellis EA, Storer CG, Kawahara AY. De novo genome assemblies of butterflies. Gigascience 2021; 10:6291117. [PMID: 34076242 PMCID: PMC8170690 DOI: 10.1093/gigascience/giab041] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Revised: 07/22/2020] [Accepted: 05/05/2021] [Indexed: 11/14/2022] Open
Abstract
BACKGROUND The availability of thousands of genomes has enabled new advancements in biology. However, many genomes have not been investigated for their quality. Here we examine quality trends in a taxonomically diverse and well-known group, butterflies (Papilionoidea), and provide draft, de novo assemblies for all available butterfly genomes. Owing to massive genome sequencing investment and taxonomic curation, this is an excellent group to explore genome quality. FINDINGS We provide de novo assemblies for all 822 available butterfly genomes and interpret their quality in terms of completeness and continuity. We identify the 50 highest quality genomes across butterflies and conclude that the ringlet, Aphantopus hyperantus, has the highest quality genome. Our post-processing of draft genome assemblies identified 118 butterfly genomes that should not be reused owing to contamination or extremely low quality. However, many draft genomes are of high utility, especially because permissibility of low-quality genomes is dependent on the objective of the study. Our assemblies will serve as a key resource for papilionid genomics, especially for researchers without computational resources. CONCLUSIONS Quality metrics and assemblies are typically presented with annotated genome accessions but rarely with de novo genomes. We recommend that studies presenting genome sequences provide the assembly and some metrics of quality because quality will significantly affect downstream results. Transparency in quality metrics is needed to improve the field of genome science and encourage data reuse.
Collapse
Affiliation(s)
- Emily A Ellis
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural History, University of Florida, 3215 Hull Road, Gainesville, FL 32611-2710, USA
| | - Caroline G Storer
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural History, University of Florida, 3215 Hull Road, Gainesville, FL 32611-2710, USA
| | - Akito Y Kawahara
- McGuire Center for Lepidoptera and Biodiversity, Florida Museum of Natural History, University of Florida, 3215 Hull Road, Gainesville, FL 32611-2710, USA
| |
Collapse
|