1
|
Liu J, Xiong Z, Pan Y, Zhao J, Dai W, Guo Q, Liu W, Li Q. Draft genome assemblies of the ponerine ant Odontoponera transversa and the carpenter ant Camponotus friedae (Hymenoptera: Formicidae). BMC Genom Data 2024; 25:70. [PMID: 39009995 PMCID: PMC11251112 DOI: 10.1186/s12863-024-01253-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Accepted: 07/10/2024] [Indexed: 07/17/2024] Open
Abstract
OBJECTIVES Ants are ecologically dominant insects in most terrestrial ecosystems, with more than 14,000 extant species in about 340 genera recorded to date. However, genomic resources are still scarce for most species, especially for species endemic in East or Southeast Asia, limiting the study of phylogeny, speciation and adaptation of this evolutionarily successful animal lineage. Here, we assemble and annotate the genomes of Odontoponera transversa and Camponotus friedae, two ant species with a natural distribution in China, to facilitate future study of ant evolution. DATA DESCRIPTION We obtained a total of 16 Gb and 51 Gb PacBio HiFi data for O. transversa and C. friedae, respectively, which were assembled into the draft genomes of 339 Mb for O. transversa and 233 Mb for C. friedae. Genome assessments by multiple metrics showed good completeness and high accuracy of the two assemblies. Gene annotations assisted by RNA-seq data yielded a comparable number of protein-coding genes in the two genomes (10,892 for O. transversa and 11,296 for C. friedae), while repeat annotations revealed a remarkable difference of repeat content between these two ant species (149.4 Mb for O. transversa versus 49.7 Mb for C. friedae). Besides, complete mitochondrial genomes for the two species were assembled and annotated.
Collapse
|
2
|
Jia H, Tan S, Cai Y, Guo Y, Shen J, Zhang Y, Ma H, Zhang Q, Chen J, Qiao G, Ruan J, Zhang YE. Low-input PacBio sequencing generates high-quality individual fly genomes and characterizes mutational processes. Nat Commun 2024; 15:5644. [PMID: 38969648 PMCID: PMC11226609 DOI: 10.1038/s41467-024-49992-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 06/20/2024] [Indexed: 07/07/2024] Open
Abstract
Long-read sequencing, exemplified by PacBio, revolutionizes genomics, overcoming challenges like repetitive sequences. However, the high DNA requirement ( > 1 µg) is prohibitive for small organisms. We develop a low-input (100 ng), low-cost, and amplification-free library-generation method for PacBio sequencing (LILAP) using Tn5-based tagmentation and DNA circularization within one tube. We test LILAP with two Drosophila melanogaster individuals, and generate near-complete genomes, surpassing preexisting single-fly genomes. By analyzing variations in these two genomes, we characterize mutational processes: complex transpositions (transposon insertions together with extra duplications and/or deletions) prefer regions characterized by non-B DNA structures, and gene conversion of transposons occurs on both DNA and RNA levels. Concurrently, we generate two complete assemblies for the endosymbiotic bacterium Wolbachia in these flies and similarly detect transposon conversion. Thus, LILAP promises a broad PacBio sequencing adoption for not only mutational studies of flies and their symbionts but also explorations of other small organisms or precious samples.
Collapse
|
3
|
Kotb NM, Ulukaya G, Chavan A, Nguyen SC, Proskauer L, Joyce EF, Hasson D, Jagannathan M, Rangan P. Genome organization regulates nuclear pore complex formation and promotes differentiation during Drosophila oogenesis. Genes Dev 2024; 38:436-454. [PMID: 38866556 PMCID: PMC11216175 DOI: 10.1101/gad.351402.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2023] [Accepted: 05/21/2024] [Indexed: 06/14/2024]
Abstract
Genome organization can regulate gene expression and promote cell fate transitions. The differentiation of germline stem cells (GSCs) to oocytes in Drosophila involves changes in genome organization mediated by heterochromatin and the nuclear pore complex (NPC). Heterochromatin represses germ cell genes during differentiation, and NPCs anchor these silenced genes to the nuclear periphery, maintaining silencing to allow for oocyte development. Surprisingly, we found that genome organization also contributes to NPC formation, mediated by the transcription factor Stonewall (Stwl). As GSCs differentiate, Stwl accumulates at boundaries between silenced and active gene compartments. Stwl at these boundaries plays a pivotal role in transitioning germ cell genes into a silenced state and activating a group of oocyte genes and nucleoporins (Nups). The upregulation of these Nups during differentiation is crucial for NPC formation and further genome organization. Thus, cross-talk between genome architecture and NPCs is essential for successful cell fate transitions.
Collapse
|
4
|
Sperling AL, Fabian DK, Garrison E, Glover DM. Protocol for preparing Drosophila genomic DNA to create chromosome-level de novo genome assemblies. STAR Protoc 2024; 5:102974. [PMID: 38581676 PMCID: PMC11002874 DOI: 10.1016/j.xpro.2024.102974] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 10/02/2023] [Accepted: 03/06/2024] [Indexed: 04/08/2024] Open
Abstract
De novo genome assemblies are common tools for examining novel biological phenomena in non-model organisms. Here, we present a protocol for preparing Drosophila genomic DNA to create chromosome-level de novo genome assemblies. We describe steps for high-molecular-weight DNA preparation with phenol or Genomic-tips, quality control, long-read nanopore sequencing, short-read DNA library preparation, and sequencing. We then detail procedures of genome assembly, annotation, and assessment that can be used for downstream comparison and functional analysis. For complete details on the use and execution of this protocol, please refer to Sperling et al.1.
Collapse
|
5
|
Rosser N, Seixas F, Queste LM, Cama B, Mori-Pezo R, Kryvokhyzha D, Nelson M, Waite-Hudson R, Goringe M, Costa M, Elias M, Mendes Eleres de Figueiredo C, Freitas AVL, Joron M, Kozak K, Lamas G, Martins ARP, McMillan WO, Ready J, Rueda-Muñoz N, Salazar C, Salazar P, Schulz S, Shirai LT, Silva-Brandão KL, Mallet J, Dasmahapatra KK. Hybrid speciation driven by multilocus introgression of ecological traits. Nature 2024; 628:811-817. [PMID: 38632397 PMCID: PMC11041799 DOI: 10.1038/s41586-024-07263-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Accepted: 03/01/2024] [Indexed: 04/19/2024]
Abstract
Hybridization allows adaptations to be shared among lineages and may trigger the evolution of new species1,2. However, convincing examples of homoploid hybrid speciation remain rare because it is challenging to demonstrate that hybridization was crucial in generating reproductive isolation3. Here we combine population genomic analysis with quantitative trait locus mapping of species-specific traits to examine a case of hybrid speciation in Heliconius butterflies. We show that Heliconius elevatus is a hybrid species that is sympatric with both parents and has persisted as an independently evolving lineage for at least 180,000 years. This is despite pervasive and ongoing gene flow with one parent, Heliconius pardalinus, which homogenizes 99% of their genomes. The remaining 1% introgressed from the other parent, Heliconius melpomene, and is scattered widely across the H. elevatus genome in islands of divergence from H. pardalinus. These islands contain multiple traits that are under disruptive selection, including colour pattern, wing shape, host plant preference, sex pheromones and mate choice. Collectively, these traits place H. elevatus on its own adaptive peak and permit coexistence with both parents. Our results show that speciation was driven by introgression of ecological traits, and that speciation with gene flow is possible with a multilocus genetic architecture.
Collapse
|
6
|
Kyriacou RG, Mulhair PO, Holland PWH. GC Content Across Insect Genomes: Phylogenetic Patterns, Causes and Consequences. J Mol Evol 2024; 92:138-152. [PMID: 38491221 PMCID: PMC10978632 DOI: 10.1007/s00239-024-10160-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 02/06/2024] [Indexed: 03/18/2024]
Abstract
The proportions of A:T and G:C nucleotide pairs are often unequal and can vary greatly between animal species and along chromosomes. The causes and consequences of this variation are incompletely understood. The recent release of high-quality genome sequences from the Darwin Tree of Life and other large-scale genome projects provides an opportunity for GC heterogeneity to be compared across a large number of insect species. Here we analyse GC content along chromosomes, and within protein-coding genes and codons, of 150 insect species from four holometabolous orders: Coleoptera, Diptera, Hymenoptera, and Lepidoptera. We find that protein-coding sequences have higher GC content than the genome average, and that Lepidoptera generally have higher GC content than the other three insect orders examined. GC content is higher in small chromosomes in most Lepidoptera species, but this pattern is less consistent in other orders. GC content also increases towards subtelomeric regions within protein-coding genes in Diptera, Coleoptera and Lepidoptera. Two species of Diptera, Bombylius major and B. discolor, have very atypical genomes with ubiquitous increase in AT content, especially at third codon positions. Despite dramatic AT-biased codon usage, we find no evidence that this has driven divergent protein evolution. We argue that the GC landscape of Lepidoptera, Diptera and Coleoptera genomes is influenced by GC-biased gene conversion, strongest in Lepidoptera, with some outlier taxa affected drastically by counteracting processes.
Collapse
|
7
|
Sawada Y, Minei R, Tabata H, Ikemura T, Wada K, Wada Y, Nagata H, Iwasaki Y. Unsupervised AI reveals insect species-specific genome signatures. PeerJ 2024; 12:e17025. [PMID: 38464746 PMCID: PMC10924456 DOI: 10.7717/peerj.17025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 02/07/2024] [Indexed: 03/12/2024] Open
Abstract
Insects are a highly diverse phylogeny and possess a wide variety of traits, including the presence or absence of wings and metamorphosis. These diverse traits are of great interest for studying genome evolution, and numerous comparative genomic studies have examined a wide phylogenetic range of insects. Here, we analyzed 22 insects belonging to a wide phylogenetic range (Endopterygota, Paraneoptera, Polyneoptera, Palaeoptera, and other insects) by using a batch-learning self-organizing map (BLSOM) for oligonucleotide compositions in their genomic fragments (100-kb or 1-Mb sequences), which is an unsupervised machine learning algorithm that can extract species-specific characteristics of the oligonucleotide compositions (genome signatures). The genome signature is of particular interest in terms of the mechanisms and biological significance that have caused the species-specific difference, and can be used as a powerful search needle to explore the various roles of genome sequences other than protein coding, and can be used to unveil mysteries hidden in the genome sequence. Since BLSOM is an unsupervised clustering method, the clustering of sequences was performed based on the oligonucleotide composition alone, without providing information about the species from which each fragment sequence was derived. Therefore, not only the interspecies separation, but also the intraspecies separation can be achieved. Here, we have revealed the specific genomic regions with oligonucleotide compositions distinct from the usual sequences of each insect genome, e.g., Mb-level structures found for a grasshopper Schistocerca americana. One aim of this study was to compare the genome characteristics of insects with those of vertebrates, especially humans, which are phylogenetically distant from insects. Recently, humans seem to be the "model organism" for which a large amount of information has been accumulated using a variety of cutting-edge and high-throughput technologies. Therefore, it is reasonable to use the abundant information from humans to study insect lineages. The specific regions of Mb length with distinct oligonucleotide compositions have also been previously observed in the human genome. These regions were enriched by transcription factor binding motifs (TFBSs) and hypothesized to be involved in the three-dimensional arrangement of chromosomal DNA in interphase nuclei. The present study characterized the species-specific oligonucleotide compositions (i.e., genome signatures) in insect genomes and identified specific genomic regions with distinct oligonucleotide compositions.
Collapse
|
8
|
Shpak M, Ghanavi HR, Lange JD, Pool JE, Stensmyr MC. Genomes from historical Drosophila melanogaster specimens illuminate adaptive and demographic changes across more than 200 years of evolution. PLoS Biol 2023; 21:e3002333. [PMID: 37824452 PMCID: PMC10569592 DOI: 10.1371/journal.pbio.3002333] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Accepted: 09/11/2023] [Indexed: 10/14/2023] Open
Abstract
The ability to perform genomic sequencing on long-dead organisms is opening new frontiers in evolutionary research. These opportunities are especially notable in the case of museum collections, from which countless documented specimens may now be suitable for genomic analysis-if data of sufficient quality can be obtained. Here, we report 25 newly sequenced genomes from museum specimens of the model organism Drosophila melanogaster, including the oldest extant specimens of this species. By comparing historical samples ranging from the early 1800s to 1933 against modern-day genomes, we document evolution across thousands of generations, including time periods that encompass the species' initial occupation of northern Europe and an era of rapidly increasing human activity. We also find that the Lund, Sweden population underwent local genetic differentiation during the early 1800s to 1933 interval (potentially due to drift in a small population) but then became more similar to other European populations thereafter (potentially due to increased migration). Within each century-scale time period, our temporal sampling allows us to document compelling candidates for recent natural selection. In some cases, we gain insights regarding previously implicated selection candidates, such as ChKov1, for which our inferred timing of selection favors the hypothesis of antiviral resistance over insecticide resistance. Other candidates are novel, such as the circadian-related gene Ahcy, which yields a selection signal that rivals that of the DDT resistance gene Cyp6g1. These insights deepen our understanding of recent evolution in a model system, and highlight the potential of future museomic studies.
Collapse
|
9
|
Love RR, Sikder JR, Vivero RJ, Matute DR, Schrider DR. Strong Positive Selection in Aedes aegypti and the Rapid Evolution of Insecticide Resistance. Mol Biol Evol 2023; 40:msad072. [PMID: 36971242 PMCID: PMC10118305 DOI: 10.1093/molbev/msad072] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Revised: 02/13/2023] [Accepted: 03/23/2023] [Indexed: 03/29/2023] Open
Abstract
Aedes aegypti vectors the pathogens that cause dengue, yellow fever, Zika virus, and chikungunya and is a serious threat to public health in tropical regions. Decades of work has illuminated many aspects of Ae. aegypti's biology and global population structure and has identified insecticide resistance genes; however, the size and repetitive nature of the Ae. aegypti genome have limited our ability to detect positive selection in this mosquito. Combining new whole genome sequences from Colombia with publicly available data from Africa and the Americas, we identify multiple strong candidate selective sweeps in Ae. aegypti, many of which overlap genes linked to or implicated in insecticide resistance. We examine the voltage-gated sodium channel gene in three American cohorts and find evidence for successive selective sweeps in Colombia. The most recent sweep encompasses an intermediate-frequency haplotype containing four candidate insecticide resistance mutations that are in near-perfect linkage disequilibrium with one another in the Colombian sample. We hypothesize that this haplotype may continue to rapidly increase in frequency and perhaps spread geographically in the coming years. These results extend our knowledge of how insecticide resistance has evolved in this species and add to a growing body of evidence suggesting that Ae. aegypti has an extensive genomic capacity to rapidly adapt to insecticide-based vector control.
Collapse
|
10
|
Hotaling S, Wilcox ER, Heckenhauer J, Stewart RJ, Frandsen PB. Highly accurate long reads are crucial for realizing the potential of biodiversity genomics. BMC Genomics 2023; 24:117. [PMID: 36927511 PMCID: PMC10018877 DOI: 10.1186/s12864-023-09193-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 02/17/2023] [Indexed: 03/18/2023] Open
Abstract
BACKGROUND Generating the most contiguous, accurate genome assemblies given available sequencing technologies is a long-standing challenge in genome science. With the rise of long-read sequencing, assembly challenges have shifted from merely increasing contiguity to correctly assembling complex, repetitive regions of interest, ideally in a phased manner. At present, researchers largely choose between two types of long read data: longer, but less accurate sequences, or highly accurate, but shorter reads (i.e., >Q20 or 99% accurate). To better understand how these types of long-read data as well as scale of data (i.e., mean length and sequencing depth) influence genome assembly outcomes, we compared genome assemblies for a caddisfly, Hesperophylax magnus, generated with longer, but less accurate, Oxford Nanopore (ONT) R9.4.1 and highly accurate PacBio HiFi (HiFi) data. Next, we expanded this comparison to consider the influence of highly accurate long-read sequence data on genome assemblies across 6750 plant and animal genomes. For this broader comparison, we used HiFi data as a surrogate for highly accurate long-reads broadly as we could identify when they were used from GenBank metadata. RESULTS HiFi reads outperformed ONT reads in all assembly metrics tested for the caddisfly data set and allowed for accurate assembly of the repetitive ~ 20 Kb H-fibroin gene. Across plants and animals, genome assemblies that incorporated HiFi reads were also more contiguous. For plants, the average HiFi assembly was 501% more contiguous (mean contig N50 = 20.5 Mb) than those generated with any other long-read data (mean contig N50 = 4.1 Mb). For animals, HiFi assemblies were 226% more contiguous (mean contig N50 = 20.9 Mb) versus other long-read assemblies (mean contig N50 = 9.3 Mb). In plants, we also found limited evidence that HiFi may offer a unique solution for overcoming genomic complexity that scales with assembly size. CONCLUSIONS Highly accurate long-reads generated with HiFi or analogous technologies represent a key tool for maximizing genome assembly quality for a wide swath of plants and animals. This finding is particularly important when resources only allow for one type of sequencing data to be generated. Ultimately, to realize the promise of biodiversity genomics, we call for greater uptake of highly accurate long-reads in future studies.
Collapse
|
11
|
Yuan R, Zheng B, Li Z, Ma X, Shu X, Qu Q, Ye X, Li S, Tang P, Chen X. The chromosome-level genome of Chinese praying mantis Tenodera sinensis (Mantodea: Mantidae) reveals its biology as a predator. Gigascience 2022; 12:giad090. [PMID: 37882605 PMCID: PMC10600911 DOI: 10.1093/gigascience/giad090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 09/17/2023] [Accepted: 10/04/2023] [Indexed: 10/27/2023] Open
Abstract
BACKGROUND The Chinese praying mantis, Tenodera sinensis (Saussure), is a carnivorous insect that preys on a variety of arthropods and small vertebrates, including pest species. Several studies have been conducted to understand its behavior and physiology. However, there is limited knowledge about the genetic information underlying its genome evolution, digestive demands, and predatory behaviors. FINDINGS Here we have assembled the chromosome-level genome of T. sinensis, representing the first sequenced genome of the family Mantidae, with a genome size of 2.54 Gb and scaffold N50 of 174.78 Mb. Our analyses revealed that 98.6% of BUSCO genes are present, resulting in a well-annotated assembly compared to other insect genomes, containing 25,022 genes. The reconstructed phylogenetic analysis showed the expected topology placing the praying mantis in an appropriate position. Analysis of transposon elements suggested the Gypsy/Dirs family, which belongs to long terminal repeat (LTR) transposons, may be a key factor resulting in the larger genome size. The genome shows expansions in several digestion and detoxification associated gene families, including trypsin and glycosyl hydrolase (GH) genes, ATP-binding cassette (ABC) transporter, and carboxylesterase (CarE), reflecting the possible genomic basis of digestive demands. Furthermore, we have found 1 ultraviolet-sensitive opsin and 2 long-wavelength-sensitive (LWS) opsins, emphasizing the core role of LWS opsins in regulating predatory behaviors. CONCLUSIONS The high-quality genome assembly of the praying mantis provides a valuable repository for studying the evolutionary patterns of the mantis genomes and the gene expression profiles of insect predators.
Collapse
|
12
|
Zhang S, Wang R, Huang C, Zhang L, Sun L. Modulation of Global Gene Expression by Aneuploidy and CNV of Dosage Sensitive Regulatory Genes. Genes (Basel) 2021; 12:genes12101606. [PMID: 34681000 PMCID: PMC8535535 DOI: 10.3390/genes12101606] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 09/29/2021] [Accepted: 10/08/2021] [Indexed: 11/16/2022] Open
Abstract
Aneuploidy, which disrupts the genetic balance due to partial genome dosage changes, is usually more detrimental than euploidy variation. To investigate the modulation of gene expression in aneuploidy, we analyzed the transcriptome sequencing data of autosomal and sex chromosome trisomy in Drosophila. The results showed that most genes on the varied chromosome (cis) present dosage compensation, while the remainder of the genome (trans) produce widespread inverse dosage effects. Some altered functions and pathways were identified as the common characteristics of aneuploidy, and several possible regulatory genes were screened for an inverse dosage effect. Furthermore, we demonstrated that dosage changes of inverse regulator Inr-a/pcf11 can produce a genome-wide inverse dosage effect. All these findings suggest that the mechanism of genomic imbalance is related to the changes in the stoichiometric relationships of macromolecular complex members that affect the overall function. These studies may deepen the understanding of gene expression regulatory mechanisms.
Collapse
|
13
|
Ahmad A, Wallau GL, Ren Z. Characterization of Mariner transposons in seven species of Rhus gall aphids. Sci Rep 2021; 11:16349. [PMID: 34381125 PMCID: PMC8357937 DOI: 10.1038/s41598-021-95843-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 07/23/2021] [Indexed: 02/07/2023] Open
Abstract
Transposable elements (TEs), also known as jumping genes, are widely spread in the genomes of insects and play a considerable role in genomic evolution. Mariner/DD34D family belongs to class II transposable elements which is widely spread in the genomes of insects and have considerable role in genomic evolution. Mariner like elements (MLEs) were searched in the genomes of seven species of Rhus gall aphids belonging to six genera. In total, 121 MLEs were detected in the genomes of the seven investigated species of Rhus gall aphids, which showed a wide distribution in both close and distant related species. The sequences of MLEs ranged from 1 to 1.4 kb in length and the structural analysis of the MLEs showed that only five copies were potentially active with intact open reading frame (ORF) and terminal inverted repeats (TIRs). Phylogenetic analysis showed that all the 121 MLE sequences belonged to four subfamilies, i.e., Mauritiana, Drosophila, Vertumana and Irritans, among which Drosophila and Vertumana subfamilies were reported in aphids for the first time. Our present report revealed the diversity and distribution of MLEs in Rhus gall aphid genomes and expanded our understandings on the characterization of transposable elements in aphid genomes, which might be useful as genetic markers and tools and would play an important role in genomic evolution and adaptation of aphids.
Collapse
|
14
|
Van Dam MH, Cabras AA, Henderson JB, Rominger AJ, Pérez Estrada C, Omer AD, Dudchenko O, Lieberman Aiden E, Lam AW. The Easter Egg Weevil (Pachyrhynchus) genome reveals syntenic patterns in Coleoptera across 200 million years of evolution. PLoS Genet 2021; 17:e1009745. [PMID: 34460814 PMCID: PMC8432895 DOI: 10.1371/journal.pgen.1009745] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Revised: 09/10/2021] [Accepted: 07/27/2021] [Indexed: 01/01/2023] Open
Abstract
Patterns of genomic architecture across insects remain largely undocumented or decoupled from a broader phylogenetic context. For instance, it is unknown whether translocation rates differ between insect orders. We address broad scale patterns of genome architecture across Insecta by examining synteny in a phylogenetic framework from open-source insect genomes. To accomplish this, we add a chromosome level genome to a crucial lineage, Coleoptera. Our assembly of the Pachyrhynchus sulphureomaculatus genome is the first chromosome scale genome for the hyperdiverse Phytophaga lineage and currently the largest insect genome assembled to this scale. The genome is significantly larger than those of other weevils, and this increase in size is caused by repetitive elements. Our results also indicate that, among beetles, there are instances of long-lasting (>200 Ma) localization of genes to a particular chromosome with few translocation events. While some chromosomes have a paucity of translocations, intra-chromosomal synteny was almost absent, with gene order thoroughly shuffled along a chromosome. This large amount of reshuffling within chromosomes with few inter-chromosomal events contrasts with patterns seen in mammals in which the chromosomes tend to exchange larger blocks of material more readily. To place our findings in an evolutionary context, we compared syntenic patterns across Insecta in a phylogenetic framework. For the first time, we find that synteny decays at an exponential rate relative to phylogenetic distance. Additionally, there are significant differences in decay rates between insect orders, this pattern was not driven by Lepidoptera alone which has a substantially different rate.
Collapse
|
15
|
Mishra S, Sharma G, Das MK, Pande V, Singh OP. Intragenomic sequence variations in the second internal transcribed spacer (ITS2) ribosomal DNA of the malaria vector Anopheles stephensi. PLoS One 2021; 16:e0253173. [PMID: 34125861 PMCID: PMC8202910 DOI: 10.1371/journal.pone.0253173] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 05/28/2021] [Indexed: 11/18/2022] Open
Abstract
Second Internal Transcribed Spacer (ITS2) ribosomal DNA (rDNA) sequence is a widely used molecular marker for species-identification or -delimitation due to observed concerted evolution which is believed to homogenize rDNA copies in an interbreeding population. However, intra-specific differences in ITS2 of Anopheles stephensi have been reported. This study reports the presence of intragenomic sequence variation in the ITS2-rDNA of An. stephensi and hypothesizes that observed intra-specific differences in this species may have resulted due to ambiguous DNA sequence-chromatogram resulting from intragenomic heterogeneity. Anopheles stephensi collected from different parts of India were sequenced for complete ITS2 and the variable region of 28S-rDNA (d1-d3 domains). Intragenomic variations were found in ITS2 region of all An. stephensi sequenced, but no such variation was observed in d1 to d3 domains of 28S-rDNA. Cloning and sequencing of ITS2 through the d3 domain of the 28S region of rDNA from representative samples from northern, central, and southern India confirmed the presence of intragenomic variation in ITS2 due to transitions at three loci and two bp indel in a di-nucleotide microsatellite locus. Multiple haplotypes were observed in ITS2 raised from such variations. Due to the absence of detectable intragenomic sequence variation in the d1 to d3 domain of 28S rDNA of An. stephensi, this region can serve as an ideal reference sequence for taxonomic and phylogenetic studies. The presence of intragenomic variation in rDNA should be carefully examined before using this as a molecular marker for species delimitation or phylogenetic analyses.
Collapse
|
16
|
Schrader L, Pan H, Bollazzi M, Schiøtt M, Larabee FJ, Bi X, Deng Y, Zhang G, Boomsma JJ, Rabeling C. Relaxed selection underlies genome erosion in socially parasitic ant species. Nat Commun 2021; 12:2918. [PMID: 34006882 PMCID: PMC8131649 DOI: 10.1038/s41467-021-23178-w] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Accepted: 04/06/2021] [Indexed: 02/03/2023] Open
Abstract
Inquiline ants are highly specialized and obligate social parasites that infiltrate and exploit colonies of closely related species. They have evolved many times convergently, are often evolutionarily young lineages, and are almost invariably rare. Focusing on the leaf-cutting ant genus Acromyrmex, we compared genomes of three inquiline social parasites with their free-living, closely-related hosts. The social parasite genomes show distinct signatures of erosion compared to the host lineages, as a consequence of relaxed selective constraints on traits associated with cooperative ant colony life and of inquilines having very small effective population sizes. We find parallel gene losses, particularly in olfactory receptors, consistent with inquiline species having highly reduced social behavioral repertoires. Many of the genomic changes that we uncover resemble those observed in the genomes of obligate non-social parasites and intracellular endosymbionts that branched off into highly specialized, host-dependent niches.
Collapse
|
17
|
Mainali S, Colorado FA, Garzon MH. Foretelling the Phenotype of a Genomic Sequence. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:777-783. [PMID: 32287003 DOI: 10.1109/tcbb.2020.2985349] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Estimating phenotypic features (physical and biochemical traits) in a biological organism from their genomic sequence alone and/or environmental conditions has major applications in anthropological paleontology and criminal forensics, for example. To what extent do genomic sequences generally and causally determine phenotypic features of organisms, environmental conditions aside? We present results of two studies, one in blackfly (Insecta:Diptera:Simuliidae) larvae in two species (Simulium ignescens and S. tunja) with four phenotypic features, including the area and spot pattern of the cephalic apotome (in the form of a latin cross on the dorsal side of the head), the postgenal cleft (area under the head on the ventral side) and general body color in larva specimens; the second in strains of Arabidopsis thaliana. They establish that a substantial component of these phenotypic features (over 75 percent) are at least logically inferable, if not causally determined, by genomic fragments alone, despite the fact that these phenotypic features are not 100 percent determined entirely by genetic traits. These results suggest that it is possible to infer the genetic contribution in the determination of specific phenotypic features of a biological organism, without recourse to the causal chain of metabolomics and proteomic events leading to them from genomic sequences.
Collapse
|
18
|
Chakraborty M, Chang CH, Khost DE, Vedanayagam J, Adrion JR, Liao Y, Montooth KL, Meiklejohn CD, Larracuente AM, Emerson JJ. Evolution of genome structure in the Drosophila simulans species complex. Genome Res 2021; 31:380-396. [PMID: 33563718 PMCID: PMC7919458 DOI: 10.1101/gr.263442.120] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Accepted: 12/28/2020] [Indexed: 12/25/2022]
Abstract
The rapid evolution of repetitive DNA sequences, including satellite DNA, tandem duplications, and transposable elements, underlies phenotypic evolution and contributes to hybrid incompatibilities between species. However, repetitive genomic regions are fragmented and misassembled in most contemporary genome assemblies. We generated highly contiguous de novo reference genomes for the Drosophila simulans species complex (D. simulans, D. mauritiana, and D. sechellia), which speciated ∼250,000 yr ago. Our assemblies are comparable in contiguity and accuracy to the current D. melanogaster genome, allowing us to directly compare repetitive sequences between these four species. We find that at least 15% of the D. simulans complex species genomes fail to align uniquely to D. melanogaster owing to structural divergence-twice the number of single-nucleotide substitutions. We also find rapid turnover of satellite DNA and extensive structural divergence in heterochromatic regions, whereas the euchromatic gene content is mostly conserved. Despite the overall preservation of gene synteny, euchromatin in each species has been shaped by clade- and species-specific inversions, transposable elements, expansions and contractions of satellite and tRNA tandem arrays, and gene duplications. We also find rapid divergence among Y-linked genes, including copy number variation and recent gene duplications from autosomes. Our assemblies provide a valuable resource for studying genome evolution and its consequences for phenotypic evolution in these genetic model species.
Collapse
|
19
|
Coelho VL, de Brito TF, de Abreu Brito IA, Cardoso MA, Berni MA, Araujo HMM, Sammeth M, Pane A. Analysis of ovarian transcriptomes reveals thousands of novel genes in the insect vector Rhodnius prolixus. Sci Rep 2021; 11:1918. [PMID: 33479356 PMCID: PMC7820597 DOI: 10.1038/s41598-021-81387-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Accepted: 12/30/2020] [Indexed: 01/29/2023] Open
Abstract
Rhodnius prolixus is a Triatominae insect species and a primary vector of Chagas disease. The genome of R. prolixus has been recently sequenced and partially assembled, but few transcriptome analyses have been performed to date. In this study, we describe the stage-specific transcriptomes obtained from previtellogenic stages of oogenesis and from mature eggs. By analyzing ~ 228 million paired-end RNA-Seq reads, we significantly improved the current genome annotations for 9206 genes. We provide extended 5' and 3' UTRs, complete Open Reading Frames, and alternative transcript variants. Strikingly, using a combination of genome-guided and de novo transcriptome assembly we found more than two thousand novel genes, thus increasing the number of genes in R. prolixus from 15,738 to 17,864. We used the improved transcriptome to investigate stage-specific gene expression profiles during R. prolixus oogenesis. Our data reveal that 11,127 genes are expressed in the early previtellogenic stage of oogenesis and their transcripts are deposited in the developing egg including key factors regulating germline development, genome integrity, and the maternal-zygotic transition. In addition, GO term analyses show that transcripts encoding components of the steroid hormone receptor pathway, cytoskeleton, and intracellular signaling are abundant in the mature eggs, where they likely control early embryonic development upon fertilization. Our results significantly improve the R. prolixus genome and transcriptome and provide novel insight into oogenesis and early embryogenesis in this medically relevant insect.
Collapse
|
20
|
Allio R, Nabholz B, Wanke S, Chomicki G, Pérez-Escobar OA, Cotton AM, Clamens AL, Kergoat GJ, Sperling FAH, Condamine FL. Genome-wide macroevolutionary signatures of key innovations in butterflies colonizing new host plants. Nat Commun 2021; 12:354. [PMID: 33441560 PMCID: PMC7806994 DOI: 10.1038/s41467-020-20507-3] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Accepted: 12/03/2020] [Indexed: 01/29/2023] Open
Abstract
The mega-diversity of herbivorous insects is attributed to their co-evolutionary associations with plants. Despite abundant studies on insect-plant interactions, we do not know whether host-plant shifts have impacted both genomic adaptation and species diversification over geological times. We show that the antagonistic insect-plant interaction between swallowtail butterflies and the highly toxic birthworts began 55 million years ago in Beringia, followed by several major ancient host-plant shifts. This evolutionary framework provides a valuable opportunity for repeated tests of genomic signatures of macroevolutionary changes and estimation of diversification rates across their phylogeny. We find that host-plant shifts in butterflies are associated with both genome-wide adaptive molecular evolution (more genes under positive selection) and repeated bursts of speciation rates, contributing to an increase in global diversification through time. Our study links ecological changes, genome-wide adaptations and macroevolutionary consequences, lending support to the importance of ecological interactions as evolutionary drivers over long time periods.
Collapse
|
21
|
Trivedi D, CM V, Bisht K, Janardan V, Pandit A, Basak B, H S, Ramesh N, Raghu P. A genome engineering resource to uncover principles of cellular organization and tissue architecture by lipid signaling. eLife 2020; 9:e55793. [PMID: 33320085 PMCID: PMC7771963 DOI: 10.7554/elife.55793] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Accepted: 12/14/2020] [Indexed: 12/14/2022] Open
Abstract
Phosphoinositides (PI) are key regulators of cellular organization in eukaryotes and genes that tune PI signaling are implicated in human disease mechanisms. Biochemical analyses and studies in cultured cells have identified a large number of proteins that can mediate PI signaling. However, the role of such proteins in regulating cellular processes in vivo and development in metazoans remains to be understood. Here, we describe a set of CRISPR-based genome engineering tools that allow the manipulation of each of these proteins with spatial and temporal control during metazoan development. We demonstrate the use of these reagents to deplete a set of 103 proteins individually in the Drosophila eye and identify several new molecules that control eye development. Our work demonstrates the power of this resource in uncovering the molecular basis of tissue homeostasis during normal development and in human disease biology.
Collapse
|
22
|
Peng W, Feng MJ, Chen H, Han BY. [Progress on genome sequencing of Dipteran insects]. YI CHUAN = HEREDITAS 2020; 42:1093-1109. [PMID: 33229316 DOI: 10.16288/j.yczz.20-130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Diptera is among the most diverse holometabolan insect orders and was the earliest order to have a genome fully sequenced. The genomes of 110 fly species have been sequenced and published and many hundreds of population- level genomes have been obtained in the model organisms Drosophila melanogaster and Anopheles gambiae. Comparative genomics elucidate many aspects of the Dipteran biology, thereby providing insights for on the variability in genome structure, genetic mechanisms, and rates and patterns of evolution in genes, species, and populations. Despite the availability of genomic resources in Diptera, there is still a significant lack of information on many other insects. The sequencing of the genomes in Dipteran insects would be of great value to exhibit multiple origins of key fly behaviors such as blood feeding, parasitism, pollination, and mycophagy. In this review, we briefly summarize the distribution and characteristics of Dipteran genomes, introduce the progress of functional genes such as Cytochrome P450, immunity, sex determination and differentiation related genes in Dipteran genome, and highlight the significant findings generated by comparative genomics approach among Dipteran species. This paper provides the guidelines and references for choosing additional taxa for genome sequencing studies in the rapidly developing genome omics era, and offers a fundamental basis for genome-based pest control and management.
Collapse
|
23
|
Gimenez S, Abdelgaffar H, Goff GL, Hilliou F, Blanco CA, Hänniger S, Bretaudeau A, Legeai F, Nègre N, Jurat-Fuentes JL, d'Alençon E, Nam K. Adaptation by copy number variation increases insecticide resistance in the fall armyworm. Commun Biol 2020; 3:664. [PMID: 33184418 PMCID: PMC7661717 DOI: 10.1038/s42003-020-01382-6] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Accepted: 10/14/2020] [Indexed: 12/19/2022] Open
Abstract
Understanding the genetic basis of insecticide resistance is a key topic in agricultural ecology. The adaptive evolution of multi-copy detoxification genes has been interpreted as a cause of insecticide resistance, yet the same pattern can also be generated by the adaptation to host-plant defense toxins. In this study, we tested in the fall armyworm, Spodoptera frugiperda (Lepidoptera: Noctuidae), if adaptation by copy number variation caused insecticide resistance in two geographically distinct populations with different levels of resistance and the two host-plant strains. We observed a significant allelic differentiation of genomic copy number variations between the two geographic populations, but not between host-plant strains. A locus with positively selected copy number variation included a CYP gene cluster. Toxicological tests supported a central role for CYP enzymes in deltamethrin resistance. Our results indicate that copy number variation of detoxification genes might be responsible for insecticide resistance in fall armyworm and that evolutionary forces causing insecticide resistance could be independent of host-plant adaptation.
Collapse
|
24
|
Erickson PA, Weller CA, Song DY, Bangerter AS, Schmidt P, Bergland AO. Unique genetic signatures of local adaptation over space and time for diapause, an ecologically relevant complex trait, in Drosophila melanogaster. PLoS Genet 2020; 16:e1009110. [PMID: 33216740 PMCID: PMC7717581 DOI: 10.1371/journal.pgen.1009110] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2019] [Revised: 12/04/2020] [Accepted: 09/10/2020] [Indexed: 02/07/2023] Open
Abstract
Organisms living in seasonally variable environments utilize cues such as light and temperature to induce plastic responses, enabling them to exploit favorable seasons and avoid unfavorable ones. Local adapation can result in variation in seasonal responses, but the genetic basis and evolutionary history of this variation remains elusive. Many insects, including Drosophila melanogaster, are able to undergo an arrest of reproductive development (diapause) in response to unfavorable conditions. In D. melanogaster, the ability to diapause is more common in high latitude populations, where flies endure harsher winters, and in the spring, reflecting differential survivorship of overwintering populations. Using a novel hybrid swarm-based genome wide association study, we examined the genetic basis and evolutionary history of ovarian diapause. We exposed outbred females to different temperatures and day lengths, characterized ovarian development for over 2800 flies, and reconstructed their complete, phased genomes. We found that diapause, scored at two different developmental cutoffs, has modest heritability, and we identified hundreds of SNPs associated with each of the two phenotypes. Alleles associated with one of the diapause phenotypes tend to be more common at higher latitudes, but these alleles do not show predictable seasonal variation. The collective signal of many small-effect, clinally varying SNPs can plausibly explain latitudinal variation in diapause seen in North America. Alleles associated with diapause are segregating in Zambia, suggesting that variation in diapause relies on ancestral polymorphisms, and both pro- and anti-diapause alleles have experienced selection in North America. Finally, we utilized outdoor mesocosms to track diapause under natural conditions. We found that hybrid swarms reared outdoors evolved increased propensity for diapause in late fall, whereas indoor control populations experienced no such change. Our results indicate that diapause is a complex, quantitative trait with different evolutionary patterns across time and space.
Collapse
|
25
|
Calfee E, Agra MN, Palacio MA, Ramírez SR, Coop G. Selection and hybridization shaped the rapid spread of African honey bee ancestry in the Americas. PLoS Genet 2020; 16:e1009038. [PMID: 33075065 PMCID: PMC7595643 DOI: 10.1371/journal.pgen.1009038] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 10/29/2020] [Accepted: 08/09/2020] [Indexed: 02/07/2023] Open
Abstract
Recent biological invasions offer 'natural' laboratories to understand the genetics and ecology of adaptation, hybridization, and range limits. One of the most impressive and well-documented biological invasions of the 20th century began in 1957 when Apis mellifera scutellata honey bees swarmed out of managed experimental colonies in Brazil. This newly-imported subspecies, native to southern and eastern Africa, both hybridized with and out-competed previously-introduced European honey bee subspecies. Populations of scutellata-European hybrid honey bees rapidly expanded and spread across much of the Americas in less than 50 years. We use broad geographic sampling and whole genome sequencing of over 300 bees to map the distribution of scutellata ancestry where the northern and southern invasions have presently stalled, forming replicated hybrid zones with European bee populations in California and Argentina. California is much farther from Brazil, yet these hybrid zones occur at very similar latitudes, consistent with the invasion having reached a climate barrier. At these range limits, we observe genome-wide clines for scutellata ancestry, and parallel clines for wing length that span hundreds of kilometers, supporting a smooth transition from climates favoring scutellata-European hybrid bees to climates where they cannot survive winter. We find no large effect loci maintaining exceptionally steep ancestry transitions. Instead, we find most individual loci have concordant ancestry clines across South America, with a build-up of somewhat steeper clines in regions of the genome with low recombination rates, consistent with many loci of small effect contributing to climate-associated fitness trade-offs. Additionally, we find no substantial reductions in genetic diversity associated with rapid expansions nor complete dropout of scutellata ancestry at any individual loci on either continent, which suggests that the competitive fitness advantage of scutellata ancestry at lower latitudes has a polygenic basis and that scutellata-European hybrid bees maintained large population sizes during their invasion. To test for parallel selection across continents, we develop a null model that accounts for drift in ancestry frequencies during the rapid expansion. We identify several peaks within a larger genomic region where selection has pushed scutellata ancestry to high frequency hundreds of kilometers past the present cline centers in both North and South America and that may underlie high-fitness traits driving the invasion.
Collapse
|