26
|
Weedall GD, Riveron JM, Hearn J, Irving H, Kamdem C, Fouet C, White BJ, Wondji CS. An Africa-wide genomic evolution of insecticide resistance in the malaria vector Anopheles funestus involves selective sweeps, copy number variations, gene conversion and transposons. PLoS Genet 2020; 16:e1008822. [PMID: 32497040 PMCID: PMC7297382 DOI: 10.1371/journal.pgen.1008822] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Revised: 06/16/2020] [Accepted: 05/01/2020] [Indexed: 01/05/2023] Open
Abstract
Insecticide resistance in malaria vectors threatens to reverse recent gains in malaria control. Deciphering patterns of gene flow and resistance evolution in malaria vectors is crucial to improving control strategies and preventing malaria resurgence. A genome-wide survey of Anopheles funestus genetic diversity Africa-wide revealed evidences of a major division between southern Africa and elsewhere, associated with different population histories. Three genomic regions exhibited strong signatures of selective sweeps, each spanning major resistance loci (CYP6P9a/b, GSTe2 and CYP9K1). However, a sharp regional contrast was observed between populations correlating with gene flow barriers. Signatures of complex molecular evolution of resistance were detected with evidence of copy number variation, transposon insertion and a gene conversion between CYP6P9a/b paralog genes. Temporal analyses of samples before and after bed net scale up suggest that these genomic changes are driven by this control intervention. Multiple independent selective sweeps at the same locus in different parts of Africa suggests that local evolution of resistance in malaria vectors may be a greater threat than trans-regional spread of resistance haplotypes. Malaria control currently relies heavily on insecticide-based vector control interventions. Unfortunately, resistance to insecticides is threatening their continued effectiveness. Metabolic resistance has the greatest operational significance, yet it remains unclear how mosquito populations evolutionarily respond to the massive selection pressure from control interventions including insecticide-treated nets. Deciphering patterns of gene flow between populations of major malaria vectors such as Anopheles funestus and elucidating genomic signature of resistance evolution are crucial for designing resistance management strategies and preventing malaria resurgence. Here, we performed a genome-wide survey of An. funestus genetic diversity from across its continental range using reduced-genome representation (ddRADseq) and whole genome (PoolSeq) approaches revealing evidence of significant barriers to gene flow impacting the spread of insecticide resistance alleles. This study detected signatures of strong selective sweeps occurring in genomic regions controlling cytochrome P450-based and glutathione s-transferase metabolic resistance to insecticides in this species. Fine-scale analysis of the major pyrethroid resistance-associated genomic regions revealed complex molecular evolution with evidence of copy number variation, transposon insertion and gene conversion highlighting the risk that if this level of selection and spread of resistance continues unabated, our ability to control malaria with current interventions will be compromised.
Collapse
|
27
|
Han X, He H, Shen H, Tang J, Dong W, Shi Y, Wu S, Zhang F, Liang G. Comparative mitochondrial genome analysis of Dendrolimus houi (Lepidoptera: Lasiocampidae) and phylogenetic relationship among Lasiocampidae species. PLoS One 2020; 15:e0232527. [PMID: 32407393 PMCID: PMC7224488 DOI: 10.1371/journal.pone.0232527] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 04/16/2020] [Indexed: 11/18/2022] Open
Abstract
Dendrolimus houi is one of the most common caterpillars infesting Gymnosperm trees, and widely distributed in several countries in Southeast Asia, and exists soley or coexists with several congeners and some Lasiocampidae species in various forest habitats. However, natural hybrids occasionally occur among some closely related species in the same habitat, and host preference, extreme climate stress, and geographic isolation probably lead to their uncertain taxonomic consensus. The mitochondrial DNA (mtDNA) of D. houi was extracted and sequenced by using high-throughput technology, and the mitogenome composition and characteristics were compared and analyzed of these species, then the phylogenetic relationship was constructed using the maximum likelihood method (ML) and the Bayesian method (BI) based on their 13 protein-coding genes (PCGs) dataset, which were combined and made available to download which were combined and made available to download among global Lasiocampidae species data. Mitogenome of D. houi was 15,373 bp in length, with 37 genes, including 13 PCGs, 22 tRNA genes (tRNAs) and 2 rRNA genes (rRNAs). The positions and sequences of genes were consistent with those of most known Lasiocampidae species. The nucleotide composition was highly A+T biased, accounting for ~80% of the whole mitogenome. All start codons of PCGs belonged to typical start codons ATN except for COI which used CGA, and most stop codons ended with standard TAA or TAG, while COI, COII, ND4 ended with incomplete T. Only tRNASer (AGN) lacked DHU arm, while the remainder formed a typical "clover-shaped" secondary structure. For Lasiocampidae species, their complete mitochondrial genomes ranged from 15,281 to 15,570 bp in length, and all first genes started from trnM in the same direction. And base composition was biased toward A and T. Finally, both two methods (ML and BI) separately revealed that the same phylogenetic relationship of D. spp. as ((((D. punctatus + D. tabulaeformis) + D. spectabilis) + D. superans) + (D. kikuchii of Hunan population + D. houi) as in previous research, but results were different in that D. kikuchii from a Yunnan population was included, indicating that different geographical populations of insects have differentiated. And the phylogenetic relationship among Lasiocampidae species was ((((Dendrolimus) + Kunugia) + Euthrix) + Trabala). This provides a better theoretical basis for Lasiocampidae evolution and classification for future research directions.
Collapse
|
28
|
Harrison BR, Wang L, Gajda E, Hoffman EV, Chung BY, Pletcher SD, Raftery D, Promislow DEL. The metabolome as a link in the genotype-phenotype map for peroxide resistance in the fruit fly, Drosophila melanogaster. BMC Genomics 2020; 21:341. [PMID: 32366330 PMCID: PMC7199327 DOI: 10.1186/s12864-020-6739-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2019] [Accepted: 04/15/2020] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Genetic association studies that seek to explain the inheritance of complex traits typically fail to explain a majority of the heritability of the trait under study. Thus, we are left with a gap in the map from genotype to phenotype. Several approaches have been used to fill this gap, including those that attempt to map endophenotype such as the transcriptome, proteome or metabolome, that underlie complex traits. Here we used metabolomics to explore the nature of genetic variation for hydrogen peroxide (H2O2) resistance in the sequenced inbred Drosophila Genetic Reference Panel (DGRP). RESULTS We first studied genetic variation for H2O2 resistance in 179 DGRP lines and along with identifying the insulin signaling modulator u-shaped and several regulators of feeding behavior, we estimate that a substantial amount of phenotypic variation can be explained by a polygenic model of genetic variation. We then profiled a portion of the aqueous metabolome in subsets of eight 'high resistance' lines and eight 'low resistance' lines. We used these lines to represent collections of genotypes that were either resistant or sensitive to the stressor, effectively modeling a discrete trait. Across the range of genotypes in both populations, flies exhibited surprising consistency in their metabolomic signature of resistance. Importantly, the resistance phenotype of these flies was more easily distinguished by their metabolome profiles than by their genotypes. Furthermore, we found a metabolic response to H2O2 in sensitive, but not in resistant genotypes. Metabolomic data further implicated at least two pathways, glycogen and folate metabolism, as determinants of sensitivity to H2O2. We also discovered a confounding effect of feeding behavior on assays involving supplemented food. CONCLUSIONS This work suggests that the metabolome can be a point of convergence for genetic variation influencing complex traits, and can efficiently elucidate mechanisms underlying trait variation.
Collapse
|
29
|
Aguiar ERGR, de Almeida JPP, Queiroz LR, Oliveira LS, Olmo RP, de Faria IJDS, Imler JL, Gruber A, Matthews BJ, Marques JT. A single unidirectional piRNA cluster similar to the flamenco locus is the major source of EVE-derived transcription and small RNAs in Aedes aegypti mosquitoes. RNA (NEW YORK, N.Y.) 2020; 26:581-594. [PMID: 31996404 PMCID: PMC7161354 DOI: 10.1261/rna.073965.119] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2019] [Accepted: 01/22/2020] [Indexed: 06/10/2023]
Abstract
Endogenous viral elements (EVEs) are found in many eukaryotic genomes. Despite considerable knowledge about genomic elements such as transposons (TEs) and retroviruses, we still lack information about nonretroviral EVEs. Aedes aegypti mosquitoes have a highly repetitive genome that is covered with EVEs. Here, we identified 129 nonretroviral EVEs in the AaegL5 version of the A. aegypti genome. These EVEs were significantly associated with TEs and preferentially located in repeat-rich clusters within intergenic regions. Genome-wide transcriptome analysis showed that most EVEs generated transcripts although only around 1.4% were sense RNAs. The majority of EVE transcription was antisense and correlated with the generation of EVE-derived small RNAs. A single genomic cluster of EVEs located in a 143 kb repetitive region in chromosome 2 contributed with 42% of antisense transcription and 45% of small RNAs derived from viral elements. This region was enriched for TE-EVE hybrids organized in the same coding strand. These generated a single long antisense transcript that correlated with the generation of phased primary PIWI-interacting RNAs (piRNAs). The putative promoter of this region had a conserved binding site for the transcription factor Cubitus interruptus, a key regulator of the flamenco locus in Drosophila melanogaster Here, we have identified a single unidirectional piRNA cluster in the A. aegypti genome that is the major source of EVE transcription fueling the generation of antisense small RNAs in mosquitoes. We propose that this region is a flamenco-like locus in A. aegypti due to its relatedness to the major unidirectional piRNA cluster in Drosophila melanogaster.
Collapse
|
30
|
Ramos-Nino ME, Fitzpatrick DM, Eckstrom KM, Tighe S, Hattaway LM, Hsueh AN, Stone DM, Dragon JA, Cheetham S. Metagenomic analysis of Aedes aegypti and Culex quinquefasciatus mosquitoes from Grenada, West Indies. PLoS One 2020; 15:e0231047. [PMID: 32282857 PMCID: PMC7153883 DOI: 10.1371/journal.pone.0231047] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Accepted: 03/14/2020] [Indexed: 02/06/2023] Open
Abstract
The mosquitoes Aedes aegypti (Linnaeus, 1762) (Diptera: Culicidae) and Culex quinquefasciatus Say, 1823 (Diptera: Culicidae) are two major vectors of arthropod-borne pathogens in Grenada, West Indies. As conventional vector control methods present many challenges, alternatives are urgently needed. Manipulation of mosquito microbiota is emerging as a field for the development of vector control strategies. Critical to this vector control approach is knowledge of the microbiota of these mosquitoes and finding candidate microorganisms that are common to the vectors with properties that could be used in microbiota modification studies. Results showed that bacteria genera including Asaia, Escherichia, Pantoea, Pseudomonas, and Serratia are common to both major arboviral vectors in Grenada and have previously been shown to be good candidates for transgenetic studies. Also, for the first time, the presence of Grenada mosquito rhabdovirus 1 is reported in C. quinquefasciatus.
Collapse
|
31
|
Bayega A, Djambazian H, Tsoumani KT, Gregoriou ME, Sagri E, Drosopoulou E, Mavragani-Tsipidou P, Giorda K, Tsiamis G, Bourtzis K, Oikonomopoulos S, Dewar K, Church DM, Papanicolaou A, Mathiopoulos KD, Ragoussis J. De novo assembly of the olive fruit fly (Bactrocera oleae) genome with linked-reads and long-read technologies minimizes gaps and provides exceptional Y chromosome assembly. BMC Genomics 2020; 21:259. [PMID: 32228451 PMCID: PMC7106766 DOI: 10.1186/s12864-020-6672-3] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2019] [Accepted: 03/13/2020] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND The olive fruit fly, Bactrocera oleae, is the most important pest in the olive fruit agribusiness industry. This is because female flies lay their eggs in the unripe fruits and upon hatching the larvae feed on the fruits thus destroying them. The lack of a high-quality genome and other genomic and transcriptomic data has hindered progress in understanding the fly's biology and proposing alternative control methods to pesticide use. RESULTS Genomic DNA was sequenced from male and female Demokritos strain flies, maintained in the laboratory for over 45 years. We used short-, mate-pair-, and long-read sequencing technologies to generate a combined male-female genome assembly (GenBank accession GCA_001188975.2). Genomic DNA sequencing from male insects using 10x Genomics linked-reads technology followed by mate-pair and long-read scaffolding and gap-closing generated a highly contiguous 489 Mb genome with a scaffold N50 of 4.69 Mb and L50 of 30 scaffolds (GenBank accession GCA_001188975.4). RNA-seq data generated from 12 tissues and/or developmental stages allowed for genome annotation. Short reads from both males and females and the chromosome quotient method enabled identification of Y-chromosome scaffolds which were extensively validated by PCR. CONCLUSIONS The high-quality genome generated represents a critical tool in olive fruit fly research. We provide an extensive RNA-seq data set, and genome annotation, critical towards gaining an insight into the biology of the olive fruit fly. In addition, elucidation of Y-chromosome sequences will advance our understanding of the Y-chromosome's organization, function and evolution and is poised to provide avenues for sterile insect technique approaches.
Collapse
|
32
|
Yang L, Dai J, Gao Q, Yuan G, Liu J, Sun Y, Sun Y, Wang L, Qian C, Zhu B, Liu C, Wei G. Characterization of the complete mitochondrial genome of Orthaga olivacea Warre (Lepidoptera Pyralidae) and comparison with other Lepidopteran insects. PLoS One 2020; 15:e0227831. [PMID: 32142522 PMCID: PMC7059908 DOI: 10.1371/journal.pone.0227831] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Accepted: 12/30/2019] [Indexed: 02/01/2023] Open
Abstract
Orthaga olivacea Warre (Lepidoptera: Pyralidae) is an important agricultural pest of camphor trees (Cinnamomum camphora). To further supplement the known genome-level features of related species, the complete mitochondrial genome of Orthaga olivacea is amplified, sequenced, annotated, analyzed, and compared with 58 other species of Lepidopteran. The complete sequence is 15,174 bp, containing 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes, 2 ribosomal RNA (rRNA) genes, and a putative control region. Base composition is biased toward adenine and thymine (79.02% A+T) and A+T skew are slightly negative. Twelve of the 13 PCGs use typical ATN start codons. The exception is cytochrome oxidase 1 (cox1) that utilizes a CGA initiation codon. Nine PCGs have standard termination codon (TAA); others have incomplete stop codons, a single T or TA nucleotide. All the tRNA genes have the typical clover-leaf secondary structure, except for trnS(AGN), in which dihydrouridine (DHU) arm fails to form a stable stem-loop structure. The A+T-rich region (293 bp) contains a typical Lepidopter motifs 'ATAGA' followed by a 17 bp poly-T stretch, and a microsatellite-like (AT)13 repeat. Codon usage analysis revealed that Asn, Ile, Leu2, Lys, Tyr and Phe were the most frequently used amino acids, while Cys was the least utilized. Phylogenetic analysis suggested that among sequenced lepidopteran mitochondrial genomes, Orthaga olivacea Warre was most closely related to Hypsopygia regina, and confirmed that Orthaga olivacea Warre belongs to the Pyralidae family.
Collapse
|
33
|
Duran DP, Laroche RA, Gough HM, Gwiazdowski RA, Knisley CB, Herrmann DP, Roman SJ, Egan SP. Geographic Life History Differences Predict Genomic Divergence Better than Mitochondrial Barcodes or Phenotype. Genes (Basel) 2020; 11:E265. [PMID: 32121321 PMCID: PMC7140875 DOI: 10.3390/genes11030265] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2020] [Revised: 02/21/2020] [Accepted: 02/26/2020] [Indexed: 01/23/2023] Open
Abstract
Species diversity can be inferred using multiple data types, however, results based on genetic data can be at odds with patterns of phenotypic variation. Tiger beetles of the Cicindelidiapolitula (LeConte, 1875) species complex have been taxonomically problematic due to extreme phenotypic variation within and between populations. To better understand the biology and taxonomy of this group, we used mtDNA genealogies and multilocus nuclear analyses of 34,921 SNPs to elucidate its evolutionary history and evaluate the validity of phenotypically circumscribed species and subspecies. Genetic analyses recovered two divergent species that are also ecologically distinct, based on adult life history. These patterns are incongruous with the phenotypic variation that informed prior taxonomy, and most subspecies were not supported as distinct evolutionary lineages. One of the nominal subspecies was found to be a cryptic species; consequently, we elevate C. p.laetipennis (Horn, 1913) to a full species. Although nuclear and mtDNA datasets recovered broadly similar evolutionary units, mito-nuclear discordance was more common than expected, being observed between nearly all geographically overlapping taxonomic pairs. Additionally, a pattern of 'mitochondrial displacement' was observed, where mitochondria from one species unidirectionally displace others. Overall, we found that geographically associated life history factors better predict genomic divergence than phenotype and mitochondrial genealogies, and consequently taxon identifications based on mtDNA (e.g., DNA barcodes) may be misleading.
Collapse
|
34
|
Chung SH, Parker BJ, Blow F, Brisson JA, Douglas AE. Host and symbiont genetic determinants of nutritional phenotype in a natural population of the pea aphid. Mol Ecol 2020; 29:848-858. [PMID: 31945243 DOI: 10.1111/mec.15355] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Revised: 12/20/2019] [Accepted: 01/08/2020] [Indexed: 12/17/2022]
Abstract
A defining feature of the nutritional ecology of plant sap-feeding insects is that the dietary deficit of essential amino acids (EAAs) in plant sap is supplemented by EAA-provisioning microbial symbionts in the insect. Here, we demonstrated substantial variation in the nutritional phenotype of 208 genotypes of the pea aphid Acyrthosiphon pisum collected from a natural population. Specifically, the genotypes varied in performance (larval growth rates) on four test diets lacking the EAAs arginine, histidine and methionine or aromatic EAAs (phenylalanine and tryptophan), relative to the diet containing all EAAs. These data indicate that EAA supply from the symbiotic bacteria Buchnera can meet total aphid nutritional demand for only a subset of the EAA/aphid genotype combinations. We then correlated single nucleotide polymorphisms (SNPs) identified in the aphid and Buchnera genomes by reduced genome sequencing against aphid performance for each EAA deletion diet. This yielded significant associations between performance on the histidine-free diet and Buchnera SNPs, including metabolism genes predicted to influence histidine biosynthesis. Aphid genetic correlates of performance were obtained for all four deletion diets, with associations on the arginine-free diet and aromatic-free diets dominated by genes functioning in the regulation of metabolic and cellular processes. The specific aphid genes associated with performance on different EAA deletion diets are largely nonoverlapping, indicating some independence in the regulatory circuits determining aphid phenotype for the different EAAs. This study demonstrates how variation in the phenotype of associations collected from natural populations can be applied to elucidate the genetic basis of ecologically important traits in systems intractable to traditional forward/reverse genetic techniques.
Collapse
|
35
|
Moest M, Van Belleghem SM, James JE, Salazar C, Martin SH, Barker SL, Moreira GRP, Mérot C, Joron M, Nadeau NJ, Steiner FM, Jiggins CD. Selective sweeps on novel and introgressed variation shape mimicry loci in a butterfly adaptive radiation. PLoS Biol 2020; 18:e3000597. [PMID: 32027643 PMCID: PMC7029882 DOI: 10.1371/journal.pbio.3000597] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 02/19/2020] [Accepted: 01/15/2020] [Indexed: 11/21/2022] Open
Abstract
Natural selection leaves distinct signatures in the genome that can reveal the targets and history of adaptive evolution. By analysing high-coverage genome sequence data from 4 major colour pattern loci sampled from nearly 600 individuals in 53 populations, we show pervasive selection on wing patterns in the Heliconius adaptive radiation. The strongest signatures correspond to loci with the greatest phenotypic effects, consistent with visual selection by predators, and are found in colour patterns with geographically restricted distributions. These recent sweeps are similar between co-mimics and indicate colour pattern turn-over events despite strong stabilising selection. Using simulations, we compare sweep signatures expected under classic hard sweeps with those resulting from adaptive introgression, an important aspect of mimicry evolution in Heliconius butterflies. Simulated recipient populations show a distinct 'volcano' pattern with peaks of increased genetic diversity around the selected target, characteristic of sweeps of introgressed variation and consistent with diversity patterns found in some populations. Our genomic data reveal a surprisingly dynamic history of colour pattern selection and co-evolution in this adaptive radiation.
Collapse
|
36
|
Brand P, Hinojosa-Díaz IA, Ayala R, Daigle M, Yurrita Obiols CL, Eltz T, Ramírez SR. The evolution of sexual signaling is linked to odorant receptor tuning in perfume-collecting orchid bees. Nat Commun 2020; 11:244. [PMID: 31932598 PMCID: PMC6957680 DOI: 10.1038/s41467-019-14162-6] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Accepted: 12/18/2019] [Indexed: 12/16/2022] Open
Abstract
Sexual signaling is an important reproductive barrier known to evolve early during the formation of new species, but the genetic mechanisms that facilitate the divergence of sexual signals remain elusive. Here we isolate a gene linked to the rapid evolution of a signaling trait in a pair of nascent neotropical orchid bee lineages, Euglossa dilemma and E. viridissima. Male orchid bees acquire chemical compounds from their environment to concoct species-specific perfumes to later expose during courtship. We find that the two lineages acquire chemically distinct perfumes and are reproductively isolated despite low levels of genome-wide differentiation. Remarkably, variation in perfume chemistry coincides with rapid divergence in few odorant receptor (OR) genes. Using functional assays, we demonstrate that the derived variant of Or41 in E. dilemma is specific towards its species-specific major perfume compound, whereas the ancestral variant in E. viridissima is broadly tuned to multiple odorants. Our results show that OR evolution likely played a role in the divergence of sexual communication in natural populations.
Collapse
|
37
|
Cottee MA, Letham SC, Young GR, Stoye JP, Taylor IA. Structure of Drosophila melanogaster ARC1 reveals a repurposed molecule with characteristics of retroviral Gag. SCIENCE ADVANCES 2020; 6:eaay6354. [PMID: 31911950 PMCID: PMC6938703 DOI: 10.1126/sciadv.aay6354] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Accepted: 11/06/2019] [Indexed: 06/10/2023]
Abstract
The tetrapod neuronal protein ARC and its Drosophila melanogaster homolog, dARC1, have important but differing roles in neuronal development. Both are thought to originate through exaptation of ancient Ty3/Gypsy retrotransposon Gag, with their novel function relying on an original capacity for self-assembly and encapsidation of nucleic acids. Here, we present the crystal structure of dARC1 CA and examine the relationship between dARC1, mammalian ARC, and the CA protein of circulating retroviruses. We show that while the overall architecture is highly related to that of orthoretroviral and spumaretroviral CA, there are substantial deviations in both amino- and carboxyl-terminal domains, potentially affecting recruitment of partner proteins and particle assembly. The degree of sequence and structural divergence suggests that Ty3/Gypsy Gag has been exapted on two separate occasions and that, although mammalian ARC and dARC1 share functional similarity, the structures have undergone different adaptations after appropriation into the tetrapod and insect genomes.
Collapse
|
38
|
Srinivasan A, Mishra RK. Genomic organization of Polycomb Response Elements and its functional implication in Drosophila and other insects. J Biosci 2020; 45:12. [PMID: 31965990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
The epigenetic memory is an essential aspect of multicellular organisms to maintain several cell types and their gene expression pattern. This complex process uses a number of protein factors and specific DNA elements within the developmental cues to achieve this. The protein factors involved in the process are the Polycomb group (PcG) members, and, accordingly, the DNA sequences that interact with these proteins are called Polycomb Response Elements (PREs). Since the PcG proteins are highly conserved among higher eukaryotes, including insects, and function at thousands of sites in the genomes, it is expected that PREs mayalso be present across the genome.However, the studies on PREs in insect species, other thanDrosophila, is currently lacking.We took a bioinformatics approach to develop an inclusive PRE prediction tool, 'PRE Mapper', to address this need. By applying this tool on the Drosophila melanogaster genome, we predicted greater than 20,000 PREs.When comparedwith the available PRE prediction methods, this tool shows far better performance by correctly identifying the in vivo binding sites of PcG proteins, identified by genome-scale ChIP experiments. Further analysis of the predicted PREs shows their cohabitation with chromatin domain boundary elements at several places in the Drosophila genome, possibly defining a composite epigenetic module.We analysed 10 insect genomes in this context and find several conserved features in PREs across the insect species with some variations in their occurrence frequency. These analyses leading to the identification of PREin insect genomes contribute to our understanding of epigenetic mechanisms in these organisms.
Collapse
|
39
|
Antonazzo G, Urbano JM, Marygold SJ, Millburn GH, Brown NH. Building a pipeline to solicit expert knowledge from the community to aid gene summary curation. Database (Oxford) 2020; 2020:baz152. [PMID: 31960022 PMCID: PMC6971343 DOI: 10.1093/database/baz152] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Revised: 10/29/2019] [Accepted: 12/12/2019] [Indexed: 11/25/2022]
Abstract
Brief summaries describing the function of each gene's product(s) are of great value to the research community, especially when interpreting genome-wide studies that reveal changes to hundreds of genes. However, manually writing such summaries, even for a single species, is a daunting task; for example, the Drosophila melanogaster genome contains almost 14 000 protein-coding genes. One solution is to use computational methods to generate summaries, but this often fails to capture the key functions or express them eloquently. Here, we describe how we solicited help from the research community to generate manually written summaries of D. melanogaster gene function. Based on the data within the FlyBase database, we developed a computational pipeline to identify researchers who have worked extensively on each gene. We e-mailed these researchers to ask them to draft a brief summary of the main function(s) of the gene's product, which we edited for consistency to produce a 'gene snapshot'. This approach yielded 1800 gene snapshot submissions within a 3-month period. We discuss the general utility of this strategy for other databases that capture data from the research literature. Database URL: https://flybase.org/.
Collapse
|
40
|
Gulati P, Kohli S, Narang A, Brahmachari V. Mining histone methyltransferases and demethylases from whole genome sequence. J Biosci 2020; 45:9. [PMID: 31965987] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Epigenetic regulation through post-translational modification of histones, especially methylation, is well conserved in evolution. Although there are several insect genomes sequenced, an analysis with a focus on their epigenetic repertoire is limited. We have utilized a novel work-flow to identify one or more domains as highpriority domain (HPD), if present in at least 50% of the genes of a given functional class in the reference genome, namely, that of Drosophila melanogaster. Based on this approach, we have mined histone methyltransferases and demethylases from the whole genome sequence of Aedes aegypti (Diptera), the pea aphid Acyrthosiphon pisum, the triatomid bug Rhodnius prolixus (Hemiptera), the honeybee Apis mellifera (Hymenoptera), the silkworm Bombyx mori (Lepidoptera) and the red flour beetle Tribolium castaneum (Coleoptera). We identified 38 clusters consisting of arginine methyltransferases, lysine methyltransferases and demethylases using OrthoFinder, and the presence of HPD was queried in these sequences using InterProScan. This approach led us to identify putative novel members and currently inaccurate ones. Other than the highpriority domains, these proteins contain shared and unique domains that can mediate protein-protein interaction. Phylogenetic analysis indicates that there is different extent of protein sequence similarity; average similarity between histone lysine methyltransferases varies from 41% (for active mark) to 48% (for repressive mark), arginine methyltransferases is 51%, and demethylases is 52%. The method utilized here facilitates reliable identification of desired functional class in newly sequenced genomes.
Collapse
|
41
|
Silva BSML, Heringer P, Dias GB, Svartman M, Kuhn GCS. De novo identification of satellite DNAs in the sequenced genomes of Drosophila virilis and D. americana using the RepeatExplorer and TAREAN pipelines. PLoS One 2019; 14:e0223466. [PMID: 31856171 PMCID: PMC6922343 DOI: 10.1371/journal.pone.0223466] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Accepted: 11/26/2019] [Indexed: 01/10/2023] Open
Abstract
Satellite DNAs are among the most abundant repetitive DNAs found in eukaryote genomes, where they participate in a variety of biological roles, from being components of important chromosome structures to gene regulation. Experimental methodologies used before the genomic era were insufficient, too laborious and time-consuming to recover the collection of all satDNAs from a genome. Today, the availability of whole sequenced genomes combined with the development of specific bioinformatic tools are expected to foster the identification of virtually all the "satellitome" of a particular species. While whole genome assemblies are important to obtain a global view of genome organization, most of them are incomplete and lack repetitive regions. We applied short-read sequencing and similarity clustering in order to perform a de novo identification of the most abundant satellite families in two Drosophila species from the virilis group: Drosophila virilis and D. americana, using the Tandem Repeat Analyzer (TAREAN) and RepeatExplorer pipelines. These species were chosen because they have been used as models to understand satDNA biology since the early 70's. We combined the computational approach with data from the literature and chromosome mapping to obtain an overview of the major tandem repeat sequences of these species. The fact that all of the abundant tandem repeats (TRs) we detected were previously identified in the literature allowed us to evaluate the efficiency of TAREAN in correctly identifying true satDNAs. Our results indicate that raw sequencing reads can be efficiently used to detect satDNAs, but that abundant tandem repeats present in dispersed arrays or associated with transposable elements are frequent false positives. We demonstrate that TAREAN with its parent method RepeatExplorer may be used as resources to detect tandem repeats associated with transposable elements and also to reveal families of dispersed tandem repeats.
Collapse
|
42
|
Heckenhauer J, Frandsen PB, Gupta DK, Paule J, Prost S, Schell T, Schneider JV, Stewart RJ, Pauls SU. Annotated Draft Genomes of Two Caddisfly Species Plectrocnemia conspersa CURTIS and Hydropsyche tenuis NAVAS (Insecta: Trichoptera). Genome Biol Evol 2019; 11:3445-3451. [PMID: 31774498 PMCID: PMC6916706 DOI: 10.1093/gbe/evz264] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/25/2019] [Indexed: 02/06/2023] Open
Abstract
Members of the speciose insect order Trichoptera (caddisflies) provide important ecosystem services, for example, nutrient cycling through breaking down of organic matter. They are also of industrial interest due to their larval silk secretions. These form the basis for their diverse case-making behavior that allows them to exploit a wide range of ecological niches. Only five genomes of this order have been published thus far, with variable qualities regarding contiguity and completeness. A low-cost sequencing strategy, that is, using a single Oxford Nanopore flow cell per individual along with Illumina sequence reads was successfully used to generate high-quality genomes of two Trichoptera species, Plectrocnemia conspersa and Hydropsyche tenuis. Of the de novo assembly methods compared, assembly of low coverage Nanopore reads (∼18×) and subsequent polishing with long reads followed by Illumina short reads (∼80-170× coverage) yielded the highest genome quality both in terms of contiguity and BUSCO completeness. The presented genomes are the shortest to date and extend our knowledge of genome size across caddisfly families. The genomic region that encodes for light (L)-chain fibroin, a protein component of larval caddisfly silk was identified and compared with existing L-fibroin gene clusters. The new genomic resources presented in this paper are among the highest quality Trichoptera genomes and will increase the knowledge of this important insect order by serving as the basis for phylogenomic and comparative genomic studies.
Collapse
|
43
|
Abstract
Balancer chromosomes are multiply inverted and rearranged chromosomes that are widely used in Drosophila genetics. First described nearly 100 years ago, balancers are used extensively in stock maintenance and complex crosses. Recently, the complete molecular structures of several commonly used balancers were determined by whole-genome sequencing. This revealed a surprising amount of variation among balancers derived from a common progenitor, identified genes directly affected by inversion breakpoints, and cataloged mutations shared by balancers. These studies emphasized that it is important to choose the optimal balancer, because different inversions suppress meiotic recombination in different chromosomal regions. In this review, we provide a brief history of balancers in Drosophila, discuss how they are used today, and provide examples of unexpected recombination events involving balancers that can lead to stock breakdown.
Collapse
|
44
|
Matthews NE, White R. Chromatin Architecture in the Fly: Living without CTCF/Cohesin Loop Extrusion?: Alternating Chromatin States Provide a Basis for Domain Architecture in Drosophila. Bioessays 2019; 41:e1900048. [PMID: 31264253 DOI: 10.1002/bies.201900048] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2019] [Revised: 05/08/2019] [Indexed: 12/15/2022]
Abstract
The organization of the genome into topologically associated domains (TADs) appears to be a fundamental process occurring across a wide range of eukaryote organisms, and it likely plays an important role in providing an architectural foundation for gene regulation. Initial studies emphasized the remarkable parallels between TAD organization in organisms as diverse as Drosophila and mammals. However, whereas CCCTC-binding factor (CTCF)/cohesin loop extrusion is emerging as a key mechanism for the formation of mammalian topological domains, the genome organization in Drosophila appears to depend primarily on the partitioning of chromatin state domains. Recent work suggesting a fundamental conserved role of chromatin state in building domain architecture is discussed and insights into genome organization from recent studies in Drosophila are considered.
Collapse
|
45
|
Chen H, Lin L, Xie M, Zhong Y, Zhang G, Su W. Survey of the Bradysia odoriphaga Transcriptome Using PacBio Single-Molecule Long-Read Sequencing. Genes (Basel) 2019; 10:genes10060481. [PMID: 31242713 PMCID: PMC6627194 DOI: 10.3390/genes10060481] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2019] [Revised: 06/20/2019] [Accepted: 06/22/2019] [Indexed: 11/16/2022] Open
Abstract
The damage caused by Bradysia odoriphaga is the main factor threatening the production of vegetables in the Liliaceae family. However, few genetic studies of B. odoriphaga have been conducted because of a lack of genomic resources. Many long-read sequencing technologies have been developed in the last decade; therefore, in this study, the transcriptome including all development stages of B. odoriphaga was sequenced for the first time by Pacific single-molecule long-read sequencing. Here, 39,129 isoforms were generated, and 35,645 were found to have annotation results when checked against sequences available in different databases. Overall, 18,473 isoforms were distributed in 25 various Clusters of Orthologous Groups, and 11,880 isoforms were categorized into 60 functional groups that belonged to the three main Gene Ontology classifications. Moreover, 30,610 isoforms were assigned into 44 functional categories belonging to six main Kyoto Encyclopedia of Genes and Genomes functional categories. Coding DNA sequence (CDS) prediction showed that 36,419 out of 39,129 isoforms were predicted to have CDS, and 4319 simple sequence repeats were detected in total. Finally, 266 insecticide resistance and metabolism-related isoforms were identified as candidate genes for further investigation of insecticide resistance and metabolism in B. odoriphaga.
Collapse
|
46
|
Shin SC, Kim H, Lee JH, Kim HW, Park J, Choi BS, Lee SC, Kim JH, Lee H, Kim S. Nanopore sequencing reads improve assembly and gene annotation of the Parochlus steinenii genome. Sci Rep 2019; 9:5095. [PMID: 30911035 PMCID: PMC6434015 DOI: 10.1038/s41598-019-41549-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2018] [Accepted: 03/08/2019] [Indexed: 02/07/2023] Open
Abstract
Parochlus steinenii is a winged midge from King George Island. It is cold-tolerant and endures the harsh Antarctic winter. Previously, we reported the genome of this midge, but the genome assembly with short reads had limited contig contiguity, which reduced the completeness of the genome assembly and the annotated gene sets. Recently, assembly contiguity has been increased using nanopore technology. A number of methods for enhancing the low base quality of the assembly have been reported, including long-read (e.g. Nanopolish) or short-read (e.g. Pilon) based methods. Based on these advances, we used nanopore technologies to upgrade the draft genome sequence of P. steinenii. The final assembled genome was 145,366,448 bases in length. The contig number decreased from 9,132 to 162, and the N50 contig size increased from 36,946 to 1,989,550 bases. The BUSCO completeness of the assembly increased from 87.8 to 98.7%. Improved assembly statistics helped predict more genes from the draft genome of P. steinenii. The completeness of the predicted gene model increased from 79.5 to 92.1%, but the numbers and types of the predicted repeats were similar to those observed in the short read assembly, with the exception of long interspersed nuclear elements. In the present study, we markedly improved the P. steinenii genome assembly statistics using nanopore sequencing, but found that genome polishing with high-quality reads was essential for improving genome annotation. The number of genes predicted and the lengths of the genes were greater than before, and nanopore technology readily improved genome information.
Collapse
|
47
|
Fuller ZL, Koury SA, Phadnis N, Schaeffer SW. How chromosomal rearrangements shape adaptation and speciation: Case studies in Drosophila pseudoobscura and its sibling species Drosophila persimilis. Mol Ecol 2019; 28:1283-1301. [PMID: 30402909 PMCID: PMC6475473 DOI: 10.1111/mec.14923] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2018] [Revised: 09/30/2018] [Accepted: 10/09/2018] [Indexed: 01/01/2023]
Abstract
The gene arrangements of Drosophila have played a prominent role in the history of evolutionary biology from the original quantification of genetic diversity to current studies of the mechanisms for the origin and establishment of new inversion mutations within populations and their subsequent fixation between species supporting reproductive barriers. This review examines the genetic causes and consequences of inversions as recombination suppressors and the role that recombination suppression plays in establishing inversions in populations as they are involved in adaptation within heterogeneous environments. This often results in the formation of clines of gene arrangement frequencies among populations. Recombination suppression leads to the differentiation of the gene arrangements which may accelerate the accumulation of fixed genetic differences among populations. If these fixed mutations cause incompatibilities, then inversions pose important reproductive barriers between species. This review uses the evolution of inversions in Drosophila pseudoobscura and D. persimilis as a case study for how inversions originate, establish and contribute to the evolution of reproductive isolation.
Collapse
|
48
|
Lerat E, Goubert C, Guirao‐Rico S, Merenciano M, Dufour A, Vieira C, González J. Population-specific dynamics and selection patterns of transposable element insertions in European natural populations. Mol Ecol 2019; 28:1506-1522. [PMID: 30506554 PMCID: PMC6849870 DOI: 10.1111/mec.14963] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Revised: 10/30/2018] [Accepted: 11/05/2018] [Indexed: 01/02/2023]
Abstract
Transposable elements (TEs) are ubiquitous sequences in genomes of virtually all species. While TEs have been investigated for several decades, only recently we have the opportunity to study their genome-wide population dynamics. Most of the studies so far have been restricted either to the analysis of the insertions annotated in the reference genome or to the analysis of a limited number of populations. Taking advantage of the European Drosophila population genomics consortium (DrosEU) sequencing data set, we have identified and measured the dynamics of TEs in a large sample of European Drosophila melanogaster natural populations. We showed that the mobilome landscape is population-specific and highly diverse depending on the TE family. In contrast with previous studies based on SNP variants, no geographical structure was observed for TE abundance or TE divergence in European populations. We further identified de novo individual insertions using two available programs and, as expected, most of the insertions were present at low frequencies. Nevertheless, we identified a subset of TEs present at high frequencies and located in genomic regions with a high recombination rate. These TEs are candidates for being the target of positive selection, although neutral processes should be discarded before reaching any conclusion on the type of selection acting on them. Finally, parallel patterns of association between the frequency of TE insertions and several geographical and temporal variables were found between European and North American populations, suggesting that TEs can be potentially implicated in the adaptation of populations across continents.
Collapse
|
49
|
Rech GE, Bogaerts-Márquez M, Barrón MG, Merenciano M, Villanueva-Cañas JL, Horváth V, Fiston-Lavier AS, Luyten I, Venkataram S, Quesneville H, Petrov DA, González J. Stress response, behavior, and development are shaped by transposable element-induced mutations in Drosophila. PLoS Genet 2019; 15:e1007900. [PMID: 30753202 PMCID: PMC6372155 DOI: 10.1371/journal.pgen.1007900] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2018] [Accepted: 12/16/2018] [Indexed: 11/30/2022] Open
Abstract
Most of the current knowledge on the genetic basis of adaptive evolution is based on the analysis of single nucleotide polymorphisms (SNPs). Despite increasing evidence for their causal role, the contribution of structural variants to adaptive evolution remains largely unexplored. In this work, we analyzed the population frequencies of 1,615 Transposable Element (TE) insertions annotated in the reference genome of Drosophila melanogaster, in 91 samples from 60 worldwide natural populations. We identified a set of 300 polymorphic TEs that are present at high population frequencies, and located in genomic regions with high recombination rate, where the efficiency of natural selection is high. The age and the length of these 300 TEs are consistent with relatively young and long insertions reaching high frequencies due to the action of positive selection. Besides, we identified a set of 21 fixed TEs also likely to be adaptive. Indeed, we, and others, found evidence of selection for 84 of these reference TE insertions. The analysis of the genes located nearby these 84 candidate adaptive insertions suggested that the functional response to selection is related with the GO categories of response to stimulus, behavior, and development. We further showed that a subset of the candidate adaptive TEs affects expression of nearby genes, and five of them have already been linked to an ecologically relevant phenotypic effect. Our results provide a more complete understanding of the genetic variation and the fitness-related traits relevant for adaptive evolution. Similar studies should help uncover the importance of TE-induced adaptive mutations in other species as well.
Collapse
|
50
|
Zhu YN, Wang LZ, Li CC, Cui Y, Wang M, Lin YJ, Zhao RP, Wang W, Xiang H. Artificial selection on storage protein 1 possibly contributes to increase of hatchability during silkworm domestication. PLoS Genet 2019; 15:e1007616. [PMID: 30668559 PMCID: PMC6358105 DOI: 10.1371/journal.pgen.1007616] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Revised: 02/01/2019] [Accepted: 12/16/2018] [Indexed: 11/19/2022] Open
Abstract
Like other domesticates, the efficient utilization of nitrogen resources is also important for the only fully domesticated insect, the silkworm. Deciphering the way in which artificial selection acts on the silkworm genome to improve the utilization of nitrogen resources and to advance human-favored domestication traits, will provide clues from a unique insect model for understanding the general rules of Darwin's evolutionary theory on domestication. Storage proteins (SPs), which belong to a hemocyanin superfamily, basically serve as a source of amino acids and nitrogen during metamorphosis and reproduction in insects. In this study, through blast searching on the silkworm genome and further screening of the artificial selection signature on silkworm SPs, we discovered a candidate domestication gene, i.e., the methionine-rich storage protein 1 (SP1), which is clearly divergent from other storage proteins and exhibits increased expression in the ova of domestic silkworms. Knockout of SP1 via the CRISPR/Cas9 technique resulted in a dramatic decrease in egg hatchability, without obvious impact on egg production, which was similar to the effect in the wild silkworm compared with the domestic type. Larval development and metamorphosis were not affected by SP1 knockout. Comprehensive ova comparative transcriptomes indicated significant higher expression of genes encoding vitellogenin, chorions, and structural components in the extracellular matrix (ECM)-interaction pathway, enzymes in folate biosynthesis, and notably hormone synthesis in the domestic silkworm, compared to both the SP1 mutant and the wild silkworm. Moreover, compared with the wild silkworms, the domestic one also showed generally up-regulated expression of genes enriched in the structural constituent of ribosome and amide, as well as peptide biosynthesis. This study exemplified a novel case in which artificial selection could act directly on nitrogen resource proteins, further affecting egg nutrients and eggshell formation possibly through a hormone signaling mediated regulatory network and the activation of ribosomes, resulting in improved biosynthesis and increased hatchability during domestication. These findings shed new light on both the understanding of artificial selection and silkworm breeding from the perspective of nitrogen and amino acid resources. Like other domesticates, nitrogen resources are also important for the only fully domesticated insect, the silkworm. Deciphering the way in which artificial selection acts on the silkworm genome to improve the utilization of nitrogen resources, thereby advancing human-favored domestication traits, will provide clues from a unique insect model for understanding the general rules of Darwin's theory on artificial selection. However, the mechanisms of domestication in the silkworm remain largely unknown. In this study, we focused on one important nitrogen resource, the storage protein (SP). We discovered that the methionine-rich storage protein 1 (SP1), which is divergent from other SPs, is the only target of artificial selection. Based on functional evidence, together with key findings from the comprehensive comparative transcriptome, we propose that artificial selection favored higher expression of SP1 in the domestic silkworm, which would influence the genes or pathways vital for egg development and eggshell formation. Artificial selection also consistently favored activated ribosome activities and improved amide and peptide biosynthesis in the ova, like what they may act in the silk gland to increase silk-cocoon yield. We highlighted a novel case in which artificial selection could directly act on a nitrogen resource protein associated with a human-desired domestication trait.
Collapse
|