1
|
Carpinteyro-Ponce J, Machado CA. The Complex Landscape of Structural Divergence Between the Drosophila pseudoobscura and D. persimilis Genomes. Genome Biol Evol 2024; 16:evae047. [PMID: 38482945 PMCID: PMC10980976 DOI: 10.1093/gbe/evae047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/07/2024] [Indexed: 04/01/2024] Open
Abstract
Structural genomic variants are key drivers of phenotypic evolution. They can span hundreds to millions of base pairs and can thus affect large numbers of genetic elements. Although structural variation is quite common within and between species, its characterization depends upon the quality of genome assemblies and the proportion of repetitive elements. Using new high-quality genome assemblies, we report a complex and previously hidden landscape of structural divergence between the genomes of Drosophila persimilis and D. pseudoobscura, two classic species in speciation research, and study the relationships among structural variants, transposable elements, and gene expression divergence. The new assemblies confirm the already known fixed inversion differences between these species. Consistent with previous studies showing higher levels of nucleotide divergence between fixed inversions relative to collinear regions of the genome, we also find a significant overrepresentation of INDELs inside the inversions. We find that transposable elements accumulate in regions with low levels of recombination, and spatial correlation analyses reveal a strong association between transposable elements and structural variants. We also report a strong association between differentially expressed (DE) genes and structural variants and an overrepresentation of DE genes inside the fixed chromosomal inversions that separate this species pair. Interestingly, species-specific structural variants are overrepresented in DE genes involved in neural development, spermatogenesis, and oocyte-to-embryo transition. Overall, our results highlight the association of transposable elements with structural variants and their importance in driving evolutionary divergence.
Collapse
Affiliation(s)
| | - Carlos A Machado
- Department of Biology, University of Maryland, College Park, MD, USA
| |
Collapse
|
2
|
Stromberg KA, Spain T, Tomlin SA, Powell J, Amarillo KD, Schroeder CM. Evolutionary diversification reveals distinct somatic versus germline cytoskeletal functions of the Arp2 branched actin nucleator protein. Curr Biol 2023; 33:5326-5339.e7. [PMID: 37977138 PMCID: PMC10785674 DOI: 10.1016/j.cub.2023.10.055] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 09/18/2023] [Accepted: 10/25/2023] [Indexed: 11/19/2023]
Abstract
Branched actin networks are critical in many cellular processes, including cell motility and division. Arp2, a protein within the seven-membered Arp2/3 complex, is responsible for generating branched actin. Given its essential roles, Arp2 evolves under stringent sequence conservation throughout eukaryotic evolution. We unexpectedly discovered recurrent evolutionary diversification of Arp2 in Drosophila, yielding independently arising paralogs Arp2D in obscura species and Arp2D2 in montium species. Both paralogs are unusually testis-enriched in expression relative to Arp2. We investigated whether their sequence divergence from canonical Arp2 led to functional specialization by replacing Arp2 in D. melanogaster with either Arp2D or Arp2D2. Despite their divergence, we surprisingly found that both complement Arp2's essential function in somatic tissue, suggesting they have preserved the ability to polymerize branched actin even in a non-native species. However, we found that Arp2D- and Arp2D2-expressing males display defects throughout sperm development, with Arp2D resulting in more pronounced deficiencies and subfertility, suggesting the Arp2 paralogs are cross-species incompatible in the testis. We focused on Arp2D and pinpointed two highly diverged structural regions-the D-loop and C terminus-and found that they contribute to germline defects in D. melanogaster sperm development. However, while the Arp2D C terminus is suboptimal in the D. melanogaster testis, it is essential for Arp2D somatic function. Testis cytology of the paralogs' native species revealed striking differences in germline actin structures, indicating unique cytoskeletal requirements. Our findings suggest canonical Arp2 function differs between somatic versus germline contexts, and Arp2 paralogs may have recurrently evolved for species-specialized actin branching in the testis.
Collapse
Affiliation(s)
- Kaitlin A Stromberg
- Department of Pharmacology, UT Southwestern Medical Center, 6001 Forest Park Road, Dallas, TX 75390, USA
| | - Tristan Spain
- Department of Pharmacology, UT Southwestern Medical Center, 6001 Forest Park Road, Dallas, TX 75390, USA
| | - Sarah A Tomlin
- Division of Basic Sciences, Fred Hutchinson Cancer Center, 1100 Fairview Avenue North, Seattle, WA 98109, USA; Howard Hughes Medical Institute, Fred Hutchinson Cancer Center, 1100 Fairview Avenue North, Seattle, WA 98109, USA
| | - Jordan Powell
- Department of Pharmacology, UT Southwestern Medical Center, 6001 Forest Park Road, Dallas, TX 75390, USA
| | - Kristen Dominique Amarillo
- Department of Pharmacology, UT Southwestern Medical Center, 6001 Forest Park Road, Dallas, TX 75390, USA
| | - Courtney M Schroeder
- Department of Pharmacology, UT Southwestern Medical Center, 6001 Forest Park Road, Dallas, TX 75390, USA.
| |
Collapse
|
3
|
Stromberg KA, Spain T, Tomlin SA, Amarillo KD, Schroeder CM. Evolutionary diversification reveals distinct somatic versus germline cytoskeletal functions of the Arp2 branched actin nucleator protein. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.25.530036. [PMID: 36909544 PMCID: PMC10002617 DOI: 10.1101/2023.02.25.530036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/02/2023]
Abstract
Branched actin networks are critical in many cellular processes, including cell motility and division. Arp2, a protein within the 7-membered Arp2/3 complex, is responsible for generating branched actin. Given its essential roles, Arp2 evolves under stringent sequence conservation throughout eukaryotic evolution. We unexpectedly discovered recurrent evolutionary diversification of Arp2 in Drosophila, yielding independently arising paralogs Arp2D in obscura species and Arp2D2 in montium species. Both paralogs are unusually testis-enriched in expression relative to Arp2. We investigated whether their sequence divergence from canonical Arp2 led to functional specialization by replacing Arp2 in D. melanogaster with either Arp2D or Arp2D2. Despite their divergence, we surprisingly found both complement Arp2's essential function in the soma, suggesting they have preserved the ability to polymerize branched actin even in a non-native species. However, we found that Arp2D-expressing males are subfertile and display many defects throughout sperm development. We pinpointed two highly diverged structural regions in Arp2D that contribute to these defects: subdomain 2 and the C-terminus. We expected that germline function would be rescued by replacing Arp2D's long and charged C-terminus with Arp2's short C-terminus, yet surprisingly, the essential somatic function of Arp2D was lost. Therefore, while Arp2D's structural divergence is incompatible with D. melanogaster sperm development, its unique C-terminus has evolved a critical role in actin polymerization. Our findings suggest canonical Arp2's function differs between somatic versus germline contexts, and Arp2 paralogs have recurrently evolved and specialized for actin branching in the testis.
Collapse
Affiliation(s)
| | - Tristan Spain
- Department of Pharmacology, UT Southwestern Medical Center, Dallas, TX
| | - Sarah A. Tomlin
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA
- Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, Seattle, WA
| | | | | |
Collapse
|
4
|
Hine E, Runcie DE, Allen SL, Wang Y, Chenoweth SF, Blows MW, McGuigan K. Maintenance of quantitative genetic variance in complex, multi-trait phenotypes: The contribution of rare, large effect variants in two Drosophila species. Genetics 2022; 222:6663993. [PMID: 35961029 PMCID: PMC9526065 DOI: 10.1093/genetics/iyac122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 08/02/2022] [Indexed: 11/29/2022] Open
Abstract
The interaction of evolutionary processes to determine quantitative genetic variation has implications for contemporary and future phenotypic evolution, as well as for our ability to detect causal genetic variants. While theoretical studies have provided robust predictions to discriminate among competing models, empirical assessment of these has been limited. In particular, theory highlights the importance of pleiotropy in resolving observations of selection and mutation, but empirical investigations have typically been limited to few traits. Here, we applied high-dimensional Bayesian Sparse Factor Genetic modeling to gene expression datasets in 2 species, Drosophila melanogaster and Drosophila serrata, to explore the distributions of genetic variance across high-dimensional phenotypic space. Surprisingly, most of the heritable trait covariation was due to few lines (genotypes) with extreme [>3 interquartile ranges (IQR) from the median] values. Intriguingly, while genotypes extreme for a multivariate factor also tended to have a higher proportion of individual traits that were extreme, we also observed genotypes that were extreme for multivariate factors but not for any individual trait. We observed other consistent differences between heritable multivariate factors with outlier lines vs those factors without extreme values, including differences in gene functions. We use these observations to identify further data required to advance our understanding of the evolutionary dynamics and nature of standing genetic variation for quantitative traits.
Collapse
Affiliation(s)
- Emma Hine
- School of Biological Sciences, The University of Queensland, Brisbane 4072 Australia
| | - Daniel E Runcie
- Department of Plant Sciences, University of California Davis, Davis, CA 95616, USA
| | - Scott L Allen
- School of Biological Sciences, The University of Queensland, Brisbane 4072 Australia
| | - Yiguan Wang
- School of Biological Sciences, The University of Queensland, Brisbane 4072 Australia.,Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, EH9 3FL, UK
| | - Stephen F Chenoweth
- School of Biological Sciences, The University of Queensland, Brisbane 4072 Australia
| | - Mark W Blows
- School of Biological Sciences, The University of Queensland, Brisbane 4072 Australia
| | - Katrina McGuigan
- School of Biological Sciences, The University of Queensland, Brisbane 4072 Australia
| |
Collapse
|
5
|
Tiedeman Z, Signor S. The transposable elements of the Drosophila serrata reference panel. Genome Biol Evol 2021; 13:6265467. [PMID: 33950180 PMCID: PMC8434751 DOI: 10.1093/gbe/evab100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/30/2021] [Indexed: 11/13/2022] Open
Abstract
Transposable elements (TEs) are an important component of the complex genomic ecosystem. Understanding the tempo and mode of TE proliferation, that is whether it is in maintained in transposition selection balance, or is induced periodically by environmental stress or other factors, is important for understanding the evolution of organismal genomes through time. Although TEs have been characterized in individuals or limited samples, a true understanding of the population genetics of TEs, and therefore the tempo and mode of transposition, is still lacking. Here, we characterize the TE landscape in an important model Drosophila, Drosophila serrata using the D. serrata reference panel, which is comprised of 102 sequenced inbred genotypes. We annotate the families of TEs in the D. serrata genome and investigate variation in TE copy number between genotypes. We find that many TEs have low copy number in the population, but this varies by family and includes a single TE making up to 50% of the genome content of TEs. We find that some TEs proliferate in particular genotypes compared with population levels. In addition, we characterize variation in each TE family allowing copy number to vary in each genotype and find that some TEs have diversified very little between individuals suggesting recent spread. TEs are important sources of spontaneous mutations in Drosophila, making up a large fraction of the total number of mutations in particular genotypes. Understanding the dynamics of TEs within populations will be an important step toward characterizing the origin of variation within and between species.
Collapse
Affiliation(s)
- Zachery Tiedeman
- Department of Biological Sciences, North Dakota State University, Fargo, North Dakota, U.S.A
| | - Sarah Signor
- Department of Biological Sciences, North Dakota State University, Fargo, North Dakota, U.S.A
| |
Collapse
|
6
|
Conner WR, Delaney EK, Bronski MJ, Ginsberg PS, Wheeler TB, Richardson KM, Peckenpaugh B, Kim KJ, Watada M, Hoffmann AA, Eisen MB, Kopp A, Cooper BS, Turelli M. A phylogeny for the Drosophila montium species group: A model clade for comparative analyses. Mol Phylogenet Evol 2020; 158:107061. [PMID: 33387647 DOI: 10.1016/j.ympev.2020.107061] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2020] [Revised: 12/18/2020] [Accepted: 12/24/2020] [Indexed: 12/22/2022]
Abstract
The Drosophila montium species group is a clade of 94 named species, closely related to the model species D. melanogaster. The montium species group is distributed over a broad geographic range throughout Asia, Africa, and Australasia. Species of this group possess a wide range of morphologies, mating behaviors, and endosymbiont associations, making this clade useful for comparative analyses. We use genomic data from 42 available species to estimate the phylogeny and relative divergence times within the montium species group, and its relative divergence time from D. melanogaster. To assess the robustness of our phylogenetic inferences, we use 3 non-overlapping sets of 20 single-copy coding sequences and analyze all 60 genes with both Bayesian and maximum likelihood methods. Our analyses support monophyly of the group. Apart from the uncertain placement of a single species, D. baimaii, our analyses also support the monophyly of all seven subgroups proposed within the montium group. Our phylograms and relative chronograms provide a highly resolved species tree, with discordance restricted to estimates of relatively short branches deep in the tree. In contrast, age estimates for the montium crown group, relative to its divergence from D. melanogaster, depend critically on prior assumptions concerning variation in rates of molecular evolution across branches, and hence have not been reliably determined. We discuss methodological issues that limit phylogenetic resolution - even when complete genome sequences are available - as well as the utility of the current phylogeny for understanding the evolutionary and biogeographic history of this clade.
Collapse
Affiliation(s)
- William R Conner
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA; Division of Biological Sciences, University of Montana, Missoula, MT 59812, USA(1)
| | - Emily K Delaney
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA
| | - Michael J Bronski
- Department of Molecular & Cell Biology, University of California, Berkeley, CA 94720, USA
| | - Paul S Ginsberg
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA; Department of Genetics, University of Georgia, Athens, GA 30602, USA(1)
| | - Timothy B Wheeler
- Division of Biological Sciences, University of Montana, Missoula, MT 59812, USA(1)
| | - Kelly M Richardson
- Bio21 Institute, School of BioScience, University of Melbourne, Victoria 3010, Australia
| | - Brooke Peckenpaugh
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA; Department of Biology, Indiana University, Bloomington, IN 47405, USA(1)
| | - Kevin J Kim
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA
| | - Masayoshi Watada
- Graduate School of Science and Engineering, Ehime University, Matsuyama, Ehime, Japan
| | - Ary A Hoffmann
- Bio21 Institute, School of BioScience, University of Melbourne, Victoria 3010, Australia
| | - Michael B Eisen
- Department of Molecular & Cell Biology, University of California, Berkeley, CA 94720, USA
| | - Artyom Kopp
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA
| | - Brandon S Cooper
- Division of Biological Sciences, University of Montana, Missoula, MT 59812, USA(1)
| | - Michael Turelli
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA.
| |
Collapse
|
7
|
Chen J, Yu Y, Kang K, Zhang D. SMRT sequencing of the full-length transcriptome of the white-backed planthopper Sogatella furcifera. PeerJ 2020; 8:e9320. [PMID: 32551204 PMCID: PMC7292024 DOI: 10.7717/peerj.9320] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Accepted: 05/18/2020] [Indexed: 12/28/2022] Open
Abstract
The white-backed planthopper Sogatella furcifera is an economically important rice pest distributed throughout Asia. It damages rice crops by sucking phloem sap, resulting in stunted growth and plant virus transmission. We aimed to obtain the full-length transcriptome data of S. furcifera using PacBio single-molecule real-time (SMRT) sequencing. Total RNA extracted from S. furcifera at various developmental stages (egg, larval, and adult stages) was mixed and used to generate a full-length transcriptome for SMRT sequencing. Long non-coding RNA (lncRNA) identification, full-length coding sequence prediction, full-length non-chimeric (FLNC) read detection, simple sequence repeat (SSR) analysis, transcription factor detection, and transcript functional annotation were performed. A total of 12,514,449 subreads (15.64 Gbp, clean reads) were generated, including 630,447 circular consensus sequences and 388,348 FLNC reads. Transcript cluster analysis of the FLNC reads revealed 251,109 consensus reads including 29,700 high-quality reads. Additionally, 100,360 SSRs and 121,395 coding sequences were identified using SSR analysis and ANGEL software, respectively. Furthermore, 44,324 lncRNAs were annotated using four tools and 1,288 transcription factors were identified. In total, 95,495 transcripts were functionally annotated based on searches of seven different databases. To the best of our knowledge, this is the first study of the full-length transcriptome of the white-backed planthopper obtained using SMRT sequencing. The acquired transcriptome data can facilitate further studies on the ecological and viral-host interactions of this agricultural pest.
Collapse
Affiliation(s)
- Jing Chen
- College of Basic Medical Science, Zunyi Medical University, Zunyi, China
| | - Yaya Yu
- College of Basic Medical Science, Zunyi Medical University, Zunyi, China
| | - Kui Kang
- School of Biological and Agricultural Science and Technology, Zunyi Normal University, Zunyi, China
| | - Daowei Zhang
- School of Biological and Agricultural Science and Technology, Zunyi Normal University, Zunyi, China
| |
Collapse
|
8
|
Whole Genome Sequences of 23 Species from the Drosophila montium Species Group (Diptera: Drosophilidae): A Resource for Testing Evolutionary Hypotheses. G3-GENES GENOMES GENETICS 2020; 10:1443-1455. [PMID: 32220952 PMCID: PMC7202002 DOI: 10.1534/g3.119.400959] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Large groups of species with well-defined phylogenies are excellent systems for testing evolutionary hypotheses. In this paper, we describe the creation of a comparative genomic resource consisting of 23 genomes from the species-rich Drosophila montium species group, 22 of which are presented here for the first time. The montium group is well-positioned for clade genomics. Within the montium clade, evolutionary distances are such that large numbers of sequences can be accurately aligned while also recovering strong signals of divergence; and the distance between the montium group and D. melanogaster is short enough so that orthologous sequence can be readily identified. All genomes were assembled from a single, small-insert library using MaSuRCA, before going through an extensive post-assembly pipeline. Estimated genome sizes within the montium group range from 155 Mb to 223 Mb (mean = 196 Mb). The absence of long-distance information during the assembly process resulted in fragmented assemblies, with the scaffold NG50s varying widely based on repeat content and sample heterozygosity (min = 18 kb, max = 390 kb, mean = 74 kb). The total scaffold length for most assemblies is also shorter than the estimated genome size, typically by 5-15%. However, subsequent analysis showed that our assemblies are highly complete. Despite large differences in contiguity, all assemblies contain at least 96% of known single-copy Dipteran genes (BUSCOs, n = 2,799). Similarly, by aligning our assemblies to the D. melanogaster genome and remapping coordinates for a large set of transcriptional enhancers (n = 3,457), we showed that each montium assembly contains orthologs for at least 91% of D. melanogaster enhancers. Importantly, the genic and enhancer contents of our assemblies are comparable to that of far more contiguous Drosophila assemblies. The alignment of our own D. serrata assembly to a previously published PacBio D. serrata assembly also showed that our longest scaffolds (up to 1 Mb) are free of large-scale misassemblies. Our genome assemblies are a valuable resource that can be used to further resolve the montium group phylogeny; study the evolution of protein-coding genes and cis-regulatory sequences; and determine the genetic basis of ecological and behavioral adaptations.
Collapse
|
9
|
Jayakumar V, Sakakibara Y. Comprehensive evaluation of non-hybrid genome assembly tools for third-generation PacBio long-read sequence data. Brief Bioinform 2020; 20:866-876. [PMID: 29112696 PMCID: PMC6585154 DOI: 10.1093/bib/bbx147] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2017] [Revised: 09/22/2017] [Indexed: 12/20/2022] Open
Abstract
Long reads obtained from third-generation sequencing platforms can help overcome the long-standing challenge of the de novo assembly of sequences for the genomic analysis of non-model eukaryotic organisms. Numerous long-read-aided de novo assemblies have been published recently, which exhibited superior quality of the assembled genomes in comparison with those achieved using earlier second-generation sequencing technologies. Evaluating assemblies is important in guiding the appropriate choice for specific research needs. In this study, we evaluated 10 long-read assemblers using a variety of metrics on Pacific Biosciences (PacBio) data sets from different taxonomic categories with considerable differences in genome size. The results allowed us to narrow down the list to a few assemblers that can be effectively applied to eukaryotic assembly projects. Moreover, we highlight how best to use limited genomic resources for effectively evaluating the genome assemblies of non-model organisms.
Collapse
|
10
|
Tan C, Liu H, Ren J, Ye X, Feng H, Liu Z. Single-molecule real-time sequencing facilitates the analysis of transcripts and splice isoforms of anthers in Chinese cabbage (Brassica rapa L. ssp. pekinensis). BMC PLANT BIOLOGY 2019; 19:517. [PMID: 31771515 PMCID: PMC6880451 DOI: 10.1186/s12870-019-2133-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2019] [Accepted: 11/12/2019] [Indexed: 05/06/2023]
Abstract
BACKGROUND Anther development has been extensively studied at the transcriptional level, but a systematic analysis of full-length transcripts on a genome-wide scale has not yet been published. Here, the Pacific Biosciences (PacBio) Sequel platform and next-generation sequencing (NGS) technology were combined to generate full-length sequences and completed structures of transcripts in anthers of Chinese cabbage. RESULTS Using single-molecule real-time sequencing (SMRT), a total of 1,098,119 circular consensus sequences (CCSs) were generated with a mean length of 2664 bp. More than 75% of the CCSs were considered full-length non-chimeric (FLNC) reads. After error correction, 725,731 high-quality FLNC reads were estimated to carry 51,501 isoforms from 19,503 loci, consisting of 38,992 novel isoforms from known genes and 3691 novel isoforms from novel genes. Of the novel isoforms, we identified 407 long non-coding RNAs (lncRNAs) and 37,549 open reading frames (ORFs). Furthermore, a total of 453,270 alternative splicing (AS) events were identified and the majority of AS models in anther were determined to be approximate exon skipping (XSKIP) events. Of the key genes regulated during anther development, AS events were mainly identified in the genes SERK1, CALS5, NEF1, and CESA1/3. Additionally, we identified 104 fusion transcripts and 5806 genes that had alternative polyadenylation (APA). CONCLUSIONS Our work demonstrated the transcriptome diversity and complexity of anther development in Chinese cabbage. The findings provide a basis for further genome annotation and transcriptome research in Chinese cabbage.
Collapse
Affiliation(s)
- Chong Tan
- College of Horticulture, Shenyang Agricultural University, Shenyang, Liaoning, 110866, People's Republic of China
| | - Hongxin Liu
- College of Horticulture, Shenyang Agricultural University, Shenyang, Liaoning, 110866, People's Republic of China
| | - Jie Ren
- College of Horticulture, Shenyang Agricultural University, Shenyang, Liaoning, 110866, People's Republic of China
| | - Xueling Ye
- College of Horticulture, Shenyang Agricultural University, Shenyang, Liaoning, 110866, People's Republic of China
| | - Hui Feng
- College of Horticulture, Shenyang Agricultural University, Shenyang, Liaoning, 110866, People's Republic of China
| | - Zhiyong Liu
- College of Horticulture, Shenyang Agricultural University, Shenyang, Liaoning, 110866, People's Republic of China.
| |
Collapse
|
11
|
High-Quality Genome Assemblies Reveal Long Non-coding RNAs Expressed in Ant Brains. Cell Rep 2019; 23:3078-3090. [PMID: 29874592 PMCID: PMC6023404 DOI: 10.1016/j.celrep.2018.05.014] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2017] [Revised: 04/04/2018] [Accepted: 05/03/2018] [Indexed: 12/31/2022] Open
Abstract
Ants are an emerging model system for neuroepigenetics, as embryos with virtually identical genomes develop into different adult castes that display diverse physiology, morphology, and behavior. Although a number of ant genomes have been sequenced to date, their draft quality is an obstacle to sophisticated analyses of epigenetic gene regulation. We reassembled de novo high-quality genomes for two ant species, Camponotus floridanus and Harpegnathos saltator. Using long reads enabled us to span large repetitive regions and improve genome contiguity, leading to comprehensive and accurate protein-coding annotations that facilitated the identification of a Gp-9-like gene as differentially expressed in Harpegnathos castes. The new assemblies also enabled us to annotate long non-coding RNAs in ants, revealing caste-, brain-, and developmental-stage-specific long non-coding RNAs (lncRNAs) in Harpegnathos. These upgraded genomes, along with the new gene annotations, will aid future efforts to identify epigenetic mechanisms of phenotypic and behavioral plasticity in ants.
Collapse
|
12
|
Abstract
The increasing affordability of sequencing technologies offers many new and exciting opportunities to address a diverse array of biological questions. This is evidenced in entomological research by numerous genomics and transcriptomics studies that attempt to decipher the often complex relationships among different species or orders and to build "omics" resources to drive advancement of the molecular understanding of insect biology. Being able to gauge the quality of the sequencing data is of critical importance to understanding the potential limitations on the types of questions that these data can be reliably used to address. This chapter details the use of the Benchmarking Universal Single-Copy Orthologue (BUSCO) assessment tool to estimate the completeness of transcriptomes, genome assemblies, and annotated gene sets in terms of their expected gene content.
Collapse
|
13
|
Dupim EG, Goldstein G, Vanderlinde T, Vaz SC, Krsticevic F, Bastos A, Pinhão T, Torres M, David JR, Vilela CR, Carvalho AB. An investigation of Y chromosome incorporations in 400 species of Drosophila and related genera. PLoS Genet 2018; 14:e1007770. [PMID: 30388103 PMCID: PMC6235401 DOI: 10.1371/journal.pgen.1007770] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2018] [Revised: 11/14/2018] [Accepted: 10/17/2018] [Indexed: 11/23/2022] Open
Abstract
Y chromosomes are widely believed to evolve from a normal autosome through a process of massive gene loss (with preservation of some male genes), shaped by sex-antagonistic selection and complemented by occasional gains of male-related genes. The net result of these processes is a male-specialized chromosome. This might be expected to be an irreversible process, but it was found in 2005 that the Drosophila pseudoobscura Y chromosome was incorporated into an autosome. Y chromosome incorporations have important consequences: a formerly male-restricted chromosome reverts to autosomal inheritance, and the species may shift from an XY/XX to X0/XX sex-chromosome system. In order to assess the frequency and causes of this phenomenon we searched for Y chromosome incorporations in 400 species from Drosophila and related genera. We found one additional large scale event of Y chromosome incorporation, affecting the whole montium subgroup (40 species in our sample); overall 13% of the sampled species (52/400) have Y incorporations. While previous data indicated that after the Y incorporation the ancestral Y disappeared as a free chromosome, the much larger data set analyzed here indicates that a copy of the Y survived as a free chromosome both in montium and pseudoobscura species, and that the current Y of the pseudoobscura lineage results from a fusion between this free Y and the neoY. The 400 species sample also showed that the previously suggested causal connection between X-autosome fusions and Y incorporations is, at best, weak: the new case of Y incorporation (montium) does not have X-autosome fusion, whereas nine independent cases of X-autosome fusions were not followed by Y incorporations. Y incorporation is an underappreciated mechanism affecting Y chromosome evolution; our results show that at least in Drosophila it plays a relevant role and highlight the need of similar studies in other groups. In contrast to other chromosomes (X and autosomes), which are present in males and females, Y chromosomes spend all time in males. Hence it is not surprising that along evolution they became male specialized, e.g., containing a disproportionate amount of male-fertility genes. Interestingly it was found in 2005 that in Drosophila pseudoobscura the Y chromosome reverted to "male-female existence", being incorporated into an autosome. These "Y chromosome incorporations" have important consequences on sex-chromosome evolution, and allow the study of the evolutionary forces that shaped Y chromosomes as they act backwards. As D. pseudoobscura was the second Drosophila species investigated in this respect, it is likely that other cases exist, and that perhaps it is a common phenomenon. In order to answer this question we studied 400 Drosophila species. We found one additional case of Y incorporation, which occurred in the ancestor of Drosophila montium, and currently affects a large number of species; overall 13% of the species we sampled (52/400) have Y incorporations. We also found that a previously suggested cause of Y incorporations (X-autosome fusions) is not a general explanation. Our results show that in Drosophila Y incorporations play a relevant role and highlight the need of similar studies in other groups.
Collapse
Affiliation(s)
- Eduardo G. Dupim
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| | - Gabriel Goldstein
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
- Departamento de Genética e Biologia Evolutiva, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Thyago Vanderlinde
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| | - Suzana C. Vaz
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
- Departamento de Genética e Biologia Evolutiva, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Flávia Krsticevic
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
- CIFASIS, CONICET, Rosario, Santa Fe, Argentina
| | - Aline Bastos
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| | - Thadeo Pinhão
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| | - Marcos Torres
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| | - Jean R. David
- Laboratoire Evolution, Génomes et Spéciation (LEGS), CNRS, France
| | - Carlos R. Vilela
- Departamento de Genética e Biologia Evolutiva, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Antonio Bernardo Carvalho
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
- * E-mail: ,
| |
Collapse
|
14
|
Highly Contiguous Genome Assemblies of 15 Drosophila Species Generated Using Nanopore Sequencing. G3-GENES GENOMES GENETICS 2018; 8:3131-3141. [PMID: 30087105 PMCID: PMC6169393 DOI: 10.1534/g3.118.200160] [Citation(s) in RCA: 85] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The Drosophila genus is a unique group containing a wide range of species that occupy diverse ecosystems. In addition to the most widely studied species, Drosophila melanogaster, many other members in this genus also possess a well-developed set of genetic tools. Indeed, high-quality genomes exist for several species within the genus, facilitating studies of the function and evolution of cis-regulatory regions and proteins by allowing comparisons across at least 50 million years of evolution. Yet, the available genomes still fail to capture much of the substantial genetic diversity within the Drosophila genus. We have therefore tested protocols to rapidly and inexpensively sequence and assemble the genome from any Drosophila species using single-molecule sequencing technology from Oxford Nanopore. Here, we use this technology to present highly contiguous genome assemblies of 15 Drosophila species: 10 of the 12 originally sequenced Drosophila species (ananassae, erecta, mojavensis, persimilis, pseudoobscura, sechellia, simulans, virilis, willistoni, and yakuba), four additional species that had previously reported assemblies (biarmipes, bipectinata, eugracilis, and mauritiana), and one novel assembly (triauraria). Genomes were generated from an average of 29x depth-of-coverage data that after assembly resulted in an average contig N50 of 4.4 Mb. Subsequent alignment of contigs from the published reference genomes demonstrates that our assemblies could be used to close over 60% of the gaps present in the currently published reference genomes. Importantly, the materials and reagents cost for each genome was approximately $1,000 (USD). This study demonstrates the power and cost-effectiveness of long-read sequencing for genome assembly in Drosophila and provides a framework for the affordable sequencing and assembly of additional Drosophila genomes.
Collapse
|
15
|
Abstract
Here we describe a collection of re-sequenced inbred lines of Drosophila serrata, sampled from a natural population situated deep within the species endemic distribution in Brisbane, Australia. D. serrata is a member of the speciose montium group whose members inhabit much of south east Asia and has been well studied for aspects of climatic adaptation, sexual selection, sexual dimorphism, and mate recognition. We sequenced 110 lines that were inbred via 17-20 generations of full-sib mating at an average coverage of 23.5x with paired-end Illumina reads. 15,228,692 biallelic SNPs passed quality control after being called using the Joint Genotyper for Inbred Lines (JGIL). Inbreeding was highly effective and the average levels of residual heterozygosity (0.86%) were well below theoretical expectations. As expected, linkage disequilibrium decayed rapidly, with r2 dropping below 0.1 within 100 base pairs. With the exception of four closely related pairs of lines which may have been due to technical errors, there was no statistical support for population substructure. Consistent with other endemic populations of other Drosophila species, preliminary population genetic analyses revealed high nucleotide diversity and, on average, negative Tajima’s D values. A preliminary GWAS was performed on a cuticular hydrocarbon trait, 2-Me-C28 revealing 4 SNPs passing Bonferroni significance residing in or near genes. One gene Cht9 may be involved in the transport of CHCs from the site of production (oenocytes) to the cuticle. Our panel will facilitate broader population genomic and quantitative genetic studies of this species and serve as an important complement to existing D. melanogaster panels that can be used to test for the conservation of genetic architectures across the Drosophila genus.
Collapse
|
16
|
Allen SL, Bonduriansky R, Sgro CM, Chenoweth SF. Sex-biased transcriptome divergence along a latitudinal gradient. Mol Ecol 2017; 26:1256-1272. [PMID: 28100025 DOI: 10.1111/mec.14015] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2016] [Revised: 11/23/2016] [Accepted: 11/28/2016] [Indexed: 12/26/2022]
Abstract
Sex-dependent gene expression is likely an important genomic mechanism that allows sex-specific adaptation to environmental changes. Among Drosophila species, sex-biased genes display remarkably consistent evolutionary patterns; male-biased genes evolve faster than unbiased genes in both coding sequence and expression level, suggesting sex differences in selection through time. However, comparatively little is known of the evolutionary process shaping sex-biased expression within species. Latitudinal clines offer an opportunity to examine how changes in key ecological parameters also influence sex-specific selection and the evolution of sex-biased gene expression. We assayed male and female gene expression in Drosophila serrata along a latitudinal gradient in eastern Australia spanning most of its endemic distribution. Analysis of 11 631 genes across eight populations revealed strong sex differences in the frequency, mode and strength of divergence. Divergence was far stronger in males than females and while latitudinal clines were evident in both sexes, male divergence was often population specific, suggesting responses to localized selection pressures that do not covary predictably with latitude. While divergence was enriched for male-biased genes, there was no overrepresentation of X-linked genes in males. By contrast, X-linked divergence was elevated in females, especially for female-biased genes. Many genes that diverged in D. serrata have homologs also showing latitudinal divergence in Drosophila simulans and Drosophila melanogaster on other continents, likely indicating parallel adaptation in these distantly related species. Our results suggest that sex differences in selection play an important role in shaping the evolution of gene expression over macro- and micro-ecological spatial scales.
Collapse
Affiliation(s)
- Scott L Allen
- The School of Biological Sciences, The University of Queensland, St. Lucia, Qld, 4072, Australia
| | - Russell Bonduriansky
- Evolution & Ecology Research Centre and School of Biological, Earth and Environmental Sciences, University of New South Wales, Sydney, NSW, 2052, Australia
| | - Carla M Sgro
- School of Biological Sciences, Monash University, Melbourne, Vic., 3800, Australia
| | - Stephen F Chenoweth
- The School of Biological Sciences, The University of Queensland, St. Lucia, Qld, 4072, Australia
| |
Collapse
|