1
|
Lineage-Specific Genes and Family Expansions in Dictyostelid Genomes Display Expression Bias and Evolutionary Diversification during Development. Genes (Basel) 2021; 12:genes12101628. [PMID: 34681022 PMCID: PMC8535579 DOI: 10.3390/genes12101628] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2021] [Revised: 10/12/2021] [Accepted: 10/13/2021] [Indexed: 12/23/2022] Open
Abstract
Gene duplications generate new genes that can contribute to expression changes and the evolution of new functions. Genomes often consist of gene families that undergo expansions, some of which occur in specific lineages that reflect recent adaptive diversification. In this study, lineage-specific genes and gene family expansions were studied across five dictyostelid species to determine when and how they are expressed during multicellular development. Lineage-specific genes were found to be enriched among genes with biased expression (predominant expression in one developmental stage) in each species and at most developmental time points, suggesting independent functional innovations of new genes throughout the phylogeny. Biased duplicate genes had greater expression divergence than their orthologs and paralogs, consistent with subfunctionalization or neofunctionalization. Lineage-specific expansions in particular had biased genes with both molecular signals of positive selection and high expression, suggesting adaptive genetic and transcriptional diversification following duplication. Our results present insights into the potential contributions of lineage-specific genes and families in generating species-specific phenotypes during multicellular development in dictyostelids.
Collapse
|
2
|
Katoh-Kurasawa M, Hrovatin K, Hirose S, Webb A, Ho HI, Zupan B, Shaulsky G. Transcriptional milestones in Dictyostelium development. Genome Res 2021; 31:1498-1511. [PMID: 34183452 PMCID: PMC8327917 DOI: 10.1101/gr.275496.121] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 06/23/2021] [Indexed: 02/02/2023]
Abstract
Dictyostelium development begins with single-cell starvation and ends with multicellular fruiting bodies. Developmental morphogenesis is accompanied by sweeping transcriptional changes, encompassing nearly half of the 13,000 genes in the genome. We performed time-series RNA-sequencing analyses of the wild type and 20 mutants to explore the relationships between transcription and morphogenesis. These strains show developmental arrest at different stages, accelerated development, or atypical morphologies. Considering eight major morphological transitions, we identified 1371 milestone genes whose expression changes sharply between consecutive transitions. We also identified 1099 genes as members of 21 regulons, which are groups of genes that remain coordinately regulated despite the genetic, temporal, and developmental perturbations. The gene annotations in these groups validate known transitions and reveal new developmental events. For example, DNA replication genes are tightly coregulated with cell division genes, so they are expressed in mid-development although chromosomal DNA is not replicated. Our data set includes 486 transcriptional profiles that can help identify new relationships between transcription and development and improve gene annotations. We show its utility by showing that cycles of aggregation and disaggregation in allorecognition-defective mutants involve dedifferentiation. We also show sensitivity to genetic and developmental conditions in two commonly used actin genes, act6 and act15, and robustness of the coaA gene. Finally, we propose that gpdA is a better mRNA quantitation standard because it is less sensitive to external conditions than commonly used standards. The data set is available for democratized exploration through the web application dictyExpress and the data mining environment Orange.
Collapse
Affiliation(s)
- Mariko Katoh-Kurasawa
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Karin Hrovatin
- Faculty of Computer and Information Science, University of Ljubljana, SI-1000 Ljubljana, Slovenia
| | - Shigenori Hirose
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Amanda Webb
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Hsing-I Ho
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Blaž Zupan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
- Faculty of Computer and Information Science, University of Ljubljana, SI-1000 Ljubljana, Slovenia
| | - Gad Shaulsky
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| |
Collapse
|
3
|
Conditional expression explains molecular evolution of social genes in a microbe. Nat Commun 2019; 10:3284. [PMID: 31337766 PMCID: PMC6650454 DOI: 10.1038/s41467-019-11237-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Accepted: 06/25/2019] [Indexed: 12/30/2022] Open
Abstract
Conflict is thought to play a critical role in the evolution of social interactions by promoting diversity or driving accelerated evolution. However, despite our sophisticated understanding of how conflict shapes social traits, we have limited knowledge of how it impacts molecular evolution across the underlying social genes. Here we address this problem by analyzing the genome-wide impact of social interactions using genome sequences from 67 Dictyostelium discoideum strains. We find that social genes tend to exhibit enhanced polymorphism and accelerated evolution. However, these patterns are not consistent with conflict driven processes, but instead reflect relaxed purifying selection. This pattern is most likely explained by the conditional nature of social interactions, whereby selection on genes expressed only in social interactions is diluted by generations of inactivity. This dilution of selection by inactivity enhances the role of drift, leading to increased polymorphism and accelerated evolution, which we call the Red King process.
Collapse
|
4
|
Hayakawa IS, Inouye K. Species recognition in social amoebae. J Biosci 2018; 43:1025-1036. [PMID: 30541961] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Aggregative multicellularity requires the ability of cells to recognise conspecifics. Social amoebae are among the best studied of such organisms, but the mechanism and evolutionary background of species recognition remained to be investigated. Here we show that heterologous expression of a single Dictyostelium purpureum gene is sufficient for D. discoideum cells to efficiently make chimaeric fruiting bodies with D. purpureum cells. This gene forms a bidirectional pair with another gene on the D. purpureum genome, and they are both highly polymorphic among independent wild isolates of the same mating group that do not form chimaeric fruiting bodies with each other. These paired genes are both structurally similar to D. discoideum tgrB1/C1 pair, which is responsible for clonal discrimination within that species, suggesting that these tgr genes constitute the species recognition system that has attained a level of precision capable of discriminating between clones within a species. Analysis of the available genome sequences of social amoebae revealed that such gene pairs exist only within the clade composed of species that produce precursors of sterile stalk cells (prestalk cells), suggesting concurrent evolution of a precise allorecognition system and a new 'worker' cell-type dedicated to transporting and supporting the reproductive cells.
Collapse
Affiliation(s)
- Ikumi Shibano Hayakawa
- Department of Botany, Graduate School of Science, Kyoto University, Kyoto 606-8502, Japan,
| | | |
Collapse
|
5
|
Species recognition in social amoebae. J Biosci 2018. [DOI: 10.1007/s12038-018-9810-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
6
|
Chen X, Köllner TG, Shaulsky G, Jia Q, Dickschat JS, Gershenzon J, Chen F. Diversity and Functional Evolution of Terpene Synthases in Dictyostelid Social Amoebae. Sci Rep 2018; 8:14361. [PMID: 30254228 PMCID: PMC6156593 DOI: 10.1038/s41598-018-32639-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Accepted: 09/12/2018] [Indexed: 11/23/2022] Open
Abstract
Dictyostelids, or social amoebae, have a unique life style in forming multicellular fruiting bodies from unicellular amoeboids upon starvation. Recently, dictyostelids were found to contain terpene synthase (TPS) genes, a gene type of secondary metabolism previously known to occur only in plants, fungi and bacteria. Here we report an evolutionary functional study of dictyostelid TPS genes. The number of TPS genes in six species of dictyostelids examined ranges from 1 to 19; and the model species Dictyostelium purpureum contains 12 genes. Using in vitro enzyme assays, the 12 TPS genes from D. purpureum were shown to encode functional enzymes with distinct product profiles. The expression of the 12 TPS genes in D. purpureum is developmentally regulated. During multicellular development, D. purpureum releases a mixture of volatile terpenes dominated by sesquiterpenes that are the in vitro products of a subset of the 12 TPS genes. The quality and quantity of the terpenes released from D. purpureum, however, bear little resemblance to those of D. discoideum, a closely related dictyostelid. Despite these variations, the conserved clade of dictyostelid TPSs, which have an evolutionary distance of more than 600 million years, has the same biochemical function, catalyzing the formation of a sesquiterpene protoillud-7-ene. Taken together, our results indicate that the dynamic evolution of dictyostelid TPS genes includes both purifying selection of an orthologous group and species-specific expansion with functional divergence. Consequently, the terpenes produced by these TPSs most likely have conserved as well as species-adaptive biological functions as chemical languages in dictyostelids.
Collapse
Affiliation(s)
- Xinlu Chen
- Department of Plant Sciences, University of Tennessee, Knoxville, TN, 37996, USA
| | - Tobias G Köllner
- Department of Biochemistry, Max Planck Institute for Chemical Ecology, Jena, Germany
| | - Gad Shaulsky
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Qidong Jia
- Department of Plant Sciences, University of Tennessee, Knoxville, TN, 37996, USA
| | - Jeroen S Dickschat
- Kekulé-Institute of Organic Chemistry and Biochemistry, University of Bonn, Gerhard-Domagk-Straße 1, 53121, Bonn, Germany
| | - Jonathan Gershenzon
- Department of Biochemistry, Max Planck Institute for Chemical Ecology, Jena, Germany
| | - Feng Chen
- Department of Plant Sciences, University of Tennessee, Knoxville, TN, 37996, USA.
| |
Collapse
|
7
|
Swart AL, Harrison CF, Eichinger L, Steinert M, Hilbi H. Acanthamoeba and Dictyostelium as Cellular Models for Legionella Infection. Front Cell Infect Microbiol 2018; 8:61. [PMID: 29552544 PMCID: PMC5840211 DOI: 10.3389/fcimb.2018.00061] [Citation(s) in RCA: 65] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2017] [Accepted: 02/13/2018] [Indexed: 12/20/2022] Open
Abstract
Environmental bacteria of the genus Legionella naturally parasitize free-living amoebae. Upon inhalation of bacteria-laden aerosols, the opportunistic pathogens grow intracellularly in alveolar macrophages and can cause a life-threatening pneumonia termed Legionnaires' disease. Intracellular replication in amoebae and macrophages takes place in a unique membrane-bound compartment, the Legionella-containing vacuole (LCV). LCV formation requires the bacterial Icm/Dot type IV secretion system, which translocates literally hundreds of "effector" proteins into host cells, where they modulate crucial cellular processes for the pathogen's benefit. The mechanism of LCV formation appears to be evolutionarily conserved, and therefore, amoebae are not only ecologically significant niches for Legionella spp., but also useful cellular models for eukaryotic phagocytes. In particular, Acanthamoeba castellanii and Dictyostelium discoideum emerged over the last years as versatile and powerful models. Using genetic, biochemical and cell biological approaches, molecular interactions between amoebae and Legionella pneumophila have recently been investigated in detail with a focus on the role of phosphoinositide lipids, small and large GTPases, autophagy components and the retromer complex, as well as on bacterial effectors targeting these host factors.
Collapse
Affiliation(s)
- A Leoni Swart
- Institute of Medical Microbiology, Medical Faculty, University of Zurich, Zurich, Switzerland
| | - Christopher F Harrison
- Max von Pettenkofer Institute, Medical Faculty, Ludwig-Maximilians University Munich, Munich, Germany
| | - Ludwig Eichinger
- Institute for Biochemistry I, Medical Faculty, University Hospital Cologne, Cologne, Germany
| | - Michael Steinert
- Department of Life Sciences, Institute of Microbiology, Technical University of Braunschweig, Braunschweig, Germany
| | - Hubert Hilbi
- Institute of Medical Microbiology, Medical Faculty, University of Zurich, Zurich, Switzerland
| |
Collapse
|
8
|
Hillmann F, Forbes G, Novohradská S, Ferling I, Riege K, Groth M, Westermann M, Marz M, Spaller T, Winckler T, Schaap P, Glöckner G. Multiple Roots of Fruiting Body Formation in Amoebozoa. Genome Biol Evol 2018; 10:591-606. [PMID: 29378020 PMCID: PMC5804921 DOI: 10.1093/gbe/evy011] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/11/2018] [Indexed: 02/03/2023] Open
Abstract
Establishment of multicellularity represents a major transition in eukaryote evolution. A subgroup of Amoebozoa, the dictyosteliids, has evolved a relatively simple aggregative multicellular stage resulting in a fruiting body supported by a stalk. Protosteloid amoeba, which are scattered throughout the amoebozoan tree, differ by producing only one or few single stalked spores. Thus, one obvious difference in the developmental cycle of protosteliids and dictyosteliids seems to be the establishment of multicellularity. To separate spore development from multicellular interactions, we compared the genome and transcriptome of a Protostelium species (Protostelium aurantium var. fungivorum) with those of social and solitary members of the Amoebozoa. During fruiting body formation nearly 4,000 genes, corresponding to specific pathways required for differentiation processes, are upregulated. A comparison with genes involved in the development of dictyosteliids revealed conservation of >500 genes, but most of them are also present in Acanthamoeba castellanii for which fruiting bodies have not been documented. Moreover, expression regulation of those genes differs between P. aurantium and Dictyostelium discoideum. Within Amoebozoa differentiation to fruiting bodies is common, but our current genome analysis suggests that protosteliids and dictyosteliids used different routes to achieve this. Most remarkable is both the large repertoire and diversity between species in genes that mediate environmental sensing and signal processing. This likely reflects an immense adaptability of the single cell stage to varying environmental conditions. We surmise that this signaling repertoire provided sufficient building blocks to accommodate the relatively simple demands for cell-cell communication in the early multicellular forms.
Collapse
Affiliation(s)
- Falk Hillmann
- Junior Research Group Evolution of Microbial Interaction, Leibniz Institute for Natural Product Research and Infection Biology – Hans Knöll Institute (HKI), Jena, Germany
| | - Gillian Forbes
- Division of Cell and Developmental Biology, School of Life Sciences, University of Dundee, United Kingdom
| | - Silvia Novohradská
- Junior Research Group Evolution of Microbial Interaction, Leibniz Institute for Natural Product Research and Infection Biology – Hans Knöll Institute (HKI), Jena, Germany
| | - Iuliia Ferling
- Junior Research Group Evolution of Microbial Interaction, Leibniz Institute for Natural Product Research and Infection Biology – Hans Knöll Institute (HKI), Jena, Germany
| | - Konstantin Riege
- Bioinformatics/High Throughput Analysis, Friedrich Schiller University Jena, Germany
| | - Marco Groth
- CF DNA-Sequencing, Leibniz Institute on Aging Research, Jena, Germany
| | | | - Manja Marz
- Bioinformatics/High Throughput Analysis, Friedrich Schiller University Jena, Germany
| | - Thomas Spaller
- Pharmaceutical Biology, Institute of Pharmacy, Friedrich Schiller University Jena, Germany
| | - Thomas Winckler
- Pharmaceutical Biology, Institute of Pharmacy, Friedrich Schiller University Jena, Germany
| | - Pauline Schaap
- Division of Cell and Developmental Biology, School of Life Sciences, University of Dundee, United Kingdom
| | - Gernot Glöckner
- Institute of Biochemistry I, Medical Faculty, University of Cologne, Germany
| |
Collapse
|
9
|
Transcriptome reprogramming during developmental switching in Physarum polycephalum involves extensive remodeling of intracellular signaling networks. Sci Rep 2017; 7:12304. [PMID: 28951552 PMCID: PMC5615045 DOI: 10.1038/s41598-017-12250-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2017] [Accepted: 08/29/2017] [Indexed: 11/08/2022] Open
Abstract
Activation of a phytochrome photoreceptor triggers a program of Physarum polycephalum plasmodial cell differentiation through which a mitotic multinucleate protoplasmic mass synchronously develops into haploid spores formed by meiosis and rearrangement of cellular components. We have performed a transcriptome-wide RNAseq study of cellular reprogramming and developmental switching. RNAseq analysis revealed extensive remodeling of intracellular signaling and regulation in switching the expression of sets of genes encoding transcription factors, kinases, phosphatases, signal transduction proteins, RNA-binding proteins, ubiquitin ligases, regulators of the mitotic and meiotic cell cycle etc. in conjunction with the regulation of genes encoding metabolic enzymes and cytoskeletal proteins. About 15% of the differentially expressed genes shared similarity with members of the evolutionary conserved set of core developmental genes of social amoebae. Differential expression of genes encoding regulators that act at the transcriptional, translational, and post-translational level indicates the establishment of a new state of cellular function and reveals evolutionary deeply conserved molecular changes involved in cellular reprogramming and differentiation in a prototypical eukaryote.
Collapse
|
10
|
Stajdohar M, Rosengarten RD, Kokosar J, Jeran L, Blenkus D, Shaulsky G, Zupan B. dictyExpress: a web-based platform for sequence data management and analytics in Dictyostelium and beyond. BMC Bioinformatics 2017; 18:291. [PMID: 28578698 PMCID: PMC5457571 DOI: 10.1186/s12859-017-1706-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2016] [Accepted: 05/23/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Dictyostelium discoideum, a soil-dwelling social amoeba, is a model for the study of numerous biological processes. Research in the field has benefited mightily from the adoption of next-generation sequencing for genomics and transcriptomics. Dictyostelium biologists now face the widespread challenges of analyzing and exploring high dimensional data sets to generate hypotheses and discovering novel insights. RESULTS We present dictyExpress (2.0), a web application designed for exploratory analysis of gene expression data, as well as data from related experiments such as Chromatin Immunoprecipitation sequencing (ChIP-Seq). The application features visualization modules that include time course expression profiles, clustering, gene ontology enrichment analysis, differential expression analysis and comparison of experiments. All visualizations are interactive and interconnected, such that the selection of genes in one module propagates instantly to visualizations in other modules. dictyExpress currently stores the data from over 800 Dictyostelium experiments and is embedded within a general-purpose software framework for management of next-generation sequencing data. dictyExpress allows users to explore their data in a broader context by reciprocal linking with dictyBase-a repository of Dictyostelium genomic data. In addition, we introduce a companion application called GenBoard, an intuitive graphic user interface for data management and bioinformatics analysis. CONCLUSIONS dictyExpress and GenBoard enable broad adoption of next generation sequencing based inquiries by the Dictyostelium research community. Labs without the means to undertake deep sequencing projects can mine the data available to the public. The entire information flow, from raw sequence data to hypothesis testing, can be accomplished in an efficient workspace. The software framework is generalizable and represents a useful approach for any research community. To encourage more wide usage, the backend is open-source, available for extension and further development by bioinformaticians and data scientists.
Collapse
Affiliation(s)
- Miha Stajdohar
- Genialis d.o.o., Trzaska cesta 315, Ljubljana, 1000, Slovenia
| | | | - Janez Kokosar
- Genialis d.o.o., Trzaska cesta 315, Ljubljana, 1000, Slovenia
| | - Luka Jeran
- Genialis d.o.o., Trzaska cesta 315, Ljubljana, 1000, Slovenia
| | - Domen Blenkus
- Genialis d.o.o., Trzaska cesta 315, Ljubljana, 1000, Slovenia
| | - Gad Shaulsky
- Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, 77030, TX, USA
| | - Blaz Zupan
- Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, 77030, TX, USA.,Faculty of Computer and Information Science, University of Ljubljana, Večna pot 113, Ljubljana, 1000, Slovenia
| |
Collapse
|
11
|
Stumpf M, Müller R, Gaßen B, Wehrstedt R, Fey P, Karow MA, Eichinger L, Glöckner G, Noegel AA. A tripeptidyl peptidase 1 is a binding partner of the Golgi pH regulator (GPHR) in Dictyostelium. Dis Model Mech 2017; 10:897-907. [PMID: 28546289 PMCID: PMC5536908 DOI: 10.1242/dmm.029280] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2017] [Accepted: 05/17/2017] [Indexed: 02/02/2023] Open
Abstract
Mutations in tripeptidyl peptidase 1 (TPP1) have been associated with late infantile neuronal ceroid lipofuscinosis (NCL), a neurodegenerative disorder. TPP1 is a lysosomal serine protease, which removes tripeptides from the N-terminus of proteins and is composed of an N-terminal prodomain and a catalytic domain. It is conserved in mammals, amphibians, fish and the amoeba Dictyostelium discoideum. D. discoideum harbors at least six genes encoding TPP1, tpp1A to tpp1F. We identified TPP1F as binding partner of Dictyostelium GPHR (Golgi pH regulator), which is an evolutionarily highly conserved intracellular transmembrane protein. A region encompassing the DUF3735 (GPHR_N) domain of GPHR was responsible for the interaction. In TPP1F, the binding site is located in the prodomain of the protein. The tpp1F gene is transcribed throughout development and translated into a polypeptide of ∼65 kDa. TPP1 activity was demonstrated for TPP1F-GFP immunoprecipitated from D. discoideum cells. Its activity could be inhibited by addition of the recombinant DUF3735 domain of GPHR. Knockout tpp1F mutants did not display any particular phenotype, and TPP1 activity was not abrogated, presumably because tpp1B compensates as it has the highest expression level of all the TPP1 genes during growth. The GPHR interaction was not restricted to TPP1F but occurred also with TPP1B. As previous reports show that the majority of the TPP1 mutations in NCL resulted in reduction or loss of enzyme activity, we suggest that Dicyostelium could be used as a model system in which to test new reagents that could affect the activity of the protein and ameliorate the disease. Summary: Interaction of Dictyostelium tripeptidyl peptidase 1 with GPHR could be relevant for studies of the human enzyme, which is associated with a neurodegenerative disorder.
Collapse
Affiliation(s)
- Maria Stumpf
- Institute of Biochemistry I, Medical Faculty, University Hospital Cologne, Center for Molecular Medicine Cologne, University of Cologne, Joseph-Stelzmann-Str. 52, Köln 50931, Germany
| | - Rolf Müller
- Institute of Biochemistry I, Medical Faculty, University Hospital Cologne, Center for Molecular Medicine Cologne, University of Cologne, Joseph-Stelzmann-Str. 52, Köln 50931, Germany
| | - Berthold Gaßen
- Institute of Biochemistry I, Medical Faculty, University Hospital Cologne, Center for Molecular Medicine Cologne, University of Cologne, Joseph-Stelzmann-Str. 52, Köln 50931, Germany
| | - Regina Wehrstedt
- Institute of Biochemistry I, Medical Faculty, University Hospital Cologne, Center for Molecular Medicine Cologne, University of Cologne, Joseph-Stelzmann-Str. 52, Köln 50931, Germany
| | - Petra Fey
- Dicty Base, Northwestern University, Biomedical Informatics Center and Center for Genetic Medicine, Chicago, IL 60611, USA
| | - Malte A Karow
- Institute of Biochemistry I, Medical Faculty, University Hospital Cologne, Center for Molecular Medicine Cologne, University of Cologne, Joseph-Stelzmann-Str. 52, Köln 50931, Germany
| | - Ludwig Eichinger
- Institute of Biochemistry I, Medical Faculty, University Hospital Cologne, Center for Molecular Medicine Cologne, University of Cologne, Joseph-Stelzmann-Str. 52, Köln 50931, Germany
| | - Gernot Glöckner
- Institute of Biochemistry I, Medical Faculty, University Hospital Cologne, Center for Molecular Medicine Cologne, University of Cologne, Joseph-Stelzmann-Str. 52, Köln 50931, Germany
| | - Angelika A Noegel
- Institute of Biochemistry I, Medical Faculty, University Hospital Cologne, Center for Molecular Medicine Cologne, University of Cologne, Joseph-Stelzmann-Str. 52, Köln 50931, Germany
| |
Collapse
|
12
|
Singh R, Lawal HM, Schilde C, Glöckner G, Barton GJ, Schaap P, Cole C. Improved annotation with de novo transcriptome assembly in four social amoeba species. BMC Genomics 2017; 18:120. [PMID: 28143409 PMCID: PMC5282741 DOI: 10.1186/s12864-017-3505-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2016] [Accepted: 01/14/2017] [Indexed: 11/27/2022] Open
Abstract
Background Annotation of gene models and transcripts is a fundamental step in genome sequencing projects. Often this is performed with automated prediction pipelines, which can miss complex and atypical genes or transcripts. RNA sequencing (RNA-seq) data can aid the annotation with empirical data. Here we present de novo transcriptome assemblies generated from RNA-seq data in four Dictyostelid species: D. discoideum, P. pallidum, D. fasciculatum and D. lacteum. The assemblies were incorporated with existing gene models to determine corrections and improvement on a whole-genome scale. This is the first time this has been performed in these eukaryotic species. Results An initial de novo transcriptome assembly was generated by Trinity for each species and then refined with Program to Assemble Spliced Alignments (PASA). The completeness and quality were assessed with the Benchmarking Universal Single-Copy Orthologs (BUSCO) and Transrate tools at each stage of the assemblies. The final datasets of 11,315-12,849 transcripts contained 5,610-7,712 updates and corrections to >50% of existing gene models including changes to hundreds or thousands of protein products. Putative novel genes are also identified and alternative splice isoforms were observed for the first time in P. pallidum, D. lacteum and D. fasciculatum. Conclusions In taking a whole transcriptome approach to genome annotation with empirical data we have been able to enrich the annotations of four existing genome sequencing projects. In doing so we have identified updates to the majority of the gene annotations across all four species under study and found putative novel genes and transcripts which could be worthy for follow-up. The new transcriptome data we present here will be a valuable resource for genome curators in the Dictyostelia and we propose this effective methodology for use in other genome annotation projects. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3505-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Reema Singh
- Computational Biology, School of Life Sciences, University of Dundee, Dow Street, Dundee, UK.,Cell and Development Biology, School of Life Sciences, University of Dundee, Dow Street, Dundee, UK
| | - Hajara M Lawal
- Cell and Development Biology, School of Life Sciences, University of Dundee, Dow Street, Dundee, UK
| | - Christina Schilde
- Cell and Development Biology, School of Life Sciences, University of Dundee, Dow Street, Dundee, UK
| | - Gernot Glöckner
- Institute of Biochemistry I, Medical Faculty, University of Cologne, D-50931, Cologne, Germany.,Leibniz Institute of Freshwater Ecology and Inland Fisheries (IGB), Müggelseedamm 301, D-12587, Berlin, Germany
| | - Geoffrey J Barton
- Computational Biology, School of Life Sciences, University of Dundee, Dow Street, Dundee, UK
| | - Pauline Schaap
- Cell and Development Biology, School of Life Sciences, University of Dundee, Dow Street, Dundee, UK
| | - Christian Cole
- Computational Biology, School of Life Sciences, University of Dundee, Dow Street, Dundee, UK.
| |
Collapse
|