51
|
Lubośny M, Przyłucka A, Śmietanka B, Burzyński A. Semimytilus algosus: first known hermaphroditic mussel with doubly uniparental inheritance of mitochondrial DNA. Sci Rep 2020; 10:11256. [PMID: 32647112 PMCID: PMC7347871 DOI: 10.1038/s41598-020-67976-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2020] [Accepted: 06/11/2020] [Indexed: 11/08/2022] Open
Abstract
Doubly uniparental inheritance (DUI) of mitochondrial DNA is a rare phenomenon occurring in some freshwater and marine bivalves and is usually characterized by the mitochondrial heteroplasmy of male individuals. Previous research on freshwater Unionida mussels showed that hermaphroditic species do not have DUI even if their closest gonochoristic counterparts do. No records showing DUI in a hermaphrodite have ever been reported. Here we show for the first time that the hermaphroditic mussel Semimytilus algosus (Mytilida), very likely has DUI, based on the complete sequences of both mitochondrial DNAs and the distribution of mtDNA types between male and female gonads. The two mitogenomes show considerable divergence (34.7%). The presumably paternal M type mitogenome dominated the male gonads of most studied mussels, while remaining at very low or undetectable levels in the female gonads of the same individuals. If indeed DUI can function in the context of simultaneous hermaphroditism, a change of paradigm regarding its involvement in sex determination is needed. It is apparently associated with gonadal differentiation rather than with sex determination in bivalves.
Collapse
Affiliation(s)
- Marek Lubośny
- Department of Genetics and Marine Biotechnology, Institute of Oceanology Polish Academy of Sciences, Sopot, Poland.
| | - Aleksandra Przyłucka
- Department of Genetics and Marine Biotechnology, Institute of Oceanology Polish Academy of Sciences, Sopot, Poland
| | - Beata Śmietanka
- Department of Genetics and Marine Biotechnology, Institute of Oceanology Polish Academy of Sciences, Sopot, Poland
| | - Artur Burzyński
- Department of Genetics and Marine Biotechnology, Institute of Oceanology Polish Academy of Sciences, Sopot, Poland
| |
Collapse
|
52
|
Machado AM, Muñoz-Merida A, Fonseca E, Veríssimo A, Pinto R, Felício M, da Fonseca RR, Froufe E, Castro LFC. Liver transcriptome resources of four commercially exploited teleost species. Sci Data 2020; 7:214. [PMID: 32636445 PMCID: PMC7340784 DOI: 10.1038/s41597-020-0565-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2019] [Accepted: 06/09/2020] [Indexed: 01/10/2023] Open
Abstract
The generation of omic resources is central to develop adequate management strategies for species with economic value. Here, we provide high-coverage RNA-seq datasets of liver tissue (containing between 80,2 and 88,4 million of paired-end reads) from four wildtype teleost species with high commercial value: Trachurus trachurus (TTR; Atlantic horse mackerel), Scomber scombrus (SSC; Atlantic mackerel), Trisopterus luscus (TLU; pout), and Micromesistius poutassou (MPO; blue whiting). A comprehensive assembly pipeline, using de novo single and multi-kmer assembly approaches, produced 64 single high-quality liver transcriptomes - 16 per species. The final assemblies, with N50 values ranging from 2,543-3,700 bp and BUSCO (Benchmarking Universal Single-Copy Orthologs) completeness values between 81.8-86.5% of the Actinopterygii gene set, were subjected to open reading frame (ORF) prediction and functional annotation. Our study provides the first transcriptomic resources for these species and offers valuable tools to evaluate both neutral and selected genetic variation among populations, and to identify candidate genes for environmental adaptation assisting in the investigation of the effects of global changes in fisheries.
Collapse
Affiliation(s)
- André M Machado
- CIIMAR - Interdisciplinary Centre of Marine and Environmental Research, U. Porto - University of Porto, Porto, Portugal.
| | - Antonio Muñoz-Merida
- CIBIO-InBIO, Research Network in Biodiversity and Evolutionary Biology, Universidade do Porto, Campus Agrário de Vairão, 4485-661, Vairão, Portugal
| | - Elza Fonseca
- CIIMAR - Interdisciplinary Centre of Marine and Environmental Research, U. Porto - University of Porto, Porto, Portugal
- Department of Biology, Faculty of Sciences, U. Porto - University of Porto, Porto, Portugal
| | - Ana Veríssimo
- CIBIO-InBIO, Research Network in Biodiversity and Evolutionary Biology, Universidade do Porto, Campus Agrário de Vairão, 4485-661, Vairão, Portugal
- Department of Biology, Faculty of Sciences, U. Porto - University of Porto, Porto, Portugal
| | - Rui Pinto
- CIIMAR - Interdisciplinary Centre of Marine and Environmental Research, U. Porto - University of Porto, Porto, Portugal
| | - Mónica Felício
- Portuguese Institute for the Sea and Atmosphere, I.P. (IPMA), Lisbon, Portugal
| | - Rute R da Fonseca
- Center for Macroecology, Evolution and Climate, Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Elsa Froufe
- CIIMAR - Interdisciplinary Centre of Marine and Environmental Research, U. Porto - University of Porto, Porto, Portugal
| | - L Filipe C Castro
- CIIMAR - Interdisciplinary Centre of Marine and Environmental Research, U. Porto - University of Porto, Porto, Portugal.
- Department of Biology, Faculty of Sciences, U. Porto - University of Porto, Porto, Portugal.
| |
Collapse
|
53
|
De novo assembly and functional annotation of the heart + hemolymph transcriptome in the Caribbean spiny lobster Panulirus argus. Mar Genomics 2020; 54:100783. [PMID: 32414680 DOI: 10.1016/j.margen.2020.100783] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Revised: 04/29/2020] [Accepted: 05/02/2020] [Indexed: 12/29/2022]
Abstract
The spiny lobster, Panulirus argus, is an ecologically relevant species in shallow water coral reefs and a target of the most lucrative fishery in the greater Caribbean region. This study reports, for the first time, the heart + hemolymph transcriptome of the Caribbean spiny lobster Panulirus argus assembled from short Illumina 150bp PE raw reads. A total of 80,152,094 raw reads were assembled using the Oyster River Protocol pipeline. The assembly resulted in a total of 254,773 transcripts. Functional gene annotation was conducted using the software package 'dammit'. Lastly, gene enrichment analyses were conducted using the Gene Ontology (GO) and KEGG pathway (Kaas) databases. This resource will be of utmost importance in future research aiming at exploring the effect of local and regional anthropogenic disturbances, as well as global climate change on the molecular physiology of this overexploited species.
Collapse
|
54
|
Freedman AH, Clamp M, Sackton TB. Error, noise and bias in de novo transcriptome assemblies. Mol Ecol Resour 2020; 21:18-29. [PMID: 32180366 DOI: 10.1111/1755-0998.13156] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Revised: 01/25/2020] [Accepted: 03/10/2020] [Indexed: 12/21/2022]
Abstract
De novo transcriptome assembly is a powerful tool, and has been widely used over the last decade for making evolutionary inferences. However, it relies on two implicit assumptions: that the assembled transcriptome is an unbiased representation of the underlying expressed transcriptome, and that expression estimates from the assembly are good, if noisy approximations of the relative abundance of expressed transcripts. Using publicly available data for model organisms, we demonstrate that, across assembly algorithms and data sets, these assumptions are consistently violated. Bias exists at the nucleotide level, with genotyping error rates ranging from 30% to 83%. As a result, diversity is underestimated in transcriptome assemblies, with consistent underestimation of heterozygosity in all but the most inbred samples. Even at the gene level, expression estimates show wide deviations from map-to-reference estimates, and positive bias at lower expression levels. Standard filtering of transcriptome assemblies improves the robustness of gene expression estimates but leads to the loss of a meaningful number of protein-coding genes, including many that are highly expressed. We demonstrate a computational method, length-rescaled CPM, to partly alleviate noise and bias in expression estimates. Researchers should consider ways to minimize the impact of bias in transcriptome assemblies.
Collapse
Affiliation(s)
- Adam H Freedman
- Faculty of Arts and Sciences Informatics Group, Harvard University, Cambridge, MA, USA
| | - Michele Clamp
- Faculty of Arts and Sciences Informatics Group, Harvard University, Cambridge, MA, USA
| | - Timothy B Sackton
- Faculty of Arts and Sciences Informatics Group, Harvard University, Cambridge, MA, USA
| |
Collapse
|
55
|
Hardin A, Nevonen KA, Eckalbar WL, Carbone L, Ahituv N. Comparative Genomic Characterization of the Multimammate Mouse Mastomys coucha. Mol Biol Evol 2020; 36:2805-2812. [PMID: 31424545 PMCID: PMC6878952 DOI: 10.1093/molbev/msz188] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
Mastomys are the most widespread African rodent and carriers of various diseases such as the plague or Lassa virus. In addition, mastomys have rapidly gained a large number of mammary glands. Here, we generated a genome, variome, and transcriptomes for Mastomys coucha. As mastomys diverged at similar times from mouse and rat, we demonstrate their utility as a comparative genomic tool for these commonly used animal models. Furthermore, we identified over 500 mastomys accelerated regions, often residing near important mammary developmental genes or within their exons leading to protein sequence changes. Functional characterization of a noncoding mastomys accelerated region, located in the HoxD locus, showed enhancer activity in mouse developing mammary glands. Combined, our results provide genomic resources for mastomys and highlight their potential both as a comparative genomic tool and for the identification of mammary gland number determining factors.
Collapse
Affiliation(s)
- Aaron Hardin
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA.,Institute for Human Genetics, University of California, San Francisco, San Francisco, CA
| | - Kimberly A Nevonen
- Department of Medicine, Knight Cardiovascular Institute, Oregon Health and Science University, Portland, OR
| | - Walter L Eckalbar
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA.,Institute for Human Genetics, University of California, San Francisco, San Francisco, CA
| | - Lucia Carbone
- Department of Medicine, Knight Cardiovascular Institute, Oregon Health and Science University, Portland, OR.,Department of Molecular and Medical Genetics, Oregon Health and Science University, Portland, OR.,Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, Portland, OR.,Division of Genetics, Oregon National Primate Research Center, Beaverton, OR
| | - Nadav Ahituv
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA.,Institute for Human Genetics, University of California, San Francisco, San Francisco, CA
| |
Collapse
|
56
|
Lubośny M, Śmietanka B, Przyłucka A, Burzyński A. Highly divergent mitogenomes ofGeukensia demissa(Bivalvia, Mytilidae) with extreme AT content. J ZOOL SYST EVOL RES 2020. [DOI: 10.1111/jzs.12354] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Marek Lubośny
- Department of Genetics and Marine Biotechnology Institute of Oceanology Polish Academy of Sciences Sopot Poland
| | - Beata Śmietanka
- Department of Genetics and Marine Biotechnology Institute of Oceanology Polish Academy of Sciences Sopot Poland
| | - Aleksandra Przyłucka
- Department of Genetics and Marine Biotechnology Institute of Oceanology Polish Academy of Sciences Sopot Poland
| | - Artur Burzyński
- Department of Genetics and Marine Biotechnology Institute of Oceanology Polish Academy of Sciences Sopot Poland
| |
Collapse
|
57
|
Edenborough KM, Bokelmann M, Lander A, Couacy-Hymann E, Lechner J, Drechsel O, Renard BY, Radonić A, Feldmann H, Kurth A, Prescott J. Dendritic Cells Generated From Mops condylurus, a Likely Filovirus Reservoir Host, Are Susceptible to and Activated by Zaire Ebolavirus Infection. Front Immunol 2019; 10:2414. [PMID: 31681302 PMCID: PMC6797855 DOI: 10.3389/fimmu.2019.02414] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Accepted: 09/26/2019] [Indexed: 12/17/2022] Open
Abstract
Ebola virus infection of human dendritic cells (DCs) induces atypical adaptive immune responses and thereby exacerbates Ebola virus disease (EVD). Human DCs, infected with Ebola virus aberrantly express low levels of the DC activation markers CD80, CD86, and MHC class II. The T cell responses ensuing are commonly anergic rather than protective against EVD. We hypothesize that DCs derived from potential reservoir hosts such as bats, which do not develop disease signs in response to Ebola virus infection, would exhibit features associated with activation. In this study, we have examined Zaire ebolavirus (EBOV) infection of DCs derived from the Angolan free-tailed bat species, Mops condylurus. This species was previously identified as permissive to EBOV infection in vivo, in the absence of disease signs. M. condylurus has also been recently implicated as the reservoir host for Bombali ebolavirus, a virus species that is closely related to EBOV. Due to the absence of pre-existing M. condylurus species-specific reagents, we characterized its de novo assembled transcriptome and defined its phylogenetic similarity to other mammals, which enabled the identification of cross-reactive reagents for M. condylurus bone marrow-derived DC (bat-BMDC) differentiation and immune cell phenotyping. Our results reveal that bat-BMDCs are susceptible to EBOV infection as determined by detection of EBOV specific viral RNA (vRNA). vRNA increased significantly 72 h after EBOV-infection and was detected in both cells and in culture supernatants. Bat-BMDC infection was further confirmed by the observation of GFP expression in DC cultures infected with a recombinant GFP-EBOV. Bat-BMDCs upregulated CD80 and chemokine ligand 3 (CCL3) transcripts in response to EBOV infection, which positively correlated with the expression levels of EBOV vRNA. In contrast to the aberrant responses to EBOV infection that are typical for human-DC, our findings from bat-BMDCs provide evidence for an immunological basis of asymptomatic EBOV infection outcomes.
Collapse
Affiliation(s)
- Kathryn M. Edenborough
- Centre for Biological Threats and Special Pathogens, Robert Koch Institute, Berlin, Germany
| | - Marcel Bokelmann
- Centre for Biological Threats and Special Pathogens, Robert Koch Institute, Berlin, Germany
| | - Angelika Lander
- Centre for Biological Threats and Special Pathogens, Robert Koch Institute, Berlin, Germany
| | - Emmanuel Couacy-Hymann
- LANADA, Laboratoire National d'Appui au Développement Agricole, Bingerville, Côte d'Ivoire
| | - Johanna Lechner
- Methodology and Research Infrastructure, Robert Koch Institute, Berlin, Germany
| | - Oliver Drechsel
- Methodology and Research Infrastructure, Robert Koch Institute, Berlin, Germany
| | - Bernhard Y. Renard
- Methodology and Research Infrastructure, Robert Koch Institute, Berlin, Germany
| | - Aleksandar Radonić
- Methodology and Research Infrastructure, Robert Koch Institute, Berlin, Germany
| | - Heinz Feldmann
- Laboratory of Virology, Division of Intramural Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Hamilton, ON, United States
| | - Andreas Kurth
- Centre for Biological Threats and Special Pathogens, Robert Koch Institute, Berlin, Germany
| | - Joseph Prescott
- Centre for Biological Threats and Special Pathogens, Robert Koch Institute, Berlin, Germany
| |
Collapse
|
58
|
Yohe LR, Davies KTJ, Simmons NB, Sears KE, Dumont ER, Rossiter SJ, Dávalos LM. Evaluating the performance of targeted sequence capture, RNA-Seq, and degenerate-primer PCR cloning for sequencing the largest mammalian multigene family. Mol Ecol Resour 2019; 20:140-153. [PMID: 31523924 DOI: 10.1111/1755-0998.13093] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2019] [Revised: 08/27/2019] [Accepted: 09/06/2019] [Indexed: 12/18/2022]
Abstract
Multigene families evolve from single-copy ancestral genes via duplication, and typically encode proteins critical to key biological processes. Molecular analyses of these gene families require high-confidence sequences, but the high sequence similarity of the members can create challenges for sequencing and downstream analyses. Focusing on the common vampire bat, Desmodus rotundus, we evaluated how different sequencing approaches performed in recovering the largest mammalian protein-coding multigene family: olfactory receptors (OR). Using the genome as a reference, we determined the proportion of intact protein-coding receptors recovered by: (a) amplicons from degenerate primers sequenced via Sanger technology, (b) RNA-Seq of the main olfactory epithelium, and (c) those genes captured with probes designed from transcriptomes of closely-related species. Our initial re-annotation of the high-quality vampire bat genome resulted in >400 intact OR genes, more than doubling the original estimate. Sanger-sequenced amplicons performed the poorest among the three approaches, detecting <33% of receptors in the genome. In contrast, the transcriptome reliably recovered >50% of the annotated genomic ORs, and targeted sequence capture recovered nearly 75% of annotated genes. Each sequencing approach assembled high-quality sequences, even if it did not recover all receptors in the genome. While some variation may be due to limitations of the study design (e.g., different individuals), variation among approaches was mostly caused by low coverage of some receptors rather than high rates of assembly error. Given this variability, we caution against using the counts of intact receptors per species to model the birth-death process of multigene families. Instead, our results support the use of orthologous sequences to explore and model the evolutionary processes shaping these genes.
Collapse
Affiliation(s)
- Laurel R Yohe
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA.,Department of Geology and Geophysics, Yale University, Stony Brook, NY, USA
| | - Kalina T J Davies
- School of Biological and Chemical Sciences, Queen Mary University of London, London, UK
| | - Nancy B Simmons
- Department of Mammalogy, Division of Vertebrate Zoology, American Museum of Natural History, New York, NY, USA
| | - Karen E Sears
- Department of Ecology and Evolutionary Biology, UCLA, Los Angeles, CA, USA
| | - Elizabeth R Dumont
- School of Natural Sciences, University of California Merced, Merced, CA, USA
| | - Stephen J Rossiter
- School of Biological and Chemical Sciences, Queen Mary University of London, London, UK
| | - Liliana M Dávalos
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA.,Consortium for Inter-Disciplinary Environmental Research, Stony Brook University, Stony Brook, NY, USA
| |
Collapse
|
59
|
Yohe LR, Davies KTJ, Rossiter SJ, Dávalos LM. Expressed Vomeronasal Type-1 Receptors (V1rs) in Bats Uncover Conserved Sequences Underlying Social Chemical Signaling. Genome Biol Evol 2019; 11:2741-2749. [PMID: 31424505 PMCID: PMC6777432 DOI: 10.1093/gbe/evz179] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/10/2019] [Indexed: 01/08/2023] Open
Abstract
In mammals, social and reproductive behaviors are mediated by chemical cues encoded by hyperdiverse families of receptors expressed in the vomeronasal organ. Between species, the number of intact receptors can vary by orders of magnitude. However, the evolutionary processes behind variation in receptor number, and its link to fitness-related behaviors are not well understood. From vomeronasal transcriptomes, we discovered the first evidence of intact vomeronasal type-1 receptor (V1r) genes in bats, and we tested whether putatively functional bat receptors were orthologous to those of related taxa, or whether bats have evolved novel receptors. Instead of lineage-specific duplications, we found that bat V1rs show high levels of orthology to those of their relatives, and receptors are under comparative levels of purifying selection as non-bats. Despite widespread vomeronasal organ loss in bats, V1r copies have been retained for >65 million years. The highly conserved nature of bat V1rs challenges our current understanding of mammalian V1r function and suggests roles other than conspecific recognition or mating initiation in social behavior.
Collapse
Affiliation(s)
- Laurel R Yohe
- Department of Ecology and Evolution, Stony Brook University, NY
- Department of Geology & Geophysics, Yale University, New Haven, CT
| | - Kalina T J Davies
- School of Biological and Chemical Sciences, Queen Mary University of London, United Kingdom
| | - Stephen J Rossiter
- School of Biological and Chemical Sciences, Queen Mary University of London, United Kingdom
| | - Liliana M Dávalos
- Department of Ecology and Evolution, Stony Brook University, NY
- Consortium for Inter-Disciplinary Environmental Research, Stony Brook University, Stony Brook, NY
| |
Collapse
|
60
|
Stuckert AMM, Moore E, Coyle KP, Davison I, MacManes MD, Roberts R, Summers K. Variation in pigmentation gene expression is associated with distinct aposematic color morphs in the poison frog Dendrobates auratus. BMC Evol Biol 2019; 19:85. [PMID: 30995908 PMCID: PMC6472079 DOI: 10.1186/s12862-019-1410-7] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Accepted: 03/15/2019] [Indexed: 12/28/2022] Open
Abstract
Background Color and pattern phenotypes have clear implications for survival and reproduction in many species. However, the mechanisms that produce this coloration are still poorly characterized, especially at the genomic level. Here we have taken a transcriptomics-based approach to elucidate the underlying genetic mechanisms affecting color and pattern in a highly polytypic poison frog. We sequenced RNA from the skin from four different color morphs during the final stage of metamorphosis and assembled a de novo transcriptome. We then investigated differential gene expression, with an emphasis on examining candidate color genes from other taxa. Results Overall, we found differential expression of a suite of genes that control melanogenesis, melanocyte differentiation, and melanocyte proliferation (e.g., tyrp1, lef1, leo1, and mitf) as well as several differentially expressed genes involved in purine synthesis and iridophore development (e.g., arfgap1, arfgap2, airc, and gart). Conclusions Our results provide evidence that several gene networks known to affect color and pattern in vertebrates play a role in color and pattern variation in this species of poison frog. Electronic supplementary material The online version of this article (10.1186/s12862-019-1410-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Adam M M Stuckert
- Department of Biology, East Carolina University, Greenville, North Carolina, USA. .,Hubbard Center for Genome Studies, University of New Hampshire, Durham, New Hampshire, USA. .,Department of Molecular, Cellular & Biomedical Sciences, University of New Hampshire, Durham, New Hampshire, USA.
| | - Emily Moore
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina, USA
| | - Kaitlin P Coyle
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina, USA
| | - Ian Davison
- Department of Biology, East Carolina University, Greenville, North Carolina, USA
| | - Matthew D MacManes
- Hubbard Center for Genome Studies, University of New Hampshire, Durham, New Hampshire, USA.,Department of Molecular, Cellular & Biomedical Sciences, University of New Hampshire, Durham, New Hampshire, USA
| | - Reade Roberts
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina, USA
| | - Kyle Summers
- Department of Biology, East Carolina University, Greenville, North Carolina, USA
| |
Collapse
|
61
|
Johnson LK, Alexander H, Brown CT. Re-assembly, quality evaluation, and annotation of 678 microbial eukaryotic reference transcriptomes. Gigascience 2019; 8:giy158. [PMID: 30544207 PMCID: PMC6481552 DOI: 10.1093/gigascience/giy158] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2018] [Revised: 09/18/2018] [Accepted: 11/29/2018] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND De novo transcriptome assemblies are required prior to analyzing RNA sequencing data from a species without an existing reference genome or transcriptome. Despite the prevalence of transcriptomic studies, the effects of using different workflows, or "pipelines," on the resulting assemblies are poorly understood. Here, a pipeline was programmatically automated and used to assemble and annotate raw transcriptomic short-read data collected as part of the Marine Microbial Eukaryotic Transcriptome Sequencing Project. The resulting transcriptome assemblies were evaluated and compared against assemblies that were previously generated with a different pipeline developed by the National Center for Genome Research. RESULTS New transcriptome assemblies contained the majority of previous contigs as well as new content. On average, 7.8% of the annotated contigs in the new assemblies were novel gene names not found in the previous assemblies. Taxonomic trends were observed in the assembly metrics. Assemblies from the Dinoflagellata showed a higher number of contigs and unique k-mers than transcriptomes from other phyla, while assemblies from Ciliophora had a lower percentage of open reading frames compared to other phyla. CONCLUSIONS Given current bioinformatics approaches, there is no single "best" reference transcriptome for a particular set of raw data. As the optimum transcriptome is a moving target, improving (or not) with new tools and approaches, automated and programmable pipelines are invaluable for managing the computationally intensive tasks required for re-processing large sets of samples with revised pipelines and ensuring a common evaluation workflow is applied to all samples. Thus, re-assembling existing data with new tools using automated and programmable pipelines may yield more accurate identification of taxon-specific trends across samples in addition to novel and useful products for the community.
Collapse
Affiliation(s)
- Lisa K Johnson
- Department of Population Health, and Reproduction, School of Veterinary Medicine, University of California Davis, One Shields Ave, Davis, CA 95616, USA
- Molecular, Cellular, and Integrative Physiology Graduate Group, University of California Davis, One Shields Ave, Davis, CA 95616, USA
| | - Harriet Alexander
- Department of Population Health, and Reproduction, School of Veterinary Medicine, University of California Davis, One Shields Ave, Davis, CA 95616, USA
- Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA 02543, USA
| | - C Titus Brown
- Department of Population Health, and Reproduction, School of Veterinary Medicine, University of California Davis, One Shields Ave, Davis, CA 95616, USA
- Molecular, Cellular, and Integrative Physiology Graduate Group, University of California Davis, One Shields Ave, Davis, CA 95616, USA
- Genome Center, University of California Davis, 451 Health Sciences Dr, Davis, CA 95616, USA
| |
Collapse
|
62
|
Pinto BJ, Card DC, Castoe TA, Diaz RE, Nielsen SV, Trainor PA, Gamble T. The transcriptome of the veiled chameleon (Chamaeleo calyptratus): A resource for studying the evolution and development of vertebrates. Dev Dyn 2019; 248:702-708. [PMID: 30839129 DOI: 10.1002/dvdy.20] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2018] [Revised: 02/26/2019] [Accepted: 02/26/2019] [Indexed: 12/14/2022] Open
Abstract
PURPOSE The veiled chameleon (Chamaeleo calyptratus) is an emerging model system for studying functional morphology and evolutionary developmental biology (evo-devo). Chameleons possess body plans that are highly adapted to an arboreal life style, featuring laterally compressed bodies, split hands/ft for grasping, a projectile tongue, turreted independently moving eyes, and a prehensile tail. Despite being one of the most phenotypically divergent clades of tetrapods, genomic resources for chameleons are severely lacking. METHODS To address this lack of resources, we used RNAseq to generate 288 million raw Illumina sequence reads from four adult tissues (male and female eyes and gonads) and whole embryos at three distinct developmental stages. We used these data to assemble a largely complete de novo transcriptome consisting of only 82 952 transcripts. In addition, a majority of assembled transcripts (67%) were successfully annotated. RESULTS We then demonstrated the utility of these data in the context of studying visual system evolution by examining the content of veiled chameleon opsin genes to show that chameleons possess all five ancestral tetrapod opsins. CONCLUSION We present this de novo, annotated, multi-tissue transcriptome assembly for the Veiled Chameleon, Chamaeleo calyptratus, as a resource to address a range of evolutionary and developmental questions. The associated raw reads and final annotated transcriptome assembly are freely available for use on NCBI and Figshare, respectively.
Collapse
Affiliation(s)
- Brendan J Pinto
- Department of Biological Sciences, Marquette University, Milwaukee, Wisconsin
| | - Daren C Card
- Department of Biology, The University of Texas at Arlington, Arlington, Texas
| | - Todd A Castoe
- Department of Biology, The University of Texas at Arlington, Arlington, Texas
| | - Raul E Diaz
- Department of Biological Sciences, Southeastern Louisiana University, Hammond, Louisiana.,Natural History Museum of Los Angeles County, Los Angeles, California
| | - Stuart V Nielsen
- Department of Biological Sciences, Marquette University, Milwaukee, Wisconsin
| | - Paul A Trainor
- Department of Anatomy & Cell Biology, Stowers Institute for Medical Research, Kansas City, Missouri.,Department of Anatomy and Cell Biology, University of Kansas Medical Center, Kansas City, Kansas
| | - Tony Gamble
- Department of Biological Sciences, Marquette University, Milwaukee, Wisconsin.,Milwaukee Public Museum, Milwaukee, Wisconsin.,Bell Museum of Natural History, University of Minnesota, St Paul, Minnesota
| |
Collapse
|
63
|
Alexander H, Johnson LK, Brown CT. Keeping it light: (re)analyzing community-wide datasets without major infrastructure. Gigascience 2019; 8:5241892. [PMID: 30544142 PMCID: PMC6350038 DOI: 10.1093/gigascience/giy159] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Accepted: 11/29/2018] [Indexed: 12/19/2022] Open
Abstract
DNA sequencing technology has revolutionized the field of biology, shifting biology from a data-limited to data-rich state. Central to the interpretation of sequencing data are the computational tools and approaches that convert raw data into biologically meaningful information. Both the tools and the generation of data are actively evolving, yet the practice of re-analysis of previously generated data with new tools is not commonplace. Re-analysis of existing data provides an affordable means of generating new information and will likely become more routine within biology, yet necessitates a new set of considerations for best practices and resource development. Here, we discuss several practices that we believe to be broadly applicable when re-analyzing data, especially when done by small research groups.
Collapse
Affiliation(s)
- Harriet Alexander
- Population Health and Reproduction, University of California Davis, 1 Shields Ave., Davis, CA, 95616, USA.,Biology Department, Woods Hole Oceanographic Institution, 266 Woods Hole Rd., Woods Hole, MA, 02543, USA
| | - Lisa K Johnson
- Population Health and Reproduction, University of California Davis, 1 Shields Ave., Davis, CA, 95616, USA.,Molecular, Cellular, and Integrative Physiology Graduate Group, University of California Davis, 1 Shields Ave., Davis, CA, 95616, USA
| | - C Titus Brown
- Population Health and Reproduction, University of California Davis, 1 Shields Ave., Davis, CA, 95616, USA.,Molecular, Cellular, and Integrative Physiology Graduate Group, University of California Davis, 1 Shields Ave., Davis, CA, 95616, USA.,Genome Center, University of California Davis, 1 Shields Ave, Davis, CA, 95616, USA
| |
Collapse
|