1
|
Sun H, Zhao W, Lin R, Zhou Z, Huai W, Yao Y. The conserved mitochondrial genome of the jewel beetle (Coleoptera: Buprestidae) and its phylogenetic implications for the suborder Polyphaga. Genomics 2020; 112:3713-3721. [PMID: 32360911 DOI: 10.1016/j.ygeno.2020.04.026] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 04/22/2020] [Accepted: 04/27/2020] [Indexed: 01/04/2023]
Abstract
In this study, we sequenced the mitochondrial (mt) genome of Agrilus mali (Coleoptera: Buprestidae) using next-generation sequencing, and accordingly annotated 13 protein-coding, 22 tRNA, and 2 rRNA genes and a 1458-bp non-coding region. Comparative analysis indicated that the mt genome of A. mali is relatively conserved, with a typical gene content and order identical to those of other coleopterans. However, the newly sequenced mt genome is characterized by a relatively higher A + T content compared with that of other species within the family Buprestidae. Phylogenetic analysis based on Bayesian inference revealed that the evolutionary relationship among the six infraorders of the suborder Polyphaga is (Scirtiformia + (Elateriformia + ((Scarabaeiformia + Staphyliniformia) + (Bostrichiformia + (Cucujiformia))))). However, the topology indicated that the family Buprestidae is a sister group to other Polyphaga infraorders, excluding Scirtiformia as a monophyly, and thus the monophyly of Elateriformia was not supported. This study not only presents the mt genome of a species in the family Buprestidae and a comparative analysis of jewel beetles but also examines the contribution of mt genomes in elucidating phylogenetic relationships within the suborder Polyphaga of Coleoptera.
Collapse
Affiliation(s)
- Huiquan Sun
- Key Laboratory of Forest Protection of National Forestry and Grassland Administration/Research Institute of Forest Ecology, Environment and Protection, Chinese Academy of Forestry, Beijing 100091, China
| | - Wenxia Zhao
- Key Laboratory of Forest Protection of National Forestry and Grassland Administration/Research Institute of Forest Ecology, Environment and Protection, Chinese Academy of Forestry, Beijing 100091, China
| | - Ruozhu Lin
- Key Laboratory of Forest Protection of National Forestry and Grassland Administration/Research Institute of Forest Ecology, Environment and Protection, Chinese Academy of Forestry, Beijing 100091, China
| | - Zhongfu Zhou
- Key Laboratory of Forest Protection of National Forestry and Grassland Administration/Research Institute of Forest Ecology, Environment and Protection, Chinese Academy of Forestry, Beijing 100091, China
| | - Wenxia Huai
- Key Laboratory of Forest Protection of National Forestry and Grassland Administration/Research Institute of Forest Ecology, Environment and Protection, Chinese Academy of Forestry, Beijing 100091, China
| | - Yanxia Yao
- Key Laboratory of Forest Protection of National Forestry and Grassland Administration/Research Institute of Forest Ecology, Environment and Protection, Chinese Academy of Forestry, Beijing 100091, China.
| |
Collapse
|
2
|
Dong Y, Guan M, Wang L, Yuan L, Sun X, Liu S. Transcriptome Analysis of Low-Temperature-Induced Breaking of Garlic Aerial Bulb Dormancy. Int J Genomics 2019; 2019:9140572. [PMID: 31485438 PMCID: PMC6702850 DOI: 10.1155/2019/9140572] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Revised: 04/25/2019] [Accepted: 05/19/2019] [Indexed: 11/18/2022] Open
Abstract
The long history of asexual reproduction of garlic using garlic cloves has resulted in virus accumulation and genetic depression. Propagation of garlic seedlings by tissue culture can both eliminate viruses and improve breeding efficiency. Aerial bulbs are the first-choice materials for breeding virus-free garlic seedlings under external conditions, but they show dormancy just like garlic bulbs. However, low temperatures can quickly break dormancy. In this research, we used a high-throughput sequencing method to sequence aerial bulbs during dormancy and after low-temperature-induced breaking of dormancy to screen out the key differentially expressed genes (DEGs) associated with low temperature and to provide a theoretical basis for exploring the molecular mechanism of low-temperature-induced breaking of aerial bulb dormancy. The high-throughput transcriptome sequencing results showed that 6,675 DEGs were upregulated and 36,596 DEGs were downregulated in the aerial bulbs given low-temperature treatment. Then, 19,507 DEGs were assigned KEGG annotations, among which most DEGs were annotated to the metabolism pathway (11,817 genes, accounting for 60.58%), followed by the genetic information processing pathway (4,521 genes, accounting for 23.18%). The DEGs were mostly concentrated in pathways such as protein processing in the endoplasmic reticulum, plant-pathogen interaction, plant hormone signal transduction, and ribosome biogenesis in eukaryotes, with significant enrichment. The key DEGs related to calcium signaling, hormonal signaling, and transcription factors were screened out, including CaM, CDPK, and CML in accessory pathways of calcium signaling; GA20ox, GAI1, and GA2ox in accessory pathways of hormonal signaling; and transcription factor genes such as MYB, AP2/ERF, bHLH, MADS, and bZIP. qRT-PCR verification results were consistent with the sequencing results, indicating that the transcriptome sequencing data were accurate and reliable. Our results provide a theoretical basis for breaking the dormancy of aerial bulbs with low-temperature treatment to produce virus-free seedlings and increase the output and quality of garlic.
Collapse
Affiliation(s)
- Yuhui Dong
- State Key Laboratory of Crop Biology, Shandong Garlic Engineering Research Center, College of Horticulture Science and Engineering, Shandong Agricultural University, No. 61, Daizong Road, Tai'an, Shandong Province 271000, China
| | - Mengjiao Guan
- State Key Laboratory of Crop Biology, Shandong Garlic Engineering Research Center, College of Horticulture Science and Engineering, Shandong Agricultural University, No. 61, Daizong Road, Tai'an, Shandong Province 271000, China
| | - Lixia Wang
- State Key Laboratory of Crop Biology, Shandong Garlic Engineering Research Center, College of Horticulture Science and Engineering, Shandong Agricultural University, No. 61, Daizong Road, Tai'an, Shandong Province 271000, China
| | - Lei Yuan
- State Key Laboratory of Crop Biology, Shandong Garlic Engineering Research Center, College of Horticulture Science and Engineering, Shandong Agricultural University, No. 61, Daizong Road, Tai'an, Shandong Province 271000, China
| | - Xiudong Sun
- State Key Laboratory of Crop Biology, Shandong Garlic Engineering Research Center, College of Horticulture Science and Engineering, Shandong Agricultural University, No. 61, Daizong Road, Tai'an, Shandong Province 271000, China
| | - Shiqi Liu
- State Key Laboratory of Crop Biology, Shandong Garlic Engineering Research Center, College of Horticulture Science and Engineering, Shandong Agricultural University, No. 61, Daizong Road, Tai'an, Shandong Province 271000, China
| |
Collapse
|
3
|
Zhang SQ, Che LH, Li Y, Dan Liang, Pang H, Ślipiński A, Zhang P. Evolutionary history of Coleoptera revealed by extensive sampling of genes and species. Nat Commun 2018; 9:205. [PMID: 29335414 PMCID: PMC5768713 DOI: 10.1038/s41467-017-02644-4] [Citation(s) in RCA: 190] [Impact Index Per Article: 27.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2017] [Accepted: 12/15/2017] [Indexed: 01/04/2023] Open
Abstract
Beetles (Coleoptera) are the most diverse and species-rich group of insects, and a robust, time-calibrated phylogeny is fundamental to understanding macroevolutionary processes that underlie their diversity. Here we infer the phylogeny and divergence times of all major lineages of Coleoptera by analyzing 95 protein-coding genes in 373 beetle species, including ~67% of the currently recognized families. The subordinal relationships are strongly supported as Polyphaga (Adephaga (Archostemata, Myxophaga)). The series and superfamilies of Polyphaga are mostly monophyletic. The species-poor Nosodendridae is robustly recovered in a novel position sister to Staphyliniformia, Bostrichiformia, and Cucujiformia. Our divergence time analyses suggest that the crown group of extant beetles occurred ~297 million years ago (Mya) and that ~64% of families originated in the Cretaceous. Most of the herbivorous families experienced a significant increase in diversification rate during the Cretaceous, thus suggesting that the rise of angiosperms in the Cretaceous may have been an 'evolutionary impetus' driving the hyperdiversity of herbivorous beetles.
Collapse
Affiliation(s)
- Shao-Qian Zhang
- State Key Laboratory of Biocontrol, College of Ecology and Evolution, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510006, China
| | - Li-Heng Che
- State Key Laboratory of Biocontrol, College of Ecology and Evolution, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510006, China
| | - Yun Li
- State Key Laboratory of Biocontrol, College of Ecology and Evolution, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510006, China
| | - Dan Liang
- State Key Laboratory of Biocontrol, College of Ecology and Evolution, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510006, China
| | - Hong Pang
- State Key Laboratory of Biocontrol, College of Ecology and Evolution, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510006, China
| | - Adam Ślipiński
- Australian National Insect Collection, CSIRO, GPO Box 1700, Canberra, ACT 2601, Australia.
| | - Peng Zhang
- State Key Laboratory of Biocontrol, College of Ecology and Evolution, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510006, China.
| |
Collapse
|
4
|
Abstract
With a million described species and more than half a billion preserved specimens, the large scale of insect collections is unequaled by those of any other group. Advances in genomics, collection digitization, and imaging have begun to more fully harness the power that such large data stores can provide. These new approaches and technologies have transformed how entomological collections are managed and utilized. While genomic research has fundamentally changed the way many specimens are collected and curated, advances in technology have shown promise for extracting sequence data from the vast holdings already in museums. Efforts to mainstream specimen digitization have taken root and have accelerated traditional taxonomic studies as well as distribution modeling and global change research. Emerging imaging technologies such as microcomputed tomography and confocal laser scanning microscopy are changing how morphology can be investigated. This review provides an overview of how the realization of big data has transformed our field and what may lie in store.
Collapse
Affiliation(s)
- Andrew Edward Z Short
- Department of Ecology and Evolutionary Biology; and Division of Entomology, Biodiversity Institute, University of Kansas, Lawrence, Kansas 66045, USA;
| | - Torsten Dikow
- Department of Entomology, National Museum of Natural History, Smithsonian Institution, Washington, DC 20013, USA;
| | - Corrie S Moreau
- Department of Science and Education, Field Museum of Natural History, Chicago, Illinois 60605, USA;
| |
Collapse
|
5
|
Che LH, Zhang SQ, Li Y, Liang D, Pang H, Ślipiński A, Zhang P. Genome-wide survey of nuclear protein-coding markers for beetle phylogenetics and their application in resolving both deep and shallow-level divergences. Mol Ecol Resour 2017; 17:1342-1358. [DOI: 10.1111/1755-0998.12664] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2016] [Revised: 01/09/2017] [Accepted: 02/14/2017] [Indexed: 11/27/2022]
Affiliation(s)
- Li-Heng Che
- State Key Laboratory of Biocontrol; College of Ecology and Evolution; School of Life Sciences; Sun Yat-Sen University; Guangzhou 510006; Guangdong Province China
| | - Shao-Qian Zhang
- State Key Laboratory of Biocontrol; College of Ecology and Evolution; School of Life Sciences; Sun Yat-Sen University; Guangzhou 510006; Guangdong Province China
| | - Yun Li
- State Key Laboratory of Biocontrol; College of Ecology and Evolution; School of Life Sciences; Sun Yat-Sen University; Guangzhou 510006; Guangdong Province China
| | - Dan Liang
- State Key Laboratory of Biocontrol; College of Ecology and Evolution; School of Life Sciences; Sun Yat-Sen University; Guangzhou 510006; Guangdong Province China
| | - Hong Pang
- State Key Laboratory of Biocontrol; College of Ecology and Evolution; School of Life Sciences; Sun Yat-Sen University; Guangzhou 510006; Guangdong Province China
| | - Adam Ślipiński
- Australian National Insect Collection; CSIRO; GPO Box 1700 Canberra ACT 2601 Australia
| | - Peng Zhang
- State Key Laboratory of Biocontrol; College of Ecology and Evolution; School of Life Sciences; Sun Yat-Sen University; Guangzhou 510006; Guangdong Province China
| |
Collapse
|
6
|
Yuan ML, Zhang QL, Zhang L, Guo ZL, Liu YJ, Shen YY, Shao R. High-level phylogeny of the Coleoptera inferred with mitochondrial genome sequences. Mol Phylogenet Evol 2016; 104:99-111. [DOI: 10.1016/j.ympev.2016.08.002] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2015] [Revised: 07/26/2016] [Accepted: 08/02/2016] [Indexed: 01/03/2023]
|
7
|
Genomic Mining of Phylogenetically Informative Nuclear Markers in Bark and Ambrosia Beetles. PLoS One 2016; 11:e0163529. [PMID: 27668729 PMCID: PMC5036811 DOI: 10.1371/journal.pone.0163529] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2016] [Accepted: 09/10/2016] [Indexed: 11/19/2022] Open
Abstract
Deep level insect relationships are generally difficult to resolve, especially within taxa of the most diverse and species rich holometabolous orders. In beetles, the major diversity occurs in the Phytophaga, including charismatic groups such as leaf beetles, longhorn beetles and weevils. Bark and ambrosia beetles are wood boring weevils that contribute 12 percent of the diversity encountered in Curculionidae, one of the largest families of beetles with more than 50000 described species. Phylogenetic resolution in groups of Cretaceous age has proven particularly difficult and requires large quantity of data. In this study, we investigated 100 nuclear genes in order to select a number of markers with low evolutionary rates and high phylogenetic signal. A PCR screening using degenerate primers was applied to 26 different weevil species. We obtained sequences from 57 of the 100 targeted genes. Sequences from each nuclear marker were aligned and examined for detecting multiple copies, pseudogenes and introns. Phylogenetic informativeness (PI) and the capacity for reconstruction of previously established phylogenetic relationships were used as proxies for selecting a subset of the 57 amplified genes. Finally, we selected 16 markers suitable for large-scale phylogenetics of Scolytinae and related weevil taxa.
Collapse
|
8
|
Phylogenetic analyses and evolutionary timescale of Coleoptera based on mitochondrial sequence. BIOCHEM SYST ECOL 2016. [DOI: 10.1016/j.bse.2016.04.014] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
9
|
Timmermans MJTN, Barton C, Haran J, Ahrens D, Culverwell CL, Ollikainen A, Dodsworth S, Foster PG, Bocak L, Vogler AP. Family-Level Sampling of Mitochondrial Genomes in Coleoptera: Compositional Heterogeneity and Phylogenetics. Genome Biol Evol 2015; 8:161-75. [PMID: 26645679 PMCID: PMC4758238 DOI: 10.1093/gbe/evv241] [Citation(s) in RCA: 132] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/28/2015] [Indexed: 12/29/2022] Open
Abstract
Mitochondrial genomes are readily sequenced with recent technology and thus evolutionary lineages can be densely sampled. This permits better phylogenetic estimates and assessment of potential biases resulting from heterogeneity in nucleotide composition and rate of change. We gathered 245 mitochondrial sequences for the Coleoptera representing all 4 suborders, 15 superfamilies of Polyphaga, and altogether 97 families, including 159 newly sequenced full or partial mitogenomes. Compositional heterogeneity greatly affected 3rd codon positions, and to a lesser extent the 1st and 2nd positions, even after RY coding. Heterogeneity also affected the encoded protein sequence, in particular in the nad2, nad4, nad5, and nad6 genes. Credible tree topologies were obtained with the nhPhyML ("nonhomogeneous") algorithm implementing a model for branch-specific equilibrium frequencies. Likelihood searches using RAxML were improved by data partitioning by gene and codon position. Finally, the PhyloBayes software, which allows different substitution processes for amino acid replacement at various sites, produced a tree that best matched known higher level taxa and defined basal relationships in Coleoptera. After rooting with Neuropterida outgroups, suborder relationships were resolved as (Polyphaga (Myxophaga (Archostemata + Adephaga))). The infraorder relationships in Polyphaga were (Scirtiformia (Elateriformia ((Staphyliniformia + Scarabaeiformia) (Bostrichiformia (Cucujiformia))))). Polyphagan superfamilies were recovered as monophyla except Staphylinoidea (paraphyletic for Scarabaeiformia) and Cucujoidea, which can no longer be considered a valid taxon. The study shows that, although compositional heterogeneity is not universal, it cannot be eliminated for some mitochondrial genes, but dense taxon sampling and the use of appropriate Bayesian analyses can still produce robust phylogenetic trees.
Collapse
Affiliation(s)
- Martijn J T N Timmermans
- Department of Life Sciences, Natural History Museum, London, United Kingdom Department of Life Sciences, Imperial College London - Silwood Park Campus, Ascot, United Kingdom Department of Natural Sciences, Middlesex University, Hendon Campus, London, United Kingdom
| | - Christopher Barton
- Department of Life Sciences, Natural History Museum, London, United Kingdom
| | - Julien Haran
- Department of Life Sciences, Natural History Museum, London, United Kingdom Present address: INRA, UR633 Zoologie Forestière, Orléans, France
| | - Dirk Ahrens
- Department of Life Sciences, Natural History Museum, London, United Kingdom Zoologisches Forschungsmuseum Alexander Koenig Bonn, Bonn, Germany
| | - C Lorna Culverwell
- Department of Life Sciences, Natural History Museum, London, United Kingdom Present address: Haartman Institute, Haartmaninkatu 3, University of Helsinki, Helsinki, Finland
| | - Alison Ollikainen
- Department of Life Sciences, Natural History Museum, London, United Kingdom Present address: Department of Medical Genetics, Genome-Scale Biology Research Program, University of Helsinki, Helsinki, Finland
| | - Steven Dodsworth
- Department of Life Sciences, Natural History Museum, London, United Kingdom Present address: Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey, United Kingdom
| | - Peter G Foster
- Department of Life Sciences, Natural History Museum, London, United Kingdom
| | - Ladislav Bocak
- Department of Life Sciences, Natural History Museum, London, United Kingdom Department of Zoology, Faculty of Science UP, Olomouc, Czech Republic
| | - Alfried P Vogler
- Department of Life Sciences, Natural History Museum, London, United Kingdom Department of Life Sciences, Imperial College London - Silwood Park Campus, Ascot, United Kingdom
| |
Collapse
|
10
|
Behura SK. Insect phylogenomics. INSECT MOLECULAR BIOLOGY 2015; 24:403-11. [PMID: 25963452 PMCID: PMC4503476 DOI: 10.1111/imb.12174] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/16/2014] [Revised: 03/10/2015] [Accepted: 04/04/2015] [Indexed: 05/08/2023]
Abstract
Phylogenomics, the integration of phylogenetics with genome data, has emerged as a powerful approach to study the evolution and systematics of species. Recently, several studies employing phylogenomic tools have provided better insights into insect evolution. Next-generation sequencing methods are now increasingly used by entomologists to generate genomic and transcript sequences of various insect species and strains. These data provide opportunities for comparative genomics and large-scale multigene phylogenies of diverse lineages of insects. Phy-logenomic investigations help us to better understand systematic and evolutionary relationships of insect species that play important roles as herbivores, predators, detritivores, pollinators and disease vectors. It is important that we critically assess the prospects and limitations of phylogenomic methods. In this review, I describe the current status, outline the major challenges and remark on potential future applications of phylogenomic tools in studying insect systematics and evolution.
Collapse
Affiliation(s)
- S K Behura
- Eck Institute for Global Health and Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, USA
| |
Collapse
|
11
|
Abstract
Over the last three decades, transcriptomic studies of venom gland cells have continuously evolved, opening up new possibilities for exploring the molecular diversity of animal venoms, a prerequisite for the discovery of new drug candidates and molecular phylogenetics. The molecular complexity of animal venoms is much greater than initially thought. In this review, we describe the different technologies available for transcriptomic studies of venom, from the original individual cloning approaches to the more recent global Next Generation Sequencing strategies. Our understanding of animal venoms is evolving, with the discovery of complex and diverse bio-optimized cocktails of compounds, including mostly peptides and proteins, which are now beginning to be studied by academic and industrial researchers.
Collapse
|
12
|
Breinholt JW, Kawahara AY. Phylotranscriptomics: saturated third codon positions radically influence the estimation of trees based on next-gen data. Genome Biol Evol 2014; 5:2082-92. [PMID: 24148944 PMCID: PMC3845638 DOI: 10.1093/gbe/evt157] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Recent advancements in molecular sequencing techniques have led to a surge in the number of phylogenetic studies that incorporate large amounts of genetic data. We test the assumption that analyzing large number of genes will lead to improvements in tree resolution and branch support using moths in the superfamily Bombycoidea, a group with some interfamilial relationships that have been difficult to resolve. Specifically, we use a next-gen data set that included 19 taxa and 938 genes (∼1.2M bp) to examine how codon position and saturation might influence resolution and node support among three key families. Maximum likelihood, parsimony, and species tree analysis using gene tree parsimony, on different nucleotide and amino acid data sets, resulted in largely congruent topologies with high bootstrap support compared with prior studies that included fewer loci. However, for a few shallow nodes, nucleotide and amino acid data provided high support for conflicting relationships. The third codon position was saturated and phylogenetic analysis of this position alone supported a completely different, potentially misleading sister group relationship. We used the program RADICAL to assess the number of genes needed to fix some of these difficult nodes. One such node originally needed a total of 850 genes but only required 250 when synonymous signal was removed. Our study shows that, in order to effectively use next-gen data to correctly resolve difficult phylogenetic relationships, it is necessary to assess the effects of synonymous substitutions and third codon positions.
Collapse
|
13
|
Safikhani Z, Sadeghi M, Pezeshk H, Eslahchi C. SSP: an interval integer linear programming for de novo transcriptome assembly and isoform discovery of RNA-seq reads. Genomics 2013; 102:507-14. [PMID: 24161398 DOI: 10.1016/j.ygeno.2013.10.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2013] [Revised: 09/28/2013] [Accepted: 10/16/2013] [Indexed: 11/19/2022]
Abstract
Recent advances in the sequencing technologies have provided a handful of RNA-seq datasets for transcriptome analysis. However, reconstruction of full-length isoforms and estimation of the expression level of transcripts with a low cost are challenging tasks. We propose a novel de novo method named SSP that incorporates interval integer linear programming to resolve alternatively spliced isoforms and reconstruct the whole transcriptome from short reads. Experimental results show that SSP is fast and precise in determining different alternatively spliced isoforms along with the estimation of reconstructed transcript abundances. The SSP software package is available at http://www.bioinf.cs.ipm.ir/software/ssp.
Collapse
Affiliation(s)
- Zhaleh Safikhani
- Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Mehdi Sadeghi
- National Institute of Genetic Engineering and Biotechnology (NIGEB), Tehran, Iran.
| | - Hamid Pezeshk
- School of Mathematics, Statistics and Computer Sciences, Center of Excellence in Biomathematics, College of Science, University of Tehran, Tehran, Iran
| | - Changiz Eslahchi
- Department of Computer Science, Shahid Beheshti University, GC., Tehran, Iran; School of Computer Science, Institute for Research in Fundamental Sciences (IPM), Tehran, Iran
| |
Collapse
|
14
|
Deep metazoan phylogeny: When different genes tell different stories. Mol Phylogenet Evol 2013; 67:223-33. [DOI: 10.1016/j.ympev.2013.01.010] [Citation(s) in RCA: 200] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2012] [Revised: 01/08/2013] [Accepted: 01/12/2013] [Indexed: 11/30/2022]
|
15
|
Johnson KP, Walden KK, Robertson HM. Next-generation phylogenomics using a Target Restricted Assembly Method. Mol Phylogenet Evol 2013; 66:417-22. [DOI: 10.1016/j.ympev.2012.09.007] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2012] [Revised: 09/07/2012] [Accepted: 09/10/2012] [Indexed: 11/25/2022]
|
16
|
Zou M, Guo B, Tao W, Arratia G, He S. Integrating multi-origin expression data improves the resolution of deep phylogeny of ray-finned fish (Actinopterygii). Sci Rep 2012; 2:665. [PMID: 22993690 PMCID: PMC3444750 DOI: 10.1038/srep00665] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2012] [Accepted: 08/20/2012] [Indexed: 11/09/2022] Open
Abstract
The actinopterygians comprise nearly one-half of all extant vertebrate species and are very important for human well-being. However, the phylogenetic relationships among certain groups within the actinopterygians are still uncertain, and debates about these relationships have continued for a long time. Along with the progress achieved in sequencing technologies, phylogenetic analyses based on multi-gene sequences, termed phylogenomic approaches, are becoming increasingly common and often result in well-resolved and highly supported phylogenetic hypotheses. Based on the transcriptome sequences generated in this study and the extensive expression data currently available from public databases, we obtained alignments of 274 orthologue groups for 26 scientifically and commercially important actinopterygians, representing 17 out of 44 orders within the class Actinopterygii. Using these alignments and probabilistic methods, we recovered relationships between basal actinopterygians and teleosts, among teleosts within protacanthopterygians and related lineages, and also within acanthomorphs. These relationships were recovered with high confidence.
Collapse
Affiliation(s)
- Ming Zou
- The key Laboratory of Aquatic Biodiversity and Conservation of Chinese Academy of Sciences, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, PR China
- Graduate University of the Chinese Academy of Sciences, Beijing 100039, PR China
| | - Baocheng Guo
- Institute of Evolutionary Biology and Environmental Studies, University of Zurich, 8057 Zurich, Switzerland
- The Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, 1015 Lausanne, Switzerland
| | - Wenjing Tao
- The key Laboratory of Aquatic Biodiversity and Conservation of Chinese Academy of Sciences, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, PR China
- Graduate University of the Chinese Academy of Sciences, Beijing 100039, PR China
| | - Gloria Arratia
- Biodiversity Research Institute, The University of Kansas, Lawrence, Kansas 66045, U.S.A
| | - Shunping He
- The key Laboratory of Aquatic Biodiversity and Conservation of Chinese Academy of Sciences, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, PR China
| |
Collapse
|
17
|
Beutel RG, Wang B, Tan JJ, Ge SQ, Ren D, Yang XK. On the phylogeny and evolution of Mesozoic and extant lineages of Adephaga (Coleoptera, Insecta). Cladistics 2012; 29:147-165. [DOI: 10.1111/j.1096-0031.2012.00420.x] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
|
18
|
Zhang N, Zeng L, Shan H, Ma H. Highly conserved low-copy nuclear genes as effective markers for phylogenetic analyses in angiosperms. THE NEW PHYTOLOGIST 2012; 195:923-937. [PMID: 22783877 DOI: 10.1111/j.1469-8137.2012.04212.x] [Citation(s) in RCA: 131] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Organismal phylogeny provides a crucial evolutionary framework for many studies and the angiosperm phylogeny has been greatly improved recently, largely using organellar and rDNA genes. However, low-copy protein-coding nuclear genes have not been widely used on a large scale in spite of the advantages of their biparental inheritance and vast number of choices. Here, we identified 1083 highly conserved low-copy nuclear genes by genome comparison. Furthermore, we demonstrated the use of five nuclear genes in 91 angiosperms representing 46 orders (73% of orders) and three gymnosperms as outgroups for a highly resolved phylogeny. These nuclear genes are easy to clone and align, and more phylogenetically informative than widely used organellar genes. The angiosperm phylogeny reconstructed using these genes was largely congruent with previous ones mainly inferred from organellar genes. Intriguingly, several new placements were uncovered for some groups, including those among the rosids, the asterids, and between the eudicots and several basal angiosperm groups. These conserved universal nuclear genes have several inherent qualities enabling them to be good markers for reconstructing angiosperm phylogeny, even eukaryotic relationships, further providing new insights into the evolutionary history of angiosperms.
Collapse
Affiliation(s)
- Ning Zhang
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, Center for Evolutionary Biology, School of Life Sciences, Fudan University, 220 Handan Road, Shanghai 200433, China
| | - Liping Zeng
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, Center for Evolutionary Biology, School of Life Sciences, Fudan University, 220 Handan Road, Shanghai 200433, China
| | - Hongyan Shan
- State Key Laboratory of Systematic and Evolution Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Hong Ma
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, Center for Evolutionary Biology, School of Life Sciences, Fudan University, 220 Handan Road, Shanghai 200433, China
- Institutes of Biomedical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai 200032, China
| |
Collapse
|
19
|
Trautwein MD, Wiegmann BM, Beutel R, Kjer KM, Yeates DK. Advances in insect phylogeny at the dawn of the postgenomic era. ANNUAL REVIEW OF ENTOMOLOGY 2012; 57:449-468. [PMID: 22149269 DOI: 10.1146/annurev-ento-120710-100538] [Citation(s) in RCA: 146] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Most species on Earth are insects and thus, understanding their evolutionary relationships is key to understanding the evolution of life. Insect relationships are increasingly well supported, due largely to technological advances in molecular sequencing and phylogenetic computational analysis. In this postgenomic era, insect systematics will be furthered best by integrative methods aimed at hypothesis corroboration from molecular, morphological, and paleontological evidence. This review of the current consensus of insect relationships provides a foundation for comparative study and offers a framework to evaluate incoming genomic evidence. Notable recent phylogenetic successes include the resolution of Holometabola, including the identification of the enigmatic Strepsiptera as a beetle relative and the early divergence of Hymenoptera; the recognition of hexapods as a crustacean lineage within Pancrustacea; and the elucidation of Dictyoptera orders, with termites placed as social cockroaches. Regions of the tree that require further investigation include the earliest winged insects (Palaeoptera) and Polyneoptera (orthopteroid lineages).
Collapse
Affiliation(s)
- Michelle D Trautwein
- Department of Entomology, North Carolina State University, Raleigh, North Carolina 27695, USA.
| | | | | | | | | |
Collapse
|
20
|
Simon S, Narechania A, Desalle R, Hadrys H. Insect phylogenomics: exploring the source of incongruence using new transcriptomic data. Genome Biol Evol 2012; 4:1295-309. [PMID: 23175716 PMCID: PMC3542558 DOI: 10.1093/gbe/evs104] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/15/2012] [Indexed: 11/13/2022] Open
Abstract
The evolution of the diverse insect lineages is one of the most fascinating issues in evolutionary biology. Despite extensive research in this area, the resolution of insect phylogeny especially of interordinal relationships has turned out to be still a great challenge. One of the challenges for insect systematics is the radiation of the polyneopteran lineages with several contradictory and/or unresolved relationships. Here, we provide the first transcriptomic data for three enigmatic polyneopteran orders (Dermaptera, Plecoptera, and Zoraptera) to clarify one of the most debated issues among higher insect systematics. We applied different approaches to generate 3 data sets comprising 78 species and 1,579 clusters of orthologous genes. Using these three matrices, we explored several key mechanistic problems of phylogenetic reconstruction including missing data, matrix selection, gene and taxa number/choice, and the biological function of the genes. Based on the first phylogenomic approach including these three ambiguous polyneopteran orders, we provide here conclusive support for monophyletic Polyneoptera, contesting the hypothesis of Zoraptera + Paraneoptera and Plecoptera + remaining Neoptera. In addition, we employ various approaches to evaluate data quality and highlight problematic nodes within the Insect Tree that still exist despite our phylogenomic approach. We further show how the support for these nodes or alternative hypotheses might depend on the taxon- and/or gene-sampling.
Collapse
Affiliation(s)
- Sabrina Simon
- ITZ, Ecology & Evolution, Stiftung Tieraerztliche Hochschule Hannover, Hannover, Germany.
| | | | | | | |
Collapse
|
21
|
Ebersberger I, de Matos Simoes R, Kupczok A, Gube M, Kothe E, Voigt K, von Haeseler A. A consistent phylogenetic backbone for the fungi. Mol Biol Evol 2011; 29:1319-34. [PMID: 22114356 PMCID: PMC3339314 DOI: 10.1093/molbev/msr285] [Citation(s) in RCA: 89] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
The kingdom of fungi provides model organisms for biotechnology, cell biology, genetics, and life sciences in general. Only when their phylogenetic relationships are stably resolved, can individual results from fungal research be integrated into a holistic picture of biology. However, and despite recent progress, many deep relationships within the fungi remain unclear. Here, we present the first phylogenomic study of an entire eukaryotic kingdom that uses a consistency criterion to strengthen phylogenetic conclusions. We reason that branches (splits) recovered with independent data and different tree reconstruction methods are likely to reflect true evolutionary relationships. Two complementary phylogenomic data sets based on 99 fungal genomes and 109 fungal expressed sequence tag (EST) sets analyzed with four different tree reconstruction methods shed light from different angles on the fungal tree of life. Eleven additional data sets address specifically the phylogenetic position of Blastocladiomycota, Ustilaginomycotina, and Dothideomycetes, respectively. The combined evidence from the resulting trees supports the deep-level stability of the fungal groups toward a comprehensive natural system of the fungi. In addition, our analysis reveals methodologically interesting aspects. Enrichment for EST encoded data—a common practice in phylogenomic analyses—introduces a strong bias toward slowly evolving and functionally correlated genes. Consequently, the generalization of phylogenomic data sets as collections of randomly selected genes cannot be taken for granted. A thorough characterization of the data to assess possible influences on the tree reconstruction should therefore become a standard in phylogenomic analyses.
Collapse
Affiliation(s)
- Ingo Ebersberger
- Center for Integrative Bioinformatics Vienna, University of Vienna, Medical University of Vienna, University of Veterinary Medicine Vienna, Vienna, Austria.
| | | | | | | | | | | | | |
Collapse
|
22
|
Simmons MP. Misleading results of likelihood-based phylogenetic analyses in the presence of missing data. Cladistics 2011; 28:208-222. [DOI: 10.1111/j.1096-0031.2011.00375.x] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
|
23
|
Abstract
The whole-genome sequencing of mosquitoes has facilitated our understanding of fundamental biological processes at their basic molecular levels and holds potential for application to mosquito control and prevention of mosquito-borne disease transmission. Draft genome sequences are available for Anopheles gambiae, Aedes aegypti, and Culex quinquefasciatus. Collectively, these represent the major vectors of African malaria, dengue fever and yellow fever viruses, and lymphatic filariasis, respectively. Rapid advances in genome technologies have revealed detailed information on genome architecture as well as phenotype-specific transcriptomics and proteomics. These resources allow for detailed comparative analyses within and across populations as well as species. Next-generation sequencing technologies will likely promote a proliferation of genome sequences for additional mosquito species as well as for individual insects. Here we review the current status of genome research in mosquitoes and identify potential areas for further investigations.
Collapse
Affiliation(s)
- David W Severson
- Eck Institute for Global Health, Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana 46556, USA.
| | | |
Collapse
|
24
|
Fong JJ, Fujita MK. Evaluating phylogenetic informativeness and data-type usage for new protein-coding genes across Vertebrata. Mol Phylogenet Evol 2011; 61:300-7. [PMID: 21742044 DOI: 10.1016/j.ympev.2011.06.016] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2010] [Revised: 05/31/2011] [Accepted: 06/21/2011] [Indexed: 11/30/2022]
Abstract
As a resource for vertebrate phylogenetics, we developed 75 new protein-coding genes using a combination of expressed sequence tags (ESTs) available in Genbank, and targeted amplification of complementary DNA (cDNA). In addition, we performed three additional analyses in order to assess the utility of our approach. First, we profiled the phylogenetic informativeness of these new markers using the online program PhyDesign. Next, we compared the utility of four different data-types used in phylogenetics: nucleotides (NUCL), amino acids (AA), 1st and 2nd codon positions only (N12), and modified sequences to account for codon degeneracy (DEGEN1; Regier et al., 2010). Lastly, we use these new markers to construct a vertebrate phylogeny and address the uncertain relationship between higher-level mammal groups: monotremes, marsupials, and placentals. Our results show that phylogenetic informativeness of the 75 new markers varies, both in the amount of phylogenetic signal and optimal timescale. When comparing the four data-types, we find that the NUCL data-type, due to the high level of phylogenetic signal, performs the best across all divergence times. The remaining three data-types (AA, N12, DEGEN1) are less subject to homoplasy, but have greatly reduced levels of phylogenetic signal relative to NUCL. Our phylogenetic inference supports the Theria hypothesis of mammalian relationships, with marsupials and placentals being sister groups.
Collapse
Affiliation(s)
- Jonathan J Fong
- Museum of Vertebrate Zoology, Department of Integrative Biology, University of California, Berkeley, CA 94720, USA.
| | | |
Collapse
|
25
|
Timmermans MJTN, Dodsworth S, Culverwell CL, Bocak L, Ahrens D, Littlewood DTJ, Pons J, Vogler AP. Why barcode? High-throughput multiplex sequencing of mitochondrial genomes for molecular systematics. Nucleic Acids Res 2010; 38:e197. [PMID: 20876691 PMCID: PMC2995086 DOI: 10.1093/nar/gkq807] [Citation(s) in RCA: 139] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2010] [Revised: 08/09/2010] [Accepted: 08/29/2010] [Indexed: 02/03/2023] Open
Abstract
Mitochondrial genome sequences are important markers for phylogenetics but taxon sampling remains sporadic because of the great effort and cost required to acquire full-length sequences. Here, we demonstrate a simple, cost-effective way to sequence the full complement of protein coding mitochondrial genes from pooled samples using the 454/Roche platform. Multiplexing was achieved without the need for expensive indexing tags ('barcodes'). The method was trialled with a set of long-range polymerase chain reaction (PCR) fragments from 30 species of Coleoptera (beetles) sequenced in a 1/16th sector of a sequencing plate. Long contigs were produced from the pooled sequences with sequencing depths ranging from ∼10 to 100× per contig. Species identity of individual contigs was established via three 'bait' sequences matching disparate parts of the mitochondrial genome obtained by conventional PCR and Sanger sequencing. This proved that assembly of contigs from the sequencing pool was correct. Our study produced sequences for 21 nearly complete and seven partial sets of protein coding mitochondrial genes. Combined with existing sequences for 25 taxa, an improved estimate of basal relationships in Coleoptera was obtained. The procedure could be employed routinely for mitochondrial genome sequencing at the species level, to provide improved species 'barcodes' that currently use the cox1 gene only.
Collapse
Affiliation(s)
- M. J. T. N. Timmermans
- Department of Entomology, Natural History Museum, Cromwell Road, London SW7 5BD, Division of Biology, Imperial College London, Silwood Park Campus, Ascot SL5 7PY, UK, Department of Zoology, Science Faculty, Palacky University, tr. Svobody 26, 771 46 Olomouc, Czech Republic, Department of Zoology, Natural History Museum, Cromwell Road, London SW7 5BD, UK and IMEDEA (CSIC-UIB), Miquel Marqués, 21 Esporlas, 07190 Illes Balears, Spain
| | - S. Dodsworth
- Department of Entomology, Natural History Museum, Cromwell Road, London SW7 5BD, Division of Biology, Imperial College London, Silwood Park Campus, Ascot SL5 7PY, UK, Department of Zoology, Science Faculty, Palacky University, tr. Svobody 26, 771 46 Olomouc, Czech Republic, Department of Zoology, Natural History Museum, Cromwell Road, London SW7 5BD, UK and IMEDEA (CSIC-UIB), Miquel Marqués, 21 Esporlas, 07190 Illes Balears, Spain
| | - C. L. Culverwell
- Department of Entomology, Natural History Museum, Cromwell Road, London SW7 5BD, Division of Biology, Imperial College London, Silwood Park Campus, Ascot SL5 7PY, UK, Department of Zoology, Science Faculty, Palacky University, tr. Svobody 26, 771 46 Olomouc, Czech Republic, Department of Zoology, Natural History Museum, Cromwell Road, London SW7 5BD, UK and IMEDEA (CSIC-UIB), Miquel Marqués, 21 Esporlas, 07190 Illes Balears, Spain
| | - L. Bocak
- Department of Entomology, Natural History Museum, Cromwell Road, London SW7 5BD, Division of Biology, Imperial College London, Silwood Park Campus, Ascot SL5 7PY, UK, Department of Zoology, Science Faculty, Palacky University, tr. Svobody 26, 771 46 Olomouc, Czech Republic, Department of Zoology, Natural History Museum, Cromwell Road, London SW7 5BD, UK and IMEDEA (CSIC-UIB), Miquel Marqués, 21 Esporlas, 07190 Illes Balears, Spain
| | - D. Ahrens
- Department of Entomology, Natural History Museum, Cromwell Road, London SW7 5BD, Division of Biology, Imperial College London, Silwood Park Campus, Ascot SL5 7PY, UK, Department of Zoology, Science Faculty, Palacky University, tr. Svobody 26, 771 46 Olomouc, Czech Republic, Department of Zoology, Natural History Museum, Cromwell Road, London SW7 5BD, UK and IMEDEA (CSIC-UIB), Miquel Marqués, 21 Esporlas, 07190 Illes Balears, Spain
| | - D. T. J. Littlewood
- Department of Entomology, Natural History Museum, Cromwell Road, London SW7 5BD, Division of Biology, Imperial College London, Silwood Park Campus, Ascot SL5 7PY, UK, Department of Zoology, Science Faculty, Palacky University, tr. Svobody 26, 771 46 Olomouc, Czech Republic, Department of Zoology, Natural History Museum, Cromwell Road, London SW7 5BD, UK and IMEDEA (CSIC-UIB), Miquel Marqués, 21 Esporlas, 07190 Illes Balears, Spain
| | - J. Pons
- Department of Entomology, Natural History Museum, Cromwell Road, London SW7 5BD, Division of Biology, Imperial College London, Silwood Park Campus, Ascot SL5 7PY, UK, Department of Zoology, Science Faculty, Palacky University, tr. Svobody 26, 771 46 Olomouc, Czech Republic, Department of Zoology, Natural History Museum, Cromwell Road, London SW7 5BD, UK and IMEDEA (CSIC-UIB), Miquel Marqués, 21 Esporlas, 07190 Illes Balears, Spain
| | - A. P. Vogler
- Department of Entomology, Natural History Museum, Cromwell Road, London SW7 5BD, Division of Biology, Imperial College London, Silwood Park Campus, Ascot SL5 7PY, UK, Department of Zoology, Science Faculty, Palacky University, tr. Svobody 26, 771 46 Olomouc, Czech Republic, Department of Zoology, Natural History Museum, Cromwell Road, London SW7 5BD, UK and IMEDEA (CSIC-UIB), Miquel Marqués, 21 Esporlas, 07190 Illes Balears, Spain
| |
Collapse
|
26
|
Sharanowski BJ, Robbertse B, Walker J, Voss SR, Yoder R, Spatafora J, Sharkey MJ. Expressed sequence tags reveal Proctotrupomorpha (minus Chalcidoidea) as sister to Aculeata (Hymenoptera: Insecta). Mol Phylogenet Evol 2010; 57:101-12. [PMID: 20637293 DOI: 10.1016/j.ympev.2010.07.006] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2009] [Revised: 07/06/2010] [Accepted: 07/07/2010] [Indexed: 10/19/2022]
|
27
|
Surget-Groba Y, Montoya-Burgos JI. Optimization of de novo transcriptome assembly from next-generation sequencing data. Genome Res 2010; 20:1432-40. [PMID: 20693479 DOI: 10.1101/gr.103846.109] [Citation(s) in RCA: 260] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Transcriptome analysis has important applications in many biological fields. However, assembling a transcriptome without a known reference remains a challenging task requiring algorithmic improvements. We present two methods for substantially improving transcriptome de novo assembly. The first method relies on the observation that the use of a single k-mer length by current de novo assemblers is suboptimal to assemble transcriptomes where the sequence coverage of transcripts is highly heterogeneous. We present the Multiple-k method in which various k-mer lengths are used for de novo transcriptome assembly. We demonstrate its good performance by assembling de novo a published next-generation transcriptome sequence data set of Aedes aegypti, using the existing genome to check the accuracy of our method. The second method relies on the use of a reference proteome to improve the de novo assembly. We developed the Scaffolding using Translation Mapping (STM) method that uses mapping against the closest available reference proteome for scaffolding contigs that map onto the same protein. In a controlled experiment using simulated data, we show that the STM method considerably improves the assembly, with few errors. We applied these two methods to assemble the transcriptome of the non-model catfish Loricaria gr. cataphracta. Using the Multiple-k and STM methods, the assembly increases in contiguity and in gene identification, showing that our methods clearly improve quality and can be widely used. The new methods were used to assemble successfully the transcripts of the core set of genes regulating tooth development in vertebrates, while classic de novo assembly failed.
Collapse
Affiliation(s)
- Yann Surget-Groba
- Department of Zoology and Animal Biology, University of Geneva, 1211 Geneva 4, Switzerland
| | | |
Collapse
|
28
|
Pons J, Ribera I, Bertranpetit J, Balke M. Nucleotide substitution rates for the full set of mitochondrial protein-coding genes in Coleoptera. Mol Phylogenet Evol 2010; 56:796-807. [DOI: 10.1016/j.ympev.2010.02.007] [Citation(s) in RCA: 121] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2009] [Revised: 02/01/2010] [Accepted: 02/03/2010] [Indexed: 10/19/2022]
|
29
|
Ribosomal protein genes of holometabolan insects reject the Halteria, instead revealing a close affinity of Strepsiptera with Coleoptera. Mol Phylogenet Evol 2010; 55:846-59. [PMID: 20348001 DOI: 10.1016/j.ympev.2010.03.024] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2009] [Revised: 02/18/2010] [Accepted: 03/19/2010] [Indexed: 11/23/2022]
Abstract
The phylogenetic relationships among holometabolan insect orders remain poorly known, despite a wealth of previous studies. In particular, past attempts to clarify the sister-group of the enigmatic order Strepsiptera with rRNA genes have led to intense debate about long-branch attraction (the 'Strepsiptera problem'), without resolving the taxonomic question at hand. Here, we appealed to alternative nuclear sequences of 27 ribosomal proteins (RPs) to generate a data matrix of 10,731 nucleotides for 22 holometabolan taxa, including two strepsipteran species. Phylogenetic relationships among holometabolan insects were analyzed under several nucleotide-coding schemes to explore differences in signal and systematic biases. Saturation and compositional bias particularly affected third positions, which greatly differed in AT content (18-72%). Such confounding factors were best reduced by R-Y coding and removal of third codon positions, resulting in more strongly supported topologies, whereas amino acid coding gave poor resolution. The placement of Strepsiptera with Coleoptera (the Coleopterida) was recovered under most coding schemes and analytical methods, if often with modest support and ambiguity. In contrast, an alternative sister-group with Diptera (the Halteria) was only found in one analysis using parsimony, and weakly supported. The topologies here generally support a Coleoptera+Strepsiptera as sister-group to Mecopterida (Siphonaptera+Mecoptera+Diptera+Lepidoptera+Trichoptera), while Hymenoptera were always recovered as sister-group to the remaining Holometabola.
Collapse
|
30
|
Duarte JM, Wall PK, Edger PP, Landherr LL, Ma H, Pires JC, Leebens-Mack J, dePamphilis CW. Identification of shared single copy nuclear genes in Arabidopsis, Populus, Vitis and Oryza and their phylogenetic utility across various taxonomic levels. BMC Evol Biol 2010. [PMID: 20181251 DOI: 10.1186/1471-2148-1110-1161] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023] Open
Abstract
BACKGROUND Although the overwhelming majority of genes found in angiosperms are members of gene families, and both gene- and genome-duplication are pervasive forces in plant genomes, some genes are sufficiently distinct from all other genes in a genome that they can be operationally defined as 'single copy'. Using the gene clustering algorithm MCL-tribe, we have identified a set of 959 single copy genes that are shared single copy genes in the genomes of Arabidopsis thaliana, Populus trichocarpa, Vitis vinifera and Oryza sativa. To characterize these genes, we have performed a number of analyses examining GO annotations, coding sequence length, number of exons, number of domains, presence in distant lineages, such as Selaginella and Physcomitrella, and phylogenetic analysis to estimate copy number in other seed plants and to demonstrate their phylogenetic utility. We then provide examples of how these genes may be used in phylogenetic analyses to reconstruct organismal history, both by using extant coverage in EST databases for seed plants and de novo amplification via RT-PCR in the family Brassicaceae. RESULTS There are 959 single copy nuclear genes shared in Arabidopsis, Populus, Vitis and Oryza ["APVO SSC genes"]. The majority of these genes are also present in the Selaginella and Physcomitrella genomes. Public EST sets for 197 species suggest that most of these genes are present across a diverse collection of seed plants, and appear to exist as single or very low copy genes, though exceptions are seen in recently polyploid taxa and in lineages where there is significant evidence for a shared large-scale duplication event. Genes encoding proteins localized in organelles are more commonly single copy than expected by chance, but the evolutionary forces responsible for this bias are unknown.Regardless of the evolutionary mechanisms responsible for the large number of shared single copy genes in diverse flowering plant lineages, these genes are valuable for phylogenetic and comparative analyses. Eighteen of the APVO SSC single copy genes were amplified in the Brassicaceae using RT-PCR and directly sequenced. Alignments of these sequences provide improved resolution of Brassicaceae phylogeny compared to recent studies using plastid and ITS sequences. An analysis of sequences from 13 APVO SSC genes from 69 species of seed plants, derived mainly from public EST databases, yielded a phylogeny that was largely congruent with prior hypotheses based on multiple plastid sequences. Whereas single gene phylogenies that rely on EST sequences have limited bootstrap support as the result of limited sequence information, concatenated alignments result in phylogenetic trees with strong bootstrap support for already established relationships. Overall, these single copy nuclear genes are promising markers for phylogenetics, and contain a greater proportion of phylogenetically-informative sites than commonly used protein-coding sequences from the plastid or mitochondrial genomes. CONCLUSIONS Putatively orthologous, shared single copy nuclear genes provide a vast source of new evidence for plant phylogenetics, genome mapping, and other applications, as well as a substantial class of genes for which functional characterization is needed. Preliminary evidence indicates that many of the shared single copy nuclear genes identified in this study may be well suited as markers for addressing phylogenetic hypotheses at a variety of taxonomic levels.
Collapse
Affiliation(s)
- Jill M Duarte
- Department of Biology and the Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, USA
| | | | | | | | | | | | | | | |
Collapse
|
31
|
Duarte JM, Wall PK, Edger PP, Landherr LL, Ma H, Pires JC, Leebens-Mack J, dePamphilis CW. Identification of shared single copy nuclear genes in Arabidopsis, Populus, Vitis and Oryza and their phylogenetic utility across various taxonomic levels. BMC Evol Biol 2010; 10:61. [PMID: 20181251 PMCID: PMC2848037 DOI: 10.1186/1471-2148-10-61] [Citation(s) in RCA: 229] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2008] [Accepted: 02/24/2010] [Indexed: 01/27/2023] Open
Abstract
Background Although the overwhelming majority of genes found in angiosperms are members of gene families, and both gene- and genome-duplication are pervasive forces in plant genomes, some genes are sufficiently distinct from all other genes in a genome that they can be operationally defined as 'single copy'. Using the gene clustering algorithm MCL-tribe, we have identified a set of 959 single copy genes that are shared single copy genes in the genomes of Arabidopsis thaliana, Populus trichocarpa, Vitis vinifera and Oryza sativa. To characterize these genes, we have performed a number of analyses examining GO annotations, coding sequence length, number of exons, number of domains, presence in distant lineages, such as Selaginella and Physcomitrella, and phylogenetic analysis to estimate copy number in other seed plants and to demonstrate their phylogenetic utility. We then provide examples of how these genes may be used in phylogenetic analyses to reconstruct organismal history, both by using extant coverage in EST databases for seed plants and de novo amplification via RT-PCR in the family Brassicaceae. Results There are 959 single copy nuclear genes shared in Arabidopsis, Populus, Vitis and Oryza ["APVO SSC genes"]. The majority of these genes are also present in the Selaginella and Physcomitrella genomes. Public EST sets for 197 species suggest that most of these genes are present across a diverse collection of seed plants, and appear to exist as single or very low copy genes, though exceptions are seen in recently polyploid taxa and in lineages where there is significant evidence for a shared large-scale duplication event. Genes encoding proteins localized in organelles are more commonly single copy than expected by chance, but the evolutionary forces responsible for this bias are unknown. Regardless of the evolutionary mechanisms responsible for the large number of shared single copy genes in diverse flowering plant lineages, these genes are valuable for phylogenetic and comparative analyses. Eighteen of the APVO SSC single copy genes were amplified in the Brassicaceae using RT-PCR and directly sequenced. Alignments of these sequences provide improved resolution of Brassicaceae phylogeny compared to recent studies using plastid and ITS sequences. An analysis of sequences from 13 APVO SSC genes from 69 species of seed plants, derived mainly from public EST databases, yielded a phylogeny that was largely congruent with prior hypotheses based on multiple plastid sequences. Whereas single gene phylogenies that rely on EST sequences have limited bootstrap support as the result of limited sequence information, concatenated alignments result in phylogenetic trees with strong bootstrap support for already established relationships. Overall, these single copy nuclear genes are promising markers for phylogenetics, and contain a greater proportion of phylogenetically-informative sites than commonly used protein-coding sequences from the plastid or mitochondrial genomes. Conclusions Putatively orthologous, shared single copy nuclear genes provide a vast source of new evidence for plant phylogenetics, genome mapping, and other applications, as well as a substantial class of genes for which functional characterization is needed. Preliminary evidence indicates that many of the shared single copy nuclear genes identified in this study may be well suited as markers for addressing phylogenetic hypotheses at a variety of taxonomic levels.
Collapse
Affiliation(s)
- Jill M Duarte
- Department of Biology and the Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, USA
| | | | | | | | | | | | | | | |
Collapse
|
32
|
Hittinger CT, Johnston M, Tossberg JT, Rokas A. Leveraging skewed transcript abundance by RNA-Seq to increase the genomic depth of the tree of life. Proc Natl Acad Sci U S A 2010; 107:1476-81. [PMID: 20080632 PMCID: PMC2824393 DOI: 10.1073/pnas.0910449107] [Citation(s) in RCA: 91] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Assembling the tree of life is a major goal of biology, but progress has been hindered by the difficulty and expense of obtaining the orthologous DNA required for accurate and fully resolved phylogenies. Next-generation DNA sequencing technologies promise to accelerate progress, but sequencing the genomes of hundreds of thousands of eukaryotic species remains impractical. Eukaryotic transcriptomes, which are smaller than genomes and biased toward highly expressed genes that tend to be conserved, could potentially provide a rich set of phylogenetic characters. We sampled the transcriptomes of 10 mosquito species by assembling 36-bp sequence reads into phylogenomic data matrices containing hundreds of thousands of orthologous nucleotides from hundreds of genes. Analysis of these data matrices yielded robust phylogenetic inferences, even with data matrices constructed from surprisingly few sequence reads. This approach is more efficient, data-rich, and economical than traditional PCR-based and EST-based methods and provides a scalable strategy for generating phylogenomic data matrices to infer the branches and twigs of the tree of life.
Collapse
Affiliation(s)
- Chris Todd Hittinger
- Department of Biochemistry and Molecular Genetics, University of Colorado Denver Health Sciences Center, Aurora, CO 80045
- Center for Genome Sciences, Department of Genetics, Washington University School of Medicine, St. Louis, MO 63108
| | - Mark Johnston
- Department of Biochemistry and Molecular Genetics, University of Colorado Denver Health Sciences Center, Aurora, CO 80045
- Center for Genome Sciences, Department of Genetics, Washington University School of Medicine, St. Louis, MO 63108
| | - John T. Tossberg
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235
| |
Collapse
|
33
|
|
34
|
Simon S, Strauss S, von Haeseler A, Hadrys H. A phylogenomic approach to resolve the basal pterygote divergence. Mol Biol Evol 2009; 26:2719-30. [PMID: 19713325 DOI: 10.1093/molbev/msp191] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
One of the most fascinating Bauplan transitions in the animal kingdom was the invention of insect wings, a change that also contributed to the success and enormous diversity of this animal group. However, the origin of insect flight and the relationships of basal winged insect orders are still controversial. Three hypotheses have been proposed to explain the phylogeny of winged insects: 1) the traditional Palaeoptera hypothesis (Ephemeroptera + Odonata, Neoptera), 2) the Metapterygota hypothesis (Ephemeroptera, Odonata + Neoptera), and 3) the Chiastomyaria hypothesis (Odonata, Ephemeroptera + Neoptera). Neither phylogenetic analyses of single genes nor even multiple marker systems (e.g., molecular markers + morphological characters) have yet been able to conclusively resolve basal pterygote divergences. A possible explanation for the lack of resolution is that the divergences took place in the mid-Devonian within a short period of time and attempts to solve this problem have been confounded by the major challenge of finding molecular markers to accurately track these short ancient internodes. Although phylogenomic data are available for Neoptera and some wingless (apterygote) orders, they are lacking for the crucial Odonata and Ephemeroptera orders. We adopt a multigene approach including data from two new expressed sequence tag projects-from the orders Ephemeroptera (Baetis sp.) and Odonata (Ischnura elegans)-to evaluate the potential of phylogenomic analyses in clarifying this unresolved issue. We analyzed two data sets that differed in represented taxa, genes, and overall sequence lengths: maxspe (15 taxa, 125 genes, and 31,643 amino acid positions) and maxgen (8 taxa, 150 genes, and 42,541 amino acid positions). Maximum likelihood and Bayesian inference analyses both place the Odonata at the base of the winged insects. Furthermore, statistical hypotheses testing rejected both the Palaeoptera and the Metapterygota hypotheses. The comprehensive molecular data set developed here provides conclusive support for odonates as the most basal winged insect order (Chiastomyaria hypothesis). Data quality assessment indicates that proteins involved in cellular processes and signaling harbor the most informative phylogenetic signal.
Collapse
Affiliation(s)
- Sabrina Simon
- Institute of Ecology & Evolution, Stiftung Tieraerztliche Hochschule Hannover, Hannover, Germany.
| | | | | | | |
Collapse
|
35
|
Gibbons JG, Janson EM, Hittinger CT, Johnston M, Abbot P, Rokas A. Benchmarking next-generation transcriptome sequencing for functional and evolutionary genomics. Mol Biol Evol 2009; 26:2731-44. [PMID: 19706727 DOI: 10.1093/molbev/msp188] [Citation(s) in RCA: 129] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Next-generation sequencing has opened the door to genomic analysis of nonmodel organisms. Technologies generating long-sequence reads (200-400 bp) are increasingly used in evolutionary studies of nonmodel organisms, but the short-sequence reads (30-50 bp) that can be produced at lower cost are thought to be of limited utility for de novo sequencing applications. Here, we tested this assumption by short-read sequencing the transcriptomes of the tropical disease vectors Aedes aegypti and Anopheles gambiae, for which complete genome sequences are available. Comparison of our results to the reference genomes allowed us to accurately evaluate the quantity, quality, and functional and evolutionary information content of our "test" data. We produced more than 0.7 billion nucleotides of sequenced data per species that assembled into more than 21,000 test contigs larger than 100 bp per species and covered approximately 27% of the Aedes reference transcriptome. Remarkably, the substitution error rate in the test contigs was approximately 0.25% per site, with very few indels or assembly errors. Test contigs of both species were enriched for genes involved in energy production and protein synthesis and underrepresented in genes involved in transcription and differentiation. Ortholog prediction using the test contigs was accurate across hundreds of millions of years of evolution. Our results demonstrate the considerable utility of short-read transcriptome sequencing for genomic studies of nonmodel organisms and suggest an approach for assessing the information content of next-generation data for evolutionary studies.
Collapse
Affiliation(s)
- John G Gibbons
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
| | | | | | | | | | | |
Collapse
|
36
|
Witek A, Herlyn H, Ebersberger I, Mark Welch DB, Hankeln T. Support for the monophyletic origin of Gnathifera from phylogenomics. Mol Phylogenet Evol 2009; 53:1037-41. [PMID: 19654049 DOI: 10.1016/j.ympev.2009.07.031] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2009] [Revised: 07/16/2009] [Accepted: 07/28/2009] [Indexed: 11/15/2022]
Abstract
The monophyletic origin of Spiralia within the metazoan tree of life is supported by many large-scale phylogenomic data. While there is now substantial molecular evidence for Lophotrochozoa being a monophyletic taxon within Spiralia, the phylogenetic affiliations of many other spiralian phyla remain unclear. Here we focus on the question of a monophyletic taxon Gnathifera, which was originally characterized by jaw morphology as comprising the taxa Rotifera, Acanthocephala and Gnathostomulida. Based on a large-scale molecular sequence dataset of 11,146 amino acid residues, we reconstructed phylogenetic trees of spiralian phyla using maximum-likelihood and Bayesian approaches. We obtain the first phylogenomic evidence for the clade Gnathifera, linking Syndermata (Rotifera+Acanthocephala) with Gnathostomulida. Furthermore, our data support recent findings concerning the paraphyly of Eurotatoria.
Collapse
Affiliation(s)
- Alexander Witek
- Institute of Molecular Genetics, Johannes Gutenberg-University Mainz, J.J. Becherweg 30a, D-55099 Mainz, Germany
| | | | | | | | | |
Collapse
|
37
|
Sheffield NC, Song H, Cameron SL, Whiting MF. Nonstationary Evolution and Compositional Heterogeneity in Beetle Mitochondrial Phylogenomics. Syst Biol 2009; 58:381-94. [DOI: 10.1093/sysbio/syp037] [Citation(s) in RCA: 139] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Nathan C. Sheffield
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
- Program in Computational Biology & Bioinformatics, Institute for Genome Sciences and Policy, Duke University, Box 90090, Durham, NC 27708, USA
| | - Hojun Song
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Stephen L. Cameron
- Australian National Insect Collection, Commonwealth Scientific and Industrial Research Organisation, Entomology, PO Box 1700, Canberra, Australian Capital Territory, 2601, Australia
| | | |
Collapse
|
38
|
Ebersberger I, Strauss S, von Haeseler A. HaMStR: profile hidden markov model based search for orthologs in ESTs. BMC Evol Biol 2009; 9:157. [PMID: 19586527 PMCID: PMC2723089 DOI: 10.1186/1471-2148-9-157] [Citation(s) in RCA: 208] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2008] [Accepted: 07/08/2009] [Indexed: 12/05/2022] Open
Abstract
Background EST sequencing is a versatile approach for rapidly gathering protein coding sequences. They provide direct access to an organism's gene repertoire bypassing the still error-prone procedure of gene prediction from genomic data. Therefore, ESTs are often the only source for biological sequence data from taxa outside mainstream interest. The widespread use of ESTs in evolutionary studies and particularly in molecular systematics studies is still hindered by the lack of efficient and reliable approaches for automated ortholog predictions in ESTs. Existing methods either depend on a known species tree or cannot cope with redundancy in EST data. Results We present a novel approach (HaMStR) to mine EST data for the presence of orthologs to a curated set of genes. HaMStR combines a profile Hidden Markov Model search and a subsequent BLAST search to extend existing ortholog cluster with sequences from further taxa. We show that the HaMStR results are consistent with those obtained with existing orthology prediction methods that require completely sequenced genomes. A case study on the phylogeny of 35 fungal taxa illustrates that HaMStR is well suited to compile informative data sets for phylogenomic studies from ESTs and protein sequence data. Conclusion HaMStR extends in a standardized manner a pre-defined set of orthologs with ESTs from further taxa. In the same fashion HaMStR can be applied to protein sequence data, and thus provides a comprehensive approach to compile ortholog cluster from any protein coding data. The resulting orthology predictions serve as the data basis for a variety of evolutionary studies. Here, we have demonstrated the application of HaMStR in a molecular systematics study. However, we envision that studies tracing the evolutionary fate of individual genes or functional complexes of genes will greatly benefit from HaMStR orthology predictions as well.
Collapse
Affiliation(s)
- Ingo Ebersberger
- Center for Integrative Bioinformatics Vienna, Max F, Perutz Laboratories, Vienna, Austria.
| | | | | |
Collapse
|
39
|
Bleidorn C, Podsiadlowski L, Zhong M, Eeckhaut I, Hartmann S, Halanych KM, Tiedemann R. On the phylogenetic position of Myzostomida: can 77 genes get it wrong? BMC Evol Biol 2009; 9:150. [PMID: 19570199 PMCID: PMC2716322 DOI: 10.1186/1471-2148-9-150] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2009] [Accepted: 07/01/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Phylogenomic analyses recently became popular to address questions about deep metazoan phylogeny. Ribosomal proteins (RP) dominate many of these analyses or are, in some cases, the only genes included. Despite initial hopes, phylogenomic analyses including tens to hundreds of genes still fail to robustly place many bilaterian taxa. RESULTS Using the phylogenetic position of myzostomids as an example, we show that phylogenies derived from RP genes and mitochondrial genes produce incongruent results. Whereas the former support a position within a clade of platyzoan taxa, mitochondrial data recovers an annelid affinity, which is strongly supported by the gene order data and is congruent with morphology. Using hypothesis testing, our RP data significantly rejects the annelids affinity, whereas a platyzoan relationship is significantly rejected by the mitochondrial data. CONCLUSION We conclude (i) that reliance of a set of markers belonging to a single class of macromolecular complexes might bias the analysis, and (ii) that concatenation of all available data might introduce conflicting signal into phylogenetic analyses. We therefore strongly recommend testing for data incongruence in phylogenomic analyses. Furthermore, judging all available data, we consider the annelid affinity hypothesis more plausible than a possible platyzoan affinity for myzostomids, and suspect long branch attraction is influencing the RP data. However, this hypothesis needs further confirmation by future analyses.
Collapse
Affiliation(s)
- Christoph Bleidorn
- Unit of Evolutionary Biology/Systematic Zoology, Institute of Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Strasse 24-25, Haus 26, D-14476 Potsdam-Golm, Germany
| | - Lars Podsiadlowski
- Institute of Evolutionary Biology and Ecology, Rheinische Friedrich-Wilhelms-Universität Bonn, An der Immenburg 1, D-53121 Bonn, Germany
| | - Min Zhong
- Department of Biological Sciences, Auburn University, 101 Life Science Building, AL 36849, USA
| | - Igor Eeckhaut
- Marine Biology Laboratory, Natural Sciences Building, University of Mons-Hainaut, Av. Champs de Mars 6, B-7000 Mons, Belgium
| | - Stefanie Hartmann
- Unit of Bioinformatics, Institute of Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Strasse 24-25, Haus 26, D-14476 Potsdam-Golm, Germany
| | - Kenneth M Halanych
- Department of Biological Sciences, Auburn University, 101 Life Science Building, AL 36849, USA
| | - Ralph Tiedemann
- Unit of Evolutionary Biology/Systematic Zoology, Institute of Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Strasse 24-25, Haus 26, D-14476 Potsdam-Golm, Germany
| |
Collapse
|
40
|
Baker WJ, Savolainen V, Asmussen-Lange CB, Chase MW, Dransfield J, Forest F, Harley MM, Uhl NW, Wilkinson M. Complete Generic-Level Phylogenetic Analyses of Palms (Arecaceae) with Comparisons of Supertree and Supermatrix Approaches. Syst Biol 2009; 58:240-56. [PMID: 20525581 DOI: 10.1093/sysbio/syp021] [Citation(s) in RCA: 152] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
| | - Vincent Savolainen
- Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AB, UK
- Imperial College London, Silwood Park Campus, Buckhurst Road, Ascot, Berkshire SL5 7PY, UK
| | - Conny B. Asmussen-Lange
- Department of Ecology, University of Copenhagen, Rolighedsvej 21, DK-1958 Frederiksberg C, Denmark
| | - Mark W. Chase
- Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AB, UK
| | | | - Félix Forest
- Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AB, UK
| | | | - Natalie W. Uhl
- Department of Plant Biology, Cornell University, 412 Mann Library Building, Ithaca, NY 14853, USA
| | - Mark Wilkinson
- Department of Zoology, Natural History Museum, Cromwell Road, London SW7 5BD, UK
| |
Collapse
|
41
|
CARDOSO A, SERRANO A, VOGLER AP. Morphological and molecular variation in tiger beetles of theCicindela hybridacomplex: is an ‘integrative taxonomy’ possible? Mol Ecol 2009; 18:648-64. [DOI: 10.1111/j.1365-294x.2008.04048.x] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
42
|
Abstract
To date the genomes of over 600 organisms have been generated of which 100 are from eukaryotes. Together with partial genome data for an additional 700 eukaryotic organisms, these exceptional sequence resources offer new opportunities to explore phylogenetic relationships and species diversity. The identification of highly diverse sequences specific to an EST-based sequence dataset offers insights into the extent of genetic novelty within that dataset. Sequences that are only shared with other related species from the same taxon might represent genes associated with taxon-specific innovations. On the other hand, sequences that are highly conserved across many other species offer valuable resources for performing more in-depth phylogenetic analyses. In the following chapter, we guide the reader through the process of examining their sequence datasets in the context of phylogenetic relationships. Performed across large-scale datasets, such analyses are termed Phylogenomics. Two complementary approaches are described, both based on the use of BLAST similarity metrics. The first uses an established Java tool - SimiTri - to visualize sequence similarity relationships between the EST dataset and three user-defined datasets. The second focuses on the use of phylogenetic profiles to identify groups of taxonomically related sequences.
Collapse
|
43
|
Abstract
Expressed sequence tags (ESTs) are fragments of mRNA sequences derived through single sequencing reactions performed on randomly selected clones from cDNA libraries. To date, over 45 million ESTs have been generated from over 1400 different species of eukaryotes. For the most part, EST projects are used to either complement existing genome projects or serve as low-cost alternatives for purposes of gene discovery. However, with improvements in accuracy and coverage, they are beginning to find application in fields such as phylogenetics, transcript profiling and proteomics. This volume provides practical details on the generation and analysis of ESTs. Chapters are presented which cover creation of cDNA libraries; generation and processing of sequence data; bioinformatics analysis of ESTs; and their application to phylogenetics and transcript profiling.
Collapse
Affiliation(s)
- John Parkinson
- Molecular Structure and Function, Hospital for Sick Children, Toronto, ON, Canada
| | | |
Collapse
|
44
|
Marvaldi AE, Duckett CN, Kjer KM, Gillespie JJ. Structural alignment of 18S and 28S rDNA sequences provides insights into phylogeny of Phytophaga (Coleoptera: Curculionoidea and Chrysomeloidea). ZOOL SCR 2009. [DOI: 10.1111/j.1463-6409.2008.00360.x] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
45
|
Lambkin CL, Trueman JWH, Yeates DK, Holston KC, Webb DW, Hauser M, Metz MA, Hill HN, Skevington JH, Yang L, Irwin ME, Wiegmann BM. Supertrees and the Tree of Life: generating a metaphylogeny for a diverse invertebrate family (Insecta:Diptera:Therevidae) using constraint trees and the parsimony ratchet to overcome low taxon overlap. INVERTEBR SYST 2009. [DOI: 10.1071/is08035] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
The dipteran family Therevidae (stiletto flies) is cosmopolitan and has been the focus of many taxonomic and phylogenetic studies over the last 25 years. Despite this work, questions remain concerning the relationships between subfamilies, genera and generic groups and membership of those groups. We use the supertree method to produce an inclusive phylogeny for the family Therevidae from 24 phylogenetic studies using matrix representation with parsimony (MRP) analysis. The supertree method, one of the most common approaches to calculating globally inclusive phylogenies from smaller more exclusive analyses, produced the therevid metaphylogeny despite only 34% of the terminal taxa being found in more than one source tree. We describe a method for handling low taxon overlap in supertree analyses, in combination with the parsimony ratchet and constraint tree techniques. The supertree presented here is an overarching phylogenetic hypothesis of the Therevidae, incorporating extensive sampling of major lineages and summarising past phylogenetic work on the family. The inclusive metaphylogeny for 362 therevid taxa robustly retrieves the subfamilies Agapophytinae, Phycinae, Therevinae and Xestomyzinae, and the tribes Cyclotelini and Therevini. The Phycinae and Xestomyzinae form a clade, sister to the remaining Therevidae. The Australasian and South American Taenogera Kröber genus-group is monophyletic and sister to a clade of Therevinae and the Australian endemic Agapophytinae. The Therevinae consists of the Anabarhynchus Macquart genus-group of Australian, South American, New Caledonian and New Zealand taxa as sister to the non-Australasian ‘higher Therevinae’, which contains the tribes Cyclotelini and Therevini. The Therevini includes the Hoplosathe Lyneborg & Zaitzev, Litolinga Irwin & Lyneborg, Baryphora Loew, Pandivirilia Irwin & Lyneborg and Thereva Latreille generic-groups. MRP supertree methods can be used to produce inclusive metaphylogenies in situations where source trees have poor data overlap and low taxon overlap, and are therefore valuable in species-rich groups such as arthropods. These methods may be necessary for constructing the ‘Tree of Life’, representing phylogenetic relationships among the millions of known species. However, our analyses show that in situations of source tree conflict, MRP supertree analyses present only the majority signal. We also show that conflict between source trees can be hidden in MRP supertrees, thus our results emphasise the need to evaluate the resulting clades with reference to the source trees.
Collapse
|
46
|
Witek A, Herlyn H, Meyer A, Boell L, Bucher G, Hankeln T. EST based phylogenomics of Syndermata questions monophyly of Eurotatoria. BMC Evol Biol 2008; 8:345. [PMID: 19113997 PMCID: PMC2654452 DOI: 10.1186/1471-2148-8-345] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2008] [Accepted: 12/29/2008] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND The metazoan taxon Syndermata comprising Rotifera (in the classical sense of Monogononta+Bdelloidea+Seisonidea) and Acanthocephala has raised several hypotheses connected to the phylogeny of these animal groups and the included subtaxa. While the monophyletic origin of Syndermata and Acanthocephala is well established based on morphological and molecular data, the phylogenetic position of Syndermata within Spiralia, the monophyletic origin of Monogononta, Bdelloidea, and Seisonidea and the acanthocephalan sister group are still a matter of debate. The comparison of the alternative hypotheses suggests that testing the phylogenetic validity of Eurotatoria (Monogononta+Bdelloidea) is the key to unravel the phylogenetic relations within Syndermata. The syndermatan phylogeny in turn is a prerequisite for reconstructing the evolution of the acanthocephalan endoparasitism. RESULTS Here we present our results from a phylogenomic approach studying i) the phylogenetic position of Syndermata within Spiralia, ii) the monophyletic origin of monogononts and bdelloids and iii) the phylogenetic relations of the latter two taxa to acanthocephalans. For this analysis we have generated EST libraries of Pomphorhynchus laevis, Echinorhynchus truttae (Acanthocephala) and Brachionus plicatilis (Monogononta). By extending these data with database entries of B. plicatilis, Philodina roseola (Bdelloidea) and 25 additional metazoan species, we conducted phylogenetic reconstructions based on 79 ribosomal proteins using maximum likelihood and bayesian approaches. Our findings suggest that the phylogenetic position of Syndermata within Spiralia is close to Platyhelminthes, that Eurotatoria are not monophyletic and that bdelloids are more closely related to acanthocephalans than monogononts. CONCLUSION Mapping morphological character evolution onto molecular phylogeny suggests the (partial or complete) reduction of the corona and the emergence of a retractable anterior end (rostrum, proboscis) before the separation of Acanthocephala. In particular, the evolution of a rostrum might have been a key event leading to the later evolution of the acanthocephalan endoparasitism, given the enormous relevance of the proboscis for anchoring of the adults to the definitive hosts' intestinal wall.
Collapse
Affiliation(s)
- Alexander Witek
- Institute of Molecular Genetics, Johannes Gutenberg-University Mainz, J. J.-Becherweg 32, D-55099 Mainz, Germany
| | - Holger Herlyn
- Institute of Anthropology, Johannes Gutenberg-University Mainz, Colonel-Kleinmann-Weg 2, D-55099 Mainz, Germany
| | - Achim Meyer
- Institute of Zoology, Johannes Gutenberg-University Mainz, Müllerweg 6, D-55099 Mainz, Germany
| | - Louis Boell
- Johann Friedrich Blumenbach Institute of Anthropology and Zoology, Georg-August-University Göttingen, J. v. Liebig-Weg 11, D-37077 Göttingen, Germany
| | - Gregor Bucher
- Johann Friedrich Blumenbach Institute of Anthropology and Zoology, Georg-August-University Göttingen, J. v. Liebig-Weg 11, D-37077 Göttingen, Germany
| | - Thomas Hankeln
- Institute of Molecular Genetics, Johannes Gutenberg-University Mainz, J. J.-Becherweg 32, D-55099 Mainz, Germany
| |
Collapse
|
47
|
Peregrín-Alvarez JM, Parkinson J. The global landscape of sequence diversity. Genome Biol 2008; 8:R238. [PMID: 17996061 PMCID: PMC2258180 DOI: 10.1186/gb-2007-8-11-r238] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2007] [Revised: 10/18/2007] [Accepted: 11/08/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Systematic comparisons between genomic sequence datasets have revealed a wide spectrum of sequence specificity from sequences that are highly conserved to those that are specific to individual species. Due to the limited number of fully sequenced eukaryotic genomes, analyses of this spectrum have largely focused on prokaryotes. Combining existing genomic datasets with the partial genomes of 193 eukaryotes derived from collections of expressed sequence tags, we performed a quantitative analysis of the sequence specificity spectrum to provide a global view of the origins and extent of sequence diversity across the three domains of life. RESULTS Comparisons with prokaryotic datasets reveal a greater genetic diversity within eukaryotes that may be related to differences in modes of genetic inheritance. Mapping this diversity within a phylogenetic framework revealed that the majority of sequences are either highly conserved or specific to the species or taxon from which they derive. Between these two extremes, several evolutionary landmarks consisting of large numbers of sequences conserved within specific taxonomic groups were identified. For example, 8% of sequences derived from metazoan species are specific and conserved within the metazoan lineage. Many of these sequences likely mediate metazoan specific functions, such as cell-cell communication and differentiation. CONCLUSION Through the use of partial genome datasets, this study provides a unique perspective of sequence conservation across the three domains of life. The provision of taxon restricted sequences should prove valuable for future computational and biochemical analyses aimed at understanding evolutionary and functional relationships.
Collapse
Affiliation(s)
- José Manuel Peregrín-Alvarez
- Molecular Structure and Function, Hospital for Sick Children, 555 University Avenue, Toronto, ON M5G 1X8, Canada.
| | | |
Collapse
|
48
|
Abstract
Molecular sequence data have been sampled from 10% of all species known to science. Although it is not yet feasible to assemble these data into a single phylogenetic tree of life, it is possible to quantify how much phylogenetic signal is present. Analysis of 14,289 phylogenies built from 2.6 million sequences in GenBank suggests that signal is strong in vertebrates and specific groups of nonvertebrate model organisms. Across eukaryotes, however, although phylogenetic evidence is very broadly distributed, for the average species in the database it is equivalent to less than one well-supported gene tree. This analysis shows that a stronger sampling effort aimed at genomic depth, in addition to taxonomic breadth, will be required to build high-resolution phylogenetic trees at this scale.
Collapse
Affiliation(s)
- Michael J Sanderson
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA.
| |
Collapse
|
49
|
Wild AL, Maddison DR. Evaluating nuclear protein-coding genes for phylogenetic utility in beetles. Mol Phylogenet Evol 2008; 48:877-91. [PMID: 18644735 DOI: 10.1016/j.ympev.2008.05.023] [Citation(s) in RCA: 133] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2007] [Revised: 04/29/2008] [Accepted: 05/20/2008] [Indexed: 10/22/2022]
Abstract
Although nuclear protein-coding genes have proven broadly useful for phylogenetic inference, relatively few such genes are regularly employed in studies of Coleoptera, the most diverse insect order. We increase the number of loci available for beetle systematics by developing protocols for three genes previously unused in beetles (alpha-spectrin, RNA polymerase II and topoisomerase I) and by refining protocols for five genes already in use (arginine kinase, CAD, enolase, PEPCK and wingless). We evaluate the phylogenetic performance of each gene in a Bayesian framework against a presumably known test phylogeny. The test phylogeny covers 31 beetle specimens and two outgroup taxa of varying age, including three of the four extant beetle suborders and a denser sampling in Adephaga and in the carabid genus Bembidion. All eight genes perform well for Cenozoic divergences and accurately separate closely related species within Bembidion, but individual genes differ markedly in accuracy over the older Mesozoic and Permian divergences. The concatenated data reconstruct the test phylogeny with high support in both Bayesian and parsimony analyses, indicating that combining data from multiple nuclear loci will be a fruitful approach for assembling the beetle tree of life.
Collapse
Affiliation(s)
- Alexander L Wild
- Department of Entomology, University of Arizona, Tucson, AZ 85721, USA.
| | | |
Collapse
|
50
|
Timmermans MJTN, Roelofs D, Mariën J, van Straalen NM. Revealing pancrustacean relationships: phylogenetic analysis of ribosomal protein genes places Collembola (springtails) in a monophyletic Hexapoda and reinforces the discrepancy between mitochondrial and nuclear DNA markers. BMC Evol Biol 2008; 8:83. [PMID: 18366624 PMCID: PMC2315649 DOI: 10.1186/1471-2148-8-83] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2007] [Accepted: 03/12/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In recent years, several new hypotheses on phylogenetic relations among arthropods have been proposed on the basis of DNA sequences. One of the challenged hypotheses is the monophyly of hexapods. This discussion originated from analyses based on mitochondrial DNA datasets that, due to an unusual positioning of Collembola, suggested that the hexapod body plan evolved at least twice. Here, we re-evaluate the position of Collembola using ribosomal protein gene sequences. RESULTS In total 48 ribosomal proteins were obtained for the collembolan Folsomia candida. These 48 sequences were aligned with sequence data on 35 other ecdysozoans. Each ribosomal protein gene was available for 25% to 86% of the taxa. However, the total sequence information was unequally distributed over the taxa and ranged between 4% and 100%. A concatenated dataset was constructed (5034 inferred amino acids in length), of which ~66% of the positions were filled. Phylogenetic tree reconstructions, using Maximum Likelihood, Maximum Parsimony, and Bayesian methods, resulted in a topology that supports monophyly of Hexapoda. CONCLUSION Although ribosomal proteins in general may not evolve independently, they once more appear highly valuable for phylogenetic reconstruction. Our analyses clearly suggest that Hexapoda is monophyletic. This underpins the inconsistency between nuclear and mitochondrial datasets when analyzing pancrustacean relationships. Caution is needed when applying mitochondrial markers in deep phylogeny.
Collapse
Affiliation(s)
- MJTN Timmermans
- Department of Animal Ecology, VU University Amsterdam, Amsterdam, The Netherlands
| | - D Roelofs
- Department of Animal Ecology, VU University Amsterdam, Amsterdam, The Netherlands
| | - J Mariën
- Department of Animal Ecology, VU University Amsterdam, Amsterdam, The Netherlands
| | - NM van Straalen
- Department of Animal Ecology, VU University Amsterdam, Amsterdam, The Netherlands
| |
Collapse
|