1
|
Wang Z, Kang M, Li J, Zhang Z, Wang Y, Chen C, Yang Y, Liu J. Genomic evidence for homoploid hybrid speciation between ancestors of two different genera. Nat Commun 2022; 13:1987. [PMID: 35418567 PMCID: PMC9008057 DOI: 10.1038/s41467-022-29643-4] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 03/25/2022] [Indexed: 11/29/2022] Open
Abstract
Homoploid hybrid speciation (HHS) has been increasingly recognized as occurring widely during species diversification of both plants and animals. However, previous studies on HHS have mostly focused on closely-related species while it has been rarely reported or tested between ancestors of different genera. Here, we explore the likely HHS origin of Carpinus sect. Distegocarpus between sect. Carpinus and Ostrya in the family Betulaceae. We generate a chromosome-level reference genome for C. viminea of sect. Carpinus and re-sequence genomes of 44 individuals from the genera Carpinus and Ostrya. Our integrated analyses of all genomic data suggest that sect. Distegocarpus, which has three species, likely originates through HHS during the early divergence between Carpinus and Ostrya. Our study highlights the likelihood of an HHS event between ancestors of the extant genera during their initial divergences, which may have led to reticulate phylogenies at higher taxonomic levels.
Collapse
Affiliation(s)
- Zefu Wang
- State Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, China
| | - Minghui Kang
- State Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, China
| | - Jialiang Li
- State Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, China
| | - Zhiyang Zhang
- State Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, China
| | - Yufei Wang
- State Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, China
| | - Chunlin Chen
- State Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, China
| | - Yongzhi Yang
- State Key Laboratory of Grassland Agro-Ecosystem, College of Ecology, Lanzhou University, Lanzhou, Gansu, China
| | - Jianquan Liu
- State Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, China.
- State Key Laboratory of Grassland Agro-Ecosystem, College of Ecology, Lanzhou University, Lanzhou, Gansu, China.
| |
Collapse
|
2
|
de Oliveira Martins L, Bloomfield S, Stoakes E, Grant AJ, Page AJ, Mather AE. Tatajuba: exploring the distribution of homopolymer tracts. NAR Genom Bioinform 2022; 4:lqac003. [PMID: 35118377 PMCID: PMC8808543 DOI: 10.1093/nargab/lqac003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 11/18/2021] [Accepted: 01/05/2022] [Indexed: 11/14/2022] Open
Abstract
Length variation of homopolymeric tracts, which induces phase variation, is known to regulate gene expression leading to phenotypic variation in a wide range of bacterial species. There is no specialized bioinformatics software which can, at scale, exhaustively explore and describe these features from sequencing data. Identifying these is non-trivial as sequencing and bioinformatics methods are prone to introducing artefacts when presented with homopolymeric tracts due to the decreased base diversity. We present tatajuba, which can automatically identify potential homopolymeric tracts and help predict their putative phenotypic impact, allowing for rapid investigation. We use it to detect all tracts in two separate datasets, one of Campylobacter jejuni and one of three Bordetella species, and to highlight those tracts that are polymorphic across samples. With this we confirm homopolymer tract variation with phenotypic impact found in previous studies and additionally find many more with potential variability. The software is written in C and is available under the open source licence GNU GPLv3.
Collapse
Affiliation(s)
| | - Samuel Bloomfield
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| | - Emily Stoakes
- Department of Veterinary Medicine, University of Cambridge, Madingley Road, Cambridge CB3 0ES, UK
| | - Andrew J Grant
- Department of Veterinary Medicine, University of Cambridge, Madingley Road, Cambridge CB3 0ES, UK
| | - Andrew J Page
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| | - Alison E Mather
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| |
Collapse
|
3
|
Ding Y, Cuddy WS, Wellings CR, Zhang P, Thach T, Hovmøller MS, Qutob D, Brar GS, Kutcher HR, Park RF. Incursions of divergent genotypes, evolution of virulence and host jumps shape a continental clonal population of the stripe rust pathogen Puccinia striiformis. Mol Ecol 2021; 30:6566-6584. [PMID: 34543497 DOI: 10.1111/mec.16182] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Revised: 08/22/2021] [Accepted: 09/13/2021] [Indexed: 11/30/2022]
Abstract
Long-distance migration and host adaptation by transboundary plant pathogens often brings detrimental effects to important agroecosystems. Efficient surveillance as a basis for responding to the dynamics of such pathogens is often hampered by a lack of information on incursion origin, evolutionary pathways and the genetic basis of rapidly evolving virulence across larger timescales. Here, we studied these genetic features by using historical isolates of the obligate biotrophic pathogen Puccinia striiformis f. sp. tritici (Pst), which causes one of the most widespread and devastating diseases, stripe (yellow) rust, of wheat. Through a combination of genotypic, phenotypic and genomic analyses, we assigned eight Pst isolates representing putative exotic Pst incursions into Australia to four previously defined genetic groups, PstS0, PstS1, PstS10 and PstS13. We showed that isolates of an additional incursion of P. striiformis, known locally as P. striiformis f. sp. pseudo-hordei, had a new and unique multilocus SSR genotype (MLG). We provide results of overall genomic variation of representative Pst isolates from each genetic group by comparative genomic analyses. We showed that isolates within the PstS1 and PstS13 genetic groups are most distinct at the whole-genome variant level from isolates belonging to genetic group PstS0, whereas the isolate from the PstS10 genetic group is intermediate. We further explored variable gene content, including putative effectors, representing both shared but also unique genetic changes that have occurred following introduction, some of which may additionally account for local adaptation of these isolates to triticale. Our genotypic and genomic data revealed new genetic insights into the evolution of diverse phenotypes of rust pathogens following incursion into a geographically isolated continental region.
Collapse
Affiliation(s)
- Yi Ding
- School of Life and Environmental Sciences, Plant Breeding Institute, The University of Sydney, Cobbitty, NSW, Australia
| | - Will S Cuddy
- NSW Department of Primary Industries, Elizabeth Macarthur Agricultural Institute, Menangle, NSW, Australia
| | - Colin R Wellings
- School of Life and Environmental Sciences, Plant Breeding Institute, The University of Sydney, Cobbitty, NSW, Australia.,NSW Department of Primary Industries, Elizabeth Macarthur Agricultural Institute, Menangle, NSW, Australia
| | - Peng Zhang
- School of Life and Environmental Sciences, Plant Breeding Institute, The University of Sydney, Cobbitty, NSW, Australia
| | - Tine Thach
- Department of Agroecology, Global Rust Reference Center, Aarhus University, Slagelse, Denmark
| | - Mogens S Hovmøller
- Department of Agroecology, Global Rust Reference Center, Aarhus University, Slagelse, Denmark
| | - Dinah Qutob
- Department of Biological Sciences, Kent State University at Stark, North Canton, ON, USA
| | - Gurcharn S Brar
- Faculty of Land and Food Systems, The University of British Columbia, Vancouver, BC, Canada
| | - Hadley R Kutcher
- College of Agriculture and Bioresources, University of Saskatchewan, Saskatoon, SK, Canada
| | - Robert F Park
- School of Life and Environmental Sciences, Plant Breeding Institute, The University of Sydney, Cobbitty, NSW, Australia
| |
Collapse
|
4
|
Li L, Hu Y, He M, Zhang B, Wu W, Cai P, Huo D, Hong Y. Comparative chloroplast genomes: insights into the evolution of the chloroplast genome of Camellia sinensis and the phylogeny of Camellia. BMC Genomics 2021; 22:138. [PMID: 33637038 PMCID: PMC7912895 DOI: 10.1186/s12864-021-07427-2] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Accepted: 02/05/2021] [Indexed: 01/21/2023] Open
Abstract
BACKGROUND Chloroplast genome resources can provide useful information for the evolution of plant species. Tea plant (Camellia sinensis) is among the most economically valuable member of Camellia. Here, we determined the chloroplast genome of the first natural triploid Chinary type tea ('Wuyi narcissus' cultivar of Camellia sinensis var. sinensis, CWN) and conducted the genome comparison with the diploid Chinary type tea (Camellia sinensis var. sinensis, CSS) and two types of diploid Assamica type teas (Camellia sinensis var. assamica: Chinese Assamica type tea, CSA and Indian Assamica type tea, CIA). Further, the evolutionary mechanism of the chloroplast genome of Camellia sinensis and the relationships of Camellia species based on chloroplast genome were discussed. RESULTS Comparative analysis showed the evolutionary dynamics of chloroplast genome of Camellia sinensis were the repeats and insertion-deletions (indels), and distribution of the repeats, indels and substitutions were significantly correlated. Chinese tea and Indian tea had significant differences in the structural characteristic and the codon usage of the chloroplast genome. Analysis of sequence characterized amplified region (SCAR) using sequences of the intergenic spacers (trnE/trnT) showed none of 292 different Camellia sinensis cultivars had similar sequence characteristic to triploid CWN, but the other four Camellia species did. Estimations of the divergence time showed that CIA diverged from the common ancestor of two Assamica type teas about 6.2 Mya (CI: 4.4-8.1 Mya). CSS and CSA diverged to each other about 0.8 Mya (CI: 0.4-1.5 Mya). Moreover, phylogenetic clustering was not exactly consistent with the current taxonomy of Camellia. CONCLUSIONS The repeat-induced and indel-induced mutations were two important dynamics contributed to the diversification of the chloroplast genome in Camellia sinensis, which were not mutually exclusive. Chinese tea and Indian tea might have undergone different selection pressures. Chloroplast transfer occurred during the polyploid evolution in Camellia sinensis. In addition, our results supported the three different domestication origins of Chinary type tea, Chinese Assamica type tea and Indian Assamica type tea. And, the current classification of some Camellia species might need to be further discussed.
Collapse
Affiliation(s)
- Li Li
- College of Tea and Food Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China.
| | - Yunfei Hu
- College of Tea and Food Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China
| | - Min He
- College of Tea and Food Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China
| | - Bo Zhang
- College of Tea and Food Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China
| | - Wei Wu
- College of Mathematics and Computer Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China
| | - Pumo Cai
- College of Tea and Food Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China
| | - Da Huo
- College of Tea and Food Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China
| | - Yongcong Hong
- College of Tea and Food Science, Wuyi University, 358# Baihua Road, Wuyishan, 354300, China.
| |
Collapse
|
5
|
Jiang Y, Yuan Z, Hu H, Ye X, Zheng Z, Wei Y, Zheng YL, Wang YG, Liu C. Differentiating homoploid hybridization from ancestral subdivision in evaluating the origin of the D lineage in wheat. THE NEW PHYTOLOGIST 2020; 228:409-414. [PMID: 32255512 DOI: 10.1111/nph.16578] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2019] [Accepted: 03/19/2020] [Indexed: 06/11/2023]
Affiliation(s)
- Yunfeng Jiang
- Triticeae Research Institute, Sichuan Agricultural University, Wenjiang, Chengdu, 611130, China
- CSIRO Agriculture and Food, St Lucia, Qld, 4067, Australia
| | - Zhongwei Yuan
- Triticeae Research Institute, Sichuan Agricultural University, Wenjiang, Chengdu, 611130, China
- CSIRO Agriculture and Food, St Lucia, Qld, 4067, Australia
| | - Haiyan Hu
- CSIRO Agriculture and Food, St Lucia, Qld, 4067, Australia
- College of Life Science and Technology, Henan Institute of Science and Technology, Xinxiang, Henan, 453003, China
| | - Xueling Ye
- Triticeae Research Institute, Sichuan Agricultural University, Wenjiang, Chengdu, 611130, China
- CSIRO Agriculture and Food, St Lucia, Qld, 4067, Australia
| | - Zhi Zheng
- CSIRO Agriculture and Food, St Lucia, Qld, 4067, Australia
| | - Yuming Wei
- Triticeae Research Institute, Sichuan Agricultural University, Wenjiang, Chengdu, 611130, China
| | - You-Liang Zheng
- Triticeae Research Institute, Sichuan Agricultural University, Wenjiang, Chengdu, 611130, China
| | - You-Gan Wang
- Science and Engineering Facility, Queensland University of Technology, Brisbane, Qld, 4000, Australia
| | - Chunji Liu
- CSIRO Agriculture and Food, St Lucia, Qld, 4067, Australia
| |
Collapse
|
6
|
Liu JT, Corbett JL, Heslop JA, Duncan SA. Enhanced genome editing in human iPSCs with CRISPR-CAS9 by co-targeting ATP1a1. PeerJ 2020; 8:e9060. [PMID: 32391204 PMCID: PMC7197401 DOI: 10.7717/peerj.9060] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Accepted: 04/04/2020] [Indexed: 12/11/2022] Open
Abstract
Genome editing in human induced pluripotent stem cells (iPSCs) provides the potential for disease modeling and cell therapy. By generating iPSCs with specific mutations, researchers can differentiate the modified cells to their lineage of interest for further investigation. However, the low efficiency of targeting in iPSCs has hampered the application of genome editing. In this study we used a CRISPR-Cas9 system that introduces a specific point substitution into the sequence of the Na+/K+-ATPase subunit ATP1A1. The introduced mutation confers resistance to cardiac glycosides, which can then be used to select successfully targeted cells. Using this system, we introduced different formats of donor DNA for homology-directed repair (HDR), including single-strand DNAs, double-strand DNAs, and plasmid donors. We achieved a 35-fold increase in HDR when using plasmid donor with a 400 bp repair template. We further co-targeted ATP1A1 and a second locus of interest to determine the enrichment of mutagenesis after cardiac glycoside selection. Through this approach, INDEL rate was increased after cardiac glycoside treatment, while HDR enrichment was only observed at certain loci. Collectively, these results suggest that a plasmid donor with a 400 bp repair template is an optimal donor DNA for targeted substitution and co-targeting ATP1A1 with the second locus enriches for mutagenesis events through cardiac glycoside selection in human iPSCs.
Collapse
Affiliation(s)
- Jui-Tung Liu
- Department of Regenerative Medicine and Cell Biology, Medical University of South Carolina, Charleston, SC, United States of America
| | - James L Corbett
- Department of Regenerative Medicine and Cell Biology, Medical University of South Carolina, Charleston, SC, United States of America
| | - James A Heslop
- Department of Regenerative Medicine and Cell Biology, Medical University of South Carolina, Charleston, SC, United States of America
| | - Stephen A Duncan
- Department of Regenerative Medicine and Cell Biology, Medical University of South Carolina, Charleston, SC, United States of America
| |
Collapse
|
7
|
Whole genome detection of sequence and structural polymorphism in six diverse horses. PLoS One 2020; 15:e0230899. [PMID: 32271776 PMCID: PMC7144971 DOI: 10.1371/journal.pone.0230899] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Accepted: 03/12/2020] [Indexed: 12/30/2022] Open
Abstract
The domesticated horse has played a unique role in human history, serving not just as a source of animal protein, but also as a catalyst for long-distance migration and military conquest. As a result, the horse developed unique physiological adaptations to meet the demands of both their climatic environment and their relationship with man. Completed in 2009, the first domesticated horse reference genome assembly (EquCab 2.0) produced most of the publicly available genetic variations annotations in this species. Yet, there are around 400 geographically and physiologically diverse breeds of horse. To enrich the current collection of genetic variants in the horse, we sequenced whole genomes from six horses of six different breeds: an American Miniature, a Percheron, an Arabian, a Mangalarga Marchador, a Native Mongolian Chakouyi, and a Tennessee Walking Horse, and mapped them to EquCab3.0 genome. Aside from extreme contrasts in body size, these breeds originate from diverse global locations and each possess unique adaptive physiology. A total of 1.3 billion reads were generated for the six horses with coverage between 15x to 24x per horse. After applying rigorous filtration, we identified and functionally annotated 17,514,723 Single Nucleotide Polymorphisms (SNPs), and 1,923,693 Insertions/Deletions (INDELs), as well as an average of 1,540 Copy Number Variations (CNVs) and 3,321 Structural Variations (SVs) per horse. Our results revealed putative functional variants including genes associated with size variation like LCORL gene (found in all horses), ZFAT in the Arabian, American Miniature and Percheron horses and ANKRD1 in the Native Mongolian Chakouyi horse. We detected a copy number variation in the Latherin gene that may be the result of evolutionary selection impacting thermoregulation by sweating, an important component of athleticism and heat tolerance. The newly discovered variants were formatted into user-friendly browser tracks and will provide a foundational database for future studies of the genetic underpinnings of diverse phenotypes within the horse. The domesticated horse played a unique role in human history, serving not just as a source of dietary animal protein, but also as a catalyst for long-distance migration and military conquest. As a result, the horse developed unique physiological adaptations to meet the demands of both their climatic environment and their relationship with man. Although the completion of the horse reference genome allowed for the discovery of many genetic variants, the remarkable diversity across breeds of horse calls for additional effort to quantify the complete span of genetic polymorphism within this unique species. In this work, we present genome re-sequencing and variant detection analysis for six horses belonging to six different breeds representing different morphology, origins and vary in their physiological demands and response. We identified and annotated not just single nucleotide polymorphisms (SNPs), but also insertions and deletions (INDELs), copy number variations (CNVs) and structural variations (SVs). Our results illustrate novel sources of polymorphism and highlight potentially impactful variations for phenotypes of body size and conformation. We also detected a copy number loss in the Latherin gene that could be the result of an evolutionary selection affecting thermoregulation through sweating. Our newly discovered variants were formatted into easy-to-use tracks that can be easily accessed by researchers around the globe.
Collapse
|
8
|
Sharma R, Gupta RS. Novel Molecular Synapomorphies Demarcate Different Main Groups/Subgroups of Plasmodium and Piroplasmida Species Clarifying Their Evolutionary Relationships. Genes (Basel) 2019; 10:genes10070490. [PMID: 31261747 PMCID: PMC6678196 DOI: 10.3390/genes10070490] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2019] [Revised: 06/11/2019] [Accepted: 06/14/2019] [Indexed: 12/19/2022] Open
Abstract
The class Hematozoa encompasses several clinically important genera, including Plasmodium, whose members cause the major life-threating disease malaria. Hence, a good understanding of the interrelationships of organisms from this class and reliable means for distinguishing them are of much importance. This study reports comprehensive phylogenetic and comparative analyses on protein sequences on the genomes of 28 hematozoa species to understand their interrelationships. In addition to phylogenetic trees based on two large datasets of protein sequences, detailed comparative analyses were carried out on the genomes of hematozoa species to identify novel molecular synapomorphies consisting of conserved signature indels (CSIs) in protein sequences. These studies have identified 79 CSIs that are exclusively present in specific groups of Hematozoa/Plasmodium species, also supported by phylogenetic analysis, providing reliable means for the identification of these species groups and understanding their interrelationships. Of these CSIs, six CSIs are specifically shared by all hematozoa species, two CSIs serve to distinguish members of the order Piroplasmida, five CSIs are uniquely found in all Piroplasmida species except B. microti and two CSIs are specific for the genus Theileria. Additionally, we also describe 23 CSIs that are exclusively present in all genome-sequenced Plasmodium species and two, nine, ten and eight CSIs which are specific for members of the Plasmodium subgenera Haemamoeba, Laverania, Vinckeia and Plasmodium (excluding P. ovale and P. malariae), respectively. Additionally, our work has identified several CSIs that support species relationships which are not evident from phylogenetic analysis. Of these CSIs, one CSI supports the ancestral nature of the avian-Plasmodium species in comparison to the mammalian-infecting groups of Plasmodium species, four CSIs strongly support a specific relationship of species between the subgenera Plasmodium and Vinckeia and three CSIs each that reliably group P. malariae with members of the subgenus Plasmodium and P. ovale within the subgenus Vinckeia, respectively. These results provide a reliable framework for understanding the evolutionary relationships among the Plasmodium/Piroplasmida species. Further, in view of the exclusivity of the described molecular markers for the indicated groups of hematozoa species, particularly large numbers of unique characteristics that are specific for all Plasmodium species, they provide important molecular tools for biochemical/genetic studies and for developing novel diagnostics and therapeutics for these organisms.
Collapse
Affiliation(s)
- Rahul Sharma
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario L8N 3Z5, Canada.
| | - Radhey S Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario L8N 3Z5, Canada.
| |
Collapse
|
9
|
Zong D, Gan P, Zhou A, Li J, Xie Z, Duan A, He C. Comparative analysis of the complete chloroplast genomes of seven Populus species: Insights into alternative female parents of Populus tomentosa. PLoS One 2019; 14:e0218455. [PMID: 31216332 PMCID: PMC6583991 DOI: 10.1371/journal.pone.0218455] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2018] [Accepted: 06/03/2019] [Indexed: 12/25/2022] Open
Abstract
Populus tomentosa, of section Populus, is distributed mainly in northern China. This species has high resistance to many diseases and insects, and it plays key roles in shelterbelts and urban afforestation in northern China. It has long been suspected to be a hybrid, but its parents remain unknown. In the present study, we report four newly sequenced complete cp genomes from section Populus and comparative genomic analyses of these new sequences and three published cp genome sequences. The seven cp genomes ranged from 155,853 bp (P. tremula var. davidiana) to 156,746 bp (P. adenopoda) in length, and their gene orders, gene numbers and GC contents were similar. We analyzed SNPs, indels, SSRs and repeats among the seven cp genomes, and eight small inversions were detected in the ndhC-trnV, rbcL-accD, petA-psbJ, trnW-trnP, rpl16-rps3, trnL-ycf15, ycf15-trnL, and ndhF-trnL intergenic regions. Furthermore, seven divergent regions (trnH-psbA, matK, psbM-psbD, ndhC-trnV, ycf1, ndhF-ccsA and ccsA-ndhD) were found in more highly variable regions. The phylogenetic tree reveals that P. tomentosa is closely related to P. alba and P. alba var. pyramidalis. Hence, P. alba was involved in the formation of P. tomentosa.
Collapse
Affiliation(s)
- Dan Zong
- Key Laboratory for Forest Genetic and Tree Improvement and Propagation in Universities of Yunnan Province, Southwest Forestry University, Kunming, Yunnan, China
- Key Laboratory of Biodiversity Conservation in Southwest China, State Forestry Administration, Southwest Forestry University, Kunming, Yunnan, China
| | - Peihua Gan
- Key Laboratory for Forest Genetic and Tree Improvement and Propagation in Universities of Yunnan Province, Southwest Forestry University, Kunming, Yunnan, China
- Key Laboratory of Biodiversity Conservation in Southwest China, State Forestry Administration, Southwest Forestry University, Kunming, Yunnan, China
| | - Anpei Zhou
- Key Laboratory for Forest Genetic and Tree Improvement and Propagation in Universities of Yunnan Province, Southwest Forestry University, Kunming, Yunnan, China
- Key Laboratory of Biodiversity Conservation in Southwest China, State Forestry Administration, Southwest Forestry University, Kunming, Yunnan, China
| | - Jinyu Li
- Key Laboratory for Forest Genetic and Tree Improvement and Propagation in Universities of Yunnan Province, Southwest Forestry University, Kunming, Yunnan, China
- Key Laboratory of Biodiversity Conservation in Southwest China, State Forestry Administration, Southwest Forestry University, Kunming, Yunnan, China
| | - Zhongli Xie
- Key Laboratory for Forest Genetic and Tree Improvement and Propagation in Universities of Yunnan Province, Southwest Forestry University, Kunming, Yunnan, China
- Key Laboratory of Biodiversity Conservation in Southwest China, State Forestry Administration, Southwest Forestry University, Kunming, Yunnan, China
| | - Anan Duan
- Key Laboratory for Forest Genetic and Tree Improvement and Propagation in Universities of Yunnan Province, Southwest Forestry University, Kunming, Yunnan, China
- Key Laboratory of Biodiversity Conservation in Southwest China, State Forestry Administration, Southwest Forestry University, Kunming, Yunnan, China
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China, Ministry of Education, Southwest Forestry University, Kunming, Yunnan, China
| | - Chengzhong He
- Key Laboratory for Forest Genetic and Tree Improvement and Propagation in Universities of Yunnan Province, Southwest Forestry University, Kunming, Yunnan, China
- Key Laboratory of Biodiversity Conservation in Southwest China, State Forestry Administration, Southwest Forestry University, Kunming, Yunnan, China
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China, Ministry of Education, Southwest Forestry University, Kunming, Yunnan, China
| |
Collapse
|
10
|
Zhang D, Li K, Gao J, Liu Y, Gao LZ. The Complete Plastid Genome Sequence of the Wild Rice Zizania latifolia and Comparative Chloroplast Genomics of the Rice Tribe Oryzeae, Poaceae. Front Ecol Evol 2016. [DOI: 10.3389/fevo.2016.00088] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
|
11
|
Gupta RS. Impact of genomics on the understanding of microbial evolution and classification: the importance of Darwin's views on classification. FEMS Microbiol Rev 2016; 40:520-53. [PMID: 27279642 DOI: 10.1093/femsre/fuw011] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/14/2016] [Indexed: 12/24/2022] Open
Abstract
Analyses of genome sequences, by some approaches, suggest that the widespread occurrence of horizontal gene transfers (HGTs) in prokaryotes disguises their evolutionary relationships and have led to questioning of the Darwinian model of evolution for prokaryotes. These inferences are critically examined in the light of comparative genome analysis, characteristic synapomorphies, phylogenetic trees and Darwin's views on examining evolutionary relationships. Genome sequences are enabling discovery of numerous molecular markers (synapomorphies) such as conserved signature indels (CSIs) and conserved signature proteins (CSPs), which are distinctive characteristics of different prokaryotic taxa. Based on these molecular markers, exhibiting high degree of specificity and predictive ability, numerous prokaryotic taxa of different ranks, currently identified based on the 16S rRNA gene trees, can now be reliably demarcated in molecular terms. Within all studied groups, multiple CSIs and CSPs have been identified for successive nested clades providing reliable information regarding their hierarchical relationships and these inferences are not affected by HGTs. These results strongly support Darwin's views on evolution and classification and supplement the current phylogenetic framework based on 16S rRNA in important respects. The identified molecular markers provide important means for developing novel diagnostics, therapeutics and for functional studies providing important insights regarding prokaryotic taxa.
Collapse
Affiliation(s)
- Radhey S Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON, Canada
| |
Collapse
|
12
|
Gupta RS. Molecular signatures that are distinctive characteristics of the vertebrates and chordates and supporting a grouping of vertebrates with the tunicates. Mol Phylogenet Evol 2015; 94:383-91. [PMID: 26419477 DOI: 10.1016/j.ympev.2015.09.019] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2015] [Revised: 09/16/2015] [Accepted: 09/18/2015] [Indexed: 12/12/2022]
Abstract
Members of the phylum Chordata and the subphylum Vertebrata are presently distinguished solely on the basis of morphological characteristics. The relationship of the vertebrates to the two non-vertebrate chordate subphyla is also a subject of debate. Analyses of protein sequences have identified multiple conserved signature indels (CSIs) that are specific for Chordata or for Vertebrata. Five CSIs in 4 important proteins are specific for the Vertebrata, whereas two other CSIs are uniquely found in all sequenced chordate species including Ciona intestinalis and Oikapleura dioica (Tunicates) as well as Branchiostoma floridae (Cephalochordates). The shared presence of these molecular signatures by all vertebrates/chordate species, but in no other animal taxa, strongly indicates that the genetic changes represented by the identified CSIs diagnose monophyletic groups. Two other discovered CSIs are uniquely shared by different vertebrate species and by either one (Ciona intestinalis) or both tunicate (Ciona and Oikapleura) species, but they are not found in Branchiostoma or other animal species. Specific presence of these CSIs in different vertebrates and either one or both tunicate species provides strong independent evidence that the vertebrate species are more closely related to the urochordates (tunicates) than to the cephalochordates.
Collapse
Affiliation(s)
- Radhey S Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario L8N 3Z5, Canada.
| |
Collapse
|
13
|
Abstract
The large phylogenetic distance separating eukaryotic genes and their archaeal orthologs has prevented identification of the position of the eukaryotic root in phylogenomic studies. Recently, an innovative approach has been proposed to circumvent this issue: the use as phylogenetic markers of proteins that have been transferred from bacterial donor sources to eukaryotes, after their emergence from Archaea. Using this approach, two recent independent studies have built phylogenomic datasets based on bacterial sequences, leading to different predictions of the eukaryotic root. Taking advantage of additional genome sequences from the jakobid Andalucia godoyi and the two known malawimonad species (Malawimonas jakobiformis and Malawimonas californiana), we reanalyzed these two phylogenomic datasets. We show that both datasets pinpoint the same phylogenetic position of the eukaryotic root that is between "Unikonta" and "Bikonta," with malawimonad and collodictyonid lineages on the Unikonta side of the root. Our results firmly indicate that (i) the supergroup Excavata is not monophyletic and (ii) the last common ancestor of eukaryotes was a biflagellate organism. Based on our results, we propose to rename the two major eukaryotic groups Unikonta and Bikonta as Opimoda and Diphoda, respectively.
Collapse
|
14
|
Kumar A, Bhandari A, Goswami C. Surveying genetic variants and molecular phylogeny of cerebral cavernous malformation gene, CCM3/PDCD10. Biochem Biophys Res Commun 2014; 455:98-106. [DOI: 10.1016/j.bbrc.2014.10.105] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2014] [Accepted: 10/21/2014] [Indexed: 11/29/2022]
|
15
|
Ashkenazy H, Cohen O, Pupko T, Huchon D. Indel reliability in indel-based phylogenetic inference. Genome Biol Evol 2014; 6:3199-209. [PMID: 25409663 PMCID: PMC4986452 DOI: 10.1093/gbe/evu252] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
It is often assumed that it is unlikely that the same insertion or deletion (indel) event occurred at the same position in two independent evolutionary lineages, and thus, indel-based inference of phylogeny should be less subject to homoplasy compared with standard inference which is based on substitution events. Indeed, indels were successfully used to solve debated evolutionary relationships among various taxonomical groups. However, indels are never directly observed but rather inferred from the alignment and thus indel-based inference may be sensitive to alignment errors. It is hypothesized that phylogenetic reconstruction would be more accurate if it relied only on a subset of reliable indels instead of the entire indel data. Here, we developed a method to quantify the reliability of indel characters by measuring how often they appear in a set of alternative multiple sequence alignments. Our approach is based on the assumption that indels that are consistently present in most alternative alignments are more reliable compared with indels that appear only in a small subset of these alignments. Using simulated and empirical data, we studied the impact of filtering and weighting indels by their reliability scores on the accuracy of indel-based phylogenetic reconstruction. The new method is available as a web-server at http://guidance.tau.ac.il/RELINDEL/.
Collapse
Affiliation(s)
- Haim Ashkenazy
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Israel
| | - Ofir Cohen
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Israel Present address: Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Tal Pupko
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Israel
| | - Dorothée Huchon
- Department of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Israel
| |
Collapse
|
16
|
Banguera-Hinestroza E, Hayano A, Crespo E, Hoelzel AR. Delphinid systematics and biogeography with a focus on the current genus Lagenorhynchus: multiple pathways for antitropical and trans-oceanic radiation. Mol Phylogenet Evol 2014; 80:217-30. [PMID: 25130419 DOI: 10.1016/j.ympev.2014.08.005] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2013] [Revised: 07/20/2014] [Accepted: 08/03/2014] [Indexed: 10/24/2022]
Abstract
The six species currently classified within the genus Lagenorhynchus exhibit a pattern of antitropical distribution common among marine taxa. In spite of their morphological similarities they are now considered an artificial grouping, and include both recent and the oldest representatives of the Delphinidae radiation. They are, therefore, a good model for studying questions about the evolutionary processes that have driven dolphin speciation, dispersion and distribution. Here we used two different approaches. First we constructed a multigenic phylogeny with a minimum amount of missing data (based on 9 genes, 11,030bp, using the 6 species of the genus and their closest relatives) to infer their relationships. Second, we built a supermatrix phylogeny (based on 33 species and 27 genes) to test the effect of taxon sampling on the phylogeny of the genus, to provide inference on biogeographic history, and provide inference on the main events shaping the dispersion and radiation of delphinids. Our analyses suggested an early evolutionary history of marine dolphins in the North Atlantic Ocean and revealed multiple pathways of migration and radiation, probably guided by paleoceanographic changes during the Miocene and Pliocene. L. acutus and L. albirostris likely shared a common ancestor that arose in the North Atlantic around the Middle Miocene, predating the radiation of subfamilies Delphininae, Globicephalinae and Lissodelphininae.
Collapse
Affiliation(s)
| | - Azusa Hayano
- Wildlife Research Center, Kyoto University, Sakyo, Kyoto 606-8203, Japan
| | - Enrique Crespo
- Centro Nacional Patagonico (CONICET), Blvd. Brown 3600 (9120), Puerto Madryn, Chubut, Argentina
| | - A Rus Hoelzel
- Department of Biological and Biomedical Sciences, University of Durham, South Road DH1 3LE, UK.
| |
Collapse
|
17
|
Huang H, Shi C, Liu Y, Mao SY, Gao LZ. Thirteen Camellia chloroplast genome sequences determined by high-throughput sequencing: genome structure and phylogenetic relationships. BMC Evol Biol 2014; 14:151. [PMID: 25001059 PMCID: PMC4105164 DOI: 10.1186/1471-2148-14-151] [Citation(s) in RCA: 248] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2014] [Accepted: 06/20/2014] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Camellia is an economically and phylogenetically important genus in the family Theaceae. Owing to numerous hybridization and polyploidization, it is taxonomically and phylogenetically ranked as one of the most challengingly difficult taxa in plants. Sequence comparisons of chloroplast (cp) genomes are of great interest to provide a robust evidence for taxonomic studies, species identification and understanding mechanisms that underlie the evolution of the Camellia species. RESULTS The eight complete cp genomes and five draft cp genome sequences of Camellia species were determined using Illumina sequencing technology via a combined strategy of de novo and reference-guided assembly. The Camellia cp genomes exhibited typical circular structure that was rather conserved in genomic structure and the synteny of gene order. Differences of repeat sequences, simple sequence repeats, indels and substitutions were further examined among five complete cp genomes, representing a wide phylogenetic diversity in the genus. A total of fifteen molecular markers were identified with more than 1.5% sequence divergence that may be useful for further phylogenetic analysis and species identification of Camellia. Our results showed that, rather than functional constrains, it is the regional constraints that strongly affect sequence evolution of the cp genomes. In a substantial improvement over prior studies, evolutionary relationships of the section Thea were determined on basis of phylogenomic analyses of cp genome sequences. CONCLUSIONS Despite a high degree of conservation between the Camellia cp genomes, sequence variation among species could still be detected, representing a wide phylogenetic diversity in the genus. Furthermore, phylogenomic analysis was conducted using 18 complete cp genomes and 5 draft cp genome sequences of Camellia species. Our results support Chang's taxonomical treatment that C. pubicosta may be classified into sect. Thea, and indicate that taxonomical value of the number of ovaries should be reconsidered when classifying the Camellia species. The availability of these cp genomes provides valuable genetic information for accurately identifying species, clarifying taxonomy and reconstructing the phylogeny of the genus Camellia.
Collapse
Affiliation(s)
- Hui Huang
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China
| | - Chao Shi
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China
- University of the Chinese Academy of Sciences, Beijing 100039, China
| | - Yuan Liu
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China
- University of the Chinese Academy of Sciences, Beijing 100039, China
| | - Shu-Yan Mao
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China
| | - Li-Zhi Gao
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China
| |
Collapse
|
18
|
An alternative root for the eukaryote tree of life. Curr Biol 2014; 24:465-70. [PMID: 24508168 DOI: 10.1016/j.cub.2014.01.036] [Citation(s) in RCA: 153] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2013] [Revised: 12/12/2013] [Accepted: 01/16/2014] [Indexed: 01/02/2023]
Abstract
The root of the eukaryote tree of life defines some of the most fundamental relationships among species. It is also critical for defining the last eukaryote common ancestor (LECA), the shared heritage of all extant species. The unikont-bikont root has been the reigning paradigm for eukaryotes for more than 10 years but is becoming increasingly controversial. We developed a carefully vetted data set, consisting of 37 nuclear-encoded proteins of close bacterial ancestry (euBacs) and their closest bacterial relatives, augmented by deep sequencing of the Acrasis kona (Heterolobosea, Discoba) transcriptome. Phylogenetic analysis of these data produces a highly robust, fully resolved global phylogeny of eukaryotes. The tree sorts all examined eukaryotes into three megagroups and identifies the Discoba, and potentially its parent taxon Excavata, as the sister group to the bulk of known eukaryote diversity, the proposed Neozoa (Amorphea + Stramenopila+Alveolata+Rhizaria+Plantae [SARP]). All major alternative hypotheses are rejected with as little as ∼50% of the data, and this resolution is unaffected by the presence of fast-evolving alignment positions or distant outgroup sequences. This "neozoan-excavate" root revises hypotheses of early eukaryote evolution and highlights the importance of the poorly studied Discoba for understanding the evolution of eukaryotic diversity and basic cellular processes.
Collapse
|
19
|
Li R, Ma PF, Wen J, Yi TS. Complete sequencing of five araliaceae chloroplast genomes and the phylogenetic implications. PLoS One 2013; 8:e78568. [PMID: 24205264 PMCID: PMC3799623 DOI: 10.1371/journal.pone.0078568] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2013] [Accepted: 09/13/2013] [Indexed: 11/29/2022] Open
Abstract
Background The ginseng family (Araliaceae) includes a number of economically important plant species. Previously phylogenetic studies circumscribed three major clades within the core ginseng plant family, yet the internal relationships of each major group have been poorly resolved perhaps due to rapid radiation of these lineages. Recent studies have shown that phyogenomics based on chloroplast genomes provides a viable way to resolve complex relationships. Methodology/Principal Findings We report the complete nucleotide sequences of five Araliaceae chloroplast genomes using next-generation sequencing technology. The five chloroplast genomes are 156,333–156,459 bp in length including a pair of inverted repeats (25,551–26,108 bp) separated by the large single-copy (86,028–86,566 bp) and small single-copy (18,021–19,117 bp) regions. Each chloroplast genome contains the same 114 unique genes consisting of 30 transfer RNA genes, four ribosomal RNA genes, and 80 protein coding genes. Gene size, content, and order, AT content, and IR/SC boundary structure are similar among all Araliaceae chloroplast genomes. A total of 140 repeats were identified in the five chloroplast genomes with palindromic repeat as the most common type. Phylogenomic analyses using parsimony, likelihood, and Bayesian inference based on the complete chloroplast genomes strongly supported the monophyly of the Asian Palmate group and the Aralia-Panax group. Furthermore, the relationships among the sampled taxa within the Asian Palmate group were well resolved. Twenty-six DNA markers with the percentage of variable sites higher than 5% were identified, which may be useful for phylogenetic studies of Araliaceae. Conclusion The chloroplast genomes of Araliaceae are highly conserved in all aspects of genome features. The large-scale phylogenomic data based on the complete chloroplast DNA sequences is shown to be effective for the phylogenetic reconstruction of Araliaceae.
Collapse
Affiliation(s)
- Rong Li
- Key Laboratory of Biodiversity and Biogeography, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, People’s Republic of China
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, People’s Republic of China
| | - Peng-Fei Ma
- Key Laboratory of Biodiversity and Biogeography, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, People’s Republic of China
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, People’s Republic of China
| | - Jun Wen
- Department of Botany, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, United States of America
- * E-mail: (JW); (T-SY)
| | - Ting-Shuang Yi
- Key Laboratory of Biodiversity and Biogeography, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, People’s Republic of China
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, People’s Republic of China
- * E-mail: (JW); (T-SY)
| |
Collapse
|
20
|
Wang S, Shi C, Gao LZ. Plastid genome sequence of a wild woody oil species, Prinsepia utilis, provides insights into evolutionary and mutational patterns of Rosaceae chloroplast genomes. PLoS One 2013; 8:e73946. [PMID: 24023915 PMCID: PMC3759469 DOI: 10.1371/journal.pone.0073946] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2013] [Accepted: 07/22/2013] [Indexed: 11/18/2022] Open
Abstract
Background Prinsepiautilis Royle is a wild woody oil species of Rosaceae that yields edible oil which has been proved to possess particular benefits for human health and medical therapy. However, the lack of bred varieties has largely impeded exploiting immense potentials for high quality of its seed oil. It is urgently needed to enlarge the knowledge of genetic basis of the species and develop genetic markers to enhance modern breeding programs. Results Here we reported the complete chloroplast (cp) genome of 156,328 bp. Comparative cp sequence analyses of P. utilis along with other four Rosaceae species resulted in similar genome structures, gene orders, and gene contents. Contraction/expansion of inverted repeat regions (IRs) explained part of the length variation in the Rosaceae cp genomes. Genome sequence alignments revealed that nucleotide diversity was associated with AT content, and large single copy regions (LSC) and small single copy regions (SSC) harbored higher sequence variations in both coding and non-coding regions than IRs. Simple sequence repeats (SSRs) were detected in the P. utilis and compared with those of the other four Rosaceae cp genomes. Almost all the SSR loci were composed of A or T, therefore it might contribute to the A-T richness of cp genomes and be associated with AT biased sequence variation. Among all the protein-coding genes, ycf1 showed the highest sequence divergence, indicating that it could accomplish the discrimination of species within Rosaceae as well as within angiosperms better than other genes. Conclusions With the addition of this new sequenced cp genome, high nucleotide substitution rate and abundant deletions/insertions were observed, suggesting a greater genomic dynamics than previously explored in Rosaceae. The availability of the complete cp genome of P. utilis will provide chloroplast markers and genetic information to better enhance the conservation and utilization of this woody oil plant.
Collapse
Affiliation(s)
- Shuo Wang
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, China
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, China
| | - Chao Shi
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, China
- University of the Chinese Academy of Sciences, Beijing, China
| | - Li-Zhi Gao
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, China
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, China
- * E-mail:
| |
Collapse
|
21
|
Characterization of bud emergence 46 (BEM46) protein: sequence, structural, phylogenetic and subcellular localization analyses. Biochem Biophys Res Commun 2013; 438:526-32. [PMID: 23916612 DOI: 10.1016/j.bbrc.2013.07.103] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Accepted: 07/25/2013] [Indexed: 02/04/2023]
Abstract
The bud emergence 46 (BEM46) protein from Neurospora crassa belongs to the α/β-hydrolase superfamily. Recently, we have reported that the BEM46 protein is localized in the perinuclear ER and also forms spots close by the plasma membrane. The protein appears to be required for cell type-specific polarity formation in N. crassa. Furthermore, initial studies suggested that the BEM46 amino acid sequence is conserved in eukaryotes and is considered to be one of the widespread conserved "known unknown" eukaryotic genes. This warrants for a comprehensive phylogenetic analysis of this superfamily to unravel origin and molecular evolution of these genes in different eukaryotes. Herein, we observe that all eukaryotes have at least a single copy of a bem46 ortholog. Upon scanning of these proteins in various genomes, we find that there are expansions leading into several paralogs in vertebrates. Usingcomparative genomic analyses, we identified insertion/deletions (indels) in the conserved domain of BEM46 protein, which allow to differentiate fungal classes such as ascomycetes from basidiomycetes. We also find that exonic indels are able to differentiate BEM46 homologs of different eukaryotic lineage. Furthermore, we unravel that BEM46 protein from N. crassa possess a novel endoplasmic-retention signal (PEKK) using GFP-fusion tagging experiments. We propose that three residues namely a serine 188S, a histidine 292H and an aspartic acid 262D are most critical residues, forming a catalytic triad in BEM46 protein from N. crassa. We carried out a comprehensive study on bem46 genes from a molecular evolution perspective with combination of functional analyses. The evolutionary history of BEM46 proteins is characterized by exonic indels in lineage specific manner.
Collapse
|
22
|
Ajawatanawong P, Baldauf SL. Evolution of protein indels in plants, animals and fungi. BMC Evol Biol 2013; 13:140. [PMID: 23826714 PMCID: PMC3706215 DOI: 10.1186/1471-2148-13-140] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2013] [Accepted: 06/24/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Insertions/deletions (indels) in protein sequences are useful as drug targets, protein structure predictors, species diagnostics and evolutionary markers. However there is limited understanding of indel evolutionary patterns. We sought to characterize indel patterns focusing first on the major groups of multicellular eukaryotes. RESULTS Comparisons of complete proteomes from a taxonically broad set of primarily Metazoa, Fungi and Viridiplantae yielded 299 substantial (>250aa) universal, single-copy (in-paralog only) proteins, from which 901 simple (present/absent) and 3,806 complex (multistate) indels were extracted. Simple indels are mostly small (1-7aa) with a most frequent size class of 1aa. However, even these simple looking indels show a surprisingly high level of hidden homoplasy (multiple independent origins). Among the apparently homoplasy-free simple indels, we identify 69 potential clade-defining indels (CDIs) that may warrant closer examination. CDIs show a very uneven taxonomic distribution among Viridiplante (13 CDIs), Fungi (40 CDIs), and Metazoa (0 CDIs). An examination of singleton indels shows an excess of insertions over deletions in nearly all examined taxa. This excess averages 2.31 overall, with a maximum observed value of 7.5 fold. CONCLUSIONS We find considerable potential for identifying taxon-marker indels using an automated pipeline. However, it appears that simple indels in universal proteins are too rare and homoplasy-rich to be used for pure indel-based phylogeny. The excess of insertions over deletions seen in nearly every genome and major group examined maybe useful in defining more realistic gap penalties for sequence alignment. This bias also suggests that insertions in highly conserved proteins experience less purifying selection than do deletions.
Collapse
Affiliation(s)
- Pravech Ajawatanawong
- Department of Systematic Biology, Evolutionary Biology Centre (EBC), Uppsala University, Uppsala 75236, Sweden.
| | | |
Collapse
|
23
|
[Phylogenetic application and analysis of horizontal transfer based on the prokaryote eno gene]. YI CHUAN = HEREDITAS 2012; 34:907-18. [PMID: 22805218 DOI: 10.3724/sp.j.1005.2012.00907] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The phenomenon of conflicting gene trees has become a remarkable and difficult problem. Application of multiple genes has been a widespread practice to reconstruct phylogenies in phylogenetic studies. Enolase is a key glycolytic enzyme, The enzymes from a large variety of organisms, including archaebacteria, eubacteria and eukaryotes, were studied. We downloaded eno sequences from the genomes of bacteria and archaea that have been completely sequenced. The comprehensive homology search and phylogenetic analysis of the eno were used, and nineteen horizontally transferred genes were identified. The results of analysis showed lots of differences between the features of horizontal transferred genes and the ones of whole genomic genes, such as nucleotide composition, gene combination, codon usage bias, and selection pressure. These results reconfirmed that the horizontally transferred genes were exogenous. The result revealed that prokaryote eno genes were highly conserved, medium-sized, is a good material in the phylogenetic. This paper can provide a reference in study of life habit and evolutionary history of donor and receptor, and enolase structure and function.
Collapse
|
24
|
Moe KT, Kwon SW, Park YJ. Trends in genomics and molecular marker systems for the development of some underutilized crops. Genes Genomics 2012. [DOI: 10.1007/s13258-012-0049-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
25
|
Denton JS, Wheeler WC. Indel information eliminates trivial sequence alignment in maximum likelihood phylogenetic analysis. Cladistics 2012; 28:514-528. [DOI: 10.1111/j.1096-0031.2012.00402.x] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
|
26
|
Gao B, Gupta RS. Phylogenetic framework and molecular signatures for the main clades of the phylum Actinobacteria. Microbiol Mol Biol Rev 2012; 76:66-112. [PMID: 22390973 PMCID: PMC3294427 DOI: 10.1128/mmbr.05011-11] [Citation(s) in RCA: 167] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
The phylum Actinobacteria harbors many important human pathogens and also provides one of the richest sources of natural products, including numerous antibiotics and other compounds of biotechnological interest. Thus, a reliable phylogeny of this large phylum and the means to accurately identify its different constituent groups are of much interest. Detailed phylogenetic and comparative analyses of >150 actinobacterial genomes reported here form the basis for achieving these objectives. In phylogenetic trees based upon 35 conserved proteins, most of the main groups of Actinobacteria as well as a number of their superageneric clades are resolved. We also describe large numbers of molecular markers consisting of conserved signature indels in protein sequences and whole proteins that are specific for either all Actinobacteria or their different clades (viz., orders, families, genera, and subgenera) at various taxonomic levels. These signatures independently support the existence of different phylogenetic clades, and based upon them, it is now possible to delimit the phylum Actinobacteria (excluding Coriobacteriia) and most of its major groups in clear molecular terms. The species distribution patterns of these markers also provide important information regarding the interrelationships among different main orders of Actinobacteria. The identified molecular markers, in addition to enabling the development of a stable and reliable phylogenetic framework for this phylum, also provide novel and powerful means for the identification of different groups of Actinobacteria in diverse environments. Genetic and biochemical studies on these Actinobacteria-specific markers should lead to the discovery of novel biochemical and/or other properties that are unique to different groups of Actinobacteria.
Collapse
Affiliation(s)
- Beile Gao
- Department of Biochemistry and Biomedical Science, McMaster University, Hamilton, Ontario, Canada
| | | |
Collapse
|
27
|
Assis R, Kondrashov AS. A strong deletion bias in nonallelic gene conversion. PLoS Genet 2012; 8:e1002508. [PMID: 22359514 PMCID: PMC3280953 DOI: 10.1371/journal.pgen.1002508] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2011] [Accepted: 12/12/2011] [Indexed: 11/19/2022] Open
Abstract
Gene conversion is the unidirectional transfer of genetic information between orthologous (allelic) or paralogous (nonallelic) genomic segments. Though a number of studies have examined nucleotide replacements, little is known about length difference mutations produced by gene conversion. Here, we investigate insertions and deletions produced by nonallelic gene conversion in 338 Drosophila and 10,149 primate paralogs. Using a direct phylogenetic approach, we identify 179 insertions and 614 deletions in Drosophila paralogs, and 132 insertions and 455 deletions in primate paralogs. Thus, nonallelic gene conversion is strongly deletion-biased in both lineages, with almost 3.5 times as many conversion-induced deletions as insertions. In primates, the deletion bias is considerably stronger for long indels and, in both lineages, the per-site rate of gene conversion is orders of magnitudes higher than that of ordinary mutation. Due to this high rate, deletion-biased nonallelic gene conversion plays a key role in genome size evolution, leading to the cooperative shrinkage and eventual disappearance of selectively neutral paralogs. Gene conversion is a process whereby a DNA sequence is copied from one segment of the genome (donor) to another (recipient), resulting in the replacement, insertion, or deletion of a DNA sequence in the recipient. This exchange is facilitated by the high sequence similarity of the two segments, which is due to their evolutionary relationship. Here, we study insertions and deletions produced by gene conversion between paralogs, segments related by DNA duplication events. By comparing paralog sequences in multiple species of fruit flies and primates, we find that deletions occur more than three times as frequently as insertions. We also discover that the rate of gene conversion between paralogs is quite high. The deletion bias and high rate of this process causes paralogs to shrink cooperatively and eventually be eliminated from the genome. Because of the abundance of paralogs in animal genomes, this phenomenon can lead to a significant reduction in genome size. Therefore, our finding enhances our understanding of the forces that lead to changes in genome size during evolution.
Collapse
Affiliation(s)
- Raquel Assis
- Department of Integrative Biology, Center for Theoretical Evolutionary Genomics, University of California Berkeley, Berkeley, California, USA.
| | | |
Collapse
|
28
|
Derelle R, Lang BF. Rooting the eukaryotic tree with mitochondrial and bacterial proteins. Mol Biol Evol 2011; 29:1277-89. [PMID: 22135192 DOI: 10.1093/molbev/msr295] [Citation(s) in RCA: 99] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
By exploiting the large body of genome data and the considerable progress in phylogenetic methodology, recent phylogenomic studies have provided new insights into the relationships among major eukaryotic groups. However, confident placement of the eukaryotic root remains a major challenge. This is due to the large evolutionary distance separating eukaryotes from their closest relatives, the Archaea, implying a weak phylogenetic signal and strong long-branch attraction artifacts. Here, we apply a new approach to the rooting of the eukaryotic tree by using a subset of genomic information with more recent evolutionary origin-mitochondrial sequences, whose closest relatives are α-Proteobacteria. For this, we identified and assembled a data set of 42 mitochondrial proteins (mainly encoded by the nuclear genome) and performed Bayesian and maximum likelihood analyses. Taxon sampling includes the recently sequenced Thecamonas trahens, a member of the phylogenetically elusive Apusozoa. This data set confirms the relationships of several eukaryotic supergroups seen before and places the eukaryotic root between the monophyletic "unikonts" and "bikonts." We further show that T. trahens branches sister to Opisthokonta with significant statistical support and question the bikont/excavate affiliation of Malawimonas species. The mitochondrial data set developed here (to be expanded in the future) constitutes a unique alternative means in resolving deep eukaryotic relationships.
Collapse
Affiliation(s)
- Romain Derelle
- Bioinformatics and Genomics Program, Centre for Genomic Regulation and Universitat Pompeu Fabra, Barcelona, Spain.
| | | |
Collapse
|
29
|
Zhang YJ, Ma PF, Li DZ. High-throughput sequencing of six bamboo chloroplast genomes: phylogenetic implications for temperate woody bamboos (Poaceae: Bambusoideae). PLoS One 2011; 6:e20596. [PMID: 21655229 PMCID: PMC3105084 DOI: 10.1371/journal.pone.0020596] [Citation(s) in RCA: 195] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2011] [Accepted: 05/05/2011] [Indexed: 11/25/2022] Open
Abstract
Background Bambusoideae is the only subfamily that contains woody members in the grass family, Poaceae. In phylogenetic analyses, Bambusoideae, Pooideae and Ehrhartoideae formed the BEP clade, yet the internal relationships of this clade are controversial. The distinctive life history (infrequent flowering and predominance of asexual reproduction) of woody bamboos makes them an interesting but taxonomically difficult group. Phylogenetic analyses based on large DNA fragments could only provide a moderate resolution of woody bamboo relationships, although a robust phylogenetic tree is needed to elucidate their evolutionary history. Phylogenomics is an alternative choice for resolving difficult phylogenies. Methodology/Principal Findings Here we present the complete nucleotide sequences of six woody bamboo chloroplast (cp) genomes using Illumina sequencing. These genomes are similar to those of other grasses and rather conservative in evolution. We constructed a phylogeny of Poaceae from 24 complete cp genomes including 21 grass species. Within the BEP clade, we found strong support for a sister relationship between Bambusoideae and Pooideae. In a substantial improvement over prior studies, all six nodes within Bambusoideae were supported with ≥0.95 posterior probability from Bayesian inference and 5/6 nodes resolved with 100% bootstrap support in maximum parsimony and maximum likelihood analyses. We found that repeats in the cp genome could provide phylogenetic information, while caution is needed when using indels in phylogenetic analyses based on few selected genes. We also identified relatively rapidly evolving cp genome regions that have the potential to be used for further phylogenetic study in Bambusoideae. Conclusions/Significance The cp genome of Bambusoideae evolved slowly, and phylogenomics based on whole cp genome could be used to resolve major relationships within the subfamily. The difficulty in resolving the diversification among three clades of temperate woody bamboos, even with complete cp genome sequences, suggests that these lineages may have diverged very rapidly.
Collapse
Affiliation(s)
- Yun-Jie Zhang
- Key Laboratory of Biodiversity and Biogeography, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, People's Republic of China
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, People's Republic of China
- Graduate University of Chinese Academy of Sciences, Beijing, People's Republic of China
| | - Peng-Fei Ma
- Key Laboratory of Biodiversity and Biogeography, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, People's Republic of China
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, People's Republic of China
- Graduate University of Chinese Academy of Sciences, Beijing, People's Republic of China
| | - De-Zhu Li
- Key Laboratory of Biodiversity and Biogeography, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, People's Republic of China
- Plant Germplasm and Genomics Center, Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, People's Republic of China
- * E-mail:
| |
Collapse
|
30
|
The utility of indels in population genetics: The Tpi intron for host race genealogy of Acrocercops transecta (Insecta: Lepidoptera). Mol Phylogenet Evol 2011; 59:469-76. [DOI: 10.1016/j.ympev.2011.03.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2010] [Revised: 02/09/2011] [Accepted: 03/04/2011] [Indexed: 11/21/2022]
|
31
|
Gollapalli DR, Macpherson IS, Liechti G, Gorla SK, Goldberg JB, Hedstrom L. Structural determinants of inhibitor selectivity in prokaryotic IMP dehydrogenases. ACTA ACUST UNITED AC 2011; 17:1084-91. [PMID: 21035731 DOI: 10.1016/j.chembiol.2010.07.014] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2010] [Revised: 06/26/2010] [Accepted: 07/20/2010] [Indexed: 10/18/2022]
Abstract
The protozoan parasite Cryptosporidium parvum is a major cause of gastrointestinal disease; no effective drug therapy exists to treat this infection. Curiously, C. parvum IMPDH (CpIMPDH) is most closely related to prokaryotic IMPDHs, suggesting that the parasite obtained its IMPDH gene via horizontal transfer. We previously identified inhibitors of CpIMPDH that do not inhibit human IMPDHs. Here, we show that these compounds also inhibit IMPDHs from Helicobacter pylori, Borrelia burgdorferi, and Streptococcus pyogenes, but not from Escherichia coli. Residues Ala165 and Tyr358 comprise a structural motif that defines susceptible enzymes. Importantly, a second-generation CpIMPDH inhibitor has bacteriocidal activity on H. pylori but not E. coli. We propose that CpIMPDH-targeted inhibitors can be developed into a new class of antibiotics that will spare some commensal bacteria.
Collapse
|
32
|
Molecular signatures for the Crenarchaeota and the Thaumarchaeota. Antonie van Leeuwenhoek 2010; 99:133-57. [PMID: 20711675 DOI: 10.1007/s10482-010-9488-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2010] [Accepted: 07/26/2010] [Indexed: 10/19/2022]
Abstract
Crenarchaeotes found in mesophilic marine environments were recently placed into a new phylum of Archaea called the Thaumarchaeota. However, very few molecular characteristics of this new phylum are currently known which can be used to distinguish them from the Crenarchaeota. In addition, their relationships to deep-branching archaeal lineages are unclear. We report here detailed analyses of protein sequences from Crenarchaeota and Thaumarchaeota that have identified many conserved signature indels (CSIs) and signature proteins (SPs) (i.e., proteins for which all significant blast hits are from these groups) that are specific for these archaeal groups. Of the identified signatures 6 CSIs and 13 SPs are specific for the Crenarchaeota phylum; 6 CSIs and >250 SPs are uniquely found in various Thaumarchaeota (viz. Cenarchaeum symbiosum, Nitrosopumilus maritimus and a number of uncultured marine crenarchaeotes) and 3 CSIs and ~10 SPs are found in both Thaumarchaeota and Crenarchaeota species. Some of the molecular signatures are also present in Korarchaeum cryptofilum, which forms the independent phylum Korarchaeota. Although some of these molecular signatures suggest a distant shared ancestry between Thaumarchaeota and Crenarchaeota, our identification of large numbers of Thaumarchaeota-specific proteins and their deep branching between the Crenarchaeota and Euryarchaeota phyla in phylogenetic trees shows that they are distinct from both Crenarchaeota and Euryarchaeota in both genetic and phylogenetic terms. These observations support the placement of marine mesophilic archaea into the separate phylum Thaumarchaeota. Additionally, many CSIs and SPs have been found that are specific for different orders within Crenarchaeota (viz. Sulfolobales-3 CSIs and 169 SPs, Thermoproteales-5 CSIs and 25 SPs, Desulfurococcales-4 SPs, and Sulfolobales and Desulfurococcales-2 CSIs and 18 SPs). The signatures described here provide novel means for distinguishing the Crenarchaeota and the Thaumarchaeota and for the classification of related and novel species in different environments. Functional studies on these signature proteins could lead to discovery of novel biochemical properties that are unique to these groups of archaea.
Collapse
|
33
|
Dagan T, Roettger M, Bryant D, Martin W. Genome networks root the tree of life between prokaryotic domains. Genome Biol Evol 2010; 2:379-92. [PMID: 20624742 PMCID: PMC2997548 DOI: 10.1093/gbe/evq025] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Eukaryotes arose from prokaryotes, hence the root in the tree of life resides among the prokaryotic domains. The position of the root is still debated, although pinpointing it would aid our understanding of the early evolution of life. Because prokaryote evolution was long viewed as a tree-like process of lineage bifurcations, efforts to identify the most ancient microbial lineage split have traditionally focused on positioning a root on a phylogenetic tree constructed from one or several genes. Such studies have delivered widely conflicting results on the position of the root, this being mainly due to methodological problems inherent to deep gene phylogeny and the workings of lateral gene transfer among prokaryotes over evolutionary time. Here, we report the position of the root determined with whole genome data using network-based procedures that take into account both gene presence or absence and the level of sequence similarity among all individual gene families that are shared across genomes. On the basis of 562,321 protein-coding gene families distributed across 191 genomes, we find that the deepest divide in the prokaryotic world is interdomain, that is, separating the archaebacteria from the eubacteria. This result resonates with some older views but conflicts with the results of most studies over the last decade that have addressed the issue. In particular, several studies have suggested that the molecular distinctness of archaebacteria is not evidence for their antiquity relative to eubacteria but instead stems from some kind of inherently elevated rate of archaebacterial sequence change. Here, we specifically test for such a rate elevation across all prokaryotic lineages through the analysis of all possible quartets among eight genes duplicated in all prokaryotes, hence the last common ancestor thereof. The results show that neither the archaebacteria as a group nor the eubacteria as a group harbor evidence for elevated evolutionary rates in the sampled genes, either in the recent evolutionary past or in their common ancestor. The interdomain prokaryotic position of the root is thus not attributable to lineage-specific rate variation.
Collapse
Affiliation(s)
- Tal Dagan
- Institute of Botany III, Heinrich-Heine University of Düsseldorf, Düsseldorf, Germany.
| | | | | | | |
Collapse
|
34
|
Abstract
High levels of alignment errors associated with gaps have generally meant their exclusion from phylogenetic analysis. Conserved inserts and deletions (indels) may in some cases be less subject to errors than amino acid substitutions for inferring the history of genomes and identifying recently laterally transferred genes, but alignment error near gaps must be evaluated prior to using indels as phylogenetic characters. A method is presented for evaluating the phylogenetic unambiguity of gaps in multiple sequence alignments by allowing a defined amount of pairwise alignment ambiguity. This work considers the bacterial genus Shewanella, which is of particular interest for applications of bioremediation and environmental engineering. Understanding the genetic history of these species is vital for these applications. A set of pairwise dynamic programming alignments is constructed to test positions in multiple alignments for phylogenetic unambiguity, and a whole genome scan is done on protein sequences from 11 sequenced species of the bacterial genus Shewanella. The splits defined by phylogenetically unambiguous indels are then used as characters for phylogenetic analysis, and results are compared to whole genome Maximum Likelihood phylogeny. A comparable description of the history of the species is found, as well as a set of lateral gene transfer candidates undetectable by traditional analysis of amino acid substitutions. This analysis is applicable to other taxonomic units at all levels and has the potential to allow cataloging of clear genome-wide phylogenetic markers for taxonomic profiling down to the species level.
Collapse
|
35
|
Wilson JJ. Assessing the value of DNA barcodes and other priority gene regions for molecular phylogenetics of Lepidoptera. PLoS One 2010; 5:e10525. [PMID: 20479871 PMCID: PMC2866325 DOI: 10.1371/journal.pone.0010525] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2010] [Accepted: 04/14/2010] [Indexed: 12/04/2022] Open
Abstract
BACKGROUND Despite apparently abundant amounts of observable variation and species diversity, the order Lepidoptera exhibits a morphological homogeneity that has provided only a limited number of taxonomic characters and led to widespread use of nucleotides for inferring relationships. This study aims to characterize and develop methods to quantify the value of priority gene regions designated for Lepidoptera molecular systematics. In particular, I assess how the DNA barcode segment of the mitochondrial COI gene performs across a broad temporal range given its number one position of priority, most sequenced status, and the conflicting opinions on its phylogenetic performance. METHODOLOGY/PRINCIPAL FINDINGS Gene regions commonly sequenced for lepidoptera phylogenetics were scored using multiple measures across three categories: practicality, which includes universality of primers and sequence quality; phylogenetic utility; and phylogenetic signal. I found that alternative measures within a category often appeared correlated, but high scores in one category did not necessarily translate into high scores in another. The DNA barcode was easier to sequence than other genes, and had high scores for utility but low signal above the genus level. CONCLUSIONS/SIGNIFICANCE Given limited financial resources and time constraints, careful selection of gene regions for molecular phylogenetics is crucial to avoid wasted effort producing partially informative data. This study introduces an approach to assessing the value of gene regions prior to the initiation of new studies and presents empirical results to help guide future selections.
Collapse
Affiliation(s)
- John James Wilson
- Department of Integrative Biology, University of Guelph, Guelph, Ontario, Canada.
| |
Collapse
|
36
|
Belinky F, Cohen O, Huchon D. Large-scale parsimony analysis of metazoan indels in protein-coding genes. Mol Biol Evol 2009; 27:441-51. [PMID: 19864469 DOI: 10.1093/molbev/msp263] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Insertions and deletions (indels) are considered to be rare evolutionary events, the analysis of which may resolve controversial phylogenetic relationships. Indeed, indel characters are often assumed to be less homoplastic than amino acid and nucleotide substitutions and, consequently, more reliable markers for phylogenetic reconstruction. In this study, we analyzed indels from over 1,000 metazoan orthologous genes. We studied the impact of different species sampling, ortholog data sets, lengths of included indels, and indel-coding methods on the resulting metazoan tree. Our results show that, similar to sequence substitutions, indels are homoplastic characters, and their analysis is sensitive to the long-branch attraction artifact. Furthermore, improving the taxon sampling and choosing a closely related outgroup greatly impact the phylogenetic inference. Our indel-based inferences support the Ecdysozoa hypothesis over the Coelomata hypothesis and suggest that sponges are a sister clade to other animals.
Collapse
|
37
|
Liang LJ, Weiss RE, Redelings B, Suchard MA. Improving phylogenetic analyses by incorporating additional information from genetic sequence databases. ACTA ACUST UNITED AC 2009; 25:2530-6. [PMID: 19661240 DOI: 10.1093/bioinformatics/btp473] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
MOTIVATION Statistical analyses of phylogenetic data culminate in uncertain estimates of underlying model parameters. Lack of additional data hinders the ability to reduce this uncertainty, as the original phylogenetic dataset is often complete, containing the entire gene or genome information available for the given set of taxa. Informative priors in a Bayesian analysis can reduce posterior uncertainty; however, publicly available phylogenetic software specifies vague priors for model parameters by default. We build objective and informative priors using hierarchical random effect models that combine additional datasets whose parameters are not of direct interest but are similar to the analysis of interest. RESULTS We propose principled statistical methods that permit more precise parameter estimates in phylogenetic analyses by creating informative priors for parameters of interest. Using additional sequence datasets from our lab or public databases, we construct a fully Bayesian semiparametric hierarchical model to combine datasets. A dynamic iteratively reweighted Markov chain Monte Carlo algorithm conveniently recycles posterior samples from the individual analyses. We demonstrate the value of our approach by examining the insertion-deletion (indel) process in the enolase gene across the Tree of Life using the phylogenetic software BALI-PHY; we incorporate prior information about indels from 82 curated alignments downloaded from the BAliBASE database.
Collapse
Affiliation(s)
- Li-Jung Liang
- Department of Medicine Statistics Core, UCLA School of Public Health, Los Angeles, CA 90095, USA.
| | | | | | | |
Collapse
|
38
|
Brinkmann H, Philippe H. The Diversity Of Eukaryotes And The Root Of The Eukaryotic Tree. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2008; 607:20-37. [DOI: 10.1007/978-0-387-74021-8_2] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
39
|
Simmons MP, Richardson D, Reddy ASN. Incorporation of gap characters and lineage-specific regions into phylogenetic analyses of gene families from divergent clades: an example from the kinesin superfamily across eukaryotes. Cladistics 2008. [DOI: 10.1111/j.1096-0031.2007.00183.x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
|
40
|
Mußmann M, Hu FZ, Richter M, de Beer D, Preisler A, Jørgensen BB, Huntemann M, Glöckner FO, Amann R, Koopman WJH, Lasken RS, Janto B, Hogg J, Stoodley P, Boissy R, Ehrlich GD. Insights into the genome of large sulfur bacteria revealed by analysis of single filaments. PLoS Biol 2007; 5:e230. [PMID: 17760503 PMCID: PMC1951784 DOI: 10.1371/journal.pbio.0050230] [Citation(s) in RCA: 136] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2007] [Accepted: 06/26/2007] [Indexed: 11/19/2022] Open
Abstract
Marine sediments are frequently covered by mats of the filamentous Beggiatoa and other large nitrate-storing bacteria that oxidize hydrogen sulfide using either oxygen or nitrate, which they store in intracellular vacuoles. Despite their conspicuous metabolic properties and their biogeochemical importance, little is known about their genetic repertoire because of the lack of pure cultures. Here, we present a unique approach to access the genome of single filaments of Beggiatoa by combining whole genome amplification, pyrosequencing, and optical genome mapping. Sequence assemblies were incomplete and yielded average contig sizes of approximately 1 kb. Pathways for sulfur oxidation, nitrate and oxygen respiration, and CO2 fixation confirm the chemolithoautotrophic physiology of Beggiatoa. In addition, Beggiatoa potentially utilize inorganic sulfur compounds and dimethyl sulfoxide as electron acceptors. We propose a mechanism of vacuolar nitrate accumulation that is linked to proton translocation by vacuolar-type ATPases. Comparative genomics indicates substantial horizontal gene transfer of storage, metabolic, and gliding capabilities between Beggiatoa and cyanobacteria. These capabilities enable Beggiatoa to overcome non-overlapping availabilities of electron donors and acceptors while gliding between oxic and sulfidic zones. The first look into the genome of these filamentous sulfur-oxidizing bacteria substantially deepens the understanding of their evolution and their contribution to sulfur and nitrogen cycling in marine sediments.
Collapse
Affiliation(s)
- Marc Mußmann
- Max Planck Institute for Marine Microbiology, Bremen, Germany
- * To whom correspondence should be addressed. E-mail: (MM); (FOG); (GDE)
| | - Fen Z Hu
- Center for Genomic Sciences, Allegheny General Hospital/Allegheny-Singer Research Institute, Pittsburgh, Pennsylvania, United States of America
| | - Michael Richter
- Max Planck Institute for Marine Microbiology, Bremen, Germany
- School of Engineering and Sciences, Jacobs University Bremen, Bremen, Germany
| | - Dirk de Beer
- Max Planck Institute for Marine Microbiology, Bremen, Germany
| | - André Preisler
- Max Planck Institute for Marine Microbiology, Bremen, Germany
| | - Bo B Jørgensen
- Max Planck Institute for Marine Microbiology, Bremen, Germany
| | - Marcel Huntemann
- Max Planck Institute for Marine Microbiology, Bremen, Germany
- School of Engineering and Sciences, Jacobs University Bremen, Bremen, Germany
| | - Frank Oliver Glöckner
- Max Planck Institute for Marine Microbiology, Bremen, Germany
- School of Engineering and Sciences, Jacobs University Bremen, Bremen, Germany
- * To whom correspondence should be addressed. E-mail: (MM); (FOG); (GDE)
| | - Rudolf Amann
- Max Planck Institute for Marine Microbiology, Bremen, Germany
| | - Werner J. H Koopman
- Department of Membrane Biochemistry, Nijmegen Centre for Molecular Life Sciences, Radboud University Nijmegen Medical Centre, Nijmegen, The Netherlands
| | - Roger S Lasken
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Benjamin Janto
- Center for Genomic Sciences, Allegheny General Hospital/Allegheny-Singer Research Institute, Pittsburgh, Pennsylvania, United States of America
| | - Justin Hogg
- Center for Genomic Sciences, Allegheny General Hospital/Allegheny-Singer Research Institute, Pittsburgh, Pennsylvania, United States of America
| | - Paul Stoodley
- Center for Genomic Sciences, Allegheny General Hospital/Allegheny-Singer Research Institute, Pittsburgh, Pennsylvania, United States of America
| | - Robert Boissy
- Center for Genomic Sciences, Allegheny General Hospital/Allegheny-Singer Research Institute, Pittsburgh, Pennsylvania, United States of America
| | - Garth D Ehrlich
- Center for Genomic Sciences, Allegheny General Hospital/Allegheny-Singer Research Institute, Pittsburgh, Pennsylvania, United States of America
- * To whom correspondence should be addressed. E-mail: (MM); (FOG); (GDE)
| |
Collapse
|
41
|
Rodríguez-Ezpeleta N, Brinkmann H, Burger G, Roger AJ, Gray MW, Philippe H, Lang BF. Toward Resolving the Eukaryotic Tree: The Phylogenetic Positions of Jakobids and Cercozoans. Curr Biol 2007; 17:1420-5. [PMID: 17689961 DOI: 10.1016/j.cub.2007.07.036] [Citation(s) in RCA: 157] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2006] [Revised: 07/10/2007] [Accepted: 07/17/2007] [Indexed: 11/18/2022]
Abstract
Resolving the global phylogeny of eukaryotes has proven to be challenging. Among the eukaryotic groups of uncertain phylogenetic position are jakobids, a group of bacterivorous flagellates that possess the most bacteria-like mitochondrial genomes known. Jakobids share several ultrastructural features with malawimonads and an assemblage of anaerobic protists (e.g., diplomonads and oxymonads). These lineages together with Euglenozoa and Heterolobosea have collectively been designated "excavates". However, published molecular phylogenies based on the sequences of nuclear rRNAs and up to six nucleus-encoded proteins do not provide convincing support for the monophyly of excavates, nor do they uncover their relationship to other major eukaryotic groups. Here, we report the first large-scale eukaryotic phylogeny, inferred from 143 nucleus-encoded proteins comprising 31,604 amino acid positions, that includes jakobids, malawimonads and cercozoans. We obtain compelling support for the monophyly of jakobids, Euglenozoa plus Heterolobosea (JEH group), and for the association of cercozoans with stramenopiles plus alveolates. Furthermore, we observe a sister-group relationship between the JEH group and malawimonads after removing fast-evolving species from the dataset. We discuss the implications of these results for the concept of "excavates" and for the elucidation of eukaryotic phylogeny in general.
Collapse
Affiliation(s)
- Naiara Rodríguez-Ezpeleta
- Centre Robert Cedergren, Département de Biochimie, Université de Montréal, 2900 Boulevard Edouard-Montpetit, Montréal, Québec H3T 1J4, Canada
| | | | | | | | | | | | | |
Collapse
|
42
|
Simmons MP, Müller K, Norton AP. The relative performance of indel-coding methods in simulations. Mol Phylogenet Evol 2007; 44:724-40. [PMID: 17512758 DOI: 10.1016/j.ympev.2007.04.001] [Citation(s) in RCA: 82] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2006] [Revised: 04/02/2007] [Accepted: 04/04/2007] [Indexed: 11/26/2022]
Abstract
We used simulations to compare the performance of 10 approaches that have been used for treating unambiguously aligned gaps in phylogenetic analyses. We examined how these approaches perform under the ideal conditions of correct alignments, as well as how robust they are to errors caused by use of inferred alignments. Our results indicate that 5th-state coding dramatically outperformed all other coding methods, which in turn all outperformed treating gaps as missing data or excluding gapped positions. Simple indel coding (SIC) and modified complex indel coding (MCIC) performed about the same, and generally outperformed the other indel-coding methods. The high performance of 5th-state coding was largely found to be a weighting artifact. We suggest that MCIC-coded gap characters be scored for all unambiguously aligned gaps in parsimony-based molecular phylogenetic analyses. When the number of terminals sampled precludes the use of MCIC, SIC may be used as an effective substitute.
Collapse
Affiliation(s)
- Mark P Simmons
- Department of Biology, Colorado State University, Fort Collins, CO 80523-1878, USA.
| | | | | |
Collapse
|
43
|
Abstract
Phylogenetic analysis has changed greatly in the last decade, and the most important themes in that change are reviewed here. Sequence data have become the most common source of phylogenetic information. This means that explicit models for evolutionary processes have been developed in a likelihood context, which allow more realistic data analyses. These models are becoming increasingly complex, both for nucleotides and for amino acid sequences, and so all such models need to be quantitatively assessed for each data set, to find the most appropriate one for use in any particular tree-building analysis. Bayesian analysis has been developed for tree-building and is greatly increasing in popularity. This is because a good heuristic strategy exists, which allows large data sets to be analyzed with complex evolutionary models in a practical time. Perhaps the most disappointing aspect of tree interpretation is the ongoing confusion between rooted and unrooted trees, while the effect of taxon and character sampling is often overlooked when constructing a phylogeny (especially in parasitology). The review finishes with a detailed consideration of the analysis of a multi-gene data set for several dozen taxa of Cryptosporidium (Apicomplexa), illustrating many of the theoretical and practical points highlighted in the review.
Collapse
Affiliation(s)
- David A Morrison
- Department of Parasitology (SWEPAR), National Veterinary Institute and Swedish University of Agricultural Sciences, 751 89 Uppsala, Sweden
| |
Collapse
|
44
|
Liapounova NA, Hampl V, Gordon PMK, Sensen CW, Gedamu L, Dacks JB. Reconstructing the mosaic glycolytic pathway of the anaerobic eukaryote Monocercomonoides. EUKARYOTIC CELL 2006; 5:2138-46. [PMID: 17071828 PMCID: PMC1694820 DOI: 10.1128/ec.00258-06] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
All eukaryotes carry out glycolysis, interestingly, not all using the same enzymes. Anaerobic eukaryotes face the challenge of fewer molecules of ATP extracted per molecule of glucose due to their lack of a complete tricarboxylic acid cycle. This may have pressured anaerobic eukaryotes to acquire the more ATP-efficient alternative glycolytic enzymes, such as pyrophosphate-fructose 6-phosphate phosphotransferase and pyruvate orthophosphate dikinase, through lateral gene transfers from bacteria and other eukaryotes. Most studies of these enzymes in eukaryotes involve pathogenic anaerobes; Monocercomonoides, an oxymonad belonging to the eukaryotic supergroup Excavata, is a nonpathogenic anaerobe representing an evolutionarily and ecologically distinct sampling of an anaerobic glycolytic pathway. We sequenced cDNA encoding glycolytic enzymes from a previously established cDNA library of Monocercomonoides and analyzed the relationships of these enzymes to those from other organisms spanning the major groups of Eukaryota, Bacteria, and Archaea. We established that, firstly, Monocercomonoides possesses alternative versions of glycolytic enzymes: fructose-6-phosphate phosphotransferase, both pyruvate kinase and pyruvate orthophosphate dikinase, cofactor-independent phosphoglycerate mutase, and fructose-bisphosphate aldolase (class II, type B). Secondly, we found evidence for the monophyly of oxymonads, kinetoplastids, diplomonads, and parabasalids, the major representatives of the Excavata. We also found several prokaryote-to-eukaryote as well as eukaryote-to-eukaryote lateral gene transfers involving glycolytic enzymes from anaerobic eukaryotes, further suggesting that lateral gene transfer was an important factor in the evolution of this pathway for denizens of this environment.
Collapse
Affiliation(s)
- Natalia A Liapounova
- Department of Biological Sciences, the University of Calgary, Calgary, Alberta T2N 1N4, Canada
| | | | | | | | | | | |
Collapse
|
45
|
Ohkuma M, Saita K, Inoue T, Kudo T. Comparison of four protein phylogeny of parabasalian symbionts in termite guts. Mol Phylogenet Evol 2006; 42:847-53. [PMID: 17098447 DOI: 10.1016/j.ympev.2006.09.016] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2005] [Revised: 09/22/2006] [Accepted: 09/22/2006] [Indexed: 11/27/2022]
Affiliation(s)
- Moriya Ohkuma
- Environmental Molecular Biology Laboratory, RIKEN, Wako, Saitama 351-0198, Japan.
| | | | | | | |
Collapse
|
46
|
Roger AJ, Hug LA. The origin and diversification of eukaryotes: problems with molecular phylogenetics and molecular clock estimation. Philos Trans R Soc Lond B Biol Sci 2006; 361:1039-54. [PMID: 16754613 PMCID: PMC1578731 DOI: 10.1098/rstb.2006.1845] [Citation(s) in RCA: 138] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Determining the relationships among and divergence times for the major eukaryotic lineages remains one of the most important and controversial outstanding problems in evolutionary biology. The sequencing and phylogenetic analyses of ribosomal RNA (rRNA) genes led to the first nearly comprehensive phylogenies of eukaryotes in the late 1980s, and supported a view where cellular complexity was acquired during the divergence of extant unicellular eukaryote lineages. More recently, however, refinements in analytical methods coupled with the availability of many additional genes for phylogenetic analysis showed that much of the deep structure of early rRNA trees was artefactual. Recent phylogenetic analyses of a multiple genes and the discovery of important molecular and ultrastructural phylogenetic characters have resolved eukaryotic diversity into six major hypothetical groups. Yet relationships among these groups remain poorly understood because of saturation of sequence changes on the billion-year time-scale, possible rapid radiations of major lineages, phylogenetic artefacts and endosymbiotic or lateral gene transfer among eukaryotes. Estimating the divergence dates between the major eukaryote lineages using molecular analyses is even more difficult than phylogenetic estimation. Error in such analyses comes from a myriad of sources including: (i) calibration fossil dates, (ii) the assumed phylogenetic tree, (iii) the nucleotide or amino acid substitution model, (iv) substitution number (branch length) estimates, (v) the model of how rates of evolution change over the tree, (vi) error inherent in the time estimates for a given model and (vii) how multiple gene data are treated. By reanalysing datasets from recently published molecular clock studies, we show that when errors from these various sources are properly accounted for, the confidence intervals on inferred dates can be very large. Furthermore, estimated dates of divergence vary hugely depending on the methods used and their assumptions. Accurate dating of divergence times among the major eukaryote lineages will require a robust tree of eukaryotes, a much richer Proterozoic fossil record of microbial eukaryotes assignable to extant groups for calibration, more sophisticated relaxed molecular clock methods and many more genes sampled from the full diversity of microbial eukaryotes.
Collapse
Affiliation(s)
- Andrew J Roger
- Canadian Institute for Advanced Research, Department of Biochemistry and Molecular Biology, Dalhousie University, Program in Evolutionary Biology Halifax, Nova Scotia, B3H 1X5 Canada.
| | | |
Collapse
|
47
|
Inagaki Y, Susko E, Roger AJ. Recombination between elongation factor 1alpha genes from distantly related archaeal lineages. Proc Natl Acad Sci U S A 2006; 103:4528-33. [PMID: 16537397 PMCID: PMC1450205 DOI: 10.1073/pnas.0600744103] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2005] [Indexed: 11/18/2022] Open
Abstract
Homologous recombination (HR) and lateral gene transfer are major processes in genome evolution. The combination of the two processes, HR between genes in different species, has been documented but is thought to be restricted to very similar sequences in relatively closely related organisms. Here we report two cases of interspecific HR in the gene encoding the core translational protein translation elongation factor 1alpha (EF-1alpha) between distantly related archaeal groups. Maximum-likelihood sliding window analyses indicate that a fragment of the EF-1alpha gene from the archaeal lineage represented by Methanopyrus kandleri was recombined into the orthologous gene in a common ancestor of the Thermococcales. A second recombination event appears to have occurred between the EF-1alpha gene of the genus Methanothermobacter and its ortholog in a common ancestor of the Methanosarcinales, a distantly related euryarchaeal lineage. These findings suggest that HR occurs across a much larger evolutionary distance than generally accepted and affects highly conserved essential "informational" genes. Although difficult to detect by standard whole-gene phylogenetic analyses, interspecific HR in highly conserved genes may occur at an appreciable frequency, potentially confounding deep phylogenetic inference and hypothesis testing.
Collapse
Affiliation(s)
- Yuji Inagaki
- *Center for Computational Sciences and Institute of Biological Sciences, University of Tsukuba, Tsukuba, Ibaraki 305-8577, Japan
| | - Edward Susko
- Department of Mathematics and Statistics and Genome Atlantic, Dalhousie University, Halifax, NS, Canada B3H 3J5; and
| | - Andrew J. Roger
- Canadian Institute for Advanced Research and Genome Atlantic, Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, NS, Canada B3H 1X5
| |
Collapse
|
48
|
Suzuki T, Hashimoto T, Yabu Y, Majiwa PAO, Ohshima S, Suzuki M, Lu S, Hato M, Kido Y, Sakamoto K, Nakamura K, Kita K, Ohta N. Alternative oxidase (AOX) genes of African trypanosomes: phylogeny and evolution of AOX and plastid terminal oxidase families. J Eukaryot Microbiol 2005; 52:374-81. [PMID: 16014016 DOI: 10.1111/j.1550-7408.2005.00050.x] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
To clarify evolution and phylogenetic relationships of trypanosome alternative oxidase (AOX) molecules, AOX genes (cDNAs) of the African trypanosomes, Trypanosoma congolense and Trypanosoma evansi, were cloned by PCR. Both AOXs possess conserved consensus motifs (-E-, -EXXH-). The putative amino acid sequence of the AOX of T. evansi was exactly the same as that of T. brucei. A protein phylogeny of trypanosome AOXs revealed that three genetically and pathogenically distinct strains of T. congolense are closely related to each other. When all known AOX sequences collected from current databases were analyzed, the common ancestor of these three Trypanosoma species shared a sister-group position to T. brucei/T. evansi. Monophyly of Trypanosoma spp. was clearly supported (100% bootstrap value) with Trypanosoma vivax placed at the most basal position of the Trypanosoma clade. Monophyly of other eukaryotic lineages, terrestrial plants + red algae, Metazoa, diatoms, Alveolata, oomycetes, green algae, and Fungi, was reconstructed in the best AOX tree obtained from maximum likelihood analysis, although some of these clades were not strongly supported. The terrestrial plants + red algae clade showed the closest affinity with an alpha-proteobacterium, Novosphingobium aromaticivorans, and the common ancestor of these lineages, was separated from other eukaryotes. Although the root of the AOX subtree was not clearly determined, subsequent phylogenetic analysis of the composite tree for AOX and plastid terminal oxidase (PTOX) demonstrated that PTOX and related cyanobacterial sequences are of a monophyletic origin and their common ancestor is linked to AOX sequences.
Collapse
Affiliation(s)
- Takashi Suzuki
- Department of Molecular Parasitology, Nagoya City University, Graduate School of Medical Sciences, Kawasumi, Mizuho 467-8601 Nagoya, Japan.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
49
|
Richards TA, Cavalier-Smith T. Myosin domain evolution and the primary divergence of eukaryotes. Nature 2005; 436:1113-8. [PMID: 16121172 DOI: 10.1038/nature03949] [Citation(s) in RCA: 341] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2005] [Accepted: 06/24/2005] [Indexed: 11/09/2022]
Abstract
Eukaryotic cells have two contrasting cytoskeletal and ciliary organizations. The simplest involves a single cilium-bearing centriole, nucleating a cone of individual microtubules (probably ancestral for unikonts: animals, fungi, Choanozoa and Amoebozoa). In contrast, bikonts (plants, chromists and all other protozoa) were ancestrally biciliate with a younger anterior cilium, converted every cell cycle into a dissimilar posterior cilium and multiple ciliary roots of microtubule bands. Here we show by comparative genomic analysis that this fundamental cellular dichotomy also involves different myosin molecular motors. We found 37 different protein domain combinations, often lineage-specific, and many previously unidentified. The sequence phylogeny and taxonomic distribution of myosin domain combinations identified five innovations that strongly support unikont monophyly and the primary bikont/unikont bifurcation. We conclude that the eukaryotic cenancestor (last common ancestor) had a cilium, mitochondria, pseudopodia, and myosins with three contrasting domain combinations and putative functions.
Collapse
Affiliation(s)
- Thomas A Richards
- Department of Zoology, The Natural History Museum, Cromwell Road, London SW7 5BD, UK.
| | | |
Collapse
|
50
|
Takishita K, Patron NJ, Ishida KI, Maruyama T, Keeling PJ. A Transcriptional Fusion of Genes Encoding Glyceraldehyde-3-Phosphate Dehydrogenase (GAPDH) and Enolase in Dinoflagellates. J Eukaryot Microbiol 2005; 52:343-8. [PMID: 16014012 DOI: 10.1111/j.1550-7408.2005.00042x] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) and enolase are enzymes essential for glycolysis and gluconeogenesis. Dinoflagellates possess several types of both GAPDH and enolase genes. Here, we identify a novel cytosolic GAPDH-enolase fusion protein in several dinoflagellate species. Phylogenetic analyses revealed that the GAPDH moiety of this fusion is weakly related to a cytosolic GAPDH previously reported in dinoflagellates, ciliates, and an apicomplexan. The enolase moiety has phylogenetic affinity with sequences from ciliates and apicomplexans, as expected for dinoflagellate genes. Furthermore, the enolase moiety has two insertions in a highly conserved region of the gene that are shared with ciliate and apicomplexan homologues, as well as with land plants, stramenopiles, haptophytes, and a chlorarachniophyte. Another glycolytic gene fusion in eukaryotes is the mitochondrion-targeted triose-phosphate isomerase (TPI) and GAPDH fusion in stramenopiles (i.e. diatoms and oomycetes). However, unlike the mitochondrial TPI-GAPDH fusion, the GAPDH-enolase fusion protein appears to exist in the same compartment as stand-alone homologues of each protein, and the metabolic reactions they catalyze in glycolysis and gluconeogenesis are not directly sequential. It is possible that the fusion is post-translationally processed to give separate GAPDH and enolase products, or that the fusion protein may function as a single bifunctional polypeptide in glycolysis, gluconeogenesis, or perhaps more likely in some previously unrecognized metabolic capacity.
Collapse
Affiliation(s)
- Kiyotaka Takishita
- Japan Agency for Marine-Earth Science and Technology (JAMSTEC), Yokosuka, Kanagawa 237-0061, Japan.
| | | | | | | | | |
Collapse
|