1
|
Patané JSL, Martins J, Setubal JC. A Guide to Phylogenomic Inference. Methods Mol Biol 2024; 2802:267-345. [PMID: 38819564 DOI: 10.1007/978-1-0716-3838-5_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
Phylogenomics aims at reconstructing the evolutionary histories of organisms taking into account whole genomes or large fractions of genomes. Phylogenomics has significant applications in fields such as evolutionary biology, systematics, comparative genomics, and conservation genetics, providing valuable insights into the origins and relationships of species and contributing to our understanding of biological diversity and evolution. This chapter surveys phylogenetic concepts and methods aimed at both gene tree and species tree reconstruction while also addressing common pitfalls, providing references to relevant computer programs. A practical phylogenomic analysis example including bacterial genomes is presented at the end of the chapter.
Collapse
Affiliation(s)
- José S L Patané
- Laboratório de Genética e Cardiologia Molecular, Instituto do Coração/Heart Institute Hospital das Clínicas - Faculdade de Medicina da Universidade de São Paulo São Paulo, São Paulo, SP, Brazil
| | - Joaquim Martins
- Integrative Omics group, Biorenewables National Laboratory, Brazilian Center for Research in Energy and Materials, Campinas, SP, Brazil
| | - João Carlos Setubal
- Departmento de Bioquímica, Instituto de Química, Universidade de São Paulo, São Paulo, SP, Brazil.
| |
Collapse
|
2
|
Liu J, Yang Y, Yan Z, Wang H, Bai M, Shi C, Li J. Analysis of the Mitogenomes of Two Helotid Species Provides New Insights into the Phylogenetic Relationship of the Basal Cucujoidea (Insecta: Coleoptera). BIOLOGY 2023; 12:biology12010135. [PMID: 36671827 PMCID: PMC9855730 DOI: 10.3390/biology12010135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 01/09/2023] [Accepted: 01/11/2023] [Indexed: 01/18/2023]
Abstract
Helotid beetles are commonly found in places where sap flows from tree trunks and in crevices in bark. The Helotidae family is a rare and primitive group of Cucujoidea. To date, no complete mitochondrial (mt) genome has been sequenced for this family. To better understand the characteristics of the mt genome and the evolution of Cucujoidea, we sequenced and annotated the complete mt genomes of Helota thoracica (Ritsema, 1895) and Helota yehi Lee, 2017 using next-generation sequencing. These are the first record of Helotidae mt genomes. The RNA secondary structures of both species were also predicted in this study. The mt genomes of H. thoracica and H. yehi are circular, with total lengths of 16,112 bp and 16,401 bp, respectively. After comparing the mt genomes of H. thoracica and H. yehi, we observed the gene arrangement, codon usage patterns, base content, and RNA secondary structures of both species to be similar, which has also been noted in other Coleoptera insects. The nucleotide sequence of the coding regions and the control region has small differences. The phylogenetic analysis indicated that Helotidae and Protocucujidae are sister groups and revealed the relationship between seven families; however, the validity of the two series (Erotylid series and Nitidulid series) as larger groups in the superfamily was not supported. The mt phylogenomic relationships have strong statistical support. Therefore, the division of Cucujoidea into series should be re-examined. Our results will provide a better understanding of the mt genome and phylogeny of Helotidae and Cucujoidea and will provide valuable molecular markers for further genetic studies.
Collapse
Affiliation(s)
- Jing Liu
- College of Plant Protection, Hebei Agricultural University, Baoding 071000, China
| | - Yuhang Yang
- College of Plant Protection, Hebei Agricultural University, Baoding 071000, China
| | - Zihan Yan
- College of Plant Protection, Hebei Agricultural University, Baoding 071000, China
| | - Haishan Wang
- College of Plant Protection, Hebei Agricultural University, Baoding 071000, China
| | - Ming Bai
- College of Plant Protection, Hebei Agricultural University, Baoding 071000, China
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- Correspondence: (M.B.); (J.L.)
| | - Chengmin Shi
- College of Plant Protection, Hebei Agricultural University, Baoding 071000, China
| | - Jing Li
- College of Plant Protection, Hebei Agricultural University, Baoding 071000, China
- Correspondence: (M.B.); (J.L.)
| |
Collapse
|
3
|
Hibdige SGS, Raimondeau P, Christin PA, Dunning LT. Widespread lateral gene transfer among grasses. THE NEW PHYTOLOGIST 2021; 230:2474-2486. [PMID: 33887801 DOI: 10.1111/nph.17328] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/06/2021] [Accepted: 02/28/2021] [Indexed: 06/12/2023]
Abstract
Lateral gene transfer (LGT) occurs in a broad range of prokaryotes and eukaryotes, occasionally promoting adaptation. LGT of functional nuclear genes has been reported among some plants, but systematic studies are needed to assess the frequency and facilitators of LGT. We scanned the genomes of a diverse set of 17 grass species that span more than 50 Ma of divergence and include major crops to identify grass-to-grass protein-coding LGT. We identified LGTs in 13 species, with significant variation in the amount each received. Rhizomatous species acquired statistically more genes, probably because this growth habit boosts opportunities for transfer into the germline. In addition, the amount of LGT increases with phylogenetic relatedness, which might reflect genomic compatibility among close relatives facilitating successful transfers. However, genetic exchanges among highly divergent species indicates that transfers can occur across almost the entire family. Overall, we showed that LGT is a widespread phenomenon in grasses that has moved functional genes across the grass family into domesticated and wild species alike. Successful LGTs appear to increase with both opportunity and compatibility.
Collapse
Affiliation(s)
- Samuel G S Hibdige
- Animal and Plant Sciences, University of Sheffield, Western Bank, Sheffield, S10 2TN, UK
| | - Pauline Raimondeau
- Animal and Plant Sciences, University of Sheffield, Western Bank, Sheffield, S10 2TN, UK
| | | | - Luke T Dunning
- Animal and Plant Sciences, University of Sheffield, Western Bank, Sheffield, S10 2TN, UK
| |
Collapse
|
4
|
Abramson NI, Golenishchev FN, Bodrov SY, Bondareva OV, Genelt-Yanovskiy EA, Petrova TV. Phylogenetic relationships and taxonomic position of genus Hyperacrius (Rodentia: Arvicolinae) from Kashmir based on evidences from analysis of mitochondrial genome and study of skull morphology. PeerJ 2020; 8:e10364. [PMID: 33240667 PMCID: PMC7680025 DOI: 10.7717/peerj.10364] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Accepted: 10/24/2020] [Indexed: 11/24/2022] Open
Abstract
In this article, we present the nearly complete mitochondrial genome of the Subalpine Kashmir vole Hyperacrius fertilis (Arvicolinae, Cricetidae, Rodentia), assembled using data from Illumina next-generation sequencing (NGS) of the DNA from a century-old museum specimen. De novo assembly consisted of 16,341 bp and included all mitogenome protein-coding genes as well as 12S and 16S RNAs, tRNAs and D-loop. Using the alignment of protein-coding genes of 14 previously published Arvicolini tribe mitogenomes, seven Clethrionomyini mitogenomes, and also Ondatra and Dicrostonyx outgroups, we conducted phylogenetic reconstructions based on a dataset of 13 protein-coding genes (PCGs) under maximum likelihood and Bayesian inference. Phylogenetic analyses robustly supported the phylogenetic position of this species within the tribe Arvicolini. Among the Arvicolini, Hyperacrius represents one of the early-diverged lineages. This result of phylogenetic analysis altered the conventional view on phylogenetic relatedness between Hyperacrius and Alticola and prompted the revision of morphological characters underlying the former assumption. Morphological analysis performed here confirmed molecular data and provided additional evidence for taxonomic replacement of the genus Hyperacrius from the tribe Clethrionomyini to the tribe Arvicolini.
Collapse
Affiliation(s)
- Natalia I. Abramson
- Department of Molecular Systematics, Zoological Institute Russian Academy of Sciences, Saint-Petersburg, Russian Federation
| | - Fedor N. Golenishchev
- Department of Mammals, Zoological Institute Russian Academy of Sciences, Saint-Petersburg, Russian Federation
| | - Semen Yu. Bodrov
- Department of Molecular Systematics, Zoological Institute Russian Academy of Sciences, Saint-Petersburg, Russian Federation
| | - Olga V. Bondareva
- Department of Molecular Systematics, Zoological Institute Russian Academy of Sciences, Saint-Petersburg, Russian Federation
| | - Evgeny A. Genelt-Yanovskiy
- Department of Molecular Systematics, Zoological Institute Russian Academy of Sciences, Saint-Petersburg, Russian Federation
| | - Tatyana V. Petrova
- Department of Molecular Systematics, Zoological Institute Russian Academy of Sciences, Saint-Petersburg, Russian Federation
| |
Collapse
|
5
|
Galen SC, Borner J, Martinsen ES, Schaer J, Austin CC, West CJ, Perkins SL. The polyphyly of Plasmodium: comprehensive phylogenetic analyses of the malaria parasites (order Haemosporida) reveal widespread taxonomic conflict. ROYAL SOCIETY OPEN SCIENCE 2018; 5:171780. [PMID: 29892372 PMCID: PMC5990803 DOI: 10.1098/rsos.171780] [Citation(s) in RCA: 98] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2017] [Accepted: 04/20/2018] [Indexed: 05/29/2023]
Abstract
The evolutionary relationships among the apicomplexan blood pathogens known as the malaria parasites (order Haemosporida), some of which infect nearly 200 million humans each year, has remained a vexing phylogenetic problem due to limitations in taxon sampling, character sampling and the extreme nucleotide base composition biases that are characteristic of this clade. Previous phylogenetic work on the malaria parasites has often lacked sufficient representation of the broad taxonomic diversity within the Haemosporida or the multi-locus sequence data needed to resolve deep evolutionary relationships, rendering our understanding of haemosporidian life-history evolution and the origin of the human malaria parasites incomplete. Here we present the most comprehensive phylogenetic analysis of the malaria parasites conducted to date, using samples from a broad diversity of vertebrate hosts that includes numerous enigmatic and poorly known haemosporidian lineages in addition to genome-wide multi-locus sequence data. We find that if base composition differences were corrected for during phylogenetic analysis, we recovered a well-supported topology indicating that the evolutionary history of the malaria parasites was characterized by a complex series of transitions in life-history strategies and host usage. Notably we find that Plasmodium, the malaria parasite genus that includes the species of human medical concern, is polyphyletic with the life-history traits characteristic of this genus having evolved in a dynamic manner across the phylogeny. We find support for multiple instances of gain and loss of asexual proliferation in host blood cells and production of haemozoin pigment, two traits that have been used for taxonomic classification as well as considered to be important factors for parasite virulence and used as drug targets. Lastly, our analysis illustrates the need for a widespread reassessment of malaria parasite taxonomy.
Collapse
Affiliation(s)
- Spencer C. Galen
- Sackler Institute for Comparative Genomics, American Museum of Natural History, Central Park West at 79th St., New York, NY 10024, USA
- Richard Gilder Graduate School, American Museum of Natural History, Central Park West at 79th St., New York, NY 10024, USA
| | - Janus Borner
- Institute of Zoology, Biocenter Grindel, University of Hamburg, Martin-Luther-King-Platz 3, D-20146 Hamburg, Germany
| | - Ellen S. Martinsen
- Center for Conservation Genomics, Smithsonian Conservation Biology Institute, National Zoological Park, PO Box 37012, MRC5503, Washington, DC 20013-7012, USA
| | - Juliane Schaer
- Department of Biology, Humboldt University, 10115, Berlin, Germany
| | - Christopher C. Austin
- Department of Biological Sciences, Museum of Natural Science, Louisiana State University, Baton Rouge, LA 70803, USA
| | | | - Susan L. Perkins
- Sackler Institute for Comparative Genomics, American Museum of Natural History, Central Park West at 79th St., New York, NY 10024, USA
| |
Collapse
|
6
|
Abstract
Phylogenomics aims at reconstructing the evolutionary histories of organisms taking into account whole genomes or large fractions of genomes. The abundance of genomic data for an enormous variety of organisms has enabled phylogenomic inference of many groups, and this has motivated the development of many computer programs implementing the associated methods. This chapter surveys phylogenetic concepts and methods aimed at both gene tree and species tree reconstruction while also addressing common pitfalls, providing references to relevant computer programs. A practical phylogenomic analysis example including bacterial genomes is presented at the end of the chapter.
Collapse
Affiliation(s)
- José S L Patané
- Department of Biochemistry, Institute of Chemistry, University of São Paulo, Av. Prof. Lineu Prestes 748, São Paulo, SP, 05508-000, Brazil
| | - Joaquim Martins
- Department of Biochemistry, Institute of Chemistry, University of São Paulo, Av. Prof. Lineu Prestes 748, São Paulo, SP, 05508-000, Brazil
| | - João C Setubal
- Department of Biochemistry, Institute of Chemistry, University of São Paulo, Av. Prof. Lineu Prestes 748, São Paulo, SP, 05508-000, Brazil.
| |
Collapse
|
7
|
Bickelmann C, Morrow JM, Du J, Schott RK, van Hazel I, Lim S, Müller J, Chang BSW. The molecular origin and evolution of dim-light vision in mammals. Evolution 2015; 69:2995-3003. [PMID: 26536060 DOI: 10.1111/evo.12794] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2015] [Revised: 09/23/2015] [Accepted: 09/27/2015] [Indexed: 01/19/2023]
Abstract
The nocturnal origin of mammals is a longstanding hypothesis that is considered instrumental for the evolution of endothermy, a potential key innovation in this successful clade. This hypothesis is primarily based on indirect anatomical inference from fossils. Here, we reconstruct the evolutionary history of rhodopsin--the vertebrate visual pigment mediating the first step in phototransduction at low-light levels--via codon-based model tests for selection, combined with gene resurrection methods that allow for the study of ancient proteins. Rhodopsin coding sequences were reconstructed for three key nodes: Amniota, Mammalia, and Theria. When expressed in vitro, all sequences generated stable visual pigments with λMAX values similar to the well-studied bovine rhodopsin. Retinal release rates of mammalian and therian ancestral rhodopsins, measured via fluorescence spectroscopy, were significantly slower than those of the amniote ancestor, indicating altered molecular function possibly related to nocturnality. Positive selection along the therian branch suggests adaptive evolution in rhodopsin concurrent with therian ecological diversification events during the Mesozoic that allowed for an exploration of the environment at varying light levels.
Collapse
Affiliation(s)
- Constanze Bickelmann
- Museum für Naturkunde, Leibniz-Institut für Evolutions- und Biodiversitätsforschung, 10115, Berlin, Germany.
| | - James M Morrow
- Department of Cell and Systems Biology, University of Toronto, Toronto, ON, M5S 3G5, Canada.,Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, M5S 3B2, Canada
| | - Jing Du
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, M5S 3B2, Canada
| | - Ryan K Schott
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, M5S 3B2, Canada
| | - Ilke van Hazel
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, M5S 3B2, Canada
| | - Steve Lim
- Department of Cell and Systems Biology, University of Toronto, Toronto, ON, M5S 3G5, Canada
| | - Johannes Müller
- Museum für Naturkunde, Leibniz-Institut für Evolutions- und Biodiversitätsforschung, 10115, Berlin, Germany
| | - Belinda S W Chang
- Department of Cell and Systems Biology, University of Toronto, Toronto, ON, M5S 3G5, Canada. .,Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, M5S 3B2, Canada. .,Centre for the Analysis of Genome Evolution and Function, Toronto, ON, M5S 3B2, Canada.
| |
Collapse
|
8
|
Arisue N, Hashimoto T. Phylogeny and evolution of apicoplasts and apicomplexan parasites. Parasitol Int 2015; 64:254-9. [DOI: 10.1016/j.parint.2014.10.005] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2014] [Revised: 10/02/2014] [Accepted: 10/08/2014] [Indexed: 12/31/2022]
|
9
|
Du J, Dungan SZ, Sabouhanian A, Chang BSW. Selection on synonymous codons in mammalian rhodopsins: a possible role in optimizing translational processes. BMC Evol Biol 2014; 14:96. [PMID: 24884412 PMCID: PMC4021273 DOI: 10.1186/1471-2148-14-96] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2013] [Accepted: 04/11/2014] [Indexed: 01/21/2023] Open
Abstract
Background Synonymous codon usage can affect many cellular processes, particularly those associated with translation such as polypeptide elongation and folding, mRNA degradation/stability, and splicing. Highly expressed genes are thought to experience stronger selection pressures on synonymous codons. This should result in codon usage bias even in species with relatively low effective population sizes, like mammals, where synonymous site selection is thought to be weak. Here we use phylogenetic codon-based likelihood models to explore patterns of codon usage bias in a dataset of 18 mammalian rhodopsin sequences, the protein mediating the first step in vision in the eye, and one of the most highly expressed genes in vertebrates. We use these patterns to infer selection pressures on key translational mechanisms including polypeptide elongation, protein folding, mRNA stability, and splicing. Results Overall, patterns of selection in mammalian rhodopsin appear to be correlated with post-transcriptional and translational processes. We found significant evidence for selection at synonymous sites using phylogenetic mutation-selection likelihood models, with C-ending codons found to have the highest relative fitness, and to be significantly more abundant at conserved sites. In general, these codons corresponded with the most abundant tRNAs in mammals. We found significant differences in codon usage bias between rhodopsin loops versus helices, though there was no significant difference in mean synonymous substitution rate between these motifs. We also found a significantly higher proportion of GC-ending codons at paired sites in rhodopsin mRNA secondary structure, and significantly lower synonymous mutation rates in putative exonic splicing enhancer (ESE) regions than in non-ESE regions. Conclusions By focusing on a single highly expressed gene we both distinguish synonymous codon selection from mutational effects and analytically explore underlying functional mechanisms. Our results suggest that codon bias in mammalian rhodopsin arises from selection to optimally balance high overall translational speed, accuracy, and proper protein folding, especially in structurally complicated regions. Selection at synonymous sites may also be contributing to mRNA stability and splicing efficiency at exonic-splicing-enhancer (ESE) regions. Our results highlight the importance of investigating highly expressed genes in a broader phylogenetic context in order to better understand the evolution of synonymous substitutions.
Collapse
Affiliation(s)
| | | | | | - Belinda S W Chang
- Department of Ecology & Evolutionary Biology, University of Toronto, 25 Harbord Street, Toronto, ON M5S 3G5, Canada.
| |
Collapse
|
10
|
Kenaley CP, DeVaney SC, Fjeran TT. THE COMPLEX EVOLUTIONARY HISTORY OF SEEING RED: MOLECULAR PHYLOGENY AND THE EVOLUTION OF AN ADAPTIVE VISUAL SYSTEM IN DEEP-SEA DRAGONFISHES (STOMIIFORMES: STOMIIDAE). Evolution 2014; 68:996-1013. [DOI: 10.1111/evo.12322] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2012] [Accepted: 11/12/2013] [Indexed: 11/30/2022]
Affiliation(s)
- Christopher P. Kenaley
- Department of Organismic and Evolutionary Biology; Harvard University; Cambridge Massachusetts 02138
| | - Shannon C. DeVaney
- Life Science Department; Los Angeles Pierce College; Woodland Hills California 91371
| | - Taylor T. Fjeran
- College of Forestry; Oregon State University; Corvallis Oregon 97331
| |
Collapse
|
11
|
Inference of functional divergence among proteins when the evolutionary process is non-stationary. J Mol Evol 2013; 76:205-15. [PMID: 23443835 DOI: 10.1007/s00239-013-9549-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2012] [Accepted: 02/09/2013] [Indexed: 10/27/2022]
Abstract
Functional shifts during protein evolution are expected to yield shifts in substitution rate, and statistical methods can test for this at both codon and amino acid levels. Although methods based on models of sequence evolution serve as powerful tools for studying evolutionary processes, violating underlying assumptions can lead to false biological conclusions. It is not unusual for functional shifts to be accompanied by changes in other aspects of the evolutionary process, such as codon or amino acid frequencies. However, models used to test for functional divergence assume these frequencies remain constant over time. We employed simulation to investigate the impact of non-stationary evolution on functional divergence inference. We investigated three likelihood ratio tests based on codon models and found varying degrees of sensitivity. Joint effects of shifts in frequencies and selection pressures can be large, leading to false signals for positive selection. Amino acid-based tests (FunDi and Bivar) were also compromised when several aspects of the substitution process were not adequately modeled. We applied the same tests to a core genome "scan" for functional divergence between light-adapted ecotypes of the cyanobacteria Prochlorococcus, and carried out gene-specific simulations for ten genes. Results of those simulations illustrated how the inference of functional divergence at the genomic level can be seriously impacted by model misspecification. Although computationally costly, simulations motivated by data in hand are warranted when several aspects of the substitution process are either misspecified or not included in the models upon which the statistical tests were built.
Collapse
|
12
|
Ishikawa SA, Inagaki Y, Hashimoto T. RY-Coding and Non-Homogeneous Models Can Ameliorate the Maximum-Likelihood Inferences From Nucleotide Sequence Data with Parallel Compositional Heterogeneity. Evol Bioinform Online 2012; 8:357-71. [PMID: 22798721 PMCID: PMC3394461 DOI: 10.4137/ebo.s9017] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
In phylogenetic analyses of nucleotide sequences, 'homogeneous' substitution models, which assume the stationarity of base composition across a tree, are widely used, albeit individual sequences may bear distinctive base frequencies. In the worst-case scenario, a homogeneous model-based analysis can yield an artifactual union of two distantly related sequences that achieved similar base frequencies in parallel. Such potential difficulty can be countered by two approaches, 'RY-coding' and 'non-homogeneous' models. The former approach converts four bases into purine and pyrimidine to normalize base frequencies across a tree, while the heterogeneity in base frequency is explicitly incorporated in the latter approach. The two approaches have been applied to real-world sequence data; however, their basic properties have not been fully examined by pioneering simulation studies. Here, we assessed the performances of the maximum-likelihood analyses incorporating RY-coding and a non-homogeneous model (RY-coding and non-homogeneous analyses) on simulated data with parallel convergence to similar base composition. Both RY-coding and non-homogeneous analyses showed superior performances compared with homogeneous model-based analyses. Curiously, the performance of RY-coding analysis appeared to be significantly affected by a setting of the substitution process for sequence simulation relative to that of non-homogeneous analysis. The performance of a non-homogeneous analysis was also validated by analyzing a real-world sequence data set with significant base heterogeneity.
Collapse
Affiliation(s)
- Sohta A Ishikawa
- Graduate School of Life and Environmental Sciences, University of Tsukuba, Tsukuba, Ibaraki 305-8572, Japan
| | | | | |
Collapse
|
13
|
Sumner JG, Jarvis PD, Fernández-Sánchez J, Kaine BT, Woodhams MD, Holland BR. Is the general time-reversible model bad for molecular phylogenetics? Syst Biol 2012; 61:1069-74. [PMID: 22442193 DOI: 10.1093/sysbio/sys042] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Jeremy G Sumner
- School of Mathematics and Physics, University of Tasmania, Hobart 7001,
| | | | | | | | | | | |
Collapse
|
14
|
O'Connell MJ, Loughran NB, Walsh TA, Donoghue MTA, Schmid KJ, Spillane C. A phylogenetic approach to test for evidence of parental conflict or gene duplications associated with protein-encoding imprinted orthologous genes in placental mammals. Mamm Genome 2010; 21:486-98. [PMID: 20931201 DOI: 10.1007/s00335-010-9283-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2010] [Accepted: 09/01/2010] [Indexed: 12/21/2022]
Abstract
There are multiple theories on the evolution of genomic imprinting. We investigated whether the molecular evolution of true orthologs of known imprinted genes provides support for theories based on gene duplication or parental conflicts (where mediated by amino-acid changes). Our analysis of 34 orthologous genes demonstrates that the vast majority of mammalian imprinted genes have not undergone any subsequent significant gene duplication within placental species, suggesting that selection pressures against gene duplication events could be operating for imprinted loci. As antagonistic co-evolution between imprinted genes can regulate offspring growth, proteins mediating this interaction could be subject to rapid evolution via positive selection. Supporting this, we detect evidence of site specific positive selection for the imprinted genes OSBPL5 (and GNASXL), and detect lineage-specific positive selection for 14 imprinted genes where it is known that the gene is imprinted in a specific lineage, namely for: PLAGL1, IGF2, SLC22A18, OSBPL5, DCN, DLK1, RASGRF1, IGF2R, IMPACT, GRB10, NAPIL4, UBE3A, GATM and GABRG3. However, there is an overall lack of concordance between the known imprinting status of each gene (i.e. whether the gene is imprinted or biallelically expressed in a particular mammalian lineage) and positive selection. While only a small number of orthologs of imprinted loci display evidence of positive selection, we observe that the majority of orthologs of imprinted loci display high levels of micro-synteny conservation and have undergone very few cis- or trans-duplications in placental mammalian lineages.
Collapse
Affiliation(s)
- Mary J O'Connell
- Genetics and Biotechnology Lab, Department of Biochemistry and Biosciences Institute, University College Cork (UCC), Lee Maltings 2.10, Cork, Ireland.
| | | | | | | | | | | |
Collapse
|
15
|
Wernegreen JJ, Kauppinen SN, Brady SG, Ward PS. One nutritional symbiosis begat another: phylogenetic evidence that the ant tribe Camponotini acquired Blochmannia by tending sap-feeding insects. BMC Evol Biol 2009; 9:292. [PMID: 20015388 PMCID: PMC2810300 DOI: 10.1186/1471-2148-9-292] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2009] [Accepted: 12/16/2009] [Indexed: 11/28/2022] Open
Abstract
Background Bacterial endosymbiosis has a recurring significance in the evolution of insects. An estimated 10-20% of insect species depend on bacterial associates for their nutrition and reproductive viability. Members of the ant tribe Camponotini, the focus of this study, possess a stable, intracellular bacterial mutualist. The bacterium, Blochmannia, was first discovered in Camponotus and has since been documented in a distinct subgenus of Camponotus, Colobopsis, and in the related genus Polyrhachis. However, the distribution of Blochmannia throughout the Camponotini remains in question. Documenting the true host range of this bacterial mutualist is an important first step toward understanding the various ecological contexts in which it has evolved, and toward identifying its closest bacterial relatives. In this study, we performed a molecular screen, based on PCR amplification of 16S rDNA, to identify bacterial associates of diverse Camponotini species. Results Phylogenetic analyses of 16S rDNA gave four important insights: (i) Blochmannia occurs in a broad range of Camponotini genera including Calomyrmex, Echinopla, and Opisthopsis, and did not occur in outgroups related to this tribe (e.g., Notostigma). This suggests that the mutualism originated in the ancestor of the tribe Camponotini. (ii) The known bacteriocyte-associated symbionts of ants, in Formica, Plagiolepis, and the Camponotini, arose independently. (iii) Blochmannia is nestled within a diverse clade of endosymbionts of sap-feeding hemipteran insects, such as mealybugs, aphids, and psyllids. In our analyses, a group of secondary symbionts of mealybugs are the closest relatives of Blochmannia. (iv) Blochmannia has cospeciated with its known hosts, although deep divergences at the genus level remain uncertain. Conclusions The Blochmannia mutualism occurs in Calomyrmex, Echinopla, and Opisthopsis, in addition to Camponotus, and probably originated in the ancestral lineage leading to the Camponotini. This significant expansion of its known host range implies that the mutualism is more ancient and ecologically diverse than previously documented. Blochmannia is most closely related to endosymbionts of sap-feeding hemipterans, which ants tend for their carbohydrate-rich honeydew. Based on phylogenetic results, we propose Camponotini might have originally acquired this bacterial mutualist through a nutritional symbiosis with other insects.
Collapse
Affiliation(s)
- Jennifer J Wernegreen
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA 02543, USA.
| | | | | | | |
Collapse
|
16
|
Simon DM, Kelchner SA, Zimmerly S. A broadscale phylogenetic analysis of group II intron RNAs and intron-encoded reverse transcriptases. Mol Biol Evol 2009; 26:2795-808. [PMID: 19713327 DOI: 10.1093/molbev/msp193] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Group II introns are self-splicing RNAs that are frequently assumed to be the ancestors of spliceosomal introns. They are widely distributed in bacteria and are also found in organelles of plants, fungi, and protists. In this study, we present a broadscale phylogenetic analysis of group II introns using sequence data from both the conserved RNA structure and the intron-encoded reverse transcriptase (RT). Two similar phylogenies are estimated for the RT open reading frame (ORF), based on either amino acid or nucleotide sequence, whereas one phylogeny is produced for the RNA. In making these estimates, we confronted nearly all the classic challenges to phylogenetic inference, including positional saturation, base composition heterogeneity, short internodes with low support, and sensitivity to taxon sampling. Although the major lineages are well-defined, robust resolution of topology is not possible between these lineages. The approximately unbiased (AU) and Shimodaira-Hasegawa topology tests indicated that the RT ORF and RNA ribozyme data sets are in significant conflict under a variety of models, revealing the possibility of imperfect coevolution between group II introns and their intron-encoded ORFs. The high level of sequence divergence, large timescale, and limited number of alignable characters in our study are representative of many RTs and group I introns, and our results suggest that phylogenetic analyses of any of these sequences could suffer from the same sources of error and instability identified in this study.
Collapse
Affiliation(s)
- Dawn M Simon
- Department of Biological Sciences, University of Calgary, Calgary, Alberta, Canada
| | | | | |
Collapse
|
17
|
Jermiin LS, Ho JWK, Lau KW, Jayaswal V. SeqVis: a tool for detecting compositional heterogeneity among aligned nucleotide sequences. Methods Mol Biol 2009; 537:65-91. [PMID: 19378140 DOI: 10.1007/978-1-59745-251-9_4] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/19/2023]
Abstract
Compositional heterogeneity is a poorly appreciated attribute of aligned nucleotide and amino acid sequences. It is a common property of molecular phylogenetic data, and it has been found to occur across sequences and/or across sites. Most molecular phylogenetic methods assume that the sequences have evolved under globally stationary, reversible, and homogeneous conditions, implying that the sequences should be compositionally homogeneous. The presence of the above-mentioned compositional heterogeneity implies that the sequences must have evolved under more general conditions than is commonly assumed. Consequently, there is a need for reliable methods to detect under what conditions alignments of nucleotides or amino acids may have evolved. In this chapter, we describe one such program. SeqVis is designed to survey aligned nucleotide sequences. We discuss pros-et-cons of this program in the context of other methods to detect compositional heterogeneity and violated phylogenetic assumptions. The benefits provided by SeqVis are demonstrated in two studies of alignments of nucleotides, one of which contained 7542 nucleotides from 53 species.
Collapse
Affiliation(s)
- Lars Sommer Jermiin
- School of Biological Sciences, Centre for Mathematical Biology and Sydney Bioinformatics, University of Sydney, Sydney, Australia
| | | | | | | |
Collapse
|
18
|
Sheffield NC, Song H, Cameron SL, Whiting MF. Nonstationary Evolution and Compositional Heterogeneity in Beetle Mitochondrial Phylogenomics. Syst Biol 2009; 58:381-94. [DOI: 10.1093/sysbio/syp037] [Citation(s) in RCA: 139] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Nathan C. Sheffield
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
- Program in Computational Biology & Bioinformatics, Institute for Genome Sciences and Policy, Duke University, Box 90090, Durham, NC 27708, USA
| | - Hojun Song
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Stephen L. Cameron
- Australian National Insect Collection, Commonwealth Scientific and Industrial Research Organisation, Entomology, PO Box 1700, Canberra, Australian Capital Territory, 2601, Australia
| | | |
Collapse
|
19
|
Pohl N, Sison-Mangus MP, Yee EN, Liswi SW, Briscoe AD. Impact of duplicate gene copies on phylogenetic analysis and divergence time estimates in butterflies. BMC Evol Biol 2009; 9:99. [PMID: 19439087 PMCID: PMC2689175 DOI: 10.1186/1471-2148-9-99] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2008] [Accepted: 05/13/2009] [Indexed: 12/05/2022] Open
Abstract
Background The increase in availability of genomic sequences for a wide range of organisms has revealed gene duplication to be a relatively common event. Encounters with duplicate gene copies have consequently become almost inevitable in the context of collecting gene sequences for inferring species trees. Here we examine the effect of incorporating duplicate gene copies evolving at different rates on tree reconstruction and time estimation of recent and deep divergences in butterflies. Results Sequences from ultraviolet-sensitive (UVRh), blue-sensitive (BRh), and long-wavelength sensitive (LWRh) opsins,EF-1α and COI were obtained from 27 taxa representing the five major butterfly families (5535 bp total). Both BRh and LWRh are present in multiple copies in some butterfly lineages and the different copies evolve at different rates. Regardless of the phylogenetic reconstruction method used, we found that analyses of combined data sets using either slower or faster evolving copies of duplicate genes resulted in a single topology in agreement with our current understanding of butterfly family relationships based on morphology and molecules. Interestingly, individual analyses of BRh and LWRh sequences also recovered these family-level relationships. Two different relaxed clock methods resulted in similar divergence time estimates at the shallower nodes in the tree, regardless of whether faster or slower evolving copies were used, with larger discrepancies observed at deeper nodes in the phylogeny. The time of divergence between the monarch butterfly Danaus plexippus and the queen D. gilippus (15.3–35.6 Mya) was found to be much older than the time of divergence between monarch co-mimic Limenitis archippus and red-spotted purple L. arthemis (4.7–13.6 Mya), and overlapping with the time of divergence of the co-mimetic passionflower butterflies Heliconius erato and H. melpomene (13.5–26.1 Mya). Our family-level results are congruent with recent estimates found in the literature and indicate an age of 84–113 million years for the divergence of all butterfly families. Conclusion These results are consistent with diversification of the butterfly families following the radiation of angiosperms and suggest that some classes of opsin genes may be usefully employed for both phylogenetic reconstruction and divergence time estimation.
Collapse
Affiliation(s)
- Nélida Pohl
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697, USA.
| | | | | | | | | |
Collapse
|
20
|
Dettaï A, di Prisco G, Lecointre G, Parisi E, Verde C. Inferring evolution of fish proteins: the globin case study. Methods Enzymol 2008; 436:539-70. [PMID: 18237653 DOI: 10.1016/s0076-6879(08)36030-3] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Because hemoglobins (Hbs) of all animal species have the same heme group, differences in their properties, including oxygen affinity, electrophoretic mobility, and pH sensitivity, must result from the interaction of the prosthetic group with specific amino acid residues in the primary structure. For this reason, fish globins have been the object of extensive studies in the past few years, not only for their structural characteristics but also because they offer the possibility to investigate the evolutionary history of Hbs in marine and freshwater species living in a large variety of environmental conditions. For such a purpose, phylogenetic analysis of globin sequences can be combined with knowledge of the phylogenetic relationships between species. In addition, Type I functional-divergence analysis is aimed toward predicting the amino acid residues that are more likely responsible for biochemical diversification of different Hb families. These residues, mapped on the three-dimensional Hb structure, can provide insights into functional and structural divergence.
Collapse
Affiliation(s)
- Agnes Dettaï
- UMR, Département Systématique et Evolution, Muséum National d'Histoire Naturelle, Paris, France
| | | | | | | | | |
Collapse
|
21
|
Wägele JW, Mayer C. Visualizing differences in phylogenetic information content of alignments and distinction of three classes of long-branch effects. BMC Evol Biol 2007; 7:147. [PMID: 17725833 PMCID: PMC2040160 DOI: 10.1186/1471-2148-7-147] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2007] [Accepted: 08/28/2007] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Published molecular phylogenies are usually based on data whose quality has not been explored prior to tree inference. This leads to errors because trees obtained with conventional methods suppress conflicting evidence, and because support values may be high even if there is no distinct phylogenetic signal. Tools that allow an a priori examination of data quality are rarely applied. RESULTS Using data from published molecular analyses on the phylogeny of crustaceans it is shown that tree topologies and popular support values do not show existing differences in data quality. To visualize variations in signal distinctness, we use network analyses based on split decomposition and split support spectra. Both methods show the same differences in data quality and the same clade-supporting patterns. Both methods are useful to discover long-branch effects. We discern three classes of long branch effects. Class I effects consist of attraction of terminal taxa caused by symplesiomorphies, which results in a false monophyly of paraphyletic groups. Addition of carefully selected taxa can fix this effect. Class II effects are caused by drastic signal erosion. Long branches affected by this phenomenon usually slip down the tree to form false clades that in reality are polyphyletic. To recover the correct phylogeny, more conservative genes must be used. Class III effects consist of attraction due to accumulated chance similarities or convergent character states. This sort of noise can be reduced by selecting less variable portions of the data set, avoiding biases, and adding slower genes. CONCLUSION To increase confidence in molecular phylogenies an exploratory analysis of the signal to noise ratio can be conducted with split decomposition methods. If long-branch effects are detected, it is necessary to discern between three classes of effects to find the best approach for an improvement of the raw data.
Collapse
Affiliation(s)
| | - Christoph Mayer
- Lehrstuhl Spezielle Zoologie, Faculty of Biology, University Bochum, 44780 Bochum, Germany
| |
Collapse
|
22
|
Jayaswal V, Robinson J, Jermiin L. Estimation of phylogeny and invariant sites under the general Markov model of nucleotide sequence evolution. Syst Biol 2007; 56:155-62. [PMID: 17454972 DOI: 10.1080/10635150701247921] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
Abstract
The models of nucleotide substitution used by most maximum likelihood-based methods assume that the evolutionary process is stationary, reversible, and homogeneous. We present an extension of the Barry and Hartigan model, which can be used to estimate parameters by maximum likelihood (ML) when the data contain invariant sites and there are violations of the assumptions of stationarity, reversibility, and homogeneity. Unlike most ML methods for estimating invariant sites, we estimate the nucleotide composition of invariant sites separately from that of variable sites. We analyze a bacterial data set where problems due to lack of stationarity and homogeneity have been previously well noted and use the parametric bootstrap to show that the data are consistent with our general Markov model. We also show that estimates of invariant sites obtained using our method are fairly accurate when applied to data simulated under the general Markov model.
Collapse
Affiliation(s)
- Vivek Jayaswal
- Sydney Bioinformatics, University of Sydney, NSW 2006, Australia
| | | | | |
Collapse
|
23
|
Winterton SL, Wiegmann BM, Schlinger EI. Phylogeny and Bayesian divergence time estimations of small-headed flies (Diptera: Acroceridae) using multiple molecular markers. Mol Phylogenet Evol 2007; 43:808-32. [PMID: 17196837 DOI: 10.1016/j.ympev.2006.08.015] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2006] [Revised: 07/19/2006] [Accepted: 08/13/2006] [Indexed: 11/22/2022]
Abstract
The first formal analysis of phylogenetic relationships among small-headed flies (Acroceridae) is presented based on DNA sequence data from two ribosomal (16S and 28S) and two protein-encoding genes: carbomoylphosphate synthase (CPS) domain of CAD (i.e., rudimentary locus) and cytochrome oxidase I (COI). DNA sequences from 40 species in 22 genera of Acroceridae (representing all three subfamilies) were compared with outgroup exemplars from Nemestrinidae, Stratiomyidae, Tabanidae, and Xylophagidae. Parsimony and Bayesian simultaneous analyses of the full data set recover a well-resolved and strongly supported hypothesis of phylogenetic relationships for major lineages within the family. Molecular evidence supports the monophyly of traditionally recognised subfamilies Philopotinae and Panopinae, but Acrocerinae are polyphyletic. Panopinae, sometimes considered "primitive" based on morphology and host-use, are always placed in a more derived position in the current study. Furthermore, these data support emerging morphological evidence that the type genus Acrocera Meigen, and its sister genus Sphaerops, are atypical acrocerids, comprising a sister lineage to all other Acroceridae. Based on the phylogeny generated in the simultaneous analysis, historical divergence times were estimated using Bayesian methodology constrained with fossil data. These estimates indicate Acroceridae likely evolved during the late Triassic but did not diversify greatly until the Cretaceous.
Collapse
MESH Headings
- Animals
- Carbamoyl-Phosphate Synthase (Ammonia)/genetics
- Diptera/classification
- Diptera/genetics
- Electron Transport Complex IV/genetics
- Evolution, Molecular
- Molecular Sequence Data
- Nucleic Acid Conformation
- Phylogeny
- RNA, Ribosomal, 16S/chemistry
- RNA, Ribosomal, 16S/genetics
- RNA, Ribosomal, 28S/chemistry
- RNA, Ribosomal, 28S/genetics
- Sequence Analysis, DNA
Collapse
Affiliation(s)
- Shaun L Winterton
- Department of Entomology, North Carolina State University, Raleigh, NC, USA.
| | | | | |
Collapse
|
24
|
Gruber KF, Voss RS, Jansa SA. Base-compositional heterogeneity in the RAG1 locus among didelphid marsupials: implications for phylogenetic inference and the evolution of GC content. Syst Biol 2007; 56:83-96. [PMID: 17366139 DOI: 10.1080/10635150601182939] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
Abstract
Although theoretical studies have suggested that base-compositional heterogeneity can adversely affect phylogenetic reconstruction, only a few empirical examples of this phenomenon, mostly among ancient lineages (with divergence dates > 100 Mya), have been reported. In the course of our phylogenetic research on the New World marsupial family Didelphidae, we sequenced 2790 bp of the RAG1 exon from exemplar species of most extant genera. Phylogenetic analysis of these sequences recovered an anomalous node consisting of two clades previously shown to be distantly related based on analyses of other molecular data. These two clades show significantly increased GC content at RAG1 third codon positions, and the resulting convergence in base composition is strong enough to overwhelm phylogenetic signal from other genes (and morphology) in most analyses of concatenated datasets. This base-compositional convergence occurred relatively recently (over tens rather than hundreds of millions of years), and the affected gene region is still in a state of evolutionary disequilibrium. Both mutation rate and substitution rate are higher in GC-rich didelphid taxa, observations consistent with RAG1 sequences having experienced a higher rate of recombination in the convergent lineages.
Collapse
Affiliation(s)
- Karl F Gruber
- Bell Museum of Natural History and Department of Ecology, Evolution, and Behavior, University of Minnesota, St. Paul, Minnesota 55108, USA
| | | | | |
Collapse
|
25
|
Short-wavelength sensitive opsin (SWS1) as a new marker for vertebrate phylogenetics. BMC Evol Biol 2006; 6:97. [PMID: 17107620 PMCID: PMC1664589 DOI: 10.1186/1471-2148-6-97] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2006] [Accepted: 11/15/2006] [Indexed: 11/23/2022] Open
Abstract
Background Vertebrate SWS1 visual pigments mediate visual transduction in response to light at short wavelengths. Due to their importance in vision, SWS1 genes have been isolated from a surprisingly wide range of vertebrates, including lampreys, teleosts, amphibians, reptiles, birds, and mammals. The SWS1 genes exhibit many of the characteristics of genes typically targeted for phylogenetic analyses. This study investigates both the utility of SWS1 as a marker for inferring vertebrate phylogenetic relationships, and the characteristics of the gene that contribute to its phylogenetic utility. Results Phylogenetic analyses of vertebrate SWS1 genes produced topologies that were remarkably congruent with generally accepted hypotheses of vertebrate evolution at both higher and lower taxonomic levels. The few exceptions were generally associated with areas of poor taxonomic sampling, or relationships that have been difficult to resolve using other molecular markers. The SWS1 data set was characterized by a substantial amount of among-site rate variation, and a relatively unskewed substitution rate matrix, even when the data were partitioned into different codon sites and individual taxonomic groups. Although there were nucleotide biases in some groups at third positions, these biases were not convergent across different taxonomic groups. Conclusion Our results suggest that SWS1 may be a good marker for vertebrate phylogenetics due to the variable yet consistent patterns of sequence evolution exhibited across fairly wide taxonomic groups. This may result from constraints imposed by the functional role of SWS1 pigments in visual transduction.
Collapse
|
26
|
Feau N, Hamelin RC, Bernier L. Attributes and congruence of three molecular data sets: Inferring phylogenies among Septoria-related species from woody perennial plants. Mol Phylogenet Evol 2006; 40:808-29. [PMID: 16707264 DOI: 10.1016/j.ympev.2006.03.029] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2005] [Revised: 03/23/2006] [Accepted: 03/24/2006] [Indexed: 11/17/2022]
Abstract
To improve our understanding of phylogenetic relationships within the anamorphic genus Septoria, three molecular data sets representing 2,417 bp of nuclear and mitochondrial genes were evaluated. Separate gene analyses and combined analyses were performed using first, the maximum parsimony criterion and second, a Bayesian framework. The homogeneity of data partitions was evaluated via a combination of homogeneity partition tests and tree topology incongruence tests before conducting combined analyses. A last incongruence re-evaluation using partitioned Bremer support was performed on the combined tree, which corroborated the previous estimates. After each separate data set attributes were examined, simple explanations were advocated as the causes of the significant incongruences detected. The analysis of multiple gene partitions showed unprecedented phylogenetic resolution within the genus Septoria that supported the results from previously published single gene phylogenies. Specifically, we have delimited distinct but closely related species representing monophyletic groups that frequently correlated with their respective host families. Conversely, the occurrence of well-supported groups including closely related but distinct molecular taxa sampled on unrelated host-plants allowed us to reject, in these particular cases, the co-evolutionary concept expected between a parasite and its host and to discuss alternative evolutionary models recently proposed for these pathogens.
Collapse
Affiliation(s)
- Nicolas Feau
- Centre de Recherche en Biologie Forestière, Université Laval, Sainte-Foy, Que., Canada G1K 7P4
| | | | | |
Collapse
|
27
|
Tavares ES, Baker AJ, Pereira SL, Miyaki CY. Phylogenetic relationships and historical biogeography of neotropical parrots (Psittaciformes: Psittacidae: Arini) inferred from mitochondrial and nuclear DNA sequences. Syst Biol 2006; 55:454-70. [PMID: 16861209 DOI: 10.1080/10635150600697390] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022] Open
Abstract
Previous hypotheses of phylogenetic relationships among Neotropical parrots were based on limited taxon sampling and lacked support for most internal nodes. In this study we increased the number of taxa (29 species belonging to 25 of the 30 genera) and gene sequences (6388 base pairs of RAG-1, cyt b, NADH2, ATPase 6, ATPase 8, COIII, 12S rDNA, and 16S rDNA) to obtain a stronger molecular phylogenetic hypothesis for this group of birds. Analyses of the combined gene sequences using maximum likelihood and Bayesian methods resulted in a well-supported phylogeny and indicated that amazons and allies are a sister clade to macaws, conures, and relatives, and these two clades are in turn a sister group to parrotlets. Key morphological and behavioral characters used in previous classifications were mapped on the molecular tree and were phylogenetically uninformative. We estimated divergence times of taxa using the molecular tree and Bayesian and penalized likelihood methods that allow for rate variation in DNA substitutions among sites and taxa. Our estimates suggest that the Neotropical parrots shared a common ancestor with Australian parrots 59 Mya (million of years ago; 95% credibility interval (CrI) 66, 51 Mya), well before Australia separated from Antarctica and South America, implying that ancestral parrots were widespread in Gondwanaland. Thus, the divergence of Australian and Neotropical parrots could be attributed to vicariance. The three major clades of Neotropical parrots originated about 50 Mya (95% CrI 57, 41 Mya), coinciding with periods of higher sea level when both Antarctica and South America were fragmented with transcontinental seaways, and likely isolated the ancestors of modern Neotropical parrots in different regions in these continents. The correspondence between major paleoenvironmental changes in South America and the diversification of genera in the clade of amazons and allies between 46 and 16 Mya suggests they diversified exclusively in South America. Conversely, ancestors of parrotlets and of macaws, conures, and allies may have been isolated in Antarctica and/or the southern cone of South America, and only dispersed out of these southern regions when climate cooled and Antarctica became ice-encrusted about 35 Mya. The subsequent radiation of macaws and their allies in South America beginning about 28 Mya (95% CrI 22, 35 Mya) coincides with the uplift of the Andes and the subsequent formation of dry, open grassland habitats that would have facilitated ecological speciation via niche expansion from forested habitats.
Collapse
Affiliation(s)
- Erika Sendra Tavares
- Departamento de Genética e Biologia Evolutiva, Instituto de Biociências, Universidade de São Paulo, R. do Matão 277, (C.Y.M.), 05508-090, São Paulo, SP, Brazil
| | | | | | | |
Collapse
|
28
|
Jansa SA, Forsman JF, Voss RS. Different patterns of selection on the nuclear genes IRBP and DMP-1 affect the efficiency but not the outcome of phylogeny estimation for didelphid marsupials. Mol Phylogenet Evol 2006; 38:363-80. [PMID: 16054401 DOI: 10.1016/j.ympev.2005.06.007] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2005] [Revised: 06/03/2005] [Accepted: 06/07/2005] [Indexed: 11/29/2022]
Abstract
Selection at the protein-level can influence nucleotide substitution patterns for protein-coding genes, which in turn can affect their performance as phylogenetic characters. In this study, we compare two protein-coding nuclear genes that appear to have evolved under markedly different selective constraints and evaluate how selection has shaped their phylogenetic signal. We sequenced 1,100+ bp of exon 6 of the gene encoding dentin matrix protein 1 (DMP1) from most of the currently recognized genera of New World opossums (family: Didelphidae) and compared these data to an existing matrix of sequences from the interphotoreceptor retinoid-binding protein gene (IRBP) and morphological characters. In comparison to IRBP, DMP1 has far fewer sites under strong purifying selection and exhibits a number of sites under positive directional selection. Furthermore, selection on the DMP1 protein appears to conserve short, acidic, serine-rich domains rather than primary amino acid sequence; as a result, DMP1 has significantly different nucleotide substitution patterns from IRBP. Using Bayesian methods, we determined that DMP1 evolves almost 30% faster than IRBP, has 2.5 times more variable sites, has less among-site rate heterogeneity, is skewed toward A and away from CT (IRBP has relatively even base frequencies), and has a significantly lower rate of change between adenine and any other nucleotide. Despite these different nucleotide substitution patterns, estimates of didelphid relationships based on separate phylogenetic analyses of these genes are remarkably congruent whether patterns of nucleotide substitution are explicitly modeled or not. Nonetheless, DMP1 contains more phylogenetically informative characters per unit sequence and resolves more nodes with higher support than does IRBP. Thus, for these two genes, relaxed functional constraints and positive selection appear to improve the efficiency of phylogenetic estimation without compromising its accuracy.
Collapse
Affiliation(s)
- Sharon A Jansa
- Bell Museum of Natural History and Department of Ecology, Evolution, and Behavior, University of Minnesota, St. Paul, 55108, USA.
| | | | | |
Collapse
|
29
|
Gissi C, San Mauro D, Pesole G, Zardoya R. Mitochondrial phylogeny of Anura (Amphibia): A case study of congruent phylogenetic reconstruction using amino acid and nucleotide characters. Gene 2006; 366:228-37. [PMID: 16307849 DOI: 10.1016/j.gene.2005.07.034] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2005] [Revised: 07/11/2005] [Accepted: 07/20/2005] [Indexed: 11/15/2022]
Abstract
We explore whether phylogenetic analyses of the same sequence data set at the amino acid and nucleotide level are able to recover congruent topologies, as well as the advantages and limitations of both alternative approaches. As a case study, mitochondrial protein-coding genes were used to discern among competing hypotheses on the phylogenetic relationships of major anuran amphibian lineages. To properly address this phylogenetic question, the complete nucleotide sequences of the mitochondrial genomes of two archaeobatrachian species, Ascaphus truei and Pelobates cultripes, were determined anew. Bayesian and maximum likelihood phylogenetic inferences of the same sequence data set were performed based on both amino acid and nucleotide characters, with the latter analysed either as codons or as a reduced data set of first+second (P12) codon positions. In addition, likelihood-based ratio tests were performed to evaluate the support of alternative topologies. The different data sets arrived at congruent and highly supported topologies, suggesting a similar phylogenetic resolving power of the two character types provided that correctly selected sites and appropriate evolutionary models are used. The reconstructed anuran mitochondrial phylogeny supports the paraphyly of Archaeobatrachia, with Ascaphus as sister group to all the remaining anurans, and Pelobates as sister group of Neobatrachia. However, the employed tree reconstruction methods and likelihood-based ratio tests seemed to be negatively affected by the fast evolving sequences of neobatrachians, suggesting that the phylogeny of Anura here presented is not definitive, and needs further investigation using an extended taxon sampling.
Collapse
Affiliation(s)
- Carmela Gissi
- Dipartimento di Scienze Biomolecolari e Biotecnologie, Università di Milano, Via Celoria, 26- 20133 Milano, Italy.
| | | | | | | |
Collapse
|
30
|
Dettai A, Lecointre G. Further support for the clades obtained by multiple molecular phylogenies in the acanthomorph bush. C R Biol 2005; 328:674-89. [PMID: 15992750 DOI: 10.1016/j.crvi.2005.04.002] [Citation(s) in RCA: 88] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2004] [Revised: 04/11/2005] [Accepted: 04/12/2005] [Indexed: 10/25/2022]
Abstract
Several recent molecular studies have begun to clarify the phylogeny of Acanthomorpha (Teleostei), a wide clade of teleost fishes. However, different molecular datasets do not agree on a single history of the taxa, probably because of marker-specific biases. The 'total-evidence' approach maximizes character congruence, but may be biased by a single robust, but non-phylogenetic constraint from one dataset. We have therefore taken the approach to analyse also each dataset separately prior to their combination, and detect repeated groups: signal common to markers is more probably a reflection of shared ancestry than marker-specific signal. Partial sequences (678+527 base pairs) of exons of the MLL gene (Mixed Lineage Leukaemia-like) gene were used, as well as the datasets of Chen et al. (ribosomal 28S, rhodopsin gene, mitochondrial 12S and 16S). Most of the repeated clades of Chen et al. are supported by the new dataset. Some new groups were repeatedly found: a Scarus-Labrus group (clade M), the presence of Gasterosteidae as a sister taxon or within the clade Zoarcoidei-Cottoidei (clade Is), Polymixia as a sister-group to the clade Zeoidei-Gadiformes (clade O), the clade Q grouping Mugiloidei, Cichlidae, Atherinomorpha, Blennioidei and Gobiesocoidei; and the interesting clade N, reducing potential sister-groups to Tetraodontiformes to either Caproidei, Lophiiformes, Acanthuroidei, Drepanidae, Chaetodontidae, and Pomacanthidae.
Collapse
Affiliation(s)
- Agnès Dettai
- UMR 7138 CNRS Systématique, Adaptation, Evolution, département Systématique et Evolution, Muséum national d'histoire naturelle, 57, rue Cuvier, case postale 26, 75231 Paris cedex 05, France
| | | |
Collapse
|
31
|
Simmons MP, Carr TG, O'Neill K. Relative character-state space, amount of potential phylogenetic information, and heterogeneity of nucleotide and amino acid characters. Mol Phylogenet Evol 2005; 32:913-26. [PMID: 15288066 DOI: 10.1016/j.ympev.2004.04.011] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2003] [Revised: 03/10/2004] [Indexed: 11/16/2022]
Abstract
We examined a broad selection of protein-coding loci from a diverse array of clades and genomes to quantify three factors that determine whether nucleotide or amino acid characters should be preferred for phylogenetic inference. First, we quantified the difference in observed character-state space between nucleotides and amino acids. Second, we quantified the loss of potential phylogenetic signal from silent substitutions when amino acids are used. Third, we used the disparity index to quantify the relative compositional heterogeneity of nucleotides and amino acids and then determined how commonly convergent (rather than unique) shifts in nucleotide and amino acid composition occur in a phylogenetic context. The greater potential phylogenetic signal for nucleotide characters was found to be enormous (on average 440% that of amino acids), whereas the greater observed character-state space for amino acids was less impressive (on average 150.4% that of nucleotides). While matrices of amino acid sequences had less compositional heterogeneity than their corresponding nucleotide sequences, heterogeneity in amino acid composition may be more homoplasious than heterogeneity in nucleotide composition. Given the ability of increased taxon sampling to better utilize the greater potential phylogenetic signal of nucleotide characters and decrease the potential for artifacts caused by heterogeneous nucleotide composition among taxa, we suggest that increased taxon sampling be performed whenever possible instead of restricting analyses to amino acid characters.
Collapse
Affiliation(s)
- Mark P Simmons
- Department of Biology, Colorado State University, Fort Collins, CO 80523, USA.
| | | | | |
Collapse
|
32
|
Holcroft NI. A molecular test of alternative hypotheses of tetraodontiform (Acanthomorpha: Tetraodontiformes) sister group relationships using data from the RAG1 gene. Mol Phylogenet Evol 2005; 32:749-60. [PMID: 15288052 DOI: 10.1016/j.ympev.2004.04.002] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2003] [Revised: 04/12/2004] [Indexed: 11/30/2022]
Abstract
Two primary competing hypotheses regarding the identity of the sister group of the order Tetraodontiformes exist. The first hypothesis holds that some or all acanthuroid fishes represent the sister of Tetraodontiformes. The second, proposed in 1984 by Rosen, holds that the order Zeiformes is sister to Tetraodontiformes and that the family Caproidae is sister to this Zeiformes + Tetraodontiformes clade. These two hypotheses were tested using data from the single-copy nuclear gene RAG1. Representatives of most major orders of acanthomorph fishes were included to provide an appropriate context in which to place Tetraodontiformes and its hypothesized sister groups. The results of an unweighted parsimony analysis indicate that Zeiformes is not the sister group of Tetraodontiformes. In addition, Caproidae appears unrelated to Zeiformes. A monophyletic Tetraodontiformes was recovered as the sister group of the clade Ephippidae + Drepanidae and was more distantly related to the included zeiform and caproid representatives.
Collapse
Affiliation(s)
- Nancy I Holcroft
- Division of Ichthyology, Natural History Museum and Biodiversity Research Center, Department of Ecology and Evolutionary Biology, The University of Kansas, Lawrence, KS 66045-7451, USA.
| |
Collapse
|
33
|
Jermiin L, Ho SY, Ababneh F, Robinson J, Larkum AW. The biasing effect of compositional heterogeneity on phylogenetic estimates may be underestimated. Syst Biol 2004; 53:638-43. [PMID: 15371251 DOI: 10.1080/10635150490468648] [Citation(s) in RCA: 179] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022] Open
Affiliation(s)
- Lars Jermiin
- School of Biological Sciences, University of Sydney, NSW 2006, Australia.
| | | | | | | | | |
Collapse
|
34
|
Herbeck JT, Degnan PH, Wernegreen JJ. Nonhomogeneous model of sequence evolution indicates independent origins of primary endosymbionts within the enterobacteriales (gamma-Proteobacteria). Mol Biol Evol 2004; 22:520-32. [PMID: 15525700 DOI: 10.1093/molbev/msi036] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Standard methods of phylogenetic reconstruction are based on models that assume homogeneity of nucleotide composition among taxa. However, this assumption is often violated in biological data sets. In this study, we examine possible effects of nucleotide heterogeneity among lineages on the phylogenetic reconstruction of a bacterial group that spans a wide range of genomic nucleotide contents: obligately endosymbiotic bacteria and free-living or commensal species in the gamma-Proteobacteria. We focus on AT-rich primary endosymbionts to better understand the origins of obligately intracellular lifestyles. Previous phylogenetic analyses of this bacterial group point to the importance of accounting for base compositional variation in estimating relationships, particularly between endosymbiotic and free-living taxa. Here, we develop an approach to compare susceptibility of various phylogenetic reconstruction methods to the effects of nucleotide heterogeneity. First, we identify candidate trees of gamma-Proteobacteria groEL and 16S rRNA using approaches that assume homogeneous and stationary base composition, including Bayesian, maximum likelihood, parsimony, and distance methods. We then create permutations of the resulting candidate trees by varying the placement of the AT-rich endosymbiont Buchnera. These permutations are evaluated under the nonhomogeneous and nonstationary maximum likelihood model of Galtier and Gouy, which allows equilibrium base content to vary among examined lineages. Our results show that commonly used phylogenetic methods produce incongruent trees of the Enterobacteriales, and that the placement of Buchnera is especially unstable. However, under a nonhomogeneous model, various groEL and 16S rRNA phylogenies that separate Buchnera from other AT-rich endosymbionts (Blochmannia and Wigglesworthia) have consistently and significantly higher likelihood scores. Blochmannia and Wigglesworthia appear to have evolved from secondary endosymbionts, and represent an origin of primary endosymbiosis that is independent from Buchnera. This application of a nonhomogeneous model offers a computationally feasible way to test specific phylogenetic hypotheses for taxa with heterogeneous and nonstationary base composition.
Collapse
Affiliation(s)
- Joshua T Herbeck
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, Massachusetts, USA.
| | | | | |
Collapse
|
35
|
Abstract
Alignments of nucleotide or amino acid sequences may contain a variety of different signals, one of which is the historical signal that we often try to recover by phylogenetic analysis. Other signals, such as those arising due to compositional heterogeneities, among-lineage and among-site rate heterogeneities, invariant sites, and covariotides, may interfere adversely with the recovery of the historical signal. The effect of the interaction of these signals on phylogenetic inference is not well understood and may, in many cases, even be underappreciated. In this study, we investigate this matter and present results based on Monte Carlo simulations. We explored the success of four phylogenetic methods in recovering the true tree from data that had evolved under conditions where the equilibrium base frequencies and substitution rates were allowed to vary among lineages. Seven scenarios with increasingly complex conditions were investigated. All of the methods tested, with the exception of neighbor-joining using LogDet distances, were sensitive to compositional convergence in nonsister lineages. Maximum parsimony was also susceptible to attraction between long edges. In many cases, however, phylogenetic inference methods can still recover the true tree when misleading signals are present, in some instances even when the historical signal is no longer dominant. These results highlight the growing need for simple methods to detect violation of the phylogenetic assumptions.
Collapse
Affiliation(s)
- Simon Y Ho
- 1School of Biological Sciences, University of Sydney, NSW 2006, Australia
| | | |
Collapse
|
36
|
Wägele JW, Holland B, Dreyer H, Hackethal B. Searching factors causing implausible non-monophyly: ssu rDNA phylogeny of Isopoda Asellota (Crustacea: Peracarida) and faster evolution in marine than in freshwater habitats. Mol Phylogenet Evol 2003; 28:536-51. [PMID: 12927137 DOI: 10.1016/s1055-7903(03)00053-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
This contribution addresses two questions: which alignment patterns are causing non-monophyly of the Asellota and what is the phylogenetic history of this group? The Asellota are small benthic crustaceans occurring in most aquatic habitats. In view of the complex morphological apomorphies known for this group, monophyly of the Asellota has never been questioned. Using ssu rDNA sequences of outgroups and of 16 asellote species from fresh water, littoral marine habitats and from deep-sea localities, the early divergence between the lineages in fresh water and in the ocean, and the monophyly of the deep-sea taxon Munnopsidae are confirmed. Relative substitution rates of freshwater species are much lower than in other isopod species, rates being highest in some littoral marine genera (Carpias and Jaera). Furthermore, more sequence sites are variable in marine than in freshwater species, the latter conserve outgroup character states. Monophyly is recovered with parsimony methods, but not with distance and maximum likelihood analyses, which tear apart the marine from the freshwater species. The information content of alignments was studied with spectra of supporting positions. The scarcity of signal (=apomorphic nucleotides) supporting monophyly of the Asellota is attributed to a short stem-line of this group or to erosion of signal in fast evolving marine species. Parametric boostrapping in combination with spectra indicates that a tree model cannot explain the data and that monophyly of the Asellota should not be rejected even though many topologies do not recover this taxon.
Collapse
Affiliation(s)
- Johann-Wolfgang Wägele
- Lehrstuhl Spezielle Zoologie, Fakultät Biologie, Ruhr-Universität Bochum, 44780 Bochum, Germany.
| | | | | | | |
Collapse
|
37
|
Chen WJ, Bonillo C, Lecointre G. Repeatability of clades as a criterion of reliability: a case study for molecular phylogeny of Acanthomorpha (Teleostei) with larger number of taxa. Mol Phylogenet Evol 2003; 26:262-88. [PMID: 12565036 DOI: 10.1016/s1055-7903(02)00371-8] [Citation(s) in RCA: 256] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Although much progress has been made recently in teleostean phylogeny, relationships among the main lineages of the higher teleosts (Acanthomorpha), containing more than 60% of all fish species, remain poorly defined. This study represents the most extensive taxonomic sampling effort to date to collect new molecular characters for phylogenetic analysis of acanthomorph fishes. We compiled and analyzed three independent data sets, including: (i) mitochondrial ribosomal fragments from 12S and 16s (814bp for 97 taxa); (ii) nuclear ribosomal 28S sequences (847bp for 74 taxa); and (iii) a nuclear protein-coding gene, rhodopsin (759bp for 86 taxa). Detailed analyses were conducted on each data set separately and the principle of taxonomic congruence without consensus trees was used to assess confidence in the results as follows. Repeatability of clades from separate analyses was considered the primary criterion to establish reliability, rather than bootstrap proportions from a single combined (total evidence) data matrix. The new and reliable clades emerging from this study of the acanthomorph radiation were: Gadiformes (cods) with Zeioids (dories); Beloniformes (needlefishes) with Atheriniformes (silversides); blenioids (blennies) with Gobiesocoidei (clingfishes); Channoidei (snakeheads) with Anabantoidei (climbing gouramies); Mastacembeloidei (spiny eels) with Synbranchioidei (swamp-eels); the last two pairs of taxa grouping together, Syngnathoidei (aulostomids, macroramphosids) with Dactylopteridae (flying gurnards); Scombroidei (mackerels) plus Stromatoidei plus Chiasmodontidae; Ammodytidae (sand lances) with Cheimarrhichthyidae (torrentfish); Zoarcoidei (eelpouts) with Cottoidei; Percidae (perches) with Notothenioidei (Antarctic fishes); and a clade grouping Carangidae (jacks), Echeneidae (remoras), Sphyraenidae (barracudas), Menidae (moonfish), Polynemidae (threadfins), Centropomidae (snooks), and Pleuronectiformes (flatfishes).
Collapse
Affiliation(s)
- Wei-Jen Chen
- Laboratoire d'Ichtyologie générale et appliquée, et service de systématique moléculaire (IFR CNRS 1541), Muséun National d'Histoire Naturelle, 43 rue Cuvier, 75231 Paris cedex 05, France.
| | | | | |
Collapse
|
38
|
Simmons MP, Ochoterena H, Freudenstein JV. Amino acid vs. nucleotide characters: challenging preconceived notions. Mol Phylogenet Evol 2002; 24:78-90. [PMID: 12128030 DOI: 10.1016/s1055-7903(02)00202-6] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The 567-terminal analysis of atpB, rbcL, and 18S rDNA was used as an empirical example to test the use of amino acid vs. nucleotide characters for protein-coding genes at deeper taxonomic levels. Nucleotides for atpB and rbcL had 6.5 times the amount of possible synapomorphy as amino acids. Based on parsimony analyses with unordered character states, nucleotides outperformed amino acids for all three measures of phylogenetic signal used (resolution, branch support, and congruence with independent evidence). The nucleotide tree was much more resolved than the amino acid tree, for both large and small clades. Nearly twice the percentage of well-supported clades resolved in the 18S rDNA tree were resolved using nucleotides (91.8%) relative to amino acids (49.2%). The well-supported clades resolved by both character types were much better supported by nucleotides (98.7% vs. 83.8% average jackknife support). The faster evolving nucleotides with a smaller average character-state space outperformed the slower evolving amino acids with a larger average character-state space. Nucleotides outperformed amino acids even with 90% of the terminals deleted. The lack of resolution on the amino acid trees appears to be caused by a lack of congruence among the amino acids, not a lack of replacement substitutions.
Collapse
Affiliation(s)
- Mark P Simmons
- The Ohio State University Herbarium, 1315 Kinnear Road, Columbus, OH 43212, USA.
| | | | | |
Collapse
|
39
|
Paton T, Haddrath O, Baker AJ. Complete mitochondrial DNA genome sequences show that modern birds are not descended from transitional shorebirds. Proc Biol Sci 2002; 269:839-46. [PMID: 11958716 PMCID: PMC1690957 DOI: 10.1098/rspb.2002.1961] [Citation(s) in RCA: 85] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
To test the hypothesis put forward by Feduccia of the origin of modern birds from transitional birds, we sequenced the first two complete mitochondrial genomes of shorebirds (ruddy turnstone and blackish oystercatcher) and compared their sequences with those of already published avian genomes. When corrected for rate heterogeneity across sites and non-homogeneous nucleotide compositions among lineages in maximum likelihood (ML), the optimal tree places palaeognath birds as sister to the neognaths including shorebirds. This optimal topology is a re-rooting of recently published ordinal-level avian trees derived from mitochondrial sequences. Using a penalized likelihood (PL) rate-smoothing process in conjunction with dates estimated from fossils, we show that the basal splits in the bird tree are much older than the Cretaceous-Tertiary (K-T) boundary, reinforcing previous molecular studies that rejected the derivation of modern birds from transitional shorebirds. Our mean estimate for the origin of modern birds at about 123 million years ago (Myr ago) is quite close to recent estimates using both nuclear and mitochondrial genes, and supports theories of continental break-up as a driving force in avian diversification. Not only did many modern orders of birds originate well before the K-T boundary, but the radiation of major clades occurred over an extended period of at least 40 Myr ago, thus also falsifying Feduccia's rapid radiation scenario following a K-T bottleneck.
Collapse
Affiliation(s)
- Tara Paton
- Centre for Biodiversity and Conservation Biology, Royal Ontario Museum, 100 Queen's Park, Toronto, ON, Canada M5S 2C6.
| | | | | |
Collapse
|
40
|
Braun EL, Grotewold E. Fungal Zuotin proteins evolved from MIDA1-like factors by lineage-specific loss of MYB domains. Mol Biol Evol 2001; 18:1401-12. [PMID: 11420378 DOI: 10.1093/oxfordjournals.molbev.a003924] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Proteins are often characterized by the presence of multiple domains, which make specific contributions to their cellular function. While the gain of domains in proteins by duplication and shuffling is well established, domain loss is poorly documented. Here, we provide evidence that domain loss has played an important role in the evolution of protein architecture and function by demonstrating that fungal Zuotin proteins evolved from MIDA1-like proteins, present in animals and plants, by complete loss of the carboxyl-terminal MYB domains. Phylogenetic analyses of the DnaJ motif (the J domain) present in both Zuotin and MIDA1 proteins were complicated by the limited length and profound differences in evolutionary rates exhibited by this domain. To rigorously examine J domain phylogeny, we combined the nonparametric bootstrap with Monte Carlo simulation. This method, which we have designated the resampled parametric bootstrap, allowed us to assess type I and type II error associated with these analyses. These results revealed significant support for domain loss rather than domain gain or gene loss involving paralogs. The absence of sequences related to the MIDA1 MYB domains in Saccharomyces cerevisiae further indicates that the domains have been completely lost, consistent with known functional differences between Zuotin and MIDA1 proteins. These analyses suggest that the description of additional examples of complete domain loss may provide a method to identify orthologous proteins exhibiting functional differences using genomic sequence data.
Collapse
Affiliation(s)
- E L Braun
- Department of Plant Biology and Plant Biotechnology Center, Ohio State University, Columbus, 43210, USA.
| | | |
Collapse
|
41
|
Ascher JS, Danforth BN, Ji S. Phylogenetic utility of the major opsin in bees (Hymenoptera: Apoidea): a reassessment. Mol Phylogenet Evol 2001; 19:76-93. [PMID: 11286493 DOI: 10.1006/mpev.2001.0911] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Major opsin (LW Rh) DNA sequence has been reported to provide useful data for resolving phylogenetic relationships among tribes of corbiculate bees based on analyses of 502 bp of coding sequence. However, the corbiculate tribes are believed to be of Cretaceous age, and strong support for insect clades of this age from small data sets of nucleotide sequence data has rarely been demonstrated. To more critically assess opsin's phylogenetic utility we generated an expanded LW Rh data set by sequencing the same gene fragment from 52 additional bee species from 24 tribes and all six extant bee families. Analyses of this data set failed to provide substantial support for monophyly of corbiculate bees, for relationships among corbiculate tribes, or for most other well-established higher-level relationships among long-tongued bees. However, monophyly of nearly all genera and tribes is strongly supported, indicating that LW Rh provides useful phylogenetic signal at lower taxonomic levels. When our expanded LW Rh data set is combined with a morphological and behavioral data set for corbiculate bees, the results unambiguously support the traditional phylogeny of the corbiculate bee tribes: (Euglossini + (Bombini + (Meliponini + Apini))). This implies a single origin of advanced eusocial behavior among bees rather than dual origins, as proposed by several recent studies.
Collapse
Affiliation(s)
- J S Ascher
- Department of Entomology, Cornell University, Ithaca, New York 14853-0901, USA.
| | | | | |
Collapse
|
42
|
Monteiro A, Pierce NE. Phylogeny of Bicyclus (Lepidoptera: Nymphalidae) inferred from COI, COII, and EF-1alpha gene sequences. Mol Phylogenet Evol 2001; 18:264-81. [PMID: 11161761 DOI: 10.1006/mpev.2000.0872] [Citation(s) in RCA: 137] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Despite the fact that Bicyclus anynana has become an important model species for wing-pattern developmental biology and studies of phenotypic plasticity, little is known of the evolutionary history of the genus Bicyclus and the position of B. anynana. Understanding the evolution of development as well as the evolution of plasticity can be attempted in this species-rich genus that displays a large range of wing patterns with variable degrees of phenotypic responses to the environment. A context to guide extrapolations from population genetic studies within B. anynana to those between closely related species has been long overdue. A phylogeny of 54 of the 80 known Bicyclus species is presented based on the combined 3000-bp sequences of two mitochondrial genes, cytochrome oxidase I and II, and the nuclear gene, elongation factor 1alpha. A series of tree topologies, constructed either from the individual genes or from the combined data, using heuristic searches under a variety of weighting schemes were compared under the best maximum-likelihood models fitted for each gene separately. The most likely tree topology to have generated the three data sets was found to be a tree resulting from a combined MP analysis with equal weights. Most phylogenetic signal for the analysis comes from silent substitutions at the third position, and despite the faster rate of evolution and higher levels of homoplasy of the mitochondrial genes relative to the nuclear gene, the latter does not show substantially stronger support for basal clades. Finally, moving branches from the chosen tree topology to other positions on the tree so as to comply better with a previous morphological study did not significantly affect tree length.
Collapse
Affiliation(s)
- A Monteiro
- Museum of Comparative Zoology Labs, Harvard University, Boston, Massachusetts, USA
| | | |
Collapse
|
43
|
Abstract
Base composition varies at all levels of the phylogenetic hierarchy and throughout the genome, and can be caused by active selection or passive mutation pressure. This variation can make reconstructing trees difficult. However, recent tree-based analyses have shed light on the forces responsible for the evolution of base composition, forces that might be very general. More explicit tree-based work is encouraged.
Collapse
|