1
|
Near TJ, MacGuigan DJ, Parker E, Struthers CD, Jones CD, Dornburg A. Phylogenetic analysis of Antarctic notothenioids illuminates the utility of RADseq for resolving Cenozoic adaptive radiations. Mol Phylogenet Evol 2018; 129:268-279. [PMID: 30195039 DOI: 10.1016/j.ympev.2018.09.001] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Revised: 08/31/2018] [Accepted: 09/01/2018] [Indexed: 10/28/2022]
Abstract
Notothenioids are a clade of ∼120 species of marine fishes distributed in extreme southern hemisphere temperate near-shore habitats and in the Southern Ocean surrounding Antarctica. Over the past 25 years, molecular and morphological approaches have redefined hypotheses of relationships among notothenioid lineages as well as their relationships among major lineages of percomorph teleosts. These phylogenies provide a basis for investigation of mechanisms of evolutionary diversification within the clade and have enhanced our understanding of the notothenioid adaptive radiation. Despite extensive efforts, there remain several questions concerning the phylogeny of notothenioids. In this study, we deploy DNA sequences of ∼100,000 loci obtained using RADseq to investigate the phylogenetic relationships of notothenioids and to assess the utility of RADseq loci for lineages that exhibit divergence times ranging from the Paleogene to the Quaternary. The notothenioid phylogenies inferred from the RADseq loci provide unparalleled resolution and node support for several long-standing problems including, (1) relationships among species of Trematomus, (2) resolution of Indonotothenia cyanobrancha as the sister lineage of Trematomus, (3) the deep paraphyly of Nototheniidae, (4) the paraphyly of Lepidonotothen s.l., (5) paraphyly of Artedidraco, and 6) the monophyly of the Bathydraconidae. Assessment of site rates demonstrates that RADseq loci are similar to mtDNA protein coding genes and exhibit peak phylogenetic informativeness at the time interval during which the major Antarctic notothenioid lineages originated and diversified. In addition to providing a well-resolved phylogenetic hypothesis for notothenioids, our analyses quantify the predicted utility of RADseq loci for Cenozoic phylogenetic inferences.
Collapse
Affiliation(s)
- Thomas J Near
- Department of Ecology & Evolutionary Biology, Yale University, P.O. Box 208106, New Haven, CT 06520, USA; Peabody Museum of Natural History, Yale University, New Haven, CT 06520, USA.
| | - Daniel J MacGuigan
- Department of Ecology & Evolutionary Biology, Yale University, P.O. Box 208106, New Haven, CT 06520, USA
| | - Elyse Parker
- Department of Ecology & Evolutionary Biology, Yale University, P.O. Box 208106, New Haven, CT 06520, USA
| | - Carl D Struthers
- Museum of New Zealand Te Papa Tongarewa, Wellington, New Zealand
| | - Christopher D Jones
- Antarctic Ecosystem Research Division, NOAA Southwest Fisheries Science Center, La Jolla, CA 92037, USA
| | - Alex Dornburg
- North Carolina Museum of Natural Sciences, Raleigh, NC 27601, USA
| |
Collapse
|
2
|
Dong W, Xu C, Wu P, Cheng T, Yu J, Zhou S, Hong DY. Resolving the systematic positions of enigmatic taxa: Manipulating the chloroplast genome data of Saxifragales. Mol Phylogenet Evol 2018; 126:321-330. [PMID: 29702217 DOI: 10.1016/j.ympev.2018.04.033] [Citation(s) in RCA: 49] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2017] [Revised: 04/20/2018] [Accepted: 04/20/2018] [Indexed: 01/30/2023]
Abstract
Accurately resolving the phylogeny of enigmatic taxa is always a challenge in phylogenetic inference. Such uncertainties could be due to systematic errors or model violations. Here, we provide an example demonstrating how these factors affect the positioning of Paeoniaceae within Saxifragales based on chloroplast genome data. We newly assembled 14 chloroplast genomes from Saxifragales, and by combining these genomes with those of 63 other angiosperms, three datasets were assembled to test different hypotheses proposed by recent studies. These datasets were subjected to maximum parsimony, maximum likelihood and Bayesian analyses with site-homogeneous/heterogeneous models, different data partitioning strategies, and the inclusion/exclusion of weak phylogenetic signals. Three datasets exhibited remarkable heterogeneity among sites and among taxa of Saxifragales. Phylogenetic analyses under homogeneous models or maximum parsimony showed a closer relationship of Paeoniaceae with herbaceous families in the order. Data partitioning strategies did not change the general tree topology. However, PhyloBayes analysis under the CAT+GTR model resulted in a relationship closer to woody families. We conclude that although genomic data significantly increase the phylogenetic resolution of enigmatic taxa with high support, the phylogenetic results inferred from such data might be analysis or signal dependent. The analytical pipeline outlined here combines phylogenomic inference methods with evaluation of lineage-specific rates of substitution, model selection, and assessment of systematic error. These methods would be applicable to resolve similar difficult questions in the tree of life.
Collapse
Affiliation(s)
- Wenpan Dong
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China; Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Chao Xu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ping Wu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Tao Cheng
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Jing Yu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Shiliang Zhou
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China; University of Chinese Academy of Sciences, Beijing 100049, China.
| | - De-Yuan Hong
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China.
| |
Collapse
|
3
|
Simmons MP, Gatesy J. Biases of tree-independent-character-subsampling methods. Mol Phylogenet Evol 2016; 100:424-443. [DOI: 10.1016/j.ympev.2016.04.022] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2015] [Revised: 03/16/2016] [Accepted: 04/15/2016] [Indexed: 12/21/2022]
|
4
|
Xi Z, Liu L, Davis CC. The Impact of Missing Data on Species Tree Estimation. Mol Biol Evol 2015; 33:838-60. [DOI: 10.1093/molbev/msv266] [Citation(s) in RCA: 101] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
|
5
|
Iwamoto A, Izumidate R, Ronse De Craene LP. Floral anatomy and vegetative development in Ceratophyllum demersum: a morphological picture of an "unsolved" plant. AMERICAN JOURNAL OF BOTANY 2015; 102:1578-89. [PMID: 26419811 DOI: 10.3732/ajb.1500124] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/25/2015] [Accepted: 08/31/2015] [Indexed: 05/26/2023]
Abstract
PREMISE OF THE STUDY The phylogenetic position of Ceratophyllum is still controversial in recent molecular analyses of angiosperms, with various suggestions of a sister group relation to all other angiosperms, eudicots, monocots, eudicots + monocots, and magnoliids. Therefore, the morphological characters of Ceratophyllum are important for resolving the phylogeny of angiosperms. In this study, we observed the detailed developmental anatomy of all lateral organs and their configurations to elucidate the floral development and phyllotactic pattern of Ceratophyllum demersum. METHODS We observed fixed shoots of C. demersum with scanning electron microscopy and serial sections of the samples with light microscopy. KEY RESULTS Bract primordia arise first, followed by the stamen primordia in staminate flowers. Both bracts and stamens initiate unidirectionally, first on the abaxial side of the floral apex and later on the adaxial side, most likely due to the contact pressure imposed by the leaf primordium at the superior node. In pistillate flowers, bract primordia on the abaxial side were also initiated first. The configuration of buds at one node showed six patterns and each pattern included at least one vegetative bud, and flower buds were always accompanied by vegetative buds at the same node. CONCLUSIONS The initiation pattern of organs in the outer whorls of C. demersum flowers is distorted by mechanical pressure, resulting in the phyllotactic variation of staminate flowers. Vegetative buds are the main axillary buds with floral buds as accessory buds, which suggests that the shoot of C. demersum has been modified from a decussate phyllotaxis.
Collapse
Affiliation(s)
- Akitoshi Iwamoto
- Department of Biology, Tokyo Gakugei University, 4-1-1 Nukui Kita-machi, Koganei-shi, Tokyo 184-8501, Japan
| | - Ryoko Izumidate
- Department of Biology, Tokyo Gakugei University, 4-1-1 Nukui Kita-machi, Koganei-shi, Tokyo 184-8501, Japan
| | | |
Collapse
|
6
|
Les DH. Water from the rock: Ancient aquatic angiosperms flow from the fossil record. Proc Natl Acad Sci U S A 2015; 112:10825-6. [PMID: 26290578 PMCID: PMC4568265 DOI: 10.1073/pnas.1514280112] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Affiliation(s)
- Donald H Les
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT 06269
| |
Collapse
|
7
|
Goremykin VV, Nikiforova SV, Cavalieri D, Pindo M, Lockhart P. The Root of Flowering Plants and Total Evidence. Syst Biol 2015; 64:879-91. [DOI: 10.1093/sysbio/syv028] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2014] [Accepted: 05/05/2015] [Indexed: 11/14/2022] Open
|
8
|
Dornburg A, Friedman M, Near TJ. Phylogenetic analysis of molecular and morphological data highlights uncertainty in the relationships of fossil and living species of Elopomorpha (Actinopterygii: Teleostei). Mol Phylogenet Evol 2015; 89:205-18. [PMID: 25899306 DOI: 10.1016/j.ympev.2015.04.004] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2014] [Revised: 03/31/2015] [Accepted: 04/07/2015] [Indexed: 02/05/2023]
Abstract
Elopomorpha is one of the three main clades of living teleost fishes and includes a range of disparate lineages including eels, tarpons, bonefishes, and halosaurs. Elopomorphs were among the first groups of fishes investigated using Hennigian phylogenetic methods and continue to be the object of intense phylogenetic scrutiny due to their economic significance, diversity, and crucial evolutionary status as the sister group of all other teleosts. While portions of the phylogenetic backbone for Elopomorpha are consistent between studies, the relationships among Albula, Pterothrissus, Notacanthiformes, and Anguilliformes remain contentious and difficult to evaluate. This lack of phylogenetic resolution is problematic as fossil lineages are often described and placed taxonomically based on an assumed sister group relationship between Albula and Pterothrissus. In addition, phylogenetic studies using morphological data that sample elopomorph fossil lineages often do not include notacanthiform or anguilliform lineages, potentially introducing a bias toward interpreting fossils as members of the common stem of Pterothrissus and Albula. Here we provide a phylogenetic analysis of DNA sequences sampled from multiple nuclear genes that include representative taxa from Albula, Pterothrissus, Notacanthiformes and Anguilliformes. We integrate our molecular dataset with a morphological character matrix that spans both living and fossil elopomorph lineages. Our results reveal substantial uncertainty in the placement of Pterothrissus as well as all sampled fossil lineages, questioning the stability of the taxonomy of fossil Elopomorpha. However, despite topological uncertainty, our integration of fossil lineages into a Bayesian time calibrated framework provides divergence time estimates for the clade that are consistent with previously published age estimates based on the elopomorph fossil record and molecular estimates resulting from traditional node-dating methods.
Collapse
Affiliation(s)
- Alex Dornburg
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06520, USA.
| | - Matt Friedman
- Department of Earth Sciences, University of Oxford, South Parks Road, Oxford OX1 3AN, UK
| | - Thomas J Near
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06520, USA; Peabody Museum of Natural History, Yale University, New Haven, CT 06520, USA
| |
Collapse
|
9
|
Breinholt JW, Kawahara AY. Phylotranscriptomics: saturated third codon positions radically influence the estimation of trees based on next-gen data. Genome Biol Evol 2014; 5:2082-92. [PMID: 24148944 PMCID: PMC3845638 DOI: 10.1093/gbe/evt157] [Citation(s) in RCA: 87] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Recent advancements in molecular sequencing techniques have led to a surge in the number of phylogenetic studies that incorporate large amounts of genetic data. We test the assumption that analyzing large number of genes will lead to improvements in tree resolution and branch support using moths in the superfamily Bombycoidea, a group with some interfamilial relationships that have been difficult to resolve. Specifically, we use a next-gen data set that included 19 taxa and 938 genes (∼1.2M bp) to examine how codon position and saturation might influence resolution and node support among three key families. Maximum likelihood, parsimony, and species tree analysis using gene tree parsimony, on different nucleotide and amino acid data sets, resulted in largely congruent topologies with high bootstrap support compared with prior studies that included fewer loci. However, for a few shallow nodes, nucleotide and amino acid data provided high support for conflicting relationships. The third codon position was saturated and phylogenetic analysis of this position alone supported a completely different, potentially misleading sister group relationship. We used the program RADICAL to assess the number of genes needed to fix some of these difficult nodes. One such node originally needed a total of 850 genes but only required 250 when synonymous signal was removed. Our study shows that, in order to effectively use next-gen data to correctly resolve difficult phylogenetic relationships, it is necessary to assess the effects of synonymous substitutions and third codon positions.
Collapse
|
10
|
Xi Z, Liu L, Rest JS, Davis CC. Coalescent versus Concatenation Methods and the Placement of Amborella as Sister to Water Lilies. Syst Biol 2014; 63:919-32. [DOI: 10.1093/sysbio/syu055] [Citation(s) in RCA: 142] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Zhenxiang Xi
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA; 2Department of Statistics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA; 3Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794, USA
| | - Liang Liu
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA; 2Department of Statistics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA; 3Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794, USA
| | - Joshua S. Rest
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA; 2Department of Statistics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA; 3Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794, USA
| | - Charles C. Davis
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA; 2Department of Statistics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA; 3Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794, USA
| |
Collapse
|
11
|
Drew BT, Ruhfel BR, Smith SA, Moore MJ, Briggs BG, Gitzendanner MA, Soltis PS, Soltis DE. Another Look at the Root of the Angiosperms Reveals a Familiar Tale. Syst Biol 2014; 63:368-82. [DOI: 10.1093/sysbio/syt108] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
12
|
Wu CS, Chaw SM, Huang YY. Chloroplast phylogenomics indicates that Ginkgo biloba is sister to cycads. Genome Biol Evol 2013; 5:243-54. [PMID: 23315384 PMCID: PMC3595029 DOI: 10.1093/gbe/evt001] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/04/2013] [Indexed: 12/23/2022] Open
Abstract
Molecular phylogenetic studies have not yet reached a consensus on the placement of Ginkgoales, which is represented by the only living species, Ginkgo biloba (common name: ginkgo). At least six discrepant placements of ginkgo have been proposed. This study aimed to use the chloroplast phylogenomic approach to examine possible factors that lead to such disagreeing placements. We found the sequence types used in the analyses as the most critical factor in the conflicting placements of ginkgo. In addition, the placement of ginkgo varied in the trees inferred from nucleotide (NU) sequences, which notably depended on breadth of taxon sampling, tree-building methods, codon positions, positions of Gnetopsida (common name: gnetophytes), and including or excluding gnetophytes in data sets. In contrast, the trees inferred from amino acid (AA) sequences congruently supported the monophyly of a ginkgo and Cycadales (common name: cycads) clade, regardless of which factors were examined. Our site-stripping analysis further revealed that the high substitution saturation of NU sequences mainly derived from the third codon positions and contributed to the variable placements of ginkgo. In summary, the factors we surveyed did not affect results inferred from analyses of AA sequences. Congruent topologies in our AA trees give more confidence in supporting the ginkgo-cycad sister-group hypothesis.
Collapse
Affiliation(s)
- Chung-Shien Wu
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Shu-Miaw Chaw
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Ya-Yi Huang
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
13
|
Zhang N, Zeng L, Shan H, Ma H. Highly conserved low-copy nuclear genes as effective markers for phylogenetic analyses in angiosperms. THE NEW PHYTOLOGIST 2012; 195:923-937. [PMID: 22783877 DOI: 10.1111/j.1469-8137.2012.04212.x] [Citation(s) in RCA: 127] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Organismal phylogeny provides a crucial evolutionary framework for many studies and the angiosperm phylogeny has been greatly improved recently, largely using organellar and rDNA genes. However, low-copy protein-coding nuclear genes have not been widely used on a large scale in spite of the advantages of their biparental inheritance and vast number of choices. Here, we identified 1083 highly conserved low-copy nuclear genes by genome comparison. Furthermore, we demonstrated the use of five nuclear genes in 91 angiosperms representing 46 orders (73% of orders) and three gymnosperms as outgroups for a highly resolved phylogeny. These nuclear genes are easy to clone and align, and more phylogenetically informative than widely used organellar genes. The angiosperm phylogeny reconstructed using these genes was largely congruent with previous ones mainly inferred from organellar genes. Intriguingly, several new placements were uncovered for some groups, including those among the rosids, the asterids, and between the eudicots and several basal angiosperm groups. These conserved universal nuclear genes have several inherent qualities enabling them to be good markers for reconstructing angiosperm phylogeny, even eukaryotic relationships, further providing new insights into the evolutionary history of angiosperms.
Collapse
Affiliation(s)
- Ning Zhang
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, Center for Evolutionary Biology, School of Life Sciences, Fudan University, 220 Handan Road, Shanghai 200433, China
| | - Liping Zeng
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, Center for Evolutionary Biology, School of Life Sciences, Fudan University, 220 Handan Road, Shanghai 200433, China
| | - Hongyan Shan
- State Key Laboratory of Systematic and Evolution Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Hong Ma
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, Center for Evolutionary Biology, School of Life Sciences, Fudan University, 220 Handan Road, Shanghai 200433, China
- Institutes of Biomedical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai 200032, China
| |
Collapse
|
14
|
Parks M, Cronn R, Liston A. Separating the wheat from the chaff: mitigating the effects of noise in a plastome phylogenomic data set from Pinus L. (Pinaceae). BMC Evol Biol 2012; 12:100. [PMID: 22731878 PMCID: PMC3475122 DOI: 10.1186/1471-2148-12-100] [Citation(s) in RCA: 87] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2012] [Accepted: 06/14/2012] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Through next-generation sequencing, the amount of sequence data potentially available for phylogenetic analyses has increased exponentially in recent years. Simultaneously, the risk of incorporating 'noisy' data with misleading phylogenetic signal has also increased, and may disproportionately influence the topology of weakly supported nodes and lineages featuring rapid radiations and/or elevated rates of evolution. RESULTS We investigated the influence of phylogenetic noise in large data sets by applying two fundamental strategies, variable site removal and long-branch exclusion, to the phylogenetic analysis of a full plastome alignment of 107 species of Pinus and six Pinaceae outgroups. While high overall phylogenetic resolution resulted from inclusion of all data, three historically recalcitrant nodes remained conflicted with previous analyses. Close investigation of these nodes revealed dramatically different responses to data removal. Whereas topological resolution and bootstrap support for two clades peaked with removal of highly variable sites, the third clade resolved most strongly when all sites were included. Similar trends were observed using long-branch exclusion, but patterns were neither as strong nor as clear. When compared to previous phylogenetic analyses of nuclear loci and morphological data, the most highly supported topologies seen in Pinus plastome analysis are congruent for the two clades gaining support from variable site removal and long-branch exclusion, but in conflict for the clade with highest support from the full data set. CONCLUSIONS These results suggest that removal of misleading signal in phylogenomic datasets can result not only in increased resolution for poorly supported nodes, but may serve as a tool for identifying erroneous yet highly supported topologies. For Pinus chloroplast genomes, removal of variable sites appears to be more effective than long-branch exclusion for clarifying phylogenetic hypotheses.
Collapse
Affiliation(s)
- Matthew Parks
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331-2902, USA
| | - Richard Cronn
- Pacific Northwest Research Station, USDA Forest Service, Corvallis, OR 97331, USA
| | - Aaron Liston
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331-2902, USA
| |
Collapse
|
15
|
Wu CS, Wang YN, Hsu CY, Lin CP, Chaw SM. Loss of different inverted repeat copies from the chloroplast genomes of Pinaceae and cupressophytes and influence of heterotachy on the evaluation of gymnosperm phylogeny. Genome Biol Evol 2011; 3:1284-95. [PMID: 21933779 PMCID: PMC3219958 DOI: 10.1093/gbe/evr095] [Citation(s) in RCA: 119] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/12/2011] [Indexed: 12/13/2022] Open
Abstract
The relationships among the extant five gymnosperm groups--gnetophytes, Pinaceae, non-Pinaceae conifers (cupressophytes), Ginkgo, and cycads--remain equivocal. To clarify this issue, we sequenced the chloroplast genomes (cpDNAs) from two cupressophytes, Cephalotaxus wilsoniana and Taiwania cryptomerioides, and 53 common chloroplast protein-coding genes from another three cupressophytes, Agathis dammara, Nageia nagi, and Sciadopitys verticillata, and a non-Cycadaceae cycad, Bowenia serrulata. Comparative analyses of 11 conifer cpDNAs revealed that Pinaceae and cupressophytes each lost a different copy of inverted repeats (IRs), which contrasts with the view that the same IR has been lost in all conifers. Based on our structural finding, the character of an IR loss no longer conflicts with the "gnepines" hypothesis (gnetophytes sister to Pinaceae). Chloroplast phylogenomic analyses of amino acid sequences recovered incongruent topologies using different tree-building methods; however, we demonstrated that high heterotachous genes (genes that have highly different rates in different lineages) contributed to the long-branch attraction (LBA) artifact, resulting in incongruence of phylogenomic estimates. Additionally, amino acid compositions appear more heterogeneous in high than low heterotachous genes among the five gymnosperm groups. Removal of high heterotachous genes alleviated the LBA artifact and yielded congruent and robust tree topologies in which gnetophytes and Pinaceae formed a sister clade to cupressophytes (the gnepines hypothesis) and Ginkgo clustered with cycads. Adding more cupressophyte taxa could not improve the accuracy of chloroplast phylogenomics for the five gymnosperm groups. In contrast, removal of high heterotachous genes from data sets is simple and can increase confidence in evaluating the phylogeny of gymnosperms.
Collapse
Affiliation(s)
- Chung-Shien Wu
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Ya-Nan Wang
- School of Forestry and Resource Conservation, National Taiwan University, Taipei, Taiwan
| | - Chi-Yao Hsu
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Ching-Ping Lin
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Shu-Miaw Chaw
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
16
|
Goremykin VV, Nikiforova SV, Bininda-Emonds ORP. Automated Removal of Noisy Data in Phylogenomic Analyses. J Mol Evol 2010; 71:319-31. [DOI: 10.1007/s00239-010-9398-z] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2010] [Accepted: 10/06/2010] [Indexed: 10/18/2022]
|
17
|
Schwartz RS, Mueller RL. Branch length estimation and divergence dating: estimates of error in Bayesian and maximum likelihood frameworks. BMC Evol Biol 2010; 10:5. [PMID: 20064267 PMCID: PMC2827399 DOI: 10.1186/1471-2148-10-5] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2009] [Accepted: 01/11/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Estimates of divergence dates between species improve our understanding of processes ranging from nucleotide substitution to speciation. Such estimates are frequently based on molecular genetic differences between species; therefore, they rely on accurate estimates of the number of such differences (i.e. substitutions per site, measured as branch length on phylogenies). We used simulations to determine the effects of dataset size, branch length heterogeneity, branch depth, and analytical framework on branch length estimation across a range of branch lengths. We then reanalyzed an empirical dataset for plethodontid salamanders to determine how inaccurate branch length estimation can affect estimates of divergence dates. RESULTS The accuracy of branch length estimation varied with branch length, dataset size (both number of taxa and sites), branch length heterogeneity, branch depth, dataset complexity, and analytical framework. For simple phylogenies analyzed in a Bayesian framework, branches were increasingly underestimated as branch length increased; in a maximum likelihood framework, longer branch lengths were somewhat overestimated. Longer datasets improved estimates in both frameworks; however, when the number of taxa was increased, estimation accuracy for deeper branches was less than for tip branches. Increasing the complexity of the dataset produced more misestimated branches in a Bayesian framework; however, in an ML framework, more branches were estimated more accurately. Using ML branch length estimates to re-estimate plethodontid salamander divergence dates generally resulted in an increase in the estimated age of older nodes and a decrease in the estimated age of younger nodes. CONCLUSIONS Branch lengths are misestimated in both statistical frameworks for simulations of simple datasets. However, for complex datasets, length estimates are quite accurate in ML (even for short datasets), whereas few branches are estimated accurately in a Bayesian framework. Our reanalysis of empirical data demonstrates the magnitude of effects of Bayesian branch length misestimation on divergence date estimates. Because the length of branches for empirical datasets can be estimated most reliably in an ML framework when branches are <1 substitution/site and datasets are > or =1 kb, we suggest that divergence date estimates using datasets, branch lengths, and/or analytical techniques that fall outside of these parameters should be interpreted with caution.
Collapse
Affiliation(s)
- Rachel S Schwartz
- Department of Biology, Colorado State University, Fort Collins, CO 80523-1878, USA.
| | | |
Collapse
|