1
|
Mahbub S, Sawmya S, Saha A, Reaz R, Rahman MS, Bayzid MS. Quartet Based Gene Tree Imputation Using Deep Learning Improves Phylogenomic Analyses Despite Missing Data. JOURNAL OF COMPUTATIONAL BIOLOGY : A JOURNAL OF COMPUTATIONAL MOLECULAR CELL BIOLOGY 2022; 29:1156-1172. [PMID: 36048555 DOI: 10.1089/cmb.2022.0212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Species tree estimation is frequently based on phylogenomic approaches that use multiple genes from throughout the genome. However, for a combination of reasons (ranging from sampling biases to more biological causes, as in gene birth and loss), gene trees are often incomplete, meaning that not all species of interest have a common set of genes. Incomplete gene trees can potentially impact the accuracy of phylogenomic inference. We, for the first time, introduce the problem of imputing the quartet distribution induced by a set of incomplete gene trees, which involves adding the missing quartets back to the quartet distribution. We present Quartet based Gene tree Imputation using Deep Learning (QT-GILD), an automated and specially tailored unsupervised deep learning technique, accompanied by cues from natural language processing, which learns the quartet distribution in a given set of incomplete gene trees and generates a complete set of quartets accordingly. QT-GILD is a general-purpose technique needing no explicit modeling of the subject system or reasons for missing data or gene tree heterogeneity. Experimental studies on a collection of simulated and empirical datasets suggest that QT-GILD can effectively impute the quartet distribution, which results in a dramatic improvement in the species tree accuracy. Remarkably, QT-GILD not only imputes the missing quartets but can also account for gene tree estimation error. Therefore, QT-GILD advances the state-of-the-art in species tree estimation from gene trees in the face of missing data.
Collapse
Affiliation(s)
- Sazan Mahbub
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh.,Department of Computer Science, University of Maryland, College Park, Maryland, USA
| | - Shashata Sawmya
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - Arpita Saha
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - Rezwana Reaz
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - M Sohel Rahman
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - Md Shamsuzzoha Bayzid
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| |
Collapse
|
2
|
de Lima Ferreira P, Batista R, Andermann T, Groppo M, Bacon CD, Antonelli A. Target sequence capture of Barnadesioideae (Compositae) demonstrates the utility of low coverage loci in phylogenomic analyses. Mol Phylogenet Evol 2022; 169:107432. [DOI: 10.1016/j.ympev.2022.107432] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Revised: 12/21/2021] [Accepted: 01/14/2022] [Indexed: 11/26/2022]
|
3
|
Li HT, Luo Y, Gan L, Ma PF, Gao LM, Yang JB, Cai J, Gitzendanner MA, Fritsch PW, Zhang T, Jin JJ, Zeng CX, Wang H, Yu WB, Zhang R, van der Bank M, Olmstead RG, Hollingsworth PM, Chase MW, Soltis DE, Soltis PS, Yi TS, Li DZ. Plastid phylogenomic insights into relationships of all flowering plant families. BMC Biol 2021; 19:232. [PMID: 34711223 PMCID: PMC8555322 DOI: 10.1186/s12915-021-01166-2] [Citation(s) in RCA: 83] [Impact Index Per Article: 27.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 10/14/2021] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND Flowering plants (angiosperms) are dominant components of global terrestrial ecosystems, but phylogenetic relationships at the familial level and above remain only partially resolved, greatly impeding our full understanding of their evolution and early diversification. The plastome, typically mapped as a circular genome, has been the most important molecular data source for plant phylogeny reconstruction for decades. RESULTS Here, we assembled by far the largest plastid dataset of angiosperms, composed of 80 genes from 4792 plastomes of 4660 species in 2024 genera representing all currently recognized families. Our phylogenetic tree (PPA II) is essentially congruent with those of previous plastid phylogenomic analyses but generally provides greater clade support. In the PPA II tree, 75% of nodes at or above the ordinal level and 78% at or above the familial level were resolved with high bootstrap support (BP ≥ 90). We obtained strong support for many interordinal and interfamilial relationships that were poorly resolved previously within the core eudicots, such as Dilleniales, Saxifragales, and Vitales being resolved as successive sisters to the remaining rosids, and Santalales, Berberidopsidales, and Caryophyllales as successive sisters to the asterids. However, the placement of magnoliids, although resolved as sister to all other Mesangiospermae, is not well supported and disagrees with topologies inferred from nuclear data. Relationships among the five major clades of Mesangiospermae remain intractable despite increased sampling, probably due to an ancient rapid radiation. CONCLUSIONS We provide the most comprehensive dataset of plastomes to date and a well-resolved phylogenetic tree, which together provide a strong foundation for future evolutionary studies of flowering plants.
Collapse
Affiliation(s)
- Hong-Tao Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
| | - Yang Luo
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
| | - Lu Gan
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
| | - Peng-Fei Ma
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
| | - Lian-Ming Gao
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
- Lijiang Forest Ecosystem National Observation and Research Station, Kunming Institute of Botany, Chinese Academy of Sciences, Lijiang, 674100, Yunnan, China
| | - Jun-Bo Yang
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
| | - Jie Cai
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
| | - Matthew A Gitzendanner
- Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA
- Biodiversity Institute, University of Florida, Gainesville, FL, 32611, USA
| | - Peter W Fritsch
- Botanical Research Institute of Texas, 1700 University Drive, Fort Worth, TX, 76017, USA
| | - Ting Zhang
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
| | - Jian-Jun Jin
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
- Department of Ecology, Evolution and Environmental Biology, Columbia University, New York, NY, 10025, USA
| | - Chun-Xia Zeng
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
| | - Hong Wang
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
| | - Wen-Bin Yu
- Center for Integrative Conservation, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Mengla, 666303, Yunnan, China
| | - Rong Zhang
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
| | - Michelle van der Bank
- Department of Botany & Plant Biotechnology, University of Johannesburg, PO Box 524, Auckland Park, Johannesburg, Gauteng, 2006, South Africa
| | - Richard G Olmstead
- Department of Biology and Burke Museum, University of Washington, Seattle, WA, 98195-5325, USA
| | | | - Mark W Chase
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3DS, England, UK
- Department of Environment and Agriculture, Curtin University, Bentley, Western Australia, 6102, Australia
| | - Douglas E Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA
- Biodiversity Institute, University of Florida, Gainesville, FL, 32611, USA
| | - Pamela S Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA
- Biodiversity Institute, University of Florida, Gainesville, FL, 32611, USA
- Department of Biology, University of Florida, Gainesville, FL, 32611, USA
| | - Ting-Shuang Yi
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China.
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650201, Yunnan, China.
| | - De-Zhu Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China.
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650201, Yunnan, China.
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China.
| |
Collapse
|
4
|
Thomas SK, Liu X, Du Z, Dong Y, Cummings A, Pokorny L, Xiang Q(J, Leebens‐Mack JH. Comprehending Cornales: phylogenetic reconstruction of the order using the Angiosperms353 probe set. AMERICAN JOURNAL OF BOTANY 2021; 108:1112-1121. [PMID: 34263456 PMCID: PMC8361741 DOI: 10.1002/ajb2.1696] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Accepted: 05/12/2021] [Indexed: 05/08/2023]
Abstract
PREMISE Cornales is an order of flowering plants containing ecologically and horticulturally important families, including Cornaceae (dogwoods) and Hydrangeaceae (hydrangeas), among others. While many relationships in Cornales are strongly supported by previous studies, some uncertainty remains with regards to the placement of Hydrostachyaceae and to relationships among families in Cornales and within Cornaceae. Here we analyzed hundreds of nuclear loci to test published phylogenetic hypotheses and estimated a robust species tree for Cornales. METHODS Using the Angiosperms353 probe set and existing data sets, we generated phylogenomic data for 158 samples, representing all families in the Cornales, with intensive sampling in the Cornaceae. RESULTS We curated an average of 312 genes per sample, constructed maximum likelihood gene trees, and inferred a species tree using the summary approach implemented in ASTRAL-III, a method statistically consistent with the multispecies coalescent model. CONCLUSIONS The species tree we constructed generally shows high support values and a high degree of concordance among individual nuclear gene trees. Relationships among families are largely congruent with previous molecular studies, except for the placement of the nyssoids and the Grubbiaceae-Curtisiaceae clades. Furthermore, we were able to place Hydrostachyaceae within Cornales, and within Cornaceae, the monophyly of known morphogroups was well supported. However, patterns of gene tree discordance suggest potential ancient reticulation, gene flow, and/or ILS in the Hydrostachyaceae lineage and the early diversification of Cornus. Our findings reveal new insights into the diversification process across Cornales and demonstrate the utility of the Angiosperms353 probe set.
Collapse
Affiliation(s)
- Shawn K. Thomas
- Department of Plant BiologyUniversity of GeorgiaAthensGA30602USA
- Division of Biological SciencesUniversity of MissouriColumbiaMO65203USA
| | - Xiang Liu
- Department of Plant and Microbial BiologyNorth Carolina State UniversityRaleighNC27695USA
- SyngentaResearch Triangle ParkNC27709USA
| | - Zhi‐Yuan Du
- Wuhan Botanical GardenThe Chinese Academy of SciencesWuhanHubei430074China
| | - Yibo Dong
- Department of Plant and Microbial BiologyNorth Carolina State UniversityRaleighNC27695USA
- Global Health Infectious Disease ResearchCollege of Public HealthUniversity of South FloridaTampaFL33612USA
| | - Amanda Cummings
- Department of Plant BiologyUniversity of GeorgiaAthensGA30602USA
| | - Lisa Pokorny
- Royal Botanic Gardens, KewRichmondLondonTW9 3AEUK
- Computational/Systems Biology and Genomics ProgramCentre for Plant Biotechnology and GenomicsUPM‐INIA‐CSICPozuelo de Alarcón (Madrid)28223Spain
| | - Qui‐Yun (Jenny) Xiang
- Department of Plant and Microbial BiologyNorth Carolina State UniversityRaleighNC27695USA
| | | |
Collapse
|
5
|
Xu Z, Tian J, Rapanarivo SHJV, Letsara R, Rakotonasolo RA, Onjalalaina GE, Hu GW, Wang QF. Hydrostachys flabellifera (Hydrostachyaceae), a new species from Madagascar. PHYTOKEYS 2020; 167:45-56. [PMID: 33304118 PMCID: PMC7695676 DOI: 10.3897/phytokeys.167.58538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/10/2020] [Accepted: 11/06/2020] [Indexed: 06/12/2023]
Abstract
Hydrostachys flabellifera, a new species of Hydrostachyaceae found in a stream in Manandriana, Madagascar, is described and illustrated herein. It is similar to H. verruculosa and H. laciniata in morphology, but can be distinguished from them by its leaves with sparsely arranged, flabelliform and palmately parted emergences, obvious rachis and the pattern of segments arranged on the male bracts. Molecular phylogenetic analysis of the nuclear ribosomal internal transcribed spacer (ITS) dataset provides a robust support for it as a new species as well.
Collapse
Affiliation(s)
- Zhun Xu
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan 430074, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jing Tian
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan 430074, China
| | | | - Rokiman Letsara
- Département Flore, Parc Botanique et Zoologique de Tsimbazaza, Antananarivo 101, Madagascar
| | - Rivontsoa A. Rakotonasolo
- University of Chinese Academy of Sciences, Beijing 100049, China
- Département Flore, Parc Botanique et Zoologique de Tsimbazaza, Antananarivo 101, Madagascar
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China
| | - Guy E. Onjalalaina
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan 430074, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Guang-Wan Hu
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan 430074, China
| | - Qing-Feng Wang
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China
- Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan 430074, China
| |
Collapse
|
6
|
Valencia-D J, Murillo-A J, Orozco CI, Parra-O C, Neubig KM. -Complete plastid genome sequences of two species of the Neotropical genus Brunellia (Brunelliaceae). PeerJ 2020; 8:e8392. [PMID: 32025370 PMCID: PMC6993752 DOI: 10.7717/peerj.8392] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Accepted: 12/13/2019] [Indexed: 11/20/2022] Open
Abstract
Here we present the first two complete plastid genomes for Brunelliaceae, a Neotropical family with a single genus, Brunellia. We surveyed the entire plastid genome in order to find variable cpDNA regions for further phylogenetic analyses across the family. We sampled morphologically different species, B. antioquensis and B. trianae, and found that the plastid genomes are 157,685 and 157,775 bp in length and display the typical quadripartite structure found in angiosperms. Despite the clear morphological distinction between both species, the molecular data show a very low level of divergence. The amount of nucleotide substitutions per site is one of the lowest reported to date among published congeneric studies (π = 0.00025). The plastid genomes have gene order and content coincident with other COM (Celastrales, Oxalidales, Malpighiales) relatives. Phylogenetic analyses of selected superrosid representatives show high bootstrap support for the ((C,M)O) topology. The N-fixing clade appears as the sister group of the COM clade and Zygophyllales as the sister to the rest of the fabids group.
Collapse
Affiliation(s)
- Janice Valencia-D
- School of Biological Sciences, Southern Illinois University at Carbondale, Carbondale, IL, United States of America
| | - José Murillo-A
- Instituto de Ciencias Naturales, Universidad Nacional de Colombia, Bogotá D.C., Colombia
| | - Clara Inés Orozco
- Instituto de Ciencias Naturales, Universidad Nacional de Colombia, Bogotá D.C., Colombia
| | - Carlos Parra-O
- Instituto de Ciencias Naturales, Universidad Nacional de Colombia, Bogotá D.C., Colombia
| | - Kurt M. Neubig
- School of Biological Sciences, Southern Illinois University at Carbondale, Carbondale, IL, United States of America
| |
Collapse
|
7
|
Bell D, Lin Q, Gerelle WK, Joya S, Chang Y, Taylor ZN, Rothfels CJ, Larsson A, Villarreal JC, Li FW, Pokorny L, Szövényi P, Crandall-Stotler B, DeGironimo L, Floyd SK, Beerling DJ, Deyholos MK, von Konrat M, Ellis S, Shaw AJ, Chen T, Wong GKS, Stevenson DW, Palmer JD, Graham SW. Organellomic data sets confirm a cryptic consensus on (unrooted) land-plant relationships and provide new insights into bryophyte molecular evolution. AMERICAN JOURNAL OF BOTANY 2020; 107:91-115. [PMID: 31814117 DOI: 10.1002/ajb2.1397] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2019] [Accepted: 11/04/2019] [Indexed: 06/10/2023]
Abstract
PREMISE Phylogenetic trees of bryophytes provide important evolutionary context for land plants. However, published inferences of overall embryophyte relationships vary considerably. We performed phylogenomic analyses of bryophytes and relatives using both mitochondrial and plastid gene sets, and investigated bryophyte plastome evolution. METHODS We employed diverse likelihood-based analyses to infer large-scale bryophyte phylogeny for mitochondrial and plastid data sets. We tested for changes in purifying selection in plastid genes of a mycoheterotrophic liverwort (Aneura mirabilis) and a putatively mycoheterotrophic moss (Buxbaumia), and compared 15 bryophyte plastomes for major structural rearrangements. RESULTS Overall land-plant relationships conflict across analyses, generally weakly. However, an underlying (unrooted) four-taxon tree is consistent across most analyses and published studies. Despite gene coverage patchiness, relationships within mosses, liverworts, and hornworts are largely congruent with previous studies, with plastid results generally better supported. Exclusion of RNA edit sites restores cases of unexpected non-monophyly to monophyly for Takakia and two hornwort genera. Relaxed purifying selection affects multiple plastid genes in mycoheterotrophic Aneura but not Buxbaumia. Plastid genome structure is nearly invariant across bryophytes, but the tufA locus, presumed lost in embryophytes, is unexpectedly retained in several mosses. CONCLUSIONS A common unrooted tree underlies embryophyte phylogeny, [(liverworts, mosses), (hornworts, vascular plants)]; rooting inconsistency across studies likely reflects substantial distance to algal outgroups. Analyses combining genomic and transcriptomic data may be misled locally for heavily RNA-edited taxa. The Buxbaumia plastome lacks hallmarks of relaxed selection found in mycoheterotrophic Aneura. Autotrophic bryophyte plastomes, including Buxbaumia, hardly vary in overall structure.
Collapse
Affiliation(s)
- David Bell
- Department of Botany, University of British Columbia, 6270 University Boulevard, Vancouver, British Columbia, V6T 1Z4, Canada
- UBC Botanical Garden and Centre for Plant Research, University of British Columbia, 6804 Marine Drive SW, Vancouver, British Columbia, V6T 1Z4, Canada
- Royal Botanic Garden, 20A Inverleith Row, Edinburgh, EH3 5LR, UK
| | - Qianshi Lin
- Department of Botany, University of British Columbia, 6270 University Boulevard, Vancouver, British Columbia, V6T 1Z4, Canada
- UBC Botanical Garden and Centre for Plant Research, University of British Columbia, 6804 Marine Drive SW, Vancouver, British Columbia, V6T 1Z4, Canada
| | - Wesley K Gerelle
- Department of Botany, University of British Columbia, 6270 University Boulevard, Vancouver, British Columbia, V6T 1Z4, Canada
- UBC Botanical Garden and Centre for Plant Research, University of British Columbia, 6804 Marine Drive SW, Vancouver, British Columbia, V6T 1Z4, Canada
| | - Steve Joya
- Department of Botany, University of British Columbia, 6270 University Boulevard, Vancouver, British Columbia, V6T 1Z4, Canada
| | - Ying Chang
- Department of Botany, University of British Columbia, 6270 University Boulevard, Vancouver, British Columbia, V6T 1Z4, Canada
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, Oregon, 97331, USA
| | - Z Nathan Taylor
- Department of Biology, Indiana University, Bloomington, Indiana, 47405, USA
| | - Carl J Rothfels
- University Herbarium and Department of Integrative Biology, University of California Berkeley, Berkeley, California, 94702, USA
| | - Anders Larsson
- Department of Organismal Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Juan Carlos Villarreal
- Department of Biology, Université Laval, Québec, G1V 0A6, Canada
- Smithsonian Tropical Research Institute, Panama City, Panama
| | - Fay-Wei Li
- Boyce Thompson Institute, Ithaca, New York, 14853, USA
- Plant Biology Section, Cornell University, Ithaca, New York, 14853, USA
| | - Lisa Pokorny
- Royal Botanic Gardens, Kew, Richmond, TW9 3DS, Surrey, UK
- Centre for Plant Biotechnology and Genomics (CBGP, UPM-INIA), 28223, Pozuelo de Alarcón (Madrid), Spain
| | - Péter Szövényi
- Department of Systematic and Evolutionary Botany, University of Zurich, Zollikerstrasse 107, 8008, Zurich, Switzerland
| | | | - Lisa DeGironimo
- Department of Biology, College of Arts and Science, New York University, New York, New York, 10003, USA
| | - Sandra K Floyd
- School of Biological Sciences, Monash University, Melbourne, Victoria, 3800, Australia
| | - David J Beerling
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, S10 2TN, UK
| | - Michael K Deyholos
- Department of Biology, University of British Columbia, Kelowna, British Columbia, V1V 1V7, Canada
| | - Matt von Konrat
- Field Museum of Natural History, Chicago, Illinois, 60605, USA
| | - Shona Ellis
- Department of Botany, University of British Columbia, 6270 University Boulevard, Vancouver, British Columbia, V6T 1Z4, Canada
| | - A Jonathan Shaw
- Department of Biology, Duke University, Durham, North Carolina, 27708, USA
| | - Tao Chen
- Shenzhen Fairy Lake Botanical Garden, Chinese Academy of Sciences, Shenzhen, Guangdong, 518004, China
| | - Gane K-S Wong
- Department of Biological Sciences, University of Alberta, Edmonton, Alberta, T6G 2E9, Canada
- Department of Medicine, University of Alberta, Edmonton, Alberta, T6G 2E1, Canada
- BGI-Shenzhen, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | | | - Jeffrey D Palmer
- Department of Biology, Indiana University, Bloomington, Indiana, 47405, USA
| | - Sean W Graham
- Department of Botany, University of British Columbia, 6270 University Boulevard, Vancouver, British Columbia, V6T 1Z4, Canada
- UBC Botanical Garden and Centre for Plant Research, University of British Columbia, 6804 Marine Drive SW, Vancouver, British Columbia, V6T 1Z4, Canada
| |
Collapse
|
8
|
Boreotropical range expansion and long-distance dispersal explain two amphi-Pacific tropical disjunctions in Sabiaceae. Mol Phylogenet Evol 2018; 124:181-191. [DOI: 10.1016/j.ympev.2018.03.005] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2017] [Revised: 03/05/2018] [Accepted: 03/06/2018] [Indexed: 11/15/2022]
|
9
|
Christensen S, Molloy EK, Vachaspati P, Warnow T. OCTAL: Optimal Completion of gene trees in polynomial time. Algorithms Mol Biol 2018; 13:6. [PMID: 29568323 PMCID: PMC5853121 DOI: 10.1186/s13015-018-0124-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2017] [Accepted: 03/06/2018] [Indexed: 12/16/2022] Open
Abstract
Background For a combination of reasons (including data generation protocols, approaches to taxon and gene sampling, and gene birth and loss), estimated gene trees are often incomplete, meaning that they do not contain all of the species of interest. As incomplete gene trees can impact downstream analyses, accurate completion of gene trees is desirable. Results We introduce the Optimal Tree Completion problem, a general optimization problem that involves completing an unrooted binary tree (i.e., adding missing leaves) so as to minimize its distance from a reference tree on a superset of the leaves. We present OCTAL, an algorithm that finds an optimal solution to this problem when the distance between trees is defined using the Robinson–Foulds (RF) distance, and we prove that OCTAL runs in \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$O(n^2)$$\end{document}O(n2) time, where n is the total number of species. We report on a simulation study in which gene trees can differ from the species tree due to incomplete lineage sorting, and estimated gene trees are completed using OCTAL with a reference tree based on a species tree estimated from the multi-locus dataset. OCTAL produces completed gene trees that are closer to the true gene trees than an existing heuristic approach in ASTRAL-II, but the accuracy of a completed gene tree computed by OCTAL depends on how topologically similar the reference tree (typically an estimated species tree) is to the true gene tree. Conclusions OCTAL is a useful technique for adding missing taxa to incomplete gene trees and provides good accuracy under a wide range of model conditions. However, results show that OCTAL’s accuracy can be reduced when incomplete lineage sorting is high, as the reference tree can be far from the true gene tree. Hence, this study suggests that OCTAL would benefit from using other types of reference trees instead of species trees when there are large topological distances between true gene trees and species trees. Electronic supplementary material The online version of this article (10.1186/s13015-018-0124-5) contains supplementary material, which is available to authorized users.
Collapse
|
10
|
Zhao L, Li X, Zhang N, Zhang SD, Yi TS, Ma H, Guo ZH, Li DZ. Phylogenomic analyses of large-scale nuclear genes provide new insights into the evolutionary relationships within the rosids. Mol Phylogenet Evol 2016; 105:166-176. [DOI: 10.1016/j.ympev.2016.06.007] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2015] [Revised: 06/06/2016] [Accepted: 06/27/2016] [Indexed: 12/28/2022]
|
11
|
Biogeography and diversification of Brassicales: A 103million year tale. Mol Phylogenet Evol 2016; 99:204-224. [PMID: 26993763 DOI: 10.1016/j.ympev.2016.02.021] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2015] [Revised: 02/24/2016] [Accepted: 02/25/2016] [Indexed: 11/23/2022]
Abstract
Brassicales is a diverse order perhaps most famous because it houses Brassicaceae and, its premier member, Arabidopsis thaliana. This widely distributed and species-rich lineage has been overlooked as a promising system to investigate patterns of disjunct distributions and diversification rates. We analyzed plastid and mitochondrial sequence data from five gene regions (>8000bp) across 151 taxa to: (1) produce a chronogram for major lineages in Brassicales, including Brassicaceae and Arabidopsis, based on greater taxon sampling across the order and previously overlooked fossil evidence, (2) examine biogeographical ancestral range estimations and disjunct distributions in BioGeoBEARS, and (3) determine where shifts in species diversification occur using BAMM. The evolution and radiation of the Brassicales began 103Mya and was linked to a series of inter-continental vicariant, long-distance dispersal, and land bridge migration events. North America appears to be a significant area for early stem lineages in the order. Shifts to Australia then African are evident at nodes near the core Brassicales, which diverged 68.5Mya (HPD=75.6-62.0). This estimated age combined with fossil evidence, indicates that some New World clades embedded amongst Old World relatives (e.g., New World capparoids) are the result of different long distance dispersal events, whereas others may be best explained by land bridge migration (e.g., Forchhammeria). Based on these analyses, the Brassicaceae crown group diverged in Europe/Northern Africa in the Eocene, circa 43.4Mya (HPD=46.6-40.3) and Arabidopsis separated from close congeners circa 10.4Mya. These ages fall between divergent dates that were previously published, suggesting we are slowly converging on a robust age estimate for the family. Three significant shifts in species diversification are observed in the order: (1) 58Mya at the crown of Capparaceae, Cleomaceae and Brassicaceae, (2) 38Mya at the crown of Resedaceae+Stixis clade, and (3) 21Mya at the crown of the tribes Brassiceae and Sisymbrieae within Brassicaceae.
Collapse
|
12
|
Crowl AA, Miles NW, Visger CJ, Hansen K, Ayers T, Haberle R, Cellinese N. A global perspective on Campanulaceae: Biogeographic, genomic, and floral evolution. AMERICAN JOURNAL OF BOTANY 2016; 103:233-45. [PMID: 26865121 DOI: 10.3732/ajb.1500450] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2015] [Accepted: 01/04/2016] [Indexed: 05/12/2023]
Abstract
PREMISE OF THE STUDY The Campanulaceae are a diverse clade of flowering plants encompassing more than 2300 species in myriad habitats from tropical rainforests to arctic tundra. A robust, multigene phylogeny, including all major lineages, is presented to provide a broad, evolutionary perspective of this cosmopolitan clade. METHODS We used a phylogenetic framework, in combination with divergence dating, ancestral range estimation, chromosome modeling, and morphological character reconstruction analyses to infer phylogenetic placement and timing of major biogeographic, genomic, and morphological changes in the history of the group and provide insights into the diversification of this clade across six continents. KEY RESULTS Ancestral range estimation supports an out-of-Africa diversification following the Cretaceous-Tertiary extinction event. Chromosomal modeling, with corroboration from the distribution of synonymous substitutions among gene duplicates, provides evidence for as many as 20 genome-wide duplication events before large radiations. Morphological reconstructions support the hypothesis that switches in floral symmetry and anther dehiscence were important in the evolution of secondary pollen presentation mechanisms. CONCLUSIONS This study provides a broad, phylogenetic perspective on the evolution of the Campanulaceae clade. The remarkable habitat diversity and cosmopolitan distribution of this lineage appears to be the result of a complex history of genome duplications and numerous long-distance dispersal events. We failed to find evidence for an ancestral polyploidy event for this clade, and our analyses indicate an ancestral base number of nine for the group. This study will serve as a framework for future studies in diverse areas of research in Campanulaceae.
Collapse
Affiliation(s)
- Andrew A Crowl
- Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA Department of Biology, University of Florida, Gainesville, Florida 32611 USA
| | - Nicholas W Miles
- Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA
| | - Clayton J Visger
- Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA Department of Biology, University of Florida, Gainesville, Florida 32611 USA
| | - Kimberly Hansen
- Department of Biological Sciences, Northern Arizona University, Flagstaff, Arizona 86011 USA
| | - Tina Ayers
- Department of Biological Sciences, Northern Arizona University, Flagstaff, Arizona 86011 USA
| | - Rosemarie Haberle
- Biology Department, Pacific Lutheran University, Tacoma, Washington 98447 USA
| | - Nico Cellinese
- Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611 USA
| |
Collapse
|
13
|
Goodheart JA, Bazinet AL, Collins AG, Cummings MP. Relationships within Cladobranchia (Gastropoda: Nudibranchia) based on RNA-Seq data: an initial investigation. ROYAL SOCIETY OPEN SCIENCE 2015; 2:150196. [PMID: 26473045 PMCID: PMC4593679 DOI: 10.1098/rsos.150196] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2015] [Accepted: 08/26/2015] [Indexed: 05/28/2023]
Abstract
Cladobranchia (Gastropoda: Nudibranchia) is a diverse (approx. 1000 species) but understudied group of sea slug molluscs. In order to fully comprehend the diversity of nudibranchs and the evolution of character traits within Cladobranchia, a solid understanding of evolutionary relationships is necessary. To date, only two direct attempts have been made to understand the evolutionary relationships within Cladobranchia, neither of which resulted in well-supported phylogenetic hypotheses. In addition to these studies, several others have addressed some of the relationships within this clade while investigating the evolutionary history of more inclusive groups (Nudibranchia and Euthyneura). However, all of the resulting phylogenetic hypotheses contain conflicting topologies within Cladobranchia. In this study, we address some of these long-standing issues regarding the evolutionary history of Cladobranchia using RNA-Seq data (transcriptomes). We sequenced 16 transcriptomes and combined these with four transcriptomes from the NCBI Sequence Read Archive. Transcript assembly using Trinity and orthology determination using HaMStR yielded 839 orthologous groups for analysis. These data provide a well-supported and almost fully resolved phylogenetic hypothesis for Cladobranchia. Our results support the monophyly of Cladobranchia and the sub-clade Aeolidida, but reject the monophyly of Dendronotida.
Collapse
Affiliation(s)
- Jessica A. Goodheart
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
- NMFS, National Systematics Laboratory, National Museum of Natural History, Smithsonian Institution, MRC-153, PO Box 37012, Washington, DC 20013, USA
| | - Adam L. Bazinet
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
| | - Allen G. Collins
- NMFS, National Systematics Laboratory, National Museum of Natural History, Smithsonian Institution, MRC-153, PO Box 37012, Washington, DC 20013, USA
| | - Michael P. Cummings
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
| |
Collapse
|
14
|
Lam VKY, Soto Gomez M, Graham SW. The Highly Reduced Plastome of Mycoheterotrophic Sciaphila (Triuridaceae) Is Colinear with Its Green Relatives and Is under Strong Purifying Selection. Genome Biol Evol 2015; 105:480-494. [PMID: 26170229 DOI: 10.1002/ajb2.1070] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Accepted: 02/02/2018] [Indexed: 05/03/2023] Open
Abstract
The enigmatic monocot family Triuridaceae provides a potentially useful model system for studying the effects of an ancient loss of photosynthesis on the plant plastid genome, as all of its members are mycoheterotrophic and achlorophyllous. However, few studies have placed the family in a comparative context, and its phylogenetic placement is only partly resolved. It was also unclear whether any taxa in this family have retained a plastid genome. Here, we used genome survey sequencing to retrieve plastid genome data for Sciaphila densiflora (Triuridaceae) and ten autotrophic relatives in the orders Dioscoreales and Pandanales. We recovered a highly reduced plastome for Sciaphila that is nearly colinear with Carludovica palmata, a photosynthetic relative that belongs to its sister group in Pandanales, Cyclanthaceae-Pandanaceae. This phylogenetic placement is well supported and robust to a broad range of analytical assumptions in maximum-likelihood inference, and is congruent with recent findings based on nuclear and mitochondrial evidence. The 28 genes retained in the S. densiflora plastid genome are involved in translation and other nonphotosynthetic functions, and we demonstrate that nearly all of the 18 protein-coding genes are under strong purifying selection. Our study confirms the utility of whole plastid genome data in phylogenetic studies of highly modified heterotrophic plants, even when they have substantially elevated rates of substitution.
Collapse
Affiliation(s)
- Vivienne K Y Lam
- Department of Botany, University of British Columbia, Vancouver, British Columbia, Canada UBC Botanical Garden & Centre for Plant Research, University of British Columbia, Vancouver, British Columbia, Canada
| | - Marybel Soto Gomez
- Department of Botany, University of British Columbia, Vancouver, British Columbia, Canada UBC Botanical Garden & Centre for Plant Research, University of British Columbia, Vancouver, British Columbia, Canada
| | - Sean W Graham
- Department of Botany, University of British Columbia, Vancouver, British Columbia, Canada UBC Botanical Garden & Centre for Plant Research, University of British Columbia, Vancouver, British Columbia, Canada
| |
Collapse
|
15
|
Wikström N, Kainulainen K, Razafimandimbison SG, Smedmark JEE, Bremer B. A revised time tree of the asterids: establishing a temporal framework for evolutionary studies of the coffee family (rubiaceae). PLoS One 2015; 10:e0126690. [PMID: 25996595 PMCID: PMC4462594 DOI: 10.1371/journal.pone.0126690] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2014] [Accepted: 04/07/2015] [Indexed: 11/19/2022] Open
Abstract
Divergence time analyses in the coffee family (Rubiaceae) have all relied on the same Gentianales crown group age estimate, reported by an earlier analysis of the asterids, for defining the upper age bound of the root node in their analyses. However, not only did the asterid analysis suffer from several analytical shortcomings, but the estimate itself has been used in highly inconsistent ways in these Rubiaceae analyses. Based on the original data, we here reanalyze the divergence times of the asterids using relaxed-clock models and 14 fossil-based minimum age constraints. We also expand the data set to include an additional 67 taxa from Rubiaceae sampled across all three subfamilies recognized in the family. Three analyses are conducted: a separate analysis of the asterids, which completely mirrors the original asterid analysis in terms of taxon sample and data; a separate analysis of the Gentianales, where the result from the first analysis is used for defining a secondary root calibration point; and a combined analysis where all taxa are analyzed simultaneously. Results are presented in the form of a time-calibrated phylogeny, and age estimates for asterid groups, Gentianales, and major groups of Rubiaceae are compared and discussed in relation to previously published estimates. Our updated age estimates for major groups of Rubiaceae provide a significant step forward towards the long term goal of establishing a robust temporal framework for the divergence of this biologically diverse and fascinating group of plants.
Collapse
Affiliation(s)
- Niklas Wikström
- Bergius Foundation, The Royal Swedish Academy of Sciences and Department of Ecology, Environment and Plant Sciences, Stockholm University, SE-10691, Stockholm, Sweden
| | - Kent Kainulainen
- Bergius Foundation, The Royal Swedish Academy of Sciences and Department of Ecology, Environment and Plant Sciences, Stockholm University, SE-10691, Stockholm, Sweden
| | - Sylvain G. Razafimandimbison
- Bergius Foundation, The Royal Swedish Academy of Sciences and Department of Ecology, Environment and Plant Sciences, Stockholm University, SE-10691, Stockholm, Sweden
| | - Jenny E. E. Smedmark
- University of Bergen, University Museum of Bergen, The Natural History Collections, Post Box 7800, NO-5020 Bergen, Norway
| | - Birgitta Bremer
- Bergius Foundation, The Royal Swedish Academy of Sciences and Department of Ecology, Environment and Plant Sciences, Stockholm University, SE-10691, Stockholm, Sweden
| |
Collapse
|
16
|
Zheng Y, Wiens JJ. Do missing data influence the accuracy of divergence-time estimation with BEAST? Mol Phylogenet Evol 2015; 85:41-9. [PMID: 25681677 DOI: 10.1016/j.ympev.2015.02.002] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2014] [Revised: 01/26/2015] [Accepted: 02/01/2015] [Indexed: 10/24/2022]
Abstract
Time-calibrated phylogenies have become essential to evolutionary biology. A recurrent and unresolved question for dating analyses is whether genes with missing data cells should be included or excluded. This issue is particularly unclear for the most widely used dating method, the uncorrelated lognormal approach implemented in BEAST. Here, we test the robustness of this method to missing data. We compare divergence-time estimates from a nearly complete dataset (20 nuclear genes for 32 species of squamate reptiles) to those from subsampled matrices, including those with 5 or 2 complete loci only and those with 5 or 8 incomplete loci added. In general, missing data had little impact on estimated dates (mean error of ∼5Myr per node or less, given an overall age of ∼220Myr in squamates), even when 80% of sampled genes had 75% missing data. Mean errors were somewhat higher when all genes were 75% incomplete (∼17Myr). However, errors increased dramatically when only 2 of 9 fossil calibration points were included (∼40Myr), regardless of missing data. Overall, missing data (and even numbers of genes sampled) may have only minor impacts on the accuracy of divergence dating with BEAST, relative to the dramatic effects of fossil calibrations.
Collapse
Affiliation(s)
- Yuchi Zheng
- Department of Herpetology, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu 610041, China; Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721-088, USA.
| | - John J Wiens
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721-088, USA.
| |
Collapse
|
17
|
Bromham L, Hua X, Lanfear R, Cowman PF. Exploring the Relationships between Mutation Rates, Life History, Genome Size, Environment, and Species Richness in Flowering Plants. Am Nat 2015; 185:507-24. [PMID: 25811085 DOI: 10.1086/680052] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
A new view is emerging of the interplay between mutation at the genomic level, substitution at the population level, and diversification at the lineage level. Many studies have suggested that rate of molecular evolution is linked to rate of diversification, but few have evaluated competing hypotheses. By analyzing sequences from 130 families of angiosperms, we show that variation in the synonymous substitution rate is correlated among genes from the mitochondrial, chloroplast, and nuclear genomes and linked to differences in traits among families (average height and genome size). Within each genome, synonymous rates are correlated to nonsynonymous substitution rates, suggesting that increasing the mutation rate results in a faster rate of genome evolution. Substitution rates are correlated with species richness in protein-coding sequences from the chloroplast and nuclear genomes. These data suggest that species traits contribute to lineage-specific differences in the mutation rate that drive both synonymous and nonsynonymous rates of change across all three genomes, which in turn contribute to greater rates of divergence between populations, generating higher rates of diversification. These observations link mutation in individuals to population-level processes and to patterns of lineage divergence.
Collapse
Affiliation(s)
- Lindell Bromham
- Centre for Macroevolution and Macroecology, Division of Evolution, Ecology and Genetics, Research School of Biology, Australian National University, Canberra, Australian Capital Territory 0200, Australia
| | | | | | | |
Collapse
|
18
|
Hileman LC. Trends in flower symmetry evolution revealed through phylogenetic and developmental genetic advances. Philos Trans R Soc Lond B Biol Sci 2015; 369:rstb.2013.0348. [PMID: 24958922 DOI: 10.1098/rstb.2013.0348] [Citation(s) in RCA: 75] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
A striking aspect of flowering plant (angiosperm) diversity is variation in flower symmetry. From an ancestral form of radial symmetry (polysymmetry, actinomorphy), multiple evolutionary transitions have contributed to instances of non-radial forms, including bilateral symmetry (monosymmetry, zygomorphy) and asymmetry. Advances in flowering plant molecular phylogenetic research and studies of character evolution as well as detailed flower developmental genetic studies in a few model species (e.g. Antirrhinum majus, snapdragon) have provided a foundation for deep insights into flower symmetry evolution. From phylogenetic studies, we have a better understanding of where during flowering plant diversification transitions from radial to bilateral flower symmetry (and back to radial symmetry) have occurred. From developmental studies, we know that a genetic programme largely dependent on the functional action of the CYCLOIDEA gene is necessary for differentiation along the snapdragon dorsoventral flower axis. Bringing these two lines of inquiry together has provided surprising insights into both the parallel recruitment of a CYC-dependent developmental programme during independent transitions to bilateral flower symmetry, and the modifications to this programme in transitions back to radial flower symmetry, during flowering plant evolution.
Collapse
Affiliation(s)
- Lena C Hileman
- Ecology and Evolutionary Biology, University of Kansas, 1200 Sunnyside Avenue, Lawrence, KS 66045, USA
| |
Collapse
|
19
|
Sun M, Soltis DE, Soltis PS, Zhu X, Burleigh JG, Chen Z. Deep phylogenetic incongruence in the angiosperm clade Rosidae. Mol Phylogenet Evol 2015; 83:156-66. [DOI: 10.1016/j.ympev.2014.11.003] [Citation(s) in RCA: 82] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2014] [Revised: 11/01/2014] [Accepted: 11/05/2014] [Indexed: 10/24/2022]
|
20
|
Wickett NJ, Mirarab S, Nguyen N, Warnow T, Carpenter E, Matasci N, Ayyampalayam S, Barker MS, Burleigh JG, Gitzendanner MA, Ruhfel BR, Wafula E, Der JP, Graham SW, Mathews S, Melkonian M, Soltis DE, Soltis PS, Miles NW, Rothfels CJ, Pokorny L, Shaw AJ, DeGironimo L, Stevenson DW, Surek B, Villarreal JC, Roure B, Philippe H, dePamphilis CW, Chen T, Deyholos MK, Baucom RS, Kutchan TM, Augustin MM, Wang J, Zhang Y, Tian Z, Yan Z, Wu X, Sun X, Wong GKS, Leebens-Mack J. Phylotranscriptomic analysis of the origin and early diversification of land plants. Proc Natl Acad Sci U S A 2014; 111:E4859-68. [PMID: 25355905 PMCID: PMC4234587 DOI: 10.1073/pnas.1323926111] [Citation(s) in RCA: 767] [Impact Index Per Article: 76.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
Reconstructing the origin and evolution of land plants and their algal relatives is a fundamental problem in plant phylogenetics, and is essential for understanding how critical adaptations arose, including the embryo, vascular tissue, seeds, and flowers. Despite advances in molecular systematics, some hypotheses of relationships remain weakly resolved. Inferring deep phylogenies with bouts of rapid diversification can be problematic; however, genome-scale data should significantly increase the number of informative characters for analyses. Recent phylogenomic reconstructions focused on the major divergences of plants have resulted in promising but inconsistent results. One limitation is sparse taxon sampling, likely resulting from the difficulty and cost of data generation. To address this limitation, transcriptome data for 92 streptophyte taxa were generated and analyzed along with 11 published plant genome sequences. Phylogenetic reconstructions were conducted using up to 852 nuclear genes and 1,701,170 aligned sites. Sixty-nine analyses were performed to test the robustness of phylogenetic inferences to permutations of the data matrix or to phylogenetic method, including supermatrix, supertree, and coalescent-based approaches, maximum-likelihood and Bayesian methods, partitioned and unpartitioned analyses, and amino acid versus DNA alignments. Among other results, we find robust support for a sister-group relationship between land plants and one group of streptophyte green algae, the Zygnematophyceae. Strong and robust support for a clade comprising liverworts and mosses is inconsistent with a widely accepted view of early land plant evolution, and suggests that phylogenetic hypotheses used to understand the evolution of fundamental plant traits should be reevaluated.
Collapse
Affiliation(s)
- Norman J Wickett
- Chicago Botanic Garden, Glencoe, IL 60022; Program in Biological Sciences, Northwestern University, Evanston, IL 60208;
| | - Siavash Mirarab
- Department of Computer Science, University of Texas, Austin, TX 78712
| | - Nam Nguyen
- Department of Computer Science, University of Texas, Austin, TX 78712
| | - Tandy Warnow
- Department of Computer Science, University of Texas, Austin, TX 78712
| | - Eric Carpenter
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada T6G 2E9
| | - Naim Matasci
- iPlant Collaborative, Tucson, AZ 85721; Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721
| | | | - Michael S Barker
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721
| | | | - Matthew A Gitzendanner
- Department of Biology and Genetics Institute, University of Florida, Gainesville, FL 32611
| | - Brad R Ruhfel
- Department of Biology and Department of Biological Sciences, Eastern Kentucky University, Richmond, KY 40475; Florida Museum of Natural History, Gainesville, FL 32611
| | - Eric Wafula
- Department of Biology, Pennsylvania State University, University Park, PA 16803
| | - Joshua P Der
- Department of Biology, Pennsylvania State University, University Park, PA 16803
| | | | - Sarah Mathews
- Arnold Arboretum of Harvard University, Cambridge, MA 02138
| | | | - Douglas E Soltis
- Department of Biology and Genetics Institute, University of Florida, Gainesville, FL 32611; Florida Museum of Natural History, Gainesville, FL 32611
| | - Pamela S Soltis
- Department of Biology and Genetics Institute, University of Florida, Gainesville, FL 32611; Florida Museum of Natural History, Gainesville, FL 32611
| | | | - Carl J Rothfels
- Department of Biology, Duke University, Durham, NC 27708; Department of Zoology, University of British Columbia, Vancouver, BC, Canada V6T 1Z4
| | - Lisa Pokorny
- Department of Biology, Duke University, Durham, NC 27708; Department of Biodiversity and Conservation, Real Jardín Botánico-Consejo Superior de Investigaciones Cientificas, 28014 Madrid, Spain
| | | | | | | | - Barbara Surek
- Botanical Institute, Universität zu Köln, Cologne D-50674, Germany
| | - Juan Carlos Villarreal
- Department fur Biologie, Systematische Botanik und Mykologie, Ludwig-Maximilians-Universitat, 80638 Munich, Germany
| | - Béatrice Roure
- Département de Biochimie, Centre Robert-Cedergren, Université de Montréal, Succursale Centre-Ville, Montreal, QC, Canada H3C 3J7
| | - Hervé Philippe
- Département de Biochimie, Centre Robert-Cedergren, Université de Montréal, Succursale Centre-Ville, Montreal, QC, Canada H3C 3J7; CNRS, Station d' Ecologie Expérimentale du CNRS, Moulis, 09200, France
| | | | - Tao Chen
- Shenzhen Fairy Lake Botanical Garden, The Chinese Academy of Sciences, Shenzhen, Guangdong 518004, China
| | - Michael K Deyholos
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada T6G 2E9
| | - Regina S Baucom
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109
| | - Toni M Kutchan
- Donald Danforth Plant Science Center, St. Louis, MO 63132
| | | | - Jun Wang
- BGI-Shenzhen, Bei shan Industrial Zone, Yantian District, Shenzhen 518083, China; and
| | - Yong Zhang
- CNRS, Station d' Ecologie Expérimentale du CNRS, Moulis, 09200, France
| | - Zhijian Tian
- BGI-Shenzhen, Bei shan Industrial Zone, Yantian District, Shenzhen 518083, China; and
| | - Zhixiang Yan
- BGI-Shenzhen, Bei shan Industrial Zone, Yantian District, Shenzhen 518083, China; and
| | - Xiaolei Wu
- BGI-Shenzhen, Bei shan Industrial Zone, Yantian District, Shenzhen 518083, China; and
| | - Xiao Sun
- BGI-Shenzhen, Bei shan Industrial Zone, Yantian District, Shenzhen 518083, China; and
| | - Gane Ka-Shu Wong
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada T6G 2E9; BGI-Shenzhen, Bei shan Industrial Zone, Yantian District, Shenzhen 518083, China; and Department of Medicine, University of Alberta, Edmonton, AB, Canada T6G 2E1
| | | |
Collapse
|
21
|
Misof B, Liu S, Meusemann K, Peters RS, Donath A, Mayer C, Frandsen PB, Ware J, Flouri T, Beutel RG, Niehuis O, Petersen M, Izquierdo-Carrasco F, Wappler T, Rust J, Aberer AJ, Aspock U, Aspock H, Bartel D, Blanke A, Berger S, Bohm A, Buckley TR, Calcott B, Chen J, Friedrich F, Fukui M, Fujita M, Greve C, Grobe P, Gu S, Huang Y, Jermiin LS, Kawahara AY, Krogmann L, Kubiak M, Lanfear R, Letsch H, Li Y, Li Z, Li J, Lu H, Machida R, Mashimo Y, Kapli P, McKenna DD, Meng G, Nakagaki Y, Navarrete-Heredia JL, Ott M, Ou Y, Pass G, Podsiadlowski L, Pohl H, von Reumont BM, Schutte K, Sekiya K, Shimizu S, Slipinski A, Stamatakis A, Song W, Su X, Szucsich NU, Tan M, Tan X, Tang M, Tang J, Timelthaler G, Tomizuka S, Trautwein M, Tong X, Uchifune T, Walzl MG, Wiegmann BM, Wilbrandt J, Wipfler B, Wong TKF, Wu Q, Wu G, Xie Y, Yang S, Yang Q, Yeates DK, Yoshizawa K, Zhang Q, Zhang R, Zhang W, Zhang Y, Zhao J, Zhou C, Zhou L, Ziesmann T, Zou S, Li Y, Xu X, Zhang Y, Yang H, Wang J, Wang J, Kjer KM, Zhou X. Phylogenomics resolves the timing and pattern of insect evolution. Science 2014; 346:763-7. [DOI: 10.1126/science.1257570] [Citation(s) in RCA: 1672] [Impact Index Per Article: 167.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
22
|
Jiang W, Chen SY, Wang H, Li DZ, Wiens JJ. Should genes with missing data be excluded from phylogenetic analyses? Mol Phylogenet Evol 2014; 80:308-18. [DOI: 10.1016/j.ympev.2014.08.006] [Citation(s) in RCA: 83] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2014] [Revised: 07/15/2014] [Accepted: 08/03/2014] [Indexed: 10/24/2022]
|
23
|
The Evolution of Reproduction-Related NLRP Genes. J Mol Evol 2014; 78:194-201. [DOI: 10.1007/s00239-014-9614-3] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2013] [Accepted: 02/19/2014] [Indexed: 12/23/2022]
|
24
|
Massoni J, Forest F, Sauquet H. Increased sampling of both genes and taxa improves resolution of phylogenetic relationships within Magnoliidae, a large and early-diverging clade of angiosperms. Mol Phylogenet Evol 2014; 70:84-93. [DOI: 10.1016/j.ympev.2013.09.010] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2012] [Revised: 08/30/2013] [Accepted: 09/11/2013] [Indexed: 11/25/2022]
|
25
|
Taller plants have lower rates of molecular evolution. Nat Commun 2013; 4:1879. [DOI: 10.1038/ncomms2836] [Citation(s) in RCA: 149] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2012] [Accepted: 04/05/2013] [Indexed: 01/20/2023] Open
|
26
|
Phylogenomics and a posteriori data partitioning resolve the Cretaceous angiosperm radiation Malpighiales. Proc Natl Acad Sci U S A 2012; 109:17519-24. [PMID: 23045684 DOI: 10.1073/pnas.1205818109] [Citation(s) in RCA: 198] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The angiosperm order Malpighiales includes ~16,000 species and constitutes up to 40% of the understory tree diversity in tropical rain forests. Despite remarkable progress in angiosperm systematics during the last 20 y, relationships within Malpighiales remain poorly resolved, possibly owing to its rapid rise during the mid-Cretaceous. Using phylogenomic approaches, including analyses of 82 plastid genes from 58 species, we identified 12 additional clades in Malpighiales and substantially increased resolution along the backbone. This greatly improved phylogeny revealed a dynamic history of shifts in net diversification rates across Malpighiales, with bursts of diversification noted in the Barbados cherries (Malpighiaceae), cocas (Erythroxylaceae), and passion flowers (Passifloraceae). We found that commonly used a priori approaches for partitioning concatenated data in maximum likelihood analyses, by gene or by codon position, performed poorly relative to the use of partitions identified a posteriori using a Bayesian mixture model. We also found better branch support in trees inferred from a taxon-rich, data-sparse matrix, which deeply sampled only the phylogenetically critical placeholders, than in trees inferred from a taxon-sparse matrix with little missing data. Although this matrix has more missing data, our a posteriori partitioning strategy reduced the possibility of producing multiple distinct but equally optimal topologies and increased phylogenetic decisiveness, compared with the strategy of partitioning by gene. These approaches are likely to help improve phylogenetic resolution in other poorly resolved major clades of angiosperms and to be more broadly useful in studies across the Tree of Life.
Collapse
|
27
|
Phylogenetics and evolution of host-plant use in leaf-mining sawflies (Hymenoptera: Tenthredinidae: Heterarthrinae). Mol Phylogenet Evol 2012; 64:331-41. [DOI: 10.1016/j.ympev.2012.04.005] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2011] [Revised: 03/21/2012] [Accepted: 04/06/2012] [Indexed: 11/17/2022]
|
28
|
Rothfels CJ, Larsson A, Kuo LY, Korall P, Chiou WL, Pryer KM. Overcoming Deep Roots, Fast Rates, and Short Internodes to Resolve the Ancient Rapid Radiation of Eupolypod II Ferns. Syst Biol 2012; 61:490-509. [DOI: 10.1093/sysbio/sys001] [Citation(s) in RCA: 96] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Carl J. Rothfels
- Department of Biology, Duke University, Box 90338, Durham, NC 27708, USA
| | - Anders Larsson
- Systematic Biology, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18D, SE-752 36 Uppsala, Sweden
| | - Li-Yaung Kuo
- Institute of Ecology and Evolutionary Biology, National Taiwan University, No. 1, Section 4, Roosevelt Road, Taipei 10617, Taiwan
| | - Petra Korall
- Systematic Biology, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18D, SE-752 36 Uppsala, Sweden
| | - Wen-Liang Chiou
- Botanical Garden Division, Taiwan Forestry Research Institute, 53 Nan-hai Road, Taipei 10066, Taiwan
| | - Kathleen M. Pryer
- Department of Biology, Duke University, Box 90338, Durham, NC 27708, USA
| |
Collapse
|
29
|
Sauquet H, Ho SYW, Gandolfo MA, Jordan GJ, Wilf P, Cantrill DJ, Bayly MJ, Bromham L, Brown GK, Carpenter RJ, Lee DM, Murphy DJ, Sniderman JMK, Udovicic F. Testing the Impact of Calibration on Molecular Divergence Times Using a Fossil-Rich Group: The Case of Nothofagus (Fagales). Syst Biol 2011; 61:289-313. [DOI: 10.1093/sysbio/syr116] [Citation(s) in RCA: 296] [Impact Index Per Article: 22.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Affiliation(s)
- Hervé Sauquet
- Laboratoire Écologie, Systématique, Évolution, Université Paris-Sud, CNRS UMR 8079, 91405 Orsay, France
| | - Simon Y. W. Ho
- Centre for Macroevolution and Macroecology, Research School of Biology, Australian National University, Canberra, ACT 0200, Australia
- School of Biological Sciences, University of Sydney, Sydney, NSW 2006, Australia
| | - Maria A. Gandolfo
- L.H. Bailey Hortorium, Department of Plant Biology, Cornell University, Ithaca, NY 14853, USA
| | - Gregory J. Jordan
- School of Plant Science, University of Tasmania, Private bag 55, Hobart, TAS 7001, Australia
| | - Peter Wilf
- Department of Geosciences, Pennsylvania State University, University Park, PA 16802, USA
| | - David J. Cantrill
- National Herbarium of Victoria, Royal Botanic Gardens Melbourne, Private Bag 2000, South Yarra, VIC 3141, Australia
| | - Michael J. Bayly
- School of Botany, The University of Melbourne, Melbourne, VIC 3010, Australia
| | - Lindell Bromham
- Centre for Macroevolution and Macroecology, Research School of Biology, Australian National University, Canberra, ACT 0200, Australia
| | - Gillian K. Brown
- National Herbarium of Victoria, Royal Botanic Gardens Melbourne, Private Bag 2000, South Yarra, VIC 3141, Australia
- School of Botany, The University of Melbourne, Melbourne, VIC 3010, Australia
| | - Raymond J. Carpenter
- Department of Ecology and Environmental Biology, School of Earth and Environmental Sciences, University of Adelaide, Adelaide, SA 5005, Australia
| | - Daphne M. Lee
- Department of Geology, University of Otago, PO Box 56, Dunedin 9054, New Zealand
| | - Daniel J. Murphy
- National Herbarium of Victoria, Royal Botanic Gardens Melbourne, Private Bag 2000, South Yarra, VIC 3141, Australia
| | - J. M. Kale Sniderman
- School of Geography and Environmental Science, Monash University, Melbourne, VIC 3800, Australia
| | - Frank Udovicic
- National Herbarium of Victoria, Royal Botanic Gardens Melbourne, Private Bag 2000, South Yarra, VIC 3141, Australia
| |
Collapse
|
30
|
Lee EK, Cibrian-Jaramillo A, Kolokotronis SO, Katari MS, Stamatakis A, Ott M, Chiu JC, Little DP, Stevenson DW, McCombie WR, Martienssen RA, Coruzzi G, DeSalle R. A functional phylogenomic view of the seed plants. PLoS Genet 2011; 7:e1002411. [PMID: 22194700 PMCID: PMC3240601 DOI: 10.1371/journal.pgen.1002411] [Citation(s) in RCA: 123] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2010] [Accepted: 10/21/2011] [Indexed: 12/01/2022] Open
Abstract
A novel result of the current research is the development and implementation of a unique functional phylogenomic approach that explores the genomic origins of seed plant diversification. We first use 22,833 sets of orthologs from the nuclear genomes of 101 genera across land plants to reconstruct their phylogenetic relationships. One of the more salient results is the resolution of some enigmatic relationships in seed plant phylogeny, such as the placement of Gnetales as sister to the rest of the gymnosperms. In using this novel phylogenomic approach, we were also able to identify overrepresented functional gene ontology categories in genes that provide positive branch support for major nodes prompting new hypotheses for genes associated with the diversification of angiosperms. For example, RNA interference (RNAi) has played a significant role in the divergence of monocots from other angiosperms, which has experimental support in Arabidopsis and rice. This analysis also implied that the second largest subunit of RNA polymerase IV and V (NRPD2) played a prominent role in the divergence of gymnosperms. This hypothesis is supported by the lack of 24nt siRNA in conifers, the maternal control of small RNA in the seeds of flowering plants, and the emergence of double fertilization in angiosperms. Our approach takes advantage of genomic data to define orthologs, reconstruct relationships, and narrow down candidate genes involved in plant evolution within a phylogenomic view of species' diversification.
Collapse
Affiliation(s)
- Ernest K. Lee
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, New York, United States of America
| | - Angelica Cibrian-Jaramillo
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, New York, United States of America
- Cullman Program in Molecular Systematics, The New York Botanical Garden, Bronx, New York, United States of America
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, New York, United States of America
| | - Sergios-Orestis Kolokotronis
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, New York, United States of America
| | - Manpreet S. Katari
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, New York, United States of America
| | | | - Michael Ott
- Department of Computer Science, Technische Universität München, Munich, Germany
| | - Joanna C. Chiu
- Department of Entomology, University of California Davis, Davis, California, United States of America
| | - Damon P. Little
- Cullman Program in Molecular Systematics, The New York Botanical Garden, Bronx, New York, United States of America
| | - Dennis Wm. Stevenson
- Cullman Program in Molecular Systematics, The New York Botanical Garden, Bronx, New York, United States of America
| | - W. Richard McCombie
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Robert A. Martienssen
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Gloria Coruzzi
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, New York, United States of America
| | - Rob DeSalle
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, New York, United States of America
| |
Collapse
|
31
|
Cho S, Zwick A, Regier JC, Mitter C, Cummings MP, Yao J, Du Z, Zhao H, Kawahara AY, Weller S, Davis DR, Baixeras J, Brown JW, Parr C. Can deliberately incomplete gene sample augmentation improve a phylogeny estimate for the advanced moths and butterflies (Hexapoda: Lepidoptera)? Syst Biol 2011; 60:782-96. [PMID: 21840842 PMCID: PMC3193767 DOI: 10.1093/sysbio/syr079] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2009] [Revised: 03/04/2010] [Accepted: 04/12/2011] [Indexed: 11/15/2022] Open
Abstract
This paper addresses the question of whether one can economically improve the robustness of a molecular phylogeny estimate by increasing gene sampling in only a subset of taxa, without having the analysis invalidated by artifacts arising from large blocks of missing data. Our case study stems from an ongoing effort to resolve poorly understood deeper relationships in the large clade Ditrysia ( > 150,000 species) of the insect order Lepidoptera (butterflies and moths). Seeking to remedy the overall weak support for deeper divergences in an initial study based on five nuclear genes (6.6 kb) in 123 exemplars, we nearly tripled the total gene sample (to 26 genes, 18.4 kb) but only in a third (41) of the taxa. The resulting partially augmented data matrix (45% intentionally missing data) consistently increased bootstrap support for groupings previously identified in the five-gene (nearly) complete matrix, while introducing no contradictory groupings of the kind that missing data have been predicted to produce. Our results add to growing evidence that data sets differing substantially in gene and taxon sampling can often be safely and profitably combined. The strongest overall support for nodes above the family level came from including all nucleotide changes, while partitioning sites into sets undergoing mostly nonsynonymous versus mostly synonymous change. In contrast, support for the deepest node for which any persuasive molecular evidence has yet emerged (78-85% bootstrap) was weak or nonexistent unless synonymous change was entirely excluded, a result plausibly attributed to compositional heterogeneity. This node (Gelechioidea + Apoditrysia), tentatively proposed by previous authors on the basis of four morphological synapomorphies, is the first major subset of ditrysian superfamilies to receive strong statistical support in any phylogenetic study. A "more-genes-only" data set (41 taxa×26 genes) also gave strong signal for a second deep grouping (Macrolepidoptera) that was obscured, but not strongly contradicted, in more taxon-rich analyses.
Collapse
Affiliation(s)
- Soowon Cho
- Department of Entomology, University of Maryland, College Park, MD 20742, USA
- Present address: Department of Plant Medicine, Chungbuk National University, Cheongju, Korea
| | - Andreas Zwick
- Center for Biosystems Research, University of Maryland Biotechnology Institute, College Park, MD 20742, USA
| | - Jerome C. Regier
- Center for Biosystems Research, University of Maryland Biotechnology Institute, College Park, MD 20742, USA
| | - Charles Mitter
- Department of Entomology, University of Maryland, College Park, MD 20742, USA
| | - Michael P. Cummings
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
| | - Jianxiu Yao
- Center for Biosystems Research, University of Maryland Biotechnology Institute, College Park, MD 20742, USA
- Present address: Department of Entomology, Kansas State University, Manhattan, KS 66506, USA
| | - Zaile Du
- Center for Biosystems Research, University of Maryland Biotechnology Institute, College Park, MD 20742, USA
| | - Hong Zhao
- Center for Biosystems Research, University of Maryland Biotechnology Institute, College Park, MD 20742, USA
| | - Akito Y. Kawahara
- Department of Entomology, University of Maryland, College Park, MD 20742, USA
| | - Susan Weller
- Department of Entomology, University of Minnesota, Saint Paul, MN 55108, USA
| | - Donald R. Davis
- Department of Entomology, Smithsonian Institution, Washington, DC 20560, USA
| | - Joaquin Baixeras
- Cavanilles Institute of Biodiversity and Evolutionary Biology, University of Valencia, Valencia, Spain
| | - John W. Brown
- Systematic Entomology Laboratory, Agricultural Research Service, United States Department of Agriculture, Beltsville, MD 20705, USA
| | - Cynthia Parr
- Encyclopedia of Life, Smithsonian Institution, Washington, DC 20560, USA
| |
Collapse
|
32
|
Kawahara AY, Ohshima I, Kawakita A, Regier JC, Mitter C, Cummings MP, Davis DR, Wagner DL, De Prins J, Lopez-Vaamonde C. Increased gene sampling strengthens support for higher-level groups within leaf-mining moths and relatives (Lepidoptera: Gracillariidae). BMC Evol Biol 2011; 11:182. [PMID: 21702958 PMCID: PMC3145599 DOI: 10.1186/1471-2148-11-182] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2010] [Accepted: 06/24/2011] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND Researchers conducting molecular phylogenetic studies are frequently faced with the decision of what to do when weak branch support is obtained for key nodes of importance. As one solution, the researcher may choose to sequence additional orthologous genes of appropriate evolutionary rate for the taxa in the study. However, generating large, complete data matrices can become increasingly difficult as the number of characters increases. A few empirical studies have shown that augmenting genes even for a subset of taxa can improve branch support. However, because each study differs in the number of characters and taxa, there is still a need for additional studies that examine whether incomplete sampling designs are likely to aid at increasing deep node resolution. We target Gracillariidae, a Cretaceous-age (~100 Ma) group of leaf-mining moths to test whether the strategy of adding genes for a subset of taxa can improve branch support for deep nodes. We initially sequenced ten genes (8,418 bp) for 57 taxa that represent the major lineages of Gracillariidae plus outgroups. After finding that many deep divergences remained weakly supported, we sequenced eleven additional genes (6,375 bp) for a 27-taxon subset. We then compared results from different data sets to assess whether one sampling design can be favored over another. The concatenated data set comprising all genes and all taxa and three other data sets of different taxon and gene sub-sampling design were analyzed with maximum likelihood. Each data set was subject to five different models and partitioning schemes of non-synonymous and synonymous changes. Statistical significance of non-monophyly was examined with the Approximately Unbiased (AU) test. RESULTS Partial augmentation of genes led to high support for deep divergences, especially when non-synonymous changes were analyzed alone. Increasing the number of taxa without an increase in number of characters led to lower bootstrap support; increasing the number of characters without increasing the number of taxa generally increased bootstrap support. More than three-quarters of nodes were supported with bootstrap values greater than 80% when all taxa and genes were combined. Gracillariidae, Lithocolletinae + Leucanthiza, and Acrocercops and Parectopa groups were strongly supported in nearly every analysis. Gracillaria group was well supported in some analyses, but less so in others. We find strong evidence for the exclusion of Douglasiidae from Gracillarioidea sensu Davis and Robinson (1998). Our results strongly support the monophyly of a G.B.R.Y. clade, a group comprised of Gracillariidae + Bucculatricidae + Roeslerstammiidae + Yponomeutidae, when analyzed with non-synonymous changes only, but this group was frequently split when synonymous and non-synonymous substitutions were analyzed together. CONCLUSIONS 1) Partially or fully augmenting a data set with more characters increased bootstrap support for particular deep nodes, and this increase was dramatic when non-synonymous changes were analyzed alone. Thus, the addition of sites that have low levels of saturation and compositional heterogeneity can greatly improve results. 2) Gracillarioidea, as defined by Davis and Robinson (1998), clearly do not include Douglasiidae, and changes to current classification will be required. 3) Gracillariidae were monophyletic in all analyses conducted, and nearly all species can be placed into one of six strongly supported clades though relationships among these remain unclear. 4) The difficulty in determining the phylogenetic placement of Bucculatricidae is probably attributable to compositional heterogeneity at the third codon position. From our tests for compositional heterogeneity and strong bootstrap values obtained when synonymous changes are excluded, we tentatively conclude that Bucculatricidae is closely related to Gracillariidae + Roeslerstammiidae + Yponomeutidae.
Collapse
Affiliation(s)
- Akito Y Kawahara
- Department of Entomology, University of Maryland, College Park, MD, USA
| | - Issei Ohshima
- Division of Evolutionary Biology, National Institute for Basic Biology, Okazaki, Japan
| | | | - Jerome C Regier
- Institute for Bioscience and Biotechnology Research, University of Maryland, College Park, MD, USA
| | - Charles Mitter
- Department of Entomology, University of Maryland, College Park, MD, USA
| | - Michael P Cummings
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, USA
| | - Donald R Davis
- Department of Entomology, Smithsonian Institution, Washington, D.C., USA
| | - David L Wagner
- Department of Ecology & Evolutionary Biology, University of Connecticut, Storrs, CT, USA
| | | | | |
Collapse
|
33
|
Soltis DE, Smith SA, Cellinese N, Wurdack KJ, Tank DC, Brockington SF, Refulio-Rodriguez NF, Walker JB, Moore MJ, Carlsward BS, Bell CD, Latvis M, Crawley S, Black C, Diouf D, Xi Z, Rushworth CA, Gitzendanner MA, Sytsma KJ, Qiu YL, Hilu KW, Davis CC, Sanderson MJ, Beaman RS, Olmstead RG, Judd WS, Donoghue MJ, Soltis PS. Angiosperm phylogeny: 17 genes, 640 taxa. AMERICAN JOURNAL OF BOTANY 2011; 98:704-30. [PMID: 21613169 DOI: 10.3732/ajb.1000404] [Citation(s) in RCA: 352] [Impact Index Per Article: 27.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
PREMISE OF THE STUDY Recent analyses employing up to five genes have provided numerous insights into angiosperm phylogeny, but many relationships have remained unresolved or poorly supported. In the hope of improving our understanding of angiosperm phylogeny, we expanded sampling of taxa and genes beyond previous analyses. METHODS We conducted two primary analyses based on 640 species representing 330 families. The first included 25260 aligned base pairs (bp) from 17 genes (representing all three plant genomes, i.e., nucleus, plastid, and mitochondrion). The second included 19846 aligned bp from 13 genes (representing only the nucleus and plastid). KEY RESULTS Many important questions of deep-level relationships in the nonmonocot angiosperms have now been resolved with strong support. Amborellaceae, Nymphaeales, and Austrobaileyales are successive sisters to the remaining angiosperms (Mesangiospermae), which are resolved into Chloranthales + Magnoliidae as sister to Monocotyledoneae + [Ceratophyllaceae + Eudicotyledoneae]. Eudicotyledoneae contains a basal grade subtending Gunneridae. Within Gunneridae, Gunnerales are sister to the remainder (Pentapetalae), which comprises (1) Superrosidae, consisting of Rosidae (including Vitaceae) and Saxifragales; and (2) Superasteridae, comprising Berberidopsidales, Santalales, Caryophyllales, Asteridae, and, based on this study, Dilleniaceae (although other recent analyses disagree with this placement). Within the major subclades of Pentapetalae, most deep-level relationships are resolved with strong support. CONCLUSIONS Our analyses confirm that with large amounts of sequence data, most deep-level relationships within the angiosperms can be resolved. We anticipate that this well-resolved angiosperm tree will be of broad utility for many areas of biology, including physiology, ecology, paleobiology, and genomics.
Collapse
Affiliation(s)
- Douglas E Soltis
- Department of Biology, University of Florida, Gainesville, Florida 32611-8525, USA. .edu
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
(Jenny) Xiang QY, Thomas DT, Xiang QP. Resolving and dating the phylogeny of Cornales – Effects of taxon sampling, data partitions, and fossil calibrations. Mol Phylogenet Evol 2011; 59:123-38. [DOI: 10.1016/j.ympev.2011.01.016] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2010] [Revised: 01/18/2011] [Accepted: 01/26/2011] [Indexed: 10/18/2022]
|
35
|
Conservation and divergence of plant LHP1 protein sequences and expression patterns in angiosperms and gymnosperms. Mol Genet Genomics 2011; 285:357-73. [DOI: 10.1007/s00438-011-0609-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2010] [Accepted: 02/09/2011] [Indexed: 12/21/2022]
|
36
|
Abstract
Flies are one of four superradiations of insects (along with beetles, wasps, and moths) that account for the majority of animal life on Earth. Diptera includes species known for their ubiquity (Musca domestica house fly), their role as pests (Anopheles gambiae malaria mosquito), and their value as model organisms across the biological sciences (Drosophila melanogaster). A resolved phylogeny for flies provides a framework for genomic, developmental, and evolutionary studies by facilitating comparisons across model organisms, yet recent research has suggested that fly relationships have been obscured by multiple episodes of rapid diversification. We provide a phylogenomic estimate of fly relationships based on molecules and morphology from 149 of 157 families, including 30 kb from 14 nuclear loci and complete mitochondrial genomes combined with 371 morphological characters. Multiple analyses show support for traditional groups (Brachycera, Cyclorrhapha, and Schizophora) and corroborate contentious findings, such as the anomalous Deuterophlebiidae as the sister group to all remaining Diptera. Our findings reveal that the closest relatives of the Drosophilidae are highly modified parasites (including the wingless Braulidae) of bees and other insects. Furthermore, we use micro-RNAs to resolve a node with implications for the evolution of embryonic development in Diptera. We demonstrate that flies experienced three episodes of rapid radiation--lower Diptera (220 Ma), lower Brachycera (180 Ma), and Schizophora (65 Ma)--and a number of life history transitions to hematophagy, phytophagy, and parasitism in the history of fly evolution over 260 million y.
Collapse
|
37
|
Neves SS, Forrest LL. Plant DNA sequencing for phylogenetic analyses: from plants to sequences. Methods Mol Biol 2011; 781:183-235. [PMID: 21877283 DOI: 10.1007/978-1-61779-276-2_10] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
DNA sequences are important sources of data for phylogenetic analysis. Nowadays, DNA sequencing is a routine technique in molecular biology laboratories. However, there are specific questions associated with project design and sequencing of plant samples for phylogenetic analysis, which may not be familiar to researchers starting in the field. This chapter gives an overview of methods and protocols involved in the sequencing of plant samples, including general recommendations on the selection of species/taxa and DNA regions to be sequenced, and field collection of plant samples. Protocols of plant sample preparation, DNA extraction, PCR and cloning, which are critical to the success of molecular phylogenetic projects, are described in detail. Common problems of sequencing (using the Sanger method) are also addressed. Possible applications of second-generation sequencing techniques in plant phylogenetics are briefly discussed. Finally, orientation on the preparation of sequence data for phylogenetic analyses and submission to public databases is also given.
Collapse
Affiliation(s)
- Susana S Neves
- Plant Cell Biotechnology Laboratory, ITQB Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Oeiras, Portugal.
| | | |
Collapse
|
38
|
Schäferhoff B, Fleischmann A, Fischer E, Albach DC, Borsch T, Heubl G, Müller KF. Towards resolving Lamiales relationships: insights from rapidly evolving chloroplast sequences. BMC Evol Biol 2010; 10:352. [PMID: 21073690 PMCID: PMC2992528 DOI: 10.1186/1471-2148-10-352] [Citation(s) in RCA: 112] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2010] [Accepted: 11/12/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In the large angiosperm order Lamiales, a diverse array of highly specialized life strategies such as carnivory, parasitism, epiphytism, and desiccation tolerance occur, and some lineages possess drastically accelerated DNA substitutional rates or miniaturized genomes. However, understanding the evolution of these phenomena in the order, and clarifying borders of and relationships among lamialean families, has been hindered by largely unresolved trees in the past. RESULTS Our analysis of the rapidly evolving trnK/matK, trnL-F and rps16 chloroplast regions enabled us to infer more precise phylogenetic hypotheses for the Lamiales. Relationships among the nine first-branching families in the Lamiales tree are now resolved with very strong support. Subsequent to Plocospermataceae, a clade consisting of Carlemanniaceae plus Oleaceae branches, followed by Tetrachondraceae and a newly inferred clade composed of Gesneriaceae plus Calceolariaceae, which is also supported by morphological characters. Plantaginaceae (incl. Gratioleae) and Scrophulariaceae are well separated in the backbone grade; Lamiaceae and Verbenaceae appear in distant clades, while the recently described Linderniaceae are confirmed to be monophyletic and in an isolated position. CONCLUSIONS Confidence about deep nodes of the Lamiales tree is an important step towards understanding the evolutionary diversification of a major clade of flowering plants. The degree of resolution obtained here now provides a first opportunity to discuss the evolution of morphological and biochemical traits in Lamiales. The multiple independent evolution of the carnivorous syndrome, once in Lentibulariaceae and a second time in Byblidaceae, is strongly supported by all analyses and topological tests. The evolution of selected morphological characters such as flower symmetry is discussed. The addition of further sequence data from introns and spacers holds promise to eventually obtain a fully resolved plastid tree of Lamiales.
Collapse
Affiliation(s)
- Bastian Schäferhoff
- Institute for Evolution and Biodiversity, University of Muenster, Hüfferstraße 1, 48149 Münster, Germany
| | | | | | | | | | | | | |
Collapse
|
39
|
Cibrián-Jaramillo A, De la Torre-Bárcena JE, Lee EK, Katari MS, Little DP, Stevenson DW, Martienssen R, Coruzzi GM, DeSalle R. Using phylogenomic patterns and gene ontology to identify proteins of importance in plant evolution. Genome Biol Evol 2010; 2:225-39. [PMID: 20624728 PMCID: PMC2997538 DOI: 10.1093/gbe/evq012] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/14/2010] [Indexed: 01/01/2023] Open
Abstract
We use measures of congruence on a combined expressed sequenced tag genome phylogeny to identify proteins that have potential significance in the evolution of seed plants. Relevant proteins are identified based on the direction of partitioned branch and hidden support on the hypothesis obtained on a 16-species tree, constructed from 2,557 concatenated orthologous genes. We provide a general method for detecting genes or groups of genes that may be under selection in directions that are in agreement with the phylogenetic pattern. Gene partitioning methods and estimates of the degree and direction of support of individual gene partitions to the overall data set are used. Using this approach, we correlate positive branch support of specific genes for key branches in the seed plant phylogeny. In addition to basic metabolic functions, such as photosynthesis or hormones, genes involved in posttranscriptional regulation by small RNAs were significantly overrepresented in key nodes of the phylogeny of seed plants. Two genes in our matrix are of critical importance as they are involved in RNA-dependent regulation, essential during embryo and leaf development. These are Argonaute and the RNA-dependent RNA polymerase 6 found to be overrepresented in the angiosperm clade. We use these genes as examples of our phylogenomics approach and show that identifying partitions or genes in this way provides a platform to explain some of the more interesting organismal differences among species, and in particular, in the evolution of plants.
Collapse
Affiliation(s)
- Angélica Cibrián-Jaramillo
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, New York, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Arnold C, Stadler PF. Polynomial algorithms for the Maximal Pairing Problem: efficient phylogenetic targeting on arbitrary trees. Algorithms Mol Biol 2010; 5:25. [PMID: 20525185 PMCID: PMC2902485 DOI: 10.1186/1748-7188-5-25] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2010] [Accepted: 06/02/2010] [Indexed: 11/24/2022] Open
Abstract
Background The Maximal Pairing Problem (MPP) is the prototype of a class of combinatorial optimization problems that are of considerable interest in bioinformatics: Given an arbitrary phylogenetic tree T and weights ωxy for the paths between any two pairs of leaves (x, y), what is the collection of edge-disjoint paths between pairs of leaves that maximizes the total weight? Special cases of the MPP for binary trees and equal weights have been described previously; algorithms to solve the general MPP are still missing, however. Results We describe a relatively simple dynamic programming algorithm for the special case of binary trees. We then show that the general case of multifurcating trees can be treated by interleaving solutions to certain auxiliary Maximum Weighted Matching problems with an extension of this dynamic programming approach, resulting in an overall polynomial-time solution of complexity (n4 log n) w.r.t. the number n of leaves. The source code of a C implementation can be obtained under the GNU Public License from http://www.bioinf.uni-leipzig.de/Software/Targeting. For binary trees, we furthermore discuss several constrained variants of the MPP as well as a partition function approach to the probabilistic version of the MPP. Conclusions The algorithms introduced here make it possible to solve the MPP also for large trees with high-degree vertices. This has practical relevance in the field of comparative phylogenetics and, for example, in the context of phylogenetic targeting, i.e., data collection with resource limitations.
Collapse
|
41
|
Phylogenetic analysis of 83 plastid genes further resolves the early diversification of eudicots. Proc Natl Acad Sci U S A 2010; 107:4623-8. [PMID: 20176954 DOI: 10.1073/pnas.0907801107] [Citation(s) in RCA: 454] [Impact Index Per Article: 32.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Although Pentapetalae (comprising all core eudicots except Gunnerales) include approximately 70% of all angiosperms, the origin of and relationships among the major lineages of this clade have remained largely unresolved. Phylogenetic analyses of 83 protein-coding and rRNA genes from the plastid genome for 86 species of seed plants, including new sequences from 25 eudicots, indicate that soon after its origin, Pentapetalae diverged into three clades: (i) a "superrosid" clade consisting of Rosidae, Vitaceae, and Saxifragales; (ii) a "superasterid" clade consisting of Berberidopsidales, Santalales, Caryophyllales, and Asteridae; and (iii) Dilleniaceae. Maximum-likelihood analyses support the position of Dilleniaceae as sister to superrosids, but topology tests did not reject alternative positions of Dilleniaceae as sister to Asteridae or all remaining Pentapetalae. Molecular dating analyses suggest that the major lineages within both superrosids and superasterids arose in as little as 5 million years. This phylogenetic hypothesis provides a crucial historical framework for future studies aimed at elucidating the underlying causes of the morphological and species diversity in Pentapetalae.
Collapse
|
42
|
Abstract
The potyviruses are one of the two most speciose taxa of plant viruses. Our expanded knowledge of the breadth and depth of their diversity and its origins has depended greatly on the use of computing and the Internet in biological research and is reviewed here. We report a fully supported phylogeny based on gene sequence data for approximately half the named species. The phylogeny shows that the genus probably originated from a virus of monocotyledonous plants and that it first diverged approximately 7250 years ago in Southwest Eurasia or North Africa. The use of computer programs to better understand the structure and evolutionary trajectory of potyvirus populations is illustrated. The review concludes with recommendations for improving potyvirus nomenclature and the databasing of potyvirus information.
Collapse
Affiliation(s)
- Adrian Gibbs
- Emeritus Faculty, Australian National University, Canberra, ACT 0200, Australia.
| | | |
Collapse
|