1
|
Field JT, Abrams AJ, Cartee JC, McTavish EJ. Rapid alignment updating with Extensiphy. Methods Ecol Evol 2022. [DOI: 10.1111/2041-210x.13790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Jasper Toscani Field
- Quantitative and Systems Biology Program School of Natural Sciences University of California Merced CA USA
| | - A. Jeanine Abrams
- Division of STD Prevention National Centers for HIV/AIDS Viral Hepatitis, STD, and TB Prevention Atlanta GA USA
| | - John C. Cartee
- Division of STD Prevention National Centers for HIV/AIDS Viral Hepatitis, STD, and TB Prevention Atlanta GA USA
| | - Emily Jane McTavish
- Life and Environmental Sciences Department School of Natural Sciences University of California Merced CA USA
| |
Collapse
|
2
|
Mctavish EJ, Sánchez-Reyes LL, Holder MT. OpenTree: A Python Package for Accessing and Analyzing Data from the Open Tree of Life. Syst Biol 2021; 70:1295-1301. [PMID: 33970279 PMCID: PMC8513759 DOI: 10.1093/sysbio/syab033] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 04/27/2021] [Accepted: 05/03/2021] [Indexed: 11/14/2022] Open
Abstract
The Open Tree of Life project constructs a comprehensive, dynamic, and digitally available tree of life by synthesizing published phylogenetic trees along with taxonomic data. Open Tree of Life provides web-service application programming interfaces (APIs) to make the tree estimate, unified taxonomy, and input phylogenetic data available to anyone. Here, we describe the Python package opentree, which provides a user friendly Python wrapper for these APIs and a set of scripts and tutorials for straightforward downstream data analyses. We demonstrate the utility of these tools by generating an estimate of the phylogenetic relationships of all bird families, and by capturing a phylogenetic estimate for all taxa observed at the University of California Merced Vernal Pools and Grassland Reserve.[Evolution; open science; phylogenetics; Python; taxonomy.].
Collapse
Affiliation(s)
- Emily Jane Mctavish
- Department of Life and Environmental Sciences, University of California, Merced, CA 95343, USA
| | | | - Mark T Holder
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66045, USA
- Biodiversity Institute, University of Kansas, Lawrence, KS 66045, USA
| |
Collapse
|
3
|
Sánchez-Reyes LL, Kandziora M, McTavish EJ. Physcraper: a Python package for continually updated phylogenetic trees using the Open Tree of Life. BMC Bioinformatics 2021; 22:355. [PMID: 34187366 PMCID: PMC8244228 DOI: 10.1186/s12859-021-04274-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Accepted: 06/16/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Phylogenies are a key part of research in many areas of biology. Tools that automate some parts of the process of phylogenetic reconstruction, mainly molecular character matrix assembly, have been developed for the advantage of both specialists in the field of phylogenetics and non-specialists. However, interpretation of results, comparison with previously available phylogenetic hypotheses, and selection of one phylogeny for downstream analyses and discussion still impose difficulties to one that is not a specialist either on phylogenetic methods or on a particular group of study. RESULTS Physcraper is a command-line Python program that automates the update of published phylogenies by adding public DNA sequences to underlying alignments of previously published phylogenies. It also provides a framework for straightforward comparison of published phylogenies with their updated versions, by leveraging upon tools from the Open Tree of Life project to link taxonomic information across databases. The program can be used by the nonspecialist, as a tool to generate phylogenetic hypotheses based on publicly available expert phylogenetic knowledge. Phylogeneticists and taxonomic group specialists will find it useful as a tool to facilitate molecular dataset gathering and comparison of alternative phylogenetic hypotheses (topologies). CONCLUSION The Physcraper workflow showcases the benefits of doing open science for phylogenetics, encouraging researchers to strive for better scientific sharing practices. Physcraper can be used with any OS and is released under an open-source license. Detailed instructions for installation and usage are available at https://physcraper.readthedocs.io.
Collapse
Affiliation(s)
| | - Martha Kandziora
- School of Natural Sciences, University of California, Merced, USA.,Department of Botany, Faculty of Science, Charles University, Prague, Czech Republic
| | | |
Collapse
|
4
|
Hong Y, Guo M, Wang J. ENJ algorithm can construct triple phylogenetic trees. MOLECULAR THERAPY-NUCLEIC ACIDS 2020; 23:286-293. [PMID: 33425487 PMCID: PMC7779534 DOI: 10.1016/j.omtn.2020.11.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 11/05/2020] [Indexed: 11/20/2022]
Abstract
Phylogenetic analysis is used to analyze the evolution of species according to the characteristics of biological sequences. The analytical results are generally represented by phylogenetic trees. NJ (neighbor joining) is a frequently used algorithm for constructing phylogenetic trees because of its few assumptions, fast operation, and high accuracy, and is based on the distance between taxa. It is known that NJ usually constructs different phylogenetic trees for the same dataset with differences in input order, which are known as “tied trees.” This article proposes an improved method of NJ, called ENJ (extended neighbor joining). The ENJ can join several (currently limited to three) nodes with the same minimum distance into a new node, rather than joining two nodes in one iteration, so it can construct triple phylogenetic trees. We have inferred the formulas for updating the distance values and calculating the branch lengths for the ENJ algorithm. We have tested the ENJ with simulated and real data. The experimental results show that, compared with other methods, the trees constructed by the ENJ have greater similarity to the initial trees, and the ENJ is much faster than the NJ algorithm. Moreover, we have constructed a phylogenetic tree for the novel coronavirus (COVID-19) and related coronaviruses by ENJ, which shows that COVID-19 and SARS-CoV are closer than other coronaviruses. Because it differs from the existing phylogenetic trees for those coronaviruses, we constructed a phylogenetic network for them. The network shows those species have had a reticulate evolution.
Collapse
Affiliation(s)
- Yan Hong
- School of Computer Science, Inner Mongolia University, Hohhot 010021, P.R. China
| | - Maozu Guo
- School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing 100044, P.R. China.,Beijing Key Laboratory of Intelligent Processing for Building Big Data, Beijing 100044, P.R. China
| | - Juan Wang
- School of Computer Science, Inner Mongolia University, Hohhot 010021, P.R. China.,Stage Key Laboratories of Reproductive Regulation & Breeding of Grassland Livestock, Hohhot 010021, Inner Mongolia, P.R. China
| |
Collapse
|
5
|
How to Study Classification. Cladistics 2020. [DOI: 10.1017/9781139047678.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
6
|
Classification. Cladistics 2020. [DOI: 10.1017/9781139047678.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
7
|
Systematics Association Special Volumes. Cladistics 2020. [DOI: 10.1017/9781139047678.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
8
|
Relationship Diagrams. Cladistics 2020. [DOI: 10.1017/9781139047678.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
9
|
The Separation of Classification and Phylogenetics. Cladistics 2020. [DOI: 10.1017/9781139047678.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
10
|
Beyond Classification. Cladistics 2020. [DOI: 10.1017/9781139047678.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
11
|
The Interrelationships of Organisms. Cladistics 2020. [DOI: 10.1017/9781139047678.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
12
|
How to Study Classification. Cladistics 2020. [DOI: 10.1017/9781139047678.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
13
|
Modern Artificial Methods and Raw Data. Cladistics 2020. [DOI: 10.1017/9781139047678.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
14
|
Further Myths and More Misunderstandings. Cladistics 2020. [DOI: 10.1017/9781139047678.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|
15
|
Afterword. Cladistics 2020. [DOI: 10.1017/9781139047678.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|
16
|
Systematics: Exposing Myths. Cladistics 2020. [DOI: 10.1017/9781139047678.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
17
|
Essentialism and Typology. Cladistics 2020. [DOI: 10.1017/9781139047678.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
18
|
Beyond Classification: How to Study Phylogeny. Cladistics 2020. [DOI: 10.1017/9781139047678.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
19
|
How to Study Classification: ‘Total Evidence’ vs. ‘Consensus’, Character Congruence vs. Taxonomic Congruence, Simultaneous Analysis vs. Partitioned Data. Cladistics 2020. [DOI: 10.1017/9781139047678.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
|
20
|
What This Book Is About. Cladistics 2020. [DOI: 10.1017/9781139047678.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|
21
|
How to Study Classification. Cladistics 2020. [DOI: 10.1017/9781139047678.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
22
|
The Cladistic Programme. Cladistics 2020. [DOI: 10.1017/9781139047678.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|
23
|
Index. Cladistics 2020. [DOI: 10.1017/9781139047678.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
24
|
Parameters of Classification: Ordo Ab Chao. Cladistics 2020. [DOI: 10.1017/9781139047678.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
25
|
Monothetic and Polythetic Taxa. Cladistics 2020. [DOI: 10.1017/9781139047678.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
26
|
How to Study Classification: Consensus Techniques and General Classifications. Cladistics 2020. [DOI: 10.1017/9781139047678.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|
27
|
Non-taxa or the Absence of –Phyly: Paraphyly and Aphyly. Cladistics 2020. [DOI: 10.1017/9781139047678.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|
28
|
Introduction: Carving Nature at Its Joints, or Why Birds Are Not Dinosaurs and Men Are Not Apes. Cladistics 2020. [DOI: 10.1017/9781139047678.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
29
|
Preface. Cladistics 2020. [DOI: 10.1017/9781139047678.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
30
|
Letunic I, Bork P. Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res 2020; 47:W256-W259. [PMID: 30931475 PMCID: PMC6602468 DOI: 10.1093/nar/gkz239] [Citation(s) in RCA: 3953] [Impact Index Per Article: 790.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Revised: 03/16/2019] [Accepted: 03/25/2019] [Indexed: 11/13/2022] Open
Abstract
The Interactive Tree Of Life (https://itol.embl.de) is an online tool for the display, manipulation and annotation of phylogenetic and other trees. It is freely available and open to everyone. The current version introduces four new dataset types, together with numerous new features. Annotation options have been expanded and new control options added for many display elements. An interactive spreadsheet-like editor has been implemented, providing dataset creation and editing directly in the web interface. Font support has been rewritten with full support for UTF-8 character encoding throughout the user interface. Google Web Fonts are now fully supported in the tree text labels. iTOL v4 is the first tool which supports direct visualization of Qiime 2 trees and associated annotations. The user account system has been streamlined and expanded with new navigation options, and currently handles >700 000 trees from more than 40 000 individual users. Full batch access has been implemented allowing programmatic upload and export of trees and annotations.
Collapse
Affiliation(s)
- Ivica Letunic
- biobyte solutions GmbH, Bothestr 142, 69126 Heidelberg, Germany
| | - Peer Bork
- EMBL, Meyerhofstrasse 1, 69117 Heidelberg, Germany
| |
Collapse
|
31
|
Kaufer A, Stark D, Ellis J. A review of the systematics, species identification and diagnostics of the Trypanosomatidae using the maxicircle kinetoplast DNA: from past to present. Int J Parasitol 2020; 50:449-460. [PMID: 32333942 DOI: 10.1016/j.ijpara.2020.03.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2019] [Revised: 02/28/2020] [Accepted: 03/09/2020] [Indexed: 11/25/2022]
Abstract
The Trypanosomatid family are a diverse and widespread group of protozoan parasites that belong to the higher order class Kinetoplastida. Containing predominantly monoxenous species (i.e. those having only a single host) that are confined to invertebrate hosts, this class is primarily known for its pathogenic dixenous species (i.e. those that have two hosts), serving as the aetiological agents of the important neglected tropical diseases including leishmaniasis, American trypanosomiasis (Chagas disease) and human African trypanosomiasis. Over the past few decades, a multitude of studies have investigated the diversity, classification and evolutionary history of the trypanosomatid family using different approaches and molecular targets. The mitochondrial-like DNA of the trypanosomatid parasites, also known as the kinetoplast, has emerged as a unique taxonomic and diagnostic target for exploring the evolution of this diverse group of parasitic eukaryotes. This review discusses recent advancements and important developments that have made a significant impact in the field of trypanosomatid systematics and diagnostics in recent years.
Collapse
Affiliation(s)
- Alexa Kaufer
- School of Life Sciences, University of Technology Sydney, Ultimo, NSW 2007, Australia.
| | - Damien Stark
- Department of Microbiology, St Vincent's Hospital Sydney, Darlinghurst, NSW 2010, Australia
| | - John Ellis
- School of Life Sciences, University of Technology Sydney, Ultimo, NSW 2007, Australia
| |
Collapse
|
32
|
Nguyen VD, Nguyen TH, Tayeen ASM, Laughinghouse HD, Sánchez-Reyes LL, Wiggins J, Pontelli E, Mozzherin D, O’Meara B, Stoltzfus A. Phylotastic: Improving Access to Tree-of-Life Knowledge With Flexible, on-the-Fly Delivery of Trees. Evol Bioinform Online 2020; 16:1176934319899384. [PMID: 32372858 PMCID: PMC7192527 DOI: 10.1177/1176934319899384] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Accepted: 11/20/2019] [Indexed: 11/15/2022] Open
Abstract
A comprehensive phylogeny of species, i.e., a tree of life, has potential uses in a variety of contexts, including research, education, and public policy. Yet, accessing the tree of life typically requires special knowledge, complex software, or long periods of training. The Phylotastic project aims make it as easy to get a phylogeny of species as it is to get driving directions from mapping software. In prior work, we presented a design for an open system to validate and manage taxon names, find phylogeny resources, extract subtrees matching a user's taxon list, scale trees to time, and integrate related resources such as species images. Here, we report the implementation of a set of tools that together represent a robust, accessible system for on-the-fly delivery of phylogenetic knowledge. This set of tools includes a web portal to execute several customizable workflows to obtain species phylogenies (scaled by geologic time and decorated with thumbnail images); more than 30 underlying web services (accessible via a common registry); and code toolkits in R and Python (allowing others to develop custom applications using Phylotastic services). The Phylotastic system, accessible via http://www.phylotastic.org, provides a unique resource to access the current state of phylogenetic knowledge, useful for a variety of cases in which a tree extracted quickly from online resources (as distinct from a tree custom-made from character data) is sufficient, as it is for many casual uses of trees identified here.
Collapse
Affiliation(s)
- Van D Nguyen
- Department of Computer Science, New Mexico State University, Las Cruces, NM, USA
| | - Thanh H Nguyen
- Department of Computer Science, New Mexico State University, Las Cruces, NM, USA
| | - Abu Saleh Md Tayeen
- Department of Computer Science, New Mexico State University, Las Cruces, NM, USA
| | - H Dail Laughinghouse
- Institute for Bioscience and Biotechnology Research, Rockville, MD, USA
- Fort Lauderdale Research and Education Center, University of Florida/IFAS, Davie, FL, USA
| | - Luna L Sánchez-Reyes
- Department of Ecology and Evolutionary Biology, The University of Tennessee, Knoxville, Knoxville, TN, USA
| | - Jodie Wiggins
- Department of Ecology and Evolutionary Biology, The University of Tennessee, Knoxville, Knoxville, TN, USA
| | - Enrico Pontelli
- Department of Computer Science, New Mexico State University, Las Cruces, NM, USA
| | - Dmitry Mozzherin
- Illinois Natural History Survey, Species File Group, University of Illinois at Urbana–Champaign, Champaign, IL, USA
| | - Brian O’Meara
- Department of Ecology and Evolutionary Biology, The University of Tennessee, Knoxville, Knoxville, TN, USA
| | - Arlin Stoltzfus
- Institute for Bioscience and Biotechnology Research, Rockville, MD, USA
- Office of Data and Informatics, Material Measurement Laboratory, NIST, Gaithersburg, MD, USA
| |
Collapse
|
33
|
Domingo E. Long-term virus evolution in nature. VIRUS AS POPULATIONS 2020. [PMCID: PMC7153321 DOI: 10.1016/b978-0-12-816331-3.00007-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Viruses spread to give rise to epidemics and pandemics, and some key parameters that include virus and host population numbers determine virus persistence or extinction in nature. Viruses evolve at different rates depending on the polymerase copying fidelity during genome replication and a number of environmental influences. Calculated rates of evolution in nature vary depending on the time interval between virus isolations. In particular, intrahost evolution is generally more rapid that interhost evolution, and several possible mechanisms for this difference are considered. The mechanisms by which the error-prone viruses evolve are very unlikely to render the operation of a molecular clock (constant rate of incorporation of mutations in the evolving genomes), although a clock is assumed in many calculations. Several computational tools permit the alignment of viral sequences and the establishment of phylogenetic relationships among viruses. The evolution of the virus in the form of dynamic mutant clouds in each infected individual, together with multiple environmental parameters renders the emergence and reemergence of viral pathogens an unpredictable event, another facet of biological complexity.
Collapse
|
34
|
Chang J, Rabosky DL, Smith SA, Alfaro ME. An
r
package and online resource for macroevolutionary studies using the ray‐finned fish tree of life. Methods Ecol Evol 2019. [DOI: 10.1111/2041-210x.13182] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Affiliation(s)
- Jonathan Chang
- School of Biological Sciences Monash University Clayton VIC Australia
| | - Daniel L. Rabosky
- Museum of Zoology Department of Ecology and Evolutionary Biology University of Michigan Ann Arbor MI
| | - Stephen A. Smith
- Museum of Zoology Department of Ecology and Evolutionary Biology University of Michigan Ann Arbor MI
| | - Michael E. Alfaro
- Department of Ecology and Evolutionary BiologyUniversity of CaliforniaLos AngelesCA
| |
Collapse
|
35
|
Bravo GA, Antonelli A, Bacon CD, Bartoszek K, Blom MPK, Huynh S, Jones G, Knowles LL, Lamichhaney S, Marcussen T, Morlon H, Nakhleh LK, Oxelman B, Pfeil B, Schliep A, Wahlberg N, Werneck FP, Wiedenhoeft J, Willows-Munro S, Edwards SV. Embracing heterogeneity: coalescing the Tree of Life and the future of phylogenomics. PeerJ 2019; 7:e6399. [PMID: 30783571 PMCID: PMC6378093 DOI: 10.7717/peerj.6399] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Accepted: 01/07/2019] [Indexed: 12/23/2022] Open
Abstract
Building the Tree of Life (ToL) is a major challenge of modern biology, requiring advances in cyberinfrastructure, data collection, theory, and more. Here, we argue that phylogenomics stands to benefit by embracing the many heterogeneous genomic signals emerging from the first decade of large-scale phylogenetic analysis spawned by high-throughput sequencing (HTS). Such signals include those most commonly encountered in phylogenomic datasets, such as incomplete lineage sorting, but also those reticulate processes emerging with greater frequency, such as recombination and introgression. Here we focus specifically on how phylogenetic methods can accommodate the heterogeneity incurred by such population genetic processes; we do not discuss phylogenetic methods that ignore such processes, such as concatenation or supermatrix approaches or supertrees. We suggest that methods of data acquisition and the types of markers used in phylogenomics will remain restricted until a posteriori methods of marker choice are made possible with routine whole-genome sequencing of taxa of interest. We discuss limitations and potential extensions of a model supporting innovation in phylogenomics today, the multispecies coalescent model (MSC). Macroevolutionary models that use phylogenies, such as character mapping, often ignore the heterogeneity on which building phylogenies increasingly rely and suggest that assimilating such heterogeneity is an important goal moving forward. Finally, we argue that an integrative cyberinfrastructure linking all steps of the process of building the ToL, from specimen acquisition in the field to publication and tracking of phylogenomic data, as well as a culture that values contributors at each step, are essential for progress.
Collapse
Affiliation(s)
- Gustavo A. Bravo
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, Cambridge, MA, USA
| | - Alexandre Antonelli
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, Cambridge, MA, USA
- Gothenburg Global Biodiversity Centre, Göteborg, Sweden
- Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
- Gothenburg Botanical Garden, Göteborg, Sweden
| | - Christine D. Bacon
- Gothenburg Global Biodiversity Centre, Göteborg, Sweden
- Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
| | - Krzysztof Bartoszek
- Department of Computer and Information Science, Linköping University, Linköping, Sweden
| | - Mozes P. K. Blom
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden
| | - Stella Huynh
- Institut de Biologie, Université de Neuchâtel, Neuchâtel, Switzerland
| | - Graham Jones
- Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
| | - L. Lacey Knowles
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
| | - Sangeet Lamichhaney
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, Cambridge, MA, USA
| | - Thomas Marcussen
- Centre for Ecological and Evolutionary Synthesis, University of Oslo, Oslo, Norway
| | - Hélène Morlon
- Institut de Biologie, Ecole Normale Supérieure de Paris, Paris, France
| | - Luay K. Nakhleh
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Bengt Oxelman
- Gothenburg Global Biodiversity Centre, Göteborg, Sweden
- Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
| | - Bernard Pfeil
- Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
| | - Alexander Schliep
- Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg, Göteborg, Sweden
| | | | - Fernanda P. Werneck
- Coordenação de Biodiversidade, Programa de Coleções Científicas Biológicas, Instituto Nacional de Pesquisa da Amazônia, Manaus, AM, Brazil
| | - John Wiedenhoeft
- Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg, Göteborg, Sweden
- Department of Computer Science, Rutgers University, Piscataway, NJ, USA
| | - Sandi Willows-Munro
- School of Life Sciences, University of Kwazulu-Natal, Pietermaritzburg, South Africa
| | - Scott V. Edwards
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, Cambridge, MA, USA
- Gothenburg Centre for Advanced Studies in Science and Technology, Chalmers University of Technology and University of Gothenburg, Göteborg, Sweden
| |
Collapse
|
36
|
Eiserhardt WL, Antonelli A, Bennett DJ, Botigué LR, Burleigh JG, Dodsworth S, Enquist BJ, Forest F, Kim JT, Kozlov AM, Leitch IJ, Maitner BS, Mirarab S, Piel WH, Pérez-Escobar OA, Pokorny L, Rahbek C, Sandel B, Smith SA, Stamatakis A, Vos RA, Warnow T, Baker WJ. A roadmap for global synthesis of the plant tree of life. AMERICAN JOURNAL OF BOTANY 2018; 105:614-622. [PMID: 29603138 DOI: 10.1002/ajb2.1041] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Accepted: 11/08/2017] [Indexed: 06/08/2023]
Abstract
Providing science and society with an integrated, up-to-date, high quality, open, reproducible and sustainable plant tree of life would be a huge service that is now coming within reach. However, synthesizing the growing body of DNA sequence data in the public domain and disseminating the trees to a diverse audience are often not straightforward due to numerous informatics barriers. While big synthetic plant phylogenies are being built, they remain static and become quickly outdated as new data are published and tree-building methods improve. Moreover, the body of existing phylogenetic evidence is hard to navigate and access for non-experts. We propose that our community of botanists, tree builders, and informaticians should converge on a modular framework for data integration and phylogenetic analysis, allowing easy collaboration, updating, data sourcing and flexible analyses. With support from major institutions, this pipeline should be re-run at regular intervals, storing trees and their metadata long-term. Providing the trees to a diverse global audience through user-friendly front ends and application development interfaces should also be a priority. Interactive interfaces could be used to solicit user feedback and thus improve data quality and to coordinate the generation of new data. We conclude by outlining a number of steps that we suggest the scientific community should take to achieve global phylogenetic synthesis.
Collapse
Affiliation(s)
- Wolf L Eiserhardt
- Royal Botanic Gardens, Kew, TW9 3AE, Richmond, Surrey, UK
- Department of Bioscience, Aarhus University, Ny Munkegade 116, 8000, Aarhus C, Denmark
| | - Alexandre Antonelli
- Gothenburg Global Biodiversity Centre, Box 461, 405 30, Gothenburg, Sweden
- Department of Biological and Environmental Sciences, University of Gothenburg, Box 461, 405 30, Gothenburg, Sweden
- Gothenburg Botanical Garden, Carl Skottsbergs Gata 22B, SE-413 19, Gothenburg, Sweden
| | - Dominic J Bennett
- Gothenburg Global Biodiversity Centre, Box 461, 405 30, Gothenburg, Sweden
- Department of Biological and Environmental Sciences, University of Gothenburg, Box 461, 405 30, Gothenburg, Sweden
- Gothenburg Botanical Garden, Carl Skottsbergs Gata 22B, SE-413 19, Gothenburg, Sweden
| | | | | | | | - Brian J Enquist
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, 85721, USA
- The Santa Fe Institute, Santa Fe, NM, 87501, USA
| | - Félix Forest
- Royal Botanic Gardens, Kew, TW9 3AE, Richmond, Surrey, UK
| | - Jan T Kim
- Royal Botanic Gardens, Kew, TW9 3AE, Richmond, Surrey, UK
| | - Alexey M Kozlov
- Scientific Computing Group, Heidelberg Institute for Theoretical Studies, 69118, Heidelberg, Germany
| | - Ilia J Leitch
- Royal Botanic Gardens, Kew, TW9 3AE, Richmond, Surrey, UK
| | - Brian S Maitner
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, 85721, USA
| | - Siavash Mirarab
- Department of Electrical and Computer Engineering, University of California, San Diego, San Diego, CA, 92093, USA
| | - William H Piel
- Yale-NUS College, 16 College Avenue West, Singapore, 138527, Republic of Singapore
| | | | - Lisa Pokorny
- Royal Botanic Gardens, Kew, TW9 3AE, Richmond, Surrey, UK
| | - Carsten Rahbek
- Center for Macroecology, Evolution and Climate, University of Copenhagen, Universitetsparken 15, DK-2100, Copenhagen O, Denmark
- Imperial College London, Silwood Park, Buckhurst Road, Ascot, Berkshire, SL5 7PY, UK
| | - Brody Sandel
- Department of Biology, Santa Clara University, Santa Clara, CA, 95053, USA
| | - Stephen A Smith
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Alexandros Stamatakis
- Scientific Computing Group, Heidelberg Institute for Theoretical Studies, 69118, Heidelberg, Germany
- Institute for Theoretical Informatics, Karlsruhe Institute of Technology, 76128, Karlsruhe, Germany
| | - Rutger A Vos
- Naturalis Biodiversity Center, P.O. Box 9517, 2300RA, Leiden, The Netherlands
- Institute of Biology Leiden, P.O. Box 9505, 2300RA, Leiden, The Netherlands
| | - Tandy Warnow
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | | |
Collapse
|