1
|
Kumar S, Stecher G, Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Mol Biol Evol 2016; 33:1870-4. [PMID: 27004904 DOI: 10.1093/molbev/msw054] [Citation(s) in RCA: 28563] [Impact Index Per Article: 3173.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We present the latest version of the Molecular Evolutionary Genetics Analysis (Mega) software, which contains many sophisticated methods and tools for phylogenomics and phylomedicine. In this major upgrade, Mega has been optimized for use on 64-bit computing systems for analyzing larger datasets. Researchers can now explore and analyze tens of thousands of sequences in Mega The new version also provides an advanced wizard for building timetrees and includes a new functionality to automatically predict gene duplication events in gene family trees. The 64-bit Mega is made available in two interfaces: graphical and command line. The graphical user interface (GUI) is a native Microsoft Windows application that can also be used on Mac OS X. The command line Mega is available as native applications for Windows, Linux, and Mac OS X. They are intended for use in high-throughput and scripted analysis. Both versions are available from www.megasoftware.net free of charge.
Collapse
|
Research Support, Non-U.S. Gov't |
9 |
28563 |
2
|
Hedges SB, Marin J, Suleski M, Paymer M, Kumar S. Tree of life reveals clock-like speciation and diversification. Mol Biol Evol 2015; 32:835-45. [PMID: 25739733 PMCID: PMC4379413 DOI: 10.1093/molbev/msv037] [Citation(s) in RCA: 659] [Impact Index Per Article: 65.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Genomic data are rapidly resolving the tree of living species calibrated to time, the timetree of life, which will provide a framework for research in diverse fields of science. Previous analyses of taxonomically restricted timetrees have found a decline in the rate of diversification in many groups of organisms, often attributed to ecological interactions among species. Here, we have synthesized a global timetree of life from 2,274 studies representing 50,632 species and examined the pattern and rate of diversification as well as the timing of speciation. We found that species diversity has been mostly expanding overall and in many smaller groups of species, and that the rate of diversification in eukaryotes has been mostly constant. We also identified, and avoided, potential biases that may have influenced previous analyses of diversification including low levels of taxon sampling, small clade size, and the inclusion of stem branches in clade analyses. We found consistency in time-to-speciation among plants and animals, ∼2 My, as measured by intervals of crown and stem species times. Together, this clock-like change at different levels suggests that speciation and diversification are processes dominated by random events and that adaptive change is largely a separate process.
Collapse
|
Research Support, U.S. Gov't, Non-P.H.S. |
10 |
659 |
3
|
Kumar S, Suleski M, Craig JM, Kasprowicz AE, Sanderford M, Li M, Stecher G, Hedges SB. TimeTree 5: An Expanded Resource for Species Divergence Times. Mol Biol Evol 2022; 39:msac174. [PMID: 35932227 PMCID: PMC9400175 DOI: 10.1093/molbev/msac174] [Citation(s) in RCA: 516] [Impact Index Per Article: 172.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2022] [Revised: 07/28/2022] [Accepted: 08/03/2022] [Indexed: 12/04/2022] Open
Abstract
We present the fifth edition of the TimeTree of Life resource (TToL5), a product of the timetree of life project that aims to synthesize published molecular timetrees and make evolutionary knowledge easily accessible to all. Using the TToL5 web portal, users can retrieve published studies and divergence times between species, the timeline of a species' evolution beginning with the origin of life, and the timetree for a given evolutionary group at the desired taxonomic rank. TToL5 contains divergence time information on 137,306 species, 41% more than the previous edition. The TToL5 web interface is now ADA-compliant and mobile-friendly, a result of comprehensive source code refactoring. TToL5 also offers programmatic access to species divergence times and timelines through an application programming interface, which is accessible at timetree.temple.edu/api. TToL5 is publicly available at timetree.org.
Collapse
|
brief-report |
3 |
516 |
4
|
Nyakatura K, Bininda-Emonds ORP. Updating the evolutionary history of Carnivora (Mammalia): a new species-level supertree complete with divergence time estimates. BMC Biol 2012; 10:12. [PMID: 22369503 PMCID: PMC3307490 DOI: 10.1186/1741-7007-10-12] [Citation(s) in RCA: 247] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2011] [Accepted: 02/27/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Although it has proven to be an important foundation for investigations of carnivoran ecology, biology and evolution, the complete species-level supertree for Carnivora of Bininda-Emonds et al. is showing its age. Additional, largely molecular sequence data are now available for many species and the advancement of computer technology means that many of the limitations of the original analysis can now be avoided. We therefore sought to provide an updated estimate of the phylogenetic relationships within all extant Carnivora, again using supertree analysis to be able to analyze as much of the global phylogenetic database for the group as possible. RESULTS In total, 188 source trees were combined, representing 114 trees from the literature together with 74 newly constructed gene trees derived from nearly 45,000 bp of sequence data from GenBank. The greater availability of sequence data means that the new supertree is almost completely resolved and also better reflects current phylogenetic opinion (for example, supporting a monophyletic Mephitidae, Eupleridae and Prionodontidae; placing Nandinia binotata as sister to the remaining Feliformia). Following an initial rapid radiation, diversification rate analyses indicate a downturn in the net speciation rate within the past three million years as well as a possible increase some 18.0 million years ago; numerous diversification rate shifts within the order were also identified. CONCLUSIONS Together, the two carnivore supertrees remain the only complete phylogenetic estimates for all extant species and the new supertree, like the old one, will form a key tool in helping us to further understand the biology of this charismatic group of carnivores.
Collapse
|
research-article |
13 |
247 |
5
|
Kenrick P, Wellman CH, Schneider H, Edgecombe GD. A timeline for terrestrialization: consequences for the carbon cycle in the Palaeozoic. Philos Trans R Soc Lond B Biol Sci 2012; 367:519-36. [PMID: 22232764 PMCID: PMC3248713 DOI: 10.1098/rstb.2011.0271] [Citation(s) in RCA: 104] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The geochemical carbon cycle is strongly influenced by life on land, principally through the effects of carbon sequestration and the weathering of calcium and magnesium silicates in surface rocks and soils. Knowing the time of origin of land plants and animals and also of key organ systems (e.g. plant vasculature, roots, wood) is crucial to understand the development of the carbon cycle and its effects on other Earth systems. Here, we compare evidence from fossils with calibrated molecular phylogenetic trees (timetrees) of living plants and arthropods. We show that different perspectives conflict in terms of the relative timing of events, the organisms involved and the pattern of diversification of various groups. Focusing on the fossil record, we highlight a number of key biases that underpin some of these conflicts, the most pervasive and far-reaching being the extent and nature of major facies changes in the rock record. These effects probably mask an earlier origin of life on land than is evident from certain classes of fossil data. If correct, this would have major implications in understanding the carbon cycle during the Early Palaeozoic.
Collapse
|
Review |
13 |
104 |
6
|
Shen XX, Liang D, Chen MY, Mao RL, Wake DB, Zhang P. Enlarged Multilocus Data set Provides Surprisingly Younger Time of Origin for the Plethodontidae, the Largest Family of Salamanders. Syst Biol 2015; 65:66-81. [PMID: 26385618 DOI: 10.1093/sysbio/syv061] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2014] [Accepted: 08/15/2015] [Indexed: 11/14/2022] Open
Abstract
Deep phylogenetic relationships of the largest salamander family Plethodontidae have been difficult to resolve, probably reflecting a rapid diversification early in their evolutionary history. Here, data from 50 independent nuclear markers (total 48,582 bp) are used to reconstruct the phylogeny and divergence times for plethodontid salamanders, using both concatenation and coalescence-based species tree analyses. Our results robustly resolve the position of the enigmatic eastern North American four-toed salamander (Hemidactylium) as the sister taxon of Batrachoseps + Tribe Bolitoglossini, thus settling a long-standing question. Furthermore, we statistically reject sister taxon status of Karsenia and Hydromantes, the only plethodontids to occur outside the Americas, leading us to new biogeographic hypotheses. Contrary to previous long-standing arguments that plethodontid salamanders are an old lineage originating in the Cretaceous (more than 90 Ma), our analyses lead to the hypothesis that these salamanders are much younger, arising close to the K-T boundary (~66 Ma). These time estimates are highly stable using alternative calibration schemes and dating methods. Our data simulation highlights the potential risk of making strong arguments about phylogenetic timing based on inferences from a handful of nuclear genes, a common practice. Based on the newly obtained timetree and ancestral area reconstruction results, we argue that (i) the classic "Out of Appalachia" hypothesis of plethodontid origins is problematic; (ii) the common ancestor of extant plethodontids may have originated in northwestern North America in the early Paleocene; (iii) origins of Eurasian plethodontids likely result from two separate dispersal events from western North America via Beringia in the late Eocene (~42 Ma) and the early Miocene (~23 Ma), respectively.
Collapse
|
Research Support, Non-U.S. Gov't |
10 |
56 |
7
|
Marin J, Battistuzzi FU, Brown AC, Hedges SB. The Timetree of Prokaryotes: New Insights into Their Evolution and Speciation. Mol Biol Evol 2017; 34:437-446. [PMID: 27965376 DOI: 10.1093/molbev/msw245] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The increasing size of timetrees in recent years has led to a focus on diversification analyses to better understand patterns of macroevolution. Thus far, nearly all studies have been conducted with eukaryotes primarily because phylogenies have been more difficult to reconstruct and calibrate to geologic time in prokaryotes. Here, we have estimated a timetree of 11,784 'species' of prokaryotes and explored their pattern of diversification. We used data from the small subunit ribosomal RNA along with an evolutionary framework from previous multi-gene studies to produce three alternative timetrees. For each timetree we surprisingly found a constant net diversification rate derived from an exponential increase of lineages and showing no evidence of saturation (rate decline), the same pattern found previously in eukaryotes. The implication is that prokaryote diversification as a whole is the result of the random splitting of lineages and is neither limited by existing diversity (filled niches) nor responsive in any major way to environmental changes.
Collapse
|
Research Support, U.S. Gov't, Non-P.H.S. |
8 |
40 |
8
|
Filipski A, Murillo O, Freydenzon A, Tamura K, Kumar S. Prospects for building large timetrees using molecular data with incomplete gene coverage among species. Mol Biol Evol 2014; 31:2542-50. [PMID: 24974376 DOI: 10.1093/molbev/msu200] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Scientists are assembling sequence data sets from increasing numbers of species and genes to build comprehensive timetrees. However, data are often unavailable for some species and gene combinations, and the proportion of missing data is often large for data sets containing many genes and species. Surprisingly, there has not been a systematic analysis of the effect of the degree of sparseness of the species-gene matrix on the accuracy of divergence time estimates. Here, we present results from computer simulations and empirical data analyses to quantify the impact of missing gene data on divergence time estimation in large phylogenies. We found that estimates of divergence times were robust even when sequences from a majority of genes for most of the species were absent. From the analysis of such extremely sparse data sets, we found that the most egregious errors occurred for nodes in the tree that had no common genes for any pair of species in the immediate descendant clades of the node in question. These problematic nodes can be easily detected prior to computational analyses based only on the input sequence alignment and the tree topology. We conclude that it is best to use larger alignments, because adding both genes and species to the alignment augments the number of genes available for estimating divergence events deep in the tree and improves their time estimates.
Collapse
|
Research Support, U.S. Gov't, Non-P.H.S. |
11 |
35 |
9
|
Marshall CR. Using the Fossil Record to Evaluate Timetree Timescales. Front Genet 2019; 10:1049. [PMID: 31803226 PMCID: PMC6871265 DOI: 10.3389/fgene.2019.01049] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Accepted: 09/30/2019] [Indexed: 12/11/2022] Open
Abstract
The fossil and geologic records provide the primary data used to established absolute timescales for timetrees. For the paleontological evaluation of proposed timetree timescales, and for node-based methods for constructing timetrees, the fossil record is used to bracket divergence times. Minimum brackets (minimum ages) can be established robustly using well-dated fossils that can be reliably assigned to lineages based on positive morphological evidence. Maximum brackets are much harder to establish, largely because it is difficult to establish definitive evidence that the absence of a taxon in the fossil record is real and not just due to the incompleteness of the fossil and rock records. Five primary methods have been developed to estimate maximum age brackets, each of which is discussed. The fact that the fossilization potential of a group typically decreases the closer one approaches its time of origin increases the challenge of estimating maximum age brackets. Additional complications arise: 1) because fossil data actually bracket the time of origin of the first relevant fossilizable morphology (apomorphy), not the divergence time itself; 2) due to the phylogenetic uncertainty in the placement of fossils; 3) because of idiosyncratic temporal and geographic gaps in the rock and fossil records; and 4) if the preservation potential of a group changed significantly during its history. In contrast, uncertainties in the absolute ages of fossils are typically relatively unimportant, even though the vast majority of fossil cannot be dated directly. These issues and relevant quantitative methods are reviewed, and their relative magnitudes assessed, which typically correlate with the age of the group, its geographic range, and species richness.
Collapse
|
Review |
6 |
33 |
10
|
Siu-Ting K, Torres-Sánchez M, San Mauro D, Wilcockson D, Wilkinson M, Pisani D, O'Connell MJ, Creevey CJ. Inadvertent Paralog Inclusion Drives Artifactual Topologies and Timetree Estimates in Phylogenomics. Mol Biol Evol 2019; 36:1344-1356. [PMID: 30903171 PMCID: PMC6526904 DOI: 10.1093/molbev/msz067] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Increasingly, large phylogenomic data sets include transcriptomic data from nonmodel organisms. This not only has allowed controversial and unexplored evolutionary relationships in the tree of life to be addressed but also increases the risk of inadvertent inclusion of paralogs in the analysis. Although this may be expected to result in decreased phylogenetic support, it is not clear if it could also drive highly supported artifactual relationships. Many groups, including the hyperdiverse Lissamphibia, are especially susceptible to these issues due to ancient gene duplication events and small numbers of sequenced genomes and because transcriptomes are increasingly applied to resolve historically conflicting taxonomic hypotheses. We tested the potential impact of paralog inclusion on the topologies and timetree estimates of the Lissamphibia using published and de novo sequencing data including 18 amphibian species, from which 2,656 single-copy gene families were identified. A novel paralog filtering approach resulted in four differently curated data sets, which were used for phylogenetic reconstructions using Bayesian inference, maximum likelihood, and quartet-based supertrees. We found that paralogs drive strongly supported conflicting hypotheses within the Lissamphibia (Batrachia and Procera) and older divergence time estimates even within groups where no variation in topology was observed. All investigated methods, except Bayesian inference with the CAT-GTR model, were found to be sensitive to paralogs, but with filtering convergence to the same answer (Batrachia) was observed. This is the first large-scale study to address the impact of orthology selection using transcriptomic data and emphasizes the importance of quality over quantity particularly for understanding relationships of poorly sampled taxa.
Collapse
|
Research Support, Non-U.S. Gov't |
6 |
33 |
11
|
Wray GA. Molecular clocks and the early evolution of metazoan nervous systems. Philos Trans R Soc Lond B Biol Sci 2016; 370:rstb.2015.0046. [PMID: 26554040 DOI: 10.1098/rstb.2015.0046] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The timing of early animal evolution remains poorly resolved, yet remains critical for understanding nervous system evolution. Methods for estimating divergence times from sequence data have improved considerably, providing a more refined understanding of key divergences. The best molecular estimates point to the origin of metazoans and bilaterians tens to hundreds of millions of years earlier than their first appearances in the fossil record. Both the molecular and fossil records are compatible, however, with the possibility of tiny, unskeletonized, low energy budget animals during the Proterozoic that had planktonic, benthic, or meiofaunal lifestyles. Such animals would likely have had relatively simple nervous systems equipped primarily to detect food, avoid inhospitable environments and locate mates. The appearance of the first macropredators during the Cambrian would have changed the selective landscape dramatically, likely driving the evolution of complex sense organs, sophisticated sensory processing systems, and diverse effector systems involved in capturing prey and avoiding predation.
Collapse
|
Review |
9 |
31 |
12
|
Liu Z, Zhang Y, Xu M, Li X, Zhang Z. Distribution of hepatitis B virus genotypes and subgenotypes: A meta-analysis. Medicine (Baltimore) 2021; 100:e27941. [PMID: 34918643 PMCID: PMC8678021 DOI: 10.1097/md.0000000000027941] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Accepted: 11/04/2021] [Indexed: 02/07/2023] Open
Abstract
Hepatitis B virus (HBV) genotypes and subgenotypes have distinct geographical distributions and influence a number of clinical disease features and responses to treatment. There are many reports on the distribution of HBV genotypes, but great differences are present between studies. What's more, a meta-analysis of HBV genotype- and subgenotype-distribution by country is lacking.A comprehensive literature search was performed in PubMed and a systematic search of full-length HBV sequences and S gene sequences was conducted in the GenBank database. HBV genotypes were checked and subgenotypes were determined by phylogenetic comparison of full-length HBV sequences or S gene sequences. STATA 12.0 was used for the analysis for countries with multiple datasets. BEAST 2.5.2 was used for Bayesian phylogenetic analysis to infer the evolutionary time scales of HBV.This study includes 309 datasets from 110 countries, including 188 relevant studies, 58 full-length gene datasets, and 63 S gene datasets. The meta-analysis was performed on 274 datasets from 75 countries. The distribution of genotypes is more detailed than those described by previous studies. While the overall genotype distribution is similar to that reported in previous studies, some notable aspects were different. The main genotypes present in south-eastern Africa, North Africa, and West Africa are genotypes A, D, and E, respectively. Genotypes G and H are mainly distributed in Mexico. Genotype F is mainly distributed in central and South America, but genotypes A and D are also common in Brazil, Cuba, and Haiti.This study provides a more accurate description of the distribution of HBV genotypes and subgenotypes in different countries and suggests that the differences in genotype distribution may be related to ethnicity and human migration.
Collapse
|
Meta-Analysis |
4 |
31 |
13
|
Mahony S, Foley NM, Biju SD, Teeling EC. Evolutionary History of the Asian Horned Frogs (Megophryinae): Integrative Approaches to Timetree Dating in the Absence of a Fossil Record. Mol Biol Evol 2017; 34:744-771. [PMID: 28100792 DOI: 10.1093/molbev/msw267] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Molecular dating studies typically need fossils to calibrate the analyses. Unfortunately, the fossil record is extremely poor or presently nonexistent for many species groups, rendering such dating analysis difficult. One such group is the Asian horned frogs (Megophryinae). Sampling all generic nomina, we combined a novel ∼5 kb dataset composed of four nuclear and three mitochondrial gene fragments to produce a robust phylogeny, with an extensive external morphological study to produce a working taxonomy for the group. Expanding the molecular dataset to include out-groups of fossil-represented ancestral anuran families, we compared the priorless RelTime dating method with the widely used prior-based Bayesian timetree method, MCMCtree, utilizing a novel combination of fossil priors for anuran phylogenetic dating. The phylogeny was then subjected to ancestral phylogeographic analyses, and dating estimates were compared with likely biogeographic vicariant events. Phylogenetic analyses demonstrated that previously proposed systematic hypotheses were incorrect due to the paraphyly of genera. Molecular phylogenetic, morphological, and timetree results support the recognition of Megophryinae as a single genus, Megophrys, with a subgenus level classification. Timetree results using RelTime better corresponded with the known fossil record for the out-group anuran tree. For the priorless in-group, it also outperformed MCMCtree when node date estimates were compared with likely influential historical biogeographic events, providing novel insights into the evolutionary history of this pan-Asian anuran group. Given a relatively small molecular dataset, and limited prior knowledge, this study demonstrates that the computationally rapid RelTime dating tool may outperform more popular and complex prior reliant timetree methodologies.
Collapse
|
Research Support, Non-U.S. Gov't |
8 |
27 |
14
|
Powell CLE, Waskin S, Battistuzzi FU. Quantifying the Error of Secondary vs. Distant Primary Calibrations in a Simulated Environment. Front Genet 2020; 11:252. [PMID: 32265987 PMCID: PMC7099002 DOI: 10.3389/fgene.2020.00252] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Accepted: 03/02/2020] [Indexed: 12/25/2022] Open
Abstract
Using calibrations to obtain absolute divergence times is standard practice in molecular clock studies. While the use of primary (e.g., fossil) calibrations is preferred, this approach can be limiting because of their rarity in fast-growing datasets. Thus, alternatives need to be explored, such as the use of secondary (molecularly-derived) calibrations that can anchor a timetree in a larger number of nodes. However, the use of secondary calibrations has been discouraged in the past because of concerns in the error rates of the node estimates they produce with an apparent high precision. Here, we quantify the amount of errors in estimates produced by the use of secondary calibrations relative to true times and primary calibrations placed on distant nodes. We find that, overall, the inaccuracies in estimates based on secondary calibrations are predictable and mirror errors associated with primary calibrations and their confidence intervals. Additionally, we find comparable error rates in estimated times from secondary calibrations and distant primary calibrations, although the precision of estimates derived from distant primary calibrations is roughly twice as good as that of estimates derived from secondary calibrations. This suggests that increasing dataset size to include primary calibrations may produce divergence times that are about as accurate as those from secondary calibrations, albeit with a higher precision. Overall, our results suggest that secondary calibrations may be useful to explore the parameter space of plausible evolutionary scenarios when compared to time estimates obtained with distant primary calibrations.
Collapse
|
research-article |
5 |
12 |
15
|
Almeida EAB, Bossert S, Danforth BN, Porto DS, Freitas FV, Davis CC, Murray EA, Blaimer BB, Spasojevic T, Ströher PR, Orr MC, Packer L, Brady SG, Kuhlmann M, Branstetter MG, Pie MR. The evolutionary history of bees in time and space. Curr Biol 2023; 33:3409-3422.e6. [PMID: 37506702 DOI: 10.1016/j.cub.2023.07.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Revised: 07/04/2023] [Accepted: 07/04/2023] [Indexed: 07/30/2023]
Abstract
Bees are the most significant pollinators of flowering plants. This partnership began ca. 120 million years ago, but the uncertainty of how and when bees spread across the planet has greatly obscured investigations of this key mutualism. We present a novel analysis of bee biogeography using extensive new genomic and fossil data to demonstrate that bees originated in Western Gondwana (Africa and South America). Bees likely originated in the Early Cretaceous, shortly before the breakup of Western Gondwana, and the early evolution of any major bee lineage is associated with either the South American or African land masses. Subsequently, bees colonized northern continents via a complex history of vicariance and dispersal. The notable early absences from large landmasses, particularly in Australia and India, have important implications for understanding the assembly of local floras and diverse modes of pollination. How bees spread around the world from their hypothesized Southern Hemisphere origin parallels the histories of numerous flowering plant clades, providing an essential step to studying the evolution of angiosperm pollination syndromes in space and time.
Collapse
|
|
2 |
6 |
16
|
Kumar S, Stecher G, Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Mol Biol Evol 2016; 33:1870-1874. [PMID: 27004904 PMCID: PMC8210823 DOI: 10.1093/molbev/msw054%20] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/14/2024] Open
Abstract
We present the latest version of the Molecular Evolutionary Genetics Analysis (Mega) software, which contains many sophisticated methods and tools for phylogenomics and phylomedicine. In this major upgrade, Mega has been optimized for use on 64-bit computing systems for analyzing larger datasets. Researchers can now explore and analyze tens of thousands of sequences in Mega The new version also provides an advanced wizard for building timetrees and includes a new functionality to automatically predict gene duplication events in gene family trees. The 64-bit Mega is made available in two interfaces: graphical and command line. The graphical user interface (GUI) is a native Microsoft Windows application that can also be used on Mac OS X. The command line Mega is available as native applications for Windows, Linux, and Mac OS X. They are intended for use in high-throughput and scripted analysis. Both versions are available from www.megasoftware.net free of charge.
Collapse
|
research-article |
9 |
6 |
17
|
Marjanović D. The Making of Calibration Sausage Exemplified by Recalibrating the Transcriptomic Timetree of Jawed Vertebrates. Front Genet 2021; 12:521693. [PMID: 34054911 PMCID: PMC8149952 DOI: 10.3389/fgene.2021.521693] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Accepted: 03/22/2021] [Indexed: 01/20/2023] Open
Abstract
Molecular divergence dating has the potential to overcome the incompleteness of the fossil record in inferring when cladogenetic events (splits, divergences) happened, but needs to be calibrated by the fossil record. Ideally but unrealistically, this would require practitioners to be specialists in molecular evolution, in the phylogeny and the fossil record of all sampled taxa, and in the chronostratigraphy of the sites the fossils were found in. Paleontologists have therefore tried to help by publishing compendia of recommended calibrations, and molecular biologists unfamiliar with the fossil record have made heavy use of such works (in addition to using scattered primary sources and copying from each other). Using a recent example of a large node-dated timetree inferred from molecular data, I reevaluate all 30 calibrations in detail, present the current state of knowledge on them with its various uncertainties, rerun the dating analysis, and conclude that calibration dates cannot be taken from published compendia or other secondary or tertiary sources without risking strong distortions to the results, because all such sources become outdated faster than they are published: 50 of the (primary) sources I cite to constrain calibrations were published in 2019, half of the total of 280 after mid-2016, and 90% after mid-2005. It follows that the present work cannot serve as such a compendium either; in the slightly longer term, it can only highlight known and overlooked problems. Future authors will need to solve each of these problems anew through a thorough search of the primary paleobiological and chronostratigraphic literature on each calibration date every time they infer a new timetree, and that literature is not optimized for that task, but largely has other objectives.
Collapse
|
research-article |
4 |
3 |
18
|
Kumar S, Stecher G, Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Mol Biol Evol 2016. [PMID: 27004904 DOI: 10.1093/molbev/msw054+] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We present the latest version of the Molecular Evolutionary Genetics Analysis (Mega) software, which contains many sophisticated methods and tools for phylogenomics and phylomedicine. In this major upgrade, Mega has been optimized for use on 64-bit computing systems for analyzing larger datasets. Researchers can now explore and analyze tens of thousands of sequences in Mega The new version also provides an advanced wizard for building timetrees and includes a new functionality to automatically predict gene duplication events in gene family trees. The 64-bit Mega is made available in two interfaces: graphical and command line. The graphical user interface (GUI) is a native Microsoft Windows application that can also be used on Mac OS X. The command line Mega is available as native applications for Windows, Linux, and Mac OS X. They are intended for use in high-throughput and scripted analysis. Both versions are available from www.megasoftware.net free of charge.
Collapse
|
|
9 |
1 |
19
|
Wang S, Luo H. Dating the bacterial tree of life based on ancient symbiosis. Syst Biol 2025:syae071. [PMID: 39847448 DOI: 10.1093/sysbio/syae071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Indexed: 01/24/2025] Open
Abstract
Obtaining a timescale for bacterial evolution is crucial to understand early life evolution but is difficult owing to the scarcity of bacterial fossils. Here, we introduce multiple new time constraints to calibrate bacterial evolution based on ancient symbiosis. This idea is implemented using a bacterial tree constructed with genes found in the mitochondrial lineages phylogenetically embedded within Proteobacteria. The expanded mitochondria-bacterial tree allows the node age constraints of eukaryotes established by their abundant fossils to be propagated to ancient co-evolving bacterial symbionts and across the bacterial tree of life. Importantly, we formulate a new probabilistic framework that considers uncertainty in inference of the ancestral lifestyle of modern symbionts to apply 19 relative time constraints (RTC) each informed by host-symbiont association to constrain bacterial symbionts no older than their eukaryotic host. Moreover, we develop an approach to incorporating substitution mixture models that better accommodate substitutional saturation and compositional heterogeneity for dating deep phylogenies. Our analysis estimates that the last bacterial common ancestor (LBCA) occurred approximately 4.0-3.5 billion years ago (Ga), followed by rapid divergence of major bacterial clades. It is generally robust to alternative root ages, root positions, tree topologies, fossil ages, ancestral lifestyle reconstruction, gene sets, among other factors. The obtained timetree serves as a foundation for testing hypotheses regarding bacterial diversification and its correlation with geobiological events across different timescales.
Collapse
|
|
1 |
|
20
|
Barba-Montoya J, Sharma S, Kumar S. Molecular timetrees using relaxed clocks and uncertain phylogenies. FRONTIERS IN BIOINFORMATICS 2023; 3:1225807. [PMID: 37600967 PMCID: PMC10435864 DOI: 10.3389/fbinf.2023.1225807] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Accepted: 07/21/2023] [Indexed: 08/22/2023] Open
Abstract
A common practice in molecular systematics is to infer phylogeny and then scale it to time by using a relaxed clock method and calibrations. This sequential analysis practice ignores the effect of phylogenetic uncertainty on divergence time estimates and their confidence/credibility intervals. An alternative is to infer phylogeny and times jointly to incorporate phylogenetic errors into molecular dating. We compared the performance of these two alternatives in reconstructing evolutionary timetrees using computer-simulated and empirical datasets. We found sequential and joint analyses to produce similar divergence times and phylogenetic relationships, except for some nodes in particular cases. The joint inference performed better when the phylogeny was not well resolved, situations in which the joint inference should be preferred. However, joint inference can be infeasible for large datasets because available Bayesian methods are computationally burdensome. We present an alternative approach for joint inference that combines the bag of little bootstraps, maximum likelihood, and RelTime approaches for simultaneously inferring evolutionary relationships, divergence times, and confidence intervals, incorporating phylogeny uncertainty. The new method alleviates the high computational burden imposed by Bayesian methods while achieving a similar result.
Collapse
|
methods-article |
2 |
|
21
|
Pi HW, Chiang YR, Li WH. Mapping Geological Events and Nitrogen Fixation Evolution Onto the Timetree of the Evolution of Nitrogen-Fixation Genes. Mol Biol Evol 2024; 41:msae023. [PMID: 38319744 PMCID: PMC10881105 DOI: 10.1093/molbev/msae023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 11/17/2023] [Accepted: 11/21/2023] [Indexed: 02/08/2024] Open
Abstract
Nitrogen is essential for all organisms, but biological nitrogen fixation (BNF) occurs only in a small fraction of prokaryotes. Previous studies divided nitrogenase-gene-carrying prokaryotes into Groups I to IV and provided evidence that BNF first evolved in bacteria. This study constructed a timetree of the evolution of nitrogen-fixation genes and estimated that archaea evolved BNF much later than bacteria and that nitrogen-fixing cyanobacteria evolved later than 1,900 MYA, considerably younger than the previous estimate of 2,200 MYA. Moreover, Groups III and II/I diverged ∼2,280 MYA, after the Kenorland supercontinent breakup (∼2,500-2,100 MYA) and the Great Oxidation Event (∼2,400-2,100 MYA); Groups III and Vnf/Anf diverged ∼2,086 MYA, after the Yarrabubba impact (∼2,229 MYA); and Groups II and I diverged ∼1,920 MYA, after the Vredefort impact (∼2,023 MYA). In summary, this study provided a timescale of BNF events and discussed the possible effects of geological events on BNF evolution.
Collapse
|
research-article |
1 |
|
22
|
Chaves JCM, Hepp F, Schrago CG, Mello B. A time-calibrated phylogeny of the diversification of Holoadeninae frogs. FRONTIERS IN BIOINFORMATICS 2024; 4:1441373. [PMID: 39415894 PMCID: PMC11480671 DOI: 10.3389/fbinf.2024.1441373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Accepted: 09/19/2024] [Indexed: 10/19/2024] Open
Abstract
The phylogeny of the major lineages of Amphibia has received significant attention in recent years, although evolutionary relationships within families remain largely neglected. One such overlooked group is the subfamily Holoadeninae, comprising 73 species across nine genera and characterized by a disjunct geographical distribution. The lack of a fossil record for this subfamily hampers the formulation of a comprehensive evolutionary hypothesis for their diversification. Aiming to fill this gap, we inferred the phylogenetic relationships and divergence times for Holoadeninae using molecular data and calibration information derived from the fossil record of Neobatrachia. Our inferred phylogeny confirmed most genus-level associations, and molecular dating analysis placed the origin of Holoadeninae in the Eocene, with subsequent splits also occurring during this period. The climatic and geological events that occurred during the Oligocene-Miocene transition were crucial to the dynamic biogeographical history of the subfamily. However, the wide highest posterior density intervals in our divergence time estimates are primarily attributed to the absence of Holoadeninae fossil information and, secondarily, to the limited number of sampled nucleotide sites.
Collapse
|
brief-report |
1 |
|
23
|
Hardies SC, Cho BC, Jang GI, Wang Z, Hwang CY. Identification of Structural and Morphogenesis Genes of Sulfitobacter Phage ΦGT1 and Placement within the Evolutionary History of the Podoviruses. Viruses 2023; 15:1475. [PMID: 37515163 PMCID: PMC10386132 DOI: 10.3390/v15071475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 06/23/2023] [Accepted: 06/28/2023] [Indexed: 07/30/2023] Open
Abstract
ΦGT1 is a lytic podovirus of an alphaproteobacterial Sulfitobacter species, with few closely matching sequences among characterized phages, thus defying a useful description by simple sequence clustering methods. The history of the ΦGT1 core structure module was reconstructed using timetrees, including numerous related prospective prophages, to flesh out the evolutionary lineages spanning from the origin of the ejectosomal podovirus >3.2 Gya to the present genes of ΦGT1 and its closest relatives. A peculiarity of the ΦGT1 structural proteome is that it contains two paralogous tubular tail A (tubeA) proteins. The origin of the dual tubeA arrangement was traced to a recombination between two more ancient podoviral lineages occurring ~0.7 Gya in the alphaproteobacterial order Rhizobiales. Descendants of the ancestral dual A recombinant were tracked forward forming both temperate and lytic phage clusters and exhibiting both vertical transmission with patchy persistence and horizontal transfer with respect to host taxonomy. The two ancestral lineages were traced backward, making junctions with a major metagenomic podoviral family, the LUZ24-like gammaproteobacterial phages, and Myxococcal phage Mx8, and finally joining near the origin of podoviruses with P22. With these most conservative among phage genes, deviations from uncomplicated vertical and nonrecombinant descent are numerous but countable. The use of timetrees allowed conceptualization of the phage's evolution in the context of a sequence of ancestors spanning the time of life on Earth.
Collapse
|
|
2 |
|
24
|
Yuan H, Dickson ED, Martinez Q, Arnold P, Asher RJ. The origin and evolution of shrews (Soricidae, Mammalia). Proc Biol Sci 2024; 291:20241856. [PMID: 39689883 DOI: 10.1098/rspb.2024.1856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Revised: 10/11/2024] [Accepted: 10/22/2024] [Indexed: 12/19/2024] Open
Abstract
Shrews are among the most speciose of mammalian clades, but their evolutionary history is poorly understood. Their fossil record is fragmentary and even the anatomy of living groups is not well documented. Here, we incorporate the oldest, most complete fossil shrew yet known into the first phylogenetic analysis of the group to include molecular, morphological and temporal data. Our study reveals previously unknown diversity among total- and crown-group soricids. This includes a novel element of the mammalian skeleton: a robust, needle-like sesamoid extending cranially from the second thoracic neural arch in Myosoricini, comparable in length to these species' humeri. Additionally, 'red-toothed' shrews have an unusually elongate basicranium, and 'white-toothed' shrews probably evolved from a common ancestor with dental pigmentation. The fossil Domnina and crown soricids have a double-jaw articulation and incomplete zygomatic arch, but unlike nearly all crown species, Domnina has open vomeronasal canals and a tympanic process of the basisphenoid. Domnina and other heterosoricids are phylogenetically outside crown Soricidae. The oldest, well-supported total-group soricoids are North American, not Asian, and Soricidae probably originated during the Palaeocene or early Eocene. The diverse mammalian genus Crocidura originated and began to diversify during the Miocene.
Collapse
|
|
1 |
|