1
|
Chen Q, Deng M, Dai X, Wang W, Wang X, Chen LS, Huang GH. Phylogenomic data exploration with increased sampling provides new insights into the higher-level relationships of butterflies and moths (Lepidoptera). Mol Phylogenet Evol 2024; 197:108113. [PMID: 38796071 DOI: 10.1016/j.ympev.2024.108113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Revised: 05/13/2024] [Accepted: 05/22/2024] [Indexed: 05/28/2024]
Abstract
A robust and stable phylogenetic framework is a fundamental goal of evolutionary biology. As the third largest insect order in the world following Coleoptera and Diptera, Lepidoptera (butterflies and moths) play a central role in almost every terrestrial ecosystem as indicators of environmental change and serve as important models for biologists exploring questions related to ecology and evolutionary biology. However, for such a charismatic insect group, the higher-level phylogenetic relationships among its superfamilies are still poorly resolved. Compared to earlier phylogenomic studies, we increased taxon sampling among Lepidoptera (37 superfamilies and 68 families containing 263 taxa) and acquired a series of large amino-acid datasets from 69,680 to 400,330 for phylogenomic reconstructions. Using these datasets, we explored the effect of different taxon sampling with significant increases in the number of included genes on tree topology by considering a series of systematic errors using maximum-likelihood (ML) and Bayesian inference (BI) methods. Moreover, we also tested the effectiveness in topology robustness among the three ML-based models. The results showed that taxon sampling is an important determinant in tree robustness of accurate lepidopteran phylogenetic estimation. Long-branch attraction (LBA) caused by site-wise heterogeneity is a significant source of bias giving rise to unstable positions of ditrysian groups in phylogenomic reconstruction. Phylogenetic inference showed the most comprehensive framework to reveal the relationships among lepidopteran superfamilies, and presented some newly relationships with strong supports (Papilionoidea was sister to Gelechioidea and Immoidea was sister to Galacticoidea, respectively), but limited by taxon sampling, the relationships within the species-rich and relatively rapid radiation Ditrysia and especially Apoditrysia remain poorly resolved, which need to increase taxon sampling for further phylogenomic reconstruction. The present study demonstrates that taxon sampling is an important determinant for an accurate lepidopteran tree of life and provides some essential insights for future lepidopteran phylogenomic studies.
Collapse
Affiliation(s)
- Qi Chen
- Yuelushan Laboratory, College of Plant Protection, Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha 410128, Hunan, China; Tropical Biodiversity and Bioresource Utilization Laboratory, College of Science, Qiongtai Normal University, Haikou 571127, Hainan, China
| | - Min Deng
- Yuelushan Laboratory, College of Plant Protection, Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha 410128, Hunan, China; Qiannan Polytechnic for Nationality, Duyun 558022, Guizhou, China
| | - Xuan Dai
- Yuelushan Laboratory, College of Plant Protection, Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha 410128, Hunan, China
| | - Wei Wang
- Research Center for Wild Animal and Plant Resource Protection and Utilization, Qiongtai Normal University, Haikou 571127, Hainan, China
| | - Xing Wang
- Yuelushan Laboratory, College of Plant Protection, Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha 410128, Hunan, China; Tropical Biodiversity and Bioresource Utilization Laboratory, College of Science, Qiongtai Normal University, Haikou 571127, Hainan, China.
| | - Liu-Sheng Chen
- Guangdong Provincial Key Laboratory of Silviculture, Protection and Utilization, Guangdong Academy of Forestry, Guangzhou 510520, Guangdong, China.
| | - Guo-Hua Huang
- Yuelushan Laboratory, College of Plant Protection, Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha 410128, Hunan, China.
| |
Collapse
|
2
|
Alhawatema M. DendroSSR: SSRs and sequence alignment as tools for building phylogeny trees. BRAZ J BIOL 2023; 83:e275386. [PMID: 37851777 DOI: 10.1590/1519-6984.275386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Accepted: 08/06/2023] [Indexed: 10/20/2023] Open
Abstract
This study introduces a new method to construct phylogenetic trees by combining both of the Simple Sequence Repeats (SSRs) and sequence alignments. The purpose of this work is to present the DendroSSR program and show it via a case study involving diverse Aspergillus species. To show how the DendroSSR program works to resolve complicated species relationships in phylogenetic trees, we employed the Aspergillus species as an example of a research case. The DendroSSR employs a technique containing multiple phases beginning with, detecting SSRs, computing SSRs similarities, sequences alignment, building a distance matrix based on SSRs similarity and sequences alignments, and then hierarchical clustering, and presenting the findings in a dendrogram. Sometimes sequence alignments alone may not give adequate information to generate a phylogenetic tree to resolve complicated species relationships. Therefore, establishing a distance matrix that is formed of addition of SSRs similarity across sequences to the traditional sequence alignment helps the process substantially and resolves the connections of complex species on phylogenetic trees. Additionally, it may be hard to distinguish complex relationships across species when studying conserved sequences, which could lead to an incomplete representation of their evolutionary relationships. These limitations are addressed by DendroSSR, which offers a technique to produce phylogenetic trees by incorporating SSRs similarity across species into the approach of generating phylogenetic trees. As it is known, SSRs are extensively scattered across the genomes of species and exhibit a great variation. Therefore, SSRs may support the knowledge gathered from sequence alignments by providing more information on genetic variation and even evolutionary relationships. The use of DendroSSR analysis might be considered for creating phylogenetic trees as a complementary or secondary strategy among the species under examination in circumstances where traditional phylogenetic analysis fails to clarify the species complex phylogenetic relationships.
Collapse
Affiliation(s)
- M Alhawatema
- Tafila Technical University, Faculty of Science, Department of Applied Biological Science, Tafila, Jordan
| |
Collapse
|
3
|
Mongiardino Koch N, Tilic E, Miller AK, Stiller J, Rouse GW. Confusion will be my epitaph: genome-scale discordance stifles phylogenetic resolution of Holothuroidea. Proc Biol Sci 2023; 290:20230988. [PMID: 37434530 PMCID: PMC10336381 DOI: 10.1098/rspb.2023.0988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Accepted: 06/12/2023] [Indexed: 07/13/2023] Open
Abstract
Sea cucumbers (Holothuroidea) are a diverse clade of echinoderms found from intertidal waters to the bottom of the deepest oceanic trenches. Their reduced skeletons and limited number of phylogenetically informative traits have long obfuscated morphological classifications. Sanger-sequenced molecular datasets have also failed to constrain the position of major lineages. Noteworthy, topological uncertainty has hindered a resolution for Neoholothuriida, a highly diverse clade of Permo-Triassic age. We perform the first phylogenomic analysis of Holothuroidea, combining existing datasets with 13 novel transcriptomes. Using a highly curated dataset of 1100 orthologues, our efforts recapitulate previous results, struggling to resolve interrelationships among neoholothuriid clades. Three approaches to phylogenetic reconstruction (concatenation under both site-homogeneous and site-heterogeneous models, and coalescent-aware inference) result in alternative resolutions, all of which are recovered with strong support and across a range of datasets filtered for phylogenetic usefulness. We explore this intriguing result using gene-wise log-likelihood scores and attempt to correlate these with a large set of gene properties. While presenting novel ways of exploring and visualizing support for alternative trees, we are unable to discover significant predictors of topological preference, and our efforts fail to favour one topology. Neoholothuriid genomes seem to retain an amalgam of signals derived from multiple phylogenetic histories.
Collapse
Affiliation(s)
| | - Ekin Tilic
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
- Department of Marine Zoology, Senckenberg Research Institute and Museum, Frankfurt, Germany
| | - Allison K. Miller
- Anatomy Department, University of Otago, Dunedin, Otago, New Zealand
| | - Josefin Stiller
- Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Greg W. Rouse
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| |
Collapse
|
4
|
Pouchon C, Boluda CG. REFMAKER: make your own reference to target nuclear loci in low coverage genome skimming libraries. Phylogenomic application in Sapotaceae. Mol Phylogenet Evol 2023:107826. [PMID: 37257798 DOI: 10.1016/j.ympev.2023.107826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 04/24/2023] [Accepted: 05/25/2023] [Indexed: 06/02/2023]
Abstract
Genome skimming approach is widely used in plant systematics to infer phylogenies mostly from organelle genomes. However, organelles represent only 10% of the produced libraries, and the low coverage associated with these libraries (< 3X) prevents the capture of nuclear sequences, which are not always available in non-model organisms or limited to the ribosomal regions. We developed REFMAKER, a user-friendly pipeline, to create specific sets of nuclear loci that can next be extracted directly from the genome skimming libraries. For this, a catalogue is built from the meta-assembly of each library contigs and cleaned by selecting the nuclear regions and removing duplicates from clustering steps. Libraries are next mapped onto this catalogue and consensus sequences are generated to produce a ready-to-use phylogenetic matrix following different filtering parameters aiming at removing putative errors and paralogous sequences. REFMAKER allowed us to infer a well resolved phylogeny in Capurodendron (Sapotaceae) on 67 nuclear loci from low-coverage libraries (<1X). The resulting phylogeny is concomitant with one previously inferred on 638 nuclear genes from target enrichment libraries. While it remains preliminary because of this low sequencing depth, REFMAKER therefore opens perspectives in phylogenomics by allowing nuclear phylogeny reconstructions with genome skimming datasets.
Collapse
Affiliation(s)
- Charles Pouchon
- Conservatoire et Jardin botaniques de la Ville de Genève, Chemin de l'Impératrice 1, 1292 Chambésy, Geneva, Switzerland; PhyloLab, Department of Plant Sciences, Université de Genève, Chemin de l'Impératrice 1, 1292 Chambésy, Geneva, Switzerland.
| | - Carlos G Boluda
- Conservatoire et Jardin botaniques de la Ville de Genève, Chemin de l'Impératrice 1, 1292 Chambésy, Geneva, Switzerland; PhyloLab, Department of Plant Sciences, Université de Genève, Chemin de l'Impératrice 1, 1292 Chambésy, Geneva, Switzerland
| |
Collapse
|
5
|
Dornburg A, Mallik R, Wang Z, Bernal MA, Thompson B, Bruford EA, Nebert DW, Vasiliou V, Yohe LR, Yoder JA, Townsend JP. Placing human gene families into their evolutionary context. Hum Genomics 2022; 16:56. [PMID: 36369063 PMCID: PMC9652883 DOI: 10.1186/s40246-022-00429-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Accepted: 10/12/2022] [Indexed: 11/13/2022] Open
Abstract
Following the draft sequence of the first human genome over 20 years ago, we have achieved unprecedented insights into the rules governing its evolution, often with direct translational relevance to specific diseases. However, staggering sequence complexity has also challenged the development of a more comprehensive understanding of human genome biology. In this context, interspecific genomic studies between humans and other animals have played a critical role in our efforts to decode human gene families. In this review, we focus on how the rapid surge of genome sequencing of both model and non-model organisms now provides a broader comparative framework poised to empower novel discoveries. We begin with a general overview of how comparative approaches are essential for understanding gene family evolution in the human genome, followed by a discussion of analyses of gene expression. We show how homology can provide insights into the genes and gene families associated with immune response, cancer biology, vision, chemosensation, and metabolism, by revealing similarity in processes among distant species. We then explain methodological tools that provide critical advances and show the limitations of common approaches. We conclude with a discussion of how these investigations position us to gain fundamental insights into the evolution of gene families among living organisms in general. We hope that our review catalyzes additional excitement and research on the emerging field of comparative genomics, while aiding the placement of the human genome into its existentially evolutionary context.
Collapse
Affiliation(s)
- Alex Dornburg
- Department of Bioinformatics and Genomics, UNC-Charlotte, Charlotte, NC, USA.
| | - Rittika Mallik
- Department of Bioinformatics and Genomics, UNC-Charlotte, Charlotte, NC, USA
| | - Zheng Wang
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| | - Moisés A Bernal
- Department of Biological Sciences, College of Science and Mathematics, Auburn University, Auburn, AL, USA
| | - Brian Thompson
- Department of Environmental Health Sciences, Yale School of Public Health, New Haven, CT, USA
| | - Elspeth A Bruford
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Daniel W Nebert
- Department of Environmental Health, Center for Environmental Genetics, University of Cincinnati Medical Center, P.O. Box 670056, Cincinnati, OH, 45267, USA
- Department of Pediatrics and Molecular Developmental Biology, Division of Human Genetics, Cincinnati Children's Hospital, Cincinnati, OH, 45229, USA
| | - Vasilis Vasiliou
- Department of Environmental Health Sciences, Yale School of Public Health, New Haven, CT, USA
| | - Laurel R Yohe
- Department of Bioinformatics and Genomics, UNC-Charlotte, Charlotte, NC, USA
| | - Jeffrey A Yoder
- Department of Molecular Biomedical Sciences, College of Veterinary Medicine, North Carolina State University, Raleigh, NC, USA
| | - Jeffrey P Townsend
- Department of Bioinformatics and Genomics, UNC-Charlotte, Charlotte, NC, USA
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
| |
Collapse
|
6
|
Rachtman E, Sarmashghi S, Bafna V, Mirarab S. Quantifying the uncertainty of assembly-free genome-wide distance estimates and phylogenetic relationships using subsampling. Cell Syst 2022; 13:817-829.e3. [PMID: 36265468 PMCID: PMC9589918 DOI: 10.1016/j.cels.2022.06.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Revised: 03/14/2022] [Accepted: 06/28/2022] [Indexed: 01/26/2023]
Abstract
Computing distance between two genomes without alignments or even access to assemblies has many downstream analyses. However, alignment-free methods, including in the fast-growing field of genome skimming, are hampered by a significant methodological gap. While accurate methods (many k-mer-based) for assembly-free distance calculation exist, measuring the uncertainty of estimated distances has not been sufficiently studied. In this paper, we show that bootstrapping, the standard non-parametric method of measuring estimator uncertainty, is not accurate for k-mer-based methods that rely on k-mer frequency profiles. Instead, we propose using subsampling (with no replacement) in combination with a correction step to reduce the variance of the inferred distribution. We show that the distribution of distances using our procedure matches the true uncertainty of the estimator. The resulting phylogenetic support values effectively differentiate between correct and incorrect branches and identify controversial branches that change across alignment-free and alignment-based phylogenies reported in the literature.
Collapse
Affiliation(s)
- Eleonora Rachtman
- Bioinformatics and Systems Biology Graduate Program, UC San Diego, San Diego, CA 92093, USA
| | - Shahab Sarmashghi
- Department of Electrical and Computer Engineering, UC San Diego, San Diego, CA 92093, USA
| | - Vineet Bafna
- Department of Computer Science and Engineering, UC San Diego, San Diego, CA 92093, USA
| | - Siavash Mirarab
- Department of Electrical and Computer Engineering, UC San Diego, San Diego, CA 92093, USA.
| |
Collapse
|
7
|
Interpreting phylogenetic conflict: Hybridization in the most speciose genus of lichen-forming fungi. Mol Phylogenet Evol 2022; 174:107543. [PMID: 35690378 DOI: 10.1016/j.ympev.2022.107543] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 02/06/2022] [Accepted: 05/13/2022] [Indexed: 11/24/2022]
Abstract
While advances in sequencing technologies have been invaluable for understanding evolutionary relationships, increasingly large genomic data sets may result in conflicting evolutionary signals that are often caused by biological processes, including hybridization. Hybridization has been detected in a variety of organisms, influencing evolutionary processes such as generating reproductive barriers and mixing standing genetic variation. Here, we investigate the potential role of hybridization in the diversification of the most speciose genus of lichen-forming fungi, Xanthoparmelia. As Xanthoparmelia is projected to have gone through recent, rapid diversification, this genus is particularly suitable for investigating and interpreting the origins of phylogenomic conflict. Focusing on a clade of Xanthoparmelia largely restricted to the Holarctic region, we used a genome skimming approach to generate 962 single-copy gene regions representing over 2 Mbp of the mycobiont genome. From this genome-scale dataset, we inferred evolutionary relationships using both concatenation and coalescent-based species tree approaches. We also used three independent tests for hybridization. Although different species tree reconstruction methods recovered largely consistent and well-supported trees, there was widespread incongruence among individual gene trees. Despite challenges in differentiating hybridization from ILS in situations of recent rapid radiations, our genome-wide analyses detected multiple potential hybridization events in the Holarctic clade, suggesting one possible source of trait variability in this hyperdiverse genus. This study highlights the value in using a pluralistic approach for characterizing genome-scale conflict, even in groups with well-resolved phylogenies, while highlighting current challenges in detecting the specific impacts of hybridization.
Collapse
|
8
|
Xiong Q, Wan ATY, Liu X, Fung CSH, Xiao X, Malainual N, Hou J, Wang L, Wang M, Yang KY, Cui Y, Leung ELH, Nong W, Shin SK, Au SWN, Jeong KY, Chew FT, Hui JHL, Leung TF, Tungtrongchitr A, Zhong N, Liu Z, Tsui SKW. Comparative Genomics Reveals Insights into the Divergent Evolution of Astigmatic Mites and Household Pest Adaptations. Mol Biol Evol 2022; 39:6582989. [PMID: 35535514 PMCID: PMC9113151 DOI: 10.1093/molbev/msac097] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Highly diversified astigmatic mites comprise many medically important human household pests such as house dust mites causing ∼1–2% of all allergic diseases globally; however, their evolutionary origin and diverse lifestyles including reversible parasitism have not been illustrated at the genomic level, which hampers allergy prevention and our exploration of these household pests. Using six high-quality assembled and annotated genomes, this study not only refuted the monophyly of mites and ticks, but also thoroughly explored the divergence of Acariformes and the diversification of astigmatic mites. In monophyletic Acariformes, Prostigmata known as notorious plant pests first evolved, and then rapidly evolving Astigmata diverged from soil oribatid mites. Within astigmatic mites, a wide range of gene families rapidly expanded via tandem gene duplications, including ionotropic glutamate receptors, triacylglycerol lipases, serine proteases and UDP glucuronosyltransferases. Gene diversification after tandem duplications provides many genetic resources for adaptation to sensing environmental signals, digestion, and detoxification in rapidly changing household environments. Many gene decay events only occurred in the skin-burrowing parasitic mite Sarcoptes scabiei. Throughout the evolution of Acariformes, massive horizontal gene transfer events occurred in gene families such as UDP glucuronosyltransferases and several important fungal cell wall lytic enzymes, which enable detoxification and digestive functions and provide perfect drug targets for pest control. This comparative study sheds light on the divergent evolution and quick adaptation to human household environments of astigmatic mites and provides insights into the genetic adaptations and even control of human household pests.
Collapse
Affiliation(s)
- Qing Xiong
- School of Biomedical Sciences, The Chinese University of Hong Kong, Hong Kong.,Hong Kong Bioinformatics Centre, The Chinese University of Hong Kong, Hong Kong
| | - Angel Tsz-Yau Wan
- School of Biomedical Sciences, The Chinese University of Hong Kong, Hong Kong.,Hong Kong Bioinformatics Centre, The Chinese University of Hong Kong, Hong Kong
| | - Xiaoyu Liu
- Shenzhen Key Laboratory of Allergy and Immunology, School of Medicine, Shenzhen University, China
| | - Cathy Sin-Hang Fung
- School of Biomedical Sciences, The Chinese University of Hong Kong, Hong Kong
| | - Xiaojun Xiao
- Shenzhen Key Laboratory of Allergy and Immunology, School of Medicine, Shenzhen University, China
| | - Nat Malainual
- Department of Parasitology, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Jinpao Hou
- School of Biomedical Sciences, The Chinese University of Hong Kong, Hong Kong.,Centre for Microbial Genomics and Proteomics, The Chinese University of Hong Kong, Hong Kong
| | - Lingyi Wang
- School of Biomedical Sciences, The Chinese University of Hong Kong, Hong Kong
| | - Mingqiang Wang
- School of Biomedical Sciences, The Chinese University of Hong Kong, Hong Kong.,Hong Kong Bioinformatics Centre, The Chinese University of Hong Kong, Hong Kong
| | - Kevin Yi Yang
- School of Biomedical Sciences, The Chinese University of Hong Kong, Hong Kong.,Hong Kong Bioinformatics Centre, The Chinese University of Hong Kong, Hong Kong
| | - Yubao Cui
- Department of Clinical Laboratory, Wuxi People's Hospital Affiliated to Nanjing Medical University, Wuxi, China
| | - Elaine Lai-Han Leung
- Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macau
| | - Wenyan Nong
- School of Life Sciences, The Chinese University of Hong Kong, Hong Kong
| | - Soo-Kyung Shin
- School of Biomedical Sciences, The Chinese University of Hong Kong, Hong Kong
| | | | - Kyoung Yong Jeong
- Institute of Allergy, Department of Internal Medicine, College of Medicine, Yonsei University, Seoul, Korea
| | - Fook-Tim Chew
- Department of Biological Sciences, National University of Singapore, Singapore
| | - Jerome Ho-Lam Hui
- School of Life Sciences, The Chinese University of Hong Kong, Hong Kong
| | - Ting-Fan Leung
- Department of Paediatrics, The Chinese University of Hong Kong, Hong Kong
| | - Anchalee Tungtrongchitr
- Department of Parasitology, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Nanshan Zhong
- State Key Laboratory of Respiratory Disease, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Zhigang Liu
- Shenzhen Key Laboratory of Allergy and Immunology, School of Medicine, Shenzhen University, China
| | - Stephen Kwok-Wing Tsui
- School of Biomedical Sciences, The Chinese University of Hong Kong, Hong Kong.,Hong Kong Bioinformatics Centre, The Chinese University of Hong Kong, Hong Kong.,Centre for Microbial Genomics and Proteomics, The Chinese University of Hong Kong, Hong Kong
| |
Collapse
|
9
|
Porto DS, Dahdul WM, Lapp H, Balhoff JP, Vision TJ, Mabee PM, Uyeda J. Assessing Bayesian Phylogenetic Information Content of Morphological Data Using Knowledge from Anatomy Ontologies. Syst Biol 2022; 71:1290-1306. [PMID: 35285502 PMCID: PMC9558846 DOI: 10.1093/sysbio/syac022] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Revised: 02/09/2022] [Accepted: 03/05/2022] [Indexed: 11/18/2022] Open
Abstract
Morphology remains a primary source of phylogenetic information for many groups of organisms, and the only one for most fossil taxa. Organismal anatomy is not a collection of randomly assembled and independent “parts”, but instead a set of dependent and hierarchically nested entities resulting from ontogeny and phylogeny. How do we make sense of these dependent and at times redundant characters? One promising approach is using ontologies—structured controlled vocabularies that summarize knowledge about different properties of anatomical entities, including developmental and structural dependencies. Here, we assess whether evolutionary patterns can explain the proximity of ontology-annotated characters within an ontology. To do so, we measure phylogenetic information across characters and evaluate if it matches the hierarchical structure given by ontological knowledge—in much the same way as across-species diversity structure is given by phylogeny. We implement an approach to evaluate the Bayesian phylogenetic information (BPI) content and phylogenetic dissonance among ontology-annotated anatomical data subsets. We applied this to data sets representing two disparate animal groups: bees (Hexapoda: Hymenoptera: Apoidea, 209 chars) and characiform fishes (Actinopterygii: Ostariophysi: Characiformes, 463 chars). For bees, we find that BPI is not substantially explained by anatomy since dissonance is often high among morphologically related anatomical entities. For fishes, we find substantial information for two clusters of anatomical entities instantiating concepts from the jaws and branchial arch bones, but among-subset information decreases and dissonance increases substantially moving to higher-level subsets in the ontology. We further applied our approach to address particular evolutionary hypotheses with an example of morphological evolution in miniature fishes. While we show that phylogenetic information does match ontology structure for some anatomical entities, additional relationships and processes, such as convergence, likely play a substantial role in explaining BPI and dissonance, and merit future investigation. Our work demonstrates how complex morphological data sets can be interrogated with ontologies by allowing one to access how information is spread hierarchically across anatomical concepts, how congruent this information is, and what sorts of processes may play a role in explaining it: phylogeny, development, or convergence. [Apidae; Bayesian phylogenetic information; Ostariophysi; Phenoscape; phylogenetic dissonance; semantic similarity.]
Collapse
Affiliation(s)
- Diego S Porto
- Department of Biological Sciences, Virginia Polytechnic Institute and State University, 926 West Campus Drive, Blacksburg, VA 24061, USA
| | - Wasila M Dahdul
- UCI Libraries,University of California, Irvine, Irvine, CA 92623, USA
- Department of Biology, University of South Dakota, 414 East Clark Street, Vermillion, SD 57069, USA
| | - Hilmar Lapp
- Center for Genomic and Computational Biology, Duke University, 101 Science Drive, Durham, NC 27708, USA
| | - James P Balhoff
- Renaissance Computing Institute, University of North Carolina, 100 Europa Drive, Suite 540, Chapel Hill, NC 27517, USA
| | - Todd J Vision
- Department of Biology and School of Information and Library Sciences, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Paula M Mabee
- Department of Biology, University of South Dakota, 414 East Clark Street, Vermillion, SD 57069, USA
- Battelle, National Ecological Observatory Network, Boulder, CO 80301, USA
| | - Josef Uyeda
- Department of Biological Sciences, Virginia Polytechnic Institute and State University, 926 West Campus Drive, Blacksburg, VA 24061, USA
| |
Collapse
|
10
|
Abstract
Over the past three decades, computational capabilities have grown at such a rapid rate that they have given rise to many computationally heavy science fields such as phylogenomics. As increasingly more genomes are sequenced in the three domains of life, larger and more species-complete phylogenetic tree reconstructions are leading to a better understanding of the tree of life and the evolutionary histories in deep times. However, these large datasets pose unique challenges from a modeling and computational perspective: accurately describing the evolutionary process of thousands of species is still beyond the capability of current models, while the computational burden limits our ability to test multiple hypotheses. Thus, it is common practice to reduce the size of a dataset by selecting species to represent a clade (taxon sampling). Unfortunately, this process is subjective, and comparisons of large tree of life studies show that choice and number of species used in a dataset can alter the topology obtained. Thus, taxon sampling is, in itself, a process that needs to be fully investigated to determine its effect on phylogenetic stability. Here, we present the theory and practical application of an automated pipeline that can be easily implemented to explore the effect of taxon sampling on phylogenetic reconstructions. The application of this approach was recently discussed in a study of Terrabacteria and shows its power in investigating the accuracy of deep nodes of a phylogeny.
Collapse
Affiliation(s)
| | - Fabia Ursula Battistuzzi
- Department of Biological Sciences, Oakland University, Rochester, MI, USA.
- Center for Data Science and Big Data Analytics, Oakland University, Rochester, MI, USA.
| |
Collapse
|
11
|
Sun X, Cheng J. Phylogenetic Signal Dissection of Heterogeneous 28S and 16S rRNA Genes in Spinicaudata (Branchiopoda, Diplostraca). Genes (Basel) 2021; 12:1705. [PMID: 34828311 PMCID: PMC8625258 DOI: 10.3390/genes12111705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 10/20/2021] [Accepted: 10/26/2021] [Indexed: 12/05/2022] Open
Abstract
It is still a challenge to reconstruct the deep phylogenetic relationships within spinicaudatans, and there are several different competing hypotheses regarding the interrelationships among Eocyzicidae, Cyzicidae s. s., Leptestheriidae, and Limnadiidae of the Suborder Spinicaudata. In order to explore the source of the inconsistencies, we focus on the sequence variation and the structure model of two rRNA genes based on extensive taxa sampling. The comparative sequence analysis revealed heterogeneity across species and the existence of conserved motifs in all spinicaudatan species. The level of intraspecific heterogeneity differed among species, which suggested that some species might have undergone a relaxed concerted evolution with respect to the 28S rRNA gene. The Bayesian analyses were performed on nuclear (28S rRNA, EF1α) and mitochondrial (16S rRNA, COI) genes. Further, we investigated compositional heterogeneity between lineages and assessed the potential for phylogenetic noise compared to signal in the combined data set. Reducing the non-phylogenetic signals and application of optimal rRNA model recovered a topology congruent with inference from the transcriptome data, whereby Limnadiidae was placed as a sister group to Leptestheriidae + Eocyzicidae with high support (topology I). Tests of alternative hypotheses provided implicit support for four competing topologies, and topology I was the best.
Collapse
Affiliation(s)
| | - Jinhui Cheng
- State Key Laboratory of Palaeobiology and Stratigraphy, Nanjing Institute of Geology and Palaeontology and Center for Excellence in Life and Palaeoenvironment, Chinese Academy of Sciences, No. 39, Beijing Eastroad, Nanjing 210008, China;
| |
Collapse
|
12
|
Duchêne DA, Mather N, Van Der Wal C, Ho SYW. Excluding loci with substitution saturation improves inferences from phylogenomic data. Syst Biol 2021; 71:676-689. [PMID: 34508605 PMCID: PMC9016599 DOI: 10.1093/sysbio/syab075] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2020] [Accepted: 09/07/2021] [Indexed: 11/21/2022] Open
Abstract
The historical signal in nucleotide sequences becomes eroded over time by substitutions occurring repeatedly at the same sites. This phenomenon, known as substitution saturation, is recognized as one of the primary obstacles to deep-time phylogenetic inference using genome-scale data sets. We present a new test of substitution saturation and demonstrate its performance in simulated and empirical data. For some of the 36 empirical phylogenomic data sets that we examined, we detect substitution saturation in around 50% of loci. We found that saturation tends to be flagged as problematic in loci with highly discordant phylogenetic signals across sites. Within each data set, the loci with smaller numbers of informative sites are more likely to be flagged as containing problematic levels of saturation. The entropy saturation test proposed here is sensitive to high evolutionary rates relative to the evolutionary timeframe, while also being sensitive to several factors known to mislead phylogenetic inference, including short internal branches relative to external branches, short nucleotide sequences, and tree imbalance. Our study demonstrates that excluding loci with substitution saturation can be an effective means of mitigating the negative impact of multiple substitutions on phylogenetic inferences. [Phylogenetic model performance; phylogenomics; substitution model; substitution saturation; test statistics.]
Collapse
Affiliation(s)
- David A Duchêne
- Centre for Evolutionary Hologenomics, University of Copenhagen, 1352 Copenhagen, Denmark
| | - Niklas Mather
- School of Life and Environmental Sciences, University of Sydney, Sydney, NSW 2006, Australia
| | - Cara Van Der Wal
- School of Life and Environmental Sciences, University of Sydney, Sydney, NSW 2006, Australia
| | - Simon Y W Ho
- School of Life and Environmental Sciences, University of Sydney, Sydney, NSW 2006, Australia
| |
Collapse
|
13
|
Mongiardino Koch N. Phylogenomic Subsampling and the Search for Phylogenetically Reliable Loci. Mol Biol Evol 2021; 38:4025-4038. [PMID: 33983409 DOI: 10.1101/2021.02.13.431075] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/21/2023] Open
Abstract
Phylogenomic subsampling is a procedure by which small sets of loci are selected from large genome-scale data sets and used for phylogenetic inference. This step is often motivated by either computational limitations associated with the use of complex inference methods or as a means of testing the robustness of phylogenetic results by discarding loci that are deemed potentially misleading. Although many alternative methods of phylogenomic subsampling have been proposed, little effort has gone into comparing their behavior across different data sets. Here, I calculate multiple gene properties for a range of phylogenomic data sets spanning animal, fungal, and plant clades, uncovering a remarkable predictability in their patterns of covariance. I also show how these patterns provide a means for ordering loci by both their rate of evolution and their relative phylogenetic usefulness. This method of retrieving phylogenetically useful loci is found to be among the top performing when compared with alternative subsampling protocols. Relatively common approaches such as minimizing potential sources of systematic bias or increasing the clock-likeness of the data are found to fare worse than selecting loci at random. Likewise, the general utility of rate-based subsampling is found to be limited: loci evolving at both low and high rates are among the least effective, and even those evolving at optimal rates can still widely differ in usefulness. This study shows that many common subsampling approaches introduce unintended effects in off-target gene properties and proposes an alternative multivariate method that simultaneously optimizes phylogenetic signal while controlling for known sources of bias.
Collapse
|
14
|
Abstract
Phylogenomic subsampling is a procedure by which small sets of loci are selected from large genome-scale data sets and used for phylogenetic inference. This step is often motivated by either computational limitations associated with the use of complex inference methods or as a means of testing the robustness of phylogenetic results by discarding loci that are deemed potentially misleading. Although many alternative methods of phylogenomic subsampling have been proposed, little effort has gone into comparing their behavior across different data sets. Here, I calculate multiple gene properties for a range of phylogenomic data sets spanning animal, fungal, and plant clades, uncovering a remarkable predictability in their patterns of covariance. I also show how these patterns provide a means for ordering loci by both their rate of evolution and their relative phylogenetic usefulness. This method of retrieving phylogenetically useful loci is found to be among the top performing when compared with alternative subsampling protocols. Relatively common approaches such as minimizing potential sources of systematic bias or increasing the clock-likeness of the data are found to fare worse than selecting loci at random. Likewise, the general utility of rate-based subsampling is found to be limited: loci evolving at both low and high rates are among the least effective, and even those evolving at optimal rates can still widely differ in usefulness. This study shows that many common subsampling approaches introduce unintended effects in off-target gene properties and proposes an alternative multivariate method that simultaneously optimizes phylogenetic signal while controlling for known sources of bias.
Collapse
|
15
|
Vankan M, Ho SYW, Duchêne DA. Evolutionary Rate Variation Among Lineages in Gene Trees has a Negative Impact on Species-Tree Inference. Syst Biol 2021; 71:490-500. [PMID: 34255084 PMCID: PMC8830059 DOI: 10.1093/sysbio/syab051] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 06/18/2021] [Indexed: 11/12/2022] Open
Abstract
Phylogenetic analyses of genomic data provide a powerful means of reconstructing the evolutionary relationships among organisms, yet such analyses are often hindered by conflicting phylogenetic signals among loci. Identifying the signals that are most influential to species-tree estimation can help to inform the choice of data for phylogenomic analysis. We investigated this in an analysis of 30 phylogenomic data sets. For each data set, we examined the association between several branch-length characteristics of gene trees and the distance between these gene trees and the corresponding species trees. We found that the distance of each gene tree to the species tree inferred from the full data set was positively associated with variation in root-to-tip distances and negatively associated with mean branch support. However, no such associations were found for gene-tree length, a measure of the overall substitution rate at each locus. We further explored the usefulness of the best-performing branch-based characteristics for selecting loci for phylogenomic analyses. We found that loci that yield gene trees with high variation in root-to-tip distances have a disproportionately distant signal of tree topology compared with the complete data sets. These results suggest that rate variation across lineages should be taken into consideration when exploring and even selecting loci for phylogenomic analysis.[Branch support; data filtering; nucleotide substitution model; phylogenomics; substitution rate; summary coalescent methods.]
Collapse
Affiliation(s)
- Mezzalina Vankan
- School of Life and Environmental Sciences, University of Sydney, NSW 2006, Australia.,Research School of Biology, Australian National University, ACT 2601, Australia
| | - Simon Y W Ho
- School of Life and Environmental Sciences, University of Sydney, NSW 2006, Australia
| | - David A Duchêne
- Research School of Biology, Australian National University, ACT 2601, Australia.,Centre for Evolutionary Hologenomics, University of Copenhagen, Copenhagen 1352, Denmark
| |
Collapse
|
16
|
Thomas AE, Igea J, Meudt HM, Albach DC, Lee WG, Tanentzap AJ. Using target sequence capture to improve the phylogenetic resolution of a rapid radiation in New Zealand Veronica. AMERICAN JOURNAL OF BOTANY 2021; 108:1289-1306. [PMID: 34173225 DOI: 10.1002/ajb2.1678] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 03/10/2021] [Indexed: 05/08/2023]
Abstract
PREMISE Recent, rapid radiations present a challenge for phylogenetic reconstruction. Fast successive speciation events typically lead to low sequence divergence and poorly resolved relationships with standard phylogenetic markers. Target sequence capture of many independent nuclear loci has the potential to improve phylogenetic resolution for rapid radiations. METHODS Here we applied target sequence capture with 353 protein-coding genes (Angiosperms353 bait kit) to Veronica sect. Hebe (common name hebe) to determine its utility for improving the phylogenetic resolution of rapid radiations. Veronica section Hebe originated 5-10 million years ago in New Zealand, forming a monophyletic radiation of ca 130 extant species. RESULTS We obtained approximately 150 kbp of 353 protein-coding exons and an additional 200 kbp of flanking noncoding sequences for each of 77 hebe and two outgroup species. When comparing coding, noncoding, and combined data sets, we found that the latter provided the best overall phylogenetic resolution. While some deep nodes in the radiation remained unresolved, our phylogeny provided broad and often improved support for subclades identified by both morphology and standard markers in previous studies. Gene-tree discordance was nonetheless widespread, indicating that additional methods are needed to disentangle fully the history of the radiation. CONCLUSIONS Phylogenomic target capture data sets both increase phylogenetic signal and deliver new insights into the complex evolutionary history of rapid radiations as compared with traditional markers. Improving methods to resolve remaining discordance among loci from target sequence capture is now important to facilitate the further study of rapid radiations.
Collapse
Affiliation(s)
- Anne E Thomas
- Ecosystems and Global Change Group, Department of Plant Sciences, University of Cambridge, Cambridge, UK
| | - Javier Igea
- Ecosystems and Global Change Group, Department of Plant Sciences, University of Cambridge, Cambridge, UK
| | - Heidi M Meudt
- Museum of New Zealand Te Papa Tongarewa, Wellington, New Zealand
| | - Dirk C Albach
- Carl von Ossietzky-University, Oldenburg, D-26111, Germany
| | - William G Lee
- Manaaki Whenua - Landcare Research Otago, Dunedin, New Zealand
| | - Andrew J Tanentzap
- Ecosystems and Global Change Group, Department of Plant Sciences, University of Cambridge, Cambridge, UK
| |
Collapse
|
17
|
Shah T, Schneider JV, Zizka G, Maurin O, Baker W, Forest F, Brewer GE, Savolainen V, Darbyshire I, Larridon I. Joining forces in Ochnaceae phylogenomics: a tale of two targeted sequencing probe kits. AMERICAN JOURNAL OF BOTANY 2021; 108:1201-1216. [PMID: 34180046 DOI: 10.1002/ajb2.1682] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Accepted: 02/23/2021] [Indexed: 05/10/2023]
Abstract
PREMISE Both universal and family-specific targeted sequencing probe kits are becoming widely used for reconstruction of phylogenetic relationships in angiosperms. Within the pantropical Ochnaceae, we show that with careful data filtering, universal kits are equally as capable in resolving intergeneric relationships as custom probe kits. Furthermore, we show the strength in combining data from both kits to mitigate bias and provide a more robust result to resolve evolutionary relationships. METHODS We sampled 23 Ochnaceae genera and used targeted sequencing with two probe kits, the universal Angiosperms353 kit and a family-specific kit. We used maximum likelihood inference with a concatenated matrix of loci and multispecies-coalescence approaches to infer relationships in the family. We explored phylogenetic informativeness and the impact of missing data on resolution and tree support. RESULTS For the Angiosperms353 data set, the concatenation approach provided results more congruent with those of the Ochnaceae-specific data set. Filtering missing data was most impactful on the Angiosperms353 data set, with a relaxed threshold being the optimum scenario. The Ochnaceae-specific data set resolved consistent topologies using both inference methods, and no major improvements were obtained after data filtering. Merging of data obtained with the two kits resulted in a well-supported phylogenetic tree. CONCLUSIONS The Angiosperms353 data set improved upon data filtering, and missing data played an important role in phylogenetic reconstruction. The Angiosperms353 data set resolved the phylogenetic backbone of Ochnaceae as equally well as the family specific data set. All analyses indicated that both Sauvagesia L. and Campylospermum Tiegh. as currently circumscribed are polyphyletic and require revised delimitation.
Collapse
Affiliation(s)
- Toral Shah
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
- Department of Life Sciences, Imperial College, Silwood Park Campus, Ascot, Berks, SL5 7PY, UK
| | - Julio V Schneider
- Department of Botany and Molecular Evolution, Senckenberg Research Institute and Natural History Museum Frankfurt, Senckenberganlage 25, Frankfurt am Main, D-60325, Germany
| | - Georg Zizka
- Department of Botany and Molecular Evolution, Senckenberg Research Institute and Natural History Museum Frankfurt, Senckenberganlage 25, Frankfurt am Main, D-60325, Germany
- Institute of Ecology, Evolution and Diversity, Goethe University, Max-von-Laue-Str. 13, Frankfurt am Main, 60438, Germany
| | - Olivier Maurin
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - William Baker
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Félix Forest
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Grace E Brewer
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Vincent Savolainen
- Department of Life Sciences, Imperial College, Silwood Park Campus, Ascot, Berks, SL5 7PY, UK
| | | | - Isabel Larridon
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
- Systematic and Evolutionary Botany Lab, Department of Biology, Ghent University, K.L., Ledeganckstraat 35, Gent, 9000, Belgium
| |
Collapse
|
18
|
Santaquiteria A, Siqueira AC, Duarte-Ribeiro E, Carnevale G, White W, Pogonoski J, Baldwin CC, Ortí G, Arcila D, Betancur RR. Phylogenomics and Historical Biogeography of Seahorses, Dragonets, Goatfishes, and Allies (Teleostei: Syngnatharia): Assessing Factors Driving Uncertainty in Biogeographic Inferences. Syst Biol 2021; 70:1145-1162. [PMID: 33892493 DOI: 10.1093/sysbio/syab028] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Accepted: 04/19/2021] [Indexed: 11/14/2022] Open
Abstract
The charismatic trumpetfishes, goatfishes, dragonets, flying gurnards, seahorses, and pipefishes encompass a recently defined yet extraordinarily diverse clade of percomorph fishes-the series Syngnatharia. This group is widely distributed in tropical and warm-temperate regions, with a great proportion of its extant diversity occurring in the Indo-Pacific. Because most syngnatharians feature long-range dispersal capabilities, tracing their biogeographic origins is challenging. Here, we applied an integrative phylogenomic approach to elucidate the evolutionary biogeography of syngnatharians. We built upon a recently published phylogenomic study that examined ultraconserved elements by adding 62 species (total 169 species) and one family (Draconettidae), to cover ca. 25% of the species diversity and all 10 families in the group. We inferred a set of time-calibrated trees and conducted ancestral range estimations. We also examined the sensitivity of these analyses to phylogenetic uncertainty (estimated from multiple genomic subsets), area delimitation, and biogeographic models that include or exclude the jump-dispersal parameter (j). Of the three factors examined, we found that the j parameter has the strongest effect in ancestral range estimates, followed by number of areas defined, and tree topology and divergence times. After accounting for these uncertainties, our results reveal that syngnatharians originated in the ancient Tethys Sea ca. 87 Ma (84-94 Ma; Late Cretaceous) and subsequently occupied the Indo-Pacific. Throughout syngnatharian history, multiple independent lineages colonized the eastern Pacific (6-8 times) and the Atlantic (6-14 times) from their center of origin, with most events taking place following an east-to-west route prior to the closure of the Tethys Seaway ca. 12-18 Ma. Ultimately, our study highlights the importance of accounting for different factors generating uncertainty in macroevolutionary and biogeographic inferences.
Collapse
Affiliation(s)
- Aintzane Santaquiteria
- Department of Biology, The University of Oklahoma, 730 Van Vleet Oval, Norman, OK 73019, USA
| | - Alexandre C Siqueira
- Research Hub for Coral Reef Ecosystem Functions, College of Science and Engineering, James Cook University, Townsville, QLD 4811, Australia
| | - Emanuell Duarte-Ribeiro
- Department of Biology, The University of Oklahoma, 730 Van Vleet Oval, Norman, OK 73019, USA
| | - Giorgio Carnevale
- Dipartimento di Scienze della Terra, Università degli Studi di Torino, via Valperga Caluso 35, 10125, Torino, Italy
| | - William White
- CSIRO Australian National Fish Collection, National Research Collections of Australia, Hobart, TAS, Australia
| | - John Pogonoski
- CSIRO Australian National Fish Collection, National Research Collections of Australia, Hobart, TAS, Australia
| | - Carole C Baldwin
- Department of Vertebrate Zoology, Smithsonian National Museum of Natural History, 10th St. & Constitution Ave. NW, Washington, DC 20560, USA
| | - Guillermo Ortí
- Department of Vertebrate Zoology, Smithsonian National Museum of Natural History, 10th St. & Constitution Ave. NW, Washington, DC 20560, USA.,Department of Biological Sciences, George Washington University, 2029 G St. NW, Washington, DC 20052, USA
| | - Dahiana Arcila
- Department of Biology, The University of Oklahoma, 730 Van Vleet Oval, Norman, OK 73019, USA.,Sam Noble Oklahoma Museum of Natural History, 2401 Chautauqua Ave, Norman, OK 73072, USA
| | - Ricardo R Betancur
- Department of Biology, The University of Oklahoma, 730 Van Vleet Oval, Norman, OK 73019, USA
| |
Collapse
|
19
|
Arcila D, Hughes LC, Meléndez-Vazquez F, Baldwin CC, White W, Carpenter K, Williams JT, Santos MD, Pogonoski J, Miya M, Ortí G, Betancur-R R. Testing the utility of alternative metrics of branch support to address the ancient evolutionary radiation of tunas, stromateoids, and allies (Teleostei: Pelagiaria). Syst Biol 2021; 70:1123-1144. [PMID: 33783539 DOI: 10.1093/sysbio/syab018] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 03/13/2021] [Indexed: 12/19/2022] Open
Abstract
The use of high-throughput sequencing technologies to produce genome-scale datasets was expected to settle some long-standing controversies across the Tree of Life, particularly in areas where short branches occur at deep timescales. Instead, these datasets have often yielded many well-supported but conflicting topologies, and highly variable gene-tree distributions. A variety of branch-support metrics beyond the nonparametric bootstrap are now available to assess how robust a phylogenetic hypothesis may be, as well as new methods to quantify gene-tree discordance. We applied multiple branch support metrics to an ancient group of marine fishes (Teleostei: Pelagiaria) whose interfamilial relationships have proven difficult to resolve due to a rapid accumulation of lineages very early in its history. We analyzed hundreds of loci including published UCE data and newly generated exonic data along with their flanking regions to represent all 16 extant families for more than 150 out of 284 valid species in the group. Branch support was lower for interfamilial relationships (except the SH-like aLRT and aBayes methods) regardless of the type of marker used. Several nodes that were highly supported with bootstrap had very low site and gene-tree concordance, revealing underlying conflict. Despite this conflict, we were able to identify four consistent interfamilial clades, each comprised of two or three families. Combining exons with their flanking regions also produced increased branch lengths in the deep branches of the pelagiarian tree. Our results demonstrate the limitations of employing current metrics of branch support and species-tree estimation when assessing the confidence of ancient evolutionary radiations and emphasize the necessity to embrace alternative measurements to explore phylogenetic uncertainty and discordance in phylogenomic datasets.
Collapse
Affiliation(s)
- Dahiana Arcila
- Department of Ichthyology, Sam Noble Oklahoma Museum of Natural History, Norman, Oklahoma, U.S.A.,Department of Biology, University of Oklahoma, Norman, Oklahoma, U.S.A
| | - Lily C Hughes
- Department of Biological Sciences, The George Washington University, Washington, District of Columbia, U.S.A.,Department of Organismal Biology and Anatomy, The University of Chicago, Illinois, Chicago, U.S.A.,Department of Vertebrate Zoology, Smithsonian Institution National Museum of Natural History, Washington, District of Columbia, U.S.A
| | - Fernando Meléndez-Vazquez
- Department of Ichthyology, Sam Noble Oklahoma Museum of Natural History, Norman, Oklahoma, U.S.A.,Department of Biology, University of Oklahoma, Norman, Oklahoma, U.S.A
| | - Carole C Baldwin
- Department of Vertebrate Zoology, Smithsonian Institution National Museum of Natural History, Washington, District of Columbia, U.S.A
| | - William White
- CSIRO Australian National Fish Collection, National Research Collections Australia, Hobart, Hobart, Tasmania, Australia
| | - Kent Carpenter
- Department of Biological Sciences, Old Dominion University, Norfolk, Virginia, U.S.A
| | - Jeffrey T Williams
- Department of Vertebrate Zoology, Smithsonian Institution National Museum of Natural History, Washington, District of Columbia, U.S.A
| | | | - John Pogonoski
- CSIRO Australian National Fish Collection, National Research Collections Australia, Hobart, Hobart, Tasmania, Australia
| | - Masaki Miya
- Natural History Museum and Institute, Chiba, Aoba-cho, Chuo-ku, Chiba, Japan
| | - Guillermo Ortí
- Department of Biological Sciences, The George Washington University, Washington, District of Columbia, U.S.A.,Department of Vertebrate Zoology, Smithsonian Institution National Museum of Natural History, Washington, District of Columbia, U.S.A
| | | |
Collapse
|
20
|
Takezaki N. Resolving the Early Divergence Pattern of Teleost Fish Using Genome-Scale Data. Genome Biol Evol 2021; 13:6178791. [PMID: 33739405 PMCID: PMC8103497 DOI: 10.1093/gbe/evab052] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/10/2021] [Indexed: 12/13/2022] Open
Abstract
Regarding the phylogenetic relationship of the three primary groups of teleost fishes, Osteoglossomorpha (bonytongues and others), Elopomorpha (eels and relatives), Clupeocephala (the remaining teleost fish), early morphological studies hypothesized the first divergence of Osteoglossomorpha, whereas the recent prevailing view is the first divergence of Elopomorpha. Molecular studies supported all the possible relationships of the three primary groups. This study analyzed genome-scale data from four previous studies: 1) 412 genes from 12 species, 2) 772 genes from 15 species, 3) 1,062 genes from 30 species, and 4) 491 UCE loci from 27 species. The effects of the species, loci, and models used on the constructed tree topologies were investigated. In the analyses of the data sets (1)–(3), although the first divergence of Clupeocephala that left the other two groups in a sister relationship was supported by concatenated sequences and gene trees of all the species and genes, the first divergence of Elopomorpha among the three groups was supported using species and/or genes with low divergence of sequence and amino-acid frequencies. This result corresponded to that of the UCE data set (4), whose sequence divergence was low, which supported the first divergence of Elopomorpha with high statistical significance. The increase in accuracy of the phylogenetic construction by using species and genes with low sequence divergence was predicted by a phylogenetic informativeness approach and confirmed by computer simulation. These results supported that Elopomorpha was the first basal group of teleost fish to have diverged, consistent with the prevailing view of recent morphological studies.
Collapse
Affiliation(s)
- Naoko Takezaki
- Life Science Research Center, Kagawa University, Mikicho, Kitagun, Kagawa, Japan
| |
Collapse
|
21
|
Ballesteros JA, Hormiga G. Molecular phylogeny of the orb-weaving spider genus Leucauge and the intergeneric relationships of Leucauginae (Araneae, Tetragnathidae). INVERTEBR SYST 2021. [DOI: 10.1071/is21029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
The tetragnathid genus Leucauge includes some of the most common orb-weaving spiders in the tropics. Although some species in this genus have attained relevance as model systems for several aspects of spider biology, our understanding of the generic diversity and evolutionary relationships among the species is poor. In this study we present the first attempt to determine the phylogenetic structure within Leucauge and the relationship of this genus with other genera of Leucauginae. This is based on DNA sequences from the five loci commonly used and Histone H4, used for the first time in spider phylogenetics. We also assess the informativeness of the standard markers and test for base composition biases in the dataset. Our results suggest that Leucauge is not monophyletic since species of the genera Opas, Opadometa, Mecynometa and Alcimosphenus are included within the current circumscription of the genus. Based on a phylogenetic re-circumscription of the genus to fulfil the requirement for monophyly of taxa, Leucauge White, 1841 is deemed to be a senior synonym of the genera Opas Pickard-Cambridge, 1896 revalidated synonymy, Mecynometa Simon, 1894 revalidated synonymy, Opadometa Archer, 1951 new synonymy and Alcimosphenus Simon, 1895 new synonymy. We identify groups of taxa critical for resolving relationships within Leucauginae and describe the limitations of the standard loci for accomplishing these resolutions.
Collapse
|
22
|
Lv X, Hu J, Hu Y, Li Y, Xu D, Ryder OA, Irwin DM, Yu L. Diverse phylogenomic datasets uncover a concordant scenario of laurasiatherian interordinal relationships. Mol Phylogenet Evol 2020; 157:107065. [PMID: 33387649 DOI: 10.1016/j.ympev.2020.107065] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Revised: 12/22/2020] [Accepted: 12/24/2020] [Indexed: 10/22/2022]
Abstract
Resolving the interordinal relationships in the mammalian superorder Laurasiatheria has been among the most intractable problems in higher-level mammalian systematics, with many conflicting hypotheses having been proposed. The present study collected three different sources of genome-scale data with comprehensive taxon sampling of laurasiatherian species, including two protein-coding datasets (4,186 protein-coding genes for an amino acid dataset comprising 2,761,247 amino acid residues and a nucleotide dataset comprising 5,516,340 nucleotides from 1st and 2nd codon positions), an intronic dataset (1,210 introns comprising 1,162,723 nucleotides) and an ultraconserved elements (UCEs) dataset (1,246 UCEs comprising 1,946,472 nucleotides) from 40 species representing all six laurasiatherian orders and 7 non-laurasiatherian outgroups. Remarkably, phylogenetic trees reconstructed with the four datasets using different tree-building methods (RAxML, FastTree, ASTRAL and MP-EST) all supported the relationship (Eulipotyphla, (Chiroptera, ((Carnivora, Pholidota), (Cetartiodactyla, Perissodactyla)))). We find a resolution of interordinal relationships of Laurasiatheria among all types of markers used in the present study, and the likelihood ratio tests for tree comparisons confirmed that the present tree topology is the optimal hypothesis compared to other examined hypotheses. Jackknifing subsampling analyses demonstrate that the results of laurasiatherian tree reconstruction varied with the number of loci and ordinal representatives used, which are likely the two main contributors to phylogenetic disagreements of Laurasiatheria seen in previous studies. Our study provides significant insight into laurasiatherian evolution, and moreover, an important methodological strategy and reference for resolving phylogenies of adaptive radiation, which have been a long-standing challenge in the field of phylogenetics.
Collapse
Affiliation(s)
- Xue Lv
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Jingyang Hu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Yiwen Hu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Yitian Li
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Dongming Xu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Kunming, China
| | - Oliver A Ryder
- Institute for Conservation Research, San Diego Zoo Global, Escondido, CA, USA
| | - David M Irwin
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Canada
| | - Li Yu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China.
| |
Collapse
|
23
|
Unraveling the Global Phylodynamic and Phylogeographic Expansion of Mycoplasma gallisepticum: Understanding the Origin and Expansion of This Pathogen in Ecuador. Pathogens 2020; 9:pathogens9090674. [PMID: 32825097 PMCID: PMC7557814 DOI: 10.3390/pathogens9090674] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2020] [Revised: 07/31/2020] [Accepted: 08/18/2020] [Indexed: 12/17/2022] Open
Abstract
Mycoplasma gallisepticum (MG) is among the most significant problems in the poultry industry worldwide, representing a serious threat to international trade. Despite the fact that the mgc2 gene has been widely used for diagnostic and molecular characterization purposes, there is a lack of evidence supporting the reliability of this gene as a marker for molecular epidemiology approaches. Therefore, the current study aimed to assess the accuracy of the mgc2 gene for phylogenetic, phylodynamic, and phylogeographic evaluations. Furthermore, the global phylodynamic expansion of MG is described, and the origin and extension of the outbreak caused by MG in Ecuador were tracked and characterized. The results obtained strongly supported the use of the mgc2 gene as a reliable phylogenetic marker and accurate estimator for the temporal and phylogeographic structure reconstruction of MG. The phylodynamic analysis denoted the failures in the current policies to control MG and highlighted the imperative need to implement more sensitive methodologies of diagnosis and more efficient vaccines. Framed in Ecuador, the present study provides the first piece of evidence of the circulation of virulent field MG strains in Ecuadorian commercial poultry. The findings derived from the current study provide novel and significant insights into the origin, diversification, and evolutionary process of MG globally.
Collapse
|
24
|
Esquerré D, Donnellan S, Brennan IG, Lemmon AR, Moriarty Lemmon E, Zaher H, Grazziotin FG, Keogh JS. Phylogenomics, Biogeography, and Morphometrics Reveal Rapid Phenotypic Evolution in Pythons After Crossing Wallace’s Line. Syst Biol 2020; 69:1039-1051. [DOI: 10.1093/sysbio/syaa024] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Revised: 03/02/2020] [Accepted: 03/17/2020] [Indexed: 11/13/2022] Open
Abstract
Abstract
Ecological opportunities can be provided to organisms that cross stringent biogeographic barriers towards environments with new ecological niches. Wallace’s and Lyddeker’s lines are arguably the most famous biogeographic barriers, separating the Asian and Australo-Papuan biotas. One of the most ecomorphologically diverse groups of reptiles, the pythons, is distributed across these lines, and are remarkably more diverse in phenotype and ecology east of Lydekker’s line in Australo-Papua. We used an anchored hybrid enrichment approach, with near complete taxon sampling, to extract mitochondrial genomes and 376 nuclear loci to resolve and date their phylogenetic history. Biogeographic reconstruction demonstrates that they originated in Asia around 38-45 Ma and then invaded Australo-Papua around 23 Ma. Australo-Papuan pythons display a sizeable expansion in morphological space, with shifts towards numerous new adaptive optima in head and body shape, coupled with the evolution of new micro-habitat preferences. We provide an updated taxonomy of pythons and our study also demonstrates how ecological opportunity following colonization of novel environments can promote morphological diversification in a formerly ecomorphologically conservative group. [Adaptive radiation; anchored hybrid enrichment; biogeography; morphometrics; snakes.]
Collapse
Affiliation(s)
- Damien Esquerré
- Division of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT 0200, Australia
| | - Stephen Donnellan
- School of Biological Sciences, The University of Adelaide, Adelaide, SA 5005, Australia
- South Australian Museum, North Terrace, Adelaide SA 5000 Australia
| | - Ian G Brennan
- Division of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT 0200, Australia
| | - Alan R Lemmon
- Department of Scientific Computing, Florida State University, 400 Dirac Science Library Tallahassee, FL 32306-4120, United States
| | - Emily Moriarty Lemmon
- Department of Biological Science, Florida State University, 319 Stadium Drive, PO Box 3064295, Tallahassee, FL 32306-4295, United States
| | - Hussam Zaher
- Museu de Zoologia, Universidade de São Paulo, Avenida Nazaré 481, CEP 04263-000, São Paulo, SP, Brazil
- CR2P – Centre de Recherche em Paléontologie – MNHN – Sorbonne Université – CNRS, 8 rue Buffon, CP 38, 75005 Paris, France
| | - Felipe G Grazziotin
- Laboratório de Coleções Zoológicas, Instituto Butantan, São Paulo, SP, Brazil
| | - J Scott Keogh
- Division of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT 0200, Australia
| |
Collapse
|
25
|
Karin BR, Gamble T, Jackman TR. Optimizing Phylogenomics with Rapidly Evolving Long Exons: Comparison with Anchored Hybrid Enrichment and Ultraconserved Elements. Mol Biol Evol 2020; 37:904-922. [PMID: 31710677 PMCID: PMC7038749 DOI: 10.1093/molbev/msz263] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Marker selection has emerged as an important component of phylogenomic study design due to rising concerns of the effects of gene tree estimation error, model misspecification, and data-type differences. Researchers must balance various trade-offs associated with locus length and evolutionary rate among other factors. The most commonly used reduced representation data sets for phylogenomics are ultraconserved elements (UCEs) and Anchored Hybrid Enrichment (AHE). Here, we introduce Rapidly Evolving Long Exon Capture (RELEC), a new set of loci that targets single exons that are both rapidly evolving (evolutionary rate faster than RAG1) and relatively long in length (>1,500 bp), while at the same time avoiding paralogy issues across amniotes. We compare the RELEC data set to UCEs and AHE in squamate reptiles by aligning and analyzing orthologous sequences from 17 squamate genomes, composed of 10 snakes and 7 lizards. The RELEC data set (179 loci) outperforms AHE and UCEs by maximizing per-locus genetic variation while maintaining presence and orthology across a range of evolutionary scales. RELEC markers show higher phylogenetic informativeness than UCE and AHE loci, and RELEC gene trees show greater similarity to the species tree than AHE or UCE gene trees. Furthermore, with fewer loci, RELEC remains computationally tractable for full Bayesian coalescent species tree analyses. We contrast RELEC to and discuss important aspects of comparable methods, and demonstrate how RELEC may be the most effective set of loci for resolving difficult nodes and rapid radiations. We provide several resources for capturing or extracting RELEC loci from other amniote groups.
Collapse
Affiliation(s)
- Benjamin R Karin
- Department of Biology, Villanova University, Villanova, PA
- Museum of Vertebrate Zoology and Department of Integrative Biology, University of California, Berkeley, CA
| | - Tony Gamble
- Department of Biological Sciences, Marquette University, Milwaukee, WI
- Milwaukee Public Museum, Milwaukee, WI
- Bell Museum of Natural History, University of Minnesota, St. Paul, MN
| | - Todd R Jackman
- Department of Biology, Villanova University, Villanova, PA
| |
Collapse
|
26
|
Bagley JC, Uribe-Convers S, Carlsen MM, Muchhala N. Utility of targeted sequence capture for phylogenomics in rapid, recent angiosperm radiations: Neotropical Burmeistera bellflowers as a case study. Mol Phylogenet Evol 2020; 152:106769. [PMID: 32081762 DOI: 10.1016/j.ympev.2020.106769] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2019] [Revised: 02/10/2020] [Accepted: 02/12/2020] [Indexed: 02/06/2023]
Abstract
Targeted sequence capture is a promising approach for large-scale phylogenomics. However, rapid evolutionary radiations pose significant challenges for phylogenetic inference (e.g. incomplete lineages sorting (ILS), phylogenetic noise), and the ability of targeted nuclear loci to resolve species trees despite such issues remains poorly studied. We test the utility of targeted sequence capture for inferring phylogenetic relationships in rapid, recent angiosperm radiations, focusing on Burmeistera bellflowers (Campanulaceae), which diversified into ~130 species over less than 3 million years. We compared phylogenies estimated from supercontig (exons plus flanking sequences), exon-only, and flanking-only datasets with 506-546 loci (~4.7 million bases) for 46 Burmeistera species/lineages and 10 outgroup taxa. Nuclear loci resolved backbone nodes and many congruent internal relationships with high support in concatenation and coalescent-based species tree analyses, and inferences were largely robust to effects of missing taxa and base composition biases. Nevertheless, species trees were incongruent between datasets, and gene trees exhibited remarkably high levels of conflict (~4-60% congruence, ~40-99% conflict) not simply driven by poor gene tree resolution. Higher gene tree heterogeneity at shorter branches suggests an important role of ILS, as expected for rapid radiations. Phylogenetic informativeness analyses also suggest this incongruence has resulted from low resolving power at short internal branches, consistent with ILS, and homoplasy at deeper nodes, with exons exhibiting much greater risk of incorrect topologies due to homoplasy than other datasets. Our findings suggest that targeted sequence capture is feasible for resolving rapid, recent angiosperm radiations, and that results based on supercontig alignments containing nuclear exons and flanking sequences have higher phylogenetic utility and accuracy than either alone. We use our results to make practical recommendations for future target capture-based studies of Burmeistera and other rapid angiosperm radiations, including that such studies should analyze supercontigs to maximize the phylogenetic information content of loci.
Collapse
Affiliation(s)
- Justin C Bagley
- Department of Biology, University of Missouri-St. Louis, St. Louis, MO 63121, USA; Department of Biology, Virginia Commonwealth University, Richmond, VA 23284, USA.
| | - Simon Uribe-Convers
- Department of Biology, University of Missouri-St. Louis, St. Louis, MO 63121, USA
| | - Mónica M Carlsen
- Research Department, Science and Conservation Division, Missouri Botanical Garden, St. Louis, MO 63110, USA
| | - Nathan Muchhala
- Department of Biology, University of Missouri-St. Louis, St. Louis, MO 63121, USA
| |
Collapse
|
27
|
Phillips AJ, Dornburg A, Zapfe KL, Anderson FE, James SW, Erséus C, Moriarty Lemmon E, Lemmon AR, Williams BW. Phylogenomic Analysis of a Putative Missing Link Sparks Reinterpretation of Leech Evolution. Genome Biol Evol 2020; 11:3082-3093. [PMID: 31214691 PMCID: PMC6598468 DOI: 10.1093/gbe/evz120] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/12/2019] [Indexed: 12/17/2022] Open
Abstract
Leeches (Hirudinida) comprise a charismatic, yet often maligned group of worms. Despite their ecological, economic, and medical importance, a general consensus on the phylogenetic relationships of major hirudinidan lineages is lacking. This absence of a consistent, robust phylogeny of early-diverging lineages has hindered our understanding of the underlying processes that enabled evolutionary diversification of this clade. Here, we used an anchored hybrid enrichment-based phylogenomic approach, capturing hundreds of loci to investigate phylogenetic relationships among major hirudinidan lineages and their closest living relatives. We recovered Branchiobdellida as sister to a clade that includes all major lineages of hirudinidans and Acanthobdella, casting doubt on the utility of Acanthobdella as a “missing link” between hirudinidans and the clitellate group formerly known as Oligochaeta. Further, our results corroborate the reciprocal monophyly of jawed and proboscis-bearing leeches. Our phylogenomic resolution of early-diverging leeches provides a useful framework for illuminating the evolution of key adaptations and host–symbiont associations that have allowed leeches to colonize a wide diversity of habitats worldwide.
Collapse
Affiliation(s)
- Anna J Phillips
- Department of Invertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia
| | - Alex Dornburg
- North Carolina Museum of Natural Sciences, Research Laboratory, Raleigh, North Carolina
| | - Katerina L Zapfe
- North Carolina Museum of Natural Sciences, Research Laboratory, Raleigh, North Carolina.,Department of Biological Sciences, Clemson University
| | | | | | - Christer Erséus
- Department of Biological and Environmental Sciences, University of Gothenburg, Sweden
| | | | - Alan R Lemmon
- Department of Scientific Computing, Florida State University
| | - Bronwyn W Williams
- North Carolina Museum of Natural Sciences, Research Laboratory, Raleigh, North Carolina
| |
Collapse
|
28
|
Phylogenetic informativeness analyses to clarify past diversification processes in Cucurbitaceae. Sci Rep 2020; 10:488. [PMID: 31949198 PMCID: PMC6965171 DOI: 10.1038/s41598-019-57249-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Accepted: 12/20/2019] [Indexed: 01/12/2023] Open
Abstract
Phylogenomic studies have so far mostly relied on genome skimming or target sequence capture, which suffer from representation bias and can fail to resolve relationships even with hundreds of loci. Here, we explored the potential of phylogenetic informativeness and tree confidence analyses to interpret phylogenomic datasets. We studied Cucurbitaceae because their small genome size allows cost-efficient genome skimming, and many relationships in the family remain controversial, preventing inferences on the evolution of characters such as sexual system or floral morphology. Genome skimming and PCR allowed us to retrieve the plastome, 57 single copy nuclear genes, and the nuclear ribosomal ITS from 29 species representing all but one tribe of Cucurbitaceae. Node support analyses revealed few inter-locus conflicts but a pervasive lack of phylogenetic signal among plastid loci, suggesting a fast divergence of Cucurbitaceae tribes. Data filtering based on phylogenetic informativeness and risk of homoplasy clarified tribe-level relationships, which support two independent evolutions of fringed petals in the family. Our study illustrates how formal analysis of phylogenomic data can increase our understanding of past diversification processes. Our data and results will facilitate the design of well-sampled phylogenomic studies in Cucurbitaceae and related families.
Collapse
|
29
|
Wei R, Zhang XC. Phylogeny of Diplazium (Athyriaceae) revisited: Resolving the backbone relationships based on plastid genomes and phylogenetic tree space analysis. Mol Phylogenet Evol 2019; 143:106699. [PMID: 31809851 DOI: 10.1016/j.ympev.2019.106699] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Revised: 12/01/2019] [Accepted: 12/01/2019] [Indexed: 11/17/2022]
Abstract
Despite progress in resolving the phylogeny of twinsorus ferns (Diplazium) based on multilocus phylogenetic studies, uncertainty remains especially for deep, or backbone relationships among closely related clades, suggesting a classic case of rapid evolutionary radiation. Here, we investigated the deep phylogenetic relationships within Diplazium by sampling all major clades and using 51 plastid genomes (plastomes), of which 38 were newly sequenced with high-throughput sequencing technology, resulting more than 127,000 informative sites. Using parsimony, maximum likelihood and Bayesian analyses of plastome sequences, we largely resolved the backbone of the phylogeny of Diplazium with strong support. However, we also detected phylogenetic incongruence among different datasets and moderately to poorly supported relationships, particularly at several extremely short internal branches. By using phylogenetic tree space and topology-clustering analyses, we provide evidence that conflicting phylogenetic signals can be found across the trees estimated from individual chloroplast protein-coding genes, which may underlie the difficulty of systematics of Diplazium. Furthermore, our phylogenetic estimate offers more resolution over previous multilocus analyses, providing a framework for future taxonomic revisions of sectional classification of Diplazium. Our study demonstrates the advantage of a character-rich plastome dataset, combining the comparison of different phylogenetic methods, for resolving the recalcitrant lineages that have undergone rapid radiation and dramatic changes in evolutionary rates.
Collapse
Affiliation(s)
- Ran Wei
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, The Chinese Academy of Sciences, Beijing 100093, China
| | - Xian-Chun Zhang
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, The Chinese Academy of Sciences, Beijing 100093, China.
| |
Collapse
|
30
|
Lamarca AP, Schrago CG. Fast speciations and slow genes: uncovering the root of living canids. Biol J Linn Soc Lond 2019. [DOI: 10.1093/biolinnean/blz181] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
Abstract
Despite ongoing efforts relying on computationally intensive tree-building methods and large datasets, the deeper phylogenetic relationships between living canid genera remain controversial. We demonstrate that this issue arises fundamentally from the uncertainty of root placement as a consequence of the short length of the branch connecting the major canid clades, which probably resulted from a fast radiation during the early diversification of extant Canidae. Using both nuclear and mitochondrial genes, we investigate the position of the canid root and its consistency by using three rooting methods. We find that mitochondrial genomes consistently retrieve a root node separating the tribe Canini from the remaining canids, whereas nuclear data mostly recover a root that places the Urocyon foxes as the sister lineage of living canids. We demonstrate that, to resolve the canid root, the nuclear segments sequenced so far are significantly less informative than mitochondrial genomes. We also propose that short intervals between speciations obscure the place of the true root, because methods are susceptible to stochastic error in the presence of short internal branches near the root.
Collapse
Affiliation(s)
- Alessandra P Lamarca
- Department of Genetics, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Carlos G Schrago
- Department of Genetics, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| |
Collapse
|
31
|
Chen X, Dong Z, Liu G, He J, Zhao R, Wang W, Peng Y, Li X. Phylogenetic analysis provides insights into the evolution of Asian fireflies and adult bioluminescence. Mol Phylogenet Evol 2019; 140:106600. [PMID: 31445200 DOI: 10.1016/j.ympev.2019.106600] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2018] [Revised: 08/09/2019] [Accepted: 08/20/2019] [Indexed: 02/04/2023]
Abstract
Fireflies are one of the best-known examples of luminescent organisms. The limited geographic distribution and rarity of some firefly genera have hindered molecular phylogenetic analysis, resulting in uncertainty in regard to firefly phylogeny. Here, using genome skimming next-generation sequencing, we sequenced 23 Asian firefly species from 15 genera (Lampyridae: 14; Rhagophthalmidae: one) and assembled their mitochondrial genomes (mitogenomes) and nuclear ribosomal DNA (rDNA) repeat unit. The mitogenomes (including 15 mitochondrial genes: COX1-3, ATP6&8, ND1-6&4L, CYTB, 12S, and 16S) were recovered for almost all 23 species; furthermore, three regions of the nuclear rDNA repeat unit (18S, 28S, and 5.8S) were recovered for 22 out of the 23 species. The mitogenomes of 11 genera and 22 species as well as the complete rDNA from 22 species are reported here for the first time. Combined with previously published sequences of mitochondrial and rDNA coding regions, 166 species (170 populations with four overlapping in Lampyridae) were included in the current analyses. We selected different species groups and coding regions to infer phylogenies, and then employed tree certainty (TC) and internode certainty (IC) to quantify any phylogenetic incongruence. Phylogenetic analysis of 18 coding regions (15 mitochondrial genes and three regions of the nuclear rDNA repeat unit) from different species groups showed that the 144-species selection group (excluding 22 species outside Lampyridae) had relatively high TC (101.39). Further phylogenetic analysis of the 144 species using different coding regions indicated that the phylogeny of the 13 coding regions (10 mitochondrial genes: COX1-2, ATP6&8, ND1, ND4-5, CYTB, 12S and 16S; three rDNA regions: 18S, 5.8S, and 28S) demonstrated higher TC (103.02) than the phylogenies based on the 18 coding regions (TC = 101.39), conserved-regions (c-regions, i.e., 12S, 16S, COX1, 18S, and 28S) (TC = 95.11), or conserved-sites (c-sites, TC = 92.31) for the mitochondrial genes. In contrast, the c-sites strengthened the deeper nodes of the 144-species phylogeny compared to the c-regions. All of the 144-species phylogenies using different coding regions (except the c-regions) consistently recovered the monophyly of each of the three luminous families and their combination (Lampyridae, Rhagophthalmidae, and Phengodidae) with high IC support. Our phylogenetic analyses clarified the position of firefly genera Lamprigera, Vesta, Stenocladius, Pyrocoelia, Diaphanes, Abscondita, Pygoluciola, Emeia, Pristolycus, and Menghuoius. We also inferred the evolutionary pattern of adult bioluminescence in Lampyridae based on the phylogenies of 166 and 144 species. Our data suggest that the common ancestor of Lampyridae possessed adult bioluminescence, with a higher loss rate than gain rate of bioluminescence during its lineage evolution. Our results provide insight into Asian firefly phylogeny, and also enrich mitogenome and rDNA data resources for further study.
Collapse
Affiliation(s)
- Xing Chen
- CAS Key Laboratory of Tropical Forest Ecology, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Mengla, Yunnan 666303, China
| | - Zhiwei Dong
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
| | - Guichun Liu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China; Center for Ecological and Environmental Sciences, Northwestern Polytechnical University, Xi'an, Shaanxi 710072, China
| | - Jinwu He
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China; Center for Ecological and Environmental Sciences, Northwestern Polytechnical University, Xi'an, Shaanxi 710072, China
| | - Ruoping Zhao
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
| | - Wen Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China; Center for Excellence in Animal Evolution and Genetics, Kunming, Yunnan 650223, China; Center for Ecological and Environmental Sciences, Northwestern Polytechnical University, Xi'an, Shaanxi 710072, China.
| | - Yanqiong Peng
- CAS Key Laboratory of Tropical Forest Ecology, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Mengla, Yunnan 666303, China.
| | - Xueyan Li
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China.
| |
Collapse
|
32
|
Parker E, Dornburg A, Domínguez-Domínguez O, Piller KR. Assessing phylogenetic information to reveal uncertainty in historical data: An example using Goodeinae (Teleostei: Cyprinodontiformes: Goodeidae). Mol Phylogenet Evol 2019; 134:282-290. [DOI: 10.1016/j.ympev.2019.01.025] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2018] [Revised: 01/17/2019] [Accepted: 01/30/2019] [Indexed: 01/18/2023]
|
33
|
Digging for the spiny rat and hutia phylogeny using a gene capture approach, with the description of a new mammal subfamily. Mol Phylogenet Evol 2019; 136:241-253. [PMID: 30885830 DOI: 10.1016/j.ympev.2019.03.007] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Revised: 03/13/2019] [Accepted: 03/14/2019] [Indexed: 02/07/2023]
Abstract
Next generation sequencing (NGS) and genomic database mining allow biologists to gather and select large molecular datasets well suited to address phylogenomics and molecular evolution questions. Here we applied this approach to a mammal family, the Echimyidae, for which generic relationships have been difficult to recover and often referred to as a star phylogeny. These South-American spiny rats represent a family of caviomorph rodents exhibiting a striking diversity of species and life history traits. Using a NGS exon capture protocol, we isolated and sequenced ca. 500 nuclear DNA exons for 35 species belonging to all major echimyid and capromyid clades. Exons were carefully selected to encompass as much diversity as possible in terms of rate of evolution, heterogeneity in the distribution of site-variation and nucleotide composition. Supermatrix inferences and coalescence-based approaches were subsequently applied to infer this family's phylogeny. The inferred topologies were the same for both approaches, and support was maximal for each node, entirely resolving the ambiguous relationships of previous analyses. Fast-evolving nuclear exons tended to yield more reliable phylogenies, as slower-evolving sequences were not informative enough to disentangle the short branches of the Echimyidae radiation. Based on this resolved phylogeny and on molecular and morphological evidence, we confirm the rank of the Caribbean hutias - formerly placed in the Capromyidae family - as Capromyinae, a clade nested within Echimyidae. We also name and define Carterodontinae, a new subfamily of Echimyidae, comprising the extant monotypic genus Carterodon from Brazil, which is the closest living relative of West Indies Capromyinae.
Collapse
|
34
|
Dornburg A, Su Z, Townsend JP. Optimal Rates for Phylogenetic Inference and Experimental Design in the Era of Genome-Scale Data Sets. Syst Biol 2018; 68:145-156. [PMID: 29939341 DOI: 10.1093/sysbio/syy047] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2018] [Accepted: 06/13/2018] [Indexed: 02/02/2023] Open
Abstract
With the rise of genome-scale data sets, there has been a call for increased data scrutiny and careful selection of loci that are appropriate to use in an attempt to resolve a phylogenetic problem. Such loci should maximize phylogenetic information content while minimizing the risk of homoplasy. Theory posits the existence of characters that evolve at an optimum rate, and efforts to determine optimal rates of inference have been a cornerstone of phylogenetic experimental design for over two decades. However, both theoretical and empirical investigations of optimal rates have varied dramatically in their conclusions: spanning no relationship to a tight relationship between the rate of change and phylogenetic utility. Herein, we synthesize these apparently contradictory views, demonstrating both empirical and theoretical conditions under which each is correct. We find that optimal rates of characters-not genes-are generally robust to most experimental design decisions. Moreover, consideration of site rate heterogeneity within a given locus is critical to accurate predictions of utility. Factors such as taxon sampling or the targeted number of characters providing support for a topology are additionally critical to the predictions of phylogenetic utility based on the rate of character change. Further, optimality of rates and predictions of phylogenetic utility are not equivalent, demonstrating the need for further development of comprehensive theory of phylogenetic experimental design. [Divergence time; GC bias; homoplasy; incongruence; information content; internode length; optimal rates; phylogenetic informativeness; phylogenetic theory; phylogenetic utility; phylogenomics; signal and noise; subtending branch length; state space; taxon and character sampling.].
Collapse
Affiliation(s)
- Alex Dornburg
- North Carolina Museum of Natural Sciences, Raleigh, 1671 Goldstar Drive, NC 27601, USA
| | - Zhuo Su
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, 165 Prospect Street, CT 06525, USA
| | - Jeffrey P Townsend
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, 165 Prospect Street, CT 06525, USA
- Department of Biostatistics, Yale University, New Haven, 60 College Street, CT 06510, USA
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, 300 George Street, CT 06511, USA
| |
Collapse
|
35
|
Alda F, Tagliacollo VA, Bernt MJ, Waltz BT, Ludt WB, Faircloth BC, Alfaro ME, Albert JS, Chakrabarty P. Resolving Deep Nodes in an Ancient Radiation of Neotropical Fishes in the Presence of Conflicting Signals from Incomplete Lineage Sorting. Syst Biol 2018; 68:573-593. [DOI: 10.1093/sysbio/syy085] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Revised: 11/30/2018] [Accepted: 12/03/2018] [Indexed: 12/13/2022] Open
Affiliation(s)
- Fernando Alda
- Museum of Natural Science, Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
- Department of Biology, Geology and Environmental Science, University of Tennessee at Chattanooga, Chattanooga, TN 37403, USA
| | - Victor A Tagliacollo
- Museu de Zoologia da Universidade de São Paulo (MZUSP), Ipirianga, 04263-000, São Paulo, São Paulo, Brazil
| | - Maxwell J Bernt
- Department of Biology, University of Louisiana at Lafayette, Lafayette, LA 70503, USA
| | - Brandon T Waltz
- Department of Biology, University of Louisiana at Lafayette, Lafayette, LA 70503, USA
| | - William B Ludt
- Museum of Natural Science, Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Brant C Faircloth
- Museum of Natural Science, Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Michael E Alfaro
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095, USA
| | - James S Albert
- Department of Biology, University of Louisiana at Lafayette, Lafayette, LA 70503, USA
| | - Prosanta Chakrabarty
- Museum of Natural Science, Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| |
Collapse
|
36
|
Near TJ, MacGuigan DJ, Parker E, Struthers CD, Jones CD, Dornburg A. Phylogenetic analysis of Antarctic notothenioids illuminates the utility of RADseq for resolving Cenozoic adaptive radiations. Mol Phylogenet Evol 2018; 129:268-279. [PMID: 30195039 DOI: 10.1016/j.ympev.2018.09.001] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Revised: 08/31/2018] [Accepted: 09/01/2018] [Indexed: 10/28/2022]
Abstract
Notothenioids are a clade of ∼120 species of marine fishes distributed in extreme southern hemisphere temperate near-shore habitats and in the Southern Ocean surrounding Antarctica. Over the past 25 years, molecular and morphological approaches have redefined hypotheses of relationships among notothenioid lineages as well as their relationships among major lineages of percomorph teleosts. These phylogenies provide a basis for investigation of mechanisms of evolutionary diversification within the clade and have enhanced our understanding of the notothenioid adaptive radiation. Despite extensive efforts, there remain several questions concerning the phylogeny of notothenioids. In this study, we deploy DNA sequences of ∼100,000 loci obtained using RADseq to investigate the phylogenetic relationships of notothenioids and to assess the utility of RADseq loci for lineages that exhibit divergence times ranging from the Paleogene to the Quaternary. The notothenioid phylogenies inferred from the RADseq loci provide unparalleled resolution and node support for several long-standing problems including, (1) relationships among species of Trematomus, (2) resolution of Indonotothenia cyanobrancha as the sister lineage of Trematomus, (3) the deep paraphyly of Nototheniidae, (4) the paraphyly of Lepidonotothen s.l., (5) paraphyly of Artedidraco, and 6) the monophyly of the Bathydraconidae. Assessment of site rates demonstrates that RADseq loci are similar to mtDNA protein coding genes and exhibit peak phylogenetic informativeness at the time interval during which the major Antarctic notothenioid lineages originated and diversified. In addition to providing a well-resolved phylogenetic hypothesis for notothenioids, our analyses quantify the predicted utility of RADseq loci for Cenozoic phylogenetic inferences.
Collapse
Affiliation(s)
- Thomas J Near
- Department of Ecology & Evolutionary Biology, Yale University, P.O. Box 208106, New Haven, CT 06520, USA; Peabody Museum of Natural History, Yale University, New Haven, CT 06520, USA.
| | - Daniel J MacGuigan
- Department of Ecology & Evolutionary Biology, Yale University, P.O. Box 208106, New Haven, CT 06520, USA
| | - Elyse Parker
- Department of Ecology & Evolutionary Biology, Yale University, P.O. Box 208106, New Haven, CT 06520, USA
| | - Carl D Struthers
- Museum of New Zealand Te Papa Tongarewa, Wellington, New Zealand
| | - Christopher D Jones
- Antarctic Ecosystem Research Division, NOAA Southwest Fisheries Science Center, La Jolla, CA 92037, USA
| | - Alex Dornburg
- North Carolina Museum of Natural Sciences, Raleigh, NC 27601, USA
| |
Collapse
|
37
|
Gilbert PS, Wu J, Simon MW, Sinsheimer JS, Alfaro ME. Filtering nucleotide sites by phylogenetic signal to noise ratio increases confidence in the Neoaves phylogeny generated from ultraconserved elements. Mol Phylogenet Evol 2018; 126:116-128. [PMID: 29626666 PMCID: PMC6217972 DOI: 10.1016/j.ympev.2018.03.033] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2017] [Revised: 03/20/2018] [Accepted: 03/29/2018] [Indexed: 02/06/2023]
Abstract
Despite genome scale analyses, high-level relationships among Neoaves birds remain contentious. The placements of the Neoaves superorders are notoriously difficult to resolve because they involve deep splits followed by short internodes. Using our approach, we investigate whether filtering UCE loci on their phylogenetic signal to noise ratio helps to resolve key nodes in the Neoaves tree of life. We find that our analysis of data sets filtered for high signal to noise ratio results in topologies that are inconsistent with unfiltered results but that are congruent with whole-genome analyses. These relationships include the Columbea + Passerea sister relationship and the Phaethontimorphae + Aequornithia sister relationship. We also find increased statistical support for more recent nodes (i.e. the Pelecanidae + Ardeidae sister relationship, the Eucavitaves clade, and the Otidiformes + Musophagiformes sister relationship). We also find instances where support is reduced for well-established clades, possibly due to the removal of sites with moderate signal-to-noise ratio. Our results suggest that filtering on the basis of signal to noise ratio is a useful tool for resolving problematic splits in phylogenomic data sets.
Collapse
Affiliation(s)
- Princess S Gilbert
- Department of Ecology & Evolutionary Biology, University of California, Los Angeles, CA, USA.
| | - Jing Wu
- Henry Samueli School of Engineering and Applied Science, Department of Computer Science, University of California, Los Angeles, CA, USA
| | - Margaret W Simon
- Department of Ecology & Evolutionary Biology, University of California, Los Angeles, CA, USA
| | - Janet S Sinsheimer
- Department of Biomathematics, David Geffen School of Medicine, University of California, Los Angeles, CA, USA; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA, USA; Department of Biostatistics, Fielding School of Public Health, University of California, Los Angeles, CA, USA
| | - Michael E Alfaro
- Department of Ecology & Evolutionary Biology, University of California, Los Angeles, CA, USA.
| |
Collapse
|
38
|
Mongiardino Koch N, Gauthier JA. Noise and biases in genomic data may underlie radically different hypotheses for the position of Iguania within Squamata. PLoS One 2018; 13:e0202729. [PMID: 30133514 PMCID: PMC6105018 DOI: 10.1371/journal.pone.0202729] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2018] [Accepted: 08/08/2018] [Indexed: 12/23/2022] Open
Abstract
Squamate reptiles are a major component of vertebrate biodiversity whose crown-clade traces its origin to a narrow window of time in the Mesozoic during which the main subclades diverged in rapid succession. Deciphering phylogenetic relationships among these lineages has proven challenging given the conflicting signals provided by genomic and phenomic data. Most notably, the placement of Iguania has routinely differed between data sources, with morphological evidence supporting a sister relationship to the remaining squamates (Scleroglossa hypothesis) and molecular data favoring a highly nested position alongside snakes and anguimorphs (Toxicofera hypothesis). We provide novel insights by generating an expanded morphological dataset and exploring the presence of phylogenetic signal, noise, and biases in molecular data. Our analyses confirm the presence of strong conflicting signals for the position of Iguania between morphological and molecular datasets. However, we also find that molecular data behave highly erratically when inferring the deepest branches of the squamate tree, a consequence of limited phylogenetic signal to resolve this ancient radiation with confidence. This, in turn, seems to result from a rate of evolution that is too high for historical signals to survive to the present. Finally, we detect significant systematic biases, with iguanians and snakes sharing faster rates of molecular evolution and a similarly biased nucleotide composition. A combination of scant phylogenetic signal, high levels of noise, and the presence of systematic biases could result in the misplacement of Iguania. We regard this explanation to be at least as plausible as the complex scenario of convergence and reversals required for morphological data to be misleading. We further evaluate and discuss the utility of morphological data to resolve ancient radiations, as well as its impact in combined-evidence phylogenomic analyses, with results relevant for the assessment of evidence and conflict across the Tree of Life.
Collapse
Affiliation(s)
- Nicolás Mongiardino Koch
- Department of Geology and Geophysics, Yale University, New Haven, Connecticut, United States of America
| | - Jacques A. Gauthier
- Department of Geology and Geophysics, Yale University, New Haven, Connecticut, United States of America
- Yale Peabody Museum of Natural History, New Haven, Connecticut, United States of America
| |
Collapse
|
39
|
Herrando-Moraira S. Exploring data processing strategies in NGS target enrichment to disentangle radiations in the tribe Cardueae (Compositae). Mol Phylogenet Evol 2018; 128:69-87. [PMID: 30036700 DOI: 10.1016/j.ympev.2018.07.012] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2018] [Revised: 07/13/2018] [Accepted: 07/14/2018] [Indexed: 12/17/2022]
Abstract
Target enrichment is a cost-effective sequencing technique that holds promise for elucidating evolutionary relationships in fast-evolving lineages. However, potential biases and impact of bioinformatic sequence treatments in phylogenetic inference have not been thoroughly explored yet. Here, we investigate this issue with an ultimate goal to shed light into a highly diversified group of Compositae (Asteraceae) constituted by four main genera: Arctium, Cousinia, Saussurea, and Jurinea. Specifically, we compared sequence data extraction methods implemented in two easy-to-use workflows, PHYLUCE and HybPiper, and assessed the impact of two filtering practices intended to reduce phylogenetic noise. In addition, we compared two phylogenetic inference methods: (1) the concatenation approach, in which all loci were concatenated in a supermatrix; and (2) the coalescence approach, in which gene trees were produced independently and then used to construct a species tree under coalescence assumptions. Here we confirm the usefulness of the set of 1061 COS targets (a nuclear conserved orthology loci set developed for the Compositae) across a variety of taxonomic levels. Intergeneric relationships were completely resolved: there are two sister groups, Arctium-Cousinia and Saussurea-Jurinea, which are in agreement with a morphological hypothesis. Intrageneric relationships among species of Arctium, Cousinia, and Saussurea are also well defined. Conversely, conflicting species relationships remain for Jurinea. Methodological choices significantly affected phylogenies in terms of topology, branch length, and support. Across all analyses, the phylogeny obtained using HybPiper and the strictest scheme of removing fast-evolving sites was estimated as the optimal. Regarding methodological choices, we conclude that: (1) trees obtained under the coalescence approach are topologically more congruent between them than those inferred using the concatenation approach; (2) refining treatments only improved support values under the concatenation approach; and (3) branch support values are maximized when fast-evolving sites are removed in the concatenation approach, and when a higher number of loci is analyzed in the coalescence approach.
Collapse
Affiliation(s)
- Sonia Herrando-Moraira
- Botanic Institute of Barcelona (IBB, CSIC-ICUB), Pg. del Migdia, s.n., 08038 Barcelona, Spain.
| | | |
Collapse
|
40
|
Bagley JC, Mayden RL, Harris PM. Phylogeny and divergence times of suckers (Cypriniformes: Catostomidae) inferred from Bayesian total-evidence analyses of molecules, morphology, and fossils. PeerJ 2018; 6:e5168. [PMID: 30013838 PMCID: PMC6035723 DOI: 10.7717/peerj.5168] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2018] [Accepted: 06/15/2018] [Indexed: 11/25/2022] Open
Abstract
Catostomidae ("suckers") is a diverse (76 species) and broadly distributed family of Holarctic freshwater fishes with a rich fossil record and a considerable number (∼35%) of threatened and imperiled species. We integrate DNA sequences (three mitochondrial genes, three nuclear genes), morphological data, and fossil information to infer sucker phylogenetic relationships and divergence times using Bayesian "total-evidence" methods, and then test hypotheses about the temporal diversification of the group. Our analyses resolved many nodes within subfamilies and clarified Catostominae relationships to be of the form ((Thoburniini, Moxostomatini), (Erimyzonini, Catostomini)). Patterns of subfamily relationships were incongruent, but mainly supported two placements of the Myxocyprininae; distinguishing these using Bayes factors lent strongest support to a model with Myxocyprininae sister to all remaining sucker lineages. We improved our Bayesian total-evidence dating analysis by excluding problematic characters, using a clock-partitioning scheme identified by Bayesian model selection, and employing a fossilized birth-death tree prior accommodating morphological data and fossils. The resulting chronogram showed that suckers evolved since the Late Cretaceous-Eocene, and that the Catostomini and Moxostomatini clades have accumulated species diversity since the early to mid-Miocene. These results agree with the fossil record and confirm previous hypotheses about dates for the origins of Catostomide and catostomine diversification, but reject previous molecular hypotheses about the timing of divergence of ictiobines, and between Asian-North American lineages. Overall, our findings from a synthesis of multiple data types enhance understanding of the phylogenetic relationships, taxonomic classification, and temporal diversification of suckers, while also highlighting practical methods for improving Bayesian divergence dating models by coupling phylogenetic informativeness profiling with relaxed-clock partitioning.
Collapse
Affiliation(s)
- Justin C. Bagley
- Department of Biology, Virginia Commonwealth University, Richmond, VA, USA
- Departamento de Zoologia, Universidade de Brasília, Brasília, Distrito Federal, Brazil
- Departamento de Zoologia e Botânica, IBiLCE, Universidade Estadual Paulista, São José do Rio Preto, São Paulo, Brazil
| | | | - Phillip M. Harris
- Department of Biological Sciences, The University of Alabama, Tuscaloosa, AL, USA
| |
Collapse
|
41
|
Rivera-Rivera CJ, Montoya-Burgos JI. Back to the roots: Reducing evolutionary rate heterogeneity among sequences gives support for the early morphological hypothesis of the root of Siluriformes (Teleostei: Ostariophysi). Mol Phylogenet Evol 2018; 127:272-279. [PMID: 29885935 DOI: 10.1016/j.ympev.2018.06.004] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2017] [Revised: 06/04/2018] [Accepted: 06/04/2018] [Indexed: 11/15/2022]
Abstract
Catfishes (Teleostei: Siluriformes) are a highly diverse order within Ostariophysi that is distributed worldwide. At the base of this clade emerge three lineages with well-defined monophylies: Diplomystidae, Loricarioidei, and Siluroidei. Morphological phylogeny studies place the Diplomystidae as the earliest branching of these three lineages, but studies based on molecular phylogenetics consistently find the fast-evolving Loricarioidei instead. The high lineage evolutionary rate heterogeneity in this order and the fact that the lineage placed closest to the root in the molecular phylogenies is fast evolving, including many long branches, raises the possibility that the discrepancy between morphological and molecular phylogenies may be the result of a long branch attraction inference artifact. We test this hypothesis by using a 10-gene dataset to evaluate the arrangement of the three main siluriform lineages, and apply the LS3 and LS4 taxon sequence subsampling methods to reduce evolutionary rate heterogeneity among lineages. The initial and complete dataset supports the basal branching of Loricarioidei as in all previous molecular phylogenies, but once lineage rate heterogeneity is reduced with LS3 or LS4 through the removal of sequences disrupting homogeneity, the phylogeny shows Diplomystidae as the earliest branching group, with high supports, as proposed by morphological phylogeny. The result obtained with LS3, however, introduces the misplacement of one of the species with the highest amount of missing data, Scoloplax sp. Because the sequence sub-selection criterion of LS4 has been optimized to reduce data removal, the phylogeny resulting from the LS4-processed data is in agreement with the known intra-lineage relationships in addition to supporting the morphologically-based rooting hypothesis. Our results are the first instance in which a consensus between molecular and morphological phylogeny is reached concerning the root of this order.
Collapse
Affiliation(s)
- Carlos J Rivera-Rivera
- Department of Genetics and Evolution, University of Geneva, Switzerland; Institute of Genetics and Genomics in Geneva (iGE3), University of Geneva, Switzerland
| | - Juan I Montoya-Burgos
- Department of Genetics and Evolution, University of Geneva, Switzerland; Institute of Genetics and Genomics in Geneva (iGE3), University of Geneva, Switzerland.
| |
Collapse
|
42
|
Abstract
Kimura's neutral theory argued that positive selection was not responsible for an appreciable fraction of molecular substitutions. Correspondingly, quantitative analysis reveals that the vast majority of substitutions in cancer genomes are not detectably under selection. Insights from the somatic evolution of cancer reveal that beneficial substitutions in cancer constitute a small but important fraction of the molecular variants. The molecular evolution of cancer community will benefit by incorporating the neutral theory of molecular evolution into their understanding and analysis of cancer evolution-and accepting the use of tractable, predictive models, even when there is some evidence that they are not perfect.
Collapse
Affiliation(s)
| | - Jeffrey P Townsend
- Department of Biostatistics, Yale University, New Haven, CT
- Program in Computational Biology and Bioinformatics
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT
| |
Collapse
|
43
|
Fonseca LHM, Lohmann LG. Combining high-throughput sequencing and targeted loci data to infer the phylogeny of the “Adenocalymma-Neojobertia” clade (Bignonieae, Bignoniaceae). Mol Phylogenet Evol 2018; 123:1-15. [DOI: 10.1016/j.ympev.2018.01.023] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2017] [Revised: 01/25/2018] [Accepted: 01/31/2018] [Indexed: 10/18/2022]
|
44
|
Gates DJ, Pilson D, Smith SD. Filtering of target sequence capture individuals facilitates species tree construction in the plant subtribe Iochrominae (Solanaceae). Mol Phylogenet Evol 2018; 123:26-34. [DOI: 10.1016/j.ympev.2018.02.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2017] [Revised: 01/30/2018] [Accepted: 02/01/2018] [Indexed: 10/18/2022]
|
45
|
Sayyari E, Mirarab S. Testing for Polytomies in Phylogenetic Species Trees Using Quartet Frequencies. Genes (Basel) 2018; 9:E132. [PMID: 29495636 PMCID: PMC5867853 DOI: 10.3390/genes9030132] [Citation(s) in RCA: 68] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2017] [Revised: 01/30/2018] [Accepted: 02/16/2018] [Indexed: 12/23/2022] Open
Abstract
Phylogenetic species trees typically represent the speciation history as a bifurcating tree. Speciation events that simultaneously create more than two descendants, thereby creating polytomies in the phylogeny, are possible. Moreover, the inability to resolve relationships is often shown as a (soft) polytomy. Both types of polytomies have been traditionally studied in the context of gene tree reconstruction from sequence data. However, polytomies in the species tree cannot be detected or ruled out without considering gene tree discordance. In this paper, we describe a statistical test based on properties of the multi-species coalescent model to test the null hypothesis that a branch in an estimated species tree should be replaced by a polytomy. On both simulated and biological datasets, we show that the null hypothesis is rejected for all but the shortest branches, and in most cases, it is retained for true polytomies. The test, available as part of the Accurate Species TRee ALgorithm (ASTRAL) package, can help systematists decide whether their datasets are sufficient to resolve specific relationships of interest.
Collapse
Affiliation(s)
- Erfan Sayyari
- Department of Electrical and Computer Engineering, University of California at San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA.
| | - Siavash Mirarab
- Department of Electrical and Computer Engineering, University of California at San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA.
| |
Collapse
|
46
|
Chakrabarty P, Faircloth BC, Alda F, Ludt WB, Mcmahan CD, Near TJ, Dornburg A, Albert JS, Arroyave J, Stiassny MLJ, Sorenson L, Alfaro ME. Phylogenomic Systematics of Ostariophysan Fishes: Ultraconserved Elements Support the Surprising Non-Monophyly of Characiformes. Syst Biol 2018; 66:881-895. [PMID: 28334176 DOI: 10.1093/sysbio/syx038] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2015] [Accepted: 02/24/2016] [Indexed: 12/30/2022] Open
Abstract
Ostariophysi is a superorder of bony fishes including more than 10,300 species in 1100 genera and 70 families. This superorder is traditionally divided into five major groups (orders): Gonorynchiformes (milkfishes and sandfishes), Cypriniformes (carps and minnows), Characiformes (tetras and their allies), Siluriformes (catfishes), and Gymnotiformes (electric knifefishes). Unambiguous resolution of the relationships among these lineages remains elusive, with previous molecular and morphological analyses failing to produce a consensus phylogeny. In this study, we use over 350 ultraconserved element (UCEs) loci comprising 5 million base pairs collected across 35 representative ostariophysan species to compile one of the most data-rich phylogenies of fishes to date. We use these data to infer higher level (interordinal) relationships among ostariophysan fishes, focusing on the monophyly of the Characiformes-one of the most contentiously debated groups in fish systematics. As with most previous molecular studies, we recover a non-monophyletic Characiformes with the two monophyletic suborders, Citharinoidei and Characoidei, more closely related to other ostariophysan clades than to each other. We also explore incongruence between results from different UCE data sets, issues of orthology, and the use of morphological characters in combination with our molecular data. [Conserved sequence; ichthyology; massively parallel sequencing; morphology; next-generation sequencing; UCEs.].
Collapse
Affiliation(s)
- Prosanta Chakrabarty
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, 119 Foster Hall, Baton Rouge, LA 70803, USA
| | - Brant C Faircloth
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, 119 Foster Hall, Baton Rouge, LA 70803, USA
| | - Fernando Alda
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, 119 Foster Hall, Baton Rouge, LA 70803, USA
| | - William B Ludt
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, 119 Foster Hall, Baton Rouge, LA 70803, USA
| | - Caleb D Mcmahan
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, 119 Foster Hall, Baton Rouge, LA 70803, USA.,The Field Museum of Natural History, 1400 S Lake Shore Dr, Chicago, IL 60605, USA
| | - Thomas J Near
- Department of Ecology and Evolutionary Biology, and Peabody Museum of Natural History, Yale University, New Haven, CT 06520, USA
| | - Alex Dornburg
- North Carolina Museum of Natural Sciences, Raleigh, NC 27601, USA
| | - James S Albert
- Department of Biology, University of Louisiana, Lafayette, LA 70504, USA
| | - Jairo Arroyave
- Instituto de Biología, Universidad Nacional Autónoma de México, Ciudad de México, México
| | - Melanie L J Stiassny
- Department of Ichthyology, American Museum of Natural History, Central Park West at 79th Street, New York, NY 10024, USA
| | - Laurie Sorenson
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, 119 Foster Hall, Baton Rouge, LA 70803, USA.,Department of Ecology and Evolutionary Biology, University of California Los Angeles, 610 Yound Drive South, Los Angeles, CA 90095, USA
| | - Michael E Alfaro
- Department of Ecology and Evolutionary Biology, University of California Los Angeles, 610 Yound Drive South, Los Angeles, CA 90095, USA
| |
Collapse
|
47
|
Klopfstein S, Massingham T, Goldman N. More on the Best Evolutionary Rate for Phylogenetic Analysis. Syst Biol 2018; 66:769-785. [PMID: 28595363 PMCID: PMC5790136 DOI: 10.1093/sysbio/syx051] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2016] [Accepted: 05/24/2017] [Indexed: 11/13/2022] Open
Abstract
The accumulation of genome-scale molecular data sets for nonmodel taxa brings us ever closer to resolving the tree of life of all living organisms. However, despite the depth of data available, a number of studies that each used thousands of genes have reported conflicting results. The focus of phylogenomic projects must thus shift to more careful experimental design. Even though we still have a limited understanding of what are the best predictors of the phylogenetic informativeness of a gene, there is wide agreement that one key factor is its evolutionary rate; but there is no consensus as to whether the rates derived as optimal in various analytical, empirical, and simulation approaches have any general applicability. We here use simulations to infer optimal rates in a set of realistic phylogenetic scenarios with varying tree sizes, numbers of terminals, and tree shapes. Furthermore, we study the relationship between the optimal rate and rate variation among sites and among lineages. Finally, we examine how well the predictions made by a range of experimental design methods correlate with the observed performance in our simulations.We find that the optimal level of divergence is surprisingly robust to differences in taxon sampling and even to among-site and among-lineage rate variation as often encountered in empirical data sets. This finding encourages the use of methods that rely on a single optimal rate to predict a gene's utility. Focusing on correct recovery either of the most basal node in the phylogeny or of the entire topology, the optimal rate is about 0.45 substitutions from root to tip in average Yule trees and about 0.2 in difficult trees with short basal and long-apical branches, but all rates leading to divergence levels between about 0.1 and 0.5 perform reasonably well.Testing the performance of six methods that can be used to predict a gene's utility against our simulation results, we find that the probability of resolution, signal-noise analysis, and Fisher information are good predictors of phylogenetic informativeness, but they require specification of at least part of a model tree. Likelihood quartet mapping also shows very good performance but only requires sequence alignments and is thus applicable without making assumptions about the phylogeny. Despite them being the most commonly used methods for experimental design, geometric quartet mapping and the integration of phylogenetic informativeness curves perform rather poorly in our comparison. Instead of derived predictors of phylogenetic informativeness, we suggest that the number of sites in a gene that evolve at near-optimal rates (as inferred here) could be used directly to prioritize genes for phylogenetic inference. In combination with measures of model fit, especially with respect to compositional biases and among-site and among-lineage rate variation, such an approach has the potential to greatly improve marker choice and should be tested on empirical data.
Collapse
Affiliation(s)
- Seraina Klopfstein
- Naturhistorisches Museum der Burgergemeinde Bern, Bernastr. 15, CH-3005 Bern, Switzerland.,University of Bern, Institute of Ecology and Evolution, Baltzerstr. 6, CH-3012 Bern, Switzerland
| | - Tim Massingham
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Nick Goldman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| |
Collapse
|
48
|
Rios L, Coronado L, Naranjo-Feliciano D, Martínez-Pérez O, Perera CL, Hernandez-Alvarez L, Díaz de Arce H, Núñez JI, Ganges L, Pérez LJ. Deciphering the emergence, genetic diversity and evolution of classical swine fever virus. Sci Rep 2017; 7:17887. [PMID: 29263428 PMCID: PMC5738429 DOI: 10.1038/s41598-017-18196-y] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2017] [Accepted: 12/01/2017] [Indexed: 12/15/2022] Open
Abstract
Classical swine fever (CSF) is one of the most important infectious diseases causing significant economic losses. Its causal agent, CSF virus (CSFV), is a member of the Pestivirus genus included into the Flaviviridae family. Previous molecular epidemiology studies have revealed the CSFV diversity is divided into three main genotypes and different subgenotypes. However, the classification system for CSFV has not yet been harmonized internationally. Similarly, the phylogeny and evolutionary dynamics of CSFV remain unclear. The current study provides novel and significant insights into the origin, diversification and evolutionary process of CSFV. In addition, the best phylogenetic marker for CSFV capable of reproducing the same phylogenetic and evolutionary information as the complete viral genome is characterized. Also, a reliable cut-off to accurately classify CSFV at genotype and subgenotype levels is established. Based on the time for the most recent common ancestor (tMRCA) reconstruction and cophylogenetic analysis, it was determined that CSFV emerged around 225 years ago when the Tunisian Sheep Virus jumped from its natural host to swine. CSFV emergence was followed by a genetic expansion in three main lineages, driven by the action of positive selection pressure and functional divergence, as main natural forces.
Collapse
Affiliation(s)
- Liliam Rios
- University of New Brunswick, Saint John, New Brunswick, E2L4L5, Canada
| | - Liani Coronado
- Centro Nacional de Sanidad Agropecuaria (CENSA), La Habana, 32700, Cuba
| | | | | | - Carmen L Perera
- Centro Nacional de Sanidad Agropecuaria (CENSA), La Habana, 32700, Cuba
| | | | - Heidy Díaz de Arce
- Hospital Italiano de Buenos Aires, Juan D. Perón 4190, C1181ACH, Buenos Aires, Argentina
| | - José I Núñez
- IRTA-CReSA. Centre de Recerca en Sanitat Animal, Barcelona, 08193, Spain
| | - Llilianne Ganges
- IRTA-CReSA. Centre de Recerca en Sanitat Animal, Barcelona, 08193, Spain.,OIE Reference Laboratory for Classical Swine Fever and OIE Collaborative Centre for Research and Control of Emerging and Re-emerging Swine Diseases in Europe, IRTA-CReSA, Barcelona, Spain
| | - Lester J Pérez
- Dalhousie University, Dalhousie Medicine New Brunswick, Saint John, New Brunswick, E2L4L5, Canada.
| |
Collapse
|
49
|
A pilot study applying the plant Anchored Hybrid Enrichment method to New World sages (Salvia subgenus Calosphace; Lamiaceae). Mol Phylogenet Evol 2017; 117:124-134. [DOI: 10.1016/j.ympev.2017.02.006] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Revised: 02/06/2017] [Accepted: 02/06/2017] [Indexed: 11/18/2022]
|
50
|
Dornburg A, Townsend JP, Wang Z. Maximizing Power in Phylogenetics and Phylogenomics: A Perspective Illuminated by Fungal Big Data. ADVANCES IN GENETICS 2017; 100:1-47. [PMID: 29153398 DOI: 10.1016/bs.adgen.2017.09.007] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Since its original inception over 150 years ago by Darwin, we have made tremendous progress toward the reconstruction of the Tree of Life. In particular, the transition from analyzing datasets comprised of small numbers of loci to those comprised of hundreds of loci, if not entire genomes, has aided in resolving some of the most vexing of evolutionary problems while giving us a new perspective on biodiversity. Correspondingly, phylogenetic trees have taken a central role in fields that span ecology, conservation, and medicine. However, the rise of big data has also presented phylogenomicists with a new set of challenges to experimental design, quantitative analyses, and computation. The sequencing of a number of very first genomes presented significant challenges to phylogenetic inference, leading fungal phylogenomicists to begin addressing pitfalls and postulating solutions to the issues that arise from genome-scale analyses relevant to any lineage across the Tree of Life. Here we highlight insights from fungal phylogenomics for topics including systematics and species delimitation, ecological and phenotypic diversification, and biogeography while providing an overview of progress made on the reconstruction of the fungal Tree of Life. Finally, we provide a review of considerations to phylogenomic experimental design for robust tree inference. We hope that this special issue of Advances in Genetics not only excites the continued progress of fungal evolutionary biology but also motivates the interdisciplinary development of new theory and methods designed to maximize the power of genomic scale data in phylogenetic analyses.
Collapse
Affiliation(s)
- Alex Dornburg
- North Carolina Museum of Natural Sciences, Raleigh, NC, United States
| | | | - Zheng Wang
- Yale University, New Haven, CT, United States.
| |
Collapse
|