1
|
Benedict C, Delgado A, Pen I, Vaga C, Daly M, Quattrini AM. Sea anemone (Anthozoa, Actiniaria) diversity in Mo'orea (French Polynesia). Mol Phylogenet Evol 2024; 198:108118. [PMID: 38849066 DOI: 10.1016/j.ympev.2024.108118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 05/20/2024] [Accepted: 06/04/2024] [Indexed: 06/09/2024]
Abstract
Sea anemones (Order Actiniaria) are a diverse group of marine invertebrates ubiquitous across marine ecosystems. Despite their wide distribution and success, a knowledge gap persists in our understanding of their diversity within tropical systems, owed to sampling bias of larger and more charismatic species overshadowing cryptic lineages. This study aims to delineate the sea anemone diversity in Mo'orea (French Polynesia) with the use of a dataset from the Mo'orea Biocode's "BioBlitz" initiative, which prioritized the sampling of more cryptic and understudied taxa. Implementing a target enrichment approach, we integrate 71 newly sequenced samples into an expansive phylogenetic framework and contextualize Mo'orea's diversity within global distribution patterns of sea anemones. Our analysis corroborates the presence of several previously documented sea anemones in French Polynesia and identifies for the first time the occurrence of members of genera Andvakia and Aiptasiomorpha. This research unveils the diverse sea anemone ecosystem in Mo'orea, spotlighting the area's ecological significance and emphasizing the need for continued exploration. Our methodology, encompassing a broad BLAST search coupled with phylogenetic analysis, proved to be a practical and effective approach for overcoming the limitations posed by the lack of comprehensive sequence data for sea anemones. We discuss the merits and limitations of current molecular methodologies and stress the importance of further research into lesser-studied marine organisms like sea anemones. Our work sets a precedent for future phylogenetic studies stemming from BioBlitz endeavors.
Collapse
Affiliation(s)
- Charlotte Benedict
- The Ohio State University, Department of Evolution, Ecology, and Organismal Biology, 1315 Kinnear Rd, Columbus, OH 43212, USA.
| | - Alonso Delgado
- The Ohio State University, Department of Evolution, Ecology, and Organismal Biology, 1315 Kinnear Rd, Columbus, OH 43212, USA
| | - Isabel Pen
- The Ohio State University, Department of Evolution, Ecology, and Organismal Biology, 1315 Kinnear Rd, Columbus, OH 43212, USA
| | - Claudia Vaga
- Department of Invertebrate Zoology, Smithsonian Institution's National Museum of Natural History, 10th and Constitution Ave NW, Washington, DC 20560, USA
| | - Marymegan Daly
- The Ohio State University, Department of Evolution, Ecology, and Organismal Biology, 1315 Kinnear Rd, Columbus, OH 43212, USA
| | - Andrea M Quattrini
- Department of Invertebrate Zoology, Smithsonian Institution's National Museum of Natural History, 10th and Constitution Ave NW, Washington, DC 20560, USA
| |
Collapse
|
2
|
Middlebrook EA, Katani R, Fair JM. OrthoPhyl-streamlining large-scale, orthology-based phylogenomic studies of bacteria at broad evolutionary scales. G3 (BETHESDA, MD.) 2024; 14:jkae119. [PMID: 38839049 PMCID: PMC11304591 DOI: 10.1093/g3journal/jkae119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Revised: 05/15/2024] [Accepted: 05/29/2024] [Indexed: 06/07/2024]
Abstract
There are a staggering number of publicly available bacterial genome sequences (at writing, 2.0 million assemblies in NCBI's GenBank alone), and the deposition rate continues to increase. This wealth of data begs for phylogenetic analyses to place these sequences within an evolutionary context. A phylogenetic placement not only aids in taxonomic classification but informs the evolution of novel phenotypes, targets of selection, and horizontal gene transfer. Building trees from multi-gene codon alignments is a laborious task that requires bioinformatic expertise, rigorous curation of orthologs, and heavy computation. Compounding the problem is the lack of tools that can streamline these processes for building trees from large-scale genomic data. Here we present OrthoPhyl, which takes bacterial genome assemblies and reconstructs trees from whole genome codon alignments. The analysis pipeline can analyze an arbitrarily large number of input genomes (>1200 tested here) by identifying a diversity-spanning subset of assemblies and using these genomes to build gene models to infer orthologs in the full dataset. To illustrate the versatility of OrthoPhyl, we show three use cases: E. coli/Shigella, Brucella/Ochrobactrum and the order Rickettsiales. We compare trees generated with OrthoPhyl to trees generated with kSNP3 and GToTree along with published trees using alternative methods. We show that OrthoPhyl trees are consistent with other methods while incorporating more data, allowing for greater numbers of input genomes, and more flexibility of analysis.
Collapse
Affiliation(s)
- Earl A Middlebrook
- Genomics and Bioanalytics Group, Los Alamos National Laboratory, Mailstop M888, Los Alamos, NM 87545, USA
| | - Robab Katani
- 401 Huck Life Sciences Building, Huck Institutes of Life Sciences, Pennsylvania State University, University Park, PA 16802, USA
| | - Jeanne M Fair
- Genomics and Bioanalytics Group, Los Alamos National Laboratory, Mailstop M888, Los Alamos, NM 87545, USA
| |
Collapse
|
3
|
Fleming J, Eriksen PM, Struck TH. Scoutknife: A naïve, whole genome informed phylogenetic robusticity metric. F1000Res 2024; 12:945. [PMID: 38799242 PMCID: PMC11128044 DOI: 10.12688/f1000research.139356.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/05/2024] [Indexed: 05/29/2024] Open
Abstract
Background: The phylogenetic bootstrap, first proposed by Felsenstein in 1985, is a critically important statistical method in assessing the robusticity of phylogenetic datasets. Core to its concept was the use of pseudo sampling - assessing the data by generating new replicates derived from the initial dataset that was used to generate the phylogeny. In this way, phylogenetic support metrics could overcome the lack of perfect, infinite data. With infinite data, however, it is possible to sample smaller replicates directly from the data to obtain both the phylogeny and its statistical robusticity in the same analysis. Due to the growth of whole genome sequencing, the depth and breadth of our datasets have greatly expanded and are set to only expand further. With genome-scale datasets comprising thousands of genes, we can now obtain a proxy for infinite data. Accordingly, we can potentially abandon the notion of pseudo sampling and instead randomly sample small subsets of genes from the thousands of genes in our analyses. Methods: We introduce Scoutknife, a jackknife-style subsampling implementation that generates 100 datasets by randomly sampling a small number of genes from an initial large-gene dataset to jointly establish both a phylogenetic hypothesis and assess its robusticity. We assess its effectiveness by using 18 previously published datasets and 100 simulation studies. Results: We show that Scoutknife is conservative and informative as to conflicts and incongruence across the whole genome, without the need for subsampling based on traditional model selection criteria. Conclusions: Scoutknife reliably achieves comparable results to selecting the best genes on both real and simulation datasets, while being resistant to the potential biases caused by selecting for model fit. As the amount of genome data grows, it becomes an even more exciting option to assess the robusticity of phylogenetic hypotheses.
Collapse
Affiliation(s)
- James Fleming
- Natural History Museum, Universitetet i Oslo, Oslo, Oslo, 0562, Norway
| | | | | |
Collapse
|
4
|
Stiller J, Feng S, Chowdhury AA, Rivas-González I, Duchêne DA, Fang Q, Deng Y, Kozlov A, Stamatakis A, Claramunt S, Nguyen JMT, Ho SYW, Faircloth BC, Haag J, Houde P, Cracraft J, Balaban M, Mai U, Chen G, Gao R, Zhou C, Xie Y, Huang Z, Cao Z, Yan Z, Ogilvie HA, Nakhleh L, Lindow B, Morel B, Fjeldså J, Hosner PA, da Fonseca RR, Petersen B, Tobias JA, Székely T, Kennedy JD, Reeve AH, Liker A, Stervander M, Antunes A, Tietze DT, Bertelsen MF, Lei F, Rahbek C, Graves GR, Schierup MH, Warnow T, Braun EL, Gilbert MTP, Jarvis ED, Mirarab S, Zhang G. Complexity of avian evolution revealed by family-level genomes. Nature 2024; 629:851-860. [PMID: 38560995 PMCID: PMC11111414 DOI: 10.1038/s41586-024-07323-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 03/15/2024] [Indexed: 04/04/2024]
Abstract
Despite tremendous efforts in the past decades, relationships among main avian lineages remain heavily debated without a clear resolution. Discrepancies have been attributed to diversity of species sampled, phylogenetic method and the choice of genomic regions1-3. Here we address these issues by analysing the genomes of 363 bird species4 (218 taxonomic families, 92% of total). Using intergenic regions and coalescent methods, we present a well-supported tree but also a marked degree of discordance. The tree confirms that Neoaves experienced rapid radiation at or near the Cretaceous-Palaeogene boundary. Sufficient loci rather than extensive taxon sampling were more effective in resolving difficult nodes. Remaining recalcitrant nodes involve species that are a challenge to model due to either extreme DNA composition, variable substitution rates, incomplete lineage sorting or complex evolutionary events such as ancient hybridization. Assessment of the effects of different genomic partitions showed high heterogeneity across the genome. We discovered sharp increases in effective population size, substitution rates and relative brain size following the Cretaceous-Palaeogene extinction event, supporting the hypothesis that emerging ecological opportunities catalysed the diversification of modern birds. The resulting phylogenetic estimate offers fresh insights into the rapid radiation of modern birds and provides a taxon-rich backbone tree for future comparative studies.
Collapse
Affiliation(s)
- Josefin Stiller
- Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| | - Shaohong Feng
- Center for Evolutionary & Organismal Biology, Liangzhu Laboratory & Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China
- Department of General Surgery, Sir Run-Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, China
- Innovation Center of Yangtze River Delta, Zhejiang University, Jiashan, China
| | - Al-Aabid Chowdhury
- School of Life and Environmental Sciences, University of Sydney, Sydney, New South Wales, Australia
| | | | - David A Duchêne
- Center for Evolutionary Hologenomics, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Qi Fang
- BGI Research, Shenzhen, China
| | - Yuan Deng
- BGI Research, Shenzhen, China
- BGI Research, Wuhan, China
| | - Alexey Kozlov
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Alexandros Stamatakis
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
- Institute of Computer Science, Foundation for Research and Technology Hellas, Heraklion, Greece
- Institute for Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Santiago Claramunt
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
- Department of Natural History, Royal Ontario Museum, Toronto, Ontario, Canada
| | - Jacqueline M T Nguyen
- College of Science and Engineering, Flinders University, Adelaide, South Australia, Australia
- Australian Museum Research Institute, Sydney, New South Wales, Australia
| | - Simon Y W Ho
- School of Life and Environmental Sciences, University of Sydney, Sydney, New South Wales, Australia
| | - Brant C Faircloth
- Department of Biological Sciences and Museum of Natural Science, Louisiana State University, Baton Rouge, LA, USA
| | - Julia Haag
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Peter Houde
- Department of Biology, New Mexico State University, Las Cruces, NM, USA
| | - Joel Cracraft
- Department of Ornithology, American Museum of Natural History, New York, NY, USA
| | - Metin Balaban
- Bioinformatics and Systems Biology Graduate Program, University of California San Diego, La Jolla, CA, USA
| | - Uyen Mai
- Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA
| | - Guangji Chen
- BGI Research, Wuhan, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Rongsheng Gao
- BGI Research, Wuhan, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | | | - Yulong Xie
- Center for Evolutionary & Organismal Biology, Liangzhu Laboratory & Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Zijian Huang
- Center for Evolutionary & Organismal Biology, Liangzhu Laboratory & Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Zhen Cao
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Zhi Yan
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Huw A Ogilvie
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Luay Nakhleh
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Bent Lindow
- Natural History Museum Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Benoit Morel
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
- Institute of Computer Science, Foundation for Research and Technology Hellas, Heraklion, Greece
| | - Jon Fjeldså
- Natural History Museum Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Peter A Hosner
- Natural History Museum Denmark, University of Copenhagen, Copenhagen, Denmark
- Center for Global Mountain Biodiversity, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Rute R da Fonseca
- Center for Global Mountain Biodiversity, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Bent Petersen
- Center for Evolutionary Hologenomics, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
- Centre of Excellence for Omics-Driven Computational Biodiscovery, Faculty of Applied Sciences, AIMST University, Bedong, Malaysia
| | - Joseph A Tobias
- Department of Life Sciences, Imperial College London, Silwood Park, Ascot, UK
| | - Tamás Székely
- Milner Centre for Evolution, University of Bath, Bath, UK
- ELKH-DE Reproductive Strategies Research Group, University of Debrecen, Debrecen, Hungary
| | - Jonathan David Kennedy
- Center for Macroecology, Evolution, and Climate, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Andrew Hart Reeve
- Natural History Museum Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Andras Liker
- HUN-REN-PE Evolutionary Ecology Research Group, University of Pannonia, Veszprém, Hungary
- Behavioural Ecology Research Group, Center for Natural Sciences, University of Pannonia, Veszprém, Hungary
| | | | - Agostinho Antunes
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal
- Department of Biology, Faculty of Sciences, University of Porto, Porto, Portugal
| | | | - Mads F Bertelsen
- Centre for Zoo and Wild Animal Health, Copenhagen Zoo, Frederiksberg, Denmark
| | - Fumin Lei
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- College of Life Science, University of Chinese Academy of Sciences, Beijing, China
| | - Carsten Rahbek
- Center for Global Mountain Biodiversity, Globe Institute, University of Copenhagen, Copenhagen, Denmark
- Center for Macroecology, Evolution, and Climate, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
- Institute of Ecology, Peking University, Beijing, China
- Danish Institute for Advanced Study, University of Southern Denmark, Odense, Denmark
| | - Gary R Graves
- Center for Macroecology, Evolution, and Climate, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
| | | | - Tandy Warnow
- University of Illinois Urbana-Champaign, Champaign, IL, USA
| | - Edward L Braun
- Department of Biology, University of Florida, Gainesville, FL, USA
| | - M Thomas P Gilbert
- Center for Evolutionary Hologenomics, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
- University Museum, NTNU, Trondheim, Norway
| | - Erich D Jarvis
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
- Howard Hughes Medical Institute, Durham, NC, USA
| | | | - Guojie Zhang
- Center for Evolutionary & Organismal Biology, Liangzhu Laboratory & Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China.
- Innovation Center of Yangtze River Delta, Zhejiang University, Jiashan, China.
- BGI Research, Wuhan, China.
- Villum Center for Biodiversity Genomics, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
5
|
Wagle S, Markin A, Górecki P, Anderson TK, Eulenstein O. Asymmetric Cluster-Based Measures for Comparative Phylogenetics. J Comput Biol 2024; 31:312-327. [PMID: 38634854 PMCID: PMC11057527 DOI: 10.1089/cmb.2023.0338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2024] Open
Abstract
Phylogenetic inference and reconstruction methods generate hypotheses on evolutionary history. Competing inference methods are frequently used, and the evaluation of the generated hypotheses is achieved using tree comparison costs. The Robinson-Foulds (RF) distance is a widely used cost to compare the topology of two trees, but this cost is sensitive to tree error and can overestimate tree differences. To overcome this limitation, a refined version of the RF distance called the Cluster Affinity (CA) distance was introduced. However, CA distances are symmetric and cannot compare different types of trees. These asymmetric comparisons occur when gene trees are compared with species trees, when disparate datasets are integrated into a supertree, or when tree comparison measures are used to infer a phylogenetic network. In this study, we introduce a relaxation of the original Affinity distance to compare heterogeneous trees called the asymmetric CA cost. We also develop a biologically interpretable cost, the Cluster Support cost that normalizes by cluster size across gene trees. The characteristics of these costs are similar to the symmetric CA cost. We describe efficient algorithms, derive the exact diameters, and use these to standardize the cost to be applicable in practice. These costs provide objective, fine-scale, and biologically interpretable values that can assess differences and similarities between phylogenetic trees.
Collapse
Affiliation(s)
- Sanket Wagle
- Department of Computer Science, Iowa State University, Ames, Iowa, USA
| | - Alexey Markin
- National Animal Disease Center, USDA-ARS, Ames, Iowa, USA
| | - Paweł Górecki
- Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, Warsaw, Poland
| | | | - Oliver Eulenstein
- Department of Computer Science, Iowa State University, Ames, Iowa, USA
| |
Collapse
|
6
|
Thalén F, Köhne CG, Bleidorn C. Patchwork: Alignment-Based Retrieval and Concatenation of Phylogenetic Markers from Genomic Data. Genome Biol Evol 2023; 15:evad227. [PMID: 38085033 PMCID: PMC10735302 DOI: 10.1093/gbe/evad227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/06/2023] [Indexed: 12/23/2023] Open
Abstract
Low-coverage whole-genome sequencing (also known as "genome skimming") is becoming an increasingly affordable approach to large-scale phylogenetic analyses. While already routinely used to recover organellar genomes, genome skimming is rather rarely utilized for recovering single-copy nuclear markers. One reason might be that only few tools exist to work with this data type within a phylogenomic context, especially to deal with fragmented genome assemblies. We here present a new software tool called Patchwork for mining phylogenetic markers from highly fragmented short-read assemblies as well as directly from sequence reads. Patchwork is an alignment-based tool that utilizes the sequence aligner DIAMOND and is written in the programming language Julia. Homologous regions are obtained via a sequence similarity search, followed by a "hit stitching" phase, in which adjacent or overlapping regions are merged into a single unit. The novel sliding window algorithm trims away any noncoding regions from the resulting sequence. We demonstrate the utility of Patchwork by recovering near-universal single-copy orthologs within a benchmarking study, and we additionally assess the performance of Patchwork in comparison with other programs. We find that Patchwork allows for accurate retrieval of (putatively) single-copy genes from genome skimming data sets at different sequencing depths with high computational speed, outperforming existing software targeting similar tasks. Patchwork is released under the GNU General Public License version 3. Installation instructions, additional documentation, and the source code itself are all available via GitHub at https://github.com/fethalen/Patchwork.
Collapse
Affiliation(s)
- Felix Thalén
- Department for Animal Evolution and Biodiversity, Georg-August-Universität Göttingen, Göttingen 37073, Germany
- Cardio-CARE AG, Medizincampus Davos, Davos Wolfgang 7265, Switzerland
| | - Clara G Köhne
- Department for Animal Evolution and Biodiversity, Georg-August-Universität Göttingen, Göttingen 37073, Germany
| | - Christoph Bleidorn
- Department for Animal Evolution and Biodiversity, Georg-August-Universität Göttingen, Göttingen 37073, Germany
| |
Collapse
|
7
|
Ramanauskas K, Igić B. kakapo: easy extraction and annotation of genes from raw RNA-seq reads. PeerJ 2023; 11:e16456. [PMID: 38034874 PMCID: PMC10688300 DOI: 10.7717/peerj.16456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 10/23/2023] [Indexed: 12/02/2023] Open
Abstract
kakapo (kākāpō) is a Python-based pipeline that allows users to extract and assemble one or more specified genes or gene families. It flexibly uses original RNA-seq read or GenBank SRA accession inputs without performing global assembly of entire transcriptomes or metatranscriptomes. The pipeline identifies open reading frames in the assembled gene transcripts and annotates them. It optionally filters raw reads for ribosomal, plastid, and mitochondrial reads, or reads belonging to non-target organisms (e.g., viral, bacterial, human). kakapo can be employed for targeted assembly, to extract arbitrary loci, such as those commonly used for phylogenetic inference in systematics or candidate genes and gene families in phylogenomic and metagenomic studies. We provide example applications and discuss how its use can offset the declining value of GenBank's single-gene databases and help assemble datasets for a variety of phylogenetic analyses.
Collapse
Affiliation(s)
- Karolis Ramanauskas
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL, United States of America
| | - Boris Igić
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL, United States of America
| |
Collapse
|
8
|
Mongiardino Koch N, Tilic E, Miller AK, Stiller J, Rouse GW. Confusion will be my epitaph: genome-scale discordance stifles phylogenetic resolution of Holothuroidea. Proc Biol Sci 2023; 290:20230988. [PMID: 37434530 PMCID: PMC10336381 DOI: 10.1098/rspb.2023.0988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Accepted: 06/12/2023] [Indexed: 07/13/2023] Open
Abstract
Sea cucumbers (Holothuroidea) are a diverse clade of echinoderms found from intertidal waters to the bottom of the deepest oceanic trenches. Their reduced skeletons and limited number of phylogenetically informative traits have long obfuscated morphological classifications. Sanger-sequenced molecular datasets have also failed to constrain the position of major lineages. Noteworthy, topological uncertainty has hindered a resolution for Neoholothuriida, a highly diverse clade of Permo-Triassic age. We perform the first phylogenomic analysis of Holothuroidea, combining existing datasets with 13 novel transcriptomes. Using a highly curated dataset of 1100 orthologues, our efforts recapitulate previous results, struggling to resolve interrelationships among neoholothuriid clades. Three approaches to phylogenetic reconstruction (concatenation under both site-homogeneous and site-heterogeneous models, and coalescent-aware inference) result in alternative resolutions, all of which are recovered with strong support and across a range of datasets filtered for phylogenetic usefulness. We explore this intriguing result using gene-wise log-likelihood scores and attempt to correlate these with a large set of gene properties. While presenting novel ways of exploring and visualizing support for alternative trees, we are unable to discover significant predictors of topological preference, and our efforts fail to favour one topology. Neoholothuriid genomes seem to retain an amalgam of signals derived from multiple phylogenetic histories.
Collapse
Affiliation(s)
| | - Ekin Tilic
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
- Department of Marine Zoology, Senckenberg Research Institute and Museum, Frankfurt, Germany
| | - Allison K. Miller
- Anatomy Department, University of Otago, Dunedin, Otago, New Zealand
| | - Josefin Stiller
- Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Greg W. Rouse
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| |
Collapse
|
9
|
Talavera A, Nie ZL, Ma ZY, Johnson G, Ickert-Bond SM, Zimmer EA, Wen J. Phylogenomic analyses using a new 1013-gene Vitaceae bait-set support major groups of North American Vitis. Mol Phylogenet Evol 2023:107866. [PMID: 37354923 DOI: 10.1016/j.ympev.2023.107866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 06/16/2023] [Accepted: 06/17/2023] [Indexed: 06/26/2023]
Abstract
A set of newly designed Vitaceae baits targeting 1013 genes was employed to explore phylogenetic relationships among North American Vitis. Eurasian Vitis taxa including Vitis vinifera were found to be nested within North American Vitis subgenus Vitis. North American Vitis subgenus Vitis can be placed into nine main groups: the Monticola group, the Occidentales group, the Californica group, the Vinifera group (introduced from Eurasia), the Mustangensis group, the Palmata group, the Aestivalis group, the Labrusca group, and the Cinerea group. Strong cytonuclear discordances were detected in North American Vitis, with many species non-monophyletic in the plastid phylogeny, while monophyletic in the nuclear phylogeny. The phylogenomic analyses support recognizing four distinct species in the Vitis cinerea complex in North America: V. cinerea, V. baileyana, V. berlandieri, and V. simpsonii. Such treatment will better serve the conservation of wild Vitis diversity in North America. Yet the evolutionary history of Vitis is highly complex, with the concordance analyses indicating conflicting signals across the phylogeny. Cytonuclear discordances and Analyses using the Species Networks applying Quartets (SNaQ) method support extensive hybridizations in North American Vitis. The results further indicate that plastid genomes alone are insufficient for resolving the evolutionary history of plant groups that have undergone rampant hybridization, like the case in North American Vitis. Nuclear gene data are essential for species delimitation, identification and reconstructing evolutionary relationships; therefore, they are imperative for plant phylogenomic studies.
Collapse
Affiliation(s)
- Alicia Talavera
- Department of Botany, National Museum of Natural History, MRC166, Smithsonian Institution, Washington, DC 20013-7012, USA; Departamento de Botánica y Fisiología Vegetal, Universidad de Málaga, 29071, Málaga, Spain.
| | - Ze-Long Nie
- Key Laboratory of Plant Resources Conservation and Utilization, College of Biology and Environmental Sciences, Jishou University, Jishou 416000, China
| | - Zhi-Yao Ma
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, 518000 China
| | - Gabriel Johnson
- Department of Botany, National Museum of Natural History, MRC166, Smithsonian Institution, Washington, DC 20013-7012, USA
| | - Stefanie M Ickert-Bond
- UA Museum of the North Herbarium and Department of Biology and Wildlife, University of Alaska Fairbanks, Fairbanks, AK 99775-6960, USA
| | - Elizabeth A Zimmer
- Department of Botany, National Museum of Natural History, MRC166, Smithsonian Institution, Washington, DC 20013-7012, USA
| | - Jun Wen
- Department of Botany, National Museum of Natural History, MRC166, Smithsonian Institution, Washington, DC 20013-7012, USA.
| |
Collapse
|
10
|
Blanco-Gavaldà C, Galbany-Casals M, Susanna A, Andrés-Sánchez S, Bayer RJ, Brochmann C, Cron GV, Bergh NG, Garcia-Jacas N, Gizaw A, Kandziora M, Kolář F, López-Alvarado J, Leliaert F, Letsara R, Moreyra LD, Razafimandimbison SG, Schmickl R, Roquet C. Repeatedly Northwards and Upwards: Southern African Grasslands Fuel the Colonization of the African Sky Islands in Helichrysum (Compositae). PLANTS (BASEL, SWITZERLAND) 2023; 12:plants12112213. [PMID: 37299192 DOI: 10.3390/plants12112213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 05/17/2023] [Accepted: 05/30/2023] [Indexed: 06/12/2023]
Abstract
The Afromontane and Afroalpine areas constitute some of the main biodiversity hotspots of Africa. They are particularly rich in plant endemics, but the biogeographic origins and evolutionary processes leading to this outstanding diversity are poorly understood. We performed phylogenomic and biogeographic analyses of one of the most species-rich plant genera in these mountains, Helichrysum (Compositae-Gnaphalieae). Most previous studies have focused on Afroalpine elements of Eurasian origin, and the southern African origin of Helichrysum provides an interesting counterexample. We obtained a comprehensive nuclear dataset from 304 species (≈50% of the genus) using target-enrichment with the Compositae1061 probe set. Summary-coalescent and concatenation approaches combined with paralog recovery yielded congruent, well-resolved phylogenies. Ancestral range estimations revealed that Helichrysum originated in arid southern Africa, whereas the southern African grasslands were the source of most lineages that dispersed within and outside Africa. Colonization of the tropical Afromontane and Afroalpine areas occurred repeatedly throughout the Miocene-Pliocene. This timing coincides with mountain uplift and the onset of glacial cycles, which together may have facilitated both speciation and intermountain gene flow, contributing to the evolution of the Afroalpine flora.
Collapse
Affiliation(s)
- Carme Blanco-Gavaldà
- Systematics and Evolution of Vascular Plants-Associated Unit to CSIC by IBB, Department of Animal Biology, Plant Biology and Ecology, Faculty of Biosciences, Autonomous University of Barcelona, ES-08193 Bellaterra, Spain
| | - Mercè Galbany-Casals
- Systematics and Evolution of Vascular Plants-Associated Unit to CSIC by IBB, Department of Animal Biology, Plant Biology and Ecology, Faculty of Biosciences, Autonomous University of Barcelona, ES-08193 Bellaterra, Spain
| | - Alfonso Susanna
- Botanic Institute of Barcelona (IBB), CSIC-Ajuntament de Barcelona, Pg. Migdia s/n, ES-08038 Barcelona, Spain
| | - Santiago Andrés-Sánchez
- Department of Botany and Plant Physiology and Plant DNA Biobank, DNA National Bank, University of Salamanca, Edificio I+D+i, Espejo St., ES-37007 Salamanca, Spain
| | - Randall J Bayer
- Department of Biological Sciences, Center for Biodiversity, University of Memphis, Memphis, TN 38152, USA
| | - Christian Brochmann
- Natural History Museum, University of Oslo, P.O. Box 1172, NO-0318 Oslo, Norway
| | - Glynis V Cron
- School of Animal, Plant and Environmental Sciences, University of the Witwatersrand, Private Bag 3, Johannesburg 2050, South Africa
| | - Nicola G Bergh
- Foundational Biodiversity Science, Kirstenbosch Research Centre, South African National Biodiversity Institute, Private Bag X7, Newlands, Cape Town 7735, South Africa
| | - Núria Garcia-Jacas
- Botanic Institute of Barcelona (IBB), CSIC-Ajuntament de Barcelona, Pg. Migdia s/n, ES-08038 Barcelona, Spain
| | - Abel Gizaw
- Natural History Museum, University of Oslo, P.O. Box 1172, NO-0318 Oslo, Norway
- Department of Plant Biology and Biodiversity Management, Addis Ababa University, Addis Ababa P.O. Box 3434, Ethiopia
| | - Martha Kandziora
- Department of Botany, Faculty of Science, Charles University in Prague, Benátská 2, CZ-12801 Prague, Czech Republic
| | - Filip Kolář
- Department of Botany, Faculty of Science, Charles University in Prague, Benátská 2, CZ-12801 Prague, Czech Republic
- Institute of Botany, Academy of Sciences of the Czech Republic, CZ-25243 Průhonice, Czech Republic
| | - Javier López-Alvarado
- Systematics and Evolution of Vascular Plants-Associated Unit to CSIC by IBB, Department of Animal Biology, Plant Biology and Ecology, Faculty of Biosciences, Autonomous University of Barcelona, ES-08193 Bellaterra, Spain
| | | | - Rokiman Letsara
- Herbarium of the Parc Botanique et Zoologique of Tsimbazaza (PBZT), Antananarivo 3G9G+V6C, Madagascar
| | - Lucía D Moreyra
- Botanic Institute of Barcelona (IBB), CSIC-Ajuntament de Barcelona, Pg. Migdia s/n, ES-08038 Barcelona, Spain
| | | | - Roswitha Schmickl
- Department of Botany, Faculty of Science, Charles University in Prague, Benátská 2, CZ-12801 Prague, Czech Republic
- Institute of Botany, Academy of Sciences of the Czech Republic, CZ-25243 Průhonice, Czech Republic
| | - Cristina Roquet
- Systematics and Evolution of Vascular Plants-Associated Unit to CSIC by IBB, Department of Animal Biology, Plant Biology and Ecology, Faculty of Biosciences, Autonomous University of Barcelona, ES-08193 Bellaterra, Spain
| |
Collapse
|
11
|
Fleming JF, Valero‐Gracia A, Struck TH. Identifying and addressing methodological incongruence in phylogenomics: A review. Evol Appl 2023; 16:1087-1104. [PMID: 37360032 PMCID: PMC10286231 DOI: 10.1111/eva.13565] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 04/07/2023] [Accepted: 05/17/2023] [Indexed: 06/28/2023] Open
Abstract
The availability of phylogenetic data has greatly expanded in recent years. As a result, a new era in phylogenetic analysis is dawning-one in which the methods we use to analyse and assess our data are the bottleneck to producing valuable phylogenetic hypotheses, rather than the need to acquire more data. This makes the ability to accurately appraise and evaluate new methods of phylogenetic analysis and phylogenetic artefact identification more important than ever. Incongruence in phylogenetic reconstructions based on different datasets may be due to two major sources: biological and methodological. Biological sources comprise processes like horizontal gene transfer, hybridization and incomplete lineage sorting, while methodological ones contain falsely assigned data or violations of the assumptions of the underlying model. While the former provides interesting insights into the evolutionary history of the investigated groups, the latter should be avoided or minimized as best as possible. However, errors introduced by methodology must first be excluded or minimized to be able to conclude that biological sources are the cause. Fortunately, a variety of useful tools exist to help detect such misassignments and model violations and to apply ameliorating measurements. Still, the number of methods and their theoretical underpinning can be overwhelming and opaque. Here, we present a practical and comprehensive review of recent developments in techniques to detect artefacts arising from model violations and poorly assigned data. The advantages and disadvantages of the different methods to detect such misleading signals in phylogenetic reconstructions are also discussed. As there is no one-size-fits-all solution, this review can serve as a guide in choosing the most appropriate detection methods depending on both the actual dataset and the computational power available to the researcher. Ultimately, this informed selection will have a positive impact on the broader field, allowing us to better understand the evolutionary history of the group of interest.
Collapse
|
12
|
Fleming JF, Struck TH. nRCFV: a new, dataset-size-independent metric to quantify compositional heterogeneity in nucleotide and amino acid datasets. BMC Bioinformatics 2023; 24:145. [PMID: 37046225 PMCID: PMC10099917 DOI: 10.1186/s12859-023-05270-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 04/04/2023] [Indexed: 04/14/2023] Open
Abstract
MOTIVATION Compositional heterogeneity-when the proportions of nucleotides and amino acids are not broadly similar across the dataset-is a cause of a great number of phylogenetic artefacts. Whilst a variety of methods can identify it post-hoc, few metrics exist to quantify compositional heterogeneity prior to the computationally intensive task of phylogenetic tree reconstruction. Here we assess the efficacy of one such existing, widely used, metric: Relative Composition Frequency Variability (RCFV), using both real and simulated data. RESULTS Our results show that RCFV can be biased by sequence length, the number of taxa, and the number of possible character states within the dataset. However, we also find that missing data does not appear to have an appreciable effect on RCFV. We discuss the theory behind this, the consequences of this for the future of the usage of the RCFV value and propose a new metric, nRCFV, which accounts for these biases. Alongside this, we present a new software that calculates both RCFV and nRCFV, called nRCFV_Reader. AVAILABILITY AND IMPLEMENTATION nRCFV has been implemented in RCFV_Reader, available at: https://github.com/JFFleming/RCFV_Reader . Both our simulation and real data are available at Datadryad: https://doi.org/10.5061/dryad.wpzgmsbpn .
Collapse
Affiliation(s)
- James F Fleming
- University of Oslo Natural History Museum, Sars' Gata 1, Oslo, Norway.
| | - Torsten H Struck
- University of Oslo Natural History Museum, Sars' Gata 1, Oslo, Norway
| |
Collapse
|
13
|
Fleming JF. The wealth of shared resources: Improving molecular taxonomy using eDNA and public databases. ZOOL SCR 2023. [DOI: 10.1111/zsc.12591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]
|