1
|
Pan D, Sun Y, Shi B, Wang R, Ng PKL, Guinot D, Cumberlidge N, Sun H. Phylogenomic analysis of brachyuran crabs using transcriptome data reveals possible sources of conflicting phylogenetic relationships within the group. Mol Phylogenet Evol 2024; 201:108201. [PMID: 39278384 DOI: 10.1016/j.ympev.2024.108201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Revised: 08/21/2024] [Accepted: 09/11/2024] [Indexed: 09/18/2024]
Abstract
Despite extensive morphological and molecular studies, the phylogenetic interrelationships within the infraorder Brachyura and the phylogenetic positions of many taxa remain uncertain. Studies that used a limited number of molecular markers have often failed to provide sufficient resolution, and may be susceptible to stochastic errors and incomplete lineage sorting (ILS). Here we reconstructed the phylogenetic relationships within the Brachyura using transcriptome data of 56 brachyuran species, including 14 newly sequenced taxa. Five supermatrices were constructed in order to exclude different sources of systematic error. The results of the phylogenetic analyses indicate that Heterotremata is non-monophyletic, and that the two Old World primary freshwater crabs (Potamidae and Gecarcinucidae) and the Hymenosomatoidea form a clade that is sister to the Thoracotremata, and outside the Heterotremata. We also found that ILS is the main cause of the gene-tree discordance of these freshwater crabs. Divergence time estimations indicate that the Brachyura has an ancient origin, probably either in the Triassic or Jurassic, and that the majority of extant families and superfamilies first appeared during the Cretaceous, with a constant increase of diversity in Post-Cretaceous-Palaeogene times. The results support the hypothesis that the two Old World freshwater crab families included in this study (Potamidae and Gecarcinucidae) diverged from their marine ancestors around 120 Ma, in the Cretaceous. In addition, this work provides new insights that may aid in the reclassification of some of the more problematic brachyuran groups.
Collapse
Affiliation(s)
- Da Pan
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Road, Nanjing 210023, PR China.
| | - Yunlong Sun
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Road, Nanjing 210023, PR China
| | - Boyang Shi
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Road, Nanjing 210023, PR China
| | - Ruxiao Wang
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Road, Nanjing 210023, PR China
| | - Peter K L Ng
- Lee Kong Chian Natural History Museum, National University of Singapore, 2 Conservatory Drive, Singapore 117377, Singapore
| | - Danièle Guinot
- Muséum National d'Histoire Naturelle, CNRS, Sorbonne Université, EPHE, Institut de Systématique, Évolution, Biodiversité (ISYEB), Case Postale 53, 57 rue Cuvier, F-75231 Paris cedex 05, France
| | - Neil Cumberlidge
- Department of Biology, Northern Michigan University, Marquette, MI 49855-5376, USA
| | - Hongying Sun
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Road, Nanjing 210023, PR China.
| |
Collapse
|
2
|
Ferranti DA, Delwiche CF. Investigating the evolution of green algae with a large transcriptomic data set. JOURNAL OF PHYCOLOGY 2024; 60:1406-1419. [PMID: 39404089 DOI: 10.1111/jpy.13509] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Revised: 08/29/2024] [Accepted: 09/04/2024] [Indexed: 12/28/2024]
Abstract
The colonization of land by plants approximately 450-500 million years ago (Mya) is one of the most important events in the history of life on Earth. Land plants, hereafter referred to as "embryophytes," comprise the foundation of every terrestrial biome, making them an essential lineage for the origin and maintenance of biodiversity. The embryophytes form a monophyletic clade within one of the two major phyla of the green algae (Viridiplantae), the Streptophyta. Estimates from fossil data and molecular clock analyses suggest the Streptophyte algae (Charophytes) diverged from the other main phylum of green algae, the Chlorophyta, as much as 1500 Mya. Here we present a phylogenetic analysis using transcriptomic and genomic data of 62 green algae and embryophyte operational taxonomic units, 31 of which were assembled de novo for this project. We have focused on identifying the charophyte lineage that is sister to embryophytes, and show that the Zygnematophyceae have the strongest support, followed by the Charophyceae. Furthermore, we have examined amino acid and codon usage across the tree and determined these data broadly follow the phylogenetic tree. We concluded by searching the data set for protein domains and gene families known to be important in embryophytes. Many of these domains and genes have homologous sequences in the charophyte lineages, giving insight into the processes that underlay the colonization of the land by plants. This provides new insights into green algal diversification, identifies previously unknown attributes of genome evolution within the group, and shows how functional mechanisms have evolved over time.
Collapse
Affiliation(s)
- David A Ferranti
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland, USA
| | - Charles F Delwiche
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland, USA
| |
Collapse
|
3
|
Zhao J, Huang CJ, Jiang LJ, He ZR, Yang S, Zhu ZM, Zhang L, Yu H, Zhou XM, Wang JG. Phylogenomic analyses of the pantropical Platycerium Desv. (Platycerioideae) reveal their complex evolution and historical biogeography. Mol Phylogenet Evol 2024; 201:108213. [PMID: 39393764 DOI: 10.1016/j.ympev.2024.108213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 09/28/2024] [Accepted: 10/03/2024] [Indexed: 10/13/2024]
Abstract
Platycerium is a genus of pantropical epiphytic ferns consisting of ca. 18 species and are highly sought after by horticultural enthusiasts. Although the monophyly of this genus has been well supported in previous molecular studies, as an intercontinentally disjunct genus, the origin and distribution pattern of Platycerium were elusive and controversial. This is mainly due to limited taxon sampling, a plastid representing only a single coalescent history, the lack of fossil evidence, and so on. Here, by utilizing genome-skimming sequencing, transcriptome sequencing, and flow cytometry, we integrated chloroplast genomes, data of single-copy nuclear genes, ploidy levels, morphology, and geographic distribution to understand the species phylogeny and the evolutionary and biogeographic history of Platycerium. Our major results include: (1) based on both plastid and nuclear datasets, Platycerium is consistently resolved into three fully supported clades: the Afro-American (AA) clade, the Javan-Australian (JA) clade, and the Malayan-Asian (MA) clade. The AA clade and MA clade are further divided into three and two subclades, respectively; (2) a large amount of gene tree conflict, as well as cytonuclear discordance, was found and can be explained by hybridization and incomplete lineage sorting, and most of the hybridization hypotheses represented ancient hybridization events; (3) through molecular dating, the crown age of Platycerium is determined to be at approximately 32.79 Ma based on the plastid dataset or 29.08 Ma based on the nuclear dataset in the Middle Oligocene; (4) ancestral area reconstruction analysis from different datasets showed that Platycerium most likely originated from Indochina; (5) current distribution patterns are resultant from long-distance dispersals, ancient orogeny, and an ancient climate event; and (6) species diversification was driven by polyploidization, dispersal, and hybridization. This study presented here will help understand the evolution of tropical plant flora and provide a reference for the cultivation and breeding of staghorn ferns.
Collapse
Affiliation(s)
- Jing Zhao
- School of Ecology and Environmental Science, Yunnan University, Kunming 650504, Yunnan, China
| | - Chuan-Jie Huang
- School of Ecology and Environmental Science, Yunnan University, Kunming 650504, Yunnan, China
| | - Li-Ju Jiang
- Gardening and Horticulture Center, Xishuangbanna Tropic Botanical Garden, Chinese Academy of Sciences, Mengla 666303, Yunnan, China
| | - Zhao-Rong He
- School of Life Sciences, Yunnan University, East Outer Ring Road, Chenggong District, Kunming 650500, Yunnan, China
| | - Shuai Yang
- Plant Fairyland, Boda Road, Chenggong District, Kunming 650503, Yunnan, China
| | - Zhang-Ming Zhu
- School of Ecology and Environmental Science, Yunnan University, Kunming 650504, Yunnan, China
| | - Liang Zhang
- Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, Yunnan, China
| | - Hong Yu
- School of Ecology and Environmental Science, Yunnan University, Kunming 650504, Yunnan, China.
| | - Xin-Mao Zhou
- School of Ecology and Environmental Science, Yunnan University, Kunming 650504, Yunnan, China.
| | - Jia-Guan Wang
- School of Ecology and Environmental Science, Yunnan University, Kunming 650504, Yunnan, China.
| |
Collapse
|
4
|
Bjornson S, Verbruggen H, Upham NS, Steenwyk JL. Reticulate evolution: Detection and utility in the phylogenomics era. Mol Phylogenet Evol 2024; 201:108197. [PMID: 39270765 DOI: 10.1016/j.ympev.2024.108197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2024] [Revised: 08/13/2024] [Accepted: 09/08/2024] [Indexed: 09/15/2024]
Abstract
Phylogenomics has enriched our understanding that the Tree of Life can have network-like or reticulate structures among some taxa and genes. Two non-vertical modes of evolution - hybridization/introgression and horizontal gene transfer - deviate from a strictly bifurcating tree model, causing non-treelike patterns. However, these reticulate processes can produce similar patterns to incomplete lineage sorting or recombination, potentially leading to ambiguity. Here, we present a brief overview of a phylogenomic workflow for inferring organismal histories and compare methods for distinguishing modes of reticulate evolution. We discuss how the timing of coalescent events can help disentangle introgression from incomplete lineage sorting and how horizontal gene transfer events can help determine the relative timing of speciation events. In doing so, we identify pitfalls of certain methods and discuss how to extend their utility across the Tree of Life. Workflows, methods, and future directions discussed herein underscore the need to embrace reticulate evolutionary patterns for understanding the timing and rates of evolutionary events, providing a clearer view of life's history.
Collapse
Affiliation(s)
- Saelin Bjornson
- School of BioSciences, University of Melbourne, Victoria, Australia
| | - Heroen Verbruggen
- School of BioSciences, University of Melbourne, Victoria, Australia; CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Campus de Vairão, Universidade do Porto, 4485-661 Vairão, Portugal
| | - Nathan S Upham
- School of Life Sciences, Arizona State University, Tempe, AZ, USA.
| | - Jacob L Steenwyk
- Howards Hughes Medical Institute and the Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA.
| |
Collapse
|
5
|
Shazib SUA, Ahsan R, Leleu M, McManus GB, Katz LA, Santoferrara LF. Phylogenomic workflow for uncultivable microbial eukaryotes using single-cell RNA sequencing - A case study with planktonic ciliates (Ciliophora, Oligotrichea). Mol Phylogenet Evol 2024:108239. [PMID: 39551225 DOI: 10.1016/j.ympev.2024.108239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2024] [Revised: 10/25/2024] [Accepted: 11/10/2024] [Indexed: 11/19/2024]
Abstract
Phylogenetic analyses increasingly rely on genomic and transcriptomic data to produce better supported inferences on the evolutionary relationships among microbial eukaryotes. Such phylogenomic analyses, however, require robust workflows, bioinformatic expertise and computational power. Microbial eukaryotes pose additional challenges given the complexity of their genomes and the presence of non-target sequences (e.g., symbionts, prey) in data obtained from single cells of uncultivable lineages. To address these challenges, we developed a phylogenomic workflow based on single-cell RNA sequencing, integrating all essential steps from cell isolation to data curation and species tree inference. We assessed our workflow by using publicly available and newly generated transcriptomes (11 and 28, respectively) from the Oligotrichea, a diverse group of marine planktonic ciliates. This group's phylogenetic relationships have been relatively well-studied based on ribosomal RNA gene markers, which we reconstructed by read mapping of transcriptome sequences and compared to our phylogenomic inferences. We also compared phylogenomic analyses based on single-copy protein-coding genes (well-curated orthologs) and multi-copy genes (including paralogs) by sequence concatenation and a coalescence approach (Asteroid), respectively. Finally, using subsets of up to 1,014 gene families (GFs), we assessed the influence of missing data in our phylogenomic inferences. All our analyses yielded similar results, and most inferred relationships were consistent and well-supported. Overall, we found that Asteroid provides robust support for species tree inferences, while simplifying curation steps, minimizing the effects of missing data and maximizing the number of GFs represented in the analyses. Our workflow can be adapted for phylogenomic analyses based on single-cell RNA sequencing of other uncultivable microbial eukaryotes.
Collapse
Affiliation(s)
- Shahed U A Shazib
- Department of Biological Sciences, Smith College, Northampton, MA, USA
| | - Ragib Ahsan
- Department of Biological Sciences, Smith College, Northampton, MA, USA; University of Massachusetts Amherst, Program in Organismic and Evolutionary Biology, Amherst, MA, USA
| | - Marie Leleu
- Department of Biological Sciences, Smith College, Northampton, MA, USA
| | - George B McManus
- Department of Marine Sciences, University of Connecticut, Groton, CT, USA
| | - Laura A Katz
- Department of Biological Sciences, Smith College, Northampton, MA, USA; University of Massachusetts Amherst, Program in Organismic and Evolutionary Biology, Amherst, MA, USA.
| | | |
Collapse
|
6
|
Su ZH, Sasaki A, Minami H, Ozaki K. Arthropod Phylotranscriptomics With a Special Focus on the Basal Phylogeny of the Myriapoda. Genome Biol Evol 2024; 16:evae189. [PMID: 39219333 PMCID: PMC11436689 DOI: 10.1093/gbe/evae189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Revised: 08/08/2024] [Accepted: 08/22/2024] [Indexed: 09/04/2024] Open
Abstract
Arthropoda represents the most diverse animal phylum, but clarifying the phylogenetic relationships among arthropod taxa remains challenging given the numerous arthropod lineages that diverged over a short period of time. In order to resolve the most controversial aspects of deep arthropod phylogeny, focusing on the Myriapoda, we conducted phylogenetic analyses based on ten super-matrices comprised of 751 to 1,233 orthologous genes across 64 representative arthropod species, including 28 transcriptomes that were newly generated in this study. Our findings provide unambiguous support for the monophyly of the higher arthropod taxa, Chelicerata, Mandibulata, Myriapoda, Pancrustacea, and Hexapoda, while the Crustacea are paraphyletic, with the class Remipedia supported as the lineage most closely related to hexapods. Within the Hexapoda, our results largely affirm previously proposed phylogenetic relationships among deep hexapod lineages, except that the Paraneoptera (Hemiptera, Thysanoptera, and Psocodea) was recovered as a monophyletic lineage in some analyses. The results corroborated the recently proposed phylogenetic framework of the four myriapod classes, wherein Symphyla and Pauropoda, as well as Chilopoda and Diplopoda, are each proposed to be sister taxa. The findings provide important insights into understanding the phylogeny and evolution of arthropods.
Collapse
Affiliation(s)
- Zhi-Hui Su
- JT Biohistory Research Hall, Takatsuki, Osaka 569-1125, Japan
- Department of Biological Sciences, Graduate School of Science, Osaka University, Toyonaka, Osaka 560-0043, Japan
| | - Ayako Sasaki
- JT Biohistory Research Hall, Takatsuki, Osaka 569-1125, Japan
| | - Hiroaki Minami
- JT Biohistory Research Hall, Takatsuki, Osaka 569-1125, Japan
- Department of Biological Sciences, Graduate School of Science, Osaka University, Toyonaka, Osaka 560-0043, Japan
| | - Katsuhisa Ozaki
- JT Biohistory Research Hall, Takatsuki, Osaka 569-1125, Japan
| |
Collapse
|
7
|
Christodoulides N, Urgiles VL, Guayasamin JM, Savage AE. Selection and Gene Duplication Associated With High-Elevation Diversification in Pristimantis, the Largest Terrestrial Vertebrate Genus. Genome Biol Evol 2024; 16:evae167. [PMID: 39109890 PMCID: PMC11342244 DOI: 10.1093/gbe/evae167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/24/2024] [Indexed: 08/24/2024] Open
Abstract
The genus Pristimantis diversified in the tropical Andes mountains and is the most speciose genus of terrestrial vertebrates. Pristimantis are notable among frogs in that they thrive at high elevations (>2,000 m) and are direct developers without a tadpole stage. Despite their ecological significance, little is known about the genetic and physiological traits enabling their success. We conducted transcriptomic analysis on seven Pristimantis species sampled across elevations in the Ecuadorean Andes to explore three hypotheses for their success: (i) unique genes are under selection relative to all other frogs, (ii) common selection occurs across all direct developers, or (iii) common selection occurs across all high-elevation frog clades. Comparative analysis with 34 frog species revealed unique positive selection in Pristimantis genes related to aerobic respiration, hemostasis, signaling, cellular transportation of proteins and ions, and immunity. Additionally, we detected positive selection across all direct developers for genes associated with oxygenase activity and metal ion binding. While many genes under selection in Pristimantis were not positively selected in other high-elevation frog species, we identified some shared genes and pathways linked to lipid metabolism, innate immunity, and cellular redox processes. We observed more positive selection in duplicated- versus single-copy genes, while relaxed purifying selection was prevalent in single-copy genes. Notably, copy number of an innate immunity complement gene was positively correlated with Pristimantis species elevation. Our findings contribute novel insights into the genetic basis of adaptation in Pristimantis and provide a foundation for future studies on the evolutionary mechanisms leading to direct development and coping with high elevations.
Collapse
Affiliation(s)
| | - Veronica L Urgiles
- Department of Biology, University of Central Florida, Orlando, FL, USA
- Departamento de herpetologia, Instituto Nacional de Biodiversidad del Ecuador, Quito, Ecuador
| | - Juan M Guayasamin
- Colegio de Ciencias Biológicas y Ambientales COCIBA, Instituto Biósfera, Laboratorio de Biología Evolutiva, Universidad San Francisco de Quito USFQ, Quito, Ecuador
- Ingeniería en Biodiversidad y Recursos Genéticos, Centro de Biodiversidad y Cambio Climático BioCamb, Universidad Tecnológica Indoamérica, Quito, Ecuador
| | - Anna E Savage
- Department of Biology, University of Central Florida, Orlando, FL, USA
| |
Collapse
|
8
|
Chen Q, Deng M, Dai X, Wang W, Wang X, Chen LS, Huang GH. Phylogenomic data exploration with increased sampling provides new insights into the higher-level relationships of butterflies and moths (Lepidoptera). Mol Phylogenet Evol 2024; 197:108113. [PMID: 38796071 DOI: 10.1016/j.ympev.2024.108113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Revised: 05/13/2024] [Accepted: 05/22/2024] [Indexed: 05/28/2024]
Abstract
A robust and stable phylogenetic framework is a fundamental goal of evolutionary biology. As the third largest insect order in the world following Coleoptera and Diptera, Lepidoptera (butterflies and moths) play a central role in almost every terrestrial ecosystem as indicators of environmental change and serve as important models for biologists exploring questions related to ecology and evolutionary biology. However, for such a charismatic insect group, the higher-level phylogenetic relationships among its superfamilies are still poorly resolved. Compared to earlier phylogenomic studies, we increased taxon sampling among Lepidoptera (37 superfamilies and 68 families containing 263 taxa) and acquired a series of large amino-acid datasets from 69,680 to 400,330 for phylogenomic reconstructions. Using these datasets, we explored the effect of different taxon sampling with significant increases in the number of included genes on tree topology by considering a series of systematic errors using maximum-likelihood (ML) and Bayesian inference (BI) methods. Moreover, we also tested the effectiveness in topology robustness among the three ML-based models. The results showed that taxon sampling is an important determinant in tree robustness of accurate lepidopteran phylogenetic estimation. Long-branch attraction (LBA) caused by site-wise heterogeneity is a significant source of bias giving rise to unstable positions of ditrysian groups in phylogenomic reconstruction. Phylogenetic inference showed the most comprehensive framework to reveal the relationships among lepidopteran superfamilies, and presented some newly relationships with strong supports (Papilionoidea was sister to Gelechioidea and Immoidea was sister to Galacticoidea, respectively), but limited by taxon sampling, the relationships within the species-rich and relatively rapid radiation Ditrysia and especially Apoditrysia remain poorly resolved, which need to increase taxon sampling for further phylogenomic reconstruction. The present study demonstrates that taxon sampling is an important determinant for an accurate lepidopteran tree of life and provides some essential insights for future lepidopteran phylogenomic studies.
Collapse
Affiliation(s)
- Qi Chen
- Yuelushan Laboratory, College of Plant Protection, Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha 410128, Hunan, China; Tropical Biodiversity and Bioresource Utilization Laboratory, College of Science, Qiongtai Normal University, Haikou 571127, Hainan, China
| | - Min Deng
- Yuelushan Laboratory, College of Plant Protection, Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha 410128, Hunan, China; Qiannan Polytechnic for Nationality, Duyun 558022, Guizhou, China
| | - Xuan Dai
- Yuelushan Laboratory, College of Plant Protection, Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha 410128, Hunan, China
| | - Wei Wang
- Research Center for Wild Animal and Plant Resource Protection and Utilization, Qiongtai Normal University, Haikou 571127, Hainan, China
| | - Xing Wang
- Yuelushan Laboratory, College of Plant Protection, Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha 410128, Hunan, China; Tropical Biodiversity and Bioresource Utilization Laboratory, College of Science, Qiongtai Normal University, Haikou 571127, Hainan, China.
| | - Liu-Sheng Chen
- Guangdong Provincial Key Laboratory of Silviculture, Protection and Utilization, Guangdong Academy of Forestry, Guangzhou 510520, Guangdong, China.
| | - Guo-Hua Huang
- Yuelushan Laboratory, College of Plant Protection, Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests, Hunan Agricultural University, Changsha 410128, Hunan, China.
| |
Collapse
|
9
|
Huang YH, Sun YF, Li H, Li HS, Pang H. PhyloAln: A Convenient Reference-Based Tool to Align Sequences and High-Throughput Reads for Phylogeny and Evolution in the Omic Era. Mol Biol Evol 2024; 41:msae150. [PMID: 39041199 PMCID: PMC11287380 DOI: 10.1093/molbev/msae150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 05/15/2024] [Accepted: 07/16/2024] [Indexed: 07/24/2024] Open
Abstract
The current trend in phylogenetic and evolutionary analyses predominantly relies on omic data. However, prior to core analyses, traditional methods typically involve intricate and time-consuming procedures, including assembly from high-throughput reads, decontamination, gene prediction, homology search, orthology assignment, multiple sequence alignment, and matrix trimming. Such processes significantly impede the efficiency of research when dealing with extensive data sets. In this study, we develop PhyloAln, a convenient reference-based tool capable of directly aligning high-throughput reads or complete sequences with existing alignments as a reference for phylogenetic and evolutionary analyses. Through testing with simulated data sets of species spanning the tree of life, PhyloAln demonstrates consistently robust performance compared with other reference-based tools across different data types, sequencing technologies, coverages, and species, with percent completeness and identity at least 50 percentage points higher in the alignments. Additionally, we validate the efficacy of PhyloAln in removing a minimum of 90% foreign and 70% cross-contamination issues, which are prevalent in sequencing data but often overlooked by other tools. Moreover, we showcase the broad applicability of PhyloAln by generating alignments (completeness mostly larger than 80%, identity larger than 90%) and reconstructing robust phylogenies using real data sets of transcriptomes of ladybird beetles, plastid genes of peppers, or ultraconserved elements of turtles. With these advantages, PhyloAln is expected to facilitate phylogenetic and evolutionary analyses in the omic era. The tool is accessible at https://github.com/huangyh45/PhyloAln.
Collapse
Affiliation(s)
- Yu-Hao Huang
- State Key Laboratory of Biocontrol, School of Ecology, Sun Yat-sen University, Shenzhen 518107, China
| | - Yi-Fei Sun
- State Key Laboratory of Biocontrol, School of Ecology, Sun Yat-sen University, Shenzhen 518107, China
| | - Hao Li
- State Key Laboratory of Biocontrol, School of Ecology, Sun Yat-sen University, Shenzhen 518107, China
| | - Hao-Sen Li
- State Key Laboratory of Biocontrol, School of Ecology, Sun Yat-sen University, Shenzhen 518107, China
| | - Hong Pang
- State Key Laboratory of Biocontrol, School of Ecology, Sun Yat-sen University, Shenzhen 518107, China
| |
Collapse
|
10
|
Kokkonen AL, Searle PC, Shiozawa DK, Evans RP. Using de novo transcriptomes to decipher the relationships in cutthroat trout subspecies ( Oncorhynchus clarkii). Evol Appl 2024; 17:e13735. [PMID: 39006004 PMCID: PMC11239772 DOI: 10.1111/eva.13735] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 05/18/2024] [Accepted: 05/27/2024] [Indexed: 07/16/2024] Open
Abstract
For almost 200 years, the taxonomy of cutthroat trout (Oncorhynchus clarkii), a salmonid native to Western North America, has been in flux as ichthyologists and fisheries biologists have tried to describe the diversity within these fishes. Starting in the 1950s, Robert Behnke reexamined the cutthroat trout and identified 14 subspecies based on morphological traits, Pleistocene events, and modern geographic ranges. His designations became instrumental in recognizing and preserving the remaining diversity of cutthroat trout. Over time, molecular techniques (i.e. karyotypes, allozymes, mitochondrial DNA, SNPs, and microsatellite arrays) have largely reinforced Behnke's phylogenies, but have also revealed that some relationships are consistently weakly supported. To further resolve these relationships, we generated de novo transcriptomes for nine cutthroat subspecies, as well as a Bear River Bonneville form and two Colorado River lineages (blue and green). We present phylogenies of these subspecies generated from multiple sets of orthologous genes extracted from our transcriptomes. We confirm many of the relationships identified in previous morphological and molecular studies, as well as discuss the importance of significant differences apparent in our phylogenies from these studies within a geological perspective. Specific findings include three distinct clades: (1) Bear River Bonneville form and Yellowstone cutthroat trout; (2) Bonneville cutthroat trout (n = 2); and (3) Greenback and Rio Grande cutthroat trout. We also identify potential gene transfer between Bonneville cutthroat trout and a population of Colorado River green lineage cutthroat trout. Using these findings, it appears that additional groups warrant species-level consideration if other recent species elevations are retained.
Collapse
Affiliation(s)
- Andrea L. Kokkonen
- Department of Microbiology and Molecular BiologyBrigham Young UniversityProvoUtahUSA
| | - Peter C. Searle
- Department of Ecology and Evolutionary BiologyCornell UniversityIthacaNew YorkUSA
| | | | - R. Paul Evans
- Department of Microbiology and Molecular BiologyBrigham Young UniversityProvoUtahUSA
| |
Collapse
|
11
|
Zhu H, Lei W, Lai Q, Sun Y, Ru D. Comparative analysis shows high level of lineage sorting in genomic regions with low recombination in the extended Picea likiangensis species complex. PLANT DIVERSITY 2024; 46:547-550. [PMID: 39280968 PMCID: PMC11390601 DOI: 10.1016/j.pld.2024.04.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 03/28/2024] [Accepted: 04/08/2024] [Indexed: 09/18/2024]
Abstract
•Phylogenomic analysis uncovers widespread discordance in the extended Picea likiangensis complex.•Introgression (54.99%) and incomplete lineage sorting (ILS; 33.12%) are key drivers of this incongruity.•Recombination rates shape ILS and introgression, with high rates correlating with elevated levels.•Genes linked to abiotic stress responses exhibit significant introgression and ILS, suggesting adaptive evolution.•Lower recombination rates improve accuracy in species relationships.
Collapse
Affiliation(s)
- Hui Zhu
- State Key Laboratory of Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou 730000, China
| | - Weixiao Lei
- Xi'an Center for Disease Control and Prevention, Xi'an 710068, China
| | - Qing Lai
- State Key Laboratory of Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou 730000, China
| | - Yongshuai Sun
- CAS Key Laboratory of Tropical Forest Ecology, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Mengla 666303, China
| | - Dafu Ru
- State Key Laboratory of Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou 730000, China
| |
Collapse
|
12
|
Ning W, Meudt HM, Tate JA. A roadmap of phylogenomic methods for studying polyploid plant genera. APPLICATIONS IN PLANT SCIENCES 2024; 12:e11580. [PMID: 39184196 PMCID: PMC11342234 DOI: 10.1002/aps3.11580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 12/10/2023] [Accepted: 01/13/2024] [Indexed: 08/27/2024]
Abstract
Phylogenetic inference of polyploid species is the first step towards understanding their patterns of diversification. In this paper, we review the challenges and limitations of inferring species relationships of polyploid plants using traditional phylogenetic sequencing approaches, as well as the mischaracterization of the species tree from single or multiple gene trees. We provide a roadmap to infer interspecific relationships among polyploid lineages by comparing and evaluating the application of current phylogenetic, phylogenomic, transcriptomic, and whole-genome approaches using different sequencing platforms. For polyploid species tree reconstruction, we assess the following criteria: (1) the amount of prior information or tools required to capture the genetic region(s) of interest; (2) the probability of recovering homeologs for polyploid species; and (3) the time efficiency of downstream data analysis. Moreover, we discuss bioinformatic pipelines that can reconstruct networks of polyploid species relationships. In summary, although current phylogenomic approaches have improved our understanding of reticulate species relationships in polyploid-rich genera, the difficulties of recovering reliable orthologous genes and sorting all homeologous copies for allopolyploids remain a challenge. In the future, assembled long-read sequencing data will assist the recovery and identification of multiple gene copies, which can be particularly useful for reconstructing the multiple independent origins of polyploids.
Collapse
Affiliation(s)
- Weixuan Ning
- School of Natural SciencesMassey UniversityPalmerston North4442New Zealand
| | - Heidi M. Meudt
- Museum of New Zealand Te Papa TongarewaWellington6011New Zealand
| | - Jennifer A. Tate
- School of Natural SciencesMassey UniversityPalmerston North4442New Zealand
| |
Collapse
|
13
|
Wang Y, Wang H, Ye C, Wang Z, Ma C, Lin D, Jin X. Progress in systematics and biogeography of Orchidaceae. PLANT DIVERSITY 2024; 46:425-434. [PMID: 39280975 PMCID: PMC11390685 DOI: 10.1016/j.pld.2024.05.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 05/10/2024] [Accepted: 05/13/2024] [Indexed: 09/18/2024]
Abstract
Orchidaceae are one of the largest families of angiosperms in terms of species richness. In the last decade, numerous studies have delved into reconstructing the phylogenetic framework of Orchidaceae, leveraging data from plastid, mitochondrial and nuclear sources. These studies have provided new insights into the systematics, diversification and biogeography of Orchidaceae, establishing a robust foundation for future research. Nevertheless, pronounced controversies persist regarding the precise placement of certain lineages within these phylogenetic frameworks. To address these discrepancies and deepen our understanding of the phylogenetic structure of Orchidaceae, we provide a comprehensive overview and analysis of phylogenetic studies focusing on contentious groups within Orchidaceae since 2015, delving into discussions on the underlying reasons for observed topological conflicts. We also provide a novel phylogenetic framework at the subtribal level. Furthermore, we examine the tempo and mode underlying orchid species diversity from the perspective of historical biogeography, highlighting factors contributing to extensive speciation. Ultimately, we delineate avenues for future research aimed at enhancing our understanding of Orchidaceae phylogeny and diversity.
Collapse
Affiliation(s)
- Yajun Wang
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, the Chinese Academy of Sciences, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
| | - Hanchen Wang
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, the Chinese Academy of Sciences, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
| | - Chao Ye
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, the Chinese Academy of Sciences, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
| | - Zhiping Wang
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, the Chinese Academy of Sciences, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
| | - Chongbo Ma
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, the Chinese Academy of Sciences, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
| | - Dongliang Lin
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, the Chinese Academy of Sciences, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
| | - Xiaohua Jin
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, the Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
| |
Collapse
|
14
|
Zhang WW, Weng ZY, Wang X, Yang Y, Li D, Wang L, Liu XC, Meng ZN. Genetic mechanism of body size variation in groupers: Insights from phylotranscriptomics. Zool Res 2024; 45:314-328. [PMID: 38485502 PMCID: PMC11017090 DOI: 10.24272/j.issn.2095-8137.2023.222] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 12/05/2023] [Indexed: 03/19/2024] Open
Abstract
Animal body size variation is of particular interest in evolutionary biology, but the genetic basis remains largely unknown. Previous studies have shown the presence of two parallel evolutionary genetic clusters within the fish genus Epinephelus with evident divergence in body size, providing an excellent opportunity to investigate the genetic basis of body size variation in vertebrates. Herein, we performed phylotranscriptomic analysis and reconstructed the phylogeny of 13 epinephelids originating from the South China Sea. Two genetic clades with an estimated divergence time of approximately 15.4 million years ago were correlated with large and small body size, respectively. A total of 180 rapidly evolving genes and two positively selected genes were identified between the two groups. Functional enrichment analyses of these candidate genes revealed distinct enrichment categories between the two groups. These pathways and genes may play important roles in body size variation in groupers through complex regulatory networks. Based on our results, we speculate that the ancestors of the two divergent groups of groupers may have adapted to different environments through habitat selection, leading to genetic variations in metabolic patterns, organ development, and lifespan, resulting in body size divergence between the two locally adapted populations. These findings provide important insights into the genetic mechanisms underlying body size variation in groupers and species differentiation.
Collapse
Affiliation(s)
- Wei-Wei Zhang
- State Key Laboratory of Biocontrol, Institute of Aquatic Economic Animals and Guangdong Province Key Laboratory of Aquatic Economic Animals, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China
| | - Zhuo-Ying Weng
- State Key Laboratory of Biocontrol, Institute of Aquatic Economic Animals and Guangdong Province Key Laboratory of Aquatic Economic Animals, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China
| | - Xi Wang
- Area of Ecology and Biodiversity, School of Biological Sciences, University of Hong Kong, Hong Kong SAR 999077, China
| | - Yang Yang
- State Key Laboratory of Biocontrol, Institute of Aquatic Economic Animals and Guangdong Province Key Laboratory of Aquatic Economic Animals, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China
| | - Duo Li
- State Key Laboratory of Biocontrol, Institute of Aquatic Economic Animals and Guangdong Province Key Laboratory of Aquatic Economic Animals, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China
| | - Le Wang
- Molecular Population Genetics Group, Temasek Life Sciences Laboratory, Singapore City 117604, Singapore
| | - Xiao-Chun Liu
- State Key Laboratory of Biocontrol, Institute of Aquatic Economic Animals and Guangdong Province Key Laboratory of Aquatic Economic Animals, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China
- Southern Laboratory of Ocean Science and Engineering (Zhuhai), Zhuhai, Guangdong 519000, China
| | - Zi-Ning Meng
- State Key Laboratory of Biocontrol, Institute of Aquatic Economic Animals and Guangdong Province Key Laboratory of Aquatic Economic Animals, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China
- Southern Laboratory of Ocean Science and Engineering (Zhuhai), Zhuhai, Guangdong 519000, China. E-mail:
| |
Collapse
|
15
|
Yang X, Zheng S, Wang X, Wang J, Ali Shah SB, Wang Y, Gao R, Xu Z. Advances in pharmacology, biosynthesis, and metabolic engineering of Scutellaria-specialized metabolites. Crit Rev Biotechnol 2024; 44:302-318. [PMID: 36581326 DOI: 10.1080/07388551.2022.2149386] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 10/11/2022] [Accepted: 11/02/2022] [Indexed: 12/31/2022]
Abstract
Scutellaria Linn., which belongs to the family Lamiaceae, is a commonly used medicinal plant for heat clearing and detoxification. In particular, the roots of S. baicalensis and the entire herb of S. barbata have been widely used in traditional medicine for thousands of years. The main active components of Scutellaria, including: baicalein, wogonin, norwogonin, scutellarein, and their glycosides have potential or existing drug usage. However, the wild resources of Scutellaria plants have been overexploited, and degenerated germplasm resources cannot fulfill the requirements of chemical extraction and clinical usage. Metabolic engineering and green production via microorganisms provide alternative strategies for greater efficiency in the production of natural products. Here, we review the progress of: pharmacological investigations, multi-omics, biosynthetic pathways, and metabolic engineering of various Scutellaria species and their active compounds. In addition, based on multi-omics data, we systematically analyze the phylogenetic relationships of Scutellaria and predict candidate transcription factors related to the regulation of active flavonoids. Finally, we propose the prospects of directed evolution of core enzymes and genome-assisted breeding to alleviate the shortage of plant resources of Scutellaria. This review provides important insights into the sustainable utilization and development of Scutellaria resources.
Collapse
Affiliation(s)
- Xinyi Yang
- Ministry of Education, Key Laboratory of Saline-alkali Vegetation Ecology Restoration (Northeast Forestry University), Harbin, China
- College of Life Science, Northeast Forestry University, Harbin, China
| | - Sihao Zheng
- China National Traditional Chinese Medicine Co., Ltd, Beijing, China
| | - Xiaotong Wang
- Ministry of Education, Key Laboratory of Saline-alkali Vegetation Ecology Restoration (Northeast Forestry University), Harbin, China
- College of Life Science, Northeast Forestry University, Harbin, China
| | - Jing Wang
- Ministry of Education, Key Laboratory of Saline-alkali Vegetation Ecology Restoration (Northeast Forestry University), Harbin, China
- College of Life Science, Northeast Forestry University, Harbin, China
| | - Syed Basit Ali Shah
- Ministry of Education, Key Laboratory of Saline-alkali Vegetation Ecology Restoration (Northeast Forestry University), Harbin, China
- College of Life Science, Northeast Forestry University, Harbin, China
| | - Yu Wang
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Ranran Gao
- The Artemisinin Research Center, Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
| | - Zhichao Xu
- Ministry of Education, Key Laboratory of Saline-alkali Vegetation Ecology Restoration (Northeast Forestry University), Harbin, China
- College of Life Science, Northeast Forestry University, Harbin, China
| |
Collapse
|
16
|
Ma B, Gong H, Xu Q, Gao Y, Guan A, Wang H, Hua K, Luo R, Jin H. Bases-dependent Rapid Phylogenetic Clustering (Bd-RPC) enables precise and efficient phylogenetic estimation in viruses. Virus Evol 2024; 10:veae005. [PMID: 38361823 PMCID: PMC10868571 DOI: 10.1093/ve/veae005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 01/06/2024] [Accepted: 01/22/2024] [Indexed: 02/17/2024] Open
Abstract
Understanding phylogenetic relationships among species is essential for many biological studies, which call for an accurate phylogenetic tree to understand major evolutionary transitions. The phylogenetic analyses present a major challenge in estimation accuracy and computational efficiency, especially recently facing a wave of severe emerging infectious disease outbreaks. Here, we introduced a novel, efficient framework called Bases-dependent Rapid Phylogenetic Clustering (Bd-RPC) for new sample placement for viruses. In this study, a brand-new recoding method called Frequency Vector Recoding was implemented to approximate the phylogenetic distance, and the Phylogenetic Simulated Annealing Search algorithm was developed to match the recoded distance matrix with the phylogenetic tree. Meanwhile, the indel (insertion/deletion) was heuristically introduced to foreign sequence recognition for the first time. Here, we compared the Bd-RPC with the recent placement software (PAGAN2, EPA-ng, TreeBeST) and evaluated it in Alphacoronavirus, Alphaherpesvirinae, and Betacoronavirus by using Split and Robinson-Foulds distances. The comparisons showed that Bd-RPC maintained the highest precision with great efficiency, demonstrating good performance in new sample placement on all three virus genera. Finally, a user-friendly website (http://www.bd-rpc.xyz) is available for users to classify new samples instantly and facilitate exploration of the phylogenetic research in viruses, and the Bd-RPC is available on GitHub (http://github.com/Bin-Ma/bd-rpc).
Collapse
Affiliation(s)
- Bin Ma
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
- College of Veterinary Medicine, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
| | - Huimin Gong
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
- College of Veterinary Medicine, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
| | - Qianshuai Xu
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
- College of Veterinary Medicine, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
| | - Yuan Gao
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
- College of Veterinary Medicine, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
| | - Aohan Guan
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
- College of Veterinary Medicine, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
| | - Haoyu Wang
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
- College of Veterinary Medicine, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
| | - Kexin Hua
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
- College of Veterinary Medicine, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
| | - Rui Luo
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
- College of Veterinary Medicine, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
| | - Hui Jin
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
- College of Veterinary Medicine, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, Hubei 430070, China
| |
Collapse
|
17
|
Zhang Z, Liu G, Li M. Phylotranscriptomic discordance is best explained by incomplete lineage sorting within Allium subgenus Cyathophora and thus hemiplasy accounts for interspecific trait transition. PLANT DIVERSITY 2024; 46:28-38. [PMID: 38343588 PMCID: PMC10851291 DOI: 10.1016/j.pld.2023.07.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Revised: 06/30/2023] [Accepted: 07/13/2023] [Indexed: 12/20/2024]
Abstract
The transition of traits between genetically related lineages is a fascinating topic that provides clues to understanding the drivers of speciation and diversification. Much can be learned about this process from phylogeny-based trait evolution. However, such inference is often plagued by genome-wide gene-tree discordance (GTD), mostly due to incomplete lineage sorting (ILS) and/or introgressive hybridization, especially when the genes underlying the traits appear discordant. Here, by collecting transcriptomes, whole chloroplast genomes (cpDNA), and population genetic datasets, we used the coalescent model to turn GTD into a source of information for ILS and employed hemiplasy to explain specific cases of apparent "phylogenetic discordance" between different morphological traits and probable species phylogeny in the Allium subg. Cyathophora. Both concatenation and coalescence methods consistently showed the same phylogenetic topology for species tree inference based on single-copy genes (SCGs), as supported by the KS distribution. However, GTD was high across the genomes of subg. Cyathophora: ∼27%-38.9% of the SCG trees were in conflict with the species tree. Plasmid and nuclear incongruence was also present. Our coalescent simulations indicated that such GTD was mainly a product of ILS. Our hemiplasy risk factor calculations supported that random fixation of ancient polymorphisms in different populations during successive speciation events along the subg. Cyathophora phylogeny may have caused the character transition, as well as the anomalous cpDNA tree. Our study exemplifies how phylogenetic noise can be transformed into evolutionary information for understanding character state transitions along species phylogenies.
Collapse
Affiliation(s)
- Zengzhu Zhang
- State Key Laboratory of Herbage Improvement and Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou 730000, Gansu, PR China
| | - Gang Liu
- State Key Laboratory of Herbage Improvement and Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou 730000, Gansu, PR China
| | - Minjie Li
- State Key Laboratory of Herbage Improvement and Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou 730000, Gansu, PR China
| |
Collapse
|
18
|
He XJ. Integrating high-volume molecular and morphological data into the evolutionary studies of Allium. PLANT DIVERSITY 2024; 46:1-2. [PMID: 38343593 PMCID: PMC10851282 DOI: 10.1016/j.pld.2023.12.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Accepted: 12/11/2023] [Indexed: 10/28/2024]
Affiliation(s)
- Xing-Jin He
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, Sichuan, PR China
| |
Collapse
|
19
|
Steenwyk JL, Li Y, Zhou X, Shen XX, Rokas A. Incongruence in the phylogenomics era. Nat Rev Genet 2023; 24:834-850. [PMID: 37369847 PMCID: PMC11499941 DOI: 10.1038/s41576-023-00620-x] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/19/2023] [Indexed: 06/29/2023]
Abstract
Genome-scale data and the development of novel statistical phylogenetic approaches have greatly aided the reconstruction of a broad sketch of the tree of life and resolved many of its branches. However, incongruence - the inference of conflicting evolutionary histories - remains pervasive in phylogenomic data, hampering our ability to reconstruct and interpret the tree of life. Biological factors, such as incomplete lineage sorting, horizontal gene transfer, hybridization, introgression, recombination and convergent molecular evolution, can lead to gene phylogenies that differ from the species tree. In addition, analytical factors, including stochastic, systematic and treatment errors, can drive incongruence. Here, we review these factors, discuss methodological advances to identify and handle incongruence, and highlight avenues for future research.
Collapse
Affiliation(s)
- Jacob L Steenwyk
- Howards Hughes Medical Institute and the Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
- Vanderbilt Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN, USA
| | - Yuanning Li
- Institute of Marine Science and Technology, Shandong University, Qingdao, China
| | - Xiaofan Zhou
- Guangdong Laboratory for Lingnan Modern Agriculture, Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Integrative Microbiology Research Centre, South China Agricultural University, Guangzhou, China
| | - Xing-Xing Shen
- Key Laboratory of Biology of Crop Pathogens and Insects of Zhejiang Province, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA.
- Vanderbilt Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN, USA.
- Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.
| |
Collapse
|
20
|
Yang L, Harris AJ, Wen F, Li Z, Feng C, Kong H, Kang M. Phylogenomic Analyses Reveal an Allopolyploid Origin of Core Didymocarpinae (Gesneriaceae) Followed by Rapid Radiation. Syst Biol 2023; 72:1064-1083. [PMID: 37158589 PMCID: PMC10627561 DOI: 10.1093/sysbio/syad029] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Revised: 04/15/2023] [Accepted: 05/05/2023] [Indexed: 05/10/2023] Open
Abstract
Allopolyploid plants have long been regarded as possessing genetic advantages under certain circumstances due to the combined effects of their hybrid origins and duplicated genomes. However, the evolutionary consequences of allopolyploidy in lineage diversification remain to be fully understood. Here, we investigate the evolutionary consequences of allopolyploidy using 138 transcriptomic sequences of Gesneriaceae, including 124 newly sequenced, focusing particularly on the largest subtribe Didymocarpinae. We estimated the phylogeny of Gesneriaceae using concatenated and coalescent-based methods based on five different nuclear matrices and 27 plastid genes, focusing on relationships among major clades. To better understand the evolutionary affinities in this family, we applied a range of approaches to characterize the extent and cause of phylogenetic incongruence. We found that extensive conflicts between nuclear and chloroplast genomes and among nuclear genes were caused by both incomplete lineage sorting (ILS) and reticulation, and we found evidence of widespread ancient hybridization and introgression. Using the most highly supported phylogenomic framework, we revealed multiple bursts of gene duplication throughout the evolutionary history of Gesneriaceae. By incorporating molecular dating and analyses of diversification dynamics, our study shows that an ancient allopolyploidization event occurred around the Oligocene-Miocene boundary, which may have driven the rapid radiation of core Didymocarpinae.
Collapse
Affiliation(s)
- Lihua Yang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China
| | - A J Harris
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China
| | - Fang Wen
- Guangxi Institute of Botany, Guangxi Zhang Autonomous Region and the Chinese Academy of Sciences, 541006 Guilin, China
| | - Zheng Li
- Department of Ecology and Evolutionary Biology, University of Arizona, 1041 E. Lowell St., Tucson, AZ 85721, USA
| | - Chao Feng
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China
| | - Hanghui Kong
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China
| | - Ming Kang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China
| |
Collapse
|
21
|
Yang F, Ge J, Guo Y, Olmstead R, Sun W. Deciphering complex reticulate evolution of Asian Buddleja (Scrophulariaceae): insights into the taxonomy and speciation of polyploid taxa in the Sino-Himalayan region. ANNALS OF BOTANY 2023; 132:15-28. [PMID: 36722368 PMCID: PMC10550280 DOI: 10.1093/aob/mcad022] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 01/31/2023] [Indexed: 06/18/2023]
Abstract
BACKGROUND AND AIMS Species of the genus Buddleja in Asia are mainly distributed in the Sino-Himalayan region and form a challenging taxonomic group, with extensive hybridization and polyploidization. A phylogenetic approach to unravelling the history of reticulation in this lineage will deepen our understanding of the speciation in biodiversity hotspots. METHODS For this study, we obtained 80 accessions representing all the species in the Asian Buddleja clade, and the ploidy level of each taxon was determined by flow cytometry analyses. Whole plastid genomes, nuclear ribosomal DNA, single nucleotide polymorphisms and a large number of low-copy nuclear genes assembled from genome skimming data were used to investigate the reticulate evolutionary history of Asian Buddleja. Complex cytonuclear conflicts were detected through a comparison of plastid and species trees. Gene tree incongruence was also analysed to detect any reticulate events in the history of this lineage. KEY RESULTS Six hybridization events were detected, which are able to explain the cytonuclear conflict in Asian Buddleja. Furthermore, PhyloNet analysis combining species ploidy data indicated several allopolyploid speciation events. A strongly supported species tree inferred from a large number of low-copy nuclear genes not only corrected some earlier misinterpretations, but also indicated that there are many Asian Buddleja species that have been lumped mistakenly. Divergent time estimation shows two periods of rapid diversification (8-10 and 0-3 Mya) in the Asian Buddleja clade, which might coincide with the final uplift of the Hengduan Mountains and Quaternary climate fluctuations, respectively. CONCLUSIONS This study presents a well-supported phylogenetic backbone for the Asian Buddleja species, elucidates their complex and reticulate evolutionary history and suggests that tectonic activity, climate fluctuations, polyploidization and hybridization together promoted the diversification of this lineage.
Collapse
Affiliation(s)
- Fengmao Yang
- Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, Yunnan, China
- Yunnan Key Laboratory for Integrative Conservation of Plant Species with Extremely Small Populations, Kunming Institute of Botany, Chinese Academy of Sciences (CAS), Kunming 650201, Yunnan, China
| | - Jia Ge
- Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, Yunnan, China
- Yunnan Key Laboratory for Integrative Conservation of Plant Species with Extremely Small Populations, Kunming Institute of Botany, Chinese Academy of Sciences (CAS), Kunming 650201, Yunnan, China
| | - Yongjie Guo
- Germplasm Bank of Wild Species of China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, Yunnan, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Richard Olmstead
- Department of Biology and Burke Museum, University of Washington, Seattle, WA 98195, USA
| | - Weibang Sun
- Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, Yunnan, China
- Yunnan Key Laboratory for Integrative Conservation of Plant Species with Extremely Small Populations, Kunming Institute of Botany, Chinese Academy of Sciences (CAS), Kunming 650201, Yunnan, China
| |
Collapse
|
22
|
Wang S, Gao J, Li Z, Chen K, Pu W, Feng C. Phylotranscriptomics supports numerous polyploidization events and phylogenetic relationships in Nicotiana. FRONTIERS IN PLANT SCIENCE 2023; 14:1205683. [PMID: 37575947 PMCID: PMC10421670 DOI: 10.3389/fpls.2023.1205683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Accepted: 07/04/2023] [Indexed: 08/15/2023]
Abstract
Introduction Nicotiana L. (Solanaceae) is of great scientific and economic importance, and polyploidization has been pivotal in shaping this genus. Despite many previous studies on the Nicotiana phylogenetic relationship and hybridization, evidence from whole genome data is still lacking. Methods In this study, we obtained 995 low-copy genes and plastid transcript fragments from the transcriptome datasets of 26 Nicotiana species, including all sections. We reconstructed the phylogenetic relationship and phylogenetic network of diploid species. Results The incongruence among gene trees showed that the formation of N. sylvestris involved incomplete lineage sorting. The nuclear-plastid discordance and nuclear introgression absence indicated that organelle capture from section Trigonophyllae was involved in forming section Petunioides. Furthermore, we analyzed the evolutionary origin of polyploid species and dated the time of hybridization events based on the analysis of PhyloNet, sequence similarity search, and phylogeny of subgenome approaches. Our results highly evidenced the hybrid origins of five polyploid sections, including sections Nicotiana, Repandae, Rusticae, Polydicliae, and Suaveolentes. Notably, we provide novel insights into the hybridization event of section Polydicliae and Suaveolentes. The section Polydicliae formed from a single hybridization event between maternal progenitor N. attenuata and paternal progenitor N. undulata; the N. sylvestris (paternal progenitor) and the N. glauca (maternal progenitor) were involved in the formation of section Suaveolentes. Discussion This study represents the first exploration of Nicotiana polyploidization events and phylogenetic relationships using the high-throughput RNA-seq approach. It will provide guidance for further studies in molecular systematics, population genetics, and ecological adaption studies in Nicotiana and other related species.
Collapse
Affiliation(s)
- Shuaibin Wang
- Tobacco Research Institute of Technology Centre, China Tobacco Hunan Industrial Corporation, Changsha, China
| | - Junping Gao
- Tobacco Research Institute of Technology Centre, China Tobacco Hunan Industrial Corporation, Changsha, China
| | - Zhaowu Li
- Puai Medical College, Shaoyang University, Shaoyang, China
| | - Kai Chen
- Tobacco Research Institute of Technology Centre, China Tobacco Hunan Industrial Corporation, Changsha, China
| | - Wenxuan Pu
- Tobacco Research Institute of Technology Centre, China Tobacco Hunan Industrial Corporation, Changsha, China
| | - Chen Feng
- Jiangxi Provincial Key Laboratory of ex-situ Plant Conservation and Utilization, Lushan Botanical Garden, Chinese Academy of Sciences, Jiujiang, China
| |
Collapse
|
23
|
Huynh S, Cloutier A, Sin SYW. Museomics and phylogenomics of lovebirds (Psittaciformes, Psittaculidae, Agapornis) using low-coverage whole-genome sequencing. Mol Phylogenet Evol 2023; 185:107822. [PMID: 37220800 DOI: 10.1016/j.ympev.2023.107822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 05/07/2023] [Accepted: 05/19/2023] [Indexed: 05/25/2023]
Abstract
Natural history collections contain specimens that provide important insights into studies of ecology and evolution. With the advancement of high-throughput sequencing, historical DNA (hDNA) from museum specimens has become a valuable source of genomic data to study the evolutionary history of organisms. Low-coverage whole genome sequencing (WGS) has been increasingly applied to museum specimens for analyzing organelle genomes, but is still uncommon for genotyping the nuclear DNA fraction. In this study, we applied low-coverage WGS to phylogenomic analyses of parrots in the genus Agapornis by including both modern samples and historical specimens of ∼100-year-old. Agapornis are small-sized African and Malagasy parrots with diverse characters. Earlier phylogenetic studies failed to resolve the positions of some key lineages, prohibiting a robust interpretation of the biogeography and evolution of these African parrots. Here, we demonstrated the use of low-coverage WGS for generating both mitochondrial and nuclear genomic data, and evaluated data quality differences between modern and historical samples. Our resolved Agapornis phylogeny indicates the ancestor of Agapornis likely colonized Madagascar from Australasia by trans-oceanic dispersal events before dispersing to the African continent. Genome-wide SNPs also allowed us to identify the parental origins of hybrid Agapornis individuals. This study demonstrates the potential of applying low-coverage WGS to phylogenomics and population genomics analyses and illustrates how including historical museum specimens can address outstanding questions regarding the evolutionary history of contemporary lineages.
Collapse
Affiliation(s)
- Stella Huynh
- School of Biological Sciences, The University of Hong Kong, Pok Fu Lam Road, Hong Kong SAR, China
| | - Alison Cloutier
- Department of Organismic and Evolutionary Biology, Mueum of Comparative Zoology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| | - Simon Yung Wa Sin
- School of Biological Sciences, The University of Hong Kong, Pok Fu Lam Road, Hong Kong SAR, China.
| |
Collapse
|
24
|
Chen C, Ruhfel BR, Li J, Wang Z, Zhang L, Zhang L, Mao X, Wang J, He D, Luo Y, Hu Q, Duan Y, Xu X, Xi Z, Liu J. Phylotranscriptomics of Swertiinae (Gentianaceae) reveals that key floral traits are not phylogenetically correlated. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2023. [PMID: 36749624 DOI: 10.1111/jipb.13464] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/25/2022] [Accepted: 02/06/2023] [Indexed: 06/18/2023]
Abstract
Establishing how lineages with similar traits are phylogenetically related remains critical for understanding the origin of biodiversity on Earth. Floral traits in plants are widely used to explore phylogenetic relationships and to delineate taxonomic groups. The subtribe Swertiinae (Gentianaceae) comprises more than 350 species with high floral diversity ranging from rotate to tubular corollas and possessing diverse nectaries. Here we performed phylogenetic analysis of 60 species from all 15 genera of the subtribe Swertiinae sensu Ho and Liu, representing the range of floral diversity, using data from the nuclear and plastid genomes. Extensive topological conflicts were present between the nuclear and plastome trees. Three of the 15 genera represented by multiple species are polyphyletic in both trees. Key floral traits including corolla type, absence or presence of lobe scales, nectary type, nectary position, and stigma type are randomly distributed in the nuclear and plastome trees without phylogenetic correlation. We also revealed the likely ancient hybrid origin of one large clade comprising 10 genera with diverse floral traits. These results highlight the complex evolutionary history of this subtribe. The phylogenies constructed here provide a basic framework for further exploring the ecological and genetic mechanisms underlying both species diversification and floral diversity.
Collapse
Affiliation(s)
- Chunlin Chen
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education & State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Brad R Ruhfel
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan, 48109, USA
| | - Jialiang Li
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education & State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Zefu Wang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education & State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Lushui Zhang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education & State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Lei Zhang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education & State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Xingxing Mao
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education & State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Ji Wang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education & State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Dashan He
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education & State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Yue Luo
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education & State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Quanjun Hu
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education & State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Yuanwen Duan
- Institute Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China
| | - Xiaoting Xu
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education & State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Zhenxiang Xi
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education & State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Jianquan Liu
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education & State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| |
Collapse
|
25
|
Abstract
Polyploidizations, or whole-genome duplications (WGDs), in plants have increased biological complexity, facilitated evolutionary innovation, and likely enabled adaptation under harsh conditions. Besides genomic data, transcriptome data have been widely employed to detect WGDs, due to their efficient accessibility to the gene space of a species. Age distributions based on synonymous substitutions (so-called KS age distributions) for paralogs assembled from transcriptome data have identified numerous WGDs in plants, paving the way for further studies on the importance of WGDs for the evolution of seed and flowering plants. However, it is still unclear how transcriptome-based age distributions compare to those based on genomic data. In this chapter, we implemented three different de novo transcriptome assembly pipelines with two popular assemblers, i.e., Trinity and SOAPdenovo-Trans. We selected six plant species with published genomes and transcriptomes to evaluate how assembled transcripts from different pipelines perform when using KS distributions to detect previously documented WGDs in the six species. Further, using genes predicted in each genome as references, we evaluated the effects of missing genes, gene family clustering, and de novo assembled transcripts on the transcriptome-based KS distributions. Our results show that, although the transcriptome-based KS distributions differ from the genome-based ones with respect to their shapes and scales, they are still reasonably reliable for unveiling WGDs, except in species where most duplicates originated from a recent WGD. We also discuss how to overcome some possible pitfalls when using transcriptome data to identify WGDs.
Collapse
Affiliation(s)
- Jia Li
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.,VIB Center for Plant Systems Biology, VIB, Ghent, Belgium
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.
| | - Zhen Li
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.
| |
Collapse
|
26
|
Guo C, Luo Y, Gao LM, Yi TS, Li HT, Yang JB, Li DZ. Phylogenomics and the flowering plant tree of life. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2023; 65:299-323. [PMID: 36416284 DOI: 10.1111/jipb.13415] [Citation(s) in RCA: 37] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 11/22/2022] [Indexed: 06/16/2023]
Abstract
The advances accelerated by next-generation sequencing and long-read sequencing technologies continue to provide an impetus for plant phylogenetic study. In the past decade, a large number of phylogenetic studies adopting hundreds to thousands of genes across a wealth of clades have emerged and ushered plant phylogenetics and evolution into a new era. In the meantime, a roadmap for researchers when making decisions across different approaches for their phylogenomic research design is imminent. This review focuses on the utility of genomic data (from organelle genomes, to both reduced representation sequencing and whole-genome sequencing) in phylogenetic and evolutionary investigations, describes the baseline methodology of experimental and analytical procedures, and summarizes recent progress in flowering plant phylogenomics at the ordinal, familial, tribal, and lower levels. We also discuss the challenges, such as the adverse impact on orthology inference and phylogenetic reconstruction raised from systematic errors, and underlying biological factors, such as whole-genome duplication, hybridization/introgression, and incomplete lineage sorting, together suggesting that a bifurcating tree may not be the best model for the tree of life. Finally, we discuss promising avenues for future plant phylogenomic studies.
Collapse
Affiliation(s)
- Cen Guo
- Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
| | - Yang Luo
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
| | - Lian-Ming Gao
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
- Lijiang Forest Diversity National Observation and Research Station, Kunming Institute of Botany, Chinese Academy of Sciences, Lijiang, 674100, China
| | - Ting-Shuang Yi
- Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
| | - Hong-Tao Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
| | - Jun-Bo Yang
- Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
| | - De-Zhu Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
- Lijiang Forest Diversity National Observation and Research Station, Kunming Institute of Botany, Chinese Academy of Sciences, Lijiang, 674100, China
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650201, China
| |
Collapse
|
27
|
Zhang J. What Has Genomics Taught An Evolutionary Biologist? GENOMICS, PROTEOMICS & BIOINFORMATICS 2023; 21:1-12. [PMID: 36720382 PMCID: PMC10373158 DOI: 10.1016/j.gpb.2023.01.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 01/06/2023] [Accepted: 01/19/2023] [Indexed: 01/30/2023]
Abstract
Genomics, an interdisciplinary field of biology on the structure, function, and evolution of genomes, has revolutionized many subdisciplines of life sciences, including my field of evolutionary biology, by supplying huge data, bringing high-throughput technologies, and offering a new approach to biology. In this review, I describe what I have learned from genomics and highlight the fundamental knowledge and mechanistic insights gained. I focus on three broad topics that are central to evolutionary biology and beyond-variation, interaction, and selection-and use primarily my own research and study subjects as examples. In the next decade or two, I expect that the most important contributions of genomics to evolutionary biology will be to provide genome sequences of nearly all known species on Earth, facilitate high-throughput phenotyping of natural variants and systematically constructed mutants for mapping genotype-phenotype-fitness landscapes, and assist the determination of causality in evolutionary processes using experimental evolution.
Collapse
Affiliation(s)
- Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA.
| |
Collapse
|
28
|
McCarthy CGP, Mulhair PO, Siu-Ting K, Creevey CJ, O’Connell MJ. Improving Orthologous Signal and Model Fit in Datasets Addressing the Root of the Animal Phylogeny. Mol Biol Evol 2023; 40:6989790. [PMID: 36649189 PMCID: PMC9848061 DOI: 10.1093/molbev/msac276] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 12/19/2022] [Accepted: 12/23/2022] [Indexed: 01/18/2023] Open
Abstract
There is conflicting evidence as to whether Porifera (sponges) or Ctenophora (comb jellies) comprise the root of the animal phylogeny. Support for either a Porifera-sister or Ctenophore-sister tree has been extensively examined in the context of model selection, taxon sampling, and outgroup selection. The influence of dataset construction is comparatively understudied. We re-examine five animal phylogeny datasets that have supported either root hypothesis using an approach designed to enrich orthologous signal in phylogenomic datasets. We find that many component orthogroups in animal datasets fail to recover major lineages as monophyletic with the exception of Ctenophora, regardless of the supported root. Enriching these datasets to retain orthogroups recovering ≥3 major lineages reduces dataset size by up to 50% while retaining underlying phylogenetic information and taxon sampling. Site-heterogeneous phylogenomic analysis of these enriched datasets recovers both Porifera-sister and Ctenophora-sister positions, even with additional constraints on outgroup sampling. Two datasets which previously supported Ctenophora-sister support Porifera-sister upon enrichment. All enriched datasets display improved model fitness under posterior predictive analysis. While not conclusively rooting animals at either Porifera or Ctenophora, we do see an increase in signal for Porifera-sister and a decrease in signal for Ctenophore-sister when data are filtered for orthologous signal. Our results indicate that dataset size and construction as well as model fit influence animal root inference.
Collapse
Affiliation(s)
| | | | - Karen Siu-Ting
- Institute for Global Food Security, School of Biological Sciences, Queen's University Belfast, Belfast BT9 5DL, United Kingdom
| | - Christopher J Creevey
- Institute for Global Food Security, School of Biological Sciences, Queen's University Belfast, Belfast BT9 5DL, United Kingdom
| | | |
Collapse
|
29
|
Coelho MAG, Pearson GA, Boavida JRH, Paulo D, Aurelle D, Arnaud‐Haond S, Gómez‐Gras D, Bensoussan N, López‐Sendino P, Cerrano C, Kipson S, Bakran‐Petricioli T, Ferretti E, Linares C, Garrabou J, Serrão EA, Ledoux J. Not out of the Mediterranean: Atlantic populations of the gorgonian Paramuricea clavata are a separate sister species under further lineage diversification. Ecol Evol 2023; 13:e9740. [PMID: 36789139 PMCID: PMC9912747 DOI: 10.1002/ece3.9740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 12/22/2022] [Accepted: 12/27/2022] [Indexed: 01/31/2023] Open
Abstract
The accurate delimitation of species boundaries in nonbilaterian marine taxa is notoriously difficult, with consequences for many studies in ecology and evolution. Anthozoans are a diverse group of key structural organisms worldwide, but the lack of reliable morphological characters and informative genetic markers hampers our ability to understand species diversification. We investigated population differentiation and species limits in Atlantic (Iberian Peninsula) and Mediterranean lineages of the octocoral genus Paramuricea previously identified as P. clavata. We used a diverse set of molecular markers (microsatellites, RNA-seq derived single-copy orthologues [SCO] and mt-mutS [mitochondrial barcode]) at 49 locations. Clear segregation of Atlantic and Mediterranean lineages was found with all markers. Species-tree estimations based on SCO strongly supported these two clades as distinct, recently diverged sister species with incomplete lineage sorting, P. cf. grayi and P. clavata, respectively. Furthermore, a second putative (or ongoing) speciation event was detected in the Atlantic between two P. cf. grayi color morphotypes (yellow and purple) using SCO and supported by microsatellites. While segregating P. cf. grayi lineages showed considerable geographic structure, dominating circalittoral communities in southern (yellow) and western (purple) Portugal, their occurrence in sympatry at some localities suggests a degree of reproductive isolation. Overall, our results show that previous molecular and morphological studies have underestimated species diversity in Paramuricea occurring in the Iberian Peninsula, which has important implications for conservation planning. Finally, our findings validate the usefulness of phylotranscriptomics for resolving evolutionary relationships in octocorals.
Collapse
Affiliation(s)
- Márcio A. G. Coelho
- Centre for Marine Sciences (CCMAR)University of AlgarveFaroPortugal
- MARE – Marine and Environmental Sciences CentreISPA‐Instituto UniversitárioLisboaPortugal
| | | | | | - Diogo Paulo
- Centre for Marine Sciences (CCMAR)University of AlgarveFaroPortugal
| | - Didier Aurelle
- Aix Marseille Univ., Université de Toulon, CNRS, IRD, MIOMarseilleFrance
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Muséum National d'Histoire Naturelle, CNRSSorbonne UniversitéParisFrance
| | - Sophie Arnaud‐Haond
- MARBEC (Marine Biodiversity, Exploitation and Conservation)Univ. Montpellier, IFREMER, CNRS, IRDSète CedexFrance
| | - Daniel Gómez‐Gras
- Hawai‘i Institute of Marine BiologyUniversity of Hawai‘i at MānoaKaneoheHawaiiUSA
- Departament de Biologia Evolutiva, Ecologia i Ciències AmbientalsUniversitat de Barcelona (UB)BarcelonaSpain
- Institut de Recerca de la Biodiversitat (IRBio)Universitat de Barcelona (UB)BarcelonaSpain
| | - Nathaniel Bensoussan
- Aix Marseille Univ., Université de Toulon, CNRS, IRD, MIOMarseilleFrance
- Departament de Biologia MarinaInstitut de Ciències del Mar (CSIC)BarcelonaSpain
| | - Paula López‐Sendino
- Departament de Biologia MarinaInstitut de Ciències del Mar (CSIC)BarcelonaSpain
| | - Carlo Cerrano
- Dipartimento di Scienze della Vita e dell’Ambiente (DiSVA)Università Politecnica delle MarcheAnconaItaly
- Consorzio Nazionale Interuniversitario per le Scienze del Mare (CoNISMa)RomeItaly
- Stazione Zoologica Anton DohrnNaplesItaly
- Fano Marine CenterFanoItaly
| | - Silvija Kipson
- Department of Biology, Faculty of ScienceUniversity of ZagrebZagrebCroatia
- SEAFAN – Marine Research & ConsultancyZagrebCroatia
| | | | - Eliana Ferretti
- Studio Associato GAIA s.n.c.GenoaItaly
- Institute of Marine ScienceThe University of AucklandAucklandNew Zealand
| | - Cristina Linares
- Departament de Biologia Evolutiva, Ecologia i Ciències AmbientalsUniversitat de Barcelona (UB)BarcelonaSpain
- Institut de Recerca de la Biodiversitat (IRBio)Universitat de Barcelona (UB)BarcelonaSpain
| | - Joaquim Garrabou
- Aix Marseille Univ., Université de Toulon, CNRS, IRD, MIOMarseilleFrance
- Departament de Biologia MarinaInstitut de Ciències del Mar (CSIC)BarcelonaSpain
| | - Ester A. Serrão
- Centre for Marine Sciences (CCMAR)University of AlgarveFaroPortugal
- CIBIO/InBIO‐Centro de Investigação em Biodiversidade e Recursos GenéticosVairãoPortugal
| | - Jean‐Baptiste Ledoux
- CIIMAR/CIMAR, Centro Interdisciplinar de Investigação Marinha e AmbientalUniversidade do PortoPortoPortugal
| |
Collapse
|
30
|
Phylotranscriptomics interrogation uncovers a complex evolutionary history for the planarian genus Dugesia (Platyhelminthes, Tricladida) in the Western Mediterranean. Mol Phylogenet Evol 2023; 178:107649. [PMID: 36280167 DOI: 10.1016/j.ympev.2022.107649] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Revised: 10/13/2022] [Accepted: 10/18/2022] [Indexed: 11/17/2022]
Abstract
The Mediterranean is one of the most biodiverse areas of the Paleartic region. Here, basing on large data sets of single copy orthologs obtained from transcriptomic data, we investigated the evolutionary history of the genus Dugesia in the Western Mediterranean area. The results corroborated that the complex paleogeological history of the region was an important driver of diversification for the genus, speciating as microplates and islands were forming. These processes led to the differentiation of three main biogeographic clades: Iberia-Apennines-Alps, Corsica-Sardinia, and Iberia-Africa. The internal relationships of these major clades were analysed with several representative samples per species. The use of large data sets regarding the number of loci and samples, as well as state-of-the-art phylogenomic inference methods allowed us to answer different unresolved questions about the evolution of particular groups, such as the diversification path of D. subtentaculata in the Iberian Peninsula and its colonization of Africa. Additionally, our results support the differentiation of D. benazzii in two lineages which could represent two species. Finally, we analysed here for the first time a comprehensive number of samples from several asexual Iberian populations whose assignment at the species level has been an enigma through the years. The phylogenies obtained with different inference methods showed a branching topology of asexual individuals at the base of sexual clades. We hypothesize that this unexpected topology is related to long-term asexuality. This work represents the first phylotranscriptomic analysis of Tricladida, laying the first stone of the genomic era in phylogenetic studies on this taxonomic group.
Collapse
|
31
|
Xiao J, Lyu R, He J, Li M, Ji J, Cheng J, Xie L. Genome-partitioning strategy, plastid and nuclear phylogenomic discordance, and its evolutionary implications of Clematis (Ranunculaceae). FRONTIERS IN PLANT SCIENCE 2022; 13:1059379. [PMID: 36452086 PMCID: PMC9703796 DOI: 10.3389/fpls.2022.1059379] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/01/2022] [Accepted: 10/24/2022] [Indexed: 06/17/2023]
Abstract
Clematis is one of the largest genera of Ranunculaceae with many phylogenetic problems left to be resolved. Clematis species have considerable genome size of more than 7 Gbp, and there was no whole-genome reference sequence published in this genus. This raises difficulties in acquiring nuclear genome data for its phylogenetic analysis. Previous studies based on Sanger sequencing data, plastid genome data, and nrDNA sequences did not well resolve the phylogeny of Clematis. In this study, we used genome skimming and transcriptome data to assemble the plastid genome sequences, nuclear single nucleotide polymorphisms (SNPs) datasets, and single-copy nuclear orthologous genes (SCOGs) to reconstruct the phylogenetic backbone of Clematis, and test effectiveness of these genome partitioning methods. We also further analyzed the discordance among nuclear gene trees and between plastid and nuclear phylogenies. The results showed that the SCOGs datasets, assembled from transcriptome method, well resolved the phylogenetic backbone of Clematis. The nuclear SNPs datasets from genome skimming method can also produce similar results with the SCOGs data. In contrast to the plastid phylogeny, the phylogeny resolved by nuclear genome data is more robust and better corresponds to morphological characters. Our results suggested that rapid species radiation may have generated high level of incomplete lineage sorting, which was the major cause of nuclear gene discordance. Our simulation also showed that there may have been frequent interspecific hybridization events, which led to some of the cyto-nuclear discordances in Clematis. This study not only provides the first robust phylogenetic backbone of Clematis based on nuclear genome data, but also provides suggestions of genome partitioning strategies for the phylogenomic study of other plant taxa.
Collapse
Affiliation(s)
- Jiamin Xiao
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| | - Rudan Lyu
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| | - Jian He
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| | - Mingyang Li
- College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Jiaxin Ji
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| | - Jin Cheng
- College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Lei Xie
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| |
Collapse
|
32
|
Çiftçi O, Alverson AJ, van Bodegom P, Roberts WR, Mertens A, Van de Vijver B, Trobajo R, Mann DG, Pirovano W, van Eijk I, Gravendeel B. Phylotranscriptomics reveals the reticulate evolutionary history of a widespread diatom species complex. JOURNAL OF PHYCOLOGY 2022; 58:643-656. [PMID: 35861132 PMCID: PMC9804273 DOI: 10.1111/jpy.13281] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/12/2022] [Accepted: 06/29/2022] [Indexed: 06/15/2023]
Abstract
In contrast to surveys based on a few genes that often provide limited taxonomic resolution, transcriptomes provide a wealth of genomic loci that can resolve relationships among taxonomically challenging lineages. Diatoms are a diverse group of aquatic microalgae that includes important bioindicator species and many such lineages. One example is Nitzschia palea, a widespread species complex with several morphologically defined taxonomic varieties, some of which are critical pollution indicators. Morphological differences among the varieties are subtle and phylogenetic studies based on a few genes fail to resolve their evolutionary relationships. We conducted morphometric and transcriptome analyses of 10 Nitzschia palea strains to resolve the relationships among strains and taxonomic varieties. Nitzschia palea was resolved into three clades, one of which corresponds to a group of strains with narrow linear-lanceolate valves. The other morphological group recovered in the shape outline analysis was not monophyletic and consisted of two clades. Gene-tree concordance analyses and phylogenetic network estimations revealed patterns of incomplete lineage sorting and gene flow between intraspecific lineages. We detected reticulated evolutionary patterns among lineages with different morphologies, resulting in a putative recent hybrid. Our study shows that phylogenomic analyses of unlinked nuclear loci, complemented with morphometrics, can resolve complex evolutionary histories of recently diverged species complexes.
Collapse
Affiliation(s)
- Ozan Çiftçi
- Institute of Environmental Sciences (CML)Leiden UniversityBox 95182300 RALeidenThe Netherlands
- Naturalis Biodiversity CenterDarwinweg 22333 CRLeidenThe Netherlands
- BaseClear B.VSylviusweg 742333 BELeidenthe Netherlands
| | - Andrew J. Alverson
- Department of Biological SciencesUniversity of Arkansas, 1 University of ArkansasFayettevilleArkansas72701USA
| | - Peter van Bodegom
- Institute of Environmental Sciences (CML)Leiden UniversityBox 95182300 RALeidenThe Netherlands
| | - Wade R. Roberts
- Department of Biological SciencesUniversity of Arkansas, 1 University of ArkansasFayettevilleArkansas72701USA
| | | | - Bart Van de Vijver
- Meise Botanic Garden Meise, Research DepartmentNieuwelaan 381860MeiseBelgium
- University of Antwerp, Department of Biology – ECOBEUniversiteitsplein 1B‐2610WilrijkBelgium
| | - Rosa Trobajo
- IRTA‐Institute for Food and Agricultural Research and Technology, Marine and Continental Waters ProgrammeCtra de Poble Nou Km 5.5, E43540, La RàpitaCataloniaSpain
| | - David G. Mann
- IRTA‐Institute for Food and Agricultural Research and Technology, Marine and Continental Waters ProgrammeCtra de Poble Nou Km 5.5, E43540, La RàpitaCataloniaSpain
- Royal Botanic Garden EdinburghEdinburghEH3 5LRScotlandUK
| | | | - Iris van Eijk
- Bayer Crop ScienceLeeuwenhoekweg 522661 CZBergschenhoekThe Netherlands
| | - Barbara Gravendeel
- Naturalis Biodiversity CenterDarwinweg 22333 CRLeidenThe Netherlands
- Radboud Institute for Biological and Environmental SciencesHeyendaalseweg 1356500 GLNijmegenThe Netherlands
| |
Collapse
|
33
|
Santos R, Ástvaldsson Á, Pipaliya SV, Zumthor JP, Dacks JB, Svärd S, Hehl AB, Faso C. Combined nanometric and phylogenetic analysis of unique endocytic compartments in Giardia lamblia sheds light on the evolution of endocytosis in Metamonada. BMC Biol 2022; 20:206. [PMID: 36127707 PMCID: PMC9490929 DOI: 10.1186/s12915-022-01402-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 09/06/2022] [Indexed: 11/27/2022] Open
Abstract
Background Giardia lamblia, a parasitic protist of the Metamonada supergroup, has evolved one of the most diverged endocytic compartment systems investigated so far. Peripheral endocytic compartments, currently known as peripheral vesicles or vacuoles (PVs), perform bulk uptake of fluid phase material which is then digested and sorted either to the cell cytosol or back to the extracellular space. Results Here, we present a quantitative morphological characterization of these organelles using volumetric electron microscopy and super-resolution microscopy (SRM). We defined a morphological classification for the heterogenous population of PVs and performed a comparative analysis of PVs and endosome-like organelles in representatives of phylogenetically related taxa, Spironucleus spp. and Tritrichomonas foetus. To investigate the as-yet insufficiently understood connection between PVs and clathrin assemblies in G. lamblia, we further performed an in-depth search for two key elements of the endocytic machinery, clathrin heavy chain (CHC) and clathrin light chain (CLC), across different lineages in Metamonada. Our data point to the loss of a bona fide CLC in the last Fornicata common ancestor (LFCA) with the emergence of a protein analogous to CLC (GlACLC) in the Giardia genus. Finally, the location of clathrin in the various compartments was quantified. Conclusions Taken together, this provides the first comprehensive nanometric view of Giardia’s endocytic system architecture and sheds light on the evolution of GlACLC analogues in the Fornicata supergroup and, specific to Giardia, as a possible adaptation to the formation and maintenance of stable clathrin assemblies at PVs. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-022-01402-3.
Collapse
Affiliation(s)
- Rui Santos
- Institute of Parasitology, University of Zürich, Winterthurerstrasse 266a, 8057, Zürich, Switzerland.,Institute of Anatomy, University of Zürich, Winterthurerstrasse 190, 8057, Zürich, Switzerland
| | - Ásgeir Ástvaldsson
- Department of Cell and Molecular Biology, University of Uppsala, Husargatan 3, 752 37, Uppsala, Sweden.,Department of Microbiology, National Veterinary Institute, 751 23, Uppsala, Sweden
| | - Shweta V Pipaliya
- Division of Infectious Diseases, Department of Medicine, University of Alberta, Edmonton, Alberta, Canada.,School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Jon Paulin Zumthor
- Amt für Lebensmittelsicherheit und Tiergesundheit Graubünden, Chur, Switzerland
| | - Joel B Dacks
- Division of Infectious Diseases, Department of Medicine, University of Alberta, Edmonton, Alberta, Canada.,Institute of Parasitology, Biology Centre, CAS, v.v.i., Branisovska 31, 370 05, Ceske Budejovice, Czech Republic
| | - Staffan Svärd
- Department of Cell and Molecular Biology, University of Uppsala, Husargatan 3, 752 37, Uppsala, Sweden
| | - Adrian B Hehl
- Institute of Parasitology, University of Zürich, Winterthurerstrasse 266a, 8057, Zürich, Switzerland
| | - Carmen Faso
- Institute of Cell Biology, University of Bern, Bern, Switzerland. .,Multidisciplinary Center for Infectious Diseases, Vetsuisse, University of Bern, Bern, Switzerland.
| |
Collapse
|
34
|
Phylogeny and evolution of Cupressaceae: Updates on intergeneric relationships and new insights on ancient intergeneric hybridization. Mol Phylogenet Evol 2022; 177:107606. [PMID: 35952837 DOI: 10.1016/j.ympev.2022.107606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Revised: 07/24/2022] [Accepted: 08/04/2022] [Indexed: 11/24/2022]
Abstract
After the merger of the former Taxodiaceae and Cupressaceae s.s., currently the conifer family Cupressaceae (sensu lato) comprises seven subfamilies and 32 genera, most of which are important components of temperate and mountainous forests. With the exception of a recently published genus-level phylogeny of gymnosperms inferred from sequence analysis of 790 orthologs, previous phylogenetic studies of Cupressaceae were based mainly on morphological characters or a few molecular markers, and did not completely resolve the intergeneric relationships. In this study, we reconstructed a robust and well-resolved phylogeny of Cupressaceae represented by all 32 genera, using 1944 genes (Orthogroups) generated from transcriptome sequencing. Reticulate evolution analyses detected a possible ancient hybridization that occurred between ancestors of two subclades of Cupressoideae, including Microbiota-Platycladus-Tetraclinis (MPT) and Juniperus-Cupressus-Hesperocyparis-Callitropsis-Xanthocyparis (JCHCX), although both concatenation and coalescent trees are highly supported. Moreover, divergence time estimation and ancestral area reconstruction indicate that Cupressaceae very likely originated in Asia in the Triassic, and geographic isolation caused by continental separation drove the vicariant evolution of the two subfamilies Cupressoideae and Callitroideae in the northern and southern hemispheres, respectively. Evolutionary analyses of some morphological characters suggest that helically arranged linear-acicular leaves and imbricate bract-scale complexes represent ancestral states, and the shift from linear-acicular leaves to scale-like leaves was associated with the shift from helical to decussate arrangement. Our study sheds new light on phylogeny and evolutionary history of Cupressaceae, and strongly suggests that both dichotomous phylogenetic and reticulate evolution analyses be conducted in phylogenomic studies.
Collapse
|
35
|
He J, Lyu R, Luo Y, Xiao J, Xie L, Wen J, Li W, Pei L, Cheng J. A phylotranscriptome study using silica gel-dried leaf tissues produces an updated robust phylogeny of Ranunculaceae. Mol Phylogenet Evol 2022; 174:107545. [PMID: 35690374 DOI: 10.1016/j.ympev.2022.107545] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Revised: 06/01/2022] [Accepted: 06/02/2022] [Indexed: 11/16/2022]
Abstract
The utility of transcriptome data in plant phylogenetics has gained popularity in recent years. However, because RNA degrades much more easily than DNA, the logistics of obtaining fresh tissues has become a major limiting factor for widely applying this method. Here, we used Ranunculaceae to test whether silica-dried plant tissues could be used for RNA extraction and subsequent phylogenomic studies. We sequenced 27 transcriptomes, 21 from silica gel-dried (SD-samples) and six from liquid nitrogen-preserved (LN-samples) leaf tissues, and downloaded 27 additional transcriptomes from GenBank. Our results showed that although the LN-samples produced slightly better reads than the SD-samples, there were no significant differences in RNA quality and quantity, assembled contig lengths and numbers, and BUSCO comparisons between two treatments. Using these data, we conducted phylogenomic analyses, including concatenated- and coalescent-based phylogenetic reconstruction, molecular dating, coalescent simulation, phylogenetic network estimation, and whole genome duplication (WGD) inference. The resulting phylogeny was consistent with previous studies with higher resolution and statistical support. The 11 core Ranunculaceae tribes grouped into two chromosome type clades (T- and R-types), with high support. Discordance among gene trees is likely due to hybridization and introgression, ancient genetic polymorphism and incomplete lineage sorting. Our results strongly support one ancient hybridization event within the R-type clade and three WGD events in Ranunculales. Evolution of the three Ranunculaceae chromosome types is likely not directly related to WGD events. By clearly resolving the Ranunculaceae phylogeny, we demonstrated that SD-samples can be used for RNA-seq and phylotranscriptomic studies of angiosperms.
Collapse
Affiliation(s)
- Jian He
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing 100083, PR China
| | - Rudan Lyu
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing 100083, PR China
| | - Yike Luo
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing 100083, PR China
| | - Jiamin Xiao
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing 100083, PR China
| | - Lei Xie
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing 100083, PR China.
| | - Jun Wen
- Department of Botany, National Museum of Natural History, MRC 166, Smithsonian Institution, Washington, DC 20013-7012, USA.
| | - Wenhe Li
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing 100083, PR China
| | - Linying Pei
- Beijing Engineering Technology Research Center for Garden Plants, Beijing Forestry University Forest Science Co. Ltd., Beijing 100083, PR China
| | - Jin Cheng
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, PR China
| |
Collapse
|
36
|
Smith ML, Vanderpool D, Hahn MW. Using all gene families vastly expands data available for phylogenomic inference. Mol Biol Evol 2022; 39:6596367. [PMID: 35642314 PMCID: PMC9178227 DOI: 10.1093/molbev/msac112] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Traditionally, single-copy orthologs have been the gold standard in phylogenomics. Most phylogenomic studies identify putative single-copy orthologs using clustering approaches and retain families with a single sequence per species. This limits the amount of data available by excluding larger families. Recent advances have suggested several ways to include data from larger families. For instance, tree-based decomposition methods facilitate the extraction of orthologs from large families. Additionally, several methods for species tree inference are robust to the inclusion of paralogs and could use all of the data from larger families. Here, we explore the effects of using all families for phylogenetic inference by examining relationships among 26 primate species in detail and by analyzing five additional data sets. We compare single-copy families, orthologs extracted using tree-based decomposition approaches, and all families with all data. We explore several species tree inference methods, finding that identical trees are returned across nearly all subsets of the data and methods for primates. The relationships among Platyrrhini remain contentious; however, the species tree inference method matters more than the subset of data used. Using data from larger gene families drastically increases the number of genes available and leads to consistent estimates of branch lengths, nodal certainty and concordance, and inferences of introgression in primates. For the other data sets, topological inferences are consistent whether single-copy families or orthologs extracted using decomposition approaches are analyzed. Using larger gene families is a promising approach to include more data in phylogenomics without sacrificing accuracy, at least when high-quality genomes are available.
Collapse
Affiliation(s)
- Megan L Smith
- Department of Biology and Department of Computer Science, Indiana University, Bloomington, Indiana, USA
| | - Dan Vanderpool
- Department of Biology and Department of Computer Science, Indiana University, Bloomington, Indiana, USA
| | - Matthew W Hahn
- Department of Biology and Department of Computer Science, Indiana University, Bloomington, Indiana, USA
| |
Collapse
|
37
|
Willson J, Roddur MS, Liu B, Zaharias P, Warnow T. DISCO: Species Tree Inference using Multicopy Gene Family Tree Decomposition. Syst Biol 2022; 71:610-629. [PMID: 34450658 PMCID: PMC9016570 DOI: 10.1093/sysbio/syab070] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 08/18/2021] [Accepted: 08/23/2021] [Indexed: 11/21/2022] Open
Abstract
Species tree inference from gene family trees is a significant problem in computational biology. However, gene tree heterogeneity, which can be caused by several factors including gene duplication and loss, makes the estimation of species trees very challenging. While there have been several species tree estimation methods introduced in recent years to specifically address gene tree heterogeneity due to gene duplication and loss (such as DupTree, FastMulRFS, ASTRAL-Pro, and SpeciesRax), many incur high cost in terms of both running time and memory. We introduce a new approach, DISCO, that decomposes the multi-copy gene family trees into many single copy trees, which allows for methods previously designed for species tree inference in a single copy gene tree context to be used. We prove that using DISCO with ASTRAL (i.e., ASTRAL-DISCO) is statistically consistent under the GDL model, provided that ASTRAL-Pro correctly roots and tags each gene family tree. We evaluate DISCO paired with different methods for estimating species trees from single copy genes (e.g., ASTRAL, ASTRID, and IQ-TREE) under a wide range of model conditions, and establish that high accuracy can be obtained even when ASTRAL-Pro is not able to correctly roots and tags the gene family trees. We also compare results using MI, an alternative decomposition strategy from Yang Y. and Smith S.A. (2014), and find that DISCO provides better accuracy, most likely as a result of covering more of the gene family tree leafset in the output decomposition. [Concatenation analysis; gene duplication and loss; species tree inference; summary method.].
Collapse
Affiliation(s)
- James Willson
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Mrinmoy Saha Roddur
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Baqiao Liu
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Paul Zaharias
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Tandy Warnow
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
38
|
Masonick P, Meyer A, Hulsey CD. Phylogenomic analyses show repeated evolution of hypertrophied lips among Lake Malawi cichlid fishes. Genome Biol Evol 2022; 14:6568296. [PMID: 35417557 PMCID: PMC9017819 DOI: 10.1093/gbe/evac051] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/03/2022] [Indexed: 11/27/2022] Open
Abstract
Cichlid fishes have repeatedly evolved an astounding diversity of trophic morphologies. For example, hypertrophied lips have evolved multiple times in both African and Neotropical cichlids and could have even evolved convergently within single species assemblages such as African Lake Malawi cichlids. However, the extremely high diversification rate in Lake Malawi cichlids and extensive potential for hybridization has cast doubt on whether even genome-level phylogenetic reconstructions could delineate if these types of adaptations have evolved once or multiple times. To examine the evolution of this iconic trait using protein-coding and noncoding single nucleotide polymorphisms (SNPs), we analyzed the genomes of 86 Lake Malawi cichlid species, including 33 de novo resequenced genomes. Surprisingly, genome-wide protein-coding SNPs exhibited enough phylogenetic informativeness to reconstruct interspecific and intraspecific relationships of hypertrophied lip cichlids, although noncoding SNPs provided better support. However, thinning of noncoding SNPs indicated most discrepancies come from the relatively smaller number of protein-coding sites and not from fundamental differences in their phylogenetic informativeness. Both coding and noncoding reconstructions showed that several “sand-dwelling” hypertrophied lip species, sampled intraspecifically, form a clade interspersed with a few other nonhypertrophied lip lineages. We also recovered Abactochromis labrosus within the rock-dwelling “mbuna” lineage, starkly contrasting with the affinities of other hypertrophied lip taxa found in the largely sand-dwelling “nonmbuna” component of this radiation. Comparative analyses coupled with tests for introgression indicate there is no widespread introgression between the hypertrophied lip lineages and taken together suggest this trophic phenotype has likely evolved at least twice independently within-lake Malawi.
Collapse
Affiliation(s)
- Paul Masonick
- Department of Biology, University of Konstanz, Universitätsstraße 10, 78464 Konstanz, Germany
| | - Axel Meyer
- Department of Biology, University of Konstanz, Universitätsstraße 10, 78464 Konstanz, Germany
| | - C Darrin Hulsey
- Department of Biology, University of Konstanz, Universitätsstraße 10, 78464 Konstanz, Germany.,Current Address: School of Biology and Environmental Science, University College Dublin, Belfield, Dublin 4, Ireland
| |
Collapse
|
39
|
MacLeod AI, Raval PK, Stockhorst S, Knopp MR, Frangedakis E, Gould SB. Loss of Plastid Developmental Genes Coincides With a Reversion to Monoplastidy in Hornworts. FRONTIERS IN PLANT SCIENCE 2022; 13:863076. [PMID: 35360315 PMCID: PMC8964177 DOI: 10.3389/fpls.2022.863076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Accepted: 02/04/2022] [Indexed: 06/14/2023]
Abstract
The first plastid evolved from an endosymbiotic cyanobacterium in the common ancestor of the Archaeplastida. The transformative steps from cyanobacterium to organelle included the transfer of control over developmental processes, a necessity for the host to orchestrate, for example, the fission of the organelle. The plastids of almost all embryophytes divide independently from nuclear division, leading to cells housing multiple plastids. Hornworts, however, are monoplastidic (or near-monoplastidic), and their photosynthetic organelles are a curious exception among embryophytes for reasons such as the occasional presence of pyrenoids. In this study, we screened genomic and transcriptomic data of eleven hornworts for components of plastid developmental pathways. We found intriguing differences among hornworts and specifically highlight that pathway components involved in regulating plastid development and biogenesis were differentially lost in this group of bryophytes. Our results also confirmed that hornworts underwent significant instances of gene loss, underpinning that the gene content of this group is significantly lower than other bryophytes and tracheophytes. In combination with ancestral state reconstruction, our data suggest that hornworts have reverted back to a monoplastidic phenotype due to the combined loss of two plastid division-associated genes, namely, ARC3 and FtsZ2.
Collapse
Affiliation(s)
- Alexander I. MacLeod
- Institute for Molecular Evolution, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany
| | - Parth K. Raval
- Institute for Molecular Evolution, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany
| | - Simon Stockhorst
- Institute for Molecular Evolution, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany
| | - Michael R. Knopp
- Institute for Molecular Evolution, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany
| | | | - Sven B. Gould
- Institute for Molecular Evolution, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany
| |
Collapse
|
40
|
Rivera-Vicéns RE, Garcia-Escudero CA, Conci N, Eitel M, Wörheide G. TransPi - a comprehensive TRanscriptome ANalysiS PIpeline for de novo transcriptome assembly. Mol Ecol Resour 2022; 22:2070-2086. [PMID: 35119207 DOI: 10.1111/1755-0998.13593] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 01/10/2022] [Accepted: 01/24/2022] [Indexed: 11/30/2022]
Abstract
The use of RNA-Seq data and the generation of de novo transcriptome assemblies have been pivotal for studies in ecology and evolution. This is distinctly true for non-model organisms, where no genome information is available. In such organisms, studies of differential gene expression, DNA enrichment baits design, and phylogenetics can all be accomplished with de novo transcriptome assemblies. Multiple tools are available for transcriptome assembly, however, no single tool can provide the best assembly for all datasets. Therefore, a multi assembler approach, followed by a reduction step, is often sought to generate an improved representation of the assembly. To reduce errors in these complex analyses while at the same time attaining reproducibility and scalability, automated workflows have been essential in the analysis of RNA-Seq data. However, most of these tools are designed for species where genome data is used as reference for the assembly process, limiting their use in non-model organisms. We present TransPi, a comprehensive pipeline for de novo transcriptome assembly, with minimum user input but without losing the ability of a thorough analysis. A combination of different model organisms, k-mer sets, read lengths, and read quantities were used for assessing the tool. Furthermore, a total of 49 non-model organisms, spanning different phyla, were also analysed. Compared to approaches using single assemblers only, TransPi produces higher BUSCO completeness percentages, and a concurrent significant reduction in duplication rates. TransPi is easy to configure and can be deployed seamlessly using Conda, Docker and Singularity.
Collapse
Affiliation(s)
- R E Rivera-Vicéns
- Department of Earth and Environmental Sciences, Paleontology & Geobiology, Ludwig-Maximilians-Universität München, Richard-Wagner-Str. 10, 80333, München, Germany
| | - C A Garcia-Escudero
- Department of Earth and Environmental Sciences, Paleontology & Geobiology, Ludwig-Maximilians-Universität München, Richard-Wagner-Str. 10, 80333, München, Germany.,Graduate School for Evolution, Ecology and Systematics, Faculty of Biology, Ludwig-Maximilians-Universität München, Biozentrum Großhaderner Str. 2, 82152, Planegg-Martinsried, Germany
| | - N Conci
- Department of Earth and Environmental Sciences, Paleontology & Geobiology, Ludwig-Maximilians-Universität München, Richard-Wagner-Str. 10, 80333, München, Germany
| | - M Eitel
- Department of Earth and Environmental Sciences, Paleontology & Geobiology, Ludwig-Maximilians-Universität München, Richard-Wagner-Str. 10, 80333, München, Germany
| | - G Wörheide
- Department of Earth and Environmental Sciences, Paleontology & Geobiology, Ludwig-Maximilians-Universität München, Richard-Wagner-Str. 10, 80333, München, Germany.,GeoBio-Center, Ludwig-Maximilians-Universität München, Richard-Wagner-Str. 10, 80333, München, Germany.,SNSB-Bayerische Staatssammlung für Paläontologie und Geologie, Richard-Wagner-Str. 10, 80333, München, Germany
| |
Collapse
|
41
|
Lizano AM, Smolina I, Choquet M, Kopp M, Hoarau G. Insights into the species evolution of Calanus copepods in the northern seas revealed by de novo transcriptome sequencing. Ecol Evol 2022; 12:e8606. [PMID: 35228861 PMCID: PMC8861592 DOI: 10.1002/ece3.8606] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Revised: 01/14/2022] [Accepted: 01/19/2022] [Indexed: 01/07/2023] Open
Abstract
Copepods of the zooplankton genus Calanus play a key role in marine ecosystems in the northern seas. Although being among the most studied organisms on Earth, due to their ecological importance, genomic resources for Calanus spp. remain scarce, mostly due to their large genome size (from 6 to 12 Gbps). As an alternative to whole-genome sequencing in Calanus spp., we sequenced and de novo assembled transcriptomes of five Calanus species: Calanus glacialis, C. hyperboreus, C. marshallae, C. pacificus, and C. helgolandicus. Functional assignment of protein families based on clusters of orthologous genes (COG) and gene ontology (GO) annotations showed analogous patterns of protein functions across species. Phylogenetic analyses using maximum likelihood (ML) of 191 protein-coding genes mined from RNA-seq data fully resolved evolutionary relationships among seven Calanus species investigated (five species sequenced for this study and two species with published datasets), with gene and site concordance factors showing that 109 out of 191 protein-coding genes support a separation between three groups: the C. finmarchicus group (including C. finmarchicus, C. glacialis, and C. marshallae), the C. helgolandicus group (including C. helgolandicus, C. sinicus, and C. pacificus) and the monophyletic C. hyperboreus group. The tree topology obtained in ML analyses was similar to a previously proposed phylogeny based on morphological criteria and cleared certain ambiguities from past studies on evolutionary relationships among Calanus species.
Collapse
Affiliation(s)
| | - Irina Smolina
- Faculty of Biosciences and AquacultureNord UniversityBodøNorway
| | - Marvin Choquet
- Faculty of Biosciences and AquacultureNord UniversityBodøNorway
- Department of Medical Biochemistry and MicrobiologyUppsala UniversityUppsalaSweden
| | - Martina Kopp
- Faculty of Biosciences and AquacultureNord UniversityBodøNorway
| | - Galice Hoarau
- Faculty of Biosciences and AquacultureNord UniversityBodøNorway
| |
Collapse
|
42
|
Smith ML, Hahn MW. The Frequency and Topology of Pseudoorthologs. Syst Biol 2021; 71:649-659. [PMID: 34951639 DOI: 10.1093/sysbio/syab097] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Revised: 12/15/2021] [Accepted: 12/17/2021] [Indexed: 11/12/2022] Open
Abstract
Phylogenetics has long relied on the use of orthologs, or genes related through speciation events, to infer species relationships. However, identifying orthologs is difficult because gene duplication can obscure relationships among genes. Researchers have been particularly concerned with the insidious effects of pseudoorthologs-duplicated genes that are mistaken for orthologs because they are present in a single copy in each sampled species. Because gene tree topologies of pseudoorthologs may differ from the species tree topology, they have often been invoked as the cause of counterintuitive results in phylogenetics. Despite these perceived problems, no previous work has calculated the probabilities of pseudoortholog topologies, or has been able to circumscribe the regions of parameter space in which pseudoorthologs are most likely to occur. Here, we introduce a model for calculating the probabilities and branch lengths of orthologs and pseudoorthologs, including concordant and discordant pseudoortholog topologies, on a rooted three-taxon species tree. We show that the probability of orthologs is high relative to the probability of pseudoorthologs across reasonable regions of parameter space. Furthermore, the probabilities of the two discordant topologies are equal and never exceed that of the concordant topology, generally being much lower. We describe the species tree topologies most prone to generating pseudoorthologs, finding that they are likely to present problems to phylogenetic inference irrespective of the presence of pseudoorthologs. Overall, our results suggest that pseudoorthologs are unlikely to mislead inferences of species relationships under the biological scenarios considered here.
Collapse
Affiliation(s)
- Megan L Smith
- Department of Biology and Department of Computer Science, Indiana University, Bloomington, IN 47405, USA
| | - Matthew W Hahn
- Department of Biology and Department of Computer Science, Indiana University, Bloomington, IN 47405, USA
| |
Collapse
|
43
|
Chen L, Jin WT, Liu XQ, Wang XQ. New insights into the phylogeny and evolution of Podocarpaceae inferred from transcriptomic data. Mol Phylogenet Evol 2021; 166:107341. [PMID: 34740782 DOI: 10.1016/j.ympev.2021.107341] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Revised: 10/28/2021] [Accepted: 10/29/2021] [Indexed: 12/14/2022]
Abstract
Phylogenies of an increasing number of taxa have been resolved with the development of phylogenomics. However, the intergeneric relationships of Podocarpaceae, the second largest family of conifers comprising 19 genera and approximately 187 species mainly distributed in the Southern Hemisphere, have not been well disentangled in previous studies, even when genome-scale data sets were used. Here we used 993 nuclear orthologous groups (OGs) and 54 chloroplast OGs (genes), which were generated from 47 transcriptomes of Podocarpaceae and its sister group Araucariaceae, to reconstruct the phylogeny of Podocarpaceae. Our study completely resolved the intergeneric relationships of Podocarpaceae represented by all extant genera and revealed that topological conflicts among phylogenetic trees could be attributed to synonymous substitutions. Moreover, we found that two morphological traits, fleshy seed cones and flattened leaves, might be important for Podocarpaceae to adapt to angiosperm-dominated forests and thus could have promoted its species diversification. In addition, our results indicate that Podocarpaceae originated in Gondwana in the late Triassic and both vicariance and dispersal have contributed to its current biogeographic patterns. Our study provides the first robust transcriptome-based phylogeny of Podocarpaceae, an evolutionary framework important for future studies of this family.
Collapse
Affiliation(s)
- Luo Chen
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Wei-Tao Jin
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Xin-Quan Liu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xiao-Quan Wang
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China; University of Chinese Academy of Sciences, Beijing 100049, China.
| |
Collapse
|
44
|
Bucchini F, Del Cortona A, Kreft Ł, Botzki A, Van Bel M, Vandepoele K. TRAPID 2.0: a web application for taxonomic and functional analysis of de novo transcriptomes. Nucleic Acids Res 2021; 49:e101. [PMID: 34197621 PMCID: PMC8464036 DOI: 10.1093/nar/gkab565] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 06/07/2021] [Accepted: 06/16/2021] [Indexed: 12/24/2022] Open
Abstract
Advances in high-throughput sequencing have resulted in a massive increase of RNA-Seq transcriptome data. However, the promise of rapid gene expression profiling in a specific tissue, condition, unicellular organism or microbial community comes with new computational challenges. Owing to the limited availability of well-resolved reference genomes, de novo assembled (meta)transcriptomes have emerged as popular tools for investigating the gene repertoire of previously uncharacterized organisms. Yet, despite their potential, these datasets often contain fragmented or contaminant sequences, and their analysis remains difficult. To alleviate some of these challenges, we developed TRAPID 2.0, a web application for the fast and efficient processing of assembled transcriptome data. The initial processing phase performs a global characterization of the input data, providing each transcript with several layers of annotation, comprising structural, functional, and taxonomic information. The exploratory phase enables downstream analyses from the web application. Available analyses include the assessment of gene space completeness, the functional analysis and comparison of transcript subsets, and the study of transcripts in an evolutionary context. A comparison with similar tools highlights TRAPID’s unique features. Finally, analyses performed within TRAPID 2.0 are complemented by interactive data visualizations, facilitating the extraction of new biological insights, as demonstrated with diatom community metatranscriptomes.
Collapse
Affiliation(s)
- François Bucchini
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.,Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
| | - Andrea Del Cortona
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.,Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
| | - Łukasz Kreft
- VIB Bioinformatics Core, VIB, 9052 Ghent, Belgium
| | | | - Michiel Van Bel
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.,Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
| | - Klaas Vandepoele
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.,Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, 9052 Ghent, Belgium
| |
Collapse
|
45
|
Ortiz González IC, Rivera-Vicéns RE, Schizas NV. Description of four Millepora spp. transcriptomes and their potential to delimit the Caribbean fire coral species. Mar Genomics 2021; 59:100863. [PMID: 33762174 DOI: 10.1016/j.margen.2021.100863] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2020] [Revised: 03/03/2021] [Accepted: 03/04/2021] [Indexed: 10/21/2022]
Abstract
Millepora is a relatively species-rich genus of hydrocorals, with 16 species distributed around the globe. It is considered an important reef building cnidarian. The current diversity of Caribbean Millepora species consists of Millepora complanata, M. alcicornis, M. squarrosa and M. striata. Here, we report the de novo transcriptome assembly and phylotranscriptomic analysis of M. alcicornis, M. complanata, M. squarrosa and a undescribed morphotype (Millepora sp.) found in exposed Thalassia beds and mangrove areas in southwest Puerto Rico. Over 345 million sequence reads were obtained for the analysis of the Millepora transcriptomes (Illumina HiSeq4000; 2x150bp). The analysis pipeline consisted of assembly with Trinity, BUSCO, RSEM and ORFs calling for each transcriptome, followed by ontology (Blast2GO) and phylogenetic analysis. The phylogenetic analysis was performed after selecting homologous genes among the transcriptomes, resulting in 10,596 sequences. Concatenation analysis (Maximum Likelihood and Bayesian inference) and a coalescence-based analysis were performed to the dataset too. Concatenation analysis yielded a topology supporting a clade of M. complanata and M. alcicornis, with Millepora sp. outside this clade and M. squarrosa as an outgroup. The coalescence-based tree estimation analysis (ASTRAL-II), presented a different topology placing M. alcicornis and Millepora sp. as sister taxa, rather than grouping with M. alcicornis with M. complanata. Our coalescence analysis indicated that there is a high degree of incomplete lineage sorting, suggesting a very recent time of species emergence among three out of the four Caribbean Millepora species. Calculations of ABBA-BABA statistics derived from transcriptome-wide SNP data indicate the possible presence of introgression between Millepora complanata and M. alcicornis.
Collapse
Affiliation(s)
| | - Ramón E Rivera-Vicéns
- Department of Marine Sciences, University of Puerto Rico at Mayagüez, PO Box 9000, Mayagüez, PR 00680, USA; Department of Earth and Environmental Sciences, Paleontology & Geobiology, Ludwig-Maximilians-Universität München, Richard-Wagner-Str. 10, 80333 München, Germany
| | - Nikolaos V Schizas
- Department of Marine Sciences, University of Puerto Rico at Mayagüez, PO Box 9000, Mayagüez, PR 00680, USA.
| |
Collapse
|
46
|
Spillane JL, LaPolice TM, MacManes MD, Plachetzki DC. Signal, bias, and the role of transcriptome assembly quality in phylogenomic inference. BMC Ecol Evol 2021; 21:43. [PMID: 33726665 PMCID: PMC7968300 DOI: 10.1186/s12862-021-01772-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Accepted: 03/03/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Phylogenomic approaches have great power to reconstruct evolutionary histories, however they rely on multi-step processes in which each stage has the potential to affect the accuracy of the final result. Many studies have empirically tested and established methodology for resolving robust phylogenies, including selecting appropriate evolutionary models, identifying orthologs, or isolating partitions with strong phylogenetic signal. However, few have investigated errors that may be initiated at earlier stages of the analysis. Biases introduced during the generation of the phylogenomic dataset itself could produce downstream effects on analyses of evolutionary history. Transcriptomes are widely used in phylogenomics studies, though there is little understanding of how a poor-quality assembly of these datasets could impact the accuracy of phylogenomic hypotheses. Here we examined how transcriptome assembly quality affects phylogenomic inferences by creating independent datasets from the same input data representing high-quality and low-quality transcriptome assembly outcomes. RESULTS By studying the performance of phylogenomic datasets derived from alternative high- and low-quality assembly inputs in a controlled experiment, we show that high-quality transcriptomes produce richer phylogenomic datasets with a greater number of unique partitions than low-quality assemblies. High-quality assemblies also give rise to partitions that have lower alignment ambiguity and less compositional bias. In addition, high-quality partitions hold stronger phylogenetic signal than their low-quality transcriptome assembly counterparts in both concatenation- and coalescent-based analyses. CONCLUSIONS Our findings demonstrate the importance of transcriptome assembly quality in phylogenomic analyses and suggest that a portion of the uncertainty observed in such studies could be alleviated at the assembly stage.
Collapse
Affiliation(s)
- Jennifer L Spillane
- Molecular, Cellular, and Biomedical Sciences Department, University of New Hampshire, Durham, NH, 03824, USA.
- Hubbard Center for Genome Studies, University of New Hampshire, Durham, NH, 03824, USA.
| | - Troy M LaPolice
- Molecular, Cellular, and Biomedical Sciences Department, University of New Hampshire, Durham, NH, 03824, USA
- Hubbard Center for Genome Studies, University of New Hampshire, Durham, NH, 03824, USA
| | - Matthew D MacManes
- Molecular, Cellular, and Biomedical Sciences Department, University of New Hampshire, Durham, NH, 03824, USA
- Hubbard Center for Genome Studies, University of New Hampshire, Durham, NH, 03824, USA
| | - David C Plachetzki
- Molecular, Cellular, and Biomedical Sciences Department, University of New Hampshire, Durham, NH, 03824, USA.
- Hubbard Center for Genome Studies, University of New Hampshire, Durham, NH, 03824, USA.
| |
Collapse
|