1
|
Zhang G, Yang J, Zhang C, Jiao B, Panero JL, Cai J, Zhang ZR, Gao LM, Gao T, Ma H. Nuclear phylogenomics of Asteraceae with increased sampling provides new insights into convergent morphological and molecular evolution. PLANT COMMUNICATIONS 2024; 5:100851. [PMID: 38409784 PMCID: PMC11211554 DOI: 10.1016/j.xplc.2024.100851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Revised: 01/22/2024] [Accepted: 02/21/2024] [Indexed: 02/28/2024]
Abstract
Convergent morphological evolution is widespread in flowering plants, and understanding this phenomenon relies on well-resolved phylogenies. Nuclear phylogenetic reconstruction using transcriptome datasets has been successful in various angiosperm groups, but it is limited to taxa with available fresh materials. Asteraceae, which are one of the two largest angiosperm families and are important for both ecosystems and human livelihood, show multiple examples of convergent evolution. Nuclear Asteraceae phylogenies have resolved relationships among most subfamilies and many tribes, but many phylogenetic and evolutionary questions regarding subtribes and genera remain, owing to limited sampling. Here, we increased the sampling for Asteraceae phylogenetic reconstruction using transcriptomes and genome-skimming datasets and produced nuclear phylogenetic trees with 706 species representing two-thirds of recognized subtribes. Ancestral character reconstruction supports multiple convergent evolutionary events in Asteraceae, with gains and losses of bilateral floral symmetry correlated with diversification of some subfamilies and smaller groups, respectively. Presence of the calyx-related pappus may have been especially important for the success of some subtribes and genera. Molecular evolutionary analyses support the likely contribution of duplications of MADS-box and TCP floral regulatory genes to innovations in floral morphology, including capitulum inflorescences and bilaterally symmetric flowers, potentially promoting the diversification of Asteraceae. Subsequent divergences and reductions in CYC2 gene expression are related to the gain and loss of zygomorphic flowers. This phylogenomic work with greater taxon sampling through inclusion of genome-skimming datasets reveals the feasibility of expanded evolutionary analyses using DNA samples for understanding convergent evolution.
Collapse
Affiliation(s)
- Guojin Zhang
- College of Life Sciences, Hunan Normal University, Changsha, Hunan 410081, China; Department of Biology, the Huck Institute of the Life Sciences, the Pennsylvania State University, State College, PA 16801, USA; State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Junbo Yang
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China
| | - Caifei Zhang
- Wuhan Botanical Garden and Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan 430074, China
| | - Bohan Jiao
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - José L Panero
- Department of Integrative Biology, University of Texas, Austin, TX 78712, USA
| | - Jie Cai
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China
| | - Zhi-Rong Zhang
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China
| | - Lian-Ming Gao
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China; Lijiang National Forest Biodiversity National Observation and Research Station, Kunming Institute of Botany, Chinese Academy of Sciences, Lijiang, Yunnan 674100, China.
| | - Tiangang Gao
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China.
| | - Hong Ma
- Department of Biology, the Huck Institute of the Life Sciences, the Pennsylvania State University, State College, PA 16801, USA.
| |
Collapse
|
2
|
Hu XZ, Guo C, Qin SY, Li DZ, Guo ZH. Deep genome skimming reveals the hybrid origin of Pseudosasa gracilis (Poaceae: Bambusoideae). PLANT DIVERSITY 2024; 46:344-352. [PMID: 38798728 PMCID: PMC11119509 DOI: 10.1016/j.pld.2023.06.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 05/24/2023] [Accepted: 06/02/2023] [Indexed: 05/29/2024]
Abstract
Pseudosasa gracilis (Poaceae: Bambusoideae) is a temperate woody bamboo species endemic to South-central China with a narrow distribution. Previous phylogenetic studies revealed an unexpected, isolated phylogenetic position of Ps. gracilis. Here we conducted phylogenomic analysis by sampling populations of Ps. gracilis and its sympatric species Ps. nanunica and Sinosasa polytricha reflecting different genomic signals, by deep genome skimming. Integrating molecular evidence from chloroplast genes and genome-wide SNPs, we deciphered the phylogenetic relationships of Ps. gracilis. Both plastid and nuclear data indicate that Ps. gracilis is more closely related to Sinosasa, which is discordant with the taxonomic treatment. To further explore this molecular-morphological conflict, we screened 411 "perfect-copy" syntenic genes to reconstruct phylogenies using both the concatenation and coalescent methods. We observed extensive discordance between gene trees and the putative species tree. A significant hybridization event was detected based on 411 genes from the D subgenome, showing Ps. gracilis was a hybrid descendant between Sinosasa longiligulata and Ps. nanunica, with 63.56% and 36.44% inheritance probabilities of each parent. Moreover, introgression events were detected in the C subgenome between Ps. gracilis and S. polytricha in the same distribution region. Our findings suggest that sympatric hybridization and introgression play a crucial role in the origin of Ps. gracilis. By providing an empirical example of bamboo of hybrid origin using comprehensive analyses based on genomic data from different inheritance systems and morphological characters, our study represents a step forward in understanding of reticulate evolution of bamboos.
Collapse
Affiliation(s)
- Xiang-Zhou Hu
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Cen Guo
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China
| | - Sheng-Yuan Qin
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - De-Zhu Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhen-Hua Guo
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
3
|
Zhang G, Ma H. Nuclear phylogenomics of angiosperms and insights into their relationships and evolution. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2024; 66:546-578. [PMID: 38289011 DOI: 10.1111/jipb.13609] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 01/03/2024] [Indexed: 02/06/2024]
Abstract
Angiosperms (flowering plants) are by far the most diverse land plant group with over 300,000 species. The sudden appearance of diverse angiosperms in the fossil record was referred to by Darwin as the "abominable mystery," hence contributing to the heightened interest in angiosperm evolution. Angiosperms display wide ranges of morphological, physiological, and ecological characters, some of which have probably influenced their species richness. The evolutionary analyses of these characteristics help to address questions of angiosperm diversification and require well resolved phylogeny. Following the great successes of phylogenetic analyses using plastid sequences, dozens to thousands of nuclear genes from next-generation sequencing have been used in angiosperm phylogenomic analyses, providing well resolved phylogenies and new insights into the evolution of angiosperms. In this review we focus on recent nuclear phylogenomic analyses of large angiosperm clades, orders, families, and subdivisions of some families and provide a summarized Nuclear Phylogenetic Tree of Angiosperm Families. The newly established nuclear phylogenetic relationships are highlighted and compared with previous phylogenetic results. The sequenced genomes of Amborella, Nymphaea, Chloranthus, Ceratophyllum, and species of monocots, Magnoliids, and basal eudicots, have facilitated the phylogenomics of relationships among five major angiosperms clades. All but one of the 64 angiosperm orders were included in nuclear phylogenomics with well resolved relationships except the placements of several orders. Most families have been included with robust and highly supported placements, especially for relationships within several large and important orders and families. Additionally, we examine the divergence time estimation and biogeographic analyses of angiosperm on the basis of the nuclear phylogenomic frameworks and discuss the differences compared with previous analyses. Furthermore, we discuss the implications of nuclear phylogenomic analyses on ancestral reconstruction of morphological, physiological, and ecological characters of angiosperm groups, limitations of current nuclear phylogenomic studies, and the taxa that require future attention.
Collapse
Affiliation(s)
- Guojin Zhang
- College of Life Sciences, Hunan Normal University, Changsha, 410081, China
- Department of Biology, 510 Mueller Laboratory, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, 16802, USA
| | - Hong Ma
- Department of Biology, 510 Mueller Laboratory, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, 16802, USA
| |
Collapse
|
4
|
Pezzini FF, Ferrari G, Forrest LL, Hart ML, Nishii K, Kidner CA. Target capture and genome skimming for plant diversity studies. APPLICATIONS IN PLANT SCIENCES 2023; 11:e11537. [PMID: 37601316 PMCID: PMC10439825 DOI: 10.1002/aps3.11537] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 06/16/2023] [Accepted: 07/10/2023] [Indexed: 08/22/2023]
Abstract
Recent technological advances in long-read high-throughput sequencing and assembly methods have facilitated the generation of annotated chromosome-scale whole-genome sequence data for evolutionary studies; however, generating such data can still be difficult for many plant species. For example, obtaining high-molecular-weight DNA is typically impossible for samples in historical herbarium collections, which often have degraded DNA. The need to fast-freeze newly collected living samples to conserve high-quality DNA can be complicated when plants are only found in remote areas. Therefore, short-read reduced-genome representations, such as target capture and genome skimming, remain important for evolutionary studies. Here, we review the pros and cons of each technique for non-model plant taxa. We provide guidance related to logistics, budget, the genomic resources previously available for the target clade, and the nature of the study. Furthermore, we assess the available bioinformatic analyses, detailing best practices and pitfalls, and suggest pathways to combine newly generated data with legacy data. Finally, we explore the possible downstream analyses allowed by the type of data generated using each technique. We provide a practical guide to help researchers make the best-informed choice regarding reduced genome representation for evolutionary studies of non-model plants in cases where whole-genome sequencing remains impractical.
Collapse
Affiliation(s)
| | - Giada Ferrari
- Royal Botanic Garden Edinburgh Edinburgh United Kingdom
| | | | | | - Kanae Nishii
- Royal Botanic Garden Edinburgh Edinburgh United Kingdom
| | - Catherine A Kidner
- Royal Botanic Garden Edinburgh Edinburgh United Kingdom
- School of Biological Sciences University of Edinburgh Edinburgh United Kingdom
| |
Collapse
|
5
|
Liu SH, Hung KH, Hsu TW, Hoch PC, Peng CI, Chiang TY. New insights into polyploid evolution and dynamic nature of Ludwigia section Isnardia (Onagraceae). BOTANICAL STUDIES 2023; 64:14. [PMID: 37269434 DOI: 10.1186/s40529-023-00387-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 05/17/2023] [Indexed: 06/05/2023]
Abstract
BACKGROUND While polyploids are common in plants, the evolutionary history and natural dynamics of most polyploid groups are still unclear. Owing to plentiful earlier systematic studies, Ludwigia sect. Isnardia (comprising 22 wetland taxa) is an ideal allopolyploid complex to investigate polyploid evolution and natural dynamics within and among taxa. With a considerable sampling, we concentrated on revisiting earlier phylogenies of Isnardia, reevaluating the earlier estimated age of the most recent common ancestor (TMRCA), exploring the correlation between infraspecific genetic diversity and ploidy levels, and inspecting interspecific gene flows among taxa. RESULTS Phylogenetic trees and network concurred with earlier phylogenies and hypothesized genomes by incorporating 192 atpB-rbcL and ITS sequences representing 91% of Isnardia taxa. Moreover, we detected three multi-origin taxa. Our findings on L. repens and L. sphaerocarpa were consistent with earlier studies; L. arcuata was reported as a multi-origin taxon here, and an additional evolutionary scenario of L. sphaerocarpa was uncovered, both for the first time. Furthermore, estimated Isnardia TMRCA ages based on our data (5.9 or 8.9 million years ago) are in accordance with earlier estimates, although younger than fossil dates (Middle Miocene). Surprisingly, infraspecific genetic variations of Isnardia taxa did not increase with ploidy levels as anticipated from many other polyploid groups. In addition, the exuberant, low, and asymmetrical gene flows among Isnardia taxa indicated that the reproductive barriers may be weakened owing to allopolyploidization, which has rarely been reported. CONCLUSIONS The present research gives new perceptions of the reticulate evolution and dynamic nature of Isnardia and points to gaps in current knowledge about allopolyploid evolution.
Collapse
Affiliation(s)
- Shih-Hui Liu
- Department of Biological Sciences, National Sun Yat-Sen University, Kaohsiung, 804, Taiwan
| | - Kuo-Hsiang Hung
- Graduate Institute of Bioresources, National Pingtung University of Science and Technology, Pingtung, 912, Taiwan
| | - Tsai-Wen Hsu
- Endemic Species Research Institute, Nantou, 552, Taiwan
| | - Peter C Hoch
- Missouri Botanical Garden, St. Louis, MO, 63166, USA
| | - Ching-I Peng
- Biodiversity Research Center, Academia Sinica, Taipei, 11529, Taiwan
| | - Tzen-Yuh Chiang
- Department of Life Sciences, National Cheng Kung University, Tainan, 701, Taiwan.
| |
Collapse
|
6
|
Pouchon C, Boluda CG. REFMAKER: make your own reference to target nuclear loci in low coverage genome skimming libraries. Phylogenomic application in Sapotaceae. Mol Phylogenet Evol 2023:107826. [PMID: 37257798 DOI: 10.1016/j.ympev.2023.107826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 04/24/2023] [Accepted: 05/25/2023] [Indexed: 06/02/2023]
Abstract
Genome skimming approach is widely used in plant systematics to infer phylogenies mostly from organelle genomes. However, organelles represent only 10% of the produced libraries, and the low coverage associated with these libraries (< 3X) prevents the capture of nuclear sequences, which are not always available in non-model organisms or limited to the ribosomal regions. We developed REFMAKER, a user-friendly pipeline, to create specific sets of nuclear loci that can next be extracted directly from the genome skimming libraries. For this, a catalogue is built from the meta-assembly of each library contigs and cleaned by selecting the nuclear regions and removing duplicates from clustering steps. Libraries are next mapped onto this catalogue and consensus sequences are generated to produce a ready-to-use phylogenetic matrix following different filtering parameters aiming at removing putative errors and paralogous sequences. REFMAKER allowed us to infer a well resolved phylogeny in Capurodendron (Sapotaceae) on 67 nuclear loci from low-coverage libraries (<1X). The resulting phylogeny is concomitant with one previously inferred on 638 nuclear genes from target enrichment libraries. While it remains preliminary because of this low sequencing depth, REFMAKER therefore opens perspectives in phylogenomics by allowing nuclear phylogeny reconstructions with genome skimming datasets.
Collapse
Affiliation(s)
- Charles Pouchon
- Conservatoire et Jardin botaniques de la Ville de Genève, Chemin de l'Impératrice 1, 1292 Chambésy, Geneva, Switzerland; PhyloLab, Department of Plant Sciences, Université de Genève, Chemin de l'Impératrice 1, 1292 Chambésy, Geneva, Switzerland.
| | - Carlos G Boluda
- Conservatoire et Jardin botaniques de la Ville de Genève, Chemin de l'Impératrice 1, 1292 Chambésy, Geneva, Switzerland; PhyloLab, Department of Plant Sciences, Université de Genève, Chemin de l'Impératrice 1, 1292 Chambésy, Geneva, Switzerland
| |
Collapse
|
7
|
Guo C, Luo Y, Gao LM, Yi TS, Li HT, Yang JB, Li DZ. Phylogenomics and the flowering plant tree of life. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2023; 65:299-323. [PMID: 36416284 DOI: 10.1111/jipb.13415] [Citation(s) in RCA: 24] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 11/22/2022] [Indexed: 06/16/2023]
Abstract
The advances accelerated by next-generation sequencing and long-read sequencing technologies continue to provide an impetus for plant phylogenetic study. In the past decade, a large number of phylogenetic studies adopting hundreds to thousands of genes across a wealth of clades have emerged and ushered plant phylogenetics and evolution into a new era. In the meantime, a roadmap for researchers when making decisions across different approaches for their phylogenomic research design is imminent. This review focuses on the utility of genomic data (from organelle genomes, to both reduced representation sequencing and whole-genome sequencing) in phylogenetic and evolutionary investigations, describes the baseline methodology of experimental and analytical procedures, and summarizes recent progress in flowering plant phylogenomics at the ordinal, familial, tribal, and lower levels. We also discuss the challenges, such as the adverse impact on orthology inference and phylogenetic reconstruction raised from systematic errors, and underlying biological factors, such as whole-genome duplication, hybridization/introgression, and incomplete lineage sorting, together suggesting that a bifurcating tree may not be the best model for the tree of life. Finally, we discuss promising avenues for future plant phylogenomic studies.
Collapse
Affiliation(s)
- Cen Guo
- Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
| | - Yang Luo
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
| | - Lian-Ming Gao
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
- Lijiang Forest Diversity National Observation and Research Station, Kunming Institute of Botany, Chinese Academy of Sciences, Lijiang, 674100, China
| | - Ting-Shuang Yi
- Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
| | - Hong-Tao Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
| | - Jun-Bo Yang
- Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
| | - De-Zhu Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
- Lijiang Forest Diversity National Observation and Research Station, Kunming Institute of Botany, Chinese Academy of Sciences, Lijiang, 674100, China
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650201, China
| |
Collapse
|
8
|
Reginato M. A pipeline for assembling low copy nuclear markers from plant genome skimming data for phylogenetic use. PeerJ 2022; 10:e14525. [PMID: 36523475 PMCID: PMC9745922 DOI: 10.7717/peerj.14525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Accepted: 11/15/2022] [Indexed: 12/12/2022] Open
Abstract
Background Genome skimming is a popular method in plant phylogenomics that do not include a biased enrichment step, relying on random shallow sequencing of total genomic DNA. From these data the plastome is usually readily assembled and constitutes the bulk of phylogenetic information generated in these studies. Despite a few attempts to use genome skims to recover low copy nuclear loci for direct phylogenetic use, such endeavor remains neglected. Causes might include the trade-off between libraries with few reads and species with large genomes (i.e., missing data caused by low coverage), but also might relate to the lack of pipelines for data assembling. Methods A pipeline and its companion R package designed to automate the recovery of low copy nuclear markers from genome skimming libraries are presented. Additionally, a series of analyses aiming to evaluate the impact of key assembling parameters, reference selection and missing data are presented. Results A substantial amount of putative low copy nuclear loci was assembled and proved useful to base phylogenetic inference across the libraries tested (4 to 11 times more data than previously assembled plastomes from the same libraries). Discussion Critical aspects of assembling low copy nuclear markers from genome skims include the minimum coverage and depth of a sequence to be used. More stringent values of these parameters reduces the amount of assembled data and increases the relative amount of missing data, which can compromise phylogenetic inference, in turn relaxing the same parameters might increase sequence error. These issues are discussed in the text, and parameter tuning through multiple comparisons tracking their effects on support and congruence is highly recommended when using this pipeline. The skimmingLoci pipeline (https://github.com/mreginato/skimmingLoci) might stimulate the use of genome skims to recover nuclear loci for direct phylogenetic use, increasing the power of genome skimming data to resolve phylogenetic relationships, while reducing the amount of sequenced DNA that is commonly wasted.
Collapse
|
9
|
Xiao J, Lyu R, He J, Li M, Ji J, Cheng J, Xie L. Genome-partitioning strategy, plastid and nuclear phylogenomic discordance, and its evolutionary implications of Clematis (Ranunculaceae). FRONTIERS IN PLANT SCIENCE 2022; 13:1059379. [PMID: 36452086 PMCID: PMC9703796 DOI: 10.3389/fpls.2022.1059379] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/01/2022] [Accepted: 10/24/2022] [Indexed: 06/17/2023]
Abstract
Clematis is one of the largest genera of Ranunculaceae with many phylogenetic problems left to be resolved. Clematis species have considerable genome size of more than 7 Gbp, and there was no whole-genome reference sequence published in this genus. This raises difficulties in acquiring nuclear genome data for its phylogenetic analysis. Previous studies based on Sanger sequencing data, plastid genome data, and nrDNA sequences did not well resolve the phylogeny of Clematis. In this study, we used genome skimming and transcriptome data to assemble the plastid genome sequences, nuclear single nucleotide polymorphisms (SNPs) datasets, and single-copy nuclear orthologous genes (SCOGs) to reconstruct the phylogenetic backbone of Clematis, and test effectiveness of these genome partitioning methods. We also further analyzed the discordance among nuclear gene trees and between plastid and nuclear phylogenies. The results showed that the SCOGs datasets, assembled from transcriptome method, well resolved the phylogenetic backbone of Clematis. The nuclear SNPs datasets from genome skimming method can also produce similar results with the SCOGs data. In contrast to the plastid phylogeny, the phylogeny resolved by nuclear genome data is more robust and better corresponds to morphological characters. Our results suggested that rapid species radiation may have generated high level of incomplete lineage sorting, which was the major cause of nuclear gene discordance. Our simulation also showed that there may have been frequent interspecific hybridization events, which led to some of the cyto-nuclear discordances in Clematis. This study not only provides the first robust phylogenetic backbone of Clematis based on nuclear genome data, but also provides suggestions of genome partitioning strategies for the phylogenomic study of other plant taxa.
Collapse
Affiliation(s)
- Jiamin Xiao
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| | - Rudan Lyu
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| | - Jian He
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| | - Mingyang Li
- College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Jiaxin Ji
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| | - Jin Cheng
- College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Lei Xie
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| |
Collapse
|
10
|
Pouchon C, Boyer F, Roquet C, Denoeud F, Chave J, Coissac E, Alsos IG, Lavergne S. ORTHOSKIM: in silico sequence capture from genomic and transcriptomic libraries for phylogenomic and barcoding applications. Mol Ecol Resour 2022; 22:2018-2037. [PMID: 35015377 DOI: 10.1111/1755-0998.13584] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Revised: 12/08/2021] [Accepted: 01/05/2022] [Indexed: 11/29/2022]
Abstract
Low-coverage whole genome shotgun sequencing (or genome skimming) has emerged as a cost-effective method for acquiring genomic data in non-model organisms. This method provides sequence information on chloroplast genome (cpDNA), mitochondrial genome (mtDNA) and nuclear ribosomal regions (rDNA), which are over-represented within cells. However, numerous bioinformatic challenges remain to accurately and rapidly obtain such data in organisms with complex genomic structures and rearrangements, in particular for mtDNA in plants or for cpDNA in some plant families. Here we introduce the pipeline ORTHOSKIM, which performs in silico capture of targeted sequences from genomic and transcriptomic libraries without assembling whole organelle genomes. ORTHOSKIM proceeds in three steps: 1) global sequence assembly, 2) mapping against reference sequences, and 3) target sequence extraction; importantly it also includes a range of quality control tests. Different modes are implemented to capture both coding and non-coding regions of cpDNA, mtDNA and rDNA sequences, along with predefined nuclear sequences (e.g. ultra-conserved elements) or collections of single-copy ortholog genes. Moreover, aligned DNA matrices are produced for phylogenetic reconstructions, by performing multiple alignments of the captured sequences. While ORTHOSKIM is suitable for any eukaryote, a case study is presented here, using 114 genome-skimming libraries and 4 RNAseq libraries obtained for two plant families, Primulaceae and Ericaceae, the latter being a well-known problematic family for cpDNA assemblies. ORTHOSKIM recovered with high success rates cpDNA, mtDNA and rDNA sequences, well suited to accurately infer evolutionary relationships within these families. ORTHOSKIM is released under a GPL-3 license and is available at: https://github.com/cpouchon/ORTHOSKIM.
Collapse
Affiliation(s)
- Charles Pouchon
- Univ. Grenoble Alpes, CNRS, Univ. Savoie Mont Blanc, LECA, Laboratoire d'Ecologie Alpine (LECA), 38000, Grenoble, France
| | - Frédéric Boyer
- Univ. Grenoble Alpes, CNRS, Univ. Savoie Mont Blanc, LECA, Laboratoire d'Ecologie Alpine (LECA), 38000, Grenoble, France
| | - Cristina Roquet
- Univ. Grenoble Alpes, CNRS, Univ. Savoie Mont Blanc, LECA, Laboratoire d'Ecologie Alpine (LECA), 38000, Grenoble, France.,Systematics and Evolution of Vascular Plants (UAB) - Associated Unit to CSIC, Departament de Biologia Animal, Biologia Vegetal i Ecologia, Universitat Autònoma de Barcelona, 08193, Bellaterra, Spain
| | - France Denoeud
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 2 rue Gaston Crémieux, 91057, Evry, France
| | - Jérome Chave
- Laboratoire Évolution et Diversité Biologique (EDB), UMR CNRS-IRD-UPS 5174, 31062, Toulouse Cedex, France
| | - Eric Coissac
- Univ. Grenoble Alpes, CNRS, Univ. Savoie Mont Blanc, LECA, Laboratoire d'Ecologie Alpine (LECA), 38000, Grenoble, France
| | - Inger Greve Alsos
- The Arctic University Museum of Norway, UiT - The Arctic University of Norway, NO-9037, Tromsø, Norway
| | | | | | - Sébastien Lavergne
- Univ. Grenoble Alpes, CNRS, Univ. Savoie Mont Blanc, LECA, Laboratoire d'Ecologie Alpine (LECA), 38000, Grenoble, France
| |
Collapse
|
11
|
Larson DA, Vargas OM, Vicentini A, Dick CW. Admixture may be extensive among hyperdominant Amazon rainforest tree species. THE NEW PHYTOLOGIST 2021; 232:2520-2534. [PMID: 34389989 PMCID: PMC9292926 DOI: 10.1111/nph.17675] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 08/04/2021] [Indexed: 05/25/2023]
Abstract
Admixture is a mechanism by which species of long-lived plants may acquire novel alleles. However, the potential role of admixture in the origin and maintenance of tropical plant diversity is unclear. We ask whether admixture occurs in an ecologically important clade of Eschweilera (Parvifolia clade, Lecythidaceae), which includes some of the most widespread and abundant tree species in Amazonian forests. Using target capture sequencing, we conducted a detailed phylogenomic investigation of 33 species in the Parvifolia clade and investigated specific hypotheses of admixture within a robust phylogenetic framework. We found strong evidence of admixture among three ecologically dominant species, E. coriacea, E. wachenheimii and E. parviflora, but a lack of evidence for admixture among other lineages. Accepted species were largely distinguishable from one another, as was geographic structure within species. We show that hybridization may play a role in the evolution of the most widespread and ecologically variable Amazonian tree species. While admixture occurs among some species of Eschweilera, it has not led to widespread erosion of most species' genetic or morphological identities. Therefore, current morphological based species circumscriptions appear to provide a useful characterization of the clade's lineage diversity.
Collapse
Affiliation(s)
- Drew A. Larson
- Department of Ecology & Evolutionary BiologyUniversity of MichiganAnn ArborMI48109USA
| | - Oscar M. Vargas
- Department of Biological SciencesHumboldt State UniversityArcataCA95521USA
| | - Alberto Vicentini
- Instituto Nacional de Pesquisas da Amazônia (INPA)ManausAMCEP 69067‐375Brazil
| | - Christopher W. Dick
- Department of Ecology & Evolutionary BiologyUniversity of MichiganAnn ArborMI48109USA
- Smithsonian Tropical Research InstitutePanama CityRepublic of Panama
| |
Collapse
|
12
|
Schmitt S, Tysklind N, Hérault B, Heuertz M. Topography drives microgeographic adaptations of closely related species in two tropical tree species complexes. Mol Ecol 2021; 30:5080-5093. [PMID: 34387001 DOI: 10.1111/mec.16116] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Revised: 07/29/2021] [Accepted: 07/30/2021] [Indexed: 11/27/2022]
Abstract
Closely related tree species that grow in sympatry are abundant in rainforests. However, little is known of the eco-evolutionary processes that govern their niches and local coexistence. We assessed genetic species delimitation in closely related sympatric species belonging to two Neotropical tree species complexes and investigated their genomic adaptation to a fine-scale topographic gradient with associated edaphic and hydrologic features. Combining LiDAR-derived topography, tree inventories, and single nucleotide polymorphisms (SNPs) from gene capture experiments, we explored genome-wide population genetic structure, covariation of environmental variables, and genotype-environment association to assess microgeographic adaptations to topography within the species complexes Symphonia (Clusiaceae), and Eschweilera (Lecythidaceae) with three species per complex and 385 and 257 individuals genotyped, respectively. Within species complexes, closely related tree species had different realized optima for topographic niches defined through the topographic wetness index or the relative elevation, and species displayed genetic signatures of adaptations to these niches. Symphonia species were genetically differentiated along water and nutrient distribution particularly in genes responding to water deprivation, whereas Eschweilera species were genetically differentiated according to soil chemistry. Our results suggest that varied topography represents a powerful driver of processes modulating tropical forest biodiversity with differential adaptations that stabilize local coexistence of closely related tree species.
Collapse
Affiliation(s)
- Sylvain Schmitt
- CNRS, UMR EcoFoG (Agroparistech, Cirad, INRAE, Université des Antilles, Université de la Guyane), Campus Agronomique, 97310, Kourou, French Guiana, France.,Univ. Bordeaux, INRAE, BIOGECO, 69 route d'Arcachon, 33610, Cestas, France
| | - Niklas Tysklind
- INRAE, UMR EcoFoG (Agroparistech, CNRS, Cirad, Université des Antilles, Université de la Guyane), Campus Agronomique, 97310, Kourou, French Guiana, France
| | - Bruno Hérault
- CIRAD, UR Forêts et Sociétés, Yamoussoukro Côte d'Ivoire, France.,Forêts et Sociétés, Univ Montpellier, CIRAD, Montpellier, France.,Institut National Polytechnique Félix Houphouët-Boigny, INP-HB, Yamoussoukro Côte d'Ivoire, France
| | - Myriam Heuertz
- Univ. Bordeaux, INRAE, BIOGECO, 69 route d'Arcachon, 33610, Cestas, France
| |
Collapse
|
13
|
An updated phylogenetic and biogeographic analysis based on genome skimming data reveals convergent evolution of shrubby habit in Clematis in the Pliocene and Pleistocene. Mol Phylogenet Evol 2021; 164:107259. [PMID: 34303792 DOI: 10.1016/j.ympev.2021.107259] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Revised: 03/17/2021] [Accepted: 07/08/2021] [Indexed: 02/08/2023]
Abstract
Convergent evolution, often viewed as the inevitable outcome of natural selection, has received special attention since the time of Darwin. Clematis is well known for its climbing habit, but it has some shrubby species, known as sect. Fruticella s.l. The shrubby Clematis species are distributed in the dry habitats of Central Asia and adjacent areas showing possible convergent evolution. In this study, we assembled the complete plastome and nuclear ribosomal DNA (nrDNA) sequences of 56 Clematis species, representing most sections and covering most of the shrubby species, to reconstruct their evolutionary histories. Using both maximum likelihood and Bayesian methods, the plastome and nrDNA datasets generated similar, but not identical, phylogenetic relationships, which are better resolved than in previous studies. Then, molecular dating, historical range reconstruction, and character optimization analyses were conducted based on this updated phylogenetic framework. All the morphological characters widely used for taxonomy were shown to have evolved multiple times. Molecular dating inferred that Clematis diverged from its sister in the mid Miocene, and all six major clades of Clematis originated during the late Miocene, with a species radiation during the Pliocene to Pleistocene. The results clearly showed that the shrubby habit evolved independently in four lineages of Clematis in Asia. We also revealed that the shrubby lineages have emerged since the very beginning of Pliocene. Asian monsoon variation in the Pliocene and glacial period fluctuation in the Pleistocene may be the driving forces for the origin and diversification of the shrubby Clematis in Central Asia and adjacent dry areas.
Collapse
|
14
|
Vargas OM, Goldston B, Grossenbacher DL, Kay KM. Patterns of speciation are similar across mountainous and lowland regions for a Neotropical plant radiation (Costaceae: Costus). Evolution 2020; 74:2644-2661. [PMID: 33047821 DOI: 10.1111/evo.14108] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Revised: 08/28/2020] [Accepted: 09/21/2020] [Indexed: 01/05/2023]
Abstract
High species richness and endemism in tropical mountains are recognized as major contributors to the latitudinal diversity gradient. The processes underlying mountain speciation, however, are largely untested. The prevalence of steep ecogeographic gradients and the geographic isolation of populations by topographic features are predicted to promote speciation in mountains. We evaluate these processes in a species-rich Neotropical genus of understory herbs that range from the lowlands to montane forests and have higher species richness in topographically complex regions. We ask whether climatic niche divergence, geographic isolation, and pollination shifts differ between mountain-influenced and lowland Amazonian sister pairs inferred from a 756-gene phylogeny. Neotropical Costus ancestors diverged in Central America during a period of mountain formation in the last 3 million years with later colonization of Amazonia. Although climatic divergence, geographic isolation, and pollination shifts are prevalent in general, these factors do not differ between mountain-influenced and Amazonian sister pairs. Despite higher climatic niche and species diversity in the mountains, speciation modes in Costus appear similar across regions. Thus, greater species richness in tropical mountains may reflect differences in colonization history, diversification rates, or the prevalence of rapidly evolving plant life forms, rather than differences in speciation mode.
Collapse
Affiliation(s)
- Oscar M Vargas
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Santa Cruz, California, 95060.,Department of Biological Sciences, Humboldt State University, Arcata, California, 95521
| | - Brittany Goldston
- Department of Biology, California Polytechnic State University, San Luis Obispo, California, 93401
| | - Dena L Grossenbacher
- Department of Biology, California Polytechnic State University, San Luis Obispo, California, 93401
| | - Kathleen M Kay
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Santa Cruz, California, 95060
| |
Collapse
|
15
|
Jantzen JR, Amarasinghe P, Folk RA, Reginato M, Michelangeli FA, Soltis DE, Cellinese N, Soltis PS. A two-tier bioinformatic pipeline to develop probes for target capture of nuclear loci with applications in Melastomataceae. APPLICATIONS IN PLANT SCIENCES 2020; 8:e11345. [PMID: 32477841 PMCID: PMC7249273 DOI: 10.1002/aps3.11345] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Accepted: 12/20/2019] [Indexed: 05/21/2023]
Abstract
PREMISE Putatively single-copy nuclear (SCN) loci, which are identified using genomic resources of closely related species, are ideal for phylogenomic inference. However, suitable genomic resources are not available for many clades, including Melastomataceae. We introduce a versatile approach to identify SCN loci for clades with few genomic resources and use it to develop probes for target enrichment in the distantly related Memecylon and Tibouchina (Melastomataceae). METHODS We present a two-tiered pipeline. First, we identified putatively SCN loci using MarkerMiner and transcriptomes from distantly related species in Melastomataceae. Published loci and genes of functional significance were then added (384 total loci). Second, using HybPiper, we retrieved 689 homologous template sequences for these loci using genome-skimming data from within the focal clades. RESULTS We sequenced 193 loci common to Memecylon and Tibouchina. Probes designed from 56 template sequences successfully targeted sequences in both clades. Probes designed from genome-skimming data within a focal clade were more successful than probes designed from other sources. DISCUSSION Our pipeline successfully identified and targeted SCN loci in Memecylon and Tibouchina, enabling phylogenomic studies in both clades and potentially across Melastomataceae. This pipeline could be easily applied to other clades with few genomic resources.
Collapse
Affiliation(s)
- Johanna R. Jantzen
- Department of BiologyUniversity of FloridaGainesvilleFlorida32611USA
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFlorida32611USA
| | - Prabha Amarasinghe
- Department of BiologyUniversity of FloridaGainesvilleFlorida32611USA
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFlorida32611USA
| | - Ryan A. Folk
- Department of Biological SciencesMississippi State UniversityStarkvilleMississippi39762USA
| | - Marcelo Reginato
- Institute of Systematic BotanyThe New York Botanical GardenBronxNew York10458USA
- Universidade Federal do Rio Grande do SulPorto AlegreRio Grande do Sul90040‐060Brazil
| | | | - Douglas E. Soltis
- Department of BiologyUniversity of FloridaGainesvilleFlorida32611USA
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFlorida32611USA
| | - Nico Cellinese
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFlorida32611USA
| | - Pamela S. Soltis
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFlorida32611USA
| |
Collapse
|
16
|
Larson DA, Walker JF, Vargas OM, Smith SA. A consensus phylogenomic approach highlights paleopolyploid and rapid radiation in the history of Ericales. AMERICAN JOURNAL OF BOTANY 2020; 107:773-789. [PMID: 32350864 DOI: 10.1002/ajb2.1469] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2019] [Accepted: 02/12/2020] [Indexed: 05/27/2023]
Abstract
PREMISE Large genomic data sets offer the promise of resolving historically recalcitrant species relationships. However, different methodologies can yield conflicting results, especially when clades have experienced ancient, rapid diversification. Here, we analyzed the ancient radiation of Ericales and explored sources of uncertainty related to species tree inference, conflicting gene tree signal, and the inferred placement of gene and genome duplications. METHODS We used a hierarchical clustering approach, with tree-based homology and orthology detection, to generate six filtered phylogenomic matrices consisting of data from 97 transcriptomes and genomes. Support for species relationships was inferred from multiple lines of evidence including shared gene duplications, gene tree conflict, gene-wise edge-based analyses, concatenation, and coalescent-based methods, and is summarized in a consensus framework. RESULTS Our consensus approach supported a topology largely concordant with previous studies, but suggests that the data are not capable of resolving several ancient relationships because of lack of informative characters, sensitivity to methodology, and extensive gene tree conflict correlated with paleopolyploidy. We found evidence of a whole-genome duplication before the radiation of all or most ericalean families, and demonstrate that tree topology and heterogeneous evolutionary rates affect the inferred placement of genome duplications. CONCLUSIONS We provide several hypotheses regarding the history of Ericales, and confidently resolve most nodes, but demonstrate that a series of ancient divergences are unresolvable with these data. Whether paleopolyploidy is a major source of the observed phylogenetic conflict warrants further investigation.
Collapse
Affiliation(s)
- Drew A Larson
- Department of Ecology & Evolutionary Biology, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Joseph F Walker
- Sainsbury Laboratory (SLCU), University of Cambridge, Cambridge, CB2 1LR, UK
| | - Oscar M Vargas
- Department of Ecology & Evolutionary Biology, University of California, Santa Cruz, CA, 95060, USA
| | - Stephen A Smith
- Department of Ecology & Evolutionary Biology, University of Michigan, Ann Arbor, MI, 48109, USA
| |
Collapse
|
17
|
Diversification History of Neotropical Lecythidaceae, an Ecologically Dominant Tree Family of Amazon Rain Forest. NEOTROPICAL DIVERSIFICATION: PATTERNS AND PROCESSES 2020. [DOI: 10.1007/978-3-030-31167-4_29] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/07/2022]
|
18
|
Morais EB, Schönenberger J, Conti E, Antonelli A, Szövényi P. Orthologous nuclear markers and new transcriptomes that broadly cover the phylogenetic diversity of Acanthaceae. APPLICATIONS IN PLANT SCIENCES 2019; 7:e11290. [PMID: 31572631 PMCID: PMC6764435 DOI: 10.1002/aps3.11290] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Accepted: 08/02/2019] [Indexed: 05/23/2023]
Abstract
PREMISE Information on orthologous groups of genes, their sequence variability, and annotation is required for project design in phylogenetic reconstruction. This resource is unavailable for the flowering plant family Acanthaceae (>4000 species). METHODS We compared transcriptome sequences spanning the extant diversity of Acanthaceae in order to provide a set of orthologous low-copy nuclear genes and assess their utility for reconstructing phylogenetic relationships within this group of plants. RESULTS We present new transcriptome assemblies for eight species representing all major clades of Acanthaceae. The assemblies of five of these species are entirely based on new sequence data. Of these five species, three are from subfamilies for which no genomic resources were previously available (Nelsonioideae and Thunbergioideae). These five new transcriptomes are more complete than all others from public databases. Furthermore, we provide alignments with sequence information, annotation, and statistics for potential phylogenetic utility of 1619 orthologous low-copy nuclear markers. DISCUSSION Our method of inferring assemblies from multiple pooled tissue samples delivers more complete transcriptomes than any available ones from Acanthaceae. We make available to the community new resources (e.g., sequence information, variability, and annotation of orthologous low-copy nuclear genes) that will help phylogenetic reconstruction in Acanthaceae.
Collapse
Affiliation(s)
- Erica B. Morais
- Department of Systematic and Evolutionary BotanyUniversity of Zurich8008ZurichSwitzerland
| | - Jürg Schönenberger
- Department of Botany and Biodiversity ResearchUniversity of ViennaRennweg 14A‐1030ViennaAustria
| | - Elena Conti
- Department of Systematic and Evolutionary BotanyUniversity of Zurich8008ZurichSwitzerland
| | - Alexandre Antonelli
- Gothenburg Global Biodiversity CentreBox 461SE 40530GöteborgSweden
- Royal Botanic Gardens, KewRichmondSurreyTW9 3AEUnited Kingdom
| | - Péter Szövényi
- Department of Systematic and Evolutionary BotanyUniversity of Zurich8008ZurichSwitzerland
| |
Collapse
|