1
|
Wu H, Li DZ, Ma PF. Unprecedented variation pattern of plastid genomes and the potential role in adaptive evolution in Poales. BMC Biol 2024; 22:97. [PMID: 38679718 PMCID: PMC11057118 DOI: 10.1186/s12915-024-01890-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 04/16/2024] [Indexed: 05/01/2024] Open
Abstract
BACKGROUND The plastid is the photosynthetic organelle in plant cell, and the plastid genomes (plastomes) are generally conserved in evolution. As one of the most economically and ecologically important order of angiosperms, Poales was previously documented to exhibit great plastomic variation as an order of photoautotrophic plants. RESULTS We acquired 93 plastomes, representing all the 16 families and 5 major clades of Poales to reveal the extent of their variation and evolutionary pattern. Extensive variation including the largest one in monocots with 225,293 bp in size, heterogeneous GC content, and a wide variety of gene duplication and loss were revealed. Moreover, rare occurrences of three inverted repeat (IR) copies in angiosperms and one IR loss were observed, accompanied by short IR (sIR) and small direct repeat (DR). Widespread structural heteroplasmy, diversified inversions, and unusual genomic rearrangements all appeared in Poales, occasionally within a single species. Extensive repeats in the plastomes were found to be positively correlated with the observed inversions and rearrangements. The variation all showed a "small-large-moderate" trend along the evolution of Poales, as well as for the sequence substitution rate. Finally, we found some positively selected genes, mainly in C4 lineages, while the closely related lineages of those experiencing gene loss tended to have undergone more relaxed purifying selection. CONCLUSIONS The variation of plastomes in Poales may be related to its successful diversification into diverse habitats and multiple photosynthetic pathway transitions. Our order-scale analyses revealed unusual evolutionary scenarios for plastomes in the photoautotrophic order of Poales and provided new insights into the plastome evolution in angiosperms as a whole.
Collapse
Affiliation(s)
- Hong Wu
- Germplasm Bank of Wild Species and Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - De-Zhu Li
- Germplasm Bank of Wild Species and Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
| | - Peng-Fei Ma
- Germplasm Bank of Wild Species and Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
| |
Collapse
|
2
|
Wang T, Tang C, Chen J, Cao Z, He H, Li Y, Li X. Phylogenetic relationship and characterization of the complete chloroplast genome of Rhododendron przewalskii subsp. przewalskii Maximowicz 1877, an ornamental and medicinal plant in China. Mitochondrial DNA B Resour 2023; 8:998-1002. [PMID: 37746034 PMCID: PMC10512896 DOI: 10.1080/23802359.2023.2259040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 09/09/2023] [Indexed: 09/26/2023] Open
Abstract
As one of the top 10 famous flowers in Chinese tradition, Rhododendron przewalskii subsp. przewalskii known as 'beauty in flowers,' which has high ornamental and medicinal value. The complete chloroplast (cp) genome of R. przewalskii subsp. przewalskii was determined in this study. The complete chloroplast genome of R. przewalskii subsp. przewalskii was 201,233 bp in length and contained a large single-copy region (LSC, 108,077 bp), and a small single-copy region (SSC, 2624 bp) and a pair of inverted repeat regions (IRa and IRb, 45,266 bp). A total of 142 functional genes were observed in this cp genome, including 91 protein-coding genes (PCGs), 43 transfer RNA genes (tRNAs), and eight ribosomal RNA genes (rRNAs). The R. przewalskii subsp. przewalskii cp genome has an A + T content of 64.06% and presents a positive AT-skew (0.53%) and a negative GC-skew (-1.56%). The maximum likelihood phylogenetic analysis based on the concatenated nucleotide sequences of 13 PCGs strongly supported the monophyletic relationship of R. przewalskii subsp. przewalskii the clade of R. henanense subsp. lingbaoense. This study provides genomic evidence for the vegetation classification of Rhododendron.
Collapse
Affiliation(s)
- Tao Wang
- Qinghai Academy of Animal and Veterinary Science, Qinghai University, Xining, China
| | - Chuyu Tang
- Qinghai Academy of Animal and Veterinary Science, Qinghai University, Xining, China
| | - Jianbo Chen
- Qinghai Academy of Animal and Veterinary Science, Qinghai University, Xining, China
| | - Zhengfei Cao
- Qinghai Academy of Animal and Veterinary Science, Qinghai University, Xining, China
| | - Hui He
- Qinghai Academy of Animal and Veterinary Science, Qinghai University, Xining, China
| | - Yuling Li
- Qinghai Academy of Animal and Veterinary Science, Qinghai University, Xining, China
| | - Xiuzhang Li
- Qinghai Academy of Animal and Veterinary Science, Qinghai University, Xining, China
| |
Collapse
|
3
|
Zhou SM, Wang F, Yan SY, Zhu ZM, Gao XF, Zhao XL. Phylogenomics and plastome evolution of Indigofera (Fabaceae). FRONTIERS IN PLANT SCIENCE 2023; 14:1186598. [PMID: 37346129 PMCID: PMC10280451 DOI: 10.3389/fpls.2023.1186598] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 05/10/2023] [Indexed: 06/23/2023]
Abstract
Introduction Indigofera L. is the third largest genus in Fabaceae and includes economically important species that are used for indigo dye-producing, medicinal, ornamental, and soil and water conservation. The genus is taxonomically difficult due to the high level of overlap in morphological characters of interspecies, fewer reliability states for classification, and extensive adaptive evolution. Previous characteristic-based taxonomy and nuclear ITS-based phylogenies have contributed to our understanding of Indigofera taxonomy and evolution. However, the lack of chloroplast genomic resources limits our comprehensive understanding of the phylogenetic relationships and evolutionary processes of Indigofera. Methods Here, we newly assembled 18 chloroplast genomes of Indigofera. We performed a series of analyses of genome structure, nucleotide diversity, phylogenetic analysis, species pairwise Ka/Ks ratios, and positive selection analysis by combining with allied species in Papilionoideae. Results and discussion The chloroplast genomes of Indigofera exhibited highly conserved structures and ranged in size from 157,918 to 160,040 bp, containing 83 protein-coding genes, 37 tRNA genes, and eight rRNA genes. Thirteen highly variable regions were identified, of which trnK-rbcL, ndhF-trnL, and ycf1 were considered as candidate DNA barcodes for species identification of Indigofera. Phylogenetic analysis using maximum likelihood (ML) and Bayesian inference (BI) methods based on complete chloroplast genome and protein-coding genes (PCGs) generated a well-resolved phylogeny of Indigofera and allied species. Indigofera monophyly was strongly supported, and four monophyletic lineages (i.e., the Pantropical, East Asian, Tethyan, and Palaeotropical clades) were resolved within the genus. The species pairwise Ka/Ks ratios showed values lower than 1, and 13 genes with significant posterior probabilities for codon sites were identified in the positive selection analysis using the branch-site model, eight of which were associated with photosynthesis. Positive selection of accD suggested that Indigofera species have experienced adaptive evolution to selection pressures imposed by their herbivores and pathogens. Our study provided insight into the structural variation of chloroplast genomes, phylogenetic relationships, and adaptive evolution in Indigofera. These results will facilitate future studies on species identification, interspecific and intraspecific delimitation, adaptive evolution, and the phylogenetic relationships of the genus Indigofera.
Collapse
Affiliation(s)
- Sheng-Mao Zhou
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China, Ministry of Education, College of Forestry, Southwest Forestry University, Kunming, China
| | - Fang Wang
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China, Ministry of Education, College of Forestry, Southwest Forestry University, Kunming, China
| | - Si-Yuan Yan
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China, Ministry of Education, College of Forestry, Southwest Forestry University, Kunming, China
| | - Zhang-Ming Zhu
- School of Ecology and Environmental Science and Yunnan Key Laboratory for Plateau Mountain Ecology and Restoration of Degraded Environments, Yunnan University, Kunming, China
| | - Xin-Fen Gao
- Chinese Academy of Sciences (CAS) Key Laboratory of Mountain Ecological Restoration and Bioresource Utilization and Ecological Restoration and Biodiversity Conservation Key Laboratory of Sichuan Province, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, China
| | - Xue-Li Zhao
- Key Laboratory for Forest Resources Conservation and Utilization in the Southwest Mountains of China, Ministry of Education, College of Forestry, Southwest Forestry University, Kunming, China
| |
Collapse
|
4
|
Pereira R, Anjos I, Reis J, Dias C, Leitão J. Next-Generation Sequencing (NGS) Identified Species-Specific SSR and SNP Markers, Allow the Unequivocal Identification of Strawberry Tree ( Arbutus unedo L.) Germplasm Accessions and Contribute to Assess Their Genetic Relationships. PLANTS (BASEL, SWITZERLAND) 2023; 12:1517. [PMID: 37050143 PMCID: PMC10096993 DOI: 10.3390/plants12071517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Revised: 03/23/2023] [Accepted: 03/25/2023] [Indexed: 06/19/2023]
Abstract
The strawberry tree (Arbutus unedo L.), an evergreen bush to small tree of the Ericaceae family, is a main component of the natural flora of the Mediterranean basin that also grows profusely through the Iberian Peninsula, southwestern France, and Ireland. The small edible red fruits are usually used to produce preserves, jams, and liquors, as the Portuguese "aguardente de medronho". The leaves and fruits have been used for a long time in traditional medicine, and their bioactive compounds are presently the subject of intense research. A strawberry tree germplasm collection was recently established by the company Corte Velada (Odiáxere, Portugal). A set of 50 germplasm accessions was selected for a breeding program. A next-generation sequencing project was performed, resulting in the establishment of the first strawberry tree genome assembly and further identification of 500 SSR and 500 SNP loci. Individual molecular fingerprints for the unequivocal identification of the selected 50 accessions were established based on 71 markers alleles amplified by 4 SSR and 9 SNP markers. The same species-specific markers alleles combined with 61 random amplified markers amplified by 5 RAPD and 5 ISSR primers were used to assess the genetic variability and genetic relationships among the selected accessions.
Collapse
Affiliation(s)
- Ricardo Pereira
- MED-Mediterranean Institute for Agriculture, Environment and Development, CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências e Tecnologia, Campus de Gambelas, Universidade do Algarve, 8005-139 Faro, Portugal; (R.P.); (I.A.); (J.R.); (C.D.)
| | - Isabela Anjos
- MED-Mediterranean Institute for Agriculture, Environment and Development, CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências e Tecnologia, Campus de Gambelas, Universidade do Algarve, 8005-139 Faro, Portugal; (R.P.); (I.A.); (J.R.); (C.D.)
- Post-Graduation in Biotechnology and Biodiversity-Rede Pró-Centro Oeste, Universidade do Estado de Mato Grosso, Cáceres 78200-000, Brazil
| | - João Reis
- MED-Mediterranean Institute for Agriculture, Environment and Development, CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências e Tecnologia, Campus de Gambelas, Universidade do Algarve, 8005-139 Faro, Portugal; (R.P.); (I.A.); (J.R.); (C.D.)
| | - Carolina Dias
- MED-Mediterranean Institute for Agriculture, Environment and Development, CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências e Tecnologia, Campus de Gambelas, Universidade do Algarve, 8005-139 Faro, Portugal; (R.P.); (I.A.); (J.R.); (C.D.)
| | - José Leitão
- MED-Mediterranean Institute for Agriculture, Environment and Development, CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências e Tecnologia, Campus de Gambelas, Universidade do Algarve, 8005-139 Faro, Portugal; (R.P.); (I.A.); (J.R.); (C.D.)
| |
Collapse
|
5
|
Xie P, Tang L, Luo Y, Liu C, Yan H. Plastid Phylogenomic Insights into the Inter-Tribal Relationships of Plantaginaceae. BIOLOGY 2023; 12:biology12020263. [PMID: 36829541 PMCID: PMC9953724 DOI: 10.3390/biology12020263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 01/21/2023] [Accepted: 01/26/2023] [Indexed: 02/10/2023]
Abstract
Plantaginaceae, consisting of 12 tribes, is a diverse, cosmopolitan family. To date, the inter-tribal relationships of this family have been unresolved, and the plastome structure and composition within Plantaginaceae have seldom been comprehensively investigated. In this study, we compared the plastomes from 41 Plantaginaceae species (including 6 newly sequenced samples and 35 publicly representative species) representing 11 tribes. To clarify the inter-tribal relationships of Plantaginaceae, we inferred phylogenic relationships based on the concatenated and coalescent analyses of 68 plastid protein-coding genes. PhyParts analysis was performed to assess the level of concordance and conflict among gene trees across the species tree. The results indicate that most plastomes of Plantaginaceae are largely conserved in terms of genome structure and gene content. In contrast to most previous studies, a robust phylogeny was recovered using plastome data, providing new insights for better understanding the inter-tribal relationships of Plantaginaceae. Both concatenated and coalescent phylogenies favored the sister relationship between Plantagineae and Digitalideae, as well as between Veroniceae and Hemiphragmeae. Sibthorpieae diverged into a separate branch which was sister to a clade comprising the four tribes mentioned above. Furthermore, the sister relationship between Russelieae and Cheloneae is strongly supported. The results of PhyParts showed gene tree congruence and conflict to varying degrees, but most plastid genes were uninformative for phylogenetic nodes, revealing the defects of previous studies using single or multiple plastid DNA sequences to infer the phylogeny of Plantaginaceae.
Collapse
Affiliation(s)
- Pingxuan Xie
- College of Traditional Chinese Medicine, Guangdong Pharmaceutical University, Guangzhou 510006, China
| | - Lilei Tang
- College of Traditional Chinese Medicine, Guangdong Pharmaceutical University, Guangzhou 510006, China
| | - Yanzhen Luo
- College of Traditional Chinese Medicine, Guangdong Pharmaceutical University, Guangzhou 510006, China
| | - Changkun Liu
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Hanjing Yan
- College of Traditional Chinese Medicine, Guangdong Pharmaceutical University, Guangzhou 510006, China
- Key Laboratory of State Administration of Traditional Chinese Medicine for Production and Development of Cantonese Medicinal Materials, Guangzhou 510006, China
- Correspondence:
| |
Collapse
|
6
|
Turudić A, Liber Z, Grdiša M, Jakše J, Varga F, Šatović Z. Variation in Chloroplast Genome Size: Biological Phenomena and Technological Artifacts. PLANTS (BASEL, SWITZERLAND) 2023; 12:254. [PMID: 36678967 PMCID: PMC9864865 DOI: 10.3390/plants12020254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 12/31/2022] [Accepted: 01/03/2023] [Indexed: 06/17/2023]
Abstract
The development of bioinformatic solutions is guided by biological knowledge of the subject. In some cases, we use unambiguous biological models, while in others we rely on assumptions. A commonly used assumption for genomes is that related species have similar genome sequences. This is even more obvious in the case of chloroplast genomes due to their slow evolution. We investigated whether the lengths of complete chloroplast sequences are closely related to the taxonomic proximity of the species. The study was performed using all available RefSeq sequences from the asterid and rosid clades. In general, chloroplast length distributions are narrow at both the family and genus levels. In addition, clear biological explanations have already been reported for families and genera that exhibit particularly wide distributions. The main factors responsible for the length variations are parasitic life forms, IR loss, IR expansions and contractions, and polyphyly. However, the presence of outliers in the distribution at the genus level is a strong indication of possible inaccuracies in sequence assembly.
Collapse
Affiliation(s)
- Ante Turudić
- Centre of Excellence for Biodiversity and Molecular Plant Breeding (CoE CroP-BioDiv), Svetošimunska c. 25, 10000 Zagreb, Croatia
- Faculty of Agriculture, University of Zagreb, Svetošimunska c. 25, 10000 Zagreb, Croatia
| | - Zlatko Liber
- Faculty of Agriculture, University of Zagreb, Svetošimunska c. 25, 10000 Zagreb, Croatia
- Faculty of Science, University of Zagreb, Marulićev trg 9a, 10000 Zagreb, Croatia
| | - Martina Grdiša
- Centre of Excellence for Biodiversity and Molecular Plant Breeding (CoE CroP-BioDiv), Svetošimunska c. 25, 10000 Zagreb, Croatia
- Faculty of Agriculture, University of Zagreb, Svetošimunska c. 25, 10000 Zagreb, Croatia
| | - Jernej Jakše
- Biotechnical Faculty, University of Ljubljana, Jamnikarjeva 101, 1000 Ljubljana, Slovenia
| | - Filip Varga
- Centre of Excellence for Biodiversity and Molecular Plant Breeding (CoE CroP-BioDiv), Svetošimunska c. 25, 10000 Zagreb, Croatia
- Faculty of Agriculture, University of Zagreb, Svetošimunska c. 25, 10000 Zagreb, Croatia
| | - Zlatko Šatović
- Centre of Excellence for Biodiversity and Molecular Plant Breeding (CoE CroP-BioDiv), Svetošimunska c. 25, 10000 Zagreb, Croatia
- Faculty of Agriculture, University of Zagreb, Svetošimunska c. 25, 10000 Zagreb, Croatia
| |
Collapse
|
7
|
Chloroplast genome assemblies and comparative analyses of commercially important Vaccinium berry crops. Sci Rep 2022; 12:21600. [PMID: 36517490 PMCID: PMC9751094 DOI: 10.1038/s41598-022-25434-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Accepted: 11/29/2022] [Indexed: 12/15/2022] Open
Abstract
Vaccinium is a large genus of shrubs that includes a handful of economically important berry crops. Given the numerous hybridizations and polyploidization events, the taxonomy of this genus has remained the subject of long debate. In addition, berries and berry-based products are liable to adulteration, either fraudulent or unintentional due to misidentification of species. The availability of more genomic information could help achieve higher phylogenetic resolution for the genus, provide molecular markers for berry crops identification, and a framework for efficient genetic engineering of chloroplasts. Therefore, in this study we assembled five Vaccinium chloroplast sequences representing the economically relevant berry types: northern highbush blueberry (V. corymbosum), southern highbush blueberry (V. corymbosum hybrids), rabbiteye blueberry (V. virgatum), lowbush blueberry (V. angustifolium), and bilberry (V. myrtillus). Comparative analyses showed that the Vaccinium chloroplast genomes exhibited an overall highly conserved synteny and sequence identity among them. Polymorphic regions included the expansion/contraction of inverted repeats, gene copy number variation, simple sequence repeats, indels, and single nucleotide polymorphisms. Based on their in silico discrimination power, we suggested variants that could be developed into molecular markers for berry crops identification. Phylogenetic analysis revealed multiple origins of highbush blueberry plastomes, likely due to the hybridization events that occurred during northern and southern highbush blueberry domestication.
Collapse
|
8
|
Turudić A, Liber Z, Grdiša M, Jakše J, Varga F, Šatović Z. Chloroplast Genome Annotation Tools: Prolegomena to the Identification of Inverted Repeats. Int J Mol Sci 2022; 23:ijms231810804. [PMID: 36142721 PMCID: PMC9503105 DOI: 10.3390/ijms231810804] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 09/01/2022] [Accepted: 09/13/2022] [Indexed: 12/31/2022] Open
Abstract
The development of next-generation sequencing technology and the increasing amount of sequencing data have brought the bioinformatic tools used in genome assembly into focus. The final step of the process is genome annotation, which works on assembled genome sequences to identify the location of genome features. In the case of organelle genomes, specialized annotation tools are used to identify organelle genes and structural features. Numerous annotation tools target chloroplast sequences. Most chloroplast DNA genomes have a quadripartite structure caused by two copies of a large inverted repeat. We investigated the strategies of six annotation tools (Chloë, Chloroplot, GeSeq, ORG.Annotate, PGA, Plann) for identifying inverted repeats and analyzed their success using publicly available complete chloroplast sequences of taxa belonging to the asterid and rosid clades. The annotation tools use two different approaches to identify inverted repeats, using existing general search tools or implementing stand-alone solutions. The chloroplast sequences studied show that there are different types of imperfections in the assembled data and that each tool performs better on some sequences than the others.
Collapse
Affiliation(s)
- Ante Turudić
- Centre of Excellence for Biodiversity and Molecular Plant Breeding (CoE CroP-BioDiv), Svetošimunska cesta 25, 10000 Zagreb, Croatia
- Faculty of Agriculture, University of Zagreb, Svetošimunska cesta 25, 10000 Zagreb, Croatia
- Correspondence: ; Tel.: +385-91-3141592
| | - Zlatko Liber
- Faculty of Agriculture, University of Zagreb, Svetošimunska cesta 25, 10000 Zagreb, Croatia
- Faculty of Science, University of Zagreb, Marulićev trg 9a, 10000 Zagreb, Croatia
| | - Martina Grdiša
- Centre of Excellence for Biodiversity and Molecular Plant Breeding (CoE CroP-BioDiv), Svetošimunska cesta 25, 10000 Zagreb, Croatia
- Faculty of Agriculture, University of Zagreb, Svetošimunska cesta 25, 10000 Zagreb, Croatia
| | - Jernej Jakše
- Biotechnical Faculty, University of Ljubljana, Jamnikarjeva 101, 1000 Ljubljana, Slovenia
| | - Filip Varga
- Centre of Excellence for Biodiversity and Molecular Plant Breeding (CoE CroP-BioDiv), Svetošimunska cesta 25, 10000 Zagreb, Croatia
- Faculty of Agriculture, University of Zagreb, Svetošimunska cesta 25, 10000 Zagreb, Croatia
| | - Zlatko Šatović
- Centre of Excellence for Biodiversity and Molecular Plant Breeding (CoE CroP-BioDiv), Svetošimunska cesta 25, 10000 Zagreb, Croatia
- Faculty of Agriculture, University of Zagreb, Svetošimunska cesta 25, 10000 Zagreb, Croatia
| |
Collapse
|
9
|
Pouchon C, Boyer F, Roquet C, Denoeud F, Chave J, Coissac E, Alsos IG, Lavergne S. ORTHOSKIM: in silico sequence capture from genomic and transcriptomic libraries for phylogenomic and barcoding applications. Mol Ecol Resour 2022; 22:2018-2037. [PMID: 35015377 DOI: 10.1111/1755-0998.13584] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Revised: 12/08/2021] [Accepted: 01/05/2022] [Indexed: 11/29/2022]
Abstract
Low-coverage whole genome shotgun sequencing (or genome skimming) has emerged as a cost-effective method for acquiring genomic data in non-model organisms. This method provides sequence information on chloroplast genome (cpDNA), mitochondrial genome (mtDNA) and nuclear ribosomal regions (rDNA), which are over-represented within cells. However, numerous bioinformatic challenges remain to accurately and rapidly obtain such data in organisms with complex genomic structures and rearrangements, in particular for mtDNA in plants or for cpDNA in some plant families. Here we introduce the pipeline ORTHOSKIM, which performs in silico capture of targeted sequences from genomic and transcriptomic libraries without assembling whole organelle genomes. ORTHOSKIM proceeds in three steps: 1) global sequence assembly, 2) mapping against reference sequences, and 3) target sequence extraction; importantly it also includes a range of quality control tests. Different modes are implemented to capture both coding and non-coding regions of cpDNA, mtDNA and rDNA sequences, along with predefined nuclear sequences (e.g. ultra-conserved elements) or collections of single-copy ortholog genes. Moreover, aligned DNA matrices are produced for phylogenetic reconstructions, by performing multiple alignments of the captured sequences. While ORTHOSKIM is suitable for any eukaryote, a case study is presented here, using 114 genome-skimming libraries and 4 RNAseq libraries obtained for two plant families, Primulaceae and Ericaceae, the latter being a well-known problematic family for cpDNA assemblies. ORTHOSKIM recovered with high success rates cpDNA, mtDNA and rDNA sequences, well suited to accurately infer evolutionary relationships within these families. ORTHOSKIM is released under a GPL-3 license and is available at: https://github.com/cpouchon/ORTHOSKIM.
Collapse
Affiliation(s)
- Charles Pouchon
- Univ. Grenoble Alpes, CNRS, Univ. Savoie Mont Blanc, LECA, Laboratoire d'Ecologie Alpine (LECA), 38000, Grenoble, France
| | - Frédéric Boyer
- Univ. Grenoble Alpes, CNRS, Univ. Savoie Mont Blanc, LECA, Laboratoire d'Ecologie Alpine (LECA), 38000, Grenoble, France
| | - Cristina Roquet
- Univ. Grenoble Alpes, CNRS, Univ. Savoie Mont Blanc, LECA, Laboratoire d'Ecologie Alpine (LECA), 38000, Grenoble, France.,Systematics and Evolution of Vascular Plants (UAB) - Associated Unit to CSIC, Departament de Biologia Animal, Biologia Vegetal i Ecologia, Universitat Autònoma de Barcelona, 08193, Bellaterra, Spain
| | - France Denoeud
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 2 rue Gaston Crémieux, 91057, Evry, France
| | - Jérome Chave
- Laboratoire Évolution et Diversité Biologique (EDB), UMR CNRS-IRD-UPS 5174, 31062, Toulouse Cedex, France
| | - Eric Coissac
- Univ. Grenoble Alpes, CNRS, Univ. Savoie Mont Blanc, LECA, Laboratoire d'Ecologie Alpine (LECA), 38000, Grenoble, France
| | - Inger Greve Alsos
- The Arctic University Museum of Norway, UiT - The Arctic University of Norway, NO-9037, Tromsø, Norway
| | | | | | - Sébastien Lavergne
- Univ. Grenoble Alpes, CNRS, Univ. Savoie Mont Blanc, LECA, Laboratoire d'Ecologie Alpine (LECA), 38000, Grenoble, France
| |
Collapse
|
10
|
Lee C, Ruhlman TA, Jansen RK. Unprecedented Intraindividual Structural Heteroplasmy in Eleocharis (Cyperaceae, Poales) Plastomes. Genome Biol Evol 2021; 12:641-655. [PMID: 32282915 PMCID: PMC7426004 DOI: 10.1093/gbe/evaa076] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/07/2020] [Indexed: 12/13/2022] Open
Abstract
Plastid genomes (plastomes) of land plants have a conserved quadripartite structure in a gene-dense unit genome consisting of a large inverted repeat that separates two single copy regions. Recently, alternative plastome structures were suggested in Geraniaceae and in some conifers and Medicago the coexistence of inversion isomers has been noted. In this study, plastome sequences of two Cyperaceae, Eleocharis dulcis (water chestnut) and Eleocharis cellulosa (gulf coast spikerush), were completed. Unlike the conserved plastomes in basal groups of Poales, these Eleocharis plastomes have remarkably divergent features, including large plastome sizes, high rates of sequence rearrangements, low GC content and gene density, gene duplications and losses, and increased repetitive DNA sequences. A novel finding among these features was the unprecedented level of heteroplasmy with the presence of multiple plastome structural types within a single individual. Illumina paired-end assemblies combined with PacBio single-molecule real-time sequencing, long-range polymerase chain reaction, and Sanger sequencing data identified at least four different plastome structural types in both Eleocharis species. PacBio long read data suggested that one of the four E. dulcis plastome types predominates.
Collapse
Affiliation(s)
- Chaehee Lee
- Department of Integrative Biology, University of Texas at Austin
| | - Tracey A Ruhlman
- Department of Integrative Biology, University of Texas at Austin
| | - Robert K Jansen
- Department of Integrative Biology, University of Texas at Austin.,Center of Excellence for Bionanoscience Research, King Abdulaziz University (KAU), Jeddah, Saudi Arabia
| |
Collapse
|
11
|
Seven Complete Chloroplast Genomes from Symplocos: Genome Organization and Comparative Analysis. FORESTS 2021. [DOI: 10.3390/f12050608] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
In the present study, chloroplast genome sequences of four species of Symplocos (S. chinensis for. pilosa, S. prunifolia, S. coreana, and S. tanakana) from South Korea were obtained by Ion Torrent sequencing and compared with the sequences of three previously reported Symplocos chloroplast genomes from different species. The length of the Symplocos chloroplast genome ranged from 156,961 to 157,365 bp. Overall, 132 genes including 87 functional genes, 37 tRNA genes, and eight rRNA genes were identified in all Symplocos chloroplast genomes. The gene order and contents were highly similar across the seven species. The coding regions were more conserved than the non-coding regions, and the large single-copy and small single-copy regions were less conserved than the inverted repeat regions. We identified five new hotspot regions (rbcL, ycf4, psaJ, rpl22, and ycf1) that can be used as barcodes or species-specific Symplocos molecular markers. These four novel chloroplast genomes provide basic information on the plastid genome of Symplocos and enable better taxonomic characterization of this genus.
Collapse
|
12
|
Abstract
The plastid genome (plastome ) has proved a valuable source of data for evaluating evolutionary relationships among angiosperms. Through basic and applied approaches, plastid transformation technology offers the potential to understand and improve plant productivity, providing food, fiber, energy, and medicines to meet the needs of a burgeoning global population. The growing genomic resources available to both phylogenetic and biotechnological investigations is allowing novel insights and expanding the scope of plastome research to encompass new species. In this chapter, we present an overview of some of the seminal and contemporary research that has contributed to our current understanding of plastome evolution and attempt to highlight the relationship between evolutionary mechanisms and the tools of plastid genetic engineering.
Collapse
Affiliation(s)
- Tracey A Ruhlman
- Integrative Biology, University of Texas at Austin, Austin, TX, USA.
| | - Robert K Jansen
- Integrative Biology, University of Texas at Austin, Austin, TX, USA
| |
Collapse
|
13
|
Liu X, Liao X, Chen D, Zheng Y, Yu X, Xu X, Liu Z, Lan S. The complete chloroplast genome sequence of Monotropa uniflora (Ericaceae). MITOCHONDRIAL DNA PART B-RESOURCES 2020; 5:3168-3169. [PMID: 33458098 PMCID: PMC7782332 DOI: 10.1080/23802359.2020.1806754] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Monotropa uniflora is one of the representative plants of Ericaceae family, which was famous for entire translucent and ‘ghostly’ white. Also, unique lifestyle also attracts lots of researchers, which it obtains through fixed carbon from photosynthetic plants via a shared mycorrhizal network. In this study, the complete chloroplast (cp) genome of M. uniflora was assembled and annotated, its full-length is 26,913 bp. Plastid genome contains 31 genes, 14 protein-coding genes, 14 tRNA genes, and 3 rRNA genes. The phylogenetic analyses based on the complete chloroplast genome sequence provided solid evidence that M. uniflora has a close relationship M. odorata. The chloroplast genome presented here will help for further conservation of M. uniflora and other saprophytes.
Collapse
Affiliation(s)
- XueDie Liu
- College of Forestry, Fujian Agriculture and Forestry University, Fuzhou, China.,Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at College of Landscape, Fujian Agriculture and Forestry University, Fuzhou, China
| | - XingYu Liao
- College of Forestry, Fujian Agriculture and Forestry University, Fuzhou, China.,Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at College of Landscape, Fujian Agriculture and Forestry University, Fuzhou, China
| | - DeQiang Chen
- College of Forestry, Fujian Agriculture and Forestry University, Fuzhou, China.,Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at College of Landscape, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Yu Zheng
- College of Forestry, Fujian Agriculture and Forestry University, Fuzhou, China.,Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at College of Landscape, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Xia Yu
- Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at College of Landscape, Fujian Agriculture and Forestry University, Fuzhou, China
| | - XinYu Xu
- Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at College of Landscape, Fujian Agriculture and Forestry University, Fuzhou, China
| | - ZhongJian Liu
- College of Forestry, Fujian Agriculture and Forestry University, Fuzhou, China.,Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at College of Landscape, Fujian Agriculture and Forestry University, Fuzhou, China
| | - SiRen Lan
- College of Forestry, Fujian Agriculture and Forestry University, Fuzhou, China.,Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at College of Landscape, Fujian Agriculture and Forestry University, Fuzhou, China
| |
Collapse
|
14
|
Li H, Guo Q, Li Q, Yang L. Long-reads reveal that Rhododendron delavayi plastid genome contains extensive repeat sequences, and recombination exists among plastid genomes of photosynthetic Ericaceae. PeerJ 2020; 8:e9048. [PMID: 32351791 PMCID: PMC7183307 DOI: 10.7717/peerj.9048] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 04/02/2020] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND Rhododendron delavayi Franch. var. delavayi is a wild ornamental plant species in Guizhou Province, China. The lack of its plastid genome information seriously hinders the further application and conservation of the valuable resource. METHODS The complete plastid genome of R. delavayi was assembled from long sequence reads. The genome was then characterized, and compared with those of other photosynthetic Ericaceae species. RESULTS The plastid genome of R. delavayi has a typical quadripartite structure, and a length of 202,169 bp. It contains a large number of repeat sequences and shows preference for codon usage. The comparative analysis revealed the irregular recombination of gene sets, including rearrangement and inversion, in the large single copy region. The extreme expansion of the inverted repeat region shortened the small single copy, and expanded the full length of the genome. In addition, consistent with traditional taxonomy, R. delavayi with nine other species of the same family were clustered into Ericaceae based on the homologous protein-coding sequences of the plastid genomes. Thus, the long-read assembly of the plastid genome of R. delavayi would provide basic information for the further study of the evolution, genetic diversity, and conservation of R. delavayi and its relatives.
Collapse
Affiliation(s)
- Huie Li
- College of Agriculture, Guizhou University, Guiyang, Guizhou, China
| | - Qiqiang Guo
- Institute for Forest Resources & Environment of Guizhou, Guizhou University, Guiyang, Guizhou, China
| | - Qian Li
- College of Agriculture, Guizhou University, Guiyang, Guizhou, China
| | - Lan Yang
- College of Agriculture, Guizhou University, Guiyang, Guizhou, China
| |
Collapse
|
15
|
Asaf S, Khan AL, Lubna, Khan A, Khan A, Khan G, Lee IJ, Al-Harrasi A. Expanded inverted repeat region with large scale inversion in the first complete plastid genome sequence of Plantago ovata. Sci Rep 2020; 10:3881. [PMID: 32127603 PMCID: PMC7054531 DOI: 10.1038/s41598-020-60803-y] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2018] [Accepted: 02/17/2020] [Indexed: 11/09/2022] Open
Abstract
Plantago ovata (Plantaginaceae) is an economically and medicinally important species, however, least is known about its genomics and evolution. Here, we report the first complete plastome genome of P. ovata and comparison with previously published genomes of related species from Plantaginaceae. The results revealed that P. ovata plastome size was 162,116 bp and that it had typical quadripartite structure containing a large single copy region of 82,084 bp and small single copy region of 5,272 bp. The genome has a markedly higher inverted repeat (IR) size of 37.4 kb, suggesting large-scale inversion of 13.8 kb within the expanded IR regions. In addition, the P. ovata plastome contains 149 different genes, including 43 tRNA, 8 rRNA, and 98 protein-coding genes. The analysis revealed 139 microsatellites, of which 71 were in the non-coding regions. Approximately 32 forward, 34 tandem, and 17 palindromic repeats were detected. The complete genome sequences, 72 shared genes, matK gene, and rbcL gene from related species generated the same phylogenetic signals, and phylogenetic analysis revealed that P. ovata formed a single clade with P. maritima and P. media. The divergence time estimation as employed in BEAST revealed that P. ovata diverged from P. maritima and P. media about 11.0 million years ago (Mya; 95% highest posterior density, 10.06-12.25 Mya). In conclusion, P. ovata had significant variation in the IR region, suggesting a more stable P. ovata plastome genome than that of other Plantaginaceae species.
Collapse
Affiliation(s)
- Sajjad Asaf
- Natural and Medical Sciences Research Center, University of Nizwa, Nizwa, Oman
| | - Abdul Latif Khan
- Natural and Medical Sciences Research Center, University of Nizwa, Nizwa, Oman.
| | - Lubna
- Department of Botany, Garden Campus, Abdul Wali Khan University Mardan, Khyber Pakhtunkhwa, Pakistan
| | - Adil Khan
- Natural and Medical Sciences Research Center, University of Nizwa, Nizwa, Oman
| | - Arif Khan
- Genomics Group, Faculty of Biosciences and Aquaculture, Nord University, Bodø, 8049, Norway
| | - Gulzar Khan
- Institute for Biology and Environmental Sciences, Carl von Ossietzky University, Oldenburg, Oldenburg, Germany
| | - In-Jung Lee
- School of Applied Biosciences, Kyungpook National University, Daegu, 41566, Republic of Korea.
| | - Ahmed Al-Harrasi
- Natural and Medical Sciences Research Center, University of Nizwa, Nizwa, Oman.
| |
Collapse
|
16
|
Choi IS, Jansen R, Ruhlman T. Lost and Found: Return of the Inverted Repeat in the Legume Clade Defined by Its Absence. Genome Biol Evol 2019; 11:1321-1333. [PMID: 31046101 PMCID: PMC6496590 DOI: 10.1093/gbe/evz076] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/05/2019] [Indexed: 12/23/2022] Open
Abstract
The plant genome comprises a coevolving, integrated genetic system housed in three subcellular compartments: the nucleus, mitochondrion, and the plastid. The typical land plant plastid genome (plastome) comprises the sum of repeating units of 130–160 kb in length. The plastome inverted repeat (IR) divides each plastome monomer into large and small single copy regions, an architecture highly conserved across land plants. There have been varying degrees of expansion or contraction of the IR, and in a few distinct lineages, including the IR-lacking clade of papilionoid legumes, one copy of the IR has been lost. Completion of plastome sequencing and assembly for 19 Medicago species and Trigonella foenum-graceum and comparative analysis with other IR-lacking clade taxa revealed modest divergence with regard to structural organization overall. However, one clade contained unique variation suggesting an ancestor had experienced repeat-mediated changes in plastome structure. In Medicago minima, a novel IR of ∼9 kb was confirmed and the role of repeat-mediated, recombination-dependent replication in IR reemergence is discussed.
Collapse
Affiliation(s)
- In-Su Choi
- Department of Integrative Biology, University of Texas at Austin
| | - Robert Jansen
- Department of Integrative Biology, University of Texas at Austin.,Center of Excellence for Bionanoscience Research, King Abdulaziz University (KAU), Jeddah, Saudi Arabia
| | - Tracey Ruhlman
- Department of Integrative Biology, University of Texas at Austin
| |
Collapse
|
17
|
Plastid phylogenomics resolves infrafamilial relationships of the Styracaceae and sheds light on the backbone relationships of the Ericales. Mol Phylogenet Evol 2018; 121:198-211. [PMID: 29360618 DOI: 10.1016/j.ympev.2018.01.004] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2017] [Revised: 12/27/2017] [Accepted: 01/02/2018] [Indexed: 01/05/2023]
Abstract
Relationships among the genera of the small, woody family Styracaceae and among families of the large, diverse order Ericales have resisted complete resolution with sequences from one or a few genes. We used plastome sequencing to attempt to resolve the backbone relationships of Styracaceae and Ericales and to explore plastome structural evolution. Complete plastomes for 23 species are newly reported here, including 18 taxa of Styracaceae and five of Ericales (including species of Sapotaceae, Clethraceae, Symplocaceae, and Diapensiaceae). Combined with publicly available complete plastome data, this resulted in a data set of 60 plastomes, including 11 of the 12 genera of Styracaceae and 12 of 22 families of Ericales. Styracaceae plastomes were found to possess the quadripartite structure typical of angiosperms, with sizes ranging from 155 to 159 kb. Most of the plastomes were found to possess the full complement of typical angiosperm plastome genes. Unusual structural features were detected in plastomes of Alniphyllum and Bruinsmia, including the presence of a large 20-kb inversion (14 genes) in the Large Single-Copy region, the loss or pseudogenization of the clpP and accD genes in Bruinsmia, and the loss of the first exon of rps16 in B. styracoides. Likewise, the second intron from clpP was found to be lost in Alniphyllum and Huodendron. Phylogenomic analyses including all 79 plastid protein-coding genes provided improved resolution for relationships among the genera of Styracaceae and families of Ericales. Styracaceae was strongly supported as monophyletic, with Styrax, Huodendron, and a clade of Alniphyllum + Bruinsmia successively sister to the remainder of the family, all with strong support. All genera of Styracaceae were recovered as monophyletic, except for Halesia and Pterostyrax, which were each recovered as polyphyletic with strong support. Within Ericales, all families were recovered as monophyletic with strong support, with Balsaminaceae sister to remaining Ericales. Most relationships recovered in plastome analyses are congruent with previous analyses based on smaller data sets. Our results demonstrate the power of plastid phylogenomics to improve phylogenetic hypotheses among genera and families, and provide new insight into plastome evolution across Ericales.
Collapse
|
18
|
Braukmann TWA, Broe MB, Stefanović S, Freudenstein JV. On the brink: the highly reduced plastomes of nonphotosynthetic Ericaceae. THE NEW PHYTOLOGIST 2017; 216:254-266. [PMID: 28731202 DOI: 10.1111/nph.14681] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2017] [Accepted: 05/29/2017] [Indexed: 06/07/2023]
Abstract
Ericaceae (the heather family) is a large and diverse group of plants that forms elaborate symbiotic relationships with mycorrhizal fungi, and includes several nonphotosynthetic lineages. Using an extensive sample of fully mycoheterotrophic (MH) species, we explored inter- and intraspecific variation as well as selective constraints acting on the plastomes of these unusual plants. The plastomes of seven MH genera were analysed in a phylogenetic context with two geographically disparate individuals sequenced for Allotropa, Monotropa, and Pityopus. The plastomes of nonphotosynthetic Ericaceae are highly reduced in size (c. 33-41 kbp) and content, having lost all photosynthesis-related genes, and are reduced to encoding housekeeping genes as well as a protease subunit (clpP)-like and acetyl-CoA carboxylase subunit D (accD)-like open reading frames. Despite an increase in the rate of their nucleotide substitutions, the remaining protein-coding genes are typically under purifying selection in full MHs. We also identified ribosomal proteins under relaxed or neutral selection. These plastomes also exhibit striking structural rearrangements. Intraspecific variation within MH Ericaceae ranges from a few differences (Allotropa) to extensive population divergences (Monotropa, Hypopitys), which indicates that cryptic speciation may be occurring in several lineages. The pattern of gene loss within fully MH Ericaceae plastomes suggests an advanced state of degradation.
Collapse
Affiliation(s)
- Thomas W A Braukmann
- Department of Biology, University of Toronto Mississauga, Mississauga, ON, L5L 1C6, Canada
| | - Michael B Broe
- Department of Evolution, Ecology, and Organismal Biology, Ohio State University, Columbus, OH, 43212-1157, USA
| | - Saša Stefanović
- Department of Biology, University of Toronto Mississauga, Mississauga, ON, L5L 1C6, Canada
| | - John V Freudenstein
- Department of Evolution, Ecology, and Organismal Biology, Ohio State University, Columbus, OH, 43212-1157, USA
| |
Collapse
|
19
|
Asaf S, Khan AL, Aaqil Khan M, Muhammad Imran Q, Kang SM, Al-Hosni K, Jeong EJ, Lee KE, Lee IJ. Comparative analysis of complete plastid genomes from wild soybean (Glycine soja) and nine other Glycine species. PLoS One 2017; 12:e0182281. [PMID: 28763486 PMCID: PMC5538705 DOI: 10.1371/journal.pone.0182281] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Accepted: 07/14/2017] [Indexed: 11/19/2022] Open
Abstract
The plastid genomes of different plant species exhibit significant variation, thereby providing valuable markers for exploring evolutionary relationships and population genetics. Glycine soja (wild soybean) is recognized as the wild ancestor of cultivated soybean (G. max), representing a valuable genetic resource for soybean breeding programmes. In the present study, the complete plastid genome of G. soja was sequenced using Illumina paired-end sequencing and then compared it for the first time with previously reported plastid genome sequences from nine other Glycine species. The G. soja plastid genome was 152,224 bp in length and possessed a typical quadripartite structure, consisting of a pair of inverted repeats (IRa/IRb; 25,574 bp) separated by small (178,963 bp) and large (83,181 bp) single-copy regions, with a 51-kb inversion in the large single-copy region. The genome encoded 134 genes, including 87 protein-coding genes, eight ribosomal RNA genes, and 39 transfer RNA genes, and possessed 204 randomly distributed microsatellites, including 15 forward, 25 tandem, and 34 palindromic repeats. Whole-plastid genome comparisons revealed an overall high degree of sequence similarity between G. max and G. gracilis and some divergence in the intergenic spacers of other species. Greater numbers of indels and SNP substitutions were observed compared with G. cyrtoloba. The sequence of the accD gene from G. soja was highly divergent from those of the other species except for G. max and G. gracilis. Phylogenomic analyses of the complete plastid genomes and 76 shared genes yielded an identical topology and indicated that G. soja is closely related to G. max and G. gracilis. The complete G. soja genome sequenced in the present study is a valuable resource for investigating the population and evolutionary genetics of Glycine species and can be used to identify related species.
Collapse
Affiliation(s)
- Sajjad Asaf
- School of Applied Biosciences, Kyungpook National University, Daegu, Republic of Korea
| | - Abdul Latif Khan
- Chair of Oman's Medicinal Plants & Marine Natural Products, University of Nizwa, Nizwa, Oman
| | - Muhammad Aaqil Khan
- School of Applied Biosciences, Kyungpook National University, Daegu, Republic of Korea
| | - Qari Muhammad Imran
- School of Applied Biosciences, Kyungpook National University, Daegu, Republic of Korea
| | - Sang-Mo Kang
- School of Applied Biosciences, Kyungpook National University, Daegu, Republic of Korea
| | - Khdija Al-Hosni
- School of Applied Biosciences, Kyungpook National University, Daegu, Republic of Korea
| | - Eun Ju Jeong
- School of Applied Biosciences, Kyungpook National University, Daegu, Republic of Korea
| | - Ko Eun Lee
- School of Applied Biosciences, Kyungpook National University, Daegu, Republic of Korea
| | - In-Jung Lee
- School of Applied Biosciences, Kyungpook National University, Daegu, Republic of Korea
| |
Collapse
|
20
|
Park S, Ruhlman TA, Weng ML, Hajrah NH, Sabir JS, Jansen RK. Contrasting Patterns of Nucleotide Substitution Rates Provide Insight into Dynamic Evolution of Plastid and Mitochondrial Genomes of Geranium. Genome Biol Evol 2017; 9:1766-1780. [PMID: 28854633 PMCID: PMC5570028 DOI: 10.1093/gbe/evx124] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/03/2017] [Indexed: 12/25/2022] Open
Abstract
Geraniaceae have emerged as a model system for investigating the causes and consequences of variation in plastid and mitochondrial genomes. Incredible structural variation in plastid genomes (plastomes) and highly accelerated evolutionary rates have been reported in selected lineages and functional groups of genes in both plastomes and mitochondrial genomes (mitogenomes), and these phenomena have been implicated in cytonuclear incompatibility. Previous organelle genome studies have included limited sampling of Geranium, the largest genus in the family with over 400 species. This study reports on rates and patterns of nucleotide substitutions in plastomes and mitogenomes of 17 species of Geranium and representatives of other Geraniaceae. As detected across other angiosperms, substitution rates in the plastome are 3.5 times higher than the mitogenome in most Geranium. However, in the branch leading to Geranium brycei/Geranium incanum mitochondrial genes experienced significantly higher dN and dS than plastid genes, a pattern that has only been detected in one other angiosperm. Furthermore, rate accelerations differ in the two organelle genomes with plastomes having increased dN and mitogenomes with increased dS. In the Geranium phaeum/Geranium reflexum clade, duplicate copies of clpP and rpoA genes that experienced asymmetric rate divergence were detected in the single copy region of the plastome. In the case of rpoA, the branch leading to G. phaeum/G. reflexum experienced positive selection or relaxation of purifying selection. Finally, the evolution of acetyl-CoA carboxylase is unusual in Geraniaceae because it is only the second angiosperm family where both prokaryotic and eukaryotic ACCases functionally coexist in the plastid.
Collapse
Affiliation(s)
- Seongjun Park
- Department of Integrative Biology, University of Texas at Austin
| | | | - Mao-Lun Weng
- Department of Integrative Biology, University of Texas at Austin
- Department of Biology and Microbiology, South Dakota State University
| | - Nahid H. Hajrah
- Genomic and Biotechnology Research Group, Department of Biological Science, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Jamal S.M. Sabir
- Genomic and Biotechnology Research Group, Department of Biological Science, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Robert K. Jansen
- Department of Integrative Biology, University of Texas at Austin
- Genomic and Biotechnology Research Group, Department of Biological Science, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia
| |
Collapse
|
21
|
Plastome-Wide Nucleotide Substitution Rates Reveal Accelerated Rates in Papilionoideae and Correlations with Genome Features Across Legume Subfamilies. J Mol Evol 2017; 84:187-203. [PMID: 28397003 DOI: 10.1007/s00239-017-9792-x] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Accepted: 04/03/2017] [Indexed: 01/08/2023]
Abstract
This study represents the most comprehensive plastome-wide comparison of nucleotide substitution rates across the three subfamilies of Fabaceae: Caesalpinioideae, Mimosoideae, and Papilionoideae. Caesalpinioid and mimosoid legumes have large, unrearranged plastomes compared with papilionoids, which exhibit varying levels of rearrangement including the loss of the inverted repeat (IR) in the IR-lacking clade (IRLC). Using 71 genes common to 39 legume taxa representing all the three subfamilies, we show that papilionoids consistently have higher nucleotide substitution rates than caesalpinioids and mimosoids, and rates in the IRLC papilionoids are generally higher than those in the IR-containing papilionoids. Unsurprisingly, this pattern was significantly correlated with growth habit as most papilionoids are herbaceous, whereas caesalpinioids and mimosoids are largely woody. Both nonsynonymous (dN) and synonymous (dS) substitution rates were also correlated with several biological features including plastome size and plastomic rearrangements such as the number of inversions and indels. In agreement with previous reports, we found that genes in the IR exhibit between three and fourfold reductions in the substitution rates relative to genes within the large single-copy or small single-copy regions. Furthermore, former IR genes in IR-lacking taxa exhibit accelerated rates compared with genes contained in the IR.
Collapse
|
22
|
Ribeiro MM, Piotti A, Ricardo A, Gaspar D, Costa R, Parducci L, Vendramin GG. Genetic diversity and divergence at the Arbutus unedo L. (Ericaceae) westernmost distribution limit. PLoS One 2017; 12:e0175239. [PMID: 28384294 PMCID: PMC5383270 DOI: 10.1371/journal.pone.0175239] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 03/22/2017] [Indexed: 01/21/2023] Open
Abstract
Mediterranean forests are fragile ecosystems vulnerable to recent global warming and reduction of precipitation, and a long-term negative effect is expected on vegetation with increasing drought and in areas burnt by fires. We investigated the spatial distribution of genetic variation of Arbutus unedo in the western Iberia Peninsula, using plastid markers with conservation and provenance regions design purposes. This species is currently undergoing an intense domestication process in the region, and, like other species, is increasingly under the threat from climate change, habitat fragmentation and wildfires. We sampled 451 trees from 15 natural populations from different ecological conditions spanning the whole species’ distribution range in the region. We applied Bayesian analysis and identified four clusters (north, centre, south, and a single-population cluster). Hierarchical AMOVA showed higher differentiation among clusters than among populations within clusters. The relatively low within-clusters differentiation can be explained by a common postglacial history of nearby populations. The genetic structure found, supported by the few available palaeobotanical records, cannot exclude the hypothesis of two independent A. unedo refugia in western Iberia Peninsula during the Last Glacial Maximum. Based on the results we recommend a conservation strategy by selecting populations for conservation based on their allelic richness and diversity and careful seed transfer consistent with current species’ genetic structure.
Collapse
Affiliation(s)
- Maria Margarida Ribeiro
- Departamento de Recursos Naturais e Desenvolvimento Sustentável, Instituto Politécnico de Castelo Branco, Escola Superior Agrária, Castelo Branco, Portugal
- Plant Ecology/ Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18 D, Uppsala, Sweden
- Forest Research Centre, School of Agriculture, University of Lisbon, Tapada da Ajuda, Lisbon, Portugal
- * E-mail:
| | - Andrea Piotti
- Institute of Biosciences and BioResources, National Research Council, Via Madonna del Piano 10, Sesto Fiorentino (Firenze), Italy
| | - Alexandra Ricardo
- Departamento de Recursos Naturais e Desenvolvimento Sustentável, Instituto Politécnico de Castelo Branco, Escola Superior Agrária, Castelo Branco, Portugal
| | - Daniel Gaspar
- INIAV, Instituto Nacional de Investigação Agrária e Veterinária, I.P., Av. República, Quinta do Marquês Oeiras, Portugal
| | - Rita Costa
- Forest Research Centre, School of Agriculture, University of Lisbon, Tapada da Ajuda, Lisbon, Portugal
- INIAV, Instituto Nacional de Investigação Agrária e Veterinária, I.P., Av. República, Quinta do Marquês Oeiras, Portugal
| | - Laura Parducci
- Plant Ecology/ Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18 D, Uppsala, Sweden
| | - Giovanni Giuseppe Vendramin
- Institute of Biosciences and BioResources, National Research Council, Via Madonna del Piano 10, Sesto Fiorentino (Firenze), Italy
| |
Collapse
|
23
|
Kang JS, Lee BY, Kwak M. The complete chloroplast genome sequences of Lychnis wilfordii and Silene capitata and comparative analyses with other Caryophyllaceae genomes. PLoS One 2017; 12:e0172924. [PMID: 28241056 PMCID: PMC5328339 DOI: 10.1371/journal.pone.0172924] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2016] [Accepted: 02/10/2017] [Indexed: 11/18/2022] Open
Abstract
The complete chloroplast genomes of Lychnis wilfordii and Silene capitata were determined and compared with ten previously reported Caryophyllaceae chloroplast genomes. The chloroplast genome sequences of L. wilfordii and S. capitata contain 152,320 bp and 150,224 bp, respectively. The gene contents and orders among 12 Caryophyllaceae species are consistent, but several microstructural changes have occurred. Expansion of the inverted repeat (IR) regions at the large single copy (LSC)/IRb and small single copy (SSC)/IR boundaries led to partial or entire gene duplications. Additionally, rearrangements of the LSC region were caused by gene inversions and/or transpositions. The 18 kb inversions, which occurred three times in different lineages of tribe Sileneae, were thought to be facilitated by the intermolecular duplicated sequences. Sequence analyses of the L. wilfordii and S. capitata genomes revealed 39 and 43 repeats, respectively, including forward, palindromic, and reverse repeats. In addition, a total of 67 and 56 simple sequence repeats were discovered in the L. wilfordii and S. capitata chloroplast genomes, respectively. Finally, we constructed phylogenetic trees of the 12 Caryophyllaceae species and two Amaranthaceae species based on 73 protein-coding genes using both maximum parsimony and likelihood methods.
Collapse
Affiliation(s)
- Jong-Soo Kang
- Plant Resources Division, National Institute of Biological Resources, Incheon, Republic of Korea
| | - Byoung Yoon Lee
- Plant Resources Division, National Institute of Biological Resources, Incheon, Republic of Korea
| | - Myounghai Kwak
- Plant Resources Division, National Institute of Biological Resources, Incheon, Republic of Korea
- * E-mail:
| |
Collapse
|
24
|
Yao X, Liu YY, Tan YH, Song Y, Corlett RT. The complete chloroplast genome sequence of Helwingia himalaica (Helwingiaceae, Aquifoliales) and a chloroplast phylogenomic analysis of the Campanulidae. PeerJ 2016; 4:e2734. [PMID: 27917320 PMCID: PMC5131622 DOI: 10.7717/peerj.2734] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2016] [Accepted: 10/30/2016] [Indexed: 11/29/2022] Open
Abstract
Complete chloroplast genome sequences have been very useful for understanding phylogenetic relationships in angiosperms at the family level and above, but there are currently large gaps in coverage. We report the chloroplast genome for Helwingia himalaica, the first in the distinctive family Helwingiaceae and only the second genus to be sequenced in the order Aquifoliales. We then combine this with 36 published sequences in the large (c. 35,000 species) subclass Campanulidae in order to investigate relationships at the order and family levels. The Helwingia genome consists of 158,362 bp containing a pair of inverted repeat (IR) regions of 25,996 bp separated by a large single-copy (LSC) region and a small single-copy (SSC) region which are 87,810 and 18,560 bp, respectively. There are 142 known genes, including 94 protein-coding genes, eight ribosomal RNA genes, and 40 tRNA genes. The topology of the phylogenetic relationships between Apiales, Asterales, and Dipsacales differed between analyses based on complete genome sequences and on 36 shared protein-coding genes, showing that further studies of campanulid phylogeny are needed.
Collapse
Affiliation(s)
- Xin Yao
- Center for Integrative Conservation, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Xishuangbanna, Yunnan, China; University of Chinese Academy of Sciences, Beijing, Beijing, China
| | - Ying-Ying Liu
- Key Laboratory of Dai and Southern medicine of Xishuangbanna Dai Autonomous Prefecture, Yunnan Branch Institute of Medicinal Plant, Chinese Academy of Medical Sciences , Jinghong, Yunnan , China
| | - Yun-Hong Tan
- Center for Integrative Conservation, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences , Xishuangbanna, Yunnan , China
| | - Yu Song
- Center for Integrative Conservation, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Xishuangbanna, Yunnan, China; University of Chinese Academy of Sciences, Beijing, Beijing, China
| | - Richard T Corlett
- Center for Integrative Conservation, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences , Xishuangbanna, Yunnan , China
| |
Collapse
|
25
|
Roquet C, Coissac É, Cruaud C, Boleda M, Boyer F, Alberti A, Gielly L, Taberlet P, Thuiller W, Van Es J, Lavergne S. Understanding the evolution of holoparasitic plants: the complete plastid genome of the holoparasite Cytinus hypocistis (Cytinaceae). ANNALS OF BOTANY 2016; 118:885-896. [PMID: 27443299 PMCID: PMC5055816 DOI: 10.1093/aob/mcw135] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2016] [Revised: 04/09/2016] [Accepted: 05/12/2016] [Indexed: 05/19/2023]
Abstract
Background and Aims Plant plastid genomes are highly conserved in size, gene content and structure; however, parasitic plants are a noticeable exception to this evolutionary stability. Although the evolution of parasites could help to better understand plastome evolution in general, complete plastomes of parasites have been sequenced only for some lineages so far. Here we contribute to filling this gap by providing and analysing the complete plastome sequence of Cytinus hypocistis, the first parasite sequenced for Malvales and a species suspected to have an extremely small genome. Methods We sequenced and assembled de novo the plastid genome of Cytinus hypocistis using a shotgun approach on genomic DNA. Phylogenomic analyses based on coding regions were performed on Malvidae. For each coding region present in Cytinus, we tested for relaxation or intensification of selective pressures in the Cytinus lineage compared with autotrophic Malvales. Key Results Cytinus hypocistis has an extremely divergent genome that is among the smallest sequenced to date (19·4 kb), with only 23 genes and no inverted repeat regions. Phylogenomic analysis confirmed the position of Cytinus within Malvales. All coding regions of Cytinus plastome presented very high substitution rates compared with non-parasitic Malvales. Conclusions Some regions were inferred to be under relaxed negative selection in Cytinus, suggesting that further plastome reduction is occurring due to relaxed purifying selection associated with the loss of photosynthetic activity. On the other hand, increased selection intensity and strong positive selection were detected for rpl22 in the Cytinus lineage, which might indicate an evolutionary role in the host-parasite arms race, a point that needs further research.
Collapse
Affiliation(s)
- Cristina Roquet
- Laboratoire d’Ecologie Alpine, Université Grenoble Alpes, BP 53, FR-38000 Grenoble, France
- Laboratoire d’Ecologie Alpine, CNRS, BP 53, FR-38000 Grenoble, France
- *For correspondence. E-mail
| | - Éric Coissac
- Laboratoire d’Ecologie Alpine, Université Grenoble Alpes, BP 53, FR-38000 Grenoble, France
- Laboratoire d’Ecologie Alpine, CNRS, BP 53, FR-38000 Grenoble, France
| | - Corinne Cruaud
- CEA-Institut de Génomique, Genoscope, Centre National de Séquençage, FR-91057 Evry Cedex, France
| | - Martí Boleda
- Laboratoire d’Ecologie Alpine, Université Grenoble Alpes, BP 53, FR-38000 Grenoble, France
- Laboratoire d’Ecologie Alpine, CNRS, BP 53, FR-38000 Grenoble, France
| | - Frédéric Boyer
- Laboratoire d’Ecologie Alpine, Université Grenoble Alpes, BP 53, FR-38000 Grenoble, France
- Laboratoire d’Ecologie Alpine, CNRS, BP 53, FR-38000 Grenoble, France
| | - Adriana Alberti
- CEA-Institut de Génomique, Genoscope, Centre National de Séquençage, FR-91057 Evry Cedex, France
| | - Ludovic Gielly
- Laboratoire d’Ecologie Alpine, Université Grenoble Alpes, BP 53, FR-38000 Grenoble, France
- Laboratoire d’Ecologie Alpine, CNRS, BP 53, FR-38000 Grenoble, France
| | - Pierre Taberlet
- Laboratoire d’Ecologie Alpine, Université Grenoble Alpes, BP 53, FR-38000 Grenoble, France
- Laboratoire d’Ecologie Alpine, CNRS, BP 53, FR-38000 Grenoble, France
| | - Wilfried Thuiller
- Laboratoire d’Ecologie Alpine, Université Grenoble Alpes, BP 53, FR-38000 Grenoble, France
- Laboratoire d’Ecologie Alpine, CNRS, BP 53, FR-38000 Grenoble, France
| | - Jérémie Van Es
- Conservatoire Botanique National Alpin, Domaine de Charance, FR-05000 Gap, France
| | - Sébastien Lavergne
- Laboratoire d’Ecologie Alpine, Université Grenoble Alpes, BP 53, FR-38000 Grenoble, France
- Laboratoire d’Ecologie Alpine, CNRS, BP 53, FR-38000 Grenoble, France
| |
Collapse
|
26
|
Logacheva MD, Schelkunov MI, Shtratnikova VY, Matveeva MV, Penin AA. Comparative analysis of plastid genomes of non-photosynthetic Ericaceae and their photosynthetic relatives. Sci Rep 2016; 6:30042. [PMID: 27452401 PMCID: PMC4958920 DOI: 10.1038/srep30042] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2016] [Accepted: 06/27/2016] [Indexed: 12/24/2022] Open
Abstract
Although plastid genomes of flowering plants are typically highly conserved regarding their size, gene content and order, there are some exceptions. Ericaceae, a large and diverse family of flowering plants, warrants special attention within the context of plastid genome evolution because it includes both non-photosynthetic and photosynthetic species with rearranged plastomes and putative losses of "essential" genes. We characterized plastid genomes of three species of Ericaceae, non-photosynthetic Monotropa uniflora and Hypopitys monotropa and photosynthetic Pyrola rotundifolia, using high-throughput sequencing. As expected for non-photosynthetic plants, M. uniflora and H. monotropa have small plastid genomes (46 kb and 35 kb, respectively) lacking genes related to photosynthesis, whereas P. rotundifolia has a larger genome (169 kb) with a gene set similar to other photosynthetic plants. The examined genomes contain an unusually high number of repeats and translocations. Comparative analysis of the expanded set of Ericaceae plastomes suggests that the genes clpP and accD that are present in the plastid genomes of almost all plants have not been lost in this family (as was previously thought) but rather persist in these genomes in unusual forms. Also we found a new gene in P. rotundifolia that emerged as a result of duplication of rps4 gene.
Collapse
Affiliation(s)
- Maria D. Logacheva
- Lomonosov Moscow State University, A.N Belozersky Institute of Physico-Chemical Biology, Moscow, Russia
- Kazan Federal University, Institute of Fundamental Biology and Medicine, Kazan, Russia
| | - Mikhail I. Schelkunov
- Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| | - Victoria Y. Shtratnikova
- Lomonosov Moscow State University, Department of Bioengineering and Bioinformatics, Moscow, Russia
| | - Maria V. Matveeva
- Kazan Federal University, Institute of Fundamental Biology and Medicine, Kazan, Russia
| | - Aleksey A. Penin
- Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
- Lomonosov Moscow State University, Department of Genetics, Moscow, Russia
| |
Collapse
|
27
|
Liu TJ, Zhang CY, Yan HF, Zhang L, Ge XJ, Hao G. Complete plastid genome sequence of Primula sinensis (Primulaceae): structure comparison, sequence variation and evidence for accD transfer to nucleus. PeerJ 2016; 4:e2101. [PMID: 27375965 PMCID: PMC4928469 DOI: 10.7717/peerj.2101] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2016] [Accepted: 05/10/2016] [Indexed: 11/22/2022] Open
Abstract
Species-rich genus Primula L. is a typical plant group with which to understand genetic variance between species in different levels of relationships. Chloroplast genome sequences are used to be the information resource for quantifying this difference and reconstructing evolutionary history. In this study, we reported the complete chloroplast genome sequence of Primula sinensis and compared it with other related species. This genome of chloroplast showed a typical circular quadripartite structure with 150,859 bp in sequence length consisting of 37.2% GC base. Two inverted repeated regions (25,535 bp) were separated by a large single-copy region (82,064 bp) and a small single-copy region (17,725 bp). The genome consists of 112 genes, including 78 protein-coding genes, 30 tRNA genes and four rRNA genes. Among them, seven coding genes, seven tRNA genes and four rRNA genes have two copies due to their locations in the IR regions. The accD and infA genes lacking intact open reading frames (ORF) were identified as pseudogenes. SSR and sequence variation analyses were also performed on the plastome of Primula sinensis, comparing with another available plastome of P. poissonii. The four most variable regions, rpl36–rps8, rps16–trnQ, trnH–psbA and ndhC–trnV, were identified. Phylogenetic relationship estimates using three sub-datasets extracted from a matrix of 57 protein-coding gene sequences showed the identical result that was consistent with previous studies. A transcript found from P. sinensis transcriptome showed a high similarity to plastid accD functional region and was identified as a putative plastid transit peptide at the N-terminal region. The result strongly suggested that plastid accD has been functionally transferred to the nucleus in P. sinensis.
Collapse
Affiliation(s)
- Tong-Jian Liu
- College of Life Sciences, South China Agricultural University , Guangzhou , China
| | - Cai-Yun Zhang
- College of Life Sciences, South China Agricultural University , Guangzhou , China
| | - Hai-Fei Yan
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences , Guangzhou , China
| | - Lu Zhang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences , Guangzhou , China
| | - Xue-Jun Ge
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences , Guangzhou , China
| | - Gang Hao
- College of Life Sciences, South China Agricultural University , Guangzhou , China
| |
Collapse
|
28
|
Blazier JC, Jansen RK, Mower JP, Govindu M, Zhang J, Weng ML, Ruhlman TA. Variable presence of the inverted repeat and plastome stability in Erodium. ANNALS OF BOTANY 2016; 117:1209-20. [PMID: 27192713 PMCID: PMC4904181 DOI: 10.1093/aob/mcw065] [Citation(s) in RCA: 70] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2015] [Revised: 01/05/2016] [Accepted: 02/22/2016] [Indexed: 05/18/2023]
Abstract
BACKGROUND AND AIMS Several unrelated lineages such as plastids, viruses and plasmids, have converged on quadripartite genomes of similar size with large and small single copy regions and a large inverted repeat (IR). Except for Erodium (Geraniaceae), saguaro cactus and some legumes, the plastomes of all photosynthetic angiosperms display this structure. The functional significance of the IR is not understood and Erodium provides a system to examine the role of the IR in the long-term stability of these genomes. We compared the degree of genomic rearrangement in plastomes of Erodium that differ in the presence and absence of the IR. METHODS We sequenced 17 new Erodium plastomes. Using 454, Illumina, PacBio and Sanger sequences, 16 genomes were assembled and categorized along with one incomplete and two previously published Erodium plastomes. We conducted phylogenetic analyses among these species using a dataset of 19 protein-coding genes and determined if significantly higher evolutionary rates had caused the long branch seen previously in phylogenetic reconstructions within the genus. Bioinformatic comparisons were also performed to evaluate plastome evolution across the genus. KEY RESULTS Erodium plastomes fell into four types (Type 1-4) that differ in their substitution rates, short dispersed repeat content and degree of genomic rearrangement, gene and intron content and GC content. Type 4 plastomes had significantly higher rates of synonymous substitutions (dS) for all genes and for 14 of the 19 genes non-synonymous substitutions (dN) were significantly accelerated. We evaluated the evidence for a single IR loss in Erodium and in doing so discovered that Type 4 plastomes contain a novel IR. CONCLUSIONS The presence or absence of the IR does not affect plastome stability in Erodium. Rather, the overall repeat content shows a negative correlation with genome stability, a pattern in agreement with other angiosperm groups and recent findings on genome stability in bacterial endosymbionts.
Collapse
Affiliation(s)
- John C Blazier
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, USA
| | - Robert K Jansen
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, USA Department of Biological Science, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - Jeffrey P Mower
- Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, NE, USA
| | - Madhu Govindu
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, USA
| | - Jin Zhang
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, USA
| | - Mao-Lun Weng
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, USA
| | - Tracey A Ruhlman
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, USA
| |
Collapse
|
29
|
Zhang J, Ruhlman TA, Sabir JSM, Blazier JC, Weng ML, Park S, Jansen RK. Coevolution between Nuclear-Encoded DNA Replication, Recombination, and Repair Genes and Plastid Genome Complexity. Genome Biol Evol 2016; 8:622-34. [PMID: 26893456 PMCID: PMC4824065 DOI: 10.1093/gbe/evw033] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Disruption of DNA replication, recombination, and repair (DNA-RRR) systems has been hypothesized to cause highly elevated nucleotide substitution rates and genome rearrangements in the plastids of angiosperms, but this theory remains untested. To investigate nuclear–plastid genome (plastome) coevolution in Geraniaceae, four different measures of plastome complexity (rearrangements, repeats, nucleotide insertions/deletions, and substitution rates) were evaluated along with substitution rates of 12 nuclear-encoded, plastid-targeted DNA-RRR genes from 27 Geraniales species. Significant correlations were detected for nonsynonymous (dN) but not synonymous (dS) substitution rates for three DNA-RRR genes (uvrB/C, why1, and gyrA) supporting a role for these genes in accelerated plastid genome evolution in Geraniaceae. Furthermore, correlation between dN of uvrB/C and plastome complexity suggests the presence of nucleotide excision repair system in plastids. Significant correlations were also detected between plastome complexity and 13 of the 90 nuclear-encoded organelle-targeted genes investigated. Comparisons revealed significant acceleration of dN in plastid-targeted genes of Geraniales relative to Brassicales suggesting this correlation may be an artifact of elevated rates in this gene set in Geraniaceae. Correlation between dN of plastid-targeted DNA-RRR genes and plastome complexity supports the hypothesis that the aberrant patterns in angiosperm plastome evolution could be caused by dysfunction in DNA-RRR systems.
Collapse
Affiliation(s)
- Jin Zhang
- Department of Integrative Biology, University of Texas at Austin
| | - Tracey A Ruhlman
- Department of Integrative Biology, University of Texas at Austin
| | - Jamal S M Sabir
- The Biotechnology Research Group, Department of Biological Sciences, Faculty of Science, King Abdulaziz University (KAU), Jeddah, Saudi Arabia
| | | | - Mao-Lun Weng
- Department of Integrative Biology, University of Texas at Austin
| | - Seongjun Park
- Department of Integrative Biology, University of Texas at Austin
| | - Robert K Jansen
- Department of Integrative Biology, University of Texas at Austin The Biotechnology Research Group, Department of Biological Sciences, Faculty of Science, King Abdulaziz University (KAU), Jeddah, Saudi Arabia
| |
Collapse
|
30
|
Santiso X, Lopez L, Retuerto R, Barreiro R. Phylogeography of a widespread species: pre-glacial vicariance, refugia, occasional blocking straits and long-distance migrations. AOB PLANTS 2016; 8:plw003. [PMID: 26768603 DOI: 10.1093/aobpla/plwoo3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 09/17/2015] [Accepted: 12/28/2015] [Indexed: 05/26/2023]
Abstract
Phylogeographic studies give us the opportunity to reconstruct the historical migrations of species and link them with climatic and geographic variation. They are, therefore, a key tool to understanding the relationships among biology, geology and history. One of the most interesting biogeographical areas of the world is the Mediterranean region. However, in this area, the description of concordant phylogeographic patterns is quite scarce, which limits the understanding of evolutionary patterns related to climate. Species with one-dimensional distribution ranges, such as the strawberry tree (Arbutus unedo), are particularly useful to unravel these patterns. Here, we describe its phylogeographic structure and check for concordance with patterns seen in other Mediterranean plants: longitudinal/latitudinal clines of diversity, evidence for glacial refugia and the role of sea straits in dispersal. We also identify the most likely source for the disjunct Irish population. With this aim, we sequenced four chloroplast non-coding fragments of A. unedo from 23 populations covering its whole distribution. We determined the genetic diversity, population structure, haplotype genealogy and time to the most recent common ancestor. The genealogy revealed two clades that separated during the last 700 ky but before the last glacial maximum. One clade occupies Atlantic Iberia and North Africa, while the other occurs in the Western Mediterranean. The Eastern Mediterranean is inhabited by newer haplotypes derived from both clades, while the Irish population is closely related to Iberian demes. The straits of Sicily and Gibraltar partially restricted the gene flow. We concluded that a vicariance event during the Late Quaternary in the western end of the species' range followed by eastward migration seems a likely explanation for the observed phylogeographic pattern. The role of straits indicates an occasional communication between Europe and North Africa, suggesting that the latter was a novel refugia. The East-West genetic split in Iberia is consistent with the refugia-within-refugia model. Finally, the strawberry tree possibly reached Ireland from Iberia instead of throughout the maritime fringe of France as previously thought.
Collapse
Affiliation(s)
- Xabier Santiso
- Área de Ecoloxía, Facultade de Bioloxía, Campus Vida, Universidade de Santiago de Compostela, 15782 Santiago de Compostela, Spain
| | - Lúa Lopez
- Área de Ecología, Facultad de Ciencias, Campus A Zapateira, Universidad de A Coruña, 15071 A Coruña, Spain
| | - Rubén Retuerto
- Área de Ecoloxía, Facultade de Bioloxía, Campus Vida, Universidade de Santiago de Compostela, 15782 Santiago de Compostela, Spain
| | - Rodolfo Barreiro
- Área de Ecología, Facultad de Ciencias, Campus A Zapateira, Universidad de A Coruña, 15071 A Coruña, Spain
| |
Collapse
|
31
|
Santiso X, Lopez L, Retuerto R, Barreiro R. Phylogeography of a widespread species: pre-glacial vicariance, refugia, occasional blocking straits and long-distance migrations. AOB PLANTS 2016; 8:plw003. [PMID: 26768603 PMCID: PMC4768523 DOI: 10.1093/aobpla/plw003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/17/2015] [Accepted: 12/28/2015] [Indexed: 05/26/2023]
Abstract
Phylogeographic studies give us the opportunity to reconstruct the historical migrations of species and link them with climatic and geographic variation. They are, therefore, a key tool to understanding the relationships among biology, geology and history. One of the most interesting biogeographical areas of the world is the Mediterranean region. However, in this area, the description of concordant phylogeographic patterns is quite scarce, which limits the understanding of evolutionary patterns related to climate. Species with one-dimensional distribution ranges, such as the strawberry tree (Arbutus unedo), are particularly useful to unravel these patterns. Here, we describe its phylogeographic structure and check for concordance with patterns seen in other Mediterranean plants: longitudinal/latitudinal clines of diversity, evidence for glacial refugia and the role of sea straits in dispersal. We also identify the most likely source for the disjunct Irish population. With this aim, we sequenced four chloroplast non-coding fragments of A. unedo from 23 populations covering its whole distribution. We determined the genetic diversity, population structure, haplotype genealogy and time to the most recent common ancestor. The genealogy revealed two clades that separated during the last 700 ky but before the last glacial maximum. One clade occupies Atlantic Iberia and North Africa, while the other occurs in the Western Mediterranean. The Eastern Mediterranean is inhabited by newer haplotypes derived from both clades, while the Irish population is closely related to Iberian demes. The straits of Sicily and Gibraltar partially restricted the gene flow. We concluded that a vicariance event during the Late Quaternary in the western end of the species' range followed by eastward migration seems a likely explanation for the observed phylogeographic pattern. The role of straits indicates an occasional communication between Europe and North Africa, suggesting that the latter was a novel refugia. The East-West genetic split in Iberia is consistent with the refugia-within-refugia model. Finally, the strawberry tree possibly reached Ireland from Iberia instead of throughout the maritime fringe of France as previously thought.
Collapse
Affiliation(s)
- Xabier Santiso
- Área de Ecoloxía, Facultade de Bioloxía, Campus Vida, Universidade de Santiago de Compostela, 15782 Santiago de Compostela, Spain
| | - Lúa Lopez
- Área de Ecología, Facultad de Ciencias, Campus A Zapateira, Universidad de A Coruña, 15071 A Coruña, Spain
| | - Rubén Retuerto
- Área de Ecoloxía, Facultade de Bioloxía, Campus Vida, Universidade de Santiago de Compostela, 15782 Santiago de Compostela, Spain
| | - Rodolfo Barreiro
- Área de Ecología, Facultad de Ciencias, Campus A Zapateira, Universidad de A Coruña, 15071 A Coruña, Spain
| |
Collapse
|
32
|
Naumann J, Der JP, Wafula EK, Jones SS, Wagner ST, Honaas LA, Ralph PE, Bolin JF, Maass E, Neinhuis C, Wanke S, dePamphilis CW. Detecting and Characterizing the Highly Divergent Plastid Genome of the Nonphotosynthetic Parasitic Plant Hydnora visseri (Hydnoraceae). Genome Biol Evol 2016; 8:345-63. [PMID: 26739167 PMCID: PMC4779604 DOI: 10.1093/gbe/evv256] [Citation(s) in RCA: 65] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/15/2015] [Indexed: 11/14/2022] Open
Abstract
Plastid genomes of photosynthetic flowering plants are usually highly conserved in both structure and gene content. However, the plastomes of parasitic and mycoheterotrophic plants may be released from selective constraint due to the reduction or loss of photosynthetic ability. Here we present the greatly reduced and highly divergent, yet functional, plastome of the nonphotosynthetic holoparasite Hydnora visseri (Hydnoraceae, Piperales). The plastome is 27 kb in length, with 24 genes encoding ribosomal proteins, ribosomal RNAs, tRNAs, and a few nonbioenergetic genes, but no genes related to photosynthesis. The inverted repeat and the small single copy region are only approximately 1.5 kb, and intergenic regions have been drastically reduced. Despite extreme reduction, gene order and orientation are highly similar to the plastome of Piper cenocladum, a related photosynthetic plant in Piperales. Gene sequences in Hydnora are highly divergent and several complementary approaches using the highest possible sensitivity were required for identification and annotation of this plastome. Active transcription is detected for all of the protein-coding genes in the plastid genome, and one of two introns is appropriately spliced out of rps12 transcripts. The whole-genome shotgun read depth is 1,400× coverage for the plastome, whereas the mitochondrial genome is covered at 40× and the nuclear genome at 2×. Despite the extreme reduction of the genome and high sequence divergence, the presence of syntenic, long transcriptionally active open-reading frames with distant similarity to other plastid genomes and a high plastome stoichiometry relative to the mitochondrial and nuclear genomes suggests that the plastome remains functional in H. visseri. A four-stage model of gene reduction, including the potential for complete plastome loss, is proposed to account for the range of plastid genomes in nonphotosynthetic plants.
Collapse
Affiliation(s)
- Julia Naumann
- Institut für Botanik, Technische Universität Dresden, Germany Department of Biology and Institute of Molecular Evolutionary Genetics, The Pennsylvania State University
| | - Joshua P Der
- Department of Biology and Institute of Molecular Evolutionary Genetics, The Pennsylvania State University Department of Biological Science, California State University Fullerton
| | - Eric K Wafula
- Department of Biology and Institute of Molecular Evolutionary Genetics, The Pennsylvania State University
| | - Samuel S Jones
- Department of Biology and Institute of Molecular Evolutionary Genetics, The Pennsylvania State University Intercollege Graduate Program in Plant Biology, The Pennsylvania State University
| | - Sarah T Wagner
- Institut für Botanik, Technische Universität Dresden, Germany
| | - Loren A Honaas
- Department of Biology and Institute of Molecular Evolutionary Genetics, The Pennsylvania State University
| | - Paula E Ralph
- Department of Biology and Institute of Molecular Evolutionary Genetics, The Pennsylvania State University
| | | | - Erika Maass
- Department of Biological Sciences, University of Namibia, Windhoek, Namibia
| | | | - Stefan Wanke
- Institut für Botanik, Technische Universität Dresden, Germany
| | - Claude W dePamphilis
- Department of Biology and Institute of Molecular Evolutionary Genetics, The Pennsylvania State University Intercollege Graduate Program in Plant Biology, The Pennsylvania State University
| |
Collapse
|
33
|
Dugas DV, Hernandez D, Koenen EJM, Schwarz E, Straub S, Hughes CE, Jansen RK, Nageswara-Rao M, Staats M, Trujillo JT, Hajrah NH, Alharbi NS, Al-Malki AL, Sabir JSM, Bailey CD. Mimosoid legume plastome evolution: IR expansion, tandem repeat expansions, and accelerated rate of evolution in clpP. Sci Rep 2015; 5:16958. [PMID: 26592928 PMCID: PMC4655330 DOI: 10.1038/srep16958] [Citation(s) in RCA: 96] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2015] [Accepted: 10/14/2015] [Indexed: 11/29/2022] Open
Abstract
The Leguminosae has emerged as a model for studying angiosperm plastome evolution because of its striking diversity of structural rearrangements and sequence variation. However, most of what is known about legume plastomes comes from few genera representing a subset of lineages in subfamily Papilionoideae. We investigate plastome evolution in subfamily Mimosoideae based on two newly sequenced plastomes (Inga and Leucaena) and two recently published plastomes (Acacia and Prosopis), and discuss the results in the context of other legume and rosid plastid genomes. Mimosoid plastomes have a typical angiosperm gene content and general organization as well as a generally slow rate of protein coding gene evolution, but they are the largest known among legumes. The increased length results from tandem repeat expansions and an unusual 13 kb IR-SSC boundary shift in Acacia and Inga. Mimosoid plastomes harbor additional interesting features, including loss of clpP intron1 in Inga, accelerated rates of evolution in clpP for Acacia and Inga, and dN/dS ratios consistent with neutral and positive selection for several genes. These new plastomes and results provide important resources for legume comparative genomics, plant breeding, and plastid genetic engineering, while shedding further light on the complexity of plastome evolution in legumes and angiosperms.
Collapse
Affiliation(s)
- Diana V Dugas
- Department of Biology, New Mexico State University, P.O. Box 30001, MSC 3AF, Las Cruces, NM, 88003, USA
| | - David Hernandez
- Department of Biology, New Mexico State University, P.O. Box 30001, MSC 3AF, Las Cruces, NM, 88003, USA
| | - Erik J M Koenen
- Institute of Systematic Botany, University of Zurich, Zollikerstrasse 107, 8008 Zurich, Switzerland
| | - Erika Schwarz
- Department of Integrative Biology, The University of Texas at Austin, 205 W. 24th St. Stop C0930, Austin, TX 78712, USA
| | - Shannon Straub
- Department of Biology, Hobart and William Smith Colleges, 300 Pulteney Street, Geneva, NY 14456, USA.,Oregon State University, Department Of Plant Biology, 2082 Cordley Hall, Corvallis, OR, 97331, USA
| | - Colin E Hughes
- Institute of Systematic Botany, University of Zurich, Zollikerstrasse 107, 8008 Zurich, Switzerland
| | - Robert K Jansen
- Department of Integrative Biology, The University of Texas at Austin, 205 W. 24th St. Stop C0930, Austin, TX 78712, USA.,Biotechnology Research Group, Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - Madhugiri Nageswara-Rao
- Department of Biology, New Mexico State University, P.O. Box 30001, MSC 3AF, Las Cruces, NM, 88003, USA
| | - Martijn Staats
- Biosystematics Group, Wageningen University, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Joshua T Trujillo
- Department of Biology, New Mexico State University, P.O. Box 30001, MSC 3AF, Las Cruces, NM, 88003, USA
| | - Nahid H Hajrah
- Biotechnology Research Group, Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - Njud S Alharbi
- Biotechnology Research Group, Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - Abdulrahman L Al-Malki
- Department of Biochemistry, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - Jamal S M Sabir
- Biotechnology Research Group, Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - C Donovan Bailey
- Department of Biology, New Mexico State University, P.O. Box 30001, MSC 3AF, Las Cruces, NM, 88003, USA
| |
Collapse
|
34
|
Williams AV, Boykin LM, Howell KA, Nevill PG, Small I. The Complete Sequence of the Acacia ligulata Chloroplast Genome Reveals a Highly Divergent clpP1 Gene. PLoS One 2015; 10:e0125768. [PMID: 25955637 PMCID: PMC4425659 DOI: 10.1371/journal.pone.0125768] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2014] [Accepted: 03/26/2015] [Indexed: 11/25/2022] Open
Abstract
Legumes are a highly diverse angiosperm family that include many agriculturally important species. To date, 21 complete chloroplast genomes have been sequenced from legume crops confined to the Papilionoideae subfamily. Here we report the first chloroplast genome from the Mimosoideae, Acacia ligulata, and compare it to the previously sequenced legume genomes. The A. ligulata chloroplast genome is 158,724 bp in size, comprising inverted repeats of 25,925 bp and single-copy regions of 88,576 bp and 18,298 bp. Acacia ligulata lacks the inversion present in many of the Papilionoideae, but is not otherwise significantly different in terms of gene and repeat content. The key feature is its highly divergent clpP1 gene, normally considered essential in chloroplast genomes. In A. ligulata, although transcribed and spliced, it probably encodes a catalytically inactive protein. This study provides a significant resource for further genetic research into Acacia and the Mimosoideae. The divergent clpP1 gene suggests that Acacia will provide an interesting source of information on the evolution and functional diversity of the chloroplast Clp protease complex.
Collapse
Affiliation(s)
- Anna V. Williams
- Australian Research Council Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley, Western Australia, Australia
- Botanic Gardens and Parks Authority, Kings Park and Botanic Garden, Fraser Avenue, Kings Park, Western Australia, Australia
- School of Plant Biology, The University of Western Australia, Crawley, Western Australia, Australia
| | - Laura M. Boykin
- Australian Research Council Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley, Western Australia, Australia
- Centre of Excellence in Computational Systems Biology, The University of Western Australia, Crawley, Western Australia, Australia
| | - Katharine A. Howell
- Australian Research Council Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley, Western Australia, Australia
| | - Paul G. Nevill
- Botanic Gardens and Parks Authority, Kings Park and Botanic Garden, Fraser Avenue, Kings Park, Western Australia, Australia
- School of Plant Biology, The University of Western Australia, Crawley, Western Australia, Australia
| | - Ian Small
- Australian Research Council Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley, Western Australia, Australia
- Centre of Excellence in Computational Systems Biology, The University of Western Australia, Crawley, Western Australia, Australia
- * E-mail:
| |
Collapse
|
35
|
Yan L, Lai X, Li X, Wei C, Tan X, Zhang Y. Analyses of the complete genome and gene expression of chloroplast of sweet potato [Ipomoea batata]. PLoS One 2015; 10:e0124083. [PMID: 25874767 PMCID: PMC4398329 DOI: 10.1371/journal.pone.0124083] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2014] [Accepted: 03/05/2015] [Indexed: 12/31/2022] Open
Abstract
Sweet potato [Ipomoea batatas (L.) Lam] ranks among the top seven most important food crops cultivated worldwide and is hexaploid plant (2n=6x=90) in the Convolvulaceae family with a genome size between 2,200 to 3,000 Mb. The genomic resources for this crop are deficient due to its complicated genetic structure. Here, we report the complete nucleotide sequence of the chloroplast (cp) genome of sweet potato, which is a circular molecule of 161,303 bp in the typical quadripartite structure with large (LSC) and small (SSC) single-copy regions separated by a pair of inverted repeats (IRs). The chloroplast DNA contains a total of 145 genes, including 94 protein-encoding genes of which there are 72 single-copy and 11 double-copy genes. The organization and structure of the chloroplast genome (gene content and order, IR expansion/contraction, random repeating sequences, structural rearrangement) of sweet potato were compared with those of Ipomoea (L.) species and some basal important angiosperms, respectively. Some boundary gene-flow and gene gain-and-loss events were identified at intra- and inter-species levels. In addition, by comparing with the transcriptome sequences of sweet potato, the RNA editing events and differential expressions of the chloroplast functional-genes were detected. Moreover, phylogenetic analysis was conducted based on 77 protein-coding genes from 33 taxa and the result may contribute to a better understanding of the evolution progress of the genus Ipomoea (L.), including phylogenetic relationships, intraspecific differentiation and interspecific introgression.
Collapse
Affiliation(s)
- Lang Yan
- College of Life Sciences, Sichuan University, Key Laboratory of Bio-resources and Eco-environment, Ministry of Education, Sichuan Key Laboratory of Molecular Biology and Biotechnology, Center for Functional Genomics and Bioinformatics, Chengdu, Sichuan, People's Republic of China
| | - Xianjun Lai
- Maize Research Institute of Sichuan Agriculture University, Key Laboratory of Crop Genetic Resource and Improvement, Ministry of Education, Wenjiang, Chengdu, People's Republic of China
| | - Xuedan Li
- College of Life Sciences, Sichuan University, Key Laboratory of Bio-resources and Eco-environment, Ministry of Education, Sichuan Key Laboratory of Molecular Biology and Biotechnology, Center for Functional Genomics and Bioinformatics, Chengdu, Sichuan, People's Republic of China
| | - Changhe Wei
- College of Life Sciences, Sichuan University, Key Laboratory of Bio-resources and Eco-environment, Ministry of Education, Sichuan Key Laboratory of Molecular Biology and Biotechnology, Center for Functional Genomics and Bioinformatics, Chengdu, Sichuan, People's Republic of China
| | - Xuemei Tan
- College of Life Sciences, Sichuan University, Key Laboratory of Bio-resources and Eco-environment, Ministry of Education, Sichuan Key Laboratory of Molecular Biology and Biotechnology, Center for Functional Genomics and Bioinformatics, Chengdu, Sichuan, People's Republic of China
- * E-mail: (YZ); (XT)
| | - Yizheng Zhang
- College of Life Sciences, Sichuan University, Key Laboratory of Bio-resources and Eco-environment, Ministry of Education, Sichuan Key Laboratory of Molecular Biology and Biotechnology, Center for Functional Genomics and Bioinformatics, Chengdu, Sichuan, People's Republic of China
- * E-mail: (YZ); (XT)
| |
Collapse
|
36
|
Carbonell-Caballero J, Alonso R, Ibañez V, Terol J, Talon M, Dopazo J. A Phylogenetic Analysis of 34 Chloroplast Genomes Elucidates the Relationships between Wild and Domestic Species within the Genus Citrus. Mol Biol Evol 2015; 32:2015-35. [PMID: 25873589 PMCID: PMC4833069 DOI: 10.1093/molbev/msv082] [Citation(s) in RCA: 198] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Citrus genus includes some of the most important cultivated fruit trees worldwide. Despite being extensively studied because of its commercial relevance, the origin of cultivated citrus species and the history of its domestication still remain an open question. Here, we present a phylogenetic analysis of the chloroplast genomes of 34 citrus genotypes which constitutes the most comprehensive and detailed study to date on the evolution and variability of the genus Citrus. A statistical model was used to estimate divergence times between the major citrus groups. Additionally, a complete map of the variability across the genome of different citrus species was produced, including single nucleotide variants, heteroplasmic positions, indels (insertions and deletions), and large structural variants. The distribution of all these variants provided further independent support to the phylogeny obtained. An unexpected finding was the high level of heteroplasmy found in several of the analyzed genomes. The use of the complete chloroplast DNA not only paves the way for a better understanding of the phylogenetic relationships within the Citrus genus but also provides original insights into other elusive evolutionary processes, such as chloroplast inheritance, heteroplasmy, and gene selection.
Collapse
Affiliation(s)
- Jose Carbonell-Caballero
- Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
| | - Roberto Alonso
- Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
| | - Victoria Ibañez
- Centro de Genómica, Instituto Valenciano de Investigaciones Agrarias, Moncada, Valencia, Spain
| | - Javier Terol
- Centro de Genómica, Instituto Valenciano de Investigaciones Agrarias, Moncada, Valencia, Spain
| | - Manuel Talon
- Centro de Genómica, Instituto Valenciano de Investigaciones Agrarias, Moncada, Valencia, Spain
| | - Joaquin Dopazo
- Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain Functional Genomics Node, Spanish National Institute of Bioinformatics at CIPF, Valencia, Spain
| |
Collapse
|
37
|
Curci PL, De Paola D, Danzi D, Vendramin GG, Sonnante G. Complete chloroplast genome of the multifunctional crop globe artichoke and comparison with other Asteraceae. PLoS One 2015; 10:e0120589. [PMID: 25774672 PMCID: PMC4361619 DOI: 10.1371/journal.pone.0120589] [Citation(s) in RCA: 82] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2014] [Accepted: 01/24/2015] [Indexed: 11/19/2022] Open
Abstract
With over 20,000 species, Asteraceae is the second largest plant family. High-throughput sequencing of nuclear and chloroplast genomes has allowed for a better understanding of the evolutionary relationships within large plant families. Here, the globe artichoke chloroplast (cp) genome was obtained by a combination of whole-genome and BAC clone high-throughput sequencing. The artichoke cp genome is 152,529 bp in length, consisting of two single-copy regions separated by a pair of inverted repeats (IRs) of 25,155 bp, representing the longest IRs found in the Asteraceae family so far. The large (LSC) and the small (SSC) single-copy regions span 83,578 bp and 18,641 bp, respectively. The artichoke cp sequence was compared to the other eight Asteraceae complete cp genomes available, revealing an IR expansion at the SSC/IR boundary. This expansion consists of 17 bp of the ndhF gene generating an overlap between the ndhF and ycf1 genes. A total of 127 cp simple sequence repeats (cpSSRs) were identified in the artichoke cp genome, potentially suitable for future population studies in the Cynara genus. Parsimony-informative regions were evaluated and allowed to place a Cynara species within the Asteraceae family tree. The eight most informative coding regions were also considered and tested for “specific barcode” purpose in the Asteraceae family. Our results highlight the usefulness of cp genome sequencing in exploring plant genome diversity and retrieving reliable molecular resources for phylogenetic and evolutionary studies, as well as for specific barcodes in plants.
Collapse
Affiliation(s)
- Pasquale L. Curci
- Institute of Biosciences and Bioresources, National Research Council, Bari, Italy
| | - Domenico De Paola
- Institute of Biosciences and Bioresources, National Research Council, Bari, Italy
| | - Donatella Danzi
- Institute of Biosciences and Bioresources, National Research Council, Bari, Italy
| | - Giovanni G. Vendramin
- Institute of Biosciences and Bioresources, National Research Council, Sesto Fiorentino (FI), Italy
| | - Gabriella Sonnante
- Institute of Biosciences and Bioresources, National Research Council, Bari, Italy
- * E-mail:
| |
Collapse
|
38
|
Nock CJ, Baten A, King GJ. Complete chloroplast genome of Macadamia integrifolia confirms the position of the Gondwanan early-diverging eudicot family Proteaceae. BMC Genomics 2014; 15 Suppl 9:S13. [PMID: 25522147 PMCID: PMC4290595 DOI: 10.1186/1471-2164-15-s9-s13] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Sequence data from the chloroplast genome have played a central role in elucidating the evolutionary history of flowering plants, Angiospermae. In the past decade, the number of complete chloroplast genomes has burgeoned, leading to well-supported angiosperm phylogenies. However, some relationships, particulary among early-diverging lineages, remain unresolved. The diverse Southern Hemisphere plant family Proteaceae arose on the ancient supercontinent Gondwana early in angiosperm history and is a model group for adaptive radiation in response to changing climatic conditions. Genomic resources for the family are limited, and until now it is one of the few early-diverging 'basal eudicot' lineages not represented in chloroplast phylogenomic analyses. RESULTS The chloroplast genome of the Australian nut crop tree Macadamia integrifolia was assembled de novo from Illumina paired-end sequence reads. Three contigs, corresponding to a collapsed inverted repeat, a large and a small single copy region were identified, and used for genome reconstruction. The complete genome is 159,714 bp in length and was assembled at deep coverage (3.29 million reads; ~2000 x). Phylogenetic analyses based on 83-gene and inverted repeat region alignments, the largest sequence-rich datasets to include the basal eudicot family Proteaceae, provide strong support for a Proteales clade that includes Macadamia, Platanus and Nelumbo. Genome structure and content followed the ancestral angiosperm pattern and were highly conserved in the Proteales, whilst size differences were largely explained by the relative contraction of the single copy regions and expansion of the inverted repeats in Macadamia. CONCLUSIONS The Macadamia chloroplast genome presented here is the first in the Proteaceae, and confirms the placement of this family with the morphologically divergent Plantanaceae (plane tree family) and Nelumbonaceae (sacred lotus family) in the basal eudicot order Proteales. It provides a high-quality reference genome for future evolutionary studies and will be of benefit for taxon-rich phylogenomic analyses aimed at resolving relationships among early-diverging angiosperms, and more broadly across the plant tree of life.
Collapse
|
39
|
The American cranberry mitochondrial genome reveals the presence of selenocysteine (tRNA-Sec and SECIS) insertion machinery in land plants. Gene 2013; 536:336-43. [PMID: 24342657 DOI: 10.1016/j.gene.2013.11.104] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2013] [Revised: 11/25/2013] [Accepted: 11/29/2013] [Indexed: 01/06/2023]
Abstract
This is the first de novo assembly and annotation of a complete mitochondrial genome in the Ericales order from the American cranberry (Vaccinium macrocarpon Ait.). Moreover, only four complete Asterid mitochondrial genomes have been made publicly available. The cranberry mitochondrial genome was assembled and reconstructed from whole genome 454 Roche GS-FLX and Illumina shotgun sequences. Compared with other Asterids, the reconstruction of the genome revealed an average size mitochondrion (459,678 nt) with relatively little repetitive sequences and DNA of plastid origin. The complete mitochondrial genome of cranberry was annotated obtaining a total of 34 genes classified based on their putative function, plus three ribosomal RNAs, and 17 transfer RNAs. Maternal organellar cranberry inheritance was inferred by analyzing gene variation in the cranberry mitochondria and plastid genomes. The annotation of cranberry mitochondrial genome revealed the presence of two copies of tRNA-Sec and a selenocysteine insertion sequence (SECIS) element which were lost in plants during evolution. This is the first report of a land plant possessing selenocysteine insertion machinery at the sequence level.
Collapse
|