51
|
Wang Y, Wang S, Liu Y, Yuan Q, Sun J, Guo L. Chloroplast genome variation and phylogenetic relationships of Atractylodes species. BMC Genomics 2021; 22:103. [PMID: 33541261 PMCID: PMC7863269 DOI: 10.1186/s12864-021-07394-8] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 01/19/2021] [Indexed: 12/21/2022] Open
Abstract
Background Atractylodes DC is the basic original plant of the widely used herbal medicines “Baizhu” and “Cangzhu” and an endemic genus in East Asia. Species within the genus have minor morphological differences, and the universal DNA barcodes cannot clearly distinguish the systemic relationship or identify the species of the genus. In order to solve these question, we sequenced the chloroplast genomes of all species of Atractylodes using high-throughput sequencing. Results The results indicate that the chloroplast genome of Atractylodes has a typical quadripartite structure and ranges from 152,294 bp (A. carlinoides) to 153,261 bp (A. macrocephala) in size. The genome of all species contains 113 genes, including 79 protein-coding genes, 30 transfer RNA genes and four ribosomal RNA genes. Four hotspots, rpl22-rps19-rpl2, psbM-trnD, trnR-trnT(GGU), and trnT(UGU)-trnL, and a total of 42–47 simple sequence repeats (SSR) were identified as the most promising potentially variable makers for species delimitation and population genetic studies. Phylogenetic analyses of the whole chloroplast genomes indicate that Atractylodes is a clade within the tribe Cynareae; Atractylodes species form a monophyly that clearly reflects the relationship within the genus. Conclusions Our study included investigations of the sequences and structural genomic variations, phylogenetics and mutation dynamics of Atractylodes chloroplast genomes and will facilitate future studies in population genetics, taxonomy and species identification. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07394-8.
Collapse
Affiliation(s)
- Yiheng Wang
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China
| | - Sheng Wang
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China
| | - Yanlei Liu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China
| | - Qingjun Yuan
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China
| | - Jiahui Sun
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China.
| | - Lanping Guo
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China.
| |
Collapse
|
52
|
Gargiulo R, Kull T, Fay MF. Effective double-digest RAD sequencing and genotyping despite large genome size. Mol Ecol Resour 2021; 21:1037-1055. [PMID: 33351289 DOI: 10.1111/1755-0998.13314] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2020] [Revised: 12/03/2020] [Accepted: 12/14/2020] [Indexed: 11/28/2022]
Abstract
Obtaining informative data is the ambition of any genomic project, but in nonmodel species with very large genomes, pursuing such a goal requires surmounting a series of analytical challenges. Double-digest RAD sequencing is routinely used in nonmodel organisms and offers some control over the volume of data obtained. However, the volume of data recovered is not always an indication of the reliability of data sets, and quality checks are necessary to ensure that true and artefactual information is set apart. In the present study, we aim to fill the gap existing between the known applicability of RAD sequencing methods in plants with large genomes and the use of the retrieved loci for population genetic inference. By analysing two populations of Cypripedium calceolus, a nonmodel orchid species with a large genome size (1C ~ 31.6 Gbp), we provide a complete workflow from library preparation to bioinformatic filtering and inference of genetic diversity and differentiation. We show how filtering strategies to dismiss potentially misleading data need to be explored and adapted to data set-specific features. Moreover, we suggest that the occurrence of organellar sequences in libraries should not be neglected when planning the experiment and analysing the results. Finally, we explain how, in the absence of prior information about the genome of the species, seeking high standards of quality during library preparation and sequencing can provide an insurance against unpredicted technical or biological constraints.
Collapse
Affiliation(s)
| | - Tiiu Kull
- Estonian University of Life Sciences, Tartu, Estonia
| | - Michael F Fay
- Royal Botanic Gardens, Kew, Richmond, Surrey, UK.,School of Biological Sciences, University of Western Australia, Crawley, WA, Australia
| |
Collapse
|
53
|
The Complete Plastid Genome of Artocarpus camansi: A High Degree of Conservation of the Plastome Structure in the Family Moraceae. FORESTS 2020. [DOI: 10.3390/f11111179] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Understanding the plastid genome is extremely important for the interpretation of the genetic mechanisms associated with essential physiological and metabolic functions, the identification of possible marker regions for phylogenetic or phylogeographic analyses, and the elucidation of the modes through which natural selection operates in different regions of this genome. In the present study, we assembled the plastid genome of Artocarpus camansi, compared its repetitive structures with Artocarpus heterophyllus, and searched for evidence of synteny within the family Moraceae. We also constructed a phylogeny based on 56 chloroplast genes to assess the relationships among three families of the order Rosales, that is, the Moraceae, Rhamnaceae, and Cannabaceae. The plastid genome of A. camansi has 160,096 bp, and presents the typical circular quadripartite structure of the Angiosperms, comprising a large single copy (LSC) of 88,745 bp and a small single copy (SSC) of 19,883 bp, separated by a pair of inverted repeat (IR) regions each with a length of 25,734 bp. The total GC content was 36.0%, which is very similar to Artocarpus heterophyllus (36.1%) and other moraceous species. A total of 23,068 codons and 80 SSRs were identified in the A. camansi plastid genome, with the majority of the SSRs being mononucleotide (70.0%). A total of 50 repeat structures were observed in the A. camansi plastid genome, in contrast with 61 repeats in A. heterophyllus. A purifying selection signal was found in 70 of the 79 protein-coding genes, indicating that they have all been highly conserved throughout the evolutionary history of the genus. The comparative analysis of the structural characteristics of the chloroplast among different moraceous species found a high degree of similarity in the sequences, which indicates a highly conserved evolutionary model in these plastid genomes. The phylogenetic analysis also recovered a high degree of similarity between the chloroplast genes of A. camansi and A. heterophyllus, and reconfirmed the hypothesis of the intense conservation of the plastome in the family Moraceae.
Collapse
|
54
|
Freudenthal JA, Pfaff S, Terhoeven N, Korte A, Ankenbrand MJ, Förster F. A systematic comparison of chloroplast genome assembly tools. Genome Biol 2020; 21:254. [PMID: 32988404 PMCID: PMC7520963 DOI: 10.1186/s13059-020-02153-6] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Accepted: 08/22/2020] [Indexed: 01/23/2023] Open
Abstract
BACKGROUND Chloroplasts are intracellular organelles that enable plants to conduct photosynthesis. They arose through the symbiotic integration of a prokaryotic cell into an eukaryotic host cell and still contain their own genomes with distinct genomic information. Plastid genomes accommodate essential genes and are regularly utilized in biotechnology or phylogenetics. Different assemblers that are able to assess the plastid genome have been developed. These assemblers often use data of whole genome sequencing experiments, which usually contain reads from the complete chloroplast genome. RESULTS The performance of different assembly tools has never been systematically compared. Here, we present a benchmark of seven chloroplast assembly tools, capable of succeeding in more than 60% of known real data sets. Our results show significant differences between the tested assemblers in terms of generating whole chloroplast genome sequences and computational requirements. The examination of 105 data sets from species with unknown plastid genomes leads to the assembly of 20 novel chloroplast genomes. CONCLUSIONS We create docker images for each tested tool that are freely available for the scientific community and ensure reproducibility of the analyses. These containers allow the analysis and screening of data sets for chloroplast genomes using standard computational infrastructure. Thus, large scale screening for chloroplasts within genomic sequencing data is feasible.
Collapse
Affiliation(s)
- Jan A. Freudenthal
- Center for Computational and Theoretical Biology, University of Würzburg, Campus Hubland Nord, Würzburg, 97074 Germany
- AnaLife Data Science, Wiesengrund 16, Würzburg, 97295 Waldbrunn Germany
| | - Simon Pfaff
- Center for Computational and Theoretical Biology, University of Würzburg, Campus Hubland Nord, Würzburg, 97074 Germany
- Department of Bioinformatics, University of Würzburg, Biozentrum, Am Hubland, Würzburg, 97074 Germany
| | - Niklas Terhoeven
- Center for Computational and Theoretical Biology, University of Würzburg, Campus Hubland Nord, Würzburg, 97074 Germany
- AnaLife Data Science, Wiesengrund 16, Würzburg, 97295 Waldbrunn Germany
| | - Arthur Korte
- Center for Computational and Theoretical Biology, University of Würzburg, Campus Hubland Nord, Würzburg, 97074 Germany
| | - Markus J. Ankenbrand
- Center for Computational and Theoretical Biology, University of Würzburg, Campus Hubland Nord, Würzburg, 97074 Germany
- AnaLife Data Science, Wiesengrund 16, Würzburg, 97295 Waldbrunn Germany
- Chair of Cellular and Molecular Imaging, Comprehensive Heart Failure Center, University Hospital Würzburg, Josef-Schneider-Str. 2, Würzburg, 97080 Germany
| | - Frank Förster
- Center for Computational and Theoretical Biology, University of Würzburg, Campus Hubland Nord, Würzburg, 97074 Germany
- Department of Bioinformatics, University of Würzburg, Biozentrum, Am Hubland, Würzburg, 97074 Germany
- Fraunhofer IME-BR, Ohlebergsweg 12, Gießen, 35392 Germany
- Bioinformatics Core Facility of the University of Gießen, Heinrich-Buff-Ring 58, Gießen, 35392 Germany
| |
Collapse
|
55
|
Freudenthal JA, Pfaff S, Terhoeven N, Korte A, Ankenbrand MJ, Förster F. A systematic comparison of chloroplast genome assembly tools. Genome Biol 2020; 21:254. [PMID: 32988404 DOI: 10.1101/665869] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Accepted: 08/22/2020] [Indexed: 05/21/2023] Open
Abstract
BACKGROUND Chloroplasts are intracellular organelles that enable plants to conduct photosynthesis. They arose through the symbiotic integration of a prokaryotic cell into an eukaryotic host cell and still contain their own genomes with distinct genomic information. Plastid genomes accommodate essential genes and are regularly utilized in biotechnology or phylogenetics. Different assemblers that are able to assess the plastid genome have been developed. These assemblers often use data of whole genome sequencing experiments, which usually contain reads from the complete chloroplast genome. RESULTS The performance of different assembly tools has never been systematically compared. Here, we present a benchmark of seven chloroplast assembly tools, capable of succeeding in more than 60% of known real data sets. Our results show significant differences between the tested assemblers in terms of generating whole chloroplast genome sequences and computational requirements. The examination of 105 data sets from species with unknown plastid genomes leads to the assembly of 20 novel chloroplast genomes. CONCLUSIONS We create docker images for each tested tool that are freely available for the scientific community and ensure reproducibility of the analyses. These containers allow the analysis and screening of data sets for chloroplast genomes using standard computational infrastructure. Thus, large scale screening for chloroplasts within genomic sequencing data is feasible.
Collapse
Affiliation(s)
- Jan A Freudenthal
- Center for Computational and Theoretical Biology, University of Würzburg, Campus Hubland Nord, Würzburg, 97074, Germany
| | - Simon Pfaff
- Center for Computational and Theoretical Biology, University of Würzburg, Campus Hubland Nord, Würzburg, 97074, Germany
- AnaLife Data Science, Wiesengrund 16, Würzburg, 97295 Waldbrunn, Germany
- Chair of Cellular and Molecular Imaging, Comprehensive Heart Failure Center, University Hospital Würzburg, Josef-Schneider-Str. 2, Würzburg, 97080, Germany
| | - Niklas Terhoeven
- Center for Computational and Theoretical Biology, University of Würzburg, Campus Hubland Nord, Würzburg, 97074, Germany
- AnaLife Data Science, Wiesengrund 16, Würzburg, 97295 Waldbrunn, Germany
| | - Arthur Korte
- Center for Computational and Theoretical Biology, University of Würzburg, Campus Hubland Nord, Würzburg, 97074, Germany
- Department of Bioinformatics, University of Würzburg, Biozentrum, Am Hubland, Würzburg, 97074, Germany
| | - Markus J Ankenbrand
- Center for Computational and Theoretical Biology, University of Würzburg, Campus Hubland Nord, Würzburg, 97074, Germany
- AnaLife Data Science, Wiesengrund 16, Würzburg, 97295 Waldbrunn, Germany
| | - Frank Förster
- Center for Computational and Theoretical Biology, University of Würzburg, Campus Hubland Nord, Würzburg, 97074, Germany.
- Department of Bioinformatics, University of Würzburg, Biozentrum, Am Hubland, Würzburg, 97074, Germany.
- Fraunhofer IME-BR, Ohlebergsweg 12, Gießen, 35392, Germany.
- Bioinformatics Core Facility of the University of Gießen, Heinrich-Buff-Ring 58, Gießen, 35392, Germany.
| |
Collapse
|
56
|
Zheng S, Poczai P, Hyvönen J, Tang J, Amiryousefi A. Chloroplot: An Online Program for the Versatile Plotting of Organelle Genomes. Front Genet 2020; 11:576124. [PMID: 33101394 PMCID: PMC7545089 DOI: 10.3389/fgene.2020.576124] [Citation(s) in RCA: 134] [Impact Index Per Article: 26.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Accepted: 08/28/2020] [Indexed: 11/13/2022] Open
Abstract
Understanding the complexity of genomic structures and their unique architecture is linked with the power of visualization tools used to represent these features. Such tools should be able to provide a realistic and scalable version of genomic content. Here, we present an online organelle plotting tool focused on chloroplasts, which were developed to visualize the exclusive structure of these genomes. The distinguished unique features of this program include its ability to represent the Single Short Copy (SSC) regions in reverse complement, which allows the depiction of the codon usage bias index for each gene, along with the possibility of the minor mismatches between inverted repeat (IR) regions and user-specified plotting layers. The versatile color schemes and diverse functionalities of the program are specifically designed to reflect the accurate scalable representation of the plastid genomes. We introduce a Shiny app website for easy use of the program; a more advanced application of the tool is possible by further development and modification of the downloadable source codes provided online. The software and its libraries are completely coded in R, available at https://irscope.shinyapps.io/chloroplot/.
Collapse
Affiliation(s)
- Shuyu Zheng
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Peter Poczai
- Finnish Museum of Natural History (Botany), University of Helsinki, Helsinki, Finland.,Department of Biosciences, Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| | - Jaakko Hyvönen
- Finnish Museum of Natural History (Botany), University of Helsinki, Helsinki, Finland.,Department of Biosciences, Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| | - Jing Tang
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Ali Amiryousefi
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| |
Collapse
|
57
|
Jin JJ, Yu WB, Yang JB, Song Y, dePamphilis CW, Yi TS, Li DZ. GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol 2020; 21:241. [PMID: 32912315 PMCID: PMC7488116 DOI: 10.1186/s13059-020-02154-5] [Citation(s) in RCA: 1574] [Impact Index Per Article: 314.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Accepted: 08/24/2020] [Indexed: 12/13/2022] Open
Abstract
GetOrganelle is a state-of-the-art toolkit to accurately assemble organelle genomes from whole genome sequencing data. It recruits organelle-associated reads using a modified "baiting and iterative mapping" approach, conducts de novo assembly, filters and disentangles the assembly graph, and produces all possible configurations of circular organelle genomes. For 50 published plant datasets, we are able to reassemble the circular plastomes from 47 datasets using GetOrganelle. GetOrganelle assemblies are more accurate than published and/or NOVOPlasty-reassembled plastomes as assessed by mapping. We also assemble complete mitochondrial genomes using GetOrganelle. GetOrganelle is freely released under a GPL-3 license ( https://github.com/Kinggerm/GetOrganelle ).
Collapse
Affiliation(s)
- Jian-Jun Jin
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China
| | - Wen-Bin Yu
- Center for Integrative Conservation, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Mengla, Yunnan, 666303, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Mengla, Yunnan, 666303, China
- Southeast Asia Biodiversity Research Institute, Chinese Academy of Sciences, Yezin, Nay Pyi Taw, 05282, Myanmar
| | - Jun-Bo Yang
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China
| | - Yu Song
- Center for Integrative Conservation, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Mengla, Yunnan, 666303, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Mengla, Yunnan, 666303, China
- Southeast Asia Biodiversity Research Institute, Chinese Academy of Sciences, Yezin, Nay Pyi Taw, 05282, Myanmar
| | - Claude W dePamphilis
- Department of Biology, The Pennsylvania State University, University Park, PA, 16801, USA
| | - Ting-Shuang Yi
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China.
| | - De-Zhu Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China.
| |
Collapse
|
58
|
Jin JJ, Yu WB, Yang JB, Song Y, dePamphilis CW, Yi TS, Li DZ. GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol 2020. [PMID: 32912315 DOI: 10.1101/256479] [Citation(s) in RCA: 122] [Impact Index Per Article: 24.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/17/2023] Open
Abstract
GetOrganelle is a state-of-the-art toolkit to accurately assemble organelle genomes from whole genome sequencing data. It recruits organelle-associated reads using a modified "baiting and iterative mapping" approach, conducts de novo assembly, filters and disentangles the assembly graph, and produces all possible configurations of circular organelle genomes. For 50 published plant datasets, we are able to reassemble the circular plastomes from 47 datasets using GetOrganelle. GetOrganelle assemblies are more accurate than published and/or NOVOPlasty-reassembled plastomes as assessed by mapping. We also assemble complete mitochondrial genomes using GetOrganelle. GetOrganelle is freely released under a GPL-3 license ( https://github.com/Kinggerm/GetOrganelle ).
Collapse
Affiliation(s)
- Jian-Jun Jin
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China
| | - Wen-Bin Yu
- Center for Integrative Conservation, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Mengla, Yunnan, 666303, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Mengla, Yunnan, 666303, China
- Southeast Asia Biodiversity Research Institute, Chinese Academy of Sciences, Yezin, Nay Pyi Taw, 05282, Myanmar
| | - Jun-Bo Yang
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China
| | - Yu Song
- Center for Integrative Conservation, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Mengla, Yunnan, 666303, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Mengla, Yunnan, 666303, China
- Southeast Asia Biodiversity Research Institute, Chinese Academy of Sciences, Yezin, Nay Pyi Taw, 05282, Myanmar
| | - Claude W dePamphilis
- Department of Biology, The Pennsylvania State University, University Park, PA, 16801, USA
| | - Ting-Shuang Yi
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China.
| | - De-Zhu Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, 650201, China.
| |
Collapse
|
59
|
Sharpe RM, Williamson-Benavides B, Edwards GE, Dhingra A. Methods of analysis of chloroplast genomes of C 3, Kranz type C 4 and Single Cell C 4 photosynthetic members of Chenopodiaceae. PLANT METHODS 2020; 16:119. [PMID: 32874195 PMCID: PMC7457496 DOI: 10.1186/s13007-020-00662-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Accepted: 08/20/2020] [Indexed: 06/11/2023]
Abstract
BACKGROUND Chloroplast genome information is critical to understanding forms of photosynthesis in the plant kingdom. During the evolutionary process, plants have developed different photosynthetic strategies that are accompanied by complementary biochemical and anatomical features. Members of family Chenopodiaceae have species with C3 photosynthesis, and variations of C4 photosynthesis in which photorespiration is reduced by concentrating CO2 around Rubisco through dual coordinated functioning of dimorphic chloroplasts. Among dicots, the family has the largest number of C4 species, and greatest structural and biochemical diversity in forms of C4 including the canonical dual-cell Kranz anatomy, and the recently identified single cell C4 with the presence of dimorphic chloroplasts separated by a vacuole. This is the first comparative analysis of chloroplast genomes in species representative of photosynthetic types in the family. RESULTS Methodology with high throughput sequencing complemented with Sanger sequencing of selected loci provided high quality and complete chloroplast genomes of seven species in the family and one species in the closely related Amaranthaceae family, representing C3, Kranz type C4 and single cell C4 (SSC4) photosynthesis six of the eight chloroplast genomes are new, while two are improved versions of previously published genomes. The depth of coverage obtained using high-throughput sequencing complemented with targeted resequencing of certain loci enabled superior resolution of the border junctions, directionality and repeat region sequences. Comparison of the chloroplast genomes with previously sequenced plastid genomes revealed similar genome organization, gene order and content with a few revisions. High-quality complete chloroplast genome sequences resulted in correcting the orientation the LSC region of the published Bienertia sinuspersici chloroplast genome, identification of stop codons in the rpl23 gene in B. sinuspersici and B. cycloptera, and identifying an instance of IR expansion in the Haloxylon ammodendron inverted repeat sequence. The rare observation of a mitochondria-to-chloroplast inter-organellar gene transfer event was identified in family Chenopodiaceae. CONCLUSIONS This study reports complete chloroplast genomes from seven Chenopodiaceae and one Amaranthaceae species. The depth of coverage obtained using high-throughput sequencing complemented with targeted resequencing of certain loci enabled superior resolution of the border junctions, directionality, and repeat region sequences. Therefore, the use of high throughput and Sanger sequencing, in a hybrid method, reaffirms to be rapid, efficient, and reliable for chloroplast genome sequencing.
Collapse
Affiliation(s)
- Richard M. Sharpe
- Department of Horticulture, Washington State University, Pullman, WA 99164 USA
| | - Bruce Williamson-Benavides
- Department of Horticulture, Washington State University, Pullman, WA 99164 USA
- Molecular Plants Sciences, Washington State University, Pullman, WA 99164 USA
| | - Gerald E. Edwards
- Molecular Plants Sciences, Washington State University, Pullman, WA 99164 USA
- School of Biological Sciences, Washington State University, Pullman, WA 99164 USA
| | - Amit Dhingra
- Department of Horticulture, Washington State University, Pullman, WA 99164 USA
- Molecular Plants Sciences, Washington State University, Pullman, WA 99164 USA
| |
Collapse
|
60
|
Liu L, Du Y, Shen C, Li R, Lee J, Li P. The complete chloroplast genome of Papaver setigerum and comparative analyses in Papaveraceae. Genet Mol Biol 2020; 43:e20190272. [PMID: 32808964 PMCID: PMC7433754 DOI: 10.1590/1678-4685-gmb-2019-0272] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Accepted: 05/08/2020] [Indexed: 11/22/2022] Open
Abstract
Papaver setigerum is an annual herb that is closely related to the opium poppy, P. somniferum. Genetic resources for P. setigerum are scarce. In the present study, we assembled the complete chloroplast (cp) genome of P. setigerum based on genome skimming data, and we conducted comparative cp genome analyses to study the evolutionary pattern in Papaveraceae. The cp genome of P. setigerum is 152,862 bp in length with a typical quadripartite structure. Comparative analyses revealed no gene rearrangement in the Papaveraceae family, although differences were evident in genome size, gene losses, as well as inverted repeats (IR) region expansion and contraction. The rps15 gene has been lost from the genomes of Meconopsis racemosa, Coreanomecon hylomeconoides, P. orientale, P. somniferum, and P. setigerum, and the ycf15 gene is found only in C. hylomeconoides. Moreover, 13 cpDNA markers, including psbA-trnH, rps16-trnQ, trnS-trnG, trnC-petN, trnE-trnT, trnL-trnF, trnF-ndhJ, petA-psbJ, ndhF-rpl32, rpl32-trnL, ccsA-ndhD, ndhE-ndhG, and rps15-ycf1, were identified with relatively high levels of variation within Papaver, which will be useful for species identification in this genus. Among those markers, psbA-trnH is the best one to distinguish P. somniferum and P. setigerum.
Collapse
Affiliation(s)
- Luxian Liu
- Henan University, School of Life Sciences, Key Laboratory of
Plant Stress Biology, Kaifeng, China
| | - Yingxue Du
- Henan University, School of Life Sciences, Key Laboratory of
Plant Stress Biology, Kaifeng, China
| | - Cheng Shen
- Zhejiang University, College of Life Sciences, The Key
Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of
Education, Hangzhou, China
| | - Rui Li
- Food inspection and Testing Institute of Henan Province,
Physical and Chemical Laboratory, Zhengzhou, China
| | - Joongku Lee
- Chungnam National University, Department of Environment and
Forest Resources, Daejeon, South Korea
| | - Pan Li
- Zhejiang University, College of Life Sciences, The Key
Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of
Education, Hangzhou, China
| |
Collapse
|
61
|
Liu Y, Tseng YH, Yang HA, Hu AQ, Xu WB, Lin CW, Kono Y, Chang CC, Peng CI, Chung KF. Six new species of Begonia from Guangxi, China. BOTANICAL STUDIES 2020; 61:21. [PMID: 32734318 PMCID: PMC7393003 DOI: 10.1186/s40529-020-00298-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/25/2020] [Accepted: 07/16/2020] [Indexed: 06/11/2023]
Abstract
BACKGROUND With currently 1980 described species, the mega-diverse Begonia is now perhaps the 5th largest flowering plant genus, expanding rapidly from ca. 900 species in 1997 to its current size in merely two decades. In continuation of our studies of Asian Begonia, we report six additional new species from Guangxi, the region/province harboring the second richest Begonia flora of China. RESULTS Based on morphological and molecular data, the new species B. aurora belongs to Begonia sect. Platycentrum, while the other five new species (viz. B. larvata, B. longiornithophylla, B. lui, B. scabrifolia, and B. zhuoyuniae) are members of Sect. Coelocentrum. Somatic chromosome numbers of B. longiornithophylla and B. zhuoyuniae at metaphase were counted as 2n = 30, consistent with previously reports for Sect. Coelocentrum. CONCLUSIONS With the addition of the six new species, the total number of Begonia species in Guangxi increases from 86 to 92. Detailed description, line drawings, and color plates are provided to aid in identification.
Collapse
Affiliation(s)
- Yan Liu
- Guangxi Key Laboratory of Plant Conservation and Restoration Ecology in Karst Terrain, Guangxi Institute of Botany, Guangxi Zhuang Autonomous Region, Chinese Academy of Sciences, Guilin, Guangxi China
| | - Yu-Hsin Tseng
- Research Museum and Herbarium (HAST), Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Hsun-An Yang
- Research Museum and Herbarium (HAST), Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Ai-Qun Hu
- Research Museum and Herbarium (HAST), Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Wei-Bin Xu
- Guangxi Key Laboratory of Plant Conservation and Restoration Ecology in Karst Terrain, Guangxi Institute of Botany, Guangxi Zhuang Autonomous Region, Chinese Academy of Sciences, Guilin, Guangxi China
| | - Che-Wei Lin
- Herbarium (TAIF), Taiwan Forestry Research Institute, Taipei, Taiwan
| | - Yoshiko Kono
- Research Museum and Herbarium (HAST), Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
- The Community Center for the Advancement of Education and Research, University of Kochi, Kochi, Japan
| | - Chiung-Chih Chang
- Research Museum and Herbarium (HAST), Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Ching-I Peng
- Research Museum and Herbarium (HAST), Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Kuo-Fang Chung
- Research Museum and Herbarium (HAST), Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
62
|
Gruenstaeudl M, Jenke N. PACVr: plastome assembly coverage visualization in R. BMC Bioinformatics 2020; 21:207. [PMID: 32448146 PMCID: PMC7245912 DOI: 10.1186/s12859-020-3475-0] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Accepted: 03/31/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Plastid genomes typically display a circular, quadripartite structure with two inverted repeat regions, which challenges automatic assembly procedures. The correct assembly of plastid genomes is a prerequisite for the validity of subsequent analyses on genome structure and evolution. The average coverage depth of a genome assembly is often used as an indicator of assembly quality. Visualizing coverage depth across a draft genome is a critical step, which allows users to inspect the quality of the assembly and, where applicable, identify regions of reduced assembly confidence. Despite the interplay between genome structure and assembly quality, no contemporary, user-friendly software tool can visualize the coverage depth of a plastid genome assembly while taking its quadripartite genome structure into account. A software tool is needed that fills this void. RESULTS We introduce 'PACVr', an R package that visualizes the coverage depth of a plastid genome assembly in relation to the circular, quadripartite structure of the genome as well as the individual plastome genes. By using a variable window approach, the tool allows visualizations on different calculation scales. It also confirms sequence equality of, as well as visualizes gene synteny between, the inverted repeat regions of the input genome. As a tool for plastid genomics, PACVr provides the functionality to identify regions of coverage depth above or below user-defined threshold values and helps to identify non-identical IR regions. To allow easy integration into bioinformatic workflows, PACVr can be invoked from a Unix shell, facilitating its use in automated quality control. We illustrate the application of PACVr on four empirical datasets and compare visualizations generated by PACVr with those of alternative software tools. CONCLUSIONS PACVr provides a user-friendly tool to visualize (a) the coverage depth of a plastid genome assembly on a circular, quadripartite plastome map and in relation to individual plastome genes, and (b) gene synteny across the inverted repeat regions. It contributes to optimizing plastid genome assemblies and increasing the reliability of publicly available plastome sequences. The software, example datasets, technical documentation, and a tutorial are available with the package at https://cran.r-project.org/package=PACVr.
Collapse
Affiliation(s)
- Michael Gruenstaeudl
- Institut für Biologie, Systematische Botanik und Pflanzengeographie, Freie Universität Berlin, Berlin, 14195 Germany
| | - Nils Jenke
- Institut für Bioinformatik, Freie Universität Berlin, Berlin, 14195 Germany
| |
Collapse
|
63
|
Li H, Guo Q, Li Q, Yang L. Long-reads reveal that Rhododendron delavayi plastid genome contains extensive repeat sequences, and recombination exists among plastid genomes of photosynthetic Ericaceae. PeerJ 2020; 8:e9048. [PMID: 32351791 PMCID: PMC7183307 DOI: 10.7717/peerj.9048] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 04/02/2020] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND Rhododendron delavayi Franch. var. delavayi is a wild ornamental plant species in Guizhou Province, China. The lack of its plastid genome information seriously hinders the further application and conservation of the valuable resource. METHODS The complete plastid genome of R. delavayi was assembled from long sequence reads. The genome was then characterized, and compared with those of other photosynthetic Ericaceae species. RESULTS The plastid genome of R. delavayi has a typical quadripartite structure, and a length of 202,169 bp. It contains a large number of repeat sequences and shows preference for codon usage. The comparative analysis revealed the irregular recombination of gene sets, including rearrangement and inversion, in the large single copy region. The extreme expansion of the inverted repeat region shortened the small single copy, and expanded the full length of the genome. In addition, consistent with traditional taxonomy, R. delavayi with nine other species of the same family were clustered into Ericaceae based on the homologous protein-coding sequences of the plastid genomes. Thus, the long-read assembly of the plastid genome of R. delavayi would provide basic information for the further study of the evolution, genetic diversity, and conservation of R. delavayi and its relatives.
Collapse
Affiliation(s)
- Huie Li
- College of Agriculture, Guizhou University, Guiyang, Guizhou, China
| | - Qiqiang Guo
- Institute for Forest Resources & Environment of Guizhou, Guizhou University, Guiyang, Guizhou, China
| | - Qian Li
- College of Agriculture, Guizhou University, Guiyang, Guizhou, China
| | - Lan Yang
- College of Agriculture, Guizhou University, Guiyang, Guizhou, China
| |
Collapse
|
64
|
Omelchenko DO, Makarenko MS, Kasianov AS, Schelkunov MI, Logacheva MD, Penin AA. Assembly and Analysis of the Complete Mitochondrial Genome of Capsella bursa-pastoris. PLANTS (BASEL, SWITZERLAND) 2020; 9:E469. [PMID: 32276324 PMCID: PMC7238199 DOI: 10.3390/plants9040469] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Revised: 03/24/2020] [Accepted: 04/04/2020] [Indexed: 12/11/2022]
Abstract
Shepherd's purse (Capsella bursa-pastoris) is a cosmopolitan annual weed and a promising model plant for studying allopolyploidization in the evolution of angiosperms. Though plant mitochondrial genomes are a valuable source of genetic information, they are hard to assemble. At present, only the complete mitogenome of C. rubella is available out of all species of the genus Capsella. In this work, we have assembled the complete mitogenome of C. bursa-pastoris using high-precision PacBio SMRT third-generation sequencing technology. It is 287,799 bp long and contains 32 protein-coding genes, 3 rRNAs, 25 tRNAs corresponding to 15 amino acids, and 8 open reading frames (ORFs) supported by RNAseq data. Though many repeat regions have been found, none of them is longer than 1 kbp, and the most frequent structural variant originated from these repeats is present in only 4% of the mitogenome copies. The mitochondrial DNA sequence of C. bursa-pastoris differs from C. rubella, but not from C. orientalis, by two long inversions, suggesting that C. orientalis could be its maternal progenitor species. In total, 377 C to U RNA editing sites have been detected. All genes except cox1 and atp8 contain RNA editing sites, and most of them lead to non-synonymous changes of amino acids. Most of the identified RNA editing sites are identical to corresponding RNA editing sites in A. thaliana.
Collapse
Affiliation(s)
- Denis O. Omelchenko
- Institute for Information Transmission Problems of the Russian Academy of Sciences, 127051 Moscow, Russia; (A.S.K.); (M.I.S.); (M.D.L.); (A.A.P.)
| | - Maxim S. Makarenko
- Institute for Information Transmission Problems of the Russian Academy of Sciences, 127051 Moscow, Russia; (A.S.K.); (M.I.S.); (M.D.L.); (A.A.P.)
| | - Artem S. Kasianov
- Institute for Information Transmission Problems of the Russian Academy of Sciences, 127051 Moscow, Russia; (A.S.K.); (M.I.S.); (M.D.L.); (A.A.P.)
| | - Mikhail I. Schelkunov
- Institute for Information Transmission Problems of the Russian Academy of Sciences, 127051 Moscow, Russia; (A.S.K.); (M.I.S.); (M.D.L.); (A.A.P.)
- Skolkovo Institute of Science and Technology, 121205 Moscow, Russia
| | - Maria D. Logacheva
- Institute for Information Transmission Problems of the Russian Academy of Sciences, 127051 Moscow, Russia; (A.S.K.); (M.I.S.); (M.D.L.); (A.A.P.)
- Skolkovo Institute of Science and Technology, 121205 Moscow, Russia
| | - Aleksey A. Penin
- Institute for Information Transmission Problems of the Russian Academy of Sciences, 127051 Moscow, Russia; (A.S.K.); (M.I.S.); (M.D.L.); (A.A.P.)
| |
Collapse
|
65
|
Armijos Carrion AD, Hinsinger DD, Strijk JS. ECuADOR-Easy Curation of Angiosperm Duplicated Organellar Regions, a tool for cleaning and curating plastomes assembled from next generation sequencing pipelines. PeerJ 2020; 8:e8699. [PMID: 32292644 PMCID: PMC7147433 DOI: 10.7717/peerj.8699] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Accepted: 02/06/2020] [Indexed: 11/25/2022] Open
Abstract
Background With the rapid increase in availability of genomic resources offered by Next-Generation Sequencing (NGS) and the availability of free online genomic databases, efficient and standardized metadata curation approaches have become increasingly critical for the post-processing stages of biological data. Especially in organelle-based studies using circular chloroplast genome datasets, the assembly of the main structural regions in random order and orientation represents a major limitation in our ability to easily generate “ready-to-align” datasets for phylogenetic reconstruction, at both small and large taxonomic scales. In addition, current practices discard the most variable regions of the genomes to facilitate the alignment of the remaining coding regions. Nevertheless, no software is currently available to perform curation to such a degree, through simple detection, organization and positioning of the main plastome regions, making it a time-consuming and error-prone process. Here we introduce a fast and user friendly software ECuADOR, a Perl script specifically designed to automate the detection and reorganization of newly assembled plastomes obtained from any source available (NGS, sanger sequencing or assembler output). Methods ECuADOR uses a sliding-window approach to detect long repeated sequences in draft sequences, which then identifies the inverted repeat regions (IRs), even in case of artifactual breaks or sequencing errors and automates the rearrangement of the sequence to the widely used LSC–Irb–SSC–IRa order. This facilitates rapid post-editing steps such as creation of genome alignments, detection of variable regions, SNP detection and phylogenomic analyses. Results ECuADOR was successfully tested on plant families throughout the angiosperm phylogeny by curating 161 chloroplast datasets. ECuADOR first identified and reordered the central regions (LSC–Irb–SSC–IRa) for each dataset and then produced a new annotation for the chloroplast sequences. The process took less than 20 min with a maximum memory requirement of 150 MB and an accuracy of over 99%. Conclusions ECuADOR is the sole de novo one-step recognition and re-ordination tool that provides facilitation in the post-processing analysis of the extra nuclear genomes from NGS data. The program is available at https://github.com/BiodivGenomic/ECuADOR/.
Collapse
Affiliation(s)
- Angelo D Armijos Carrion
- Biodiversity Genomics Team, Plant Ecophysiology & Evolution Group, Guangxi Key Laboratory of Forest Ecology and Conservation, College of Forestry, Guangxi University, Nanning, Guangxi, PR China
| | - Damien D Hinsinger
- Biodiversity Genomics Team, Plant Ecophysiology & Evolution Group, Guangxi Key Laboratory of Forest Ecology and Conservation, College of Forestry, Guangxi University, Nanning, Guangxi, PR China.,Alliance for Conservation Tree Genomics, Pha Tad Ke Botanical Garden, Luang Prabang, Laos
| | - Joeri S Strijk
- Biodiversity Genomics Team, Plant Ecophysiology & Evolution Group, Guangxi Key Laboratory of Forest Ecology and Conservation, College of Forestry, Guangxi University, Nanning, Guangxi, PR China.,Alliance for Conservation Tree Genomics, Pha Tad Ke Botanical Garden, Luang Prabang, Laos.,State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning, Guangxi, PR China
| |
Collapse
|
66
|
Scheunert A, Dorfner M, Lingl T, Oberprieler C. Can we use it? On the utility of de novo and reference-based assembly of Nanopore data for plant plastome sequencing. PLoS One 2020; 15:e0226234. [PMID: 32208422 PMCID: PMC7092973 DOI: 10.1371/journal.pone.0226234] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Accepted: 02/28/2020] [Indexed: 12/13/2022] Open
Abstract
The chloroplast genome harbors plenty of valuable information for phylogenetic research. Illumina short-read data is generally used for de novo assembly of whole plastomes. PacBio or Oxford Nanopore long reads are additionally employed in hybrid approaches to enable assembly across the highly similar inverted repeats of a chloroplast genome. Unlike for PacBio, plastome assemblies based solely on Nanopore reads are rarely found, due to their high error rate and non-random error profile. However, the actual quality decline connected to their use has rarely been quantified. Furthermore, no study has employed reference-based assembly using Nanopore reads, which is common with Illumina data. Using Leucanthemum Mill. as an example, we compared the sequence quality of seven chloroplast genome assemblies of the same species, using combinations of two sequencing platforms and three analysis pipelines. In addition, we assessed the factors which might influence Nanopore assembly quality during sequence generation and bioinformatic processing. The consensus sequence derived from de novo assembly of Nanopore data had a sequence identity of 99.59% compared to Illumina short-read de novo assembly. Most of the errors detected were indels (81.5%), and a large majority of them is part of homopolymer regions. The quality of reference-based assembly is heavily dependent upon the choice of a close-enough reference. When using a reference with 0.83% sequence divergence from the studied species, mapping of Nanopore reads results in a consensus comparable to that from Nanopore de novo assembly, and of only slightly inferior quality compared to a reference-based assembly with Illumina data. For optimal de novo assembly of Nanopore data, appropriate filtering of contaminants and chimeric sequences, as well as employing moderate read coverage, is essential. Based on these results, we conclude that Nanopore long reads are a suitable alternative to Illumina short reads in plastome phylogenomics. Few errors remain in the finalized assembly, which can be easily masked in phylogenetic analyses without loss in analytical accuracy. The easily applicable and cost-effective technology might warrant more attention by researchers dealing with plant chloroplast genomes.
Collapse
Affiliation(s)
- Agnes Scheunert
- Evolutionary and Systematic Botany Group, Institute of Plant Sciences, University of Regensburg, Regensburg, Germany
| | - Marco Dorfner
- Evolutionary and Systematic Botany Group, Institute of Plant Sciences, University of Regensburg, Regensburg, Germany
| | - Thomas Lingl
- Evolutionary and Systematic Botany Group, Institute of Plant Sciences, University of Regensburg, Regensburg, Germany
| | - Christoph Oberprieler
- Evolutionary and Systematic Botany Group, Institute of Plant Sciences, University of Regensburg, Regensburg, Germany
| |
Collapse
|
67
|
Siniauskaya MG, Makarevich AM, Goloenko IM, Pankratov VS, Liaudanski AD, Danilenko NG, Lukhanina NV, Shimkevich AM, Davydenko OG. The study of organelle DNA variability in alloplasmic barley lines in the NGS era. Vavilovskii Zhurnal Genet Selektsii 2020; 24:12-19. [PMID: 33659776 PMCID: PMC7716555 DOI: 10.18699/vj19.589] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Alloplasmic lines are a suitable model for studying molecular coevolution and interrelations between genetic systems of plant cells. Whole chloroplast (cp) and mitochondrial (mt) genome sequences were obtained by the MiSeq System (Illumina). Organelle DNA samples were prepared from a set of 12 alloplasmic barley lines with different cytoplasms of Hordeum vulgare ssp. spontaneum and H. vulgare ssp. vulgare, as well as from their paternal varieties. A bioinformatic approach for analysis of NGS data obtained on an organellar DNA mix has been developed and verified. A comparative study of Hordeum organelle genomes’ variability and disposition of polymorphic loci was conducted. Eight types of chloroplast DNA and 5 types of mitochondrial DNA were distinguished for the barley sample set examined. These results were compared with the previous data of a restriction fragment length polymorphism (RFLP) study of organelle DNAs for the same material. Formerly established data about a field evaluation of alloplasmic barley lines were revised in the light of information about organelle genomes gained after NGS. Totally 17 polymorphic loci were found at exons of chloroplast genomes. Seven of the SNPs were located in the genes of the Ndh complex. The nonsynonymous changes of nucleotides were detected in the matK, rpoC1, ndhK, ndhG and infA genes. Some of the SNPs detected are very similar in codon position and in the type of amino acid substitution to the places where RNA editing can occur. Thus, these results outline new perspectives for the future study of nuclear-cytoplasmic interactions in alloplasmic lines.
Collapse
Affiliation(s)
- M G Siniauskaya
- Institute of Genetics and Cytology of the National Academy of Sciences of Belarus, Minsk, Belarus
| | - A M Makarevich
- Institute of Genetics and Cytology of the National Academy of Sciences of Belarus, Minsk, Belarus
| | - I M Goloenko
- Institute of Genetics and Cytology of the National Academy of Sciences of Belarus, Minsk, Belarus
| | - V S Pankratov
- Institute of Genetics and Cytology of the National Academy of Sciences of Belarus, Minsk, Belarus
| | - A D Liaudanski
- Institute of Genetics and Cytology of the National Academy of Sciences of Belarus, Minsk, Belarus
| | - N G Danilenko
- Institute of Genetics and Cytology of the National Academy of Sciences of Belarus, Minsk, Belarus
| | - N V Lukhanina
- Institute of Genetics and Cytology of the National Academy of Sciences of Belarus, Minsk, Belarus
| | - A M Shimkevich
- Institute of Genetics and Cytology of the National Academy of Sciences of Belarus, Minsk, Belarus
| | - O G Davydenko
- Institute of Genetics and Cytology of the National Academy of Sciences of Belarus, Minsk, Belarus
| |
Collapse
|
68
|
Zheng S, Poczai P, Hyvönen J, Tang J, Amiryousefi A. Chloroplot: An Online Program for the Versatile Plotting of Organelle Genomes. Front Genet 2020. [PMID: 33101394 DOI: 10.3389/fgene.576124] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2023] Open
Abstract
Understanding the complexity of genomic structures and their unique architecture is linked with the power of visualization tools used to represent these features. Such tools should be able to provide a realistic and scalable version of genomic content. Here, we present an online organelle plotting tool focused on chloroplasts, which were developed to visualize the exclusive structure of these genomes. The distinguished unique features of this program include its ability to represent the Single Short Copy (SSC) regions in reverse complement, which allows the depiction of the codon usage bias index for each gene, along with the possibility of the minor mismatches between inverted repeat (IR) regions and user-specified plotting layers. The versatile color schemes and diverse functionalities of the program are specifically designed to reflect the accurate scalable representation of the plastid genomes. We introduce a Shiny app website for easy use of the program; a more advanced application of the tool is possible by further development and modification of the downloadable source codes provided online. The software and its libraries are completely coded in R, available at https://irscope.shinyapps.io/chloroplot/.
Collapse
Affiliation(s)
- Shuyu Zheng
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Peter Poczai
- Finnish Museum of Natural History (Botany), University of Helsinki, Helsinki, Finland
- Department of Biosciences, Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| | - Jaakko Hyvönen
- Finnish Museum of Natural History (Botany), University of Helsinki, Helsinki, Finland
- Department of Biosciences, Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| | - Jing Tang
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Ali Amiryousefi
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| |
Collapse
|
69
|
Zhang Z, Zhang Y, Song M, Guan Y, Ma X. Species Identification of Dracaena Using the Complete Chloroplast Genome as a Super-Barcode. Front Pharmacol 2019; 10:1441. [PMID: 31849682 PMCID: PMC6901964 DOI: 10.3389/fphar.2019.01441] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2019] [Accepted: 11/12/2019] [Indexed: 01/04/2023] Open
Abstract
The taxonomy and nomenclature of Dracaena plants are much disputed, particularly for several Dracaena species in Asia. However, neither morphological features nor common DNA regions are ideal for identification of Dracaena spp. Meanwhile, although multiple Dracaena spp. are sources of the rare traditional medicine dragon's blood, the Pharmacopoeia of the People's Republic of China has defined Dracaena cochinchinensis as the only source plant. The inaccurate identification of Dracaena spp. will inevitably affect the clinical efficacy of dragon's blood. It is therefore important to find a better method to distinguish these species. Here, we report the complete chloroplast (CP) genomes of six Dracaena spp., D. cochinchinensis, D. cambodiana, D. angustifolia, D. terniflora, D. hokouensis, and D. elliptica, obtained through high-throughput Illumina sequencing. These CP genomes exhibited typical circular tetramerous structure, and their sizes ranged from 155,055 (D. elliptica) to 155,449 bp (D. cochinchinensis). The GC content of each CP genome was 37.5%. Furthermore, each CP genome contained 130 genes, including 84 protein-coding genes, 38 tRNA genes, and 8 rRNA genes. There were no potential coding or non-coding regions to distinguish these six species, but the maximum likelihood tree of the six Dracaena spp. and other related species revealed that the whole CP genome can be used as a super-barcode to identify these Dracaena spp. This study provides not only invaluable data for species identification and safe medical application of Dracaena but also an important reference and foundation for species identification and phylogeny of Liliaceae plants.
Collapse
Affiliation(s)
- Zhonglian Zhang
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
- Yunnan Branch of Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Jinghong, China
| | - Yue Zhang
- Yunnan Branch of Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Jinghong, China
| | - Meifang Song
- Yunnan Branch of Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Jinghong, China
| | - Yanhong Guan
- Yunnan Branch of Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Jinghong, China
| | - Xiaojun Ma
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| |
Collapse
|
70
|
Fu YB, Li P, Biligetu B. Developing Chloroplast Genomic Resources from 25 Avena Species for the Characterization of Oat Wild Relative Germplasm. PLANTS (BASEL, SWITZERLAND) 2019; 8:E438. [PMID: 31652703 PMCID: PMC6918232 DOI: 10.3390/plants8110438] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/04/2019] [Revised: 10/13/2019] [Accepted: 10/21/2019] [Indexed: 02/03/2023]
Abstract
Chloroplast (cp) genomics will play an important role in the characterization of crop wild relative germplasm conserved in worldwide gene banks, thanks to the advances in genome sequencing. We applied a multiplexed shotgun sequencing procedure to sequence the cp genomes of 25 Avena species with variable ploidy levels. Bioinformatics analysis of the acquired sequences generated 25 de novo genome assemblies ranging from 135,557 to 136,006 bp. The gene annotations revealed 130 genes and their duplications, along with four to six pseudogenes, for each genome. Little differences in genome structure and gene arrangement were observed across the 25 species. Polymorphism analyses identified 1313 polymorphic sites and revealed an average of 277 microsatellites per genome. Greater nucleotide diversity was observed in the short single-copy region. Genome-wide scanning of selection signals suggested that six cp genes were under positive selection on some amino acids. These research outputs allow for a better understanding of oat cp genomes and evolution, and they form an essential set of cp genomic resources for the studies of oat evolutionary biology and for oat wild relative germplasm characterization.
Collapse
Affiliation(s)
- Yong-Bi Fu
- Plant Gene Resources of Canada, Saskatoon Research and Development Centre, Agriculture and Agri-Food Canada, 107 Science Place, Saskatoon, SK S7N 0X2, Canada.
| | - Pingchuan Li
- Department of Plant Sciences, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8, Canada.
| | - Bill Biligetu
- Department of Plant Sciences, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8, Canada.
| |
Collapse
|
71
|
Malakasi P, Bellot S, Dee R, Grace OM. Museomics Clarifies the Classification of Aloidendron (Asphodelaceae), the Iconic African Tree Aloes. FRONTIERS IN PLANT SCIENCE 2019; 10:1227. [PMID: 31681358 PMCID: PMC6803536 DOI: 10.3389/fpls.2019.01227] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Accepted: 09/04/2019] [Indexed: 05/24/2023]
Abstract
Arborescent succulent plants are regarded as keystone and indicator species in desert ecosystems due to their large stature and long lifespans. Tree aloes, the genus Aloidendron, are icons of the southern African deserts yet have proved elusive subjects due to the difficulty of obtaining material of known provenance for comparative study. Consequently, evolutionary relationships among representatives of the unusual arborescent life form have remained unclear until now. We used a museomics approach to overcome this challenge. Chloroplast genomes of six Aloidendron species and 12 other members of Asphodelaceae were sequenced from modern living collections and herbarium specimens, including the type specimens of all but two Aloidendron species, the earliest of which was collected 130 years ago. Maximum-likelihood trees estimated from full chloroplast genomes and the nuclear internal transcribed spacer (ITS) region show that Aloidendron sabaeum, from the Arabian Peninsula, is nested within Aloe while the Madagascar endemic Aloestrela suzannae is most closely related to the Somalian Aloidendron eminens. We observed phylogenetic conflicts between the plastid and nuclear topologies, which may be indicative of recurrent hybridisation or incomplete lineage sorting events in Aloe and in Aloidendron. Comparing species ecology in the context provided by our phylogeny suggests that habitat preference to either xeric deserts or humid forests/thickets evolved repeatedly in Aloidendron. Our findings demonstrate the value of botanical collections for the study and classification of taxonomically challenging succulent plants.
Collapse
|
72
|
The complete chloroplast genome of Stryphnodendron adstringens (Leguminosae - Caesalpinioideae): comparative analysis with related Mimosoid species. Sci Rep 2019; 9:14206. [PMID: 31578450 PMCID: PMC6775074 DOI: 10.1038/s41598-019-50620-3] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Accepted: 08/14/2019] [Indexed: 01/26/2023] Open
Abstract
Stryphnodendron adstringens is a medicinal plant belonging to the Leguminosae family, and it is commonly found in the southeastern savannas, endemic to the Cerrado biome. The goal of this study was to assemble and annotate the chloroplast genome of S. adstringens and to compare it with previously known genomes of the mimosoid clade within Leguminosae. The chloroplast genome was reconstructed using de novo and referenced-based assembly of paired-end reads generated by shotgun sequencing of total genomic DNA. The size of the S. adstringens chloroplast genome was 162,169 bp. This genome included a large single-copy (LSC) region of 91,045 bp, a small single-copy (SSC) region of 19,014 bp and a pair of inverted repeats (IRa and IRb) of 26,055 bp each. The S. adstringens chloroplast genome contains a total of 111 functional genes, including 77 protein-coding genes, 30 transfer RNA genes, and 4 ribosomal RNA genes. A total of 137 SSRs and 42 repeat structures were identified in S. adstringens chloroplast genome, with the highest proportion in the LSC region. A comparison of the S. adstringens chloroplast genome with those from other mimosoid species indicated that gene content and synteny are highly conserved in the clade. The phylogenetic reconstruction using 73 conserved coding-protein genes from 19 Leguminosae species was supported to be paraphyletic. Furthermore, the noncoding and coding regions with high nucleotide diversity may supply valuable markers for molecular evolutionary and phylogenetic studies at different taxonomic levels in this group.
Collapse
|
73
|
Morales‐Briones DF, Arias T, Di Stilio VS, Tank DC. Chloroplast primers for clade-wide phylogenetic studies of Thalictrum. APPLICATIONS IN PLANT SCIENCES 2019; 7:e11294. [PMID: 31667022 PMCID: PMC6814179 DOI: 10.1002/aps3.11294] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2019] [Accepted: 08/20/2019] [Indexed: 06/02/2023]
Abstract
PREMISE Chloroplast primers were developed for phylogenetic and comparative studies in Thalictrum (Ranunculaceae). METHODS AND RESULTS We assembled and annotated the complete plastome sequence of T. thalictroides by combining multiple whole genome sequencing libraries. Using transcriptome-sequencing libraries, we also assembled a partial plastome of the related species T. hernandezii. From the newly assembled plastomes and one previously sequenced plastome, we designed and validated 28 primer pairs to target variable portions of the chloroplast genome in Thalictrum. Furthermore, we tested the validated primers in 62 species of Thalictrum. The total alignment length of the 28 regions was 15,268 bp with 2443 variable sites and 92% character occupancy. CONCLUSIONS The newly developed chloroplast primer pairs improve the phylogenetic resolution (bootstrap support and tree certainty) in Thalictum and will be a useful resource for future phylogenetic and evolutionary studies for species in the genus and in close relatives in Thalictroideae.
Collapse
Affiliation(s)
- Diego F. Morales‐Briones
- Department of Biological SciencesUniversity of Idaho875 Perimeter Dr. MS 3051MoscowIdaho83844‐3051USA
- Stillinger HerbariumUniversity of Idaho875 Perimeter Dr. MS 3026MoscowIdaho83844-3026USA
- Institute for Bioinformatics and Evolutionary Studies (IBEST)University of Idaho875 Perimeter Dr. MS 3051MoscowIdaho83844‐3051USA
- Present address:
Department of Plant and Microbial BiologyUniversity of Minnesota1479 Gortner AvenueSaint PaulMinnesota55108‐1095USA
| | - Tatiana Arias
- School of Biological SciencesThe University of Hong KongPokfulam RoadHong KongHong Kong
- Corporación para Investigaciones BiológicasCra. 72 A No. 78 B 141MedellínColombia
- Department of BiologyUniversity of WashingtonBox 351800SeattleWashington98195‐1800USA
| | - Verónica S. Di Stilio
- Department of BiologyUniversity of WashingtonBox 351800SeattleWashington98195‐1800USA
| | - David C. Tank
- Department of Biological SciencesUniversity of Idaho875 Perimeter Dr. MS 3051MoscowIdaho83844‐3051USA
- Stillinger HerbariumUniversity of Idaho875 Perimeter Dr. MS 3026MoscowIdaho83844-3026USA
- Institute for Bioinformatics and Evolutionary Studies (IBEST)University of Idaho875 Perimeter Dr. MS 3051MoscowIdaho83844‐3051USA
| |
Collapse
|
74
|
Trevisan B, Alcantara DM, Machado DJ, Marques FP, Lahr DJ. Genome skimming is a low-cost and robust strategy to assemble complete mitochondrial genomes from ethanol preserved specimens in biodiversity studies. PeerJ 2019; 7:e7543. [PMID: 31565556 PMCID: PMC6746217 DOI: 10.7717/peerj.7543] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2019] [Accepted: 07/24/2019] [Indexed: 12/17/2022] Open
Abstract
Global loss of biodiversity is an ongoing process that concerns both local and global authorities. Studies of biodiversity mainly involve traditional methods using morphological characters and molecular protocols. However, conventional methods are a time consuming and resource demanding task. The development of high-throughput sequencing (HTS) techniques has reshaped the way we explore biodiversity and opened a path to new questions and novel empirical approaches. With the emergence of HTS, sequencing the complete mitochondrial genome became more accessible, and the number of genome sequences published has increased exponentially during the last decades. Despite the current state of knowledge about the potential of mitogenomics in phylogenetics, this is still a relatively under-explored area for a multitude of taxonomic groups, especially for those without commercial relevance, non-models organisms and with preserved DNA. Here we take the first step to assemble and annotate the genomes from HTS data using a new protocol of genome skimming which will offer an opportunity to extend the field of mitogenomics to under-studied organisms. We extracted genomic DNA from specimens preserved in ethanol. We used Nextera XT DNA to prepare indexed paired-end libraries since it is a powerful tool for working with diverse samples, requiring a low amount of input DNA. We sequenced the samples in two different Illumina platform (MiSeq or NextSeq 550). We trimmed raw reads, filtered and had their quality tested accordingly. We performed the assembly using a baiting and iterative mapping strategy, and the annotated the putative mitochondrion through a semi-automatic procedure. We applied the contiguity index to access the completeness of each new mitogenome. Our results reveal the efficiency of the proposed method to recover the whole mitogenomes of preserved DNA from non-model organisms even if there are gene rearrangement in the specimens. Our findings suggest the potential of combining the adequate platform and library to the genome skimming as an innovative approach, which opens a new range of possibilities of its use to obtain molecular data from organisms with different levels of preservation.
Collapse
Affiliation(s)
- Bruna Trevisan
- Department of Zoology, Institute of Biosciences, University of São Paulo, São Paulo, São Paulo, Brazil
| | - Daniel M.C. Alcantara
- Department of Zoology, Institute of Biosciences, University of São Paulo, São Paulo, São Paulo, Brazil
| | - Denis Jacob Machado
- Department of Zoology, Institute of Biosciences, University of São Paulo, São Paulo, São Paulo, Brazil
- Department of Bioinformatics and Genomics / College of Computing and Informatics, University of North Carolina at Charlotte, Charlotte, NC, United States of America
| | - Fernando P.L. Marques
- Department of Zoology, Institute of Biosciences, University of São Paulo, São Paulo, São Paulo, Brazil
| | - Daniel J.G. Lahr
- Department of Zoology, Institute of Biosciences, University of São Paulo, São Paulo, São Paulo, Brazil
| |
Collapse
|
75
|
Song E, Park S, Kim S. Primers for complete chloroplast genome sequencing in Magnolia. APPLICATIONS IN PLANT SCIENCES 2019; 7:e11286. [PMID: 31572627 PMCID: PMC6764489 DOI: 10.1002/aps3.11286] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/21/2018] [Accepted: 05/28/2019] [Indexed: 06/10/2023]
Abstract
PREMISE A new set of primers was developed for sequencing of whole chloroplast genomes of Magnolia species and gap-filling of unfinished genomes. METHODS AND RESULTS Two hundred and fifty primers were newly designed based on two previously reported chloroplast genomes from two different genera in Magnoliaceae. A total of 134 primer pairs, including the ones developed in this study and 18 previously reported ones, were enough to cover the entire chloroplast genome sequences in Magnoliaceae. Four species from different sections of Magnolia (M. dealbata, M. fraseri var. pyramidata, M. liliiflora, and M. odora) were used to show the general application of these primers to chloroplast genome sequencing in Magnolia. CONCLUSIONS Using the developed primers, four Magnolia chloroplast genomes were successfully assembled. These results show the utility of these primers across Magnolia and their potential use for phylogenetic studies, DNA barcoding, and population genetics in this group.
Collapse
Affiliation(s)
- Eunji Song
- Department of BiologySungshin UniversitySeoul01133Korea
| | - Suhyeon Park
- Department of BiologySungshin UniversitySeoul01133Korea
| | - Sangtae Kim
- Department of BiologySungshin UniversitySeoul01133Korea
| |
Collapse
|
76
|
Genome Comparison Reveals Mutation Hotspots in the Chloroplast Genome and Phylogenetic Relationships of Ormosia Species. BIOMED RESEARCH INTERNATIONAL 2019; 2019:7265030. [PMID: 31531364 PMCID: PMC6720362 DOI: 10.1155/2019/7265030] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/23/2019] [Revised: 07/13/2019] [Accepted: 07/22/2019] [Indexed: 12/04/2022]
Abstract
The papilionoid legume genus Ormosia comprises approximately 130 species, which are distributed mostly in the Neotropics, with some species in eastern Asia and northeastern Australia. The taxonomy and evolutionary history remain unclear due to the lack of a robust species-level phylogeny. Chloroplast genomes can provide important information for phylogenetic and population genetic studies. In this study, we determined the complete chloroplast genome sequences of five Ormosia species by Illumina sequencing. The Ormosia chloroplast genomes displayed the typical quadripartite structure of angiosperms, which consisted of a pair of inverted regions separated by a large single-copy region and a small single-copy region. The location and distribution of repeat sequences and microsatellites were determined. Comparative analyses highlighted a wide spectrum of variation, with trnK-rbcL, atpE-trnS-rps4, trnC-petN, trnS-psbZ-trnG, trnP-psaJ-rpl33, and clpP intron being the most variable regions. Phylogenetic analysis revealed that Ormosia is in the Papilionoideae clade and is sister to the Lupinus clade. Overall, this study, which provides Ormosia chloroplast genomic resources and a comparative analysis of Ormosia chloroplast genomes, will be beneficial for the evolutionary study and phylogenetic reconstruction of the genus Ormosia and molecular barcoding in population genetics and will provide insight into the chloroplast genome evolution of legumes.
Collapse
|
77
|
Lencina F, Landau AM, Petterson ME, Pacheco MG, Kobayashi K, Prina AR. The rpl23 gene and pseudogene are hotspots of illegitimate recombination in barley chloroplast mutator seedlings. Sci Rep 2019; 9:9960. [PMID: 31292475 PMCID: PMC6620283 DOI: 10.1038/s41598-019-46321-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Accepted: 06/26/2019] [Indexed: 11/23/2022] Open
Abstract
Previously, through a TILLING (Targeting Induced Local Lesions in Genomes) approach applied on barley chloroplast mutator (cpm) seedlings a high frequency of polymorphisms in the rpl23 gene was detected. All the polymorphisms corresponded to five differences already known to exist in nature between the rpl23 gene located in the inverted repeats (IRs) and the rpl23 pseudogene located in the large single copy region (LSC). In this investigation, polymorphisms in the rpl23 gene were verified and besides, a similar situation was found for the pseudogene in cpm seedlings. On the other hand, no polymorphisms were found in any of those loci in 40 wild type barley seedlings. Those facts and the independent occurrence of polymorphisms in the gene and pseudogene in individual seedlings suggest that the detected polymorphisms initially arose from gene conversion between gene and pseudogene. Moreover, an additional recombination process involving small recombinant segments seems to occur between the two gene copies as a consequence of their location in the IRs. These and previous results support the hypothesis that the CPM protein is a component of the plastome mismatch repair (MMR) system, whose failure of the anti-recombination activity results in increased illegitimate recombination between the rpl23 gene and pseudogene.
Collapse
Affiliation(s)
- F Lencina
- Instituto de Genética "Ewald A. Favret", CICVyA (Centro de Investigación en Ciencias Veterinarias y Agronómicas), INTA (Instituto Nacional de Tecnología Agropecuaria), Nicolás Repetto y de los Reseros s/n (1686), Hurlingham, Buenos Aires, Argentina
| | - A M Landau
- Instituto de Genética "Ewald A. Favret", CICVyA (Centro de Investigación en Ciencias Veterinarias y Agronómicas), INTA (Instituto Nacional de Tecnología Agropecuaria), Nicolás Repetto y de los Reseros s/n (1686), Hurlingham, Buenos Aires, Argentina
| | - M E Petterson
- Instituto de Genética "Ewald A. Favret", CICVyA (Centro de Investigación en Ciencias Veterinarias y Agronómicas), INTA (Instituto Nacional de Tecnología Agropecuaria), Nicolás Repetto y de los Reseros s/n (1686), Hurlingham, Buenos Aires, Argentina
| | - M G Pacheco
- Instituto de Genética "Ewald A. Favret", CICVyA (Centro de Investigación en Ciencias Veterinarias y Agronómicas), INTA (Instituto Nacional de Tecnología Agropecuaria), Nicolás Repetto y de los Reseros s/n (1686), Hurlingham, Buenos Aires, Argentina
| | - K Kobayashi
- Laboratorio de Agrobiotecnología, Grupo Biología Molecular Vegetal Aplicada, Instituto de Biodiversidad y Biología Experimental y Aplicada (IBBEA, CONICET-UBA), Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, UBA, Buenos Aires, Argentina
| | - A R Prina
- Instituto de Genética "Ewald A. Favret", CICVyA (Centro de Investigación en Ciencias Veterinarias y Agronómicas), INTA (Instituto Nacional de Tecnología Agropecuaria), Nicolás Repetto y de los Reseros s/n (1686), Hurlingham, Buenos Aires, Argentina.
| |
Collapse
|
78
|
Jin FY, Y X, Xie DF, Li H, Yu Y, Zhou SD, He XJ. Comparative Complete Chloroplast Genome Analyses and Contribution to the Understanding of Chloroplast Phylogeny and Adaptive Evolution in Subgenus Anguinum. RUSS J GENET+ 2019. [DOI: 10.1134/s1022795419070081] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
79
|
The Complete Chloroplast Genomes of Punica granatum and a Comparison with Other Species in Lythraceae. Int J Mol Sci 2019; 20:ijms20122886. [PMID: 31200508 PMCID: PMC6627765 DOI: 10.3390/ijms20122886] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2019] [Revised: 06/02/2019] [Accepted: 06/02/2019] [Indexed: 02/06/2023] Open
Abstract
Pomegranates (Punica granatum L.) are one of the most popular fruit trees cultivated in arid and semi-arid tropics and subtropics. In this study, we determined and characterized three complete chloroplast (cp) genomes of P. granatum cultivars with different phenotypes using the genome skimming approach. The complete cp genomes of three pomegranate cultivars displayed the typical quadripartite structure of angiosperms, and their length ranged from 156,638 to 156,639 bp. They encoded 113 unique genes and 17 are duplicated in the inverted regions. We analyzed the sequence diversity of pomegranate cp genomes coupled with two previous reports. The results showed that the sequence diversity is extremely low and no informative sites were detected, which suggests that cp genome sequences may be not be suitable for investigating the genetic diversity of pomegranate genotypes. Further, we analyzed the codon usage pattern and identified the potential RNA editing sites. A comparative cp genome analysis with other species within Lythraceae revealed that the gene content and organization are highly conserved. Based on a site-specific model, 11 genes with positively selected sites were detected, and most of them were photosynthesis-related genes and genetic system-related genes. Together with previously released cp genomes of the order Myrtales, we determined the taxonomic position of P. granatum based on the complete chloroplast genomes. Phylogenetic analysis suggested that P. granatum form a single clade with other species from Lythraceae with a high support value. The complete cp genomes provides valuable information for understanding the phylogenetic position of P. gramatum in the order Myrtales.
Collapse
|
80
|
Romeiras MM, Pena AR, Menezes T, Vasconcelos R, Monteiro F, Paulo OS, Moura M. Shortcomings of Phylogenetic Studies on Recent Radiated Insular Groups: A Meta-Analysis Using Cabo Verde Biodiversity. Int J Mol Sci 2019; 20:E2782. [PMID: 31174340 PMCID: PMC6600550 DOI: 10.3390/ijms20112782] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2019] [Revised: 05/16/2019] [Accepted: 06/04/2019] [Indexed: 12/22/2022] Open
Abstract
Over the previous decades, numerous studies focused on how oceanic islands have contributed to determine the phylogenetic relationships and times of origin and diversification of different endemic lineages. The Macaronesian Islands (i.e., Azores, Madeira, Selvagens, Canaries, and Cabo Verde), harbour biotas with exceptionally high levels of endemism. Within the region, the vascular plants and reptiles constitute two of the most important radiations. In this study we compare relevant published phylogenetic data and diversification rates retrieved within Cabo Verde endemic lineages and discuss the importance of choosing appropriate phylogeny-based methods to investigate diversification dynamics on islands. From this selective literature-based review, we summarize the software packages used in Macaronesian studies and discuss their adequacy considering the published data to obtain well-supported phylogenies in the target groups. We further debate the importance of Next Generation Sequencing (NGS), to investigate the evolutionary processes of diversification in the Macaronesian Islands. Analysis of genomic data provides phylogenetic resolution for rapidly evolving species radiations, suggesting a great potential to improve the phylogenetic signal and divergence time estimates in insular lineages. The most important Macaronesian reptile radiations provide good case-studies to compare classical phylogenetic methods with new tools, such as phylogenomics, revealing a high value for research on this hotspot area.
Collapse
Affiliation(s)
- Maria M Romeiras
- LEAF, Linking Landscape, Environment, Agriculture and Food, Instituto Superior de Agronomia, Universidade de Lisboa, 1349-017 Lisbon, Portugal.
- Centre for Ecology, Evolution and Environmental Changes (cE3c), Faculdade de Ciências, Universidade de Lisboa, 1749-016 Lisbon, Portugal.
| | - Ana Rita Pena
- Centre for Ecology, Evolution and Environmental Changes (cE3c), Faculdade de Ciências, Universidade de Lisboa, 1749-016 Lisbon, Portugal.
| | - Tiago Menezes
- CIBIO, Research Centre in Biodiversity and Genetic Resources, Azores Group, InBIO Associate Laboratory, Universidade dos Açores, 9501-855 Ponta Delgada, Azores, Portugal.
| | - Raquel Vasconcelos
- CIBIO, Research Centre in Biodiversity and Genetic Resources, InBIO Associate Laboratory, Universidade do Porto, 4485-661 Vairão, Portugal.
| | - Filipa Monteiro
- LEAF, Linking Landscape, Environment, Agriculture and Food, Instituto Superior de Agronomia, Universidade de Lisboa, 1349-017 Lisbon, Portugal.
- Centre for Ecology, Evolution and Environmental Changes (cE3c), Faculdade de Ciências, Universidade de Lisboa, 1749-016 Lisbon, Portugal.
| | - Octávio S Paulo
- Centre for Ecology, Evolution and Environmental Changes (cE3c), Faculdade de Ciências, Universidade de Lisboa, 1749-016 Lisbon, Portugal.
| | - Mónica Moura
- CIBIO, Research Centre in Biodiversity and Genetic Resources, Azores Group, InBIO Associate Laboratory, Universidade dos Açores, 9501-855 Ponta Delgada, Azores, Portugal.
| |
Collapse
|
81
|
Bethune K, Mariac C, Couderc M, Scarcelli N, Santoni S, Ardisson M, Martin J, Montúfar R, Klein V, Sabot F, Vigouroux Y, Couvreur TLP. Long-fragment targeted capture for long-read sequencing of plastomes. APPLICATIONS IN PLANT SCIENCES 2019; 7:e1243. [PMID: 31139509 PMCID: PMC6526642 DOI: 10.1002/aps3.1243] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Accepted: 03/21/2019] [Indexed: 05/09/2023]
Abstract
PREMISE Third-generation sequencing methods generate significantly longer reads than those produced using alternative sequencing methods. This provides increased possibilities for the study of biodiversity, phylogeography, and population genetics. We developed a protocol for in-solution enrichment hybridization capture of long DNA fragments applicable to complete plastid genomes. METHODS AND RESULTS The protocol uses cost-effective in-house probes developed via long-range PCR and was used in six non-model monocot species (Poaceae: African rice, pearl millet, fonio; and three palm species). DNA was extracted from fresh and silica gel-dried leaves. Our protocol successfully captured long-read plastome fragments (3151 bp median on average), with an enrichment rate ranging from 15% to 98%. DNA extracted from silica gel-dried leaves led to low-quality plastome assemblies when compared to DNA extracted from fresh tissue. CONCLUSIONS Our protocol could also be generalized to capture long sequences from specific nuclear fragments.
Collapse
Affiliation(s)
| | | | | | | | - Sylvain Santoni
- UMR AGAP, Equipe Diversité et Adaptation de la Vigne et des Espèces MéditerranéennesINRA2 Place Viala34060MontpellierFrance
| | - Morgane Ardisson
- UMR AGAP, Equipe Diversité et Adaptation de la Vigne et des Espèces MéditerranéennesINRA2 Place Viala34060MontpellierFrance
| | | | - Rommel Montúfar
- Facultad de Ciencias Exactas y NaturalesPontificia Universidad Católica del EcuadorQuitoEcuador
| | | | | | | | | |
Collapse
|
82
|
Sablok G, Amiryousefi A, He X, Hyvönen J, Poczai P. Sequencing the Plastid Genome of Giant Ragweed ( Ambrosia trifida, Asteraceae) From a Herbarium Specimen. FRONTIERS IN PLANT SCIENCE 2019; 10:218. [PMID: 30873197 PMCID: PMC6403193 DOI: 10.3389/fpls.2019.00218] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/08/2018] [Accepted: 02/08/2019] [Indexed: 05/09/2023]
Abstract
We report the first plastome sequence of giant ragweed (Ambrosia trifida); with this new genome information, we assessed the phylogeny of Asteraceae and the transcriptional profiling against glyphosate resistance in giant ragweed. Assembly and genic features show a normal angiosperm quadripartite plastome structure with no signatures of deviation in gene directionality. Comparative analysis revealed large inversions across the plastome of giant ragweed and the previously sequenced members of the plant family. Asteraceae plastid genomes contain two inversions of 22.8 and 3.3 kb; the former is located between trnS-GCU and trnG-UCC genes, and the latter between trnE-UUC and trnT-GGU genes. The plastid genome sequences of A. trifida and the related species, Ambrosia artemisiifolia, are identical in gene content and arrangement, but they differ in length. The phylogeny is well-resolved and congruent with previous hypotheses about the phylogenetic relationship of Asteraceae. Transcriptomic analysis revealed divergence in the relative expressions at the exonic and intronic levels, providing hints toward the ecological adaptation of the genus. Giant ragweed shows various levels of glyphosate resistance, with introns displaying higher expression patterns at resistant time points after the assumed herbicide treatment.
Collapse
Affiliation(s)
- Gaurav Sablok
- Finnish Museum of Natural History (Botany Unit), University of Helsinki, Helsinki, Finland
- Organismal Evolution and Biology, Faculty of Biology and Environmental Sciences, Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| | - Ali Amiryousefi
- Finnish Museum of Natural History (Botany Unit), University of Helsinki, Helsinki, Finland
- Organismal Evolution and Biology, Faculty of Biology and Environmental Sciences, Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| | - Xiaolan He
- Finnish Museum of Natural History (Botany Unit), University of Helsinki, Helsinki, Finland
- Organismal Evolution and Biology, Faculty of Biology and Environmental Sciences, Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| | - Jaakko Hyvönen
- Finnish Museum of Natural History (Botany Unit), University of Helsinki, Helsinki, Finland
- Organismal Evolution and Biology, Faculty of Biology and Environmental Sciences, Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| | - Péter Poczai
- Finnish Museum of Natural History (Botany Unit), University of Helsinki, Helsinki, Finland
- Organismal Evolution and Biology, Faculty of Biology and Environmental Sciences, Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| |
Collapse
|
83
|
Yan M, Zhao X, Zhao Y, Ren Y, Yuan Z. The complete chloroplast genome sequence of pomegranate ‘Bhagwa’. Mitochondrial DNA B Resour 2019. [DOI: 10.1080/23802359.2019.1617047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022] Open
Affiliation(s)
- Ming Yan
- Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, China
- College of Forestry, Nanjing Forestry University, Nanjing, China
| | - Xueqing Zhao
- Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, China
- College of Forestry, Nanjing Forestry University, Nanjing, China
| | - Yujie Zhao
- Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, China
- College of Forestry, Nanjing Forestry University, Nanjing, China
| | - Yuan Ren
- Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, China
- College of Forestry, Nanjing Forestry University, Nanjing, China
| | - Zhaohe Yuan
- Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, China
- College of Forestry, Nanjing Forestry University, Nanjing, China
| |
Collapse
|
84
|
Zhang K, Chen Z, Liu C. The complete plastid genome of marula ( Sclerocarya birrea). Mitochondrial DNA B Resour 2019. [DOI: 10.1080/23802359.2018.1547142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022] Open
Affiliation(s)
- Kexin Zhang
- Shenzhen Nanshan foreign language senior high school, Shenzhen, China
| | - Ziqiang Chen
- Plant Science and Technology, College of Chinese Medicine Materials, Jilin Agricultural University, Changchun, China
| | - Cuijing Liu
- Plant Science and Technology, College of Chinese Medicine Materials, Jilin Agricultural University, Changchun, China
| |
Collapse
|
85
|
Comparative assessment shows the reliability of chloroplast genome assembly using RNA-seq. Sci Rep 2018; 8:17404. [PMID: 30479362 PMCID: PMC6258696 DOI: 10.1038/s41598-018-35654-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2018] [Accepted: 11/09/2018] [Indexed: 11/08/2022] Open
Abstract
Chloroplast genomes (cp genomes) are widely used in comparative genomics, population genetics, and phylogenetic studies. Obtaining chloroplast genomes from RNA-Seq data seems feasible due to the almost full transcription of cpDNA. However, the reliability of chloroplast genomes assembled from RNA-Seq instead of genomic DNA libraries remains to be thoroughly verified. In this study, we assembled chloroplast genomes for three Erysimum (Brassicaceae) species from three RNA-Seq replicas and from one genomic library of each species, using a streamlined bioinformatics protocol. We compared these assembled genomes, confirming that assembled cp genomes from RNA-Seq data were highly similar to each other and to those from genomic libraries in terms of overall structure, size, and composition. Although post-transcriptional modifications, such as RNA-editing, may introduce variations in the RNA-seq data, the assembly of cp genomes from RNA-seq appeared to be reliable. Moreover, RNA-Seq assembly was less sensitive to sources of error such as the recovery of nuclear plastid DNAs (NUPTs). Although some precautions should be taken when producing reference genomes in non-model plants, we conclude that assembling cp genomes from RNA-Seq data is a fast, accurate, and reliable strategy.
Collapse
|
86
|
D'Agostino N, Tamburino R, Cantarella C, De Carluccio V, Sannino L, Cozzolino S, Cardi T, Scotti N. The Complete Plastome Sequences of Eleven Capsicum Genotypes: Insights into DNA Variation and Molecular Evolution. Genes (Basel) 2018; 9:E503. [PMID: 30336638 PMCID: PMC6210379 DOI: 10.3390/genes9100503] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Revised: 10/11/2018] [Accepted: 10/11/2018] [Indexed: 11/16/2022] Open
Abstract
Members of the genus Capsicum are of great economic importance, including both wild forms and cultivars of peppers and chilies. The high number of potentially informative characteristics that can be identified through next-generation sequencing technologies gave a huge boost to evolutionary and comparative genomic research in higher plants. Here, we determined the complete nucleotide sequences of the plastomes of eight Capsicum species (eleven genotypes), representing the three main taxonomic groups in the genus and estimated molecular diversity. Comparative analyses highlighted a wide spectrum of variation, ranging from point mutations to small/medium size insertions/deletions (InDels), with accD, ndhB, rpl20, ycf1, and ycf2 being the most variable genes. The global pattern of sequence variation is consistent with the phylogenetic signal. Maximum-likelihood tree estimation revealed that Capsicum chacoense is sister to the baccatum complex. Divergence and positive selection analyses unveiled that protein-coding genes were generally well conserved, but we identified 25 positive signatures distributed in six genes involved in different essential plastid functions, suggesting positive selection during evolution of Capsicum plastomes. Finally, the identified sequence variation allowed us to develop simple PCR-based markers useful in future work to discriminate species belonging to different Capsicum complexes.
Collapse
Affiliation(s)
- Nunzio D'Agostino
- CREA Research Centre for Vegetable and Ornamental Crops, Via dei Cavalleggeri 25, 84098 Pontecagnano Faiano (SA), Italy.
| | - Rachele Tamburino
- CNR-IBBR, National Research Council of Italy, Institute of Biosciences and BioResources, Via Università 133, 80055 Portici (NA), Italy.
| | - Concita Cantarella
- CREA Research Centre for Vegetable and Ornamental Crops, Via dei Cavalleggeri 25, 84098 Pontecagnano Faiano (SA), Italy.
| | - Valentina De Carluccio
- CREA Research Centre for Vegetable and Ornamental Crops, Via dei Cavalleggeri 25, 84098 Pontecagnano Faiano (SA), Italy.
- Department of Biology, University of Naples Federico II, Via Cinthia, 80126 Naples, Italy.
| | - Lorenza Sannino
- CNR-IBBR, National Research Council of Italy, Institute of Biosciences and BioResources, Via Università 133, 80055 Portici (NA), Italy.
| | - Salvatore Cozzolino
- Department of Biology, University of Naples Federico II, Via Cinthia, 80126 Naples, Italy.
| | - Teodoro Cardi
- CREA Research Centre for Vegetable and Ornamental Crops, Via dei Cavalleggeri 25, 84098 Pontecagnano Faiano (SA), Italy.
| | - Nunzia Scotti
- CNR-IBBR, National Research Council of Italy, Institute of Biosciences and BioResources, Via Università 133, 80055 Portici (NA), Italy.
| |
Collapse
|
87
|
Feranchuk S, Belkova N, Chernogor L, Potapova U, Belikov S. The signs of adaptive mutations identified in the chloroplast genome of the algae endosymbiont of Baikal sponge. F1000Res 2018; 7:1405. [PMID: 33224472 PMCID: PMC7670478 DOI: 10.12688/f1000research.15841.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 08/29/2018] [Indexed: 03/31/2024] Open
Abstract
Background: The study of ecosystems of the great lakes is important as observations can be extended to ecosystems of larger scale. The ecological crisis of Lake Baikal needs investigations to discover the molecular mechanisms involved in the crisis. The disease of Baikal sponges is one of the processes resulting in the degradation of the littoral zone of the lake. Methods: The chloroplast genome fragment for the algae endosymbiont of Baikal sponge was assembled from metagenomic sequencing data. The distributions of polymorphic sites were obtained for the genome fragment, separately for samples from healthy sponge, diseased sponge and dead sponge tissues. Results: The comparative analysis of chloroplast genome sequences suggests that the symbiotic algae from Baikal sponge is close to Choricystis genus of unicellular algae. Also, the distributions of polymorphic sites allowed detection of the signs of extensive mutations in the chloroplasts isolated from the diseased sponge tissues. Conclusions: The study demonstrate the particular case of evolution at the molecular level due to the conditions of a severe crisis of a whole ecosystem in Lake Baikal. The detection of adaptive mutations in the chloroplast genome is an important feature which could represent the behavior of an ecosystem in the event of a severe crisis.
Collapse
Affiliation(s)
- Sergey Feranchuk
- Limological institute, Siberian Branch of the Russian Academy of Sciences, Irkutsk, 664033, Russian Federation
- Department of Informatics , National Research Technical University, Irkutsk, 664074, Russian Federation
| | - Natalia Belkova
- Limological institute, Siberian Branch of the Russian Academy of Sciences, Irkutsk, 664033, Russian Federation
- Scientific Centre for Family Health and Human Reproduction Problems, Irkutsk, 664033, Russian Federation
| | - Lubov Chernogor
- Limological institute, Siberian Branch of the Russian Academy of Sciences, Irkutsk, 664033, Russian Federation
| | - Ulyana Potapova
- Limological institute, Siberian Branch of the Russian Academy of Sciences, Irkutsk, 664033, Russian Federation
| | - Sergei Belikov
- Limological institute, Siberian Branch of the Russian Academy of Sciences, Irkutsk, 664033, Russian Federation
| |
Collapse
|
88
|
Wariss HM, Yi TS, Wang H, Zhang R. The chloroplast genome of a rare and an endangered species Salweenia bouffordiana (Leguminosae) in China. CONSERV GENET RESOUR 2018. [DOI: 10.1007/s12686-017-0836-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
89
|
Gruenstaeudl M, Gerschler N, Borsch T. Bioinformatic Workflows for Generating Complete Plastid Genome Sequences-An Example from Cabomba (Cabombaceae) in the Context of the Phylogenomic Analysis of the Water-Lily Clade. Life (Basel) 2018; 8:E25. [PMID: 29933597 PMCID: PMC6160935 DOI: 10.3390/life8030025] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2018] [Revised: 06/11/2018] [Accepted: 06/19/2018] [Indexed: 12/13/2022] Open
Abstract
The sequencing and comparison of plastid genomes are becoming a standard method in plant genomics, and many researchers are using this approach to infer plant phylogenetic relationships. Due to the widespread availability of next-generation sequencing, plastid genome sequences are being generated at breakneck pace. This trend towards massive sequencing of plastid genomes highlights the need for standardized bioinformatic workflows. In particular, documentation and dissemination of the details of genome assembly, annotation, alignment and phylogenetic tree inference are needed, as these processes are highly sensitive to the choice of software and the precise settings used. Here, we present the procedure and results of sequencing, assembling, annotating and quality-checking of three complete plastid genomes of the aquatic plant genus Cabomba as well as subsequent gene alignment and phylogenetic tree inference. We accompany our findings by a detailed description of the bioinformatic workflow employed. Importantly, we share a total of eleven software scripts for each of these bioinformatic processes, enabling other researchers to evaluate and replicate our analyses step by step. The results of our analyses illustrate that the plastid genomes of Cabomba are highly conserved in both structure and gene content.
Collapse
Affiliation(s)
- Michael Gruenstaeudl
- Institut für Biologie, Systematische Botanik und Pflanzengeographie, Freie Universität Berlin, 14195 Berlin, Germany.
| | - Nico Gerschler
- Institut für Biologie, Systematische Botanik und Pflanzengeographie, Freie Universität Berlin, 14195 Berlin, Germany.
| | - Thomas Borsch
- Institut für Biologie, Systematische Botanik und Pflanzengeographie, Freie Universität Berlin, 14195 Berlin, Germany.
- Botanischer Garten und Botanisches Museum Berlin, Freie Universität Berlin, 14195 Berlin, Germany.
- Berlin Center for Genomics in Biodiversity Research (BeGenDiv), 14195 Berlin, Germany.
| |
Collapse
|
90
|
Dean GH, Asmarayani R, Ardiyani M, Santika Y, Triono T, Mathews S, Webb CO. Generating DNA sequence data with limited resources for molecular biology: Lessons from a barcoding project in Indonesia. APPLICATIONS IN PLANT SCIENCES 2018; 6:e01167. [PMID: 30131909 PMCID: PMC6055555 DOI: 10.1002/aps3.1167] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2017] [Accepted: 05/15/2018] [Indexed: 05/29/2023]
Abstract
The advent of the DNA sequencing age has led to a revolution in biology. The rapid and cost-effective generation of high-quality sequence data has transformed many fields, including those focused on discovering species and surveying biodiversity, monitoring movement of biological materials, forensic biology, and disease diagnostics. There is a need to build capacity to generate useful sequence data in countries with limited historical access to laboratory resources, so that researchers can benefit from the advantages offered by these data. Commonly used molecular techniques such as DNA extraction, PCR, and DNA sequencing are within the reach of small laboratories in many countries, with the main obstacles to successful implementation being lack of funding and limited practical experience. Here we describe a successful approach that we developed to obtain DNA sequence data during a small DNA barcoding project in Indonesia.
Collapse
Affiliation(s)
- Gillian H. Dean
- Department of BotanyUniversity of British ColumbiaVancouverV6T1Z4British ColumbiaCanada
| | - Rani Asmarayani
- Herbarium BogorienseBotany DivisionResearch Center for BiologyIndonesian Institute of Sciences (LIPI)Cibinong16911BogorWest JavaIndonesia
- Present address:
Department of BiologyUniversity of Missouri–St. LouisSt. LouisMissouri63121USA
| | - Marlina Ardiyani
- Herbarium BogorienseBotany DivisionResearch Center for BiologyIndonesian Institute of Sciences (LIPI)Cibinong16911BogorWest JavaIndonesia
| | - Yessi Santika
- Herbarium BogorienseBotany DivisionResearch Center for BiologyIndonesian Institute of Sciences (LIPI)Cibinong16911BogorWest JavaIndonesia
| | - Teguh Triono
- Herbarium BogorienseBotany DivisionResearch Center for BiologyIndonesian Institute of Sciences (LIPI)Cibinong16911BogorWest JavaIndonesia
- Present address:
Zoological Society of London (ZSL) Indonesia ProgramBogor16128Indonesia
| | - Sarah Mathews
- Arnold Arboretum of Harvard UniversityBostonMassachusetts02131USA
- Present address:
CSIROAustralian National HerbariumCanberraAustralian Capital Territory2601Australia
| | - Campbell O. Webb
- Arnold Arboretum of Harvard UniversityBostonMassachusetts02131USA
- Present address:
University of Alaska Museum of the NorthFairbanksAlaska99775USA
| |
Collapse
|
91
|
Manzanilla V, Kool A, Nguyen Nhat L, Nong Van H, Le Thi Thu H, de Boer HJ. Phylogenomics and barcoding of Panax: toward the identification of ginseng species. BMC Evol Biol 2018; 18:44. [PMID: 29614961 PMCID: PMC5883351 DOI: 10.1186/s12862-018-1160-y] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2017] [Accepted: 03/21/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The economic value of ginseng in the global medicinal plant trade is estimated to be in excess of US$2.1 billion. At the same time, the evolutionary placement of ginseng (Panax ginseng) and the complex evolutionary history of the genus is poorly understood despite several molecular phylogenetic studies. In this study, we use a full plastome phylogenomic framework to resolve relationships in Panax and to identify molecular markers for species discrimination. RESULTS We used high-throughput sequencing of MBD2-Fc fractionated Panax DNA to supplement publicly available plastid genomes to create a phylogeny based on fully assembled and annotated plastid genomes from 60 accessions of 8 species. The plastome phylogeny based on a 163 kbp matrix resolves the sister relationship of Panax ginseng with P. quinquefolius. The closely related species P. vietnamensis is supported as sister of P. japonicus. The plastome matrix also shows that the markers trnC-rps16, trnS-trnG, and trnE-trnM could be used for unambiguous molecular identification of all the represented species in the genus. CONCLUSIONS MBD2 depletion reduces the cost of plastome sequencing, which makes it a cost-effective alternative to Sanger sequencing based DNA barcoding for molecular identification. The plastome phylogeny provides a robust framework that can be used to study the evolution of morphological characters and biosynthesis pathways of ginsengosides for phylogenetic bioprospecting. Molecular identification of ginseng species is essential for authenticating ginseng in international trade and it provides an incentive for manufacturers to create authentic products with verified ingredients.
Collapse
Affiliation(s)
- V Manzanilla
- The Natural History Museum, University of Oslo, Oslo, Norway.
| | - A Kool
- The Natural History Museum, University of Oslo, Oslo, Norway
| | - L Nguyen Nhat
- Institute of Genome Research, Vietnam Academy of Science and Technology, 18 Hoang Quoc Viet, Cau Giay, Hanoi, Vietnam
| | - H Nong Van
- Institute of Genome Research, Vietnam Academy of Science and Technology, 18 Hoang Quoc Viet, Cau Giay, Hanoi, Vietnam
| | - H Le Thi Thu
- Institute of Genome Research, Vietnam Academy of Science and Technology, 18 Hoang Quoc Viet, Cau Giay, Hanoi, Vietnam
| | - H J de Boer
- The Natural History Museum, University of Oslo, Oslo, Norway
| |
Collapse
|
92
|
Chen H, Shao J, Zhang H, Jiang M, Huang L, Zhang Z, Yang D, He M, Ronaghi M, Luo X, Sun B, Wu W, Liu C. Sequencing and Analysis of Strobilanthes cusia (Nees) Kuntze Chloroplast Genome Revealed the Rare Simultaneous Contraction and Expansion of the Inverted Repeat Region in Angiosperm. FRONTIERS IN PLANT SCIENCE 2018; 9:324. [PMID: 29593773 PMCID: PMC5861152 DOI: 10.3389/fpls.2018.00324] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2017] [Accepted: 02/27/2018] [Indexed: 05/06/2023]
Abstract
Ban-Lan-Gen, the root tissues derived from several morphologically indistinguishable plant species, have been used widely in traditional Chinese medicines for numerous years. The identification of reliable markers to distinguish various source plant species is critical for the effective and safe use of products containing Ban-Lan-Gen. Here, we analyzed and characterized the complete chloroplast (cp) genome sequence of Strobilanthes cusia (Nees) Kuntze to identify high-resolution markers for the species determination of Southern Ban-Lan-Gen. Total DNA was extracted and subjected to next-generation sequencing. The cp genome was then assembled, and the gaps were filled using PCR amplification and Sanger sequencing. Genome annotation was conducted using CpGAVAS web server. The genome was 144,133 bp in length, presenting a typical quadripartite structure of large (LSC; 91,666 bp) and small (SSC; 17,328 bp) single-copy regions separated by a pair of inverted repeats (IRs; 17,811 bp). The genome encodes 113 unique genes, including 79 protein-coding, 30 transfer RNA, and 4 ribosomal RNA genes. A total of 20 tandem, 2 forward, and 6 palindromic repeats were detected in the genome. A phylogenetic analysis based on 65 protein-coding genes showed that S. cusia was closely related to Andrographis paniculata and Ruellia breedlovei, which belong to the same family, Acanthaceae. One interesting feature is that the IR regions apparently undergo simultaneous contraction and expansion, resulting in the presence of single copies of rps19, rpl2, rpl23, and ycf2 in the LSC region and the duplication of psbA and trnH genes in the IRs. This study provides the first complete cp genome in the genus Strobilanthes, containing critical information for the classification of various Strobilanthes species in the future. This study also provides the foundation for precisely determining the plant sources of Ban-Lan-Gen.
Collapse
Affiliation(s)
- Haimei Chen
- Key Laboratory of Bioactive Substances and Resource Utilization of Chinese Herbal Medicine, Ministry of Education, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Junjie Shao
- Key Laboratory of Bioactive Substances and Resource Utilization of Chinese Herbal Medicine, Ministry of Education, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Hui Zhang
- Key Laboratory of Bioactive Substances and Resource Utilization of Chinese Herbal Medicine, Ministry of Education, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Mei Jiang
- Key Laboratory of Bioactive Substances and Resource Utilization of Chinese Herbal Medicine, Ministry of Education, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Linfang Huang
- Key Laboratory of Bioactive Substances and Resource Utilization of Chinese Herbal Medicine, Ministry of Education, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Zhao Zhang
- Key Laboratory of Bioactive Substances and Resource Utilization of Chinese Herbal Medicine, Ministry of Education, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Dan Yang
- Key Laboratory of Bioactive Substances and Resource Utilization of Chinese Herbal Medicine, Ministry of Education, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Molly He
- Illumina, Inc., San Diego, CA, United States
| | | | - Xi Luo
- Illumina, Inc., San Diego, CA, United States
| | - Botao Sun
- Illumina, Inc., San Diego, CA, United States
| | - Wuwei Wu
- Guangxi Botanical Garden of Medicinal Plants, Nanning, China
| | - Chang Liu
- Key Laboratory of Bioactive Substances and Resource Utilization of Chinese Herbal Medicine, Ministry of Education, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| |
Collapse
|
93
|
McKain MR, Johnson MG, Uribe‐Convers S, Eaton D, Yang Y. Practical considerations for plant phylogenomics. APPLICATIONS IN PLANT SCIENCES 2018; 6:e1038. [PMID: 29732268 PMCID: PMC5895195 DOI: 10.1002/aps3.1038] [Citation(s) in RCA: 101] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Accepted: 03/13/2018] [Indexed: 05/10/2023]
Abstract
The past decade has seen a major breakthrough in our ability to easily and inexpensively sequence genome-scale data from diverse lineages. The development of high-throughput sequencing and long-read technologies has ushered in the era of phylogenomics, where hundreds to thousands of nuclear genes and whole organellar genomes are routinely used to reconstruct evolutionary relationships. As a result, understanding which options are best suited for a particular set of questions can be difficult, especially for those just starting in the field. Here, we review the most recent advances in plant phylogenomic methods and make recommendations for project-dependent best practices and considerations. We focus on the costs and benefits of different approaches in regard to the information they provide researchers and the questions they can address. We also highlight unique challenges and opportunities in plant systems, such as polyploidy, reticulate evolution, and the use of herbarium materials, identifying optimal methodologies for each. Finally, we draw attention to lingering challenges in the field of plant phylogenomics, such as reusability of data sets, and look at some up-and-coming technologies that may help propel the field even further.
Collapse
Affiliation(s)
- Michael R. McKain
- Department of Biological SciencesThe University of AlabamaBox 870344TuscaloosaAlabama35487USA
| | - Matthew G. Johnson
- Department of Biological SciencesTexas Tech University2901 Main Street, Box 43131LubbockTexas79409USA
| | - Simon Uribe‐Convers
- Department of Ecology and Evolutionary BiologyUniversity of Michigan830 North UniversityAnn ArborMichigan48109USA
| | - Deren Eaton
- Department of Ecology, Evolution, and Environmental BiologyColumbia University1200 Amsterdam AvenueNew YorkNew York10027USA
| | - Ya Yang
- Department of Plant and Microbial BiologyUniversity of Minnesota–Twin Cities1445 Gortner AvenueSt. PaulMinnesota55108USA
| |
Collapse
|
94
|
Chen KK. Characterization of the complete chloroplast genome of the Tertiary relict tree Phellodendron amurense (Sapindales: Rutaceae) using Illumina sequencing technology. CONSERV GENET RESOUR 2018. [DOI: 10.1007/s12686-017-0761-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
95
|
Fu CN, Li HT, Milne R, Zhang T, Ma PF, Yang J, Li DZ, Gao LM. Comparative analyses of plastid genomes from fourteen Cornales species: inferences for phylogenetic relationships and genome evolution. BMC Genomics 2017; 18:956. [PMID: 29216844 PMCID: PMC5721659 DOI: 10.1186/s12864-017-4319-9] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Accepted: 11/21/2017] [Indexed: 12/03/2022] Open
Abstract
Background The Cornales is the basal lineage of the asterids, the largest angiosperm clade. Phylogenetic relationships within the order were previously not fully resolved. Fifteen plastid genomes representing 14 species, ten genera and seven families of Cornales were newly sequenced for comparative analyses of genome features, evolution, and phylogenomics based on different partitioning schemes and filtering strategies. Results All plastomes of the 14 Cornales species had the typical quadripartite structure with a genome size ranging from 156,567 bp to 158,715 bp, which included two inverted repeats (25,859–26,451 bp) separated by a large single-copy region (86,089–87,835 bp) and a small single-copy region (18,250–18,856 bp) region. These plastomes encoded the same set of 114 unique genes including 31 transfer RNA, 4 ribosomal RNA and 79 coding genes, with an identical gene order across all examined Cornales species. Two genes (rpl22 and ycf15) contained premature stop codons in seven and five species respectively. The phylogenetic relationships among all sampled species were fully resolved with maximum support. Different filtering strategies (none, light and strict) of sequence alignment did not have an effect on these relationships. The topology recovered from coding and noncoding data sets was the same as for the whole plastome, regardless of filtering strategy. Moreover, mutational hotspots and highly informative regions were identified. Conclusions Phylogenetic relationships among families and intergeneric relationships within family of Cornales were well resolved. Different filtering strategies and partitioning schemes do not influence the relationships. Plastid genomes have great potential to resolve deep phylogenetic relationships of plants. Electronic supplementary material The online version of this article (10.1186/s12864-017-4319-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Chao-Nan Fu
- Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Hong-Tao Li
- Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China
| | - Richard Milne
- Institute of Molecular Plant Sciences, University of Edinburgh, King's Buildings, Edinburgh, Scotland, EH9 3JH, UK
| | - Ting Zhang
- Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China
| | - Peng-Fei Ma
- Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China
| | - Jing Yang
- Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China
| | - De-Zhu Li
- Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China. .,University of Chinese Academy of Sciences, Beijing, 100049, China. .,Germplasm Bank of Wild Species in Southwest China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China.
| | - Lian-Ming Gao
- Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China.
| |
Collapse
|
96
|
Affiliation(s)
- Freek T. Bakker
- Biosystematics Group, Wageningen University, Wageningen, The Netherlands
| |
Collapse
|
97
|
The complete chloroplast genome of Sinojackia xylocarpa (Ericales: Styracaceae), an endangered plant species endemic to China. CONSERV GENET RESOUR 2017. [DOI: 10.1007/s12686-017-0763-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
|