1
|
Wu Y, Zeng MY, Wang HX, Lan S, Liu ZJ, Zhang S, Li MH, Guan Y. The Complete Chloroplast Genomes of Bulbophyllum (Orchidaceae) Species: Insight into Genome Structure Divergence and Phylogenetic Analysis. Int J Mol Sci 2024; 25:2665. [PMID: 38473912 DOI: 10.3390/ijms25052665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2023] [Revised: 02/15/2024] [Accepted: 02/19/2024] [Indexed: 03/14/2024] Open
Abstract
Bulbophyllum is one of the largest genera and presents some of the most intricate taxonomic problems in the family Orchidaceae, including species of ornamental and medical importance. The lack of knowledge regarding the characterization of Bulbophyllum chloroplast (cp) genomes has imposed current limitations on our study. Here, we report the complete cp genomes of seven Bulbophyllum species, including B. ambrosia, B. crassipes, B. farreri, B. hamatum, B. shanicum, B. triste, and B. violaceolabellum, and compared with related taxa to provide a better understanding of their genomic information on taxonomy and phylogeny. A total of 28 Bulbophyllum cp genomes exhibit typical quadripartite structures with lengths ranging from 145,092 bp to 165,812 bp and a GC content of 36.60% to 38.04%. Each genome contained 125-132 genes, encompassing 74-86 protein-coding genes, 38 tRNA genes, and eight rRNA genes. The genome arrangements, gene contents, and length were similar, with differences observed in ndh gene composition. It is worth noting that there were exogenous fragment insertions in the IR regions of B. crassipes. A total of 18-49 long repeats and 38-80 simple sequence repeats (SSRs) were detected and the single nucleotide (A/T) was dominant in Bulbophyllum cp genomes, with an obvious A/T preference. An analysis of relative synonymous codon usage (RSCU) revealed that leucine (Leu) was the most frequently used codon, while cysteine (Cys) was the least used. Six highly variable regions (rpl32-trnLUAG > trnTUGU-trnLUAA > trnFGAA-ndhJ > rps15-ycf1 > rbcL-accD > psbI-trnSGCU) and five coding sequences (ycf1 > rps12 > matK > psbK > rps15) were identified as potential DNA markers based on nucleotide diversity. Additionally, 31,641 molecular diagnostic characters (MDCs) were identified in complete cp genomes. A phylogenetic analysis based on the complete cp genome sequences and 68 protein-coding genes strongly supported that 28 Bulbophyllum species can be divided into four branches, sects. Brachyantha, Cirrhopetalum, and Leopardinae, defined by morphology, were non-monophyly. Our results enriched the genetic resources of Bulbophyllum, providing valuable information to illustrate the complicated taxonomy, phylogeny, and evolution process of the genus.
Collapse
Affiliation(s)
- Yuwei Wu
- Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at Landscape Architecture and Arts, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Meng-Yao Zeng
- Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at Landscape Architecture and Arts, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Huan-Xin Wang
- Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at Landscape Architecture and Arts, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Siren Lan
- Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at Landscape Architecture and Arts, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Fujian Colleges and Universities Engineering Research Institute of Conservation and Utilization of Natural Bioresources, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Zhong-Jian Liu
- Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at Landscape Architecture and Arts, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Fujian Colleges and Universities Engineering Research Institute of Conservation and Utilization of Natural Bioresources, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Shibao Zhang
- Key Laboratory of Economic Plants and Biotechnology, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China
| | - Ming-He Li
- Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at Landscape Architecture and Arts, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Fujian Colleges and Universities Engineering Research Institute of Conservation and Utilization of Natural Bioresources, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Yunxiao Guan
- Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at Landscape Architecture and Arts, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Fujian Colleges and Universities Engineering Research Institute of Conservation and Utilization of Natural Bioresources, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| |
Collapse
|
2
|
Chen XH, Ding LN, Zong XY, Xu H, Wang WB, Ding R, Qu B. The complete chloroplast genome sequences of four Liparis species (Orchidaceae) and phylogenetic implications. Gene 2023; 888:147760. [PMID: 37661026 DOI: 10.1016/j.gene.2023.147760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 08/18/2023] [Accepted: 09/01/2023] [Indexed: 09/05/2023]
Abstract
Liparis Richard (Malaxideae, Epidendroideae) is a large and diverse genus of the family Orchidaceae, the taxonomy of which is complicated and controversial. In this study, we sequenced, assembled and analyzed four complete chloroplast genomes of Liparis species including L. kumokiri, L. makinoana, L. pauliana, and L. viridiflora, and evaluated their phylogenetic relationships with related species for the first time. These four chloroplast genomes (size range 153,095 to 158,239 bp) possess typical quadripartite structures that consist of a large single copy (LSC, 83,533-86,752 bp), a small single copy (SSC, 17,938-18,156 bp) and a pair of inverted repeats (IRs, 26,421-26,933 bp). The genomes contain 133 genes, including 87 protein coding genes, 38 tRNAs and 8 rRNA genes. The genome arrangements, gene contents, gene order, long repeats and simple sequence repeats were similar with small differences observed among these four chloroplast genomes. Five highly variable regions including ycf1, ndhA, ndhF, trnQ and trnK were identified from the comparative analysis with other nine related Liparis species, which had the potential to be used as DNA markers for species identification and phylogenetic studies of Liparis species. Phylogenetic analysis based on the complete chloroplast genome sequences strongly supported the polyphyly of Liparis and its further division into three branches. These results provided valuable information to illustrate the complicated taxonomy, phylogeny and evolution process of the Liparis genus.
Collapse
Affiliation(s)
- Xu-Hui Chen
- College of Bioscience and Biotechnology, Shenyang Agricultural University, Shenyang 110161, Liaoning, PR China
| | - Li-Na Ding
- College of Bioscience and Biotechnology, Shenyang Agricultural University, Shenyang 110161, Liaoning, PR China
| | - Xiao-Yan Zong
- College of Bioscience and Biotechnology, Shenyang Agricultural University, Shenyang 110161, Liaoning, PR China
| | - Hua Xu
- School of Modern Industry for Selenium Science and Engineering, Wuhan Polytechnic University, Wuhan 430048, PR China
| | - Wei-Bin Wang
- College of Plant Protection, Shenyang Agricultural University, Shenyang 110161, Liaoning, PR China
| | - Rui Ding
- College of Land and Environment, Shenyang Agricultural University, Shenyang 110161, Liaoning, PR China.
| | - Bo Qu
- College of Bioscience and Biotechnology, Shenyang Agricultural University, Shenyang 110161, Liaoning, PR China.
| |
Collapse
|
3
|
Chloroplast Genome Evolution and Species Identification of Styrax (Styracaceae). BIOMED RESEARCH INTERNATIONAL 2022; 2022:5364094. [PMID: 35252450 PMCID: PMC8893999 DOI: 10.1155/2022/5364094] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Accepted: 02/11/2022] [Indexed: 01/21/2023]
Abstract
The genus Styrax L. consists of approximately 130 species distributed in the Americas, eastern Asia, and the Mediterranean region. The phylogeny and evolutionary history of this genus are not clear. Knowledge of the phylogenetic relationships and the method for species identification will be critical for the evolution of this genus. In this study, we sequenced the chloroplast genome of 17 Styrax samples and added 17 additional chloroplast genome sequences from GenBank. The data were used to investigate chloroplast genome evolution, infer phylogenetic relationships, and access the species identification rate within Styrax. The Styrax chloroplast genome contains typical quadripartite structures, ranging from 157,641 bp to 159,333 bp. The chloroplast genome contains 114 unique genes. The P distance among the Styrax species ranged from 0.0003 to 0.00611. Seventeen small inversions and SSR sites were discovered in the Styrax chloroplast genome. By comparing with the chloroplast genome sequences, six mutation hotspots were identified, and the markers ycf1b and trnT-trnL were identified as the best Styrax-specific DNA barcodes. The specific barcodes and superbarcode exhibited higher discriminatory power than universal barcodes. Chloroplast phylogenomic results improved the resolution of the phylogenetic relationships of Styrax compared to previous analyses.
Collapse
|
4
|
Kumar S, Singh A, Shanker A. pSATdb: a database of mitochondrial common, polymorphic, and unique microsatellites. Life Sci Alliance 2022; 5:5/6/e202101307. [PMID: 35181599 PMCID: PMC8860089 DOI: 10.26508/lsa.202101307] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Revised: 02/03/2022] [Accepted: 02/04/2022] [Indexed: 12/02/2022] Open
Abstract
The polymorphic microSATellites database (pSATdb) provides information on common, polymorphic, and unique mitochondrial microsatellites. Microsatellites, also termed as simple sequence repeats, are repetitive tracts in a DNA sequence, typically consisting of one to six nucleotides. These repeats are found in all genomes and play key roles in phylogeny and species identification. Microsatellites are highly polymorphic, and their length may differ from species to species. There are several online resources dedicated to mitochondria; however, comprehensive information is not available about the length variation of mitochondrial microsatellites. Therefore, to explore it between species among a genus, we have developed a database named pSATdb (polymorphic microSATellites database; https://lms.snu.edu.in/pSATdb/). pSATdb contains 28,710 perfect microsatellites identified across 5,976 mitochondrial genome (mt-genome) sequences from 1,576 genera which includes 1,535 (5,846 mt-genome) and 41 (130 mt-genome) genera of Metazoa and Viridiplantae, respectively. pSATdb is the only database which provides genus-wise information about the length variation of mitochondrial microsatellites. Because of the emerging role of microsatellites in genomics studies, the identified common, polymorphic, and unique microsatellites stored in pSATdb will be effectively useful in various studies including genetic diversity, mapping, marker-assisted selection, and comparative population studies.
Collapse
Affiliation(s)
- Sonu Kumar
- Department of Bioinformatics, Central University of South Bihar, Gaya, India
| | - Ashutosh Singh
- Translational Bioinformatics Lab, Department of Life Sciences, Shiv Nadar University, Greater Noida, India
| | - Asheesh Shanker
- Department of Bioinformatics, Central University of South Bihar, Gaya, India
| |
Collapse
|
5
|
Sun J, Wang Y, Liu Y, Xu C, Yuan Q, Guo L, Huang L. Evolutionary and phylogenetic aspects of the chloroplast genome of Chaenomeles species. Sci Rep 2020; 10:11466. [PMID: 32651417 PMCID: PMC7351712 DOI: 10.1038/s41598-020-67943-1] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Accepted: 06/10/2020] [Indexed: 01/23/2023] Open
Abstract
Chaenomeles (family Rosaceae) is a genus of five diploid species of deciduous spiny shrubs that are native to Central Asia and Japan. It is an important horticultural crop (commonly known as flowering quinces) in Europe and Asia for its high yield in fruits that are rich in juice, aroma, and dietary fiber. Therefore, the development of effective genetic markers of Chaenomeles species is advantageous for crop improvement through breeding and selection. In this study, we successfully assembled and analyzed the chloroplast genome of five Chaenomeles species. The chloroplast genomes of the five Chaenomeles species were very similar with no structural or content rearrangements among them. The chloroplast genomes ranged from 159,436 to 160,040 bp in length and contained a total of 112 unique genes, including 78 protein-coding genes, 30 tRNAs, and 4 rRNAs. Three highly variable regions, including trnR-atpA, trnL-F, and rpl32-ccsA, were identified. Phylogenetic analysis based on the complete chloroplast genome showed that Chaenomeles forms a monophyletic clade and had a close relationship with the genera Docynia and Malus. Analyses for phylogenetic relationships and the development of available genetic markers in future could provide valuable information regarding genetics and breeding mechanisms of the Chaenomeles species.
Collapse
Affiliation(s)
- Jiahui Sun
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China
| | - Yiheng Wang
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China
| | - Yanlei Liu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Chao Xu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China
| | - Qingjun Yuan
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China.
| | - Lanping Guo
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China.
| | - Luqi Huang
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China.
| |
Collapse
|
6
|
Cappa R, de Campos C, Maxwell AP, McKnight AJ. "Mitochondrial Toolbox" - A Review of Online Resources to Explore Mitochondrial Genomics. Front Genet 2020; 11:439. [PMID: 32457801 PMCID: PMC7225359 DOI: 10.3389/fgene.2020.00439] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Accepted: 04/09/2020] [Indexed: 12/30/2022] Open
Abstract
Mitochondria play a significant role in many biological systems. There is emerging evidence that differences in the mitochondrial genome may contribute to multiple common diseases, leading to an increasing number of studies exploring mitochondrial genomics. There is often a large amount of complex data generated (for example via next generation sequencing), which requires optimised bioinformatics tools to efficiently and effectively generate robust outcomes from these large datasets. Twenty-four online resources dedicated to mitochondrial genomics were reviewed. This 'mitochondrial toolbox' summary resource will enable researchers to rapidly identify the resource(s) most suitable for their needs. These resources fulfil a variety of functions, with some being highly specialised. No single tool will provide all users with the resources they require; therefore, the most suitable tool will vary between users depending on the nature of the work they aim to carry out. Genetics resources are well established for phylogeny and DNA sequence changes, but further epigenetic and gene expression resources need to be developed for mitochondrial genomics.
Collapse
Affiliation(s)
- Ruaidhri Cappa
- Centre for Public Health, Institute of Clinical Sciences B, Queen's University Belfast, Royal Victoria Hospital, Belfast, United Kingdom
| | - Cassio de Campos
- School of Electronics, Electrical Engineering and Computer Science, Queen's University Belfast, Belfast, United Kingdom
| | - Alexander P Maxwell
- Centre for Public Health, Institute of Clinical Sciences B, Queen's University Belfast, Royal Victoria Hospital, Belfast, United Kingdom
| | - Amy J McKnight
- Centre for Public Health, Institute of Clinical Sciences B, Queen's University Belfast, Royal Victoria Hospital, Belfast, United Kingdom
| |
Collapse
|
7
|
Complete Chloroplast Genome Sequence of Decaisnea insignis: Genome Organization, Genomic Resources and Comparative Analysis. Sci Rep 2017; 7:10073. [PMID: 28855603 PMCID: PMC5577308 DOI: 10.1038/s41598-017-10409-8] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2017] [Accepted: 08/08/2017] [Indexed: 11/09/2022] Open
Abstract
Decaisnea insignis is a wild resource plant and is used as an ornamental, medicinal, and fruit plant. High-throughput sequencing of chloroplast genomes has provided insight into the overall evolutionary dynamics of chloroplast genomes and has enhanced our understanding of the evolutionary relationships within plant families. In the present study, we sequenced the complete chloroplast genome of D. insignis and used the data to assess its genomic resources. The D. insignis chloroplast genome is 158,683 bp in length and includes a pair of inverted repeats of 26,167 bp that are separated by small and large single copy regions of 19,162 bp and 87,187 bp, respectively. We identified 83 simple sequence repeats and 18 pairs of large repeats. Most simple-sequence repeats were located in the noncoding sections of the large single-copy/small single-copy region and exhibited a high A/T content. The D. insignis chloroplast genome bias was skewed towards A/T on the basis of codon usage. A phylogenetic tree based on 82 protein-coding genes of 33 angiosperms showed that D. insignis was clustered with Akebia in Lardizabalaceae. Overall, the results of this study will contribute to better understanding the evolution, molecular biology and genetic improvement of D. insignis.
Collapse
|
8
|
Ivanova Z, Sablok G, Daskalova E, Zahmanova G, Apostolova E, Yahubyan G, Baev V. Chloroplast Genome Analysis of Resurrection Tertiary Relict Haberlea rhodopensis Highlights Genes Important for Desiccation Stress Response. FRONTIERS IN PLANT SCIENCE 2017; 8:204. [PMID: 28265281 PMCID: PMC5316520 DOI: 10.3389/fpls.2017.00204] [Citation(s) in RCA: 71] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2016] [Accepted: 02/03/2017] [Indexed: 05/20/2023]
Abstract
Haberlea rhodopensis is a paleolithic tertiary relict species, best known as a resurrection plant with remarkable tolerance to desiccation. When exposed to severe drought stress, H. rhodopensis shows an ability to maintain the structural integrity of its photosynthetic apparatus, which re-activates easily upon rehydration. We present here the results from the assembly and annotation of the chloroplast (cp) genome of H. rhodopensis, which was further subjected to comparative analysis with the cp genomes of closely related species. H. rhodopensis showed a cp genome size of 153,099 bp, harboring a pair of inverted repeats (IR) of 25,415 bp separated by small and large copy regions (SSC and LSC) of 17,826 and 84,443 bp. The genome structure, gene order, GC content and codon usage are similar to those of the typical angiosperm cp genomes. The genome hosts 137 genes representing 70.66% of the plastome, which includes 86 protein-coding genes, 36 tRNAs, and 4 rRNAs. A comparative plastome analysis with other closely related Lamiales members revealed conserved gene order in the IR and LSC/SSC regions. A phylogenetic analysis based on protein-coding genes from 33 species defines this species as belonging to the Gesneriaceae family. From an evolutionary point of view, a site-specific selection analysis detected positively selected sites in 17 genes, most of which are involved in photosynthesis (e.g., rbcL, ndhF, accD, atpE, etc.). The observed codon substitutions may be interpreted as being a consequence of molecular adaptation to drought stress, which ensures an evolutionary advantage to H. rhodopensis.
Collapse
Affiliation(s)
- Zdravka Ivanova
- Department of Plant Physiology and Molecular Biology, University of PlovdivPlovdiv, Bulgaria
| | - Gaurav Sablok
- Plant Functional Biology and Climate Change Cluster, University of Technology at Sydney, SydneyNSW, Australia
| | - Evelina Daskalova
- Department of Plant Physiology and Molecular Biology, University of PlovdivPlovdiv, Bulgaria
| | - Gergana Zahmanova
- Department of Plant Physiology and Molecular Biology, University of PlovdivPlovdiv, Bulgaria
| | - Elena Apostolova
- Department of Plant Physiology and Molecular Biology, University of PlovdivPlovdiv, Bulgaria
| | - Galina Yahubyan
- Department of Plant Physiology and Molecular Biology, University of PlovdivPlovdiv, Bulgaria
| | - Vesselin Baev
- Department of Plant Physiology and Molecular Biology, University of PlovdivPlovdiv, Bulgaria
- *Correspondence: Vesselin Baev,
| |
Collapse
|
9
|
Sablok G, Pérez-Pulido AJ, Do T, Seong TY, Casimiro-Soriguer CS, La Porta N, Ralph PJ, Squartini A, Muñoz-Merida A, Harikrishna JA. PlantFuncSSR: Integrating First and Next Generation Transcriptomics for Mining of SSR-Functional Domains Markers. FRONTIERS IN PLANT SCIENCE 2016; 7:878. [PMID: 27446111 PMCID: PMC4922199 DOI: 10.3389/fpls.2016.00878] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/08/2016] [Accepted: 06/03/2016] [Indexed: 05/30/2023]
Abstract
Analysis of repetitive DNA sequence content and divergence among the repetitive functional classes is a well-accepted approach for estimation of inter- and intra-generic differences in plant genomes. Among these elements, microsatellites, or Simple Sequence Repeats (SSRs), have been widely demonstrated as powerful genetic markers for species and varieties discrimination. We present PlantFuncSSRs platform having more than 364 plant species with more than 2 million functional SSRs. They are provided with detailed annotations for easy functional browsing of SSRs and with information on primer pairs and associated functional domains. PlantFuncSSRs can be leveraged to identify functional-based genic variability among the species of interest, which might be of particular interest in developing functional markers in plants. This comprehensive on-line portal unifies mining of SSRs from first and next generation sequencing datasets, corresponding primer pairs and associated in-depth functional annotation such as gene ontology annotation, gene interactions and its identification from reference protein databases. PlantFuncSSRs is freely accessible at: http://www.bioinfocabd.upo.es/plantssr.
Collapse
Affiliation(s)
- Gaurav Sablok
- Plant Functional Biology and Climate Change Cluster (C3), University of Technology, SydneyNSW, Australia
| | - Antonio J. Pérez-Pulido
- Centro Andaluz de Biología del Desarrollo (CABD-CSIC), Universidad Pablo de OlavideSevilla, Spain
| | - Thac Do
- Plant Functional Biology and Climate Change Cluster (C3), University of Technology, SydneyNSW, Australia
| | - Tan Y. Seong
- Centre for Research in Biotechnology for Agriculture and Institute of Biological Sciences, Faculty of Science, University of MalayaKuala Lumpur, Malaysia
| | | | - Nicola La Porta
- Department of Sustainable Agro-Ecosystems and Bioresources, Research and Innovation Centre, Fondazione Edmund MachTrento, Italy
- MountFOR Project Centre, European Forest InstituteTrento, Italy
- Consiglio Nazionale delle Ricerche, Istituto per la Valorizzazione del Legno e delle Specie ArboreeFlorence, Italy
| | - Peter J. Ralph
- Plant Functional Biology and Climate Change Cluster (C3), University of Technology, SydneyNSW, Australia
| | - Andrea Squartini
- Department of Agronomy, Food, Natural Resources, Animals and Environment, University of PadovaPadova, Italy
| | - Antonio Muñoz-Merida
- CIBIO Research Centre in Biodiversity and Genetic Resources, InBIO, Universidade do PortoVila do Conde, Portugal
| | - Jennifer A. Harikrishna
- Centre for Research in Biotechnology for Agriculture and Institute of Biological Sciences, Faculty of Science, University of MalayaKuala Lumpur, Malaysia
| |
Collapse
|
10
|
Sablok G, Padma Raju GV, Mudunuri SB, Prabha R, Singh DP, Baev V, Yahubyan G, Ralph PJ, La Porta N. ChloroMitoSSRDB 2.00: more genomes, more repeats, unifying SSRs search patterns and on-the-fly repeat detection. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2015; 2015:bav084. [PMID: 26412851 PMCID: PMC4584093 DOI: 10.1093/database/bav084] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Accepted: 08/17/2015] [Indexed: 01/13/2023]
Abstract
Organelle genomes evolve rapidly as compared with nuclear genomes and have been widely used for developing microsatellites or simple sequence repeats (SSRs) markers for delineating phylogenomics. In our previous reports, we have established the largest repository of organelle SSRs, ChloroMitoSSRDB, which provides access to 2161 organelle genomes (1982 mitochondrial and 179 chloroplast genomes) with a total of 5838 perfect chloroplast SSRs, 37 297 imperfect chloroplast SSRs, 5898 perfect mitochondrial SSRs and 50 355 imperfect mitochondrial SSRs across organelle genomes. In the present research, we have updated ChloroMitoSSRDB by systematically analyzing and adding additional 191 chloroplast and 2102 mitochondrial genomes. With the recent update, ChloroMitoSSRDB 2.00 provides access to a total of 4454 organelle genomes displaying a total of 40 653 IMEx Perfect SSRs (11 802 Chloroplast Perfect SSRs and 28 851 Mitochondria Perfect SSRs), 275 981 IMEx Imperfect SSRs (78 972 Chloroplast Imperfect SSRs and 197 009 Mitochondria Imperfect SSRs), 35 250 MISA (MIcroSAtellite identification tool) Perfect SSRs and 3211 MISA Compound SSRs and associated information such as location of the repeats (coding and non-coding), size of repeat, motif and length polymorphism, and primer pairs. Additionally, we have integrated and made available several in silico SSRs mining tools through a unified web-portal for in silico repeat mining for assembled organelle genomes and from next generation sequencing reads. ChloroMitoSSRDB 2.00 allows the end user to perform multiple SSRs searches and easy browsing through the SSRs using two repeat algorithms and provide primer pair information for identified SSRs for evolutionary genomics. Database URL:http://www.mcr.org.in/chloromitossrdb
Collapse
Affiliation(s)
- Gaurav Sablok
- Plant Functional Biology and Climate Change Cluster (C3), University of Technology Sydney, PO Box 123, Broadway, NSW 2007, Australia, Environmental Biotechnology Platform, Research and Innovation Center, Fondazione Edmund Mach (FEM), IASMA Via Mach 1., 38010 San Michele all'Adige (TN), Italy,
| | - G V Padma Raju
- Department of Computer Science and Engineering, S.R.K.R Engineering College, Chinna Amiram, Bhimavaram 534204, Andhra Pradesh, India
| | - Suresh B Mudunuri
- Technology Centre, S.R.K.R. Engineering College, Chinna Amiram, Bhimavaram 534204, Andhra Pradesh, India
| | - Ratna Prabha
- National Bureau of Agriculturally Important Microorganisms (NBAIM) (Indian Council of Agricultural Research), Maunath Bhanjan 275101, Uttar Pradesh, India and
| | - Dhananjaya P Singh
- National Bureau of Agriculturally Important Microorganisms (NBAIM) (Indian Council of Agricultural Research), Maunath Bhanjan 275101, Uttar Pradesh, India and
| | - Vesselin Baev
- Department of Plant Physiology and Molecular Biology, University of Plovdiv, 24 Tsar Assen St, 4000 Plovdiv, Bulgaria
| | - Galina Yahubyan
- Department of Plant Physiology and Molecular Biology, University of Plovdiv, 24 Tsar Assen St, 4000 Plovdiv, Bulgaria
| | - Peter J Ralph
- Plant Functional Biology and Climate Change Cluster (C3), University of Technology Sydney, PO Box 123, Broadway, NSW 2007, Australia
| | - Nicola La Porta
- Environmental Biotechnology Platform, Research and Innovation Center, Fondazione Edmund Mach (FEM), IASMA Via Mach 1., 38010 San Michele all'Adige (TN), Italy
| |
Collapse
|
11
|
Nagpure NS, Rashid I, Pathak AK, Singh M, Pati R, Singh SP, Sarkar UK. FMiR: A Curated Resource of Mitochondrial DNA Information for Fish. PLoS One 2015; 10:e0136711. [PMID: 26317619 PMCID: PMC4552752 DOI: 10.1371/journal.pone.0136711] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2014] [Accepted: 08/07/2015] [Indexed: 11/18/2022] Open
Abstract
Mitochondrial genome sequences have been widely used for evolutionary and phylogenetic studies. Among vertebrates, fish are an important, diverse group, and their mitogenome sequences are growing rapidly in public repositories. To facilitate mitochondrial genome analysis and to explore the valuable genetic information, we developed the Fish Mitogenome Resource (FMiR) database to provide a workbench for mitogenome annotation, species identification and microsatellite marker mining. The microsatellites are also known as simple sequence repeats (SSRs) and used as molecular markers in studies on population genetics, gene duplication and marker assisted selection. Here, easy-to-use tools have been implemented for mining SSRs and for designing primers to identify species/habitat specific markers. In addition, FMiR can analyze complete or partial mitochondrial genome sequence to identify species and to deduce relational distances among sequences across species. The database presently contains curated mitochondrial genomes from 1302 fish species belonging to 297 families and 47 orders reported from saltwater and freshwater ecosystems. In addition, the database covers information on fish species such as conservation status, ecosystem, family, distribution and occurrence downloaded from the FishBase and IUCN Red List databases. Those fish information have been used to browse mitogenome information for the species belonging to a particular category. The database is scalable in terms of content and inclusion of other analytical modules. The FMiR is running under Linux operating platform on high performance server accessible at URL http://mail.nbfgr.res.in/fmir.
Collapse
Affiliation(s)
- Naresh Sahebrao Nagpure
- Division of Molecular Biology and Biotechnology, National Bureau of Fish Genetic Resources, Lucknow-226002, India
| | - Iliyas Rashid
- Division of Molecular Biology and Biotechnology, National Bureau of Fish Genetic Resources, Lucknow-226002, India
| | - Ajey Kumar Pathak
- Division of Fish Taxonomy and Resources, National Bureau of Fish Genetic Resources, Lucknow-226002, India
| | - Mahender Singh
- Division of Molecular Biology and Biotechnology, National Bureau of Fish Genetic Resources, Lucknow-226002, India
- * E-mail:
| | - Rameshwar Pati
- Division of Molecular Biology and Biotechnology, National Bureau of Fish Genetic Resources, Lucknow-226002, India
| | - Shri Prakash Singh
- Division of Fish Taxonomy and Resources, National Bureau of Fish Genetic Resources, Lucknow-226002, India
| | - Uttam Kumar Sarkar
- Division of Fish Taxonomy and Resources, National Bureau of Fish Genetic Resources, Lucknow-226002, India
| |
Collapse
|
12
|
Hu S, Sablok G, Wang B, Qu D, Barbaro E, Viola R, Li M, Varotto C. Plastome organization and evolution of chloroplast genes in Cardamine species adapted to contrasting habitats. BMC Genomics 2015; 16:306. [PMID: 25887666 PMCID: PMC4446112 DOI: 10.1186/s12864-015-1498-0] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2014] [Accepted: 03/27/2015] [Indexed: 11/10/2022] Open
Abstract
Background Plastid genomes, also known as plastomes, are shaped by the selective forces acting on the fundamental cellular functions they code for and thus they are expected to preserve signatures of the adaptive path undertaken by different plant species during evolution. To identify molecular signatures of positive selection associated to adaptation to contrasting ecological niches, we sequenced with Solexa technology the plastomes of two congeneric Brassicaceae species with different habitat preference, Cardamine resedifolia and Cardamine impatiens. Results Following in-depth characterization of plastome organization, repeat patterns and gene space, the comparison of the newly sequenced plastomes between each other and with 15 fully sequenced Brassicaceae plastomes publically available in GenBank uncovered dynamic variation of the IR boundaries in the Cardamine lineage. We further detected signatures of positive selection in ten of the 75 protein-coding genes of the examined plastomes, identifying a range of chloroplast functions putatively involved in adaptive processes within the family. For instance, the three residues found to be under positive selection in RUBISCO could possibly be involved in the modulation of RUBISCO aggregation/activation and enzymatic specificty in Brassicaceae. In addition, our results points to differential evolutionary rates in Cardamine plastomes. Conclusions Overall our results support the existence of wider signatures of positive selection in the plastome of C. resedifolia, possibly as a consequence of adaptation to high altitude environments. We further provide a first characterization of the selective patterns shaping the Brassicaceae plastomes, which could help elucidate the driving forces underlying adaptation and evolution in this important plant family. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1498-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Shiliang Hu
- Ecogenomics Laboratory, Department of Biodiversity and Molecular Ecology, Research and Innovation Centre, Fondazione Edmund Mach, Via E. Mach 1, 38010 S, Michele all'Adige (TN), Italy.
| | - Gaurav Sablok
- Ecogenomics Laboratory, Department of Biodiversity and Molecular Ecology, Research and Innovation Centre, Fondazione Edmund Mach, Via E. Mach 1, 38010 S, Michele all'Adige (TN), Italy.
| | - Bo Wang
- Ecogenomics Laboratory, Department of Biodiversity and Molecular Ecology, Research and Innovation Centre, Fondazione Edmund Mach, Via E. Mach 1, 38010 S, Michele all'Adige (TN), Italy.
| | - Dong Qu
- Ecogenomics Laboratory, Department of Biodiversity and Molecular Ecology, Research and Innovation Centre, Fondazione Edmund Mach, Via E. Mach 1, 38010 S, Michele all'Adige (TN), Italy. .,College of Horticulture, Northwest Agricultural and Forest University, 712100, Yangling, Shaanxi, PR China.
| | - Enrico Barbaro
- Ecogenomics Laboratory, Department of Biodiversity and Molecular Ecology, Research and Innovation Centre, Fondazione Edmund Mach, Via E. Mach 1, 38010 S, Michele all'Adige (TN), Italy.
| | - Roberto Viola
- Ecogenomics Laboratory, Department of Biodiversity and Molecular Ecology, Research and Innovation Centre, Fondazione Edmund Mach, Via E. Mach 1, 38010 S, Michele all'Adige (TN), Italy.
| | - Mingai Li
- Ecogenomics Laboratory, Department of Biodiversity and Molecular Ecology, Research and Innovation Centre, Fondazione Edmund Mach, Via E. Mach 1, 38010 S, Michele all'Adige (TN), Italy.
| | - Claudio Varotto
- Ecogenomics Laboratory, Department of Biodiversity and Molecular Ecology, Research and Innovation Centre, Fondazione Edmund Mach, Via E. Mach 1, 38010 S, Michele all'Adige (TN), Italy.
| |
Collapse
|
13
|
Kapil A, Rai PK, Shanker A. ChloroSSRdb: a repository of perfect and imperfect chloroplastic simple sequence repeats (cpSSRs) of green plants. Database (Oxford) 2014; 2014:bau107. [PMID: 25380781 PMCID: PMC4224265 DOI: 10.1093/database/bau107] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2014] [Revised: 09/19/2014] [Accepted: 10/08/2014] [Indexed: 01/08/2023]
Abstract
Simple sequence repeats (SSRs) are regions in DNA sequence that contain repeating motifs of length 1-6 nucleotides. These repeats are ubiquitously present and are found in both coding and non-coding regions of genome. A total of 534 complete chloroplast genome sequences (as on 18 September 2014) of Viridiplantae are available at NCBI organelle genome resource. It provides opportunity to mine these genomes for the detection of SSRs and store them in the form of a database. In an attempt to properly manage and retrieve chloroplastic SSRs, we designed ChloroSSRdb which is a relational database developed using SQL server 2008 and accessed through ASP.NET. It provides information of all the three types (perfect, imperfect and compound) of SSRs. At present, ChloroSSRdb contains 124 430 mined SSRs, with majority lying in non-coding region. Out of these, PCR primers were designed for 118 249 SSRs. Tetranucleotide repeats (47 079) were found to be the most frequent repeat type, whereas hexanucleotide repeats (6414) being the least abundant. Additionally, in each species statistical analyses were performed to calculate relative frequency, correlation coefficient and chi-square statistics of perfect and imperfect SSRs. In accordance with the growing interest in SSR studies, ChloroSSRdb will prove to be a useful resource in developing genetic markers, phylogenetic analysis, genetic mapping, etc. Moreover, it will serve as a ready reference for mined SSRs in available chloroplast genomes of green plants. Database URL: www.compubio.in/chlorossrdb/
Collapse
Affiliation(s)
- Aditi Kapil
- Department of Bioinformatics, Banasthali University, Banasthali, Rajasthan 304022, India and Department of Mathematics and Statistics, Banasthali University, Banasthali, Rajasthan 304022, India
| | - Piyush Kant Rai
- Department of Bioinformatics, Banasthali University, Banasthali, Rajasthan 304022, India and Department of Mathematics and Statistics, Banasthali University, Banasthali, Rajasthan 304022, India
| | - Asheesh Shanker
- Department of Bioinformatics, Banasthali University, Banasthali, Rajasthan 304022, India and Department of Mathematics and Statistics, Banasthali University, Banasthali, Rajasthan 304022, India
| |
Collapse
|
14
|
Nagpure NS, Rashid I, Pathak AK, Singh M, Singh SP, Sarkar UK. In silico analysis of SSRs in mitochondrial genomes of fishes. ACTA ACUST UNITED AC 2014; 26:195-201. [PMID: 24660911 DOI: 10.3109/19401736.2014.892075] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
The availability of fish mitochondrial (mt) genomes provides an opportunity to explore the simple sequence repeats. In the present study, mt genomes of 85 fish species reported from Indian subcontinent were downloaded from NCBI and computationally analysed for finding SSRs types, frequency of occurrence, mutation and evolutionary adaptation across species. A total of 92 microsatellites in different nucleotide combinations were detected in 59 species. 26 interspersed SSRs, mostly poly (AT)n were found in the D-loop regions in the species of Cyprinidae. Fifty-six SSRs of 12 bp fixed length were observed in eight genes only. Further, identical repeat motifs were found on the same location in ATP6 and ND4 genes, which were biased towards particular habitat. The comparison of ATP6 and ND4 gene sets to other homologous sequences showed point mutations. This study explores the SSRs discovery and their utility as marker for species and population identification.
Collapse
|
15
|
MitoSatPlant: mitochondrial microsatellites database of viridiplantae. Mitochondrion 2014; 19 Pt B:334-7. [PMID: 24561221 DOI: 10.1016/j.mito.2014.02.002] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2013] [Revised: 02/10/2014] [Accepted: 02/11/2014] [Indexed: 01/12/2023]
Abstract
Microsatellites also known as simple sequence repeats (SSRs) consist of 1-6 nucleotide long repeating units. The importance of mitochondrial SSRs (mtSSRs) in fields like population genetics, plant phylogenetics and genome mapping motivated us to develop MitoSatPlant, a repository of plant mtSSRs. It contains information for perfect, imperfect and compound SSRs mined from 92 mitochondrial genomes of green plants, available at NCBI (as of 1 Feb 2014). A total of 72,798 SSRs were found, of which PCR primers were designed for 72,495 SSRs. Among all sequences, tetranucleotide repeats (26,802) were found to be most abundant whereas hexanucleotide repeats (2751) were detected with least frequency. MitoSatPlant was developed using SQL server 2008 and can be accessed through a front end designed in ASP.Net. It is an easy to use, user-friendly database and will prove to be a useful resource for plant scientists. To the best of our knowledge MitoSatPlant is the only database available for plant mtSSRs and can be freely accessed at http://compubio.in/mitosatplant/.
Collapse
|
16
|
Torales SL, Rivarola M, Pomponio MF, Gonzalez S, Acuña CV, Fernández P, Lauenstein DL, Verga AR, Hopp HE, Paniego NB, Poltri SNM. De novo assembly and characterization of leaf transcriptome for the development of functional molecular markers of the extremophile multipurpose tree species Prosopis alba. BMC Genomics 2013; 14:705. [PMID: 24125525 PMCID: PMC4008253 DOI: 10.1186/1471-2164-14-705] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2013] [Accepted: 10/07/2013] [Indexed: 11/10/2022] Open
Abstract
Background Prosopis alba (Fabaceae) is an important native tree adapted to arid and semiarid regions of north-western Argentina which is of great value as multipurpose species. Despite its importance, the genomic resources currently available for the entire Prosopis genus are still limited. Here we describe the development of a leaf transcriptome and the identification of new molecular markers that could support functional genetic studies in natural and domesticated populations of this genus. Results Next generation DNA pyrosequencing technology applied to P. alba transcripts produced a total of 1,103,231 raw reads with an average length of 421 bp. De novo assembling generated a set of 15,814 isotigs and 71,101 non-assembled sequences (singletons) with an average of 991 bp and 288 bp respectively. A total of 39,000 unique singletons were identified after clustering natural and artificial duplicates from pyrosequencing reads. Regarding the non-redundant sequences or unigenes, 22,095 out of 54,814 were successfully annotated with Gene Ontology terms. Moreover, simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were searched, resulting in 5,992 and 6,236 markers, respectively, throughout the genome. For the validation of the the predicted SSR markers, a subset of 87 SSRs selected through functional annotation evidence was successfully amplified from six DNA samples of seedlings. From this analysis, 11 of these 87 SSRs were identified as polymorphic. Additionally, another set of 123 nuclear polymorphic SSRs were determined in silico, of which 50% have the probability of being effectively polymorphic. Conclusions This study generated a successful global analysis of the P. alba leaf transcriptome after bioinformatic and wet laboratory validations of RNA-Seq data. The limited set of molecular markers currently available will be significantly increased with the thousands of new markers that were identified in this study. This information will strongly contribute to genomics resources for P. alba functional analysis and genetics. Finally, it will also potentially contribute to the development of population-based genome studies in the genera.
Collapse
Affiliation(s)
- Susana L Torales
- Instituto de Recursos Biológicos, IRB, Instituto Nacional de Tecnología Agropecuaria (INTA Castelar), CC 25, Castelar B1712WAA, Argentina.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|