1
|
Quantification of adulteration in traded ayurvedic raw drugs employing machine learning approaches with DNA barcode database. 3 Biotech 2021; 11:463. [PMID: 34745814 DOI: 10.1007/s13205-021-03001-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Accepted: 09/26/2021] [Indexed: 10/20/2022] Open
Abstract
Adulteration of expensive raw drugs with inferior taxa has become a routine practice, conceding the quality and safety of derived herbal products. In this regard, the study addresses the development of an integrated approach encompassing DNA barcode and HPTLC fingerprinting to authenticate chiefly traded ayurvedic raw drugs in south India [viz. Saraca asoca (Roxb.) Willd., Terminalia arjuna (Roxb. ex DC.) Wight and Arn., Sida alnifolia L. and Desmodium gangeticum (L.) DC.] from its adulterants. Consortium of Barcode of Life (CBOL) recommended DNA barcode gene regions viz. nuclear ribosomal-Internal Transcribed Spacer (nrDNA-ITS), maturase K (matK), ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit (rbcL) and psbA-trnH spacer regions along with HPTLC profiling were experimented and a reference database was created. Further, an integrated analytical approach employing genetic distance-based Maximum Likelihood phylogenetic tree and Artificial Intelligence (AI)based Machine Learning Algorithms (MLA)-Waikato Environment for Knowledge Analysis (WEKA) and Barcoding with Logic (BLOG) were employed to prove efficacy of DNA barcode tool. Even though, among the four barcodes, psbA-trnH (S. alnifolia and its adulterants, T. arjuna and its adulterants) or ITS region (S. asoca and its adulterants, D. gangeticum and its adulterants) showed highest inter specific divergences in the selected Biological Reference Materials (BRMs), rbcL or matK barcode regions alone were successful for authentication of traded samples. The automated species identification techniques, WEKA and BLOG, experimented for the first time in India for raw drug validation, could achieve rapid and precise identification. A national certification agency for raw drug authentication employing an integrated approach involving a DNA barcoding tool along with standard organoleptic and analytical methods can strengthen and ensure safety and quality of herbal medicines in India. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1007/s13205-021-03001-5.
Collapse
|
2
|
Unnikrishnan R, Dev SA, Jayaraj R. Pitfalls and promises of raw drug identification techniques in the ayurvedic industry: an overview. 3 Biotech 2020; 10:497. [PMID: 33150123 DOI: 10.1007/s13205-020-02482-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Accepted: 10/13/2020] [Indexed: 11/26/2022] Open
Abstract
India, with a rich heritage of floral diversity, is well-known for its medicinal plant wealth and is the largest producer of medicinal herbs in the world. Ethnobiological Survey of Ministry of Environment and Forests (MOEF) could identify 8000 plant species utilized in various systems of medicine with approximately 25,000 effective herbal formulations. The extensive consumption to meet demand-supply ratio exerts a heavy strain on the existing resources. This subsequently led to the adulteration and substitution of medicinal plants with look-alike species. The consumer's faith on herbal medicine is in the phase of decline due to the extremities in adulteration/substitution and ensuing consequences. It is imperative to bring forth universally acceptable standard tools to authenticate raw drugs before being processed further into formulations. A vast array of techniques such as physical, chemical (analytical), biochemical, anatomical, organoleptic, and recently emerged DNA based molecular methods are widely used for plant species authentication. In recent years, DNA barcoding has made remarkable progress in the field of medicinal plants research. DNA metabarcoding is the latest development for qualitative evaluation of the herbal formulations, whereas for quantitative analysis, combination of pharmacognostic, pharmacovigilance and analytical methods are inevitable for authentication. This review addresses the overall strengths and shortcomings of the existing as well as recently emerged techniques in authenticating ayurvedic raw drugs.
Collapse
Affiliation(s)
- Remya Unnikrishnan
- Forest Genetics and Biotechnology Division, Kerala Forest Research Institute, Peechi, Thrissur, Kerala India
- Cochin University of Science & Technology, Kochi, Kerala India
| | - Suma Arun Dev
- Forest Genetics and Biotechnology Division, Kerala Forest Research Institute, Peechi, Thrissur, Kerala India
| | - R Jayaraj
- Forest Ecology and Biodiversity Division, Kerala Forest Research Institute, Peechi, Thrissur, Kerala India
| |
Collapse
|
3
|
Biswas R, Panja AS, Bandopadhyay R. In Silico Analyses of Burial Codon Bias Among the Species of Dipterocarpaceae Through Molecular and Phylogenetic Data. Evol Bioinform Online 2019; 15:1176934319834888. [PMID: 31223230 PMCID: PMC6563522 DOI: 10.1177/1176934319834888] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Accepted: 02/07/2019] [Indexed: 11/15/2022] Open
Abstract
Introduction: DNA barcode, a molecular marker, is used to distinguish among the closely
related species, and it can be applied across a broad range of taxa to
understand ecology and evolution. MaturaseK gene (matK) and
rubisco bisphosphate carboxylase/oxygenase form I gene
(rbcL) of the chloroplast are highly conserved in a
plant system, which are used as core barcode. This present endeavor entails
the comprehensive examination of the under threat plant species based on
success of discrimination on DNA barcode under selection pressure. Result: The family Dipterocarpaceae comprising of 15 genera is under threat due to
some factors, namely, deforestation, habitat alteration, poor seed, pollen
dispersal, etc. Species of this family was grouped into 6 clusters for
matK and 5 clusters and 2 sub-clusters for
rbcL in the phylogenetic tree by using neighbor-joining
method. Cluster I to cluster VI of matK and cluster I to
cluster V of rbcL genes were analyzed by various codon and
substitution bias tools. Mutational pressure guided the codon bias which was
favored by the avoidance of higher GC content and significant negative
correlation between GC12 and GC3 (in sub-cluster I of cluster I
[0.03 < P], cluster I
[0.00001 < P], and cluster II
[0.01 < P] of rbcL, and cluster IV
[0.013 < P] of matK). After
refining the results, it could be speculated that the lower null expectation
values (R = 0.5 or <0.5) were less divergent from the
evolutionary perspective. Apart from that, the higher null expectation
values (R = >0.85) also showed the same result, which
possibly could be due to the negative impact of very high and low transition
rate than transversion. Conclusion: Through the analysis of inter-generic, inter/intra-specific variation and
phylogenetic data, it was found that both selection and mutation played an
important role in synonymous codon choice in these genes, but they acted
inconsistently on the genes, both matK and
rbcL. In vitro stable proteins of both
matK and rbcL were selected through
natural selection rather than mutational selection. matK
gene had higher individual discrimination and barcode success compared with
rbcL. These discriminatory approaches may describe the
problem related to the extinction of plant species. Hence, it becomes very
imperative to identify and detect the under threat plant species in
advance.
Collapse
Affiliation(s)
- Raju Biswas
- UGC-Center of Advanced Study, Department of Botany, The University of Burdwan, Bardhaman, India
| | - Anindya Sundar Panja
- Department of Biotechnology, Oriental Institute of Science and Technology, Vidyasagar University, Midnapore, India
| | - Rajib Bandopadhyay
- UGC-Center of Advanced Study, Department of Botany, The University of Burdwan, Bardhaman, India
| |
Collapse
|
4
|
Jürges G, Sahi V, Rios Rodriguez D, Reich E, Bhamra S, Howard C, Slater A, Nick P. Product authenticity versus globalisation-The Tulsi case. PLoS One 2018; 13:e0207763. [PMID: 30475878 PMCID: PMC6261265 DOI: 10.1371/journal.pone.0207763] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2018] [Accepted: 11/06/2018] [Indexed: 11/19/2022] Open
Abstract
Using the Indian medicinal plant Tulsi (Holy Basil) as a case study, we have tested to what extent the discrepancy between vernacular and scientific nomenclature can be resolved, whether the presumed chemical diversity underlying the medicinal use of Tulsi has a genetic component, and whether it is possible to detect this genetic component using genetic barcoding markers. Based on four plastidic markers, we can define several haplotypes within Ocimum that are consistent across these markers. Haplotype II is congruent with O. tenuiflorum, while haplotype I extends over several members of the genus and cannot be resolved into genetically separate subclades. The vernacular subdivision of Tulsi into three types (Rama, Krishna, Vana) can only be partially linked with genetic differences-whereby Rama and Krishna Tulsi can be assigned to O. tenuiflorum, while Vana Tulsi belongs to haplotype I. This genetic difference is mirrored by differences in the profiles of secondary compounds. While developmental state and light quality modulate the amplitude to which the chemical profile is expressed, the profile itself seems to be linked with genetic differences. We finally develop an authentication assay that makes use of a characteristic single nucleotide polymorphism in one of the barcoding markers, establishing a differential restriction pattern that can be used to discriminate Vana Tulsi.
Collapse
Affiliation(s)
- Gabriele Jürges
- Molecular Cell Biology, Botanical Institute, Karlsruhe Institute of Technology, Germany
| | - Vaidurya Sahi
- Molecular Cell Biology, Botanical Institute, Karlsruhe Institute of Technology, Germany
| | | | - Eike Reich
- CAMAG Laboratories, Muttenz, Switzerland
| | - Sukvinder Bhamra
- Biomolecular Technology Group, School of Allied Health Sciences, The Gateway, De Montfort University, Leicester, United Kingdom
- University of Kent, Medway School of Pharmacy, Chatham, Kent, United Kingdom
| | - Caroline Howard
- Biomolecular Technology Group, School of Allied Health Sciences, The Gateway, De Montfort University, Leicester, United Kingdom
- BP-NIBSC Herbal Laboratory, National Institute for Biological Standards and Control (NIBSC), Blanche Lane, South Mimms, Potters Bar, Hertfordshire, United Kingdom
| | - Adrian Slater
- Biomolecular Technology Group, School of Allied Health Sciences, The Gateway, De Montfort University, Leicester, United Kingdom
| | - Peter Nick
- Molecular Cell Biology, Botanical Institute, Karlsruhe Institute of Technology, Germany
- * E-mail:
| |
Collapse
|
5
|
Plastome-Wide Nucleotide Substitution Rates Reveal Accelerated Rates in Papilionoideae and Correlations with Genome Features Across Legume Subfamilies. J Mol Evol 2017; 84:187-203. [PMID: 28397003 DOI: 10.1007/s00239-017-9792-x] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Accepted: 04/03/2017] [Indexed: 01/08/2023]
Abstract
This study represents the most comprehensive plastome-wide comparison of nucleotide substitution rates across the three subfamilies of Fabaceae: Caesalpinioideae, Mimosoideae, and Papilionoideae. Caesalpinioid and mimosoid legumes have large, unrearranged plastomes compared with papilionoids, which exhibit varying levels of rearrangement including the loss of the inverted repeat (IR) in the IR-lacking clade (IRLC). Using 71 genes common to 39 legume taxa representing all the three subfamilies, we show that papilionoids consistently have higher nucleotide substitution rates than caesalpinioids and mimosoids, and rates in the IRLC papilionoids are generally higher than those in the IR-containing papilionoids. Unsurprisingly, this pattern was significantly correlated with growth habit as most papilionoids are herbaceous, whereas caesalpinioids and mimosoids are largely woody. Both nonsynonymous (dN) and synonymous (dS) substitution rates were also correlated with several biological features including plastome size and plastomic rearrangements such as the number of inversions and indels. In agreement with previous reports, we found that genes in the IR exhibit between three and fourfold reductions in the substitution rates relative to genes within the large single-copy or small single-copy regions. Furthermore, former IR genes in IR-lacking taxa exhibit accelerated rates compared with genes contained in the IR.
Collapse
|
6
|
Genetic Fingerprinting of Wheat and Its Progenitors by Mitochondrial Gene orf256. Biomolecules 2012; 2:228-39. [PMID: 24970134 PMCID: PMC4030846 DOI: 10.3390/biom2020228] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2012] [Revised: 03/29/2012] [Accepted: 04/04/2012] [Indexed: 11/26/2022] Open
Abstract
orf256 is a wheat mitochondrial gene associated with cytoplasmic male sterility (CMS) that has different organization in various species. This study exploited the orf256 gene as a mitochondrial DNA marker to study the genetic fingerprint of Triticum and Aegilops species. PCR followed by sequencing of common parts of the orf256 gene were employed to determine the fingerprint and molecular evolution of Triticum and Aegilops species. Although many primer pairs were used, two pairs of orf256 specific primers (5:-94/C: 482, 5:253/C: 482), amplified DNA fragments of 576 bp and 230 bp respectively in all species were tested. A common 500 bp of nine species of Triticum and Aegilops were aligned and showed consistent results with that obtained from other similar chloroplast or nuclear genes. Base alignment showed that there were various numbers of base substitutions in all species compared to S. cereal (Sc) (the outgroup species). Phylogenetic relationship revealed similar locations and proximity on phylogenetic trees established using plastid and nuclear genes. The results of this study open a good route to use unknown function genes of mitochondria in studying the molecular relationships and evolution of wheat and complex plant genomes.
Collapse
|
7
|
Paape T, Miyake T, Takebayashi N, Wolf D, Kohn JR. Evolutionary genetics of an S-like polymorphism in Papaveraceae with putative function in self-incompatibility. PLoS One 2011; 6:e23635. [PMID: 21912602 PMCID: PMC3166141 DOI: 10.1371/journal.pone.0023635] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2011] [Accepted: 07/21/2011] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Papaver rhoeas possesses a gametophytic self-incompatibility (SI) system not homologous to any other SI mechanism characterized at the molecular level. Four previously published full length stigmatic S-alleles from the genus Papaver exhibited remarkable sequence divergence, but these studies failed to amplify additional S-alleles despite crossing evidence for more than 60 S-alleles in Papaver rhoeas alone. METHODOLOGY/PRINCIPAL FINDINGS Using RT-PCR we identified 87 unique putative stigmatic S-allele sequences from the Papaveraceae Argemone munita, Papaver mcconnellii, P. nudicuale, Platystemon californicus and Romneya coulteri. Hand pollinations among two full-sib families of both A. munita and P. californicus indicate a strong correlation between the putative S-genotype and observed incompatibility phenotype. However, we also found more than two S-like sequences in some individuals of A. munita and P. californicus, with two products co-segregating in both full-sib families of P. californicus. Pairwise sequence divergence estimates within and among taxa show Papaver stigmatic S-alleles to be the most variable with lower divergence among putative S-alleles from other Papaveraceae. Genealogical analysis indicates little shared ancestral polymorphism among S-like sequences from different genera. Lack of shared ancestral polymorphism could be due to long divergence times among genera studied, reduced levels of balancing selection if some or all S-like sequences do not function in incompatibility, population bottlenecks, or different levels of recombination among taxa. Preliminary estimates of positive selection find many sites under selective constraint with a few undergoing positive selection, suggesting that self-recognition may depend on amino acid substitutions at only a few sites. CONCLUSIONS/SIGNIFICANCE Because of the strong correlation between genotype and SI phenotype, sequences reported here represent either functional stylar S-alleles, tightly linked paralogs of the S-locus or a combination of both. The considerable complexity revealed in this study shows we have much to learn about the evolutionary dynamics of self-incompatibility systems.
Collapse
Affiliation(s)
- Timothy Paape
- College of Biological Sciences, University of Minnesota, St. Paul, Minnesota, United States of America.
| | | | | | | | | |
Collapse
|
8
|
Sánchez-Ken JG, Clark LG. Phylogeny and a new tribal classification of the Panicoideae s.l. (Poaceae) based on plastid and nuclear sequence data and structural data. AMERICAN JOURNAL OF BOTANY 2010; 97:1732-48. [PMID: 21616806 DOI: 10.3732/ajb.1000024] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
PREMISE OF THE STUDY The subfamily Panicoideae (Poaceae) encompasses nearly one-third of the diversity of grass species, including important crops such as maize and sugarcane. Previous analyses recovered strong support for a Panicoideae+Centothecoideae lineage within the diverse Panicoideae+Arundinoideae+Chloridoideae+Micrairoideae+Aristidoideae+Danthonioideae (PACMAD) clade, although support for internal relationships was inconsistent. The objectives of this research were to (1) further test the monophyly of each subfamily and previously recovered clades within the Panicoideae+Centothecoideae lineage, (2) establish phylogenetic relationships among these groups, and (3) propose a new tribal classification for this lineage based explicitly on the phylogeny. • METHODS Maximum parsimony and Bayesian inference analyses of 37 taxa were based on previously published sequences (ndhF and rpl16 intron) and on new plastid and nuclear (rbcL and granule-bound starch synthase I) sequence data as well as structural data. • Key results. The Panicoideae+Centothecoideae lineage and a majority of the clades identified in previous analyses continue to be robustly supported, but resolution along the backbone of the topology remains elusive. Support for the monophyly of both subfamilies was lacking although support values for some clades increased. The tribes Centotheceae and Arundinelleae were confirmed as polyphyletic. • CONCLUSIONS Subfamily Centothecoideae is formally submerged into the Panicoideae, and a new tribal classification for the expanded Panicoideae is proposed based explicitly on the phylogeny. This classification includes 12 tribes of which Chasmanthieae and Zeugiteae are segretated from the Centotheceae; Tristachyideae is segregated from Arundinelleae, and a new tribe, Cyperochloeae, is validated to accommodate two isolated genera. A key to the tribes is provided.
Collapse
Affiliation(s)
- J Gabriel Sánchez-Ken
- Departamento de Botánica, Instituto de Biología, Universidad Nacional Autónoma de México, Apdo. Postal 70-233, México D.F. 04510, Mexico
| | | |
Collapse
|
9
|
Yonemaru JI, Ando T, Mizubayashi T, Kasuga S, Matsumoto T, Yano M. Development of genome-wide simple sequence repeat markers using whole-genome shotgun sequences of sorghum (Sorghum bicolor (L.) Moench). DNA Res 2009; 16:187-93. [PMID: 19363056 PMCID: PMC2695772 DOI: 10.1093/dnares/dsp005] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Simple sequence repeat (SSR) markers with a high degree of polymorphism contribute to the molecular dissection of agriculturally important traits in sorghum (Sorghum bicolor (L.) Moench). We designed 5599 non-redundant SSR markers, including regions flanking the SSRs, in whole-genome shotgun sequences of sorghum line ATx623. (AT/TA)n repeats constituted 26.1% of all SSRs, followed by (AG/TC)n at 20.5%, (AC/TG)n at 13.7% and (CG/GC)n at 11.8%. The chromosomal locations of 5012 SSR markers were determined by comparing the locations identified by means of electronic PCR with the predicted positions of 34 008 gene loci. Most SSR markers had a similar distribution to the gene loci. Among 970 markers validated by fragment analysis, 67.8% (658 of 970) markers successfully provided PCR amplification in sorghum line BTx623, with a mean polymorphism rate of 45.1% (297 of 658) for all SSR loci in combinations of 11 sorghum lines and one sudangrass (Sorghum sudanense (Piper) Stapf) line. The product of 5012 and 0.678 suggests that ∼3400 SSR markers could be used to detect SSR polymorphisms and that more than 1500 (45.1% of 3400) markers could reveal SSR polymorphisms in combinations of Sorghum lines.
Collapse
Affiliation(s)
- Jun-ichi Yonemaru
- National Institute of Agrobiological Sciences, 2-1-2 Kannondai, Tsukuba, Ibaraki 305-8602, Japan.
| | | | | | | | | | | |
Collapse
|
10
|
Kim JS, Islam-Faridi MN, Klein PE, Stelly DM, Price HJ, Klein RR, Mullet JE. Comprehensive molecular cytogenetic analysis of sorghum genome architecture: distribution of euchromatin, heterochromatin, genes and recombination in comparison to rice. Genetics 2005; 171:1963-76. [PMID: 16143604 PMCID: PMC1456119 DOI: 10.1534/genetics.105.048215] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2005] [Accepted: 08/21/2005] [Indexed: 11/18/2022] Open
Abstract
Cytogenetic maps of sorghum chromosomes 3-7, 9, and 10 were constructed on the basis of the fluorescence in situ hybridization (FISH) of approximately 18-30 BAC probes mapped across each of these chromosomes. Distal regions of euchromatin and pericentromeric regions of heterochromatin were delimited for all 10 sorghum chromosomes and their DNA content quantified. Euchromatic DNA spans approximately 50% of the sorghum genome, ranging from approximately 60% of chromosome 1 (SBI-01) to approximately 33% of chromosome 7 (SBI-07). This portion of the sorghum genome is predicted to encode approximately 70% of the sorghum genes ( approximately 1 gene model/12.3 kbp), assuming that rice and sorghum encode a similar number of genes. Heterochromatin spans approximately 411 Mbp of the sorghum genome, a region characterized by a approximately 34-fold lower rate of recombination and approximately 3-fold lower gene density compared to euchromatic DNA. The sorghum and rice genomes exhibit a high degree of macrocolinearity; however, the sorghum genome is approximately 2-fold larger than the rice genome. The distal euchromatic regions of sorghum chromosomes 3-7 and 10 are approximately 1.8-fold larger overall and exhibit an approximately 1.5-fold lower average rate of recombination than the colinear regions of the homeologous rice chromosomes. By contrast, the pericentromeric heterochromatic regions of these chromosomes are on average approximately 3.6-fold larger in sorghum and recombination is suppressed approximately 15-fold compared to the colinear regions of rice chromosomes.
Collapse
Affiliation(s)
- J-S Kim
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX 77843, USA
| | | | | | | | | | | | | |
Collapse
|
11
|
Buchanan CD, Klein PE, Mullet JE. Phylogenetic analysis of 5'-noncoding regions from the ABA-responsive rab16/17 gene family of sorghum, maize and rice provides insight into the composition, organization and function of cis-regulatory modules. Genetics 2004; 168:1639-54. [PMID: 15579713 PMCID: PMC1448771 DOI: 10.1534/genetics.104.030346] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2004] [Accepted: 07/16/2004] [Indexed: 11/18/2022] Open
Abstract
Phylogenetic analysis of sequences from gene families and homologous genes from species of varying divergence can be used to identify conserved noncoding regulatory elements. In this study, phylogenetic analysis of 5'-noncoding sequences was optimized using rab17, a well-characterized ABA-responsive gene from maize, and five additional rab16/17 homologs from sorghum and rice. Conserved 5'-noncoding sequences among the maize, sorghum, and rice rab16/17 homologs were identified with the aid of the software program FootPrinter and by screening for known transcription-factor-binding sites. Searches for 7 of 8 (7/8)bp sequence matches within aligned 5'-noncoding segments of the rab genes identified many of the cis-elements previously characterized by biochemical analysis in maize rab17 plus several additional putative regulatory elements. Differences in the composition of conserved noncoding sequences among rab16/17 genes were related to variation in rab gene mRNA levels in different tissues and to response to ABA treatment using qRT-PCR. Absence of a GRA-like element in the promoter of sorghum dhn2 relative to maize rab17 was correlated with an approximately 85-fold reduction of dhn2 RNA in sorghum shoots. Overall, we conclude that phylogenetic analysis of gene families among rice, sorghum, and maize will help identify regulatory sequences in the noncoding regions of genes and contribute to our understanding of grass gene regulatory networks.
Collapse
Affiliation(s)
- Christina D Buchanan
- Institute for Plant Genomics and Biotechnology, Texas A and M University, College Station, Texas 77843, USA
| | | | | |
Collapse
|
12
|
Matsuoka Y, Yamazaki Y, Ogihara Y, Tsunewaki K. Whole chloroplast genome comparison of rice, maize, and wheat: implications for chloroplast gene diversification and phylogeny of cereals. Mol Biol Evol 2002; 19:2084-91. [PMID: 12446800 DOI: 10.1093/oxfordjournals.molbev.a004033] [Citation(s) in RCA: 76] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The fully sequenced chloroplast genomes of maize (subfamily Panicoideae), rice (subfamily Bambusoideae), and wheat (subfamily Pooideae) provide the unique opportunity to investigate the evolution of chloroplast genes and genomes in the grass family (Poaceae) by whole-genome comparison. Analyses of nucleotide sequence variations in 106 cereal chloroplast genes with tobacco sequences as the outgroup suggested that (1) most of the genic regions of the chloroplast genomes of maize, rice, and wheat have evolved at similar rates; (2) RNA genes have highly conservative evolutionary rates relative to the other genes; (3) photosynthetic genes have been under strong purifying selection; (4) between the three cereals, 14 genes which account for about 28% of the genic region have evolved with heterogeneous nucleotide substitution rates; and (5) rice genes tend to have evolved more slowly than the others at loci where rate heterogeneity exists. Although the mechanism that underlies chloroplast gene diversification is complex, our analyses identified variation in nonsynonymous substitution rates as a genetic force that generates heterogeneity, which is evidence of selection in chloroplast gene diversification at the intrafamilial level. Phylogenetic trees constructed with the variable nucleotide sites of the chloroplast genes place maize basal to the rice-wheat clade, revealing a close relationship between the Bambusoideae and Pooideae.
Collapse
Affiliation(s)
- Yoshihiro Matsuoka
- Fukui Prefectural University, Matsuoka-cho, Yoshida-gun, Fukui 910-1195, Japan.
| | | | | | | |
Collapse
|
13
|
Klein PE, Klein RR, Cartinhour SW, Ulanch PE, Dong J, Obert JA, Morishige DT, Schlueter SD, Childs KL, Ale M, Mullet JE. A high-throughput AFLP-based method for constructing integrated genetic and physical maps: progress toward a sorghum genome map. Genome Res 2000; 10:789-807. [PMID: 10854411 PMCID: PMC310885 DOI: 10.1101/gr.10.6.789] [Citation(s) in RCA: 164] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Sorghum is an important target for plant genomic mapping because of its adaptation to harsh environments, diverse germplasm collection, and value for comparing the genomes of grass species such as corn and rice. The construction of an integrated genetic and physical map of the sorghum genome (750 Mbp) is a primary goal of our sorghum genome project. To help accomplish this task, we have developed a new high-throughput PCR-based method for building BAC contigs and locating BAC clones on the sorghum genetic map. This task involved pooling 24,576 sorghum BAC clones ( approximately 4x genome equivalents) in six different matrices to create 184 pools of BAC DNA. DNA fragments from each pool were amplified using amplified fragment length polymorphism (AFLP) technology, resolved on a LI-COR dual-dye DNA sequencing system, and analyzed using Bionumerics software. On average, each set of AFLP primers amplified 28 single-copy DNA markers that were useful for identifying overlapping BAC clones. Data from 32 different AFLP primer combinations identified approximately 2400 BACs and ordered approximately 700 BAC contigs. Analysis of a sorghum RIL mapping population using the same primer pairs located approximately 200 of the BAC contigs on the sorghum genetic map. Restriction endonuclease fingerprinting of the entire collection of sorghum BAC clones was applied to test and extend the contigs constructed using this PCR-based methodology. Analysis of the fingerprint data allowed for the identification of 3366 contigs each containing an average of 5 BACs. BACs in approximately 65% of the contigs aligned by AFLP analysis had sufficient overlap to be confirmed by DNA fingerprint analysis. In addition, 30% of the overlapping BACs aligned by AFLP analysis provided information for merging contigs and singletons that could not be joined using fingerprint data alone. Thus, the combination of fingerprinting and AFLP-based contig assembly and mapping provides a reliable, high-throughput method for building an integrated genetic and physical map of the sorghum genome.
Collapse
Affiliation(s)
- P E Klein
- Crop Biotechnology Center, Texas A & M University, College Station, Texas 77843 USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Mathews S, Tsai RC, Kellogg EA. Phylogenetic structure in the grass family (Poaceae): evidence from the nuclear gene phytochrome B. AMERICAN JOURNAL OF BOTANY 2000. [PMID: 10636833 DOI: 10.2307/2656688] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Phylogenetic analyses of partial phytochrome B (PHYB) nuclear DNA sequences provide unambiguous resolution of evolutionary relationships within Poaceae. Analysis of PHYB nucleotides from 51 taxa representing seven traditionally recognized subfamilies clearly distinguishes three early-diverging herbaceous "bambusoid" lineages. First and most basal are Anomochloa and Streptochaeta, second is Pharus, and third is Puelia. The remaining grasses occur in two principal, highly supported clades. The first comprises bambusoid, oryzoid, and pooid genera (the BOP clade); the second comprises panicoid, arundinoid, chloridoid, and centothecoid genera (the PACC clade). The PHYB phylogeny is the first nuclear gene tree to address comprehensively phylogenetic relationships among grasses. It corroborates several inferences made from chloroplast gene trees, including the PACC clade, and the basal position of the herbaceous bamboos Anomochloa, Streptochaeta, and Pharus. However, the clear resolution of the sister group relationship among bambusoids, oryzoids, and pooids in the PHYB tree is novel; the relationship is only weakly supported in ndhF trees and is nonexistent in rbcL and plastid restriction site trees. Nuclear PHYB data support Anomochlooideae, Pharoideae, Pooideae sensu lato, Oryzoideae, Panicoideae, and Chloridoideae, and concur in the polyphyly of both Arundinoideae and Bambusoideae.
Collapse
Affiliation(s)
- S Mathews
- Department of Organismic and Evolutionary Biology, Harvard University Herbaria, 22 Divinity Avenue,Cambridge, Massachusetts 02138 USA; and
| | | | | |
Collapse
|
15
|
Abstract
The grass family includes some 10,000 species, and it encompasses tremendous morphological, physiological, ecological, and genetic diversity. The phylogeny of the family is becoming increasingly well understood. There were two major radiations of grasses, an early diversification leading to the subfamilies Pooideae, Bambusoideae, and Oryzoideae, and a later one leading to Panicoideae, Chloridoideae, Centothecoideae, and Arundinoideae. The phylogeny can be used to determine the direction of changes in genome arrangement and genome size.
Collapse
Affiliation(s)
- E A Kellogg
- Harvard University Herbaria, 22 Divinity Avenue, Cambridge, MA 02138, USA
| |
Collapse
|
16
|
Bennetzen JL, SanMiguel P, Chen M, Tikhonov A, Francki M, Avramova Z. Grass genomes. Proc Natl Acad Sci U S A 1998; 95:1975-8. [PMID: 9482817 PMCID: PMC33825 DOI: 10.1073/pnas.95.5.1975] [Citation(s) in RCA: 117] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
For the most part, studies of grass genome structure have been limited to the generation of whole-genome genetic maps or the fine structure and sequence analysis of single genes or gene clusters. We have investigated large contiguous segments of the genomes of maize, sorghum, and rice, primarily focusing on intergenic spaces. Our data indicate that much (>50%) of the maize genome is composed of interspersed repetitive DNAs, primarily nested retrotransposons that insert between genes. These retroelements are less abundant in smaller genome plants, including rice and sorghum. Although 5- to 200-kb blocks of methylated, presumably heterochromatic, retrotransposons flank most maize genes, rice and sorghum genes are often adjacent. Similar genes are commonly found in the same relative chromosomal locations and orientations in each of these three species, although there are numerous exceptions to this collinearity (i.e., rearrangements) that can be detected at the levels of both the recombinational map and cloned DNA. Evolutionarily conserved sequences are largely confined to genes and their regulatory elements. Our results indicate that a knowledge of grass genome structure will be a useful tool for gene discovery and isolation, but the general rules and biological significance of grass genome organization remain to be determined. Moreover, the nature and frequency of exceptions to the general patterns of grass genome structure and collinearity are still largely unknown and will require extensive further investigation.
Collapse
Affiliation(s)
- J L Bennetzen
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907-1392, USA
| | | | | | | | | | | |
Collapse
|
17
|
Laroche J, Li P, Maggia L, Bousquet J. Molecular evolution of angiosperm mitochondrial introns and exons. Proc Natl Acad Sci U S A 1997; 94:5722-7. [PMID: 9159140 PMCID: PMC20846 DOI: 10.1073/pnas.94.11.5722] [Citation(s) in RCA: 116] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Numbers of substitutions per site for 15 protein-coding genes and six introns of the plant mitochondria were estimated to compare modes and tempos of evolution between exons and introns, and numbers of insertions-deletions per site also were investigated in introns. Intra-gene homogeneity of numbers of substitutions per site was assessed further among different taxa and between mitochondrial and nuclear paralogs translocated from the mitochondrial genome. Gene-to-gene differences in numbers of substitutions per site were found to be higher for nonsynonymous than synonymous sites, and this could be due to differential selection if mutation rate is assumed constant for the genome. Some mitochondrial genes have evolved as fast as chloroplast genes, thus faster than previously thought. For coxI, relative rate tests showed that woody taxa evolved slower than annuals at synonymous sites. Generation time, population size, and speciation rate are likely factors involved in this rate heterogeneity. Introns were less constrained than their adjacent exons for both overall numbers of substitutions per site and indels, but, on average, overall numbers of substitutions per site for introns were similar to numbers of synonymous substitutions per site for exons. Correlations were generally high between numbers of substitutions and numbers of indels per site for the same intron. Mitochondrial genes transferred to the nucleus had an accelerated rate of substitution per site, which was most significant at synonymous sites. These differences between paralogs in two different genomes are likely the result of different mutation rates.
Collapse
Affiliation(s)
- J Laroche
- Centre de Recherche en Biologie Forestière, Université Laval, Sainte-Foy, Québec, Canada, G1K 7P4
| | | | | | | |
Collapse
|
18
|
Chen M, SanMiguel P, de Oliveira AC, Woo SS, Zhang H, Wing RA, Bennetzen JL. Microcolinearity in sh2-homologous regions of the maize, rice, and sorghum genomes. Proc Natl Acad Sci U S A 1997; 94:3431-5. [PMID: 9096411 PMCID: PMC20387 DOI: 10.1073/pnas.94.7.3431] [Citation(s) in RCA: 168] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Large regions of genomic colinearity have been demonstrated among grass species by recombinational mapping, but the degree of chromosomal conservation at the sub-centimorgan level has not been extensively investigated. We cloned the rice and sorghum genes homologous to the sh2 locus of maize on bacterial artificial chromosomes (BACs), and observed that a homologue of the maize a1 gene was also present on each of these BACs. In sorghum, we found a direct duplication of a1 homologues separated by about 10 kb. In maize, sh2 and a1 are approximately 140 kb apart and transcribed in the same direction, with sh2 upstream of a1. In rice and sorghum, this arrangement is fully conserved. However, the sh2 and a1 homologues are separated by about 19 kb in both rice and sorghum. We found low-copy-number and repetitive DNAs between the sh2 and a1 homologues of sorghum and rice. The sh2 and a1 homologues cross-hybridized, but the repetitive DNA and most low-copy-number sequences between these genes did not. These results indicate that maize, sorghum, and rice have conserved gene order and composition in the sh2-a1 region, but have acquired extensive qualitative and quantitative differences in the sequences between these genes.
Collapse
Affiliation(s)
- M Chen
- Department of Biological Sciences, Purdue University, West Lafayette, IN 47907, USA
| | | | | | | | | | | | | |
Collapse
|
19
|
Abstract
To understand the evolutionary mechanisms and relationships of nitrate reductases (NRs), the nucleotide sequences encoding 19 nitrate reductase (NR) genes from 16 species of fungi, algae, and higher plants were analyzed. The NR genes examined show substantial sequence similarity, particularly within functional domains, and large variations in GC content at the third codon position and intron number. The intron positions were different between the fungi and plants, but conserved within these groups. The overall and nonsynonymous substitution rates among fungi, algae, and higher plants were estimated to be 4.33 x 10(-10) and 3.29 x 10(-10) substitutions per site per year. The three functional domains of NR genes evolved at about one-third of the rate of the N-terminal and the two hinge regions connecting the functional domains. Relative rate tests suggested that the nonsynonymous substitution rates were constant among different lineages, while the overall nucleotide substitution rates varied between some lineages. The phylogenetic trees based on NR genes correspond well with the phylogeny of the organisms determined from systematics and other molecular studies. Based on the nonsynonymous substitution rate, the divergence time of monocots and dicots was estimated to be about 340 Myr when the fungi-plant or algae-higher plant divergence times were used as reference points and 191 Myr when the rice-barley divergence time was used as a reference point. These two estimates are consistent with other estimates of divergence times based on these reference points. The lack of consistency between these two values appears to be due to the uncertainty of the reference times.
Collapse
Affiliation(s)
- J Zhou
- Department of Genetics and Cell Biology, Washington State University, Pullman 99164-6420, USA
| | | |
Collapse
|
20
|
MacRae AF, Huttley GA, Clegg MT. Molecular evolutionary characterization of an Activator (Ac)-like transposable element sequence from pearl millet (Pennisetum glaucum) (Poaceae). Genetica 1994; 92:77-89. [PMID: 7958939 DOI: 10.1007/bf00163756] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
We present data on the evolution of the Ac/Ds family of transposable elements in select grasses (Poaceae). A defective Ac-like element was cloned from a DNA library of the grass Pennisetum glaucum (pearl millet) and its entire 4531 bp sequence has been determined. When the pearl millet Ac-like sequence is aligned with the maize Ac sequence, it is found that there is approximately 70% DNA similarity in the central region spanning most of maize Ac exon II and all of exon III. In addition, there are two smaller regions of similarity at the Ac terminii. Besides these three major structural similarities, Pennisetum Ac has two large regions, one 5' and one 3', that show little similarity to Zea Ac. Furthermore, most of the sequences corresponding to intron II in maize Ac are absent in pearl millet Ac. Kimura's evolutionary distance between the central region of maize and pearl millet Ac sequences is estimated to be 0.429 +/- 0.020 nucleotide substitutions per site. This value is not significantly different from the average number of synonymous substitutions for coding regions of the Adh1 gene between maize and pearl millet, which is 0.395 +/- 0.051 nucleotide substitutions per site. If we assume Ac and Adh1 divergence times are equivalent between maize and pearl millet, then the above calculations suggest Ac-like sequences have probably not been strongly constrained by natural selection. Conserved DNA and amino acid sequence motifs are also examined. The level of DNA sequence divergence between maize and pearl millet Ac sequences, the estimated date when maize and pearl millet diverged (25-40 million years ago), coupled with their reproductive isolation/lack of current genetic exchange, all support the theory that Ac-like sequences have not been recently introduced into pearl millet from maize. Instead, Ac-like sequences were probably present in the progenitor of maize and pearl millet and have thus existed in the grasses for at least 25 million years.
Collapse
Affiliation(s)
- A F MacRae
- Department of Biology, University of Missouri-St. Louis 63121
| | | | | |
Collapse
|
21
|
Morton BR, Clegg MT. A chloroplast DNA mutational hotspot and gene conversion in a noncoding region near rbcL in the grass family (Poaceae). Curr Genet 1993; 24:357-65. [PMID: 8252646 DOI: 10.1007/bf00336789] [Citation(s) in RCA: 79] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
The noncoding DNA region of the chloroplast genome, flanked by the genes rbcL and psaI (ORF36), has been sequenced for seven species of the grass family (Poaceae). This region had previously been observed as a hotspot area for length mutations. Sequence comparison reveals that short duplications, probably resulting from slipped-strand mispairing, account for many small length differences between sequences but that major mutational hotspots are localized in three small areas, two of which show potential secondary structure. Mutation in one of these hotspots appears to be a result of more complex recombination events. All seven species contain a pseudogene for rpl23 and evidence is presented that this pseudogene is being maintained by gene conversion with the functional gene. Different transition/transversion biases and AT contents between the pseudogene and the surrounding noncoding sequences are noted. In the subfamily Panicoideae there is a deletion in which almost 1 kb of ancestral sequence, including the 3' end of the rpl23 pseudogene, has been replaced by a non-homologous 60-base sequence of unknown origin. Two other deletions of almost the same region have occurred in the grass family. The deleted noncoding region has mutational and compositional properties similar to the rbcL coding sequence and the rpl23 pseudogene. The three independent deletions, as well as the pattern of mutation in the localized hotspots, indicate that such noncoding DNA may be misleading for studies of phylogenetic inference.
Collapse
Affiliation(s)
- B R Morton
- Department of Botany and Plant Sciences, University of California, Riverside 92521
| | | |
Collapse
|
22
|
Abstract
We present data on evolution of the Ac/Ds family of transposable elements in select grasses (Poaceae). An Ac-like element was cloned from a DNA library of the grass Pennisetum glaucum (pearl millet) and 2387 bp of it have been sequenced. When the pearl millet Ac-like sequence is aligned with the corresponding region of the maize Ac sequence, it is found that all sequences corresponding to intron II in maize Ac are absent in pearl millet Ac. Kimura's evolutionary distance between maize and pearl millet Ac sequences is estimated to be 0.429 +/- 0.020 nucleotide substitutions per site. This value is not significantly different from the average number of synonymous substitutions for coding regions of the Adh1 gene between maize and pearl millet, which is 0.395 +/- 0.051 nucleotide substitutions per site. If we can assume Ac and Adh1 divergence times are equivalent between maize and pearl millet, then the above calculations suggest Ac-like sequences have probably not been strongly constrained by natural selection. The level of DNA sequence divergence between maize and pearl millet Ac sequences, the estimated date when maize and pearl millet diverged (25-40 million years ago), coupled with their reproductive isolation/lack of current genetic exchange, all support the theory that Ac-like sequences have not been recently introduced into pearl millet from maize. Instead, Ac-like sequences were probably present in the progenitor of maize and pearl millet, and have thus existed in the grasses for at least 25 million years. Ac-like sequences may be widely distributed among the grasses. We also present the first 2 Ds1 controlling element sequences from teosinte species: Zea luxurians and Zea perennis. A total of 10 Ds1 elements had previously been sequenced from maize and a distant maize relative, Tripsacum. When a maximum likelihood network of genetic relationships is constructed for all 12 sequenced Ds1 elements, the 2 teosinte Ds1 elements are as distant from most maize Ds1 elements and from each other, as the maize Ds1 elements are from one another. Our new teosinte sequence data support the previous conclusion that Ds1 elements have been accumulating mutations independently since maize and Tripsacum diverged. We present a scenario for the origin of Ds1 elements.
Collapse
Affiliation(s)
- A F MacRae
- Department of Botany and Plant Sciences, University of California, Riverside 92521
| | | |
Collapse
|