1
|
GSCIT: smart Hash Table-based mapping equipped genome sequence coverage inspection. Funct Integr Genomics 2024; 24:36. [PMID: 38374301 DOI: 10.1007/s10142-024-01315-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 02/09/2024] [Accepted: 02/13/2024] [Indexed: 02/21/2024]
|
2
|
Allantoin improves salinity tolerance in Arabidopsis and rice through synergid activation of abscisic acid and brassinosteroid biosynthesis. PLANT MOLECULAR BIOLOGY 2023:10.1007/s11103-023-01350-8. [PMID: 37184674 DOI: 10.1007/s11103-023-01350-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 04/02/2023] [Indexed: 05/16/2023]
Abstract
Soil salinity stress is one of the major bottlenecks for crop production. Although, allantoin is known to be involved in nitrogen metabolism in plants, yet several reports in recent time indicate its involvement in various abiotic stress responses including salinity stress. However, the detail mechanism of allantoin involvement in salinity stress tolerance in plants is not studied well. Moreover, we demonstrated the role of exogenous application of allantoin as well as increased concentration of endogenous allantoin in rendering salinity tolerance in rice and Arabidopsis respectively, via., induction of abscisic acid (ABA) and brassinosteroid (BR) biosynthesis pathways. Exogenous application of allantoin (10 µM) provides salt-tolerance to salt-sensitive rice genotype (IR-29). Transcriptomic data after exogenous supplementation of allantoin under salinity stress showed induction of ABA (OsNCED1) and BR (Oscytochrome P450) biosynthesis genes in IR-29. Further, the key gene of allantoin biosynthesis pathway i.e., urate oxidase of the halophytic species Oryza coarctata was also found to induce ABA and BR biosynthesis genes when over-expressed in transgenic Arabidopsis. Thus, indicating that ABA and BR biosynthesis pathways were involved in allantoin mediated salinity tolerance in both rice and Arabidopsis. Additionally, it has been found that several physio-chemical parameters such as biomass, Na+/K+ ratio, MDA, soluble sugar, proline, allantoin and chlorophyll contents were also associated with the allantoin-mediated salinity tolerance in urate oxidase overexpressed lines of Arabidopsis. These findings depicted the functional conservation of allantoin for salinity tolerance in both plant clades.
Collapse
|
3
|
Identification and analysis of miRNAs-lncRNAs-mRNAs modules involved in stem-elongation of deepwater rice (Oryza sativa L.). PHYSIOLOGIA PLANTARUM 2022; 174:e13736. [PMID: 35716004 DOI: 10.1111/ppl.13736] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Revised: 06/06/2022] [Accepted: 06/16/2022] [Indexed: 06/15/2023]
Abstract
Deepwater is an abiotic stress that limits rice cultivation worldwide due to recurrent floods. The miRNAs and lncRNAs are two non-coding RNAs emerging as major regulators of gene expressions under different abiotic stresses. However, the regulation of these two non-coding RNAs under deepwater stress in rice is still unexplored. In this study, small RNA-seq and RNA-seq from internode and node tissues were analyzed to predict deepwater stress responsive miRNAs and lncRNAs, respectively. Additionally, a competitive endogenous RNA (ceRNA) study revealed about 69 and 25 lncRNAs acting as endogenous target mimics (eTM) with the internode and node miRNAs, respectively. In ceRNA analyses, some of the key miRNAs such as miR1850.1, miR1848, and IN-nov-miR145 were upregulated while miR159e was downregulated, and their respective eTM lncRNAs and targets were found to have opposite expressions. Moreover, we have transiently expressed one module (IN-nov-miR145-Cc-TCONS_00011544-Os11g36430.3) in tobacco leaves. The integrated analysis has identified differentially expressed (DE) miRNAs, lncRNAs and their target genes, and the complex regulatory network, which might lead to stem elongation under deepwater stress. In this novel attempt to identify and characterize miRNAs and lncRNAs under deepwater stress in rice, we have provided, probably for the first time, a reference platform to study the interactions of these two non-coding RNAs with respective target genes through transient expression analyses.
Collapse
|
4
|
A first-generation haplotype map (HapMap-1) of tea (Camellia sinensis L. O. Kuntz). Bioinformatics 2022; 38:318-324. [PMID: 34601584 DOI: 10.1093/bioinformatics/btab690] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2021] [Revised: 08/30/2021] [Accepted: 09/29/2021] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION Tea is a cross-pollinated woody perennial plant, which is why, application of conventional breeding is limited for its genetic improvement. However, lack of the genome-wide high-density SNP markers and genome-wide haplotype information has greatly hampered the utilization of tea genetic resources toward fast-track tea breeding programs. To address this challenge, we have generated a first-generation haplotype map of tea (Tea HapMap-1). Out-crossing and highly heterozygous nature of tea plants, make them more complicated for DNA-level variant discovery. RESULTS In this study, whole genome re-sequencing data of 369 tea genotypes were used to generate 2,334,564 biallelic SNPs and 1,447,985 InDels. Around 2928.04 million paired-end reads were used with an average mapping depth of ∼0.31× per accession. Identified polymorphic sites in this study will be useful in mapping the genomic regions responsible for important traits of tea. These resources lay the foundation for future research to understand the genetic diversity within tea germplasm and utilize genes that determine tea quality. This will further facilitate the understanding of tea genome evolution and tea metabolite pathways thus, offers an effective germplasm utilization for breeding the tea varieties. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
5
|
Genome-Wide Analysis of Four Pathotypes of Wheat Rust Pathogen ( Puccinia graminis) Reveals Structural Variations and Diversifying Selection. J Fungi (Basel) 2021; 7:701. [PMID: 34575739 PMCID: PMC8468629 DOI: 10.3390/jof7090701] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Revised: 08/19/2021] [Accepted: 08/21/2021] [Indexed: 12/28/2022] Open
Abstract
Diseases caused by Puccinia graminis are some of the most devastating diseases of wheat. Extensive genomic understanding of the pathogen has proven helpful not only in understanding host- pathogen interaction but also in finding appropriate control measures. In the present study, whole-genome sequencing of four diverse P. graminis pathotypes was performed to understand the genetic variation and evolution. An average of 63.5 Gb of data per pathotype with about 100× average genomic coverage was achieved with 100-base paired-end sequencing performed with Illumina Hiseq 1000. Genome structural annotations collectively predicted 9273 functional proteins including ~583 extracellular secreted proteins. Approximately 7.4% of the genes showed similarity with the PHI database which is suggestive of their significance in pathogenesis. Genome-wide analysis demonstrated pathotype 117-6 as likely distinct and descended through a different lineage. The 3-6% more SNPs in the regulatory regions and 154 genes under positive selection with their orthologs and under negative selection in the other three pathotypes further supported pathotype 117-6 to be highly diverse in nature. The genomic information generated in the present study could serve as an important source for comparative genomic studies across the genus Puccinia and lead to better rust management in wheat.
Collapse
|
6
|
TEnGExA: an R package based tool for tissue enrichment and gene expression analysis. Brief Bioinform 2020; 22:5909881. [PMID: 32960209 DOI: 10.1093/bib/bbaa221] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Revised: 08/10/2020] [Accepted: 08/18/2020] [Indexed: 12/24/2022] Open
Abstract
RNA-seq data analysis with rapidly advancing high-throughput sequencing technology, nowadays provides large number of transcripts or genes to perform downstream analysis including functional annotation and pathway analysis. However for the data from multiple tissues, downstream analysis with tissue-specific or tissue-enriched transcripts is highly preferable. However, there is still a need of tool for quickly performing tissue-enrichment and gene expression analysis irrespective of number of input genes or tissues at various fragments per kilobase of transcript per million fragments mapped (FPKM) thresholds. To fulfill this need, we presented a freely available R package and web-interface tool, TEnGExA, which allows tissue-enrichment analysis (TEA) for any number of genes or transcripts for any species provided only a read-count or FPKM-value matrix as input. Based on the different FPKM value and fold thresholds, TEnGExA classifies the user provided gene lists into tissue-enriched or tissue-specific transcripts along with other standard classes. By analyzing the published sample data from human, plant and microorganism, we signifies that TEnGExA can easily handle complex or large data from any species to provided tissue-enriched gene list for downstream analysis in quick time. In summary, TEnGExA is quick, easy to use and an efficient tool for TEA. The R package is freely available at https://github.com/ubagithub/TEnGExA/ and the GUI web interface is accessible at http://webtom.cabgrid.res.in/tissue_enrich/.
Collapse
|
7
|
TeaMiD: a comprehensive database of simple sequence repeat markers of tea. Database (Oxford) 2020; 2020:baaa013. [PMID: 32159215 PMCID: PMC7065459 DOI: 10.1093/database/baaa013] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Revised: 01/05/2020] [Accepted: 01/25/2020] [Indexed: 12/05/2022]
Abstract
Tea is a highly cross-pollinated, woody, perennial tree. High heterozygosity combined with a long gestational period makes conventional breeding a cumbersome process. Therefore, marker-assisted breeding is a better alternative approach when compared with conventional breeding. Considering the large genome size of tea (~3 Gb), information about simple sequence repeat (SSR) is scanty. Thus, we have taken advantage of the recently published tea genomes to identify large numbers of SSR markers in the tea. Besides the genomic sequences, we identified SSRs from the other publicly available sequences such as RNA-seq, GSS, ESTs and organelle genomes (chloroplasts and mitochondrial) and also searched published literature to catalog validated set of tea SSR markers. The complete exercise yielded a total of 935 547 SSRs. Out of the total, 82 SSRs were selected for validation among a diverse set of tea genotypes. Six primers (each with four to six alleles, an average of five alleles per locus) out of the total 27 polymorphic primers were used for a diversity analysis in 36 tea genotypes with mean polymorphic information content of 0.61-0.76. Finally, using all the information generated in this study, we have developed a user-friendly database (TeaMiD; http://indianteagenome.in:8080/teamid/) that hosts SSR from all the six resources including three nuclear genomes of tea and transcriptome sequences of 17 Camellia wild species. Database URL: http://indianteagenome.in:8080/teamid/.
Collapse
|
8
|
Decoding and analysis of organelle genomes of Indian tea (Camellia assamica) for phylogenetic confirmation. Genomics 2019; 112:659-668. [PMID: 31029862 DOI: 10.1016/j.ygeno.2019.04.018] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2018] [Revised: 03/03/2019] [Accepted: 04/24/2019] [Indexed: 01/16/2023]
Abstract
The NCBI database has >15 chloroplast (cp) genome sequences available for different Camellia species but none for C. assamica. There is no report of any mitochondrial (mt) genome in the Camellia genus or Theaceae family. With the strong believes that these organelle genomes can play a great tool for taxonomic and phylogenetic analysis, we successfully assembled and analyzed cp and mt genome of C. assamica. We assembled the complete mt genome of C. assamica in a single circular contig of 707,441 bp length comprising of a total of 66 annotated genes, including 35 protein-coding genes, 29 tRNAs and two rRNAs. The first ever cp genome of C. assamica resulted in a circular contig of 157,353 bp length with a typical quadripartite structure. Phylogenetic analysis based on these organelle genomes showed that C. assamica was closely related to C. sinensis and C. leptophylla. It also supports Caryophyllales as Superasterids.
Collapse
|
9
|
Identification of jumonjiC domain containing gene family among the Oryza species and their expression analysis in FL478, a salt tolerant rice genotype. PLANT PHYSIOLOGY AND BIOCHEMISTRY : PPB 2018; 130:43-53. [PMID: 29960182 DOI: 10.1016/j.plaphy.2018.06.031] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Revised: 06/20/2018] [Accepted: 06/21/2018] [Indexed: 05/26/2023]
Abstract
The jumonji (JMJ)-C domain containing proteins belong to histone demethylases family with the ability to demethylate the tri-methylated histone residues. They act as chromatin regulators to regulate many physiological functions in plants. The present study deals with the characterization of JMJ-C gene family members in wild as well as cultivated rice species and their expression analysis in salt tolerant rice genotype, FL478. The genome wide study identified 151 members belonging to JMJ-C gene family in 11 different Oryza species. We also studied their structure, genomic location, gene duplication events, phylogenetic relationship, in silico expression analysis and identified cis elements in their promoters. We also found a few JMJ-C gene family members in rice which underwent duplication before the whole genome duplication event of the rice. The qRT-PCR based expression profiling revealed that out of the total 15 rice JMJ-C members, two were highly expressed in the flag leaf stage of FL478 under salt treatment. These two candidate JMJ-C members were also found to render salinity tolerance when over-expressed in yeast cells. Thus, the present study helps in further structural as well as functional characterization of JMJ-C genes under salinity stress in Oryza species.
Collapse
|
10
|
Discovery of microRNA-target modules of African rice (Oryza glaberrima) under salinity stress. Sci Rep 2018; 8:570. [PMID: 29330361 PMCID: PMC5766505 DOI: 10.1038/s41598-017-18206-z] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2017] [Accepted: 11/22/2017] [Indexed: 11/09/2022] Open
Abstract
Oryza glaberrima is the second edible rice in the genus Oryza. It is grown in the African countries. miRNAs are regulatory molecules that are involved in every domains of gene expression including salinity stress response. Although several miRNAs have been reported from various species of Oryza, yet none of them are from this species. Salt treated (200 mM NaCl for 48 h) and control smallRNA libraries of RAM-100, a salt tolerant genotype, each with 2 replications generated 150 conserve and 348 novel miRNAs. We also used smallRNAseq data of NCBI of O. glaberrima to discover additional 246 known miRNAs. Totally, 29 known and 32 novel miRNAs were differentially regulated under salinity stress. Gene ontology and KEGG analysis indicated several targets were involved in vital biological pathways of salinity stress tolerance. Expression of selected miRNAs as indicated by Illumina data were found to be coherent with real time-PCR analysis. However, target gene expression was inversely correlated with their corresponding miRNAs. Finally based upon present results as well as existing knowledge of literature, we proposed the miRNA-target modules that were induced by salinity stress. Therefore, the present findings provide valuable information about miRNA-target networks in salinity adaption of O. glaberrima.
Collapse
|
11
|
High Quality Unigenes and Microsatellite Markers from Tissue Specific Transcriptome and Development of a Database in Clusterbean (Cyamopsis tetragonoloba, L. Taub). Genes (Basel) 2017; 8:genes8110313. [PMID: 29120386 PMCID: PMC5704226 DOI: 10.3390/genes8110313] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Revised: 10/23/2017] [Accepted: 11/06/2017] [Indexed: 12/23/2022] Open
Abstract
Clusterbean (Cyamopsis tetragonoloba L. Taub), is an important industrial, vegetable and forage crop. This crop owes its commercial importance to the presence of guar gum (galactomannans) in its endosperm which is used as a lubricant in a range of industries. Despite its relevance to agriculture and industry, genomic resources available in this crop are limited. Therefore, the present study was undertaken to generate RNA-Seq based transcriptome from leaf, shoot, and flower tissues. A total of 145 million high quality Illumina reads were assembled using Trinity into 127,706 transcripts and 48,007 non-redundant high quality (HQ) unigenes. We annotated 79% unigenes against Plant Genes from the National Center for Biotechnology Information (NCBI), Swiss-Prot, Pfam, gene ontology (GO) and KEGG databases. Among the annotated unigenes, 30,020 were assigned with 116,964 GO terms, 9984 with EC and 6111 with 137 KEGG pathways. At different fragments per kilobase of transcript per millions fragments sequenced (FPKM) levels, genes were found expressed higher in flower tissue followed by shoot and leaf. Additionally, we identified 8687 potential simple sequence repeats (SSRs) with an average frequency of one SSR per 8.75 kb. A total of 28 amplified SSRs in 21 clusterbean genotypes resulted in polymorphism in 13 markers with average polymorphic information content (PIC) of 0.21. We also constructed a database named ‘ClustergeneDB’ for easy retrieval of unigenes and the microsatellite markers. The tissue specific genes identified and the molecular marker resources developed in this study is expected to aid in genetic improvement of clusterbean for its end use.
Collapse
|
12
|
Dissection of genomic features and variations of three pathotypes of Puccinia striiformis through whole genome sequencing. Sci Rep 2017; 7:42419. [PMID: 28211474 PMCID: PMC5314344 DOI: 10.1038/srep42419] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2016] [Accepted: 01/10/2017] [Indexed: 01/28/2023] Open
Abstract
Stripe rust of wheat, caused by Puccinia striiformis f. sp. tritici, is one of the important diseases of wheat. We used NGS technologies to generate a draft genome sequence of two highly virulent (46S 119 and 31) and a least virulent (K) pathotypes of P. striiformis from the Indian subcontinent. We generated ~24,000-32,000 sequence contigs (N50;7.4-9.2 kb), which accounted for ~86X-105X sequence depth coverage with an estimated genome size of these pathotypes ranging from 66.2-70.2 Mb. A genome-wide analysis revealed that pathotype 46S 119 might be highly evolved among the three pathotypes in terms of year of detection and prevalence. SNP analysis revealed that ~47% of the gene sets are affected by nonsynonymous mutations. The extracellular secreted (ES) proteins presumably are well conserved among the three pathotypes, and perhaps purifying selection has an important role in differentiating pathotype 46S 119 from pathotypes K and 31. In the present study, we decoded the genomes of three pathotypes, with 81% of the total annotated genes being successfully assigned functional roles. Besides the identification of secretory genes, genes essential for pathogen-host interactions shall prove this study as a huge genomic resource for the management of this disease using host resistance.
Collapse
|
13
|
Draft Genome of the Wheat Rust Pathogen (Puccinia triticina) Unravels Genome-Wide Structural Variations during Evolution. Genome Biol Evol 2016; 8:2702-21. [PMID: 27521814 PMCID: PMC5630921 DOI: 10.1093/gbe/evw197] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/06/2016] [Indexed: 01/02/2023] Open
Abstract
Leaf rust is one of the most important diseases of wheat and is caused by Puccinia triticina, a highly variable rust pathogen prevalent worldwide. Decoding the genome of this pathogen will help in unraveling the molecular basis of its evolution and in the identification of genes responsible for its various biological functions. We generated high quality draft genome sequences (approximately 100- 106 Mb) of two races of P. triticina; the variable and virulent Race77 and the old, avirulent Race106. The genomes of races 77 and 106 had 33X and 27X coverage, respectively. We predicted 27678 and 26384 genes, with average lengths of 1,129 and 1,086 bases in races 77 and 106, respectively and found that the genomes consisted of 37.49% and 39.99% repetitive sequences. Genome wide comparative analysis revealed that Race77 differs substantially from Race106 with regard to segmental duplication (SD), repeat element, and SNP/InDel characteristics. Comparative analyses showed that Race 77 is a recent, highly variable and adapted Race compared with Race106. Further sequence analyses of 13 additional pathotypes of Race77 clearly differentiated the recent, active and virulent, from the older pathotypes. Average densities of 2.4 SNPs and 0.32 InDels per kb were obtained for all P. triticina pathotypes. Secretome analysis demonstrated that Race77 has more virulence factors than Race 106, which may be responsible for the greater degree of adaptation of this pathogen. We also found that genes under greater selection pressure were conserved in the genomes of both races, and may affect functions crucial for the higher levels of virulence factors in Race77. This study provides insights into the genome structure, genome organization, molecular basis of variation, and pathogenicity of P. triticina The genome sequence data generated in this study have been submitted to public domain databases and will be an important resource for comparative genomics studies of the more than 4000 existing Puccinia species.
Collapse
|