1
|
Panahi B, Shahi A. Trancriptome data mining in combination with co-expression network analysis identifies the functional modules and critical regulators in Hordeum vulgare L. in response to cold stress. Biochem Biophys Rep 2024; 37:101620. [PMID: 38155945 PMCID: PMC10753052 DOI: 10.1016/j.bbrep.2023.101620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Revised: 12/13/2023] [Accepted: 12/13/2023] [Indexed: 12/30/2023] Open
Abstract
Cold stress, as an abiotic stress, is one of the most limiting factors which pose a great threat to the plant's productivity. To understand the transcriptional regulation and connectivity pattern of genes involved in barley cold stress responses, co-expression network analysis was performed based on the global transcriptome profiling. The microarray datasets related to cold stress treatments were retrieved from the Gene Expression Omnibus (GEO) and Array express databases. Four microarray datasets related to cold stress-responsive transcriptome in barley were included in our study. Gene co-expression analysis was constructed using WGCNA method. Module-Trait Relationships (MTR) analysis and hub genes determination and validation were carried out. Finally, transcription factor and kinase regulatory networks were Inferred using machine learning algorithm. The co-expression modules were determined using beta index = 10. In total 13 co-expressed modules were identified with an average size of 153 genes. Functional enrichment based on gene ontology (GO) showed that each of the stress related significant modules were enriched in different biological processes. Annotation of significant modules identifies some TFs and Kinases such as ethylene-responsive transcription factor 1-like, transcription factor PCL1-like, transcription factor MYC2, WRKY, serine/threonine-protein kinase PBL7, and receptor-like protein kinase At2g42960 were contributed in barley cold stress response. Our analysis highlighted the functional importance of ABA signaling pathway, ROS signaling, defensive and protective proteins, degrading protein, Ca2+ related signaling, ribosome-mediated translation and etc. in responding of barley to cold stress condition. The current findings add substantially to our understanding of the cold responsive underlying mechanism of barley which can serve in future studies and breeding programs.
Collapse
Affiliation(s)
- Bahman Panahi
- Department of Genomics, Branch for Northwest & West Region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, Iran
| | - Ali Shahi
- Faculty of Agriculture (Meshgin-Shahr Campus), University of Mohaghegh Ardabili, Ardabil, Iran
| |
Collapse
|
2
|
Identification of let-7f and miR-338 as plasma-based biomarkers for sporadic amyotrophic lateral sclerosis using meta-analysis and empirical validation. Sci Rep 2022; 12:1373. [PMID: 35082326 PMCID: PMC8791978 DOI: 10.1038/s41598-022-05067-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 01/06/2022] [Indexed: 12/11/2022] Open
Abstract
AbstractAmyotrophic lateral sclerosis (ALS) is a lethal neurodegenerative disease that in most cases occurs sporadic (sALS). The disease is not curable, and its pathogenesis mechanisms are not well understood yet. Given the intricacy of underlying molecular interactions and heterogeneity of ALS, the discovery of molecules contributing to disease onset and progression will open a new avenue for advancement in early diagnosis and therapeutic intervention. Here we conducted a meta-analysis of 12 circulating miRNA profiling studies using the robust rank aggregation (RRA) method, followed by enrichment analysis and experimental verification. We identified miR-451a and let-7f-5p as meta-signature miRNAs whose targets are involved in critical pathogenic pathways underlying ALS, including ‘FoxO signaling pathway’, ‘MAPK signaling pathway’, and ‘apoptosis’. A systematic review of 7 circulating gene profiling studies elucidated that 241 genes up-regulated in sALS circulation with concomitant being targets of the meta-signature miRNAs. Protein–protein interaction (PPI) network analysis of the candidate targets using MCODE algorithm revealed the main subcluster is involved in multiple cascades eventually leads apoptosis, including ‘positive regulation of neuron apoptosis. Besides, we validated the meta-analysis results using RT-qPCR. Indeed, relative expression analysis verified let-7f-5p and miR-338-3p as significantly down-regulated and up-regulated biomarkers in the plasma of sALS patients, respectively. Receiver operating characteristic (ROC) analysis also highlighted the let-7f-5p and miR-338-3p potential as robustness plasma biomarkers for diagnosis and potential therapeutic targets of sALS disease.
Collapse
|
3
|
Milne L, Bayer M, Rapazote-Flores P, Mayer CD, Waugh R, Simpson CG. EORNA, a barley gene and transcript abundance database. Sci Data 2021; 8:90. [PMID: 33767193 DOI: 10.1038/s41597-021-00872-874] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Accepted: 02/22/2021] [Indexed: 05/27/2023] Open
Abstract
A high-quality, barley gene reference transcript dataset (BaRTv1.0), was used to quantify gene and transcript abundances from 22 RNA-seq experiments, covering 843 separate samples. Using the abundance data we developed a Barley Expression Database (EORNA*) to underpin a visualisation tool that displays comparative gene and transcript abundance data on demand as transcripts per million (TPM) across all samples and all the genes. EORNA provides gene and transcript models for all of the transcripts contained in BaRTV1.0, and these can be conveniently identified through either BaRT or HORVU gene names, or by direct BLAST of query sequences. Browsing the quantification data reveals cultivar, tissue and condition specific gene expression and shows changes in the proportions of individual transcripts that have arisen via alternative splicing. TPM values can be easily extracted to allow users to determine the statistical significance of observed transcript abundance variation among samples or perform meta analyses on multiple RNA-seq experiments. * Eòrna is the Scottish Gaelic word for Barley.
Collapse
Affiliation(s)
- Linda Milne
- Information and Computational Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Micha Bayer
- Information and Computational Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Paulo Rapazote-Flores
- Information and Computational Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Claus-Dieter Mayer
- Biomathematics and Statistics Scotland, University of Aberdeen, Aberdeen, AB25 2ZD, UK
| | - Robbie Waugh
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
- School of Agriculture and Wine & Waite Research Institute, University of Adelaide, Waite Campus, Glen Osmond, SA, 5064, Australia
| | - Craig G Simpson
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK.
| |
Collapse
|
4
|
Milne L, Bayer M, Rapazote-Flores P, Mayer CD, Waugh R, Simpson CG. EORNA, a barley gene and transcript abundance database. Sci Data 2021; 8:90. [PMID: 33767193 PMCID: PMC7994555 DOI: 10.1038/s41597-021-00872-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Accepted: 02/22/2021] [Indexed: 01/31/2023] Open
Abstract
A high-quality, barley gene reference transcript dataset (BaRTv1.0), was used to quantify gene and transcript abundances from 22 RNA-seq experiments, covering 843 separate samples. Using the abundance data we developed a Barley Expression Database (EORNA*) to underpin a visualisation tool that displays comparative gene and transcript abundance data on demand as transcripts per million (TPM) across all samples and all the genes. EORNA provides gene and transcript models for all of the transcripts contained in BaRTV1.0, and these can be conveniently identified through either BaRT or HORVU gene names, or by direct BLAST of query sequences. Browsing the quantification data reveals cultivar, tissue and condition specific gene expression and shows changes in the proportions of individual transcripts that have arisen via alternative splicing. TPM values can be easily extracted to allow users to determine the statistical significance of observed transcript abundance variation among samples or perform meta analyses on multiple RNA-seq experiments. * Eòrna is the Scottish Gaelic word for Barley.
Collapse
Affiliation(s)
- Linda Milne
- Information and Computational Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Micha Bayer
- Information and Computational Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Paulo Rapazote-Flores
- Information and Computational Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Claus-Dieter Mayer
- Biomathematics and Statistics Scotland, University of Aberdeen, Aberdeen, AB25 2ZD, UK
| | - Robbie Waugh
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
- School of Agriculture and Wine & Waite Research Institute, University of Adelaide, Waite Campus, Glen Osmond, SA, 5064, Australia
| | - Craig G Simpson
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK.
| |
Collapse
|
5
|
Farhadian M, Rafat SA, Panahi B, Mayack C. Weighted gene co-expression network analysis identifies modules and functionally enriched pathways in the lactation process. Sci Rep 2021; 11:2367. [PMID: 33504890 PMCID: PMC7840764 DOI: 10.1038/s41598-021-81888-z] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Accepted: 01/13/2021] [Indexed: 01/02/2023] Open
Abstract
The exponential growth in knowledge has resulted in a better understanding of the lactation process in a wide variety of animals. However, the underlying genetic mechanisms are not yet clearly known. In order to identify the mechanisms involved in the lactation process, various mehods, including meta-analysis, weighted gene co-express network analysis (WGCNA), hub genes identification, gene ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment at before peak (BP), peak (P), and after peak (AP) stages of the lactation processes have been employed. A total of 104, 85, and 26 differentially expressed genes were identified based on PB vs. P, BP vs. AP, and P vs. AP comparisons, respectively. GO and KEGG pathway enrichment analysis revealed that DEGs were significantly enriched in the "ubiquitin-dependent ERAD" and the "chaperone cofactor-dependent protein refolding" in BP vs. P and P vs. P, respectively. WGCNA identified five significant functional modules related to the lactation process. Moreover, GJA1, AP2A2, and NPAS3 were defined as hub genes in the identified modules, highlighting the importance of their regulatory impacts on the lactation process. The findings of this study provide new insights into the complex regulatory networks of the lactation process at three distinct stages, while suggesting several candidate genes that may be useful for future animal breeding programs. Furthermore, this study supports the notion that in combination with a meta-analysis, the WGCNA represents an opportunity to achieve a higher resolution analysis that can better predict the most important functional genes that might provide a more robust bio-signature for phenotypic traits, thus providing more suitable biomarker candidates for future studies.
Collapse
Affiliation(s)
- Mohammad Farhadian
- Department of Animal Science, Faculty of Agriculture, University of Tabriz, Tabriz, Iran.
| | - Seyed Abbas Rafat
- Department of Animal Science, Faculty of Agriculture, University of Tabriz, Tabriz, Iran
| | - Bahman Panahi
- Department of Genomics, Branch for Northwest & West Region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, Iran
| | - Christopher Mayack
- Molecular Biology, Genetics, and Bioengineering, Faculty of Engineering and Natural Sciences, Sabancı University, Istanbul, 34956, Turkey
| |
Collapse
|
6
|
Panahi B, Hejazi MA. Weighted gene co-expression network analysis of the salt-responsive transcriptomes reveals novel hub genes in green halophytic microalgae Dunaliella salina. Sci Rep 2021; 11:1607. [PMID: 33452393 PMCID: PMC7810892 DOI: 10.1038/s41598-020-80945-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Accepted: 12/30/2020] [Indexed: 12/24/2022] Open
Abstract
Despite responses to salinity stress in Dunaliella salina, a unicellular halotolerant green alga, being subject to extensive study, but the underlying molecular mechanism remains unknown. Here, Empirical Bayes method was applied to identify the common differentially expressed genes (DEGs) between hypersaline and normal conditions. Then, using weighted gene co-expression network analysis (WGCNA), which takes advantage of a graph theoretical approach, highly correlated genes were clustered as a module. Subsequently, connectivity patterns of the identified modules in two conditions were surveyed to define preserved and non-preserved modules by combining the Zsummary and medianRank measures. Finally, common and specific hub genes in non-preserved modules were determined using Eigengene-based module connectivity or module membership (kME) measures and validation was performed by using leave-one-out cross-validation (LOOCV). In this study, the power of beta = 12 (scale-free R2 = 0.8) was selected as the soft-thresholding to ensure a scale-free network, which led to the identification of 15 co-expression modules. Results also indicate that green, blue, brown, and yellow modules are non-preserved in salinity stress conditions. Examples of enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways in non-preserved modules are Sulfur metabolism, Oxidative phosphorylation, Porphyrin and chlorophyll metabolism, Vitamin B6 metabolism. Moreover, the systems biology approach was applied here, proposed some salinity specific hub genes, such as radical-induced cell death1 protein (RCD1), mitogen-activated protein kinase kinase kinase 13 (MAP3K13), long-chain acyl-CoA synthetase (ACSL), acetyl-CoA carboxylase, biotin carboxylase subunit (AccC), and fructose-bisphosphate aldolase (ALDO), for the development of metabolites accumulating strains in D. salina.
Collapse
Affiliation(s)
- Bahman Panahi
- Department of Genomics, Branch for Northwest & West region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, 5156915-598, Iran.
| | - Mohammad Amin Hejazi
- Department of Food Biotechnology, Branch for Northwest & West region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, 5156915-598, Iran
| |
Collapse
|
7
|
Global Analysis of Cereal microProteins Suggests Diverse Roles in Crop Development and Environmental Adaptation. G3-GENES GENOMES GENETICS 2020; 10:3709-3717. [PMID: 32763954 PMCID: PMC7534434 DOI: 10.1534/g3.120.400794] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
MicroProteins are a class of small single-domain proteins that post-translationally regulate larger multidomain proteins from which they evolved or which they relate to. They disrupt the normal function of their targets by forming microProtein-target heterodimers through compatible protein-protein interaction (PPI) domains. Recent studies confirm the significance of microProteins in the fine-tuning of plant developmental processes such as shoot apical meristem maintenance and flowering time regulation. While there are a number of well-characterized microProteins in Arabidopsis thaliana, studies from more complex plant genomes are still missing. We have previously developed miPFinder, a software for identifying microProteins from annotated genomes. Here we present an improved version where we have updated the algorithm to increase its accuracy and speed, and used it to analyze five cereal crop genomes – wheat, rice, barley, maize and sorghum. We found 20,064 potential microProteins from a total of 258,029 proteins in these five organisms, of which approximately 2000 are high-confidence, i.e., likely to function as actual microProteins. Gene ontology analysis of these 2000 microProtein candidates revealed their roles in stress, light and growth responses, hormone signaling and transcriptional regulation. Using a recently developed rice gene co-expression database, we analyzed 347 potential rice microProteins that are also conserved in other cereal crops and found over 50 of these rice microProteins to be co-regulated with their identified interaction partners. Overall, our study reveals a rich source of biotechnologically interesting small proteins that regulate fundamental plant processes such a growth and stress response that could be utilized in crop bioengineering.
Collapse
|
8
|
Farhadian M, Rafat SA, Panahi B, Ebrahimie E. Transcriptome signature of two lactation stages in Ghezel sheep identifies using RNA-Sequencing. Anim Biotechnol 2020; 33:223-233. [PMID: 32633600 DOI: 10.1080/10495398.2020.1784185] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
The expression of genes and their regulation during lactation in Ghezel sheep breed remains less understood. To explore the underlying molecular mechanism of the lactation process in the mammary gland, transcriptome profiles of Iranian fat-tailed Ghezel sheep breed milk at two stages, before (BF) and after peak (AF) stages of lactation were investigated. Functional impacts of differentially expressed genes (DEGs) between BF and AF stages were surveyed using Gene Ontology (GO) and Protein-Protein Interaction (PPI) network analysis. Totally, 75 DEGs were identified between BF and AF stages of lactation. The RNA-Seq results were validated by Q-RT-PCR. Gene ontology of DEGs mainly enriched in metabolic process and oxidative phosphorylation. PPI network analysis also highlighted the contribution of peroxisome proliferator-activated receptors (PPAR) signaling, oxidative phosphorylation and metabolic pathways in the lactation process. Intriguingly, the genes involved in fat metabolism dominantly down-regulated at AF stage. Our results provide new insight into transcriptional changes and add to growing body of knowledge on the lactation process in fat-tailed sheep breeds.
Collapse
Affiliation(s)
- Mohammad Farhadian
- Department of Animal Science, Faculty of Agriculture, University of Tabriz, Tabriz, Iran
| | - Seyed Aabbas Rafat
- Department of Animal Science, Faculty of Agriculture, University of Tabriz, Tabriz, Iran
| | - Bahman Panahi
- Department of Genomics, Branch for Northwest and West Region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, Iran
| | - Esmaeil Ebrahimie
- Genomics Research Platform, School of Life Sciences, La Trobe University, Melbourne, Victoria, Australia.,School of Animal and Veterinary Sciences, The University of Adelaide, South Australia, Australia
| |
Collapse
|
9
|
Integrative analysis of gene expression and alternative splicing in microalgae grown under heterotrophic condition. PLoS One 2020; 15:e0234710. [PMID: 32555718 PMCID: PMC7299362 DOI: 10.1371/journal.pone.0234710] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Accepted: 05/31/2020] [Indexed: 02/04/2023] Open
Abstract
Heterotrophic cultures are the most effective approach to overcome low growth rate challenge in the most commercial microalgae. However, the mechanism through which heterotrophic condition regulates algae metabolism are not completely clear. Alternative Splicing (AS) is a common posttranscriptional process by which transcriptome and proteome plasticity increases at different environmental conditions. To identify and characterize of AS events in Auxenochlorella protothecoides microalga grown in autotrophic and heterotrophic, RNA-Seq data were analysed. We found that AS increased with the transition from autotrophic to heterotrophic condition. 705 and 660 differentially expressed (DEG) and spliced (DAS) genes were identified for A.protothecoides was transferred from autotrophic to heterotrophic condition, respectively. Moreover, there was slight coverage between DEG and DAS genes. Furthermore, functional analysis showed that the DAS genes are most frequently related to ion binding and stimulus response. The results also indicated that prevalence of Intron retention is associated with down-regulation of the genes involved in carotenoid biosynthesis. This study provides valuable insights into transcriptional and posttranscriptional plasticity of microalgae during growth mode change.
Collapse
|
10
|
Panahi B, Farhadian M, Hejazi MA. Systems biology approach identifies functional modules and regulatory hubs related to secondary metabolites accumulation after transition from autotrophic to heterotrophic growth condition in microalgae. PLoS One 2020; 15:e0225677. [PMID: 32084664 PMCID: PMC7035001 DOI: 10.1371/journal.pone.0225677] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2019] [Accepted: 01/21/2020] [Indexed: 11/22/2022] Open
Abstract
Heterotrophic growth mode is among the most promising strategies put forth to overcome the low biomass and secondary metabolites productivity challenge. To shedding light on the underlying molecular mechanisms, transcriptome meta-analysis was integrated with weighted gene co-expression network analysis (WGCNA), connectivity analysis, functional enrichment, and hubs identification. Meta-analysis and Functional enrichment analysis demonstrated that most of the biological processes are up-regulated at heterotrophic growth condition, which leads to change of genetic architectures and phenotypic outcomes. WGNCA analysis of meta-genes also resulted four significant functional modules across logarithmic (LG), transition (TR), and production peak (PR) phases. The expression pattern and connectivity characteristics of the brown module as a non-preserved module vary across LG, TR, and PR phases. Functional analysis identified Carotenoid biosynthesis, Fatty acid metabolism and Methane metabolism as enriched pathways in the non-preserved module. Our integrated approach was applied here, identified some hubs, such as a serine hydroxymethyltransferase (SHMT1), which is the best candidate for development of metabolites accumulating strains in microalgae. Current study provided a new insight into underlying metabolite accumulation mechanisms and opens new avenue for the future applied studies in the microalgae field.
Collapse
Affiliation(s)
- Bahman Panahi
- Department of Genomics, Branch for Northwest & West Region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, Iran
- * E-mail: ,
| | - Mohammad Farhadian
- Department of Animal Science, Faculty of Agriculture, University of Tabriz, Tabriz, Iran
| | - Mohammad Amin Hejazi
- Department of Food Biotechnology, Branch for Northwest & West Region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, Iran
| |
Collapse
|
11
|
Rapazote-Flores P, Bayer M, Milne L, Mayer CD, Fuller J, Guo W, Hedley PE, Morris J, Halpin C, Kam J, McKim SM, Zwirek M, Casao MC, Barakate A, Schreiber M, Stephen G, Zhang R, Brown JWS, Waugh R, Simpson CG. BaRTv1.0: an improved barley reference transcript dataset to determine accurate changes in the barley transcriptome using RNA-seq. BMC Genomics 2019; 20:968. [PMID: 31829136 DOI: 10.1186/s12864-019-6243-6247] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 10/29/2019] [Indexed: 05/27/2023] Open
Abstract
BACKGROUND The time required to analyse RNA-seq data varies considerably, due to discrete steps for computational assembly, quantification of gene expression and splicing analysis. Recent fast non-alignment tools such as Kallisto and Salmon overcome these problems, but these tools require a high quality, comprehensive reference transcripts dataset (RTD), which are rarely available in plants. RESULTS A high-quality, non-redundant barley gene RTD and database (Barley Reference Transcripts - BaRTv1.0) has been generated. BaRTv1.0, was constructed from a range of tissues, cultivars and abiotic treatments and transcripts assembled and aligned to the barley cv. Morex reference genome (Mascher et al. Nature; 544: 427-433, 2017). Full-length cDNAs from the barley variety Haruna nijo (Matsumoto et al. Plant Physiol; 156: 20-28, 2011) determined transcript coverage, and high-resolution RT-PCR validated alternatively spliced (AS) transcripts of 86 genes in five different organs and tissue. These methods were used as benchmarks to select an optimal barley RTD. BaRTv1.0-Quantification of Alternatively Spliced Isoforms (QUASI) was also made to overcome inaccurate quantification due to variation in 5' and 3' UTR ends of transcripts. BaRTv1.0-QUASI was used for accurate transcript quantification of RNA-seq data of five barley organs/tissues. This analysis identified 20,972 significant differentially expressed genes, 2791 differentially alternatively spliced genes and 2768 transcripts with differential transcript usage. CONCLUSION A high confidence barley reference transcript dataset consisting of 60,444 genes with 177,240 transcripts has been generated. Compared to current barley transcripts, BaRTv1.0 transcripts are generally longer, have less fragmentation and improved gene models that are well supported by splice junction reads. Precise transcript quantification using BaRTv1.0 allows routine analysis of gene expression and AS.
Collapse
Affiliation(s)
- Paulo Rapazote-Flores
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Micha Bayer
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Linda Milne
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | | | - John Fuller
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Wenbin Guo
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Pete E Hedley
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Jenny Morris
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Claire Halpin
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Jason Kam
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
- Present address: Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Gogerddan, Aberystwyth, Ceredigion, SY23 3EB, UK
| | - Sarah M McKim
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Monika Zwirek
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
- Present Address: MRC Protein Phosphorylation and Ubiquitylation Unit, Sir James Black Centre, School of Life Sciences, University of Dundee, Dundee, DD1 5EH, UK
| | - M Cristina Casao
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Abdellah Barakate
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Miriam Schreiber
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Gordon Stephen
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Runxuan Zhang
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - John W S Brown
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Robbie Waugh
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Craig G Simpson
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK.
| |
Collapse
|
12
|
Rapazote-Flores P, Bayer M, Milne L, Mayer CD, Fuller J, Guo W, Hedley PE, Morris J, Halpin C, Kam J, McKim SM, Zwirek M, Casao MC, Barakate A, Schreiber M, Stephen G, Zhang R, Brown JWS, Waugh R, Simpson CG. BaRTv1.0: an improved barley reference transcript dataset to determine accurate changes in the barley transcriptome using RNA-seq. BMC Genomics 2019; 20:968. [PMID: 31829136 PMCID: PMC6907147 DOI: 10.1186/s12864-019-6243-7] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 10/29/2019] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND The time required to analyse RNA-seq data varies considerably, due to discrete steps for computational assembly, quantification of gene expression and splicing analysis. Recent fast non-alignment tools such as Kallisto and Salmon overcome these problems, but these tools require a high quality, comprehensive reference transcripts dataset (RTD), which are rarely available in plants. RESULTS A high-quality, non-redundant barley gene RTD and database (Barley Reference Transcripts - BaRTv1.0) has been generated. BaRTv1.0, was constructed from a range of tissues, cultivars and abiotic treatments and transcripts assembled and aligned to the barley cv. Morex reference genome (Mascher et al. Nature; 544: 427-433, 2017). Full-length cDNAs from the barley variety Haruna nijo (Matsumoto et al. Plant Physiol; 156: 20-28, 2011) determined transcript coverage, and high-resolution RT-PCR validated alternatively spliced (AS) transcripts of 86 genes in five different organs and tissue. These methods were used as benchmarks to select an optimal barley RTD. BaRTv1.0-Quantification of Alternatively Spliced Isoforms (QUASI) was also made to overcome inaccurate quantification due to variation in 5' and 3' UTR ends of transcripts. BaRTv1.0-QUASI was used for accurate transcript quantification of RNA-seq data of five barley organs/tissues. This analysis identified 20,972 significant differentially expressed genes, 2791 differentially alternatively spliced genes and 2768 transcripts with differential transcript usage. CONCLUSION A high confidence barley reference transcript dataset consisting of 60,444 genes with 177,240 transcripts has been generated. Compared to current barley transcripts, BaRTv1.0 transcripts are generally longer, have less fragmentation and improved gene models that are well supported by splice junction reads. Precise transcript quantification using BaRTv1.0 allows routine analysis of gene expression and AS.
Collapse
Affiliation(s)
- Paulo Rapazote-Flores
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Micha Bayer
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Linda Milne
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | | | - John Fuller
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Wenbin Guo
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Pete E Hedley
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Jenny Morris
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Claire Halpin
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Jason Kam
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
- Present address: Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Gogerddan, Aberystwyth, Ceredigion, SY23 3EB, UK
| | - Sarah M McKim
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Monika Zwirek
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
- Present Address: MRC Protein Phosphorylation and Ubiquitylation Unit, Sir James Black Centre, School of Life Sciences, University of Dundee, Dundee, DD1 5EH, UK
| | - M Cristina Casao
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Abdellah Barakate
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Miriam Schreiber
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Gordon Stephen
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Runxuan Zhang
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - John W S Brown
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Robbie Waugh
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Craig G Simpson
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK.
| |
Collapse
|
13
|
Panahi B, Frahadian M, Dums JT, Hejazi MA. Integration of Cross Species RNA-seq Meta-Analysis and Machine-Learning Models Identifies the Most Important Salt Stress-Responsive Pathways in Microalga Dunaliella. Front Genet 2019; 10:752. [PMID: 31555319 PMCID: PMC6727038 DOI: 10.3389/fgene.2019.00752] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2019] [Accepted: 07/17/2019] [Indexed: 01/12/2023] Open
Abstract
Photosynthetic microalgae are potentially yielding sources of different high-value secondary metabolites. Salinity is a complex stress that influences various metabolite-related pathways in microalgae. To obtain a clear view of the underlying metabolic pathways and resolve contradictory information concerning the transcriptional regulation of Dunaliella species in salt stress conditions, RNA-seq meta-analysis along with systems levels analysis was conducted. A p-value combination technique with Fisher method was used for cross species meta-analysis on the transcriptomes of two Dunaliella salina and Dunaliellatertiolecta species. The potential functional impacts of core meta-genes were surveyed based on gene ontology and network analysis. In the current study, the integration of supervised machine-learning algorithms with RNA-seq meta-analysis was performed. The analysis shows that the lipid and nitrogen metabolism, structural proteins of photosynthesis apparatus, chaperone-mediated autophagy, and ROS-related genes are the keys and core elements of the Dunaliella salt stress response system. Cross-talk between Ca2+ signal transduction, lipid accumulation, and ROS signaling network in salt stress conditions are also proposed. Our novel approach opens new avenues for better understanding of microalgae stress response mechanisms and for selection of candidate gene targets for metabolite production in microalgae.
Collapse
Affiliation(s)
- Bahman Panahi
- Department of Genomics, Branch for Northwest & West region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, Iran
| | - Mohammad Frahadian
- Department of Animal Science, Faculty of Agriculture, University of Tabriz, Tabriz, Iran
| | - Jacob T Dums
- Department of Plant and Soil Sciences, University of Delaware, Newark, DE, USA
| | - Mohammad Amin Hejazi
- Department of Food Biotechnology, Branch for Northwest & West region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, Iran
| |
Collapse
|
14
|
Sharifi S, Pakdel A, Ebrahimie E, Aryan Y, Ghaderi Zefrehee M, Reecy JM. Prediction of key regulators and downstream targets of E. coli induced mastitis. J Appl Genet 2019; 60:367-373. [DOI: 10.1007/s13353-019-00499-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2018] [Revised: 04/02/2019] [Accepted: 05/21/2019] [Indexed: 01/04/2023]
|
15
|
Panahi B, Mohammadi SA, Ruzicka K, Abbasi Holaso H, Zare Mehrjerdi M. Genome-wide identification and co-expression network analysis of nuclear factor-Y in barley revealed potential functions in salt stress. PHYSIOLOGY AND MOLECULAR BIOLOGY OF PLANTS : AN INTERNATIONAL JOURNAL OF FUNCTIONAL PLANT BIOLOGY 2019; 25:485-495. [PMID: 30956430 PMCID: PMC6419857 DOI: 10.1007/s12298-018-00637-1] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Revised: 11/05/2018] [Accepted: 12/25/2018] [Indexed: 05/25/2023]
Abstract
Nuclear factor-Ys (NF-Ys) were previously shown to have important regulatory impacts in different developmental and physiological process. However, in barley the function of the NF-Y genes at system levels is not well known. To identify barley NF-Ys, Arabidopsis and wheat NF-Y protein sequences were retrieved and the BLAST program along with the hidden Markov model were used. Multiple sequence alignments of identified NF-Ys were constructed using ClustalW. Expression patterns of the NF-Ys at different physiological and developmental conditions were also surveyed based on microarray datasets in public databases and subsequently co-expression network were constructed. Validation of in silico expression analysis was performed by real-time qPCR under salt stress condition. In total, 23 barley NF-Ys (8 NF-YA, 11 NF-YB and 4 NF-YC) were identified. Based on the sequence homology, the subunits of the NF-Y complex were divided into three to five groups. Structural analysis highlighted the conserved domains of HvNF-YA, HvNF-YB and HvNF-YC. Co-expression network analysis indicated the potential functions of HvNF-Ys in photosynthesis, starch biosynthesis and osmotic stress tolerance. The results of qRT-PCR also confirmed the HvNF-Ys roles in adaptation responses of barley to salt stress. We identified some potential candidate genes which could be used for improvements of cereals tolerance to salinity stress.
Collapse
Affiliation(s)
- Bahman Panahi
- Department of Genomics, Branch for Northwest and West Region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, Iran
| | - Seyyed Abolghasem Mohammadi
- Department of Plant Breeding and Biotechnology, University of Tabriz, Tabriz, Iran
- Center of Excellence in Cereal Molecular Breeding, University of Tabriz, Tabriz, Iran
| | - Kamil Ruzicka
- Laboratory of Hormonal Regulations in Plants, Institute of Experimental Botany, Czech Academy of Sciences, Rozvojová 263, Prague 6, Czech Republic
| | | | | |
Collapse
|
16
|
Molecular characterization of Brassica napus stress related transcription factors, BnMYB44 and BnVIP1, selected based on comparative analysis of Arabidopsis thaliana and Eutrema salsugineum transcriptomes. Mol Biol Rep 2018; 45:1111-1124. [DOI: 10.1007/s11033-018-4262-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2016] [Accepted: 07/13/2018] [Indexed: 10/28/2022]
|
17
|
Genome-wide analysis of the SPL/miR156 module and its interaction with the AP2/miR172 unit in barley. Sci Rep 2018; 8:7085. [PMID: 29728569 PMCID: PMC5935748 DOI: 10.1038/s41598-018-25349-0] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2017] [Accepted: 04/11/2018] [Indexed: 12/20/2022] Open
Abstract
The SQUAMOSA-promoter binding like (SPL) gene family encodes transcription factors that have been shown in many species to influence plant growth and development, but information about these genes in barley (Hordeum vulgare L.) is limited. This study identified 17 barley SPL genes, within eight distinct groups, that are orthologs of SPL genes described in Arabidopsis, wheat, and rice. Sixteen barley SPLs undergo alternative splicing. Seven SPLs contain a putative miR156 target site and the transcript levels of the miR156-targeted HvSPLs (HvSPL3, 13 and 23) were lower in vegetative than in reproductive phase but this was true also for some SPL genes such as HvSPL6 that were not regulated by miR156. Because SPL gene products regulate miR172, which is also involved in floral development, the expression of miR172 was studied. An antagonistic expression pattern of miR156 and miR172b during the vegetative and the reproductive phases signifies their apparent function in barley growth phase transition. Characterization of a barley mir172 mutant having an abnormal, indeterminate spikelet phenotype suggests the possible feedback role of AP2/miR172 module on HvSPL genes. This is the first comprehensive analysis of the miR156/SPL/miR172 axis in barley that provides a basis to elucidate their roles in various biological processes.
Collapse
|
18
|
Wei G, Liu K, Shen T, Shi J, Liu B, Han M, Peng M, Fu H, Song Y, Zhu J, Dong A, Ni T. Position-specific intron retention is mediated by the histone methyltransferase SDG725. BMC Biol 2018; 16:44. [PMID: 29706137 PMCID: PMC5925840 DOI: 10.1186/s12915-018-0513-8] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2018] [Accepted: 04/05/2018] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Intron retention (IR), the most prevalent alternative splicing form in plants, plays a critical role in gene expression during plant development and stress response. However, the molecular mechanisms underlying IR regulation remain largely unknown. RESULTS Knockdown of SDG725, a histone H3 lysine 36 (H3K36)-specific methyltransferase in rice, leads to alterations of IR in more than 4700 genes. Surprisingly, IR events are globally increased at the 5' region but decreased at the 3' region of the gene body in the SDG725-knockdown mutant. Chromatin immunoprecipitation sequencing analyses reveal that SDG725 depletion results in a genome-wide increase of the H3K36 mono-methylation (H3K36me1) but, unexpectedly, promoter-proximal shifts of H3K36 di- and tri-methylation (H3K36me2 and H3K36me3). Consistent with the results in animals, the levels of H3K36me1/me2/me3 in rice positively correlate with gene expression levels, whereas shifts of H3K36me2/me3 coincide with position-specific alterations of IR. We find that either H3K36me2 or H3K36me3 alone contributes to the positional change of IR caused by SDG725 knockdown, although IR shift is more significant when both H3K36me2 and H3K36me3 modifications are simultaneously shifted. CONCLUSIONS Our results revealed that SDG725 modulates IR in a position-specific manner, indicating that H3K36 methylation plays a role in RNA splicing, probably by marking the retained introns in plants.
Collapse
Affiliation(s)
- Gang Wei
- State Key Laboratory of Genetic Engineering and MOE Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center of Genetics and Development, Human Phenome Institute, School of Life Sciences and Huashan Hospital, Fudan University, Shanghai, 200438, People's Republic of China
| | - Kunpeng Liu
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, International Associated Laboratory of CNRS-Fudan-HUNAU on Plant Epigenome Research, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, Shanghai, 200438, People's Republic of China
| | - Ting Shen
- State Key Laboratory of Genetic Engineering and MOE Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center of Genetics and Development, Human Phenome Institute, School of Life Sciences and Huashan Hospital, Fudan University, Shanghai, 200438, People's Republic of China
| | - Jinlei Shi
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, International Associated Laboratory of CNRS-Fudan-HUNAU on Plant Epigenome Research, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, Shanghai, 200438, People's Republic of China
| | - Bing Liu
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, International Associated Laboratory of CNRS-Fudan-HUNAU on Plant Epigenome Research, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, Shanghai, 200438, People's Republic of China
| | - Miao Han
- State Key Laboratory of Genetic Engineering and MOE Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center of Genetics and Development, Human Phenome Institute, School of Life Sciences and Huashan Hospital, Fudan University, Shanghai, 200438, People's Republic of China
| | - Maolin Peng
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, International Associated Laboratory of CNRS-Fudan-HUNAU on Plant Epigenome Research, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, Shanghai, 200438, People's Republic of China
| | - Haihui Fu
- State Key Laboratory of Genetic Engineering and MOE Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center of Genetics and Development, Human Phenome Institute, School of Life Sciences and Huashan Hospital, Fudan University, Shanghai, 200438, People's Republic of China
| | - Yifan Song
- State Key Laboratory of Genetic Engineering and MOE Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center of Genetics and Development, Human Phenome Institute, School of Life Sciences and Huashan Hospital, Fudan University, Shanghai, 200438, People's Republic of China
| | - Jun Zhu
- Systems Biology Center, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA.
| | - Aiwu Dong
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, International Associated Laboratory of CNRS-Fudan-HUNAU on Plant Epigenome Research, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, Shanghai, 200438, People's Republic of China.
| | - Ting Ni
- State Key Laboratory of Genetic Engineering and MOE Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center of Genetics and Development, Human Phenome Institute, School of Life Sciences and Huashan Hospital, Fudan University, Shanghai, 200438, People's Republic of China.
| |
Collapse
|
19
|
Baharlou Houreh M, Ghorbani Kalkhajeh P, Niazi A, Ebrahimi F, Ebrahimie E. SpliceDetector: a software for detection of alternative splicing events in human and model organisms directly from transcript IDs. Sci Rep 2018; 8:5063. [PMID: 29567976 PMCID: PMC5864913 DOI: 10.1038/s41598-018-23245-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2017] [Accepted: 03/02/2018] [Indexed: 12/14/2022] Open
Abstract
In eukaryotes, different combinations of exons lead to multiple transcripts with various functions in protein level, in a process called alternative splicing (AS). Unfolding the complexity of functional genomics through genome-wide profiling of AS and determining the altered ultimate products provide new insights for better understanding of many biological processes, disease progress as well as drug development programs to target harmful splicing variants. The current available tools of alternative splicing work with raw data and include heavy computation. In particular, there is a shortcoming in tools to discover AS events directly from transcripts. Here, we developed a Windows-based user-friendly tool for identifying AS events from transcripts without the need to any advanced computer skill or database download. Meanwhile, due to online working mode, our application employs the updated SpliceGraphs without the need to any resource updating. First, SpliceGraph forms based on the frequency of active splice sites in pre-mRNA. Then, the presented approach compares query transcript exons to SpliceGraph exons. The tool provides the possibility of statistical analysis of AS events as well as AS visualization compared to SpliceGraph. The developed application works for transcript sets in human and model organisms.
Collapse
Affiliation(s)
| | | | - Ali Niazi
- Institute of Biotechnology, Shiraz University, Shiraz, Iran
| | | | - Esmaeil Ebrahimie
- Institute of Biotechnology, Shiraz University, Shiraz, Iran. .,Adelaide Medical School, The University of Adelaide, Adelaide, Australia. .,School of Information Technology and Mathematical Sciences, Division of Information Technology, Engineering and the Environment, The University of South Australia, Adelaide, SA, Australia. .,School of Biological Sciences, Faculty of Science and Engineering, Flinders University, Adelaide, SA, Australia.
| |
Collapse
|
20
|
Mao R, Liang C, Zhang Y, Hao X, Li J. 50/50 Expressional Odds of Retention Signifies the Distinction between Retained Introns and Constitutively Spliced Introns in Arabidopsis thaliana. FRONTIERS IN PLANT SCIENCE 2017; 8:1728. [PMID: 29062321 PMCID: PMC5640774 DOI: 10.3389/fpls.2017.01728] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2017] [Accepted: 09/21/2017] [Indexed: 05/23/2023]
Abstract
Intron retention, one of the most prevalent alternative splicing events in plants, can lead to introns retained in mature mRNAs. However, in comparison with constitutively spliced introns (CSIs), the relevantly distinguishable features for retained introns (RIs) are still poorly understood. This work proposes a computational pipeline to discover novel RIs from multiple next-generation RNA sequencing (RNA-Seq) datasets of Arabidopsis thaliana. Using this pipeline, we detected 3,472 novel RIs from 18 RNA-Seq datasets and re-confirmed 1,384 RIs which are currently annotated in the TAIR10 database. We also use the expression of intron-containing isoforms as a new feature in addition to the conventional features. Based on these features, RIs are highly distinguishable from CSIs by machine learning methods, especially when the expressional odds of retention (i.e., the expression ratio of the RI-containing isoforms relative to the isoforms without RIs for the same gene) reaches to or larger than 50/50. In this case, the RIs and CSIs can be clearly separated by the Random Forest with an outstanding performance of 0.95 on AUC (the area under a receiver operating characteristics curve). The closely related characteristics to the RIs include the low strength of splice sites, high similarity with the flanking exon sequences, low occurrence percentage of YTRAY near the acceptor site, existence of putative intronic splicing silencers (ISSs, i.e., AG/GA-rich motifs) and intronic splicing enhancers (ISEs, i.e., TTTT-containing motifs), and enrichment of Serine/Arginine-Rich (SR) proteins and heterogeneous nuclear ribonucleoparticle proteins (hnRNPs).
Collapse
Affiliation(s)
- Rui Mao
- College of Information Engineering, Northwest A&F University, Yangling, China
| | - Chun Liang
- Department of Biology, Miami University, Oxford, OH, United States
- Department of Computer Sciences and Software Engineering, Miami University, Oxford, OH, United States
| | - Yang Zhang
- College of Information Engineering, Northwest A&F University, Yangling, China
| | - Xingan Hao
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Plant Protection, Northwest A&F University, Yangling, China
| | - Jinyan Li
- Advanced Analytics Institute, University of Technology Sydney, Sydney, NSW, Australia
| |
Collapse
|
21
|
Ranwez V, Serra A, Pot D, Chantret N. Domestication reduces alternative splicing expression variations in sorghum. PLoS One 2017; 12:e0183454. [PMID: 28886042 PMCID: PMC5590825 DOI: 10.1371/journal.pone.0183454] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2017] [Accepted: 08/06/2017] [Indexed: 01/09/2023] Open
Abstract
Domestication is known to strongly reduce genomic diversity through population bottlenecks. The resulting loss of polymorphism has been thoroughly documented in numerous cultivated species. Here we investigate the impact of domestication on the diversity of alternative transcript expressions using RNAseq data obtained on cultivated and wild sorghum accessions (ten accessions for each pool). In that aim, we focus on genes expressing two isoforms in sorghum and estimate the ratio between expression levels of those isoforms in each accession. Noticeably, for a given gene, one isoform can either be overexpressed or underexpressed in some wild accessions, whereas in the cultivated accessions, the balance between the two isoforms of the same gene appears to be much more homogenous. Indeed, we observe in sorghum significantly more variation in isoform expression balance among wild accessions than among domesticated accessions. The possibility exists that the loss of nucleotide diversity due to domestication could affect regulatory elements, controlling transcription or degradation of these isoforms. Impact on the isoform expression balance is discussed. As far as we know, this is the first time that the impact of domestication on transcript isoform balance has been studied at the genomic scale. This could pave the way towards the identification of key domestication genes with finely tuned isoform expressions in domesticated accessions while being highly variable in their wild relatives.
Collapse
Affiliation(s)
| | - Audrey Serra
- Montpellier SupAgro, UMR AGAP, Montpellier, France
| | - David Pot
- CIRAD, UMR AGAP, Montpellier, France
| | | |
Collapse
|
22
|
Clavijo BJ, Venturini L, Schudoma C, Accinelli GG, Kaithakottil G, Wright J, Borrill P, Kettleborough G, Heavens D, Chapman H, Lipscombe J, Barker T, Lu FH, McKenzie N, Raats D, Ramirez-Gonzalez RH, Coince A, Peel N, Percival-Alwyn L, Duncan O, Trösch J, Yu G, Bolser DM, Namaati G, Kerhornou A, Spannagl M, Gundlach H, Haberer G, Davey RP, Fosker C, Palma FD, Phillips AL, Millar AH, Kersey PJ, Uauy C, Krasileva KV, Swarbreck D, Bevan MW, Clark MD. An improved assembly and annotation of the allohexaploid wheat genome identifies complete families of agronomic genes and provides genomic evidence for chromosomal translocations. Genome Res 2017; 27:885-896. [PMID: 28420692 PMCID: PMC5411782 DOI: 10.1101/gr.217117.116] [Citation(s) in RCA: 244] [Impact Index Per Article: 34.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2016] [Accepted: 03/14/2017] [Indexed: 01/16/2023]
Abstract
Advances in genome sequencing and assembly technologies are generating many high-quality genome sequences, but assemblies of large, repeat-rich polyploid genomes, such as that of bread wheat, remain fragmented and incomplete. We have generated a new wheat whole-genome shotgun sequence assembly using a combination of optimized data types and an assembly algorithm designed to deal with large and complex genomes. The new assembly represents >78% of the genome with a scaffold N50 of 88.8 kb that has a high fidelity to the input data. Our new annotation combines strand-specific Illumina RNA-seq and Pacific Biosciences (PacBio) full-length cDNAs to identify 104,091 high-confidence protein-coding genes and 10,156 noncoding RNA genes. We confirmed three known and identified one novel genome rearrangements. Our approach enables the rapid and scalable assembly of wheat genomes, the identification of structural variants, and the definition of complete gene models, all powerful resources for trait analysis and breeding of this key global crop.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | - Tom Barker
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
| | - Fu-Hao Lu
- John Innes Centre, Norwich, NR4 7UH, United Kingdom
| | | | - Dina Raats
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
| | | | | | - Ned Peel
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
| | | | - Owen Duncan
- ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley Western Australia 6009, Australia
| | - Josua Trösch
- ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley Western Australia 6009, Australia
| | - Guotai Yu
- John Innes Centre, Norwich, NR4 7UH, United Kingdom
| | - Dan M Bolser
- EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
| | - Guy Namaati
- EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
| | - Arnaud Kerhornou
- EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
| | - Manuel Spannagl
- Plant Genome and Systems Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany
| | - Heidrun Gundlach
- Plant Genome and Systems Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany
| | - Georg Haberer
- Plant Genome and Systems Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany
| | - Robert P Davey
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
- University of East Anglia, Norwich, NR4 7TJ, United Kingdom
| | | | - Federica Di Palma
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
- University of East Anglia, Norwich, NR4 7TJ, United Kingdom
| | | | - A Harvey Millar
- ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley Western Australia 6009, Australia
| | - Paul J Kersey
- EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
| | | | - Ksenia V Krasileva
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
- University of East Anglia, Norwich, NR4 7TJ, United Kingdom
- The Sainsbury Laboratory, Norwich, NR4 7UH, United Kingdom
| | - David Swarbreck
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
- University of East Anglia, Norwich, NR4 7TJ, United Kingdom
| | | | - Matthew D Clark
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
- University of East Anglia, Norwich, NR4 7TJ, United Kingdom
| |
Collapse
|
23
|
Yi H, Li X, Lee SH, Nou IS, Lim YP, Hur Y. Natural variation in CIRCADIAN CLOCK ASSOCIATED 1 is associated with flowering time in Brassica rapa. Genome 2016; 60:402-413. [PMID: 28177832 DOI: 10.1139/gen-2016-0052] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Flowering time is a very important agronomic trait and the development of molecular markers associated with this trait can facilitate crop breeding. CIRCADIAN CLOCK ASSOCIATED 1 (CCA1), a core oscillator component of circadian rhythms that affect metabolic pathways in plants, has been implicated in flowering time control in species of Brassica. CCA1 gene sequences from three Brassica rapa inbred lines, showing either early flowering or late flowering phenotypes, were analyzed and a high level of sequence variation was identified, especially within the fourth intron. Using this information, three PCR primer sets were designed and tested using various inbred lines of B. rapa. The usage of InDel markers was further validated by evaluation of flowering time and high resolution melting (HRM) analysis. Both methods, PCR and HRM, validated the use of newly developed markers. Additional sequence analyses of Brassica plants with diploid (AA, BB, or CC) and allotetraploid genomes further confirmed a large number of sequence polymorphisms in the CCA1 gene, including insertions/deletions in the fourth intron. Our results demonstrated that sequence variations in CCA1 can be used to develop valuable trait-related molecular markers for Brassica crop breeding.
Collapse
Affiliation(s)
- Hankuil Yi
- a Department of Biological Science, College of Bioscience and Biotechnology, Chungnam National University, Daejeon 305-764, Republic of Korea
| | - Xiaonan Li
- b Department of Horticulture, Chungnam National University, Gung-Dong, Yuseong-Gu, Daejeon 305-764, Republic of Korea.,d Department of Horticulture, Shenyang Agricultural University, Shenyang, China
| | - Seong Ho Lee
- b Department of Horticulture, Chungnam National University, Gung-Dong, Yuseong-Gu, Daejeon 305-764, Republic of Korea
| | - Ill-Sup Nou
- c Department of Horticulture, Sunchon National University, Suncheon, Jeonnam, Republic of Korea
| | - Yong Pyo Lim
- b Department of Horticulture, Chungnam National University, Gung-Dong, Yuseong-Gu, Daejeon 305-764, Republic of Korea
| | - Yoonkang Hur
- a Department of Biological Science, College of Bioscience and Biotechnology, Chungnam National University, Daejeon 305-764, Republic of Korea
| |
Collapse
|
24
|
Alanazi IO, Ebrahimie E. Computational Systems Biology Approach Predicts Regulators and Targets of microRNAs and Their Genomic Hotspots in Apoptosis Process. Mol Biotechnol 2016; 58:460-79. [DOI: 10.1007/s12033-016-9938-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
25
|
Genome-wide analysis of shoot growth-associated alternative splicing in moso bamboo. Mol Genet Genomics 2016; 291:1695-714. [DOI: 10.1007/s00438-016-1212-1] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Accepted: 04/28/2016] [Indexed: 10/21/2022]
|