1
|
Gao R, Lu Y, Wu N, Liu H, Jin X. Comprehensive study of serine/arginine-rich (SR) gene family in rice: characterization, evolution and expression analysis. PeerJ 2023; 11:e16193. [PMID: 37849832 PMCID: PMC10578304 DOI: 10.7717/peerj.16193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 09/06/2023] [Indexed: 10/19/2023] Open
Abstract
As important regulators of alternative splicing (AS) events, serine/arginine (SR)-rich proteins play indispensable roles in the growth and development of organisms. Until now, the study of SR genes has been lacking in plants. In the current study, we performed genome-wide analysis on the SR gene family in rice. A total of 24 OsSR genes were phylogenetically classified into seven groups, corresponding to seven subfamilies. The OsSR genes' structures, distribution of conserved domains, and protein tertiary structure of OsSR were conserved within each subfamily. The synteny analysis revealed that segmental duplication events were critical for the expansion of OsSR gene family. Moreover, interspecific synteny revealed the distribution of orthologous SR gene pairs between rice and Arabidopsis, sorghum, wheat, and maize. Among all OsSR genes, 14 genes exhibited NAGNAG acceptors, and only four OsSR genes had AS events on the NAGNAG acceptors. Furthermore, the distinct tissue-specific expression patterns of OsSR genes showed that these genes may function in different developmental stages in rice. The AS patterns on the same OsSR gene were variable among the root, stem, leaf, and grains at different filling stages, and some isoforms could only be detected in one or a few of tested tissues. Meanwhile, our results showed that the expression of some OsSR genes changed dramatically under ABA, GA, salt, drought, cold or heat treatment, which were related to the wide distribution of corresponding cis-elements in their promoter regions, suggesting their specific roles in stress and hormone response. This research facilitates our understanding of SR gene family in rice and provides clues for further exploration of the function of OsSR genes.
Collapse
Affiliation(s)
- Rui Gao
- Department of Agronomy, The Key Laboratory of Crop Germplasm Resource of Zhejiang Province, Zhejiang University, Hangzhou, China
| | - Yingying Lu
- Department of Agronomy, The Key Laboratory of Crop Germplasm Resource of Zhejiang Province, Zhejiang University, Hangzhou, China
| | - Nan Wu
- Department of Agronomy, The Key Laboratory of Crop Germplasm Resource of Zhejiang Province, Zhejiang University, Hangzhou, China
| | - Hui Liu
- Department of Agronomy, The Key Laboratory of Crop Germplasm Resource of Zhejiang Province, Zhejiang University, Hangzhou, China
| | - Xiaoli Jin
- Department of Agronomy, The Key Laboratory of Crop Germplasm Resource of Zhejiang Province, Zhejiang University, Hangzhou, China
| |
Collapse
|
2
|
Martín G, Márquez Y, Mantica F, Duque P, Irimia M. Alternative splicing landscapes in Arabidopsis thaliana across tissues and stress conditions highlight major functional differences with animals. Genome Biol 2021; 22:35. [PMID: 33446251 PMCID: PMC7807721 DOI: 10.1186/s13059-020-02258-y] [Citation(s) in RCA: 77] [Impact Index Per Article: 25.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 12/22/2020] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Alternative splicing (AS) is a widespread regulatory mechanism in multicellular organisms. Numerous transcriptomic and single-gene studies in plants have investigated AS in response to specific conditions, especially environmental stress, unveiling substantial amounts of intron retention that modulate gene expression. However, a comprehensive study contrasting stress-response and tissue-specific AS patterns and directly comparing them with those of animal models is still missing. RESULTS We generate a massive resource for Arabidopsis thaliana, PastDB, comprising AS and gene expression quantifications across tissues, development and environmental conditions, including abiotic and biotic stresses. Harmonized analysis of these datasets reveals that A. thaliana shows high levels of AS, similar to fruitflies, and that, compared to animals, disproportionately uses AS for stress responses. We identify core sets of genes regulated specifically by either AS or transcription upon stresses or among tissues, a regulatory specialization that is tightly mirrored by the genomic features of these genes. Unexpectedly, non-intron retention events, including exon skipping, are overrepresented across regulated AS sets in A. thaliana, being also largely involved in modulating gene expression through NMD and uORF inclusion. CONCLUSIONS Non-intron retention events have likely been functionally underrated in plants. AS constitutes a distinct regulatory layer controlling gene expression upon internal and external stimuli whose target genes and master regulators are hardwired at the genomic level to specifically undergo post-transcriptional regulation. Given the higher relevance of AS in the response to different stresses when compared to animals, this molecular hardwiring is likely required for a proper environmental response in A. thaliana.
Collapse
Affiliation(s)
- Guiomar Martín
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande, 6, 2780-156, Oeiras, Portugal.
| | - Yamile Márquez
- Centre for Genomic Regulation, Barcelona Institute of Science and Technology, Dr. Aiguader, 88, Barcelona, 08003, Spain
| | - Federica Mantica
- Centre for Genomic Regulation, Barcelona Institute of Science and Technology, Dr. Aiguader, 88, Barcelona, 08003, Spain
| | - Paula Duque
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande, 6, 2780-156, Oeiras, Portugal
| | - Manuel Irimia
- Centre for Genomic Regulation, Barcelona Institute of Science and Technology, Dr. Aiguader, 88, Barcelona, 08003, Spain. .,Universitat Pompeu Fabra, Dr. Aiguader, 88, Barcelona, 08003, Spain. .,ICREA, Passeig de Lluís Companys, 23, 08010, Barcelona, Spain.
| |
Collapse
|
3
|
Rapazote-Flores P, Bayer M, Milne L, Mayer CD, Fuller J, Guo W, Hedley PE, Morris J, Halpin C, Kam J, McKim SM, Zwirek M, Casao MC, Barakate A, Schreiber M, Stephen G, Zhang R, Brown JWS, Waugh R, Simpson CG. BaRTv1.0: an improved barley reference transcript dataset to determine accurate changes in the barley transcriptome using RNA-seq. BMC Genomics 2019; 20:968. [PMID: 31829136 DOI: 10.1186/s12864-019-6243-6247] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 10/29/2019] [Indexed: 05/27/2023] Open
Abstract
BACKGROUND The time required to analyse RNA-seq data varies considerably, due to discrete steps for computational assembly, quantification of gene expression and splicing analysis. Recent fast non-alignment tools such as Kallisto and Salmon overcome these problems, but these tools require a high quality, comprehensive reference transcripts dataset (RTD), which are rarely available in plants. RESULTS A high-quality, non-redundant barley gene RTD and database (Barley Reference Transcripts - BaRTv1.0) has been generated. BaRTv1.0, was constructed from a range of tissues, cultivars and abiotic treatments and transcripts assembled and aligned to the barley cv. Morex reference genome (Mascher et al. Nature; 544: 427-433, 2017). Full-length cDNAs from the barley variety Haruna nijo (Matsumoto et al. Plant Physiol; 156: 20-28, 2011) determined transcript coverage, and high-resolution RT-PCR validated alternatively spliced (AS) transcripts of 86 genes in five different organs and tissue. These methods were used as benchmarks to select an optimal barley RTD. BaRTv1.0-Quantification of Alternatively Spliced Isoforms (QUASI) was also made to overcome inaccurate quantification due to variation in 5' and 3' UTR ends of transcripts. BaRTv1.0-QUASI was used for accurate transcript quantification of RNA-seq data of five barley organs/tissues. This analysis identified 20,972 significant differentially expressed genes, 2791 differentially alternatively spliced genes and 2768 transcripts with differential transcript usage. CONCLUSION A high confidence barley reference transcript dataset consisting of 60,444 genes with 177,240 transcripts has been generated. Compared to current barley transcripts, BaRTv1.0 transcripts are generally longer, have less fragmentation and improved gene models that are well supported by splice junction reads. Precise transcript quantification using BaRTv1.0 allows routine analysis of gene expression and AS.
Collapse
Affiliation(s)
- Paulo Rapazote-Flores
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Micha Bayer
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Linda Milne
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | | | - John Fuller
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Wenbin Guo
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Pete E Hedley
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Jenny Morris
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Claire Halpin
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Jason Kam
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
- Present address: Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Gogerddan, Aberystwyth, Ceredigion, SY23 3EB, UK
| | - Sarah M McKim
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Monika Zwirek
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
- Present Address: MRC Protein Phosphorylation and Ubiquitylation Unit, Sir James Black Centre, School of Life Sciences, University of Dundee, Dundee, DD1 5EH, UK
| | - M Cristina Casao
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Abdellah Barakate
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Miriam Schreiber
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Gordon Stephen
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Runxuan Zhang
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - John W S Brown
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Robbie Waugh
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Craig G Simpson
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK.
| |
Collapse
|
4
|
Rapazote-Flores P, Bayer M, Milne L, Mayer CD, Fuller J, Guo W, Hedley PE, Morris J, Halpin C, Kam J, McKim SM, Zwirek M, Casao MC, Barakate A, Schreiber M, Stephen G, Zhang R, Brown JWS, Waugh R, Simpson CG. BaRTv1.0: an improved barley reference transcript dataset to determine accurate changes in the barley transcriptome using RNA-seq. BMC Genomics 2019; 20:968. [PMID: 31829136 PMCID: PMC6907147 DOI: 10.1186/s12864-019-6243-7] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 10/29/2019] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND The time required to analyse RNA-seq data varies considerably, due to discrete steps for computational assembly, quantification of gene expression and splicing analysis. Recent fast non-alignment tools such as Kallisto and Salmon overcome these problems, but these tools require a high quality, comprehensive reference transcripts dataset (RTD), which are rarely available in plants. RESULTS A high-quality, non-redundant barley gene RTD and database (Barley Reference Transcripts - BaRTv1.0) has been generated. BaRTv1.0, was constructed from a range of tissues, cultivars and abiotic treatments and transcripts assembled and aligned to the barley cv. Morex reference genome (Mascher et al. Nature; 544: 427-433, 2017). Full-length cDNAs from the barley variety Haruna nijo (Matsumoto et al. Plant Physiol; 156: 20-28, 2011) determined transcript coverage, and high-resolution RT-PCR validated alternatively spliced (AS) transcripts of 86 genes in five different organs and tissue. These methods were used as benchmarks to select an optimal barley RTD. BaRTv1.0-Quantification of Alternatively Spliced Isoforms (QUASI) was also made to overcome inaccurate quantification due to variation in 5' and 3' UTR ends of transcripts. BaRTv1.0-QUASI was used for accurate transcript quantification of RNA-seq data of five barley organs/tissues. This analysis identified 20,972 significant differentially expressed genes, 2791 differentially alternatively spliced genes and 2768 transcripts with differential transcript usage. CONCLUSION A high confidence barley reference transcript dataset consisting of 60,444 genes with 177,240 transcripts has been generated. Compared to current barley transcripts, BaRTv1.0 transcripts are generally longer, have less fragmentation and improved gene models that are well supported by splice junction reads. Precise transcript quantification using BaRTv1.0 allows routine analysis of gene expression and AS.
Collapse
Affiliation(s)
- Paulo Rapazote-Flores
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Micha Bayer
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Linda Milne
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | | | - John Fuller
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Wenbin Guo
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Pete E Hedley
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Jenny Morris
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Claire Halpin
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Jason Kam
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
- Present address: Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Gogerddan, Aberystwyth, Ceredigion, SY23 3EB, UK
| | - Sarah M McKim
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Monika Zwirek
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
- Present Address: MRC Protein Phosphorylation and Ubiquitylation Unit, Sir James Black Centre, School of Life Sciences, University of Dundee, Dundee, DD1 5EH, UK
| | - M Cristina Casao
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Abdellah Barakate
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Miriam Schreiber
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Gordon Stephen
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - Runxuan Zhang
- Information and Computational Sciences, James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
| | - John W S Brown
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Robbie Waugh
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK
- Division of Plant Sciences, School of Life Sciences, University of Dundee at the James Hutton Institute, Dundee, DD2 5DA, UK
| | - Craig G Simpson
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK.
| |
Collapse
|
5
|
Li C, Zheng L, Zhang J, Lv Y, Liu J, Wang X, Palfalvi G, Wang G, Zhang Y. Characterization and functional analysis of four HYH splicing variants in Arabidopsis hypocotyl elongation. Gene 2017; 619:44-49. [DOI: 10.1016/j.gene.2017.04.001] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2016] [Revised: 03/30/2017] [Accepted: 04/03/2017] [Indexed: 11/29/2022]
|
6
|
Satyawan D, Kim MY, Lee S. Stochastic alternative splicing is prevalent in mungbean (Vigna radiata). PLANT BIOTECHNOLOGY JOURNAL 2017; 15:174-182. [PMID: 27400146 PMCID: PMC5258860 DOI: 10.1111/pbi.12600] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2016] [Revised: 06/10/2016] [Accepted: 07/05/2016] [Indexed: 05/20/2023]
Abstract
Alternative splicing (AS) can produce multiple mature mRNAs from the same primary transcript, thereby generating diverse proteins and phenotypes from the same gene. To assess the prevalence of AS in mungbean (Vigna radiata), we analysed whole-genome RNA sequencing data from root, leaf, flower and pod tissues and found that at least 37.9% of mungbean genes are subjected to AS. The number of AS transcripts exhibited a strong correlation with exon number and thus resembled a uniform probabilistic event rather than a specific regulatory function. The proportion of frameshift splicing was close to the expected frequency of random splicing. However, alternative donor and acceptor AS events tended to occur at multiples of three nucleotides (i.e. the codon length) from the main splice site. Genes with high exon number and expression level, which should have the most AS if splicing is purely stochastic, exhibited less AS, implying the existence of negative selection against excessive random AS. Functional AS is probably rare: a large proportion of AS isoforms exist at very low copy per cell on average or are expressed at much lower levels than default transcripts. Conserved AS was only detected in 629 genes (2.8% of all genes in the genome) when compared to Vigna angularis, and in 16 genes in more distant species like soya bean. These observations highlight the challenges of finding and cataloguing candidates for experimentally proven AS isoforms in a crop genome.
Collapse
Affiliation(s)
- Dani Satyawan
- Department of Plant Science and Research Institute of Agriculture and Life SciencesSeoul National UniversitySeoulKorea
- Indonesian Center for Agricultural Biotechnology and Genetic Resources Research and DevelopmentBogorIndonesia
| | - Moon Young Kim
- Department of Plant Science and Research Institute of Agriculture and Life SciencesSeoul National UniversitySeoulKorea
- Plant Genomics and Breeding InstituteSeoul National UniversitySeoulKorea
| | - Suk‐Ha Lee
- Department of Plant Science and Research Institute of Agriculture and Life SciencesSeoul National UniversitySeoulKorea
- Plant Genomics and Breeding InstituteSeoul National UniversitySeoulKorea
| |
Collapse
|
7
|
Li S, Papale LA, Zhang Q, Madrid A, Chen L, Chopra P, Keleş S, Jin P, Alisch RS. Genome-wide alterations in hippocampal 5-hydroxymethylcytosine links plasticity genes to acute stress. Neurobiol Dis 2015; 86:99-108. [PMID: 26598390 DOI: 10.1016/j.nbd.2015.11.010] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2015] [Accepted: 11/11/2015] [Indexed: 12/15/2022] Open
Abstract
Environmental stress is among the most important contributors to increased susceptibility to develop psychiatric disorders, including anxiety and post-traumatic stress disorder. While even acute stress alters gene expression, the molecular mechanisms underlying these changes remain largely unknown. 5-hydroxymethylcytosine (5hmC) is a novel environmentally sensitive DNA modification that is highly enriched in post-mitotic neurons and is associated with active transcription of neuronal genes. Recently, we found a hippocampal increase of 5hmC in the glucocorticoid receptor gene (Nr3c1) following acute stress, warranting a deeper investigation of stress-related 5hmC levels. Here we used an established chemical labeling and affinity purification method coupled with high-throughput sequencing technology to generate the first genome-wide profile of hippocampal 5hmC following exposure to acute restraint stress and a one-hour recovery. This approach found a genome-wide disruption in 5hmC associated with acute stress response, primarily in genic regions, and identified known and potentially novel stress-related targets that have a significant enrichment for neuronal ontological functions. Integration of these data with hippocampal gene expression data from these same mice found stress-related hydroxymethylation correlated to altered transcript levels and sequence motif predictions indicated that 5hmC may function by mediating transcription factor binding to these transcripts. Together, these data reveal an environmental impact on this newly discovered epigenetic mark in the brain and represent a critical step toward understanding stress-related epigenetic mechanisms that alter gene expression and can lead to the development of psychiatric disorders.
Collapse
Affiliation(s)
- Sisi Li
- Department of Psychiatry University of Wisconsin, Madison, WI, USA; Neuroscience Training Program, University of Wisconsin, Madison, WI, USA
| | - Ligia A Papale
- Department of Psychiatry University of Wisconsin, Madison, WI, USA
| | - Qi Zhang
- Department Statistics, Biostatistics, and Medical Informatics, University of Wisconsin, Madison, WI, USA
| | - Andy Madrid
- Department of Psychiatry University of Wisconsin, Madison, WI, USA; Endocrinology and Reproductive Physiology Training Program, University of Wisconsin, Madison, WI, USA
| | - Li Chen
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
| | - Pankaj Chopra
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
| | - Sündüz Keleş
- Department Statistics, Biostatistics, and Medical Informatics, University of Wisconsin, Madison, WI, USA
| | - Peng Jin
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
| | - Reid S Alisch
- Department of Psychiatry University of Wisconsin, Madison, WI, USA.
| |
Collapse
|
8
|
Yan X, Sablok G, Feng G, Ma J, Zhao H, Sun X. nagnag: Identification and quantification of NAGNAG alternative splicing using RNA-Seq data. FEBS Lett 2015; 589:1766-70. [PMID: 26028313 DOI: 10.1016/j.febslet.2015.05.029] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2015] [Revised: 05/04/2015] [Accepted: 05/12/2015] [Indexed: 12/28/2022]
Abstract
Regulation of proteome diversity by alternative splicing has been widely demonstrated in plants and animals. NAGNAG splicing, which was recently defined as a tissue specific event, results in the production of two distinct isoforms that are distinguished by three nucleotides (NAG) as a consequence of the intron proximal or distal to the splice site. Since the NAGNAG mechanism is not well characterized, tools for the identification and quantification of NAGNAG splicing events remain under-developed. Here we report nagnag, an R-based NAGNAG splicing detection tool, which accurately identifies and quantifies NAGNAG splicing events using RNA-Seq. Overall, nagnag produces user-friendly visualization reports and highlights differences between the DNA/RNA/protein across the identified isoforms of the reported gene. The package is available on https://sourceforge.net/projects/nagnag/files/; or http://genome.sdau.edu.cn/research/software/nagnag.html.
Collapse
Affiliation(s)
- Xiaoyan Yan
- Affiliated Hospital of Shandong University of Traditional Chinese Medicine, No. 42 Wenhua West Road, Jinan, Shandong 250011, China
| | - Gaurav Sablok
- Plant Functional Biology and Climate Change Cluster (C3), University of Technology Sydney, PO Box 123, Broadway, NSW 2007, Australia
| | - Gang Feng
- Preventive Medicine-Health and Biomedical Informatics, Northwestern University, Chicago, IL 60611, United States
| | - Jiaxin Ma
- Department of Plant Pathology, Nanjing Agricultural University, Weigang Road, Nanjing 210095, China
| | - Hongwei Zhao
- Department of Plant Pathology, Nanjing Agricultural University, Weigang Road, Nanjing 210095, China
| | - Xiaoyong Sun
- Agricultural Big-Data Research Center, College of Information Science and Engineering, Shandong Agricultural University, Taian, Shandong 271018, China.
| |
Collapse
|
9
|
Bao R, Huang L, Andrade J, Tan W, Kibbe WA, Jiang H, Feng G. Review of current methods, applications, and data management for the bioinformatics analysis of whole exome sequencing. Cancer Inform 2014; 13:67-82. [PMID: 25288881 PMCID: PMC4179624 DOI: 10.4137/cin.s13779] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2014] [Revised: 07/06/2014] [Accepted: 07/07/2014] [Indexed: 12/21/2022] Open
Abstract
The advent of next-generation sequencing technologies has greatly promoted advances in the study of human diseases at the genomic, transcriptomic, and epigenetic levels. Exome sequencing, where the coding region of the genome is captured and sequenced at a deep level, has proven to be a cost-effective method to detect disease-causing variants and discover gene targets. In this review, we outline the general framework of whole exome sequence data analysis. We focus on established bioinformatics tools and applications that support five analytical steps: raw data quality assessment, pre-processing, alignment, post-processing, and variant analysis (detection, annotation, and prioritization). We evaluate the performance of open-source alignment programs and variant calling tools using simulated and benchmark datasets, and highlight the challenges posed by the lack of concordance among variant detection tools. Based on these results, we recommend adopting multiple tools and resources to reduce false positives and increase the sensitivity of variant calling. In addition, we briefly discuss the current status and solutions for big data management, analysis, and summarization in the field of bioinformatics.
Collapse
Affiliation(s)
- Riyue Bao
- Center for Research Informatics, The University of Chicago, Chicago, IL, USA
| | - Lei Huang
- Center for Research Informatics, The University of Chicago, Chicago, IL, USA
| | - Jorge Andrade
- Center for Research Informatics, The University of Chicago, Chicago, IL, USA
| | - Wei Tan
- IBM Thomas J. Watson Research Center, Yorktown Heights, New York, USA
| | - Warren A Kibbe
- Biomedical Informatics Center (NUBIC), Clinical and Translational Sciences Institute (NUCATS), Northwestern University, Chicago, IL, USA
| | - Hongmei Jiang
- Department of Statistics, Northwestern University, Evanston, IL, USA
| | - Gang Feng
- Biomedical Informatics Center (NUBIC), Clinical and Translational Sciences Institute (NUCATS), Northwestern University, Chicago, IL, USA
| |
Collapse
|
10
|
Computational evidence of NAGNAG alternative splicing in human large intergenic noncoding RNA. BIOMED RESEARCH INTERNATIONAL 2014; 2014:736798. [PMID: 24995327 PMCID: PMC4068082 DOI: 10.1155/2014/736798] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/04/2014] [Revised: 05/08/2014] [Accepted: 05/21/2014] [Indexed: 11/23/2022]
Abstract
NAGNAG alternative splicing plays an essential role in biological processes and represents a highly adaptable system for posttranslational regulation of gene function. NAGNAG alternative splicing impacts a myriad of biological processes. Previous studies of NAGNAG largely focused on messenger RNA. To the best of our knowledge, this is the first study testing the hypothesis that NAGNAG alternative splicing is also operative in large intergenic noncoding RNA (lincRNA). The RNA-seq data sets from recent deep sequencing studies were queried to test our hypothesis. NAGNAG alternative splicing of human lincRNA was identified while querying two independent RNA-seq data sets. Within these datasets, 31 NAGNAG alternative splicing sites were identified in lincRNA. Notably, most exons of lincRNA containing NAGNAG acceptors were longer than those from protein-coding genes. Furthermore, presence of CAG coding appeared to participate in the splice site selection. Finally, expression of the isoforms of NAGNAG lincRNA exhibited tissue specificity. Together, this study improves our understanding of the NAGNAG alternative splicing in lincRNA.
Collapse
|