1
|
Gill T, Gill SK, Saini DK, Chopra Y, de Koff JP, Sandhu KS. A Comprehensive Review of High Throughput Phenotyping and Machine Learning for Plant Stress Phenotyping. PHENOMICS (CHAM, SWITZERLAND) 2022; 2:156-183. [PMID: 36939773 PMCID: PMC9590503 DOI: 10.1007/s43657-022-00048-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Revised: 01/29/2022] [Accepted: 02/11/2022] [Indexed: 02/04/2023]
Abstract
During the last decade, there has been rapid adoption of ground and aerial platforms with multiple sensors for phenotyping various biotic and abiotic stresses throughout the developmental stages of the crop plant. High throughput phenotyping (HTP) involves the application of these tools to phenotype the plants and can vary from ground-based imaging to aerial phenotyping to remote sensing. Adoption of these HTP tools has tried to reduce the phenotyping bottleneck in breeding programs and help to increase the pace of genetic gain. More specifically, several root phenotyping tools are discussed to study the plant's hidden half and an area long neglected. However, the use of these HTP technologies produces big data sets that impede the inference from those datasets. Machine learning and deep learning provide an alternative opportunity for the extraction of useful information for making conclusions. These are interdisciplinary approaches for data analysis using probability, statistics, classification, regression, decision theory, data visualization, and neural networks to relate information extracted with the phenotypes obtained. These techniques use feature extraction, identification, classification, and prediction criteria to identify pertinent data for use in plant breeding and pathology activities. This review focuses on the recent findings where machine learning and deep learning approaches have been used for plant stress phenotyping with data being collected using various HTP platforms. We have provided a comprehensive overview of different machine learning and deep learning tools available with their potential advantages and pitfalls. Overall, this review provides an avenue for studying various HTP platforms with particular emphasis on using the machine learning and deep learning tools for drawing legitimate conclusions. Finally, we propose the conceptual challenges being faced and provide insights on future perspectives for managing those issues.
Collapse
Affiliation(s)
- Taqdeer Gill
- Department of Agricultural and Environmental Sciences, Tennessee State University, Nashville, TN 37209 USA
| | - Simranveer K. Gill
- College of Agriculture, Punjab Agricultural University, Ludhiana, Punjab 141004 India
| | - Dinesh K. Saini
- Department of Plant Breeding and Genetics, Punjab Agricultural University, Ludhiana, Punjab 141004 India
| | - Yuvraj Chopra
- College of Agriculture, Punjab Agricultural University, Ludhiana, Punjab 141004 India
| | - Jason P. de Koff
- Department of Agricultural and Environmental Sciences, Tennessee State University, Nashville, TN 37209 USA
| | - Karansher S. Sandhu
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA 99163 USA
| |
Collapse
|
2
|
Modern Approaches for Transcriptome Analyses in Plants. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2021; 1346:11-50. [DOI: 10.1007/978-3-030-80352-0_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
3
|
Sircar S, Parekh N. Meta-analysis of drought-tolerant genotypes in Oryza sativa: A network-based approach. PLoS One 2019; 14:e0216068. [PMID: 31059518 PMCID: PMC6502313 DOI: 10.1371/journal.pone.0216068] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2018] [Accepted: 04/12/2019] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Drought is a severe environmental stress. It is estimated that about 50% of the world rice production is affected mainly by drought. Apart from conventional breeding strategies to develop drought-tolerant crops, innovative computational approaches may provide insights into the underlying molecular mechanisms of stress response and identify drought-responsive markers. Here we propose a network-based computational approach involving a meta-analytic study of seven drought-tolerant rice genotypes under drought stress. RESULTS Co-expression networks enable large-scale analysis of gene-pair associations and tightly coupled clusters that may represent coordinated biological processes. Considering differentially expressed genes in the co-expressed modules and supplementing external information such as resistance/tolerance QTLs, transcription factors, network-based topological measures, we identify and prioritize drought-adaptive co-expressed gene modules and potential candidate genes. Using the candidate genes that are well-represented across the datasets as 'seed' genes, two drought-specific protein-protein interaction networks (PPINs) are constructed with up- and down-regulated genes. Cluster analysis of the up-regulated PPIN revealed ABA signalling pathway as a central process in drought response with a probable crosstalk with energy metabolic processes. Tightly coupled gene clusters representing up-regulation of core cellular respiratory processes and enhanced degradation of branched chain amino acids and cell wall metabolism are identified. Cluster analysis of down-regulated PPIN provides a snapshot of major processes associated with photosynthesis, growth, development and protein synthesis, most of which are shut down during drought. Differential regulation of phytohormones, e.g., jasmonic acid, cell wall metabolism, signalling and posttranslational modifications associated with biotic stress are elucidated. Functional characterization of topologically important, drought-responsive uncharacterized genes that may play a role in important processes such as ABA signalling, calcium signalling, photosynthesis and cell wall metabolism is discussed. Further transgenic studies on these genes may help in elucidating their biological role under stress conditions. CONCLUSION Currently, a large number of resources for rice functional genomics exist which are mostly underutilized by the scientific community. In this study, a computational approach integrating information from various resources such as gene co-expression networks, protein-protein interactions and pathway-level information is proposed to provide a systems-level view of complex drought-responsive processes across the drought-tolerant genotypes.
Collapse
Affiliation(s)
- Sanchari Sircar
- Centre for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad, India
| | - Nita Parekh
- Centre for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad, India
- * E-mail:
| |
Collapse
|
4
|
Identification of cancer subtypes by integrating multiple types of transcriptomics data with deep learning in breast cancer. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2018.03.072] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
|
5
|
Mao R, Liang C, Zhang Y, Hao X, Li J. 50/50 Expressional Odds of Retention Signifies the Distinction between Retained Introns and Constitutively Spliced Introns in Arabidopsis thaliana. FRONTIERS IN PLANT SCIENCE 2017; 8:1728. [PMID: 29062321 PMCID: PMC5640774 DOI: 10.3389/fpls.2017.01728] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2017] [Accepted: 09/21/2017] [Indexed: 05/23/2023]
Abstract
Intron retention, one of the most prevalent alternative splicing events in plants, can lead to introns retained in mature mRNAs. However, in comparison with constitutively spliced introns (CSIs), the relevantly distinguishable features for retained introns (RIs) are still poorly understood. This work proposes a computational pipeline to discover novel RIs from multiple next-generation RNA sequencing (RNA-Seq) datasets of Arabidopsis thaliana. Using this pipeline, we detected 3,472 novel RIs from 18 RNA-Seq datasets and re-confirmed 1,384 RIs which are currently annotated in the TAIR10 database. We also use the expression of intron-containing isoforms as a new feature in addition to the conventional features. Based on these features, RIs are highly distinguishable from CSIs by machine learning methods, especially when the expressional odds of retention (i.e., the expression ratio of the RI-containing isoforms relative to the isoforms without RIs for the same gene) reaches to or larger than 50/50. In this case, the RIs and CSIs can be clearly separated by the Random Forest with an outstanding performance of 0.95 on AUC (the area under a receiver operating characteristics curve). The closely related characteristics to the RIs include the low strength of splice sites, high similarity with the flanking exon sequences, low occurrence percentage of YTRAY near the acceptor site, existence of putative intronic splicing silencers (ISSs, i.e., AG/GA-rich motifs) and intronic splicing enhancers (ISEs, i.e., TTTT-containing motifs), and enrichment of Serine/Arginine-Rich (SR) proteins and heterogeneous nuclear ribonucleoparticle proteins (hnRNPs).
Collapse
Affiliation(s)
- Rui Mao
- College of Information Engineering, Northwest A&F University, Yangling, China
| | - Chun Liang
- Department of Biology, Miami University, Oxford, OH, United States
- Department of Computer Sciences and Software Engineering, Miami University, Oxford, OH, United States
| | - Yang Zhang
- College of Information Engineering, Northwest A&F University, Yangling, China
| | - Xingan Hao
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Plant Protection, Northwest A&F University, Yangling, China
| | - Jinyan Li
- Advanced Analytics Institute, University of Technology Sydney, Sydney, NSW, Australia
| |
Collapse
|
6
|
Badr E, ElHefnawi M, Heath LS. Computational Identification of Tissue-Specific Splicing Regulatory Elements in Human Genes from RNA-Seq Data. PLoS One 2016; 11:e0166978. [PMID: 27861625 PMCID: PMC5115852 DOI: 10.1371/journal.pone.0166978] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2016] [Accepted: 11/07/2016] [Indexed: 12/24/2022] Open
Abstract
Alternative splicing is a vital process for regulating gene expression and promoting proteomic diversity. It plays a key role in tissue-specific expressed genes. This specificity is mainly regulated by splicing factors that bind to specific sequences called splicing regulatory elements (SREs). Here, we report a genome-wide analysis to study alternative splicing on multiple tissues, including brain, heart, liver, and muscle. We propose a pipeline to identify differential exons across tissues and hence tissue-specific SREs. In our pipeline, we utilize the DEXSeq package along with our previously reported algorithms. Utilizing the publicly available RNA-Seq data set from the Human BodyMap project, we identified 28,100 differentially used exons across the four tissues. We identified tissue-specific exonic splicing enhancers that overlap with various previously published experimental and computational databases. A complicated exonic enhancer regulatory network was revealed, where multiple exonic enhancers were found across multiple tissues while some were found only in specific tissues. Putative combinatorial exonic enhancers and silencers were discovered as well, which may be responsible for exon inclusion or exclusion across tissues. Some of the exonic enhancers are found to be co-occurring with multiple exonic silencers and vice versa, which demonstrates a complicated relationship between tissue-specific exonic enhancers and silencers.
Collapse
Affiliation(s)
- Eman Badr
- Department of Information Technology, Faculty of Computers and Information, Cairo University, Giza, Egypt
- * E-mail:
| | - Mahmoud ElHefnawi
- Center of Excellence for Advanced Sciences, Informatics and Systems Department, National Research Center, Cairo, Egypt
- Center for Informatics Science, Nile University, Sheikh Zayed City, Egypt
| | - Lenwood S. Heath
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia, United States of America
| |
Collapse
|
7
|
Kahles A, Ong CS, Zhong Y, Rätsch G. SplAdder: identification, quantification and testing of alternative splicing events from RNA-Seq data. Bioinformatics 2016; 32:1840-7. [PMID: 26873928 PMCID: PMC4908322 DOI: 10.1093/bioinformatics/btw076] [Citation(s) in RCA: 93] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2015] [Revised: 12/18/2015] [Accepted: 02/04/2016] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Understanding the occurrence and regulation of alternative splicing (AS) is a key task towards explaining the regulatory processes that shape the complex transcriptomes of higher eukaryotes. With the advent of high-throughput sequencing of RNA (RNA-Seq), the diversity of AS transcripts could be measured at an unprecedented depth. Although the catalog of known AS events has grown ever since, novel transcripts are commonly observed when working with less well annotated organisms, in the context of disease, or within large populations. Whereas an identification of complete transcripts is technically challenging and computationally expensive, focusing on single splicing events as a proxy for transcriptome characteristics is fruitful and sufficient for a wide range of analyses. RESULTS We present SplAdder, an alternative splicing toolbox, that takes RNA-Seq alignments and an annotation file as input to (i) augment the annotation based on RNA-Seq evidence, (ii) identify alternative splicing events present in the augmented annotation graph, (iii) quantify and confirm these events based on the RNA-Seq data and (iv) test for significant quantitative differences between samples. Thereby, our main focus lies on performance, accuracy and usability. AVAILABILITY Source code and documentation are available for download at http://github.com/ratschlab/spladder Example data, introductory information and a small tutorial are accessible via http://bioweb.me/spladder CONTACTS : andre.kahles@ratschlab.org or gunnar.ratsch@ratschlab.org SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- André Kahles
- Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA and
| | - Cheng Soon Ong
- Canberra Research Laboratory, NICTA, Canberra, ACT 2601, Australia
| | - Yi Zhong
- Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA and
| | - Gunnar Rätsch
- Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA and
| |
Collapse
|
8
|
Badr E, Heath LS. Identifying splicing regulatory elements with de Bruijn graphs. J Comput Biol 2015; 21:880-97. [PMID: 25393830 DOI: 10.1089/cmb.2014.0183] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Splicing regulatory elements (SREs) are short, degenerate sequences on pre-mRNA molecules that enhance or inhibit the splicing process via the binding of splicing factors, proteins that regulate the functioning of the spliceosome. Existing methods for identifying SREs in a genome are either experimental or computational. Here, we propose a formalism based on de Bruijn graphs that combines genomic structure, word count enrichment analysis, and experimental evidence to identify SREs found in exons. In our approach, SREs are not restricted to a fixed length (i.e., k-mers, for a fixed k). As a result, we identify 2001 putative exonic enhancers and 3080 putative exonic silencers for human genes, with lengths varying from 6 to 15 nucleotides. Many of the predicted SREs overlap with experimentally verified binding sites. Our model provides a novel method to predict variable length putative regulatory elements computationally for further experimental investigation.
Collapse
Affiliation(s)
- Eman Badr
- Department of Computer Science, Virginia Tech , Blacksburg, Virginia
| | | |
Collapse
|
9
|
Guerra D, Crosatti C, Khoshro HH, Mastrangelo AM, Mica E, Mazzucotelli E. Post-transcriptional and post-translational regulations of drought and heat response in plants: a spider's web of mechanisms. FRONTIERS IN PLANT SCIENCE 2015; 6:57. [PMID: 25717333 PMCID: PMC4324062 DOI: 10.3389/fpls.2015.00057] [Citation(s) in RCA: 101] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2014] [Accepted: 01/22/2015] [Indexed: 05/14/2023]
Abstract
Drought and heat tolerance are complex quantitative traits. Moreover, the adaptive significance of some stress-related traits is more related to plant survival than to agronomic performance. A web of regulatory mechanisms fine-tunes the expression of stress-related traits and integrates both environmental and developmental signals. Both post-transcriptional and post-translational modifications contribute substantially to this network with a pivotal regulatory function of the transcriptional changes related to cellular and plant stress response. Alternative splicing and RNA-mediated silencing control the amount of specific transcripts, while ubiquitin and SUMO modify activity, sub-cellular localization and half-life of proteins. Interactions across these modification mechanisms ensure temporally and spatially appropriate patterns of downstream-gene expression. For key molecular components of these regulatory mechanisms, natural genetic diversity exists among genotypes with different behavior in terms of stress tolerance, with effects upon the expression of adaptive morphological and/or physiological target traits.
Collapse
Affiliation(s)
- Davide Guerra
- Genomics Research Centre, Consiglio per la Ricerca e la Sperimentazione in Agricoltura, Fiorenzuola d’Arda, Piacenza, Italy
| | - Cristina Crosatti
- Genomics Research Centre, Consiglio per la Ricerca e la Sperimentazione in Agricoltura, Fiorenzuola d’Arda, Piacenza, Italy
| | - Hamid H. Khoshro
- Department of Agronomy and Plant Breeding, Ilam University, Ilam, Iran
| | - Anna M. Mastrangelo
- Cereal Research Centre, Consiglio per la Ricerca e la Sperimentazione in Agricoltura, Foggia, Italy
| | - Erica Mica
- Genomics Research Centre, Consiglio per la Ricerca e la Sperimentazione in Agricoltura, Fiorenzuola d’Arda, Piacenza, Italy
| | - Elisabetta Mazzucotelli
- Genomics Research Centre, Consiglio per la Ricerca e la Sperimentazione in Agricoltura, Fiorenzuola d’Arda, Piacenza, Italy
| |
Collapse
|
10
|
Mao R, Raj Kumar PK, Guo C, Zhang Y, Liang C. Comparative analyses between retained introns and constitutively spliced introns in Arabidopsis thaliana using random forest and support vector machine. PLoS One 2014; 9:e104049. [PMID: 25110928 PMCID: PMC4128822 DOI: 10.1371/journal.pone.0104049] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2014] [Accepted: 07/06/2014] [Indexed: 01/04/2023] Open
Abstract
One of the important modes of pre-mRNA post-transcriptional modification is alternative splicing. Alternative splicing allows creation of many distinct mature mRNA transcripts from a single gene by utilizing different splice sites. In plants like Arabidopsis thaliana, the most common type of alternative splicing is intron retention. Many studies in the past focus on positional distribution of retained introns (RIs) among different genic regions and their expression regulations, while little systematic classification of RIs from constitutively spliced introns (CSIs) has been conducted using machine learning approaches. We used random forest and support vector machine (SVM) with radial basis kernel function (RBF) to differentiate these two types of introns in Arabidopsis. By comparing coordinates of introns of all annotated mRNAs from TAIR10, we obtained our high-quality experimental data. To distinguish RIs from CSIs, We investigated the unique characteristics of RIs in comparison with CSIs and finally extracted 37 quantitative features: local and global nucleotide sequence features of introns, frequent motifs, the signal strength of splice sites, and the similarity between sequences of introns and their flanking regions. We demonstrated that our proposed feature extraction approach was more accurate in effectively classifying RIs from CSIs in comparison with other four approaches. The optimal penalty parameter C and the RBF kernel parameter in SVM were set based on particle swarm optimization algorithm (PSOSVM). Our classification performance showed F-Measure of 80.8% (random forest) and 77.4% (PSOSVM). Not only the basic sequence features and positional distribution characteristics of RIs were obtained, but also putative regulatory motifs in intron splicing were predicted based on our feature extraction approach. Clearly, our study will facilitate a better understanding of underlying mechanisms involved in intron retention.
Collapse
Affiliation(s)
- Rui Mao
- College of Mechanical and Electronic Engineering, Northwest A&F University, Yangling, Shaanxi, China
- College of Information Engineering, Northwest A&F University, Yangling, Shaanxi, China
- Department of Biology, Miami University, Oxford, Ohio, United States of America
| | | | - Cheng Guo
- Department of Biology, Miami University, Oxford, Ohio, United States of America
| | - Yang Zhang
- College of Mechanical and Electronic Engineering, Northwest A&F University, Yangling, Shaanxi, China
- College of Information Engineering, Northwest A&F University, Yangling, Shaanxi, China
- * E-mail: (YZ); (CL)
| | - Chun Liang
- Department of Biology, Miami University, Oxford, Ohio, United States of America
- Department of Computer Sciences and Software Engineering, Miami University, Oxford, Ohio, United States of America
- * E-mail: (YZ); (CL)
| |
Collapse
|
11
|
Genome-wide analysis of alternative splicing of pre-mRNA under salt stress in Arabidopsis. BMC Genomics 2014; 15:431. [PMID: 24897929 PMCID: PMC4079960 DOI: 10.1186/1471-2164-15-431] [Citation(s) in RCA: 163] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2014] [Accepted: 05/29/2014] [Indexed: 02/05/2023] Open
Abstract
Background Alternative splicing (AS) of precursor mRNA (pre-mRNA) is an important gene regulation process that potentially regulates many physiological processes in plants, including the response to abiotic stresses such as salt stress. Results To analyze global changes in AS under salt stress, we obtained high-coverage (~200 times) RNA sequencing data from Arabidopsis thaliana seedlings that were treated with different concentrations of NaCl. We detected that ~49% of all intron-containing genes were alternatively spliced under salt stress, 10% of which experienced significant differential alternative splicing (DAS). Furthermore, AS increased significantly under salt stress compared with under unstressed conditions. We demonstrated that most DAS genes were not differentially regulated by salt stress, suggesting that AS may represent an independent layer of gene regulation in response to stress. Our analysis of functional categories suggested that DAS genes were associated with specific functional pathways, such as the pathways for the responses to stresses and RNA splicing. We revealed that serine/arginine-rich (SR) splicing factors were frequently and specifically regulated in AS under salt stresses, suggesting a complex loop in AS regulation for stress adaptation. We also showed that alternative splicing site selection (SS) occurred most frequently at 4 nucleotides upstream or downstream of the dominant sites and that exon skipping tended to link with alternative SS. Conclusions Our study provided a comprehensive view of AS under salt stress and revealed novel insights into the potential roles of AS in plant response to salt stress. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-431) contains supplementary material, which is available to authorized users.
Collapse
|
12
|
A novel computational method for the identification of plant alternative splice sites. Biochem Biophys Res Commun 2013; 431:221-4. [DOI: 10.1016/j.bbrc.2012.12.131] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2012] [Accepted: 12/27/2012] [Indexed: 11/23/2022]
|
13
|
Lemetre C, Zhang ZD. A brief introduction to tiling microarrays: principles, concepts, and applications. Methods Mol Biol 2013; 1067:3-19. [PMID: 23975782 DOI: 10.1007/978-1-62703-607-8_1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Technological achievements have always contributed to the advancement of biomedical research. It has never been more so than in recent times, when the development and application of innovative cutting-edge technologies have transformed biology into a data-rich quantitative science. This stunning revolution in biology primarily ensued from the emergence of microarrays over two decades ago. The completion of whole-genome sequencing projects and the advance in microarray manufacturing technologies enabled the development of tiling microarrays, which gave unprecedented genomic coverage. Since their first description, several types of application of tiling arrays have emerged, each aiming to tackle a different biological problem. Although numerous algorithms have already been developed to analyze microarray data, new method development is still needed not only for better performance but also for integration of available microarray data sets, which without doubt constitute one of the largest collections of biological data ever generated. In this chapter we first introduce the principles behind the emergence and the development of tiling microarrays, and then discuss with some examples how they are used to investigate different biological problems.
Collapse
Affiliation(s)
- Christophe Lemetre
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, USA
| | | |
Collapse
|
14
|
Smith LM, Hartmann L, Drewe P, Bohnert R, Kahles A, Lanz C, Rätsch G. Multiple insert size paired-end sequencing for deconvolution of complex transcriptomes. RNA Biol 2012; 9:596-609. [PMID: 22614838 DOI: 10.4161/rna.19683] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Deep sequencing of transcriptomes allows quantitative and qualitative analysis of many RNA species in a sample, with parallel comparison of expression levels, splicing variants, natural antisense transcripts, RNA editing and transcriptional start and stop sites the ideal goal. By computational modeling, we show how libraries of multiple insert sizes combined with strand-specific, paired-end (SS-PE) sequencing can increase the information gained on alternative splicing, especially in higher eukaryotes. Despite the benefits of gaining SS-PE data with paired ends of varying distance, the standard Illumina protocol allows only non-strand-specific, paired-end sequencing with a single insert size. Here, we modify the Illumina RNA ligation protocol to allow SS-PE sequencing by using a custom pre-adenylated 3' adaptor. We generate parallel libraries with differing insert sizes to aid deconvolution of alternative splicing events and to characterize the extent and distribution of natural antisense transcription in C. elegans. Despite stringent requirements for detection of alternative splicing, our data increases the number of intron retention and exon skipping events annotated in the Wormbase genome annotations by 127% and 121%, respectively. We show that parallel libraries with a range of insert sizes increase transcriptomic information gained by sequencing and that by current established benchmarks our protocol gives competitive results with respect to library quality.
Collapse
Affiliation(s)
- Lisa M Smith
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | | | | | | | | | | | | |
Collapse
|
15
|
Mastrangelo AM, Marone D, Laidò G, De Leonardis AM, De Vita P. Alternative splicing: enhancing ability to cope with stress via transcriptome plasticity. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2012; 185-186:40-9. [PMID: 22325865 DOI: 10.1016/j.plantsci.2011.09.006] [Citation(s) in RCA: 164] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2011] [Revised: 09/18/2011] [Accepted: 09/20/2011] [Indexed: 05/20/2023]
Abstract
Alternative splicing is a mechanism for the regulation of gene expression that is widespread in higher eukaryotes. Genome-wide approaches, based on comparison of expressed and genomic sequences, on tiling arrays, and on next-generation sequencing, have provided growing knowledge of the extent, distribution and association of alternative splicing with stress-related genes in plants. The functional meaning of alternative splicing in response to stress has been defined for many genes, and in particular for those involved in the regulation of the stress responses, such as protein kinases, transcription factors, splicing regulators and pathogen-resistance genes. The production of proteins with diverse domain rearrangements from the same gene is the main alternative splicing mechanism for pathogen-resistance genes. The plant response to abiotic stress is also characterized by a second mechanism, which consists of the expression of alternative transcripts that are targeted to nonsense-mediated decay. These quantitatively regulate stress-related gene expression. Many alternative splicing events are well conserved among plant species, and also across kingdoms, especially those observed in response to stress, for genes encoding splicing regulators, and other classes of RNA-binding proteins. Nevertheless, non-conserved events indicate that alternative splicing represents an evolutionary strategy that rapidly increases genome plasticity and develops new gene functions, along with other mechanisms such as gene duplication. Finally, the study of the naturally occurring variability of alternative splicing and the identification of genomic regions involved in the regulation of alternative splicing in crops are proposed as strategies for selecting genotypes with superior performance under adverse environmental conditions.
Collapse
|