301
|
Liu G, Liu J, Zhang C, You X, Zhao T, Jiang J, Chen X, Zhang H, Yang H, Zhang D, Du C, Li J, Xu X. Physiological and RNA-seq analyses provide insights into the response mechanism of the Cf-10-mediated resistance to Cladosporium fulvum infection in tomato. PLANT MOLECULAR BIOLOGY 2018; 96:403-416. [PMID: 29383477 DOI: 10.1007/s11103-018-0706-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2017] [Accepted: 01/20/2018] [Indexed: 05/22/2023]
Abstract
Based on the physiological and RNA-seq analysis, some progress has been made in elucidating the Cf-10-mediated resistance responses to C. fulvum infection in tomato. GO and KEGG enrichment analysis revealed that the DEGs were significantly associated with defense-signaling pathways like oxidation-reduction processes, oxidoreductase activity and plant hormone signal transduction. Leaf mold, caused by the fungus Cladosporium fulvum, is one of the most common diseases affecting tomatoes worldwide. Cf series genes including Cf-2, Cf-4, Cf-5, Cf-9 and Cf-10 play very important roles in resisting tomato leaf mold. Understanding the molecular mechanism of Cf gene-mediated resistance is thus the key to facilitating genetic engineering of resistance to C. fulvum infection. Progress has been made in elucidating two Cf genes, Cf -19 and Cf -12, and how they mediate resistance responses to C. fulvum infection in tomato. However, the mechanism of the Cf-10- mediated resistance response is still unclear. In the present study, RNA-seq was used to analyze changes in the transcriptome at different stages of C. fulvum infection. A total of 2,242 differentially expressed genes (DEGs) responsive to C. fulvum between 0 and 16 days post infection (dpi) were identified, including 1,501 upregulated and 741 downregulated genes. The majority of DEGs were associated with defense-signaling pathways including oxidation-reduction processes, oxidoreductase activity and plant hormone signal transduction. Four DEGs associated with plant-pathogen interaction were uniquely activated in Cf-10 tomato and validated by qRT-PCR. In addition, physiological indicators including reactive oxygen species (ROS), superoxide dismutase (SOD), catalase (CAT) and peroxidase (POD) were measured at 0-21 dpi, and hormone expression [Jasmonic acid (JA) and salicylic acid (SA)] was estimated at 0 and 16 dpi to elucidate the mechanism of the Cf-10-mediated resistance response. C. fulvum infection induced the activities of POD, CAT and SOD, and decreased ROS levels. JA was determined to participate in the resistance response to C. fulvum during the initial infection period. The results of this study provide accountable evidence for the physiological and transcriptional regulation of the Cf-10-mediated resistance response to C. fulvum infection, facilitating further understanding of the molecular mechanism of Cf-10-mediated resistance to C. fulvum infection.
Collapse
Affiliation(s)
- Guan Liu
- College of Horticulture and Landscape Architecuture, Northeast Agricultural University, Harbin, 150030, China
| | - Junfang Liu
- College of Horticulture and Landscape Architecuture, Northeast Agricultural University, Harbin, 150030, China
| | - Chunli Zhang
- College of Horticulture and Landscape Architecuture, Northeast Agricultural University, Harbin, 150030, China
| | - Xiaoqing You
- College of Horticulture and Landscape Architecuture, Northeast Agricultural University, Harbin, 150030, China
| | - Tingting Zhao
- College of Horticulture and Landscape Architecuture, Northeast Agricultural University, Harbin, 150030, China
| | - Jingbin Jiang
- College of Horticulture and Landscape Architecuture, Northeast Agricultural University, Harbin, 150030, China
| | - Xiuling Chen
- College of Horticulture and Landscape Architecuture, Northeast Agricultural University, Harbin, 150030, China
| | - He Zhang
- College of Horticulture and Landscape Architecuture, Northeast Agricultural University, Harbin, 150030, China
| | - Huanhuan Yang
- College of Horticulture and Landscape Architecuture, Northeast Agricultural University, Harbin, 150030, China
| | - Dongye Zhang
- College of Horticulture and Landscape Architecuture, Northeast Agricultural University, Harbin, 150030, China
| | - Chong Du
- College of Horticulture and Landscape Architecuture, Northeast Agricultural University, Harbin, 150030, China
| | - Jingfu Li
- College of Horticulture and Landscape Architecuture, Northeast Agricultural University, Harbin, 150030, China
| | - Xiangyang Xu
- College of Horticulture and Landscape Architecuture, Northeast Agricultural University, Harbin, 150030, China.
| |
Collapse
|
302
|
Tardaguila M, de la Fuente L, Marti C, Pereira C, Pardo-Palacios FJ, Del Risco H, Ferrell M, Mellado M, Macchietto M, Verheggen K, Edelmann M, Ezkurdia I, Vazquez J, Tress M, Mortazavi A, Martens L, Rodriguez-Navarro S, Moreno-Manzano V, Conesa A. SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification. Genome Res 2018; 28:396-411. [PMID: 29440222 PMCID: PMC5848618 DOI: 10.1101/gr.222976.117] [Citation(s) in RCA: 224] [Impact Index Per Article: 37.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Accepted: 01/08/2018] [Indexed: 01/15/2023]
Abstract
High-throughput sequencing of full-length transcripts using long reads has paved the way for the discovery of thousands of novel transcripts, even in well-annotated mammalian species. The advances in sequencing technology have created a need for studies and tools that can characterize these novel variants. Here, we present SQANTI, an automated pipeline for the classification of long-read transcripts that can assess the quality of data and the preprocessing pipeline using 47 unique descriptors. We apply SQANTI to a neuronal mouse transcriptome using Pacific Biosciences (PacBio) long reads and illustrate how the tool is effective in characterizing and describing the composition of the full-length transcriptome. We perform extensive evaluation of ToFU PacBio transcripts by PCR to reveal that an important number of the novel transcripts are technical artifacts of the sequencing approach and that SQANTI quality descriptors can be used to engineer a filtering strategy to remove them. Most novel transcripts in this curated transcriptome are novel combinations of existing splice sites, resulting more frequently in novel ORFs than novel UTRs, and are enriched in both general metabolic and neural-specific functions. We show that these new transcripts have a major impact in the correct quantification of transcript levels by state-of-the-art short-read-based quantification algorithms. By comparing our iso-transcriptome with public proteomics databases, we find that alternative isoforms are elusive to proteogenomics detection. SQANTI allows the user to maximize the analytical outcome of long-read technologies by providing the tools to deliver quality-evaluated and curated full-length transcriptomes.
Collapse
Affiliation(s)
- Manuel Tardaguila
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, Genetics Institute, University of Florida, Gainesville, Florida 32611, USA
| | - Lorena de la Fuente
- Genomics of Gene Expression Laboratory, Centro de Investigaciones Principe Felipe (CIPF), 46012 Valencia, Spain
| | - Cristina Marti
- Genomics of Gene Expression Laboratory, Centro de Investigaciones Principe Felipe (CIPF), 46012 Valencia, Spain
| | - Cécile Pereira
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, Genetics Institute, University of Florida, Gainesville, Florida 32611, USA
| | | | - Hector Del Risco
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, Genetics Institute, University of Florida, Gainesville, Florida 32611, USA
| | - Marc Ferrell
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, Genetics Institute, University of Florida, Gainesville, Florida 32611, USA
| | | | - Marissa Macchietto
- Department of Developmental and Cell Biology, University of California, Irvine, California 92617, USA
| | - Kenneth Verheggen
- VIB-UGent Center for Medical Biotechnology, VIB, B-9000 Ghent, Belgium
- Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium
| | - Mariola Edelmann
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, Genetics Institute, University of Florida, Gainesville, Florida 32611, USA
| | - Iakes Ezkurdia
- Centro Nacional de Investigaciones Cardiovasculares CNIC, 28029 Madrid, Spain
| | - Jesus Vazquez
- Centro Nacional de Investigaciones Cardiovasculares CNIC, 28029 Madrid, Spain
| | - Michael Tress
- Spanish National Cancer Research Centre (CNIO), 28029 Madrid, Spain
| | - Ali Mortazavi
- Department of Developmental and Cell Biology, University of California, Irvine, California 92617, USA
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, B-9000 Ghent, Belgium
- Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium
| | - Susana Rodriguez-Navarro
- Gene Expression and mRNA Metabolism Laboratory, CSIC, IBV, 46010 Valencia, Spain
- Gene Expression and mRNA Metabolism Laboratory, CIPF, 46012 Valencia, Spain
| | | | - Ana Conesa
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, Genetics Institute, University of Florida, Gainesville, Florida 32611, USA
- Genomics of Gene Expression Laboratory, Centro de Investigaciones Principe Felipe (CIPF), 46012 Valencia, Spain
| |
Collapse
|
303
|
An D, Cao HX, Li C, Humbeck K, Wang W. Isoform Sequencing and State-of-Art Applications for Unravelling Complexity of Plant Transcriptomes. Genes (Basel) 2018; 9:genes9010043. [PMID: 29346292 PMCID: PMC5793194 DOI: 10.3390/genes9010043] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2017] [Revised: 12/30/2017] [Accepted: 01/15/2018] [Indexed: 12/12/2022] Open
Abstract
Single-molecule real-time (SMRT) sequencing developed by PacBio, also called third-generation sequencing (TGS), offers longer reads than the second-generation sequencing (SGS). Given its ability to obtain full-length transcripts without assembly, isoform sequencing (Iso-Seq) of transcriptomes by PacBio is advantageous for genome annotation, identification of novel genes and isoforms, as well as the discovery of long non-coding RNA (lncRNA). In addition, Iso-Seq gives access to the direct detection of alternative splicing, alternative polyadenylation (APA), gene fusion, and DNA modifications. Such applications of Iso-Seq facilitate the understanding of gene structure, post-transcriptional regulatory networks, and subsequently proteomic diversity. In this review, we summarize its applications in plant transcriptome study, specifically pointing out challenges associated with each step in the experimental design and highlight the development of bioinformatic pipelines. We aim to provide the community with an integrative overview and a comprehensive guidance to Iso-Seq, and thus to promote its applications in plant research.
Collapse
Affiliation(s)
- Dong An
- School of Agriculture and Biology, Shanghai Jiao Tong University, 800 Dong Chuan Road, Shanghai 200240, China.
| | - Hieu X Cao
- Institute of Biology/Plant Physiology, Martin-Luther-University of Halle-Wittenberg, Weinbergweg 10, 06120 Halle, Germany.
| | - Changsheng Li
- School of Agriculture and Biology, Shanghai Jiao Tong University, 800 Dong Chuan Road, Shanghai 200240, China.
| | - Klaus Humbeck
- Institute of Biology/Plant Physiology, Martin-Luther-University of Halle-Wittenberg, Weinbergweg 10, 06120 Halle, Germany.
| | - Wenqin Wang
- School of Agriculture and Biology, Shanghai Jiao Tong University, 800 Dong Chuan Road, Shanghai 200240, China.
| |
Collapse
|
304
|
Yeh HS, Zhang W, Yong J. Analyses of alternative polyadenylation: from old school biochemistry to high-throughput technologies. BMB Rep 2018; 50:201-207. [PMID: 28148393 PMCID: PMC5437964 DOI: 10.5483/bmbrep.2017.50.4.019] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2017] [Indexed: 01/08/2023] Open
Abstract
Alternations in usage of polyadenylation sites during transcription termination yield transcript isoforms from a gene. Recent findings of transcriptome-wide alternative polyadenylation (APA) as a molecular response to changes in biology position APA not only as a molecular event of early transcriptional termination but also as a cellular regulatory step affecting various biological pathways. With the development of high-throughput profiling technologies at a single nucleotide level and their applications targeted to the 3'-end of mRNAs, dynamics in the landscape of mRNA 3'-end is measureable at a global scale. In this review, methods and technologies that have been adopted to study APA events are discussed. In addition, various bioinformatics algorithms for APA isoform analysis using publicly available RNA-seq datasets are introduced. [BMB Reports 2017; 50(4): 201-207].
Collapse
Affiliation(s)
- Hsin-Sung Yeh
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, USA
| | - Wei Zhang
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis, Minnesota 55455, USA
| | - Jeongsik Yong
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, USA
| |
Collapse
|
305
|
Akhter S, Kretzschmar WW, Nordal V, Delhomme N, Street NR, Nilsson O, Emanuelsson O, Sundström JF. Integrative Analysis of Three RNA Sequencing Methods Identifies Mutually Exclusive Exons of MADS-Box Isoforms During Early Bud Development in Picea abies. FRONTIERS IN PLANT SCIENCE 2018; 9:1625. [PMID: 30483285 PMCID: PMC6243048 DOI: 10.3389/fpls.2018.01625] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Accepted: 10/18/2018] [Indexed: 05/06/2023]
Abstract
Recent efforts to sequence the genomes and transcriptomes of several gymnosperm species have revealed an increased complexity in certain gene families in gymnosperms as compared to angiosperms. One example of this is the gymnosperm sister clade to angiosperm TM3-like MADS-box genes, which at least in the conifer lineage has expanded in number of genes. We have previously identified a member of this sub-clade, the conifer gene DEFICIENS AGAMOUS LIKE 19 (DAL19), as being specifically upregulated in cone-setting shoots. Here, we show through Sanger sequencing of mRNA-derived cDNA and mapping to assembled conifer genomic sequences that DAL19 produces six mature mRNA splice variants in Picea abies. These splice variants use alternate first and last exons, while their four central exons constitute a core region present in all six transcripts. Thus, they are likely to be transcript isoforms. Quantitative Real-Time PCR revealed that two mutually exclusive first DAL19 exons are differentially expressed across meristems that will form either male or female cones, or vegetative shoots. Furthermore, mRNA in situ hybridization revealed that two mutually exclusive last DAL19 exons were expressed in a cell-specific pattern within bud meristems. Based on these findings in DAL19, we developed a sensitive approach to transcript isoform assembly from short-read sequencing of mRNA. We applied this method to 42 putative MADS-box core regions in P. abies, from which we assembled 1084 putative transcripts. We manually curated these transcripts to arrive at 933 assembled transcript isoforms of 38 putative MADS-box genes. 152 of these isoforms, which we assign to 28 putative MADS-box genes, were differentially expressed across eight female, male, and vegetative buds. We further provide evidence of the expression of 16 out of the 38 putative MADS-box genes by mapping PacBio Iso-Seq circular consensus reads derived from pooled sample sequencing to assembled transcripts. In summary, our analyses reveal the use of mutually exclusive exons of MADS-box gene isoforms during early bud development in P. abies, and we find that the large number of identified MADS-box transcripts in P. abies results not only from expansion of the gene family through gene duplication events but also from the generation of numerous splice variants.
Collapse
Affiliation(s)
- Shirin Akhter
- Linnean Center for Plant Biology, Uppsala BioCenter, Department of Plant Biology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Warren W. Kretzschmar
- Science for Life Laboratory, Department of Gene Technology, School of Engineering Sciences in Biotechnology, Chemistry and Health, KTH Royal Institute of Technology, Solna, Sweden
| | - Veronika Nordal
- Linnean Center for Plant Biology, Uppsala BioCenter, Department of Plant Biology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Nicolas Delhomme
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, Sweden
| | - Nathaniel R. Street
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Ove Nilsson
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, Sweden
| | - Olof Emanuelsson
- Science for Life Laboratory, Department of Gene Technology, School of Engineering Sciences in Biotechnology, Chemistry and Health, KTH Royal Institute of Technology, Solna, Sweden
| | - Jens F. Sundström
- Linnean Center for Plant Biology, Uppsala BioCenter, Department of Plant Biology, Swedish University of Agricultural Sciences, Uppsala, Sweden
- *Correspondence: Jens F. Sundström,
| |
Collapse
|
306
|
McCormick RF, Truong SK, Sreedasyam A, Jenkins J, Shu S, Sims D, Kennedy M, Amirebrahimi M, Weers BD, McKinley B, Mattison A, Morishige DT, Grimwood J, Schmutz J, Mullet JE. The Sorghum bicolor reference genome: improved assembly, gene annotations, a transcriptome atlas, and signatures of genome organization. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2018; 93:338-354. [PMID: 29161754 DOI: 10.1111/tpj.13781] [Citation(s) in RCA: 285] [Impact Index Per Article: 47.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2017] [Revised: 11/05/2017] [Accepted: 11/14/2017] [Indexed: 05/20/2023]
Abstract
Sorghum bicolor is a drought tolerant C4 grass used for the production of grain, forage, sugar, and lignocellulosic biomass and a genetic model for C4 grasses due to its relatively small genome (approximately 800 Mbp), diploid genetics, diverse germplasm, and colinearity with other C4 grass genomes. In this study, deep sequencing, genetic linkage analysis, and transcriptome data were used to produce and annotate a high-quality reference genome sequence. Reference genome sequence order was improved, 29.6 Mbp of additional sequence was incorporated, the number of genes annotated increased 24% to 34 211, average gene length and N50 increased, and error frequency was reduced 10-fold to 1 per 100 kbp. Subtelomeric repeats with characteristics of Tandem Repeats in Miniature (TRIM) elements were identified at the termini of most chromosomes. Nucleosome occupancy predictions identified nucleosomes positioned immediately downstream of transcription start sites and at different densities across chromosomes. Alignment of more than 50 resequenced genomes from diverse sorghum genotypes to the reference genome identified approximately 7.4 M single nucleotide polymorphisms (SNPs) and 1.9 M indels. Large-scale variant features in euchromatin were identified with periodicities of approximately 25 kbp. A transcriptome atlas of gene expression was constructed from 47 RNA-seq profiles of growing and developed tissues of the major plant organs (roots, leaves, stems, panicles, and seed) collected during the juvenile, vegetative and reproductive phases. Analysis of the transcriptome data indicated that tissue type and protein kinase expression had large influences on transcriptional profile clustering. The updated assembly, annotation, and transcriptome data represent a resource for C4 grass research and crop improvement.
Collapse
Affiliation(s)
- Ryan F McCormick
- Interdisciplinary Program in Genetics, Texas A&M University, College Station, TX, 77843, USA
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Sandra K Truong
- Interdisciplinary Program in Genetics, Texas A&M University, College Station, TX, 77843, USA
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | | | - Jerry Jenkins
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Shengqiang Shu
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - David Sims
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Megan Kennedy
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | | | - Brock D Weers
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Brian McKinley
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Ashley Mattison
- Interdisciplinary Program in Genetics, Texas A&M University, College Station, TX, 77843, USA
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Daryl T Morishige
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| | - Jane Grimwood
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Jeremy Schmutz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - John E Mullet
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843, USA
| |
Collapse
|
307
|
Ren P, Meng Y, Li B, Ma X, Si E, Lai Y, Wang J, Yao L, Yang K, Shang X, Wang H. Molecular Mechanisms of Acclimatization to Phosphorus Starvation and Recovery Underlying Full-Length Transcriptome Profiling in Barley ( Hordeum vulgare L.). FRONTIERS IN PLANT SCIENCE 2018; 9:500. [PMID: 29720989 PMCID: PMC5915550 DOI: 10.3389/fpls.2018.00500] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2018] [Accepted: 04/03/2018] [Indexed: 05/18/2023]
Abstract
A lack of phosphorus (P) in plants can severely constrain growth and development. Barley, one of the earliest domesticated crops, is extensively planted in poor soil around the world. To date, the molecular mechanisms of enduring low phosphorus, at the transcriptional level, in barley are still unclear. In the present study, two different barley genotypes (GN121 and GN42)-with contrasting phosphorus efficiency-were used to reveal adaptations to low phosphorus stress, at three time points, at the morphological, physiological, biochemical, and transcriptome level. GN121 growth was less affected by phosphorus starvation and recovery than that of GN42. The biomass and inorganic phosphorus concentration of GN121 and GN42 declined under the low phosphorus-induced stress and increased after recovery with normal phosphorus. However, the range of these parameters was higher in GN42 than in GN121. Subsequently, a more complete genome annotation was obtained by correcting with the data sequenced on Illumina HiSeq X 10 and PacBio RSII SMRT platform. A total of 6,182 and 5,270 differentially expressed genes (DEGs) were identified in GN121 and GN42, respectively. The majority of these DEGs were involved in phosphorus metabolism such as phospholipid degradation, hydrolysis of phosphoric enzymes, sucrose synthesis, phosphorylation/dephosphorylation and post-transcriptional regulation; expression of these genes was significantly different between GN121 and GN42. Specifically, six and seven DEGs were annotated as phosphorus transporters in roots and leaves, respectively. Furthermore, a putative model was constructed relying on key metabolic pathways related to phosphorus to illustrate the higher phosphorus efficiency of GN121 compared to GN42 under low phosphorus conditions. Results from this study provide a multi-transcriptome database and candidate genes for further study on phosphorus use efficiency (PUE).
Collapse
Affiliation(s)
- Panrong Ren
- Gansu Provincial Key Lab of Aridland Crop Science/Gansu Key Lab of Crop Improvement and Germplasm Enhancement, Gansu Agricultural University, Lanzhou, China
- College of Agronomy, Gansu Agricultural University, Lanzhou, China
| | - Yaxiong Meng
- Gansu Provincial Key Lab of Aridland Crop Science/Gansu Key Lab of Crop Improvement and Germplasm Enhancement, Gansu Agricultural University, Lanzhou, China
- College of Agronomy, Gansu Agricultural University, Lanzhou, China
| | - Baochun Li
- Gansu Provincial Key Lab of Aridland Crop Science/Gansu Key Lab of Crop Improvement and Germplasm Enhancement, Gansu Agricultural University, Lanzhou, China
- College of Life Sciences and Technology, Gansu Agricultural University, Lanzhou, China
| | - Xiaole Ma
- Gansu Provincial Key Lab of Aridland Crop Science/Gansu Key Lab of Crop Improvement and Germplasm Enhancement, Gansu Agricultural University, Lanzhou, China
- College of Agronomy, Gansu Agricultural University, Lanzhou, China
| | - Erjing Si
- Gansu Provincial Key Lab of Aridland Crop Science/Gansu Key Lab of Crop Improvement and Germplasm Enhancement, Gansu Agricultural University, Lanzhou, China
- College of Agronomy, Gansu Agricultural University, Lanzhou, China
| | - Yong Lai
- Department of Agriculture and Forestry, College of Agriculture and Animal Husbandry, Qinghai University, Xining, China
| | - Juncheng Wang
- Gansu Provincial Key Lab of Aridland Crop Science/Gansu Key Lab of Crop Improvement and Germplasm Enhancement, Gansu Agricultural University, Lanzhou, China
- College of Agronomy, Gansu Agricultural University, Lanzhou, China
| | - Lirong Yao
- Gansu Provincial Key Lab of Aridland Crop Science/Gansu Key Lab of Crop Improvement and Germplasm Enhancement, Gansu Agricultural University, Lanzhou, China
- College of Agronomy, Gansu Agricultural University, Lanzhou, China
| | - Ke Yang
- Gansu Provincial Key Lab of Aridland Crop Science/Gansu Key Lab of Crop Improvement and Germplasm Enhancement, Gansu Agricultural University, Lanzhou, China
- College of Agronomy, Gansu Agricultural University, Lanzhou, China
| | - Xunwu Shang
- College of Agronomy, Gansu Agricultural University, Lanzhou, China
| | - Huajun Wang
- Gansu Provincial Key Lab of Aridland Crop Science/Gansu Key Lab of Crop Improvement and Germplasm Enhancement, Gansu Agricultural University, Lanzhou, China
- College of Agronomy, Gansu Agricultural University, Lanzhou, China
- *Correspondence: Huajun Wang
| |
Collapse
|
308
|
Bayega A, Fahiminiya S, Oikonomopoulos S, Ragoussis J. Current and Future Methods for mRNA Analysis: A Drive Toward Single Molecule Sequencing. Methods Mol Biol 2018; 1783:209-241. [PMID: 29767365 DOI: 10.1007/978-1-4939-7834-2_11] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The transcriptome encompasses a range of species including messenger RNA, and other noncoding RNA such as rRNA, tRNA, and short and long noncoding RNAs. Due to the huge role played by mRNA in development and disease, several methods have been developed to sequence and characterize mRNA, with RNA sequencing (RNA-Seq) emerging as the current method of choice particularly for large high-throughput studies. Short-read RNA-Seq which involves sequencing of short cDNA fragments and computationally assembling them to reconstruct the transcriptome, or aligning them to a reference is the most widely used approach. However, due to inherent limitations of this approach in de novo transcriptome assembly and isoform quantification, long-read RNA-Seq approaches, which also happen to be single molecule sequencing approaches, are increasingly becoming the standard for de novo transcriptome assembly and isoform quantification. In this chapter, we review the technical aspects of the current methods of RNA-Seq, both short and long-read approaches, and data analysis methods available. We discuss recent advances in single-cell RNA-Seq and direct RNA-Seq approaches, which perhaps will dominate the future of RNA-Seq.
Collapse
Affiliation(s)
- Anthony Bayega
- McGill University and Genome Quebec Innovation Centre, Department of Human Genetics, McGill University, Montréal, QC, Canada
| | | | - Spyros Oikonomopoulos
- McGill University and Genome Quebec Innovation Centre, Department of Human Genetics, McGill University, Montréal, QC, Canada
| | - Jiannis Ragoussis
- McGill University and Genome Quebec Innovation Centre, Department of Human Genetics, McGill University, Montréal, QC, Canada.
- Department of Bioengineering, McGill University, Montréal, QC, Canada.
- Cancer and Mutagen Unit, Department of Biochemistry, Center of Innovation in Personalized Medicine, King Fahd Center for Medical Research, King Abdulaziz University, Jeddah, Saudi Arabia.
| |
Collapse
|
309
|
Filichkin SA, Hamilton M, Dharmawardhana PD, Singh SK, Sullivan C, Ben-Hur A, Reddy ASN, Jaiswal P. Abiotic Stresses Modulate Landscape of Poplar Transcriptome via Alternative Splicing, Differential Intron Retention, and Isoform Ratio Switching. FRONTIERS IN PLANT SCIENCE 2018; 9:5. [PMID: 29483921 PMCID: PMC5816337 DOI: 10.3389/fpls.2018.00005] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2017] [Accepted: 01/03/2018] [Indexed: 05/19/2023]
Abstract
Abiotic stresses affect plant physiology, development, growth, and alter pre-mRNA splicing. Western poplar is a model woody tree and a potential bioenergy feedstock. To investigate the extent of stress-regulated alternative splicing (AS), we conducted an in-depth survey of leaf, root, and stem xylem transcriptomes under drought, salt, or temperature stress. Analysis of approximately one billion of genome-aligned RNA-Seq reads from tissue- or stress-specific libraries revealed over fifteen millions of novel splice junctions. Transcript models supported by both RNA-Seq and single molecule isoform sequencing (Iso-Seq) data revealed a broad array of novel stress- and/or tissue-specific isoforms. Analysis of Iso-Seq data also resulted in the discovery of 15,087 novel transcribed regions of which 164 show AS. Our findings demonstrate that abiotic stresses profoundly perturb transcript isoform profiles and trigger widespread intron retention (IR) events. Stress treatments often increased or decreased retention of specific introns - a phenomenon described here as differential intron retention (DIR). Many differentially retained introns were regulated in a stress- and/or tissue-specific manner. A subset of transcripts harboring super stress-responsive DIR events showed persisting fluctuations in the degree of IR across all treatments and tissue types. To investigate coordinated dynamics of intron-containing transcripts in the study we quantified absolute copy number of isoforms of two conserved transcription factors (TFs) using Droplet Digital PCR. This case study suggests that stress treatments can be associated with coordinated switches in relative ratios between fully spliced and intron-retaining isoforms and may play a role in adjusting transcriptome to abiotic stresses.
Collapse
Affiliation(s)
- Sergei A. Filichkin
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Michael Hamilton
- Department of Computer Science, Colorado State University, Fort Collins, CO, United States
| | | | - Sunil K. Singh
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
| | - Christopher Sullivan
- Center for Genome Research and Biocomputing, Oregon State University, Corvallis, OR, United States
| | - Asa Ben-Hur
- Department of Computer Science, Colorado State University, Fort Collins, CO, United States
| | - Anireddy S. N. Reddy
- Department of Biology and Program in Cell and Molecular Biology, Colorado State University, Fort Collins, CO, United States
| | - Pankaj Jaiswal
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR, United States
- *Correspondence: Pankaj Jaiswal,
| |
Collapse
|
310
|
Wang M, Wang P, Liang F, Ye Z, Li J, Shen C, Pei L, Wang F, Hu J, Tu L, Lindsey K, He D, Zhang X. A global survey of alternative splicing in allopolyploid cotton: landscape, complexity and regulation. THE NEW PHYTOLOGIST 2018; 217:163-178. [PMID: 28892169 DOI: 10.1111/nph.14762] [Citation(s) in RCA: 113] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2017] [Accepted: 07/25/2017] [Indexed: 05/21/2023]
Abstract
Alternative splicing (AS) is a crucial regulatory mechanism in eukaryotes, which acts by greatly increasing transcriptome diversity. The extent and complexity of AS has been revealed in model plants using high-throughput next-generation sequencing. However, this technique is less effective in accurately identifying transcript isoforms in polyploid species because of the high sequence similarity between coexisting subgenomes. Here we characterize AS in the polyploid species cotton. Using Pacific Biosciences single-molecule long-read isoform sequencing (Iso-Seq), we developed an integrated pipeline for Iso-Seq transcriptome data analysis (https://github.com/Nextomics/pipeline-for-isoseq). We identified 176 849 full-length transcript isoforms from 44 968 gene models and updated gene annotation. These data led us to identify 15 102 fibre-specific AS events and estimate that c. 51.4% of homoeologous genes produce divergent isoforms in each subgenome. We reveal that AS allows differential regulation of the same gene by miRNAs at the isoform level. We also show that nucleosome occupancy and DNA methylation play a role in defining exons at the chromatin level. This study provides new insights into the complexity and regulation of AS, and will enhance our understanding of AS in polyploid species. Our methodology for Iso-Seq data analysis will be a useful reference for the study of AS in other species.
Collapse
Affiliation(s)
- Maojun Wang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, Hubei, China
| | - Pengcheng Wang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, Hubei, China
| | - Fan Liang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, Hubei, China
| | - Zhengxiu Ye
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, Hubei, China
| | - Jianying Li
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, Hubei, China
| | - Chao Shen
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, Hubei, China
| | - Liuling Pei
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, Hubei, China
| | - Feng Wang
- Nextomics Biosciences, Wuhan, 430000, Hubei, China
| | - Jiang Hu
- Nextomics Biosciences, Wuhan, 430000, Hubei, China
| | - Lili Tu
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, Hubei, China
| | - Keith Lindsey
- Department of Biosciences, Durham University, Durham, DH1 3LE, UK
| | - Daohua He
- College of Agronomy, Northwest A&F University, Yangling, 712100, Shaanxi, China
| | - Xianlong Zhang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, Hubei, China
| |
Collapse
|
311
|
Dimitrakopoulos L, Prassas I, Diamandis EP, Charames GS. Onco-proteogenomics: Multi-omics level data integration for accurate phenotype prediction. Crit Rev Clin Lab Sci 2017; 54:414-432. [DOI: 10.1080/10408363.2017.1384446] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Affiliation(s)
- Lampros Dimitrakopoulos
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
- Department of Pathology and Laboratory Medicine, Mount Sinai Hospital, Joseph and Wolf Lebovic Health Complex, Toronto, ON, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada
| | - Ioannis Prassas
- Department of Pathology and Laboratory Medicine, Mount Sinai Hospital, Joseph and Wolf Lebovic Health Complex, Toronto, ON, Canada
| | - Eleftherios P. Diamandis
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
- Department of Pathology and Laboratory Medicine, Mount Sinai Hospital, Joseph and Wolf Lebovic Health Complex, Toronto, ON, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada
- Department of Clinical Biochemistry, University Health Network, Toronto, ON, Canada
| | - George S. Charames
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
- Department of Pathology and Laboratory Medicine, Mount Sinai Hospital, Joseph and Wolf Lebovic Health Complex, Toronto, ON, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada
| |
Collapse
|
312
|
De Quattro C, Pè ME, Bertolini E. Long noncoding RNAs in the model species Brachypodium distachyon. Sci Rep 2017; 7:11252. [PMID: 28900227 PMCID: PMC5595811 DOI: 10.1038/s41598-017-11206-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2017] [Accepted: 08/21/2017] [Indexed: 02/06/2023] Open
Abstract
Eukaryotic genomes are pervasively transcribed and only a small portion of the transcribed sequences belongs to protein coding genes. High-throughput sequencing technology contributed to consolidate this perspective, allowing the identification of numerous noncoding RNAs with key roles in biological processes. Long noncoding RNAs (lncRNAs) are transcripts longer than 200 nt with limited phylogenetic conservation, expressed at low levels and characterized by tissue/organ specific expression profiles. Although a large set of lncRNAs has been identified, the functional roles of lncRNAs are only beginning to be recognized and the molecular mechanism of lncRNA-mediated gene regulation remains largely unexplored, particularly in plants where their annotation and characterization are still incomplete. Using public and proprietary poly-(A)+ RNA-seq data as well as a collection of full length ESTs from several organs, developmental stages and stress conditions in three Brachypodium distachyon inbred lines, we describe the identification and the main features of thousands lncRNAs. Here we provide a genome-wide characterization of lncRNAs, highlighting their intraspecies conservation and describing their expression patterns among several organs/tissues and stress conditions. This work represents a fundamental resource to deepen our knowledge on long noncoding RNAs in C3 cereals, allowing the Brachypodium community to exploit these results in future research programs.
Collapse
Affiliation(s)
- Concetta De Quattro
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà 33, 56127, Pisa, Italy
| | - Mario Enrico Pè
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà 33, 56127, Pisa, Italy
| | - Edoardo Bertolini
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà 33, 56127, Pisa, Italy.
- Donald Danforth Plant Science Center, 975 North Warson Road, St. Louis, MO, 63132, USA.
| |
Collapse
|
313
|
Long read reference genome-free reconstruction of a full-length transcriptome from Astragalus membranaceus reveals transcript variants involved in bioactive compound biosynthesis. Cell Discov 2017; 3:17031. [PMID: 28861277 PMCID: PMC5573880 DOI: 10.1038/celldisc.2017.31] [Citation(s) in RCA: 67] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2017] [Accepted: 07/25/2017] [Indexed: 12/18/2022] Open
Abstract
Astragalus membranaceus, also known as Huangqi in China, is one of the most widely used medicinal herbs in Traditional Chinese Medicine. Traditional Chinese Medicine formulations from Astragalus membranaceus have been used to treat a wide range of illnesses, such as cardiovascular disease, type 2 diabetes, nephritis and cancers. Pharmacological studies have shown that immunomodulating, anti-hyperglycemic, anti-inflammatory, antioxidant and antiviral activities exist in the extract of Astragalus membranaceus. Therefore, characterising the biosynthesis of bioactive compounds in Astragalus membranaceus, such as Astragalosides, Calycosin and Calycosin-7-O-β-d-glucoside, is of particular importance for further genetic studies of Astragalus membranaceus. In this study, we reconstructed the Astragalus membranaceus full-length transcriptomes from leaf and root tissues using PacBio Iso-Seq long reads. We identified 27 975 and 22 343 full-length unique transcript models in each tissue respectively. Compared with previous studies that used short read sequencing, our reconstructed transcripts are longer, and are more likely to be full-length and include numerous transcript variants. Moreover, we also re-characterised and identified potential transcript variants of genes involved in Astragalosides, Calycosin and Calycosin-7-O-β-d-glucoside biosynthesis. In conclusion, our study provides a practical pipeline to characterise the full-length transcriptome for species without a reference genome and a useful genomic resource for exploring the biosynthesis of active compounds in Astragalus membranaceus.
Collapse
|
314
|
Evolutionarily Conserved Alternative Splicing Across Monocots. Genetics 2017; 207:465-480. [PMID: 28839042 DOI: 10.1534/genetics.117.300189] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Accepted: 08/11/2017] [Indexed: 12/22/2022] Open
Abstract
One difficulty when identifying alternative splicing (AS) events in plants is distinguishing functional AS from splicing noise. One way to add confidence to the validity of a splice isoform is to observe that it is conserved across evolutionarily related species. We use a high throughput method to identify junction-based conserved AS events from RNA-Seq data across nine plant species, including five grass monocots (maize, sorghum, rice, Brachpodium, and foxtail millet), plus two nongrass monocots (banana and African oil palm), the eudicot Arabidopsis, and the basal angiosperm Amborella In total, 9804 AS events were found to be conserved between two or more species studied. In grasses containing large regions of conserved synteny, the frequency of conserved AS events is twice that observed for genes outside of conserved synteny blocks. In plant-specific RS and RS2Z subfamilies of the serine/arginine (SR) splice-factor proteins, we observe both conservation and divergence of AS events after the whole genome duplication in maize. In addition, plant-specific RS and RS2Z splice-factor subfamilies are highly connected with R2R3-MYB in STRING functional protein association networks built using genes exhibiting conserved AS. Furthermore, we discovered that functional protein association networks constructed around genes harboring conserved AS events are enriched for phosphatases, kinases, and ubiquitylation genes, which suggests that AS may participate in regulating signaling pathways. These data lay the foundation for identifying and studying conserved AS events in the monocots, particularly across grass species, and this conserved AS resource identifies an additional layer between genotype to phenotype that may impact future crop improvement efforts.
Collapse
|
315
|
Best MG, Sol N, In 't Veld SGJG, Vancura A, Muller M, Niemeijer ALN, Fejes AV, Tjon Kon Fat LA, Huis In 't Veld AE, Leurs C, Le Large TY, Meijer LL, Kooi IE, Rustenburg F, Schellen P, Verschueren H, Post E, Wedekind LE, Bracht J, Esenkbrink M, Wils L, Favaro F, Schoonhoven JD, Tannous J, Meijers-Heijboer H, Kazemier G, Giovannetti E, Reijneveld JC, Idema S, Killestein J, Heger M, de Jager SC, Urbanus RT, Hoefer IE, Pasterkamp G, Mannhalter C, Gomez-Arroyo J, Bogaard HJ, Noske DP, Vandertop WP, van den Broek D, Ylstra B, Nilsson RJA, Wesseling P, Karachaliou N, Rosell R, Lee-Lewandrowski E, Lewandrowski KB, Tannous BA, de Langen AJ, Smit EF, van den Heuvel MM, Wurdinger T. Swarm Intelligence-Enhanced Detection of Non-Small-Cell Lung Cancer Using Tumor-Educated Platelets. Cancer Cell 2017; 32:238-252.e9. [PMID: 28810146 PMCID: PMC6381325 DOI: 10.1016/j.ccell.2017.07.004] [Citation(s) in RCA: 201] [Impact Index Per Article: 28.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/03/2017] [Revised: 05/17/2017] [Accepted: 07/13/2017] [Indexed: 01/01/2023]
Abstract
Blood-based liquid biopsies, including tumor-educated blood platelets (TEPs), have emerged as promising biomarker sources for non-invasive detection of cancer. Here we demonstrate that particle-swarm optimization (PSO)-enhanced algorithms enable efficient selection of RNA biomarker panels from platelet RNA-sequencing libraries (n = 779). This resulted in accurate TEP-based detection of early- and late-stage non-small-cell lung cancer (n = 518 late-stage validation cohort, accuracy, 88%; AUC, 0.94; 95% CI, 0.92-0.96; p < 0.001; n = 106 early-stage validation cohort, accuracy, 81%; AUC, 0.89; 95% CI, 0.83-0.95; p < 0.001), independent of age of the individuals, smoking habits, whole-blood storage time, and various inflammatory conditions. PSO enabled selection of gene panels to diagnose cancer from TEPs, suggesting that swarm intelligence may also benefit the optimization of diagnostics readout of other liquid biopsy biosources.
Collapse
Affiliation(s)
- Myron G Best
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Department of Pathology, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Brain Tumor Center Amsterdam, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands.
| | - Nik Sol
- Brain Tumor Center Amsterdam, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Department of Neurology, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Sjors G J G In 't Veld
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Brain Tumor Center Amsterdam, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Adrienne Vancura
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Brain Tumor Center Amsterdam, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Mirte Muller
- Department of Thoracic Oncology, The Netherlands Cancer Institute - Antoni van Leeuwenhoek Hospital, Plesmanlaan 121, 1066 CX Amsterdam, the Netherlands
| | - Anna-Larissa N Niemeijer
- Department of Pulmonary Diseases, VU University Medical Center, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Aniko V Fejes
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Brain Tumor Center Amsterdam, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Clinical Institute of Laboratory Medicine, Medical University of Vienna, Spitalgasse 23, 1090 Vienna, Austria
| | | | - Anna E Huis In 't Veld
- Department of Pulmonary Diseases, VU University Medical Center, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Cyra Leurs
- Department of Neurology, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; MS Center Amsterdam, VU University Medical Center, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Tessa Y Le Large
- Department of Surgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Laura L Meijer
- Department of Surgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Irsan E Kooi
- Department of Clinical Genetics, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - François Rustenburg
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Department of Pathology, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Brain Tumor Center Amsterdam, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Pepijn Schellen
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Brain Tumor Center Amsterdam, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Heleen Verschueren
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Brain Tumor Center Amsterdam, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; thromboDx B.V., 1098 EA Amsterdam, the Netherlands
| | - Edward Post
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Brain Tumor Center Amsterdam, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; thromboDx B.V., 1098 EA Amsterdam, the Netherlands
| | - Laurine E Wedekind
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Brain Tumor Center Amsterdam, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Jillian Bracht
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Michelle Esenkbrink
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Leon Wils
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Francesca Favaro
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Jilian D Schoonhoven
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Jihane Tannous
- Department of Neurology, Massachusetts General Hospital and Neuroscience Program, Harvard Medical School, 149 13(th) Street, Charlestown, MA 02129, USA
| | - Hanne Meijers-Heijboer
- Department of Clinical Genetics, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Geert Kazemier
- Department of Surgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Elisa Giovannetti
- Department of Medical Oncology, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Jaap C Reijneveld
- Brain Tumor Center Amsterdam, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Department of Neurology, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Sander Idema
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Joep Killestein
- Department of Neurology, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; MS Center Amsterdam, VU University Medical Center, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Michal Heger
- Department of Surgery, Amsterdam Medical Center, Meibergdreef 9, 1105 AZ Amsterdam, the Netherlands
| | - Saskia C de Jager
- Department of Experimental Cardiology, Utrecht University Medical Center, Heidelberglaan 100, 3584 CX Utrecht, the Netherlands
| | - Rolf T Urbanus
- Laboratory of Clinical Chemistry and Hematology, Utrecht University Medical Center, Heidelberglaan 100, 3584 CX Utrecht, the Netherlands
| | - Imo E Hoefer
- Laboratory of Clinical Chemistry and Hematology, Utrecht University Medical Center, Heidelberglaan 100, 3584 CX Utrecht, the Netherlands
| | - Gerard Pasterkamp
- Department of Experimental Cardiology, Utrecht University Medical Center, Heidelberglaan 100, 3584 CX Utrecht, the Netherlands
| | - Christine Mannhalter
- Clinical Institute of Laboratory Medicine, Medical University of Vienna, Spitalgasse 23, 1090 Vienna, Austria
| | - Jose Gomez-Arroyo
- Department of Pulmonary Diseases, VU University Medical Center, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Harm-Jan Bogaard
- Department of Pulmonary Diseases, VU University Medical Center, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - David P Noske
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Brain Tumor Center Amsterdam, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - W Peter Vandertop
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Brain Tumor Center Amsterdam, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Daan van den Broek
- Department of Clinical Chemistry, The Netherlands Cancer Institute - Antoni van Leeuwenhoek Hospital, Plesmanlaan 121, 1066 CX Amsterdam, the Netherlands
| | - Bauke Ylstra
- Department of Pathology, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - R Jonas A Nilsson
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Department of Radiation Sciences, Oncology, Umeå University, 90185 Umeå, Sweden
| | - Pieter Wesseling
- Department of Pathology, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Brain Tumor Center Amsterdam, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Department of Pathology, Princess Máxima Center for Pediatric Oncology and University Medical Center Utrecht, Lundlaan 6, 3584 EA Utrecht, the Netherlands
| | - Niki Karachaliou
- Translational Research Unit, Dr. Rosell Oncology Institute, Quirón Dexeus University Hospital, Calle Sabine Arana 5-19, 08028 Barcelona, Spain
| | - Rafael Rosell
- Translational Research Unit, Dr. Rosell Oncology Institute, Quirón Dexeus University Hospital, Calle Sabine Arana 5-19, 08028 Barcelona, Spain; Pangaea Biotech SL, Calle Sabine Arana 5-19, 08028 Barcelona, Spain; Catalan Institute of Oncology, Hospital Germans Trias i Pujol, Carretera de Canyet, 08916 Barcelona, Spain; Molecular Oncology Research (MORe) Foundation, Calle Sabine Arana 5-19, 08028 Barcelona, Spain
| | - Elizabeth Lee-Lewandrowski
- Department of Pathology, Massachusetts General Hospital, Harvard Medical School, 149 13(th) Street, Charlestown, MA 02129, USA
| | - Kent B Lewandrowski
- Department of Pathology, Massachusetts General Hospital, Harvard Medical School, 149 13(th) Street, Charlestown, MA 02129, USA
| | - Bakhos A Tannous
- Department of Neurology, Massachusetts General Hospital and Neuroscience Program, Harvard Medical School, 149 13(th) Street, Charlestown, MA 02129, USA
| | - Adrianus J de Langen
- Department of Pulmonary Diseases, VU University Medical Center, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Egbert F Smit
- Department of Thoracic Oncology, The Netherlands Cancer Institute - Antoni van Leeuwenhoek Hospital, Plesmanlaan 121, 1066 CX Amsterdam, the Netherlands; Department of Pulmonary Diseases, VU University Medical Center, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands
| | - Michel M van den Heuvel
- Department of Thoracic Oncology, The Netherlands Cancer Institute - Antoni van Leeuwenhoek Hospital, Plesmanlaan 121, 1066 CX Amsterdam, the Netherlands; Department of Respiratory Diseases, Radboud University Medical Center, 6500 HB Nijmegen, the Netherlands
| | - Thomas Wurdinger
- Department of Neurosurgery, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Brain Tumor Center Amsterdam, VU University Medical Center, Cancer Center Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, the Netherlands; Department of Neurology, Massachusetts General Hospital and Neuroscience Program, Harvard Medical School, 149 13(th) Street, Charlestown, MA 02129, USA.
| |
Collapse
|
316
|
A transcriptome atlas of rabbit revealed by PacBio single-molecule long-read sequencing. Sci Rep 2017; 7:7648. [PMID: 28794490 PMCID: PMC5550469 DOI: 10.1038/s41598-017-08138-z] [Citation(s) in RCA: 66] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2016] [Accepted: 07/07/2017] [Indexed: 01/08/2023] Open
Abstract
It is widely acknowledged that transcriptional diversity largely contributes to biological regulation in eukaryotes. Since the advent of second-generation sequencing technologies, a large number of RNA sequencing studies have considerably improved our understanding of transcriptome complexity. However, it still remains a huge challenge for obtaining full-length transcripts because of difficulties in the short read-based assembly. In the present study we employ PacBio single-molecule long-read sequencing technology for whole-transcriptome profiling in rabbit (Oryctolagus cuniculus). We totally obtain 36,186 high-confidence transcripts from 14,474 genic loci, among which more than 23% of genic loci and 66% of isoforms have not been annotated yet within the current reference genome. Furthermore, about 17% of transcripts are computationally revealed to be non-coding RNAs. Up to 24,797 alternative splicing (AS) and 11,184 alternative polyadenylation (APA) events are detected within this de novo constructed transcriptome, respectively. The results provide a comprehensive set of reference transcripts and hence contribute to the improved annotation of rabbit genome.
Collapse
|
317
|
Zhu FY, Chen MX, Ye NH, Shi L, Ma KL, Yang JF, Cao YY, Zhang Y, Yoshida T, Fernie AR, Fan GY, Wen B, Zhou R, Liu TY, Fan T, Gao B, Zhang D, Hao GF, Xiao S, Liu YG, Zhang J. Proteogenomic analysis reveals alternative splicing and translation as part of the abscisic acid response in Arabidopsis seedlings. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 91:518-533. [PMID: 28407323 DOI: 10.1111/tpj.13571] [Citation(s) in RCA: 109] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Revised: 04/05/2017] [Accepted: 04/07/2017] [Indexed: 05/19/2023]
Abstract
In eukaryotes, mechanisms such as alternative splicing (AS) and alternative translation initiation (ATI) contribute to organismal protein diversity. Specifically, splicing factors play crucial roles in responses to environment and development cues; however, the underlying mechanisms are not well investigated in plants. Here, we report the parallel employment of short-read RNA sequencing, single molecule long-read sequencing and proteomic identification to unravel AS isoforms and previously unannotated proteins in response to abscisic acid (ABA) treatment. Combining the data from the two sequencing methods, approximately 83.4% of intron-containing genes were alternatively spliced. Two AS types, which are referred to as alternative first exon (AFE) and alternative last exon (ALE), were more abundant than intron retention (IR); however, by contrast to AS events detected under normal conditions, differentially expressed AS isoforms were more likely to be translated. ABA extensively affects the AS pattern, indicated by the increasing number of non-conventional splicing sites. This work also identified thousands of unannotated peptides and proteins by ATI based on mass spectrometry and a virtual peptide library deduced from both strands of coding regions within the Arabidopsis genome. The results enhance our understanding of AS and alternative translation mechanisms under normal conditions, and in response to ABA treatment.
Collapse
Affiliation(s)
- Fu-Yuan Zhu
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, Shandong, China
- School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, China
| | - Mo-Xian Chen
- School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, China
| | - Neng-Hui Ye
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, China
- Southern Regional Collaborative Innovation Center for Grain and Oil Crops in China, Hunan Agricultural University, Changsha, 410128, China
| | - Lu Shi
- School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong
- State Key Laboratory of Membrane Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | | | - Jing-Fang Yang
- College of Chemistry, Central China Normal University, Wuhan, China
| | - Yun-Ying Cao
- School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong
- College of Life Sciences, Nantong University, Nantong, Jiangsu, China
| | - Youjun Zhang
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany
| | - Takuya Yoshida
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany
- Laboratory of Plant Molecular Physiology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, 113-8657, Japan
| | - Alisdair R Fernie
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany
| | | | - Bo Wen
- BGI-Shenzhen, Shenzhen, China
| | | | - Tie-Yuan Liu
- School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Tao Fan
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, Shandong, China
| | - Bei Gao
- School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Di Zhang
- School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Ge-Fei Hao
- College of Chemistry, Central China Normal University, Wuhan, China
| | - Shi Xiao
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Ying-Gao Liu
- State Key Laboratory of Crop Biology, College of Life Science, Shandong Agricultural University, Taian, Shandong, China
| | - Jianhua Zhang
- School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, China
| |
Collapse
|
318
|
Wang T, Wang H, Cai D, Gao Y, Zhang H, Wang Y, Lin C, Ma L, Gu L. Comprehensive profiling of rhizome-associated alternative splicing and alternative polyadenylation in moso bamboo (Phyllostachys edulis). THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 91:684-699. [PMID: 28493303 DOI: 10.1111/tpj.13597] [Citation(s) in RCA: 116] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2017] [Revised: 04/27/2017] [Accepted: 05/03/2017] [Indexed: 05/21/2023]
Abstract
Moso bamboo (Phyllostachys edulis) represents one of the fastest-spreading plants in the world, due in part to its well-developed rhizome system. However, the post-transcriptional mechanism for the development of the rhizome system in bamboo has not been comprehensively studied. We therefore used a combination of single-molecule long-read sequencing technology and polyadenylation site sequencing (PAS-seq) to re-annotate the bamboo genome, and identify genome-wide alternative splicing (AS) and alternative polyadenylation (APA) in the rhizome system. In total, 145 522 mapped full-length non-chimeric (FLNC) reads were analyzed, resulting in the correction of 2241 mis-annotated genes and the identification of 8091 previously unannotated loci. Notably, more than 42 280 distinct splicing isoforms were derived from 128 667 intron-containing full-length FLNC reads, including a large number of AS events associated with rhizome systems. In addition, we characterized 25 069 polyadenylation sites from 11 450 genes, 6311 of which have APA sites. Further analysis of intronic polyadenylation revealed that LTR/Gypsy and LTR/Copia were two major transposable elements within the intronic polyadenylation region. Furthermore, this study provided a quantitative atlas of poly(A) usage. Several hundred differential poly(A) sites in the rhizome-root system were identified. Taken together, these results suggest that post-transcriptional regulation may potentially have a vital role in the underground rhizome-root system.
Collapse
Affiliation(s)
- Taotao Wang
- Basic Forestry and Proteomics Research Center, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
- College of Forestry, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Huiyuan Wang
- Basic Forestry and Proteomics Research Center, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
- College of Forestry, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Dawei Cai
- Basic Forestry and Proteomics Research Center, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
- College of Forestry, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Yubang Gao
- Basic Forestry and Proteomics Research Center, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Hangxiao Zhang
- Basic Forestry and Proteomics Research Center, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Yongsheng Wang
- Basic Forestry and Proteomics Research Center, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Chentao Lin
- Department of Molecular, Cell and Developmental Biology, University of California, Los Angeles, CA, 90095, USA
| | - Liuyin Ma
- Basic Forestry and Proteomics Research Center, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Lianfeng Gu
- Basic Forestry and Proteomics Research Center, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| |
Collapse
|
319
|
Liu X, Mei W, Soltis PS, Soltis DE, Barbazuk WB. Detecting alternatively spliced transcript isoforms from single-molecule long-read sequences without a reference genome. Mol Ecol Resour 2017; 17:1243-1256. [PMID: 28316149 DOI: 10.1111/1755-0998.12670] [Citation(s) in RCA: 81] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2016] [Revised: 03/03/2017] [Accepted: 03/07/2017] [Indexed: 01/11/2023]
Abstract
Alternative splicing (AS) is a major source of transcript and proteome diversity, but examining AS in species without well-annotated reference genomes remains difficult. Research on both human and mouse has demonstrated the advantages of using Iso-Seq™ data for isoform-level transcriptome analysis, including the study of AS and gene fusion. We applied Iso-Seq™ to investigate AS in Amborella trichopoda, a phylogenetically pivotal species that is sister to all other living angiosperms. Our data show that, compared with RNA-Seq data, the Iso-Seq™ platform provides better recovery on large transcripts, new gene locus identification and gene model correction. Reference-based AS detection with Iso-Seq™ data identifies AS within a higher fraction of multi-exonic genes than observed for published RNA-Seq analysis (45.8% vs. 37.5%). These data demonstrate that the Iso-Seq™ approach is useful for detecting AS events. Using the Iso-Seq-defined transcript collection in Amborella as a reference, we further describe a pipeline for detection of AS isoforms from PacBio Iso-Seq™ without using a reference sequence (de novo). Results using this pipeline show a 66%-76% overall success rate in identifying AS events. This de novoAS detection pipeline provides a method to accurately characterize and identify bona fide alternatively spliced transcripts in any nonmodel system that lacks a reference genome sequence. Hence, our pipeline has huge potential applications and benefits to the broader biology community.
Collapse
Affiliation(s)
- Xiaoxian Liu
- Department of Biology, University of Florida, Gainesville, FL, 32611-8525, USA.,Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611-7800, USA
| | - Wenbin Mei
- Department of Biology, University of Florida, Gainesville, FL, 32611-8525, USA
| | - Pamela S Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611-7800, USA.,Genetics Institute, University of Florida, Gainesville, FL, 32610, USA
| | - Douglas E Soltis
- Department of Biology, University of Florida, Gainesville, FL, 32611-8525, USA.,Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611-7800, USA.,Genetics Institute, University of Florida, Gainesville, FL, 32610, USA
| | - W Brad Barbazuk
- Department of Biology, University of Florida, Gainesville, FL, 32611-8525, USA.,Genetics Institute, University of Florida, Gainesville, FL, 32610, USA
| |
Collapse
|
320
|
Jiang X, Hall AB, Biedler JK, Tu Z. Single molecule RNA sequencing uncovers trans-splicing and improves annotations in Anopheles stephensi. INSECT MOLECULAR BIOLOGY 2017; 26:298-307. [PMID: 28181326 PMCID: PMC5718059 DOI: 10.1111/imb.12294] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Single molecule real-time (SMRT) sequencing has recently been used to obtain full-length cDNA sequences that improve genome annotation and reveal RNA isoforms. Here, we used one such method called isoform sequencing from Pacific Biosciences (PacBio) to sequence a cDNA library from the Asian malaria mosquito Anopheles stephensi. More than 600 000 full-length cDNAs, referred to as reads of insert, were identified. Owing to the inherently high error rate of PacBio sequencing, we tested different approaches for error correction. We found that error correction using Illumina RNA sequencing (RNA-seq) generated more data than using the default SMRT pipeline. The full-length error-corrected PacBio reads greatly improved the gene annotation of Anopheles stephensi: 4867 gene models were updated and 1785 alternatively spliced isoforms were added to the annotation. In addition, six trans-splicing events, where exons from different primary transcripts were joined together, were identified in An. stephensi. All six trans-splicing events appear to be conserved in Culicidae, as they are also found in Anopheles gambiae and Aedes aegypti. The proteins encoded by trans-splicing events are also highly conserved and the orthologues of these proteins are cis-spliced in outgroup species, indicating that trans-splicing may arise as a mechanism to rescue genes that broke up during evolution.
Collapse
Affiliation(s)
- X Jiang
- Program in Genetics Bioinformatics and Computational Biology, Virginia Tech, Blacksburg, VA, USA
- Department of Biochemistry, Virginia Tech, Blacksburg, VA, USA
- Fralin Life Science Institute, Virginia Tech, Blacksburg, VA, USA
| | - A B Hall
- Program in Genetics Bioinformatics and Computational Biology, Virginia Tech, Blacksburg, VA, USA
- Fralin Life Science Institute, Virginia Tech, Blacksburg, VA, USA
| | - J K Biedler
- Program in Genetics Bioinformatics and Computational Biology, Virginia Tech, Blacksburg, VA, USA
- Department of Biochemistry, Virginia Tech, Blacksburg, VA, USA
| | - Z Tu
- Program in Genetics Bioinformatics and Computational Biology, Virginia Tech, Blacksburg, VA, USA
- Department of Biochemistry, Virginia Tech, Blacksburg, VA, USA
- Fralin Life Science Institute, Virginia Tech, Blacksburg, VA, USA
| |
Collapse
|
321
|
Hoang NV, Furtado A, Mason PJ, Marquardt A, Kasirajan L, Thirugnanasambandam PP, Botha FC, Henry RJ. A survey of the complex transcriptome from the highly polyploid sugarcane genome using full-length isoform sequencing and de novo assembly from short read sequencing. BMC Genomics 2017; 18:395. [PMID: 28532419 PMCID: PMC5440902 DOI: 10.1186/s12864-017-3757-8] [Citation(s) in RCA: 111] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2016] [Accepted: 05/03/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Despite the economic importance of sugarcane in sugar and bioenergy production, there is not yet a reference genome available. Most of the sugarcane transcriptomic studies have been based on Saccharum officinarum gene indices (SoGI), expressed sequence tags (ESTs) and de novo assembled transcript contigs from short-reads; hence knowledge of the sugarcane transcriptome is limited in relation to transcript length and number of transcript isoforms. RESULTS The sugarcane transcriptome was sequenced using PacBio isoform sequencing (Iso-Seq) of a pooled RNA sample derived from leaf, internode and root tissues, of different developmental stages, from 22 varieties, to explore the potential for capturing full-length transcript isoforms. A total of 107,598 unique transcript isoforms were obtained, representing about 71% of the total number of predicted sugarcane genes. The majority of this dataset (92%) matched the plant protein database, while just over 2% was novel transcripts, and over 2% was putative long non-coding RNAs. About 56% and 23% of total sequences were annotated against the gene ontology and KEGG pathway databases, respectively. Comparison with de novo contigs from Illumina RNA-Sequencing (RNA-Seq) of the internode samples from the same experiment and public databases showed that the Iso-Seq method recovered more full-length transcript isoforms, had a higher N50 and average length of largest 1,000 proteins; whereas a greater representation of the gene content and RNA diversity was captured in RNA-Seq. Only 62% of PacBio transcript isoforms matched 67% of de novo contigs, while the non-matched proportions were attributed to the inclusion of leaf/root tissues and the normalization in PacBio, and the representation of more gene content and RNA classes in the de novo assembly, respectively. About 69% of PacBio transcript isoforms and 41% of de novo contigs aligned with the sorghum genome, indicating the high conservation of orthologs in the genic regions of the two genomes. CONCLUSIONS The transcriptome dataset should contribute to improved sugarcane gene models and sugarcane protein predictions; and will serve as a reference database for analysis of transcript expression in sugarcane.
Collapse
Affiliation(s)
- Nam V Hoang
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Room 2.245, Level 2, The John Hay Building, Queensland Biosciences Precinct [#80], 306 Carmody Road, St. Lucia, QLD, 4072, Australia.,College of Agriculture and Forestry, Hue University, Hue, Vietnam
| | - Agnelo Furtado
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Room 2.245, Level 2, The John Hay Building, Queensland Biosciences Precinct [#80], 306 Carmody Road, St. Lucia, QLD, 4072, Australia
| | - Patrick J Mason
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Room 2.245, Level 2, The John Hay Building, Queensland Biosciences Precinct [#80], 306 Carmody Road, St. Lucia, QLD, 4072, Australia
| | - Annelie Marquardt
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Room 2.245, Level 2, The John Hay Building, Queensland Biosciences Precinct [#80], 306 Carmody Road, St. Lucia, QLD, 4072, Australia.,Sugar Research Australia, Indooroopilly, QLD, 4068, Australia
| | - Lakshmi Kasirajan
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Room 2.245, Level 2, The John Hay Building, Queensland Biosciences Precinct [#80], 306 Carmody Road, St. Lucia, QLD, 4072, Australia.,ICAR - Sugarcane Breeding Institute, Coimbatore, Tamil Nadu, India
| | - Prathima P Thirugnanasambandam
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Room 2.245, Level 2, The John Hay Building, Queensland Biosciences Precinct [#80], 306 Carmody Road, St. Lucia, QLD, 4072, Australia.,ICAR - Sugarcane Breeding Institute, Coimbatore, Tamil Nadu, India
| | - Frederik C Botha
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Room 2.245, Level 2, The John Hay Building, Queensland Biosciences Precinct [#80], 306 Carmody Road, St. Lucia, QLD, 4072, Australia.,Sugar Research Australia, Indooroopilly, QLD, 4068, Australia
| | - Robert J Henry
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Room 2.245, Level 2, The John Hay Building, Queensland Biosciences Precinct [#80], 306 Carmody Road, St. Lucia, QLD, 4072, Australia.
| |
Collapse
|
322
|
Zulkapli MM, Rosli MAF, Salleh FIM, Mohd Noor N, Aizat WM, Goh HH. Iso-Seq analysis of Nepenthes ampullaria, Nepenthes rafflesiana and Nepenthes × hookeriana for hybridisation study in pitcher plants. GENOMICS DATA 2017; 12:130-131. [PMID: 28529881 PMCID: PMC5429222 DOI: 10.1016/j.gdata.2017.05.003] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 04/04/2017] [Revised: 04/26/2017] [Accepted: 05/02/2017] [Indexed: 12/20/2022]
Abstract
Tropical pitcher plants in the species-rich Nepenthaceae family of carnivorous plants possess unique pitcher organs. Hybridisation, natural or artificial, in this family is extensive resulting in pitchers with diverse features. The pitcher functions as a passive insect trap with digestive fluid for nutrient acquisition in nitrogen-poor habitats. This organ shows specialisation according to the dietary habit of different Nepenthes species. In this study, we performed the first single-molecule real-time isoform sequencing (Iso-Seq) analysis of full-length cDNA from Nepenthes ampullaria which can feed on leaf litter, compared to carnivorous Nepenthes rafflesiana, and their carnivorous hybrid Nepenthes × hookeriana. This allows the comparison of pitcher transcriptomes from the parents and the hybrid to understand how hybridisation could shape the evolution of dietary habit in Nepenthes. Raw reads have been deposited to SRA database with the accession numbers SRX2692198 (N. ampullaria), SRX2692197 (N. rafflesiana), and SRX2692196 (N. × hookeriana).
Collapse
Affiliation(s)
| | | | - Faris Imadi Mohd Salleh
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| | - Normah Mohd Noor
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| | - Wan Mohd Aizat
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| | - Hoe-Han Goh
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| |
Collapse
|
323
|
Clavijo BJ, Venturini L, Schudoma C, Accinelli GG, Kaithakottil G, Wright J, Borrill P, Kettleborough G, Heavens D, Chapman H, Lipscombe J, Barker T, Lu FH, McKenzie N, Raats D, Ramirez-Gonzalez RH, Coince A, Peel N, Percival-Alwyn L, Duncan O, Trösch J, Yu G, Bolser DM, Namaati G, Kerhornou A, Spannagl M, Gundlach H, Haberer G, Davey RP, Fosker C, Palma FD, Phillips AL, Millar AH, Kersey PJ, Uauy C, Krasileva KV, Swarbreck D, Bevan MW, Clark MD. An improved assembly and annotation of the allohexaploid wheat genome identifies complete families of agronomic genes and provides genomic evidence for chromosomal translocations. Genome Res 2017; 27:885-896. [PMID: 28420692 PMCID: PMC5411782 DOI: 10.1101/gr.217117.116] [Citation(s) in RCA: 244] [Impact Index Per Article: 34.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2016] [Accepted: 03/14/2017] [Indexed: 01/16/2023]
Abstract
Advances in genome sequencing and assembly technologies are generating many high-quality genome sequences, but assemblies of large, repeat-rich polyploid genomes, such as that of bread wheat, remain fragmented and incomplete. We have generated a new wheat whole-genome shotgun sequence assembly using a combination of optimized data types and an assembly algorithm designed to deal with large and complex genomes. The new assembly represents >78% of the genome with a scaffold N50 of 88.8 kb that has a high fidelity to the input data. Our new annotation combines strand-specific Illumina RNA-seq and Pacific Biosciences (PacBio) full-length cDNAs to identify 104,091 high-confidence protein-coding genes and 10,156 noncoding RNA genes. We confirmed three known and identified one novel genome rearrangements. Our approach enables the rapid and scalable assembly of wheat genomes, the identification of structural variants, and the definition of complete gene models, all powerful resources for trait analysis and breeding of this key global crop.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | - Tom Barker
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
| | - Fu-Hao Lu
- John Innes Centre, Norwich, NR4 7UH, United Kingdom
| | | | - Dina Raats
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
| | | | | | - Ned Peel
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
| | | | - Owen Duncan
- ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley Western Australia 6009, Australia
| | - Josua Trösch
- ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley Western Australia 6009, Australia
| | - Guotai Yu
- John Innes Centre, Norwich, NR4 7UH, United Kingdom
| | - Dan M Bolser
- EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
| | - Guy Namaati
- EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
| | - Arnaud Kerhornou
- EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
| | - Manuel Spannagl
- Plant Genome and Systems Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany
| | - Heidrun Gundlach
- Plant Genome and Systems Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany
| | - Georg Haberer
- Plant Genome and Systems Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany
| | - Robert P Davey
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
- University of East Anglia, Norwich, NR4 7TJ, United Kingdom
| | | | - Federica Di Palma
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
- University of East Anglia, Norwich, NR4 7TJ, United Kingdom
| | | | - A Harvey Millar
- ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley Western Australia 6009, Australia
| | - Paul J Kersey
- EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
| | | | - Ksenia V Krasileva
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
- University of East Anglia, Norwich, NR4 7TJ, United Kingdom
- The Sainsbury Laboratory, Norwich, NR4 7UH, United Kingdom
| | - David Swarbreck
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
- University of East Anglia, Norwich, NR4 7TJ, United Kingdom
| | | | - Matthew D Clark
- Earlham Institute, Norwich, NR4 7UZ, United Kingdom
- University of East Anglia, Norwich, NR4 7TJ, United Kingdom
| |
Collapse
|
324
|
Kuo RI, Tseng E, Eory L, Paton IR, Archibald AL, Burt DW. Normalized long read RNA sequencing in chicken reveals transcriptome complexity similar to human. BMC Genomics 2017; 18:323. [PMID: 28438136 PMCID: PMC5404281 DOI: 10.1186/s12864-017-3691-9] [Citation(s) in RCA: 76] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2016] [Accepted: 04/06/2017] [Indexed: 01/20/2023] Open
Abstract
BACKGROUND Despite the significance of chicken as a model organism, our understanding of the chicken transcriptome is limited compared to human. This issue is common to all non-human vertebrate annotations due to the difficulty in transcript identification from short read RNAseq data. While previous studies have used single molecule long read sequencing for transcript discovery, they did not perform RNA normalization and 5'-cap selection which may have resulted in lower transcriptome coverage and truncated transcript sequences. RESULTS We sequenced normalised chicken brain and embryo RNA libraries with Pacific Bioscience Iso-Seq. 5' cap selection was performed on the embryo library to provide methodological comparison. From these Iso-Seq sequencing projects, we have identified 60 k transcripts and 29 k genes within the chicken transcriptome. Of these, more than 20 k are novel lncRNA transcripts with ~3 k classified as sense exonic overlapping lncRNA, which is a class that is underrepresented in many vertebrate annotations. The relative proportion of alternative transcription events revealed striking similarities between the chicken and human transcriptomes while also providing explanations for previously observed genomic differences. CONCLUSIONS Our results indicate that the chicken transcriptome is similar in complexity compared to human, and provide insights into other vertebrate biology. Our methodology demonstrates the potential of Iso-Seq sequencing to rapidly expand our knowledge of transcriptomics.
Collapse
Affiliation(s)
- Richard I. Kuo
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG UK
| | | | - Lel Eory
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG UK
| | - Ian R. Paton
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG UK
| | - Alan L. Archibald
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG UK
| | - David W. Burt
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG UK
- The University of Queensland, St. Lucia, Canberra, QLD 4072 Australia
| |
Collapse
|
325
|
Li Y, Dai C, Hu C, Liu Z, Kang C. Global identification of alternative splicing via comparative analysis of SMRT- and Illumina-based RNA-seq in strawberry. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 90:164-176. [PMID: 27997733 DOI: 10.1111/tpj.13462] [Citation(s) in RCA: 107] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2016] [Revised: 11/23/2016] [Accepted: 12/14/2016] [Indexed: 05/21/2023]
Abstract
Alternative splicing (AS) is a key post-transcriptional regulatory mechanism, yet little information is known about its roles in fruit crops. Here, AS was globally analyzed in the wild strawberry Fragaria vesca genome with RNA-seq data derived from different stages of fruit development. The AS landscape was characterized and compared between the single-molecule, real-time (SMRT) and Illumina RNA-seq platform. While SMRT has a lower sequencing depth, it identifies more genes undergoing AS (57.67% of detected multiexon genes) when it is compared with Illumina (33.48%), illustrating the efficacy of SMRT in AS identification. We investigated different modes of AS in the context of fruit development; the percentage of intron retention (IR) is markedly reduced whereas that of alternative acceptor sites (AA) is significantly increased post-fertilization when compared with pre-fertilization. When all the identified transcripts were combined, a total of 66.43% detected multiexon genes in strawberry undergo AS, some of which lead to a gain or loss of conserved domains in the gene products. The work demonstrates that SMRT sequencing is highly powerful in AS discovery and provides a rich data resource for later functional studies of different isoforms. Further, shifting AS modes may contribute to rapid changes of gene expression during fruit set.
Collapse
Affiliation(s)
- Yongping Li
- Key Laboratory of Horticultural Plant Biology (Ministry of Education), College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| | - Cheng Dai
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, 430070, China
| | - Chungen Hu
- Key Laboratory of Horticultural Plant Biology (Ministry of Education), College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| | - Zhongchi Liu
- Key Laboratory of Horticultural Plant Biology (Ministry of Education), College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD, 20742, USA
| | - Chunying Kang
- Key Laboratory of Horticultural Plant Biology (Ministry of Education), College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| |
Collapse
|
326
|
Genome-wide analysis of complex wheat gliadins, the dominant carriers of celiac disease epitopes. Sci Rep 2017; 7:44609. [PMID: 28300172 PMCID: PMC5353739 DOI: 10.1038/srep44609] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2016] [Accepted: 02/09/2017] [Indexed: 01/08/2023] Open
Abstract
Gliadins, specified by six compound chromosomal loci (Gli-A1/B1/D1 and Gli-A2/B2/D2) in hexaploid bread wheat, are the dominant carriers of celiac disease (CD) epitopes. Because of their complexity, genome-wide characterization of gliadins is a strong challenge. Here, we approached this challenge by combining transcriptomic, proteomic and bioinformatic investigations. Through third-generation RNA sequencing, full-length transcripts were identified for 52 gliadin genes in the bread wheat cultivar Xiaoyan 81. Of them, 42 were active and predicted to encode 25 α-, 11 γ-, one δ- and five ω-gliadins. Comparative proteomic analysis between Xiaoyan 81 and six newly-developed mutants each lacking one Gli locus indicated the accumulation of 38 gliadins in the mature grains. A novel group of α-gliadins (the CSTT group) was recognized to contain very few or no CD epitopes. The δ-gliadins identified here or previously did not carry CD epitopes. Finally, the mutant lacking Gli-D2 showed significant reductions in the most celiac-toxic α-gliadins and derivative CD epitopes. The insights and resources generated here should aid further studies on gliadin functions in CD and the breeding of healthier wheat.
Collapse
|
327
|
Weirather JL, de Cesare M, Wang Y, Piazza P, Sebastiano V, Wang XJ, Buck D, Au KF. Comprehensive comparison of Pacific Biosciences and Oxford Nanopore Technologies and their applications to transcriptome analysis. F1000Res 2017; 6:100. [PMID: 28868132 PMCID: PMC5553090 DOI: 10.12688/f1000research.10571.1] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 06/09/2017] [Indexed: 09/05/2023] Open
Abstract
Background: Given the demonstrated utility of Third Generation Sequencing [Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT)] long reads in many studies, a comprehensive analysis and comparison of their data quality and applications is in high demand. Methods: Based on the transcriptome sequencing data from human embryonic stem cells, we analyzed multiple data features of PacBio and ONT, including error pattern, length, mappability and technical improvements over previous platforms. We also evaluated their application to transcriptome analyses, such as isoform identification and quantification and characterization of transcriptome complexity, by comparing the performance of size-selected PacBio, non-size-selected ONT and their corresponding Hybrid-Seq strategies (PacBio+Illumina and ONT+Illumina). Results: PacBio shows overall better data quality, while ONT provides a higher yield. As with data quality, PacBio performs marginally better than ONT in most aspects for both long reads only and Hybrid-Seq strategies in transcriptome analysis. In addition, Hybrid-Seq shows superior performance over long reads only in most transcriptome analyses. Conclusions: Both PacBio and ONT sequencing are suitable for full-length single-molecule transcriptome analysis. As this first use of ONT reads in a Hybrid-Seq analysis has shown, both PacBio and ONT can benefit from a combined Illumina strategy. The tools and analytical methods developed here provide a resource for future applications and evaluations of these rapidly-changing technologies.
Collapse
Affiliation(s)
- Jason L Weirather
- Department of Internal Medicine, University of Iowa, Iowa City, IA, USA
| | - Mariateresa de Cesare
- Oxford Genomics Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Yunhao Wang
- Department of Internal Medicine, University of Iowa, Iowa City, IA, USA
- Key laboratory of Genetics Network Biology, Collaborative Innovation Center of Genetics and Development, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Paolo Piazza
- Oxford Genomics Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Vittorio Sebastiano
- Institute for Stem Cell Biology and Regenerative Medicine, Stanford University, Stanford, CA, USA
- Department of Obstetrics and Gynecology, Stanford University, Stanford, CA, USA
| | - Xiu-Jie Wang
- Key laboratory of Genetics Network Biology, Collaborative Innovation Center of Genetics and Development, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - David Buck
- Oxford Genomics Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Kin Fai Au
- Department of Internal Medicine, University of Iowa, Iowa City, IA, USA
- Department of Biostatistics, University of Iowa, Iowa City, USA
| |
Collapse
|
328
|
Weirather JL, de Cesare M, Wang Y, Piazza P, Sebastiano V, Wang XJ, Buck D, Au KF. Comprehensive comparison of Pacific Biosciences and Oxford Nanopore Technologies and their applications to transcriptome analysis. F1000Res 2017; 6:100. [PMID: 28868132 PMCID: PMC5553090 DOI: 10.12688/f1000research.10571.2] [Citation(s) in RCA: 261] [Impact Index Per Article: 37.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 06/09/2017] [Indexed: 12/11/2022] Open
Abstract
Background: Given the demonstrated utility of Third Generation Sequencing [Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT)] long reads in many studies, a comprehensive analysis and comparison of their data quality and applications is in high demand.
Methods: Based on the transcriptome sequencing data from human embryonic stem cells, we analyzed multiple data features of PacBio and ONT, including error pattern, length, mappability and technical improvements over previous platforms. We also evaluated their application to transcriptome analyses, such as isoform identification and quantification and characterization of transcriptome complexity, by comparing the performance of size-selected PacBio, non-size-selected ONT and their corresponding Hybrid-Seq strategies (PacBio+Illumina and ONT+Illumina).
Results: PacBio shows overall better data quality, while ONT provides a higher yield. As with data quality, PacBio performs marginally better than ONT in most aspects for both long reads only and Hybrid-Seq strategies in transcriptome analysis. In addition, Hybrid-Seq shows superior performance over long reads only in most transcriptome analyses.
Conclusions: Both PacBio and ONT sequencing are suitable for full-length single-molecule transcriptome analysis. As this first use of ONT reads in a Hybrid-Seq analysis has shown, both PacBio and ONT can benefit from a combined Illumina strategy. The tools and analytical methods developed here provide a resource for future applications and evaluations of these rapidly-changing technologies.
Collapse
Affiliation(s)
- Jason L Weirather
- Department of Internal Medicine, University of Iowa, Iowa City, IA, USA
| | - Mariateresa de Cesare
- Oxford Genomics Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Yunhao Wang
- Department of Internal Medicine, University of Iowa, Iowa City, IA, USA.,Key laboratory of Genetics Network Biology, Collaborative Innovation Center of Genetics and Development, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Paolo Piazza
- Oxford Genomics Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Vittorio Sebastiano
- Institute for Stem Cell Biology and Regenerative Medicine, Stanford University, Stanford, CA, USA.,Department of Obstetrics and Gynecology, Stanford University, Stanford, CA, USA
| | - Xiu-Jie Wang
- Key laboratory of Genetics Network Biology, Collaborative Innovation Center of Genetics and Development, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - David Buck
- Oxford Genomics Centre, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Kin Fai Au
- Department of Internal Medicine, University of Iowa, Iowa City, IA, USA.,Department of Biostatistics, University of Iowa, Iowa City, USA
| |
Collapse
|
329
|
De novo hybrid assembly of the rubber tree genome reveals evidence of paleotetraploidy in Hevea species. Sci Rep 2017; 7:41457. [PMID: 28150702 PMCID: PMC5288721 DOI: 10.1038/srep41457] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2016] [Accepted: 12/19/2016] [Indexed: 12/11/2022] Open
Abstract
Para rubber tree (Hevea brasiliensis) is an important economic species as it is the sole commercial producer of high-quality natural rubber. Here, we report a de novo hybrid assembly of BPM24 accession, which exhibits resistance to major fungal pathogens in Southeast Asia. Deep-coverage 454/Illumina short-read and Pacific Biosciences (PacBio) long-read sequence data were acquired to generate a preliminary draft, which was subsequently scaffolded using a long-range "Chicago" technique to obtain a final assembly of 1.26 Gb (N50 = 96.8 kb). The assembled genome contains 69.2% repetitive sequences and has a GC content of 34.31%. Using a high-density SNP-based genetic map, we were able to anchor 28.9% of the genome assembly (363 Mb) associated with over two thirds of the predicted protein-coding genes into rubber tree's 18 linkage groups. These genetically anchored sequences allowed comparative analyses of the intragenomic homeologous synteny, providing the first concrete evidence to demonstrate the presence of paleotetraploidy in Hevea species. Additionally, the degree of macrosynteny conservation observed between rubber tree and cassava strongly supports the hypothesis that the paleotetraploidization event took place prior to the divergence of the Hevea and Manihot species.
Collapse
|
330
|
Cheng CY, Krishnakumar V, Chan AP, Thibaud-Nissen F, Schobel S, Town CD. Araport11: a complete reannotation of the Arabidopsis thaliana reference genome. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 89:789-804. [PMID: 27862469 DOI: 10.1111/tpj.13415] [Citation(s) in RCA: 632] [Impact Index Per Article: 90.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2016] [Revised: 10/27/2016] [Accepted: 11/03/2016] [Indexed: 05/20/2023]
Abstract
The flowering plant Arabidopsis thaliana is a dicot model organism for research in many aspects of plant biology. A comprehensive annotation of its genome paves the way for understanding the functions and activities of all types of transcripts, including mRNA, the various classes of non-coding RNA, and small RNA. The TAIR10 annotation update had a profound impact on Arabidopsis research but was released more than 5 years ago. Maintaining the accuracy of the annotation continues to be a prerequisite for future progress. Using an integrative annotation pipeline, we assembled tissue-specific RNA-Seq libraries from 113 datasets and constructed 48 359 transcript models of protein-coding genes in eleven tissues. In addition, we annotated various classes of non-coding RNA including microRNA, long intergenic RNA, small nucleolar RNA, natural antisense transcript, small nuclear RNA, and small RNA using published datasets and in-house analytic results. Altogether, we identified 635 novel protein-coding genes, 508 novel transcribed regions, 5178 non-coding RNAs, and 35 846 small RNA loci that were formerly unannotated. Analysis of the splicing events and RNA-Seq based expression profiles revealed the landscapes of gene structures, untranslated regions, and splicing activities to be more intricate than previously appreciated. Furthermore, we present 692 uniformly expressed housekeeping genes, 43% of whose human orthologs are also housekeeping genes. This updated Arabidopsis genome annotation with a substantially increased resolution of gene models will not only further our understanding of the biological processes of this plant model but also of other species.
Collapse
Affiliation(s)
- Chia-Yi Cheng
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Vivek Krishnakumar
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Agnes P Chan
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, US National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Seth Schobel
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Christopher D Town
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| |
Collapse
|
331
|
Kelly DE, Hansen MEB, Tishkoff SA. Global variation in gene expression and the value of diverse sampling. CURRENT OPINION IN SYSTEMS BIOLOGY 2017; 1:102-108. [PMID: 28596996 PMCID: PMC5458633 DOI: 10.1016/j.coisb.2016.12.018] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The genomics era has accelerated our understanding of how genetic and epigenetic factors influence both normal variable traits and disease risk in humans. However, the majority of "omics" studies have focused on individuals living in urban centers, primarily from Europe and Asia, neglecting much of the genetic and environmental variation that exists across worldwide populations. Comparative studies of gene regulation in ethnically diverse populations are informing our understanding of how evolutionary forces have shaped the genetic and molecular mechanisms underlying complex traits, and studying gene expression in different environmental contexts is enabling the dissection of disease-related pathways such as immune response. Such approaches are vital to the equitable application of genomics and medicine.
Collapse
Affiliation(s)
- Derek E. Kelly
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
- Genomics and Computational Biology Graduate Group, University of Pennsylvania, Philadelphia, PA 19104, USA
| | | | - Sarah A. Tishkoff
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
- Genomics and Computational Biology Graduate Group, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
332
|
Deng X, Cao X. Roles of pre-mRNA splicing and polyadenylation in plant development. CURRENT OPINION IN PLANT BIOLOGY 2017; 35:45-53. [PMID: 27866125 DOI: 10.1016/j.pbi.2016.11.003] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Revised: 11/01/2016] [Accepted: 11/03/2016] [Indexed: 05/20/2023]
Abstract
Plants possess amazing plasticity of growth and development, allowing them to adjust continuously and rapidly to changes in the environment. Over the past two decades, numerous molecular studies have illuminated the role of transcriptional regulation in plant development and environmental responses. However, emerging studies in Arabidopsis have uncovered an unexpectedly widespread role for post-transcriptional regulation in development and responses to environmental changes. In this review, we summarize recent discoveries detailing the contribution of two post-transcriptional mechanisms, pre-mRNA splicing and polyadenylation, to the regulation of plant development, with an emphasis on the control of flowering time. We also discuss future directions in the field and new technological approaches.
Collapse
Affiliation(s)
- Xian Deng
- State Key Laboratory of Plant Genomics and National Center for Plant Gene Research, CAS Center for Excellence in Molecular Plant Sciences, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Xiaofeng Cao
- State Key Laboratory of Plant Genomics and National Center for Plant Gene Research, CAS Center for Excellence in Molecular Plant Sciences, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China.
| |
Collapse
|
333
|
Xu Q, Zhu J, Zhao S, Hou Y, Li F, Tai Y, Wan X, Wei C. Transcriptome Profiling Using Single-Molecule Direct RNA Sequencing Approach for In-depth Understanding of Genes in Secondary Metabolism Pathways of Camellia sinensis. FRONTIERS IN PLANT SCIENCE 2017; 8:1205. [PMID: 28744294 PMCID: PMC5504172 DOI: 10.3389/fpls.2017.01205] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2016] [Accepted: 06/26/2017] [Indexed: 05/22/2023]
Abstract
Characteristic secondary metabolites, including flavonoids, theanine and caffeine, are important components of Camellia sinensis, and their biosynthesis has attracted widespread interest. Previous studies on the biosynthesis of these major secondary metabolites using next-generation sequencing technologies limited the accurately prediction of full-length (FL) splice isoforms. Herein, we applied single-molecule sequencing to pooled tea plant tissues, to provide a more complete transcriptome of C. sinensis. Moreover, we identified 94 FL transcripts and four alternative splicing events for enzyme-coding genes involved in the biosynthesis of flavonoids, theanine and caffeine. According to the comparison between long-read isoforms and assemble transcripts, we improved the quality and accuracy of genes sequenced by short-read next-generation sequencing technology. The resulting FL transcripts, together with the improved assembled transcripts and identified alternative splicing events, enhance our understanding of genes involved in the biosynthesis of characteristic secondary metabolites in C. sinensis.
Collapse
|
334
|
von Arnim AG, Hellmann HA. Meeting report: processing, translation, decay - three ways to keep RNA sizzling. PLANT, CELL & ENVIRONMENT 2016; 39:2624-2628. [PMID: 27859406 DOI: 10.1111/pce.12819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2016] [Revised: 08/17/2016] [Accepted: 08/21/2016] [Indexed: 06/06/2023]
Abstract
This meeting report highlights key trends that emerged from a conference entitled Post-Transcriptional Gene Regulation in Plants, which was held 14-15 July 2016, as a satellite meeting of the annual meeting of the American Society of Plant Biologists in Austin, Texas. The molecular biology of RNA is emerging as an integral part of the framework for plants' responses to environmental challenges such as drought and heat, hypoxia, nutrient deprivation, light and pathogens. Moreover, the conference illustrated how a multitude of customized and pioneering omics-related technologies are being applied, more and more often in combination, to describe and dissect the complexities of gene expression at the post-transcriptional level.
Collapse
Affiliation(s)
- Albrecht G von Arnim
- Department of Biochemistry, Cellular and Molecular Biology, The University of Tennessee, Knoxville, TN, 37996-0840, USA
| | - Hanjo A Hellmann
- School of Biological Sciences, Washington State University, PO Box 644236, Pullman, WA, 99164-4236, USA
| |
Collapse
|
335
|
Xiang B, Li X, Qian J, Wang L, Ma L, Tian X, Wang Y. The Complete Chloroplast Genome Sequence of the Medicinal Plant Swertia mussotii Using the PacBio RS II Platform. Molecules 2016; 21:molecules21081029. [PMID: 27517885 PMCID: PMC6274542 DOI: 10.3390/molecules21081029] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2016] [Revised: 07/21/2016] [Accepted: 08/04/2016] [Indexed: 11/16/2022] Open
Abstract
Swertia mussotii is an important medicinal plant that has great economic and medicinal value and is found on the Qinghai Tibetan Plateau. The complete chloroplast (cp) genome of S. mussotii is 153,431 bp in size, with a pair of inverted repeat (IR) regions of 25,761 bp each that separate an large single-copy (LSC) region of 83,567 bp and an a small single-copy (SSC) region of 18,342 bp. The S. mussotii cp genome encodes 84 protein-coding genes, 37 transfer RNA (tRNA) genes, and eight ribosomal RNA (rRNA) genes. The identity, number, and GC content of S. mussotii cp genes were similar to those in the genomes of other Gentianales species. Via analysis of the repeat structure, 11 forward repeats, eight palindromic repeats, and one reverse repeat were detected in the S. mussotii cp genome. There are 45 SSRs in the S. mussotii cp genome, the majority of which are mononucleotides found in all other Gentianales species. An entire cp genome comparison study of S. mussotii and two other species in Gentianaceae was conducted. The complete cp genome sequence provides intragenic information for the cp genetic engineering of this medicinal plant.
Collapse
Affiliation(s)
- Beibei Xiang
- School of Chinese Materia Medica, Tianjin University of Traditional Chinese Medicine, Anshan Road 312, Tianjin 300193, China.
| | - Xiaoxue Li
- College of Life Science, Nankai University, Weijin Road 94, Tianjin 300071, China.
| | - Jun Qian
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Malianwa North Road 151, Beijing 100193, China.
| | - Lizhi Wang
- School of Chinese Materia Medica, Tianjin University of Traditional Chinese Medicine, Anshan Road 312, Tianjin 300193, China.
| | - Lin Ma
- School of Chinese Materia Medica, Tianjin University of Traditional Chinese Medicine, Anshan Road 312, Tianjin 300193, China.
| | - Xiaoxuan Tian
- Tianjin State Key Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, Anshan Road 312, Tianjin 300193, China.
| | - Yong Wang
- College of Life Science, Nankai University, Weijin Road 94, Tianjin 300071, China.
| |
Collapse
|