51
|
Wright DJ, Hall NAL, Irish N, Man AL, Glynn W, Mould A, Angeles ADL, Angiolini E, Swarbreck D, Gharbi K, Tunbridge EM, Haerty W. Long read sequencing reveals novel isoforms and insights into splicing regulation during cell state changes. BMC Genomics 2022; 23:42. [PMID: 35012468 PMCID: PMC8744310 DOI: 10.1186/s12864-021-08261-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 12/15/2021] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND Alternative splicing is a key mechanism underlying cellular differentiation and a driver of complexity in mammalian neuronal tissues. However, understanding of which isoforms are differentially used or expressed and how this affects cellular differentiation remains unclear. Long read sequencing allows full-length transcript recovery and quantification, enabling transcript-level analysis of alternative splicing processes and how these change with cell state. Here, we utilise Oxford Nanopore Technologies sequencing to produce a custom annotation of a well-studied human neuroblastoma cell line SH-SY5Y, and to characterise isoform expression and usage across differentiation. RESULTS We identify many previously unannotated features, including a novel transcript of the voltage-gated calcium channel subunit gene, CACNA2D2. We show differential expression and usage of transcripts during differentiation identifying candidates for future research into state change regulation. CONCLUSIONS Our work highlights the potential of long read sequencing to uncover previously unknown transcript diversity and mechanisms influencing alternative splicing.
Collapse
Affiliation(s)
- David J Wright
- Earlham Institute, Norwich Research Park, Norfolk, NR4 7UZ, UK
| | - Nicola A L Hall
- Department of Psychiatry, Medical Sciences Division, University of Oxford, Oxfordshire, OX3 3JX, UK
- Oxford Health, NHS Foundation Trust, Oxford, Oxfordshire, OX3 7JX, UK
| | - Naomi Irish
- Earlham Institute, Norwich Research Park, Norfolk, NR4 7UZ, UK
| | - Angela L Man
- Earlham Institute, Norwich Research Park, Norfolk, NR4 7UZ, UK
| | - Will Glynn
- Earlham Institute, Norwich Research Park, Norfolk, NR4 7UZ, UK
| | - Arne Mould
- Department of Psychiatry, Medical Sciences Division, University of Oxford, Oxfordshire, OX3 3JX, UK
- Oxford Health, NHS Foundation Trust, Oxford, Oxfordshire, OX3 7JX, UK
| | - Alejandro De Los Angeles
- Department of Psychiatry, Medical Sciences Division, University of Oxford, Oxfordshire, OX3 3JX, UK
- Oxford Health, NHS Foundation Trust, Oxford, Oxfordshire, OX3 7JX, UK
| | - Emily Angiolini
- Earlham Institute, Norwich Research Park, Norfolk, NR4 7UZ, UK
| | - David Swarbreck
- Earlham Institute, Norwich Research Park, Norfolk, NR4 7UZ, UK
| | - Karim Gharbi
- Earlham Institute, Norwich Research Park, Norfolk, NR4 7UZ, UK
| | - Elizabeth M Tunbridge
- Department of Psychiatry, Medical Sciences Division, University of Oxford, Oxfordshire, OX3 3JX, UK
- Oxford Health, NHS Foundation Trust, Oxford, Oxfordshire, OX3 7JX, UK
| | - Wilfried Haerty
- Earlham Institute, Norwich Research Park, Norfolk, NR4 7UZ, UK.
| |
Collapse
|
52
|
Namba S, Ueno T, Kojima S, Kobayashi K, Kawase K, Tanaka Y, Inoue S, Kishigami F, Kawashima S, Maeda N, Ogawa T, Hazama S, Togashi Y, Ando M, Shiraishi Y, Mano H, Kawazu M. Transcript-targeted analysis reveals isoform alterations and double-hop fusions in breast cancer. Commun Biol 2021; 4:1320. [PMID: 34811492 PMCID: PMC8608905 DOI: 10.1038/s42003-021-02833-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Accepted: 11/02/2021] [Indexed: 12/22/2022] Open
Abstract
Although transcriptome alteration is an essential driver of carcinogenesis, the effects of chromosomal structural alterations on the cancer transcriptome are not yet fully understood. Short-read transcript sequencing has prevented researchers from directly exploring full-length transcripts, forcing them to focus on individual splice sites. Here, we develop a pipeline for Multi-Sample long-read Transcriptome Assembly (MuSTA), which enables construction of a transcriptome from long-read sequence data. Using the constructed transcriptome as a reference, we analyze RNA extracted from 22 clinical breast cancer specimens. We identify a comprehensive set of subtype-specific and differentially used isoforms, which extended our knowledge of isoform regulation to unannotated isoforms including a short form TNS3. We also find that the exon-intron structure of fusion transcripts depends on their genomic context, and we identify double-hop fusion transcripts that are transcribed from complex structural rearrangements. For example, a double-hop fusion results in aberrant expression of an endogenous retroviral gene, ERVFRD-1, which is normally expressed exclusively in placenta and is thought to protect fetus from maternal rejection; expression is elevated in several TCGA samples with ERVFRD-1 fusions. Our analyses provide direct evidence that full-length transcript sequencing of clinical samples can add to our understanding of cancer biology and genomics in general.
Collapse
Affiliation(s)
- Shinichi Namba
- Division of Cellular Signaling, National Cancer Center Research Institute, Tokyo, 104-0045, Japan
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Osaka, 565-0871, Japan
| | - Toshihide Ueno
- Division of Cellular Signaling, National Cancer Center Research Institute, Tokyo, 104-0045, Japan
| | - Shinya Kojima
- Division of Cellular Signaling, National Cancer Center Research Institute, Tokyo, 104-0045, Japan
| | - Kenya Kobayashi
- Department of Head and Neck Oncology, National Cancer Center Hospital, Tokyo, 104-0045, Japan
| | - Katsushige Kawase
- Division of Cell Therapy, Chiba Cancer Center, Research Institute, Chiba, 260-8717, Japan
| | - Yosuke Tanaka
- Division of Cellular Signaling, National Cancer Center Research Institute, Tokyo, 104-0045, Japan
| | - Satoshi Inoue
- Division of Cellular Signaling, National Cancer Center Research Institute, Tokyo, 104-0045, Japan
| | - Fumishi Kishigami
- Division of Cellular Signaling, National Cancer Center Research Institute, Tokyo, 104-0045, Japan
| | - Shusuke Kawashima
- Division of Cell Therapy, Chiba Cancer Center, Research Institute, Chiba, 260-8717, Japan
| | - Noriko Maeda
- Department of Gastroenterological, Breast and Endocrine Surgery, Yamaguchi University Graduate School of Medicine, Yamaguchi, 755-8505, Japan
| | - Tomoko Ogawa
- Department of Breast Surgery, Mie University Hospital, Mie, 514-8507, Japan
| | - Shoichi Hazama
- Department of Translational Research and Developmental Therapeutics against Cancer, Yamaguchi University Graduate School of Medicine, Yamaguchi, 755-8505, Japan
| | - Yosuke Togashi
- Division of Cell Therapy, Chiba Cancer Center, Research Institute, Chiba, 260-8717, Japan
| | - Mizuo Ando
- Department of Otolaryngology, Head and Neck Surgery, The University of Tokyo Hospital, Tokyo, 113-8654, Japan
| | - Yuichi Shiraishi
- Division of Genome Analysis Platform Development, National Cancer Center Research Institute, Tokyo, 104-0045, Japan
| | - Hiroyuki Mano
- Division of Cellular Signaling, National Cancer Center Research Institute, Tokyo, 104-0045, Japan
| | - Masahito Kawazu
- Division of Cellular Signaling, National Cancer Center Research Institute, Tokyo, 104-0045, Japan.
- Division of Cell Therapy, Chiba Cancer Center, Research Institute, Chiba, 260-8717, Japan.
| |
Collapse
|
53
|
Kuo MC, Liu SCH, Hsu YF, Wu RM. The role of noncoding RNAs in Parkinson's disease: biomarkers and associations with pathogenic pathways. J Biomed Sci 2021; 28:78. [PMID: 34794432 PMCID: PMC8603508 DOI: 10.1186/s12929-021-00775-x] [Citation(s) in RCA: 46] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2021] [Accepted: 11/04/2021] [Indexed: 02/08/2023] Open
Abstract
The discovery of various noncoding RNAs (ncRNAs) and their biological implications is a growing area in cell biology. Increasing evidence has revealed canonical and noncanonical functions of long and small ncRNAs, including microRNAs, long ncRNAs (lncRNAs), circular RNAs, PIWI-interacting RNAs, and tRNA-derived fragments. These ncRNAs have the ability to regulate gene expression and modify metabolic pathways. Thus, they may have important roles as diagnostic biomarkers or therapeutic targets in various diseases, including neurodegenerative disorders, especially Parkinson's disease. Recently, through diverse sequencing technologies and a wide variety of bioinformatic analytical tools, such as reverse transcriptase quantitative PCR, microarrays, next-generation sequencing and long-read sequencing, numerous ncRNAs have been shown to be associated with neurodegenerative disorders, including Parkinson's disease. In this review article, we will first introduce the biogenesis of different ncRNAs, including microRNAs, PIWI-interacting RNAs, circular RNAs, long noncoding RNAs, and tRNA-derived fragments. The pros and cons of the detection platforms of ncRNAs and the reproducibility of bioinformatic analytical tools will be discussed in the second part. Finally, the recent discovery of numerous PD-associated ncRNAs and their association with the diagnosis and pathophysiology of PD are reviewed, and microRNAs and long ncRNAs that are transported by exosomes in biofluids are particularly emphasized.
Collapse
Affiliation(s)
- Ming-Che Kuo
- Department of Medicine, Section of Neurology, Cancer Center, National Taiwan University Hospital, Taipei, Taiwan
- Department of Neurology, National Taiwan University Hospital, College of Medicine, National Taiwan University, Taipei, Taiwan
| | - Sam Chi-Hao Liu
- Department of Neurology, National Taiwan University Hospital, College of Medicine, National Taiwan University, Taipei, Taiwan
| | - Ya-Fang Hsu
- Graduate Institute of Brain and Mind Sciences, College of Medicine, National Taiwan University, Taipei, Taiwan
| | - Ruey-Meei Wu
- Department of Neurology, National Taiwan University Hospital, College of Medicine, National Taiwan University, Taipei, Taiwan.
- Graduate Institute of Brain and Mind Sciences, College of Medicine, National Taiwan University, Taipei, Taiwan.
| |
Collapse
|
54
|
Comparative Analysis of PacBio and Oxford Nanopore Sequencing Technologies for Transcriptomic Landscape Identification of Penaeus monodon. Life (Basel) 2021; 11:life11080862. [PMID: 34440606 PMCID: PMC8399832 DOI: 10.3390/life11080862] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Revised: 08/07/2021] [Accepted: 08/17/2021] [Indexed: 12/16/2022] Open
Abstract
With the advantages that long-read sequencing platforms such as Pacific Biosciences (Menlo Park, CA, USA) (PacBio) and Oxford Nanopore Technologies (Oxford, UK) (ONT) can offer, various research fields such as genomics and transcriptomics can exploit their benefits. Selecting an appropriate sequencing platform is undoubtedly crucial for the success of the research outcome, thus there is a need to compare these long-read sequencing platforms and evaluate them for specific research questions. This study aims to compare the performance of PacBio and ONT platforms for transcriptomic analysis by utilizing transcriptome data from three different tissues (hepatopancreas, intestine, and gonads) of the juvenile black tiger shrimp, Penaeus monodon. We compared three important features: (i) main characteristics of the sequencing libraries and their alignment with the reference genome, (ii) transcript assembly features and isoform identification, and (iii) correlation of the quantification of gene expression levels for both platforms. Our analyses suggest that read-length bias and differences in sequencing throughput are highly influential factors when using long reads in transcriptome studies. These comparisons can provide a guideline when designing a transcriptome study utilizing these two long-read sequencing technologies.
Collapse
|
55
|
De Paoli-Iseppi R, Gleeson J, Clark MB. Isoform Age - Splice Isoform Profiling Using Long-Read Technologies. Front Mol Biosci 2021; 8:711733. [PMID: 34409069 PMCID: PMC8364947 DOI: 10.3389/fmolb.2021.711733] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 07/19/2021] [Indexed: 01/12/2023] Open
Abstract
Alternative splicing (AS) of RNA is a key mechanism that results in the expression of multiple transcript isoforms from single genes and leads to an increase in the complexity of both the transcriptome and proteome. Regulation of AS is critical for the correct functioning of many biological pathways, while disruption of AS can be directly pathogenic in diseases such as cancer or cause risk for complex disorders. Current short-read sequencing technologies achieve high read depth but are limited in their ability to resolve complex isoforms. In this review we examine how long-read sequencing (LRS) technologies can address this challenge by covering the entire RNA sequence in a single read and thereby distinguish isoform changes that could impact RNA regulation or protein function. Coupling LRS with technologies such as single cell sequencing, targeted sequencing and spatial transcriptomics is producing a rapidly expanding suite of technological approaches to profile alternative splicing at the isoform level with unprecedented detail. In addition, integrating LRS with genotype now allows the impact of genetic variation on isoform expression to be determined. Recent results demonstrate the potential of these techniques to elucidate the landscape of splicing, including in tissues such as the brain where AS is particularly prevalent. Finally, we also discuss how AS can impact protein function, potentially leading to novel therapeutic targets for a range of diseases.
Collapse
Affiliation(s)
| | | | - Michael B. Clark
- Centre for Stem Cell Systems, Department of Anatomy and Physiology, The University of Melbourne, Parkville, VIC, Australia
| |
Collapse
|
56
|
Naftaly AS, Pau S, White MA. Long-read RNA sequencing reveals widespread sex-specific alternative splicing in threespine stickleback fish. Genome Res 2021; 31:1486-1497. [PMID: 34131005 PMCID: PMC8327910 DOI: 10.1101/gr.274282.120] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Accepted: 06/15/2021] [Indexed: 01/07/2023]
Abstract
Alternate isoforms are important contributors to phenotypic diversity across eukaryotes. Although short-read RNA-sequencing has increased our understanding of isoform diversity, it is challenging to accurately detect full-length transcripts, preventing the identification of many alternate isoforms. Long-read sequencing technologies have made it possible to sequence full-length alternative transcripts, accurately characterizing alternative splicing events, alternate transcription start and end sites, and differences in UTR regions. Here, we use Pacific Biosciences (PacBio) long-read RNA-sequencing (Iso-Seq) to examine the transcriptomes of five organs in threespine stickleback fish (Gasterosteus aculeatus), a widely used genetic model species. The threespine stickleback fish has a refined genome assembly in which gene annotations are based on short-read RNA sequencing and predictions from coding sequence of other species. This suggests some of the existing annotations may be inaccurate or alternative transcripts may not be fully characterized. Using Iso-Seq we detected thousands of novel isoforms, indicating many isoforms are absent in the current Ensembl gene annotations. In addition, we refined many of the existing annotations within the genome. We noted many improperly positioned transcription start sites that were refined with long-read sequencing. The Iso-Seq-predicted transcription start sites were more accurate and verified through ATAC-seq. We also detected many alternative splicing events between sexes and across organs. We found a substantial number of genes in both somatic and gonadal samples that had sex-specific isoforms. Our study highlights the power of long-read sequencing to study the complexity of transcriptomes, greatly improving genomic resources for the threespine stickleback fish.
Collapse
Affiliation(s)
- Alice S Naftaly
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | - Shana Pau
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
- Department of Biology, University of Texas Arlington, Arlington, Texas 76019, USA
| | - Michael A White
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| |
Collapse
|
57
|
Zea DJ, Laskina S, Baudin A, Richard H, Laine E. Assessing conservation of alternative splicing with evolutionary splicing graphs. Genome Res 2021; 31:1462-1473. [PMID: 34266979 PMCID: PMC8327911 DOI: 10.1101/gr.274696.120] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Accepted: 06/11/2021] [Indexed: 12/29/2022]
Abstract
Understanding how protein function has evolved and diversified is of great importance for human genetics and medicine. Here, we tackle the problem of describing the whole transcript variability observed in several species by generalizing the definition of splicing graph. We provide a practical solution to construct parsimonious evolutionary splicing graphs where each node is a minimal transcript building block defined across species. We show a clear link between the functional relevance, tissue regulation, and conservation of alternative transcripts on a set of 50 genes. By scaling up to the whole human protein-coding genome, we identify a few thousand genes where alternative splicing modulates the number and composition of pseudorepeats. We have implemented our approach in ThorAxe, an efficient, versatile, robust, and freely available computational tool.
Collapse
Affiliation(s)
- Diego Javier Zea
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France
| | - Sofya Laskina
- Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, 13353 Berlin, Germany
| | - Alexis Baudin
- Sorbonne Université, CNRS, LIP6, F-75005 Paris, France
| | - Hugues Richard
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France
- Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, 13353 Berlin, Germany
| | - Elodie Laine
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France
| |
Collapse
|
58
|
Chiu YJ, Tsai FJ, Bau DT, Chang LC, Hsieh MT, Lu CC, Kuo SC, Yang JS. Next‑generation sequencing analysis reveals that MTH‑3, a novel curcuminoid derivative, suppresses the invasion of MDA‑MB‑231 triple‑negative breast adenocarcinoma cells. Oncol Rep 2021; 46:133. [PMID: 34013378 PMCID: PMC8144931 DOI: 10.3892/or.2021.8084] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Accepted: 04/19/2021] [Indexed: 12/15/2022] Open
Abstract
Triple‑negative breast cancer (TNBC) behaves aggressively in the invasive and metastatic states. Our research group recently developed a novel curcumin derivative, (1E,3Z,6E)-3-hydroxy-5-oxohepta-1,3,6-triene-1,7-diyl)bis(2‑methoxy-4,1‑phenylene)bis(3-hydroxy2-hydroxymethyl)-2‑methyl propanoate (MTH‑3), and previous studies showed that MTH‑3 inhibits TNBC proliferation and induces apoptosis in vitro and in vivo with a superior bioavailability and absorption than curcumin. In the present study, the effects of MTH‑3 on TNBC cell invasion were examined using various assays and gelatin zymography, and western blot analysis. Treatment with MTH‑3 inhibited MDA‑MB‑231 cell invasion and migration, as shown by Transwell assay, 3D spheroid invasion assay, and wound healing assay. The results of the gelatin zymography experiments revealed that MTH‑3 decreased matrix metalloproteinase‑9 activity. The potential signaling pathways were revealed by next‑generation sequencing analysis, antibody microarray analysis and western blot analysis. In conclusion, the results of the present study show that, MTH‑3 inhibited tumor cell invasion through the MAPK/ERK/AKT signaling pathway and cell cycle regulatory cascade, providing significant information about the potential molecular mechanisms of the effects of MTH‑3 on TNBC.
Collapse
Affiliation(s)
- Yu-Jen Chiu
- Division of Plastic and Reconstructive Surgery, Department of Surgery, Taipei Veterans General Hospital, Taipei 11217, Taiwan, R.O.C
- Department of Surgery, School of Medicine, National Yang Ming Chiao Tung University, Taipei 11221, Taiwan, R.O.C
- Institute of Clinical Medicine, National Yang Ming Chiao Tung University, Taipei 11221, Taiwan, R.O.C
| | - Fuu-Jen Tsai
- Human Genetic Center, China Medical University, Taichung 40402, Taiwan, R.O.C
- School of Chinese Medicine, China Medical University, Taichung 40402, Taiwan, R.O.C
| | - Da-Tian Bau
- Department of Medical Research, China Medical University Hospital, China Medical University, Taichung 40402, Taiwan, R.O.C
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung 41354, Taiwan, R.O.C
| | - Ling-Chu Chang
- Chinese Medicinal Research and Development Center, China Medical University Hospital, Taichung 40402, Taiwan, R.O.C
| | - Min-Tsang Hsieh
- Chinese Medicinal Research and Development Center, China Medical University Hospital, Taichung 40402, Taiwan, R.O.C
- School of Pharmacy, China Medical University, Taichung 40402, Taiwan, R.O.C
| | - Chi-Cheng Lu
- Department of Sport Performance, National Taiwan University of Sport, Taichung 40402, Taiwan, R.O.C
| | - Sheng-Chu Kuo
- Chinese Medicinal Research and Development Center, China Medical University Hospital, Taichung 40402, Taiwan, R.O.C
- School of Pharmacy, China Medical University, Taichung 40402, Taiwan, R.O.C
| | - Jai-Sing Yang
- Department of Medical Research, China Medical University Hospital, China Medical University, Taichung 40402, Taiwan, R.O.C
| |
Collapse
|
59
|
Massaiu I, Songia P, Chiesa M, Valerio V, Moschetta D, Alfieri V, Myasoedova VA, Schmid M, Cassetta L, Colombo GI, D’Alessandra Y, Poggio P. Evaluation of Oxford Nanopore MinION RNA-Seq Performance for Human Primary Cells. Int J Mol Sci 2021; 22:ijms22126317. [PMID: 34204756 PMCID: PMC8231517 DOI: 10.3390/ijms22126317] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 05/17/2021] [Accepted: 06/09/2021] [Indexed: 12/12/2022] Open
Abstract
Transcript sequencing is a crucial tool for gaining a deep understanding of biological processes in diagnostic and clinical medicine. Given their potential to study novel complex eukaryotic transcriptomes, long-read sequencing technologies are able to overcome some limitations of short-read RNA-Seq approaches. Oxford Nanopore Technologies (ONT) offers the ability to generate long-read sequencing data in real time via portable protein nanopore USB devices. This work aimed to provide the user with the number of reads that should be sequenced, through the ONT MinION platform, to reach the desired accuracy level for a human cell RNA study. We sequenced three cDNA libraries prepared from poly-adenosine RNA of human primary cardiac fibroblasts. Since the runs were comparable, they were combined in a total dataset of 48 million reads. Synthetic datasets with different sizes were generated starting from the total and analyzed in terms of the number of identified genes and their expression levels. As expected, an improved sensitivity was obtained, increasing the sequencing depth, particularly for the non-coding genes. The reliability of expression levels was assayed by (i) comparison with PCR quantifications of selected genes and (ii) by the implementation of a user-friendly multiplexing method in a single run.
Collapse
Affiliation(s)
- Ilaria Massaiu
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
| | - Paola Songia
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
| | - Mattia Chiesa
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
| | - Vincenza Valerio
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
- Dipartimento di Medicina Clinica e Chirurgia, Università degli Studi di Napoli Federico II, 80131 Napoli, Italy
| | - Donato Moschetta
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
- Dipartimento di Scienze Farmacologiche e Biomolecolari, Università degli Studi di Milano, 20133 Milano, Italy
| | - Valentina Alfieri
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
| | - Veronika A. Myasoedova
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
| | - Michael Schmid
- Genexa AG, Dienerstrasse 7, CH-8004 Zürich, Switzerland;
| | - Luca Cassetta
- The Queen’s Medical Research Council Centre for Reproductive Health, University of Edinburgh, Edinburgh EH16 4TJ, UK;
| | - Gualtiero I. Colombo
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
| | - Yuri D’Alessandra
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
| | - Paolo Poggio
- Centro Cardiologico Monzino IRCCS, 20131 Milan, Italy; (I.M.); (P.S.); (M.C.); (V.V.); (D.M.); (V.A.); (V.A.M.); (G.I.C.); (Y.D.)
- Correspondence:
| |
Collapse
|
60
|
Sun D, Zhang L, Yu Q, Zhang J, Li P, Zhang Y, Xing X, Ding L, Fang W, Chen F, Song A. Integrated Signals of Jasmonates, Sugars, Cytokinins and Auxin Influence the Initial Growth of the Second Buds of Chrysanthemum after Decapitation. BIOLOGY 2021; 10:biology10050440. [PMID: 34065759 PMCID: PMC8156878 DOI: 10.3390/biology10050440] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 05/13/2021] [Accepted: 05/14/2021] [Indexed: 11/16/2022]
Abstract
Decapitation is common in horticulture for altering plant architecture. The decapitation of chrysanthemum plants breaks apical dominance and leads to more flowers on lateral branches, resulting in landscape flowers with good coverage. We performed both third- and second-generation transcriptome sequencing of the second buds of chrysanthemum. This third-generation transcriptome is the first sequenced third-generation transcriptome of chrysanthemum, revealing alternative splicing events, lncRNAs, and transcription factors. Aside from the classic hormones, the expression of jasmonate-related genes changed because of this process. Sugars also played an important role in this process, with upregulated expression of sucrose transport-related and TPS genes. We constructed a model of the initial growth of the second buds after decapitation. Auxin export and sugar influx activated the growth of these buds, while the JA-Ile caused by wounding inhibited the expression of CycD genes from 0 h to 6 h. After wound recovery, cytokinins accumulated in the second buds and might have induced ARR12 expression to upregulate CycD gene expression from 6 h to 48 h, together with sugars. Therefore, jasmonates, cytokinins, sugars, and auxin work together to determine the fate of the buds of plants with short internodes, such as chrysanthemum.
Collapse
Affiliation(s)
- Daojin Sun
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Landscaping, Ministry of Agriculture and Rural Affairs, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China; (D.S.); (L.Z.); (Q.Y.); (J.Z.); (Y.Z.); (X.X.); (L.D.); (W.F.); (F.C.)
| | - Luyao Zhang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Landscaping, Ministry of Agriculture and Rural Affairs, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China; (D.S.); (L.Z.); (Q.Y.); (J.Z.); (Y.Z.); (X.X.); (L.D.); (W.F.); (F.C.)
| | - Qi Yu
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Landscaping, Ministry of Agriculture and Rural Affairs, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China; (D.S.); (L.Z.); (Q.Y.); (J.Z.); (Y.Z.); (X.X.); (L.D.); (W.F.); (F.C.)
| | - Jiali Zhang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Landscaping, Ministry of Agriculture and Rural Affairs, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China; (D.S.); (L.Z.); (Q.Y.); (J.Z.); (Y.Z.); (X.X.); (L.D.); (W.F.); (F.C.)
| | - Peiling Li
- Henan Key Laboratory of Tea Comprehensive Utilization in South Henan, Xinyang Agriculture and Forestry University, Xinyang 464000, China;
| | - Yu Zhang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Landscaping, Ministry of Agriculture and Rural Affairs, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China; (D.S.); (L.Z.); (Q.Y.); (J.Z.); (Y.Z.); (X.X.); (L.D.); (W.F.); (F.C.)
| | - Xiaojuan Xing
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Landscaping, Ministry of Agriculture and Rural Affairs, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China; (D.S.); (L.Z.); (Q.Y.); (J.Z.); (Y.Z.); (X.X.); (L.D.); (W.F.); (F.C.)
| | - Lian Ding
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Landscaping, Ministry of Agriculture and Rural Affairs, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China; (D.S.); (L.Z.); (Q.Y.); (J.Z.); (Y.Z.); (X.X.); (L.D.); (W.F.); (F.C.)
| | - Weimin Fang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Landscaping, Ministry of Agriculture and Rural Affairs, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China; (D.S.); (L.Z.); (Q.Y.); (J.Z.); (Y.Z.); (X.X.); (L.D.); (W.F.); (F.C.)
| | - Fadi Chen
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Landscaping, Ministry of Agriculture and Rural Affairs, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China; (D.S.); (L.Z.); (Q.Y.); (J.Z.); (Y.Z.); (X.X.); (L.D.); (W.F.); (F.C.)
| | - Aiping Song
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Landscaping, Ministry of Agriculture and Rural Affairs, College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China; (D.S.); (L.Z.); (Q.Y.); (J.Z.); (Y.Z.); (X.X.); (L.D.); (W.F.); (F.C.)
- Correspondence:
| |
Collapse
|
61
|
Lecluze E, Rolland AD, Filis P, Evrard B, Leverrier-Penna S, Maamar MB, Coiffec I, Lavoué V, Fowler PA, Mazaud-Guittot S, Jégou B, Chalmel F. Dynamics of the transcriptional landscape during human fetal testis and ovary development. Hum Reprod 2021; 35:1099-1119. [PMID: 32412604 DOI: 10.1093/humrep/deaa041] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Revised: 02/10/2020] [Indexed: 12/17/2022] Open
Abstract
STUDY QUESTION Which transcriptional program triggers sex differentiation in bipotential gonads and downstream cellular events governing fetal testis and ovary development in humans? SUMMARY ANSWER The characterization of a dynamically regulated protein-coding and non-coding transcriptional landscape in developing human gonads of both sexes highlights a large number of potential key regulators that show an early sexually dimorphic expression pattern. WHAT IS KNOWN ALREADY Gonadal sex differentiation is orchestrated by a sexually dimorphic gene expression program in XX and XY developing fetal gonads. A comprehensive characterization of its non-coding counterpart offers promising perspectives for deciphering the molecular events underpinning gonad development and for a complete understanding of the etiology of disorders of sex development in humans. STUDY DESIGN, SIZE, DURATION To further investigate the protein-coding and non-coding transcriptional landscape during gonad differentiation, we used RNA-sequencing (RNA-seq) and characterized the RNA content of human fetal testis (N = 24) and ovaries (N = 24) from 6 to 17 postconceptional week (PCW), a key period in sex determination and gonad development. PARTICIPANTS/MATERIALS, SETTING, METHODS First trimester fetuses (6-12 PCW) and second trimester fetuses (13-14 and 17 PCW) were obtained from legally induced normally progressing terminations of pregnancy. Total RNA was extracted from whole human fetal gonads and sequenced as paired-end 2 × 50 base reads. Resulting sequences were mapped to the human genome, allowing for the assembly and quantification of corresponding transcripts. MAIN RESULTS AND THE ROLE OF CHANCE This RNA-seq analysis of human fetal testes and ovaries at seven key developmental stages led to the reconstruction of 22 080 transcripts differentially expressed during testicular and/or ovarian development. In addition to 8935 transcripts displaying sex-independent differential expression during gonad development, the comparison of testes and ovaries enabled the discrimination of 13 145 transcripts that show a sexually dimorphic expression profile. The latter include 1479 transcripts differentially expressed as early as 6 PCW, including 39 transcription factors, 40 long non-coding RNAs and 20 novel genes. Despite the use of stringent filtration criteria (expression cut-off of at least 1 fragment per kilobase of exon model per million reads mapped, fold change of at least 2 and false discovery rate adjusted P values of less than <1%), the possibility of assembly artifacts and of false-positive differentially expressed transcripts cannot be fully ruled out. LARGE-SCALE DATA Raw data files (fastq) and a searchable table (.xlss) containing information on genomic features and expression data for all refined transcripts have been submitted to the NCBI GEO under accession number GSE116278. LIMITATIONS, REASONS FOR CAUTION The intrinsic nature of this bulk analysis, i.e. the sequencing of transcripts from whole gonads, does not allow direct identification of the cellular origin(s) of the transcripts characterized. Potential cellular dilution effects (e.g. as a result of distinct proliferation rates in XX and XY gonads) may account for a few of the expression profiles identified as being sexually dimorphic. Finally, transcriptome alterations that would result from exposure to pre-abortive drugs cannot be completely excluded. Although we demonstrated the high quality of the sorted cell populations used for experimental validations using quantitative RT-PCR, it cannot be totally excluded that some germline expression may correspond to cell contamination by, for example, macrophages. WIDER IMPLICATIONS OF THE FINDINGS For the first time, this study has led to the identification of 1000 protein-coding and non-coding candidate genes showing an early, sexually dimorphic, expression pattern that have not previously been associated with sex differentiation. Collectively, these results increase our understanding of gonad development in humans, and contribute significantly to the identification of new candidate genes involved in fetal gonad differentiation. The results also provide a unique resource that may improve our understanding of the fetal origin of testicular and ovarian dysgenesis syndromes, including cryptorchidism and testicular cancers. STUDY FUNDING/COMPETING INTEREST(S) This work was supported by the French National Institute of Health and Medical Research (Inserm), the University of Rennes 1, the French School of Public Health (EHESP), the Swiss National Science Foundation [SNF n° CRS115_171007 to B.J.], the French National Research Agency [ANR n° 16-CE14-0017-02 and n° 18-CE14-0038-02 to F.C.], the Medical Research Council [MR/L010011/1 to P.A.F.] and the European Community's Seventh Framework Programme (FP7/2007-2013) [under grant agreement no 212885 to P.A.F.] and from the European Union's Horizon 2020 Research and Innovation Programme [under grant agreement no 825100 to P.A.F. and S.M.G.]. There are no competing interests related to this study.
Collapse
Affiliation(s)
- Estelle Lecluze
- Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail) - UMR_S 1085, F-35000 Rennes, France
| | - Antoine D Rolland
- Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail) - UMR_S 1085, F-35000 Rennes, France
| | - Panagiotis Filis
- Institute of Medical Sciences, School of Medicine, Medical Sciences & Nutrition, University of Aberdeen, Foresterhill, Aberdeen, AB25 2ZD, UK
| | - Bertrand Evrard
- Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail) - UMR_S 1085, F-35000 Rennes, France
| | - Sabrina Leverrier-Penna
- Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail) - UMR_S 1085, F-35000 Rennes, France.,Univ Poitiers, STIM, CNRS ERL7003, Poitiers Cedex 9, CNRS ERL7003, France
| | - Millissia Ben Maamar
- Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail) - UMR_S 1085, F-35000 Rennes, France
| | - Isabelle Coiffec
- Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail) - UMR_S 1085, F-35000 Rennes, France
| | - Vincent Lavoué
- Service Gynécologie et Obstétrique, CHU Rennes, F-35000 Rennes, France
| | - Paul A Fowler
- Institute of Medical Sciences, School of Medicine, Medical Sciences & Nutrition, University of Aberdeen, Foresterhill, Aberdeen, AB25 2ZD, UK
| | - Séverine Mazaud-Guittot
- Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail) - UMR_S 1085, F-35000 Rennes, France
| | - Bernard Jégou
- Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail) - UMR_S 1085, F-35000 Rennes, France
| | - Frédéric Chalmel
- Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail) - UMR_S 1085, F-35000 Rennes, France
| |
Collapse
|
62
|
Liu S, Wu I, Yu YP, Balamotis M, Ren B, Ben Yehezkel T, Luo JH. Targeted transcriptome analysis using synthetic long read sequencing uncovers isoform reprograming in the progression of colon cancer. Commun Biol 2021; 4:506. [PMID: 33907296 PMCID: PMC8079361 DOI: 10.1038/s42003-021-02024-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Accepted: 03/09/2021] [Indexed: 02/02/2023] Open
Abstract
The characterization of human gene expression is limited by short read lengths, high error rates and large input requirements. Here, we used a synthetic long read (SLR) sequencing approach, LoopSeq, to generate accurate sequencing reads that span full length transcripts using standard short read data. LoopSeq identified isoforms from control samples with 99.4% accuracy and a 0.01% per-base error rate, exceeding the accuracy reported for other long-read technologies. Applied to targeted transcriptome sequencing from colon cancers and their metastatic counterparts, LoopSeq revealed large scale isoform redistributions from benign colon mucosa to primary colon cancer and metastatic cancer and identified several previously unknown fusion isoforms. Strikingly, single nucleotide variants (SNVs) occurred dominantly in specific isoforms and some SNVs underwent isoform switching in cancer progression. The ability to use short reads to generate accurate long-read data as the raw unit of information holds promise as a widely accessible approach in transcriptome sequencing.
Collapse
Affiliation(s)
- Silvia Liu
- Department of Pathology, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
- High Throughput Genome Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
- Pittsburgh Liver Research Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
| | - Indira Wu
- Loop Genomics, Inc., San Jose, CA, 95138, USA
| | - Yan-Ping Yu
- Department of Pathology, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
- High Throughput Genome Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
- Pittsburgh Liver Research Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
| | | | - Baoguo Ren
- Department of Pathology, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
- High Throughput Genome Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA
| | | | - Jian-Hua Luo
- Department of Pathology, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA.
- High Throughput Genome Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA.
- Pittsburgh Liver Research Center, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15261, USA.
| |
Collapse
|
63
|
Graw S, Chappell K, Washam CL, Gies A, Bird J, Robeson MS, Byrum SD. Multi-omics data integration considerations and study design for biological systems and disease. Mol Omics 2021; 17:170-185. [PMID: 33347526 PMCID: PMC8058243 DOI: 10.1039/d0mo00041h] [Citation(s) in RCA: 62] [Impact Index Per Article: 20.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
With the advancement of next-generation sequencing and mass spectrometry, there is a growing need for the ability to merge biological features in order to study a system as a whole. Features such as the transcriptome, methylome, proteome, histone post-translational modifications and the microbiome all influence the host response to various diseases and cancers. Each of these platforms have technological limitations due to sample preparation steps, amount of material needed for sequencing, and sequencing depth requirements. These features provide a snapshot of one level of regulation in a system. The obvious next step is to integrate this information and learn how genes, proteins, and/or epigenetic factors influence the phenotype of a disease in context of the system. In recent years, there has been a push for the development of data integration methods. Each method specifically integrates a subset of omics data using approaches such as conceptual integration, statistical integration, model-based integration, networks, and pathway data integration. In this review, we discuss considerations of the study design for each data feature, the limitations in gene and protein abundance and their rate of expression, the current data integration methods, and microbiome influences on gene and protein expression. The considerations discussed in this review should be regarded when developing new algorithms for integrating multi-omics data.
Collapse
Affiliation(s)
- Stefan Graw
- Department of Biochemistry and Molecular Biology, University of Arkansas for Medical Sciences, 4301 West Markham Street (slot 516), Little Rock, AR 72205-7199, USA.
| | - Kevin Chappell
- Department of Biochemistry and Molecular Biology, University of Arkansas for Medical Sciences, 4301 West Markham Street (slot 516), Little Rock, AR 72205-7199, USA.
| | - Charity L Washam
- Department of Biochemistry and Molecular Biology, University of Arkansas for Medical Sciences, 4301 West Markham Street (slot 516), Little Rock, AR 72205-7199, USA. and Arkansas Children's Research Institute, 13 Children's Way, Little Rock, AR 72202, USA
| | - Allen Gies
- Department of Biochemistry and Molecular Biology, University of Arkansas for Medical Sciences, 4301 West Markham Street (slot 516), Little Rock, AR 72205-7199, USA.
| | - Jordan Bird
- Department of Biochemistry and Molecular Biology, University of Arkansas for Medical Sciences, 4301 West Markham Street (slot 516), Little Rock, AR 72205-7199, USA.
| | - Michael S Robeson
- Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR 72205, USA.
| | - Stephanie D Byrum
- Department of Biochemistry and Molecular Biology, University of Arkansas for Medical Sciences, 4301 West Markham Street (slot 516), Little Rock, AR 72205-7199, USA. and Arkansas Children's Research Institute, 13 Children's Way, Little Rock, AR 72202, USA
| |
Collapse
|
64
|
Transcriptional Landscape of Vero E6 Cells during Early Swine Acute Diarrhea Syndrome Coronavirus Infection. Viruses 2021; 13:v13040674. [PMID: 33919952 PMCID: PMC8070899 DOI: 10.3390/v13040674] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Revised: 04/07/2021] [Accepted: 04/12/2021] [Indexed: 01/17/2023] Open
Abstract
Swine acute diarrhea syndrome coronavirus (SADS-CoV) is a newly emerged and highly pathogenic virus that is associated with fatal diarrhea disease in piglets, causing significant economic losses to the pig industry. At present, the research on the pathogenicity and molecular mechanisms of host-virus interactions of SADS-CoV are limited and remain poorly understood. Here, we investigated the global gene expression profiles of SADS-CoV-infected Vero E6 cells at 12, 18, and 24 h post-infection (hpi) using the RNA-sequencing. As a result, a total of 3324 differentially expressed genes (DEG) were identified, most of which showed a down-regulated expression pattern. Functional enrichment analyses indicated that the DEGs are mainly involved in signal transduction, cellular transcription, immune and inflammatory response, and autophagy. Collectively, our results provide insights into the changes in the cellular transcriptome during early infection of SADS-CoV and may provide information for further study of molecular mechanisms.
Collapse
|
65
|
Liu Q, Liaquat F, He Y, Munis MFH, Zhang C. Functional Annotation of a Full-Length Transcriptome and Identification of Genes Associated with Flower Development in Rhododendronsimsii (Ericaceae). PLANTS (BASEL, SWITZERLAND) 2021; 10:649. [PMID: 33805478 PMCID: PMC8065783 DOI: 10.3390/plants10040649] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/27/2021] [Revised: 03/21/2021] [Accepted: 03/24/2021] [Indexed: 11/16/2022]
Abstract
Rhododendronsimsii is one of the top ten famous flowers in China. Due to its historical value and high aesthetic, it is widely popular among Chinese people. Various colors are important breeding objectives in Rhododendron L. The understanding of the molecular mechanism of flower color formation can provide a theoretical basis for the improvement of flower color in Rhododendron L. To generate the R.simsii transcriptome, PacBio sequencing technology has been used. A total of 833,137 full-length non-chimeric reads were obtained and 726,846 high-quality full-length transcripts were found. Moreover, 40,556 total open reading frames were obtained; of which 36,018 were complete. In gene annotation analyses, 39,411, 18,565, 16,102 and 17,450 transcriptions were allocated to GO, Nr, KEGG and COG databases, correspondingly. To identify long non-coding RNAs (lncRNAs), we utilized four computational methods associated with Protein families (Pfam), Cooperative Data Classification (CPC), Coding Assessing Potential Tool (CPAT) and Coding Non Coding Index (CNCI) databases and observed 6170, 2265, 4084 and 1240 lncRNAs, respectively. Based on the results, most genes were enriched in the flavonoid biosynthetic pathway. The eight key genes on the anthocyanin biosynthetic pathway were further selected and analyzed by qRT-PCR. The F3'H and ANS showed an upward trend in the developmental stages of R. simsii. The highest expression of F3'5'H and FLS in the petal color formation of R. simsii was observed. This research provided a huge number of full-length transcripts, which will help to proceed genetic analyses of R.simsii. native, which is a semi-deciduous shrub.
Collapse
Affiliation(s)
- Qunlu Liu
- Department of Landscape Architecture, School of Design, Shanghai Jiao Tong University, Shanghai 200240, China; (Q.L.); (Y.H.)
| | - Fiza Liaquat
- School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China;
| | - Yefeng He
- Department of Landscape Architecture, School of Design, Shanghai Jiao Tong University, Shanghai 200240, China; (Q.L.); (Y.H.)
| | | | - Chunying Zhang
- Shanghai Engineering Research Center of Sustainable Plant Innovation, Shanghai Botanical Garden, Shanghai 200231, China
| |
Collapse
|
66
|
Westermann AJ, Vogel J. Cross-species RNA-seq for deciphering host-microbe interactions. Nat Rev Genet 2021; 22:361-378. [PMID: 33597744 DOI: 10.1038/s41576-021-00326-y] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/05/2021] [Indexed: 02/08/2023]
Abstract
The human body is constantly exposed to microorganisms, which entails manifold interactions between human cells and diverse commensal or pathogenic bacteria. The cellular states of the interacting cells are decisive for the outcome of these encounters such as whether bacterial virulence programmes and host defence or tolerance mechanisms are induced. This Review summarizes how next-generation RNA sequencing (RNA-seq) has become a primary technology to study host-microbe interactions with high resolution, improving our understanding of the physiological consequences and the mechanisms at play. We illustrate how the discriminatory power and sensitivity of RNA-seq helps to dissect increasingly complex cellular interactions in time and space down to the single-cell level. We also outline how future transcriptomics may answer currently open questions in host-microbe interactions and inform treatment schemes for microbial disorders.
Collapse
Affiliation(s)
- Alexander J Westermann
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Centre for Infection Research (HZI), Würzburg, Germany. .,Institute for Molecular Infection Biology (IMIB), University of Würzburg, Würzburg, Germany.
| | - Jörg Vogel
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Centre for Infection Research (HZI), Würzburg, Germany. .,Institute for Molecular Infection Biology (IMIB), University of Würzburg, Würzburg, Germany.
| |
Collapse
|
67
|
The Alter Retina: Alternative Splicing of Retinal Genes in Health and Disease. Int J Mol Sci 2021; 22:ijms22041855. [PMID: 33673358 PMCID: PMC7917623 DOI: 10.3390/ijms22041855] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Revised: 02/08/2021] [Accepted: 02/09/2021] [Indexed: 12/14/2022] Open
Abstract
Alternative splicing of mRNA is an essential mechanism to regulate and increase the diversity of the transcriptome and proteome. Alternative splicing frequently occurs in a tissue- or time-specific manner, contributing to differential gene expression between cell types during development. Neural tissues present extremely complex splicing programs and display the highest number of alternative splicing events. As an extension of the central nervous system, the retina constitutes an excellent system to illustrate the high diversity of neural transcripts. The retina expresses retinal specific splicing factors and produces a large number of alternative transcripts, including exclusive tissue-specific exons, which require an exquisite regulation. In fact, a current challenge in the genetic diagnosis of inherited retinal diseases stems from the lack of information regarding alternative splicing of retinal genes, as a considerable percentage of mutations alter splicing or the relative production of alternative transcripts. Modulation of alternative splicing in the retina is also instrumental in the design of novel therapeutic approaches for retinal dystrophies, since it enables precision medicine for specific mutations.
Collapse
|
68
|
Sahlin K, Medvedev P. Error correction enables use of Oxford Nanopore technology for reference-free transcriptome analysis. Nat Commun 2021; 12:2. [PMID: 33397972 PMCID: PMC7782715 DOI: 10.1038/s41467-020-20340-8] [Citation(s) in RCA: 74] [Impact Index Per Article: 24.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Accepted: 11/25/2020] [Indexed: 01/24/2023] Open
Abstract
Oxford Nanopore (ONT) is a leading long-read technology which has been revolutionizing transcriptome analysis through its capacity to sequence the majority of transcripts from end-to-end. This has greatly increased our ability to study the diversity of transcription mechanisms such as transcription initiation, termination, and alternative splicing. However, ONT still suffers from high error rates which have thus far limited its scope to reference-based analyses. When a reference is not available or is not a viable option due to reference-bias, error correction is a crucial step towards the reconstruction of the sequenced transcripts and downstream sequence analysis of transcripts. In this paper, we present a novel computational method to error correct ONT cDNA sequencing data, called isONcorrect. IsONcorrect is able to jointly use all isoforms from a gene during error correction, thereby allowing it to correct reads at low sequencing depths. We are able to obtain a median accuracy of 98.9-99.6%, demonstrating the feasibility of applying cost-effective cDNA full transcript length sequencing for reference-free transcriptome analysis.
Collapse
Affiliation(s)
- Kristoffer Sahlin
- Department of Mathematics, Science for Life Laboratory, Stockholm University, 106 91, Stockholm, Sweden
| | - Paul Medvedev
- Department of Computer Science and Engineering, The Pennsylvania State University, University Park, PA, USA.
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA.
- Center for Computational Biology and Bioinformatics, The Pennsylvania State University, University Park, PA, USA.
| |
Collapse
|
69
|
Niazi AM, Krause M, Valen E. Transcript Isoform-Specific Estimation of Poly(A) Tail Length by Nanopore Sequencing of Native RNA. Methods Mol Biol 2021; 2284:543-567. [PMID: 33835463 DOI: 10.1007/978-1-0716-1307-8_30] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The poly(A) tail is a homopolymeric stretch of adenosine at the 3'-end of mature RNA transcripts and its length plays an important role in nuclear export, stability, and translational regulation of mRNA. Existing techniques for genome-wide estimation of poly(A) tail length are based on short-read sequencing. These methods are limited because they sequence a synthetic DNA copy of mRNA instead of the native transcripts. Furthermore, they can identify only a short segment of the transcript proximal to the poly(A) tail which makes it difficult to assign the measured poly(A) length uniquely to a single transcript isoform. With the introduction of native RNA sequencing by Oxford Nanopore Technologies, it is now possible to sequence full-length native RNA. A single long read contains both the transcript and the associated poly(A) tail, thereby making transcriptome-wide isoform-specific poly(A) tail length assessment feasible. We developed tailfindr-an R-based package for estimating poly(A) tail length from Oxford Nanopore sequencing data. In this chapter, we describe in detail the pipeline for transcript isoform-specific poly(A) tail profiling based on native RNA Nanopore sequencing-from library preparation to downstream data analysis with tailfindr.
Collapse
Affiliation(s)
- Adnan M Niazi
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
| | - Maximilian Krause
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway.
- Sars International Centre for Marine Molecular Biology, University of Bergen, Bergen, Norway.
| | - Eivind Valen
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
- Sars International Centre for Marine Molecular Biology, University of Bergen, Bergen, Norway
| |
Collapse
|
70
|
Molecular assessment and transcriptome profiling of wild fish populations of Oryzias mekongensis and O. songkhramensis (Adrianichthyidae: Beloniformes) from Thailand. PLoS One 2020; 15:e0242382. [PMID: 33211755 PMCID: PMC7676673 DOI: 10.1371/journal.pone.0242382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Accepted: 11/01/2020] [Indexed: 11/19/2022] Open
Abstract
Among the fish of the genus Oryzias, two species are frequently used as model animals in biological research. In Thailand, Oryzias mekongensis is usually found in natural freshwater near the Mekong Basin in the northeast region, while O. songkhramensis inhabits the Songkhram Basin. For differential morphological identification, the coloured bands on the dorsal and ventral margins of the caudal fin are used to distinguish O. mekongensis from O. songkhramensis. However, these characteristics are insufficient to justify species differentiation, and little molecular evidence is available to supplement them. This study aimed to investigate the molecular population and transcriptome profiles of adult O. mekongensis and O. songkhramensis. In the molecular tree based on cytochrome b sequences, O. mekongensis exhibited four clades that were clearly distinguished from O. songkhramensis. Clade 1 of the O. mekongensis population was close to the Mekong River and lived in the eastern portion of the upper northeast region. Clade 2 was far from the Mekong River and inhabited the middle region of the Songkhram River. Clade 3 was positioned to the west of the Songkhram River, and clade 4 was to the south of the Songkhram River Basin. After RNA sequencing using an Illumina HiSeq 2500 platform, the gene category annotations hardly differentiated the species and were discussed in the text. Based on the present findings, population dispersal of these Oryzias species might be associated with geographic variations of the upper northeast region. Molecular genetics and transcriptome profiling might advance our understanding of the evolution of teleost fish.
Collapse
|
71
|
Dahal S, Yurkovich JT, Xu H, Palsson BO, Yang L. Synthesizing Systems Biology Knowledge from Omics Using Genome-Scale Models. Proteomics 2020; 20:e1900282. [PMID: 32579720 PMCID: PMC7501203 DOI: 10.1002/pmic.201900282] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2020] [Revised: 06/13/2020] [Indexed: 12/18/2022]
Abstract
Omic technologies have enabled the complete readout of the molecular state of a cell at different biological scales. In principle, the combination of multiple omic data types can provide an integrated view of the entire biological system. This integration requires appropriate models in a systems biology approach. Here, genome-scale models (GEMs) are focused upon as one computational systems biology approach for interpreting and integrating multi-omic data. GEMs convert the reactions (related to metabolism, transcription, and translation) that occur in an organism to a mathematical formulation that can be modeled using optimization principles. A variety of genome-scale modeling methods used to interpret multiple omic data types, including genomics, transcriptomics, proteomics, metabolomics, and meta-omics are reviewed. The ability to interpret omics in the context of biological systems has yielded important findings for human health, environmental biotechnology, bioenergy, and metabolic engineering. The authors find that concurrent with advancements in omic technologies, genome-scale modeling methods are also expanding to enable better interpretation of omic data. Therefore, continued synthesis of valuable knowledge, through the integration of omic data with GEMs, are expected.
Collapse
Affiliation(s)
- Sanjeev Dahal
- Department of Chemical Engineering, Queen’s University, Kingston, Canada
| | | | - Hao Xu
- Department of Chemical Engineering, Queen’s University, Kingston, Canada
| | - Bernhard O. Palsson
- Department of Bioengineering, University of California San Diego, La Jolla, CA, USA
- Department of Pediatrics, University of California San Diego, La Jolla, CA, USA
| | - Laurence Yang
- Department of Chemical Engineering, Queen’s University, Kingston, Canada
| |
Collapse
|
72
|
Miga KH. Centromere studies in the era of 'telomere-to-telomere' genomics. Exp Cell Res 2020; 394:112127. [PMID: 32504677 DOI: 10.1016/j.yexcr.2020.112127] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 05/23/2020] [Accepted: 05/30/2020] [Indexed: 12/17/2022]
Abstract
We are entering into an exciting era of genomics where truly complete, high-quality assemblies of human chromosomes are available end-to-end, or from 'telomere-to-telomere' (T2T). This technological advance offers a new opportunity to include endogenous human centromeric regions in high-resolution, sequence-based studies. These emerging reference maps are expected to reveal a new functional landscape in the human genome, where centromere proteins, transcriptional regulation, and spatial organization can be examined with base-level resolution across different stages of development and disease. Such studies will depend on innovative assembly methods of extremely long tandem repeats (ETRs), or satellite DNAs, paired with the development of new, orthogonal validation methods to ensure accuracy and completeness. This review reflects the progress in centromere genomics, credited by recent advancements in long-read sequencing and assembly methods. In doing so, I will discuss the challenges that remain and the promise for a new period of scientific discovery for satellite DNA biology and centromere function.
Collapse
Affiliation(s)
- Karen H Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, CA, 95064, USA.
| |
Collapse
|
73
|
Abstract
Cells are the building blocks of life, from single-celled microbes through to multi-cellular organisms. To understand a multitude of biological processes we need to understand how cells behave, how they interact with each other and how they respond to their environment. The use of new methodologies is changing the way we study cells allowing us to study them on minute scales and in unprecedented detail. These same methods are allowing researchers to begin to sample the vast diversity of microbes that dominate natural environments. The aim of this special issue is to bring together research and perspectives on the application of new approaches to understand the biological properties of cells, including how they interact with other biological entities. This article is part of a discussion meeting issue 'Single cell ecology'.
Collapse
Affiliation(s)
- Thomas A Richards
- Biosciences and Living Systems Institute, University of Exeter, Stocker Road, Exeter EX4 4QD, UK
| | - Ramon Massana
- Department of Marine Biology and Oceanography, Institut de Ciències del Mar (CSIC), 08003 Barcelona, Spain
| | - Stefano Pagliara
- Biosciences and Living Systems Institute, University of Exeter, Stocker Road, Exeter EX4 4QD, UK
| | - Neil Hall
- Earlham Institute, Norwich Research Park, Norwich, NR4 7UZ, UK.,School of Biological Sciences, University of East Anglia, Norwich, NR4 7TU, UK
| |
Collapse
|