1
|
Tiberi S, Meili J, Cai P, Soneson C, He D, Sarkar H, Avalos-Pacheco A, Patro R, Robinson MD. DifferentialRegulation: a Bayesian hierarchical approach to identify differentially regulated genes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.08.17.553679. [PMID: 37645841 PMCID: PMC10462127 DOI: 10.1101/2023.08.17.553679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
Motivation Although transcriptomics data is typically used to analyse mature spliced mRNA, recent attention has focused on jointly investigating spliced and unspliced (or precursor-) mRNA, which can be used to study gene regulation and changes in gene expression production. Nonetheless, most methods for spliced/unspliced inference (such as RNA velocity tools) focus on individual samples, and rarely allow comparisons between groups of samples (e.g., healthy vs. diseased). Furthermore, this kind of inference is challenging, because spliced and unspliced mRNA abundance is characterized by a high degree of quantification uncertainty, due to the prevalence of multi-mapping reads, i.e., reads compatible with multiple transcripts (or genes), and/or with both their spliced and unspliced versions. Results Here, we present DifferentialRegulation, a Bayesian hierarchical method to discover changes between experimental conditions with respect to the relative abundance of unspliced mRNA (over the total mRNA). We model the quantification uncertainty via a latent variable approach, where reads are allocated to their gene/transcript of origin, and to the respective splice version. We designed several benchmarks where our approach shows good performance, in terms of sensitivity and error control, versus state-of-the-art competitors. Importantly, our tool is flexible, and works with both bulk and single-cell RNA-sequencing data. Availability and implementation DifferentialRegulation is distributed as a Bioconductor R package.
Collapse
Affiliation(s)
- Simone Tiberi
- Department of Statistical Sciences, University of Bologna, Bologna, Italy
- Department of Molecular Life Sciences and SIB Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland
| | - Joël Meili
- Department of Molecular Life Sciences and SIB Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland
| | - Peiying Cai
- Department of Molecular Life Sciences and SIB Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland
| | - Charlotte Soneson
- Computational Biology Platform, Friedrich Miescher Institute for Biomedical Research and SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Dongze He
- Department of Cell Biology and Molecular Genetics, University of Maryland, MD, USA
- Center for Bioinformatics and Computational Biology, University of Maryland, MD, USA
| | - Hirak Sarkar
- Department of Computer Science, Princeton University, NJ, USA
| | - Alejandra Avalos-Pacheco
- Research Unit of Applied Statistics, TU Wien, Vienna, Austria
- Harvard-MIT Center for Regulatory Science, Harvard Medical School, Boston, MA, USA
| | - Rob Patro
- Department of Computer Science, University of Maryland, MD, USA
- Center for Bioinformatics and Computational Biology, University of Maryland, MD, USA
| | - Mark D Robinson
- Department of Molecular Life Sciences and SIB Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland
| |
Collapse
|
2
|
Wu EY, Singh NP, Choi K, Zakeri M, Vincent M, Churchill GA, Ackert-Bicknell CL, Patro R, Love MI. SEESAW: detecting isoform-level allelic imbalance accounting for inferential uncertainty. Genome Biol 2023; 24:165. [PMID: 37438847 PMCID: PMC10337143 DOI: 10.1186/s13059-023-03003-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 06/29/2023] [Indexed: 07/14/2023] Open
Abstract
Detecting allelic imbalance at the isoform level requires accounting for inferential uncertainty, caused by multi-mapping of RNA-seq reads. Our proposed method, SEESAW, uses Salmon and Swish to offer analysis at various levels of resolution, including gene, isoform, and aggregating isoforms to groups by transcription start site. The aggregation strategies strengthen the signal for transcripts with high uncertainty. The SEESAW suite of methods is shown to have higher power than other allelic imbalance methods when there is isoform-level allelic imbalance. We also introduce a new test for detecting imbalance that varies across a covariate, such as time.
Collapse
Affiliation(s)
- Euphy Y Wu
- Department of Biostatistics, University of North Carolina-Chapel Hill, Chapel Hill, NC, USA
| | - Noor P Singh
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | | | - Mohsen Zakeri
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | | | | | - Cheryl L Ackert-Bicknell
- Department of Orthopedics, School of Medicine, University of Colorado, Anschutz Campus, Aurora, CO, USA
| | - Rob Patro
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | - Michael I Love
- Department of Biostatistics, University of North Carolina-Chapel Hill, Chapel Hill, NC, USA.
- Department of Genetics, University of North Carolina-Chapel Hill, Chapel Hill, NC, USA.
| |
Collapse
|
3
|
Borozan L, Rojas Ringeling F, Kao SY, Nikonova E, Monteagudo-Mesas P, Matijević D, Spletter ML, Canzar S. Counting pseudoalignments to novel splicing events. Bioinformatics 2023; 39:btad419. [PMID: 37432342 PMCID: PMC10348833 DOI: 10.1093/bioinformatics/btad419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 04/21/2023] [Accepted: 07/10/2023] [Indexed: 07/12/2023] Open
Abstract
MOTIVATION Alternative splicing (AS) of introns from pre-mRNA produces diverse sets of transcripts across cell types and tissues, but is also dysregulated in many diseases. Alignment-free computational methods have greatly accelerated the quantification of mRNA transcripts from short RNA-seq reads, but they inherently rely on a catalog of known transcripts and might miss novel, disease-specific splicing events. By contrast, alignment of reads to the genome can effectively identify novel exonic segments and introns. Event-based methods then count how many reads align to predefined features. However, an alignment is more expensive to compute and constitutes a bottleneck in many AS analysis methods. RESULTS Here, we propose fortuna, a method that guesses novel combinations of annotated splice sites to create transcript fragments. It then pseudoaligns reads to fragments using kallisto and efficiently derives counts of the most elementary splicing units from kallisto's equivalence classes. These counts can be directly used for AS analysis or summarized to larger units as used by other widely applied methods. In experiments on synthetic and real data, fortuna was around 7× faster than traditional align and count approaches, and was able to analyze almost 300 million reads in just 15 min when using four threads. It mapped reads containing mismatches more accurately across novel junctions and found more reads supporting aberrant splicing events in patients with autism spectrum disorder than existing methods. We further used fortuna to identify novel, tissue-specific splicing events in Drosophila. AVAILABILITY AND IMPLEMENTATION fortuna source code is available at https://github.com/canzarlab/fortuna.
Collapse
Affiliation(s)
- Luka Borozan
- Department of Mathematics, Josip Juraj Strossmayer University of Osijek, Osijek 31000, Croatia
| | - Francisca Rojas Ringeling
- Gene Center, Ludwig-Maximilians-Universität München, Munich 81377, Germany
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, United States
| | - Shao-Yen Kao
- Biomedical Center, Department of Physiological Chemistry, Ludwig-Maximilians-Universität München, Planegg-Martinsried 82152, Germany
| | - Elena Nikonova
- Biomedical Center, Department of Physiological Chemistry, Ludwig-Maximilians-Universität München, Planegg-Martinsried 82152, Germany
| | | | - Domagoj Matijević
- Department of Mathematics, Josip Juraj Strossmayer University of Osijek, Osijek 31000, Croatia
| | - Maria L Spletter
- Biomedical Center, Department of Physiological Chemistry, Ludwig-Maximilians-Universität München, Planegg-Martinsried 82152, Germany
- School of Science and Engineering, Division of Biological & Biomedical Systems, University of Missouri Kansas City, Kansas City, MO 64110, United States
| | - Stefan Canzar
- Gene Center, Ludwig-Maximilians-Universität München, Munich 81377, Germany
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, United States
- Department of Computer Science and Engineering, The Pennsylvania State University, University Park, PA 16802, United States
| |
Collapse
|
4
|
Ayers KL, Eggers S, Rollo BN, Smith KR, Davidson NM, Siddall NA, Zhao L, Bowles J, Weiss K, Zanni G, Burglen L, Ben-Shachar S, Rosensaft J, Raas-Rothschild A, Jørgensen A, Schittenhelm RB, Huang C, Robevska G, van den Bergen J, Casagranda F, Cyza J, Pachernegg S, Wright DK, Bahlo M, Oshlack A, O'Brien TJ, Kwan P, Koopman P, Hime GR, Girard N, Hoffmann C, Shilon Y, Zung A, Bertini E, Milh M, Ben Rhouma B, Belguith N, Bashamboo A, McElreavey K, Banne E, Weintrob N, BenZeev B, Sinclair AH. Variants in SART3 cause a spliceosomopathy characterised by failure of testis development and neuronal defects. Nat Commun 2023; 14:3403. [PMID: 37296101 PMCID: PMC10256788 DOI: 10.1038/s41467-023-39040-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Accepted: 05/26/2023] [Indexed: 06/12/2023] Open
Abstract
Squamous cell carcinoma antigen recognized by T cells 3 (SART3) is an RNA-binding protein with numerous biological functions including recycling small nuclear RNAs to the spliceosome. Here, we identify recessive variants in SART3 in nine individuals presenting with intellectual disability, global developmental delay and a subset of brain anomalies, together with gonadal dysgenesis in 46,XY individuals. Knockdown of the Drosophila orthologue of SART3 reveals a conserved role in testicular and neuronal development. Human induced pluripotent stem cells carrying patient variants in SART3 show disruption to multiple signalling pathways, upregulation of spliceosome components and demonstrate aberrant gonadal and neuronal differentiation in vitro. Collectively, these findings suggest that bi-allelic SART3 variants underlie a spliceosomopathy which we tentatively propose be termed INDYGON syndrome (Intellectual disability, Neurodevelopmental defects and Developmental delay with 46,XY GONadal dysgenesis). Our findings will enable additional diagnoses and improved outcomes for individuals born with this condition.
Collapse
Affiliation(s)
- Katie L Ayers
- The Murdoch Children's Research Institute, Melbourne, Australia.
- Department of Paediatrics, The University of Melbourne, Melbourne, Australia.
| | - Stefanie Eggers
- The Victorian Clinical Genetics Services, Melbourne, Australia
| | - Ben N Rollo
- Department of Neuroscience, Central Clinical School, Monash University, Alfred Centre, Melbourne, Australia
| | - Katherine R Smith
- Walter and Eliza Hall Institute of Medical Research, Melbourne, Australia
| | - Nadia M Davidson
- Walter and Eliza Hall Institute of Medical Research, Melbourne, Australia
- School of BioSciences, Faculty of Science, University of Melbourne, Melbourne, Australia
- Department of Medical Biology, Faculty of Medicine, Dentistry and Health Sciences, University of Melbourne, Melbourne, Australia
| | - Nicole A Siddall
- Department of Anatomy and Physiology, The University of Melbourne, Melbourne, Australia
| | - Liang Zhao
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Josephine Bowles
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
- School of Biomedical Sciences, The University of Queensland, Brisbane, QLD, Australia
| | - Karin Weiss
- Genetics Institute, Rambam Health Care Campus, Rappaport Faculty of Medicine, Institute of Technology, Haifa, Israel
| | - Ginevra Zanni
- Unit of Muscular and Neurodegenerative Disorders and Unit of Developmental Neurology, Department of Neurosciences, Bambino Gesù Children's Hospital, IRCCS, Rome, Italy
| | - Lydie Burglen
- Centre de Référence des Malformations et Maladies Congénitales du Cervelet, Et Laboratoire de Neurogénétique Moléculaire, Département de Génétique et Embryologie Médicale, APHP. Sorbonne Université, Hôpital Trousseau, Paris, France
- Developmental Brain Disorders Laboratory, Imagine Institute, INSERM UMR 1163, Paris, France
| | - Shay Ben-Shachar
- Genetic Institute, Tel Aviv Sourasky Medical Center, Tel Aviv, Israel
| | - Jenny Rosensaft
- Genetics Institute, Kaplan Medical Center, Hebrew University Hadassah Medical School, Rehovot, 76100, Israel
| | - Annick Raas-Rothschild
- Edmond and Lily Safra Children's Hospital, Chaim Sheba Medical Center, Ramat Gan, Israel
- Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Anne Jørgensen
- Department of Growth and Reproduction, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Ralf B Schittenhelm
- Monash Proteomics and Metabolomics Facility, Biomedicine Discovery Institute, Department of Biochemistry and Molecular Biology, Monash University, Clayton, Australia
| | - Cheng Huang
- Monash Proteomics and Metabolomics Facility, Biomedicine Discovery Institute, Department of Biochemistry and Molecular Biology, Monash University, Clayton, Australia
| | | | | | - Franca Casagranda
- Department of Anatomy and Physiology, The University of Melbourne, Melbourne, Australia
| | - Justyna Cyza
- The Murdoch Children's Research Institute, Melbourne, Australia
| | - Svenja Pachernegg
- The Murdoch Children's Research Institute, Melbourne, Australia
- Department of Paediatrics, The University of Melbourne, Melbourne, Australia
| | - David K Wright
- Department of Neuroscience, Central Clinical School, Monash University, Alfred Centre, Melbourne, Australia
| | - Melanie Bahlo
- Walter and Eliza Hall Institute of Medical Research, Melbourne, Australia
- Department of Medical Biology, Faculty of Medicine, Dentistry and Health Sciences, University of Melbourne, Melbourne, Australia
| | - Alicia Oshlack
- The Peter MacCallum Cancer Centre, Melbourne, Australia
- School of Mathematics and Statistics, The University of Melbourne, Melbourne, Australia
| | - Terrence J O'Brien
- Department of Neuroscience, Central Clinical School, Monash University, Alfred Centre, Melbourne, Australia
- Department of Medicine, The Royal Melbourne Hospital, The University of Melbourne, Melbourne, Australia
| | - Patrick Kwan
- Department of Neuroscience, Central Clinical School, Monash University, Alfred Centre, Melbourne, Australia
- Department of Medicine, The Royal Melbourne Hospital, The University of Melbourne, Melbourne, Australia
| | - Peter Koopman
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Gary R Hime
- Department of Anatomy and Physiology, The University of Melbourne, Melbourne, Australia
| | - Nadine Girard
- Aix-Marseille Université, APHM. Department of Pediatric Neurology, Timone Hospital, Marseille, France
| | - Chen Hoffmann
- Radiology Department, Sheba medical Centre, Tel Aviv, Israel
| | - Yuval Shilon
- Kaplan Medical Center, Hebrew University Hadassah Medical School, Rehovot, 76100, Israel
| | - Amnon Zung
- Pediatrics Department, Kaplan Medical Center, Rehovot, 76100, Israel
- Faculty of Medicine, Hebrew University of Jerusalem, Hadassah Medical School, Jerusalem, Israel
| | - Enrico Bertini
- Unit of Muscular and Neurodegenerative Disorders and Unit of Developmental Neurology, Department of Neurosciences, Bambino Gesù Children's Hospital, IRCCS, Rome, Italy
| | - Mathieu Milh
- Aix-Marseille Université, APHM. Department of Pediatric Neurology, Timone Hospital, Marseille, France
| | - Bochra Ben Rhouma
- Higher Institute of Nursing Sciences of Gabes, University of Gabes, Gabes, Tunisia
- Laboratory of Human Molecular Genetics, Faculty of Medicine of Sfax, Sfax University, Sfax, Tunisia
| | - Neila Belguith
- Laboratory of Human Molecular Genetics, Faculty of Medicine of Sfax, Sfax University, Sfax, Tunisia
- Department of Congenital and Hereditary Diseases, Charles Nicolle Hospital, Tunis, Tunisia
| | - Anu Bashamboo
- Institut Pasteur, Université de Paris, CNRS UMR3738, Human Developmental Genetics, 75015, Paris, France
| | - Kenneth McElreavey
- Institut Pasteur, Université de Paris, CNRS UMR3738, Human Developmental Genetics, 75015, Paris, France
| | - Ehud Banne
- Genetics Institute, Kaplan Medical Center, Hebrew University Hadassah Medical School, Rehovot, 76100, Israel
- The Rina Mor Genetic Institute, Wolfson Medical Center, Holon, 58100, Israel
| | - Naomi Weintrob
- Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
- Pediatric Endocrinology Unit, Dana-Dwek Children's Hospital, Tel Aviv Medical Center, Tel Aviv, Israel
| | | | - Andrew H Sinclair
- The Murdoch Children's Research Institute, Melbourne, Australia
- Department of Paediatrics, The University of Melbourne, Melbourne, Australia
| |
Collapse
|