1
|
Yang K, Islas N, Jewell S, Jha A, Radens CM, Pleiss JA, Lynch KW, Barash Y, Choi PS. Machine learning-optimized targeted detection of alternative splicing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.20.614162. [PMID: 39386495 PMCID: PMC11463589 DOI: 10.1101/2024.09.20.614162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/12/2024]
Abstract
RNA-sequencing (RNA-seq) is widely adopted for transcriptome analysis but has inherent biases which hinder the comprehensive detection and quantification of alternative splicing. To address this, we present an efficient targeted RNA-seq method that greatly enriches for splicing-informative junction-spanning reads. Local Splicing Variation sequencing (LSV-seq) utilizes multiplexed reverse transcription from highly scalable pools of primers anchored near splicing events of interest. Primers are designed using Optimal Prime, a novel machine learning algorithm trained on the performance of thousands of primer sequences. In experimental benchmarks, LSV-seq achieves high on-target capture rates and concordance with RNA-seq, while requiring significantly lower sequencing depth. Leveraging deep learning splicing code predictions, we used LSV-seq to target events with low coverage in GTEx RNA-seq data and newly discover hundreds of tissue-specific splicing events. Our results demonstrate the ability of LSV-seq to quantify splicing of events of interest at high-throughput and with exceptional sensitivity.
Collapse
Affiliation(s)
- Kevin Yang
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, USA
- Department of Pathology & Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
- Division of Cancer Pathobiology, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Nathaniel Islas
- Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA, USA
| | - San Jewell
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, USA
| | - Anupama Jha
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Caleb M. Radens
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, USA
| | - Jeffrey A. Pleiss
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Kristen W. Lynch
- Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, PA, USA
| | - Yoseph Barash
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, USA
- Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA, USA
| | - Peter S. Choi
- Department of Pathology & Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
- Division of Cancer Pathobiology, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| |
Collapse
|
2
|
Connell ML, Meyer DN, Haimbaugh A, Baker TR. Status of single-cell RNA sequencing for reproductive toxicology in zebrafish and the transcriptomic trade-off. CURRENT OPINION IN TOXICOLOGY 2024; 38:100463. [PMID: 38846809 PMCID: PMC11155726 DOI: 10.1016/j.cotox.2024.100463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2024]
Abstract
The utilization of transcriptomic studies identifying profiles of gene expression, especially in toxicogenomics, has catapulted next-generation sequencing to the forefront of reproductive toxicology. An innovative yet underutilized RNA sequencing technique emerging into this field is single-cell RNA sequencing (scRNA-seq), which provides sequencing at the individual cellular level of gonad tissue. ScRNA-seq provides a novel and unique perspective for identifying distinct cellular profiles, including identification of rare cell subtypes. The specificity of scRNA-seq is a powerful tool for reproductive toxicity research, especially for translational animal models including zebrafish. Studies to date not only have focused on 'tissue atlassing' or characterizing what cell types make up different tissues but have also begun to include toxicant exposure as a factor that this review aims to explore. Future scRNA-seq studies will contribute to understanding exposure-induced outcomes; however, the trade-offs with traditional methods need to be considered.
Collapse
Affiliation(s)
- Mackenzie L Connell
- University of Florida, Department of Environmental & Global Health, 2173 Mowry Rd, Gainesville, FL 32611, USA
| | - Danielle N Meyer
- University of Florida, Department of Environmental & Global Health, 2173 Mowry Rd, Gainesville, FL 32611, USA
| | - Alex Haimbaugh
- University of Florida, Department of Environmental & Global Health, 2173 Mowry Rd, Gainesville, FL 32611, USA
| | - Tracie R Baker
- University of Florida, Department of Environmental & Global Health, 2173 Mowry Rd, Gainesville, FL 32611, USA
| |
Collapse
|
3
|
Hegarty C, Neto N, Cahill P, Floudas A. Computational approaches in rheumatic diseases - Deciphering complex spatio-temporal cell interactions. Comput Struct Biotechnol J 2023; 21:4009-4020. [PMID: 37649712 PMCID: PMC10462794 DOI: 10.1016/j.csbj.2023.08.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 08/04/2023] [Accepted: 08/04/2023] [Indexed: 09/01/2023] Open
Abstract
Inflammatory arthritis, including rheumatoid (RA), and psoriatic (PsA) arthritis, are clinically and immunologically heterogeneous diseases with no identified cure. Chronic inflammation of the synovial tissue ushers loss of function of the joint that severely impacts the patient's quality of life, eventually leading to disability and life-threatening comorbidities. The pathogenesis of synovial inflammation is the consequence of compounded immune and stromal cell interactions influenced by genetic and environmental factors. Deciphering the complexity of the synovial cellular landscape has accelerated primarily due to the utilisation of bulk and single cell RNA sequencing. Particularly the capacity to generate cell-cell interaction networks could reveal evidence of previously unappreciated processes leading to disease. However, there is currently a lack of universal nomenclature as a result of varied experimental and technological approaches that discombobulates the study of synovial inflammation. While spatial transcriptomic analysis that combines anatomical information with transcriptomic data of synovial tissue biopsies promises to provide more insights into disease pathogenesis, in vitro functional assays with single-cell resolution will be required to validate current bioinformatic applications. In order to provide a comprehensive approach and translate experimental data to clinical practice, a combination of clinical and molecular data with machine learning has the potential to enhance patient stratification and identify individuals at risk of arthritis that would benefit from early therapeutic intervention. This review aims to provide a comprehensive understanding of the effect of computational approaches in deciphering synovial inflammation pathogenesis and discuss the impact that further experimental and novel computational tools may have on therapeutic target identification and drug development.
Collapse
Affiliation(s)
- Ciara Hegarty
- Translational Immunology lab, School of Biotechnology, Dublin City University, Dublin, Ireland
| | - Nuno Neto
- Trinity Centre for Biomedical Engineering, Trinity College Dublin, Ireland
| | - Paul Cahill
- Vascular Biology lab, School of Biotechnology, Dublin City University, Dublin, Ireland
| | - Achilleas Floudas
- Translational Immunology lab, School of Biotechnology, Dublin City University, Dublin, Ireland
| |
Collapse
|
4
|
Tang W, Jørgensen ACS, Marguerat S, Thomas P, Shahrezaei V. Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics. Bioinformatics 2023; 39:btad395. [PMID: 37354494 PMCID: PMC10318389 DOI: 10.1093/bioinformatics/btad395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 05/18/2023] [Accepted: 06/22/2023] [Indexed: 06/26/2023] Open
Abstract
MOTIVATION Gene expression is characterized by stochastic bursts of transcription that occur at brief and random periods of promoter activity. The kinetics of gene expression burstiness differs across the genome and is dependent on the promoter sequence, among other factors. Single-cell RNA sequencing (scRNA-seq) has made it possible to quantify the cell-to-cell variability in transcription at a global genome-wide level. However, scRNA-seq data are prone to technical variability, including low and variable capture efficiency of transcripts from individual cells. RESULTS Here, we propose a novel mathematical theory for the observed variability in scRNA-seq data. Our method captures burst kinetics and variability in both the cell size and capture efficiency, which allows us to propose several likelihood-based and simulation-based methods for the inference of burst kinetics from scRNA-seq data. Using both synthetic and real data, we show that the simulation-based methods provide an accurate, robust and flexible tool for inferring burst kinetics from scRNA-seq data. In particular, in a supervised manner, a simulation-based inference method based on neural networks proves to be accurate and useful when applied to both allele and nonallele-specific scRNA-seq data. AVAILABILITY AND IMPLEMENTATION The code for Neural Network and Approximate Bayesian Computation inference is available at https://github.com/WT215/nnRNA and https://github.com/WT215/Julia_ABC, respectively.
Collapse
Affiliation(s)
- Wenhao Tang
- Department of Mathematics, Imperial College London, London SW7 2BX, United Kingdom
| | - Andreas Christ Sølvsten Jørgensen
- Department of Mathematics, Imperial College London, London SW7 2BX, United Kingdom
- I-X Centre for AI in Science, Imperial College London, White City Campus, London W12 0BZ, United Kingdom
| | - Samuel Marguerat
- MRC London Institute of Medical Sciences (LMS), London W12 0NN, United Kingdom
- Institute of Clinical Sciences (ICS), Faculty of Medicine, Imperial College London, London W12 0NN, United Kingdom
| | - Philipp Thomas
- Department of Mathematics, Imperial College London, London SW7 2BX, United Kingdom
| | - Vahid Shahrezaei
- Department of Mathematics, Imperial College London, London SW7 2BX, United Kingdom
| |
Collapse
|
5
|
Zeng Q, Mousa M, Nadukkandy AS, Franssens L, Alnaqbi H, Alshamsi FY, Safar HA, Carmeliet P. Understanding tumour endothelial cell heterogeneity and function from single-cell omics. Nat Rev Cancer 2023:10.1038/s41568-023-00591-5. [PMID: 37349410 DOI: 10.1038/s41568-023-00591-5] [Citation(s) in RCA: 32] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 05/22/2023] [Indexed: 06/24/2023]
Abstract
Anti-angiogenic therapies (AATs) are used to treat different types of cancers. However, their success is limited owing to insufficient efficacy and resistance. Recently, single-cell omics studies of tumour endothelial cells (TECs) have provided new mechanistic insight. Here, we overview the heterogeneity of human TECs of all tumour types studied to date, at the single-cell level. Notably, most human tumour types contain varying numbers but only a small population of angiogenic TECs, the presumed targets of AATs, possibly contributing to the limited efficacy of and resistance to AATs. In general, TECs are heterogeneous within and across all tumour types, but comparing TEC phenotypes across tumours is currently challenging, owing to the lack of a uniform nomenclature for endothelial cells and consistent single-cell analysis protocols, urgently raising the need for a more consistent approach. Nonetheless, across most tumour types, universal TEC markers (ACKR1, PLVAP and IGFBP3) can be identified. Besides angiogenesis, biological processes such as immunomodulation and extracellular matrix organization are among the most commonly predicted enriched signatures of TECs across different tumour types. Although angiogenesis and extracellular matrix targets have been considered for AAT (without the hoped success), the immunomodulatory properties of TECs have not been fully considered as a novel anticancer therapeutic approach. Therefore, we also discuss progress, limitations, solutions and novel targets for AAT development.
Collapse
Affiliation(s)
- Qun Zeng
- Laboratory of Angiogenesis and Vascular Metabolism, Department of Oncology, KU Leuven and Center for Cancer Biology, VIB, Leuven, Belgium
| | - Mira Mousa
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, UAE
| | - Aisha Shigna Nadukkandy
- Laboratory of Angiogenesis and Vascular Metabolism, Department of Oncology, KU Leuven and Center for Cancer Biology, VIB, Leuven, Belgium
- Laboratory of Angiogenesis and Vascular Heterogeneity, Department of Biomedicine, Aarhus University, Aarhus, Denmark
| | - Lies Franssens
- Laboratory of Angiogenesis and Vascular Metabolism, Department of Oncology, KU Leuven and Center for Cancer Biology, VIB, Leuven, Belgium
| | - Halima Alnaqbi
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, UAE
| | - Fatima Yousif Alshamsi
- Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, UAE
| | - Habiba Al Safar
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, UAE.
- Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, UAE.
| | - Peter Carmeliet
- Laboratory of Angiogenesis and Vascular Metabolism, Department of Oncology, KU Leuven and Center for Cancer Biology, VIB, Leuven, Belgium.
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, UAE.
- Laboratory of Angiogenesis and Vascular Heterogeneity, Department of Biomedicine, Aarhus University, Aarhus, Denmark.
| |
Collapse
|
6
|
Stokes T, Cen HH, Kapranov P, Gallagher IJ, Pitsillides AA, Volmar C, Kraus WE, Johnson JD, Phillips SM, Wahlestedt C, Timmons JA. Transcriptomics for Clinical and Experimental Biology Research: Hang on a Seq. ADVANCED GENETICS (HOBOKEN, N.J.) 2023; 4:2200024. [PMID: 37288167 PMCID: PMC10242409 DOI: 10.1002/ggn2.202200024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Indexed: 06/09/2023]
Abstract
Sequencing the human genome empowers translational medicine, facilitating transcriptome-wide molecular diagnosis, pathway biology, and drug repositioning. Initially, microarrays are used to study the bulk transcriptome; but now short-read RNA sequencing (RNA-seq) predominates. Positioned as a superior technology, that makes the discovery of novel transcripts routine, most RNA-seq analyses are in fact modeled on the known transcriptome. Limitations of the RNA-seq methodology have emerged, while the design of, and the analysis strategies applied to, arrays have matured. An equitable comparison between these technologies is provided, highlighting advantages that modern arrays hold over RNA-seq. Array protocols more accurately quantify constitutively expressed protein coding genes across tissue replicates, and are more reliable for studying lower expressed genes. Arrays reveal long noncoding RNAs (lncRNA) are neither sparsely nor lower expressed than protein coding genes. Heterogeneous coverage of constitutively expressed genes observed with RNA-seq, undermines the validity and reproducibility of pathway analyses. The factors driving these observations, many of which are relevant to long-read or single-cell sequencing are discussed. As proposed herein, a reappreciation of bulk transcriptomic methods is required, including wider use of the modern high-density array data-to urgently revise existing anatomical RNA reference atlases and assist with more accurate study of lncRNAs.
Collapse
Affiliation(s)
- Tanner Stokes
- Faculty of ScienceMcMaster UniversityHamiltonL8S 4L8Canada
| | - Haoning Howard Cen
- Life Sciences InstituteUniversity of British ColumbiaVancouverV6T 1Z3Canada
| | | | - Iain J Gallagher
- School of Applied SciencesEdinburgh Napier UniversityEdinburghEH11 4BNUK
| | | | | | | | - James D. Johnson
- Life Sciences InstituteUniversity of British ColumbiaVancouverV6T 1Z3Canada
| | | | | | - James A. Timmons
- Miller School of MedicineUniversity of MiamiMiamiFL33136USA
- William Harvey Research InstituteQueen Mary University LondonLondonEC1M 6BQUK
- Augur Precision Medicine LTDStirlingFK9 5NFUK
| |
Collapse
|
7
|
Xiang Y, Xiang P, Zhang L, Li Y, Zhang J. A narrative review for platelets and their RNAs in cancers: New concepts and clinical perspectives. Medicine (Baltimore) 2022; 101:e32539. [PMID: 36596034 PMCID: PMC9803462 DOI: 10.1097/md.0000000000032539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Recent years have witnessed a growing body of evidence suggesting that platelets are involved in several stages of the metastatic process via direct or indirect interactions with cancer cells, contributing to the progression of neoplastic malignancies. Cancer cells can dynamically exchange components with platelets in and out of blood vessels, and directly phagocytose platelets to hijack their proteome, transcriptome, and secretome, or be remotely regulated by metabolites or microparticles released by platelets, resulting in phenotypic, genetic, and functional modifications. Moreover, platelet interactions with stromal and immune cells in the tumor microenvironment lead to alterations in their components, including the ribonucleic acid (RNA) profile, and complicate the impact of platelets on cancers. A deeper understanding of the roles of platelets and their RNAs in cancer will contribute to the development of anticancer strategies and the optimization of clinical management. Encouragingly, advances in high-throughput sequencing, bioinformatics data analysis, and machine learning have allowed scientists to explore the potential of platelet RNAs for cancer diagnosis, prognosis, and guiding treatment. However, the clinical application of this technique remains controversial and requires larger, multicenter studies with standardized protocols. Here, we integrate the latest evidence to provide a broader insight into the role of platelets in cancer progression and management, and propose standardized recommendations for the clinical utility of platelet RNAs to facilitate translation and benefit patients.
Collapse
Affiliation(s)
- Yunhui Xiang
- Department of Laboratory Medicine and Sichuan Provincial Key Laboratory for Human Disease Gene Study, Sichuan Provincial People’s Hospital, University of Electronic Science and Technology of China, Chengdu, China
| | - Pinpin Xiang
- Department of Laboratory Medicine, Xiping Community Health Service Center of Longquanyi District Chengdu City, Chengdu, China
| | - Liuyun Zhang
- Department of Laboratory Medicine and Sichuan Provincial Key Laboratory for Human Disease Gene Study, Sichuan Provincial People’s Hospital, University of Electronic Science and Technology of China, Chengdu, China
| | - Yanying Li
- Department of Laboratory Medicine and Sichuan Provincial Key Laboratory for Human Disease Gene Study, Sichuan Provincial People’s Hospital, University of Electronic Science and Technology of China, Chengdu, China
| | - Juan Zhang
- Department of Laboratory Medicine and Sichuan Provincial Key Laboratory for Human Disease Gene Study, Sichuan Provincial People’s Hospital, University of Electronic Science and Technology of China, Chengdu, China
- * Correspondence: Juan Zhang, Department of Laboratory Medicine and Sichuan Provincial Key Laboratory for Human Disease Gene Study, Sichuan Provincial People’s Hospital, University of Electronic Science and Technology of China, 32# West Second Section, First Ring Road, Qingyang District, Chengdu City, Sichuan Province 610072, China (e-mail: )
| |
Collapse
|
8
|
Haimbaugh A, Meyer D, Akemann C, Gurdziel K, Baker TR. Comparative Toxicotranscriptomics of Single Cell RNA-Seq and Conventional RNA-Seq in TCDD-Exposed Testicular Tissue. FRONTIERS IN TOXICOLOGY 2022; 4:821116. [PMID: 35615540 PMCID: PMC9126299 DOI: 10.3389/ftox.2022.821116] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Accepted: 03/03/2022] [Indexed: 12/18/2022] Open
Abstract
In this report, we compare the outcomes and limitations of two methods of transcriptomic inquiry on adult zebrafish testes exposed to 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD) during sexual differentiation: conventional or bulk RNA-seq (bulk-seq) and single cell RNA sequencing (scRNA-seq) data. scRNA-seq has emerged as a valuable tool for uncovering cell type-specific transcriptome dynamics which exist in heterogeneous tissue. Our lab previously showed the toxicological value of the scRNA-seq pipeline to characterize the sequelae of TCDD exposure in testes, demonstrating that loss of spermatids and spermatozoa, but not other cell types, contributed to the pathology of infertility in adult male zebrafish exposed during sexual differentiation. To investigate the potential for technical artifacts in scRNA-seq such as cell dissociation effects and reduced transcriptome coverage, we compared bulk-sequenced and scRNA-seq-paired samples from control and TCDD-exposed samples to understand what is gained and lost in scRNA-seq vs bulk-seq, both transcriptomically and toxicologically. We hypothesized that the testes may be sensitive to tissue disruption as they contain multiple cell types under constant division and/or maturation, and that TCDD exposure may mediate the extent of sensitivity. Thus, we sought to understand the extent to which this dissociation impacts the toxicological value of data returned from scRNA-seq. We confirm that the required dissociation of individual cells from intact tissue has a significant impact on gene expression, affecting gene pathways with the potential to confound toxicogenomics studies on exposures if findings are not well-controlled and well-situated in context. Additionally, a common scRNA-seq method using cDNA amplified from the 3' end of mRNA under-detects low-expressing transcripts including transcription factors. We confirm this, and show TCDD-related genes may be overlooked by scRNA-seq, however, this under-detection effect is not mediated by TCDD exposure. Even so, scRNA-seq generally extracted toxicologically relevant information better than the bulk-seq method in the present study. This report aims to inform future experimental design for transcriptomic investigation in the growing field of toxicogenomics by demonstrating the differential information extracted from sequencing cells-despite being from the same tissue and exposure scheme-is influenced by the specific protocol used, with implications for the interpretation of exposure-induced risk.
Collapse
Affiliation(s)
- Alex Haimbaugh
- Department of Pharmacology, School of Medicine, Wayne State University, Detroit, MI, United States
| | - Danielle Meyer
- Department of Pharmacology, School of Medicine, Wayne State University, Detroit, MI, United States
- Institute of Environmental Health Sciences, Wayne State University, Detroit, MI, United States
- Department of Environmental and Global Health, University of Florida, Gainesville, FL, United States
| | - Camille Akemann
- Department of Pharmacology, School of Medicine, Wayne State University, Detroit, MI, United States
| | - Katherine Gurdziel
- Genome Sciences Core, Office of the Vice President for Research, Wayne State University, Detroit, MI, United States
| | - Tracie R. Baker
- Department of Pharmacology, School of Medicine, Wayne State University, Detroit, MI, United States
- Institute of Environmental Health Sciences, Wayne State University, Detroit, MI, United States
- Department of Environmental and Global Health, University of Florida, Gainesville, FL, United States
| |
Collapse
|