1
|
Anczukow O, Allain FHT, Angarola BL, Black DL, Brooks AN, Cheng C, Conesa A, Crosse EI, Eyras E, Guccione E, Lu SX, Neugebauer KM, Sehgal P, Song X, Tothova Z, Valcárcel J, Weeks KM, Yeo GW, Thomas-Tikhonenko A. Steering research on mRNA splicing in cancer towards clinical translation. Nat Rev Cancer 2024:10.1038/s41568-024-00750-2. [PMID: 39384951 DOI: 10.1038/s41568-024-00750-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 08/27/2024] [Indexed: 10/11/2024]
Abstract
Splicing factors are affected by recurrent somatic mutations and copy number variations in several types of haematologic and solid malignancies, which is often seen as prima facie evidence that splicing aberrations can drive cancer initiation and progression. However, numerous spliceosome components also 'moonlight' in DNA repair and other cellular processes, making their precise role in cancer difficult to pinpoint. Still, few would deny that dysregulated mRNA splicing is a pervasive feature of most cancers. Correctly interpreting these molecular fingerprints can reveal novel tumour vulnerabilities and untapped therapeutic opportunities. Yet multiple technological challenges, lingering misconceptions, and outstanding questions hinder clinical translation. To start with, the general landscape of splicing aberrations in cancer is not well defined, due to limitations of short-read RNA sequencing not adept at resolving complete mRNA isoforms, as well as the shallow read depth inherent in long-read RNA-sequencing, especially at single-cell level. Although individual cancer-associated isoforms are known to contribute to cancer progression, widespread splicing alterations could be an equally important and, perhaps, more readily actionable feature of human cancers. This is to say that in addition to 'repairing' mis-spliced transcripts, possible therapeutic avenues include exacerbating splicing aberration with small-molecule spliceosome inhibitors, targeting recurrent splicing aberrations with synthetic lethal approaches, and training the immune system to recognize splicing-derived neoantigens.
Collapse
Affiliation(s)
- Olga Anczukow
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
| | - Frédéric H-T Allain
- Department of Biology, Eidgenössische Technische Hochschule (ETH), Zürich, Switzerland
| | | | - Douglas L Black
- Department of Microbiology, Immunology, and Molecular Genetics, University of California Los Angeles, Los Angeles, CA, USA
| | - Angela N Brooks
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Chonghui Cheng
- Department of Molecular and Human Genetics, Lester & Sue Breast Center, Baylor College of Medicine, Houston, TX, USA
| | - Ana Conesa
- Institute for Integrative Systems Biology, Spanish National Research Council, Paterna, Spain
| | - Edie I Crosse
- Basic Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Eduardo Eyras
- Shine-Dalgarno Centre for RNA Innovation, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Ernesto Guccione
- Department of Oncological Sciences, Mount Sinai School of Medicine, New York, NY, USA
| | - Sydney X Lu
- Department of Medicine, Stanford Medical School, Palo Alto, CA, USA
| | - Karla M Neugebauer
- Department of Molecular Biophysics & Biochemistry, Yale University, New Haven, CT, USA
| | - Priyanka Sehgal
- Division of Cancer Pathobiology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Xiao Song
- Department of Neurology, Northwestern University, Chicago, IL, USA
| | - Zuzana Tothova
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Juan Valcárcel
- Centre for Genomic Regulation, Institució Catalana de Recerca i Estudis Avançats, Barcelona, Spain
| | - Kevin M Weeks
- Department of Chemistry, University of North Carolina, Chapel Hill, NC, USA
| | - Gene W Yeo
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA, USA
| | - Andrei Thomas-Tikhonenko
- Division of Cancer Pathobiology, Children's Hospital of Philadelphia, Philadelphia, PA, USA.
- Department of Pathology & Laboratory Medicine, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
2
|
Kumari P, Kaur M, Dindhoria K, Ashford B, Amarasinghe SL, Thind AS. Advances in long-read single-cell transcriptomics. Hum Genet 2024; 143:1005-1020. [PMID: 38787419 PMCID: PMC11485027 DOI: 10.1007/s00439-024-02678-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Accepted: 05/07/2024] [Indexed: 05/25/2024]
Abstract
Long-read single-cell transcriptomics (scRNA-Seq) is revolutionizing the way we profile heterogeneity in disease. Traditional short-read scRNA-Seq methods are limited in their ability to provide complete transcript coverage, resolve isoforms, and identify novel transcripts. The scRNA-Seq protocols developed for long-read sequencing platforms overcome these limitations by enabling the characterization of full-length transcripts. Long-read scRNA-Seq techniques initially suffered from comparatively poor accuracy compared to short read scRNA-Seq. However, with improvements in accuracy, accessibility, and cost efficiency, long-reads are gaining popularity in the field of scRNA-Seq. This review details the advances in long-read scRNA-Seq, with an emphasis on library preparation protocols and downstream bioinformatics analysis tools.
Collapse
Affiliation(s)
- Pallawi Kumari
- Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
| | - Manmeet Kaur
- Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
| | - Kiran Dindhoria
- Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
| | - Bruce Ashford
- Illawarra Shoalhaven Local Health District (ISLHD), NSW Health, Wollongong, NSW, Australia
| | - Shanika L Amarasinghe
- Monash Biomedical Discovery Institute, Monash University, Clayton, VIC, 3800, Australia
- Walter and Eliza Hall Institute of Medical Research, 1G, Royal Parade, Parkville, VIC, 3025, Australia
| | - Amarinder Singh Thind
- Illawarra Shoalhaven Local Health District (ISLHD), NSW Health, Wollongong, NSW, Australia.
- The School of Chemistry and Molecular Bioscience (SCMB), University of Wollongong, Loftus St, Wollongong, NSW, 2500, Australia.
| |
Collapse
|
3
|
Overgaard CK, Jamy M, Radutoiu S, Burki F, Dueholm MKD. Benchmarking long-read sequencing strategies for obtaining ASV-resolved rRNA operons from environmental microeukaryotes. Mol Ecol Resour 2024; 24:e13991. [PMID: 38979877 DOI: 10.1111/1755-0998.13991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 05/31/2024] [Accepted: 06/26/2024] [Indexed: 07/10/2024]
Abstract
The use of short-read metabarcoding for classifying microeukaryotes is challenged by the lack of comprehensive 18S rRNA reference databases. While recent advances in high-throughput long-read sequencing provide the potential to greatly increase the phylogenetic coverage of these databases, the performance of different sequencing technologies and subsequent bioinformatics processing remain to be evaluated, primarily because of the absence of well-defined eukaryotic mock communities. To address this challenge, we created a eukaryotic rRNA operon clone-library and turned it into a precisely defined synthetic eukaryotic mock community. This mock community was then used to evaluate the performance of three long-read sequencing strategies (PacBio circular consensus sequencing and two Nanopore approaches using unique molecular identifiers) and three tools for resolving amplicons sequence variants (ASVs) (USEARCH, VSEARCH, and DADA2). We investigated the sensitivity of the sequencing techniques based on the number of detected mock taxa, and the accuracy of the different ASV-calling tools with a specific focus on the presence of chimera among the final rRNA operon ASVs. Based on our findings, we provide recommendations and best practice protocols for how to cost-effectively obtain essentially error-free rRNA operons in high-throughput. An agricultural soil sample was used to demonstrate that the sequencing and bioinformatic results from the mock community also translates to highly diverse natural samples, which enables us to identify previously undescribed microeukaryotic lineages.
Collapse
Affiliation(s)
| | - Mahwash Jamy
- Department of Organismal Biology (Systematic Biology), Uppsala University, Uppsala, Sweden
- Department of Aquatic Sciences and Assessment; Division of Microbial Ecology, Uppsala University, Uppsala, Sweden
| | - Simona Radutoiu
- Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| | - Fabien Burki
- Department of Organismal Biology (Systematic Biology), Uppsala University, Uppsala, Sweden
| | - Morten Kam Dahl Dueholm
- Center for Microbial Communities, Department of Chemistry and Bioscience, Aalborg University, Aalborg, Denmark
| |
Collapse
|
4
|
Nesta A, Veiga DFT, Banchereau J, Anczukow O, Beck CR. Alternative splicing of transposable elements in human breast cancer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.26.615242. [PMID: 39386569 PMCID: PMC11463404 DOI: 10.1101/2024.09.26.615242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/12/2024]
Abstract
Transposable elements (TEs) drive genome evolution and can affect gene expression through diverse mechanisms. In breast cancer, disrupted regulation of TE sequences may facilitate tumor-specific transcriptomic alterations. We examine 142,514 full-length isoforms derived from long-read RNA sequencing (LR-seq) of 30 breast samples to investigate the effects of TEs on the breast cancer transcriptome. Approximately half of these isoforms contain TE sequences, and these contribute to half of the novel annotated splice junctions. We quantify splicing of these LR-seq derived isoforms in 1,135 breast tumors from The Cancer Genome Atlas (TCGA) and 1,329 healthy tissue samples from the Genotype-Tissue Expression (GTEx), and find 300 TE-overlapping tumor-specific splicing events. Some splicing events are enriched in specific breast cancer subtypes - for example, a TE-driven transcription start site upstream of ERBB2 in HER2+ tumors, and several TE-mediated splicing events are associated with patient survival and poor prognosis. The full-length sequences we capture with LR-seq reveal thousands of isoforms with signatures of RNA editing, including a novel isoform belonging to RHOA; a gene previously implicated in tumor progression. We utilize our full-length isoforms to discover polymorphic TE insertions that alter splicing and validate one of these events in breast cancer cell lines. Together, our results demonstrate the widespread effects of dysregulated TEs on breast cancer transcriptomes and highlight the advantages of long-read isoform sequencing for understanding TE biology. TE-derived isoforms may alter the expression of genes important in cancer and can potentially be used as novel, disease-specific therapeutic targets or biomarkers.
Collapse
Affiliation(s)
- Alex Nesta
- The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032 USA
- Department of Genetics and Genome Sciences, University of Connecticut Health Center, Farmington, CT 06030, USA
| | - Diogo F. T. Veiga
- Department of Translational Medicine, School of Medical Sciences, University of Campinas (UNICAMP), Campinas, SP 13083, Brazil
| | - Jacques Banchereau
- The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032 USA
- Immunoledge LLC, Montclair, NJ, 07042, USA
| | - Olga Anczukow
- The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032 USA
- Department of Genetics and Genome Sciences, University of Connecticut Health Center, Farmington, CT 06030, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269, USA
| | - Christine R. Beck
- The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032 USA
- Department of Genetics and Genome Sciences, University of Connecticut Health Center, Farmington, CT 06030, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269, USA
| |
Collapse
|
5
|
Yu S, Si Y, Yu J, Jiang C, Cheng F, Xu M, Fan Z, Liu F, Liu C, Wang Y, Wang N, Liu C, Bi C, Sun H. SNRPB2 promotes triple-negative breast cancer progression by controlling alternative splicing of MDM4 pre-mRNA. Cancer Sci 2024. [PMID: 39329452 DOI: 10.1111/cas.16356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2024] [Revised: 09/03/2024] [Accepted: 09/09/2024] [Indexed: 09/28/2024] Open
Abstract
Alternative splicing generates cancer-specific transcripts and is now recognized as a hallmark of cancer. However, the critical oncogenic spliceosome-related proteins involved in triple-negative breast cancer (TNBC) remain elusive. Here, we explored the expression pattern of spliceosome-related proteins in TNBC, non-TNBC, and normal breast tissues from The Cancer Genome Atlas breast cancer (TCGA-BRCA) cohort, revealing higher expression of nearly half of spliceosome-related proteins in TNBC than their counterparts. Among these TNBC-specific spliceosome-related proteins, the expression of SNRPB2 was associated with poor prognosis in patients with TNBC. In TNBC cells, the knockdown of SNRPB2 strongly suppressed cell proliferation and invasion and induced cell cycle arrest. Mechanistically, transcriptome data showed that SNRPB2 knockdown inactivated E2F1 signaling, which regulated the cell cycle. We further validated the downregulation of several cell cycle genes in SNRPB2 knockdown cells. Moreover, the analysis showed that SNRPB2 knockdown triggered the alteration of many alternative splicing events, most of which were skipping of exon. In TNBC cells, it was found that SNRPB2 knockdown led to the skipping of exon 6 in MDM4 pre-mRNA, generating MDM4-S transcript and downregulating MDM4 protein expression. More importantly, downregulation of MDM4 decreased retinoblastoma 1 (Rb1) protein expression, which is a target of MDM4 and a regulator of E2F1 signaling. In summary, the current study revealed an SNRPB2/MDM4/Rb axis in promoting the progression of TNBC, providing novel insights and novel targets for combating TNBC.
Collapse
Affiliation(s)
- Shiyi Yu
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental and Translational Non-coding RNA Research, Yangzhou University, Yangzhou, China
| | - Yue Si
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental and Translational Non-coding RNA Research, Yangzhou University, Yangzhou, China
| | - Jianzhong Yu
- Department of Internal Medicine, Haian Hospital of Traditional Chinese Medicine Affiliated to Nanjing University of Chinese Medicine, Nantong, China
| | - Chengyang Jiang
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental and Translational Non-coding RNA Research, Yangzhou University, Yangzhou, China
| | - Fei Cheng
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental and Translational Non-coding RNA Research, Yangzhou University, Yangzhou, China
| | - Miao Xu
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental and Translational Non-coding RNA Research, Yangzhou University, Yangzhou, China
| | - Zhehao Fan
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental and Translational Non-coding RNA Research, Yangzhou University, Yangzhou, China
| | - Fangchen Liu
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental and Translational Non-coding RNA Research, Yangzhou University, Yangzhou, China
| | - Chang Liu
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental and Translational Non-coding RNA Research, Yangzhou University, Yangzhou, China
| | - Ying Wang
- Department of Thyroid and Breast Surgery, The Affiliated Hospital of Yangzhou University, Yangzhou University, Yangzhou, China
| | - Ning Wang
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental and Translational Non-coding RNA Research, Yangzhou University, Yangzhou, China
| | - Chenxu Liu
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental and Translational Non-coding RNA Research, Yangzhou University, Yangzhou, China
| | - Caili Bi
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental and Translational Non-coding RNA Research, Yangzhou University, Yangzhou, China
| | - Haibo Sun
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental and Translational Non-coding RNA Research, Yangzhou University, Yangzhou, China
| |
Collapse
|
6
|
Li Z, Zhang B, Chan JJ, Tabatabaeian H, Tong QY, Chew XH, Fan X, Driguez P, Chan C, Cheong F, Wang S, Siew BE, Tan IJW, Lee KY, Lieske B, Cheong WK, Kappei D, Tan KK, Gao X, Tay Y. An isoform-resolution transcriptomic atlas of colorectal cancer from long-read single-cell sequencing. CELL GENOMICS 2024; 4:100641. [PMID: 39216476 PMCID: PMC11480860 DOI: 10.1016/j.xgen.2024.100641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 06/06/2024] [Accepted: 08/07/2024] [Indexed: 09/04/2024]
Abstract
Colorectal cancer (CRC) ranks as the second leading cause of cancer deaths globally. In recent years, short-read single-cell RNA sequencing (scRNA-seq) has been instrumental in deciphering tumor heterogeneities. However, these studies only enable gene-level quantification but neglect alterations in transcript structures arising from alternative end processing or splicing. In this study, we integrated short- and long-read scRNA-seq of CRC samples to build an isoform-resolution CRC transcriptomic atlas. We identified 394 dysregulated transcript structures in tumor epithelial cells, including 299 resulting from various combinations of splicing events. Second, we characterized genes and isoforms associated with epithelial lineages and subpopulations exhibiting distinct prognoses. Among 31,935 isoforms with novel junctions, 330 were supported by The Cancer Genome Atlas RNA-seq and mass spectrometry data. Finally, we built an algorithm that integrated novel peptides derived from open reading frames of recurrent tumor-specific transcripts with mass spectrometry data and identified recurring neoepitopes that may aid the development of cancer vaccines.
Collapse
Affiliation(s)
- Zhongxiao Li
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia; Center of Excellence for Smart Health (KCSH), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia; Center of Excellence on Generative AI, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia
| | - Bin Zhang
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia; Center of Excellence for Smart Health (KCSH), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia; Center of Excellence on Generative AI, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia.
| | - Jia Jia Chan
- Cancer Science Institute of Singapore, National University of Singapore, Singapore 117599, Singapore
| | - Hossein Tabatabaeian
- Cancer Science Institute of Singapore, National University of Singapore, Singapore 117599, Singapore
| | - Qing Yun Tong
- Cancer Science Institute of Singapore, National University of Singapore, Singapore 117599, Singapore
| | - Xiao Hong Chew
- Cancer Science Institute of Singapore, National University of Singapore, Singapore 117599, Singapore
| | - Xiaonan Fan
- Cancer Science Institute of Singapore, National University of Singapore, Singapore 117599, Singapore
| | - Patrick Driguez
- Core Labs, King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia
| | - Charlene Chan
- Cancer Science Institute of Singapore, National University of Singapore, Singapore 117599, Singapore
| | - Faith Cheong
- Cancer Science Institute of Singapore, National University of Singapore, Singapore 117599, Singapore
| | - Shi Wang
- Department of Pathology, National University Health System, Singapore 119228, Singapore
| | - Bei En Siew
- Department of Surgery, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore
| | - Ian Jse-Wei Tan
- Department of Surgery, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore; Division of Colorectal Surgery, University Surgical Cluster, National University Health System, Singapore 119228, Singapore
| | - Kai-Yin Lee
- Department of Surgery, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore; Division of Colorectal Surgery, University Surgical Cluster, National University Health System, Singapore 119228, Singapore
| | - Bettina Lieske
- Department of Surgery, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore; Division of Colorectal Surgery, University Surgical Cluster, National University Health System, Singapore 119228, Singapore
| | - Wai-Kit Cheong
- Department of Surgery, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore; Division of Colorectal Surgery, University Surgical Cluster, National University Health System, Singapore 119228, Singapore
| | - Dennis Kappei
- Cancer Science Institute of Singapore, National University of Singapore, Singapore 117599, Singapore; NUS Centre for Cancer Research, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore; Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore
| | - Ker-Kan Tan
- Department of Surgery, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore; Division of Colorectal Surgery, University Surgical Cluster, National University Health System, Singapore 119228, Singapore
| | - Xin Gao
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia; Center of Excellence for Smart Health (KCSH), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia; Center of Excellence on Generative AI, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia.
| | - Yvonne Tay
- Cancer Science Institute of Singapore, National University of Singapore, Singapore 117599, Singapore; NUS Centre for Cancer Research, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore; Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117597, Singapore.
| |
Collapse
|
7
|
Yu S, Wu R, Si Y, Fan Z, Wang Y, Yao C, Sun R, Xue Y, Chen Y, Wang Z, Dong S, Wang N, Ling X, Liang Z, Bi C, Yang Y, Dong W, Sun H. Alternative splicing of ALDOA confers tamoxifen resistance in breast cancer. Oncogene 2024; 43:2901-2913. [PMID: 39164523 DOI: 10.1038/s41388-024-03134-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Revised: 08/09/2024] [Accepted: 08/12/2024] [Indexed: 08/22/2024]
Abstract
The cancer-associated alternative splicing (AS) events generate cancer-related transcripts which are involved in uncontrolled cell proliferation and drug resistance. However, the key AS variants implicated in tamoxifen (TAM) resistance in breast cancer remain elusive. In the current study, we investigated the landscape of AS events in nine pairs of primary and relapse breast tumors from patients receiving TAM-based therapy. We unrevealed a notable association between the inclusion of exon 7.2 in the 5'untranslated region (5'UTR) of ALDOA mRNA and TAM resistance. Mechanistically, the inclusion of ALDOA exon 7.2 enhances the translation efficiency of the transcript, resulting in increased ALDOA protein expression, mTOR pathway activity, and the promotion of TAM resistance in breast cancer cells. Moreover, the inclusion of exon 7.2 in ALDOA mRNA is mediated by MSI1 via direct interaction. In addition, elevated inclusion of ALDOA exon 7.2 or expression of MSI1 is associated with an unfavorable prognosis in patients undergoing endocrine therapy. Notably, treatment with Aldometanib, an ALDOA inhibitor, effectively restrains the growth of TAM-resistant breast cancer cells in vitro and in vivo. The present study unveils the pivotal role of an AS event in ALDOA, under the regulation of MSI1, in driving TAM resistance in breast cancer. Therefore, this study provides a promising therapeutic avenue targeting ALDOA to combat TAM resistance.
Collapse
Affiliation(s)
- Shiyi Yu
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental & Translational Non-Coding RNA Research, Yangzhou University, Yangzhou, China
| | - Rui Wu
- School of Life Science, Liaoning Normal University, Dalian, China
| | - Yue Si
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental & Translational Non-Coding RNA Research, Yangzhou University, Yangzhou, China
| | - Zhehao Fan
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental & Translational Non-Coding RNA Research, Yangzhou University, Yangzhou, China
| | - Ying Wang
- Department of Thyroid and Breast Surgery, The Affiliated Hospital of Yangzhou University, Yangzhou University, Yangzhou, China
| | - Chang Yao
- The Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing, China
| | - Rongmao Sun
- Department of Thyroid and Breast Surgery, The Affiliated Hospital of Yangzhou University, Yangzhou University, Yangzhou, China
| | - Yaji Xue
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental & Translational Non-Coding RNA Research, Yangzhou University, Yangzhou, China
| | - Yongli Chen
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental & Translational Non-Coding RNA Research, Yangzhou University, Yangzhou, China
| | - Zheng Wang
- Department of Pathology, The Affiliated Hospital of Yangzhou University, Yangzhou University, Yangzhou, China
| | - Shuangshuang Dong
- Department of Pathology, Northern Jiangsu People's Hospital Affiliated to Yangzhou University/Clinical Medical College, Yangzhou University, Yangzhou, China
| | - Ning Wang
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental & Translational Non-Coding RNA Research, Yangzhou University, Yangzhou, China
| | - Xinyue Ling
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental & Translational Non-Coding RNA Research, Yangzhou University, Yangzhou, China
| | - Zhengyan Liang
- School of Basic Medical Science, Guangdong Medical University, Dongguan, China
| | - Caili Bi
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Experimental & Translational Non-Coding RNA Research, Yangzhou University, Yangzhou, China
| | - Yi Yang
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China.
- Jiangsu Key Laboratory of Experimental & Translational Non-Coding RNA Research, Yangzhou University, Yangzhou, China.
| | - Weibing Dong
- School of Life Science, Liaoning Normal University, Dalian, China.
| | - Haibo Sun
- Institute of Translational Medicine, Medical College, Yangzhou University, Yangzhou, China.
- Jiangsu Key Laboratory of Experimental & Translational Non-Coding RNA Research, Yangzhou University, Yangzhou, China.
- Haian Hospital of Traditional Chinese Medicine, Haian, China.
| |
Collapse
|
8
|
Niranjan V, Setlur AS, K C, Kumkum S, Dasgupta S, Singh V, Desai V, Kumar J. Exploring the Synergistic Mechanism of AP2A2 Transcription Factor Inhibition via Molecular Modeling and Simulations as a Novel Computational Approach for Combating Breast Cancer: In Silico Interpretations. Mol Biotechnol 2024; 66:2497-2521. [PMID: 37747672 DOI: 10.1007/s12033-023-00871-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Accepted: 08/28/2023] [Indexed: 09/26/2023]
Abstract
Studies have shown that transcription factor AP2A2 (activator protein-2 alpha-2) is involved in the expression of DLEC1, a tumor suppressor gene, which, when mutated, will cause breast cancer and is thus an excellent target for breast cancer studies. Therefore, in the present research, a synergistic approach toward combating breast cancer is proposed by blocking AP2A2 factor, and allowing the cancer cells to be sensitive to anti-cancer drugs. The effect of AP2A2 on breast cancer was first understood via gene analysis from cBioPortal. AP2A2 was then modeled using RaptorX and its structure was validated from Ramachandran plots. Using all ligands from MolPort database, molecular docking was performed against AP2A2, from which the top three best docked ligands were studied for toxicity in humans using Protox-II. Once the ligands passed these tests, the best complexes were simulated at 200ns in Desmond Maestro, to comprehend their stabilities, followed by the computations of free energies of binding via Molecular mechanics- Generalized Born Solvent Accessibility method (MM-GBSA). The results showed that molecules MolPort-005-945-556 (sachharolipids), MolPort-001-741-124 (flavonoids), and MolPort-005-944-667 (lignan glycosides) with AP2A2 passed toxicity evaluation and belonged to toxicity classes 6, 5, and 5, respectively, with good docking energies. 200 ns simulations revealed stable complexes with slight conformational changes. Stability of ligands was confirmed via snapshots at every 20 ns of the trajectory. Radial distribution of these molecules against the protein revealed very slight deviation from binding pocket. Additionally, the free binding energies for these complexes were found to be - 54.93 ± 12.982 kcal/mol, - 44.39 ± 14.393 kcal/mol, and - 66.51 ± 13.522 kcal/mol, respectively. A preliminary computational validation of the inability of AP2A2 to bind to DLEC1 in the presence of ligands offers beneficial insights into the potential of these ligands. Therefore, this study sheds light on the potential natural molecules that could stably block AP2A2 with least deviation and act in synergy to aid anti-cancer drugs work on breast cancer cells.
Collapse
Affiliation(s)
- Vidya Niranjan
- Department of Biotechnology, RV College of Engineering, Bangalore, 560059, India.
| | - Anagha S Setlur
- Department of Biotechnology, RV College of Engineering, Bangalore, 560059, India
| | - Chandrashekar K
- Department of Biotechnology, RV College of Engineering, Bangalore, 560059, India
| | - Sneha Kumkum
- Department of Biotechnology, RV College of Engineering, Bangalore, 560059, India
| | - Sanjana Dasgupta
- Department of Biotechnology, RV College of Engineering, Bangalore, 560059, India
| | - Varsha Singh
- Department of Biotechnology, RV College of Engineering, Bangalore, 560059, India
| | - Vrushali Desai
- Department of Biotechnology, RV College of Engineering, Bangalore, 560059, India
| | - Jitendra Kumar
- Biotechnology Industry Research Assistance Council (BIRAC), CGO complex Lodhi Road, New Delhi, India.
| |
Collapse
|
9
|
Gong B, Li D, Łabaj PP, Pan B, Novoradovskaya N, Thierry-Mieg D, Thierry-Mieg J, Chen G, Bergstrom Lucas A, LoCoco JS, Richmond TA, Tseng E, Kusko R, Happe S, Mercer TR, Pabón-Peña C, Salmans M, Tilgner HU, Xiao W, Johann DJ, Jones W, Tong W, Mason CE, Kreil DP, Xu J. Targeted DNA-seq and RNA-seq of Reference Samples with Short-read and Long-read Sequencing. Sci Data 2024; 11:892. [PMID: 39152166 PMCID: PMC11329654 DOI: 10.1038/s41597-024-03741-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 08/05/2024] [Indexed: 08/19/2024] Open
Abstract
Next-generation sequencing (NGS) has revolutionized genomic research by enabling high-throughput, cost-effective genome and transcriptome sequencing accelerating personalized medicine for complex diseases, including cancer. Whole genome/transcriptome sequencing (WGS/WTS) provides comprehensive insights, while targeted sequencing is more cost-effective and sensitive. In comparison to short-read sequencing, which still dominates the field due to high speed and cost-effectiveness, long-read sequencing can overcome alignment limitations and better discriminate similar sequences from alternative transcripts or repetitive regions. Hybrid sequencing combines the best strengths of different technologies for a more comprehensive view of genomic/transcriptomic variations. Understanding each technology's strengths and limitations is critical for translating cutting-edge technologies into clinical applications. In this study, we sequenced DNA and RNA libraries of reference samples using various targeted DNA and RNA panels and the whole transcriptome on both short-read and long-read platforms. This study design enables a comprehensive analysis of sequencing technologies, targeting protocols, and library preparation methods. Our expanded profiling landscape establishes a reference point for assessing current sequencing technologies, facilitating informed decision-making in genomic research and precision medicine.
Collapse
Affiliation(s)
- Binsheng Gong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, 72079, USA
| | - Dan Li
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, 72079, USA
| | - Paweł P Łabaj
- Małopolska Centre of Biotechnology, Jagiellonian University, Krakow, Poland
- Bioinformatics Research, Institute of Molecular Biotechnology, Boku University Vienna, Vienna, Austria
| | - Bohu Pan
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, 72079, USA
| | | | - Danielle Thierry-Mieg
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD, 20894, USA
| | - Jean Thierry-Mieg
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD, 20894, USA
| | - Guangchun Chen
- Department of Immunology, Genomics and Microarray Core Facility, University of Texas Southwestern Medical Center, 5323 Harry Hine Blvd., Dallas, TX, 75390, USA
| | - Anne Bergstrom Lucas
- Agilent Technologies, Inc., 5301 Stevens Creek Blvd., Santa Clara, CA, 95051, USA
| | | | - Todd A Richmond
- Market & Application Development Bioinformatics, Roche Sequencing Solutions Inc., 4300 Hacienda Dr., Pleasanton, CA, 94588, USA
| | | | - Rebecca Kusko
- Cellino Bio, 750 Main Street, Cambridge, MA, 02143, USA
| | - Scott Happe
- Agilent Technologies, Inc., 1834 State Hwy 71 West, Cedar Creek, TX, 78612, USA
| | - Timothy R Mercer
- Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, St Lucia, QLD, Australia
| | - Carlos Pabón-Peña
- Agilent Technologies, Inc., 5301 Stevens Creek Blvd., Santa Clara, CA, 95051, USA
| | | | - Hagen U Tilgner
- Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA
- Center for Neurogenetics, Weill Cornell Medicine, New York, NY, USA
| | - Wenzhong Xiao
- Stanford Genome Technology Center, Stanford University, Palo Alto, CA, 94304, USA
- Massachusetts General Hospital, Harvard Medical School, Boston, MA, 02114, USA
| | - Donald J Johann
- Winthrop P Rockefeller Cancer Institute, University of Arkansas for Medical Sciences, 4301W Markham St., Little Rock, AR, 72205, USA
| | - Wendell Jones
- Q squared Solutions Genomics, 2400 Elis Road, Durham, NC, 27703, USA
| | - Weida Tong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, 72079, USA
| | - Christopher E Mason
- Department of Physiology and Biophysics, Weill Cornell Medicine, Cornell University, New York, NY, 10065, USA.
- The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA.
- The WorldQuant Initiative for Quantitative Prediction, Weill Cornell Medicine, New York, NY, USA.
| | - David P Kreil
- Bioinformatics Research, Institute of Molecular Biotechnology, Boku University Vienna, Vienna, Austria.
| | - Joshua Xu
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, 72079, USA.
| |
Collapse
|
10
|
Haj Abdullah Alieh L, Cardoso de Toledo B, Hadarovich A, Toth-Petroczy A, Calegari F. Characterization of alternative splicing during mammalian brain development reveals the extent of isoform diversity and potential effects on protein structural changes. Biol Open 2024; 13:bio061721. [PMID: 39387301 DOI: 10.1242/bio.061721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2024] [Accepted: 09/09/2024] [Indexed: 10/15/2024] Open
Abstract
Regulation of gene expression is critical for fate commitment of stem and progenitor cells during tissue formation. In the context of mammalian brain development, a plethora of studies have described how changes in the expression of individual genes characterize cell types across ontogeny and phylogeny. However, little attention has been paid to the fact that different transcripts can arise from any given gene through alternative splicing (AS). Considered a key mechanism expanding transcriptome diversity during evolution, assessing the full potential of AS on isoform diversity and protein function has been notoriously difficult. Here, we capitalize on the use of a validated reporter mouse line to isolate neural stem cells, neurogenic progenitors and neurons during corticogenesis and combine the use of short- and long-read sequencing to reconstruct the full transcriptome diversity characterizing neurogenic commitment. Extending available transcriptional profiles of the mammalian brain by nearly 50,000 new isoforms, we found that neurogenic commitment is characterized by a progressive increase in exon inclusion resulting in the profound remodeling of the transcriptional profile of specific cortical cell types. Most importantly, we computationally infer the biological significance of AS on protein structure by using AlphaFold2, revealing how radical protein conformational changes can arise from subtle changes in isoforms sequence. Together, our study reveals that AS has a greater potential to impact protein diversity and function than previously thought, independently from changes in gene expression.
Collapse
Affiliation(s)
| | | | - Anna Hadarovich
- Max Planck Institute of Molecular Cell Biology and Genetics, 01307 Dresden, Germany
- Center for Systems Biology Dresden, 01307 Dresden, Germany
| | - Agnes Toth-Petroczy
- Max Planck Institute of Molecular Cell Biology and Genetics, 01307 Dresden, Germany
- Center for Systems Biology Dresden, 01307 Dresden, Germany
- Cluster of Excellence Physics of Life, TU Dresden, 01062 Dresden, Germany
| | - Federico Calegari
- CRTD-Center for Regenerative Therapies Dresden, School of Medicine, TU Dresden, Germany
| |
Collapse
|
11
|
Parag RR, Yamamoto T, Saito K, Zhu D, Yang L, Van Meir EG. Novel Isoforms of Adhesion G Protein-Coupled Receptor B1 (ADGRB1/BAI1) Generated from an Alternative Promoter in Intron 17. Mol Neurobiol 2024:10.1007/s12035-024-04293-3. [PMID: 38941066 DOI: 10.1007/s12035-024-04293-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2024] [Accepted: 06/06/2024] [Indexed: 06/29/2024]
Abstract
Brain-specific angiogenesis inhibitor 1 (BAI1) belongs to the adhesion G-protein-coupled receptors, which exhibit large multi-domain extracellular N termini that mediate cell-cell and cell-matrix interactions. To explore the existence of BAI1 isoforms, we queried genomic datasets for markers of active chromatin and new transcript variants in the ADGRB1 (adhesion G-protein-coupled receptor B1) gene. Two major types of mRNAs were identified in human/mouse brain, those with a start codon in exon 2 encoding a full-length protein of a predicted size of 173.5/173.3 kDa and shorter transcripts starting from alternative exons at the intron 17/exon 18 boundary with new or exon 19 start codons, predicting two shorter isoforms of 76.9/76.4 and 70.8/70.5 kDa, respectively. Immunoblots on wild-type and Adgrb1 exon 2-deleted mice, reverse transcription PCR, and promoter-luciferase reporter assay confirmed that the shorter isoforms originate from an alternative promoter in intron 17. The shorter BAI1 isoforms lack most of the N terminus and are very close in structure to the truncated BAI1 isoform generated through GPS processing from the full-length receptor. The cleaved BAI1 isoform has a 19 amino acid extracellular stalk that may serve as a receptor agonist, while the alternative transcripts generate BAI1 isoforms with extracellular N termini of 5 or 60 amino acids. Further studies are warranted to compare the functions of these isoforms and examine the distinct roles they play in different tissues and cell types.
Collapse
Affiliation(s)
- Rashed Rezwan Parag
- Laboratory of Molecular Neuro-Oncology, Department of Neurosurgery, Heersink School of Medicine, University of Alabama at Birmingham (UAB), WTI 520E, 1824 6th Avenue South, Birmingham, AL, 35233, USA
- Graduate Biomedical Sciences, University of Alabama at Birmingham (UAB), Birmingham, AL, USA
| | - Takahiro Yamamoto
- Laboratory of Molecular Neuro-Oncology, Department of Neurosurgery, Heersink School of Medicine, University of Alabama at Birmingham (UAB), WTI 520E, 1824 6th Avenue South, Birmingham, AL, 35233, USA
- Department of Neurosurgery, Kumamoto University, Kumamoto, Japan
| | - Kiyotaka Saito
- Laboratory of Molecular Neuro-Oncology, Department of Neurosurgery, Heersink School of Medicine, University of Alabama at Birmingham (UAB), WTI 520E, 1824 6th Avenue South, Birmingham, AL, 35233, USA
| | - Dan Zhu
- Department of Neurosurgery, Emory University School of Medicine, Atlanta, GA, USA
| | - Liquan Yang
- Department of Neurosurgery, Emory University School of Medicine, Atlanta, GA, USA
| | - Erwin G Van Meir
- Laboratory of Molecular Neuro-Oncology, Department of Neurosurgery, Heersink School of Medicine, University of Alabama at Birmingham (UAB), WTI 520E, 1824 6th Avenue South, Birmingham, AL, 35233, USA.
- Department of Neurosurgery, Emory University School of Medicine, Atlanta, GA, USA.
- O'Neal Comprehensive Cancer Center, University of Alabama at Birmingham (UAB), Birmingham, AL, USA.
| |
Collapse
|
12
|
Wang Y, Zou B, Zhang Y, Zhang J, Li S, Yu B, An Z, Li L, Cui S, Zhang Y, Yao J, Shi X, Liu J. Comprehensive Long-Read Sequencing Analysis Discloses the Transcriptome Features of Papillary Thyroid Microcarcinoma. J Clin Endocrinol Metab 2024; 109:1263-1274. [PMID: 38038628 DOI: 10.1210/clinem/dgad695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 11/14/2023] [Accepted: 11/27/2023] [Indexed: 12/02/2023]
Abstract
CONTEXT Papillary thyroid microcarcinoma (PTMC) is the most common type of thyroid cancer. It has been shown that lymph node metastasis is associated with poor prognosis in patients with PTMC. OBJECTIVE We aim to characterize the PTMC transcriptome landscape and identify the candidate transcripts that are associated with lateral neck lymph node metastasis of PTMC. METHODS We performed full-length transcriptome sequencing in 64 PTMC samples. Standard bioinformatic pipelines were applied to characterize and annotate the full-length expression profiles of 2 PTMC subtypes. Functional open reading frame (ORF) annotation of the known and novel transcripts were predicted by HMMER, DeepLoc, and DeepTMHMM tools. Candidate transcripts associated with the pN1b subtype were identified after transcript quantification and differential gene expression analyses. RESULTS We found that skipping exons accounted for the more than 27.82% of the alternative splicing events. At least 42.56% of the discovered transcripts were novel isoforms of annotated genes. A total of 39 193 ORFs in novel transcripts and 18 596 ORFs in known transcripts were identified. Distribution patterns of the characterized transcripts in functional domain, subcellular localization, and transmembrane structure were predicted. In total, 1033 and 1204 differentially expressed genes were identified in the pN0 and pN1b groups, respectively. Moreover, novel isoforms of FRMD3, NOD1, and SHROOM4 were highlighted for their association with pN1b subtype. CONCLUSION Our data provided the global transcriptome landscape of PTMC and also revealed the novel isoforms that associated with PTMC aggressiveness.
Collapse
Affiliation(s)
- Yanqiang Wang
- Key Laboratory of Cellular Physiology of the Ministry of Education (Shanxi Medical University), Translational Medicine Research Center, Department of Pathology, Shanxi Medical University, Taiyuan, Shanxi 030001, China
| | - Binbin Zou
- Key Laboratory of Cellular Physiology of the Ministry of Education (Shanxi Medical University), Translational Medicine Research Center, Department of Pathology, Shanxi Medical University, Taiyuan, Shanxi 030001, China
| | - Yanyan Zhang
- Department of Thyroid Surgery, First Hospital of Shanxi Medical University, Taiyuan, Shanxi 030001, China
| | - Jin Zhang
- Department of Thyroid Surgery, First Hospital of Shanxi Medical University, Taiyuan, Shanxi 030001, China
| | - Shujing Li
- Department of Thyroid Surgery, First Hospital of Shanxi Medical University, Taiyuan, Shanxi 030001, China
| | - Bo Yu
- Department of Thyroid Surgery, First Hospital of Shanxi Medical University, Taiyuan, Shanxi 030001, China
| | - Zhekun An
- Key Laboratory of Cellular Physiology of the Ministry of Education (Shanxi Medical University), Translational Medicine Research Center, Department of Pathology, Shanxi Medical University, Taiyuan, Shanxi 030001, China
| | - Lei Li
- Department of Thyroid Surgery, First Hospital of Shanxi Medical University, Taiyuan, Shanxi 030001, China
| | - Siqian Cui
- Department of Thyroid Surgery, First Hospital of Shanxi Medical University, Taiyuan, Shanxi 030001, China
| | - Yutong Zhang
- Key Laboratory of Cellular Physiology of the Ministry of Education (Shanxi Medical University), Translational Medicine Research Center, Department of Pathology, Shanxi Medical University, Taiyuan, Shanxi 030001, China
| | - Jiali Yao
- Key Laboratory of Cellular Physiology of the Ministry of Education (Shanxi Medical University), Translational Medicine Research Center, Department of Pathology, Shanxi Medical University, Taiyuan, Shanxi 030001, China
| | - Xiuzhi Shi
- Key Laboratory of Cellular Physiology of the Ministry of Education (Shanxi Medical University), Translational Medicine Research Center, Department of Pathology, Shanxi Medical University, Taiyuan, Shanxi 030001, China
| | - Jing Liu
- Department of Thyroid Surgery, First Hospital of Shanxi Medical University, Taiyuan, Shanxi 030001, China
| |
Collapse
|
13
|
Lambourne L, Mattioli K, Santoso C, Sheynkman G, Inukai S, Kaundal B, Berenson A, Spirohn-Fitzgerald K, Bhattacharjee A, Rothman E, Shrestha S, Laval F, Yang Z, Bisht D, Sewell JA, Li G, Prasad A, Phanor S, Lane R, Campbell DM, Hunt T, Balcha D, Gebbia M, Twizere JC, Hao T, Frankish A, Riback JA, Salomonis N, Calderwood MA, Hill DE, Sahni N, Vidal M, Bulyk ML, Fuxman Bass JI. Widespread variation in molecular interactions and regulatory properties among transcription factor isoforms. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.12.584681. [PMID: 38617209 PMCID: PMC11014633 DOI: 10.1101/2024.03.12.584681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]
Abstract
Most human Transcription factors (TFs) genes encode multiple protein isoforms differing in DNA binding domains, effector domains, or other protein regions. The global extent to which this results in functional differences between isoforms remains unknown. Here, we systematically compared 693 isoforms of 246 TF genes, assessing DNA binding, protein binding, transcriptional activation, subcellular localization, and condensate formation. Relative to reference isoforms, two-thirds of alternative TF isoforms exhibit differences in one or more molecular activities, which often could not be predicted from sequence. We observed two primary categories of alternative TF isoforms: "rewirers" and "negative regulators", both of which were associated with differentiation and cancer. Our results support a model wherein the relative expression levels of, and interactions involving, TF isoforms add an understudied layer of complexity to gene regulatory networks, demonstrating the importance of isoform-aware characterization of TF functions and providing a rich resource for further studies.
Collapse
Affiliation(s)
- Luke Lambourne
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Kaia Mattioli
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Clarissa Santoso
- Department of Biology, Boston University, Boston, MA, USA
- Bioinformatics Program, Boston University, Boston, MA, USA
| | - Gloria Sheynkman
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Sachi Inukai
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Babita Kaundal
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Anna Berenson
- Molecular Biology, Cell Biology & Biochemistry Program, Boston University, Boston, MA, USA
| | - Kerstin Spirohn-Fitzgerald
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Anukana Bhattacharjee
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
- Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Elisabeth Rothman
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | | | - Florent Laval
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
- TERRA Teaching and Research Centre, University of Liège, Gembloux, Belgium
- Laboratory of Viral Interactomes, GIGA Institute, University of Liège, Liège, Belgium
| | - Zhipeng Yang
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Deepa Bisht
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Jared A Sewell
- Department of Biology, Boston University, Boston, MA, USA
| | - Guangyuan Li
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
- Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Anisa Prasad
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Harvard College, Cambridge MA, USA
| | - Sabrina Phanor
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Ryan Lane
- Department of Biology, Boston University, Boston, MA, USA
| | | | - Toby Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Dawit Balcha
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Marinella Gebbia
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- Lunenfeld-Tanenbaum Research Institute (LTRI), Sinai Health System, Toronto, Ontario, Canada
| | - Jean-Claude Twizere
- TERRA Teaching and Research Centre, University of Liège, Gembloux, Belgium
- Laboratory of Viral Interactomes, GIGA Institute, University of Liège, Liège, Belgium
| | - Tong Hao
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Adam Frankish
- Laboratory of Viral Interactomes, GIGA Institute, University of Liège, Liège, Belgium
| | - Josh A Riback
- Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, USA
| | - Nathan Salomonis
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
- Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Michael A Calderwood
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - David E Hill
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Nidhi Sahni
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Marc Vidal
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Martha L Bulyk
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Juan I Fuxman Bass
- Department of Biology, Boston University, Boston, MA, USA
- Bioinformatics Program, Boston University, Boston, MA, USA
- Molecular Biology, Cell Biology & Biochemistry Program, Boston University, Boston, MA, USA
| |
Collapse
|
14
|
Murali M, Saquing J, Lu S, Gao Z, Jordan B, Wakefield ZP, Fiszbein A, Cooper DR, Castaldi PJ, Korkin D, Sheynkman G. Biosurfer for systematic tracking of regulatory mechanisms leading to protein isoform diversity. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.15.585320. [PMID: 38559226 PMCID: PMC10980011 DOI: 10.1101/2024.03.15.585320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Long-read RNA sequencing has shed light on transcriptomic complexity, but questions remain about the functionality of downstream protein products. We introduce Biosurfer, a computational approach for comparing protein isoforms, while systematically tracking the transcriptional, splicing, and translational variations that underlie differences in the sequences of the protein products. Using Biosurfer, we analyzed the differences in 32,799 pairs of GENCODE annotated protein isoforms, finding a majority (70%) of variable N-termini are due to the alternative transcription start sites, while only 9% arise from 5' UTR alternative splicing. Biosurfer's detailed tracking of nucleotide-to-residue relationships helped reveal an uncommonly tracked source of single amino acid residue changes arising from the codon splits at junctions. For 17% of internal sequence changes, such split codon patterns lead to single residue differences, termed "ragged codons". Of variable C-termini, 72% involve splice- or intron retention-induced reading frameshifts. We found an unusual pattern of reading frame changes, in which the first frameshift is closely followed by a distinct second frameshift that restores the original frame, which we term a "snapback" frameshift. We analyzed long read RNA-seq-predicted proteome of a human cell line and found similar trends as compared to our GENCODE analysis, with the exception of a higher proportion of isoforms predicted to undergo nonsense-mediated decay. Biosurfer's comprehensive characterization of long-read RNA-seq datasets should accelerate insights of the functional role of protein isoforms, providing mechanistic explanation of the origins of the proteomic diversity driven by the alternative splicing. Biosurfer is available as a Python package at https://github.com/sheynkman-lab/biosurfer.
Collapse
Affiliation(s)
- Mayank Murali
- Broad Institute of MIT and Harvard University, Cambridge, MA, USA
| | - Jamie Saquing
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| | - Senbao Lu
- Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA, USA
- Computer Science Department, Worcester Polytechnic Institute, Worcester, MA, USA
| | - Ziyang Gao
- Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA, USA
- Computer Science Department, Worcester Polytechnic Institute, Worcester, MA, USA
| | - Ben Jordan
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| | - Zachary Peters Wakefield
- Bioinformatics Program, Boston University, Boston, MA, USA
- Department of Biology, Boston University, Boston, MA, USA
| | - Ana Fiszbein
- Bioinformatics Program, Boston University, Boston, MA, USA
- Department of Biology, Boston University, Boston, MA, USA
| | - David R. Cooper
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| | - Peter J. Castaldi
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, MA, USA
- Division of General Medicine and Primary Care, Department of Medicine, Brigham and Women’s Hospital, Boston, MA, USA
| | - Dmitry Korkin
- Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA, USA
- Computer Science Department, Worcester Polytechnic Institute, Worcester, MA, USA
| | - Gloria Sheynkman
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- UVA Cancer Center, University of Virginia, Charlottesville, VA, USA
| |
Collapse
|
15
|
Shi Q, Li X, Liu Y, Chen Z, He X. FLIBase: a comprehensive repository of full-length isoforms across human cancers and tissues. Nucleic Acids Res 2024; 52:D124-D133. [PMID: 37697439 PMCID: PMC10767943 DOI: 10.1093/nar/gkad745] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 08/14/2023] [Accepted: 08/31/2023] [Indexed: 09/13/2023] Open
Abstract
Regulatory processes at the RNA transcript level play a crucial role in generating transcriptome diversity and proteome composition in human cells, impacting both physiological and pathological states. This study introduces FLIBase (www.FLIBase.org), a specialized database that focuses on annotating full-length isoforms using long-read sequencing techniques. We collected and integrated long-read (351 samples) and short-read (12 469 samples) RNA sequencing data from diverse normal and cancerous human tissues and cells. The current version of FLIBase comprises a total of 983 789 full-length spliced isoforms, identified through long-read sequences and verified using short-read exon-exon splice junctions. Of these, 188 248 isoforms have been annotated, while 795 541 isoforms remain unannotated. By overcoming the limitations of short-read RNA sequencing methods, FLIBase provides an accurate and comprehensive representation of full-length transcripts. These comprehensive annotations empower researchers to undertake various downstream analyses and investigations. Importantly, FLIBase exhibits a significant advantage in identifying a substantial number of previously unannotated isoforms and tumor-specific RNA transcripts. These tumor-specific RNA transcripts have the potential to serve as a source of immunogenic recurrent neoantigens. This remarkable discovery holds tremendous promise for advancing the development of tailored RNA-based diagnostic and therapeutic strategies for various types of human cancer.
Collapse
Affiliation(s)
- Qili Shi
- Fudan University Shanghai Cancer Center and Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Xinrong Li
- Fudan University Shanghai Cancer Center and Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Yizhe Liu
- Fudan University Shanghai Cancer Center and Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Zhiao Chen
- Fudan University Shanghai Cancer Center and Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University, Shanghai 200032, China
- Key Laboratory of Breast Cancer in Shanghai, Fudan University Shanghai Cancer Center, Fudan University, Shanghai 200032, China
- Shanghai Key Laboratory of Radiation Oncology, Fudan University Shanghai Cancer Center, Fudan University, Shanghai 200032, China
| | - Xianghuo He
- Fudan University Shanghai Cancer Center and Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University, Shanghai 200032, China
- Key Laboratory of Breast Cancer in Shanghai, Fudan University Shanghai Cancer Center, Fudan University, Shanghai 200032, China
- Shanghai Key Laboratory of Radiation Oncology, Fudan University Shanghai Cancer Center, Fudan University, Shanghai 200032, China
| |
Collapse
|
16
|
Mestre-Tomás J, Liu T, Pardo-Palacios F, Conesa A. SQANTI-SIM: a simulator of controlled transcript novelty for lrRNA-seq benchmark. Genome Biol 2023; 24:286. [PMID: 38082294 PMCID: PMC10712166 DOI: 10.1186/s13059-023-03127-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Accepted: 11/27/2023] [Indexed: 12/18/2023] Open
Abstract
Long-read RNA sequencing has emerged as a powerful tool for transcript discovery, even in well-annotated organisms. However, assessing the accuracy of different methods in identifying annotated and novel transcripts remains a challenge. Here, we present SQANTI-SIM, a versatile tool that wraps around popular long-read simulators to allow precise management of transcript novelty based on the structural categories defined by SQANTI3. By selectively excluding specific transcripts from the reference dataset, SQANTI-SIM effectively emulates scenarios involving unannotated transcripts. Furthermore, the tool provides customizable features and supports the simulation of additional types of data, representing the first multi-omics simulation tool for the lrRNA-seq field.
Collapse
Affiliation(s)
- Jorge Mestre-Tomás
- Institute for Integrative Systems Biology, Spanish National Research Council, Catedrátic Agustín Escardino Benlloch, Paterna, 46980, Spain
- Department of Applied Statistics, Operations Research and Quality, Universitat Politècnica de València, Camino de Vera, Valencia, 46022, Spain
| | - Tianyuan Liu
- Institute for Integrative Systems Biology, Spanish National Research Council, Catedrátic Agustín Escardino Benlloch, Paterna, 46980, Spain
| | - Francisco Pardo-Palacios
- Institute for Integrative Systems Biology, Spanish National Research Council, Catedrátic Agustín Escardino Benlloch, Paterna, 46980, Spain
| | - Ana Conesa
- Institute for Integrative Systems Biology, Spanish National Research Council, Catedrátic Agustín Escardino Benlloch, Paterna, 46980, Spain.
| |
Collapse
|
17
|
Dondi A, Lischetti U, Jacob F, Singer F, Borgsmüller N, Coelho R, Heinzelmann-Schwarz V, Beisel C, Beerenwinkel N. Detection of isoforms and genomic alterations by high-throughput full-length single-cell RNA sequencing in ovarian cancer. Nat Commun 2023; 14:7780. [PMID: 38012143 PMCID: PMC10682465 DOI: 10.1038/s41467-023-43387-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Accepted: 11/07/2023] [Indexed: 11/29/2023] Open
Abstract
Understanding the complex background of cancer requires genotype-phenotype information in single-cell resolution. Here, we perform long-read single-cell RNA sequencing (scRNA-seq) on clinical samples from three ovarian cancer patients presenting with omental metastasis and increase the PacBio sequencing depth to 12,000 reads per cell. Our approach captures 152,000 isoforms, of which over 52,000 were not previously reported. Isoform-level analysis accounting for non-coding isoforms reveals 20% overestimation of protein-coding gene expression on average. We also detect cell type-specific isoform and poly-adenylation site usage in tumor and mesothelial cells, and find that mesothelial cells transition into cancer-associated fibroblasts in the metastasis, partly through the TGF-β/miR-29/Collagen axis. Furthermore, we identify gene fusions, including an experimentally validated IGF2BP2::TESPA1 fusion, which is misclassified as high TESPA1 expression in matched short-read data, and call mutations confirmed by targeted NGS cancer gene panel results. With these findings, we envision long-read scRNA-seq to become increasingly relevant in oncology and personalized medicine.
Collapse
Affiliation(s)
- Arthur Dondi
- ETH Zurich, Department of Biosystems Science and Engineering, Mattenstrasse 26, 4058, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Mattenstrasse 26, 4058, Basel, Switzerland
| | - Ulrike Lischetti
- ETH Zurich, Department of Biosystems Science and Engineering, Mattenstrasse 26, 4058, Basel, Switzerland.
- University Hospital Basel and University of Basel, Ovarian Cancer Research, Department of Biomedicine, Hebelstrasse 20, 4031, Basel, Switzerland.
| | - Francis Jacob
- University Hospital Basel and University of Basel, Ovarian Cancer Research, Department of Biomedicine, Hebelstrasse 20, 4031, Basel, Switzerland
| | - Franziska Singer
- SIB Swiss Institute of Bioinformatics, Mattenstrasse 26, 4058, Basel, Switzerland
- ETH Zurich, NEXUS Personalized Health Technologies, Wagistrasse 18, 8952, Schlieren, Switzerland
| | - Nico Borgsmüller
- ETH Zurich, Department of Biosystems Science and Engineering, Mattenstrasse 26, 4058, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Mattenstrasse 26, 4058, Basel, Switzerland
| | - Ricardo Coelho
- University Hospital Basel and University of Basel, Ovarian Cancer Research, Department of Biomedicine, Hebelstrasse 20, 4031, Basel, Switzerland
| | - Viola Heinzelmann-Schwarz
- University Hospital Basel and University of Basel, Ovarian Cancer Research, Department of Biomedicine, Hebelstrasse 20, 4031, Basel, Switzerland
- University Hospital Basel, Gynecological Cancer Center, Spitalstrasse 21, 4031, Basel, Switzerland
| | - Christian Beisel
- ETH Zurich, Department of Biosystems Science and Engineering, Mattenstrasse 26, 4058, Basel, Switzerland.
| | - Niko Beerenwinkel
- ETH Zurich, Department of Biosystems Science and Engineering, Mattenstrasse 26, 4058, Basel, Switzerland.
- SIB Swiss Institute of Bioinformatics, Mattenstrasse 26, 4058, Basel, Switzerland.
| |
Collapse
|
18
|
Wu H, Lu Y, Duan Z, Wu J, Lin M, Wu Y, Han S, Li T, Fan Y, Hu X, Xiao H, Feng J, Lu Z, Kong D, Li S. Nanopore long-read RNA sequencing reveals functional alternative splicing variants in human vascular smooth muscle cells. Commun Biol 2023; 6:1104. [PMID: 37907652 PMCID: PMC10618188 DOI: 10.1038/s42003-023-05481-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 10/18/2023] [Indexed: 11/02/2023] Open
Abstract
Vascular smooth muscle cells (VSMCs) are the major contributor to vascular repair and remodeling, which showed high level of phenotypic plasticity. Abnormalities in VSMC plasticity can lead to multiple cardiovascular diseases, wherein alternative splicing plays important roles. However, alternative splicing variants in VSMC plasticity are not fully understood. Here we systematically characterized the long-read transcriptome and their dysregulation in human aortic smooth muscle cells (HASMCs) by employing the Oxford Nanopore Technologies long-read RNA sequencing in HASMCs that are separately treated with platelet-derived growth factor, transforming growth factor, and hsa-miR-221-3P transfection. Our analysis reveals frequent alternative splicing events and thousands of unannotated transcripts generated from alternative splicing. HASMCs treated with different factors exhibit distinct transcriptional reprogramming modulated by alternative splicing. We also found that unannotated transcripts produce different open reading frames compared to the annotated transcripts. Finally, we experimentally validated the unannotated transcript derived from gene CISD1, namely CISD1-u, which plays a role in the phenotypic switch of HASMCs. Our study characterizes the phenotypic modulation of HASMCs from an insight of long-read transcriptome, which would promote the understanding and the manipulation of HASMC plasticity in cardiovascular diseases.
Collapse
Affiliation(s)
- Hao Wu
- Department of Cardiovascular Surgery, Shanghai Sixth People's Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Yicheng Lu
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Zhenzhen Duan
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Jingni Wu
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Minghui Lin
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Yangjun Wu
- Department of Gynecological Oncology, Fudan University Shanghai Cancer Center, Shanghai, China
| | - Siyang Han
- Department of Ophthalmology, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Tongqi Li
- Department of Ophthalmology, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Yuqi Fan
- North Cross School Shanghai, Shanghai, China
| | - Xiaoyuan Hu
- H. Milton Stewart School of Industrial and Systems Engineering, College of Engineering, Geogia Institute of Technology, Atlanta, GA, USA
| | - Hongyan Xiao
- Department of Cardiac Surgery, Wuhan Asia Heart Hospital, Wuhan University of Science and Technology, Wuhan, China
| | - Jiaxuan Feng
- Department of Vascular Surgery and Intervention Center, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Zhiqian Lu
- Department of Cardiovascular Surgery, Shanghai Sixth People's Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China.
| | - Deping Kong
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
| | - Shengli Li
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
| |
Collapse
|
19
|
Mestre-Tomás J, Liu T, Pardo-Palacios F, Conesa A. SQANTI-SIM: a simulator of controlled transcript novelty for lrRNA-seq benchmark. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.23.554392. [PMID: 37662216 PMCID: PMC10473693 DOI: 10.1101/2023.08.23.554392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2023]
Abstract
Long-read RNA-seq has emerged as a powerful tool for transcript discovery, even in well-annotated organisms. However, assessing the accuracy of different methods in identifying annotated and novel transcripts remains a challenge. Here, we present SQANTI-SIM, a versatile utility that wraps around popular long-read simulators to allow precise management of transcript novelty based on the structural categories defined by SQANTI3. By selectively excluding specific transcripts from the reference dataset, SQANTI-SIM effectively emulates scenarios involving unannotated transcripts. Furthermore, the tool provides customizable features and supports the simulation of additional types of data, representing the first multi-omics simulation tool for the lrRNA-seq field. We demonstrate the effectiveness of SQANTI-SIM by benchmarking five transcriptome reconstruction pipelines using the simulated data.
Collapse
Affiliation(s)
- Jorge Mestre-Tomás
- Institute for Integrative Systems Biology, Spanish National Research Council, Catedràtic Agustín Escardino Benlloch, Paterna, 46980, Spain
| | - Tianyuan Liu
- Institute for Integrative Systems Biology, Spanish National Research Council, Catedràtic Agustín Escardino Benlloch, Paterna, 46980, Spain
| | - Francisco Pardo-Palacios
- Institute for Integrative Systems Biology, Spanish National Research Council, Catedràtic Agustín Escardino Benlloch, Paterna, 46980, Spain
| | - Ana Conesa
- Institute for Integrative Systems Biology, Spanish National Research Council, Catedràtic Agustín Escardino Benlloch, Paterna, 46980, Spain
| |
Collapse
|
20
|
Wang F, Xu Y, Wang R, Zhang B, Smith N, Notaro A, Gaerlan S, Kutschera E, Kadash-Edmondson KE, Xing Y, Lin L. TEQUILA-seq: a versatile and low-cost method for targeted long-read RNA sequencing. Nat Commun 2023; 14:4760. [PMID: 37553321 PMCID: PMC10409798 DOI: 10.1038/s41467-023-40083-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2022] [Accepted: 07/11/2023] [Indexed: 08/10/2023] Open
Abstract
Long-read RNA sequencing (RNA-seq) is a powerful technology for transcriptome analysis, but the relatively low throughput of current long-read sequencing platforms limits transcript coverage. One strategy for overcoming this bottleneck is targeted long-read RNA-seq for preselected gene panels. We present TEQUILA-seq, a versatile, easy-to-implement, and low-cost method for targeted long-read RNA-seq utilizing isothermally linear-amplified capture probes. When performed on the Oxford nanopore platform with multiple gene panels of varying sizes, TEQUILA-seq consistently and substantially enriches transcript coverage while preserving transcript quantification. We profile full-length transcript isoforms of 468 actionable cancer genes across 40 representative breast cancer cell lines. We identify transcript isoforms enriched in specific subtypes and discover novel transcript isoforms in extensively studied cancer genes such as TP53. Among cancer genes, tumor suppressor genes (TSGs) are significantly enriched for aberrant transcript isoforms targeted for degradation via mRNA nonsense-mediated decay, revealing a common RNA-associated mechanism for TSG inactivation. TEQUILA-seq reduces the per-reaction cost of targeted capture by 2-3 orders of magnitude, as compared to a standard commercial solution. TEQUILA-seq can be broadly used for targeted sequencing of full-length transcripts in diverse biomedical research settings.
Collapse
Affiliation(s)
- Feng Wang
- Center for Computational and Genomic Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Yang Xu
- Center for Computational and Genomic Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Graduate Group in Genomics and Computational Biology, University of Pennsylvania, Philadelphia, PA, USA
| | - Robert Wang
- Center for Computational and Genomic Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Graduate Group in Genomics and Computational Biology, University of Pennsylvania, Philadelphia, PA, USA
| | - Beatrice Zhang
- Center for Computational and Genomic Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Noah Smith
- Center for Computational and Genomic Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Amber Notaro
- Center for Computational and Genomic Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Samantha Gaerlan
- Center for Computational and Genomic Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Eric Kutschera
- Center for Computational and Genomic Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Kathryn E Kadash-Edmondson
- Center for Computational and Genomic Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Yi Xing
- Center for Computational and Genomic Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA.
- Department of Pathology and Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA, USA.
| | - Lan Lin
- Department of Pathology and Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA, USA.
| |
Collapse
|
21
|
Shiau CK, Lu L, Kieser R, Fukumura K, Pan T, Lin HY, Yang J, Tong EL, Lee G, Yan Y, Huse JT, Gao R. High throughput single cell long-read sequencing analyses of same-cell genotypes and phenotypes in human tumors. Nat Commun 2023; 14:4124. [PMID: 37433798 PMCID: PMC10336110 DOI: 10.1038/s41467-023-39813-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 06/27/2023] [Indexed: 07/13/2023] Open
Abstract
Single-cell nanopore sequencing of full-length mRNAs transforms single-cell multi-omics studies. However, challenges include high sequencing errors and dependence on short-reads and/or barcode whitelists. To address these, we develop scNanoGPS to calculate same-cell genotypes (mutations) and phenotypes (gene/isoform expressions) without short-read nor whitelist guidance. We apply scNanoGPS onto 23,587 long-read transcriptomes from 4 tumors and 2 cell-lines. Standalone, scNanoGPS deconvolutes error-prone long-reads into single-cells and single-molecules, and simultaneously accesses both phenotypes and genotypes of individual cells. Our analyses reveal that tumor and stroma/immune cells express distinct combination of isoforms (DCIs). In a kidney tumor, we identify 924 DCI genes involved in cell-type-specific functions such as PDE10A in tumor cells and CCL3 in lymphocytes. Transcriptome-wide mutation analyses identify many cell-type-specific mutations including VEGFA mutations in tumor cells and HLA-A mutations in immune cells, highlighting the critical roles of different mutant populations in tumors. Together, scNanoGPS facilitates applications of single-cell long-read sequencing technologies.
Collapse
Affiliation(s)
- Cheng-Kai Shiau
- Department of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA
- Center for Cancer Genomics, Robert H. Lurie Cancer Center, Northwestern University, Chicago, IL, 60611, USA
| | - Lina Lu
- Department of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA
- Center for Cancer Genomics, Robert H. Lurie Cancer Center, Northwestern University, Chicago, IL, 60611, USA
| | - Rachel Kieser
- Center for RNA Therapeutics, Houston Methodist Research Institute, Houston, TX, 77030, USA
| | - Kazutaka Fukumura
- Department of Translational Molecular Pathology, University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Timothy Pan
- Department of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA
- Center for Cancer Genomics, Robert H. Lurie Cancer Center, Northwestern University, Chicago, IL, 60611, USA
- The Driskill Graduate Program, Northwestern University, Chicago, IL, 60611, USA
| | - Hsiao-Yun Lin
- Department of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA
- Center for Cancer Genomics, Robert H. Lurie Cancer Center, Northwestern University, Chicago, IL, 60611, USA
| | - Jie Yang
- Department of Radiation Oncology, New York University Langone School of Medicine, New York, NY, 100167, USA
| | - Eric L Tong
- School of Engineering, University of Texas at Austin, Austin, TX, 78712, USA
| | - GaHyun Lee
- Center for Genetic Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA
| | - Yuanqing Yan
- Department of Surgery, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA
| | - Jason T Huse
- Department of Pathology and Translational Molecular Pathology, University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Ruli Gao
- Department of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA.
- Center for Cancer Genomics, Robert H. Lurie Cancer Center, Northwestern University, Chicago, IL, 60611, USA.
- The Driskill Graduate Program, Northwestern University, Chicago, IL, 60611, USA.
| |
Collapse
|
22
|
Mastrorosa FK, Miller DE, Eichler EE. Applications of long-read sequencing to Mendelian genetics. Genome Med 2023; 15:42. [PMID: 37316925 PMCID: PMC10266321 DOI: 10.1186/s13073-023-01194-3] [Citation(s) in RCA: 22] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Accepted: 05/18/2023] [Indexed: 06/16/2023] Open
Abstract
Advances in clinical genetic testing, including the introduction of exome sequencing, have uncovered the molecular etiology for many rare and previously unsolved genetic disorders, yet more than half of individuals with a suspected genetic disorder remain unsolved after complete clinical evaluation. A precise genetic diagnosis may guide clinical treatment plans, allow families to make informed care decisions, and permit individuals to participate in N-of-1 trials; thus, there is high interest in developing new tools and techniques to increase the solve rate. Long-read sequencing (LRS) is a promising technology for both increasing the solve rate and decreasing the amount of time required to make a precise genetic diagnosis. Here, we summarize current LRS technologies, give examples of how they have been used to evaluate complex genetic variation and identify missing variants, and discuss future clinical applications of LRS. As costs continue to decrease, LRS will find additional utility in the clinical space fundamentally changing how pathological variants are discovered and eventually acting as a single-data source that can be interrogated multiple times for clinical service.
Collapse
Affiliation(s)
| | - Danny E Miller
- Division of Genetic Medicine, Department of Pediatrics, University of Washington and Seattle Children's Hospital, Seattle, WA, 98195, USA
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, 98195, USA
- Brotman Baty Institute for Precision Medicine, University of Washington, Seattle, WA, 98195, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, 98195, USA.
| |
Collapse
|
23
|
Wu S, Schmitz U. Single-cell and long-read sequencing to enhance modelling of splicing and cell-fate determination. Comput Struct Biotechnol J 2023; 21:2373-2380. [PMID: 37066125 PMCID: PMC10091034 DOI: 10.1016/j.csbj.2023.03.023] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 03/13/2023] [Accepted: 03/13/2023] [Indexed: 04/03/2023] Open
Abstract
Single-cell sequencing technologies have revolutionised the life sciences and biomedical research. Single-cell sequencing provides high-resolution data on cell heterogeneity, allowing high-fidelity cell type identification, and lineage tracking. Computational algorithms and mathematical models have been developed to make sense of the data, compensate for errors and simulate the biological processes, which has led to breakthroughs in our understanding of cell differentiation, cell-fate determination and tissue cell composition. The development of long-read (a.k.a. third-generation) sequencing technologies has produced powerful tools for investigating alternative splicing, isoform expression (at the RNA level), genome assembly and the detection of complex structural variants (at the DNA level). In this review, we provide an overview of the recent advancements in single-cell and long-read sequencing technologies, with a particular focus on the computational algorithms that help in correcting, analysing, and interpreting the resulting data. Additionally, we review some mathematical models that use single-cell and long-read sequencing data to study cell-fate determination and alternative splicing, respectively. Moreover, we highlight the emerging opportunities in modelling cell-fate determination that result from the combination of single-cell and long-read sequencing technologies.
Collapse
Affiliation(s)
- Siyuan Wu
- Department of Molecular & Cell Biology, James Cook University, Townsville 4811, Queensland, Australia
- Centre for Tropical Bioinformatics and Molecular Biology, James Cook University, Cairns 4870, Queensland, Australia
- School of Mathematics, Monash University, Melbourne 3800, Victoria, Australia
| | - Ulf Schmitz
- Department of Molecular & Cell Biology, James Cook University, Townsville 4811, Queensland, Australia
- Centre for Tropical Bioinformatics and Molecular Biology, James Cook University, Cairns 4870, Queensland, Australia
| |
Collapse
|
24
|
Abstract
Dysregulated RNA splicing is a molecular feature that characterizes almost all tumour types. Cancer-associated splicing alterations arise from both recurrent mutations and altered expression of trans-acting factors governing splicing catalysis and regulation. Cancer-associated splicing dysregulation can promote tumorigenesis via diverse mechanisms, contributing to increased cell proliferation, decreased apoptosis, enhanced migration and metastatic potential, resistance to chemotherapy and evasion of immune surveillance. Recent studies have identified specific cancer-associated isoforms that play critical roles in cancer cell transformation and growth and demonstrated the therapeutic benefits of correcting or otherwise antagonizing such cancer-associated mRNA isoforms. Clinical-grade small molecules that modulate or inhibit RNA splicing have similarly been developed as promising anticancer therapeutics. Here, we review splicing alterations characteristic of cancer cell transcriptomes, dysregulated splicing's contributions to tumour initiation and progression, and existing and emerging approaches for targeting splicing for cancer therapy. Finally, we discuss the outstanding questions and challenges that must be addressed to translate these findings into the clinic.
Collapse
Affiliation(s)
- Robert K Bradley
- Computational Biology Program, Public Health Sciences Division and Basic Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA, USA.
| | - Olga Anczuków
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
- Department of Genetics and Genome Sciences, UConn Health, Farmington, CT, USA.
| |
Collapse
|
25
|
Li J, Guan D, Halstead MM, Islas-Trejo AD, Goszczynski DE, Ernst CW, Cheng H, Ross P, Zhou H. Transcriptome annotation of 17 porcine tissues using nanopore sequencing technology. Anim Genet 2023; 54:35-44. [PMID: 36385508 DOI: 10.1111/age.13274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 10/20/2022] [Accepted: 11/01/2022] [Indexed: 11/18/2022]
Abstract
The annotation of animal genomes plays an important role in elucidating molecular mechanisms behind the genetic control of economically important traits. Here, we employed long-read sequencing technology, Oxford Nanopore Technology, to annotate the pig transcriptome across 17 tissues from two Yorkshire littermate pigs. More than 9.8 million reads were obtained from a single flow cell, and 69 781 unique transcripts at 50 108 loci were identified. Of these transcripts, 16 255 were found to be novel isoforms, and 22 344 were found at loci that were novel and unannotated in the Ensembl (release 102) and NCBI (release 106) annotations. Novel transcripts were mostly expressed in cerebellum, followed by lung, liver, spleen, and hypothalamus. By comparing the unannotated transcripts to existing databases, there were 21 285 (95.3%) transcripts matched to the NT database (v5) and 13 676 (61.2%) matched to the NR database (v5). Moreover, there were 4324 (19.4%) transcripts matched to the SwissProt database (v5), corresponding to 11 356 proteins. Tissue-specific gene expression analyses showed that 9749 transcripts were highly tissue-specific, and cerebellum contained the most tissue-specific transcripts. As the same samples were used for the annotation of cis-regulatory elements in the pig genome, the transcriptome annotation generated by this study provides an additional and complementary annotation resource for the Functional Annotation of Animal Genomes effort to comprehensively annotate the pig genome.
Collapse
Affiliation(s)
- Jinghui Li
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Dailu Guan
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Michelle M Halstead
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Alma D Islas-Trejo
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Daniel E Goszczynski
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Catherine W Ernst
- Department of Animal Science, Michigan State University, East Lansing, Michigan, USA
| | - Hao Cheng
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Pablo Ross
- Department of Animal Science, University of California Davis, Davis, California, USA
| | - Huaijun Zhou
- Department of Animal Science, University of California Davis, Davis, California, USA
| |
Collapse
|
26
|
Balachandran P, Walawalkar IA, Flores JI, Dayton JN, Audano PA, Beck CR. Transposable element-mediated rearrangements are prevalent in human genomes. Nat Commun 2022; 13:7115. [PMID: 36402840 PMCID: PMC9675761 DOI: 10.1038/s41467-022-34810-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 11/08/2022] [Indexed: 11/21/2022] Open
Abstract
Transposable elements constitute about half of human genomes, and their role in generating human variation through retrotransposition is broadly studied and appreciated. Structural variants mediated by transposons, which we call transposable element-mediated rearrangements (TEMRs), are less well studied, and the mechanisms leading to their formation as well as their broader impact on human diversity are poorly understood. Here, we identify 493 unique TEMRs across the genomes of three individuals. While homology directed repair is the dominant driver of TEMRs, our sequence-resolved TEMR resource allows us to identify complex inversion breakpoints, triplications or other high copy number polymorphisms, and additional complexities. TEMRs are enriched in genic loci and can create potentially important risk alleles such as a deletion in TRIM65, a known cancer biomarker and therapeutic target. These findings expand our understanding of this important class of structural variation, the mechanisms responsible for their formation, and establish them as an important driver of human diversity.
Collapse
Affiliation(s)
| | | | - Jacob I Flores
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Jacob N Dayton
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Peter A Audano
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Christine R Beck
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
- Department of Genetics and Genome Sciences, University of Connecticut Health Center, Farmington, CT, USA.
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA.
| |
Collapse
|
27
|
Urbanski L, Brugiolo M, Park S, Angarola BL, Leclair NK, Yurieva M, Palmer P, Sahu SK, Anczuków O. MYC regulates a pan-cancer network of co-expressed oncogenic splicing factors. Cell Rep 2022; 41:111704. [DOI: 10.1016/j.celrep.2022.111704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Revised: 05/16/2022] [Accepted: 11/01/2022] [Indexed: 11/23/2022] Open
|
28
|
Guan D, Halstead MM, Islas-Trejo AD, Goszczynski DE, Cheng HH, Ross PJ, Zhou H. Prediction of transcript isoforms in 19 chicken tissues by Oxford Nanopore long-read sequencing. Front Genet 2022; 13:997460. [PMID: 36246588 PMCID: PMC9561881 DOI: 10.3389/fgene.2022.997460] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 08/30/2022] [Indexed: 11/22/2022] Open
Abstract
To identify and annotate transcript isoforms in the chicken genome, we generated Nanopore long-read sequencing data from 68 samples that encompassed 19 diverse tissues collected from experimental adult male and female White Leghorn chickens. More than 23.8 million reads with mean read length of 790 bases and average quality of 18.2 were generated. The annotation and subsequent filtering resulted in the identification of 55,382 transcripts at 40,547 loci with mean length of 1,700 bases. We predicted 30,967 coding transcripts at 19,461 loci, and 16,495 lncRNA transcripts at 15,512 loci. Compared to existing reference annotations, we found ∼52% of annotated transcripts could be partially or fully matched while ∼47% were novel. Seventy percent of novel transcripts were potentially transcribed from lncRNA loci. Based on our annotation, we quantified transcript expression across tissues and found two brain tissues (i.e., cerebellum and cortex) expressed the highest number of transcripts and loci. Furthermore, ∼22% of the transcripts displayed tissue specificity with the reproductive tissues (i.e., testis and ovary) exhibiting the most tissue-specific transcripts. Despite our wide sampling, ∼20% of Ensembl reference loci were not detected. This suggests that deeper sequencing and additional samples that include different breeds, cell types, developmental stages, and physiological conditions, are needed to fully annotate the chicken genome. The application of Nanopore sequencing in this study demonstrates the usefulness of long-read data in discovering additional novel loci (e.g., lncRNA loci) and resolving complex transcripts (e.g., the longest transcript for the TTN locus).
Collapse
Affiliation(s)
- Dailu Guan
- Department of Animal Science, University of California Davis, Davis, CA, United States
| | - Michelle M. Halstead
- Department of Animal Science, University of California Davis, Davis, CA, United States
| | - Alma D. Islas-Trejo
- Department of Animal Science, University of California Davis, Davis, CA, United States
| | - Daniel E. Goszczynski
- Department of Animal Science, University of California Davis, Davis, CA, United States
| | - Hans H. Cheng
- USDA, ARS, USNPRC, Avian Disease and Oncology Laboratory, East Lansing, MI, United States
| | - Pablo J. Ross
- Department of Animal Science, University of California Davis, Davis, CA, United States
- *Correspondence: Pablo J. Ross, ; Huaijun Zhou,
| | - Huaijun Zhou
- Department of Animal Science, University of California Davis, Davis, CA, United States
- *Correspondence: Pablo J. Ross, ; Huaijun Zhou,
| |
Collapse
|
29
|
Zhang S, Mao M, Lv Y, Yang Y, He W, Song Y, Wang Y, Yang Y, Al Abo M, Freedman JA, Patierno SR, Wang Y, Wang Z. A widespread length-dependent splicing dysregulation in cancer. SCIENCE ADVANCES 2022; 8:eabn9232. [PMID: 35977015 PMCID: PMC9385142 DOI: 10.1126/sciadv.abn9232] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Accepted: 07/01/2022] [Indexed: 06/15/2023]
Abstract
Dysregulation of alternative splicing is a key molecular hallmark of cancer. However, the common features and underlying mechanisms remain unclear. Here, we report an intriguing length-dependent splicing regulation in cancers. By systematically analyzing the transcriptome of thousands of cancer patients, we found that short exons are more likely to be mis-spliced and preferentially excluded in cancers. Compared to other exons, cancer-associated short exons (CASEs) are more conserved and likely to encode in-frame low-complexity peptides, with functional enrichment in GTPase regulators and cell adhesion. We developed a CASE-based panel as reliable cancer stratification markers and strong predictors for survival, which is clinically useful because the detection of short exon splicing is practical. Mechanistically, mis-splicing of CASEs is regulated by elevated transcription and alteration of certain RNA binding proteins in cancers. Our findings uncover a common feature of cancer-specific splicing dysregulation with important clinical implications in cancer diagnosis and therapies.
Collapse
Affiliation(s)
- Sirui Zhang
- CAS Key Laboratory of Computational Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Miaowei Mao
- CAS Key Laboratory of Computational Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Yuesheng Lv
- Institute of Cancer Stem Cell, Dalian Medical University, Dalian 116044, China
| | - Yingqun Yang
- CAS Key Laboratory of Computational Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
- Shanghai Tech University, Shanghai 200031, China
| | - Weijing He
- Department of Colorectal Surgery, Fudan University Shanghai Cancer Center, Shanghai 200032, China
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Yongmei Song
- State Key Laboratory of Molecular Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100021, China
| | - Yongbo Wang
- Department of Cellular and Genetic Medicine, School of Basic Medical Sciences, Fudan University, Shanghai 200032, China
| | - Yun Yang
- CAS Key Laboratory of Computational Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Muthana Al Abo
- Duke Cancer Institute, Duke University School of Medicine, Durham, NC 27710, USA
| | - Jennifer A Freedman
- Duke Cancer Institute, Duke University School of Medicine, Durham, NC 27710, USA
- Division of Medical Oncology, Department of Medicine, Duke University Medical Center, Durham, NC 27710, USA
| | - Steven R Patierno
- Duke Cancer Institute, Duke University School of Medicine, Durham, NC 27710, USA
- Division of Medical Oncology, Department of Medicine, Duke University Medical Center, Durham, NC 27710, USA
| | - Yang Wang
- Institute of Cancer Stem Cell, Dalian Medical University, Dalian 116044, China
| | - Zefeng Wang
- CAS Key Laboratory of Computational Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| |
Collapse
|