1
|
Surana P, Dutta P, Davuluri RV. TransTEx: novel tissue-specificity scoring method for grouping human transcriptome into different expression groups. Bioinformatics 2024; 40:btae475. [PMID: 39120880 PMCID: PMC11319638 DOI: 10.1093/bioinformatics/btae475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Revised: 06/12/2024] [Accepted: 08/08/2024] [Indexed: 08/10/2024] Open
Abstract
MOTIVATION Although human tissues carry out common molecular processes, gene expression patterns can distinguish different tissues. Traditional informatics methods, primarily at the gene level, overlook the complexity of alternative transcript variants and protein isoforms produced by most genes, changes in which are linked to disease prognosis and drug resistance. RESULTS We developed TransTEx (Transcript-level Tissue Expression), a novel tissue-specificity scoring method, for grouping transcripts into four expression groups. TransTEx applies sequential cut-offs to tissue-wise transcript probability estimates, subsampling-based P-values and fold-change estimates. Application of TransTEx on GTEx mRNA-seq data divided 199 166 human transcripts into different groups as 17 999 tissue-specific (TSp), 7436 tissue-enhanced, 36 783 widely expressed (Wide), 79 191 lowly expressed (Low), and 57 757 no expression (Null) transcripts. Testis has the most (13 466) TSp isoforms followed by liver (890), brain (701), pituitary (435), and muscle (420). We found that the tissue specificity of alternative transcripts of a gene is predominantly influenced by alternate promoter usage. By overlapping brain-specific transcripts with the cell-type gene-markers in scBrainMap database, we found that 63% of the brain-specific transcripts were enriched in nonneuronal cell types, predominantly astrocytes followed by endothelial cells and oligodendrocytes. In addition, we found 61 brain cell-type marker genes encoding a total of 176 alternative transcripts as brain-specific and 22 alternative transcripts as testis-specific, highlighting the complex TSp and cell-type specific gene regulation and expression at isoform-level. TransTEx can be adopted to the analysis of bulk RNA-seq or scRNA-seq datasets to find tissue- and/or cell-type specific isoform-level gene markers. AVAILABILITY AND IMPLEMENTATION TransTEx database: https://bmi.cewit.stonybrook.edu/transtexdb/ and the R package is available via GitHub: https://github.com/pallavisurana1/TransTEx.
Collapse
Affiliation(s)
- Pallavi Surana
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY 11794, USA
| | - Pratik Dutta
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY 11794, USA
| | - Ramana V Davuluri
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY 11794, USA
| |
Collapse
|
2
|
Chen Z, Shi Q, Zhao Y, Xu M, Liu Y, Li X, Liu L, Sun M, Wu X, Shao Z, Xu Y, Wang L, He X. Long-read transcriptome landscapes of primary and metastatic liver cancers at transcript resolution. Biomark Res 2024; 12:4. [PMID: 38185659 PMCID: PMC10773130 DOI: 10.1186/s40364-023-00554-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Accepted: 12/29/2023] [Indexed: 01/09/2024] Open
Abstract
BACKGROUND The liver ranks as the sixth most prevalent site of primary cancer in humans, and it frequently experiences metastases from cancers originating in other organs. To facilitate the development of effective treatments and improve survival rates, it is crucial to comprehend the intricate and diverse transcriptome landscape of primary and metastatic liver cancers. METHODS We conducted long-read isoform sequencing and short-read RNA sequencing using a cohort of 95 patients with primary and secondary liver cancer who underwent hepatic resection. We compared the transcriptome landscapes of primary and metastatic liver cancers and systematically investigated hepatocellular carcinoma (HCC), paired primary tumours and liver metastases, and matched nontumour liver tissues. RESULTS We elucidated the full-length isoform-level transcriptome of primary and metastatic liver cancers in humans. Our analysis revealed isoform-level diversity in HCC and identified transcriptome variations associated with liver metastatis. Specific RNA transcripts and isoform switching events with clinical implications were profound in liver cancer. Moreover, we defined metastasis-specific transcripts that may serve as predictors of risk of metastasis. Additionally, we observed abnormalities in adjacent paracancerous liver tissues and characterized the immunological and metabolic alterations occurring in the liver. CONCLUSIONS Our findings underscore the power of full-length transcriptome profiling in providing novel biological insights into the molecular mechanisms underlying tumourigenesis. These insights will further contribute to improving treatment strategies for primary and metastatic liver cancers.
Collapse
Affiliation(s)
- Zhiao Chen
- Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University Shanghai Cancer Center, Fudan University, 302 Rm., 7# Bldg., 270 Dong An Road, 200032, Shanghai, China
- Key Laboratory of Breast Cancer in Shanghai, Fudan University Shanghai Cancer Center, Fudan University, 200032, Shanghai, China
- Shanghai Key Laboratory of Radiation Oncology, Fudan University Shanghai Cancer Center, Fudan University, 200032, Shanghai, China
| | - Qili Shi
- Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University Shanghai Cancer Center, Fudan University, 302 Rm., 7# Bldg., 270 Dong An Road, 200032, Shanghai, China
| | - Yiming Zhao
- Department of Hepatic Surgery, Fudan University Shanghai Cancer Center, Fudan University, 200032, Shanghai, China
| | - Midie Xu
- Department of Pathology, biobank, Fudan University Shanghai Cancer Center, Shanghai, China
| | - Yizhe Liu
- Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University Shanghai Cancer Center, Fudan University, 302 Rm., 7# Bldg., 270 Dong An Road, 200032, Shanghai, China
| | - Xinrong Li
- Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University Shanghai Cancer Center, Fudan University, 302 Rm., 7# Bldg., 270 Dong An Road, 200032, Shanghai, China
| | - Li Liu
- Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University Shanghai Cancer Center, Fudan University, 302 Rm., 7# Bldg., 270 Dong An Road, 200032, Shanghai, China
| | - Menghong Sun
- Department of Pathology, biobank, Fudan University Shanghai Cancer Center, Shanghai, China
| | - Xiaohua Wu
- Department of Gynecologic Oncology, Fudan University Shanghai Cancer Center, Fudan University, 200032, Shanghai, China
| | - Zhimin Shao
- Key Laboratory of Breast Cancer in Shanghai, Fudan University Shanghai Cancer Center, Fudan University, 200032, Shanghai, China
- Department of Breast Surgery, Fudan University Shanghai Cancer Center, Fudan University, 200032, Shanghai, China
| | - Ye Xu
- Department of Colorectal Surgery, Fudan University Shanghai Cancer Center, 200032, Shanghai, China.
| | - Lu Wang
- Department of Hepatic Surgery, Fudan University Shanghai Cancer Center, Fudan University, 200032, Shanghai, China.
| | - Xianghuo He
- Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University Shanghai Cancer Center, Fudan University, 302 Rm., 7# Bldg., 270 Dong An Road, 200032, Shanghai, China.
- Key Laboratory of Breast Cancer in Shanghai, Fudan University Shanghai Cancer Center, Fudan University, 200032, Shanghai, China.
- Shanghai Key Laboratory of Radiation Oncology, Fudan University Shanghai Cancer Center, Fudan University, 200032, Shanghai, China.
| |
Collapse
|
3
|
Shi Q, Li X, Liu Y, Chen Z, He X. FLIBase: a comprehensive repository of full-length isoforms across human cancers and tissues. Nucleic Acids Res 2024; 52:D124-D133. [PMID: 37697439 PMCID: PMC10767943 DOI: 10.1093/nar/gkad745] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 08/14/2023] [Accepted: 08/31/2023] [Indexed: 09/13/2023] Open
Abstract
Regulatory processes at the RNA transcript level play a crucial role in generating transcriptome diversity and proteome composition in human cells, impacting both physiological and pathological states. This study introduces FLIBase (www.FLIBase.org), a specialized database that focuses on annotating full-length isoforms using long-read sequencing techniques. We collected and integrated long-read (351 samples) and short-read (12 469 samples) RNA sequencing data from diverse normal and cancerous human tissues and cells. The current version of FLIBase comprises a total of 983 789 full-length spliced isoforms, identified through long-read sequences and verified using short-read exon-exon splice junctions. Of these, 188 248 isoforms have been annotated, while 795 541 isoforms remain unannotated. By overcoming the limitations of short-read RNA sequencing methods, FLIBase provides an accurate and comprehensive representation of full-length transcripts. These comprehensive annotations empower researchers to undertake various downstream analyses and investigations. Importantly, FLIBase exhibits a significant advantage in identifying a substantial number of previously unannotated isoforms and tumor-specific RNA transcripts. These tumor-specific RNA transcripts have the potential to serve as a source of immunogenic recurrent neoantigens. This remarkable discovery holds tremendous promise for advancing the development of tailored RNA-based diagnostic and therapeutic strategies for various types of human cancer.
Collapse
Affiliation(s)
- Qili Shi
- Fudan University Shanghai Cancer Center and Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Xinrong Li
- Fudan University Shanghai Cancer Center and Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Yizhe Liu
- Fudan University Shanghai Cancer Center and Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Zhiao Chen
- Fudan University Shanghai Cancer Center and Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University, Shanghai 200032, China
- Key Laboratory of Breast Cancer in Shanghai, Fudan University Shanghai Cancer Center, Fudan University, Shanghai 200032, China
- Shanghai Key Laboratory of Radiation Oncology, Fudan University Shanghai Cancer Center, Fudan University, Shanghai 200032, China
| | - Xianghuo He
- Fudan University Shanghai Cancer Center and Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University, Shanghai 200032, China
- Key Laboratory of Breast Cancer in Shanghai, Fudan University Shanghai Cancer Center, Fudan University, Shanghai 200032, China
- Shanghai Key Laboratory of Radiation Oncology, Fudan University Shanghai Cancer Center, Fudan University, Shanghai 200032, China
| |
Collapse
|
4
|
Wu H, Lu Y, Duan Z, Wu J, Lin M, Wu Y, Han S, Li T, Fan Y, Hu X, Xiao H, Feng J, Lu Z, Kong D, Li S. Nanopore long-read RNA sequencing reveals functional alternative splicing variants in human vascular smooth muscle cells. Commun Biol 2023; 6:1104. [PMID: 37907652 PMCID: PMC10618188 DOI: 10.1038/s42003-023-05481-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 10/18/2023] [Indexed: 11/02/2023] Open
Abstract
Vascular smooth muscle cells (VSMCs) are the major contributor to vascular repair and remodeling, which showed high level of phenotypic plasticity. Abnormalities in VSMC plasticity can lead to multiple cardiovascular diseases, wherein alternative splicing plays important roles. However, alternative splicing variants in VSMC plasticity are not fully understood. Here we systematically characterized the long-read transcriptome and their dysregulation in human aortic smooth muscle cells (HASMCs) by employing the Oxford Nanopore Technologies long-read RNA sequencing in HASMCs that are separately treated with platelet-derived growth factor, transforming growth factor, and hsa-miR-221-3P transfection. Our analysis reveals frequent alternative splicing events and thousands of unannotated transcripts generated from alternative splicing. HASMCs treated with different factors exhibit distinct transcriptional reprogramming modulated by alternative splicing. We also found that unannotated transcripts produce different open reading frames compared to the annotated transcripts. Finally, we experimentally validated the unannotated transcript derived from gene CISD1, namely CISD1-u, which plays a role in the phenotypic switch of HASMCs. Our study characterizes the phenotypic modulation of HASMCs from an insight of long-read transcriptome, which would promote the understanding and the manipulation of HASMC plasticity in cardiovascular diseases.
Collapse
Affiliation(s)
- Hao Wu
- Department of Cardiovascular Surgery, Shanghai Sixth People's Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Yicheng Lu
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Zhenzhen Duan
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Jingni Wu
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Minghui Lin
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Yangjun Wu
- Department of Gynecological Oncology, Fudan University Shanghai Cancer Center, Shanghai, China
| | - Siyang Han
- Department of Ophthalmology, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Tongqi Li
- Department of Ophthalmology, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Yuqi Fan
- North Cross School Shanghai, Shanghai, China
| | - Xiaoyuan Hu
- H. Milton Stewart School of Industrial and Systems Engineering, College of Engineering, Geogia Institute of Technology, Atlanta, GA, USA
| | - Hongyan Xiao
- Department of Cardiac Surgery, Wuhan Asia Heart Hospital, Wuhan University of Science and Technology, Wuhan, China
| | - Jiaxuan Feng
- Department of Vascular Surgery and Intervention Center, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Zhiqian Lu
- Department of Cardiovascular Surgery, Shanghai Sixth People's Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China.
| | - Deping Kong
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
| | - Shengli Li
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
| |
Collapse
|
5
|
Long-read transcriptome sequencing reveals allele-specific variants at high resolution. Trends Genet 2023; 39:31-33. [PMID: 36207147 DOI: 10.1016/j.tig.2022.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Revised: 09/11/2022] [Accepted: 09/21/2022] [Indexed: 11/05/2022]
Abstract
Disturbance in the regulation of transcript structure plays a crucial role in human disease. In a recent study, Glinos et al. characterized allele-specific transcript alterations in long-read RNA sequencing (RNA-seq) data derived from multiple human tissues and provide a high-resolution view of how disease-associated genetic variants affect transcript structure.
Collapse
|