1
|
Xu D, Tang L, Zhou J, Wang F, Cao H, Huang Y, Kapranov P. Evidence for widespread existence of functional novel and non-canonical human transcripts. BMC Biol 2023; 21:271. [PMID: 38001496 PMCID: PMC10675921 DOI: 10.1186/s12915-023-01753-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 10/31/2023] [Indexed: 11/26/2023] Open
Abstract
BACKGROUND Fraction of functional sequence in the human genome remains a key unresolved question in Biology and the subject of vigorous debate. While a plethora of studies have connected a significant fraction of human DNA to various biochemical processes, the classical definition of function requires evidence of effects on cellular or organismal fitness that such studies do not provide. Although multiple high-throughput reverse genetics screens have been developed to address this issue, they are limited to annotated genomic elements and suffer from non-specific effects, arguing for a strong need to develop additional functional genomics approaches. RESULTS In this work, we established a high-throughput lentivirus-based insertional mutagenesis strategy as a forward genetics screen tool in aneuploid cells. Application of this approach to human cell lines in multiple phenotypic screens suggested the presence of many yet uncharacterized functional elements in the human genome, represented at least in part by novel exons of known and novel genes. The novel transcripts containing these exons can be massively, up to thousands-fold, induced by specific stresses, and at least some can represent bi-cistronic protein-coding mRNAs. CONCLUSIONS Altogether, these results argue that many unannotated and non-canonical human transcripts, including those that appear as aberrant splice products, have biological relevance under specific biological conditions.
Collapse
Affiliation(s)
- Dongyang Xu
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Lu Tang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Junjun Zhou
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Fang Wang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Huifen Cao
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Yu Huang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Philipp Kapranov
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China.
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, 361102, China.
| |
Collapse
|
2
|
Alternative splicing of ACE2 possibly generates variants that may limit the entry of SARS-CoV-2: a potential therapeutic approach using SSOs. Clin Sci (Lond) 2020; 134:1143-1150. [PMID: 32442315 DOI: 10.1042/cs20200419] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2020] [Revised: 05/04/2020] [Accepted: 05/15/2020] [Indexed: 02/06/2023]
Abstract
Angiotensin-converting enzyme 2 (ACE2) plays an essential role in maintaining the balance of the renin-angiotensin system and also serves as a receptor for the SARS-CoV-2, SARS-CoV, and HCoV-NL63. Following the recent outbreak of SARS-CoV-2 infection, there has been an urgent need to develop therapeutic interventions. ACE2 is a potential target for many treatment approaches for the SARS-CoV-2. With the help of bioinformatics, we have predicted several novel exons of the human ACE2 gene. The inclusion of novel exons located in the 5'UTR/intronic region in the mature transcript may remove the critical ACE2 residues responsible for the interaction with the receptor-binding domain (RBD) of SARS-CoV-2, thus preventing their binding and entry into the cell. Additionally, inclusion of a novel predicted exons located in the 3'UTR by alternative splicing may remove the C-terminal transmembrane domain of ACE2 and generate soluble ACE2 isoforms. Splice-switching antisense oligonucleotides (SSOs) have been employed effectively as a therapeutic strategy in several disease conditions. Alternative splicing of the ACE2 gene could similarly be modulated using SSOs to exclude critical domains required for the entry of SARS-CoV-2. Strategies can also be designed to deliver these SSOs directly to the lungs in order to minimize the damage caused by SARS-CoV-2 pathogenesis.
Collapse
|
3
|
Abstract
Alternative splicing of precursor mRNA is a key mediator of gene expression regulation leading to greater diversity of the proteome in complex organisms. Systematic sequencing of the human genome and transcriptome has led to our understanding of how alternative splicing of critical genes leads to multiple pathological conditions such as cancer. For many years, proteases were known only for their roles as proteolytic enzymes, acting to regulate/process proteins associated with diverse cellular functions. However, the differential expression and altered function of various protease isoforms, such as (i) anti-apoptotic activities, (ii) mediating intercellular adhesion, and (iii) modifying the extracellular matrix, are evidence of their specific contribution towards shaping the tumor microenvironment. Revealing the alternative splicing of protease genes and characterization of their protein products/isoforms with distinct and opposing functions creates a platform to understand how protease isoforms contribute to specific cancer hallmarks. Here, in this review, we address cancer-specific isoforms produced by the alternative splicing of proteases and their distinctive roles in the tumor microenvironment.
Collapse
Affiliation(s)
- Chamikara Liyanage
- School of Biomedical Sciences, Faculty of Health, Institute of Health and Biomedical Innovation, Queensland University of Technology, Brisbane, Australia
- Australian Prostate Cancer Research Centre-Queensland (APCRC-Q), Translational Research Institute, Queensland University of Technology, Brisbane, Australia
| | - Achala Fernando
- School of Biomedical Sciences, Faculty of Health, Institute of Health and Biomedical Innovation, Queensland University of Technology, Brisbane, Australia
- Australian Prostate Cancer Research Centre-Queensland (APCRC-Q), Translational Research Institute, Queensland University of Technology, Brisbane, Australia
| | - Jyotsna Batra
- School of Biomedical Sciences, Faculty of Health, Institute of Health and Biomedical Innovation, Queensland University of Technology, Brisbane, Australia.
- Australian Prostate Cancer Research Centre-Queensland (APCRC-Q), Translational Research Institute, Queensland University of Technology, Brisbane, Australia.
| |
Collapse
|
4
|
Rehman SU, Ashraf S, Ahamad S, Sarwar T, Husain MA, Ahmad P, Tabish M, Jairajpuri MA. Identification of a novel alternatively spliced isoform of antithrombin containing an additional RCL-like loop. Biochem Biophys Res Commun 2019; 517:421-426. [PMID: 31378371 DOI: 10.1016/j.bbrc.2019.07.113] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Accepted: 07/28/2019] [Indexed: 11/22/2022]
Abstract
Antithrombin (AT3) is one of the most important inhibitors of blood coagulation proteases that belong to the serpin family of protease inhibitors. In this study, a novel alternatively spliced isoform of AT3 was identified, both at transcript and protein level. This novel transcript contains an additional region in the continuation of exon 3b that was included in the transcript due to use of an alternate 5' splice site. The existence of the novel transcript was confirmed in human brain and liver through RT-PCR. An analysis of the complete transcript indicated that the native reactive centre loop (RCL) of AT3 is maintained; however the novel amino acid sequence projects out as an additional loop as evident from MD simulation studies. A unique amino acid sequence present in the novel isoform was used for the development of polyclonal antibody. The expression of novel isoform was confirmed in human brain and liver tissue using Western blot analysis. Interestingly an alignment of RCL like domain with other inhibitory serpins showed significant similarity with the neuroserpin RCL. To the best of our knowledge, this is the first evidence of alternatively spliced AT3 sequence containing an additional loop and could have physiological relevance.
Collapse
Affiliation(s)
- Sayeed Ur Rehman
- Department of Biosciences, Jamia Millia Islamia, New Delhi, India; Department of Biochemistry, SCLS, Jamia Hamdard, New Delhi, India
| | - Shazia Ashraf
- Department of Biosciences, Jamia Millia Islamia, New Delhi, India; Department of Medicine, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
| | - Shahzaib Ahamad
- Department of Biotechnology, School of Engineering & Technology, IFTM University, Lodhipur-Rajput, Delhi Road, Moradabad, India
| | - Tarique Sarwar
- Department of Biosciences, Jamia Millia Islamia, New Delhi, India
| | | | - Parvez Ahmad
- Department of Biosciences, Jamia Millia Islamia, New Delhi, India
| | - Mohammad Tabish
- Department of Biochemistry, Faculty of Life Sciences, A.M.U, Aligarh, U.P, India
| | | |
Collapse
|
5
|
Kwon M, Yim S, Kim G, Lee S, Jeong C, Lee D. CODA-ML: context-specific biological knowledge representation for systemic physiology analysis. BMC Bioinformatics 2019; 20:248. [PMID: 31138123 PMCID: PMC6538558 DOI: 10.1186/s12859-019-2812-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background Computational analysis of complex diseases involving multiple organs requires the integration of multiple different models into a unified model. Different models are often constructed in heterogeneous formats. Thus, the integration of the models requires a standard language format that can effectively represent essential biological information. However, the previously introduced formats have limitations that prevent from adequately representing essential biological information, particularly specifications of bio-molecules and biological contexts. Results We defined an XML-based markup language called context-oriented directed association markup language (CODA-ML), which better represents essential biological information. The CODA-ML has two major strengths in designating molecular specifications and biological contexts. It can cover heterogeneous entity types involved in biological events (e.g. gene/protein, compound, cellular function, disease). Molecular types of entities can have molecular specifications which include detailed information of a molecule from isoforms to modifications, enabling high-resolution representation of molecules. In addition, it can distinguish biological events that vary depending on different biological contexts such as cell types or disease conditions. Especially representation of inter-cellular events as well as intra-cellular events is available. These two major strengths can resolve contradictory associations when different models are integrated into one unified model, which improves the accuracy of the model. Conclusions With the CODA-ML, diverse models such as signaling pathways, metabolic pathways, and gene regulatory pathways can be represented in a unified language format. Heterogeneous entity types can be covered by the CODA-ML, thus it enables detailed description for the mechanisms of diseases or drugs from multiple perspectives (e.g., molecule, function or disease). The CODA-ML is expected to help integrate different models into one systemic model in an efficient and effective. The unified model can be used to perform computational analysis not only for cancer but also for other complex diseases involving multiple organs beyond a single cell. Electronic supplementary material The online version of this article (10.1186/s12859-019-2812-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Mijin Kwon
- Department of Bio and Brain Engineering, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea
| | - Soorin Yim
- Department of Bio and Brain Engineering, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea
| | - Gwangmin Kim
- Department of Bio and Brain Engineering, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea
| | - Saehwan Lee
- Department of Bio and Brain Engineering, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea
| | - Chungsun Jeong
- Department of Bio and Brain Engineering, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea
| | - Doheon Lee
- Department of Bio and Brain Engineering, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea. .,Bio-Synergy Research Center, 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea.
| |
Collapse
|
6
|
Identification and expression of alternatively spliced novel isoforms of cancer associated MYD88 lacking death domain in mouse. Mol Biol Rep 2018; 45:699-711. [DOI: 10.1007/s11033-018-4209-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2018] [Accepted: 06/04/2018] [Indexed: 12/13/2022]
|
7
|
Differentially expressed novel alternatively spliced transcript variant of tumor suppressor Stk11 gene in mouse. Gene 2018; 668:146-154. [PMID: 29777910 DOI: 10.1016/j.gene.2018.05.053] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Revised: 04/26/2018] [Accepted: 05/15/2018] [Indexed: 11/22/2022]
Abstract
Serine/threonine kinase 11 (STK11) is a protein kinase that is encoded by Stk11 gene located on chromosome 19 and 10 in humans and mouse respectively. It acts as a master kinase of adenine monophosphate-activated protein kinase (AMPK) pathway that coordinates the regulation of cellular energy metabolism and cell division. STK11 exerts effect by activating more than 14 kinases including AMPK and AMPK-related kinases. It is also known to regulate cell polarity and acts as tumor suppressor. Alternative splicing of pre-mRNA is a mechanism which results in multiple transcript variants of a single gene. In human, two STK11 isoforms have been reported, an alternatively spliced isoform which has variation at its C-terminal and mostly expressed in testis (LKB1S). Another isoform exhibiting oncogenic properties lacks few residues at its N-terminal (ΔN-LKB1). In the present study, we report the identification of a new transcript variant Stk11N which is generated through alternative splicing. The new variant was found to have differential and tissue specific expression at Postnatal-7 and adult stages of mouse. As compared to the known variant Stk11C, the conceptually translated amino acid sequences of the new variant differ from exon-E2 onwards. In silico post translational studies of the new and published variant show similarity in some of the properties while differ in properties like nuclear export signals, phosphorylation, glycosylation, etc. Thus, alternative splicing of Stk11 gene generating new variant with heterogeneous properties suggests for complex regulation of these variants in controlling the AMPK pathway and other functions.
Collapse
|
8
|
Rehman SU, Sarwar T, Husain MA, Ishqi HM, Tabish M. Identification of two novel isoforms of mouse NUR77 lacking N-terminal domains. IUBMB Life 2017; 69:106-114. [PMID: 28111880 DOI: 10.1002/iub.1605] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2016] [Accepted: 12/18/2016] [Indexed: 12/21/2022]
Abstract
Nur77 is a member of nuclear receptor superfamily that acts as a transcription factor and regulates expression of multiple genes. Subcellular localization of Nur77 protein plays an important role in the survival and cell death. In this study, we have predicted and confirmed alternatively spliced two new transcripts of Nur77 gene in mouse. The newly identified transcripts have their alternatively spliced first exon located upstream of published 5'-UTR of the gene. Transcription factor binding sites in the possible promoter regions of these transcripts were also analyzed. Expression of novel transcript variants was found to be significantly lower than the already published transcript. New transcript variants encode for NUR77 protein isoforms which are significantly smaller in size due to lack of transactivation domain and a part of DNA binding domain. Western blot analysis using NUR77 specific antibody confirmed the existence of these smaller variants in mouse. Localization of these new isoforms was predicted to be majorly outside the nucleus. In silico analysis of the conceptually translated proteins was performed using different bioinformatics tools. The results obtained in this study offer further insight into novel area of research on extensively studied Nur77. © 2017 IUBMB Life, 69(2):106-114, 2017.
Collapse
Affiliation(s)
- Sayeed Ur Rehman
- Department of Biochemistry, Faculty of Life Sciences, A.M. University, Aligarh, Uttar Pradesh, India.,Department of Biosciences, Jamia Millia Islamia, New Delhi, Delhi, India
| | - Tarique Sarwar
- Department of Biochemistry, Faculty of Life Sciences, A.M. University, Aligarh, Uttar Pradesh, India.,Department of Biosciences, Jamia Millia Islamia, New Delhi, Delhi, India
| | - Mohammed Amir Husain
- Department of Biochemistry, Faculty of Life Sciences, A.M. University, Aligarh, Uttar Pradesh, India.,Department of Biosciences, Jamia Millia Islamia, New Delhi, Delhi, India
| | - Hassan Mubarak Ishqi
- Department of Biochemistry, Faculty of Life Sciences, A.M. University, Aligarh, Uttar Pradesh, India
| | - Mohammad Tabish
- Department of Biochemistry, Faculty of Life Sciences, A.M. University, Aligarh, Uttar Pradesh, India
| |
Collapse
|
9
|
A novel exon generates ubiquitously expressed alternatively spliced new transcript of mouse Abcc4 gene. Gene 2016; 594:131-137. [PMID: 27613143 DOI: 10.1016/j.gene.2016.08.058] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2016] [Revised: 07/30/2016] [Accepted: 08/31/2016] [Indexed: 11/20/2022]
Abstract
Abcc4 gene codes for a protein (ABCC4) involved in the transportation of different classes of drugs outside the cells. Various important drugs transported by ABCC4 include antiviral and anticancer drugs as well as endogenous molecules such as bile acids, cyclic nucleotides, folates, prostaglandins and steroids. Alternative splicing generates multiple mRNAs that encode protein isoforms having diverse functions. In this study, we have identified a novel transcript of mouse Abcc4 gene using a combination of bioinformatics and molecular biology techniques. This transcript was found to be different from the reported transcript in having a different first exon that was found to be located on previously identified first intron. Newly identified transcript was found to be expressed across different tissues we studied and in different developmental stages. Expression level of novel and reported transcripts was studied using quantitative real-time PCR. After conceptually translating the novel transcript, various post-translational modifications were studied. Translation efficiency and predicted half life of encoded protein isoforms were analysed in silico. Molecular modelling was performed to compare the structural differences in both isoforms. The diversity at N-termini in these protein isoforms explains the diverse function of ABCC4 in mouse.
Collapse
|
10
|
Ishqi HM, Ur Rehman S, Sarwar T, Husain MA, Tabish M. Identification of differentially expressed three novel transcript variants of mouse ARNT gene. IUBMB Life 2015; 68:122-35. [DOI: 10.1002/iub.1464] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 11/25/2015] [Indexed: 11/09/2022]
Affiliation(s)
- Hassan Mubarak Ishqi
- Department of Biochemistry, Faculty of Life Sciences; A.M. University; Aligarh Uttar Pradesh India
| | - Sayeed Ur Rehman
- Department of Biochemistry, Faculty of Life Sciences; A.M. University; Aligarh Uttar Pradesh India
| | - Tarique Sarwar
- Department of Biochemistry, Faculty of Life Sciences; A.M. University; Aligarh Uttar Pradesh India
| | - Mohammed Amir Husain
- Department of Biochemistry, Faculty of Life Sciences; A.M. University; Aligarh Uttar Pradesh India
| | - Mohammad Tabish
- Department of Biochemistry, Faculty of Life Sciences; A.M. University; Aligarh Uttar Pradesh India
| |
Collapse
|