1
|
Rohlfes N, Radhakrishnan R, Singh PK, Bedwell GJ, Engelman AN, Dharan A, Campbell EM. The nuclear localization signal of CPSF6 governs post-nuclear import steps of HIV-1 infection. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.20.599834. [PMID: 38979149 PMCID: PMC11230232 DOI: 10.1101/2024.06.20.599834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
The early stages of HIV-1 infection include the trafficking of the viral core into the nucleus of infected cells. However, much remains to be understood about how HIV-1 accomplishes nuclear import and the consequences of the import pathways utilized on nuclear events. The host factor cleavage and polyadenylation specificity factor 6 (CPSF6) assists HIV-1 nuclear localization and post-entry integration targeting. Here, we used a CPSF6 truncation mutant lacking a functional nuclear localization signal (NLS), CPSF6-358, and appended heterologous NLSs to rescue nuclear localization. We show that some, but not all, NLSs drive CPSF6-358 into the nucleus. Interestingly, we found that some nuclear localized CPSF6-NLS chimeras supported inefficient HIV-1 infection. We found that HIV-1 still enters the nucleus in these cell lines but fails to traffic to speckle-associated domains (SPADs). Additionally, we show that HIV-1 fails to efficiently integrate in these cell lines. Collectively, our results demonstrate that the NLS of CPSF6 facilitates steps of HIV-1 infection subsequent to nuclear import and additionally identify the ability of canonical NLS sequences to influence cargo localization in the nucleus following nuclear import. Author Summary During HIV-1 infection, the viral capsid, which encloses the viral genome and accessory proteins required for reverse transcription (RT) and integration, traffics towards the nucleus and enters through the nuclear pore complex (NPC). Following entry into the nucleus, RT is completed and viral capsid disassembles releasing the preintegration complex (PIC) to integrate with the host chromosome. In this study, we investigated the early HIV-1 host factor CPSF6, and specifically focused on the C-terminal short amino acid nuclear localization signal (NLS) in CPSF6, in mediating viral nuclear entry and subsequent gene expression. Altering the NLS in CPSF6 with NLS from other proteins, significantly impacted HIV-1's ability to infect those cells. We further showed this defect in infection occurred at the level of viral integration. This study highlights the importance of the NLS in CPSF6 in dictating the NPC it associates with and its effect on HIV-1 infection. Moreover, our study emphasizes the function of NLS in targeting host cargos to different nuclear entry pathways.
Collapse
|
2
|
Jang S, Engelman AN. Capsid-host interactions for HIV-1 ingress. Microbiol Mol Biol Rev 2023; 87:e0004822. [PMID: 37750702 PMCID: PMC10732038 DOI: 10.1128/mmbr.00048-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/27/2023] Open
Abstract
The HIV-1 capsid, composed of approximately 1,200 copies of the capsid protein, encases genomic RNA alongside viral nucleocapsid, reverse transcriptase, and integrase proteins. After cell entry, the capsid interacts with a myriad of host factors to traverse the cell cytoplasm, pass through the nuclear pore complex (NPC), and then traffic to chromosomal sites for viral DNA integration. Integration may very well require the dissolution of the capsid, but where and when this uncoating event occurs remains hotly debated. Based on size constraints, a long-prevailing view was that uncoating preceded nuclear transport, but recent research has indicated that the capsid may remain largely intact during nuclear import, with perhaps some structural remodeling required for NPC traversal. Completion of reverse transcription in the nucleus may further aid capsid uncoating. One canonical type of host factor, typified by CPSF6, leverages a Phe-Gly (FG) motif to bind capsid. Recent research has shown these peptides reside amid prion-like domains (PrLDs), which are stretches of protein sequence devoid of charged residues. Intermolecular PrLD interactions along the exterior of the capsid shell impart avid host factor binding for productive HIV-1 infection. Herein we overview capsid-host interactions implicated in HIV-1 ingress and discuss important research questions moving forward. Highlighting clinical relevance, the long-acting ultrapotent inhibitor lenacapavir, which engages the same capsid binding pocket as FG host factors, was recently approved to treat people living with HIV.
Collapse
Affiliation(s)
- Sooin Jang
- Department of Cancer Immunology and Virology, Dana-Farber Cancer Institute, Boston, Massachusetts, USA
- Department of Medicine, Harvard Medical School, Boston, Massachusetts, USA
| | - Alan N. Engelman
- Department of Cancer Immunology and Virology, Dana-Farber Cancer Institute, Boston, Massachusetts, USA
- Department of Medicine, Harvard Medical School, Boston, Massachusetts, USA
| |
Collapse
|
3
|
Joudaki A, Takeda JI, Masuda A, Ode R, Fujiwara K, Ohno K. FexSplice: A LightGBM-Based Model for Predicting the Splicing Effect of a Single Nucleotide Variant Affecting the First Nucleotide G of an Exon. Genes (Basel) 2023; 14:1765. [PMID: 37761905 PMCID: PMC10531444 DOI: 10.3390/genes14091765] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Revised: 08/30/2023] [Accepted: 09/04/2023] [Indexed: 09/29/2023] Open
Abstract
Single nucleotide variants (SNVs) affecting the first nucleotide G of an exon (Fex-SNVs) identified in various diseases are mostly recognized as missense or nonsense variants. Their effect on pre-mRNA splicing has been seldom analyzed, and no curated database is available. We previously reported that Fex-SNVs affect splicing when the length of the polypyrimidine tract is short or degenerate. However, we cannot readily predict the splicing effects of Fex-SNVs. We here scrutinized the available literature and identified 106 splicing-affecting Fex-SNVs based on experimental evidence. We similarly identified 106 neutral Fex-SNVs in the dbSNP database with a global minor allele frequency (MAF) of more than 0.01 and less than 0.50. We extracted 115 features representing the strength of splicing cis-elements and developed machine-learning models with support vector machine, random forest, and gradient boosting to discriminate splicing-affecting and neutral Fex-SNVs. Gradient boosting-based LightGBM outperformed the other two models, and the length and nucleotide compositions of the polypyrimidine tract played critical roles in the discrimination. Recursive feature elimination showed that the LightGBM model using 15 features achieved the best performance with an accuracy of 0.80 ± 0.12 (mean and SD), a Matthews Correlation Coefficient (MCC) of 0.57 ± 0.15, an area under the curve of the receiver operating characteristics curve (AUROC) of 0.86 ± 0.08, and an area under the curve of the precision-recall curve (AUPRC) of 0.87 ± 0.09 using a 10-fold cross-validation. We developed a web service program, named FexSplice that accepts a genomic coordinate either on GRCh37/hg19 or GRCh38/hg38 and returns a predicted probability of aberrant splicing of A, C, and T variants.
Collapse
Affiliation(s)
- Atefeh Joudaki
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan; (A.J.); (J.-i.T.); (A.M.)
| | - Jun-ichi Takeda
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan; (A.J.); (J.-i.T.); (A.M.)
| | - Akio Masuda
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan; (A.J.); (J.-i.T.); (A.M.)
| | - Rikumo Ode
- Department of Materials Science and Engineering, Nagoya University Graduate School of Engineering, Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan; (R.O.); (K.F.)
| | - Koichi Fujiwara
- Department of Materials Science and Engineering, Nagoya University Graduate School of Engineering, Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan; (R.O.); (K.F.)
| | - Kinji Ohno
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, 65 Tsurumai, Showa-ku, Nagoya 466-8550, Japan; (A.J.); (J.-i.T.); (A.M.)
| |
Collapse
|
4
|
Guo G, Wang X, Zhang Y, Li T. Sequence variations of phase-separating proteins and resources for studying biomolecular condensates. Acta Biochim Biophys Sin (Shanghai) 2023; 55:1119-1132. [PMID: 37464880 PMCID: PMC10423696 DOI: 10.3724/abbs.2023131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 06/06/2023] [Indexed: 07/20/2023] Open
Abstract
Phase separation (PS) is an important mechanism underlying the formation of biomolecular condensates. Physiological condensates are associated with numerous biological processes, such as transcription, immunity, signaling, and synaptic transmission. Changes in particular amino acids or segments can disturb the protein's phase behavior and interactions with other biomolecules in condensates. It is thus presumed that variations in the phase-separating-prone domains can significantly impact the properties and functions of condensates. The dysfunction of condensates contributes to a number of pathological processes. Pharmacological perturbation of these condensates is proposed as a promising way to restore physiological states. In this review, we characterize the variations observed in PS proteins that lead to aberrant biomolecular compartmentalization. We also showcase recent advancements in bioinformatics of membraneless organelles (MLOs), focusing on available databases useful for screening PS proteins and describing endogenous condensates, guiding researchers to seek the underlying pathogenic mechanisms of biomolecular condensates.
Collapse
Affiliation(s)
- Gaigai Guo
- Department of Biomedical InformaticsSchool of Basic Medical SciencesPeking University Health Science CenterBeijing100191China
| | - Xinxin Wang
- Department of Biomedical InformaticsSchool of Basic Medical SciencesPeking University Health Science CenterBeijing100191China
| | - Yi Zhang
- Department of Biomedical InformaticsSchool of Basic Medical SciencesPeking University Health Science CenterBeijing100191China
| | - Tingting Li
- Department of Biomedical InformaticsSchool of Basic Medical SciencesPeking University Health Science CenterBeijing100191China
- Key Laboratory for NeuroscienceMinistry of Education/National Health Commission of ChinaPeking UniversityBeijing100191China
| |
Collapse
|
5
|
Fukuchi S, Noguchi T, Anbo H, Homma K. Exon Elongation Added Intrinsically Disordered Regions to the Encoded Proteins and Facilitated the Emergence of the Last Eukaryotic Common Ancestor. Mol Biol Evol 2022; 40:6931801. [PMID: 36529689 PMCID: PMC9825244 DOI: 10.1093/molbev/msac272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 11/06/2022] [Accepted: 12/13/2022] [Indexed: 12/23/2022] Open
Abstract
Most prokaryotic proteins consist of a single structural domain (SD) with little intrinsically disordered regions (IDRs) that by themselves do not adopt stable structures, whereas the typical eukaryotic protein comprises multiple SDs and IDRs. How eukaryotic proteins evolved to differ from prokaryotic proteins has not been fully elucidated. Here, we found that the longer the internal exons are, the more frequently they encode IDRs in eight eukaryotes including vertebrates, invertebrates, a fungus, and plants. Based on this observation, we propose the "small bang" model from the proteomic viewpoint: the protoeukaryotic genes had no introns and mostly encoded one SD each, but a majority of them were subsequently divided into multiple exons (step 1). Many exons unconstrained by SDs elongated to encode IDRs (step 2). The elongated exons encoding IDRs frequently facilitated the acquisition of multiple SDs to make the last common ancestor of eukaryotes (step 3). One prediction of the model is that long internal exons are mostly unconstrained exons. Analytical results of the eight eukaryotes are consistent with this prediction. In support of the model, we identified cases of internal exons that elongated after the rat-mouse divergence and discovered that the expanded sections are mostly in unconstrained exons and preferentially encode IDRs. The model also predicts that SDs followed by long internal exons tend to have other SDs downstream. This prediction was also verified in all the eukaryotic species analyzed. Our model accounts for the dichotomy between prokaryotic and eukaryotic proteins and proposes a selective advantage conferred by IDRs.
Collapse
Affiliation(s)
- Satoshi Fukuchi
- Program for Information Systems, Division of Informatics, Bioengineering and Bioscience, Maebashi Institute of Technology, Maebashi-shi, Japan
| | - Tamotsu Noguchi
- Pharmaceutical Education Research Center, Meiji Pharmaceutical University, Kiyose, Tokyo, Japan
| | - Hiroto Anbo
- Program for Information Systems, Division of Informatics, Bioengineering and Bioscience, Maebashi Institute of Technology, Maebashi-shi, Japan
| | | |
Collapse
|
6
|
Nsengimana B, Khan FA, Awan UA, Wang D, Fang N, Wei W, Zhang W, Ji S. Pseudogenes and Liquid Phase Separation in Epigenetic Expression. Front Oncol 2022; 12:912282. [PMID: 35875144 PMCID: PMC9305658 DOI: 10.3389/fonc.2022.912282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Accepted: 06/13/2022] [Indexed: 11/24/2022] Open
Abstract
Pseudogenes have been considered as non-functional genes. However, peptides and long non-coding RNAs produced by pseudogenes are expressed in different tumors. Moreover, the dysregulation of pseudogenes is associated with cancer, and their expressions are higher in tumors compared to normal tissues. Recent studies show that pseudogenes can influence the liquid phase condensates formation. Liquid phase separation involves regulating different epigenetic stages, including transcription, chromatin organization, 3D DNA structure, splicing, and post-transcription modifications like m6A. Several membrane-less organelles, formed through the liquid phase separate, are also involved in the epigenetic regulation, and their defects are associated with cancer development. However, the association between pseudogenes and liquid phase separation remains unrevealed. The current study sought to investigate the relationship between pseudogenes and liquid phase separation in cancer development, as well as their therapeutic implications.
Collapse
Affiliation(s)
- Bernard Nsengimana
- Laboratory of Cell Signal Transduction, Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Henan University, Kaifeng, China
| | - Faiz Ali Khan
- Laboratory of Cell Signal Transduction, Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Henan University, Kaifeng, China
- School of Life Sciences, Henan University, Kaifeng, China
- Department of Basic Sciences Research, Shaukat Khanum Memorial Cancer Hospital and Research Centre (SKMCH&RC), Lahore, Pakistan
| | - Usman Ayub Awan
- Department of Medical Laboratory Technology, The University of Haripur, Haripur, Pakistan
| | - Dandan Wang
- Laboratory of Cell Signal Transduction, Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Henan University, Kaifeng, China
| | - Na Fang
- Laboratory of Cell Signal Transduction, Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Henan University, Kaifeng, China
| | - Wenqiang Wei
- Laboratory of Cell Signal Transduction, Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Henan University, Kaifeng, China
- *Correspondence: Wenqiang Wei, ; Weijuan Zhang, ; Shaoping Ji,
| | - Weijuan Zhang
- Laboratory of Cell Signal Transduction, Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Henan University, Kaifeng, China
- *Correspondence: Wenqiang Wei, ; Weijuan Zhang, ; Shaoping Ji,
| | - Shaoping Ji
- Laboratory of Cell Signal Transduction, Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Henan University, Kaifeng, China
- *Correspondence: Wenqiang Wei, ; Weijuan Zhang, ; Shaoping Ji,
| |
Collapse
|
7
|
Kawachi T, Masuda A, Yamashita Y, Takeda JI, Ohkawara B, Ito M, Ohno K. Regulated splicing of large exons is linked to phase-separation of vertebrate transcription factors. EMBO J 2021; 40:e107485. [PMID: 34605568 DOI: 10.15252/embj.2020107485] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 09/06/2021] [Accepted: 09/14/2021] [Indexed: 12/30/2022] Open
Abstract
Although large exons cannot be readily recognized by the spliceosome, many are evolutionarily conserved and constitutively spliced for inclusion in the processed transcript. Furthermore, whether large exons may be enriched in a certain subset of proteins, or mediate specific functions, has remained unclear. Here, we identify a set of nearly 3,000 SRSF3-dependent large constitutive exons (S3-LCEs) in human and mouse cells. These exons are enriched for cytidine-rich sequence motifs, which bind and recruit the splicing factors hnRNP K and SRSF3. We find that hnRNP K suppresses S3-LCE splicing, an effect that is mitigated by SRSF3 to thus achieve constitutive splicing of S3-LCEs. S3-LCEs are enriched in genes for components of transcription machineries, including mediator and BAF complexes, and frequently contain intrinsically disordered regions (IDRs). In a subset of analyzed S3-LCE-containing transcription factors, SRSF3 depletion leads to deletion of the IDRs due to S3-LCE exon skipping, thereby disrupting phase-separated assemblies of these factors. Cytidine enrichment in large exons introduces proline/serine codon bias in intrinsically disordered regions and appears to have been evolutionarily acquired in vertebrates. We propose that layered splicing regulation by hnRNP K and SRSF3 ensures proper phase-separation of these S3-LCE-containing transcription factors in vertebrates.
Collapse
Affiliation(s)
- Toshihiko Kawachi
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Akio Masuda
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Yoshihiro Yamashita
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Jun-Ichi Takeda
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Bisei Ohkawara
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Mikako Ito
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Kinji Ohno
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan
| |
Collapse
|