1
|
Polańska O, Szulc N, Stottko R, Olek M, Nadwodna J, Gąsior-Głogowska M, Szefczyk M. Challenges in Peptide Solubilization - Amyloids Case Study. CHEM REC 2024:e202400053. [PMID: 39023378 DOI: 10.1002/tcr.202400053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Revised: 05/23/2024] [Indexed: 07/20/2024]
Abstract
Peptide science has been a rapidly growing research field because of the enormous potential application of these biocompatible and bioactive molecules. However, many factors limit the widespread use of peptides in medicine, and low solubility is among the most common problems that hamper drug development in the early stages of research. Solubility is a crucial, albeit poorly understood, feature that determines peptide behavior. Several different solubility predictors have been proposed, and many strategies and protocols have been reported to dissolve peptides, but none of them is a one-size-fits-all method for solubilization of even the same peptide. In this review, we look for the reasons behind the difficulties in dissolving peptides, analyze the factors influencing peptide aggregation, conduct a critical analysis of solubilization strategies and protocols available in the literature, and give some tips on how to deal with the so-called difficult sequences. We focus on amyloids, which are particularly difficult to dissolve and handle such as amyloid beta (Aβ), insulin, and phenol-soluble modulins (PSMs).
Collapse
Affiliation(s)
- Oliwia Polańska
- Department of Biomedical Engineering, Faculty of Fundamental Problems of Technology, Wroclaw University of Science and Technology, Wybrzeze Wyspianskiego 27, 50-370, Wroclaw, Poland
| | - Natalia Szulc
- Department of Physics and Biophysics, Wroclaw University of Environmental and Life Sciences, Norwida 25, 50-375, Wrocław, Poland
| | - Rafał Stottko
- Faculty of Chemistry, Wrocław University of Science and Technology, Gdanska 7/9, 50-344, Wrocław, Poland
| | - Mateusz Olek
- Faculty of Medical Sciences in Zabrze, Medical University of Silesia in Katowice, Traugutta 2, 41-800 Zabrze, Poland
| | - Julita Nadwodna
- Department of Biomedical Engineering, Faculty of Fundamental Problems of Technology, Wroclaw University of Science and Technology, Wybrzeze Wyspianskiego 27, 50-370, Wroclaw, Poland
| | - Marlena Gąsior-Głogowska
- Department of Biomedical Engineering, Faculty of Fundamental Problems of Technology, Wroclaw University of Science and Technology, Wybrzeze Wyspianskiego 27, 50-370, Wroclaw, Poland
| | - Monika Szefczyk
- Department of Bioorganic Chemistry, Faculty of Chemistry, Wroclaw University of Science and Technology, Wybrzeze Wyspianskiego 27, 50-370, Wroclaw, Poland
| |
Collapse
|
2
|
Han CW, Jeong MS, Jang SB. Influence of the interaction between p53 and ZNF568 on mitochondrial oxidative phosphorylation. Int J Biol Macromol 2024; 275:133314. [PMID: 38944084 DOI: 10.1016/j.ijbiomac.2024.133314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2024] [Revised: 06/10/2024] [Accepted: 06/19/2024] [Indexed: 07/01/2024]
Abstract
The tumor suppressor p53 plays important roles in suppressing the development and progression of cancer by responding to various stress signals. In addition, p53 can regulate the metabolic pathways of cancer cells by regulating energy metabolism and oxidative phosphorylation. Here, we present a mechanism for the interaction between p53 and ZNF568. Initially, we used X-ray crystallography to determine the irregular loop structure of the ZNF568 KRAB domain; this loop plays an important role in the interaction between p53 and ZNF568. In addition, Cryo-EM was used to examine how the p53 DBD and ZNF568 KRAB domains bind together. The function of ZNF568 on p53-mediated mitochondrial respiration was confirmed by measuring glucose consumption and lactate production. These findings show that ZNF568 can reduce p53-mediated mitochondrial respiratory activity by binding to p53 and inhibiting the transcription of SCO2. SIGNIFICANCE: ZNF568 can directly bind to the p53 DBD and transcriptionally regulate the SCO2 gene. SCO2 transcriptional regulation by interaction between ZNF568 and p53 may regulate the balance between mitochondrial respiration and glycolysis.
Collapse
Affiliation(s)
- Chang Woo Han
- Institute of Systems Biology, Pusan National University, Jangjeon-dong, Geumjeong-gu, Busan 46241, Republic of Korea
| | - Mi Suk Jeong
- Institute of Systems Biology, Pusan National University, Jangjeon-dong, Geumjeong-gu, Busan 46241, Republic of Korea
| | - Se Bok Jang
- Department of Molecular Biology, College of Natural Sciences, Pusan National University, 2, Busandaehak-ro 63beon-gil, Geumjeong-gu, Busan 46241, Republic of Korea.
| |
Collapse
|
3
|
Waszkiewicz R, Michaś A, Białobrzewski MK, Klepka BP, Cieplak-Rotowska MK, Staszałek Z, Cichocki B, Lisicki M, Szymczak P, Niedzwiecka A. Hydrodynamic Radii of Intrinsically Disordered Proteins: Fast Prediction by Minimum Dissipation Approximation and Experimental Validation. J Phys Chem Lett 2024; 15:5024-5033. [PMID: 38696815 PMCID: PMC11103702 DOI: 10.1021/acs.jpclett.4c00312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 04/12/2024] [Accepted: 04/26/2024] [Indexed: 05/04/2024]
Abstract
The diffusion coefficients of globular and fully unfolded proteins can be predicted with high accuracy solely from their mass or chain length. However, this approach fails for intrinsically disordered proteins (IDPs) containing structural domains. We propose a rapid predictive methodology for estimating the diffusion coefficients of IDPs. The methodology uses accelerated conformational sampling based on self-avoiding random walks and includes hydrodynamic interactions between coarse-grained protein subunits, modeled using the generalized Rotne-Prager-Yamakawa approximation. To estimate the hydrodynamic radius, we rely on the minimum dissipation approximation recently introduced by Cichocki et al. Using a large set of experimentally measured hydrodynamic radii of IDPs over a wide range of chain lengths and domain contributions, we demonstrate that our predictions are more accurate than the Kirkwood approximation and phenomenological approaches. Our technique may prove to be valuable in predicting the hydrodynamic properties of both fully unstructured and multidomain disordered proteins.
Collapse
Affiliation(s)
- Radost Waszkiewicz
- Institute
of Theoretical Physics, Faculty of Physics, University of Warsaw, L. Pasteura 5, 02-093 Warsaw, Poland
| | - Agnieszka Michaś
- Institute
of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668 Warsaw, Poland
| | - Michał K. Białobrzewski
- Institute
of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668 Warsaw, Poland
| | - Barbara P. Klepka
- Institute
of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668 Warsaw, Poland
| | | | - Zuzanna Staszałek
- Institute
of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668 Warsaw, Poland
| | - Bogdan Cichocki
- Institute
of Theoretical Physics, Faculty of Physics, University of Warsaw, L. Pasteura 5, 02-093 Warsaw, Poland
| | - Maciej Lisicki
- Institute
of Theoretical Physics, Faculty of Physics, University of Warsaw, L. Pasteura 5, 02-093 Warsaw, Poland
| | - Piotr Szymczak
- Institute
of Theoretical Physics, Faculty of Physics, University of Warsaw, L. Pasteura 5, 02-093 Warsaw, Poland
| | - Anna Niedzwiecka
- Institute
of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668 Warsaw, Poland
| |
Collapse
|
4
|
Walseng E, Wang B, Yang C, Patel P, Zhao C, Zhang H, Zhao P, Mazor Y. Conformation-selective rather than avidity-based binding to tumor associated antigen derived peptide-MHC enables targeting of WT1-pMHC low expressing cancer cells by anti-WT1-pMHC/CD3 T cell engagers. Front Immunol 2023; 14:1275304. [PMID: 38022650 PMCID: PMC10667733 DOI: 10.3389/fimmu.2023.1275304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Accepted: 10/25/2023] [Indexed: 12/01/2023] Open
Abstract
T cell engagers, a category of T cell-retargeting immunotherapy, are rapidly transforming clinical cancer care. However, the lack of tumor-specific targets poses a significant roadblock for broad adaptation of this therapeutic modality in many indications, often resulting in systemic on-target off-tumor toxicity. Though various tumor-derived intracellular mutations provide a massive pool of potential tumor-specific antigens, targeting them is extremely challenging, partly due to the low copy number of tumor associated antigen (TAA)-derived pMHC on tumor cell surface. Further, the interplay of binding geometry and format valency in relation to the capacity of a T cell engager to efficiently target low density cell-surface pMHC is not well understood. Using the Wilms' tumor 1 (WT1) oncoprotein as a proof-of-principle TAA, combined with an array of IgG-like T cell engager modalities that differ in their anti-TAA valency and binding geometry, we show that the ability to induce an immunological synapse formation, resulting in potent killing of WT1 positive cancer cell lines is primarily dependent on the distinct geometrical conformations between the Fab arms of anti-WT1-HLA-A*02:01 and anti-CD3. The augmented avidity conferred by the binding of two anti-WT1-HLA-A*02:01 Fab arms has only minimal influence on cell killing potency. These findings demonstrate the need for careful examination of key design parameters for the development of next-generation T cell engagers targeting low density TAA-pMHCs on tumor cells.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Yariv Mazor
- Biologics Engineering, Biopharmaceutical R&D, AstraZeneca, Gaithersburg, MD, United States
| |
Collapse
|
5
|
Oh C, Buckley PM, Choi J, Hierro A, DiMaio D. Sequence-independent activity of a predicted long disordered segment of the human papillomavirus type 16 L2 capsid protein during virus entry. Proc Natl Acad Sci U S A 2023; 120:e2307721120. [PMID: 37819982 PMCID: PMC10589650 DOI: 10.1073/pnas.2307721120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Accepted: 08/28/2023] [Indexed: 10/13/2023] Open
Abstract
The activity of proteins is thought to be invariably determined by their amino acid sequence or composition, but we show that a long segment of a viral protein can support infection independent of its sequence or composition. During virus entry, the papillomavirus L2 capsid protein protrudes through the endosome membrane into the cytoplasm to bind cellular factors such as retromer required for intracellular virus trafficking. Here, we show that an ~110 amino acid segment of L2 is predicted to be disordered and that large deletions in this segment abolish infectivity of HPV16 pseudoviruses by inhibiting cytoplasmic protrusion of L2, association with retromer, and proper virus trafficking. The activity of these mutants can be restored by insertion of protein segments with diverse sequences, compositions, and chemical properties, including scrambled amino acid sequences, a tandem array of a short sequence, and the intrinsically disordered region of an unrelated cellular protein. The infectivity of mutants with small in-frame deletions in this segment directly correlates with the size of the segment. These results indicate that the length of the disordered segment, not its sequence or composition, determines its activity during HPV16 pseudovirus infection. We propose that a minimal length of L2 is required for it to protrude far enough into the cytoplasm to bind cytoplasmic trafficking factors, but the sequence of this segment is largely irrelevant. Thus, protein segments can carry out complex biological functions such as Human papillomavirus pseudovirus infection in a sequence-independent manner. This finding has important implications for protein function and evolution.
Collapse
Affiliation(s)
- Changin Oh
- Department of Genetics, Yale School of Medicine, New Haven, CT06520-8005
| | - Patrick M. Buckley
- Department of Microbial Pathogenesis, Yale School of Medicine, New Haven, CT06536-0812
| | - Jeongjoon Choi
- Department of Genetics, Yale School of Medicine, New Haven, CT06520-8005
| | - Aitor Hierro
- Center for Cooperative Research in Biosciences, Bilbao, Derio48160, Spain
- Basque Foundation for Science, Bilbao48009, Spain
| | - Daniel DiMaio
- Department of Genetics, Yale School of Medicine, New Haven, CT06520-8005
- Department of Therapeutic Radiology, Yale School of Medicine, New Haven, CT06520-8040
- Department of Molecular Biophysics & Biochemistry, Yale University, New Haven, CT06520-8024
- Yale Cancer Center, New Haven, CT06520-8028
| |
Collapse
|
6
|
Corbella M, Pinto GP, Kamerlin SCL. Loop dynamics and the evolution of enzyme activity. Nat Rev Chem 2023; 7:536-547. [PMID: 37225920 DOI: 10.1038/s41570-023-00495-w] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/06/2023] [Indexed: 05/26/2023]
Abstract
In the early 2000s, Tawfik presented his 'New View' on enzyme evolution, highlighting the role of conformational plasticity in expanding the functional diversity of limited repertoires of sequences. This view is gaining increasing traction with increasing evidence of the importance of conformational dynamics in both natural and laboratory evolution of enzymes. The past years have seen several elegant examples of harnessing conformational (particularly loop) dynamics to successfully manipulate protein function. This Review revisits flexible loops as critical participants in regulating enzyme activity. We showcase several systems of particular interest: triosephosphate isomerase barrel proteins, protein tyrosine phosphatases and β-lactamases, while briefly discussing other systems in which loop dynamics are important for selectivity and turnover. We then discuss the implications for engineering, presenting examples of successful loop manipulation in either improving catalytic efficiency, or changing selectivity completely. Overall, it is becoming clearer that mimicking nature by manipulating the conformational dynamics of key protein loops is a powerful method of tailoring enzyme activity, without needing to target active-site residues.
Collapse
Affiliation(s)
- Marina Corbella
- Department of Chemistry, Uppsala University, Uppsala, Sweden
| | - Gaspar P Pinto
- Department of Chemistry, Uppsala University, Uppsala, Sweden
- Cortex Discovery GmbH, Regensburg, Germany
| | - Shina C L Kamerlin
- Department of Chemistry, Uppsala University, Uppsala, Sweden.
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, USA.
| |
Collapse
|
7
|
Sarabandi S, Pourtaghi H. Whole genome sequence analysis of CPV-2 isolates from 1998 to 2020. Virol J 2023; 20:138. [PMID: 37400901 DOI: 10.1186/s12985-023-02102-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Accepted: 06/14/2023] [Indexed: 07/05/2023] Open
Abstract
Canine parvovirus-2 (CPV-2) is a virus with worldwide spread causing canine gastroenteritis. New strains of this virus have unique characteristics and are resistant to some vaccine strains. Therefore, understanding the root causes of resistance has proven to be of increasing concern to many scientists. This study collected 126 whole genome sequences of CPV-2 subtypes with specific collection dates from the NCBI data bank. The whole genome sequences of CPV-2 collected from different countries were analyzed to detect the new substitutions and update these mutations. The result indicated 12, 7, and 10 mutations in NS1, VP1, and VP2, in that respective order. Moreover, the A5G and Q370R mutations of VP2 are the most common changes in the recent isolates of the CPV-2C subtype, and the new N93K residue of VP2 is speculated to be the cause of vaccine failure. To summarize, the observed mutations, which are increasing over time, causes several changes in viral characteristic. A comprehensive understanding of these mutations can lead us to control potential future epidemics associated with this virus more efficiently.
Collapse
Affiliation(s)
- Sajed Sarabandi
- Department of Pathobiology, Islamic Azad University, Karaj Branch, Karaj, Iran
| | - Hadi Pourtaghi
- Department of Microbiology, Islamic Azad University, Karaj Branch, Karaj, Iran.
| |
Collapse
|
8
|
Oh C, Buckley PM, Choi J, Hierro A, DiMaio D. Sequence independent activity of a predicted long disordered segment of the human papillomavirus L2 capsid protein during virus entry. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.21.533711. [PMID: 36993745 PMCID: PMC10055320 DOI: 10.1101/2023.03.21.533711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/31/2023]
Abstract
The papillomavirus L2 capsid protein protrudes through the endosome membrane into the cytoplasm during virus entry to bind cellular factors required for intracellular virus trafficking. Cytoplasmic protrusion of HPV16 L2, virus trafficking, and infectivity are inhibited by large deletions in an ∼110 amino acid segment of L2 that is predicted to be disordered. The activity of these mutants can be restored by inserting protein segments with diverse compositions and chemical properties into this region, including scrambled sequences, a tandem array of a short sequence, and the intrinsically disordered region of a cellular protein. The infectivity of mutants with small in-frame insertions and deletions in this segment directly correlates with the size of the segment. These results indicate that the length of the disordered segment, not its sequence or its composition, determines its activity during virus entry. Sequence independent but length dependent activity has important implications for protein function and evolution.
Collapse
|
9
|
He J, Huang F, Zhang J, Chen Q, Zheng Z, Zhou Q, Chen D, Li J, Chen J. Vaccine design based on 16 epitopes of SARS-CoV-2 spike protein. J Med Virol 2020; 93:2115-2131. [PMID: 33091154 PMCID: PMC7675516 DOI: 10.1002/jmv.26596] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2020] [Revised: 09/18/2020] [Accepted: 10/07/2020] [Indexed: 12/21/2022]
Abstract
The global outbreak of severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2) urgently requires an effective vaccine for prevention. In this study, 66 epitopes containing pentapeptides of SARS‐CoV‐2 spike protein in the IEDB database were compared with the amino acid sequence of SARS‐CoV‐2 spike protein, and 66 potentially immune‐related peptides of SARS‐CoV‐2 spike protein were obtained. Based on the single‐nucleotide polymorphisms analysis of spike protein of 1218 SARS‐CoV‐2 isolates, 52 easily mutated sites were identified and used for vaccine epitope screening. The best vaccine candidate epitopes in the 66 peptides of SARS‐CoV‐2 spike protein were screened out through mutation and immunoinformatics analysis. The best candidate epitopes were connected by different linkers in silico to obtain vaccine candidate sequences. The results showed that 16 epitopes were relatively conservative, immunological, nontoxic, and nonallergenic, could induce the secretion of cytokines, and were more likely to be exposed on the surface of the spike protein. They were both B‐ and T‐cell epitopes, and could recognize a certain number of HLA molecules and had high coverage rates in different populations. Moreover, epitopes 897‐913 were predicted to have possible cross‐immunoprotection for SARS‐CoV and SARS‐CoV‐2. The results of vaccine candidate sequences screening suggested that sequences (without linker, with linker GGGSGGG, EAAAK, GPGPG, and KK, respectively) were the best. The proteins translated by these sequences were relatively stable, with a high antigenic index and good biological activity. Our study provided vaccine candidate epitopes and sequences for the research of the SARS‐CoV‐2 vaccine.
Collapse
Affiliation(s)
- Jinlei He
- Department of Pathogenic Biology, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China
| | - Fan Huang
- Department of First Surgical, Chengdu Shuangliu Hospital of Traditional Chinese Medicine, Chengdu, China
| | - Jianhui Zhang
- Department of Pathogenic Biology, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China
| | - Qiwei Chen
- Department of Pathogenic Biology, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China
| | - Zhiwan Zheng
- Department of Pathogenic Biology, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China
| | - Qi Zhou
- Department of Pathogenic Biology, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China
| | - Dali Chen
- Department of Pathogenic Biology, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China
| | - Jiao Li
- Department of Pathogenic Biology, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China
| | - Jianping Chen
- Department of Pathogenic Biology, West China School of Basic Medical Sciences and Forensic Medicine, Sichuan University, Chengdu, China.,Animal Disease Prevention and Food Safety Key Laboratory of Sichuan Province, Chengdu, China
| |
Collapse
|
10
|
Abbass J, Nebel JC. Enhancing fragment-based protein structure prediction by customising fragment cardinality according to local secondary structure. BMC Bioinformatics 2020; 21:170. [PMID: 32357827 PMCID: PMC7195757 DOI: 10.1186/s12859-020-3491-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Accepted: 04/13/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Whenever suitable template structures are not available, usage of fragment-based protein structure prediction becomes the only practical alternative as pure ab initio techniques require massive computational resources even for very small proteins. However, inaccuracy of their energy functions and their stochastic nature imposes generation of a large number of decoys to explore adequately the solution space, limiting their usage to small proteins. Taking advantage of the uneven complexity of the sequence-structure relationship of short fragments, we adjusted the fragment insertion process by customising the number of available fragment templates according to the expected complexity of the predicted local secondary structure. Whereas the number of fragments is kept to its default value for coil regions, important and dramatic reductions are proposed for beta sheet and alpha helical regions, respectively. RESULTS The evaluation of our fragment selection approach was conducted using an enhanced version of the popular Rosetta fragment-based protein structure prediction tool. It was modified so that the number of fragment candidates used in Rosetta could be adjusted based on the local secondary structure. Compared to Rosetta's standard predictions, our strategy delivered improved first models, + 24% and + 6% in terms of GDT, when using 2000 and 20,000 decoys, respectively, while reducing significantly the number of fragment candidates. Furthermore, our enhanced version of Rosetta is able to deliver with 2000 decoys a performance equivalent to that produced by standard Rosetta while using 20,000 decoys. We hypothesise that, as the fragment insertion process focuses on the most challenging regions, such as coils, fewer decoys are needed to explore satisfactorily conformation spaces. CONCLUSIONS Taking advantage of the high accuracy of sequence-based secondary structure predictions, we showed the value of that information to customise the number of candidates used during the fragment insertion process of fragment-based protein structure prediction. Experimentations conducted using standard Rosetta showed that, when using the recommended number of decoys, i.e. 20,000, our strategy produces better results. Alternatively, similar results can be achieved using only 2000 decoys. Consequently, we recommend the adoption of this strategy to either improve significantly model quality or reduce processing times by a factor 10.
Collapse
Affiliation(s)
- Jad Abbass
- Faculty of Science, Engineering and Computing, Kingston University, London, KT1 2EE UK
- Department of Computer Science, Lebanese International University, Bekaa, Lebanon
| | - Jean-Christophe Nebel
- Faculty of Science, Engineering and Computing, Kingston University, London, KT1 2EE UK
| |
Collapse
|
11
|
Mitusińska K, Skalski T, Góra A. Simple Selection Procedure to Distinguish between Static and Flexible Loops. Int J Mol Sci 2020; 21:ijms21072293. [PMID: 32225102 PMCID: PMC7177474 DOI: 10.3390/ijms21072293] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Revised: 03/22/2020] [Accepted: 03/24/2020] [Indexed: 12/02/2022] Open
Abstract
Loops are the most variable and unorganized elements of the secondary structure of proteins. Their ability to shift their shape can play a role in the binding of small ligands, enzymatic catalysis, or protein–protein interactions. Due to the loop flexibility, the positions of their residues in solved structures show the largest B-factors, or in a worst-case scenario can be unknown. Based on the loops’ movements’ timeline, they can be divided into slow (static) and fast (flexible). Although most of the loops that are missing in experimental structures belong to the flexible loops group, the computational tools for loop reconstruction use a set of static loop conformations to predict the missing part of the structure and evaluate the model. We believe that these two loop types can adopt different conformations and that using scoring functions appropriate for static loops is not sufficient for flexible loops. We showed that common model evaluation methods, are insufficient in the case of flexible solvent-exposed loops. Instead, we recommend using the potential energy to evaluate such loop models. We provide a novel model selection method based on a set of geometrical parameters to distinguish between flexible and static loops without the use of molecular dynamics simulations. We have also pointed out the importance of water network and interactions with the solvent for the flexible loop modeling.
Collapse
Affiliation(s)
- Karolina Mitusińska
- Tunneling Group, Biotechnology Centre, Silesian University of Technology, ul. Krzywoustego 8, 44-100 Gliwice, Poland;
| | - Tomasz Skalski
- Biotechnology Centre, Silesian University of Technology, ul. Krzywoustego 8, 44-100 Gliwice, Poland;
| | - Artur Góra
- Tunneling Group, Biotechnology Centre, Silesian University of Technology, ul. Krzywoustego 8, 44-100 Gliwice, Poland;
- Correspondence: ; Tel.: +48-322371659
| |
Collapse
|
12
|
Marks C, Shi J, Deane CM. Predicting loop conformational ensembles. Bioinformatics 2018; 34:949-956. [PMID: 29136084 DOI: 10.1093/bioinformatics/btx718] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Accepted: 11/09/2017] [Indexed: 12/23/2022] Open
Abstract
Motivation Protein function is often facilitated by the existence of multiple stable conformations. Structure prediction algorithms need to be able to model these different conformations accurately and produce an ensemble of structures that represent a target's conformational diversity rather than just a single state. Here, we investigate whether current loop prediction algorithms are capable of this. We use the algorithms to predict the structures of loops with multiple experimentally determined conformations, and the structures of loops with only one conformation, and assess their ability to generate and select decoys that are close to any, or all, of the observed structures. Results We find that while loops with only one known conformation are predicted well, conformationally diverse loops are modelled poorly, and in most cases the predictions returned by the methods do not resemble any of the known conformers. Our results contradict the often-held assumption that multiple native conformations will be present in the decoy set, making the production of accurate conformational ensembles impossible, and hence indicating that current methodologies are not well suited to prediction of conformationally diverse, often functionally important protein regions. Contact marks@stats.ox.ac.uk. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Claire Marks
- Department of Statistics, University of Oxford, Oxford OX1 3LB, UK
| | - Jiye Shi
- Department of Chemistry, UCB Pharma, Slough SL1 3WE, UK
| | | |
Collapse
|
13
|
Kyeong HH, Choi Y, Kim HS. GradDock: rapid simulation and tailored ranking functions for peptide-MHC Class I docking. Bioinformatics 2018; 34:469-476. [PMID: 28968726 DOI: 10.1093/bioinformatics/btx589] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2017] [Accepted: 09/15/2017] [Indexed: 01/16/2023] Open
Abstract
Motivation The identification of T-cell epitopes has many profound translational applications in the areas of transplantation, disease diagnosis, vaccine/therapeutic protein development and personalized immunotherapy. While data-driven methods have been widely used for the prediction of peptide binders with notable successes, the structural modeling of peptide binding to MHC molecules is crucial for understanding the underlying molecular mechanism of the immunological processes. Results We developed GradDock, a structure-based method for the rapid and accurate modeling of peptide binding to MHC Class I (pMHC-I). GradDock explicitly models diverse unbound peptides in vacuo and inserts them into the MHC-I groove through a steered gradient descent with a topological correction process. The simulation process yields diverse structural conformations including native-like peptides. We completely revised the Rosetta score terms and developed a new ranking function specifically for pMHC-I. Using the diverse peptides, a linear programming approach is applied to find the optimal weights for the individual Rosetta score terms. Our examination revealed that a refinement of the dihedral angles and a modification of the repulsion can dramatically improve the modeling quality. GradDock is five-times faster than a Rosetta-based docking approach for pMHC-I. We also demonstrate that the predictive capability of GradDock with the re-weighted Rosetta ranking function is consistently more accurate than the Rosetta-based method with the standard Rosetta score (approximately three-times better for a cross-docking set). Availability and implementation GradDock is freely available for academic purposes. The program and the ranking score weights for Rosetta are available at http://bel.kaist.ac.kr/research/GradDock. Contact hskim76@kaist.ac.kr. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Hyun-Ho Kyeong
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea
| | - Yoonjoo Choi
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea
| | - Hak-Sung Kim
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea
| |
Collapse
|
14
|
The soluble loop BC region guides, but not dictates, the assembly of the transmembrane cytochrome b6. PLoS One 2017; 12:e0189532. [PMID: 29240839 PMCID: PMC5730185 DOI: 10.1371/journal.pone.0189532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2017] [Accepted: 11/27/2017] [Indexed: 11/19/2022] Open
Abstract
Studying folding and assembly of naturally occurring α-helical transmembrane proteins can inspire the design of membrane proteins with defined functions. Thus far, most studies have focused on the role of membrane-integrated protein regions. However, to fully understand folding pathways and stabilization of α–helical membrane proteins, it is vital to also include the role of soluble loops. We have analyzed the impact of interhelical loops on folding, assembly and stability of the heme-containing four-helix bundle transmembrane protein cytochrome b6 that is involved in charge transfer across biomembranes. Cytochrome b6 consists of two transmembrane helical hairpins that sandwich two heme molecules. Our analyses strongly suggest that the loop connecting the helical hairpins is not crucial for positioning the two protein “halves” for proper folding and assembly of the holo-protein. Furthermore, proteolytic removal of any of the remaining two loops, which connect the two transmembrane helices of a hairpin structure, appears to also not crucially effect folding and assembly. Overall, the transmembrane four-helix bundle appears to be mainly stabilized via interhelical interactions in the transmembrane regions, while the soluble loop regions guide assembly and stabilize the holo-protein. The results of this study might steer future strategies aiming at designing heme-binding four-helix bundle structures, involved in transmembrane charge transfer reactions.
Collapse
|
15
|
Joo H, Chavan AG, Fraga KJ, Tsai J. An amino acid code for irregular and mixed protein packing. Proteins 2015; 83:2147-61. [PMID: 26370334 DOI: 10.1002/prot.24929] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2015] [Revised: 09/01/2015] [Accepted: 09/02/2015] [Indexed: 11/10/2022]
Abstract
To advance our understanding of protein tertiary structure, the development of the knob-socket model is completed in an analysis of the packing in irregular coil and turn secondary structure packing as well as between mixed secondary structure. The knob-socket model simplifies packing based on repeated patterns of two motifs: a three-residue socket for packing within secondary (2°) structure and a four-residue knob-socket for tertiary (3°) packing. For coil and turn secondary structure, knob-sockets allow identification of a correlation between amino acid composition and tertiary arrangements in space. Coil contributes almost as much as α-helices to tertiary packing. In irregular sockets, Gly, Pro, Asp, and Ser are favored, while in irregular knobs, the preference order is Arg, Asp, Pro, Asn, Thr, Leu, and Gly. Cys, His,Met, and Trp are not favored in either. In mixed packing, the knob amino acid preferences are a function of the socket that they are packing into, whereas the amino acid composition of the sockets does not depend on the secondary structure of the knob. A unique motif of a coil knob with an XYZ β-sheet socket may potentially function to inhibit β-sheet extension. In addition, analysis of the preferred crossing angles for strands within a β-sheet and mixed α-helice/β-sheet identifies canonical packing patterns useful in protein design. Lastly, the knob-socket model abstracts the complexity of protein tertiary structure into an intuitive packing surface topology map.
Collapse
Affiliation(s)
- Hyun Joo
- Department of Chemistry, University of the Pacific, Stockton, California, 95211
| | - Archana G Chavan
- Department of Chemistry, University of the Pacific, Stockton, California, 95211
| | - Keith J Fraga
- Department of Chemistry, University of the Pacific, Stockton, California, 95211
| | - Jerry Tsai
- Department of Chemistry, University of the Pacific, Stockton, California, 95211
| |
Collapse
|
16
|
de Oliveira SHP, Shi J, Deane CM. Building a better fragment library for de novo protein structure prediction. PLoS One 2015; 10:e0123998. [PMID: 25901595 PMCID: PMC4406757 DOI: 10.1371/journal.pone.0123998] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2014] [Accepted: 02/25/2015] [Indexed: 01/11/2023] Open
Abstract
Fragment-based approaches are the current standard for de novo protein structure prediction. These approaches rely on accurate and reliable fragment libraries to generate good structural models. In this work, we describe a novel method for structure fragment library generation and its application in fragment-based de novo protein structure prediction. The importance of correct testing procedures in assessing the quality of fragment libraries is demonstrated. In particular, the exclusion of homologs to the target from the libraries to correctly simulate a de novo protein structure prediction scenario, something which surprisingly is not always done. We demonstrate that fragments presenting different predominant predicted secondary structures should be treated differently during the fragment library generation step and that exhaustive and random search strategies should both be used. This information was used to develop a novel method, Flib. On a validation set of 41 structurally diverse proteins, Flib libraries presents both a higher precision and coverage than two of the state-of-the-art methods, NNMake and HHFrag. Flib also achieves better precision and coverage on the set of 275 protein domains used in the two previous experiments of the the Critical Assessment of Structure Prediction (CASP9 and CASP10). We compared Flib libraries against NNMake libraries in a structure prediction context. Of the 13 cases in which a correct answer was generated, Flib models were more accurate than NNMake models for 10. “Flib is available for download at: http://www.stats.ox.ac.uk/research/proteins/resources”.
Collapse
Affiliation(s)
| | - Jiye Shi
- Department of Informatics, UCB Pharma, Slough, United Kingdom
- Shanghai Institute of Applied Physics, Chinese Academy of Sciences, Shanghai, China
| | - Charlotte M. Deane
- Department of Statistics, Oxford University, Oxford, Oxfordshire, United Kingdom
| |
Collapse
|