26
|
Choi YK, Cao Y, Frank M, Woo H, Park SJ, Yeom MS, Croll TI, Seok C, Im W. Structure, Dynamics, Receptor Binding, and Antibody Binding of the Fully Glycosylated Full-Length SARS-CoV-2 Spike Protein in a Viral Membrane. J Chem Theory Comput 2021; 17:2479-2487. [PMID: 33689337 PMCID: PMC8047829 DOI: 10.1021/acs.jctc.0c01144] [Citation(s) in RCA: 44] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
![]()
The spike (S) protein of severe acute
respiratory syndrome coronavirus
2 (SARS-CoV-2) mediates host cell entry by binding to angiotensin-converting
enzyme 2 (ACE2) and is considered the major target for drug and vaccine
development. We previously built fully glycosylated full-length SARS-CoV-2
S protein models in a viral membrane including both open and closed
conformations of the receptor-binding domain (RBD) and different templates
for the stalk region. In this work, multiple μs-long all-atom
molecular dynamics simulations were performed to provide deeper insights
into the structure and dynamics of S protein and glycan functions.
Our simulations reveal that the highly flexible stalk is composed
of two independent joints and most probable S protein orientations
are competent for ACE2 binding. We identify multiple glycans stabilizing
the open and/or closed states of the RBD and demonstrate that the
exposure of antibody epitopes can be captured by detailed antibody–glycan
clash analysis instead of commonly used accessible surface area analysis
that tends to overestimate the impact of glycan shielding and neglect
possible detailed interactions between glycan and antibodies. Overall,
our observations offer structural and dynamic insights into the SARS-CoV-2
S protein and potentialize for guiding the design of effective antiviral
therapeutics.
Collapse
|
27
|
Yu J, Yang J, Seok C, Song WJ. Symmetry-related residues as promising hotspots for the evolution of de novo oligomeric enzymes. Chem Sci 2021; 12:5091-5101. [PMID: 34168770 PMCID: PMC8179601 DOI: 10.1039/d0sc06823c] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Directed evolution has provided us with great opportunities and prospects in the synthesis of tailor-made proteins. It, however, often requires at least mid to high throughput screening, necessitating more effective strategies for laboratory evolution. We herein demonstrate that protein symmetry can be a versatile criterion for searching for promising hotspots for the directed evolution of de novo oligomeric enzymes. The randomization of symmetry-related residues located at the rotational axes of artificial metallo-β-lactamase yields drastic effects on catalytic activities, whereas that of non-symmetry-related, yet, proximal residues to the active site results in negligible perturbations. Structural and biochemical analysis of the positive hits indicates that seemingly trivial mutations at symmetry-related spots yield significant alterations in overall structures, metal-coordination geometry, and chemical environments of active sites. Our work implicates that numerous artificially designed and natural oligomeric proteins might have evolutionary advantages of propagating beneficial mutations using their global symmetry. Symmetry-related residues located at the rotational axes can be promising hotspots for the evolution of de novo oligomeric enzymes even though they are distantly located from the active site pocket.![]()
Collapse
|
28
|
Abstract
Fluorescent molecules, fluorophores or dyes, play essential roles in bioimaging. Effective bioimaging requires fluorophores with diverse colors and high quantum yields for better resolution. An essential computational component to design novel dye molecules is an accurate model that predicts the electronic properties of molecules. Here, we present statistical machines that predict the excitation energies and associated oscillator strengths of a given molecule using the random forest algorithm. The excitation energies and oscillator strengths of a molecule are closely related to the emission spectrum and the quantum yields of fluorophores, respectively. In this study, we identified specific molecular substructures that induce high oscillator strengths of molecules. The results of our study are expected to serve as new design principles for designing novel fluorophores.
Collapse
|
29
|
Kim J, Han W, Park T, Kim EJ, Bang I, Lee HS, Jeong Y, Roh K, Kim J, Kim JS, Kang C, Seok C, Han JK, Choi HJ. Sclerostin inhibits Wnt signaling through tandem interaction with two LRP6 ectodomains. Nat Commun 2020; 11:5357. [PMID: 33097721 PMCID: PMC7585440 DOI: 10.1038/s41467-020-19155-4] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2019] [Accepted: 09/30/2020] [Indexed: 12/21/2022] Open
Abstract
Low-density lipoprotein receptor-related protein 6 (LRP6) is a coreceptor of the β-catenin-dependent Wnt signaling pathway. The LRP6 ectodomain binds Wnt proteins, as well as Wnt inhibitors such as sclerostin (SOST), which negatively regulates Wnt signaling in osteocytes. Although LRP6 ectodomain 1 (E1) is known to interact with SOST, several unresolved questions remain, such as the reason why SOST binds to LRP6 E1E2 with higher affinity than to the E1 domain alone. Here, we present the crystal structure of the LRP6 E1E2–SOST complex with two interaction sites in tandem. The unexpected additional binding site was identified between the C-terminus of SOST and the LRP6 E2 domain. This interaction was confirmed by in vitro binding and cell-based signaling assays. Its functional significance was further demonstrated in vivo using Xenopus laevis embryos. Our results provide insights into the inhibitory mechanism of SOST on Wnt signaling. The low-density lipoprotein receptor-related protein 6 (LRP6) is a co-receptor of the β-catenin-dependent Wnt signaling pathway and interacts with the Wnt inhibitor sclerostin (SOST). Here the authors present the crystal structure of SOST in complex with the LRP6 E1E2 ectodomain construct, which reveals that the SOST C-terminus binds to the LRP6 E2 domain, and further validate this binding site with in vitro and in vivo experiments.
Collapse
|
30
|
Kang B, Seok C, Lee J. Prediction of Molecular Electronic Transitions Using Random Forests. J Chem Inf Model 2020; 60:5984-5994. [PMID: 33090804 DOI: 10.1021/acs.jcim.0c00698] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Fluorescent molecules, fluorophores or dyes, play essential roles in bioimaging. Effective bioimaging requires fluorophores with diverse colors and high quantum yields for better resolution. An essential computational component to design novel dye molecules is an accurate model that predicts the electronic properties of molecules. Here, we present statistical machines that predict the excitation energies and associated oscillator strengths of a given molecule using the random forest algorithm. The excitation energies and oscillator strengths of a molecule are closely related to the emission spectrum and the quantum yields of fluorophores, respectively. In this study, we identified specific molecular substructures that induce high oscillator strengths of molecules. The results of our study are expected to serve as new design principles for designing novel fluorophores.
Collapse
|
31
|
Woo H, Park SJ, Choi YK, Park T, Tanveer M, Cao Y, Kern NR, Lee J, Yeom MS, Croll TI, Seok C, Im W. Developing a Fully Glycosylated Full-Length SARS-CoV-2 Spike Protein Model in a Viral Membrane. J Phys Chem B 2020; 124:7128-7137. [PMID: 32559081 PMCID: PMC7341691 DOI: 10.1021/acs.jpcb.0c04553] [Citation(s) in RCA: 183] [Impact Index Per Article: 45.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Revised: 06/18/2020] [Indexed: 12/27/2022]
Abstract
This technical study describes all-atom modeling and simulation of a fully glycosylated full-length SARS-CoV-2 spike (S) protein in a viral membrane. First, starting from PDB: 6VSB and 6VXX, full-length S protein structures were modeled using template-based modeling, de-novo protein structure prediction, and loop modeling techniques in GALAXY modeling suite. Then, using the recently determined most occupied glycoforms, 22 N-glycans and 1 O-glycan of each monomer were modeled using Glycan Reader & Modeler in CHARMM-GUI. These fully glycosylated full-length S protein model structures were assessed and further refined against the low-resolution data in their respective experimental maps using ISOLDE. We then used CHARMM-GUI Membrane Builder to place the S proteins in a viral membrane and performed all-atom molecular dynamics simulations. All structures are available in CHARMM-GUI COVID-19 Archive (http://www.charmm-gui.org/docs/archive/covid19) so that researchers can use these models to carry out innovative and novel modeling and simulation research for the prevention and treatment of COVID-19.
Collapse
|
32
|
Yang J, Kwon S, Bae SH, Park KM, Yoon C, Lee JH, Seok C. GalaxySagittarius: Structure- and Similarity-Based Prediction of Protein Targets for Druglike Compounds. J Chem Inf Model 2020; 60:3246-3254. [PMID: 32401021 DOI: 10.1021/acs.jcim.0c00104] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Computational techniques for predicting interactions of proteins and druglike molecules have often been used to search for compounds that bind a given protein with high affinity. More recently, such tools have also been applied to the reverse procedure of searching protein targets for a given compound. Among methods for predicting protein-ligand interactions, ligand-based methods relying on similarity to ligands of known interactions are effective only when similar protein-ligand interactions are known. Receptor-based methods predicting protein-ligand interactions by molecular docking are effective only when high-accuracy receptor structures and binding sites are available. Moreover, the computational cost of molecular docking tends to be too high to be applied to the entire protein structure database. In this paper, an effective target prediction method, which combines ligand similarity-based and receptor structure-based approaches, is introduced. In this method, protein-ligand docking is performed after efficient structure- and similarity-based screening. The enriched protein target database by predicted binding ligands and sites allows detection of protein targets with previously unknown ligand interactions. The method, called GalaxySagittarius, is freely available as a web server at http://galaxy.seoklab.org/sagittarius.
Collapse
|
33
|
Woo H, Park SJ, Choi YK, Park T, Tanveer M, Cao Y, Kern NR, Lee J, Yeom MS, Croll TI, Seok C, Im W. Developing a Fully-glycosylated Full-length SARS-CoV-2 Spike Protein Model in a Viral Membrane. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2020:2020.05.20.103325. [PMID: 32511389 PMCID: PMC7263518 DOI: 10.1101/2020.05.20.103325] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
This technical study describes all-atom modeling and simulation of a fully-glycosylated full-length SARS-CoV-2 spike (S) protein in a viral membrane. First, starting from PDB:6VSB and 6VXX, full-length S protein structures were modeled using template-based modeling, de-novo protein structure prediction, and loop modeling techniques in GALAXY modeling suite. Then, using the recently-determined most occupied glycoforms, 22 N-glycans and 1 O-glycan of each monomer were modeled using Glycan Reader & Modeler in CHARMM-GUI. These fully-glycosylated full-length S protein model structures were assessed and further refined against the low-resolution data in their respective experimental maps using ISOLDE. We then used CHARMM-GUI Membrane Builder to place the S proteins in a viral membrane and performed all-atom molecular dynamics simulations. All structures are available in CHARMM-GUI COVID-19 Archive (http://www.charmm-gui.org/docs/archive/covid19), so researchers can use these models to carry out innovative and novel modeling and simulation research for the prevention and treatment of COVID-19.
Collapse
|
34
|
Lee GR, Won J, Heo L, Seok C. GalaxyRefine2: simultaneous refinement of inaccurate local regions and overall protein structure. Nucleic Acids Res 2020; 47:W451-W455. [PMID: 31001635 PMCID: PMC6602442 DOI: 10.1093/nar/gkz288] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Revised: 04/01/2019] [Accepted: 04/11/2019] [Indexed: 11/12/2022] Open
Abstract
The 3D structure of a protein can be predicted from its amino acid sequence with high accuracy for a large fraction of cases because of the availability of large quantities of experimental data and the advance of computational algorithms. Recently, deep learning methods exploiting the coevolution information obtained by comparing related protein sequences have been successfully used to generate highly accurate model structures even in the absence of template structure information. However, structures predicted based on either template structures or related sequences require further improvement in regions for which information is missing. Refining a predicted protein structure with insufficient information on certain regions is critical because these regions may be connected to functional specificity that is not conserved among related proteins. The GalaxyRefine2 web server, freely available via http://galaxy.seoklab.org/refine2, is an upgraded version of the GalaxyRefine protein structure refinement server and reflects recent developments successfully tested through CASP blind prediction experiments. This method adopts an iterative optimization approach involving various structure move sets to refine both local and global structures. The estimation of local error and hybridization of available homolog structures are also employed for effective conformation search.
Collapse
|
35
|
Baek M, Park T, Heo L, Seok C. Modeling Protein Homo-Oligomer Structures with GalaxyHomomer Web Server. Methods Mol Biol 2020; 2165:127-137. [PMID: 32621222 DOI: 10.1007/978-1-0716-0708-4_7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Cellular processes, such as metabolism, signal transduction, or immunity, often depend on the homo-oligomerization of proteins. Detailed structural knowledge of the homo-oligomer structure is therefore crucial for molecular-level understanding of protein functions and their regulation. In this chapter, we introduce the GalaxyHomomer server, which supports easy-to-use web interfaces for general users. It is freely accessible at http://galaxy.seoklab.org/homomer . GalaxyHomomer carries out template-based modeling, ab initio docking or both depending on the availability of proper oligomer templates. It also incorporates recently developed model refinement methods that can consistently improve model quality by performing symmetric loop modeling and overall structure refinement. Moreover, the server provides additional options that can be chosen by the user depending on the availability of information on the monomer structure, oligomeric state, and locations of unreliable/flexible loops or termini.
Collapse
|
36
|
Park T, Woo H, Baek M, Yang J, Seok C. Structure prediction of biological assemblies using GALAXY in CAPRI rounds 38-45. Proteins 2019; 88:1009-1017. [PMID: 31774573 DOI: 10.1002/prot.25859] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Revised: 11/11/2019] [Accepted: 11/23/2019] [Indexed: 12/12/2022]
Abstract
We participated in CARPI rounds 38-45 both as a server predictor and a human predictor. These CAPRI rounds provided excellent opportunities for testing prediction methods for three classes of protein interactions, that is, protein-protein, protein-peptide, and protein-oligosaccharide interactions. Both template-based methods (GalaxyTBM for monomer protein, GalaxyHomomer for homo-oligomer protein, GalaxyPepDock for protein-peptide complex) and ab initio docking methods (GalaxyTongDock and GalaxyPPDock for protein oligomer, GalaxyPepDock-ab-initio for protein-peptide complex, GalaxyDock2 and Galaxy7TM for protein-oligosaccharide complex) have been tested. Template-based methods depend heavily on the availability of proper templates and template-target similarity, and template-target difference is responsible for inaccuracy of template-based models. Inaccurate template-based models could be improved by our structure refinement and loop modeling methods based on physics-based energy optimization (GalaxyRefineComplex and GalaxyLoop) for several CAPRI targets. Current ab initio docking methods require accurate protein structures as input. Small conformational changes from input structure could be accounted for by our docking methods, producing one of the best models for several CAPRI targets. However, predicting large conformational changes involving protein backbone is still challenging, and full exploration of physics-based methods for such problems is still to come.
Collapse
|
37
|
Lensink MF, Brysbaert G, Nadzirin N, Velankar S, Chaleil RAG, Gerguri T, Bates PA, Laine E, Carbone A, Grudinin S, Kong R, Liu RR, Xu XM, Shi H, Chang S, Eisenstein M, Karczynska A, Czaplewski C, Lubecka E, Lipska A, Krupa P, Mozolewska M, Golon Ł, Samsonov S, Liwo A, Crivelli S, Pagès G, Karasikov M, Kadukova M, Yan Y, Huang SY, Rosell M, Rodríguez-Lumbreras LA, Romero-Durana M, Díaz-Bueno L, Fernandez-Recio J, Christoffer C, Terashi G, Shin WH, Aderinwale T, Subraman SRMV, Kihara D, Kozakov D, Vajda S, Porter K, Padhorny D, Desta I, Beglov D, Ignatov M, Kotelnikov S, Moal IH, Ritchie DW, de Beauchêne IC, Maigret B, Devignes MD, Echartea MER, Barradas-Bautista D, Cao Z, Cavallo L, Oliva R, Cao Y, Shen Y, Baek M, Park T, Woo H, Seok C, Braitbard M, Bitton L, Scheidman-Duhovny D, Dapkūnas J, Olechnovič K, Venclovas Č, Kundrotas PJ, Belkin S, Chakravarty D, Badal VD, Vakser IA, Vreven T, Vangaveti S, Borrman T, Weng Z, Guest JD, Gowthaman R, Pierce BG, Xu X, Duan R, Qiu L, Hou J, Merideth BR, Ma Z, Cheng J, Zou X, Koukos PI, Roel-Touris J, Ambrosetti F, Geng C, Schaarschmidt J, Trellet ME, Melquiond ASJ, Xue L, Jiménez-García B, van Noort CW, Honorato RV, Bonvin AMJJ, Wodak SJ. Blind prediction of homo- and hetero-protein complexes: The CASP13-CAPRI experiment. Proteins 2019; 87:1200-1221. [PMID: 31612567 PMCID: PMC7274794 DOI: 10.1002/prot.25838] [Citation(s) in RCA: 79] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2019] [Revised: 09/26/2019] [Accepted: 09/27/2019] [Indexed: 12/28/2022]
Abstract
We present the results for CAPRI Round 46, the third joint CASP-CAPRI protein assembly prediction challenge. The Round comprised a total of 20 targets including 14 homo-oligomers and 6 heterocomplexes. Eight of the homo-oligomer targets and one heterodimer comprised proteins that could be readily modeled using templates from the Protein Data Bank, often available for the full assembly. The remaining 11 targets comprised 5 homodimers, 3 heterodimers, and two higher-order assemblies. These were more difficult to model, as their prediction mainly involved "ab-initio" docking of subunit models derived from distantly related templates. A total of ~30 CAPRI groups, including 9 automatic servers, submitted on average ~2000 models per target. About 17 groups participated in the CAPRI scoring rounds, offered for most targets, submitting ~170 models per target. The prediction performance, measured by the fraction of models of acceptable quality or higher submitted across all predictors groups, was very good to excellent for the nine easy targets. Poorer performance was achieved by predictors for the 11 difficult targets, with medium and high quality models submitted for only 3 of these targets. A similar performance "gap" was displayed by scorer groups, highlighting yet again the unmet challenge of modeling the conformational changes of the protein components that occur upon binding or that must be accounted for in template-based modeling. Our analysis also indicates that residues in binding interfaces were less well predicted in this set of targets than in previous Rounds, providing useful insights for directions of future improvements.
Collapse
|
38
|
Baek M, Park T, Woo H, Seok C. Prediction of protein oligomer structures using GALAXY in CASP13. Proteins 2019; 87:1233-1240. [PMID: 31509276 DOI: 10.1002/prot.25814] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2019] [Revised: 08/30/2019] [Accepted: 09/07/2019] [Indexed: 01/24/2023]
Abstract
Many proteins need to form oligomers to be functional, so oligomer structures provide important clues to biological roles of proteins. Prediction of oligomer structures therefore can be a useful tool in the absence of experimentally resolved structures. In this article, we describe the server and human methods that we used to predict oligomer structures in the CASP13 experiment. Performances of the methods on the 42 CASP13 oligomer targets consisting of 30 homo-oligomers and 12 hetero-oligomers are discussed. Our server method, Seok-assembly, generated models with interface contact similarity measure greater than 0.2 as model 1 for 11 homo-oligomer targets when proper templates existed in the database. Model refinement methods such as loop modeling and molecular dynamics (MD)-based overall refinement failed to improve model qualities when target proteins have domains not covered by templates or when chains have very small interfaces. In human predictions, additional experimental data such as low-resolution electron microscopy (EM) map were utilized. EM data could assist oligomer structure prediction by providing a global shape of the complex structure.
Collapse
|
39
|
Won J, Baek M, Monastyrskyy B, Kryshtafovych A, Seok C. Assessment of protein model structure accuracy estimation in CASP13: Challenges in the era of deep learning. Proteins 2019; 87:1351-1360. [PMID: 31436360 DOI: 10.1002/prot.25804] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Revised: 08/08/2019] [Accepted: 08/19/2019] [Indexed: 12/20/2022]
Abstract
Scoring model structure is an essential component of protein structure prediction that can affect the prediction accuracy tremendously. Users of protein structure prediction results also need to score models to select the best models for their application studies. In Critical Assessment of techniques for protein Structure Prediction (CASP), model accuracy estimation methods have been tested in a blind fashion by providing models submitted by the tertiary structure prediction servers for scoring. In CASP13, model accuracy estimation results were evaluated in terms of both global and local structure accuracy. Global structure accuracy estimation was evaluated by the quality of the models selected by the global structure scores and by the absolute estimates of the global scores. Residue-wise, local structure accuracy estimations were evaluated by three different measures. A new measure introduced in CASP13 evaluates the ability to predict inaccurately modeled regions that may be improved by refinement. An intensive comparative analysis on CASP13 and the previous CASPs revealed that the tertiary structure models generated by the CASP13 servers show very distinct features. Higher consensus toward models of higher global accuracy appeared even for free modeling targets, and many models of high global accuracy were not well optimized at the atomic level. This is related to the new technology in CASP13, deep learning for tertiary contact prediction. The tertiary model structures generated by deep learning pose a new challenge for EMA (estimation of model accuracy) method developers. Model accuracy estimation itself is also an area where deep learning can potentially have an impact, although current EMA methods have not fully explored that direction.
Collapse
|
40
|
Yang J, Baek M, Seok C. GalaxyDock3: Protein–ligand docking that considers the full ligand conformational flexibility. J Comput Chem 2019; 40:2739-2748. [DOI: 10.1002/jcc.26050] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2019] [Revised: 06/18/2019] [Accepted: 07/29/2019] [Indexed: 12/27/2022]
|
41
|
Baek M, Park T, Heo L, Park C, Seok C. GalaxyHomomer: a web server for protein homo-oligomer structure prediction from a monomer sequence or structure. Nucleic Acids Res 2019; 45:W320-W324. [PMID: 28387820 PMCID: PMC5570155 DOI: 10.1093/nar/gkx246] [Citation(s) in RCA: 81] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2017] [Accepted: 04/05/2017] [Indexed: 11/18/2022] Open
Abstract
Homo-oligomerization of proteins is abundant in nature, and is often intimately related with the physiological functions of proteins, such as in metabolism, signal transduction or immunity. Information on the homo-oligomer structure is therefore important to obtain a molecular-level understanding of protein functions and their regulation. Currently available web servers predict protein homo-oligomer structures either by template-based modeling using homo-oligomer templates selected from the protein structure database or by ab initio docking of monomer structures resolved by experiment or predicted by computation. The GalaxyHomomer server, freely accessible at http://galaxy.seoklab.org/homomer, carries out template-based modeling, ab initio docking or both depending on the availability of proper oligomer templates. It also incorporates recently developed model refinement methods that can consistently improve model quality. Moreover, the server provides additional options that can be chosen by the user depending on the availability of information on the monomer structure, oligomeric state and locations of unreliable/flexible loops or termini. The performance of the server was better than or comparable to that of other available methods when tested on benchmark sets and in a recent CASP performed in a blind fashion.
Collapse
|
42
|
Park T, Baek M, Lee H, Seok C. GalaxyTongDock: Symmetric and asymmetric ab initio protein-protein docking web server with improved energy parameters. J Comput Chem 2019; 40:2413-2417. [PMID: 31173387 DOI: 10.1002/jcc.25874] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2018] [Revised: 04/27/2019] [Accepted: 05/20/2019] [Indexed: 12/21/2022]
Abstract
Protein-protein docking methods are spotlighted for their roles in providing insights into protein-protein interactions in the absence of full structural information by experiment. GalaxyTongDock is an ab initio protein-protein docking web server that performs rigid-body docking just like ZDOCK but with improved energy parameters. The energy parameters were trained by iterative docking and parameter search so that more native-like structures are selected as top rankers. GalaxyTongDock performs asymmetric docking of two different proteins (GalaxyTongDock_A) and symmetric docking of homo-oligomeric proteins with Cn and Dn symmetries (GalaxyTongDock_C and GalaxyTongDock_D). Performance tests on an unbound docking benchmark set for asymmetric docking and a model docking benchmark set for symmetric docking showed that GalaxyTongDock is better or comparable to other state-of-the-art methods. Experimental and/or evolutionary information on binding interfaces can be easily incorporated by using block and interface options. GalaxyTongDock web server is freely available at http://galaxy.seoklab.org/tongdock. © 2019 Wiley Periodicals, Inc.
Collapse
|
43
|
Oh SH, Sung YH, Kim I, Sim CK, Lee JH, Baek M, Pack CG, Seok C, Seo EJ, Lee MS, Kim KM. Novel Compound Heterozygote Mutation in IL10RA in a Patient With Very Early-Onset Inflammatory Bowel Disease. Inflamm Bowel Dis 2019; 25:498-509. [PMID: 30462267 DOI: 10.1093/ibd/izy353] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Indexed: 12/25/2022]
Abstract
BACKGROUND Very early-onset inflammatory bowel disease (VEO-IBD) is often associated with monogenetic disorders. IL-10RA deficiency is one of the major causal mutations in VEO-IBD. Here, we aimed to identify the causal mutation associated with severe IBD in a 1-year-old patient, validate the pathogenicity of the mutation, and characterize the mutant protein. METHODS To identify the causal mutation, targeted exome sequencing (ES) was performed using the genomic DNA from the patient. To validate the pathogenicity, IL-10RA functional tests were performed using the patient's peripheral blood mononuclear cells (PBMCs). Additionally, flow cytometry analysis, confocal microscopy on overexpressed green fluorescent protein-fused mutants, and computational analysis on the structures of IL-10RA proteins were performed. RESULTS We identified a novel compound heterozygote mutation p.[Tyr91Cys];[Pro146Alafs*40] in the IL10RA gene of the patient. The missense variant p.Tyr91Cys was previously identified but not functionally tested, and a frameshift variant, p.Pro146Alafs*40, is novel and nonfunctional. PBMCs from the patient showed defective signal transducer and activator of transcription 3 activation. The p.Tyr91Cys mutant protein failed to properly localize on the plasma membrane. The p.Tyr91Cys mutation seems to disrupt the hydrophobic core structure surrounding the tyrosine 91 residue, causing structural instability. CONCLUSIONS Targeted ES and linkage analysis identified novel compound heterozygous mutations p.[Tyr91Cys];[Pro146Alafs*40] in the IL10RA gene of a child with severe VEO-IBD. p.Tyr91Cys proteins were functionally defective in IL-10RA signaling and failed to properly localize on the plasma membrane, probably due to its structural instability.
Collapse
|
44
|
Ngo TD, Oh C, Mizar P, Baek M, Park KS, Nguyen L, Byeon H, Yoon S, Ryu Y, Ryu BH, Kim TD, Yang JW, Seok C, Lee SS, Kim KK. Structural Basis for the Enantioselectivity of Esterase Est-Y29 toward (S)-Ketoprofen. ACS Catal 2018. [DOI: 10.1021/acscatal.8b02797] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
45
|
Keasar C, McGuffin LJ, Wallner B, Chopra G, Adhikari B, Bhattacharya D, Blake L, Bortot LO, Cao R, Dhanasekaran BK, Dimas I, Faccioli RA, Faraggi E, Ganzynkowicz R, Ghosh S, Ghosh S, Giełdoń A, Golon L, He Y, Heo L, Hou J, Khan M, Khatib F, Khoury GA, Kieslich C, Kim DE, Krupa P, Lee GR, Li H, Li J, Lipska A, Liwo A, Maghrabi AHA, Mirdita M, Mirzaei S, Mozolewska MA, Onel M, Ovchinnikov S, Shah A, Shah U, Sidi T, Sieradzan AK, Ślusarz M, Ślusarz R, Smadbeck J, Tamamis P, Trieber N, Wirecki T, Yin Y, Zhang Y, Bacardit J, Baranowski M, Chapman N, Cooper S, Defelicibus A, Flatten J, Koepnick B, Popović Z, Zaborowski B, Baker D, Cheng J, Czaplewski C, Delbem ACB, Floudas C, Kloczkowski A, Ołdziej S, Levitt M, Scheraga H, Seok C, Söding J, Vishveshwara S, Xu D, Crivelli SN. An analysis and evaluation of the WeFold collaborative for protein structure prediction and its pipelines in CASP11 and CASP12. Sci Rep 2018; 8:9939. [PMID: 29967418 PMCID: PMC6028396 DOI: 10.1038/s41598-018-26812-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Accepted: 05/17/2018] [Indexed: 01/14/2023] Open
Abstract
Every two years groups worldwide participate in the Critical Assessment of Protein Structure Prediction (CASP) experiment to blindly test the strengths and weaknesses of their computational methods. CASP has significantly advanced the field but many hurdles still remain, which may require new ideas and collaborations. In 2012 a web-based effort called WeFold, was initiated to promote collaboration within the CASP community and attract researchers from other fields to contribute new ideas to CASP. Members of the WeFold coopetition (cooperation and competition) participated in CASP as individual teams, but also shared components of their methods to create hybrid pipelines and actively contributed to this effort. We assert that the scale and diversity of integrative prediction pipelines could not have been achieved by any individual lab or even by any collaboration among a few partners. The models contributed by the participating groups and generated by the pipelines are publicly available at the WeFold website providing a wealth of data that remains to be tapped. Here, we analyze the results of the 2014 and 2016 pipelines showing improvements according to the CASP assessment as well as areas that require further adjustments and research.
Collapse
|
46
|
Hribar-Lee B, Seok C, Coutsias E, Lukšič M. Tribute to Ken A. Dill. J Phys Chem B 2018; 122:5261-5262. [DOI: 10.1021/acs.jpcb.8b02470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
47
|
Won J, Lee GR, Park H, Seok C. GalaxyGPCRloop: Template-Based and Ab Initio Structure Sampling of the Extracellular Loops of G-Protein-Coupled Receptors. J Chem Inf Model 2018; 58:1234-1243. [DOI: 10.1021/acs.jcim.8b00148] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
48
|
Rie Lee G, Seok C, Buck M. From Single Structures to Ensembles: Application of the Galaxy Program Suite to Ubiquitin, Cyclophilin a and PTP1B. Biophys J 2018. [DOI: 10.1016/j.bpj.2017.11.3144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022] Open
|
49
|
Baek M, Seok C. Improving Docking Performance of Large Flexible Ligands using Hot Spot Information Predicted by Fragment Docking. Biophys J 2018. [DOI: 10.1016/j.bpj.2017.11.354] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
|
50
|
Lensink MF, Velankar S, Baek M, Heo L, Seok C, Wodak SJ. The challenge of modeling protein assemblies: the CASP12-CAPRI experiment. Proteins 2017; 86 Suppl 1:257-273. [PMID: 29127686 DOI: 10.1002/prot.25419] [Citation(s) in RCA: 60] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2017] [Revised: 10/31/2017] [Accepted: 11/07/2017] [Indexed: 12/18/2022]
Abstract
We present the quality assessment of 5613 models submitted by predictor groups from both CAPRI and CASP for the total of 15 most tractable targets from the second joint CASP-CAPRI protein assembly prediction experiment. These targets comprised 12 homo-oligomers and 3 hetero-complexes. The bulk of the analysis focuses on 10 targets (of CAPRI Round 37), which included all 3 hetero-complexes, and whose protein chains or the full assembly could be readily modeled from structural templates in the PDB. On average, 28 CAPRI groups and 10 CASP groups (including automatic servers), submitted models for each of these 10 targets. Additionally, about 16 groups participated in the CAPRI scoring experiments. A range of acceptable to high quality models were obtained for 6 of the 10 Round 37 targets, for which templates were available for the full assembly. Poorer results were achieved for the remaining targets due to the lower quality of the templates available for the full complex or the individual protein chains, highlighting the unmet challenge of modeling the structural adjustments of the protein components that occur upon binding or which must be accounted for in template-based modeling. On the other hand, our analysis indicated that residues in binding interfaces were correctly predicted in a sizable fraction of otherwise poorly modeled assemblies and this with higher accuracy than published methods that do not use information on the binding partner. Lastly, the strengths and weaknesses of the assessment methods are evaluated and improvements suggested.
Collapse
|