1
|
Jiang H, Jude KM, Wu K, Fallas J, Ueda G, Brunette TJ, Hicks DR, Pyles H, Yang A, Carter L, Lamb M, Li X, Levine PM, Stewart L, Garcia KC, Baker D. De novo design of buttressed loops for sculpting protein functions. Nat Chem Biol 2024:10.1038/s41589-024-01632-2. [PMID: 38816644 DOI: 10.1038/s41589-024-01632-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2023] [Accepted: 04/29/2024] [Indexed: 06/01/2024]
Abstract
In natural proteins, structured loops have central roles in molecular recognition, signal transduction and enzyme catalysis. However, because of the intrinsic flexibility and irregularity of loop regions, organizing multiple structured loops at protein functional sites has been very difficult to achieve by de novo protein design. Here we describe a solution to this problem that designs tandem repeat proteins with structured loops (9-14 residues) buttressed by extensive hydrogen bonding interactions. Experimental characterization shows that the designs are monodisperse, highly soluble, folded and thermally stable. Crystal structures are in close agreement with the design models, with the loops structured and buttressed as designed. We demonstrate the functionality afforded by loop buttressing by designing and characterizing binders for extended peptides in which the loops form one side of an extended binding pocket. The ability to design multiple structured loops should contribute generally to efforts to design new protein functions.
Collapse
Affiliation(s)
- Hanlun Jiang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Kevin M Jude
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA, USA
- Department of Molecular and Cellular Physiology, Stanford University School of Medicine, Stanford, CA, USA
| | - Kejia Wu
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Biological Physics, Structure and Design Graduate Program, University of Washington, Seattle, WA, USA
| | - Jorge Fallas
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - George Ueda
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - T J Brunette
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Derrick R Hicks
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Harley Pyles
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Aerin Yang
- Department of Molecular and Cellular Physiology, Stanford University School of Medicine, Stanford, CA, USA
| | - Lauren Carter
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Mila Lamb
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Xinting Li
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Paul M Levine
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Lance Stewart
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - K Christopher Garcia
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA, USA.
- Department of Molecular and Cellular Physiology, Stanford University School of Medicine, Stanford, CA, USA.
- Department of Structural Biology, Stanford University School of Medicine, Stanford, CA, USA.
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA.
- Institute for Protein Design, University of Washington, Seattle, WA, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA.
| |
Collapse
|
2
|
Crean RM, Corbella M, Calixto AR, Hengge AC, Kamerlin SCL. Sequence - dynamics - function relationships in protein tyrosine phosphatases. QRB DISCOVERY 2024; 5:e4. [PMID: 38689874 PMCID: PMC11058592 DOI: 10.1017/qrd.2024.3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Revised: 09/20/2023] [Accepted: 10/24/2023] [Indexed: 05/02/2024] Open
Abstract
Protein tyrosine phosphatases (PTPs) are crucial regulators of cellular signaling. Their activity is regulated by the motion of a conserved loop, the WPD-loop, from a catalytically inactive open to a catalytically active closed conformation. WPD-loop motion optimally positions a catalytically critical residue into the active site, and is directly linked to the turnover number of these enzymes. Crystal structures of chimeric PTPs constructed by grafting parts of the WPD-loop sequence of PTP1B onto the scaffold of YopH showed WPD-loops in a wide-open conformation never previously observed in either parent enzyme. This wide-open conformation has, however, been observed upon binding of small molecule inhibitors to other PTPs, suggesting the potential of targeting it for drug discovery efforts. Here, we have performed simulations of both enzymes and show that there are negligible energetic differences in the chemical step of catalysis, but significant differences in the dynamical properties of the WPD-loop. Detailed interaction network analysis provides insight into the molecular basis for this population shift to a wide-open conformation. Taken together, our study provides insight into the links between loop dynamics and chemistry in these YopH variants specifically, and how WPD-loop dynamic can be engineered through modification of the internal protein interaction network.
Collapse
Affiliation(s)
- Rory M. Crean
- Department of Chemistry – BMC, Uppsala University, Uppsala, Sweden
| | - Marina Corbella
- Department of Chemistry – BMC, Uppsala University, Uppsala, Sweden
- Departament de Química Inorgànica i Orgànica (Secció de Química Orgànica) & Institut de Química Teòrica i Computacional (IQTCUB), Universitat de Barcelona, Barcelona, Spain
| | - Ana R. Calixto
- Department of Chemistry – BMC, Uppsala University, Uppsala, Sweden
- LAQV, REQUIMTE, Departamento de Química e Bioquímica, Faculdade de Ciências, Universidade do Porto, Porto, Portugal
| | - Alvan C. Hengge
- Department of Chemistry and Biochemistry, Utah State University, Logan, UT, USA
| | - Shina C. L. Kamerlin
- Department of Chemistry – BMC, Uppsala University, Uppsala, Sweden
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, USA
| |
Collapse
|
3
|
Wang T, Wang L, Zhang X, Shen C, Zhang O, Wang J, Wu J, Jin R, Zhou D, Chen S, Liu L, Wang X, Hsieh CY, Chen G, Pan P, Kang Y, Hou T. Comprehensive assessment of protein loop modeling programs on large-scale datasets: prediction accuracy and efficiency. Brief Bioinform 2023; 25:bbad486. [PMID: 38171930 PMCID: PMC10764206 DOI: 10.1093/bib/bbad486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 12/04/2023] [Accepted: 12/05/2023] [Indexed: 01/05/2024] Open
Abstract
Protein loops play a critical role in the dynamics of proteins and are essential for numerous biological functions, and various computational approaches to loop modeling have been proposed over the past decades. However, a comprehensive understanding of the strengths and weaknesses of each method is lacking. In this work, we constructed two high-quality datasets (i.e. the General dataset and the CASP dataset) and systematically evaluated the accuracy and efficiency of 13 commonly used loop modeling approaches from the perspective of loop lengths, protein classes and residue types. The results indicate that the knowledge-based method FREAD generally outperforms the other tested programs in most cases, but encountered challenges when predicting loops longer than 15 and 30 residues on the CASP and General datasets, respectively. The ab initio method Rosetta NGK demonstrated exceptional modeling accuracy for short loops with four to eight residues and achieved the highest success rate on the CASP dataset. The well-known AlphaFold2 and RoseTTAFold require more resources for better performance, but they exhibit promise for predicting loops longer than 16 and 30 residues in the CASP and General datasets. These observations can provide valuable insights for selecting suitable methods for specific loop modeling tasks and contribute to future advancements in the field.
Collapse
Affiliation(s)
- Tianyue Wang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Langcheng Wang
- Department of Pathology, New York University Medical Center, 550 First Avenue, New York, NY 10016, USA
| | - Xujun Zhang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Chao Shen
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Odin Zhang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Jike Wang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Jialu Wu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Ruofan Jin
- College of Life Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Donghao Zhou
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, Guangdong, China
| | - Shicheng Chen
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Liwei Liu
- Advanced Computing and Storage Laboratory, Central Research Institute, 2012 Laboratories, Huawei Technologies Co., Ltd., Shenzhen 518129, Guangdong, China
| | - Xiaorui Wang
- State Key Laboratory of Quality Research in Chinese Medicines, Macau University of Science and Technology, Macao, China
| | - Chang-Yu Hsieh
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Guangyong Chen
- Zhejiang Lab, Zhejiang University, Hangzhou 311121, Zhejiang, China
| | - Peichen Pan
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Yu Kang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Tingjun Hou
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| |
Collapse
|
4
|
Yasuzawa Y, Shibuya R, Senga Y, Miyafusa T, Honda S. Determination of the optimal connector length to enhance stability of backbone-circularized granulocyte colony-stimulating factor. FEBS Open Bio 2023; 13:1910-1921. [PMID: 37574212 PMCID: PMC10549227 DOI: 10.1002/2211-5463.13692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Revised: 07/19/2023] [Accepted: 08/11/2023] [Indexed: 08/15/2023] Open
Abstract
Improving protein stability is important for industrial applications, and one promising method for achieving this is backbone circularization. As connector length affects stability, predicting and elucidating a more stable connector length is necessary for development of the backbone circularization method. However, the relationship between connector length and protein stability has not been completely elucidated. Here, we determined the most stable connector length for granulocyte colony-stimulating factor by changing one residue at a time to produce connector length variants and then measuring their thermal stability. Analysis of the local structures obtained from the predicted structures of the circularized variants revealed that an approach using helix length, dihedral backbone angle, and number of unbonded hydrogen bond donors and acceptors is suitable for identifying connector lengths with higher stability.
Collapse
Affiliation(s)
- Yosuke Yasuzawa
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier SciencesThe University of TokyoJapan
| | - Risa Shibuya
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier SciencesThe University of TokyoJapan
- Biomedical Research InstituteNational Institute of Advanced Industrial Science and Technology (AIST)TsukubaJapan
| | - Yukako Senga
- Biomedical Research InstituteNational Institute of Advanced Industrial Science and Technology (AIST)TsukubaJapan
| | - Takamitsu Miyafusa
- Biomedical Research InstituteNational Institute of Advanced Industrial Science and Technology (AIST)TsukubaJapan
- Bioproduction Research InstituteNational Institute of Advanced Industrial Science and Technology (AIST)TsukubaJapan
| | - Shinya Honda
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier SciencesThe University of TokyoJapan
- Biomedical Research InstituteNational Institute of Advanced Industrial Science and Technology (AIST)TsukubaJapan
| |
Collapse
|
5
|
Jiang H, Jude KM, Wu K, Fallas J, Ueda G, Brunette TJ, Hicks D, Pyles H, Yang A, Carter L, Lamb M, Li X, Levine PM, Stewart L, Garcia KC, Baker D. De novo design of buttressed loops for sculpting protein functions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.22.554384. [PMID: 37662224 PMCID: PMC10473674 DOI: 10.1101/2023.08.22.554384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2023]
Abstract
In natural proteins, structured loops play central roles in molecular recognition, signal transduction and enzyme catalysis. However, because of the intrinsic flexibility and irregularity of loop regions, organizing multiple structured loops at protein functional sites has been very difficult to achieve by de novo protein design. Here we describe a solution to this problem that generates structured loops buttressed by extensive hydrogen bonding interactions with two neighboring loops and with secondary structure elements. We use this approach to design tandem repeat proteins with buttressed loops ranging from 9 to 14 residues in length. Experimental characterization shows the designs are folded and monodisperse, highly soluble, and thermally stable. Crystal structures are in close agreement with the computational design models, with the loops structured and buttressed by their neighbors as designed. We demonstrate the functionality afforded by loop buttressing by designing and characterizing binders for extended peptides in which the loops form one side of an extended binding pocket. The ability to design multiple structured loops should contribute quite generally to efforts to design new protein functions.
Collapse
Affiliation(s)
- Hanlun Jiang
- Department of Biochemistry, University of Washington
- Institute for Protein Design, University of Washington
| | - Kevin M Jude
- Howard Hughes Medical Institute, Stanford University School of Medicine
| | - Kejia Wu
- Department of Biochemistry, University of Washington
- Institute for Protein Design, University of Washington
- Biological Physics, Structure and Design Graduate Program, University of Washington
| | - Jorge Fallas
- Department of Biochemistry, University of Washington
- Institute for Protein Design, University of Washington
| | - George Ueda
- Department of Biochemistry, University of Washington
- Institute for Protein Design, University of Washington
| | - T J Brunette
- Department of Biochemistry, University of Washington
- Institute for Protein Design, University of Washington
| | - Derrick Hicks
- Department of Biochemistry, University of Washington
- Institute for Protein Design, University of Washington
| | - Harley Pyles
- Department of Biochemistry, University of Washington
- Institute for Protein Design, University of Washington
| | - Aerin Yang
- Department of Molecular and Cellular Physiology, Stanford University School of Medicine
| | - Lauren Carter
- Department of Biochemistry, University of Washington
- Institute for Protein Design, University of Washington
| | - Mila Lamb
- Department of Biochemistry, University of Washington
- Institute for Protein Design, University of Washington
| | - Xinting Li
- Department of Biochemistry, University of Washington
- Institute for Protein Design, University of Washington
| | - Paul M Levine
- Department of Biochemistry, University of Washington
- Institute for Protein Design, University of Washington
| | - Lance Stewart
- Department of Biochemistry, University of Washington
- Institute for Protein Design, University of Washington
| | - K Christopher Garcia
- Howard Hughes Medical Institute, Stanford University School of Medicine
- Department of Molecular and Cellular Physiology, Stanford University School of Medicine
- Department of Structural Biology, Stanford University School of Medicine
| | - David Baker
- Department of Biochemistry, University of Washington
- Institute for Protein Design, University of Washington
- Howard Hughes Medical Institute, Stanford University School of Medicine
- Howard Hughes Medical Institute, University of Washington
| |
Collapse
|
6
|
Kalita P, Tripathi T, Padhi AK. Computational Protein Design for COVID-19 Research and Emerging Therapeutics. ACS CENTRAL SCIENCE 2023; 9:602-613. [PMID: 37122454 PMCID: PMC10042144 DOI: 10.1021/acscentsci.2c01513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Indexed: 05/03/2023]
Abstract
As the world struggles with the ongoing COVID-19 pandemic, unprecedented obstacles have continuously been traversed as new SARS-CoV-2 variants continually emerge. Infectious disease outbreaks are unavoidable, but the knowledge gained from the successes and failures will help create a robust health management system to deal with such pandemics. Previously, scientists required years to develop diagnostics, therapeutics, or vaccines; however, we have seen that, with the rapid deployment of high-throughput technologies and unprecedented scientific collaboration worldwide, breakthrough discoveries can be accelerated and insights broadened. Computational protein design (CPD) is a game-changing new technology that has provided alternative therapeutic strategies for pandemic management. In addition to the development of peptide-based inhibitors, miniprotein binders, decoys, biosensors, nanobodies, and monoclonal antibodies, CPD has also been used to redesign native SARS-CoV-2 proteins and human ACE2 receptors. We discuss how novel CPD strategies have been exploited to develop rationally designed and robust COVID-19 treatment strategies.
Collapse
Affiliation(s)
- Parismita Kalita
- Molecular
and Structural Biophysics Laboratory, Department of Biochemistry, North-Eastern Hill University, Shillong 793022, India
| | - Timir Tripathi
- Molecular
and Structural Biophysics Laboratory, Department of Biochemistry, North-Eastern Hill University, Shillong 793022, India
- Regional
Director’s Office, Indira Gandhi
National Open University, Regional Centre Kohima, Kenuozou, Kohima 797001, India
| | - Aditya K. Padhi
- Laboratory
for Computational Biology & Biomolecular Design, School of Biochemical
Engineering, Indian Institute of Technology
(BHU), Varanasi 221005, Uttar Pradesh, India
| |
Collapse
|
7
|
Krivacic C, Kundert K, Pan X, Pache RA, Liu L, Conchúir SO, Jeliazkov JR, Gray JJ, Thompson MC, Fraser JS, Kortemme T. Accurate positioning of functional residues with robotics-inspired computational protein design. Proc Natl Acad Sci U S A 2022; 119:e2115480119. [PMID: 35254891 PMCID: PMC8931229 DOI: 10.1073/pnas.2115480119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 01/27/2022] [Indexed: 11/18/2022] Open
Abstract
SignificanceComputational protein design promises to advance applications in medicine and biotechnology by creating proteins with many new and useful functions. However, new functions require the design of specific and often irregular atom-level geometries, which remains a major challenge. Here, we develop computational methods that design and predict local protein geometries with greater accuracy than existing methods. Then, as a proof of concept, we leverage these methods to design new protein conformations in the enzyme ketosteroid isomerase that change the protein's preference for a key functional residue. Our computational methods are openly accessible and can be applied to the design of other intricate geometries customized for new user-defined protein functions.
Collapse
Affiliation(s)
- Cody Krivacic
- UC Berkeley–UCSF Graduate Program in Bioengineering, University of California, San Francisco, CA 94158
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
| | - Kale Kundert
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
- Biophysics Graduate Program, University of California, San Francisco, CA 94158
| | - Xingjie Pan
- UC Berkeley–UCSF Graduate Program in Bioengineering, University of California, San Francisco, CA 94158
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
| | - Roland A. Pache
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
| | - Lin Liu
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
| | - Shane O Conchúir
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
| | | | - Jeffrey J. Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218
| | - Michael C. Thompson
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
| | - James S. Fraser
- UC Berkeley–UCSF Graduate Program in Bioengineering, University of California, San Francisco, CA 94158
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
- Biophysics Graduate Program, University of California, San Francisco, CA 94158
- Quantitative Biosciences Institute, University of California, San Francisco, CA 94158
| | - Tanja Kortemme
- UC Berkeley–UCSF Graduate Program in Bioengineering, University of California, San Francisco, CA 94158
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
- Biophysics Graduate Program, University of California, San Francisco, CA 94158
- Quantitative Biosciences Institute, University of California, San Francisco, CA 94158
| |
Collapse
|
8
|
Barozet A, Molloy K, Vaisset M, Zanon C, Fauret P, Siméon T, Cortés J. MoMA-LoopSampler: a web server to exhaustively sample protein loop conformations. Bioinformatics 2022; 38:552-553. [PMID: 34432000 DOI: 10.1093/bioinformatics/btab584] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 06/08/2021] [Accepted: 08/23/2021] [Indexed: 02/03/2023] Open
Abstract
SUMMARY MoMA-LoopSampler is a sampling method that globally explores the conformational space of flexible protein loops. It combines a large structural library of three-residue fragments and a novel reinforcement-learning-based approach to accelerate the sampling process while maintaining diversity. The method generates a set of statistically likely loop states satisfying geometric constraints, and its ability to sample experimentally observed conformations has been demonstrated. This paper presents a web user interface to MoMA-LoopSampler through the illustration of a typical use-case. AVAILABILITY AND IMPLEMENTATION MoMA-LoopSampler is freely available at: https://moma.laas.fr/applications/LoopSampler/. We recommend users to create an account, but anonymous access is possible. In most cases, jobs are completed within a few minutes. The waiting time may increase depending on the server load, but it very rarely exceeds an hour. For users requiring more intensive use, binaries can be provided upon request. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Amélie Barozet
- LAAS-CNRS, Université de Toulouse, CNRS, F-31400 Toulouse, France
| | - Kevin Molloy
- Computer Science, James Madison University, Harrisonburg, VA 22181, USA.,School of Biology, James Madison University, Harrisonburg, VA 22181, USA
| | - Marc Vaisset
- LAAS-CNRS, Université de Toulouse, CNRS, F-31400 Toulouse, France
| | - Christophe Zanon
- LAAS-CNRS, Université de Toulouse, CNRS, F-31400 Toulouse, France
| | - Pierre Fauret
- LAAS-CNRS, Université de Toulouse, CNRS, F-31400 Toulouse, France
| | - Thierry Siméon
- LAAS-CNRS, Université de Toulouse, CNRS, F-31400 Toulouse, France
| | - Juan Cortés
- LAAS-CNRS, Université de Toulouse, CNRS, F-31400 Toulouse, France
| |
Collapse
|
9
|
Barozet A, Chacón P, Cortés J. Current approaches to flexible loop modeling. Curr Res Struct Biol 2021; 3:187-191. [PMID: 34409304 PMCID: PMC8361254 DOI: 10.1016/j.crstbi.2021.07.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Revised: 06/30/2021] [Accepted: 07/25/2021] [Indexed: 01/14/2023] Open
Abstract
Loops are key components of protein structures, involved in many biological functions. Due to their conformational variability, the structural investigation of loops is a difficult topic, requiring a combination of experimental and computational methods. This paper provides a brief overview of current computational approaches to flexible loop modeling, and presents the main ingredients of the most standard protocols. Despite great progress in recent years, accurately modeling the conformational variability of long flexible loops remains a challenging problem. Future advances in this field will likely come from a tight coupling of experimental and computational techniques, which would enable a better understanding of the relationships between loop sequence, structural flexibility, and functional roles. In fine, accurate loop modeling will open the road to loop design problems of interest for applications in biomedicine and biotechnology.
Collapse
Affiliation(s)
- Amélie Barozet
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
| | - Pablo Chacón
- Department of Biological Physical Chemistry, Rocasolano Physical Chemistry Institute C.S.I.C., Madrid, Spain
| | - Juan Cortés
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
| |
Collapse
|
10
|
Clerc I, Sagar A, Barducci A, Sibille N, Bernadó P, Cortés J. The diversity of molecular interactions involving intrinsically disordered proteins: A molecular modeling perspective. Comput Struct Biotechnol J 2021; 19:3817-3828. [PMID: 34285781 PMCID: PMC8273358 DOI: 10.1016/j.csbj.2021.06.031] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Revised: 06/17/2021] [Accepted: 06/21/2021] [Indexed: 01/15/2023] Open
Abstract
Intrinsically Disordered Proteins and Regions (IDPs/IDRs) are key components of a multitude of biological processes. Conformational malleability enables IDPs/IDRs to perform very specialized functions that cannot be accomplished by globular proteins. The functional role for most of these proteins is related to the recognition of other biomolecules to regulate biological processes or as a part of signaling pathways. Depending on the extent of disorder, the number of interacting sites and the type of partner, very different architectures for the resulting assemblies are possible. More recently, molecular condensates with liquid-like properties composed of multiple copies of IDPs and nucleic acids have been proven to regulate key processes in eukaryotic cells. The structural and kinetic details of disordered biomolecular complexes are difficult to unveil experimentally due to their inherent conformational heterogeneity. Computational approaches, alone or in combination with experimental data, have emerged as unavoidable tools to understand the functional mechanisms of this elusive type of assemblies. The level of description used, all-atom or coarse-grained, strongly depends on the size of the molecular systems and on the timescale of the investigated mechanism. In this mini-review, we describe the most relevant architectures found for molecular interactions involving IDPs/IDRs and the computational strategies applied for their investigation.
Collapse
Affiliation(s)
- Ilinka Clerc
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
| | - Amin Sagar
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, France
| | - Alessandro Barducci
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, France
| | - Nathalie Sibille
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, France
| | - Pau Bernadó
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, France
| | - Juan Cortés
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
| |
Collapse
|
11
|
Wiese JG, Shanmugaratnam S, Höcker B. Extension of a de novo TIM barrel with a rationally designed secondary structure element. Protein Sci 2021; 30:982-989. [PMID: 33723882 PMCID: PMC8040861 DOI: 10.1002/pro.4064] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2020] [Revised: 02/02/2021] [Accepted: 03/09/2021] [Indexed: 11/12/2022]
Abstract
The ability to construct novel enzymes is a major aim in de novo protein design. A popular enzyme fold for design attempts is the TIM barrel. This fold is a common topology for enzymes and can harbor many diverse reactions. The recent de novo design of a four-fold symmetric TIM barrel provides a well understood minimal scaffold for potential enzyme designs. Here we explore opportunities to extend and diversify this scaffold by adding a short de novo helix on top of the barrel. Due to the size of the protein, we developed a design pipeline based on computational ab initio folding that solves a less complex sub-problem focused around the helix and its vicinity and adapt it to the entire protein. We provide biochemical characterization and a high-resolution X-ray structure for one variant and compare it to our design model. The successful extension of this robust TIM-barrel scaffold opens opportunities to diversify it towards more pocket like arrangements and as such can be considered a building block for future design of binding or catalytic sites.
Collapse
Affiliation(s)
- Jonas Gregor Wiese
- Max Planck Institute for Developmental BiologyTübingenGermany
- Present address:
Technical University of MunichMunichGermany
| | - Sooruban Shanmugaratnam
- Max Planck Institute for Developmental BiologyTübingenGermany
- University of Bayreuth, Department for BiochemistryBayreuthGermany
| | - Birte Höcker
- Max Planck Institute for Developmental BiologyTübingenGermany
- University of Bayreuth, Department for BiochemistryBayreuthGermany
| |
Collapse
|
12
|
Mahajan SP, Srinivasan Y, Labonte JW, DeLisa MP, Gray JJ. Structural basis for peptide substrate specificities of glycosyltransferase GalNAc-T2. ACS Catal 2021; 11:2977-2991. [PMID: 34322281 DOI: 10.1021/acscatal.0c04609] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
The polypeptide N-acetylgalactosaminyl transferase (GalNAc-T) enzyme family initiates O-linked mucin-type glycosylation. The family constitutes 20 isoenzymes in humans. GalNAc-Ts exhibit both redundancy and finely tuned specificity for a wide range of peptide substrates. In this work, we deciphered the sequence and structural motifs that determine the peptide substrate preferences for the GalNAc-T2 isoform. Our approach involved sampling and characterization of peptide-enzyme conformations obtained from Rosetta Monte Carlo-minimization-based flexible docking. We computationally scanned 19 amino acid residues at positions -1 and +1 of an eight-residue peptide substrate, which comprised a dataset of 361 (19x19) peptides with previously characterized experimental GalNAc-T2 glycosylation efficiencies. The calculations recapitulated experimental specificity data, successfully discriminating between glycosylatable and non-glycosylatable peptides with a probability of 96.5% (ROC-AUC score), a balanced accuracy of 85.5% and a false positive rate of 7.3%. The glycosylatable peptide substrates viz. peptides with proline, serine, threonine, and alanine at the -1 position of the peptide preferentially exhibited cognate sequon-like conformations. The preference for specific residues at the -1 position of the peptide was regulated by enzyme residues R362, K363, Q364, H365 and W331, which modulate the pocket size and specific enzyme-peptide interactions. For the +1 position of the peptide, enzyme residues K281 and K363 formed gating interactions with aromatics and glutamines at the +1 position of the peptide, leading to modes of peptide-binding sub-optimal for catalysis. Overall, our work revealed enzyme features that lead to the finely tuned specificity observed for a broad range of peptide substrates for the GalNAc-T2 enzyme. We anticipate that the key sequence and structural motifs can be extended to analyze specificities of other isoforms of the GalNAc-T family and can be used to guide design of variants with tailored specificity.
Collapse
Affiliation(s)
- Sai Pooja Mahajan
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Yashes Srinivasan
- Department of Bioengineering, University of California—Los Angeles, Los Angeles, California 90095, United States
| | - Jason W. Labonte
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland 21218, United States
- Department of Chemistry, Franklin & Marshall College, Lancaster, Pennsylvania 17604, United States
| | - Matthew P. DeLisa
- Robert Frederick Smith School of Chemical and Biomolecular Engineering, Department of Microbiology, and Nancy E. and Peter C. Meinig School of Biomedical Engineering, Biochemistry, Molecular and Cell Biology, Cornell University, Ithaca, New York 14853, United States
| | - Jeffrey J. Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland 21218, United States
- Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, Maryland 21224, United States
| |
Collapse
|
13
|
Timonina D, Sharapova Y, Švedas V, Suplatov D. Bioinformatic analysis of subfamily-specific regions in 3D-structures of homologs to study functional diversity and conformational plasticity in protein superfamilies. Comput Struct Biotechnol J 2021; 19:1302-1311. [PMID: 33738079 PMCID: PMC7933735 DOI: 10.1016/j.csbj.2021.02.005] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2020] [Revised: 02/08/2021] [Accepted: 02/09/2021] [Indexed: 02/07/2023] Open
Abstract
Local 3D-structural differences in homologous proteins contribute to functional diversity observed in a superfamily, but so far received little attention as bioinformatic analysis was usually carried out at the level of amino acid sequences. We have developed Zebra3D - the first-of-its-kind bioinformatic software for systematic analysis of 3D-alignments of protein families using machine learning. The new tool identifies subfamily-specific regions (SSRs) - patterns of local 3D-structure (i.e. single residues, loops, or secondary structure fragments) that are spatially equivalent within families/subfamilies, but are different among them, and thus can be associated with functional diversity and function-related conformational plasticity. Bioinformatic analysis of protein superfamilies by Zebra3D can be used to study 3D-determinants of catalytic activity and specific accommodation of ligands, help to prepare focused libraries for directed evolution or assist development of chimeric enzymes with novel properties by exchange of equivalent regions between homologs, and to characterize plasticity in binding sites. A companion Mustguseal web-server is available to automatically construct a 3D-alignment of functionally diverse proteins, thus reducing the minimal input required to operate Zebra3D to a single PDB code. The Zebra3D + Mustguseal combined approach provides the opportunity to systematically explore the value of SSRs in superfamilies and to use this information for protein design and drug discovery. The software is available open-access at https://biokinet.belozersky.msu.ru/Zebra3D.
Collapse
Affiliation(s)
- Daria Timonina
- Lomonosov Moscow State University, Faculty of Bioengineering and Bioinformatics, Lenin Hills 1-73, Moscow 119234, Russia
| | - Yana Sharapova
- Lomonosov Moscow State University, Faculty of Bioengineering and Bioinformatics, Lenin Hills 1-73, Moscow 119234, Russia
- Lomonosov Moscow State University, Belozersky Institute of Physicochemical Biology, Lenin Hills 1-73, Moscow 119234, Russia
| | - Vytas Švedas
- Lomonosov Moscow State University, Faculty of Bioengineering and Bioinformatics, Lenin Hills 1-73, Moscow 119234, Russia
- Lomonosov Moscow State University, Belozersky Institute of Physicochemical Biology, Lenin Hills 1-73, Moscow 119234, Russia
| | - Dmitry Suplatov
- Lomonosov Moscow State University, Belozersky Institute of Physicochemical Biology, Lenin Hills 1-73, Moscow 119234, Russia
- Corresponding author.
| |
Collapse
|
14
|
Planas-Iglesias J, Marques SM, Pinto GP, Musil M, Stourac J, Damborsky J, Bednar D. Computational design of enzymes for biotechnological applications. Biotechnol Adv 2021; 47:107696. [PMID: 33513434 DOI: 10.1016/j.biotechadv.2021.107696] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 01/12/2021] [Accepted: 01/13/2021] [Indexed: 12/14/2022]
Abstract
Enzymes are the natural catalysts that execute biochemical reactions upholding life. Their natural effectiveness has been fine-tuned as a result of millions of years of natural evolution. Such catalytic effectiveness has prompted the use of biocatalysts from multiple sources on different applications, including the industrial production of goods (food and beverages, detergents, textile, and pharmaceutics), environmental protection, and biomedical applications. Natural enzymes often need to be improved by protein engineering to optimize their function in non-native environments. Recent technological advances have greatly facilitated this process by providing the experimental approaches of directed evolution or by enabling computer-assisted applications. Directed evolution mimics the natural selection process in a highly accelerated fashion at the expense of arduous laboratory work and economic resources. Theoretical methods provide predictions and represent an attractive complement to such experiments by waiving their inherent costs. Computational techniques can be used to engineer enzymatic reactivity, substrate specificity and ligand binding, access pathways and ligand transport, and global properties like protein stability, solubility, and flexibility. Theoretical approaches can also identify hotspots on the protein sequence for mutagenesis and predict suitable alternatives for selected positions with expected outcomes. This review covers the latest advances in computational methods for enzyme engineering and presents many successful case studies.
Collapse
Affiliation(s)
- Joan Planas-Iglesias
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Sérgio M Marques
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Gaspar P Pinto
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Milos Musil
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic; IT4Innovations Centre of Excellence, Faculty of Information Technology, Brno University of Technology, 61266 Brno, Czech Republic
| | - Jan Stourac
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic.
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic.
| |
Collapse
|
15
|
Lucas JE, Kortemme T. New computational protein design methods for de novo small molecule binding sites. PLoS Comput Biol 2020; 16:e1008178. [PMID: 33017412 PMCID: PMC7575090 DOI: 10.1371/journal.pcbi.1008178] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Revised: 10/20/2020] [Accepted: 07/22/2020] [Indexed: 11/19/2022] Open
Abstract
Protein binding to small molecules is fundamental to many biological processes, yet it remains challenging to predictively design this functionality de novo. Current state-of-the-art computational design methods typically rely on existing small molecule binding sites or protein scaffolds with existing shape complementarity for a target ligand. Here we introduce new methods that utilize pools of discrete contacts between protein side chains and defined small molecule ligand substructures (ligand fragments) observed in the Protein Data Bank. We use the Rosetta Molecular Modeling Suite to recombine protein side chains in these contact pools to generate hundreds of thousands of energetically favorable binding sites for a target ligand. These composite binding sites are built into existing scaffold proteins matching the intended binding site geometry with high accuracy. In addition, we apply pools of side chain rotamers interacting with the target ligand to augment Rosetta's conventional design machinery and improve key metrics known to be predictive of design success. We demonstrate that our method reliably builds diverse binding sites into different scaffold proteins for a variety of target molecules. Our generalizable de novo ligand binding site design method provides a foundation for versatile design of protein to interface previously unattainable molecules for applications in medical diagnostics and synthetic biology.
Collapse
Affiliation(s)
- James E. Lucas
- UC Berkeley–UCSF Graduate Program in Bioengineering, University of California San Francisco, San Francisco, CA, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, United States of America
| | - Tanja Kortemme
- UC Berkeley–UCSF Graduate Program in Bioengineering, University of California San Francisco, San Francisco, CA, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, United States of America
| |
Collapse
|
16
|
Pan X, Thompson MC, Zhang Y, Liu L, Fraser JS, Kelly MJS, Kortemme T. Expanding the space of protein geometries by computational design of de novo fold families. Science 2020; 369:1132-1136. [PMID: 32855341 DOI: 10.1126/science.abc0881] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Accepted: 07/14/2020] [Indexed: 01/03/2023]
Abstract
Naturally occurring proteins vary the precise geometries of structural elements to create distinct shapes optimal for function. We present a computational design method, loop-helix-loop unit combinatorial sampling (LUCS), that mimics nature's ability to create families of proteins with the same overall fold but precisely tunable geometries. Through near-exhaustive sampling of loop-helix-loop elements, LUCS generates highly diverse geometries encompassing those found in nature but also surpassing known structure space. Biophysical characterization showed that 17 (38%) of 45 tested LUCS designs encompassing two different structural topologies were well folded, including 16 with designed non-native geometries. Four experimentally solved structures closely matched the designs. LUCS greatly expands the designable structure space and offers a new paradigm for designing proteins with tunable geometries that may be customizable for novel functions.
Collapse
Affiliation(s)
- Xingjie Pan
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA. .,UC Berkeley-UCSF Graduate Program in Bioengineering, University of California, San Francisco, CA, USA
| | - Michael C Thompson
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | - Yang Zhang
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | - Lin Liu
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA.,Quantitative Biosciences Institute, University of California, San Francisco, CA, USA
| | - Mark J S Kelly
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA, USA
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA. .,UC Berkeley-UCSF Graduate Program in Bioengineering, University of California, San Francisco, CA, USA.,Quantitative Biosciences Institute, University of California, San Francisco, CA, USA.,Chan Zuckerberg Biohub, San Francisco, CA, USA
| |
Collapse
|
17
|
Barozet A, Bianciotto M, Vaisset M, Siméon T, Minoux H, Cortés J. Protein loops with multiple meta-stable conformations: A challenge for sampling and scoring methods. Proteins 2020; 89:218-231. [PMID: 32920900 DOI: 10.1002/prot.26008] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2019] [Revised: 08/10/2020] [Accepted: 08/25/2020] [Indexed: 12/25/2022]
Abstract
Flexible regions in proteins, such as loops, cannot be represented by a single conformation. Instead, conformational ensembles are needed to provide a more global picture. In this context, identifying statistically meaningful conformations within an ensemble generated by loop sampling techniques remains an open problem. The difficulty is primarily related to the lack of structural data about these flexible regions. With the majority of structural data coming from x-ray crystallography and ignoring plasticity, the conception and evaluation of loop scoring methods is challenging. In this work, we compare the performance of various scoring methods on a set of eight protein loops that are known to be flexible. The ability of each method to identify and select all of the known conformations is assessed, and the underlying energy landscapes are produced and projected to visualize the qualitative differences obtained when using the methods. Statistical potentials are found to provide considerable reliability despite their being designed to tradeoff accuracy for lower computational cost. On a large pool of loop models, they are capable of filtering out statistically improbable states while retaining those that resemble known (and thus likely) conformations. However, computationally expensive methods are still required for more precise assessment and structural refinement. The results also highlight the importance of employing several scaffolds for the protein, due to the high influence of small structural rearrangements in the rest of the protein over the modeled energy landscape for the loop.
Collapse
Affiliation(s)
- Amélie Barozet
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France.,Sanofi Recherche & Développement, Integrated Drug Discovery, Molecular Design Sciences, Vitry-sur-Seine, France
| | - Marc Bianciotto
- Sanofi Recherche & Développement, Integrated Drug Discovery, Molecular Design Sciences, Vitry-sur-Seine, France
| | - Marc Vaisset
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
| | - Thierry Siméon
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
| | - Hervé Minoux
- Sanofi Recherche & Développement, Integrated Drug Discovery, Molecular Design Sciences, Vitry-sur-Seine, France
| | - Juan Cortés
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
| |
Collapse
|
18
|
Chowdhury R, Grisewood MJ, Boorla VS, Yan Q, Pfleger BF, Maranas CD. IPRO+/-: Computational Protein Design Tool Allowing for Insertions and Deletions. Structure 2020; 28:1344-1357.e4. [PMID: 32857964 DOI: 10.1016/j.str.2020.08.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Revised: 07/01/2020] [Accepted: 08/07/2020] [Indexed: 12/30/2022]
Abstract
Insertions and deletions (indels) in protein sequences alter the residue spacing along the polypeptide backbone and consequently open up possibilities for tuning protein function in a way that is inaccessible by amino acid substitution alone. We describe an optimization-based computational protein redesign approach centered around predicting beneficial combinations of indels along with substitutions and also obtain putative substrate-docked structures for these protein variants. This modified algorithmic capability would be of interest for enzyme engineering and broadly inform other protein design tasks. We highlight this capability by (1) identifying active variants of a bacterial thioesterase enzyme ('TesA) with experimental corroboration, (2) recapitulating existing active TEM-1 β-Lactamase sequences of different sizes, and (3) identifying shorter 4-Coumarate:CoA ligases with enhanced in vitro activities toward non-native substrates. A separate PyRosetta-based open-source tool, Indel-Maker (http://www.maranasgroup.com/software.htm), has also been created to construct computational models of user-defined protein variants with specific indels and substitutions.
Collapse
Affiliation(s)
- Ratul Chowdhury
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802, USA
| | - Matthew J Grisewood
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802, USA
| | - Veda Sheersh Boorla
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802, USA
| | - Qiang Yan
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Brian F Pfleger
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Costas D Maranas
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802, USA.
| |
Collapse
|
19
|
Cohan MC, Ruff KM, Pappu RV. Information theoretic measures for quantifying sequence-ensemble relationships of intrinsically disordered proteins. Protein Eng Des Sel 2020; 32:191-202. [PMID: 31375817 PMCID: PMC7462041 DOI: 10.1093/protein/gzz014] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 06/19/2019] [Indexed: 01/26/2023] Open
Abstract
Intrinsically disordered proteins (IDPs) contribute to a multitude of functions. De novo design of IDPs should open the door to modulating functions and phenotypes controlled by these systems. Recent design efforts have focused on compositional biases and specific sequence patterns as the design features. Analysis of the impact of these designs on sequence-function relationships indicates that individual sequence/compositional parameters are insufficient for describing sequence-function relationships in IDPs. To remedy this problem, we have developed information theoretic measures for sequence–ensemble relationships (SERs) of IDPs. These measures rely on prior availability of statistically robust conformational ensembles derived from all atom simulations. We show that the measures we have developed are useful for comparing sequence-ensemble relationships even when sequence is poorly conserved. Based on our results, we propose that de novo designs of IDPs, guided by knowledge of their SERs, should provide improved insights into their sequence–ensemble–function relationships.
Collapse
Affiliation(s)
- Megan C Cohan
- Department of Biomedical Engineering and Center for Science & Engineering of Living Systems (CSELS) Washington University in St. Louis, One Brookings Drive, Campus Box 1097, St. Louis MO, USA
| | - Kiersten M Ruff
- Department of Biomedical Engineering and Center for Science & Engineering of Living Systems (CSELS) Washington University in St. Louis, One Brookings Drive, Campus Box 1097, St. Louis MO, USA
| | - Rohit V Pappu
- Department of Biomedical Engineering and Center for Science & Engineering of Living Systems (CSELS) Washington University in St. Louis, One Brookings Drive, Campus Box 1097, St. Louis MO, USA
| |
Collapse
|
20
|
Kuhlman B, Bradley P. Advances in protein structure prediction and design. Nat Rev Mol Cell Biol 2019; 20:681-697. [PMID: 31417196 PMCID: PMC7032036 DOI: 10.1038/s41580-019-0163-x] [Citation(s) in RCA: 373] [Impact Index Per Article: 74.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/19/2019] [Indexed: 12/18/2022]
Abstract
The prediction of protein three-dimensional structure from amino acid sequence has been a grand challenge problem in computational biophysics for decades, owing to its intrinsic scientific interest and also to the many potential applications for robust protein structure prediction algorithms, from genome interpretation to protein function prediction. More recently, the inverse problem - designing an amino acid sequence that will fold into a specified three-dimensional structure - has attracted growing attention as a potential route to the rational engineering of proteins with functions useful in biotechnology and medicine. Methods for the prediction and design of protein structures have advanced dramatically in the past decade. Increases in computing power and the rapid growth in protein sequence and structure databases have fuelled the development of new data-intensive and computationally demanding approaches for structure prediction. New algorithms for designing protein folds and protein-protein interfaces have been used to engineer novel high-order assemblies and to design from scratch fluorescent proteins with novel or enhanced properties, as well as signalling proteins with therapeutic potential. In this Review, we describe current approaches for protein structure prediction and design and highlight a selection of the successful applications they have enabled.
Collapse
Affiliation(s)
- Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC, USA.
- Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC, USA.
| | - Philip Bradley
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA, USA.
- Institute for Protein Design, University of Washington, Seattle, WA, USA.
| |
Collapse
|
21
|
Gong L, Geng H, Zhang X, Gao P. Comparison of the structure and function of a chimeric peptide modified titanium surface. RSC Adv 2019; 9:26276-26282. [PMID: 35530988 PMCID: PMC9070349 DOI: 10.1039/c9ra05127a] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2019] [Accepted: 08/12/2019] [Indexed: 01/02/2023] Open
Abstract
Peri-implantitis is a plaque-initiating infectious disease that can be prevented by interfering with the initial bacterial attachment. At present, surface modification of implants using antimicrobial peptides can interfere with the adhesion of streptococci. In this study, the structure and function of chimeric peptides were compared to get a strategy to modify a Ti surface. Compared to the antimicrobial activity with a fragment of hBD-3, the bifunctional and multifunctional chimeric peptides retain their antimicrobial function. All peptides showed antimicrobial activity against streptococcus in biofilm and planktonic conditions. The results demonstrate significant improvement in reducing bacterial colonization onto titanium surfaces. According to the results of structure analysis, the antimicrobial activity of tyrosine in hBD3-3 was stronger than that of the alpha helix in bifunctional or multifunctional chimeric peptides. Rigid connections were proved to avoid functional domain changes due to the interaction of charges. These results indicated that the endogenous peptide fragments modifying the Ti surface could provide an environmentally friendly approach to reduce or prevent the occurrence of peri-implant diseases. The antimicrobial activity of Tyr structure in hBD3-3 is stronger than that of the α-helix structure in multifunctional chimeric peptides. Rigid connections avoid functional domain changes. Endogenous peptide fragments on a Ti surface could reduce peri-implant diseases.![]()
Collapse
Affiliation(s)
- Lei Gong
- Department of Esophageal Cancer, Tianjin's Clinical Research Center for Cancer, Key Laboratory of Cancer Prevention and Therapy, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital Tianjin 300070 PR China
| | - Hongjuan Geng
- Department of Stomatology Tianjin Hospital, 406 Jiefang South Road, Hexi District Tianjin 300211 PR China
| | - Xi Zhang
- School and Hospital of Stomatology, Tianjin Medical University 12 Observatory Road Tianjin 300070 PR China
| | - Ping Gao
- School and Hospital of Stomatology, Tianjin Medical University 12 Observatory Road Tianjin 300070 PR China
| |
Collapse
|
22
|
Loshbaugh AL, Kortemme T. Comparison of Rosetta flexible-backbone computational protein design methods on binding interactions. Proteins 2019; 88:206-226. [PMID: 31344278 DOI: 10.1002/prot.25790] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2019] [Revised: 07/15/2019] [Accepted: 07/19/2019] [Indexed: 01/03/2023]
Abstract
Computational design of binding sites in proteins remains difficult, in part due to limitations in our current ability to sample backbone conformations that enable precise and accurate geometric positioning of side chains during sequence design. Here we present a benchmark framework for comparison between flexible-backbone design methods applied to binding interactions. We quantify the ability of different flexible backbone design methods in the widely used protein design software Rosetta to recapitulate observed protein sequence profiles assumed to represent functional protein/protein and protein/small molecule binding interactions. The CoupledMoves method, which combines backbone flexibility and sequence exploration into a single acceptance step during the sampling trajectory, better recapitulates observed sequence profiles than the BackrubEnsemble and FastDesign methods, which separate backbone flexibility and sequence design into separate acceptance steps during the sampling trajectory. Flexible-backbone design with the CoupledMoves method is a powerful strategy for reducing sequence space to generate targeted libraries for experimental screening and selection.
Collapse
Affiliation(s)
- Amanda L Loshbaugh
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California.,Biophysics Graduate Program, University of California San Francisco, San Francisco, California
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California.,Biophysics Graduate Program, University of California San Francisco, San Francisco, California.,Quantitative Biosciences Institute, University of California San Francisco, San Francisco, California.,Chan Zuckerberg Biohub, San Francisco, California
| |
Collapse
|