1
|
Talluri S. Algorithms for protein design. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2022; 130:1-38. [PMID: 35534105 DOI: 10.1016/bs.apcsb.2022.01.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Computational Protein Design has the potential to contribute to major advances in enzyme technology, vaccine design, receptor-ligand engineering, biomaterials, nanosensors, and synthetic biology. Although Protein Design is a challenging problem, proteins can be designed by experts in Protein Design, as well as by non-experts whose primary interests are in the applications of Protein Design. The increased accessibility of Protein Design technology is attributable to the accumulated knowledge and experience with Protein Design as well as to the availability of software and online resources. The objective of this review is to serve as a guide to the relevant literature with a focus on the novel methods and algorithms that have been developed or applied for Protein Design, and to assist in the selection of algorithms for Protein Design. Novel algorithms and models that have been introduced to utilize the enormous amount of experimental data and novel computational hardware have the potential for producing substantial increases in the accuracy, reliability and range of applications of designed proteins.
Collapse
Affiliation(s)
- Sekhar Talluri
- Department of Biotechnology, GITAM, Visakhapatnam, India.
| |
Collapse
|
2
|
Grayson KJ, Anderson JLR. Designed for life: biocompatible de novo designed proteins and components. J R Soc Interface 2019; 15:rsif.2018.0472. [PMID: 30158186 PMCID: PMC6127164 DOI: 10.1098/rsif.2018.0472] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2018] [Accepted: 08/01/2018] [Indexed: 12/30/2022] Open
Abstract
A principal goal of synthetic biology is the de novo design or redesign of biomolecular components. In addition to revealing fundamentally important information regarding natural biomolecular engineering and biochemistry, functional building blocks will ultimately be provided for applications including the manufacture of valuable products and therapeutics. To fully realize this ambitious goal, the designed components must be biocompatible, working in concert with natural biochemical processes and pathways, while not adversely affecting cellular function. For example, de novo protein design has provided us with a wide repertoire of structures and functions, including those that can be assembled and function in vivo. Here we discuss such biocompatible designs, as well as others that have the potential to become biocompatible, including non-protein molecules, and routes to achieving full biological integration.
Collapse
Affiliation(s)
- Katie J Grayson
- School of Biochemistry, University of Bristol, Biomedical Sciences Building, Bristol BS8 1TD, UK
| | - J L Ross Anderson
- School of Biochemistry, University of Bristol, Biomedical Sciences Building, Bristol BS8 1TD, UK .,BrisSynBio Synthetic Biology Research Centre, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK
| |
Collapse
|
3
|
Kundert K, Kortemme T. Computational design of structured loops for new protein functions. Biol Chem 2019; 400:275-288. [PMID: 30676995 PMCID: PMC6530579 DOI: 10.1515/hsz-2018-0348] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Accepted: 12/18/2018] [Indexed: 12/20/2022]
Abstract
The ability to engineer the precise geometries, fine-tuned energetics and subtle dynamics that are characteristic of functional proteins is a major unsolved challenge in the field of computational protein design. In natural proteins, functional sites exhibiting these properties often feature structured loops. However, unlike the elements of secondary structures that comprise idealized protein folds, structured loops have been difficult to design computationally. Addressing this shortcoming in a general way is a necessary first step towards the routine design of protein function. In this perspective, we will describe the progress that has been made on this problem and discuss how recent advances in the field of loop structure prediction can be harnessed and applied to the inverse problem of computational loop design.
Collapse
Affiliation(s)
- Kale Kundert
- Graduate Group in Biophysics, University of California San Francisco, San Francisco, CA 94158, USA
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94158, USA
| | - Tanja Kortemme
- Graduate Group in Biophysics, University of California San Francisco, San Francisco, CA 94158, USA
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94158, USA
- Chan Zuckerberg Biohub, 499 Illinois St, San Francisco, CA 94158, USA
| |
Collapse
|
4
|
Tubiana L, Jurásek M, Coluzza I. Implementing efficient concerted rotations using Mathematica and C code ⋆. THE EUROPEAN PHYSICAL JOURNAL. E, SOFT MATTER 2018; 41:87. [PMID: 30022359 DOI: 10.1140/epje/i2018-11694-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2018] [Accepted: 06/28/2018] [Indexed: 06/08/2023]
Abstract
In this article we demonstrate a general and efficient metaprogramming implementation of concerted rotations using Mathematica. Concerted rotations allow the movement of a fixed portion of a polymer backbone with fixed bending angles, like a protein, while maintaining the correct geometry of the backbone and the initial and final points of the portion fixed. Our implementation uses Mathematica to generate a C code which is then wrapped in a library by a Python script. The user can modify the Mathematica notebook to generate a set of concerted rotations suited for a particular backbone geometry, without having to write the C code himself. The resulting code is highly optimized, performing on the order of thousands of operations per second.
Collapse
Affiliation(s)
- Luca Tubiana
- Computational Physics Department, University of Vienna, Sensengasse 8/10, 1090, Vienna, Austria.
| | - Miroslav Jurásek
- Faculty of Science, Masaryk University, Kotlářská 2, 602 00, Brno, Czech Republic
- CEITEC - Central European Institute of Technology, Kamenice 5, 625 00, Brno, Czech Republic
| | - Ivan Coluzza
- CIC biomaGUNE Parque Cientfico y Tecnolgico de Gipuzkoa, Paseo Miramn 182, 20014, Donostia / San Sebastin, Gipuzkoa, Spain
| |
Collapse
|
5
|
Chu H, Liu H. TetraBASE: A Side Chain-Independent Statistical Energy for Designing Realistically Packed Protein Backbones. J Chem Inf Model 2018; 58:430-442. [PMID: 29314837 DOI: 10.1021/acs.jcim.7b00677] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
To construct backbone structures of high designability is a primary aspect of computational protein design. We report here a side chain-independent statistical energy that aims at realistic modeling of through-space packing of polypeptide backbones. To mitigate the lack of explicit amino acid side chains, the model treats the interbackbone site packing as being dependent on peptide local conformation. In addition, new variables suitable for statistical analysis, one for relative orientation and another for distance, have been introduced to represent the intersite geometry based on the asymmetrical tetrahedron organization of distinct chemical groups surrounding the Cα-carbon atoms. The resulting tetrahedron-based backbone statistical energy (tetraBASE) model has been used to optimize the tertiary organizations of secondary structure elements (SSEs) of designated types with Monte Caro simulated annealing, starting from artificial initial configurations. The tetraBASE minimum energy structures can reproduce SSE packing frequently observed in native proteins with atomic root-mean-square deviations of 1-2 Å. The model has also been tested by examining the stability of native SSE arrangements under tetraBASE. The results suggest that tetraBASE model can be used to effectively represent interbackbone packing when designing backbone structures without explicitly knowing side chain types.
Collapse
Affiliation(s)
- Huanyu Chu
- School of Life Sciences, University of Science and Technology of China , 230027 Hefei, Anhui China.,Hefei National Laboratory for Physical Sciences at the Microscales , 230027 Hefei, Anhui China
| | - Haiyan Liu
- School of Life Sciences, University of Science and Technology of China , 230027 Hefei, Anhui China.,Hefei National Laboratory for Physical Sciences at the Microscales , 230027 Hefei, Anhui China.,Collaborative Innovation Center of Chemistry for Life Sciences , 230027 Hefei, Anhui China
| |
Collapse
|
6
|
Salih O, He S, Planamente S, Stach L, MacDonald JT, Manoli E, Scheres SHW, Filloux A, Freemont PS. Atomic Structure of Type VI Contractile Sheath from Pseudomonas aeruginosa. Structure 2018; 26:329-336.e3. [PMID: 29307484 PMCID: PMC5807055 DOI: 10.1016/j.str.2017.12.005] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2017] [Revised: 09/28/2017] [Accepted: 12/06/2017] [Indexed: 01/01/2023]
Abstract
Pseudomonas aeruginosa has three type VI secretion systems (T6SSs), H1-, H2-, and H3-T6SS, each belonging to a distinct group. The two T6SS components, TssB/VipA and TssC/VipB, assemble to form tubules that conserve structural/functional homology with tail sheaths of contractile bacteriophages and pyocins. Here, we used cryoelectron microscopy to solve the structure of the H1-T6SS P. aeruginosa TssB1C1 sheath at 3.3 Å resolution. Our structure allowed us to resolve some features of the T6SS sheath that were not resolved in the Vibrio cholerae VipAB and Francisella tularensis IglAB structures. Comparison with sheath structures from other contractile machines, including T4 phage and R-type pyocins, provides a better understanding of how these systems have conserved similar functions/mechanisms despite evolution. We used the P. aeruginosa R2 pyocin as a structural template to build an atomic model of the TssB1C1 sheath in its extended conformation, allowing us to propose a coiled-spring-like mechanism for T6SS sheath contraction. We solved a T6SS sheath structure from Pseudomonas aeruginosa (group 3 T6SSi) Comparisons between T6SS groups suggest a conserved sheath contraction mechanism Extended-state model led to proposal of a spring-like sheath contraction mechanism
Collapse
Affiliation(s)
- Osman Salih
- Section of Structural Biology, Department of Medicine, Imperial College London, London SW7 2AZ, UK
| | - Shaoda He
- MRC Laboratory of Molecular Biology, Cambridge CB2 0QH, UK
| | - Sara Planamente
- MRC Centre for Molecular Bacteriology and Infection (CMBI), Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| | - Lasse Stach
- Section of Structural Biology, Department of Medicine, Imperial College London, London SW7 2AZ, UK
| | - James T MacDonald
- Section of Structural Biology, Department of Medicine, Imperial College London, London SW7 2AZ, UK
| | - Eleni Manoli
- MRC Centre for Molecular Bacteriology and Infection (CMBI), Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| | | | - Alain Filloux
- MRC Centre for Molecular Bacteriology and Infection (CMBI), Department of Life Sciences, Imperial College London, London SW7 2AZ, UK.
| | - Paul S Freemont
- Section of Structural Biology, Department of Medicine, Imperial College London, London SW7 2AZ, UK.
| |
Collapse
|
7
|
Computational protein design with backbone plasticity. Biochem Soc Trans 2016; 44:1523-1529. [PMID: 27911735 PMCID: PMC5264498 DOI: 10.1042/bst20160155] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2016] [Revised: 08/01/2016] [Accepted: 08/03/2016] [Indexed: 11/17/2022]
Abstract
The computational algorithms used in the design of artificial proteins have become increasingly sophisticated in recent years, producing a series of remarkable successes. The most dramatic of these is the de novo design of artificial enzymes. The majority of these designs have reused naturally occurring protein structures as ‘scaffolds’ onto which novel functionality can be grafted without having to redesign the backbone structure. The incorporation of backbone flexibility into protein design is a much more computationally challenging problem due to the greatly increased search space, but promises to remove the limitations of reusing natural protein scaffolds. In this review, we outline the principles of computational protein design methods and discuss recent efforts to consider backbone plasticity in the design process.
Collapse
|
8
|
Synthetic beta-solenoid proteins with the fragment-free computational design of a beta-hairpin extension. Proc Natl Acad Sci U S A 2016; 113:10346-51. [PMID: 27573845 DOI: 10.1073/pnas.1525308113] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The ability to design and construct structures with atomic level precision is one of the key goals of nanotechnology. Proteins offer an attractive target for atomic design because they can be synthesized chemically or biologically and can self-assemble. However, the generalized protein folding and design problem is unsolved. One approach to simplifying the problem is to use a repetitive protein as a scaffold. Repeat proteins are intrinsically modular, and their folding and structures are better understood than large globular domains. Here, we have developed a class of synthetic repeat proteins based on the pentapeptide repeat family of beta-solenoid proteins. We have constructed length variants of the basic scaffold and computationally designed de novo loops projecting from the scaffold core. The experimentally solved 3.56-Å resolution crystal structure of one designed loop matches closely the designed hairpin structure, showing the computational design of a backbone extension onto a synthetic protein core without the use of backbone fragments from known structures. Two other loop designs were not clearly resolved in the crystal structures, and one loop appeared to be in an incorrect conformation. We have also shown that the repeat unit can accommodate whole-domain insertions by inserting a domain into one of the designed loops.
Collapse
|
9
|
López-Blanco JR, Canosa-Valls AJ, Li Y, Chacón P. RCD+: Fast loop modeling server. Nucleic Acids Res 2016; 44:W395-400. [PMID: 27151199 PMCID: PMC4987936 DOI: 10.1093/nar/gkw395] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2016] [Accepted: 04/28/2016] [Indexed: 11/12/2022] Open
Abstract
Modeling loops is a critical and challenging step in protein modeling and prediction. We have developed a quick online service (http://rcd.chaconlab.org) for ab initio loop modeling combining a coarse-grained conformational search with a full-atom refinement. Our original Random Coordinate Descent (RCD) loop closure algorithm has been greatly improved to enrich the sampling distribution towards near-native conformations. These improvements include a new workflow optimization, MPI-parallelization and fast backbone angle sampling based on neighbor-dependent Ramachandran probability distributions. The server starts by efficiently searching the vast conformational space from only the loop sequence information and the environment atomic coordinates. The generated closed loop models are subsequently ranked using a fast distance-orientation dependent energy filter. Top ranked loops are refined with the Rosetta energy function to obtain accurate all-atom predictions that can be interactively inspected in an user-friendly web interface. Using standard benchmarks, the average root mean squared deviation (RMSD) is 0.8 and 1.4 Å for 8 and 12 residues loops, respectively, in the challenging modeling scenario in where the side chains of the loop environment are fully remodeled. These results are not only very competitive compared to those obtained with public state of the art methods, but also they are obtained ∼10-fold faster.
Collapse
Affiliation(s)
- José Ramón López-Blanco
- Department of Biological Chemical Physics, Rocasolano Physical Chemistry Institute C.S.I.C., Serrano 119, 28006 Madrid, Spain
| | - Alejandro Jesús Canosa-Valls
- Department of Biological Chemical Physics, Rocasolano Physical Chemistry Institute C.S.I.C., Serrano 119, 28006 Madrid, Spain
| | - Yaohang Li
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA
| | - Pablo Chacón
- Department of Biological Chemical Physics, Rocasolano Physical Chemistry Institute C.S.I.C., Serrano 119, 28006 Madrid, Spain
| |
Collapse
|
10
|
Affiliation(s)
- Ivan Coluzza
- Department of Computational Physics, Faculty of Physics, University of Vienna , Vienna, Austria
| |
Collapse
|
11
|
Hierarchical particle swarm optimizer for minimizing the non-convex potential energy of molecular structure. J Mol Graph Model 2014; 54:114-22. [PMID: 25459763 DOI: 10.1016/j.jmgm.2014.10.002] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2014] [Revised: 09/22/2014] [Accepted: 10/08/2014] [Indexed: 11/23/2022]
Abstract
The stable conformation of a molecule is greatly important to uncover the secret of its properties and functions. Generally, the conformation of a molecule will be the most stable when it is of the minimum potential energy. Accordingly, the determination of the conformation can be solved in the optimization framework. It is, however, not an easy task to achieve the only conformation with the lowest energy among all the potential ones because of the high complexity of the energy landscape and the exponential computation increasing with molecular size. In this paper, we develop a hierarchical and heterogeneous particle swarm optimizer (HHPSO) to deal with the problem in the minimization of the potential energy. The proposed method is evaluated over a scalable simplified molecular potential energy function with up to 200 degrees of freedom and a realistic energy function of pseudo-ethane molecule. The experimental results are compared with other six PSO variants and four genetic algorithms. The results show HHPSO is significantly better than the compared PSOs with p-value less than 0.01277 over molecular potential energy function.
Collapse
|