1
|
Capponi S, Wang S. AI in cellular engineering and reprogramming. Biophys J 2024:S0006-3495(24)00245-5. [PMID: 38576162 DOI: 10.1016/j.bpj.2024.04.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 03/19/2024] [Accepted: 04/01/2024] [Indexed: 04/06/2024] Open
Abstract
During the last decade, artificial intelligence (AI) has increasingly been applied in biophysics and related fields, including cellular engineering and reprogramming, offering novel approaches to understand, manipulate, and control cellular function. The potential of AI lies in its ability to analyze complex datasets and generate predictive models. AI algorithms can process large amounts of data from single-cell genomics and multiomic technologies, allowing researchers to gain mechanistic insights into the control of cell identity and function. By integrating and interpreting these complex datasets, AI can help identify key molecular events and regulatory pathways involved in cellular reprogramming. This knowledge can inform the design of precision engineering strategies, such as the development of new transcription factor and signaling molecule cocktails, to manipulate cell identity and drive authentic cell fate across lineage boundaries. Furthermore, when used in combination with computational methods, AI can accelerate and improve the analysis and understanding of the intricate relationships between genes, proteins, and cellular processes. In this review article, we explore the current state of AI applications in biophysics with a specific focus on cellular engineering and reprogramming. Then, we showcase a couple of recent applications where we combined machine learning with experimental and computational techniques. Finally, we briefly discuss the challenges and prospects of AI in cellular engineering and reprogramming, emphasizing the potential of these technologies to revolutionize our ability to engineer cells for a variety of applications, from disease modeling and drug discovery to regenerative medicine and biomanufacturing.
Collapse
Affiliation(s)
- Sara Capponi
- IBM Almaden Research Center, San Jose, California; Center for Cellular Construction, San Francisco, California.
| | - Shangying Wang
- Bay Area Institute of Science, Altos Labs, Redwood City, California.
| |
Collapse
|
2
|
Weinberg ZY, Soliman SS, Kim MS, Chen IP, Ott M, El-Samad H. De novo-designed minibinders expand the synthetic biology sensing repertoire. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.12.575267. [PMID: 38293112 PMCID: PMC10827046 DOI: 10.1101/2024.01.12.575267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2024]
Abstract
Synthetic and chimeric receptors capable of recognizing and responding to user-defined antigens have enabled "smart" therapeutics based on engineered cells. These cell engineering tools depend on antigen sensors which are most often derived from antibodies. Advances in the de novo design of proteins have enabled the design of protein binders with the potential to target epitopes with unique properties and faster production timelines compared to antibodies. Building upon our previous work combining a de novo-designed minibinder of the Spike protein of SARS-CoV-2 with the synthetic receptor synNotch (SARSNotch), we investigated whether minibinders can be readily adapted to a diversity of cell engineering tools. We show that the Spike minibinder LCB1 easily generalizes to a next-generation proteolytic receptor SNIPR that performs similarly to our previously reported SARSNotch. LCB1-SNIPR successfully enables the detection of live SARS-CoV-2, an improvement over SARSNotch which can only detect cell-expressed Spike. To test the generalizability of minibinders to diverse applications, we tested LCB1 as an antigen sensor for a chimeric antigen receptor (CAR). LCB1-CAR enabled CD8+ T cells to cytotoxically target Spike-expressing cells. Our findings suggest that minibinders represent a novel class of antigen sensors that have the potential to dramatically expand the sensing repertoire of cell engineering tools.
Collapse
Affiliation(s)
| | | | - Matthew S. Kim
- Tetrad Gradudate Program, UCSF, San Francisco CA
- Cell Design Institute, San Francisco CA
| | - Irene P. Chen
- Gladstone Institutes, San Francisco CA
- Department of Medicine, UCSF, San Francisco CA
| | - Melanie Ott
- Gladstone Institutes, San Francisco CA
- Department of Medicine, UCSF, San Francisco CA
- Chan Zuckerberg Biohub–San Francisco, San Francisco CA
| | - Hana El-Samad
- Department of Biochemistry & Biophysics, UCSF, San Francisco CA
- Cell Design Institute, San Francisco CA
- Chan Zuckerberg Biohub–San Francisco, San Francisco CA
- Altos Labs, San Francisco CA
| |
Collapse
|
3
|
Islam S, Pantazes RJ. Developing similarity matrices for antibody-protein binding interactions. PLoS One 2023; 18:e0293606. [PMID: 37883504 PMCID: PMC10602319 DOI: 10.1371/journal.pone.0293606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Accepted: 10/17/2023] [Indexed: 10/28/2023] Open
Abstract
The inventions of AlphaFold and RoseTTAFold are revolutionizing computational protein science due to their abilities to reliably predict protein structures. Their unprecedented successes are due to the parallel consideration of several types of information, one of which is protein sequence similarity information. Sequence homology has been studied for many decades and depends on similarity matrices to define how similar or different protein sequences are to one another. A natural extension of predicting protein structures is predicting the interactions between proteins, but similarity matrices for protein-protein interactions do not exist. This study conducted a mutational analysis of 384 non-redundant antibody-protein antigen complexes to calculate antibody-protein interaction similarity matrices. Every important residue in each antibody and each antigen was mutated to each of the other 19 commonly occurring amino acids and the percentage changes in interaction energies were calculated using three force fields: CHARMM, Amber, and Rosetta. The data were used to construct six interaction similarity matrices, one for antibodies and another for antigens using each force field. The matrices exhibited both commonalities, such as mutations of aromatic and charged residues being the most detrimental, and differences, such as Rosetta predicting mutations of serines to be better tolerated than either Amber or CHARMM. A comparison to nine previously published similarity matrices for protein sequences revealed that the new interaction matrices are more similar to one another than they are to any of the previous matrices. The created similarity matrices can be used in force field specific applications to help guide decisions regarding mutations in protein-protein binding interfaces.
Collapse
Affiliation(s)
- Sumaiya Islam
- Department of Chemical Engineering, Auburn University, Auburn, Alabama, United States of America
| | - Robert J. Pantazes
- Department of Chemical Engineering, Auburn University, Auburn, Alabama, United States of America
| |
Collapse
|
4
|
Chen X, Morehead A, Liu J, Cheng J. A gated graph transformer for protein complex structure quality assessment and its performance in CASP15. Bioinformatics 2023; 39:i308-i317. [PMID: 37387159 PMCID: PMC10311325 DOI: 10.1093/bioinformatics/btad203] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023] Open
Abstract
MOTIVATION Proteins interact to form complexes to carry out essential biological functions. Computational methods such as AlphaFold-multimer have been developed to predict the quaternary structures of protein complexes. An important yet largely unsolved challenge in protein complex structure prediction is to accurately estimate the quality of predicted protein complex structures without any knowledge of the corresponding native structures. Such estimations can then be used to select high-quality predicted complex structures to facilitate biomedical research such as protein function analysis and drug discovery. RESULTS In this work, we introduce a new gated neighborhood-modulating graph transformer to predict the quality of 3D protein complex structures. It incorporates node and edge gates within a graph transformer framework to control information flow during graph message passing. We trained, evaluated and tested the method (called DProQA) on newly-curated protein complex datasets before the 15th Critical Assessment of Techniques for Protein Structure Prediction (CASP15) and then blindly tested it in the 2022 CASP15 experiment. The method was ranked 3rd among the single-model quality assessment methods in CASP15 in terms of the ranking loss of TM-score on 36 complex targets. The rigorous internal and external experiments demonstrate that DProQA is effective in ranking protein complex structures. AVAILABILITY AND IMPLEMENTATION The source code, data, and pre-trained models are available at https://github.com/jianlin-cheng/DProQA.
Collapse
Affiliation(s)
- Xiao Chen
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65201, United States
| | - Alex Morehead
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65201, United States
| | - Jian Liu
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65201, United States
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65201, United States
| |
Collapse
|
5
|
Ali M, Khramushin A, Yadav VK, Schueler-Furman O, Ivarsson Y. Elucidation of Short Linear Motif-Based Interactions of the FERM Domains of Ezrin, Radixin, Moesin, and Merlin. Biochemistry 2023. [PMID: 37224425 DOI: 10.1021/acs.biochem.3c00096] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
The ERM (ezrin, radixin, and moesin) family of proteins and the related protein merlin participate in scaffolding and signaling events at the cell cortex. The proteins share an N-terminal FERM [band four-point-one (4.1) ERM] domain composed of three subdomains (F1, F2, and F3) with binding sites for short linear peptide motifs. By screening the FERM domains of the ERMs and merlin against a phage library that displays peptides representing the intrinsically disordered regions of the human proteome, we identified a large number of novel ligands. We determined the affinities for the ERM and merlin FERM domains interacting with 18 peptides and validated interactions with full-length proteins through pull-down experiments. The majority of the peptides contained an apparent Yx[FILV] motif; others show alternative motifs. We defined distinct binding sites for two types of similar but distinct binding motifs (YxV and FYDF) using a combination of Rosetta FlexPepDock computational peptide docking protocols and mutational analysis. We provide a detailed molecular understanding of how the two types of peptides with distinct motifs bind to different sites on the moesin FERM phosphotyrosine binding-like subdomain and uncover interdependencies between the different types of ligands. The study expands the motif-based interactomes of the ERMs and merlin and suggests that the FERM domain acts as a switchable interaction hub.
Collapse
Affiliation(s)
- Muhammad Ali
- Department of Chemistry - BMC, Uppsala University, Husargatan 3, 751 23 Uppsala, Sweden
| | - Alisa Khramushin
- Department of Microbiology and Molecular Genetics, Institute for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem 9112102, Israel
| | - Vikash K Yadav
- Department of Chemistry - BMC, Uppsala University, Husargatan 3, 751 23 Uppsala, Sweden
| | - Ora Schueler-Furman
- Department of Microbiology and Molecular Genetics, Institute for Medical Research Israel-Canada, Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem 9112102, Israel
| | - Ylva Ivarsson
- Department of Chemistry - BMC, Uppsala University, Husargatan 3, 751 23 Uppsala, Sweden
| |
Collapse
|
6
|
Himiyama T, Hamaguchi T, Yonekura K, Nakamura T. Unnaturally Distorted Hexagonal Protein Ring Alternatingly Reorganized from Two Distinct Chemically Modified Proteins. Bioconjug Chem 2023. [PMID: 36888722 DOI: 10.1021/acs.bioconjchem.3c00057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/10/2023]
Abstract
In this study, we constructed a semiartificial protein assembly of alternating ring type, which was modified from the natural assembly state via incorporation of a synthetic component at the protein interface. For the redesign of a natural protein assembly, a scrap-and-build approach employing chemical modification was used. Two different protein dimer units were designed based on peroxiredoxin from Thermococcus kodakaraensis, which originally forms a dodecameric hexagonal ring with six homodimers. The two dimeric mutants were reorganized into a ring by reconstructing the protein-protein interactions via synthetic naphthalene moieties introduced by chemical modification. Cryo-electron microscopy revealed the formation of a uniquely shaped dodecameric hexagonal protein ring with broken symmetry, distorted from the regular hexagon of the wild-type protein. The artificially installed naphthalene moieties were arranged at the interfaces of dimer units, forming two distinct protein-protein interactions, one of which is highly unnatural. This study deciphered the potential of the chemical modification technique that constructs semiartificial protein structures and assembly hardly accessible by conventional amino acid mutations.
Collapse
Affiliation(s)
- Tomoki Himiyama
- Biomedical Research Institute, National Institute of Advanced Industrial Science and Technology, 1-8-31, Ikeda, Osaka 563-8577, Japan
| | - Tasuku Hamaguchi
- Biostructural Mechanism Laboratory, RIKEN SPring-8 Center, 1-1-1, Sayo, Hyogo 679-5148, Japan
- Institute of Multidisciplinary Research for Advanced Materials, Tohoku University, 2-1-1 Katahira, Sendai, Miyagi 980-8577, Japan
| | - Koji Yonekura
- Biostructural Mechanism Laboratory, RIKEN SPring-8 Center, 1-1-1, Sayo, Hyogo 679-5148, Japan
- Institute of Multidisciplinary Research for Advanced Materials, Tohoku University, 2-1-1 Katahira, Sendai, Miyagi 980-8577, Japan
- Advanced Electron Microscope Development Unit, RIKEN-JEOL Collaboration Center, RIKEN Baton Zone Program, 1-1-1, Sayo, Hyogo 679-5148, Japan
| | - Tsutomu Nakamura
- Biomedical Research Institute, National Institute of Advanced Industrial Science and Technology, 1-8-31, Ikeda, Osaka 563-8577, Japan
| |
Collapse
|
7
|
Qing R, Hao S, Smorodina E, Jin D, Zalevsky A, Zhang S. Protein Design: From the Aspect of Water Solubility and Stability. Chem Rev 2022; 122:14085-14179. [PMID: 35921495 PMCID: PMC9523718 DOI: 10.1021/acs.chemrev.1c00757] [Citation(s) in RCA: 54] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Indexed: 12/13/2022]
Abstract
Water solubility and structural stability are key merits for proteins defined by the primary sequence and 3D-conformation. Their manipulation represents important aspects of the protein design field that relies on the accurate placement of amino acids and molecular interactions, guided by underlying physiochemical principles. Emulated designer proteins with well-defined properties both fuel the knowledge-base for more precise computational design models and are used in various biomedical and nanotechnological applications. The continuous developments in protein science, increasing computing power, new algorithms, and characterization techniques provide sophisticated toolkits for solubility design beyond guess work. In this review, we summarize recent advances in the protein design field with respect to water solubility and structural stability. After introducing fundamental design rules, we discuss the transmembrane protein solubilization and de novo transmembrane protein design. Traditional strategies to enhance protein solubility and structural stability are introduced. The designs of stable protein complexes and high-order assemblies are covered. Computational methodologies behind these endeavors, including structure prediction programs, machine learning algorithms, and specialty software dedicated to the evaluation of protein solubility and aggregation, are discussed. The findings and opportunities for Cryo-EM are presented. This review provides an overview of significant progress and prospects in accurate protein design for solubility and stability.
Collapse
Affiliation(s)
- Rui Qing
- State
Key Laboratory of Microbial Metabolism, School of Life Sciences and
Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
- Media
Lab, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
- The
David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Shilei Hao
- Media
Lab, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
- Key
Laboratory of Biorheological Science and Technology, Ministry of Education, College of Bioengineering, Chongqing University, Chongqing 400030, China
| | - Eva Smorodina
- Department
of Immunology, University of Oslo and Oslo
University Hospital, Oslo 0424, Norway
| | - David Jin
- Avalon GloboCare
Corp., Freehold, New Jersey 07728, United States
| | - Arthur Zalevsky
- Laboratory
of Bioinformatics Approaches in Combinatorial Chemistry and Biology, Shemyakin−Ovchinnikov Institute of Bioorganic
Chemistry RAS, Moscow 117997, Russia
| | - Shuguang Zhang
- Media
Lab, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
8
|
Pollet L, Lambourne L, Xia Y. Structural Determinants of Yeast Protein-Protein Interaction Interface Evolution at the Residue Level. J Mol Biol 2022; 434:167750. [PMID: 35850298 DOI: 10.1016/j.jmb.2022.167750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 06/09/2022] [Accepted: 07/12/2022] [Indexed: 12/01/2022]
Abstract
Interfaces of contact between proteins play important roles in determining the proper structure and function of protein-protein interactions (PPIs). Therefore, to fully understand PPIs, we need to better understand the evolutionary design principles of PPI interfaces. Previous studies have uncovered that interfacial sites are more evolutionarily conserved than other surface protein sites. Yet, little is known about the nature and relative importance of evolutionary constraints in PPI interfaces. Here, we explore constraints imposed by the structure of the microenvironment surrounding interfacial residues on residue evolutionary rate using a large dataset of over 700 structural models of baker's yeast PPIs. We find that interfacial residues are, on average, systematically more conserved than all other residues with a similar degree of total burial as measured by relative solvent accessibility (RSA). Besides, we find that RSA of the residue when the PPI is formed is a better predictor of interfacial residue evolutionary rate than RSA in the monomer state. Furthermore, we investigate four structure-based measures of residue interfacial involvement, including change in RSA upon binding (ΔRSA), number of residue-residue contacts across the interface, and distance from the center or the periphery of the interface. Integrated modeling for evolutionary rate prediction in interfaces shows that ΔRSA plays a dominant role among the four measures of interfacial involvement, with minor, but independent contributions from other measures. These results yield insight into the evolutionary design of interfaces, improving our understanding of the role that structure plays in the molecular evolution of PPIs at the residue level.
Collapse
Affiliation(s)
- Léah Pollet
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, QC, Canada
| | - Luke Lambourne
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA; Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA; Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA.
| | - Yu Xia
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, QC, Canada.
| |
Collapse
|
9
|
Magi Meconi G, Sasselli IR, Bianco V, Onuchic JN, Coluzza I. Key aspects of the past 30 years of protein design. REPORTS ON PROGRESS IN PHYSICS. PHYSICAL SOCIETY (GREAT BRITAIN) 2022; 85:086601. [PMID: 35704983 DOI: 10.1088/1361-6633/ac78ef] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 06/15/2022] [Indexed: 06/15/2023]
Abstract
Proteins are the workhorse of life. They are the building infrastructure of living systems; they are the most efficient molecular machines known, and their enzymatic activity is still unmatched in versatility by any artificial system. Perhaps proteins' most remarkable feature is their modularity. The large amount of information required to specify each protein's function is analogically encoded with an alphabet of just ∼20 letters. The protein folding problem is how to encode all such information in a sequence of 20 letters. In this review, we go through the last 30 years of research to summarize the state of the art and highlight some applications related to fundamental problems of protein evolution.
Collapse
Affiliation(s)
- Giulia Magi Meconi
- Computational Biophysics Lab, Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramon 182, 20014, Donostia-San Sebastián, Spain
| | - Ivan R Sasselli
- Computational Biophysics Lab, Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramon 182, 20014, Donostia-San Sebastián, Spain
| | | | - Jose N Onuchic
- Center for Theoretical Biological Physics, Department of Physics & Astronomy, Department of Chemistry, Department of Biosciences, Rice University, Houston, TX 77251, United States of America
| | - Ivan Coluzza
- BCMaterials, Basque Center for Materials, Applications and Nanostructures, Bld. Martina Casiano, UPV/EHU Science Park, Barrio Sarriena s/n, 48940 Leioa, Spain
- Basque Foundation for Science, Ikerbasque, 48009, Bilbao, Spain
| |
Collapse
|
10
|
Maity B, Taher M, Mazumdar S, Ueno T. Artificial metalloenzymes based on protein assembly. Coord Chem Rev 2022. [DOI: 10.1016/j.ccr.2022.214593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
11
|
Zacharias M. Match_Motif: A rapid computational tool to assist in protein-protein interaction design. Protein Sci 2022; 31:147-157. [PMID: 34648221 PMCID: PMC8740833 DOI: 10.1002/pro.4208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Revised: 10/06/2021] [Accepted: 10/12/2021] [Indexed: 11/12/2022]
Abstract
In order to generate protein assemblies with a desired function, the rational design of protein-protein binding interfaces is of significant interest. Approaches based on random mutagenesis or directed evolution may involve complex experimental selection procedures. Also, molecular modeling approaches to design entirely new proteins and interactions with partner molecules can involve large computational efforts and screening steps. In order to simplify at least the initial effort for designing a putative binding interface between two proteins the Match_Motif approach has been developed. It employs the large collection of known protein-protein complex structures to suggest interface modifications that may lead to improved binding for a desired input interaction geometry. The approach extracts interaction motifs based on the backbone structure of short (four residues) segments and the relative arrangement with respect to short segments on the partner protein. The interaction geometry is used to search through a database of such motifs in known stable bound complexes. All matches are rapidly identified (within a few seconds) and collected and can be used to guide changes in the interface that may lead to improved binding. In the output, an alternative interface structure is also proposed based on the frequency of occurrence of side chains at a given interface position in all matches and based on sterical considerations. Applications of the procedure to known complex structures and alternative arrangements are presented and discussed. The program, data files, and example applications can be downloaded from https://www.groups.ph.tum.de/t38/downloads/.
Collapse
Affiliation(s)
- Martin Zacharias
- Center of Functional Protein AssembliesTechnical University of MunichGarchingGermany
| |
Collapse
|
12
|
Shimizu K, Mijiddorj B, Usami M, Mizoguchi I, Yoshida S, Akayama S, Hamada Y, Ohyama A, Usui K, Kawamura I, Kawano R. De novo design of a nanopore for single-molecule detection that incorporates a β-hairpin peptide. NATURE NANOTECHNOLOGY 2022; 17:67-75. [PMID: 34811552 PMCID: PMC8770118 DOI: 10.1038/s41565-021-01008-w] [Citation(s) in RCA: 40] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/06/2021] [Accepted: 09/13/2021] [Indexed: 05/11/2023]
Abstract
The amino-acid sequence of a protein encodes information on its three-dimensional structure and specific functionality. De novo design has emerged as a method to manipulate the primary structure for the development of artificial proteins and peptides with desired functionality. This paper describes the de novo design of a pore-forming peptide, named SV28, that has a β-hairpin structure and assembles to form a stable nanopore in a bilayer lipid membrane. This large synthetic nanopore is an entirely artificial device for practical applications. The peptide forms multidispersely sized nanopore structures ranging from 1.7 to 6.3 nm in diameter and can detect DNAs. To form a monodispersely sized nanopore, we redesigned the SV28 by introducing a glycine-kink mutation. The resulting redesigned peptide forms a monodisperse pore with a diameter of 1.7 nm leading to detection of a single polypeptide chain. Such de novo design of a β-hairpin peptide has the potential to create artificial nanopores, which can be size adjusted to a target molecule.
Collapse
Affiliation(s)
- Keisuke Shimizu
- Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology (TUAT), Tokyo, Japan
| | - Batsaikhan Mijiddorj
- Graduate School of Engineering, Yokohama National University, Yokohama, Japan
- School of Engineering and Applied Sciences, National University of Mongolia, Ulaanbaatar, Mongolia
| | - Masataka Usami
- Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology (TUAT), Tokyo, Japan
| | - Ikuro Mizoguchi
- Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology (TUAT), Tokyo, Japan
| | - Shuhei Yoshida
- Faculty of Frontiers of Innovative Research in Science and Technology (FIRST), Konan University, Kobe, Japan
| | - Shiori Akayama
- Faculty of Frontiers of Innovative Research in Science and Technology (FIRST), Konan University, Kobe, Japan
| | - Yoshio Hamada
- Faculty of Frontiers of Innovative Research in Science and Technology (FIRST), Konan University, Kobe, Japan
| | - Akifumi Ohyama
- Graduate School of Engineering Science, Yokohama National University, Yokohama, Japan
| | - Kenji Usui
- Faculty of Frontiers of Innovative Research in Science and Technology (FIRST), Konan University, Kobe, Japan
| | - Izuru Kawamura
- Graduate School of Engineering, Yokohama National University, Yokohama, Japan
- Graduate School of Engineering Science, Yokohama National University, Yokohama, Japan
| | - Ryuji Kawano
- Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology (TUAT), Tokyo, Japan.
| |
Collapse
|
13
|
Sinha NJ, Langenstein MG, Pochan DJ, Kloxin CJ, Saven JG. Peptide Design and Self-assembly into Targeted Nanostructure and Functional Materials. Chem Rev 2021; 121:13915-13935. [PMID: 34709798 DOI: 10.1021/acs.chemrev.1c00712] [Citation(s) in RCA: 88] [Impact Index Per Article: 29.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Peptides have been extensively utilized to construct nanomaterials that display targeted structure through hierarchical assembly. The self-assembly of both rationally designed peptides derived from naturally occurring domains in proteins as well as intuitively or computationally designed peptides that form β-sheets and helical secondary structures have been widely successful in constructing nanoscale morphologies with well-defined 1-d, 2-d, and 3-d architectures. In this review, we discuss these successes of peptide self-assembly, especially in the context of designing hierarchical materials. In particular, we emphasize the differences in the level of peptide design as an indicator of complexity within the targeted self-assembled materials and highlight future avenues for scientific and technological advances in this field.
Collapse
Affiliation(s)
- Nairiti J Sinha
- Department of Materials Science and Engineering, University of Delaware, Newark, Delaware 19716, United States
| | - Matthew G Langenstein
- Department of Materials Science and Engineering, University of Delaware, Newark, Delaware 19716, United States
| | - Darrin J Pochan
- Department of Materials Science and Engineering, University of Delaware, Newark, Delaware 19716, United States
| | - Christopher J Kloxin
- Department of Materials Science and Engineering, University of Delaware, Newark, Delaware 19716, United States.,Department of Chemical and Biomolecular Engineering, University of Delaware, Newark, Delaware 19716, United States
| | - Jeffery G Saven
- Department of Chemistry, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| |
Collapse
|
14
|
Synthetic Protein Circuits and Devices Based on Reversible Protein-Protein Interactions: An Overview. Life (Basel) 2021; 11:life11111171. [PMID: 34833047 PMCID: PMC8623019 DOI: 10.3390/life11111171] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 10/25/2021] [Accepted: 10/26/2021] [Indexed: 12/30/2022] Open
Abstract
Protein-protein interactions (PPIs) contribute to regulate many aspects of cell physiology and metabolism. Protein domains involved in PPIs are important building blocks for engineering genetic circuits through synthetic biology. These domains can be obtained from known proteins and rationally engineered to produce orthogonal scaffolds, or computationally designed de novo thanks to recent advances in structural biology and molecular dynamics prediction. Such circuits based on PPIs (or protein circuits) appear of particular interest, as they can directly affect transcriptional outputs, as well as induce behavioral/adaptational changes in cell metabolism, without the need for further protein synthesis. This last example was highlighted in recent works to enable the production of fast-responding circuits which can be exploited for biosensing and diagnostics. Notably, PPIs can also be engineered to develop new drugs able to bind specific intra- and extra-cellular targets. In this review, we summarize recent findings in the field of protein circuit design, with particular focus on the use of peptides as scaffolds to engineer these circuits.
Collapse
|
15
|
Zhang X, Liu Y, Zheng B, Zang J, Lv C, Zhang T, Wang H, Zhao G. Protein interface redesign facilitates the transformation of nanocage building blocks to 1D and 2D nanomaterials. Nat Commun 2021; 12:4849. [PMID: 34381032 PMCID: PMC8357837 DOI: 10.1038/s41467-021-25199-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Accepted: 07/28/2021] [Indexed: 01/09/2023] Open
Abstract
Although various artificial protein nanoarchitectures have been constructed, controlling the transformation between different protein assemblies has largely been unexplored. Here, we describe an approach to realize the self-assembly transformation of dimeric building blocks by adjusting their geometric arrangement. Thermotoga maritima ferritin (TmFtn) naturally occurs as a dimer; twelve of these dimers interact with each other in a head-to-side manner to generate 24-meric hollow protein nanocage in the presence of Ca2+ or PEG. By tuning two contiguous dimeric proteins to interact in a fully or partially side-by-side fashion through protein interface redesign, we can render the self-assembly transformation of such dimeric building blocks from the protein nanocage to filament, nanorod and nanoribbon in response to multiple external stimuli. We show similar dimeric protein building blocks can generate three kinds of protein materials in a manner that highly resembles natural pentamer building blocks from viral capsids that form different protein assemblies.
Collapse
Affiliation(s)
- Xiaorong Zhang
- grid.22935.3f0000 0004 0530 8290College of Food Science & Nutritional Engineering, China Agricultural University, Beijing Key Laboratory of Functional Food from Plant Resources, Beijing, 100083 China
| | - Yu Liu
- grid.22935.3f0000 0004 0530 8290College of Food Science & Nutritional Engineering, China Agricultural University, Beijing Key Laboratory of Functional Food from Plant Resources, Beijing, 100083 China
| | - Bowen Zheng
- grid.22935.3f0000 0004 0530 8290College of Food Science & Nutritional Engineering, China Agricultural University, Beijing Key Laboratory of Functional Food from Plant Resources, Beijing, 100083 China
| | - Jiachen Zang
- grid.22935.3f0000 0004 0530 8290College of Food Science & Nutritional Engineering, China Agricultural University, Beijing Key Laboratory of Functional Food from Plant Resources, Beijing, 100083 China
| | - Chenyan Lv
- grid.22935.3f0000 0004 0530 8290College of Food Science & Nutritional Engineering, China Agricultural University, Beijing Key Laboratory of Functional Food from Plant Resources, Beijing, 100083 China
| | - Tuo Zhang
- grid.22935.3f0000 0004 0530 8290College of Food Science & Nutritional Engineering, China Agricultural University, Beijing Key Laboratory of Functional Food from Plant Resources, Beijing, 100083 China
| | - Hongfei Wang
- grid.163032.50000 0004 1760 2008Key Laboratory of Chemical Biology and Molecular Engineering of Education Ministry, Key Laboratory of Energy Conversion and Storage Materials of Shanxi Province, Institute of Molecular Science, Shanxi University, Taiyuan, China
| | - Guanghua Zhao
- grid.22935.3f0000 0004 0530 8290College of Food Science & Nutritional Engineering, China Agricultural University, Beijing Key Laboratory of Functional Food from Plant Resources, Beijing, 100083 China
| |
Collapse
|
16
|
Emami N, Ferdousi R. AptaNet as a deep learning approach for aptamer-protein interaction prediction. Sci Rep 2021; 11:6074. [PMID: 33727685 PMCID: PMC7971039 DOI: 10.1038/s41598-021-85629-0] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2020] [Accepted: 03/03/2021] [Indexed: 02/08/2023] Open
Abstract
Aptamers are short oligonucleotides (DNA/RNA) or peptide molecules that can selectively bind to their specific targets with high specificity and affinity. As a powerful new class of amino acid ligands, aptamers have high potentials in biosensing, therapeutic, and diagnostic fields. Here, we present AptaNet-a new deep neural network-to predict the aptamer-protein interaction pairs by integrating features derived from both aptamers and the target proteins. Aptamers were encoded by using two different strategies, including k-mer and reverse complement k-mer frequency. Amino acid composition (AAC) and pseudo amino acid composition (PseAAC) were applied to represent target information using 24 physicochemical and conformational properties of the proteins. To handle the imbalance problem in the data, we applied a neighborhood cleaning algorithm. The predictor was constructed based on a deep neural network, and optimal features were selected using the random forest algorithm. As a result, 99.79% accuracy was achieved for the training dataset, and 91.38% accuracy was obtained for the testing dataset. AptaNet achieved high performance on our constructed aptamer-protein benchmark dataset. The results indicate that AptaNet can help identify novel aptamer-protein interacting pairs and build more-efficient insights into the relationship between aptamers and proteins. Our benchmark dataset and the source codes for AptaNet are available in: https://github.com/nedaemami/AptaNet .
Collapse
Affiliation(s)
- Neda Emami
- Department of Health Information Technology, School of Management and Medical Informatics, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Reza Ferdousi
- Department of Health Information Technology, School of Management and Medical Informatics, Tabriz University of Medical Sciences, Tabriz, Iran.
- Research Center for Pharmaceutical Nanotechnology, Biomedicine Institute, Tabriz University of Medical Sciences, Tabriz, Iran.
| |
Collapse
|
17
|
Pham PN, Huličiak M, Biedermannová L, Černý J, Charnavets T, Fuertes G, Herynek Š, Kolářová L, Kolenko P, Pavlíček J, Zahradník J, Mikulecky P, Schneider B. Protein Binder (ProBi) as a New Class of Structurally Robust Non-Antibody Protein Scaffold for Directed Evolution. Viruses 2021; 13:v13020190. [PMID: 33514045 PMCID: PMC7911045 DOI: 10.3390/v13020190] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2020] [Revised: 01/15/2021] [Accepted: 01/23/2021] [Indexed: 12/13/2022] Open
Abstract
Engineered small non-antibody protein scaffolds are a promising alternative to antibodies and are especially attractive for use in protein therapeutics and diagnostics. The advantages include smaller size and a more robust, single-domain structural framework with a defined binding surface amenable to mutation. This calls for a more systematic approach in designing new scaffolds suitable for use in one or more methods of directed evolution. We hereby describe a process based on an analysis of protein structures from the Protein Data Bank and their experimental examination. The candidate protein scaffolds were subjected to a thorough screening including computational evaluation of the mutability, and experimental determination of their expression yield in E. coli, solubility, and thermostability. In the next step, we examined several variants of the candidate scaffolds including their wild types and alanine mutants. We proved the applicability of this systematic procedure by selecting a monomeric single-domain human protein with a fold different from previously known scaffolds. The newly developed scaffold, called ProBi (Protein Binder), contains two independently mutable surface patches. We demonstrated its functionality by training it as a binder against human interleukin-10, a medically important cytokine. The procedure yielded scaffold-related variants with nanomolar affinity.
Collapse
|
18
|
Zhang W, Mo S, Liu M, Liu L, Yu L, Wang C. Rationally Designed Protein Building Blocks for Programmable Hierarchical Architectures. Front Chem 2020; 8:587975. [PMID: 33195088 PMCID: PMC7658299 DOI: 10.3389/fchem.2020.587975] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Accepted: 10/05/2020] [Indexed: 01/23/2023] Open
Abstract
Diverse natural/artificial proteins have been used as building blocks to construct a variety of well-ordered nanoscale structures over the past couple of decades. Sophisticated protein self-assemblies have attracted great scientific interests due to their potential applications in disease diagnosis, illness treatment, biomechanics, bio-optics and bio-electronics, etc. This review outlines recent efforts directed to the creation of structurally defined protein assemblies including one-dimensional (1D) strings/rings/tubules, two-dimensional (2D) planar sheets and three-dimensional (3D) polyhedral scaffolds. We elucidate various innovative strategies for manipulating proteins to self-assemble into desired architectures. The emergent applications of protein assemblies as versatile platforms in medicine and material science with improved performances have also been discussed.
Collapse
Affiliation(s)
- Wenbo Zhang
- State Key Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Department of Biophysics and Structural Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Shanshan Mo
- State Key Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Department of Biophysics and Structural Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Mingwei Liu
- State Key Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Department of Biophysics and Structural Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Lei Liu
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, United States
| | - Lanlan Yu
- State Key Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Department of Biophysics and Structural Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Chenxuan Wang
- State Key Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Department of Biophysics and Structural Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| |
Collapse
|
19
|
Bailey JB, Tezcan FA. Tunable and Cooperative Thermomechanical Properties of Protein-Metal-Organic Frameworks. J Am Chem Soc 2020; 142:17265-17270. [PMID: 32972136 DOI: 10.1021/jacs.0c07835] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
We recently introduced protein-metal-organic frameworks (protein-MOFs) as chemically designed protein crystals, composed of ferritin nodes that predictably assemble into 3D lattices upon coordination of various metal ions and ditopic, hydroxamate-based linkers. Owing to their unique tripartite construction, protein-MOFs possess extremely sparse lattice connectivity, suggesting that they might display unusual thermomechanical properties. Leveraging the synthetic modularity of ferritin-MOFs, we investigated the temperature-dependent structural dynamics of six distinct frameworks. Our results show that the thermostabilities of ferritin-MOFs can be tuned through the metal component or the presence of crowding agents. Our studies also reveal a framework that undergoes a reversible and isotropic first-order phase transition near-room temperature, corresponding to a 4% volumetric change within 1 °C and a hysteresis window of ∼10 °C. This highly cooperative crystal-to-crystal transformation, which stems from the soft crystallinity of ferritin-MOFs, illustrates the advantage of modular construction strategies in discovering tunable-and unpredictable-material properties.
Collapse
Affiliation(s)
- Jake B Bailey
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California 92093, United States
| | - F Akif Tezcan
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California 92093, United States.,Materials Science and Engineering, University of California, San Diego, La Jolla, California 92093, United States
| |
Collapse
|
20
|
Jasaitis L, Silver CD, Rawlings AE, Peters DT, Whelan F, Regan L, Pasquina-Lemonche L, Potts JR, Johnson SD, Staniland SS. Rational Design and Self-Assembly of Coiled-Coil Linked SasG Protein Fibrils. ACS Synth Biol 2020; 9:1599-1607. [PMID: 32551507 DOI: 10.1021/acssynbio.0c00156] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Protein engineering is an attractive approach for the self-assembly of nanometer-scale architectures for a range of potential nanotechnologies. Using the versatile chemistry provided by protein folding and assembly, coupled with amino acid side-chain functionality, allows for the construction of precise molecular "protein origami" hierarchical patterned structures for a range of nanoapplications such as stand-alone enzymatic pathways and molecular machines. The Staphyloccocus aureus surface protein SasG is a rigid, rod-like structure shown to have high mechanical strength due to "clamp-like" intradomain features and a stabilizing interface between the G5 and E domains, making it an excellent building block for molecular self-assembly. Here we characterize a new two subunit system composed of the SasG rod protein genetically conjugated with de novo designed coiled-coils, resulting in the self-assembly of fibrils. Circular dichroism (CD) and quartz-crystal microbalance with dissipation (QCM-D) are used to show the specific, alternating binding between the two subunits. Furthermore, we use atomic force microscopy (AFM) to study the extent of subunit polymerization in a liquid environment, demonstrating self-assembly culminating in the formation of linear macromolecular fibrils.
Collapse
Affiliation(s)
- Lukas Jasaitis
- Department of Chemistry, University of Sheffield, Sheffield S3 7HF, United Kingdom
| | - Callum D. Silver
- Department of Electronic Engineering, University of York, York YO10 5DD, United Kingdom
| | - Andrea E. Rawlings
- Department of Chemistry, University of Sheffield, Sheffield S3 7HF, United Kingdom
| | - Daniel T. Peters
- Biosciences Institute, Newcastle University, Newcastle upon Tyne NE2 4HH, United Kingdom
| | - Fiona Whelan
- School of Biological Sciences, University of Adelaide, Adelaide, SA 5005, Australia
| | - Lynne Regan
- Institute for Quantitative Biology, Biochemistry and Biotechnology, University of Edinburgh, Edinburgh EH9 3JU, Scotland
| | - Laia Pasquina-Lemonche
- Department of Physics and Astronomy, University of Sheffield, Sheffield S3 7HF, United Kingdom
| | - Jennifer R. Potts
- School of Life and Environmental Science, University of Sydney, Sydney, NSW 2006, Australia
- Department of Biology, University of York, York YO10 5DD, United Kingdom
| | - Steven D. Johnson
- Department of Electronic Engineering, University of York, York YO10 5DD, United Kingdom
| | - Sarah S. Staniland
- Department of Chemistry, University of Sheffield, Sheffield S3 7HF, United Kingdom
| |
Collapse
|
21
|
Roy AA, Dhawanjewar AS, Sharma P, Singh G, Madhusudhan MS. Protein Interaction Z Score Assessment (PIZSA): an empirical scoring scheme for evaluation of protein-protein interactions. Nucleic Acids Res 2020; 47:W331-W337. [PMID: 31114890 PMCID: PMC6602501 DOI: 10.1093/nar/gkz368] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2019] [Revised: 04/24/2019] [Accepted: 05/15/2019] [Indexed: 11/24/2022] Open
Abstract
Our web server, PIZSA (http://cospi.iiserpune.ac.in/pizsa), assesses the likelihood of protein–protein interactions by assigning a Z Score computed from interface residue contacts. Our score takes into account the optimal number of atoms that mediate the interaction between pairs of residues and whether these contacts emanate from the main chain or side chain. We tested the score on 174 native interactions for which 100 decoys each were constructed using ZDOCK. The native structure scored better than any of the decoys in 146 cases and was able to rank within the 95th percentile in 162 cases. This easily outperforms a competing method, CIPS. We also benchmarked our scoring scheme on 15 targets from the CAPRI dataset and found that our method had results comparable to that of CIPS. Further, our method is able to analyse higher order protein complexes without the need to explicitly identify chains as receptors or ligands. The PIZSA server is easy to use and could be used to score any input three-dimensional structure and provide a residue pair-wise break up of the results. Attractively, our server offers a platform for users to upload their own potentials and could serve as an ideal testing ground for this class of scoring schemes.
Collapse
Affiliation(s)
- Ankit A Roy
- Indian Institute of Science Education and Research, Pune, Dr Homi Bhabha Road, Pashan, Pune 411008, India
| | - Abhilesh S Dhawanjewar
- Indian Institute of Science Education and Research, Pune, Dr Homi Bhabha Road, Pashan, Pune 411008, India.,presently at School of Biological Sciences, University of Nebraska, Lincoln, NE 68588, USA
| | - Parichit Sharma
- Indian Institute of Science Education and Research, Pune, Dr Homi Bhabha Road, Pashan, Pune 411008, India.,presently at School of Informatics, Computing & Engineering, Department of Computer Science, Indiana University, Bloomington, IN 47408, USA
| | - Gulzar Singh
- Indian Institute of Science Education and Research, Pune, Dr Homi Bhabha Road, Pashan, Pune 411008, India
| | - M S Madhusudhan
- Indian Institute of Science Education and Research, Pune, Dr Homi Bhabha Road, Pashan, Pune 411008, India
| |
Collapse
|
22
|
Nerattini F, Figliuzzi M, Cardelli C, Tubiana L, Bianco V, Dellago C, Coluzza I. Identification of Protein Functional Regions. Chemphyschem 2020; 21:335-347. [PMID: 31944517 DOI: 10.1002/cphc.201900898] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2019] [Revised: 11/01/2019] [Indexed: 11/12/2022]
Abstract
Protein sequence stores the information relative to both functionality and stability, thus making it difficult to disentangle the two contributions. However, the identification of critical residues for function and stability has important implications for the mapping of the proteome interactions, as well as for many pharmaceutical applications, e. g. the identification of ligand binding regions for targeted pharmaceutical protein design. In this work, we propose a computational method to identify critical residues for protein functionality and stability and to further categorise them in strictly functional, structural and intermediate. We evaluate single site conservation and use Direct Coupling Analysis (DCA) to identify co-evolved residues both in natural and artificial evolution processes. We reproduce artificial evolution using protein design and base our approach on the hypothesis that artificial evolution in the absence of any functional constraint would exclusively lead to site conservation and co-evolution events of the structural type. Conversely, natural evolution intrinsically embeds both functional and structural information. By comparing the lists of conserved and co-evolved residues, outcomes of the analysis on natural and artificial evolution, we identify the functional residues without the need of any a priori knowledge of the biological role of the analysed protein.
Collapse
Affiliation(s)
- Francesca Nerattini
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Vienna, Austria
| | - Matteo Figliuzzi
- Sorbonne Universites, UPMC, Institut de Biologie Paris-Seine, CNRS, Laboratoire de Biologie Computationnelle et Quantitative UMR, 7238, Paris, France
| | - Chiara Cardelli
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Vienna, Austria
| | - Luca Tubiana
- Physics Department, Universitá degli studi di Trento, via Sommarive 14, 38123, Trento, IT
| | - Valentino Bianco
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Vienna, Austria.,Faculty of Chemistry, Chemical Physics Department, Universidad Complutense de Madrid, Plaza de las Ciencias, Ciudad Universitaria, Madrid, 28040, Spain
| | - Christoph Dellago
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Vienna, Austria
| | - Ivan Coluzza
- CIC biomaGUNE, Paseo Miramon 182, 20014 San Sebastian, Spain, and IKERBASQUE, Basque Foundation for Science, 48013, Bilbao, Spain
| |
Collapse
|
23
|
Nerattini F, Tubiana L, Cardelli C, Bianco V, Dellago C, Coluzza I. Protein design under competing conditions for the availability of amino acids. Sci Rep 2020; 10:2684. [PMID: 32060385 PMCID: PMC7021711 DOI: 10.1038/s41598-020-59401-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2019] [Accepted: 12/08/2019] [Indexed: 11/09/2022] Open
Abstract
Isolating the properties of proteins that allow them to convert sequence into the structure is a long-lasting biophysical problem. In particular, studies focused extensively on the effect of a reduced alphabet size on the folding properties. However, the natural alphabet is a compromise between versatility and optimisation of the available resources. Here, for the first time, we include the impact of the relative availability of the amino acids to extract from the 20 letters the core necessary for protein stability. We present a computational protein design scheme that involves the competition for resources between a protein and a potential interaction partner that, additionally, gives us the chance to investigate the effect of the reduced alphabet on protein-protein interactions. We devise a scheme that automatically identifies the optimal reduced set of letters for the design of the protein, and we observe that even alphabets reduced down to 4 letters allow for single protein folding. However, it is only with 6 letters that we achieve optimal folding, thus recovering experimental observations. Additionally, we notice that the binding between the protein and a potential interaction partner could not be avoided with the investigated reduced alphabets. Therefore, we suggest that aggregation could have been a driving force in the evolution of the large protein alphabet.
Collapse
Affiliation(s)
- Francesca Nerattini
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Vienna, Austria
| | - Luca Tubiana
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Vienna, Austria
| | - Chiara Cardelli
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Vienna, Austria
| | - Valentino Bianco
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Vienna, Austria
| | - Christoph Dellago
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Vienna, Austria
| | - Ivan Coluzza
- Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo Miramon 182, 20014, San Sebastian, Spain. .,IKERBASQUE, Basque Foundation for Science, 48013, Bilbao, Spain.
| |
Collapse
|
24
|
Kaushik AC, Mehmood A, Khan MT, Kumar A, Dai X, Wei DQ. RETRACTED ARTICLE: Protein blueprint and their interactions while approachability struggle for amino acids. J Biomol Struct Dyn 2020; 39:i-ix. [PMID: 31914855 DOI: 10.1080/07391102.2020.1713894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Affiliation(s)
| | - Aamir Mehmood
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Muhammad Tahir Khan
- Department of Bioinformatics and Biosciences, Capital University of Science and Technology, Islamabad, Pakistan
| | - Ajay Kumar
- Institute of Biomedical Sciences, National Sun Yat-Sen University, Kaohsiung City, Taiwan
| | - Xiaofeng Dai
- Wuxi School of Medicine, Jiangnan University, Wuxi, China
| | - Dong-Qing Wei
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| |
Collapse
|
25
|
Abstract
Background Protein-protein docking is a valuable computational approach for investigating protein-protein interactions. Shape complementarity is the most basic component of a scoring function and plays an important role in protein-protein docking. Despite significant progresses, shape representation remains an open question in the development of protein-protein docking algorithms, especially for grid-based docking approaches. Results We have proposed a new pairwise shape-based scoring function (LSC) for protein-protein docking which adopts an exponential form to take into account long-range interactions between protein atoms. The LSC scoring function was incorporated into our FFT-based docking program and evaluated for both bound and unbound docking on the protein docking benchmark 4.0. It was shown that our LSC achieved a significantly better performance than four other similar docking methods, ZDOCK 2.1, MolFit/G, GRAMM, and FTDock/G, in both success rate and number of hits. When considering the top 10 predictions, LSC obtained a success rate of 51.71% and 6.82% for bound and unbound docking, respectively, compared to 42.61% and 4.55% for the second-best program ZDOCK 2.1. LSC also yielded an average of 8.38 and 3.94 hits per complex in the top 1000 predictions for bound and unbound docking, respectively, followed by 6.38 and 2.96 hits for the second-best ZDOCK 2.1. Conclusions The present LSC method will not only provide an initial-stage docking approach for post-docking processes but also have a general implementation for accurate representation of other energy terms on grids in protein-protein docking. The software has been implemented in our HDOCK web server at http://hdock.phys.hust.edu.cn/.
Collapse
|
26
|
Sinha NJ, Wu D, Kloxin CJ, Saven JG, Jensen GV, Pochan DJ. Polyelectrolyte character of rigid rod peptide bundlemer chains constructed via hierarchical self-assembly. SOFT MATTER 2019; 15:9858-9870. [PMID: 31738361 DOI: 10.1039/c9sm01894h] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Short α-helical peptides were computationally designed to self-assemble into robust coiled coils that are antiparallel, homotetrameric bundles. These peptide bundle units, or 'bundlemers', have been utilized as anisotropic building blocks to construct bundlemer-based polymers via a hierarchical, hybrid physical-covalent assembly pathway. The bundlemer chains were constructed using short linker connections via 'click' chemistry reactions between the N-termini of bundlemer constituent peptides. The resulting bundlemer chains appear as extremely rigid, cylindrical rods in transmission electron microscopy (TEM) images. Small angle neutron scattering (SANS) shows that these bundlemer chains exist as individual rods in solution with a cross-section that is equal to that of a single coiled coil bundlemer building block of ≈20 Å. SANS further confirms that the interparticle solution structure of the rigid rod bundlemer chains is heterogeneous and responsive to solution conditions, such as ionic-strength and pH. Due to their peptidic constitution, the bundlemer assemblies behave like polyelectrolytes that carry an average charge density of approximately 3 charges per bundlemer as determined from SANS structure factor data fitting, which describes the repulsion between charged rods in solution. This repulsion manifests as a correlation hole in the scattering profile that is suppressed by dilution or addition of salt. Presence of rod cluster aggregates with a mass fractal dimension of ≈2.5 is also confirmed across all samples. The formation of such dense, fractal-like cluster aggregates in a solution of net repulsive rods is a unique example of the subtle balance between short-range attraction and long-rage repulsion interactions in proteins and other biomaterials. With computational control of constituent peptide sequences, it is further possible to deconvolute the underlying sequence driven structure-property relationships in the modular bundlemer chains.
Collapse
Affiliation(s)
- Nairiti J Sinha
- Department of Materials Science and Engineering, University of Delaware, Newark, DE, USA.
| | | | | | | | | | | |
Collapse
|
27
|
Siebenmorgen T, Zacharias M. Computational prediction of protein–protein binding affinities. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2019. [DOI: 10.1002/wcms.1448] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Affiliation(s)
- Till Siebenmorgen
- Physics Department T38 Technical University of Munich Garching Germany
| | - Martin Zacharias
- Physics Department T38 Technical University of Munich Garching Germany
| |
Collapse
|
28
|
Choi Y, Furlon JM, Amos RB, Griswold KE, Bailey-Kellogg C. DisruPPI: structure-based computational redesign algorithm for protein binding disruption. Bioinformatics 2019; 34:i245-i253. [PMID: 29949961 PMCID: PMC6022686 DOI: 10.1093/bioinformatics/bty274] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Motivation Disruption of protein–protein interactions can mitigate antibody recognition of therapeutic proteins, yield monomeric forms of oligomeric proteins, and elucidate signaling mechanisms, among other applications. While designing affinity-enhancing mutations remains generally quite challenging, both statistically and physically based computational methods can precisely identify affinity-reducing mutations. In order to leverage this ability to design variants of a target protein with disrupted interactions, we developed the DisruPPI protein design method (DISRUpting Protein–Protein Interactions) to optimize combinations of mutations simultaneously for both disruption and stability, so that incorporated disruptive mutations do not inadvertently affect the target protein adversely. Results Two existing methods for predicting mutational effects on binding, FoldX and INT5, were demonstrated to be quite precise in selecting disruptive mutations from the SKEMPI and AB-Bind databases of experimentally determined changes in binding free energy. DisruPPI was implemented to use an INT5-based disruption score integrated with an AMBER-based stability assessment and was applied to disrupt protein interactions in a set of different targets representing diverse applications. In retrospective evaluation with three different case studies, comparison of DisruPPI-designed variants to published experimental data showed that DisruPPI was able to identify more diverse interaction-disrupting and stability-preserving variants more efficiently and effectively than previous approaches. In prospective application to an interaction between enhanced green fluorescent protein (EGFP) and a nanobody, DisruPPI was used to design five EGFP variants, all of which were shown to have significantly reduced nanobody binding while maintaining function and thermostability. This demonstrates that DisruPPI may be readily utilized for effective removal of known epitopes of therapeutically relevant proteins. Availability and implementation DisruPPI is implemented in the EpiSweep package, freely available under an academic use license. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yoonjoo Choi
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Republic of Korea
| | - Jacob M Furlon
- Thayer School of Engineering, Dartmouth, Hanover, NH, USA
| | - Ryan B Amos
- Department of Computer Science, Princeton University, Princeton, NJ, USA
| | - Karl E Griswold
- Thayer School of Engineering, Dartmouth, Hanover, NH, USA.,Norris Cotton Cancer Center at Dartmouth, Lebanon, NH, USA.,Department of Biological Sciences, Dartmouth, Hanover, NH, USA
| | | |
Collapse
|
29
|
Abstract
Proteins are a class of nanoscale building block with remarkable chemical complexity and sophistication: their diverse functions, shapes, and symmetry as well as atomically monodisperse structures far surpass the range of conventional nanoparticles that can be accessed synthetically. The chemical topologies of proteins that drive their assembly into materials are central to their functions in nature. However, despite the importance of protein materials in biology, efforts to harness these building blocks synthetically to engineer new materials have been impeded by the chemical complexity of protein surfaces, making it difficult to reliably design protein building blocks that can be robustly transformed into targeted materials. Here we describe our work aimed at exploiting a simple but important concept: if one could exchange complex protein-protein interactions with well-defined and programmable DNA-DNA interactions, one could control the assembly of proteins into structurally well-defined oligomeric and polymeric materials and three-dimensional crystals. As a class of nanoscale building block, proteins with surface DNA modifications have a vast design space that exceeds what is practically and conceptually possible with their inorganic counterparts: the sequences of the DNA and protein and the chemical nature and position of DNA attachment all play roles in dictating the assembly behavior of protein-DNA conjugates. We summarize how each of these design parameters can influence structural outcome, beginning with proteins with a single surface DNA modification, where energy barriers between protein monomers can be tuned through the sequence and secondary structure of the oligonucleotide. We then explore challenges and progress in designing directional interactions and valency on protein surfaces. The directional binding properties of protein-DNA conjugates are ultimately imposed by the amino acid sequence of the protein, which defines the spatial distribution of DNA modification sites on the protein. Through careful design and mutagenesis, bivalent building blocks that bind directionally to form one-dimensional assemblies can be realized. Finally, we discuss the assembly of proteins densely modified with DNA into crystalline superlattices. At first glance, these protein building blocks display crystallization behavior remarkably similar to that of their DNA-functionalized inorganic nanoparticle counterparts, which allows design principles elucidated for DNA-guided nanoparticle crystallization to be used as predictive tools in determining structural outcomes in protein systems. Proteins additionally offer design handles that nanoparticles do not: unlike nanoparticles, the number and spatial distribution of DNA can be controlled through the protein sequence and DNA modification chemistry. Changing the spatial distributions of DNA can drive otherwise identical proteins down distinct crystallization pathways and yield building blocks with exotic distributions of DNA that crystallize into structures that are not yet attainable using isotropically functionalized particles. We highlight challenges in accessing other classes of architectures and establishing general design rules for DNA-mediated protein assembly. Harnessing surface DNA modifications to build protein materials creates many opportunities to realize new architectures and answer fundamental questions about DNA-modified nanostructures in both materials and biological contexts. Proteins with surface DNA modifications are a powerful class of nanomaterial building blocks for which the DNA and protein sequences and the nature of their conjugation dictate the material structure.
Collapse
|
30
|
Nerattini F, Tubiana L, Cardelli C, Bianco V, Dellago C, Coluzza I. Design of Protein–Protein Binding Sites Suggests a Rationale for Naturally Occurring Contact Areas. J Chem Theory Comput 2018; 15:1383-1392. [DOI: 10.1021/acs.jctc.8b00667] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Francesca Nerattini
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090 Vienna, Austria
| | - Luca Tubiana
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090 Vienna, Austria
| | - Chiara Cardelli
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090 Vienna, Austria
| | - Valentino Bianco
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090 Vienna, Austria
| | - Christoph Dellago
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090 Vienna, Austria
| | - Ivan Coluzza
- CIC biomaGUNE, Paseo Miramon 182, 20014 San Sebastian, Spain
- IKERBASQUE,
Basque Foundation for Science, 48013 Bilbao, Spain
| |
Collapse
|
31
|
Blacklock KM, Yang L, Mulligan VK, Khare SD. A computational method for the design of nested proteins by loop-directed domain insertion. Proteins 2018; 86:354-369. [PMID: 29250820 DOI: 10.1002/prot.25445] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Revised: 12/04/2017] [Accepted: 12/15/2017] [Indexed: 12/23/2022]
Abstract
The computational design of novel nested proteins-in which the primary structure of one protein domain (insert) is flanked by the primary structure segments of another (parent)-would enable the generation of multifunctional proteins. Here we present a new algorithm, called Loop-Directed Domain Insertion (LooDo), implemented within the Rosetta software suite, for the purpose of designing nested protein domain combinations connected by flexible linker regions. Conformational space for the insert domain is sampled using large libraries of linker fragments for linker-to-parent domain superimposition followed by insert-to-linker superimposition. The relative positioning of the two domains (treated as rigid bodies) is sampled efficiently by a grid-based, mutual placement compatibility search. The conformations of the loop residues, and the identities of loop as well as interface residues, are simultaneously optimized using a generalized kinematic loop closure algorithm and Rosetta EnzymeDesign, respectively, to minimize interface energy. The algorithm was found to consistently sample near-native conformations and interface sequences for a benchmark set of structurally similar but functionally divergent domain-inserted enzymes from the α/β hydrolase superfamily, and discriminates well between native and nonnative conformations and sequences, although loop conformations tended to deviate from the native conformations. Furthermore, in cross-domain placement tests, native insert-parent domain combinations were ranked as the best-scoring structures compared to nonnative domain combinations. This algorithm should be broadly applicable to the design of multi-domain protein complexes with any combination of inserted or tandem domain connections.
Collapse
Affiliation(s)
- Kristin M Blacklock
- Institute for Quantitative Biomedicine, Rutgers The State University of New Jersey, Piscataway, New Jersey.,Department of Chemistry and Chemical Biology, Rutgers The State University of New Jersey, Piscataway, New Jersey.,Center for Integrative Proteomics Research, Rutgers The State University of New Jersey, Piscataway, New Jersey
| | - Lu Yang
- Department of Chemistry and Chemical Biology, Rutgers The State University of New Jersey, Piscataway, New Jersey.,Center for Integrative Proteomics Research, Rutgers The State University of New Jersey, Piscataway, New Jersey
| | - Vikram K Mulligan
- Institute for Protein Design and Department of Biochemistry, University of Washington, Seattle, Washington
| | - Sagar D Khare
- Institute for Quantitative Biomedicine, Rutgers The State University of New Jersey, Piscataway, New Jersey.,Department of Chemistry and Chemical Biology, Rutgers The State University of New Jersey, Piscataway, New Jersey.,Center for Integrative Proteomics Research, Rutgers The State University of New Jersey, Piscataway, New Jersey
| |
Collapse
|
32
|
Coluzza I. Computational protein design: a review. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2017; 29:143001. [PMID: 28140371 DOI: 10.1088/1361-648x/aa5c76] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Proteins are one of the most versatile modular assembling systems in nature. Experimentally, more than 110 000 protein structures have been identified and more are deposited every day in the Protein Data Bank. Such an enormous structural variety is to a first approximation controlled by the sequence of amino acids along the peptide chain of each protein. Understanding how the structural and functional properties of the target can be encoded in this sequence is the main objective of protein design. Unfortunately, rational protein design remains one of the major challenges across the disciplines of biology, physics and chemistry. The implications of solving this problem are enormous and branch into materials science, drug design, evolution and even cryptography. For instance, in the field of drug design an effective computational method to design protein-based ligands for biological targets such as viruses, bacteria or tumour cells, could give a significant boost to the development of new therapies with reduced side effects. In materials science, self-assembly is a highly desired property and soon artificial proteins could represent a new class of designable self-assembling materials. The scope of this review is to describe the state of the art in computational protein design methods and give the reader an outline of what developments could be expected in the near future.
Collapse
Affiliation(s)
- Ivan Coluzza
- Computational Physics, Faculty of Physics, University of Vienna, Vienna, Austria
| |
Collapse
|
33
|
Hasani HJ, Barakat KH. Protein-Protein Docking. PHARMACEUTICAL SCIENCES 2017. [DOI: 10.4018/978-1-5225-1762-7.ch042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Protein-protein docking algorithms are powerful computational tools, capable of analyzing the protein-protein interactions at the atomic-level. In this chapter, we will review the theoretical concepts behind different protein-protein docking algorithms, highlighting their strengths as well as their limitations and pointing to important case studies for each method. The methods we intend to cover in this chapter include various search strategies and scoring techniques. This includes exhaustive global search, fast Fourier transform search, spherical Fourier transform-based search, direct search in Cartesian space, local shape feature matching, geometric hashing, genetic algorithm, randomized search, and Monte Carlo search. We will also discuss the different ways that have been used to incorporate protein flexibility within the docking procedure and some other future directions in this field, suggesting possible ways to improve the different methods.
Collapse
|
34
|
Abstract
Computational protein design (CPD) has established itself as a leading field in basic and applied science with a strong coupling between the two. Proteins are computationally designed from the level of amino acids to the level of a functional protein complex. Design targets range from increased thermo- (or other) stability to specific requested reactions such as protein-protein binding, enzymatic reactions, or nanotechnology applications. The design scheme may encompass small regions of the proteins or the entire protein. In either case, the design may aim at the side-chains or at the full backbone conformation. Herein, the main framework for the process is outlined highlighting key elements in the CPD iterative cycle. These include the very definition of CPD, the diverse goals of CPD, components of the CPD protocol, methods for searching sequence and structure space, scoring functions, and augmenting the CPD with other optimization tools. Taken together, this chapter aims to introduce the framework of CPD.
Collapse
Affiliation(s)
- Ilan Samish
- Department of Plants and Environmental Sciences, Weizmann Institute of Science, Rehovot, Israel.
- Department of Biotechnology Engineering, Braude Academic College of Engineering, Karmiel, Israel.
- Amai Proteins Ltd., Ashdod, Israel.
| |
Collapse
|
35
|
Luo Q, Hou C, Bai Y, Wang R, Liu J. Protein Assembly: Versatile Approaches to Construct Highly Ordered Nanostructures. Chem Rev 2016; 116:13571-13632. [PMID: 27587089 DOI: 10.1021/acs.chemrev.6b00228] [Citation(s) in RCA: 357] [Impact Index Per Article: 44.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Nature endows life with a wide variety of sophisticated, synergistic, and highly functional protein assemblies. Following Nature's inspiration to assemble protein building blocks into exquisite nanostructures is emerging as a fascinating research field. Dictating protein assembly to obtain highly ordered nanostructures and sophisticated functions not only provides a powerful tool to understand the natural protein assembly process but also offers access to advanced biomaterials. Over the past couple of decades, the field of protein assembly has undergone unexpected and rapid developments, and various innovative strategies have been proposed. This Review outlines recent advances in the field of protein assembly and summarizes several strategies, including biotechnological strategies, chemical strategies, and combinations of these approaches, for manipulating proteins to self-assemble into desired nanostructures. The emergent applications of protein assemblies as versatile platforms to design a wide variety of attractive functional materials with improved performances have also been discussed. The goal of this Review is to highlight the importance of this highly interdisciplinary field and to promote its growth in a diverse variety of research fields ranging from nanoscience and material science to synthetic biology.
Collapse
Affiliation(s)
- Quan Luo
- State Key Laboratory of Supramolecular Structure and Materials, College of Chemistry, Jilin University , 2699 Qianjin Street, Changchun 130012, P. R. China
| | - Chunxi Hou
- State Key Laboratory of Supramolecular Structure and Materials, College of Chemistry, Jilin University , 2699 Qianjin Street, Changchun 130012, P. R. China
| | - Yushi Bai
- State Key Laboratory of Supramolecular Structure and Materials, College of Chemistry, Jilin University , 2699 Qianjin Street, Changchun 130012, P. R. China
| | - Ruibing Wang
- State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau , Taipa, Macau SAR 999078, China
| | - Junqiu Liu
- State Key Laboratory of Supramolecular Structure and Materials, College of Chemistry, Jilin University , 2699 Qianjin Street, Changchun 130012, P. R. China
| |
Collapse
|
36
|
Franceus J, Verhaeghe T, Desmet T. Correlated positions in protein evolution and engineering. J Ind Microbiol Biotechnol 2016; 44:687-695. [PMID: 27514664 DOI: 10.1007/s10295-016-1811-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2016] [Accepted: 07/30/2016] [Indexed: 12/22/2022]
Abstract
Statistical analysis of a protein multiple sequence alignment can reveal groups of positions that undergo interdependent mutations throughout evolution. At these so-called correlated positions, only certain combinations of amino acids appear to be viable for maintaining proper folding, stability, catalytic activity or specificity. Therefore, it is often speculated that they could be interesting guides for semi-rational protein engineering purposes. Because they are a fingerprint from protein evolution, their analysis may provide valuable insight into a protein's structure or function and furthermore, they may also be suitable target positions for mutagenesis. Unfortunately, little is currently known about the properties of these correlation networks and how they should be used in practice. This review summarises the recent findings, opportunities and pitfalls of the concept.
Collapse
Affiliation(s)
- Jorick Franceus
- Department of Biochemical and Microbial Technology, Centre for Industrial Biotechnology and Biocatalysis, Ghent University, Coupure Links 653, 9000, Ghent, Belgium
| | - Tom Verhaeghe
- Department of Biochemical and Microbial Technology, Centre for Industrial Biotechnology and Biocatalysis, Ghent University, Coupure Links 653, 9000, Ghent, Belgium
| | - Tom Desmet
- Department of Biochemical and Microbial Technology, Centre for Industrial Biotechnology and Biocatalysis, Ghent University, Coupure Links 653, 9000, Ghent, Belgium.
| |
Collapse
|
37
|
Surya S, Abhilash J, Geethanandan K, Sadasivan C, Haridas M. A profile of protein-protein interaction: Crystal structure of a lectin-lectin complex. Int J Biol Macromol 2016; 87:529-36. [DOI: 10.1016/j.ijbiomac.2016.02.081] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2016] [Revised: 02/29/2016] [Accepted: 02/29/2016] [Indexed: 10/22/2022]
|
38
|
Hosseinzadeh P, Lu Y. Design and fine-tuning redox potentials of metalloproteins involved in electron transfer in bioenergetics. BIOCHIMICA ET BIOPHYSICA ACTA 2016; 1857:557-581. [PMID: 26301482 PMCID: PMC4761536 DOI: 10.1016/j.bbabio.2015.08.006] [Citation(s) in RCA: 114] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2015] [Accepted: 08/20/2015] [Indexed: 12/25/2022]
Abstract
Redox potentials are a major contributor in controlling the electron transfer (ET) rates and thus regulating the ET processes in the bioenergetics. To maximize the efficiency of the ET process, one needs to master the art of tuning the redox potential, especially in metalloproteins, as they represent major classes of ET proteins. In this review, we first describe the importance of tuning the redox potential of ET centers and its role in regulating the ET in bioenergetic processes including photosynthesis and respiration. The main focus of this review is to summarize recent work in designing the ET centers, namely cupredoxins, cytochromes, and iron-sulfur proteins, and examples in design of protein networks involved these ET centers. We then discuss the factors that affect redox potentials of these ET centers including metal ion, the ligands to metal center and interactions beyond the primary ligand, especially non-covalent secondary coordination sphere interactions. We provide examples of strategies to fine-tune the redox potential using both natural and unnatural amino acids and native and nonnative cofactors. Several case studies are used to illustrate recent successes in this area. Outlooks for future endeavors are also provided. This article is part of a Special Issue entitled Biodesign for Bioenergetics--the design and engineering of electronic transfer cofactors, proteins and protein networks, edited by Ronald L. Koder and J.L. Ross Anderson.
Collapse
Affiliation(s)
- Parisa Hosseinzadeh
- Department of Chemistry and Department of Biochemistry, University of Illinois at Urbana-Champaign, 600 S. Mathews St., Urbana, IL, 61801, USA
| | - Yi Lu
- Department of Chemistry and Department of Biochemistry, University of Illinois at Urbana-Champaign, 600 S. Mathews St., Urbana, IL, 61801, USA.
| |
Collapse
|
39
|
Moreira IS, Fernandes PA, Ramos MJ. Computational alanine scanning mutagenesis--an improved methodological approach. J Comput Chem 2016; 28:644-54. [PMID: 17195156 DOI: 10.1002/jcc.20566] [Citation(s) in RCA: 185] [Impact Index Per Article: 23.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Alanine scanning mutagenesis of protein-protein interfacial residues can be applied to a wide variety of protein complexes to understand the structural and energetic characteristics of the hot-spots. Binding free energies have been estimated with reasonable accuracy with empirical methods, such as Molecular Mechanics/Poisson-Boltzmann surface area (MM-PBSA), and with more rigorous computational approaches like Free Energy Perturbation (FEP) and Thermodynamic Integration (TI). The main objective of this work is the development of an improved methodological approach, with less computational cost, that predicts accurately differences in binding free energies between the wild-type and alanine mutated complexes (DeltaDeltaG(binding)). The method was applied to three complexes, and a mean unsigned error of 0.80 kcal/mol was obtained in a set of 46 mutations. The computational method presented here achieved an overall success rate of 80% and an 82% success rate in residues for which alanine mutation causes an increase in the binding free energy > 2.0 kcal/mol (warm- and hot-spots). This fully atomistic computational methodological approach consists in a computational Molecular Dynamics simulation protocol performed in a continuum medium using the Generalized Born model. A set of three different internal dielectric constants, to mimic the different degree of relaxation of the interface when different types of amino acids are mutated for alanine, have to be used for the proteins, depending on the type of amino acid that is mutated. This method permits a systematic scanning mutagenesis of protein-protein interfaces and it is capable of anticipating the experimental results of mutagenesis, thus guiding new experimental investigations.
Collapse
Affiliation(s)
- Irina S Moreira
- REQUIMTE/Departamento de Química, Faculdade de Ciências da Universidade do Porto, Rua do Campo Alegre 687, 4169-007 Porto, Portugal
| | | | | |
Collapse
|
40
|
Islam MA, Bhayye S, Adeniyi AA, Soliman ME, Pillay TS. Diabetes mellitus caused by mutations in human insulin: analysis of impaired receptor binding of insulins Wakayama, Los Angeles and Chicago using pharmacoinformatics. J Biomol Struct Dyn 2016; 35:724-737. [DOI: 10.1080/07391102.2016.1160258] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Affiliation(s)
- Md Ataul Islam
- Faculty of Health Sciences, Department of Chemical Pathology, & Institute of Cellular & Molecular Medicine, University of Pretoria and National Health Laboratory Service Tshwane Academic Division, Pretoria, South Africa
| | - Sagar Bhayye
- Department of Chemical Technology, University of Calcutta, 92, A. P. C. Road, Kolkata 700009, India
| | - Adebayo A. Adeniyi
- Discipline of Pharmaceutical Sciences, School of Health Sciences, University of KwaZulu-Natal, Durban 4000, South Africa
| | - Mahmoud E.S. Soliman
- Discipline of Pharmaceutical Sciences, School of Health Sciences, University of KwaZulu-Natal, Durban 4000, South Africa
| | - Tahir S. Pillay
- Faculty of Health Sciences, Department of Chemical Pathology, & Institute of Cellular & Molecular Medicine, University of Pretoria and National Health Laboratory Service Tshwane Academic Division, Pretoria, South Africa
- Division of Chemical Pathology, University of Cape Town, Cape Town, South Africa
| |
Collapse
|
41
|
Pakulska MM, Miersch S, Shoichet MS. Designer protein delivery: From natural to engineered affinity-controlled release systems. Science 2016; 351:aac4750. [PMID: 26989257 DOI: 10.1126/science.aac4750] [Citation(s) in RCA: 107] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Exploiting binding affinities between molecules is an established practice in many fields, including biochemical separations, diagnostics, and drug development; however, using these affinities to control biomolecule release is a more recent strategy. Affinity-controlled release takes advantage of the reversible nature of noncovalent interactions between a therapeutic protein and a binding partner to slow the diffusive release of the protein from a vehicle. This process, in contrast to degradation-controlled sustained-release formulations such as poly(lactic-co-glycolic acid) microspheres, is controlled through the strength of the binding interaction, the binding kinetics, and the concentration of binding partners. In the context of affinity-controlled release--and specifically the discovery or design of binding partners--we review advances in in vitro selection and directed evolution of proteins, peptides, and oligonucleotides (aptamers), aided by computational design.
Collapse
Affiliation(s)
- Malgosia M Pakulska
- Department of Chemical Engineering and Applied Chemistry, Institute of Biomaterials and Biomedical Engineering, and Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
| | - Shane Miersch
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Molly S Shoichet
- Department of Chemical Engineering and Applied Chemistry, Institute of Biomaterials and Biomedical Engineering, and Donnelly Centre, University of Toronto, Toronto, Ontario, Canada. Department of Chemistry, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
42
|
Brunk E, Mih N, Monk J, Zhang Z, O’Brien EJ, Bliven SE, Chen K, Chang RL, Bourne PE, Palsson BO. Systems biology of the structural proteome. BMC SYSTEMS BIOLOGY 2016; 10:26. [PMID: 26969117 PMCID: PMC4787049 DOI: 10.1186/s12918-016-0271-6] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/17/2015] [Accepted: 02/16/2016] [Indexed: 12/19/2022]
Abstract
BACKGROUND The success of genome-scale models (GEMs) can be attributed to the high-quality, bottom-up reconstructions of metabolic, protein synthesis, and transcriptional regulatory networks on an organism-specific basis. Such reconstructions are biochemically, genetically, and genomically structured knowledge bases that can be converted into a mathematical format to enable a myriad of computational biological studies. In recent years, genome-scale reconstructions have been extended to include protein structural information, which has opened up new vistas in systems biology research and empowered applications in structural systems biology and systems pharmacology. RESULTS Here, we present the generation, application, and dissemination of genome-scale models with protein structures (GEM-PRO) for Escherichia coli and Thermotoga maritima. We show the utility of integrating molecular scale analyses with systems biology approaches by discussing several comparative analyses on the temperature dependence of growth, the distribution of protein fold families, substrate specificity, and characteristic features of whole cell proteomes. Finally, to aid in the grand challenge of big data to knowledge, we provide several explicit tutorials of how protein-related information can be linked to genome-scale models in a public GitHub repository ( https://github.com/SBRG/GEMPro/tree/master/GEMPro_recon/). CONCLUSIONS Translating genome-scale, protein-related information to structured data in the format of a GEM provides a direct mapping of gene to gene-product to protein structure to biochemical reaction to network states to phenotypic function. Integration of molecular-level details of individual proteins, such as their physical, chemical, and structural properties, further expands the description of biochemical network-level properties, and can ultimately influence how to model and predict whole cell phenotypes as well as perform comparative systems biology approaches to study differences between organisms. GEM-PRO offers insight into the physical embodiment of an organism's genotype, and its use in this comparative framework enables exploration of adaptive strategies for these organisms, opening the door to many new lines of research. With these provided tools, tutorials, and background, the reader will be in a position to run GEM-PRO for their own purposes.
Collapse
Affiliation(s)
- Elizabeth Brunk
- />Department of Bioengineering, University of California, La Jolla, San Diego, CA 92093 USA
- />Joint BioEnergy Institute, Emeryville, CA 94608 USA
| | - Nathan Mih
- />Bioinformatics and Systems Biology Program, University of California, La Jolla, San Diego, CA 92093 USA
| | - Jonathan Monk
- />Department of Bioengineering, University of California, La Jolla, San Diego, CA 92093 USA
| | - Zhen Zhang
- />Department of Bioengineering, University of California, La Jolla, San Diego, CA 92093 USA
| | - Edward J. O’Brien
- />Department of Bioengineering, University of California, La Jolla, San Diego, CA 92093 USA
| | - Spencer E. Bliven
- />Bioinformatics and Systems Biology Program, University of California, La Jolla, San Diego, CA 92093 USA
- />National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894 USA
| | - Ke Chen
- />Department of Bioengineering, University of California, La Jolla, San Diego, CA 92093 USA
| | - Roger L. Chang
- />Department of Systems Biology, Harvard Medical School, Boston, MA 02115 USA
| | - Philip E. Bourne
- />Office of the Director, National Institutes of Health, Bethesda, MD 20894 USA
| | - Bernhard O. Palsson
- />Department of Bioengineering, University of California, La Jolla, San Diego, CA 92093 USA
| |
Collapse
|
43
|
Sirin S, Apgar JR, Bennett EM, Keating AE. AB-Bind: Antibody binding mutational database for computational affinity predictions. Protein Sci 2016; 25:393-409. [PMID: 26473627 PMCID: PMC4815335 DOI: 10.1002/pro.2829] [Citation(s) in RCA: 87] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2015] [Revised: 10/09/2015] [Accepted: 10/12/2015] [Indexed: 12/26/2022]
Abstract
Antibodies (Abs) are a crucial component of the immune system and are often used as diagnostic and therapeutic agents. The need for high-affinity and high-specificity antibodies in research and medicine is driving the development of computational tools for accelerating antibody design and discovery. We report a diverse set of antibody binding data with accompanying structures that can be used to evaluate methods for modeling antibody interactions. Our Antibody-Bind (AB-Bind) database includes 1101 mutants with experimentally determined changes in binding free energies (ΔΔG) across 32 complexes. Using the AB-Bind data set, we evaluated the performance of protein scoring potentials in their ability to predict changes in binding free energies upon mutagenesis. Numerical correlations between computed and observed ΔΔG values were low (r = 0.16-0.45), but the potentials exhibited predictive power for classifying variants as improved vs weakened binders. Performance was evaluated using the area under the curve (AUC) for receiver operator characteristic (ROC) curves; the highest AUC values for 527 mutants with |ΔΔG| > 1.0 kcal/mol were 0.81, 0.87, and 0.88 using STATIUM, FoldX, and Discovery Studio scoring potentials, respectively. Some methods could also enrich for variants with improved binding affinity; FoldX and Discovery Studio were able to correctly rank 42% and 30%, respectively, of the 80 most improved binders (those with ΔΔG < -1.0 kcal/mol) in the top 5% of the database. This modest predictive performance has value but demonstrates the continuing need to develop and improve protein energy functions for affinity prediction.
Collapse
Affiliation(s)
- Sarah Sirin
- Department of BiologyMassachusetts Institute of TechnologyCambridgeMassachusetts02139
| | - James R. Apgar
- Global Biotherapeutics Technologies, Pfizer Inc610 Main StreetCambridgeMassachusetts02139
| | - Eric M. Bennett
- Global Biotherapeutics Technologies, Pfizer Inc610 Main StreetCambridgeMassachusetts02139
| | - Amy E. Keating
- Department of BiologyMassachusetts Institute of TechnologyCambridgeMassachusetts02139
- Department of Biological EngineeringMassachusetts Institute of TechnologyCambridgeMassachusetts02139
| |
Collapse
|
44
|
Polydorides S, Michael E, Mignon D, Druart K, Archontis G, Simonson T. Proteus and the Design of Ligand Binding Sites. Methods Mol Biol 2016; 1414:77-97. [PMID: 27094287 DOI: 10.1007/978-1-4939-3569-7_6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
This chapter describes the organization and use of Proteus, a multitool computational suite for the optimization of protein and ligand conformations and sequences, and the calculation of pK α shifts and relative binding affinities. The software offers the use of several molecular mechanics force fields and solvent models, including two generalized Born variants, and a large range of scoring functions, which can combine protein stability, ligand affinity, and ligand specificity terms, for positive and negative design. We present in detail the steps for structure preparation, system setup, construction of the interaction energy matrix, protein sequence and structure optimizations, pK α calculations, and ligand titration calculations. We discuss illustrative examples, including the chemical/structural optimization of a complex between the MHC class II protein HLA-DQ8 and the vinculin epitope, and the chemical optimization of the compstatin analog Ac-Val4Trp/His9Ala, which regulates the function of protein C3 of the complement system.
Collapse
Affiliation(s)
- Savvas Polydorides
- Theoretical and Computational Biophysics Group, Department of Physics, University of Cyprus, 1678, Nicosia, Cyprus
| | - Eleni Michael
- Theoretical and Computational Biophysics Group, Department of Physics, University of Cyprus, 1678, Nicosia, Cyprus
| | - David Mignon
- Department of Biology, Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, 91128, Palaiseau, France
| | - Karen Druart
- Department of Biology, Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, 91128, Palaiseau, France
| | - Georgios Archontis
- Theoretical and Computational Biophysics Group, Department of Physics, University of Cyprus, 1678, Nicosia, Cyprus.
| | - Thomas Simonson
- Department of Biology, Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, 91128, Palaiseau, France.
| |
Collapse
|
45
|
Mozo-Villarías A, Cedano J, Querol E. Vector description of electric and hydrophobic interactions in protein homodimers. EUROPEAN BIOPHYSICS JOURNAL: EBJ 2015; 45:341-6. [DOI: 10.1007/s00249-015-1100-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/24/2015] [Revised: 11/01/2015] [Accepted: 11/11/2015] [Indexed: 11/24/2022]
|
46
|
Brender JR, Zhang Y. Predicting the Effect of Mutations on Protein-Protein Binding Interactions through Structure-Based Interface Profiles. PLoS Comput Biol 2015; 11:e1004494. [PMID: 26506533 PMCID: PMC4624718 DOI: 10.1371/journal.pcbi.1004494] [Citation(s) in RCA: 99] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2015] [Accepted: 08/06/2015] [Indexed: 11/18/2022] Open
Abstract
The formation of protein-protein complexes is essential for proteins to perform their physiological functions in the cell. Mutations that prevent the proper formation of the correct complexes can have serious consequences for the associated cellular processes. Since experimental determination of protein-protein binding affinity remains difficult when performed on a large scale, computational methods for predicting the consequences of mutations on binding affinity are highly desirable. We show that a scoring function based on interface structure profiles collected from analogous protein-protein interactions in the PDB is a powerful predictor of protein binding affinity changes upon mutation. As a standalone feature, the differences between the interface profile score of the mutant and wild-type proteins has an accuracy equivalent to the best all-atom potentials, despite being two orders of magnitude faster once the profile has been constructed. Due to its unique sensitivity in collecting the evolutionary profiles of analogous binding interactions and the high speed of calculation, the interface profile score has additional advantages as a complementary feature to combine with physics-based potentials for improving the accuracy of composite scoring approaches. By incorporating the sequence-derived and residue-level coarse-grained potentials with the interface structure profile score, a composite model was constructed through the random forest training, which generates a Pearson correlation coefficient >0.8 between the predicted and observed binding free-energy changes upon mutation. This accuracy is comparable to, or outperforms in most cases, the current best methods, but does not require high-resolution full-atomic models of the mutant structures. The binding interface profiling approach should find useful application in human-disease mutation recognition and protein interface design studies. Few proteins carry out their tasks in isolation. Instead, proteins combine with each other in complicated ways that can be affected by either the natural genetic variation that occurs among people or by disease causing mutations such as those that occur in cancer or in genetic disorders. To understand how these mutations affect our health, it is necessary to understand how mutations can affect the strength of the interactions that bind proteins together. This is a difficult task to do in a laboratory on a large scale and scientists are increasingly turning to computational methods to predict these effects in advance. We show that by looking at the multiple alignments of similar protein-protein complex structures at the interface regions, new constraints based on the evolution of the three dimensional structures of proteins can be made to predict which mutations are compatible with two proteins interacting and which are not.
Collapse
Affiliation(s)
- Jeffrey R. Brender
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Biological Chemistry, University of Michigan, Ann Arbor, Michigan, United States of America
- * E-mail:
| |
Collapse
|
47
|
Shen Y, Radhakrishnan ML, Tidor B. Molecular mechanisms and design principles for promiscuous inhibitors to avoid drug resistance: lessons learned from HIV-1 protease inhibition. Proteins 2015; 83:351-72. [PMID: 25410041 PMCID: PMC4829108 DOI: 10.1002/prot.24730] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2014] [Revised: 10/14/2014] [Accepted: 11/06/2014] [Indexed: 11/16/2022]
Abstract
Molecular recognition is central to biology and ranges from highly selective to broadly promiscuous. The ability to modulate specificity at will is particularly important for drug development, and discovery of mechanisms contributing to binding specificity is crucial for our basic understanding of biology and for applications in health care. In this study, we used computational molecular design to create a large dataset of diverse small molecules with a range of binding specificities. We then performed structural, energetic, and statistical analysis on the dataset to study molecular mechanisms of achieving specificity goals. The work was done in the context of HIV‐1 protease inhibition and the molecular designs targeted a panel of wild‐type and drug‐resistant mutant HIV‐1 protease structures. The analysis focused on mechanisms for promiscuous binding to bind robustly even to resistance mutants. Broadly binding inhibitors tended to be smaller in size, more flexible in chemical structure, and more hydrophobic in nature compared to highly selective ones. Furthermore, structural and energetic analyses illustrated mechanisms by which flexible inhibitors achieved binding; we found ligand conformational adaptation near mutation sites and structural plasticity in targets through torsional flips of asymmetric functional groups to form alternative, compensatory packing interactions or hydrogen bonds. As no inhibitor bound to all variants, we designed small cocktails of inhibitors to do so and discovered that they often jointly covered the target set through mechanistic complementarity. Furthermore, using structural plasticity observed in experiments, and potentially in simulations, is suggested to be a viable means of designing adaptive inhibitors that are promiscuous binders. Proteins 2015; 83:351–372. © 2014 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Yang Shen
- Department of Biological EngineeringMassachusetts Institute of TechnologyCambridgeMassachusetts02139
- Department of Electrical Engineering and Computer ScienceMassachusetts Institute of TechnologyCambridgeMassachusetts02139
- Computer Science and Artificial Intelligence LaboratoryMassachusetts Institute of TechnologyCambridgeMassachusetts02139
- Present address:
Center for Bioinformatics and Genomic Systems EngineeringDepartment of Electrical and Computer EngineeringTexas A&M UniversityCollege StationTexas77843
| | | | - Bruce Tidor
- Department of Biological EngineeringMassachusetts Institute of TechnologyCambridgeMassachusetts02139
- Department of Electrical Engineering and Computer ScienceMassachusetts Institute of TechnologyCambridgeMassachusetts02139
- Computer Science and Artificial Intelligence LaboratoryMassachusetts Institute of TechnologyCambridgeMassachusetts02139
| |
Collapse
|
48
|
Zhang W, Zhang H, Zhang T, Fan H, Hao Q. Protein-complex structure completion using IPCAS (Iterative Protein Crystal structure Automatic Solution). ACTA ACUST UNITED AC 2015; 71:1487-92. [PMID: 26143920 DOI: 10.1107/s1399004715008597] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2014] [Accepted: 05/02/2015] [Indexed: 11/10/2022]
Abstract
Protein complexes are essential components in many cellular processes. In this study, a procedure to determine the protein-complex structure from a partial molecular-replacement (MR) solution is demonstrated using a direct-method-aided dual-space iterative phasing and model-building program suite, IPCAS (Iterative Protein Crystal structure Automatic Solution). The IPCAS iteration procedure involves (i) real-space model building and refinement, (ii) direct-method-aided reciprocal-space phase refinement and (iii) phase improvement through density modification. The procedure has been tested with four protein complexes, including two previously unknown structures. It was possible to use IPCAS to build the whole complex structure from one or less than one subunit once the molecular-replacement method was able to give a partial solution. In the most challenging case, IPCAS was able to extend to the full length starting from less than 30% of the complex structure, while conventional model-building procedures were unsuccessful.
Collapse
Affiliation(s)
- Weizhe Zhang
- Department of Physiology, University of Hong Kong, Hong Kong
| | - Hongmin Zhang
- Department of Physiology, University of Hong Kong, Hong Kong
| | - Tao Zhang
- Institute of Physics, Chinese Academy of Sciences, Beijing 100080, People's Republic of China
| | - Haifu Fan
- Institute of Physics, Chinese Academy of Sciences, Beijing 100080, People's Republic of China
| | - Quan Hao
- Department of Physiology, University of Hong Kong, Hong Kong
| |
Collapse
|
49
|
Zhan YA, Ytreberg FM. The cis conformation of proline leads to weaker binding of a p53 peptide to MDM2 compared to trans. Arch Biochem Biophys 2015; 575:22-9. [PMID: 25840370 PMCID: PMC5444545 DOI: 10.1016/j.abb.2015.03.021] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2014] [Revised: 03/24/2015] [Accepted: 03/25/2015] [Indexed: 12/11/2022]
Abstract
The cis and trans conformations of the Xaa-Pro (Xaa: any amino acid) peptide bond are thermodynamically stable while other peptide bonds strongly prefer trans. The effect of proline cis-trans isomerization on protein binding has not been thoroughly investigated. In this study, computer simulations were used to calculate the absolute binding affinity for a p53 peptide (residues 17-29) to MDM2 for both cis and trans isomers of the p53 proline in position 27. Results show that the cis isomer of p53(17-29) binds more weakly to MDM2 than the trans isomer, and that this is primarily due to the difference in the free energy cost associated with the loss of conformational entropy of p53(17-29) when it binds to MDM2. The population of cis p53(17-29) was estimated to be 0.8% of the total population in the bound state. The stronger binding of trans p53(17-29) to MDM2 compared to cis may leave a minimal level of p53 available to respond to cellular stress. This study demonstrates that it is feasible to estimate the absolute binding affinity for an intrinsically disordered protein fragment binding to an ordered protein that are in good agreement with experimental results.
Collapse
Affiliation(s)
- Yingqian Ada Zhan
- Institute for Bioinformatics and Evolutionary Studies, University of Idaho, Moscow, ID, United States
| | - F Marty Ytreberg
- Institute for Bioinformatics and Evolutionary Studies, University of Idaho, Moscow, ID, United States; Department of Physics, University of Idaho, Moscow, ID, United States.
| |
Collapse
|
50
|
Abstract
Faced with a protein engineering challenge, a contemporary researcher can choose from myriad design strategies. Library-scale computational protein design (LCPD) is a hybrid method suitable for the engineering of improved protein variants with diverse sequences. This chapter discusses the background and merits of several practical LCPD techniques. First, LCPD methods suitable for delocalized protein design are presented in the context of example design calculations for cellobiohydrolase II. Second, localized design methods are discussed in the context of an example design calculation intended to shift the substrate specificity of a ketol-acid reductoisomerase Rossmann domain from NADPH to NADH.
Collapse
|