1
|
Satalkar V, Degaga GD, Li W, Pang YT, McShan AC, Gumbart JC, Mitchell JC, Torres MP. Generative β-hairpin design using a residue-based physicochemical property landscape. Biophys J 2024; 123:2790-2806. [PMID: 38297834 PMCID: PMC11393682 DOI: 10.1016/j.bpj.2024.01.029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 12/20/2023] [Accepted: 01/25/2024] [Indexed: 02/02/2024] Open
Abstract
De novo peptide design is a new frontier that has broad application potential in the biological and biomedical fields. Most existing models for de novo peptide design are largely based on sequence homology that can be restricted based on evolutionarily derived protein sequences and lack the physicochemical context essential in protein folding. Generative machine learning for de novo peptide design is a promising way to synthesize theoretical data that are based on, but unique from, the observable universe. In this study, we created and tested a custom peptide generative adversarial network intended to design peptide sequences that can fold into the β-hairpin secondary structure. This deep neural network model is designed to establish a preliminary foundation of the generative approach based on physicochemical and conformational properties of 20 canonical amino acids, for example, hydrophobicity and residue volume, using extant structure-specific sequence data from the PDB. The beta generative adversarial network model robustly distinguishes secondary structures of β hairpin from α helix and intrinsically disordered peptides with an accuracy of up to 96% and generates artificial β-hairpin peptide sequences with minimum sequence identities around 31% and 50% when compared against the current NCBI PDB and nonredundant databases, respectively. These results highlight the potential of generative models specifically anchored by physicochemical and conformational property features of amino acids to expand the sequence-to-structure landscape of proteins beyond evolutionary limits.
Collapse
Affiliation(s)
- Vardhan Satalkar
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia
| | - Gemechis D Degaga
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee
| | - Wei Li
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia
| | - Yui Tik Pang
- School of Physics, Georgia Institute of Technology, Atlanta, Georgia
| | - Andrew C McShan
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia
| | - James C Gumbart
- School of Physics, Georgia Institute of Technology, Atlanta, Georgia; School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia
| | - Julie C Mitchell
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee.
| | - Matthew P Torres
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia; School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia.
| |
Collapse
|
2
|
Peña-Guerrero J, Fernández-Rubio C, García-Sosa AT, Nguewa PA. BRCT Domains: Structure, Functions, and Implications in Disease-New Therapeutic Targets for Innovative Drug Discovery against Infections. Pharmaceutics 2023; 15:1839. [PMID: 37514027 PMCID: PMC10386641 DOI: 10.3390/pharmaceutics15071839] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Revised: 06/12/2023] [Accepted: 06/22/2023] [Indexed: 07/30/2023] Open
Abstract
The search for new therapeutic targets and their implications in drug development remains an emerging scientific topic. BRCT-bearing proteins are found in Archaea, Bacteria, Eukarya, and viruses. They are traditionally involved in DNA repair, recombination, and cell cycle control. To carry out these functions, BRCT domains are able to interact with DNA and proteins. Moreover, such domains are also implicated in several pathogenic processes and malignancies including breast, ovarian, and lung cancer. Although these domains exhibit moderately conserved folding, their sequences show very low conservation. Interestingly, sequence variations among species are considered positive traits in the search for suitable therapeutic targets, since non-specific drug interactions might be reduced. These main characteristics of BRCT, as well as its critical implications in key biological processes in the cell, have prompted the study of these domains as therapeutic targets. This review explores the possible roles of BRCT domains as therapeutic targets for drug discovery. We describe their common structural features and relevant interactions and pathways, as well as their implications in pathologic processes. Drugs commonly used to target these domains are also presented. Finally, based on their structures, we describe new drug design possibilities using modern and innovative techniques.
Collapse
Affiliation(s)
- José Peña-Guerrero
- ISTUN Institute of Tropical Health, Department of Microbiology and Parasitology, University of Navarra, IdiSNA (Navarra Institute for Health Research), E-31008 Pamplona, Navarra, Spain
| | - Celia Fernández-Rubio
- ISTUN Institute of Tropical Health, Department of Microbiology and Parasitology, University of Navarra, IdiSNA (Navarra Institute for Health Research), E-31008 Pamplona, Navarra, Spain
| | - Alfonso T García-Sosa
- Chair of Molecular Technology, Institute of Chemistry, University of Tartu, Ravila 14a, 50411 Tartu, Estonia
| | - Paul A Nguewa
- ISTUN Institute of Tropical Health, Department of Microbiology and Parasitology, University of Navarra, IdiSNA (Navarra Institute for Health Research), E-31008 Pamplona, Navarra, Spain
| |
Collapse
|
3
|
Costa CFS, Barbosa AJM, Dias AMGC, Roque ACA. Native, engineered and de novo designed ligands targeting the SARS-CoV-2 spike protein. Biotechnol Adv 2022; 59:107986. [PMID: 35598822 PMCID: PMC9119173 DOI: 10.1016/j.biotechadv.2022.107986] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 04/29/2022] [Accepted: 05/16/2022] [Indexed: 01/27/2023]
Abstract
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is responsible for the deadly coronavirus disease 2019 (Covid-19) and is a concerning hazard to public health. This virus infects cells by establishing a contact between its spike protein (S-protein) and host human angiotensin-converting enzyme 2 (hACE2) receptor, subsequently initiating viral fusion. The inhibition of the interaction between the S-protein and hACE2 has immediately drawn attention amongst the scientific community, and the S-protein was considered the prime target to design vaccines and to develop affinity ligands for diagnostics and therapy. Several S-protein binders have been reported at a fast pace, ranging from antibodies isolated from immunised patients to de novo designed ligands, with some binders already yielding promising in vivo results in protecting against SARS-CoV-2. Natural, engineered and designed affinity ligands targeting the S-protein are herein summarised, focusing on molecular recognition aspects, whilst identifying preferred hot spots for ligand binding. This review serves as inspiration for the improvement of already existing ligands or for the design of new affinity ligands towards SARS-CoV-2 proteins. Lessons learnt from the Covid-19 pandemic are also important to consolidate tools and processes in protein engineering to enable the fast discovery, production and delivery of diagnostic, prophylactic, and therapeutic solutions in future pandemics.
Collapse
Affiliation(s)
- Carlos F S Costa
- Associate Laboratory i4HB - Institute for Health and Bioeconomy, School of Science and Technology, Universidade NOVA de Lisboa, 2829-516 Caparica, Portugal; UCIBIO - Applied Molecular Biosciences Unit, Department of Chemistry, School of Science and Technology, Universidade NOVA de Lisboa, 2829-516 Caparica, Portugal
| | - Arménio J M Barbosa
- Associate Laboratory i4HB - Institute for Health and Bioeconomy, School of Science and Technology, Universidade NOVA de Lisboa, 2829-516 Caparica, Portugal; UCIBIO - Applied Molecular Biosciences Unit, Department of Chemistry, School of Science and Technology, Universidade NOVA de Lisboa, 2829-516 Caparica, Portugal
| | - Ana Margarida G C Dias
- Associate Laboratory i4HB - Institute for Health and Bioeconomy, School of Science and Technology, Universidade NOVA de Lisboa, 2829-516 Caparica, Portugal; UCIBIO - Applied Molecular Biosciences Unit, Department of Chemistry, School of Science and Technology, Universidade NOVA de Lisboa, 2829-516 Caparica, Portugal
| | - Ana Cecília A Roque
- Associate Laboratory i4HB - Institute for Health and Bioeconomy, School of Science and Technology, Universidade NOVA de Lisboa, 2829-516 Caparica, Portugal; UCIBIO - Applied Molecular Biosciences Unit, Department of Chemistry, School of Science and Technology, Universidade NOVA de Lisboa, 2829-516 Caparica, Portugal.
| |
Collapse
|
4
|
Sun XY, Zhong Y, Li YH, Miller DP, Buttan S, Wu XX, Zhang Y, Tang Q, Tan HW, Zhu J, Liu R, Zurek E, Lu ZL, Gong B. Reliable folding of hybrid tetrapeptides into short β-hairpins. CHINESE CHEM LETT 2022. [DOI: 10.1016/j.cclet.2021.06.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
5
|
Rudnev VR, Kulikova LI, Nikolsky KS, Malsagova KA, Kopylov AT, Kaysheva AL. Current Approaches in Supersecondary Structures Investigation. Int J Mol Sci 2021; 22:11879. [PMID: 34769310 PMCID: PMC8584461 DOI: 10.3390/ijms222111879] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Revised: 10/27/2021] [Accepted: 10/29/2021] [Indexed: 11/16/2022] Open
Abstract
Proteins expressed during the cell cycle determine cell function, topology, and responses to environmental influences. The development and improvement of experimental methods in the field of structural biology provide valuable information about the structure and functions of individual proteins. This work is devoted to the study of supersecondary structures of proteins and determination of their structural motifs, description of experimental methods for their detection, databases, and repositories for storage, as well as methods of molecular dynamics research. The interest in the study of supersecondary structures in proteins is due to their autonomous stability outside the protein globule, which makes it possible to study folding processes, conformational changes in protein isoforms, and aberrant proteins with high productivity.
Collapse
Affiliation(s)
- Vladimir R. Rudnev
- Biobanking Group, Branch of Institute of Biomedical Chemistry “Scientific and Education Center”, 109028 Moscow, Russia; (V.R.R.); (L.I.K.); (K.S.N.); (A.T.K.); (A.L.K.)
- Institute of Theoretical and Experimental Biophysics, Russian Academy of Sciences, 142290 Pushchino, Russia
| | - Liudmila I. Kulikova
- Biobanking Group, Branch of Institute of Biomedical Chemistry “Scientific and Education Center”, 109028 Moscow, Russia; (V.R.R.); (L.I.K.); (K.S.N.); (A.T.K.); (A.L.K.)
- Institute of Theoretical and Experimental Biophysics, Russian Academy of Sciences, 142290 Pushchino, Russia
- Institute of Mathematical Problems of Biology RAS—The Branch of Keldysh Institute of Applied Mathematics of Russian Academy of Sciences, 142290 Pushchino, Russia
| | - Kirill S. Nikolsky
- Biobanking Group, Branch of Institute of Biomedical Chemistry “Scientific and Education Center”, 109028 Moscow, Russia; (V.R.R.); (L.I.K.); (K.S.N.); (A.T.K.); (A.L.K.)
| | - Kristina A. Malsagova
- Biobanking Group, Branch of Institute of Biomedical Chemistry “Scientific and Education Center”, 109028 Moscow, Russia; (V.R.R.); (L.I.K.); (K.S.N.); (A.T.K.); (A.L.K.)
| | - Arthur T. Kopylov
- Biobanking Group, Branch of Institute of Biomedical Chemistry “Scientific and Education Center”, 109028 Moscow, Russia; (V.R.R.); (L.I.K.); (K.S.N.); (A.T.K.); (A.L.K.)
| | - Anna L. Kaysheva
- Biobanking Group, Branch of Institute of Biomedical Chemistry “Scientific and Education Center”, 109028 Moscow, Russia; (V.R.R.); (L.I.K.); (K.S.N.); (A.T.K.); (A.L.K.)
| |
Collapse
|
6
|
DuPai CD, Davies BW, Wilke CO. A systematic analysis of the beta hairpin motif in the Protein Data Bank. Protein Sci 2021; 30:613-623. [PMID: 33389765 DOI: 10.1002/pro.4020] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Revised: 12/29/2020] [Accepted: 12/29/2020] [Indexed: 12/31/2022]
Abstract
The beta hairpin motif is a ubiquitous protein structural motif that can be found in molecules across the tree of life. This motif, which is also popular in synthetically designed proteins and peptides, is known for its stability and adaptability to broad functions. Here, we systematically probe all 49,000 unique beta hairpin substructures contained within the Protein Data Bank (PDB) to uncover key characteristics correlated with stable beta hairpin structure, including amino acid biases and enriched interstrand contacts. We find that position specific amino acid preferences, while seen throughout the beta hairpin structure, are most evident within the turn region, where they depend on subtle turn dynamics associated with turn length and secondary structure. We also establish a set of broad design principles, such as the inclusion of aspartic acid residues at a specific position and the careful consideration of desired secondary structure when selecting residues for the turn region, that can be applied to the generation of libraries encoding proteins or peptides containing beta hairpin structures.
Collapse
Affiliation(s)
- Cory D DuPai
- Department of Molecular Biosciences, University of Texas at Austin, Austin, Texas, USA.,Department of Integrative Biology, University of Texas at Austin, Austin, Texas, USA
| | - Bryan W Davies
- Department of Molecular Biosciences, University of Texas at Austin, Austin, Texas, USA.,Center for Systems and Synthetic Biology, John Ring LaMontagne Center for Infectious Diseases, Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas, USA
| | - Claus O Wilke
- Department of Integrative Biology, University of Texas at Austin, Austin, Texas, USA
| |
Collapse
|
7
|
Abstract
The reversible interaction between an affinity ligand and a complementary receptor has been widely explored in purification systems for several biomolecules. The development of tailored affinity ligands highly specific toward particular target biomolecules is one of the options in affinity purification systems. However, both genetic and chemical modifications in proteins and peptides widen the application of affinity ligand-tag receptors pairs toward universal capture and purification strategies. In particular, this chapter will focus on two case studies highly relevant for biotechnology and biomedical areas, namely the affinity tags and receptors employed on the production of recombinant fusion proteins, and the chemical modification of phosphate groups on proteins and peptides and the subsequent specific capture and enrichment, a mandatory step before further proteomic analysis.
Collapse
|
8
|
Wen J, Liao H, Stachowski K, Hempfling JP, Qian Z, Yuan C, Foster MP, Pei D. Rational design of cell-permeable cyclic peptides containing a d-Pro-l-Pro motif. Bioorg Med Chem 2020; 28:115711. [PMID: 33069067 DOI: 10.1016/j.bmc.2020.115711] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Revised: 08/12/2020] [Accepted: 08/12/2020] [Indexed: 12/01/2022]
Abstract
Cyclic peptides are capable of binding to challenging targets (e.g., proteins involved in protein-protein interactions) with high affinity and specificity, but generally cannot gain access to intracellular targets because of poor membrane permeability. In this work, we discovered a conformationally constrained cyclic cell-penetrating peptide (CPP) containing a d-Pro-l-Pro motif, cyclo(AFΦrpPRRFQ) (where Φ is l-naphthylalanine, r is d-arginine, and p is d-proline). The structural constraints provided by cyclization and the d-Pro-l-Pro motif permitted the rational design of cell-permeable cyclic peptides of large ring sizes (up to 16 amino acids). This strategy was applied to design a potent, cell-permeable, and biologically active cyclic peptidyl inhibitor, cyclo(YpVNFΦrpPRR) (where Yp is l-phosphotyrosine), against the Grb2 SH2 domain. Multidimensional NMR spectroscopic and circular dichroism analyses revealed that the cyclic CPP as well as the Grb2 SH2 inhibitor assume a predominantly random coil structure but have significant β-hairpin character surrounding the d-Pro-l-Pro motif. These results demonstrate cyclo(AFΦrpPRRFQ) as an effective CPP for endocyclic (insertion of cargo into the CPP ring) or exocyclic delivery of biological cargos (attachment of cargo to the Gln side chain).
Collapse
Affiliation(s)
- Jin Wen
- Department of Chemistry and Biochemistry and Ohio State Biochemistry Program, The Ohio State University, 484 West 12(th) Avenue, Columbus, OH 43210, USA
| | - Hui Liao
- Department of Chemistry and Biochemistry and Ohio State Biochemistry Program, The Ohio State University, 484 West 12(th) Avenue, Columbus, OH 43210, USA
| | - Kye Stachowski
- Department of Chemistry and Biochemistry and Ohio State Biochemistry Program, The Ohio State University, 484 West 12(th) Avenue, Columbus, OH 43210, USA
| | - Jordan P Hempfling
- Department of Chemistry and Biochemistry and Ohio State Biochemistry Program, The Ohio State University, 484 West 12(th) Avenue, Columbus, OH 43210, USA
| | - Ziqing Qian
- Department of Chemistry and Biochemistry and Ohio State Biochemistry Program, The Ohio State University, 484 West 12(th) Avenue, Columbus, OH 43210, USA
| | - Chunhua Yuan
- Campus Chemical Instrument Center, The Ohio State University, 460 West 12(th) Avenue, Columbus, OH 43210, USA
| | - Mark P Foster
- Department of Chemistry and Biochemistry and Ohio State Biochemistry Program, The Ohio State University, 484 West 12(th) Avenue, Columbus, OH 43210, USA.
| | - Dehua Pei
- Department of Chemistry and Biochemistry and Ohio State Biochemistry Program, The Ohio State University, 484 West 12(th) Avenue, Columbus, OH 43210, USA.
| |
Collapse
|