1
|
Nandi S, Bhaduri S, Das D, Ghosh P, Mandal M, Mitra P. Deciphering the Lexicon of Protein Targets: A Review on Multifaceted Drug Discovery in the Era of Artificial Intelligence. Mol Pharm 2024; 21:1563-1590. [PMID: 38466810 DOI: 10.1021/acs.molpharmaceut.3c01161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
Understanding protein sequence and structure is essential for understanding protein-protein interactions (PPIs), which are essential for many biological processes and diseases. Targeting protein binding hot spots, which regulate signaling and growth, with rational drug design is promising. Rational drug design uses structural data and computational tools to study protein binding sites and protein interfaces to design inhibitors that can change these interactions, thereby potentially leading to therapeutic approaches. Artificial intelligence (AI), such as machine learning (ML) and deep learning (DL), has advanced drug discovery and design by providing computational resources and methods. Quantum chemistry is essential for drug reactivity, toxicology, drug screening, and quantitative structure-activity relationship (QSAR) properties. This review discusses the methodologies and challenges of identifying and characterizing hot spots and binding sites. It also explores the strategies and applications of artificial-intelligence-based rational drug design technologies that target proteins and protein-protein interaction (PPI) binding hot spots. It provides valuable insights for drug design with therapeutic implications. We have also demonstrated the pathological conditions of heat shock protein 27 (HSP27) and matrix metallopoproteinases (MMP2 and MMP9) and designed inhibitors of these proteins using the drug discovery paradigm in a case study on the discovery of drug molecules for cancer treatment. Additionally, the implications of benzothiazole derivatives for anticancer drug design and discovery are deliberated.
Collapse
Affiliation(s)
- Suvendu Nandi
- School of Medical Science and Technology, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal 721302, India
| | - Soumyadeep Bhaduri
- Centre for Computational and Data Sciences, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal 721302, India
| | - Debraj Das
- Centre for Computational and Data Sciences, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal 721302, India
| | - Priya Ghosh
- School of Medical Science and Technology, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal 721302, India
| | - Mahitosh Mandal
- School of Medical Science and Technology, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal 721302, India
| | - Pralay Mitra
- Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal 721302, India
| |
Collapse
|
2
|
Vila JA. Protein folding rate evolution upon mutations. Biophys Rev 2023; 15:661-669. [PMID: 37681091 PMCID: PMC10480377 DOI: 10.1007/s12551-023-01088-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Accepted: 06/24/2023] [Indexed: 09/09/2023] Open
Abstract
Despite the spectacular success of cutting-edge protein fold prediction methods, many critical questions remain unanswered, including why proteins can reach their native state in a biologically reasonable time. A satisfactory answer to this simple question could shed light on the slowest folding rate of proteins as well as how mutations-amino-acid substitutions and/or post-translational modifications-might affect it. Preliminary results indicate that (i) Anfinsen's dogma validity ensures that proteins reach their native state on a reasonable timescale regardless of their sequence or length, and (ii) it is feasible to determine the evolution of protein folding rates without accounting for epistasis effects or the mutational trajectories between the starting and target sequences. These results have direct implications for evolutionary biology because they lay the groundwork for a better understanding of why, and to what extent, mutations-a crucial element of evolution and a factor influencing it-affect protein evolvability. Furthermore, they may spur significant progress in our efforts to solve crucial structural biology problems, such as how a sequence encodes its folding.
Collapse
Affiliation(s)
- Jorge A. Vila
- IMASL-CONICET, Universidad Nacional de San Luis, Ejército de Los Andes 950, 5700 San Luis, Argentina
| |
Collapse
|
3
|
Luzuriaga-Neira AR, Ritchie AM, Payne BL, Carrillo-Parramon O, Liberles DA, Alvarez-Ponce D. Highly Abundant Proteins Are Highly Thermostable. Genome Biol Evol 2023; 15:evad112. [PMID: 37399326 DOI: 10.1093/gbe/evad112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/08/2023] [Indexed: 07/05/2023] Open
Abstract
Highly abundant proteins tend to evolve slowly (a trend called E-R anticorrelation), and a number of hypotheses have been proposed to explain this phenomenon. The misfolding avoidance hypothesis attributes the E-R anticorrelation to the abundance-dependent toxic effects of protein misfolding. To avoid these toxic effects, protein sequences (particularly those of highly expressed proteins) would be under selection to fold properly. One prediction of the misfolding avoidance hypothesis is that highly abundant proteins should exhibit high thermostability (i.e., a highly negative free energy of folding, ΔG). Thus far, only a handful of analyses have tested for a relationship between protein abundance and thermostability, producing contradictory results. These analyses have been limited by 1) the scarcity of ΔG data, 2) the fact that these data have been obtained by different laboratories and under different experimental conditions, 3) the problems associated with using proteins' melting energy (Tm) as a proxy for ΔG, and 4) the difficulty of controlling for potentially confounding variables. Here, we use computational methods to compare the free energy of folding of pairs of human-mouse orthologous proteins with different expression levels. Even though the effect size is limited, the most highly expressed ortholog is often the one with a more negative ΔG of folding, indicating that highly expressed proteins are often more thermostable.
Collapse
Affiliation(s)
| | - Andrew M Ritchie
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, Pennsylvania, USA
| | | | | | - David A Liberles
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, Pennsylvania, USA
| | | |
Collapse
|
4
|
Kasavajhala K, Simmerling C. Exploring the Transferability of Replica Exchange Structure Reservoirs to Accelerate Generation of Ensembles for Alternate Hamiltonians or Protein Mutations. J Chem Theory Comput 2023; 19:1931-1944. [PMID: 36861842 PMCID: PMC10658647 DOI: 10.1021/acs.jctc.3c00005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/03/2023]
Abstract
Generating precise ensembles is commonly a prerequisite to understand the energetics of biological processes using Molecular Dynamics (MD) simulations. Previously, we have shown how unweighted reservoirs built from high temperature MD simulations can accelerate convergence of Boltzmann-weighted ensembles by at least 10× with the Reservoir Replica Exchange MD (RREMD) method. Therefore, in this work, we explore whether an unweighted structure reservoir generated with one Hamiltonian (solute force field plus solvent model) can be reused to quickly generate accurately weighted ensembles for Hamiltonians other than the one that was used to generate the reservoir. We also extended this methodology to rapidly estimate the effects of mutations on peptide stability by using a reservoir of diverse structures obtained from wild-type simulations. These results suggest that structures generated via fast methods such as coarse-grained models or structures predicted by Rosetta or deep learning approaches could be integrated into a reservoir to accelerate generation of ensembles using more accurate representations.
Collapse
Affiliation(s)
- Koushik Kasavajhala
- Department of Chemistry, Stony Brook University, Stony Brook, New York 11794, United States
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
| | - Carlos Simmerling
- Department of Chemistry, Stony Brook University, Stony Brook, New York 11794, United States
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
| |
Collapse
|
5
|
Vila JA. Proteins' Evolution upon Point Mutations. ACS OMEGA 2022; 7:14371-14376. [PMID: 35573218 PMCID: PMC9089682 DOI: 10.1021/acsomega.2c01407] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Accepted: 04/05/2022] [Indexed: 05/03/2023]
Abstract
As the reader must be already aware, state-of-the-art protein folding prediction methods have reached a smashing success in their goal of accurately determining the three-dimensional structures of proteins. Yet, a solution to simple problems such as the effects of protein point mutations on their (i) native conformation; (ii) marginal stability; (iii) ensemble of high-energy nativelike conformations; and (iv) metamorphism propensity and, hence, their evolvability, remains as an unsolved problem. As a plausible solution to the latter, some properties of the amide hydrogen-deuterium exchange, a highly sensitive probe of the structure, stability, and folding of proteins, are assessed from a new perspective. The preliminary results indicate that the protein marginal stability change upon point mutations provides the necessary and sufficient information to estimate, through a Boltzmann factor, the evolution of the amide hydrogen exchange protection factors and, consequently, that of the ensemble of folded conformations coexisting with the native state. This work contributes to our general understanding of the effects of point mutations on proteins and may spur significant progress in our efforts to develop methods to determine the appearance of new folds and functions accurately.
Collapse
|
6
|
Dultz G, Srikakulam SK, Konetschnik M, Shimakami T, Doncheva NT, Dietz J, Sarrazin C, Biondi RM, Zeuzem S, Tampé R, Kalinina OV, Welsch C. Epistatic interactions promote persistence of NS3-Q80K in HCV infection by compensating for protein folding instability. J Biol Chem 2021; 297:101031. [PMID: 34339738 PMCID: PMC8405986 DOI: 10.1016/j.jbc.2021.101031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Revised: 07/27/2021] [Accepted: 07/29/2021] [Indexed: 11/28/2022] Open
Abstract
The Q80K polymorphism in the NS3-4A protease of the hepatitis C virus is associated with treatment failure of direct-acting antiviral agents. This polymorphism is highly prevalent in genotype 1a infections and stably transmitted between hosts. Here, we investigated the underlying molecular mechanisms of evolutionarily conserved coevolving amino acids in NS3-Q80K and revealed potential implications of epistatic interactions in immune escape and variants persistence. Using purified protein, we characterized the impact of epistatic amino acid substitutions on the physicochemical properties and peptide cleavage kinetics of the NS3-Q80K protease. We found that Q80K destabilized the protease protein fold (p < 0.0001). Although NS3-Q80K showed reduced peptide substrate turnover (p < 0.0002), replicative fitness in an H77S.3 cell culture model of infection was not significantly inferior to the WT virus. Epistatic substitutions at residues 91 and 174 in NS3-Q80K stabilized the protein fold (p < 0.0001) and leveraged the WT protease stability. However, changes in protease stability inversely correlated with enzymatic activity. In infectious cell culture, these secondary substitutions were not associated with a gain of replicative fitness in NS3-Q80K variants. Using molecular dynamics, we observed that the total number of residue contacts in NS3-Q80K mutants correlated with protein folding stability. Changes in the number of contacts reflected the compensatory effect on protein folding instability by epistatic substitutions. In summary, epistatic substitutions in NS3-Q80K contribute to viral fitness by mechanisms not directly related to RNA replication. By compensating for protein-folding instability, epistatic interactions likely protect NS3-Q80K variants from immune cell recognition.
Collapse
Affiliation(s)
- Georg Dultz
- Department of Internal Medicine 1, Goethe University Hospital Frankfurt, Frankfurt am Main, Germany
| | - Sanjay K Srikakulam
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research, Saarland University Campus, Saarbrücken, Germany; Graduate School of Computer Science, Saarland University, Saarbrücken, Germany; Interdisciplinary Graduate School of Natural Product Research, Saarland University, Saarbrücken, Germany
| | - Michael Konetschnik
- Department of Internal Medicine 1, Goethe University Hospital Frankfurt, Frankfurt am Main, Germany
| | - Tetsuro Shimakami
- Department of Gastroenterology, Kanazawa University Hospital, Kanazawa, Japan
| | - Nadezhda T Doncheva
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Julia Dietz
- Department of Internal Medicine 1, Goethe University Hospital Frankfurt, Frankfurt am Main, Germany
| | - Christoph Sarrazin
- Department of Internal Medicine 1, Goethe University Hospital Frankfurt, Frankfurt am Main, Germany
| | - Ricardo M Biondi
- Molecular Targeting, Instituto de Investigación en Biomedicina de Buenos Aires (IBioBA) - CONICET - Partner Institute of the Max Planck Society, Buenos Aires, Argentina
| | - Stefan Zeuzem
- Department of Internal Medicine 1, Goethe University Hospital Frankfurt, Frankfurt am Main, Germany; University Center for Infectious Diseases, University Hospital Frankfurt, Frankfurt am Main, Germany
| | - Robert Tampé
- Institute of Biochemistry, Biocenter, Goethe University Frankfurt, Frankfurt am Main, Germany
| | - Olga V Kalinina
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research, Saarland University Campus, Saarbrücken, Germany; Medical Faculty, Saarland University, Homburg, Germany; Center for Bioinformatics, Saarland Informatics Campus, Saarbrücken, Germany
| | - Christoph Welsch
- Department of Internal Medicine 1, Goethe University Hospital Frankfurt, Frankfurt am Main, Germany; University Center for Infectious Diseases, University Hospital Frankfurt, Frankfurt am Main, Germany.
| |
Collapse
|
7
|
Blanquart S, Groussin M, Le Roy A, Szöllosi GJ, Girard E, Franzetti B, Gouy M, Madern D. Resurrection of Ancestral Malate Dehydrogenases Reveals the Evolutionary History of Halobacterial Proteins : Deciphering Gene Trajectories and Changes in Biochemical Properties. Mol Biol Evol 2021; 38:3754-3774. [PMID: 33974066 PMCID: PMC8382911 DOI: 10.1093/molbev/msab146] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Extreme halophilic Archaea thrive in high salt, where, through proteomic adaptation, they cope with the strong osmolarity and extreme ionic conditions of their environment. In spite of wide fundamental interest, however, studies providing insights into this adaptation are scarce, because of practical difficulties inherent to the purification and characterization of halophilic enzymes. In this work, we describe the evolutionary history of malate dehydrogenases (MalDH) within Halobacteria (a class of the Euryarchaeota phylum). We resurrected nine ancestors along the inferred halobacterial MalDH phylogeny, including the Last Common Ancestral MalDH of Halobacteria (LCAHa) and compared their biochemical properties with those of five modern halobacterial MalDHs. We monitored the stability of these various MalDHs, their oligomeric states and enzymatic properties, as a function of concentration for different salts in the solvent. We found that a variety of evolutionary processes such as amino acid replacement, gene duplication, loss of MalDH gene and replacement owing to horizontal transfer resulted in significant differences in solubility, stability and catalytic properties between these enzymes in the three Halobacteriales, Haloferacales and Natrialbales orders since the LCAHa MalDH.We also showed how a stability trade-off might favor the emergence of new properties during adaptation to diverse environmental conditions. Altogether, our results suggest a new view of halophilic protein adaptation in Archaea.
Collapse
Affiliation(s)
| | - Mathieu Groussin
- Université Lyon 1, CNRS, UMR5558, Laboratoire de Biométrie et Biologie Évolutive, 43 bd du 11 novembre 1918, Villeurbanne, F-69622, France.,Center for Microbiome Informatics and Therapeutics, Massachusetts Institute of Technology, Cambridge, Massachusetts, 02139, USA
| | - Aline Le Roy
- Univ Grenoble Alpes, CNRS, CEA, IBS, Grenoble, F-38000, France
| | - Gergely J Szöllosi
- Université Lyon 1, CNRS, UMR5558, Laboratoire de Biométrie et Biologie Évolutive, 43 bd du 11 novembre 1918, Villeurbanne, F-69622, France.,MTA-ELTE "Lendulet" Evolutionary Genomics Research Group, Budapest, H-1117, Hungary
| | - Eric Girard
- Univ Grenoble Alpes, CNRS, CEA, IBS, Grenoble, F-38000, France
| | - Bruno Franzetti
- Univ Grenoble Alpes, CNRS, CEA, IBS, Grenoble, F-38000, France
| | - Manolo Gouy
- Université Lyon 1, CNRS, UMR5558, Laboratoire de Biométrie et Biologie Évolutive, 43 bd du 11 novembre 1918, Villeurbanne, F-69622, France
| | | |
Collapse
|
8
|
Affiliation(s)
- Lavi S. Bigman
- Department of Structural BiologyWeizmann Institute of Science Rehovot 76100 Israel
| | - Yaakov Levy
- Department of Structural BiologyWeizmann Institute of Science Rehovot 76100 Israel
| |
Collapse
|
9
|
Bigman LS, Levy Y. Proteins: molecules defined by their trade-offs. Curr Opin Struct Biol 2020; 60:50-56. [DOI: 10.1016/j.sbi.2019.11.005] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Revised: 10/07/2019] [Accepted: 11/11/2019] [Indexed: 12/30/2022]
|
10
|
Genome-Wide Mutagenesis of Hepatitis C Virus Reveals Ability of Genome To Overcome Detrimental Mutations. J Virol 2020; 94:JVI.01327-19. [PMID: 31723027 DOI: 10.1128/jvi.01327-19] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Accepted: 10/17/2019] [Indexed: 01/10/2023] Open
Abstract
To gain insight into the impact of mutations on the viability of the hepatitis C virus (HCV) genome, we created a set of full-genome mutant libraries, differing from the parent sequence as well as each other, by using a random mutagenesis approach; the proportion of mutations increased across these libraries with declining template amount or dATP concentration. The replication efficiencies of full-genome mutant libraries ranged between 71 and 329 focus-forming units (FFU) per 105 Huh7.5 cells. Mutant libraries with low proportions of mutations demonstrated low replication capabilities, whereas those with high proportions of mutations had their replication capabilities restored. Hepatoma cells transfected with selected mutant libraries, with low (4 mutations per 10,000 bp copied), moderate (33 mutations), and high (66 mutations) proportions of mutations, and their progeny were subjected to serial passage. Predominant virus variants (mutants) from these mutant libraries (Mutantl, Mutantm, and Mutanth, respectively) were evaluated for changes in growth kinetics and particle-to-FFU unit ratio, virus protein expression, and modulation of host cell protein synthesis. Mutantm and Mutantl variants produced >3.0-log-higher extracellular progeny per ml than the parent, and Mutanth produced progeny at a rate 1.0-log lower. More than 80% of the mutations were in a nonstructural part of the mutant genomes, the majority were nonsynonymous, and a moderate to large proportion were in the conserved regions. Our results suggest that the HCV genome has the ability to overcome lethal/deleterious mutations because of the high reproduction rate but highly selects for random, beneficial mutations.IMPORTANCE Hepatitis C virus (HCV) in vivo displays high genetic heterogeneity, which is partly due to the high reproduction and random substitutions during error-prone genome replication. It is difficult to introduce random substitutions in vitro because of limitations in inducing mutagenesis from the 5' end to the 3' end of the genome. Our study has overcome this limitation. We synthesized full-length genomes with few to several random mutations in the background of an HCV clone that can recapitulate all steps of the life cycle. Our study provides evidence of the capability of the HCV genome to overcome deleterious mutations and remain viable. Mutants that emerged from the libraries had diverse phenotype profiles compared to the parent, and putative adaptive mutations mapped to segments of the conserved nonstructural genome. We demonstrate the potential utility of our system for the study of sequence variation that ensures the survival and adaptation of HCV.
Collapse
|
11
|
Wang F, Diesendruck CE. Effect of disulphide loop length on mechanochemical structural stability of macromolecules. Chem Commun (Camb) 2020; 56:2143-2146. [DOI: 10.1039/c9cc07439b] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Polymer chains folded with a single disulphide loop are shown to present distinct rates of mechanochemical fragmentation.
Collapse
Affiliation(s)
- Feng Wang
- Schulich Faculty of Chemistry and Russell-Berrie Nanotechnology Institute
- Technion – Israel Institute of Technology
- Haifa
- Israel
- School of Chemical Engineering
| | - Charles E. Diesendruck
- Schulich Faculty of Chemistry and Russell-Berrie Nanotechnology Institute
- Technion – Israel Institute of Technology
- Haifa
- Israel
| |
Collapse
|