1
|
Mapping major SARS-CoV-2 drug targets and assessment of druggability using computational fragment screening: Identification of an allosteric small-molecule binding site on the Nsp13 helicase. PLoS One 2021; 16:e0246181. [PMID: 33596235 PMCID: PMC7888625 DOI: 10.1371/journal.pone.0246181] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Accepted: 01/14/2021] [Indexed: 01/18/2023] Open
Abstract
The 2019 emergence of, SARS-CoV-2 has tragically taken an immense toll on human life and far reaching impacts on society. There is a need to identify effective antivirals with diverse mechanisms of action in order to accelerate preclinical development. This study focused on five of the most established drug target proteins for direct acting small molecule antivirals: Nsp5 Main Protease, Nsp12 RNA-dependent RNA polymerase, Nsp13 Helicase, Nsp16 2'-O methyltransferase and the S2 subunit of the Spike protein. A workflow of solvent mapping and free energy calculations was used to identify and characterize favorable small-molecule binding sites for an aromatic pharmacophore (benzene). After identifying the most favorable sites, calculated ligand efficiencies were compared utilizing computational fragment screening. The most favorable sites overall were located on Nsp12 and Nsp16, whereas the most favorable sites for Nsp13 and S2 Spike had comparatively lower ligand efficiencies relative to Nsp12 and Nsp16. Utilizing fragment screening on numerous possible sites on Nsp13 helicase, we identified a favorable allosteric site on the N-terminal zinc binding domain (ZBD) that may be amenable to virtual or biophysical fragment screening efforts. Recent structural studies of the Nsp12:Nsp13 replication-transcription complex experimentally corroborates ligand binding at this site, which is revealed to be a functional Nsp8:Nsp13 protein-protein interaction site in the complex. Detailed structural analysis of Nsp13 ZBD conformations show the role of induced-fit flexibility in this ligand binding site and identify which conformational states are associated with efficient ligand binding. We hope that this map of over 200 possible small-molecule binding sites for these drug targets may be of use for ongoing discovery, design, and drug repurposing efforts. This information may be used to prioritize screening efforts or aid in the process of deciphering how a screening hit may bind to a specific target protein.
Collapse
|
2
|
Yousafi Q, Kanwal S, Rashid H, Khan MS, Saleem S, Aslam M. In silico structural and functional characterization and phylogenetic study of alkaline phosphatase in bacterium, Rhizobium leguminosarum (Frank 1879). Comput Biol Chem 2019; 83:107142. [PMID: 31698161 DOI: 10.1016/j.compbiolchem.2019.107142] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2018] [Revised: 08/28/2019] [Accepted: 10/01/2019] [Indexed: 01/17/2023]
Abstract
Phosphorus is one of the primary macronutrient of plants, which is present in soil. It is essential for normal growth and development of plants. Plants use inorganic form of phosphate but organic form can also be assimilated with the help of soil inhabiting bacteria. Alkaline phosphatase is an enzyme present in Rizobium bacteria. This enzyme is responsible for solubilization and mineralization of organic phosphate and makes it readily available for plants. In the present study, nine different strains of Rhizobium leguminosarum were selected for a detailed computational structural and functional characterization and phylogenetic studies of alkaline phosphatase. Amino acid sequences were retrieved from UniProt and saved in FASTA format for use in analysis. Phylogenetic analysis of these strains was done by using MEGA7. 3D structure prediction was performed by using online server I-Tasser. Galaxy Web and 3D Refine were used for structure refinement. The refined structures were evaluated using two validation servers, QMEAN and SAVES. Protein-protein interaction analysis was done by using STRING. For detailed functional characterization, Cofactor, Coach, RaptorX, PSORT and MEME were used. Overall quality of predicted protein models was above 80%. Refined and validated models were submitted into PMDB. Seven out of nine strains were closely related and other two were distantly related. Protein-Protein interaction showed no significant co-expression among the interaction partners.
Collapse
Affiliation(s)
- Qudsia Yousafi
- COMSATS University Islamabad, Sahiwal campus, Pakistan instead of COMSATS Institute of Information Technology COMSATS, Institute of Information Technology GT Road COMSATS Road, 57000, Sahiwal, Pakistan.
| | - Saba Kanwal
- COMSATS University Islamabad, Sahiwal campus, Pakistan instead of COMSATS Institute of Information Technology COMSATS, Institute of Information Technology GT Road COMSATS Road, 57000, Sahiwal, Pakistan
| | - Hamid Rashid
- COMSATS University Islamabad, Sahiwal campus, Pakistan instead of COMSATS Institute of Information Technology COMSATS, Institute of Information Technology GT Road COMSATS Road, 57000, Sahiwal, Pakistan
| | - Muhammad Saad Khan
- COMSATS University Islamabad, Sahiwal campus, Pakistan instead of COMSATS Institute of Information Technology COMSATS, Institute of Information Technology GT Road COMSATS Road, 57000, Sahiwal, Pakistan
| | - Shahzad Saleem
- COMSATS University Islamabad, Sahiwal campus, Pakistan instead of COMSATS Institute of Information Technology COMSATS, Institute of Information Technology GT Road COMSATS Road, 57000, Sahiwal, Pakistan
| | | |
Collapse
|
3
|
Lu HC, Herrera Braga J, Fraternali F. PinSnps: structural and functional analysis of SNPs in the context of protein interaction networks. ACTA ACUST UNITED AC 2016; 32:2534-6. [PMID: 27153707 PMCID: PMC4978923 DOI: 10.1093/bioinformatics/btw153] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2015] [Accepted: 03/15/2016] [Indexed: 12/24/2022]
Abstract
Summary: We present a practical computational pipeline to readily perform data analyses of protein–protein interaction networks by using genetic and functional information mapped onto protein structures. We provide a 3D representation of the available protein structure and its regions (surface, interface, core and disordered) for the selected genetic variants and/or SNPs, and a prediction of the mutants’ impact on the protein as measured by a range of methods. We have mapped in total 2587 genetic disorder-related SNPs from OMIM, 587 873 cancer-related variants from COSMIC, and 1 484 045 SNPs from dbSNP. All result data can be downloaded by the user together with an R-script to compute the enrichment of SNPs/variants in selected structural regions. Availability and Implementation: PinSnps is available as open-access service at http://fraternalilab.kcl.ac.uk/PinSnps/ Contact:franca.fraternali@kcl.ac.uk Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Hui-Chun Lu
- Randall Division of Cell and Molecular Biophysics, King's College London, London SE1 1UL, UK
| | - Julián Herrera Braga
- Randall Division of Cell and Molecular Biophysics, King's College London, London SE1 1UL, UK
| | - Franca Fraternali
- Randall Division of Cell and Molecular Biophysics, King's College London, London SE1 1UL, UK
| |
Collapse
|
4
|
Boelt SG, Houen G, Højrup P. Agarose gel shift assay reveals that calreticulin favors substrates with a quaternary structure in solution. Anal Biochem 2015; 481:33-42. [PMID: 25908558 DOI: 10.1016/j.ab.2015.04.016] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2015] [Revised: 04/13/2015] [Accepted: 04/13/2015] [Indexed: 10/23/2022]
Abstract
Here we present an agarose gel shift assay that, in contrast to other electrophoresis approaches, is loaded in the center of the gel. This allows proteins to migrate in either direction according to their isoelectric points. Therefore, the presented assay enables a direct visualization, separation, and prefractionation of protein interactions in solution independent of isoelectric point. We demonstrate that this assay is compatible with immunochemical methods and mass spectrometry. The assay was used to investigate interactions with several potential substrates for calreticulin, a chaperone that is involved in different biological aspects through interaction with other proteins. The current analytical assays used to investigate these interactions are mainly spectroscopic aggregation assays or solid phase assays that do not provide a direct visualization of the stable protein complex but rather provide an indirect measure of interactions. Therefore, no interaction studies between calreticulin and substrates in solution have been investigated previously. The results presented here indicate that calreticulin has a preference for substrates with a quaternary structure and primarily β-sheets in their secondary structure. It is also demonstrated that the agarose gel shift assay is useful in the study of other protein interactions and can be used as an alternative method to native polyacrylamide gel electrophoresis.
Collapse
Affiliation(s)
- Sanne Grundvad Boelt
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense, Denmark; Department of Autoimmunology and Biomarkers, Statens Serum Institut, DK-2300 Copenhagen, Denmark
| | - Gunnar Houen
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense, Denmark; Department of Autoimmunology and Biomarkers, Statens Serum Institut, DK-2300 Copenhagen, Denmark
| | - Peter Højrup
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense, Denmark.
| |
Collapse
|
5
|
Clancy T, Hovig E. From proteomes to complexomes in the era of systems biology. Proteomics 2014; 14:24-41. [PMID: 24243660 DOI: 10.1002/pmic.201300230] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2013] [Revised: 10/22/2013] [Accepted: 11/06/2013] [Indexed: 01/16/2023]
Abstract
Protein complexes carry out almost the entire signaling and functional processes in the cell. The protein complex complement of a cell, and its network of complex-complex interactions, is referred to here as the complexome. Computational methods to predict protein complexes from proteomics data, resulting in network representations of complexomes, have recently being developed. In addition, key advances have been made toward understanding the network and structural organization of complexomes. We review these bioinformatics advances, and their discovery-potential, as well as the merits of integrating proteomics data with emerging methods in systems biology to study protein complex signaling. It is envisioned that improved integration of proteomics and systems biology, incorporating the dynamics of protein complexes in space and time, may lead to more predictive models of cell signaling networks for effective modulation.
Collapse
Affiliation(s)
- Trevor Clancy
- Department of Tumor Biology, Institute for Cancer Research, The Norwegian Radium Hospital, Oslo University Hospital, Oslo, Norway
| | | |
Collapse
|
6
|
Abstract
The past decade has seen a dramatic expansion in the number and range of techniques available to obtain genome-wide information and to analyze this information so as to infer both the functions of individual molecules and how they interact to modulate the behavior of biological systems. Here, we review these techniques, focusing on the construction of physical protein-protein interaction networks, and highlighting approaches that incorporate protein structure, which is becoming an increasingly important component of systems-level computational techniques. We also discuss how network analyses are being applied to enhance our basic understanding of biological systems and their disregulation, as well as how these networks are being used in drug development.
Collapse
Affiliation(s)
- Donald Petrey
- Center for Computational Biology and Bioinformatics, Department of Systems Biology
| | | |
Collapse
|
7
|
Andreani J, Guerois R. Evolution of protein interactions: From interactomes to interfaces. Arch Biochem Biophys 2014; 554:65-75. [DOI: 10.1016/j.abb.2014.05.010] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2014] [Revised: 04/28/2014] [Accepted: 05/12/2014] [Indexed: 12/16/2022]
|
8
|
Lu HC, Fornili A, Fraternali F. Protein-protein interaction networks studies and importance of 3D structure knowledge. Expert Rev Proteomics 2014; 10:511-20. [PMID: 24206225 DOI: 10.1586/14789450.2013.856764] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Protein-protein interaction networks (PPINs) are a powerful tool to study biological processes in living cells. In this review, we present the progress of PPIN studies from abstract to more detailed representations. We will focus on 3D interactome networks, which offer detailed information at the atomic level. This information can be exploited in understanding not only the underlying cellular mechanisms, but also how human variants and disease-causing mutations affect protein functions and complexes' stability. Recent studies have used structural information on PPINs to also understand the molecular mechanisms of binding partner selection. We will address the challenges in generating 3D PPINs due to the restricted number of solved protein structures. Finally, some of the current use of 3D PPINs will be discussed, highlighting their contribution to the studies in genotype-phenotype relationships and in the optimization of targeted studies to design novel chemical compounds for medical treatments.
Collapse
Affiliation(s)
- Hui-Chun Lu
- Randall Division of Cell and Molecular Biophysics, King's College London, New Hunt's House, London SE1 1UL, UK
| | | | | |
Collapse
|
9
|
Kelchtermans P, Bittremieux W, De Grave K, Degroeve S, Ramon J, Laukens K, Valkenborg D, Barsnes H, Martens L. Machine learning applications in proteomics research: how the past can boost the future. Proteomics 2014; 14:353-66. [PMID: 24323524 DOI: 10.1002/pmic.201300289] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2013] [Revised: 09/24/2013] [Accepted: 10/14/2013] [Indexed: 01/22/2023]
Abstract
Machine learning is a subdiscipline within artificial intelligence that focuses on algorithms that allow computers to learn solving a (complex) problem from existing data. This ability can be used to generate a solution to a particularly intractable problem, given that enough data are available to train and subsequently evaluate an algorithm on. Since MS-based proteomics has no shortage of complex problems, and since publicly available data are becoming available in ever growing amounts, machine learning is fast becoming a very popular tool in the field. We here therefore present an overview of the different applications of machine learning in proteomics that together cover nearly the entire wet- and dry-lab workflow, and that address key bottlenecks in experiment planning and design, as well as in data processing and analysis.
Collapse
Affiliation(s)
- Pieter Kelchtermans
- Department of Medical Protein Research, VIB, Ghent, Belgium; Faculty of Medicine and Health Sciences, Department of Biochemistry, Ghent University, Ghent, Belgium; Flemish Institute for Technological Research (VITO), Boeretang, Mol, Belgium
| | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Prieto G, Fullaondo A, Rodriguez JA. Prediction of nuclear export signals using weighted regular expressions (Wregex). ACTA ACUST UNITED AC 2014; 30:1220-7. [PMID: 24413524 DOI: 10.1093/bioinformatics/btu016] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
MOTIVATION Leucine-rich nuclear export signals (NESs) are short amino acid motifs that mediate binding of cargo proteins to the nuclear export receptor CRM1, and thus contribute to regulate the localization and function of many cellular proteins. Computational prediction of NES motifs is of great interest, but remains a significant challenge. RESULTS We have developed a novel approach for amino acid motif searching that can be used for NES prediction. This approach, termed Wregex (weighted regular expression), combines regular expressions with a position-specific scoring matrix (PSSM), and has been implemented in a web-based, freely available, software tool. By making use of a PSSM, Wregex provides a score to prioritize candidates for experimental testing. Key features of Wregex include its flexibility, which makes it useful for searching other types of protein motifs, and its fast execution time, which makes it suitable for large-scale analysis. In comparative tests with previously available prediction tools, Wregex is shown to offer a good rate of true-positive motifs, while keeping a smaller number of potential candidates.
Collapse
Affiliation(s)
- Gorka Prieto
- Department of Communications Engineering, University of the Basque Country (UPV/EHU), Alda. Urquijo s/n Bilbao, 48013 and Department of Genetics, Physical Anthropology and Animal Physiology, University of the Basque Country (UPV/EHU), Barrio Sarriena s/n Leioa, 48940, Spain
| | | | | |
Collapse
|
11
|
Interface-resolved network of protein-protein interactions. PLoS Comput Biol 2013; 9:e1003065. [PMID: 23696724 PMCID: PMC3656101 DOI: 10.1371/journal.pcbi.1003065] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2012] [Accepted: 04/08/2013] [Indexed: 12/01/2022] Open
Abstract
We define an interface-interaction network (IIN) to capture the specificity and competition between protein-protein interactions (PPI). This new type of network represents interactions between individual interfaces used in functional protein binding and thereby contains the detail necessary to describe the competition and cooperation between any pair of binding partners. Here we establish a general framework for the construction of IINs that merges computational structure-based interface assignment with careful curation of available literature. To complement limited structural data, the inclusion of biochemical data is critical for achieving the accuracy and completeness necessary to analyze the specificity and competition between the protein interactions. Firstly, this procedure provides a means to clarify the information content of existing data on purported protein interactions and to remove indirect and spurious interactions. Secondly, the IIN we have constructed here for proteins involved in clathrin-mediated endocytosis (CME) exhibits distinctive topological properties. In contrast to PPI networks with their global and relatively dense connectivity, the fragmentation of the IIN into distinctive network modules suggests that different functional pressures act on the evolution of its topology. Large modules in the IIN are formed by interfaces sharing specificity for certain domain types, such as SH3 domains distributed across different proteins. The shared and distinct specificity of an interface is necessary for effective negative and positive design of highly selective binding targets. Lastly, the organization of detailed structural data in a network format allows one to identify pathways of specific binding interactions and thereby predict effects of mutations at specific surfaces on a protein and of specific binding inhibitors, as we explore in several examples. Overall, the endocytosis IIN is remarkably complex and rich in features masked in the coarser PPI, and collects relevant detail of protein association in a readily interpretable format. Much of the work inside the cell is carried out by proteins interacting with other proteins. Each edge in a protein-protein interaction network reflects these functional interactions and each node a separate protein, creating a complex structure that nevertheless follows well-established global and local patterns related to robust protein function. However, this network is not detailed enough to assess whether a particular protein can bind multiple interaction partners simultaneously through distinct interfaces, or whether the partners targeting a specific interface share similar structural or chemical properties. By breaking each protein node into its constituent interface nodes, we generate and assess such a detailed new network. To sample protein binding interactions broadly and accurately beyond those seen in crystal structures, our method combines computational interface assignment with data from biochemical studies. Using this approach we are able to assign interfaces to the majority of known interactions between proteins involved in the clathrin-mediated endocytosis pathway in yeast. Analysis of this interface-interaction network provides novel insights into the functional specificity of protein interactions, and highlights elements of cooperativity and competition among the proteins. By identifying diverse multi-protein complexes, interface-interaction networks also provide a map for targeted drug development.
Collapse
|
12
|
Dunham WH, Mullin M, Gingras AC. Affinity-purification coupled to mass spectrometry: basic principles and strategies. Proteomics 2012; 12:1576-90. [PMID: 22611051 DOI: 10.1002/pmic.201100523] [Citation(s) in RCA: 230] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Identifying the interactions established by a protein of interest can be a critical step in understanding its function. This is especially true when an unknown protein of interest is demonstrated to physically interact with proteins of known function. While many techniques have been developed to characterize protein-protein interactions, one strategy that has gained considerable momentum over the past decade for identification and quantification of protein-protein interactions, is affinity-purification followed by mass spectrometry (AP-MS). Here, we briefly review the basic principles used in affinity-purification coupled to mass spectrometry, with an emphasis on tools (both biochemical and computational), which enable the discovery and reporting of high quality protein-protein interactions.
Collapse
Affiliation(s)
- Wade H Dunham
- Samuel Lunenfeld Research Institute at Mount Sinai Hospital, Toronto, ON, Canada
| | | | | |
Collapse
|