1
|
Yin G, Lv G, Zhang J, Jiang H, Lai T, Yang Y, Ren Y, Wang J, Yi C, Chen H, Huang Y, Xiao C. Early-stage structure-based drug discovery for small GTPases by NMR spectroscopy. Pharmacol Ther 2022; 236:108110. [PMID: 35007659 DOI: 10.1016/j.pharmthera.2022.108110] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Revised: 01/04/2022] [Accepted: 01/05/2022] [Indexed: 12/13/2022]
Abstract
Small GTPase or Ras superfamily, including Ras, Rho, Rab, Ran and Arf, are fundamental in regulating a wide range of cellular processes such as growth, differentiation, migration and apoptosis. They share structural and functional similarities for binding guanine nucleotides and hydrolyzing GTP. Dysregulations of Ras proteins are involved in the pathophysiology of multiple human diseases, however there is still a stringent need for effective treatments targeting these proteins. For decades, small GTPases were recognized as 'undruggable' targets due to their complex regulatory mechanisms and lack of deep pockets for ligand binding. NMR has been critical in deciphering the structural and dynamic properties of the switch regions that are underpinning molecular switch functions of small GTPases, which pave the way for developing new effective inhibitors. The recent progress of drug or lead molecule development made for small GTPases profoundly delineated how modern NMR techniques reshape the field of drug discovery. In this review, we will summarize the progress of structural and dynamic studies of small GTPases, the NMR techniques developed for structure-based drug screening and their applications in early-stage drug discovery for small GTPases.
Collapse
Affiliation(s)
- Guowei Yin
- The Seventh Affiliated Hospital of Sun Yat-sen University, Shenzhen 518107, China.
| | - Guohua Lv
- Division of Histology & Embryology, Medical College, Jinan University, Guangzhou 511486, Guangdong, China
| | - Jerry Zhang
- University of North Carolina at Chapel Hill School of Medicine, Chapel Hill, NC 27516, USA
| | - Hongmei Jiang
- The Seventh Affiliated Hospital of Sun Yat-sen University, Shenzhen 518107, China
| | - Tianqi Lai
- Division of Histology & Embryology, Medical College, Jinan University, Guangzhou 511486, Guangdong, China
| | - Yushan Yang
- The Seventh Affiliated Hospital of Sun Yat-sen University, Shenzhen 518107, China
| | - Yong Ren
- The Seventh Affiliated Hospital of Sun Yat-sen University, Shenzhen 518107, China
| | - Jing Wang
- College of Life Sciences, Northwest University, Xi'an 710069, Shaanxi, China
| | - Chenju Yi
- The Seventh Affiliated Hospital of Sun Yat-sen University, Shenzhen 518107, China
| | - Hao Chen
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi Province 710049, PR China; Research Institute of Xi'an Jiaotong University, Zhejiang, Hangzhou, Zhejiang Province 311215, PR China
| | - Yun Huang
- Howard Hughes Medical Institute, Chevy Chase 20815, MD, USA; Department of Physiology & Biophysics, Weill Cornell Medicine, New York 10065, NY, USA.
| | - Chaoni Xiao
- College of Life Sciences, Northwest University, Xi'an 710069, Shaanxi, China.
| |
Collapse
|
2
|
Li Q, Kang C. A Practical Perspective on the Roles of Solution NMR Spectroscopy in Drug Discovery. Molecules 2020; 25:molecules25132974. [PMID: 32605297 PMCID: PMC7411973 DOI: 10.3390/molecules25132974] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2020] [Revised: 06/21/2020] [Accepted: 06/26/2020] [Indexed: 11/26/2022] Open
Abstract
Solution nuclear magnetic resonance (NMR) spectroscopy is a powerful tool to study structures and dynamics of biomolecules under physiological conditions. As there are numerous NMR-derived methods applicable to probe protein–ligand interactions, NMR has been widely utilized in drug discovery, especially in such steps as hit identification and lead optimization. NMR is frequently used to locate ligand-binding sites on a target protein and to determine ligand binding modes. NMR spectroscopy is also a unique tool in fragment-based drug design (FBDD), as it is able to investigate target-ligand interactions with diverse binding affinities. NMR spectroscopy is able to identify fragments that bind weakly to a target, making it valuable for identifying hits targeting undruggable sites. In this review, we summarize the roles of solution NMR spectroscopy in drug discovery. We describe some methods that are used in identifying fragments, understanding the mechanism of action for a ligand, and monitoring the conformational changes of a target induced by ligand binding. A number of studies have proven that 19F-NMR is very powerful in screening fragments and detecting protein conformational changes. In-cell NMR will also play important roles in drug discovery by elucidating protein-ligand interactions in living cells.
Collapse
Affiliation(s)
- Qingxin Li
- Guangdong Provincial Engineering Laboratory of Biomass High Value Utilization, Guangdong Provincial Bioengineering Institute (Guangzhou Sugarcane Industry Research Institute), Guangzhou 510316, China
- Correspondence: (Q.L.); (C.K.); Tel.: +86-020-84168436 (Q.L.); +65-64070602 (C.K.)
| | - CongBao Kang
- Experimental Drug Development Centre (EDDC), Agency for Science, Technology and Research (A*STAR), 10 Biopolis Road, Chromos, #05-01, Singapore 138670, Singapore
- Correspondence: (Q.L.); (C.K.); Tel.: +86-020-84168436 (Q.L.); +65-64070602 (C.K.)
| |
Collapse
|
3
|
Abstract
Drug discovery is an extremely difficult and challenging endeavor with a very high failure rate. The task of identifying a drug that is safe, selective, and effective is a daunting proposition because disease biology is complex and highly variable across patients. Metabolomics enables the discovery of disease biomarkers, which provides insights into the molecular and metabolic basis of disease and may be used to assess treatment prognosis and outcome. In this regard, metabolomics has evolved to become an important component of the drug discovery process to resolve efficacy and toxicity issues and as a tool for precision medicine. A detailed description of an experimental protocol is presented that outlines the application of NMR metabolomics to the drug discovery pipeline. This includes (1) target identification by understanding the metabolic dysregulation in diseases, (2) predicting the mechanism of action of newly discovered or existing drug therapies, (3) and using metabolomics to screen a chemical lead to assess biological activity. Unlike other OMICS approaches, the metabolome is "fragile" and may be negatively impacted by improper sample collection, storage, and extraction procedures. Similarly, biologically irrelevant conclusions may result from incorrect data collection, preprocessing or processing procedures, or the erroneous use of univariate and multivariate statistical methods. These critical concerns are also addressed in the protocol.
Collapse
|
4
|
Suravajhala P, Kogelman LJA, Kadarmideen HN. Multi-omic data integration and analysis using systems genomics approaches: methods and applications in animal production, health and welfare. Genet Sel Evol 2016; 48:38. [PMID: 27130220 PMCID: PMC4850674 DOI: 10.1186/s12711-016-0217-x] [Citation(s) in RCA: 100] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2015] [Accepted: 04/16/2016] [Indexed: 02/06/2023] Open
Abstract
In the past years, there has been a remarkable development of high-throughput omics (HTO) technologies such as genomics, epigenomics, transcriptomics, proteomics and metabolomics across all facets of biology. This has spearheaded the progress of the systems biology era, including applications on animal production and health traits. However, notwithstanding these new HTO technologies, there remains an emerging challenge in data analysis. On the one hand, different HTO technologies judged on their own merit are appropriate for the identification of disease-causing genes, biomarkers for prevention and drug targets for the treatment of diseases and for individualized genomic predictions of performance or disease risks. On the other hand, integration of multi-omic data and joint modelling and analyses are very powerful and accurate to understand the systems biology of healthy and sustainable production of animals. We present an overview of current and emerging HTO technologies each with a focus on their applications in animal and veterinary sciences before introducing an integrative systems genomics framework for analysing and integrating multi-omic data towards improved animal production, health and welfare. We conclude that there are big challenges in multi-omic data integration, modelling and systems-level analyses, particularly with the fast emerging HTO technologies. We highlight existing and emerging systems genomics approaches and discuss how they contribute to our understanding of the biology of complex traits or diseases and holistic improvement of production performance, disease resistance and welfare.
Collapse
Affiliation(s)
- Prashanth Suravajhala
- Department of Large Animal Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Grønnegårdsvej 7, 1870, Frederiksberg C, Denmark
| | - Lisette J A Kogelman
- Department of Large Animal Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Grønnegårdsvej 7, 1870, Frederiksberg C, Denmark
| | - Haja N Kadarmideen
- Department of Large Animal Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Grønnegårdsvej 7, 1870, Frederiksberg C, Denmark.
| |
Collapse
|
5
|
Stark JL, Eghbalnia HR, Lee W, Westler WM, Markley JL. NMRmix: A Tool for the Optimization of Compound Mixtures in 1D (1)H NMR Ligand Affinity Screens. J Proteome Res 2016; 15:1360-8. [PMID: 26965640 PMCID: PMC4820789 DOI: 10.1021/acs.jproteome.6b00121] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
![]()
NMR ligand affinity screening is
a powerful technique that is routinely
used in drug discovery or functional genomics to directly detect protein–ligand
binding events. Binding events can be identified by monitoring differences
in the 1D 1H NMR spectrum of a compound with and without
protein. Although a single NMR spectrum can be collected within a
short period (2—10 min per sample), one-by-one screening of
a protein against a library of hundreds or thousands of compounds
requires a large amount of spectrometer time and a large quantity
of protein. Therefore, compounds are usually evaluated in mixtures
ranging in size from 3 to 20 compounds to improve the efficiency of
these screens in both time and material. Ideally, the NMR signals
from individual compounds in the mixture should not overlap so that
spectral changes can be associated with a particular compound. We
have developed a software tool, NMRmix, to assist in creating ideal
mixtures from a large panel of compounds with known chemical shifts.
Input to NMRmix consists of an 1H NMR peak list for each
compound, a user-defined overlap threshold, and additional user-defined
parameters if default settings are not used. NMRmix utilizes a simulated
annealing algorithm to optimize the composition of the mixtures to
minimize spectral peak overlaps so that each compound in the mixture
is represented by a maximum number of nonoverlapping chemical shifts.
A built-in graphical user interface simplifies data import and visual
evaluation of the results.
Collapse
Affiliation(s)
- Jaime L Stark
- National Magnetic Resonance Facility at Madison, University of Wisconsin , Madison, Wisconsin 53706, United States
| | - Hamid R Eghbalnia
- National Magnetic Resonance Facility at Madison, University of Wisconsin , Madison, Wisconsin 53706, United States
| | - Woonghee Lee
- National Magnetic Resonance Facility at Madison, University of Wisconsin , Madison, Wisconsin 53706, United States
| | - William M Westler
- National Magnetic Resonance Facility at Madison, University of Wisconsin , Madison, Wisconsin 53706, United States
| | - John L Markley
- National Magnetic Resonance Facility at Madison, University of Wisconsin , Madison, Wisconsin 53706, United States
| |
Collapse
|
6
|
Worley B, Sisco NJ, Powers R. Statistical removal of background signals from high-throughput (1)H NMR line-broadening ligand-affinity screens. JOURNAL OF BIOMOLECULAR NMR 2015; 63:53-58. [PMID: 26156049 PMCID: PMC4577452 DOI: 10.1007/s10858-015-9962-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2015] [Accepted: 06/30/2015] [Indexed: 06/04/2023]
Abstract
NMR ligand-affinity screens are vital to drug discovery, are routinely used to screen fragment-based libraries, and used to verify chemical leads from high-throughput assays and virtual screens. NMR ligand-affinity screens are also a highly informative first step towards identifying functional epitopes of unknown proteins, as well as elucidating the biochemical functions of protein-ligand interaction at their binding interfaces. While simple one-dimensional (1)H NMR experiments are capable of indicating binding through a change in ligand line shape, they are plagued by broad, ill-defined background signals from protein (1)H resonances. We present an uncomplicated method for subtraction of protein background in high-throughput ligand-based affinity screens, and show that its performance is maximized when phase-scatter correction is applied prior to subtraction.
Collapse
Affiliation(s)
- Bradley Worley
- Department of Chemistry, University of Nebraska-Lincoln, Lincoln, NE, 68588-0304, USA
| | - Nicholas J Sisco
- Department of Chemistry, University of Nebraska-Lincoln, Lincoln, NE, 68588-0304, USA
- Department of Chemistry and Biochemistry, Arizona State University, Tempe, AZ, 85287, USA
- Magnetic Resonance Research Center, Arizona State University, Tempe, AZ, 85287, USA
| | - Robert Powers
- Department of Chemistry, University of Nebraska-Lincoln, Lincoln, NE, 68588-0304, USA.
| |
Collapse
|
7
|
Copeland JC, Zehr LJ, Cerny RL, Powers R. The applicability of molecular descriptors for designing an electrospray ionization mass spectrometry compatible library for drug discovery. Comb Chem High Throughput Screen 2014; 15:806-15. [PMID: 22708878 DOI: 10.2174/138620712803901180] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2012] [Revised: 05/25/2012] [Accepted: 06/08/2012] [Indexed: 11/22/2022]
Abstract
Detecting a small molecular-weight compound by electrospray ionization mass spectrometry (ESI-MS) requires the compound to obtain a charge. Factors such as gas-phase proton affinities and analyte surface activity are correlated with a positive ESI-MS response, but unfortunately it is extremely challenging to predict from a chemical structure alone if a compound is likely to yield an observable molecular-ion peak in an ESI-MS spectrum. Thus, the design of a chemical library for an ESI-MS ligand-affinity screen is particularly daunting. Only 56.9% of the compounds from our FAST-NMR functional library [1] were detectable by ESI-MS. An analysis of ~1,600 molecular descriptors did not identify any correlation with a positive ESI-MS response that cannot be attributed to a skewed population distribution. Unfortunately, our results suggest that molecular descriptors are not a valuable approach for designing a chemical library for an MS-based ligand affinity screen.
Collapse
Affiliation(s)
- Jennifer C Copeland
- Department of Chemistry, University of Nebraska-Lincoln, Lincoln, NE 68588-0304, USA
| | | | | | | |
Collapse
|
8
|
Stark JL, Mehla K, Chaika N, Acton TB, Xiao R, Singh PK, Montelione GT, Powers R. Structure and function of human DnaJ homologue subfamily a member 1 (DNAJA1) and its relationship to pancreatic cancer. Biochemistry 2014; 53:1360-72. [PMID: 24512202 PMCID: PMC3985919 DOI: 10.1021/bi401329a] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Pancreatic cancer has a dismal 5 year survival rate of 5.5% that has not been improved over the past 25 years despite an enormous amount of effort. Thus, there is an urgent need to identify truly novel yet druggable protein targets for drug discovery. The human protein DnaJ homologue subfamily A member 1 (DNAJA1) was previously shown to be downregulated 5-fold in pancreatic cancer cells and has been targeted as a biomarker for pancreatic cancer, but little is known about the specific biological function for DNAJA1 or the other members of the DnaJ family encoded in the human genome. Our results suggest the overexpression of DNAJA1 suppresses the stress response capabilities of the oncogenic transcription factor, c-Jun, and results in the diminution of cell survival. DNAJA1 likely activates a DnaK protein by forming a complex that suppresses the JNK pathway, the hyperphosphorylation of c-Jun, and the anti-apoptosis state found in pancreatic cancer cells. A high-quality nuclear magnetic resonance solution structure of the J-domain of DNAJA1 combined with a bioinformatics analysis and a ligand affinity screen identifies a potential DnaK binding site, which is also predicted to overlap with an inhibitory binding site, suggesting DNAJA1 activity is highly regulated.
Collapse
Affiliation(s)
- Jaime L Stark
- Department of Chemistry, University of Nebraska-Lincoln , Lincoln, Nebraska 68588, United States
| | | | | | | | | | | | | | | |
Collapse
|
9
|
Jang R, Gao X, Li M. Towards fully automated structure-based NMR resonance assignment of ¹⁵N-labeled proteins from automatically picked peaks. J Comput Biol 2011; 18:347-63. [PMID: 21385039 DOI: 10.1089/cmb.2010.0251] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
In NMR resonance assignment, an indispensable step in NMR protein studies, manually processed peaks from both N-labeled and C-labeled spectra are typically used as inputs. However, the use of homologous structures can allow one to use only N-labeled NMR data and avoid the added expense of using C-labeled data. We propose a novel integer programming framework for structure-based backbone resonance assignment using N-labeled data. The core consists of a pair of integer programming models: one for spin system forming and amino acid typing, and the other for backbone resonance assignment. The goal is to perform the assignment directly from spectra without any manual intervention via automatically picked peaks, which are much noisier than manually picked peaks, so methods must be error-tolerant. In the case of semi-automated/manually processed peak data, we compare our system with the Xiong-Pandurangan-Bailey-Kellogg's contact replacement (CR) method, which is the most error-tolerant method for structure-based resonance assignment. Our system, on average, reduces the error rate of the CR method by five folds on their data set. In addition, by using an iterative algorithm, our system has the added capability of using the NOESY data to correct assignment errors due to errors in predicting the amino acid and secondary structure type of each spin system. On a publicly available data set for human ubiquitin, where the typing accuracy is 83%, we achieve 91% accuracy, compared to the 59% accuracy obtained without correcting for such errors. In the case of automatically picked peaks, using assignment information from yeast ubiquitin, we achieve a fully automatic assignment with 97% accuracy. To our knowledge, this is the first system that can achieve fully automatic structure-based assignment directly from spectra. This has implications in NMR protein mutant studies, where the assignment step is repeated for each mutant.
Collapse
Affiliation(s)
- Richard Jang
- David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, Ontario, Canada
| | | | | |
Collapse
|
10
|
Research Spotlight: Research in bioanalysis and separations at the University of Nebraska – Lincoln. Bioanalysis 2011; 3:1065-76. [DOI: 10.4155/bio.11.64] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
The Chemistry Department at the University of Nebraska – Lincoln (UNL) is located in Hamilton Hall on the main campus of UNL in Lincoln, NE, USA. This department houses the primary graduate and research program in chemistry in the state of Nebraska. This program includes the traditional fields of analytical chemistry, biochemistry, inorganic chemistry, organic chemistry and physical chemistry. However, this program also contains a great deal of multidisciplinary research in fields that range from bioanalytical and biophysical chemistry to nanomaterials, energy research, catalysis and computational chemistry. Current research in bioanalytical and biophysical chemistry at UNL includes work with separation methods such as HPLC and CE, as well as with techniques such as MS and LC–MS, NMR spectroscopy, electrochemical biosensors, scanning probe microscopy and laser spectroscopy. This article will discuss several of these areas, with an emphasis being placed on research in bioanalytical separations, binding assays and related fields.
Collapse
|
11
|
Shortridge MD, Bokemper M, Copeland JC, Stark JL, Powers R. Correlation between protein function and ligand binding profiles. J Proteome Res 2011; 10:2538-45. [PMID: 21366353 DOI: 10.1021/pr200015d] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We report that proteins with the same function bind the same set of small molecules from a standardized chemical library. This observation led to a quantifiable and rapidly adaptable method for protein functional analysis using experimentally derived ligand binding profiles. Ligand binding is measured using a high-throughput NMR ligand affinity screen with a structurally diverse chemical library. The method was demonstrated using a set of 19 proteins with a range of functions. A statistically significant similarity in ligand binding profiles was only observed between the two functionally identical albumins and between the five functionally similar amylases. This new approach is independent of sequence, structure, or evolutionary information and, therefore, extends our ability to analyze and functionally annotate novel genes.
Collapse
Affiliation(s)
- Matthew D Shortridge
- Department of Chemistry, University of Nebraska-Lincoln, Lincoln, Nebraska 68588-0304, United States
| | | | | | | | | |
Collapse
|
12
|
Stark JL, Mercier KA, Mueller GA, Acton TB, Xiao R, Montelione GT, Powers R. Solution structure and function of YndB, an AHSA1 protein from Bacillus subtilis. Proteins 2011; 78:3328-40. [PMID: 20818668 DOI: 10.1002/prot.22840] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The solution structure of the Bacillus subtilis protein YndB has been solved using NMR to investigate proposed biological functions. The YndB structure exhibits the helix-grip fold, which consists of a β-sheet with two small and one long α-helix, forming a hydrophobic cavity that preferentially binds lipid-like molecules. Sequence and structure comparisons with proteins from eukaryotes, prokaryotes, and archaea suggest that YndB is very similar to the eukaryote protein Aha1, which binds to the middle domain of Hsp90 and induces ATPase activity. On the basis of these similarities, YndB has been classified as a member of the activator of Hsp90 ATPase homolog 1-like protein (AHSA1) family with a function that appears to be related to stress response. An in silico screen of a compound library of ∼ 18,500 lipids was used to identify classes of lipids that preferentially bind YndB. The in silico screen identified, in order of affinity, the chalcone/hydroxychalcone, flavanone, and flavone/flavonol classes of lipids, which was further verified by 2D (1) H-(15) N HSQC NMR titration experiments with trans-chalcone, flavanone, flavone, and flavonol. All of these compounds are typically found in plants as precursors to various flavonoid antibiotics and signaling molecules. The sum of the data suggests an involvement of YndB with the stress response of B. subtilis to chalcone-like flavonoids released by plants due to a pathogen infection. The observed binding of chalcone-like molecules by YndB is likely related to the symbiotic relationship between B. subtilis and plants.
Collapse
Affiliation(s)
- Jaime L Stark
- Department of Chemistry, University of Nebraska Lincoln, Lincoln, Nebraska 68588-0304, USA
| | | | | | | | | | | | | |
Collapse
|
13
|
Searching the protein structure database for ligand-binding site similarities using CPASS v.2. BMC Res Notes 2011; 4:17. [PMID: 21269480 PMCID: PMC3057182 DOI: 10.1186/1756-0500-4-17] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2010] [Accepted: 01/26/2011] [Indexed: 11/17/2022] Open
Abstract
Background A recent analysis of protein sequences deposited in the NCBI RefSeq database indicates that ~8.5 million protein sequences are encoded in prokaryotic and eukaryotic genomes, where ~30% are explicitly annotated as "hypothetical" or "uncharacterized" protein. Our Comparison of Protein Active-Site Structures (CPASS v.2) database and software compares the sequence and structural characteristics of experimentally determined ligand binding sites to infer a functional relationship in the absence of global sequence or structure similarity. CPASS is an important component of our Functional Annotation Screening Technology by NMR (FAST-NMR) protocol and has been successfully applied to aid the annotation of a number of proteins of unknown function. Findings We report a major upgrade to our CPASS software and database that significantly improves its broad utility. CPASS v.2 is designed with a layered architecture to increase flexibility and portability that also enables job distribution over the Open Science Grid (OSG) to increase speed. Similarly, the CPASS interface was enhanced to provide more user flexibility in submitting a CPASS query. CPASS v.2 now allows for both automatic and manual definition of ligand-binding sites and permits pair-wise, one versus all, one versus list, or list versus list comparisons. Solvent accessible surface area, ligand root-mean square difference, and Cβ distances have been incorporated into the CPASS similarity function to improve the quality of the results. The CPASS database has also been updated. Conclusions CPASS v.2 is more than an order of magnitude faster than the original implementation, and allows for multiple simultaneous job submissions. Similarly, the CPASS database of ligand-defined binding sites has increased in size by ~ 38%, dramatically increasing the likelihood of a positive search result. The modification to the CPASS similarity function is effective in reducing CPASS similarity scores for false positives by ~30%, while leaving true positives unaffected. Importantly, receiver operating characteristics (ROC) curves demonstrate the high correlation between CPASS similarity scores and an accurate functional assignment. As indicated by distribution curves, scores ≥ 30% infer a functional similarity. Software URL: http://cpass.unl.edu.
Collapse
|
14
|
Stark JL, Powers R. Application of NMR and molecular docking in structure-based drug discovery. Top Curr Chem (Cham) 2011; 326:1-34. [PMID: 21915777 DOI: 10.1007/128_2011_213] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Drug discovery is a complex and costly endeavor, where few drugs that reach the clinical testing phase make it to market. High-throughput screening (HTS) is the primary method used by the pharmaceutical industry to identify initial lead compounds. Unfortunately, HTS has a high failure rate and is not particularly efficient at identifying viable drug leads. These shortcomings have encouraged the development of alternative methods to drive the drug discovery process. Specifically, nuclear magnetic resonance (NMR) spectroscopy and molecular docking are routinely being employed as important components of drug discovery research. Molecular docking provides an extremely rapid way to evaluate likely binders from a large chemical library with minimal cost. NMR ligand-affinity screens can directly detect a protein-ligand interaction, can measure a corresponding dissociation constant, and can reliably identify the ligand binding site and generate a co-structure. Furthermore, NMR ligand affinity screens and molecular docking are perfectly complementary techniques, where the combination of the two has the potential to improve the efficiency and success rate of drug discovery. This review will highlight the use of NMR ligand affinity screens and molecular docking in drug discovery and describe recent examples where the two techniques were combined to identify new and effective therapeutic drugs.
Collapse
Affiliation(s)
- Jaime L Stark
- Department of Chemistry, University of Nebraska, Lincoln, NE 68588-0304, USA
| | | |
Collapse
|
15
|
Triplet T, Shortridge MD, Griep MA, Stark JL, Powers R, Revesz P. PROFESS: a PROtein function, evolution, structure and sequence database. Database (Oxford) 2010; 2010:baq011. [PMID: 20624718 PMCID: PMC2911846 DOI: 10.1093/database/baq011] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2009] [Revised: 06/03/2010] [Accepted: 06/06/2010] [Indexed: 11/13/2022]
Abstract
The proliferation of biological databases and the easy access enabled by the Internet is having a beneficial impact on biological sciences and transforming the way research is conducted. There are approximately 1100 molecular biology databases dispersed throughout the Internet. To assist in the functional, structural and evolutionary analysis of the abundant number of novel proteins continually identified from whole-genome sequencing, we introduce the PROFESS (PROtein Function, Evolution, Structure and Sequence) database. Our database is designed to be versatile and expandable and will not confine analysis to a pre-existing set of data relationships. A fundamental component of this approach is the development of an intuitive query system that incorporates a variety of similarity functions capable of generating data relationships not conceived during the creation of the database. The utility of PROFESS is demonstrated by the analysis of the structural drift of homologous proteins and the identification of potential pancreatic cancer therapeutic targets based on the observation of protein-protein interaction networks. Database URL: http://cse.unl.edu/~profess/
Collapse
Affiliation(s)
- Thomas Triplet
- Department of Computer Science and Engineering, University of Nebraska-Lincoln, Lincoln, NE 68588-0115 and Department of Chemistry, University of Nebraska-Lincoln, Lincoln NE 68588-0304, USA
| | - Matthew D. Shortridge
- Department of Computer Science and Engineering, University of Nebraska-Lincoln, Lincoln, NE 68588-0115 and Department of Chemistry, University of Nebraska-Lincoln, Lincoln NE 68588-0304, USA
| | - Mark A. Griep
- Department of Computer Science and Engineering, University of Nebraska-Lincoln, Lincoln, NE 68588-0115 and Department of Chemistry, University of Nebraska-Lincoln, Lincoln NE 68588-0304, USA
| | - Jaime L. Stark
- Department of Computer Science and Engineering, University of Nebraska-Lincoln, Lincoln, NE 68588-0115 and Department of Chemistry, University of Nebraska-Lincoln, Lincoln NE 68588-0304, USA
| | - Robert Powers
- Department of Computer Science and Engineering, University of Nebraska-Lincoln, Lincoln, NE 68588-0115 and Department of Chemistry, University of Nebraska-Lincoln, Lincoln NE 68588-0304, USA
| | - Peter Revesz
- Department of Computer Science and Engineering, University of Nebraska-Lincoln, Lincoln, NE 68588-0115 and Department of Chemistry, University of Nebraska-Lincoln, Lincoln NE 68588-0304, USA
| |
Collapse
|
16
|
Tang Z, Wu H, Cort JR, Buchko GW, Zhang Y, Shao Y, Aksay IA, Liu J, Lin Y. Constraint of DNA on functionalized graphene improves its biostability and specificity. SMALL (WEINHEIM AN DER BERGSTRASSE, GERMANY) 2010; 6:1205-9. [PMID: 20461727 DOI: 10.1002/smll.201000024] [Citation(s) in RCA: 235] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Affiliation(s)
- Zhiwen Tang
- Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Shortridge MD, Powers R. Structural and functional similarity between the bacterial type III secretion system needle protein PrgI and the eukaryotic apoptosis Bcl-2 proteins. PLoS One 2009; 4:e7442. [PMID: 19823588 PMCID: PMC2757720 DOI: 10.1371/journal.pone.0007442] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2009] [Accepted: 09/15/2009] [Indexed: 11/25/2022] Open
Abstract
Background Functional similarity is challenging to identify when global sequence and structure similarity is low. Active-sites or functionally relevant regions are evolutionarily more stable relative to the remainder of a protein structure and provide an alternative means to identify potential functional similarity between proteins. We recently developed the FAST-NMR methodology to discover biochemical functions or functional hypotheses of proteins of unknown function by experimentally identifying ligand binding sites. FAST-NMR utilizes our CPASS software and database to assign a function based on a similarity in the structure and sequence of ligand binding sites between proteins of known and unknown function. Methodology/Principal Findings The PrgI protein from Salmonella typhimurium forms the needle complex in the type III secretion system (T3SS). A FAST-NMR screen identified a similarity between the ligand binding sites of PrgI and the Bcl-2 apoptosis protein Bcl-xL. These ligand binding sites correlate with known protein-protein binding interfaces required for oligomerization. Both proteins form membrane pores through this oligomerization to release effector proteins to stimulate cell death. Structural analysis indicates an overlap between the PrgI structure and the pore forming motif of Bcl-xL. A sequence alignment indicates conservation between the PrgI and Bcl-xL ligand binding sites and pore formation regions. This active-site similarity was then used to verify that chelerythrine, a known Bcl-xL inhibitor, also binds PrgI. Conclusions/Significance A structural and functional relationship between the bacterial T3SS and eukaryotic apoptosis was identified using our FAST-NMR ligand affinity screen in combination with a bioinformatic analysis based on our CPASS program. A similarity between PrgI and Bcl-xL is not readily apparent using traditional global sequence and structure analysis, but was only identified because of conservation in ligand binding sites. These results demonstrate the unique opportunity that ligand-binding sites provide for the identification of functional relationships when global sequence and structural information is limited.
Collapse
Affiliation(s)
- Matthew D. Shortridge
- Department of Chemistry, University of Nebraska-Lincoln, Lincoln, Nebraska, United States of America
| | - Robert Powers
- Department of Chemistry, University of Nebraska-Lincoln, Lincoln, Nebraska, United States of America
- * E-mail:
| |
Collapse
|
18
|
Abstract
BACKGROUND: Drug discovery is a complex and unpredictable endeavor with a high failure rate. Current trends in the pharmaceutical industry have exasperated these challenges and are contributing to the dramatic decline in productivity observed over the last decade. The industrialization of science by forcing the drug discovery process to adhere to assembly-line protocols is imposing unnecessary restrictions, such as short project time-lines. Recent advances in nuclear magnetic resonance are responding to these self-imposed limitations and are providing opportunities to increase the success rate of drug discovery. OBJECTIVE/METHOD: A review of recent advancements in NMR technology that have the potential of significantly impacting and benefiting the drug discovery process will be presented. These include fast NMR data collection protocols and high-throughput protein structure determination, rapid protein-ligand co-structure determination, lead discovery using fragment-based NMR affinity screens, NMR metabolomics to monitor in vivo efficacy and toxicity for lead compounds, and the identification of new therapeutic targets through the functional annotation of proteins by FAST-NMR. CONCLUSION: NMR is a critical component of the drug discovery process, where the versatility of the technique enables it to continually expand and evolve its role. NMR is expected to maintain this growth over the next decade with advancements in automation, speed of structure calculation, in-cell imaging techniques, and the expansion of NMR amenable targets.
Collapse
Affiliation(s)
- Robert Powers
- Department of Chemistry, University of Nebraska Lincoln, Lincoln, NE 68588
| |
Collapse
|
19
|
Mercier KA, Cort JR, Kennedy MA, Lockert EE, Ni S, Shortridge MD, Powers R. Structure and function of Pseudomonas aeruginosa protein PA1324 (21-170). Protein Sci 2009; 18:606-18. [PMID: 19241370 DOI: 10.1002/pro.62] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Pseudomonas aeruginosa is the prototypical biofilm-forming gram-negative opportunistic human pathogen. P. aeruginosa is causatively associated with nosocomial infections and with cystic fibrosis. Antibiotic resistance in some strains adds to the inherent difficulties that result from biofilm formation when treating P. aeruginosa infections. Transcriptional profiling studies suggest widespread changes in the proteome during quorum sensing and biofilm development. Many of the proteins found to be upregulated during these processes are poorly characterized from a functional standpoint. Here, we report the solution NMR structure of PA1324, a protein of unknown function identified in these studies, and provide a putative biological functional assignment based on the observed prealbumin-like fold and FAST-NMR ligand screening studies. PA1324 is postulated to be involved in the binding and transport of sugars or polysaccharides associated with the peptidoglycan matrix during biofilm formation.
Collapse
Affiliation(s)
- Kelly A Mercier
- Department of Chemistry, University of Nebraska-Lincoln, Lincoln, Nebraska 68588, USA
| | | | | | | | | | | | | |
Collapse
|
20
|
Sim DW, Lee YS, Kim JH, Seo MD, Lee BJ, Won HS. HP0902 from Helicobacter pylori is a thermostable, dimeric protein belonging to an all-β topology of the cupin superfamily. BMB Rep 2009; 42:387-92. [PMID: 19558799 DOI: 10.5483/bmbrep.2009.42.6.387] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Affiliation(s)
- Dae-Won Sim
- Department of Biotechnology, College of Biomedical and Health Science, Konkuk University, Chungju, 380-701, Korea
| | | | | | | | | | | |
Collapse
|
21
|
Targeting protein–protein interactions for therapeutic intervention: a challenge for the future. Future Med Chem 2009; 1:65-93. [DOI: 10.4155/fmc.09.12] [Citation(s) in RCA: 187] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Background: Over the last two decades, an increasing research effort in academia and industry has focused on the modulation (both inhibition and stabilization) of protein–protein interactions (PPIs) in order to develop novel therapeutic approaches and target-selective agents in drug discovery. Discussion: The diversity and complexity of highly dynamic systems such as PPIs present many challenges for the identification of drug-like molecules with the ability to modulate the PPI with the necessary selectivity and potency. In this review, a number of these strategies will be presented along with a critical overview of the challenges and potential solutions relating to the exploitation of PPIs as molecular targets. Conclusions: Both traditional drug discovery approaches and some more recently developed innovative strategies have already provided valuable tools for the discovery of PPI modulators, and a number of successful examples have highlighted the potential of targeting PPIs for therapeutic intervention, especially in the oncology area.
Collapse
|
22
|
Panula-Perälä J, Siurkus J, Vasala A, Wilmanowski R, Casteleijn MG, Neubauer P. Enzyme controlled glucose auto-delivery for high cell density cultivations in microplates and shake flasks. Microb Cell Fact 2008; 7:31. [PMID: 19017379 PMCID: PMC2588551 DOI: 10.1186/1475-2859-7-31] [Citation(s) in RCA: 117] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2008] [Accepted: 11/18/2008] [Indexed: 11/10/2022] Open
Abstract
Background Here we describe a novel cultivation method, called EnBase™, or enzyme-based-substrate-delivery, for the growth of microorganisms in millilitre and sub-millilitre scale which yields 5 to 20 times higher cell densities compared to standard methods. The novel method can be directly applied in microwell plates and shake flasks without any requirements for additional sensors or liquid supply systems. EnBase is therefore readily applicable for many high throughput applications, such as DNA production for genome sequencing, optimisation of protein expression, production of proteins for structural genomics, bioprocess development, and screening of enzyme and metagenomic libraries. Results High cell densities with EnBase are obtained by applying the concept of glucose-limited fed-batch cultivation which is commonly used in industrial processes. The major difference of the novel method is that no external glucose feed is required, but glucose is released into the growth medium by enzymatic degradation of starch. To cope with the high levels of starch necessary for high cell density cultivation, starch is supplied to the growing culture suspension by continuous diffusion from a storage gel. Our results show that the controlled enzyme-based supply of glucose allows a glucose-limited growth to high cell densities of OD600 = 20 to 30 (corresponding to 6 to 9 g l-1 cell dry weight) without the external feed of additional compounds in shake flasks and 96-well plates. The final cell density can be further increased by addition of extra nitrogen during the cultivation. Production of a heterologous triosphosphate isomerase in E. coli BL21(DE3) resulted in 10 times higher volumetric product yield and a higher ratio of soluble to insoluble product when compared to the conventional production method. Conclusion The novel EnBase method is robust and simple-to-apply for high cell density cultivation in shake flasks and microwell plates. The potential of the system is that the microbial growth rate and oxygen consumption can be simply controlled by the amount (and principally also by the activity) of the starch-degrading enzyme. This solves the problems of uncontrolled growth, oxygen limitation, and severe pH drop in shaken cultures. In parallel the method provides the basis for enhanced cell densities. The feasibility of the new method has been shown for 96-well plates and shake flasks and we believe that it can easily be adapted to different microwell and deepwell plate formats and shake flasks. Therefore EnBase will be a helpful tool especially in high throughput applications.
Collapse
Affiliation(s)
- Johanna Panula-Perälä
- Bioprocess Engineering Laboratory, Department of Process and Environmental Engineering and Biocenter Oulu, University of Oulu, P,O, Box 4300, FI-90014 Oulu, Finland.
| | | | | | | | | | | |
Collapse
|