1
|
Xu Q, Zheng L, Huang M, Zhao M. Collagen derived Gly-Pro-type DPP-IV inhibitory peptides: Structure-activity relationship, inhibition kinetics and inhibition mechanism. Food Chem 2024; 441:138370. [PMID: 38199113 DOI: 10.1016/j.foodchem.2024.138370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 12/20/2023] [Accepted: 01/04/2024] [Indexed: 01/12/2024]
Abstract
Our previous study has demonstrated that both the amino acid at N3 position and peptide length affected the DPP-IV inhibitory activity of Gly-Pro-type peptides. To further elucidate their molecular mechanism, a combined approach of QSAR modeling, enzymatic kinetics and molecular docking was used. Results showed that the QSAR models of Gly-Pro-type tripeptides and Gly-Pro-type peptides containing 3-12 residues were successfully constructed by 5z-scale descriptor with R2 of 0.830 and 0.797, respectively. The lower values of electrophilicity, polarity, and side-chain bulk of amino acid at N3 position caused higher DPP-IV inhibitory activity of Gly-Pro-type peptides. Moreover, an appropriate increase in the length of Gly-Pro-type peptides did not change their competitive inhibition mode, but decreased their inhibition constants (Ki values) and increased interactions with DPP-IV. More importantly, the interactions between the residues at C-terminal of Gly-Pro-type peptides containing 5 ∼ 6 residues with S2 extensive subsites (Ser209, Phe357, Arg358) of DPP-IV increased the interactions of Gly residue at N1 position with the S2 subsites (Glu205, Glu206, Asn710, Arg125, Tyr662) and decreased the acylation level of DPP-IV-peptide complex, and thereby increasing peptides' DPP-IV inhibitory activity.
Collapse
Affiliation(s)
- Qiongyao Xu
- School of Food Science and Engineering, South China University of Technology, Guangzhou 510640, China; Guangdong Food Green Processing and Nutrition Regulation Technologies Research Center, Guangzhou 510650, China
| | - Lin Zheng
- School of Food Science and Engineering, South China University of Technology, Guangzhou 510640, China; Guangdong Food Green Processing and Nutrition Regulation Technologies Research Center, Guangzhou 510650, China.
| | - Mingtao Huang
- School of Food Science and Engineering, South China University of Technology, Guangzhou 510640, China; Chaozhou Branch of Chemistry and Chemical Engineering Guangdong Laboratory, Chaozhou, 521000, China; Guangdong Food Green Processing and Nutrition Regulation Technologies Research Center, Guangzhou 510650, China
| | - Mouming Zhao
- School of Food Science and Engineering, South China University of Technology, Guangzhou 510640, China; Chaozhou Branch of Chemistry and Chemical Engineering Guangdong Laboratory, Chaozhou, 521000, China; Guangdong Food Green Processing and Nutrition Regulation Technologies Research Center, Guangzhou 510650, China.
| |
Collapse
|
2
|
Yadav DK, Kumar S, Teli MK, Kim MH. Ligand-based pharmacophore modeling and docking studies on vitamin D receptor inhibitors. J Cell Biochem 2020; 121:3570-3583. [PMID: 31904142 DOI: 10.1002/jcb.29640] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Accepted: 12/09/2019] [Indexed: 12/21/2022]
Abstract
In recent years, pharmacophore modeling and molecular docking approaches have been extensively used to characterize the structural requirements and explore the conformational space of a ligand in the binding pocket of the selected target protein. Herein, we report a pharmacophore modeling and molecular docking of 45 compounds comprising of the indole scaffold as vitamin D receptor (VDR) inhibitors. Based on the selected best hypothesis (DRRRR.61), an atom-based three-dimensional quantitative structure-activity relationships model was developed to rationalize the structural requirement of biological activity modulating components. The developed model predicted the binding affinity for the training set and test set with R2 (training) = 0.8869 and R2 (test) = 0.8139, respectively. Furthermore, molecular docking and dynamics simulation were performed to understand the underpinning of binding interaction and stability of selected VDR inhibitors in the binding pocket. In conclusion, the results presented here, in the form of functional and structural data, agreed well with the proposed pharmacophores and provide further insights into the development of novel VDR inhibitors with better activity.
Collapse
Affiliation(s)
- Dharmendra Kumar Yadav
- Gachon Institute of Pharmaceutical Science and Department of Pharmacy, College of Pharmacy, Gachon University, Incheon, Republic of Korea
| | - Surendra Kumar
- Gachon Institute of Pharmaceutical Science and Department of Pharmacy, College of Pharmacy, Gachon University, Incheon, Republic of Korea
| | - Mahesh Kumar Teli
- Gachon Institute of Pharmaceutical Science and Department of Pharmacy, College of Pharmacy, Gachon University, Incheon, Republic of Korea
| | - Mi-Hyun Kim
- Gachon Institute of Pharmaceutical Science and Department of Pharmacy, College of Pharmacy, Gachon University, Incheon, Republic of Korea
| |
Collapse
|
3
|
Fontaine NT, Cadet XF, Vetrivel I. Novel Descriptors and Digital Signal Processing- Based Method for Protein Sequence Activity Relationship Study. Int J Mol Sci 2019; 20:ijms20225640. [PMID: 31718061 PMCID: PMC6888668 DOI: 10.3390/ijms20225640] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2019] [Revised: 11/04/2019] [Accepted: 11/07/2019] [Indexed: 12/18/2022] Open
Abstract
The work aiming to unravel the correlation between protein sequence and function in the absence of structural information can be highly rewarding. We present a new way of considering descriptors from the amino acids index database for modeling and predicting the fitness value of a polypeptide chain. This approach includes the following steps: (i) Calculating Q elementary numerical sequences (Ele_SEQ) depending on the encoding of the amino acid residues, (ii) determining an extended numerical sequence (Ext_SEQ) by concatenating the Q elementary numerical sequences, wherein at least one elementary numerical sequence is a protein spectrum obtained by applying fast Fourier transformation (FFT), and (iii) predicting a value of fitness for polypeptide variants (train and/or validation set). These new descriptors were tested on four sets of proteins of different lengths (GLP-2, TNF alpha, cytochrome P450, and epoxide hydrolase) and activities (cAMP activation, binding affinity, thermostability and enantioselectivity). We show that the use of multiple physicochemical descriptors coupled with the implementation of the FFT, taking into account the interactions between residues of amino acids within the protein sequence, could lead to very significant improvement in the quality of models and predictions. The choice of the descriptor or of the combination of descriptors and/or FFT is dependent on the couple protein/fitness. This approach can provide potential users with value added to existing mutant libraries where screening efforts have so far been unsuccessful in finding improved polypeptide mutants for useful applications.
Collapse
Affiliation(s)
- Nicolas T Fontaine
- PEACCEL, Protein Engineering ACCELerator, 6 Square Albin Cachot, box 42, 75013 Paris, France
| | - Xavier F Cadet
- PEACCEL, Protein Engineering ACCELerator, 6 Square Albin Cachot, box 42, 75013 Paris, France
| | - Iyanar Vetrivel
- PEACCEL, Protein Engineering ACCELerator, 6 Square Albin Cachot, box 42, 75013 Paris, France
| |
Collapse
|
4
|
Gill RT, Halweg-Edwards AL, Clauset A, Way SF. Synthesis aided design: The biological design-build-test engineering paradigm? Biotechnol Bioeng 2015; 113:7-10. [PMID: 26580431 DOI: 10.1002/bit.25857] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 10/14/2015] [Indexed: 11/09/2022]
Affiliation(s)
- Ryan T Gill
- Department of Chemical and Biological Engineering, University of Colorado, Boulder 80309, CO
| | - Andrea L Halweg-Edwards
- Department of Chemical and Biological Engineering, University of Colorado, Boulder 80309, CO
| | - Aaron Clauset
- Department of Computer Science, University of Colorado, Boulder, CO
| | - Sam F Way
- Department of Computer Science, University of Colorado, Boulder, CO
| |
Collapse
|
5
|
Padrón-García JA, Alonso-Tarajano M, Alonso-Becerra E, Winterburn TJ, Ruiz Y, Kay J, Berry C. Quantitative structure activity relationship of IA3-like peptides as aspartic proteinase inhibitors. Proteins 2009; 75:859-69. [DOI: 10.1002/prot.22295] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
6
|
The role of the N-terminal and mid-region residues of substance P in regulating functional selectivity at the tachykinin NK1 receptor. Eur J Pharmacol 2008; 592:1-6. [DOI: 10.1016/j.ejphar.2008.06.097] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2008] [Revised: 06/18/2008] [Accepted: 06/22/2008] [Indexed: 11/17/2022]
|
7
|
Chaparro-Riggers JF, Polizzi KM, Bommarius AS. Better library design: data-driven protein engineering. Biotechnol J 2007; 2:180-91. [PMID: 17183506 DOI: 10.1002/biot.200600170] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Data-driven protein engineering is increasingly used as an alternative to rational design and combinatorial engineering because it uses available knowledge to limit library size, while still allowing for the identification of unpredictable substitutions that lead to large effects. Recent advances in computational modeling and bioinformatics, as well as an increasing databank of experiments on functional variants, have led to new strategies to choose particular amino acid residues to vary in order to increase the chances of obtaining a variant protein with the desired property. Strategies for limiting diversity at each position, design of small sub-libraries, and the performance of scouting experiments, have also been developed or even automated, further reducing the library size.
Collapse
Affiliation(s)
- Javier F Chaparro-Riggers
- School of Chemical and Biomolecular Engineering, Parker H. Petit Institute of Bioengineering and Bioscience, Atlanta, GA, USA
| | | | | |
Collapse
|
8
|
Liao J, Warmuth MK, Govindarajan S, Ness JE, Wang RP, Gustafsson C, Minshull J. Engineering proteinase K using machine learning and synthetic genes. BMC Biotechnol 2007; 7:16. [PMID: 17386103 PMCID: PMC1847811 DOI: 10.1186/1472-6750-7-16] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2006] [Accepted: 03/26/2007] [Indexed: 11/10/2022] Open
Abstract
Background Altering a protein's function by changing its sequence allows natural proteins to be converted into useful molecular tools. Current protein engineering methods are limited by a lack of high throughput physical or computational tests that can accurately predict protein activity under conditions relevant to its final application. Here we describe a new synthetic biology approach to protein engineering that avoids these limitations by combining high throughput gene synthesis with machine learning-based design algorithms. Results We selected 24 amino acid substitutions to make in proteinase K from alignments of homologous sequences. We then designed and synthesized 59 specific proteinase K variants containing different combinations of the selected substitutions. The 59 variants were tested for their ability to hydrolyze a tetrapeptide substrate after the enzyme was first heated to 68°C for 5 minutes. Sequence and activity data was analyzed using machine learning algorithms. This analysis was used to design a new set of variants predicted to have increased activity over the training set, that were then synthesized and tested. By performing two cycles of machine learning analysis and variant design we obtained 20-fold improved proteinase K variants while only testing a total of 95 variant enzymes. Conclusion The number of protein variants that must be tested to obtain significant functional improvements determines the type of tests that can be performed. Protein engineers wishing to modify the property of a protein to shrink tumours or catalyze chemical reactions under industrial conditions have until now been forced to accept high throughput surrogate screens to measure protein properties that they hope will correlate with the functionalities that they intend to modify. By reducing the number of variants that must be tested to fewer than 100, machine learning algorithms make it possible to use more complex and expensive tests so that only protein properties that are directly relevant to the desired application need to be measured. Protein design algorithms that only require the testing of a small number of variants represent a significant step towards a generic, resource-optimized protein engineering process.
Collapse
Affiliation(s)
- Jun Liao
- Department of Computer Science, University of California, Santa Cruz, CA 95064 USA
| | - Manfred K Warmuth
- Department of Computer Science, University of California, Santa Cruz, CA 95064 USA
| | | | - Jon E Ness
- DNA 2.0, 1430 O'Brien Drive, Suite E, Menlo Park, CA 94025, USA
| | - Rebecca P Wang
- DNA 2.0, 1430 O'Brien Drive, Suite E, Menlo Park, CA 94025, USA
| | | | - Jeremy Minshull
- DNA 2.0, 1430 O'Brien Drive, Suite E, Menlo Park, CA 94025, USA
| |
Collapse
|
9
|
Ivanisenko VA, Eroshkin AM, Kolchanov NA. WebProAnalyst: an interactive tool for analysis of quantitative structure-activity relationships in protein families. Nucleic Acids Res 2005; 33:W99-104. [PMID: 15980590 PMCID: PMC1160182 DOI: 10.1093/nar/gki421] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
WebProAnalyst is a web-accessible analysis tool () designed for scanning quantitative structure–activity relationships in protein families. The tool allows users to search correlations between protein activity and physicochemical characteristics (i.e. hydrophobicity or alpha-helical amphipathicity) in queried sequences. WebProAnalyst uses aligned amino acid sequences and data on protein activity (pK, Km, ED50, among others). WebProAnalyst implements methods of the known ProAnalyst package, including the multiple linear regression analysis and the sequence–activity correlation coefficient. In addition, WebProAnalyst incorporates a method based on neural networks. The WebProAnalyst reports a list of sites in protein family, the regression analysis parameters (including correlation values) for the relationships between the amino acid physicochemical characteristics in the site and the protein activity values. WebProAnalyst is useful in search of the amino acid residues that are important for protein function/activity. Furthermore, WebProAnalyst may be helpful in designing the protein-engineering experiments.
Collapse
Affiliation(s)
- Vladimir A. Ivanisenko
- Institute of Cytology and Genetics SBRASLavrentyev avenue 10, Novosibirsk, 630090, Russia
- The Novosibirsk State University, Novosibirsk630090, Russia
- To whom correspondence should be addressed. Tel: +7 3832332971; Fax: +7 3832331278;
| | - Alexey M. Eroshkin
- State Research Center of Virology and Biotechnology VECTORKoltsovo, Novosibirsk Region, 630559, Russia
| | - Nickolay A. Kolchanov
- Institute of Cytology and Genetics SBRASLavrentyev avenue 10, Novosibirsk, 630090, Russia
| |
Collapse
|
10
|
Minshull J, Ness JE, Gustafsson C, Govindarajan S. Predicting enzyme function from protein sequence. Curr Opin Chem Biol 2005; 9:202-9. [PMID: 15811806 DOI: 10.1016/j.cbpa.2005.02.003] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
There are two main reasons to try to predict an enzyme's function from its sequence. The first is to identify the components and thus the functional capabilities of an organism, the second is to create enzymes with specific properties. Genomics, expression analysis, proteomics and metabonomics are largely directed towards understanding how information flows from DNA sequence to protein functions within an organism. This review focuses on information flow in the opposite direction: the applicability of what is being learned from natural enzymes to improve methods for catalyst design.
Collapse
|
11
|
Minshull J, Govindarajan S, Cox T, Ness JE, Gustafsson C. Engineered protein function by selective amino acid diversification. Methods 2005; 32:416-27. [PMID: 15003604 DOI: 10.1016/j.ymeth.2003.10.004] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/06/2003] [Indexed: 11/16/2022] Open
Abstract
Almost all protein engineering methods rely upon making changes to naturally occurring proteins that already possess some of the desired properties. This will probably remain the case as long as we lack a complete understanding of the way that an amino acid sequence gives rise to a protein with a precisely defined biological function. Common to all methods for altering an existing protein is the selection of a subset of amino acids in the protein for variation and a choice of which substitutions to make at each position. Variants are then tested empirically and further variants are created based upon their performance. Differences between protein engineering methods are the ways in which amino acids are chosen for variation, the protocols followed for creating the variants, and how information regarding variant properties is used in creating subsequent variants. In this article, we describe these differences and provide examples of how the experimental parameters of specific projects determine which method is most suitable.
Collapse
|
12
|
Kim HJ, Chae CH, Yi KY, Park KL, Yoo SE. Computational studies of COX-2 inhibitors: 3D-QSAR and docking. Bioorg Med Chem 2004; 12:1629-41. [PMID: 15028256 DOI: 10.1016/j.bmc.2004.01.027] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2003] [Revised: 01/17/2004] [Accepted: 01/19/2004] [Indexed: 11/29/2022]
Abstract
The 3D-QSAR (three-dimensional quantitative structure-activity relationships) studies for 88 selective COX-2 (cyclooxygenase-2) inhibitors belonging to three chemical classes (triaryl rings, diaryl cycloalkanopyrazoles, and diphenyl hydrazides) were conducted using comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA). Partial least squares analysis produced statistically significant models with q(2) values of 0.84 and 0.79 for CoMFA and CoMSIA, respectively. The binding energies calculated from flexible docking were correlated with inhibitory activities by the least-squares fit method. The three chemical classes of inhibitors showed reasonable internal predictability (r(2)=0.51, 0.49, and 0.54), but the sulfonyl-containing inhibitors demonstrated distinctively low binding energy compared to the others. The electrostatic interaction energy between the Arg513 of the COX-2 active site and sulfonyl group of the triaryl rings seemed to have the responsibility for difference in binding energy. Comparative binding energy (COMBINE) analyses gave q(2) values of 0.64, 0.63, and 0.50 for triaryl rings, diaryl cycloalkanopyrazoles, and diphenyl hydrazides, respectively. In this COMBINE model, some protein residues were highlighted as particularly important for inhibitory activity. The combination of ligand-based and structure-based models provided an improved understanding in the interaction between the three chemical classes and the COX-2.
Collapse
Affiliation(s)
- Hye-Jung Kim
- Korea Research Institute of Chemical Technology, PO Box 107, Yusung-gu, Taejeon 305-343, South Korea
| | | | | | | | | |
Collapse
|
13
|
Varga JL, Schally AV, Horvath JE, Kovacs M, Halmos G, Groot K, Toller GL, Rekasi Z, Zarandi M. Increased activity of antagonists of growth hormone-releasing hormone substituted at positions 8, 9, and 10. Proc Natl Acad Sci U S A 2004; 101:1708-13. [PMID: 14755056 PMCID: PMC341828 DOI: 10.1073/pnas.0307288101] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Antagonists of human growth hormone-releasing hormone (hGHRH) with increased potency and improved enzymatic and chemical stability are needed for potential clinical applications. We synthesized 21 antagonistic analogs of hGHRH(1-29)NH(2), substituted at positions 8, 9, and 10 of the common core sequence [phenylacetyl-Tyr(1), d-Arg(2,28), para-chloro-phenylalanine 6, Arg(9)/homoarginine 9, Tyr(10)/O-methyltyrosine 10, alpha-aminobutyric acid 15, norleucine 27, Har(29)] hGHRH(1-29)NH(2). Inhibitory effects on hGHRH-induced GH release were evaluated in vitro in a superfused rat pituitary system, as well as in vivo after i.v. injection into rats. The binding affinities of the peptides to pituitary GHRH receptors were also determined. Introduction of para-amidinophenylalanine 10 yielded antagonists JV-1-62 and -63 with the highest activities in vitro and lowest receptor dissociation constants (K(i) = 0.057-0.062 nM). Antagonists JV-1-62 and -63 also exhibited the strongest effect in vivo, significantly (P < 0.05-0.001) inhibiting hGHRH-induced GH release for at least 1 h. Para-aminophenylalanine 10 and O-ethyltyrosine 10 substitutions yielded antagonists potent in vitro, but His(10), 3,3'-diphenylalanine 10, 2-naphthylalanine 10, and cyclohexylalanine 10 modifications were detrimental. Antagonists containing citrulline 9 (in MZ-J-7-72), amidinophenylalanine 9 (in JV-1-65), His(9), d-Arg(9), citrulline 8, Ala(8), d-Ala(8), or alpha-aminobutyric acid 8 substituents also had high activity and receptor affinity in vitro. However, in vitro potencies of analogs with substitution in position 9 correlated poorly with acute endocrine effects in vivo, as exemplified by the weak and/or short inhibitory actions of antagonists JV-1-65 and MZ-J-7-72 on GH release in vivo. Nevertheless, antagonist JV-1-65 was more potent than JV-1-63 in tests on inhibition of the growth of human prostatic and lung cancer lines xenografted into nude mice. This indicates that oncological activity may be based on several mechanisms. hGHRH antagonists with improved efficacy could be useful for treatment of cancers that depend on insulin-like growth factors or GHRH.
Collapse
Affiliation(s)
- Jozsef L Varga
- Endocrine, Polypeptide, and Cancer Institute, Veterans Affairs Medical Center, Tulane University School of Medicine, New Orleans, LA 70112-2699, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Gustafsson C, Govindarajan S, Minshull J. Putting engineering back into protein engineering: bioinformatic approaches to catalyst design. Curr Opin Biotechnol 2003; 14:366-70. [PMID: 12943844 DOI: 10.1016/s0958-1669(03)00101-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Complex multivariate engineering problems are commonplace and not unique to protein engineering. Mathematical and data-mining tools developed in other fields of engineering have now been applied to analyze sequence-activity relationships of peptides and proteins and to assist in the design of proteins and peptides with specified properties. Decreasing costs of DNA sequencing in conjunction with methods to quickly synthesize statistically representative sets of proteins allow modern heuristic statistics to be applied to protein engineering. This provides an alternative approach to expensive assays or unreliable high-throughput surrogate screens.
Collapse
|
15
|
Liu SS, Liu HL, Yin CS, Wang LS. VSMP: a novel variable selection and modeling method based on the prediction. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2003; 43:964-9. [PMID: 12767155 DOI: 10.1021/ci020377j] [Citation(s) in RCA: 81] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The use of numerous descriptors that are indicative of molecular structure and topology is becoming more common in quantitative structure-activity relationship (QSAR). How to choose the adequate descriptors for QSAR studies is important but difficult because there are no absolute rules to govern this choice. A variety of variable selection techniques including stepwise, partial least squares/principal component analysis (PLS/PCA), neural network, and evolutionary algorithm such as genetic algorithm have been applied to this common problem. All-subsets regression (ASR) is capable of finding out the best variable subset from among a large pool. In this paper, a novel variable selection and modeling method based on the prediction, for short VSMP, has been developed. Here two controllable parameters, the interrelation coefficient between the pairs of the independent variables (r(int)) and the correlation coefficient (q(2)) obtained using the leave-one-out (LOO) cross-validation technique, are introduced into the ASR to improve its performances. This technique differs from the other variable selection procedures related to the ASR by two main features: (1) The search of various optimal subset search is controlled by the statistic q(2) or root-mean-square error (RMSEP) in the LOO cross-validation step rather than the correlation coefficient obtained in the modeling step (r(2)). (2) The searching speed of all optimal subsets is expedited by the statistic r(int) together with q(2). A comparison of the results of the VSMP applied to the Selwood data set (n = 31 compounds, m = 53 descriptors) with those obtained from alternative algorithms shows the good performance of the technique.
Collapse
Affiliation(s)
- Shu-Shen Liu
- State Key Laboratory of Pollution Control and Resources Reuse, Department of Environmental Science & Engineering, Nanjing University, Nanjing 210093, P. R. China.
| | | | | | | |
Collapse
|
16
|
Choulier L, Andersson K, Hämäläinen MD, van Regenmortel MHV, Malmqvist M, Altschuh D. QSAR studies applied to the prediction of antigen-antibody interaction kinetics as measured by BIACORE. Protein Eng Des Sel 2002; 15:373-82. [PMID: 12034857 DOI: 10.1093/protein/15.5.373] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The objective of this work was to investigate the potential of the quantitative structure-activity relationships (QSAR) approach for predictive modulation of molecular interaction kinetics. A multivariate QSAR approach involving modifications in peptide sequence and buffer composition was recently used in an attempt to predict the kinetics of peptide-antibody interactions as measured by BIACORE. Quantitative buffer-kinetics relationships (QBKR) and quantitative sequence-kinetics relationships (QSKR) models were developed. Their predictive capacity was investigated in this study by comparing predicted and observed kinetic dissociation parameters (k(d)) for new antigenic peptides, or in new buffers. The range of experimentally measured k(d) variations was small (300-fold), limiting the practical value of the approach for this particular interaction. However, the models were validated from a statistical point of view. In QSKR, the leave-one-out cross validation gave Q(2) = 0.71 for 24 peptides (all but one outlier), compared to 0.81 for 17 training peptides. A more precise model (Q(2) = 0.92) could be developed when removing sets of peptides sharing distinctive structural features, suggesting that different peptides use slightly different binding modes. All models share the most important factor and are informative for structure-kinetics relationships. In QBKR, the measured effect on k(d) of individual additives in the buffers was consistent with the effect predicted from multivariate buffers. Our results open new perspectives for the predictive optimization of interaction kinetics, with important implications in pharmacology and biotechnology.
Collapse
Affiliation(s)
- Laurence Choulier
- UMR7100-CNRS, ESBS, Bld Sébastien Brandt, 67400 Illkirch Cedex, France and Biacore AB, Rapsgatan 7, SE754 50 Uppsala, Sweden
| | | | | | | | | | | |
Collapse
|
17
|
Lee MJ, de Jong S, Gäde G, Poulos C, Goldsworthy GJ. Mathematical modelling of insect neuropeptide potencies. Are quantitatively predictive models possible? INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2000; 30:899-907. [PMID: 10899456 DOI: 10.1016/s0965-1748(00)00078-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The potencies of natural adipokinetic hormones and synthetic variants have been determined in Locusta migratoria using the lipid mobilisation assay in vivo, and/or the acetate uptake assay in vitro. These data are combinations of previously published and unpublished data (a total of sixty-nine analogues), and form data sets for the construction of mathematical models of the hormone potencies. The sequence variations of amino acids in both natural and artificial adipokinetic hormone analogues were described using continuous descriptor scales z(1)', z(2)', and z(3)', each previously published scale being derived from various properties of the amino acids. By means of these z'-scales and partial least squares regression we attempted to model the potencies in Locusta migratoria of adipokinetic hormones in the two assays. Correlations (r(2) values) between predicted and actual potencies of the different peptides were up to 0.73. We discuss the potential of the partial least squares method for formulating quantitative relationships between different hormone structures and their potencies, and describe how the procedure might be used in structure-activity prediction with the construction of an optimised peptide data set.
Collapse
Affiliation(s)
- M J Lee
- Biotechnology, Unilever Research Vlaardingen, Olivier van Noortlaan 120, 3133 AT, Vlaardingen, The Netherlands.
| | | | | | | | | |
Collapse
|
18
|
Varga JL, Schally AV, Csernus VJ, Zarándi M, Halmos G, Groot K, Rékási Z. Synthesis and biological evaluation of antagonists of growth hormone-releasing hormone with high and protracted in vivo activities. Proc Natl Acad Sci U S A 1999; 96:692-7. [PMID: 9892695 PMCID: PMC15198 DOI: 10.1073/pnas.96.2.692] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Some antagonists of human growth hormone-releasing hormone (hGH-RH) synthesized previously were shown to inhibit in vivo proliferation of various human cancers in nude mice. However, the activity of these analogs requires an increase to assure clinical efficacy. In an attempt to prepare hGH-RH antagonists with a high and protracted activity, we synthesized and biologically tested 22 antagonistic analogs of hGH-RH(1-29)NH2. The ability of the antagonists to inhibit hGH-RH-induced GH release was evaluated in vitro in a superfused rat pituitary system, as well as in vivo after i.v. injection into rats. The binding affinity of the peptides to GH-RH receptors also was determined. All antagonistic analogs had the common core sequence [PhAc-Tyr1,D-Arg2, Phe(4-Cl)6 (para-chlorophenylalanine), Abu15 (alpha-aminobutyric acid), Nle27]hGH-RH(1-29)NH2 and contained Arg, D-Arg, homoarginine (Har), norleucine (Nle), and other substitutions. The following analogs were determined to have a high and/or protracted antagonistic activity: [PhAc-Tyr1,D-Arg2,Phe(4-Cl)6,Arg9,Abu15,Nle27, D-Arg29]hGH-RH(1-29)NH2 (JV-1-10), [PhAc-Tyr1,D-Arg2,Phe(4-Cl)6, Abu15,Nle27,D-Arg28,Har29]hGH-RH(1-29)NH2 (MZ-6-55), [PhAc-Tyr1, D-Arg2,Phe(4-Cl)6,Arg9,Abu15,Nle27,D-Arg28,Har29 ]hGH-RH(1-29)NH2 (JV-1-36), and [PhAc-Tyr1,D-Arg2,Phe(4-Cl)6,Har9,Tyr(Me)10,Abu15, Nle27,D-Arg28,Har29]hGH-RH(1-29)NH2 (JV-1-38). Among the peptides tested, analog JV-1-36 showed the highest GH-RH antagonistic activity in vitro and also induced a strong and prolonged inhibition of GH release in vivo for at least 30 min. The antagonist JV-1-38 was slightly less potent than JV-1-36 both in vitro and in vivo but proved to be very long-acting in vivo, suppressing the GH-RH-induced GH release even after 60 min. High and protracted in vivo activities of these antagonists indicate an improvement over earlier GH-RH analogs. Some of these hGH-RH antagonists could find clinical applications in the treatment of cancers dependent on insulin-like growth factors I and II.
Collapse
Affiliation(s)
- J L Varga
- Endocrine, Polypeptide and Cancer Institute, Veterans Affairs Medical Center, Tulane University School of Medicine, New Orleans, LA 70112, USA
| | | | | | | | | | | | | |
Collapse
|
19
|
Matter H. A validation study of molecular descriptors for the rational design of peptide libraries. THE JOURNAL OF PEPTIDE RESEARCH : OFFICIAL JOURNAL OF THE AMERICAN PEPTIDE SOCIETY 1998; 52:305-14. [PMID: 9832309 DOI: 10.1111/j.1399-3011.1998.tb01245.x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Important molecular descriptors used for establishing quantitative structure-activity relationships are investigated to classify similar versus dissimilar peptides. When searching new lead structures, synthesizing and testing compounds which are too similar wastes time and resources. In contrast, any lead optimization program requires the investigation of similar compounds to that lead. Thus, it is important to maximize or minimize the structural diversity of peptides to design useful compound libraries for lead finding or lead refinement projects. If a molecular descriptor is a useful measure of similarity for the design of peptide libraries, small differences in this descriptor for a pair of molecules should only translate into small biological differences. Using this paradigm as a basis for descriptor validation, it was possible to rank different molecular descriptors. Those physicochemical descriptors are 2D fingerprints and five experimentally or theoretically derived principal property scales. Some theoretically derived metrics are obtained by computing interaction energies or similarity indices on predefined 3D grid points using canonical conformations for individual amino acids. The resulting 3D data matrices are analyzed using a principal component analysis leading to three principal properties for CoMFA (Comparative Molecular Field Analysis) or CoMSIA (Comparative Molecular Similarity Index Analysis) derived molecular fields. The descriptor validation results reveal the applicability of design tools on peptide data sets. Experimentally derived descriptors, in general, are more acceptable than computationally derived metrics, while the latter provide a statistically valid alternative to characterize novel building blocks. The CoMSIA metrics perform slightly better than the CoMFA-based principal properties, while GRID-based descriptors are always less acceptable.
Collapse
Affiliation(s)
- H Matter
- Hoechst Marion Roussel AG, Computational Chemistry, Core Research Functions, Frankfurt am Main, Germany.
| |
Collapse
|