1
|
Johnston RC, Yao K, Kaplan Z, Chelliah M, Leswing K, Seekins S, Watts S, Calkins D, Chief Elk J, Jerome SV, Repasky MP, Shelley JC. Epik: p Ka and Protonation State Prediction through Machine Learning. J Chem Theory Comput 2023; 19:2380-2388. [PMID: 37023332 DOI: 10.1021/acs.jctc.3c00044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/08/2023]
Abstract
Epik version 7 is a software program that uses machine learning for predicting the pKa values and protonation state distribution of complex, druglike molecules. Using an ensemble of atomic graph convolutional neural networks (GCNNs) trained on over 42,000 pKa values across broad chemical space from both experimental and computed origins, the model predicts pKa values with 0.42 and 0.72 pKa unit median absolute and root mean square errors, respectively, across seven test sets. Epik version 7 also generates protonation states and recovers 95% of the most populated protonation states compared to previous versions. Requiring on average only 47 ms per ligand, Epik version 7 is rapid and accurate enough to evaluate protonation states for crucial molecules and prepare ultra-large libraries of compounds to explore vast regions of chemical space. The simplicity and time required for the training allow for the generation of highly accurate models customized to a program's specific chemistry.
Collapse
Affiliation(s)
- Ryne C Johnston
- Schrödinger, Inc., 101 SW Main Street, Suite 1300, Portland, Oregon 97204, United States
| | - Kun Yao
- Schrödinger, Inc., 1540 Broadway Street, 24th Floor, New York, New York 10036, United States
| | - Zachary Kaplan
- Schrödinger, Inc., 1540 Broadway Street, 24th Floor, New York, New York 10036, United States
| | - Monica Chelliah
- Schrödinger, Inc., 1540 Broadway Street, 24th Floor, New York, New York 10036, United States
| | - Karl Leswing
- Schrödinger, Inc., 1540 Broadway Street, 24th Floor, New York, New York 10036, United States
| | - Sean Seekins
- Schrödinger, Inc., 101 SW Main Street, Suite 1300, Portland, Oregon 97204, United States
| | - Shawn Watts
- Schrödinger, Inc., 101 SW Main Street, Suite 1300, Portland, Oregon 97204, United States
| | - David Calkins
- Schrödinger, Inc., 101 SW Main Street, Suite 1300, Portland, Oregon 97204, United States
| | - Jackson Chief Elk
- Schrödinger, Inc., 101 SW Main Street, Suite 1300, Portland, Oregon 97204, United States
| | - Steven V Jerome
- Schrödinger, Inc., 9171 Towne Centre Drive, San Diego, California 92122, United States
| | - Matthew P Repasky
- Schrödinger, Inc., 101 SW Main Street, Suite 1300, Portland, Oregon 97204, United States
| | - John C Shelley
- Schrödinger, Inc., 101 SW Main Street, Suite 1300, Portland, Oregon 97204, United States
| |
Collapse
|
2
|
Zhang Y, Vass M, Shi D, Abualrous E, Chambers JM, Chopra N, Higgs C, Kasavajhala K, Li H, Nandekar P, Sato H, Miller EB, Repasky MP, Jerome SV. Benchmarking Refined and Unrefined AlphaFold2 Structures for Hit Discovery. J Chem Inf Model 2023; 63:1656-1667. [PMID: 36897766 DOI: 10.1021/acs.jcim.2c01219] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/11/2023]
Abstract
The recently developed AlphaFold2 (AF2) algorithm predicts proteins' 3D structures from amino acid sequences. The open AlphaFold protein structure database covers the complete human proteome. Using an industry-leading molecular docking method (Glide), we investigated the virtual screening performance of 37 common drug targets, each with an AF2 structure and known holo and apo structures from the DUD-E data set. In a subset of 27 targets where the AF2 structures are suitable for refinement, the AF2 structures show comparable early enrichment of known active compounds (avg. EF 1%: 13.0) to apo structures (avg. EF 1%: 11.4) while falling behind early enrichment of the holo structures (avg. EF 1%: 24.2). With an induced-fit protocol (IFD-MD), we can refine the AF2 structures using an aligned known binding ligand as the template to improve the performance in structure-based virtual screening (avg. EF 1%: 18.9). Glide-generated docking poses of known binding ligands can also be used as templates for IFD-MD, achieving similar improvements (avg. EF 1% 18.0). Thus, with proper preparation and refinement, AF2 structures show considerable promise for in silico hit identification.
Collapse
Affiliation(s)
- Yuqi Zhang
- Schrödinger Inc., 9868 Scranton Road, Suite3200, San Diego, California 92121, United States
| | - Marton Vass
- Schrödinger Technologies Limited, Nine Hills Road, Cambridge CB2 1GE, United Kingdom
| | - Da Shi
- Schrödinger Inc., 9868 Scranton Road, Suite3200, San Diego, California 92121, United States
| | - Esam Abualrous
- Schrödinger GmbH, Glücksteinallee 25, Mannheim 68159, Germany
| | - Jennifer M Chambers
- Schrödinger Inc., 1540 Broadway, 24th Floor, New York, New York 10036, United States
| | - Nikita Chopra
- Schrödinger Inc., 9868 Scranton Road, Suite3200, San Diego, California 92121, United States
| | - Christopher Higgs
- Schrödinger Inc., 9868 Scranton Road, Suite3200, San Diego, California 92121, United States
| | - Koushik Kasavajhala
- Schrödinger India Private Limited, No: #147, 3rd Floor, Jawaharlal Nehru Main Road, Channasandra 5th Stage Rajarajeswari Nagar, Above State Bank of India, Bengaluru, India 560098
| | - Hubert Li
- Schrödinger Inc., 9868 Scranton Road, Suite3200, San Diego, California 92121, United States
| | - Prajwal Nandekar
- Schrödinger India Private Limited, No: #147, 3rd Floor, Jawaharlal Nehru Main Road, Channasandra 5th Stage Rajarajeswari Nagar, Above State Bank of India, Bengaluru, India 560098
| | - Hideyuki Sato
- Schrödinger KK, 13F Marunouchi Trust Tower North, 1-8-1 Marunouchi, Chiyoda-ku, Tokyo 100-0005, Japan
| | - Edward B Miller
- Schrödinger Inc., 1540 Broadway, 24th Floor, New York, New York 10036, United States
| | - Matthew P Repasky
- Schrödinger Inc., 101 SW Main Street, #1300, Portland, Oregon 97239, United States
| | - Steven V Jerome
- Schrödinger Inc., 9868 Scranton Road, Suite3200, San Diego, California 92121, United States
| |
Collapse
|
3
|
Yang Y, Yao K, Repasky MP, Leswing K, Abel R, Shoichet BK, Jerome SV. Efficient Exploration of Chemical Space with Docking and Deep Learning. J Chem Theory Comput 2021; 17:7106-7119. [PMID: 34592101 DOI: 10.1021/acs.jctc.1c00810] [Citation(s) in RCA: 53] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
With the advent of make-on-demand commercial libraries, the number of purchasable compounds available for virtual screening and assay has grown explosively in recent years, with several libraries eclipsing one billion compounds. Today's screening libraries are larger and more diverse, enabling the discovery of more-potent hit compounds and unlocking new areas of chemical space, represented by new core scaffolds. Applying physics-based in silico screening methods in an exhaustive manner, where every molecule in the library must be enumerated and evaluated independently, is increasingly cost-prohibitive. Here, we introduce a protocol for machine learning-enhanced molecular docking based on active learning to dramatically increase throughput over traditional docking. We leverage a novel selection protocol that strikes a balance between two objectives: (1) identifying the best scoring compounds and (2) exploring a large region of chemical space, demonstrating superior performance compared to a purely greedy approach. Together with automated redocking of the top compounds, this method captures almost all the high scoring scaffolds in the library found by exhaustive docking. This protocol is applied to our recent virtual screening campaigns against the D4 and AMPC targets that produced dozens of highly potent, novel inhibitors, and a blind test against the MT1 target. Our protocol recovers more than 80% of the experimentally confirmed hits with a 14-fold reduction in compute cost, and more than 90% of the hit scaffolds in the top 5% of model predictions, preserving the diversity of the experimentally confirmed hit compounds.
Collapse
Affiliation(s)
- Ying Yang
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California 94158, United States
| | - Kun Yao
- Schrödinger, Inc., 120 West 45th Street, 17th Floor, New York, New York 10036, United States
| | - Matthew P Repasky
- Schrödinger, Inc., 101 SW Main Street, #1300, Portland, Oregon 97239, United States
| | - Karl Leswing
- Schrödinger, Inc., 120 West 45th Street, 17th Floor, New York, New York 10036, United States
| | - Robert Abel
- Schrödinger, Inc., 120 West 45th Street, 17th Floor, New York, New York 10036, United States
| | - Brian K Shoichet
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California 94158, United States
| | - Steven V Jerome
- Schrödinger, Inc., 10201 Wateridge Cir Suite 220, San Diego, California 92121, United States
| |
Collapse
|
4
|
Murphy RB, Repasky MP, Greenwood JR, Tubert-Brohman I, Jerome S, Annabhimoju R, Boyles NA, Schmitz CD, Abel R, Farid R, Friesner RA. WScore: A Flexible and Accurate Treatment of Explicit Water Molecules in Ligand–Receptor Docking. J Med Chem 2016; 59:4364-84. [DOI: 10.1021/acs.jmedchem.6b00131] [Citation(s) in RCA: 58] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Affiliation(s)
- Robert B. Murphy
- Schrödinger, Inc., 101 SW Main Street, Portland Oregon 97204, United States
| | - Matthew P. Repasky
- Schrödinger, Inc., 101 SW Main Street, Portland Oregon 97204, United States
| | - Jeremy R. Greenwood
- Schrödinger, Inc., 120 West 45th Street, New York, New York 10036, United States
| | - Ivan Tubert-Brohman
- Schrödinger, Inc., 120 West 45th Street, New York, New York 10036, United States
| | - Steven Jerome
- Schrödinger, Inc., 120 West 45th Street, New York, New York 10036, United States
| | | | - Nicholas A. Boyles
- Schrödinger, Inc., 101 SW Main Street, Portland Oregon 97204, United States
| | | | - Robert Abel
- Schrödinger, Inc., 120 West 45th Street, New York, New York 10036, United States
| | - Ramy Farid
- Schrödinger, Inc., 120 West 45th Street, New York, New York 10036, United States
| | - Richard A. Friesner
- Department of
Chemistry, Columbia University, New York, 3000 Broadway,
MC 3110, New York 10036, United States
| |
Collapse
|
5
|
Rao S, Sanschagrin PC, Greenwood JR, Repasky MP, Sherman W, Farid R. Improving database enrichment through ensemble docking. J Comput Aided Mol Des 2008; 22:621-7. [PMID: 18253700 DOI: 10.1007/s10822-008-9182-y] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2007] [Accepted: 01/15/2008] [Indexed: 10/22/2022]
Abstract
While it may seem intuitive that using an ensemble of multiple conformations of a receptor in structure-based virtual screening experiments would necessarily yield improved enrichment of actives relative to using just a single receptor, it turns out that at least in the p38 MAP kinase model system studied here, a very large majority of all possible ensembles do not yield improved enrichment of actives. However, there are combinations of receptor structures that do lead to improved enrichment results. We present here a method to select the ensembles that produce the best enrichments that does not rely on knowledge of active compounds or sophisticated analyses of the 3D receptor structures. In the system studied here, the small fraction of ensembles of up to 3 receptors that do yield good enrichments of actives were identified by selecting ensembles that have the best mean GlideScore for the top 1% of the docked ligands in a database screen of actives and drug-like "decoy" ligands. Ensembles of two receptors identified using this mean GlideScore metric generally outperform single receptors, while ensembles of three receptors identified using this metric consistently give optimal enrichment factors in which, for example, 40% of the known actives outrank all the other ligands in the database.
Collapse
Affiliation(s)
- Shashidhar Rao
- Schrödinger, Inc, 120 West 45th Street, New York, NY, 10036, USA
| | | | | | | | | | | |
Collapse
|
6
|
Abstract
Glide is a ligand docking program for predicting protein-ligand binding modes and ranking ligands via high-throughput virtual screening. Glide utilizes two different scoring functions, SP and XP GlideScore, to rank-order compounds. Three modes of sampling ligand conformational and positional degrees of freedom are available to determine the optimal ligand orientation relative to a rigid protein receptor geometry. This unit presents protocols for flexible ligand docking with Glide, optionally including ligand constraints or ligand molecular similarities.
Collapse
|
7
|
Friesner RA, Murphy RB, Repasky MP, Frye LL, Greenwood JR, Halgren TA, Sanschagrin PC, Mainz DT. Extra precision glide: docking and scoring incorporating a model of hydrophobic enclosure for protein-ligand complexes. J Med Chem 2006; 49:6177-96. [PMID: 17034125 DOI: 10.1021/jm051256o] [Citation(s) in RCA: 4486] [Impact Index Per Article: 249.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A novel scoring function to estimate protein-ligand binding affinities has been developed and implemented as the Glide 4.0 XP scoring function and docking protocol. In addition to unique water desolvation energy terms, protein-ligand structural motifs leading to enhanced binding affinity are included: (1) hydrophobic enclosure where groups of lipophilic ligand atoms are enclosed on opposite faces by lipophilic protein atoms, (2) neutral-neutral single or correlated hydrogen bonds in a hydrophobically enclosed environment, and (3) five categories of charged-charged hydrogen bonds. The XP scoring function and docking protocol have been developed to reproduce experimental binding affinities for a set of 198 complexes (RMSDs of 2.26 and 1.73 kcal/mol over all and well-docked ligands, respectively) and to yield quality enrichments for a set of fifteen screens of pharmaceutical importance. Enrichment results demonstrate the importance of the novel XP molecular recognition and water scoring in separating active and inactive ligands and avoiding false positives.
Collapse
Affiliation(s)
- Richard A Friesner
- Department of Chemistry, Columbia University, New York, New York 10027, USA.
| | | | | | | | | | | | | | | |
Collapse
|
8
|
Banks JL, Beard HS, Cao Y, Cho AE, Damm W, Farid R, Felts AK, Halgren TA, Mainz DT, Maple JR, Murphy R, Philipp DM, Repasky MP, Zhang LY, Berne BJ, Friesner RA, Gallicchio E, Levy RM. Integrated Modeling Program, Applied Chemical Theory (IMPACT). J Comput Chem 2005; 26:1752-80. [PMID: 16211539 PMCID: PMC2742605 DOI: 10.1002/jcc.20292] [Citation(s) in RCA: 1012] [Impact Index Per Article: 53.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
We provide an overview of the IMPACT molecular mechanics program with an emphasis on recent developments and a description of its current functionality. With respect to core molecular mechanics technologies we include a status report for the fixed charge and polarizable force fields that can be used with the program and illustrate how the force fields, when used together with new atom typing and parameter assignment modules, have greatly expanded the coverage of organic compounds and medicinally relevant ligands. As we discuss in this review, explicit solvent simulations have been used to guide our design of implicit solvent models based on the generalized Born framework and a novel nonpolar estimator that have recently been incorporated into the program. With IMPACT it is possible to use several different advanced conformational sampling algorithms based on combining features of molecular dynamics and Monte Carlo simulations. The program includes two specialized molecular mechanics modules: Glide, a high-throughput docking program, and QSite, a mixed quantum mechanics/molecular mechanics module. These modules employ the IMPACT infrastructure as a starting point for the construction of the protein model and assignment of molecular mechanics parameters, but have then been developed to meet specialized objectives with respect to sampling and the energy function.
Collapse
|
9
|
Friesner RA, Banks JL, Murphy RB, Halgren TA, Klicic JJ, Mainz DT, Repasky MP, Knoll EH, Shelley M, Perry JK, Shaw DE, Francis P, Shenkin PS. Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. J Med Chem 2004; 47:1739-49. [PMID: 15027865 DOI: 10.1021/jm0306430] [Citation(s) in RCA: 6252] [Impact Index Per Article: 312.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Unlike other methods for docking ligands to the rigid 3D structure of a known protein receptor, Glide approximates a complete systematic search of the conformational, orientational, and positional space of the docked ligand. In this search, an initial rough positioning and scoring phase that dramatically narrows the search space is followed by torsionally flexible energy optimization on an OPLS-AA nonbonded potential grid for a few hundred surviving candidate poses. The very best candidates are further refined via a Monte Carlo sampling of pose conformation; in some cases, this is crucial to obtaining an accurate docked pose. Selection of the best docked pose uses a model energy function that combines empirical and force-field-based terms. Docking accuracy is assessed by redocking ligands from 282 cocrystallized PDB complexes starting from conformationally optimized ligand geometries that bear no memory of the correctly docked pose. Errors in geometry for the top-ranked pose are less than 1 A in nearly half of the cases and are greater than 2 A in only about one-third of them. Comparisons to published data on rms deviations show that Glide is nearly twice as accurate as GOLD and more than twice as accurate as FlexX for ligands having up to 20 rotatable bonds. Glide is also found to be more accurate than the recently described Surflex method.
Collapse
Affiliation(s)
- Richard A Friesner
- Department of Chemistry, Columbia University, New York, New York 10036, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Casadesús R, Moreno M, González-Lafont A, Lluch JM, Repasky MP. Testing electronic structure methods for describing intermolecular H...H interactions in supramolecular chemistry. J Comput Chem 2004; 25:99-105. [PMID: 14634997 DOI: 10.1002/jcc.10371] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In this article a wide variety of computational approaches (molecular mechanics force fields, semiempirical formalisms, and hybrid methods, namely ONIOM calculations) have been used to calculate the energy and geometry of the supramolecular system 2-(2'-hydroxyphenyl)-4-methyloxazole (HPMO) encapsulated in beta-cyclodextrin (beta-CD). The main objective of the present study has been to examine the performance of these computational methods when describing the short range H. H intermolecular interactions between guest (HPMO) and host (beta-CD) molecules. The analyzed molecular mechanics methods do not provide unphysical short H...H contacts, but it is obvious that their applicability to the study of supramolecular systems is rather limited. For the semiempirical methods, MNDO is found to generate more reliable geometries than AM1, PM3 and the two recently developed schemes PDDG/MNDO and PDDG/PM3. MNDO results only give one slightly short H...H distance, whereas the NDDO formalisms with modifications of the Core Repulsion Function (CRF) via Gaussians exhibit a large number of short to very short and unphysical H...H intermolecular distances. In contrast, the PM5 method, which is the successor to PM3, gives very promising results. Our ONIOM calculations indicate that the unphysical optimized geometries from PM3 are retained when this semiempirical method is used as the low level layer in a QM:QM formulation. On the other hand, ab initio methods involving good enough basis sets, at least for the high level layer in a hybrid ONIOM calculation, behave well, but they may be too expensive in practice for most supramolecular chemistry applications. Finally, the performance of the evaluated computational methods has also been tested by evaluating the energetic difference between the two most stable conformations of the host(beta-CD)-guest(HPMO) system.
Collapse
Affiliation(s)
- Ricard Casadesús
- Departament de Química, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | | | | | | | | |
Collapse
|
11
|
Tubert-Brohman I, Guimarães CRW, Repasky MP, Jorgensen WL. Extension of the PDDG/PM3 and PDDG/MNDO semiempirical molecular orbital methods to the halogens. J Comput Chem 2004; 25:138-50. [PMID: 14635001 DOI: 10.1002/jcc.10356] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
The new semiempirical methods, PDDG/PM3 and PDDG/MNDO, have been parameterized for halogens. For comparison, the original MNDO and PM3 were also reoptimized for the halogens using the same training set; these modified methods are referred to as MNDO' and PM3'. For 442 halogen-containing molecules, the smallest mean absolute error (MAE) in heats of formation is obtained with PDDG/PM3 (5.6 kcal/mol), followed by PM3' (6.1 kcal/mol), PDDG/MNDO (6.6 kcal/mol), PM3 (8.1 kcal/mol), MNDO' (8.5 kcal/mol), AM1 (11.1 kcal/mol), and MNDO (14.0 kcal/mol). For normal-valent halogen-containing molecules, the PDDG methods also provide improved heats of formation over MNDO/d. Hypervalent compounds were not included in the training set and improvements over the standard NDDO methods with sp basis sets were not obtained. For small haloalkanes, the PDDG methods yield more accurate heats of formation than are obtained from density functional theory (DFT) with the B3LYP and B3PW91 functionals using large basis sets. PDDG/PM3 and PM3' also give improved binding energies over the standard NDDO methods for complexes involving halide anions, and they are competitive with B3LYP/6-311++G(d,p) results including thermal corrections. Among the semiempirical methods studied, PDDG/PM3 also generates the best agreement with high-level ab initio G2 and CCSD(T) intrinsic activation energies for S(N)2 reactions involving methyl halides and halide anions. Finally, the MAEs in ionization potentials, dipole moments, and molecular geometries show that the parameter sets for the PDDG and reoptimized NDDO methods reduce the MAEs in heats of formation without compromising the other important QM observables.
Collapse
Affiliation(s)
- Ivan Tubert-Brohman
- Department of Chemistry, Yale University, 225 Prospect St., New Haven, Connecticut 06520, USA
| | | | | | | |
Collapse
|
12
|
Guimarães CRW, Repasky MP, Chandrasekhar J, Tirado-Rives J, Jorgensen WL. Contributions of conformational compression and preferential transition state stabilization to the rate enhancement by chorismate mutase. J Am Chem Soc 2003; 125:6892-9. [PMID: 12783541 DOI: 10.1021/ja021424r] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The rate enhancement provided by the chorismate mutase (CM) enzyme for the Claisen rearrangement of chorismate to prephenate has been investigated by application of the concept of near attack conformations (NACs). Using a combined QM/MM Monte Carlo/free-energy perturbation (MC/FEP) method, 82% and 100% of chorismate conformers were found to be NAC structures in water and in the CM active site, respectively. Consequently, the conversion of non-NACs to NACs does not contribute to the free energy of activation from preorganization of the substrate into NACs. The FEP calculations yielded differences in free energies of activation that well reproduce the experimental data. Additional calculations indicate that the rate enhancement by CM over the aqueous phase results primarily from conformational compression of NACs by the enzyme and that this process is enthalpically controlled. This suggests that preferential stabilization of the transition state in the enzyme environment relative to water plays a secondary role in the catalysis by CM.
Collapse
|
13
|
Repasky MP, Guimarães CRW, Chandrasekhar J, Tirado-Rives J, Jorgensen WL. Investigation of solvent effects for the Claisen rearrangement of chorismate to prephenate: mechanistic interpretation via near attack conformations. J Am Chem Soc 2003; 125:6663-72. [PMID: 12769575 DOI: 10.1021/ja021423z] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Solvent effects on the rate of the Claisen rearrangement of chorismate to prephenate have been examined in water and methanol. The preequilibrium free-energy differences between diaxial and diequatorial conformers of chorismate, which had previously been implicated as the sole basis for the observed 100-fold rate increase in water over methanol, have been reframed using the near attack conformation (NAC) concept of Bruice and co-workers. Using a combined QM/MM Monte Carlo/free-energy perturbation (MC/FEP) method, 82%, 57%, and 1% of chorismate conformers were found to be NAC structures (NACs) in water, methanol, and the gas phase, respectively. As a consequence, the conversion of non-NACs to NACs provides no free-energy contributions to the overall relative reaction rates in water versus methanol. Free-energy perturbation calculations yielded differences in free energies of activation for the two polar protic solvents and the gas phase. The rate enhancement in water over the gas phase arises from preferential hydration of the transition state (TS) relative to the reactants via increased hydrogen bonding and long-range electrostatic interactions, which accompany bringing the two negatively charged carboxylates into closer proximity. More specifically, there is an increase of 1.3 and 0.6 hydrogen bonds to the carboxylate groups and the ether oxygen, respectively, in going from the reactant to the TS in water. In methanol, the corresponding changes in hydrogen bonding with first shell solvent molecules are small; the rate enhancement arises primarily from the enhanced long-range interactions with solvent molecules. Thus, the reaction occurs faster in water than in methanol due to greater stabilization of the TS in water by specific interactions with first shell solvent molecules.
Collapse
Affiliation(s)
- Matthew P Repasky
- Department of Chemistry, Yale University, 225 Prospect Street, New Haven, CT 06520-8107, USA
| | | | | | | | | |
Collapse
|
14
|
Abstract
Two new semiempirical methods employing a Pairwise Distance Directed Gaussian modification have been developed: PDDG/PM3 and PDDG/MNDO; they are easily implemented in existing software, and yield heats of formation for compounds containing C, H, N, and O atoms with significantly improved accuracy over the standard NDDO schemes, PM5, PM3, AM1, and MNDO. The PDDG/PM3 results for heats of formation also show substantial improvement over density functional theory with large basis sets. The PDDG modifications consist of a single function, which is added to the existing pairwise core repulsion functions within PM3 and MNDO, a reparameterized semiempirical parameter set, and modified computation of the energy of formation of a gaseous atom. The PDDG addition introduces functional group information via pairwise atomic interactions using only atom-based parameters. For 622 diverse molecules containing C, H, N, and O atoms, mean absolute errors in calculated heats of formation are reduced from 4.4 to 3.2 kcal/mol and from 8.4 to 5.2 kcal/mol using the PDDG modified versions of PM3 and MNDO over the standard versions, respectively. Several specific problems are overcome, including the relative stability of hydrocarbon isomers, and energetics of small rings and molecules containing multiple heteroatoms. The internal consistency of PDDG energies is also significantly improved, enabling more reliable analysis of isomerization energies and trends across series of molecules; PDDG isomerization energies show significant improvement over B3LYP/6-31G* results. Comparison of heats of formation, ionization potentials, dipole moments, isomer, and conformer energetics, intermolecular interaction energies, activation energies, and molecular geometries from the PDDG techniques is made to experimental data and values from other semiempirical and ab initio methods.
Collapse
Affiliation(s)
- Matthew P Repasky
- Department of Chemistry, Yale University, 225 Prospect St., New Haven, Connecticut 06520-8107, USA
| | | | | |
Collapse
|
15
|
Repasky MP, Chandrasekhar J, Jorgensen WL. Improved semiempirical heats of formation through the use of bond and group equivalents. J Comput Chem 2002; 23:498-510. [PMID: 11908087 DOI: 10.1002/jcc.10023] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Deficiencies in energetics obtained using the common semiempirical methods, AM1, PM3, and MNDO, may partly be traced to the use of pseudoatomic equivalents for conversion of molecular energies to heats of formation at 298 K. We present an alternative scheme based on the use of bond and group equivalents. Values for the 61 bond and group equivalents necessary for treatment of molecules containing the common organic elements, hydrogen, carbon, nitrogen, and oxygen have been derived. For a set of 583 neutral, closed-shell molecules mean absolute errors in AM1, PM3, and MNDO heats of formation are reduced from 6.6, 4.2, and 8.2 kcal/mol to 2.3, 2.2, and 3.0 kcal/mol, respectively. Several systematic problems are overcome in the present scheme including relative stabilities of branched hydrocarbons, energetics of conjugated systems, heats of formation of long chain hydrocarbons, and enthalpies of molecules containing multiple heteroatoms. Although the approach is restricted to molecules with well-defined functional groups, the equivalents are easy to incorporate and are chemically relevant. This revised procedure allows semiempirical methods to be used for far more reliable evaluations of heats of reactions. Estimates are made of the errors inherent in these semiempirical formalisms, arising from integral approximations and the neglect of explicit treatment of electron correlation effects, while excluding those from inadequate parameterization.
Collapse
Affiliation(s)
- Matthew P Repasky
- Department of Chemistry, Yale University, 225 Prospect St., New Haven, Connecticut 06520-8107, USA
| | | | | |
Collapse
|
16
|
Wesolowski SS, Brinkmann NR, Valeev EF, Schaefer HF, Repasky MP, Jorgensen WL. Three- versus four-coordinate phosphorus in the gas phase and in solution: Treacherous relative energies for phosphine oxide and phosphinous acid. J Chem Phys 2002. [DOI: 10.1063/1.1418440] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
17
|
Affiliation(s)
- Dongchul Lim
- Department of Chemistry, Yale University, New Haven, CT 06520-8107
| | - Corky Jenson
- Department of Chemistry, Yale University, New Haven, CT 06520-8107
| | | | | |
Collapse
|
18
|
|