Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Total Articles

16
(from Reference Citation Analysis)

Article PDFs (6)

Cited by > 0 (13)

Searched Name

Christoffer Norn

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Statistics

Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Category

Show more Refine

Number	Citation Analysis
1	De novo design of diverse small molecule binders and sensors using Shape Complementary Pseudocycles. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.20.572602. [PMID: 38187589 PMCID: PMC10769206 DOI: 10.1101/2023.12.20.572602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2024] Abstract A general method for designing proteins to bind and sense any small molecule of interest would be widely useful. Due to the small number of atoms to interact with, binding to small molecules with high affinity requires highly shape complementary pockets, and transducing binding events into signals is challenging. Here we describe an integrated deep learning and energy based approach for designing high shape complementarity binders to small molecules that are poised for downstream sensing applications. We employ deep learning generated psuedocycles with repeating structural units surrounding central pockets; depending on the geometry of the structural unit and repeat number, these pockets span wide ranges of sizes and shapes. For a small molecule target of interest, we extensively sample high shape complementarity pseudocycles to generate large numbers of customized potential binding pockets; the ligand binding poses and the interacting interfaces are then optimized for high affinity binding. We computationally design binders to four diverse molecules, including for the first time polar flexible molecules such as methotrexate and thyroxine, which are expressed at high levels and have nanomolar affinities straight out of the computer. Co-crystal structures are nearly identical to the design models. Taking advantage of the modular repeating structure of pseudocycles and central location of the binding pockets, we constructed low noise nanopore sensors and chemically induced dimerization systems by splitting the binders into domains which assemble into the original pseudocycle pocket upon target molecule addition. Collapse Key Words Collapse MESH Headings Collapse Grants P30 GM133893 NIGMS NIH HHS U19 AG065156 NIA NIH HHS R01 AG063845 NIA NIH HHS HHSN272201700059C NIAID NIH HHS P30 GM124165 NIGMS NIH HHS R01 AI160052 NIAID NIH HHS 75N93022C00036 NIAID NIH HHS Collapse
2	Small-molecule binding and sensing with a designed protein family. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.01.565201. [PMID: 37961294 PMCID: PMC10635051 DOI: 10.1101/2023.11.01.565201] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023] Abstract Despite transformative advances in protein design with deep learning, the design of small-molecule-binding proteins and sensors for arbitrary ligands remains a grand challenge. Here we combine deep learning and physics-based methods to generate a family of proteins with diverse and designable pocket geometries, which we employ to computationally design binders for six chemically and structurally distinct small-molecule targets. Biophysical characterization of the designed binders revealed nanomolar to low micromolar binding affinities and atomic-level design accuracy. The bound ligands are exposed at one edge of the binding pocket, enabling the de novo design of chemically induced dimerization (CID) systems; we take advantage of this to create a biosensor with nanomolar sensitivity for cortisol. Our approach provides a general method to design proteins that bind and sense small molecules for a wide range of analytical, environmental, and biomedical applications. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
3	Computational design of sequence-specific DNA-binding proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.20.558720. [PMID: 37790440 PMCID: PMC10542524 DOI: 10.1101/2023.09.20.558720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/05/2023] Abstract Sequence-specific DNA-binding proteins (DBPs) play critical roles in biology and biotechnology, and there has been considerable interest in the engineering of DBPs with new or altered specificities for genome editing and other applications. While there has been some success in reprogramming naturally occurring DBPs using selection methods, the computational design of new DBPs that recognize arbitrary target sites remains an outstanding challenge. We describe a computational method for the design of small DBPs that recognize specific target sequences through interactions with bases in the major groove, and employ this method in conjunction with experimental screening to generate binders for 5 distinct DNA targets. These binders exhibit specificity closely matching the computational models for the target DNA sequences at as many as 6 base positions and affinities as low as 30-100 nM. The crystal structure of a designed DBP-target site complex is in close agreement with the design model, highlighting the accuracy of the design method. The designed DBPs function in both Escherichia coli and mammalian cells to repress and activate transcription of neighboring genes. Our method is a substantial step towards a general route to small and hence readily deliverable sequence-specific DBPs for gene regulation and editing. Collapse Key Words Collapse MESH Headings Collapse Grants P30 GM124169 NIGMS NIH HHS R01 GM135658 NIGMS NIH HHS S10 OD021832 NIH HHS S10 OD028581 NIH HHS Collapse
4	Top-down design of protein architectures with reinforcement learning. Science 2023;380:266-273. [PMID: 37079676 DOI: 10.1126/science.adf6591] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Accepted: 03/21/2023] [Indexed: 04/22/2023] Abstract As a result of evolutionary selection, the subunits of naturally occurring protein assemblies often fit together with substantial shape complementarity to generate architectures optimal for function in a manner not achievable by current design approaches. We describe a "top-down" reinforcement learning-based design approach that solves this problem using Monte Carlo tree search to sample protein conformers in the context of an overall architecture and specified functional constraints. Cryo-electron microscopy structures of the designed disk-shaped nanopores and ultracompact icosahedra are very close to the computational models. The icosohedra enable very-high-density display of immunogens and signaling molecules, which potentiates vaccine response and angiogenesis induction. Our approach enables the top-down design of complex protein nanomaterials with desired system properties and demonstrates the power of reinforcement learning in protein design. Collapse Key Words Collapse MESH Headings Cryoelectron Microscopy Nanostructures Proteins/chemistry Protein Engineering Machine Learning Collapse Grants T90 DE021984 NIDCR NIH HHS Collapse
5	Atomistic simulation of protein evolution reveals sequence covariation and time-dependent fluctuations of site-specific substitution rates. PLoS Comput Biol 2023;19:e1010262. [PMID: 36961827 PMCID: PMC10075473 DOI: 10.1371/journal.pcbi.1010262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 04/05/2023] [Accepted: 03/01/2023] [Indexed: 03/25/2023] Open Abstract Thermodynamic stability is a crucial fitness constraint in protein evolution and is a central factor in shaping the sequence landscapes of proteins. The correlation between stability and molecular fitness depends on the mechanism that relates the biophysical property with biological function. In the simplest case, stability and fitness are related by the amount of folded protein. However, when proteins are toxic in the unfolded state, the fitness function shifts, resulting in higher stability under mutation-selection balance. Likewise, a higher population size results in a similar change in protein stability, as it magnifies the effect of the selection pressure in evolutionary dynamics. This study investigates how such factors affect the evolution of protein stability, site-specific mutation rates, and residue-residue covariation. To simulate evolutionary trajectories with realistic modeling of protein energetics, we develop an all-atom simulator of protein evolution, RosettaEvolve. By evolving proteins under different fitness functions, we can study how the fitness function affects the distribution of proposed and accepted mutations, site-specific rates, and the prevalence of correlated amino acid substitutions. We demonstrate that fitness pressure affects the proposal distribution of mutational effects, that changes in stability can largely explain variations in site-specific substitution rates in evolutionary trajectories, and that increased fitness pressure results in a stronger covariation signal. Our results give mechanistic insight into the evolutionary consequences of variation in protein stability and provide a basis to rationalize the strong covariation signal observed in natural sequence alignments. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
6	De novo design of luciferases using deep learning. Nature 2023;614:774-780. [PMID: 36813896 PMCID: PMC9946828 DOI: 10.1038/s41586-023-05696-3] [Citation(s) in RCA: 79] [Impact Index Per Article: 79.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 01/03/2023] [Indexed: 02/24/2023] Abstract De novo enzyme design has sought to introduce active sites and substrate-binding pockets that are predicted to catalyse a reaction of interest into geometrically compatible native scaffolds1,2, but has been limited by a lack of suitable protein structures and the complexity of native protein sequence-structure relationships. Here we describe a deep-learning-based 'family-wide hallucination' approach that generates large numbers of idealized protein structures containing diverse pocket shapes and designed sequences that encode them. We use these scaffolds to design artificial luciferases that selectively catalyse the oxidative chemiluminescence of the synthetic luciferin substrates diphenylterazine3 and 2-deoxycoelenterazine. The designed active sites position an arginine guanidinium group adjacent to an anion that develops during the reaction in a binding pocket with high shape complementarity. For both luciferin substrates, we obtain designed luciferases with high selectivity; the most active of these is a small (13.9 kDa) and thermostable (with a melting temperature higher than 95 °C) enzyme that has a catalytic efficiency on diphenylterazine (kcat/Km = 106 M-1 s-1) comparable to that of native luciferases, but a much higher substrate specificity. The creation of highly active and specific biocatalysts from scratch with broad applications in biomedicine is a key milestone for computational enzyme design, and our approach should enable generation of a wide range of luciferases and other enzymes. Collapse Key Words sensors and probes protein design enzymes Collapse MESH Headings Biocatalysis Catalytic Domain Deep Learning Enzyme Stability Hot Temperature Luciferases/chemistry Luciferases/metabolism Luciferins/metabolism Luminescence Oxidation-Reduction Substrate Specificity Collapse Grants K99 EB031913 NIBIB NIH HHS U01 AI151698 NIAID NIH HHS Collapse
7	A thermodynamic model of protein structure evolution explains empirical amino acid substitution matrices. Protein Sci 2021;30:2057-2068. [PMID: 34218472 PMCID: PMC8442976 DOI: 10.1002/pro.4155] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Revised: 06/25/2021] [Accepted: 06/29/2021] [Indexed: 12/30/2022] Abstract Proteins evolve under a myriad of biophysical selection pressures that collectively control the patterns of amino acid substitutions. These evolutionary pressures are sufficiently consistent over time and across protein families to produce substitution patterns, summarized in global amino acid substitution matrices such as BLOSUM, JTT, WAG, and LG, which can be used to successfully detect homologs, infer phylogenies, and reconstruct ancestral sequences. Although the factors that govern the variation of amino acid substitution rates have received much attention, the influence of thermodynamic stability constraints remains unresolved. Here we develop a simple model to calculate amino acid substitution matrices from evolutionary dynamics controlled by a fitness function that reports on the thermodynamic effects of amino acid mutations in protein structures. This hybrid biophysical and evolutionary model accounts for nucleotide transition/transversion rate bias, multi‐nucleotide codon changes, the number of codons per amino acid, and thermodynamic protein stability. We find that our theoretical model accurately recapitulates the complex yet universal pattern observed in common global amino acid substitution matrices used in phylogenetics. These results suggest that selection for thermodynamically stable proteins, coupled with nucleotide mutation bias filtered by the structure of the genetic code, is the primary driver behind the global amino acid substitution patterns observed in proteins throughout the tree of life. Collapse Key Words amino acid substitution exchangeabilities protein evolution protein stability replacement matrices Collapse MESH Headings Collapse Grants Collapse
8	Protein sequence design by conformational landscape optimization. Proc Natl Acad Sci U S A 2021;118:e2017228118. [PMID: 33712545 PMCID: PMC7980421 DOI: 10.1073/pnas.2017228118] [Citation(s) in RCA: 63] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open Abstract The protein design problem is to identify an amino acid sequence that folds to a desired structure. Given Anfinsen's thermodynamic hypothesis of folding, this can be recast as finding an amino acid sequence for which the desired structure is the lowest energy state. As this calculation involves not only all possible amino acid sequences but also, all possible structures, most current approaches focus instead on the more tractable problem of finding the lowest-energy amino acid sequence for the desired structure, often checking by protein structure prediction in a second step that the desired structure is indeed the lowest-energy conformation for the designed sequence, and typically discarding a large fraction of designed sequences for which this is not the case. Here, we show that by backpropagating gradients through the transform-restrained Rosetta (trRosetta) structure prediction network from the desired structure to the input amino acid sequence, we can directly optimize over all possible amino acid sequences and all possible structures in a single calculation. We find that trRosetta calculations, which consider the full conformational landscape, can be more effective than Rosetta single-point energy estimations in predicting folding and stability of de novo designed proteins. We compare sequence design by conformational landscape optimization with the standard energy-based sequence design methodology in Rosetta and show that the former can result in energy landscapes with fewer alternative energy minima. We show further that more funneled energy landscapes can be designed by combining the strengths of the two approaches: the low-resolution trRosetta model serves to disfavor alternative states, and the high-resolution Rosetta model serves to create a deep energy minimum at the design target structure. Collapse Key Words energy landscape machine learning protein design sequence optimization stability prediction Collapse MESH Headings Models, Molecular Neural Networks, Computer Protein Conformation Protein Folding Proteins/chemistry Thermodynamics Collapse Grants DP5 OD026389 NIH HHS Howard Hughes Medical Institute Collapse
9	An enumerative algorithm for de novo design of proteins with diverse pocket structures. Proc Natl Acad Sci U S A 2020;117:22135-22145. [PMID: 32839327 PMCID: PMC7486743 DOI: 10.1073/pnas.2005412117] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open Abstract To create new enzymes and biosensors from scratch, precise control over the structure of small-molecule binding sites is of paramount importance, but systematically designing arbitrary protein pocket shapes and sizes remains an outstanding challenge. Using the NTF2-like structural superfamily as a model system, we developed an enumerative algorithm for creating a virtually unlimited number of de novo proteins supporting diverse pocket structures. The enumerative algorithm was tested and refined through feedback from two rounds of large-scale experimental testing, involving in total the assembly of synthetic genes encoding 7,896 designs and assessment of their stability on yeast cell surface, detailed biophysical characterization of 64 designs, and crystal structures of 5 designs. The refined algorithm generates proteins that remain folded at high temperatures and exhibit more pocket diversity than naturally occurring NTF2-like proteins. We expect this approach to transform the design of small-molecule sensors and enzymes by enabling the creation of binding and active site geometries much more optimal for specific design challenges than is accessible by repurposing the limited number of naturally occurring NTF2-like proteins. Collapse Key Words NTF2-like proteins high-throughput screening protein design protein pockets Collapse MESH Headings Algorithms Binding Sites Computer Simulation High-Throughput Screening Assays Models, Molecular Nucleocytoplasmic Transport Proteins/chemistry Protein Conformation Protein Engineering Protein Stability Collapse Grants Collapse
10	Macromolecular modeling and design in Rosetta: recent methods and frameworks. Nat Methods 2020;17:665-680. [PMID: 32483333 PMCID: PMC7603796 DOI: 10.1038/s41592-020-0848-2] [Citation(s) in RCA: 373] [Impact Index Per Article: 93.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 04/22/2020] [Indexed: 12/12/2022] Abstract The Rosetta software for macromolecular modeling, docking and design is extensively used in laboratories worldwide. During two decades of development by a community of laboratories at more than 60 institutions, Rosetta has been continuously refactored and extended. Its advantages are its performance and interoperability between broad modeling capabilities. Here we review tools developed in the last 5 years, including over 80 methods. We discuss improvements to the score function, user interfaces and usability. Rosetta is available at http://www.rosettacommons.org. Collapse Key Words Collapse MESH Headings Macromolecular Substances/chemistry Models, Molecular Molecular Docking Simulation Peptidomimetics/chemistry Protein Conformation Proteins/chemistry Software Collapse Grants R01 GM099827 NIGMS NIH HHS 18POST34080422 American Heart Association-American Stroke Association R01 DK097376 NIDDK NIH HHS R01 GM117189 NIGMS NIH HHS T32 GM135141 NIGMS NIH HHS Howard Hughes Medical Institute RL1 CA133832 NCI NIH HHS R01 GM126299 NIGMS NIH HHS R01 GM117968 NIGMS NIH HHS R01 GM084453 NIGMS NIH HHS F31 CA243353 NCI NIH HHS R21 GM102716 NIGMS NIH HHS R35 GM122517 NIGMS NIH HHS P30 CA006927 NCI NIH HHS F32 GM110899 NIGMS NIH HHS T32 GM007628 NIGMS NIH HHS P41 RR012408 NCRR NIH HHS R01 GM097207 NIGMS NIH HHS R01 GM099842 NIGMS NIH HHS R01 GM080403 NIGMS NIH HHS R01 GM092802 NIGMS NIH HHS R01 GM073151 NIGMS NIH HHS R35 GM125034 NIGMS NIH HHS R01 AI113867 NIAID NIH HHS R35 GM131923 NIGMS NIH HHS R01 GM127578 NIGMS NIH HHS R21 AI121799 NIAID NIH HHS R01 GM076324 NIGMS NIH HHS R01 GM088277 NIGMS NIH HHS R01 AI143997 NIAID NIH HHS R01 GM078221 NIGMS NIH HHS R01 GM123089 NIGMS NIH HHS R35 ES030443 NIEHS NIH HHS R01 GM132565 NIGMS NIH HHS P42 ES004699 NIEHS NIH HHS R35 GM122579 NIGMS NIH HHS T32 AI007244 NIAID NIH HHS R01 GM098101 NIGMS NIH HHS R01 GM099959 NIGMS NIH HHS F32 CA189246 NCI NIH HHS R01 GM110089 NIGMS NIH HHS F31 GM123616 NIGMS NIH HHS R01 HL122010 NHLBI NIH HHS R01 GM121487 NIGMS NIH HHS U19 AI117905 NIAID NIH HHS R00 GM120388 NIGMS NIH HHS UH2 CA203780 NCI NIH HHS R21 CA219847 NCI NIH HHS T32 GM008268 NIGMS NIH HHS R01 GM073960 NIGMS NIH HHS Collapse
11	A combined computational-experimental approach to define the structural origin of antibody recognition of sialyl-Tn, a tumor-associated carbohydrate antigen. Sci Rep 2018;8:10786. [PMID: 30018351 PMCID: PMC6050261 DOI: 10.1038/s41598-018-29209-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2018] [Accepted: 07/06/2018] [Indexed: 12/16/2022] Open Abstract Anti-carbohydrate monoclonal antibodies (mAbs) hold great promise as cancer therapeutics and diagnostics. However, their specificity can be mixed, and detailed characterization is problematic, because antibody-glycan complexes are challenging to crystallize. Here, we developed a generalizable approach employing high-throughput techniques for characterizing the structure and specificity of such mAbs, and applied it to the mAb TKH2 developed against the tumor-associated carbohydrate antigen sialyl-Tn (STn). The mAb specificity was defined by apparent K_D values determined by quantitative glycan microarray screening. Key residues in the antibody combining site were identified by site-directed mutagenesis, and the glycan-antigen contact surface was defined using saturation transfer difference NMR (STD-NMR). These features were then employed as metrics for selecting the optimal 3D-model of the antibody-glycan complex, out of thousands plausible options generated by automated docking and molecular dynamics simulation. STn-specificity was further validated by computationally screening of the selected antibody 3D-model against the human sialyl-Tn-glycome. This computational-experimental approach would allow rational design of potent antibodies targeting carbohydrates. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
12	Mapping the Ca(2+) induced structural change in calreticulin. J Proteomics 2016;142:138-48. [PMID: 27195812 DOI: 10.1016/j.jprot.2016.05.015] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2016] [Revised: 05/09/2016] [Accepted: 05/14/2016] [Indexed: 01/04/2023] Abstract UNLABELLED Calreticulin is a highly conserved multifunctional protein implicated in many different biological systems and has therefore been the subject of intensive research. It is primarily present in the endoplasmatic reticulum where its main functions are to regulate Ca(2+) homeostasis, act as a chaperone and stabilize the MHC class I peptide-loading complex. Although several high-resolution structures of calreticulin exist, these only cover three-quarters of the entire protein leaving the extended structures unsolved. Additionally, the structure of calreticulin is influenced by the presence of Ca(2+). The conformational changes induced by Ca(2+) have not been determined yet as they are hard to study with traditional approaches. Here, we investigated the Ca(2+)-induced conformational changes with a combination of chemical cross-linking, mass spectrometry, bioinformatics analysis and modelling in Rosetta. Using a bifunctional linker, we found a large Ca(2+)-induced change to the cross-linking pattern in calreticulin. Our results are consistent with a high flexibility in the P-loop, a stabilization of the acidic C-terminal and a relatively close interaction of the P-loop and the acidic C-terminal. BIOLOGICAL SIGNIFICANCE The function of calreticulin, an endoplasmatic reticulin chaperone, is affected by fluctuations in Ca(2+)concentration, but the structural mechanism is unknown. The present work suggests that Ca(2+)-dependent regulation is caused by different conformations of a long proline-rich loop that changes the accessibility to the peptide/lectin-binding site. Our results indicate that the binding of Ca(2+) to calreticulin may thus not only just be a question of Ca(2+) storage but is likely to have an impact on the chaperone activity. Collapse Key Words BS(3) d(0) BS(3) d(4) Calreticulin Chemical cross-linking Mass spectrometry MassAI software Protein structure Rosetta modelling Collapse MESH Headings Collapse Grants Collapse
13	AbDesign: An algorithm for combinatorial backbone design guided by natural conformations and sequences. Proteins 2015;83:1385-406. [PMID: 25670500 PMCID: PMC4881815 DOI: 10.1002/prot.24779] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2014] [Revised: 01/13/2015] [Accepted: 01/26/2015] [Indexed: 12/20/2022] Abstract Computational design of protein function has made substantial progress, generating new enzymes, binders, inhibitors, and nanomaterials not previously seen in nature. However, the ability to design new protein backbones for function--essential to exert control over all polypeptide degrees of freedom--remains a critical challenge. Most previous attempts to design new backbones computed the mainchain from scratch. Here, instead, we describe a combinatorial backbone and sequence optimization algorithm called AbDesign, which leverages the large number of sequences and experimentally determined molecular structures of antibodies to construct new antibody models, dock them against target surfaces and optimize their sequence and backbone conformation for high stability and binding affinity. We used the algorithm to produce antibody designs that target the same molecular surfaces as nine natural, high-affinity antibodies; in five cases interface sequence identity is above 30%, and in four of those the backbone conformation at the core of the antibody binding surface is within 1 Å root-mean square deviation from the natural antibodies. Designs recapitulate polar interaction networks observed in natural complexes, and amino acid sidechain rigidity at the designed binding surface, which is likely important for affinity and specificity, is high compared to previous design studies. In designed anti-lysozyme antibodies, complementarity-determining regions (CDRs) at the periphery of the interface, such as L1 and H2, show greater backbone conformation diversity than the CDRs at the core of the interface, and increase the binding surface area compared to the natural antibody, potentially enhancing affinity and specificity. Collapse Key Words CDRs Rosetta V(D)J recombination canonical conformations computational protein design conformation-sequence optimization fuzzy-logic design modular segments Collapse MESH Headings Algorithms Amino Acid Sequence Complementarity Determining Regions/chemistry Computational Biology/methods Fuzzy Logic Humans Molecular Sequence Data Protein Conformation Protein Engineering/methods Sequence Analysis, Protein/methods Collapse Grants 335439 European Research Council Collapse
14	Structural basis for constitutive activity and agonist-induced activation of the enteroendocrine fat sensor GPR119. Br J Pharmacol 2015;171:5774-89. [PMID: 25117266 DOI: 10.1111/bph.12877] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2014] [Revised: 08/04/2014] [Accepted: 08/06/2014] [Indexed: 12/25/2022] Open Abstract BACKGROUND AND PURPOSE GPR119 is a Gαs-coupled 7TM receptor activated by endogenous lipids such as oleoylethanolamide (OEA) and by the dietary triglyceride metabolite 2-monoacylglycerol. GPR119 stimulates enteroendocrine hormone and insulin secretion. But despite massive drug discovery efforts in the field, very little is known about the basic molecular pharmacology of GPR119. EXPERIMENTAL APPROACH GPR119 receptor signalling was studied in transfected cells. Mutational mapping (30 mutations in 23 positions) was performed on residues required for ligand-independent and agonist-induced GPR119 activation (AR231453 and OEA). Novel Rosetta-based receptor modelling was applied, using a composite template approach with segments from different X-ray structures and fully flexible ligand docking. KEY RESULTS The increased signalling induced by increasing the cell surface expression of GPR119 in the absence of agonist and the inhibitory effect of two synthetic inverse agonists demonstrated that GRP119 signals with a high degree of constitutive activity through the Gαs pathway. The mutational maps for AR231453 and OEA were very similar and, surprisingly, also similar to the mutational map for residues affecting the constitutive signalling - albeit with key differences. Surprisingly, almost all residues in extracellular loop-2b were important for the constitutive activity. The molecular modelling and docking demonstrated that AR231453 binds in a 'vertical' pocket in between mutational hits reaching from the centre of the receptor out to extracellular loop-2b. CONCLUSIONS AND IMPLICATIONS The high constitutive activity of GPR119 should be taken into account in future drug discovery efforts, which can now be guided by the detailed knowledge of the physiochemical properties of the extended ligand-binding pocket. Collapse Key Words Collapse MESH Headings Collapse Grants Collapse
15	Assessment and challenges of ligand docking into comparative models of G-protein coupled receptors. PLoS One 2013;8:e67302. [PMID: 23844000 PMCID: PMC3699586 DOI: 10.1371/journal.pone.0067302] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2013] [Accepted: 05/16/2013] [Indexed: 01/09/2023] Open Abstract The rapidly increasing number of high-resolution X-ray structures of G-protein coupled receptors (GPCRs) creates a unique opportunity to employ comparative modeling and docking to provide valuable insight into the function and ligand binding determinants of novel receptors, to assist in virtual screening and to design and optimize drug candidates. However, low sequence identity between receptors, conformational flexibility, and chemical diversity of ligands present an enormous challenge to molecular modeling approaches. It is our hypothesis that rapid Monte-Carlo sampling of protein backbone and side-chain conformational space with Rosetta can be leveraged to meet this challenge. This study performs unbiased comparative modeling and docking methodologies using 14 distinct high-resolution GPCRs and proposes knowledge-based filtering methods for improvement of sampling performance and identification of correct ligand-receptor interactions. On average, top ranked receptor models built on template structures over 50% sequence identity are within 2.9 Å of the experimental structure, with an average root mean square deviation (RMSD) of 2.2 Å for the transmembrane region and 5 Å for the second extracellular loop. Furthermore, these models are consistently correlated with low Rosetta energy score. To predict their binding modes, ligand conformers of the 14 ligands co-crystalized with the GPCRs were docked against the top ranked comparative models. In contrast to the comparative models themselves, however, it remains difficult to unambiguously identify correct binding modes by score alone. On average, sampling performance was improved by 10³ fold over random using knowledge-based and energy-based filters. In assessing the applicability of experimental constraints, we found that sampling performance is increased by one order of magnitude for every 10 residues known to contact the ligand. Additionally, in the case of DOR, knowledge of a single specific ligand-protein contact improved sampling efficiency 7 fold. These findings offer specific guidelines which may lead to increased success in determining receptor-ligand complexes. Collapse Key Words Collapse MESH Headings Amino Acid Sequence Binding Sites Databases, Protein Humans Ligands Molecular Docking Simulation Molecular Sequence Data Monte Carlo Method Protein Binding Protein Structure, Secondary Receptors, G-Protein-Coupled/chemistry Software Structural Homology, Protein Thermodynamics Collapse Grants R01 GM099842 NIGMS NIH HHS R01 GM080403 NIGMS NIH HHS R01 GM080403, NIGMS NIH HHS T32 GM007347 NIGMS NIH HHS R01 MH090192 NIMH NIH HHS Collapse
16	Molecular characterization of oxysterol binding to the Epstein-Barr virus-induced gene 2 (GPR183). J Biol Chem 2012;287:35470-35483. [PMID: 22875855 PMCID: PMC3471686 DOI: 10.1074/jbc.m112.387894] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2012] [Indexed: 11/06/2022] Open Abstract Oxysterols are oxygenated cholesterol derivates that are emerging as a physiologically important group of molecules. Although they regulate a range of cellular processes, only few oxysterol-binding effector proteins have been identified, and the knowledge of their binding mode is limited. Recently, the family of G protein-coupled seven transmembrane-spanning receptors (7TM receptors) was added to this group. Specifically, the Epstein-Barr virus-induced gene 2 (EBI2 or GPR183) was shown to be activated by several oxysterols, most potently by 7α,25-dihydroxycholesterol (7α,25-OHC). Nothing is known about the binding mode, however. Using mutational analysis, we identify here four key residues for 7α,25-OHC binding: Arg-87 in TM-II (position II:20/2.60), Tyr-112 and Tyr-116 (positions III:09/3.33 and III:13/3.37) in TM-III, and Tyr-260 in TM-VI (position VI:16/6.51). Substituting these residues with Ala and/or Phe results in a severe decrease in agonist binding and receptor activation. Docking simulations suggest that Tyr-116 interacts with the 3β-OH group in the agonist, Tyr-260 with the 7α-OH group, and Arg-87, either directly or indirectly, with the 25-OH group, although nearby residues likely also contribute. In addition, Tyr-112 is involved in 7α,25-OHC binding but via hydrophobic interactions. Finally, we show that II:20/2.60 constitutes an important residue for ligand binding in receptors carrying a positively charged residue at this position. This group is dominated by lipid- and nucleotide-activated receptors, here exemplified by the CysLTs, P2Y12, and P2Y14. In conclusion, we present the first molecular characterization of oxysterol binding to a 7TM receptor and identify position II:20/2.60 as a generally important residue for ligand binding in certain 7TM receptors. Collapse Key Words 7-helix receptor lipids membrane proteins molecular docking mutagenesis site-specific 7tm receptor cyslt receptors ebi2 p2y receptors oxysterol Collapse MESH Headings Amino Acid Substitution Catalytic Domain HEK293 Cells Humans Hydroxycholesterols/chemistry Hydroxycholesterols/metabolism Molecular Docking Simulation Mutation, Missense Protein Binding Receptors, G-Protein-Coupled/chemistry Receptors, G-Protein-Coupled/genetics Receptors, G-Protein-Coupled/metabolism Collapse Grants Collapse