Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Hellinga HW. Rational protein design: combining theory and experiment. Proc Natl Acad Sci U S A 1997;94:10015-7. [PMID: 9294154 PMCID: PMC33767 DOI: 10.1073/pnas.94.19.10015] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open

For:	Hellinga HW. Rational protein design: combining theory and experiment. Proc Natl Acad Sci U S A 1997;94:10015-7. [PMID: 9294154 PMCID: PMC33767 DOI: 10.1073/pnas.94.19.10015] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open

Number

Cited by Other Article(s)

Zhou B, Zheng L, Wu B, Tan Y, Lv O, Yi K, Fan G, Hong L. Protein Engineering with Lightweight Graph Denoising Neural Networks. J Chem Inf Model 2024;64:3650-3661. [PMID: 38630581 DOI: 10.1021/acs.jcim.4c00036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2024]

Mardikoraem M, Woldring D. Protein Fitness Prediction Is Impacted by the Interplay of Language Models, Ensemble Learning, and Sampling Methods. Pharmaceutics 2023;15:1337. [PMID: 37242577 PMCID: PMC10224321 DOI: 10.3390/pharmaceutics15051337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Revised: 04/19/2023] [Accepted: 04/21/2023] [Indexed: 05/28/2023] Open

Abstract

Advances in machine learning (ML) and the availability of protein sequences via high-throughput sequencing techniques have transformed the ability to design novel diagnostic and therapeutic proteins. ML allows protein engineers to capture complex trends hidden within protein sequences that would otherwise be difficult to identify in the context of the immense and rugged protein fitness landscape. Despite this potential, there persists a need for guidance during the training and evaluation of ML methods over sequencing data. Two key challenges for training discriminative models and evaluating their performance include handling severely imbalanced datasets (e.g., few high-fitness proteins among an abundance of non-functional proteins) and selecting appropriate protein sequence representations (numerical encodings). Here, we present a framework for applying ML over assay-labeled datasets to elucidate the capacity of sampling techniques and protein encoding methods to improve binding affinity and thermal stability prediction tasks. For protein sequence representations, we incorporate two widely used methods (One-Hot encoding and physiochemical encoding) and two language-based methods (next-token prediction, UniRep; masked-token prediction, ESM). Elaboration on performance is provided over protein fitness, protein size, and sampling techniques. In addition, an ensemble of protein representation methods is generated to discover the contribution of distinct representations and improve the final prediction score. We then implement multiple criteria decision analysis (MCDA; TOPSIS with entropy weighting), using multiple metrics well-suited for imbalanced data, to ensure statistical rigor in ranking our methods. Within the context of these datasets, the synthetic minority oversampling technique (SMOTE) outperformed undersampling while encoding sequences with One-Hot, UniRep, and ESM representations. Moreover, ensemble learning increased the predictive performance of the affinity-based dataset by 4% compared to the best single-encoding candidate (F1-score = 97%), while ESM alone was rigorous enough in stability prediction (F1-score = 92%).

Collapse

Seo K, Hagino K, Ichihashi N. Progresses in Cell-Free In Vitro Evolution. ADVANCES IN BIOCHEMICAL ENGINEERING/BIOTECHNOLOGY 2023;186:121-140. [PMID: 37306699 DOI: 10.1007/10_2023_219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

E C, Dai L, Yu J. Switching promotor recognition of phage RNA polymerase in silico along lab-directed evolution path. Biophys J 2022;121:582-595. [PMID: 35031277 PMCID: PMC8874028 DOI: 10.1016/j.bpj.2022.01.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 12/01/2021] [Accepted: 01/10/2022] [Indexed: 11/16/2022] Open

George A, Ravi R, Tiwari PB, Srivastava SR, Jain V, Mahalakshmi R. Engineering a Hyperstable Yersinia pestis Outer Membrane Protein Ail Using Thermodynamic Design. J Am Chem Soc 2022;144:1545-1555. [DOI: 10.1021/jacs.1c05964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Siedhoff NE, Illig AM, Schwaneberg U, Davari MD. PyPEF-An Integrated Framework for Data-Driven Protein Engineering. J Chem Inf Model 2021;61:3463-3476. [PMID: 34260225 DOI: 10.1021/acs.jcim.1c00099] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Abstract

Data-driven strategies are gaining increased attention in protein engineering due to recent advances in access to large experimental databanks of proteins, next-generation sequencing (NGS), high-throughput screening (HTS) methods, and the development of artificial intelligence algorithms. However, the reliable prediction of beneficial amino acid substitutions, their combination, and the effect on functional properties remain the most significant challenges in protein engineering, which is applied to develop proteins and enzymes for biocatalysis, biomedicine, and life sciences. Here, we present a general-purpose framework (PyPEF: pythonic protein engineering framework) for performing data-driven protein engineering using machine learning methods combined with techniques from signal processing and statistical physics. PyPEF guides the identification and selection of beneficial proteins of a defined sequence space by systematically or randomly exploring the fitness of variants and by sampling random evolution pathways. The performance of PyPEF was evaluated concerning its predictive accuracy and throughput on four public protein and enzyme data sets using common regression models. It was proved that the program could efficiently predict the fitness of protein sequences for different target properties (predictive models with coefficient of determination values ranging from 0.58 to 0.92). By combining machine learning and protein evolution, PyPEF enabled the screening of proteins with various functions, reaching a screening capacity of more than 500,000 protein sequence variants in the timeframe of only a few minutes on a personal computer. PyPEF displayed significant accuracies on four public data sets (different proteins and properties) and underlined the potential of integrating data-driven technologies for covering different philosophies by either predicting the fitness of the variants to the highest accuracy accounting for epistatic effects or capturing the general trend of introduced mutations on the fitness in directed protein evolution campaigns. In essence, PyPEF can provide a powerful solution to current sequence exploration and combinatorial problems faced in protein engineering through exhaustive in silico screening of the sequence space.

Collapse

Ngo K, Bruno da Silva F, Leite VBP, Contessoto VG, Onuchic JN. Improving the Thermostability of Xylanase A from Bacillus subtilis by Combining Bioinformatics and Electrostatic Interactions Optimization. J Phys Chem B 2021;125:4359-4367. [PMID: 33887137 DOI: 10.1021/acs.jpcb.1c01253] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Pannecoucke E, Van Trimpont M, Desmet J, Pieters T, Reunes L, Demoen L, Vuylsteke M, Loverix S, Vandenbroucke K, Alard P, Henderikx P, Deroo S, Baatz F, Lorent E, Thiolloy S, Somers K, McGrath Y, Van Vlierberghe P, Lasters I, Savvides SN. Cell-penetrating Alphabody protein scaffolds for intracellular drug targeting. SCIENCE ADVANCES 2021;7:7/13/eabe1682. [PMID: 33771865 PMCID: PMC7997521 DOI: 10.1126/sciadv.abe1682] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Accepted: 02/05/2021] [Indexed: 05/02/2023]

Phylogeny and Structure of Fatty Acid Photodecarboxylases and Glucose-Methanol-Choline Oxidoreductases. Catalysts 2020. [DOI: 10.3390/catal10091072] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open

Halder R, Jana B. Exploring the role of hydrophilic amino acids in unfolding of protein in aqueous ethanol solution. Proteins 2020;89:116-125. [PMID: 32860277 DOI: 10.1002/prot.25999] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2019] [Revised: 08/07/2020] [Accepted: 08/25/2020] [Indexed: 12/14/2022]

Zhang L, Xiao WH, Wang Y, Yao MD, Jiang GZ, Zeng BX, Zhang RS, Yuan YJ. Chassis and key enzymes engineering for monoterpenes production. Biotechnol Adv 2017;35:1022-1031. [DOI: 10.1016/j.biotechadv.2017.09.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2017] [Revised: 09/02/2017] [Accepted: 09/04/2017] [Indexed: 02/07/2023]

Application of conventional molecular dynamics simulation in evaluating the stability of apomyoglobin in urea solution. Sci Rep 2017;7:44651. [PMID: 28300210 PMCID: PMC5353640 DOI: 10.1038/srep44651] [Citation(s) in RCA: 61] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2016] [Accepted: 02/09/2017] [Indexed: 01/02/2023] Open

Childers MC, Daggett V. Insights from molecular dynamics simulations for computational protein design. MOLECULAR SYSTEMS DESIGN & ENGINEERING 2017;2:9-33. [PMID: 28239489 PMCID: PMC5321087 DOI: 10.1039/c6me00083e] [Citation(s) in RCA: 127] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Abstract

A grand challenge in the field of structural biology is to design and engineer proteins that exhibit targeted functions. Although much success on this front has been achieved, design success rates remain low, an ever-present reminder of our limited understanding of the relationship between amino acid sequences and the structures they adopt. In addition to experimental techniques and rational design strategies, computational methods have been employed to aid in the design and engineering of proteins. Molecular dynamics (MD) is one such method that simulates the motions of proteins according to classical dynamics. Here, we review how insights into protein dynamics derived from MD simulations have influenced the design of proteins. One of the greatest strengths of MD is its capacity to reveal information beyond what is available in the static structures deposited in the Protein Data Bank. In this regard simulations can be used to directly guide protein design by providing atomistic details of the dynamic molecular interactions contributing to protein stability and function. MD simulations can also be used as a virtual screening tool to rank, select, identify, and assess potential designs. MD is uniquely poised to inform protein design efforts where the application requires realistic models of protein dynamics and atomic level descriptions of the relationship between dynamics and function. Here, we review cases where MD simulations was used to modulate protein stability and protein function by providing information regarding the conformation(s), conformational transitions, interactions, and dynamics that govern stability and function. In addition, we discuss cases where conformations from protein folding/unfolding simulations have been exploited for protein design, yielding novel outcomes that could not be obtained from static structures.

Collapse

Heinemann J, Deng K, Shih SCC, Gao J, Adams PD, Singh AK, Northen TR. On-chip integration of droplet microfluidics and nanostructure-initiator mass spectrometry for enzyme screening. LAB ON A CHIP 2017;17:323-331. [PMID: 27957569 DOI: 10.1039/c6lc01182a] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]

Bayram Akcapinar G, Venturini A, Martelli PL, Casadio R, Sezerman UO. Modulating the thermostability of Endoglucanase I from Trichoderma reesei using computational approaches. Protein Eng Des Sel 2015;28:127-35. [DOI: 10.1093/protein/gzv012] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2014] [Accepted: 02/04/2015] [Indexed: 11/12/2022] Open

van den Berg BA, Reinders MJ, van der Laan JM, Roubos JA, de Ridder D. Protein redesign by learning from data. Protein Eng Des Sel 2014;27:281-8. [DOI: 10.1093/protein/gzu031] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open

Sijenyi F, Saro P, Ouyang Z, Damm-Ganamet K, Wood M, Jiang J, SantaLucia J. The RNA Folding Problems: Different Levels of sRNA Structure Prediction. NUCLEIC ACIDS AND MOLECULAR BIOLOGY 2012. [DOI: 10.1007/978-3-642-25740-7_6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Huggins DJ, Tidor B. Systematic placement of structural water molecules for improved scoring of protein-ligand interactions. Protein Eng Des Sel 2011;24:777-89. [PMID: 21771870 PMCID: PMC3170077 DOI: 10.1093/protein/gzr036] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2011] [Revised: 06/03/2011] [Accepted: 06/15/2011] [Indexed: 11/13/2022] Open

Samish I, MacDermaid CM, Perez-Aguilar JM, Saven JG. Theoretical and Computational Protein Design. Annu Rev Phys Chem 2011;62:129-49. [DOI: 10.1146/annurev-physchem-032210-103509] [Citation(s) in RCA: 119] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Balaraman GS, Bhattacharya S, Vaidehi N. Structural insights into conformational stability of wild-type and mutant beta1-adrenergic receptor. Biophys J 2010;99:568-77. [PMID: 20643076 DOI: 10.1016/j.bpj.2010.04.075] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2010] [Revised: 04/09/2010] [Accepted: 04/16/2010] [Indexed: 11/26/2022] Open

Abstract

Recent experiments to derive a thermally stable mutant of turkey beta-1-adrenergic receptor (beta1AR) have shown that a combination of six single point mutations resulted in a 20 degrees C increase in thermal stability in mutant beta1AR. Here we have used the all-atom force-field energy function to calculate a stability score to detect stabilizing point mutations in G-protein coupled receptors. The calculated stability score shows good correlation with the measured thermal stability for 76 single point mutations and 22 multiple mutants in beta1AR. We have demonstrated that conformational sampling of the receptor for various mutants improve the prediction of thermal stability by 50%. Point mutations Y227A5.58, V230A5.61, and F338M7.48 in the thermally stable mutant m23-beta1AR stabilizes key microdomains of the receptor in the inactive conformation. The Y227A5.58 and V230A5.61 mutations stabilize the ionic lock between R139(3.50) on transmembrane helix3 and E285(6.30) on transmembrane helix6. The mutation F338M7.48 on TM7 alters the interaction of the conserved motif NPxxY(x)5,6F with helix8 and hence modulates the interaction of TM2-TM7-helix8 microdomain. The D186-R317 salt bridge (in extracellular loops 2 and 3) is stabilized in the cyanopindolol-bound wild-type beta1AR, whereas the salt bridge between D184-R317 is preferred in the mutant m23. We propose that this could be the surrogate to a similar salt bridge found between the extracellular loop 2 and TM7 in beta2AR reported recently. We show that the binding energy difference between the inactive and active states is less in m23 compared to the wild-type, which explains the activation of m23 at higher norepinephrine concentration compared to the wild-type. Results from this work throw light into the mechanism behind stabilizing mutations. The computational scheme proposed in this work could be used to design stabilizing mutations for other G-protein coupled receptors.

Collapse

Barakat NH, Barakat NH, Love JJ. Combined use of experimental and computational screens to characterize protein stability. Protein Eng Des Sel 2010;23:799-807. [PMID: 20805093 DOI: 10.1093/protein/gzq052] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Noivirt-Brik O, Horovitz A, Unger R. Trade-off between positive and negative design of protein stability: from lattice models to real proteins. PLoS Comput Biol 2009;5:e1000592. [PMID: 20011105 PMCID: PMC2781108 DOI: 10.1371/journal.pcbi.1000592] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2009] [Accepted: 11/03/2009] [Indexed: 11/18/2022] Open

Huggins DJ, Altman MD, Tidor B. Evaluation of an inverse molecular design algorithm in a model binding site. Proteins 2009;75:168-86. [PMID: 18831031 DOI: 10.1002/prot.22226] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Noivirt-Brik O, Unger R, Horovitz A. Analysing the origin of long-range interactions in proteins using lattice models. BMC STRUCTURAL BIOLOGY 2009;9:4. [PMID: 19178726 PMCID: PMC2670300 DOI: 10.1186/1472-6807-9-4] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/18/2008] [Accepted: 01/29/2009] [Indexed: 11/10/2022]

Suárez M, Tortosa P, Carrera J, Jaramillo A. Pareto optimization in computational protein design with multiple objectives. J Comput Chem 2008;29:2704-11. [DOI: 10.1002/jcc.20981] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Using a strategy based on the concept of convergent evolution to identify residue substitutions responsible for thermal adaptation. Proteins 2008;73:53-62. [DOI: 10.1002/prot.22049] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Wilson CJ, Zhan H, Swint-Kruse L, Matthews KS. Ligand interactions with lactose repressor protein and the repressor-operator complex: the effects of ionization and oligomerization on binding. Biophys Chem 2006;126:94-105. [PMID: 16860458 DOI: 10.1016/j.bpc.2006.06.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2006] [Revised: 06/09/2006] [Accepted: 06/10/2006] [Indexed: 10/24/2022]

Mattanovich D, Borth N. Applications of cell sorting in biotechnology. Microb Cell Fact 2006;5:12. [PMID: 16551353 PMCID: PMC1435767 DOI: 10.1186/1475-2859-5-12] [Citation(s) in RCA: 94] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2005] [Accepted: 03/21/2006] [Indexed: 01/28/2023] Open

Woolfson DN. The design of coiled-coil structures and assemblies. ADVANCES IN PROTEIN CHEMISTRY 2005;70:79-112. [PMID: 15837514 DOI: 10.1016/s0065-3233(05)70004-8] [Citation(s) in RCA: 434] [Impact Index Per Article: 22.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]

Abstract

Protein design allows sequence-to-structure relationships in proteins to be examined and, potentially, new protein structures and functions to be made to order. To succeed, however, the protein-design process requires reliable rules that link protein sequence to structure?function. Although our present understanding of coiled-coil folding and assembly is not complete, through numerous bioinformatics and experimental studies there are now sufficient rules to allow confident design attempts of naturally observed and even novel coiled-coil motifs. This review summarizes the current design rules for coiled coils, and describes some of the key successful coiled-coil designs that have been created to date. The designs range from those for relatively straightforward, naturally observed structures-including parallel and antiparallel dimers, trimers and tetramers, all of which have been made as homomers and heteromers-to more exotic structures that expand the repertoire of Nature's coiled-coil structures. Examples in the second bracket include a probe that binds a cancer-associated coiled-coil protein; a tetramer with a right-handed supercoil; sticky-ended coiled coils that self-assemble to form fibers; coiled coils that switch conformational state; a three-component two-stranded coiled coil; and an antiparallel dimer that directs fragment complementation of larger proteins. Some of the more recent examples show an important development in the field; namely, new designs are being created with function as well as structure in mind. This will remain one of the key challenges in coiled-coil design in the next few years. Other challenges that lie ahead include the need to discover more rules for coiled-coil prediction and design, and to implement these in prediction and design algorithms. The considerable success of coiled-coil design so far bodes well for this, however. It is likely that these challenges will be met and surpassed.

Collapse

Pandya MJ, Cerasoli E, Joseph A, Stoneman RG, Waite E, Woolfson DN. Sequence and Structural Duality: Designing Peptides to Adopt Two Stable Conformations. J Am Chem Soc 2004;126:17016-24. [PMID: 15612740 DOI: 10.1021/ja045568c] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Sear RP. Highly specific protein–protein interactions, evolution and negative design. Phys Biol 2004;1:166-72. [PMID: 16204836 DOI: 10.1088/1478-3967/1/3/004] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Jones DD, Barker PD. Design and Characterisation of an Artificial DNA-Binding Cytochrome. Chembiochem 2004;5:964-71. [PMID: 15239054 DOI: 10.1002/cbic.200300569] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Abstract

We aim to design novel proteins that link specific biochemical binding events, such as DNA recognition, with electron transfer functionality. We want these proteins to form the basis of new molecules that can be used for templated assembly of conducting cofactors or for thermodynamically linking DNA binding with cofactor chemistry for nanodevice applications. The first examples of our new proteins recruit the DNA-binding basic helix region of the leucine zipper protein GCN4. This basic helix region was attached to the N and C termini of cytochrome b(562) (cyt b(562)) to produce new, monomeric, multifunctional polypeptides. We have fully characterised the DNA and haem-binding properties of these proteins, which is a prerequisite for future application of the new molecules. Attachment of a single basic helix of GCN4 to either the N or C terminus of the cytochrome does not result in specific DNA binding but the presence of DNA-binding domains at both termini converts the cytochrome into a specific DNA-binding protein. Upon binding haem, this chimeric protein attains the spectral characteristics of wild-type cyt b(562). The three forms of the protein, apo, oxidised holo and reduced holo, all bind the designed (ATGAcgATGA) target DNA sequence with a dissociation constant, K(D), of approximately 90 nM. The protein has a lower affinity (K(D) ca. 370 nM) for the wild-type GCN4 recognition sequence (ATGAcTCAT). The presence of only half the consensus DNA sequence (ATGAcgGGCC) shifts the K(D) value to more than 2500 nM and the chimera does not bind specifically to DNA sequences with no target recognition sites. Ultracentrifugation revealed that the holoprotein-DNA complex is formed with a 1:1 stoichiometry, which indicates that a higher-order protein aggregate is not responsible for DNA binding. Mutagenesis of a loop linking helices 2 and 3 of the cytochrome results in a chimera with a haem-dependent DNA binding affinity. This is the first demonstration that binding of a haem group to a designed monomeric protein can allosterically modulate the DNA binding affinity.

Collapse

Khatun J, Khare SD, Dokholyan NV. Can Contact Potentials Reliably Predict Stability of Proteins? J Mol Biol 2004;336:1223-38. [PMID: 15037081 DOI: 10.1016/j.jmb.2004.01.002] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2003] [Revised: 01/08/2004] [Accepted: 01/08/2004] [Indexed: 11/17/2022]

Doye JPK, Louis AA, Vendruscolo M. Inhibition of protein crystallization by evolutionary negative design. Phys Biol 2004;1:P9-13. [PMID: 16204814 DOI: 10.1088/1478-3967/1/1/p02] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Vinkers HM, de Jonge MR, Daeyaert FFD, Heeres J, Koymans LMH, van Lenthe JH, Lewi PJ, Timmerman H, Van Aken K, Janssen PAJ. SYNOPSIS: SYNthesize and OPtimize System in Silico. J Med Chem 2003;46:2765-73. [PMID: 12801239 DOI: 10.1021/jm030809x] [Citation(s) in RCA: 130] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Jin W, Kambara O, Sasakawa H, Tamura A, Takada S. De novo design of foldable proteins with smooth folding funnel: automated negative design and experimental verification. Structure 2003;11:581-90. [PMID: 12737823 DOI: 10.1016/s0969-2126(03)00075-3] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Turk JA, Smithrud DB. Synthesis and physical properties of protein core mimetics. J Org Chem 2001;66:8328-35. [PMID: 11735510 DOI: 10.1021/jo0106849] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Marshall SA, Mayo SL. Achieving stability and conformational specificity in designed proteins via binary patterning. J Mol Biol 2001;305:619-31. [PMID: 11152617 DOI: 10.1006/jmbi.2000.4319] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Shahrezaei V, Ejtehadi MR. Geometry selects highly designable structures. J Chem Phys 2000. [DOI: 10.1063/1.1308514] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Grell D, Richardson JS, Richardson DC, Mutter M. SymROP: ROP protein with identical helices redesigned by all-atom contact analysis and molecular dynamics. J Mol Graph Model 2000;18:290-8, 309-10. [PMID: 11021545 DOI: 10.1016/s1093-3263(00)00049-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Raha K, Wollacott AM, Italia MJ, Desjarlais JR. Prediction of amino acid sequence from structure. Protein Sci 2000;9:1106-19. [PMID: 10892804 PMCID: PMC2144664 DOI: 10.1110/ps.9.6.1106] [Citation(s) in RCA: 61] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Street AG, Datta D, Gordon DB, Mayo SL. Designing protein beta-sheet surfaces by Z-score optimization. PHYSICAL REVIEW LETTERS 2000;84:5010-5013. [PMID: 10990854 DOI: 10.1103/physrevlett.84.5010] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/1999] [Indexed: 05/23/2023]

Hill RB, DeGrado WF. A polar, solvent-exposed residue can be essential for native protein structure. Structure 2000;8:471-9. [PMID: 10801493 PMCID: PMC3050062 DOI: 10.1016/s0969-2126(00)00130-1] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Petrosian SA, Makhatadze GI. Contribution of proton linkage to the thermodynamic stability of the major cold-shock protein of Escherichia coli CspA. Protein Sci 2000;9:387-94. [PMID: 10716191 PMCID: PMC2144560 DOI: 10.1110/ps.9.2.387] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Skalicky JJ, Gibney BR, Rabanal F, Bieber Urbauer RJ, Dutton PL, Wand AJ. Solution Structure of a Designed Four-α-Helix Bundle Maquette Scaffold. J Am Chem Soc 1999. [DOI: 10.1021/ja983309f] [Citation(s) in RCA: 65] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Hegyi H, Gerstein M. The relationship between protein structure and function: a comprehensive survey with application to the yeast genome. J Mol Biol 1999;288:147-64. [PMID: 10329133 DOI: 10.1006/jmbi.1999.2661] [Citation(s) in RCA: 269] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Abstract

For most proteins in the genome databases, function is predicted via sequence comparison. In spite of the popularity of this approach, the extent to which it can be reliably applied is unknown. We address this issue by systematically investigating the relationship between protein function and structure. We focus initially on enzymes functionally classified by the Enzyme Commission (EC) and relate these to by structurally classified domains the SCOP database. We find that the major SCOP fold classes have different propensities to carry out certain broad categories of functions. For instance, alpha/beta folds are disproportionately associated with enzymes, especially transferases and hydrolases, and all-alpha and small folds with non-enzymes, while alpha+beta folds have an equal tendency either way. These observations for the database overall are largely true for specific genomes. We focus, in particular, on yeast, analyzing it with many classifications in addition to SCOP and EC (i.e. COGs, CATH, MIPS), and find clear tendencies for fold-function association, across a broad spectrum of functions. Analysis with the COGs scheme also suggests that the functions of the most ancient proteins are more evenly distributed among different structural classes than those of more modern ones. For the database overall, we identify the most versatile functions, i.e. those that are associated with the most folds, and the most versatile folds, associated with the most functions. The two most versatile enzymatic functions (hydro-lyases and O-glycosyl glucosidases) are associated with seven folds each. The five most versatile folds (TIM-barrel, Rossmann, ferredoxin, alpha-beta hydrolase, and P-loop NTP hydrolase) are all mixed alpha-beta structures. They stand out as generic scaffolds, accommodating from six to as many as 16 functions (for the exceptional TIM-barrel). At the conclusion of our analysis we are able to construct a graph giving the chance that a functional annotation can be reliably transferred at different degrees of sequence and structural similarity. Supplemental information is available from http://bioinfo.mbb.yale.edu/genome/foldfunc++ +.

Collapse

Buchler NE, Goldstein RA. Effect of alphabet size and foldability requirements on protein structure designability. Proteins 1999. [DOI: 10.1002/(sici)1097-0134(19990101)34:1<113::aid-prot9>3.0.co;2-j] [Citation(s) in RCA: 47] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Hellinga HW. Construction of a Blue Copper Analogue through Iterative Rational Protein Design Cycles Demonstrates Principles of Molecular Recognition in Metal Center Formation. J Am Chem Soc 1998. [DOI: 10.1021/ja980054x] [Citation(s) in RCA: 48] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

The development of new biotechnologies using metalloprotein design. Curr Opin Biotechnol 1998;9:370-376. [PMID: 9751639 DOI: 10.1016/s0958-1669(98)80010-4] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Hellinga HW, Marvin JS. Protein engineering and the development of generic biosensors. Trends Biotechnol 1998;16:183-9. [PMID: 9586241 DOI: 10.1016/s0167-7799(98)01174-3] [Citation(s) in RCA: 84] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]