1
|
Abstract
How did specific useful protein sequences arise from simpler molecules at the origin of life? This seemingly needle-in-a-haystack problem has remarkably close resemblance to the old Protein Folding Problem, for which the solution is now known from statistical physics. Based on the logic that Origins must have come only after there was an operative evolution mechanism-which selects on phenotype, not genotype-we give a perspective that proteins and their folding processes are likely to have been the primary driver of the early stages of the origin of life.
Collapse
Affiliation(s)
- Charles D. Kocher
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY11794
- Department of Physics and Astronomy, Stony Brook University, Stony Brook, NY11794
| | - Ken A. Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY11794
- Department of Physics and Astronomy, Stony Brook University, Stony Brook, NY11794
- Department of Chemistry, Stony Brook University, Stony Brook, NY11794
| |
Collapse
|
2
|
Saha US, Vendruscolo M, Carpenter AE, Singh S, Bender A, Seal S. Step Forward Cross Validation for Bioactivity Prediction: Out of Distribution Validation in Drug Discovery. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.02.601740. [PMID: 39005404 PMCID: PMC11245006 DOI: 10.1101/2024.07.02.601740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
Recent advances in machine learning methods for materials science have significantly enhanced accurate predictions of the properties of novel materials. Here, we explore whether these advances can be adapted to drug discovery by addressing the problem of prospective validation - the assessment of the performance of a method on out-of-distribution data. First, we tested whether k-fold n-step forward cross-validation could improve the accuracy of out-of-distribution small molecule bioactivity predictions. We found that it is more helpful than conventional random split cross-validation in describing the accuracy of a model in real-world drug discovery settings. We also analyzed discovery yield and novelty error, finding that these two metrics provide an understanding of the applicability domain of models and an assessment of their ability to predict molecules with desirable bioactivity compared to other small molecules. Based on these results, we recommend incorporating a k-fold n-step forward cross-validation and these metrics when building state-of-the-art models for bioactivity prediction in drug discovery.
Collapse
Affiliation(s)
| | | | | | | | - Andreas Bender
- Department of Chemistry, University of Cambridge, UK
- STAR-UBB Institute, Babeş-Bolyai University, Cluj-Napoca, Romania
| | - Srijit Seal
- Department of Chemistry, University of Cambridge, UK
- Broad Institute of MIT and Harvard, Cambridge, MA, US
| |
Collapse
|
3
|
Fersht AR. From covalent transition states in chemistry to noncovalent in biology: from β- to Φ-value analysis of protein folding. Q Rev Biophys 2024; 57:e4. [PMID: 38597675 DOI: 10.1017/s0033583523000045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/11/2024]
Abstract
Solving the mechanism of a chemical reaction requires determining the structures of all the ground states on the pathway and the elusive transition states linking them. 2024 is the centenary of Brønsted's landmark paper that introduced the β-value and structure-activity studies as the only experimental means to infer the structures of transition states. It involves making systematic small changes in the covalent structure of the reactants and analysing changes in activation and equilibrium-free energies. Protein engineering was introduced for an analogous procedure, Φ-value analysis, to analyse the noncovalent interactions in proteins central to biological chemistry. The methodology was developed first by analysing noncovalent interactions in transition states in enzyme catalysis. The mature procedure was then applied to study transition states in the pathway of protein folding - 'part (b) of the protein folding problem'. This review describes the development of Φ-value analysis of transition states and compares and contrasts the interpretation of β- and Φ-values and their limitations. Φ-analysis afforded the first description of transition states in protein folding at the level of individual residues. It revealed the nucleation-condensation folding mechanism of protein domains with the transition state as an expanded, distorted native structure, containing little fully formed secondary structure but many weak tertiary interactions. A spectrum of transition states with various degrees of structural polarisation was then uncovered that spanned from nucleation-condensation to the framework mechanism of fully formed secondary structure. Φ-analysis revealed how movement of the expanded transition state on an energy landscape accommodates the transition from framework to nucleation-condensation mechanisms with a malleability of structure as a unifying feature of folding mechanisms. Such movement follows the rubric of analysis of classical covalent chemical mechanisms that began with Brønsted. Φ-values are used to benchmark computer simulation, and Φ and simulation combine to describe folding pathways at atomic resolution.
Collapse
Affiliation(s)
- Alan R Fersht
- MRC Laboratory of Molecular Biology, Cambridge, UK
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK
- Gonville and Caius College, University of Cambridge, Cambridge, UK
| |
Collapse
|
4
|
Sun Y, Zhang F, Ouyang Q, Luo C. The dynamic-process characterization and prediction of synthetic gene circuits by dynamic delay model. iScience 2024; 27:109142. [PMID: 38384832 PMCID: PMC10879701 DOI: 10.1016/j.isci.2024.109142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 01/17/2024] [Accepted: 02/01/2024] [Indexed: 02/23/2024] Open
Abstract
Differential equation models are widely used to describe genetic regulations, predict multicomponent regulatory circuits, and provide quantitative insights. However, it is still challenging to quantitatively link the dynamic behaviors with measured parameters in synthetic circuits. Here, we propose a dynamic delay model (DDM) which includes two simple parts: the dynamic determining part and the doses-related steady-state-determining part. The dynamic determining part is usually supposed as the delay time but without a clear formula. For the first time, we give the detail formula of the dynamic determining function and provide a method for measuring all parameters of synthetic elements (include 8 activators and 5 repressors) by microfluidic system. Three synthetic circuits were built to show that the DDM can notably improve the prediction accuracy and can be used in various synthetic biology applications.
Collapse
Affiliation(s)
- Yanhong Sun
- The State Key Laboratory for Artificial Microstructures and Mesoscopic Physics, School of Physics, Peking University, Beijing 100871, China
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Fengyu Zhang
- Wenzhou Institute University of Chinese Academy of Sciences, Wenzhou, Zhejiang 325001, China
| | - Qi Ouyang
- The State Key Laboratory for Artificial Microstructures and Mesoscopic Physics, School of Physics, Peking University, Beijing 100871, China
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Chunxiong Luo
- The State Key Laboratory for Artificial Microstructures and Mesoscopic Physics, School of Physics, Peking University, Beijing 100871, China
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
- Wenzhou Institute University of Chinese Academy of Sciences, Wenzhou, Zhejiang 325001, China
| |
Collapse
|
5
|
Jahn R, Cafiso DC, Tamm LK. Mechanisms of SNARE proteins in membrane fusion. Nat Rev Mol Cell Biol 2024; 25:101-118. [PMID: 37848589 DOI: 10.1038/s41580-023-00668-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/06/2023] [Indexed: 10/19/2023]
Abstract
Soluble N-ethylmaleimide-sensitive factor attachment protein receptors (SNAREs) are a family of small conserved eukaryotic proteins that mediate membrane fusion between organelles and with the plasma membrane. SNAREs are directly or indirectly anchored to membranes. Prior to fusion, complementary SNAREs assemble between membranes with the aid of accessory proteins that provide a scaffold to initiate SNARE zippering, pulling the membranes together and mediating fusion. Recent advances have enabled the construction of detailed models describing bilayer transitions and energy barriers along the fusion pathway and have elucidated the structures of SNAREs complexed in various states with regulatory proteins. In this Review, we discuss how these advances are yielding an increasingly detailed picture of the SNARE-mediated fusion pathway, leading from first contact between the membranes via metastable non-bilayer intermediates towards the opening and expansion of a fusion pore. We describe how SNARE proteins assemble into complexes, how this assembly is regulated by accessory proteins and how SNARE complexes overcome the free energy barriers that prevent spontaneous membrane fusion.
Collapse
Affiliation(s)
- Reinhard Jahn
- Laboratory of Neurobiology, Max-Planck Institute for Multidisciplinary Sciences, Göttingen, Germany.
| | - David C Cafiso
- Department of Chemistry, University of Virginia, Charlottesville, VA, USA
| | - Lukas K Tamm
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| |
Collapse
|
6
|
Turina P, Fariselli P, Capriotti E. K-Pro: Kinetics Data on Proteins and Mutants. J Mol Biol 2023; 435:168245. [PMID: 37625584 DOI: 10.1016/j.jmb.2023.168245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 08/16/2023] [Accepted: 08/17/2023] [Indexed: 08/27/2023]
Abstract
The study of protein folding plays a crucial role in improving our understanding of protein function and of the relationship between genetics and phenotypes. In particular, understanding the thermodynamics and kinetics of the folding process is important for uncovering the mechanisms behind human disorders caused by protein misfolding. To address this issue, it is essential to collect and curate experimental kinetic and thermodynamic data on protein folding. K-Pro is a new database designed for collecting and storing experimental kinetic data on monomeric proteins, with a two-state folding mechanism. With 1,529 records from 62 proteins corresponding to 65 structures, K-Pro contains various kinetic parameters such as the logarithm of the folding and unfolding rates, Tanford's β and the ϕ values. When available, the database also includes thermodynamic parameters associated with the kinetic data. K-Pro features a user-friendly interface that allows browsing and downloading kinetic data of interest. The graphical interface provides a visual representation of the protein and mutants, and it is cross-linked to key databases such as PDB, UniProt, and PubMed. K-Pro is open and freely accessible through https://folding.biofold.org/k-pro and supports the latest versions of popular browsers.
Collapse
Affiliation(s)
- Paola Turina
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Via F. Selmi 3, 40126 Bologna, Italy
| | - Piero Fariselli
- Department of Medical Sciences, University of Torino, Via Santena 19, 10126 Torino, Italy
| | - Emidio Capriotti
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Via F. Selmi 3, 40126 Bologna, Italy.
| |
Collapse
|
7
|
Oda M. Analysis of the Structural Dynamics of Proteins in the Ligand-Unbound and -Bound States by Diffracted X-ray Tracking. Int J Mol Sci 2023; 24:13717. [PMID: 37762021 PMCID: PMC10531450 DOI: 10.3390/ijms241813717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 09/03/2023] [Accepted: 09/05/2023] [Indexed: 09/29/2023] Open
Abstract
Although many protein structures have been determined at atomic resolution, the majority of them are static and represent only the most stable or averaged structures in solution. When a protein binds to its ligand, it usually undergoes fluctuation and changes its conformation. One attractive method for obtaining an accurate view of proteins in solution, which is required for applications such as the rational design of proteins and structure-based drug design, is diffracted X-ray tracking (DXT). DXT can detect the protein structural dynamics on a timeline via gold nanocrystals attached to the protein. Here, the structure dynamics of single-chain Fv antibodies, helix bundle-forming de novo designed proteins, and DNA-binding proteins in both ligand-unbound and ligand-bound states were analyzed using the DXT method. The resultant mean square angular displacements (MSD) curves in both the tilting and twisting directions clearly demonstrated that structural fluctuations were suppressed upon ligand binding, and the binding energies determined using the angular diffusion coefficients from the MSD agreed well with the binding thermodynamics determined using isothermal titration calorimetry. In addition, the size of gold nanocrystals is discussed, which is one of the technical concerns of DXT.
Collapse
Affiliation(s)
- Masayuki Oda
- Graduate School of Life and Environmental Sciences, Kyoto Prefectural University, 1-5 Hangi-cho, Shimogamo, Sakyo-ku, Kyoto 606-8522, Japan
| |
Collapse
|
8
|
Vila JA. Protein folding rate evolution upon mutations. Biophys Rev 2023; 15:661-669. [PMID: 37681091 PMCID: PMC10480377 DOI: 10.1007/s12551-023-01088-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Accepted: 06/24/2023] [Indexed: 09/09/2023] Open
Abstract
Despite the spectacular success of cutting-edge protein fold prediction methods, many critical questions remain unanswered, including why proteins can reach their native state in a biologically reasonable time. A satisfactory answer to this simple question could shed light on the slowest folding rate of proteins as well as how mutations-amino-acid substitutions and/or post-translational modifications-might affect it. Preliminary results indicate that (i) Anfinsen's dogma validity ensures that proteins reach their native state on a reasonable timescale regardless of their sequence or length, and (ii) it is feasible to determine the evolution of protein folding rates without accounting for epistasis effects or the mutational trajectories between the starting and target sequences. These results have direct implications for evolutionary biology because they lay the groundwork for a better understanding of why, and to what extent, mutations-a crucial element of evolution and a factor influencing it-affect protein evolvability. Furthermore, they may spur significant progress in our efforts to solve crucial structural biology problems, such as how a sequence encodes its folding.
Collapse
Affiliation(s)
- Jorge A. Vila
- IMASL-CONICET, Universidad Nacional de San Luis, Ejército de Los Andes 950, 5700 San Luis, Argentina
| |
Collapse
|
9
|
Sahakyan H, Nazaryan K, Mushegian A, Sorokina I. A Study of a Protein-Folding Machine: Transient Rotation of the Polypeptide Backbone Facilitates Rapid Folding of Protein Domains in All-Atom Molecular Dynamics Simulations. Int J Mol Sci 2023; 24:10049. [PMID: 37373197 DOI: 10.3390/ijms241210049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Revised: 06/07/2023] [Accepted: 06/09/2023] [Indexed: 06/29/2023] Open
Abstract
Molecular dynamics simulations of protein folding typically consider the polypeptide chain at equilibrium and in isolation from the cellular components. We argue that in order to understand protein folding as it occurs in vivo, it should be modeled as an active, energy-dependent process, in which the cellular protein-folding machine directly manipulates the polypeptide. We conducted all-atom molecular dynamics simulations of four protein domains, whose folding from the extended state was augmented by the application of rotational force to the C-terminal amino acid, while the movement of the N-terminal amino acid was restrained. We have shown earlier that such a simple manipulation of peptide backbone facilitated the formation of native structures in diverse α-helical peptides. In this study, the simulation protocol was modified, to apply the backbone rotation and movement restriction only for a short time at the start of simulation. This transient application of a mechanical force to the peptide is sufficient to accelerate, by at least an order of magnitude, the folding of four protein domains from different structural classes to their native or native-like conformations. Our in silico experiments show that a compact stable fold may be attained more readily when the motions of the polypeptide are biased by external forces and constraints.
Collapse
Affiliation(s)
- Harutyun Sahakyan
- Institute of Molecular Biology, Academy of Sciences of Republic of Armenia, Yerevan 0014, Armenia
| | - Karen Nazaryan
- Institute of Molecular Biology, Academy of Sciences of Republic of Armenia, Yerevan 0014, Armenia
| | - Arcady Mushegian
- Division of Molecular and Cellular Biosciences, National Science Foundation, Alexandria, VA 22314, USA
| | | |
Collapse
|
10
|
Finkelstein AV, Bogatyreva NS, Ivankov DN, Garbuzynskiy SO. Clarification to "Protein folding problem: enigma, paradox, solution". Biophys Rev 2023; 15:161. [PMID: 37124919 PMCID: PMC10133415 DOI: 10.1007/s12551-023-01058-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/07/2023] Open
Affiliation(s)
- Alexei V. Finkelstein
- Institute of Protein Research of the Russian Academy of Sciences, 142290 Pushchino, Moscow Region, Russia
- Biotechnology Department of the Lomonosov Moscow State University, 142290 Pushchino, Moscow Region, Russia
- Biology Department of the Lomonosov Moscow State University, 1-12 Leninskie Gory, 119991 Moscow, Russia
| | - Natalya S. Bogatyreva
- Institute of Protein Research of the Russian Academy of Sciences, 142290 Pushchino, Moscow Region, Russia
| | - Dmitry N. Ivankov
- Center of Life Sciences, Skolkovo Institute of Science and Technology, 121205 Moscow, Russia
| | - Sergiy O. Garbuzynskiy
- Institute of Protein Research of the Russian Academy of Sciences, 142290 Pushchino, Moscow Region, Russia
| |
Collapse
|
11
|
Abstract
This Editorial (Vol. 15 Issue 2-Regular Issue) first announces the winner of the 2023 Michèle Auger Award for Young Scientists' Independent Research before then going on to describe the contents of the current Issue. The Editorial closes with a discussion of the pros and cons of writing in the formulation of scientific ideas.
Collapse
Affiliation(s)
- Damien Hall
- WPI Nano Life Science Institute, Kanazawa University, Kakumamachi, Kanazawa, Ishikawa 920-1164 Japan
| |
Collapse
|
12
|
Komar AA. Molecular Peptide Grafting as a Tool to Create Novel Protein Therapeutics. Molecules 2023; 28:2383. [PMID: 36903628 PMCID: PMC10005171 DOI: 10.3390/molecules28052383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Revised: 02/26/2023] [Accepted: 03/03/2023] [Indexed: 03/08/2023] Open
Abstract
The study of peptides (synthetic or corresponding to discrete regions of proteins) has facilitated the understanding of protein structure-activity relationships. Short peptides can also be used as powerful therapeutic agents. However, the functional activity of many short peptides is usually substantially lower than that of their parental proteins. This is (as a rule) due to their diminished structural organization, stability, and solubility often leading to an enhanced propensity for aggregation. Several approaches have emerged to overcome these limitations, which are aimed at imposing structural constraints into the backbone and/or sidechains of the therapeutic peptides (such as molecular stapling, peptide backbone circularization and molecular grafting), therefore enforcing their biologically active conformation and thus improving their solubility, stability, and functional activity. This review provides a short summary of approaches aimed at enhancing the biological activity of short functional peptides with a particular focus on the peptide grafting approach, whereby a functional peptide is inserted into a scaffold molecule. Intra-backbone insertions of short therapeutic peptides into scaffold proteins have been shown to enhance their activity and render them a more stable and biologically active conformation.
Collapse
Affiliation(s)
- Anton A. Komar
- Center for Gene Regulation in Health and Disease, Department of Biological, Geological and Environmental Sciences, Cleveland State University, 2121 Euclid Avenue, Cleveland, OH 44115, USA; ; Tel.: +1-216-687-2516
- Department of Biochemistry and Center for RNA Science and Therapeutics, School of Medicine, Case Western Reserve University, Cleveland, OH 44106, USA
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| |
Collapse
|
13
|
Hall D. Biophysical Reviews: Turning the page from 2022 to 2023. Biophys Rev 2023; 15:1-11. [PMID: 36909962 PMCID: PMC9995637 DOI: 10.1007/s12551-023-01049-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/27/2023] [Indexed: 02/25/2023] Open
Abstract
This Editorial (vol. 15 issue 1-Regular Issue featuring an Issue Focus on the "100th Anniversary of Har Gobind Khorana") first describes the issue contents before providing both, a look back at some journal highlights from 2022, and a look forward to what we can expect from 2023. The Editorial closes with a roundup of new journal access features and an acknowledgement of those supporting the journal.
Collapse
Affiliation(s)
- Damien Hall
- WPI Nano Life Science Institute, Kanazawa University, Kakumamachi, Kanazawa, Ishikawa 920-1164 Japan
- Department of Applied Physics, Aalto University, 00076 Aalto, Finland
| |
Collapse
|
14
|
Hall D, Basu G, Ito N. Computational biophysics and structural biology of proteins-a Special Issue in honor of Prof. Haruki Nakamura's 70th birthday. Biophys Rev 2022; 14:1211-1222. [PMID: 36620377 PMCID: PMC9809522 DOI: 10.1007/s12551-022-01039-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/08/2022] [Indexed: 01/05/2023] Open
Abstract
Receiving his initial training jointly in theoretical and applied physics at the University of Tokyo, Professor Haruki Nakamura has had a long and eventful scientific career, along the way helping to shape the way that biophysics is carried out in Japan. Concentrating his research efforts on the simulation of protein structure and function, he has, over his career arc, acted as director of the Institute for Protein Research (Osaka, Japan), director of the Protein Data Bank of Japan (PDBj), president of the Biophysical Society of Japan (BSJ), president of the Protein Science Society of Japan (PSSJ), and group leader and professor of Bioinformatics and Computational Structural Biology at Osaka University. In 2022, Prof. Haruki Nakamura turned 70 years old, and to mark this occasion, his scientific colleagues from around the world have combined their efforts to produce this Festschrift Issue of the IUPAB Biophysical Reviews journal around the theme of the computational biophysics and structural biology of proteins.
Collapse
Affiliation(s)
- Damien Hall
- WPI Nano Life Science Institute, Kanazawa University, Kakumamachi, Kanazawa, Ishikawa 920-1164 Japan
- Department of Applied Physics, Aalto University, 00076 Aalto, Finland
| | - Gautam Basu
- Department of Biophysics, Bose Institute, Centenary Campus, P-1/12 C.I.T. Scheme VII-M, Kolkata, 700054 India
| | - Nobutoshi Ito
- Medical Research Institute, Tokyo Medical and Dental University (TMDU), Yushima, Bunkyo-Ku, Tokyo, 113-8510 Japan
| |
Collapse
|