1
|
Zhao K, Zhao P, Wang S, Xia Y, Zhang G. FoldPAthreader: predicting protein folding pathway using a novel folding force field model derived from known protein universe. Genome Biol 2024; 25:152. [PMID: 38862984 PMCID: PMC11167914 DOI: 10.1186/s13059-024-03291-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Accepted: 05/29/2024] [Indexed: 06/13/2024] Open
Abstract
Protein folding has become a tractable problem with the significant advances in deep learning-driven protein structure prediction. Here we propose FoldPAthreader, a protein folding pathway prediction method that uses a novel folding force field model by exploring the intrinsic relationship between protein evolution and folding from the known protein universe. Further, the folding force field is used to guide Monte Carlo conformational sampling, driving the protein chain fold into its native state by exploring potential intermediates. On 30 example targets, FoldPAthreader successfully predicts 70% of the proteins whose folding pathway is consistent with biological experimental data.
Collapse
Affiliation(s)
- Kailong Zhao
- College of Information Engineering, Zhejiang University of Technology, HangZhou, 310023, China
| | - Pengxin Zhao
- College of Information Engineering, Zhejiang University of Technology, HangZhou, 310023, China
| | - Suhui Wang
- College of Information Engineering, Zhejiang University of Technology, HangZhou, 310023, China
| | - Yuhao Xia
- College of Information Engineering, Zhejiang University of Technology, HangZhou, 310023, China
| | - Guijun Zhang
- College of Information Engineering, Zhejiang University of Technology, HangZhou, 310023, China.
| |
Collapse
|
2
|
Peng CX, Liang F, Xia YH, Zhao KL, Hou MH, Zhang GJ. Recent Advances and Challenges in Protein Structure Prediction. J Chem Inf Model 2024; 64:76-95. [PMID: 38109487 DOI: 10.1021/acs.jcim.3c01324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2023]
Abstract
Artificial intelligence has made significant advances in the field of protein structure prediction in recent years. In particular, DeepMind's end-to-end model, AlphaFold2, has demonstrated the capability to predict three-dimensional structures of numerous unknown proteins with accuracy levels comparable to those of experimental methods. This breakthrough has opened up new possibilities for understanding protein structure and function as well as accelerating drug discovery and other applications in the field of biology and medicine. Despite the remarkable achievements of artificial intelligence in the field, there are still some challenges and limitations. In this Review, we discuss the recent progress and some of the challenges in protein structure prediction. These challenges include predicting multidomain protein structures, protein complex structures, multiple conformational states of proteins, and protein folding pathways. Furthermore, we highlight directions in which further improvements can be conducted.
Collapse
Affiliation(s)
- Chun-Xiang Peng
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Fang Liang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Yu-Hao Xia
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Kai-Long Zhao
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Ming-Hua Hou
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Gui-Jun Zhang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| |
Collapse
|
3
|
Kulkarni P, Leite VBP, Roy S, Bhattacharyya S, Mohanty A, Achuthan S, Singh D, Appadurai R, Rangarajan G, Weninger K, Orban J, Srivastava A, Jolly MK, Onuchic JN, Uversky VN, Salgia R. Intrinsically disordered proteins: Ensembles at the limits of Anfinsen's dogma. BIOPHYSICS REVIEWS 2022; 3:011306. [PMID: 38505224 PMCID: PMC10903413 DOI: 10.1063/5.0080512] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Accepted: 02/17/2022] [Indexed: 03/21/2024]
Abstract
Intrinsically disordered proteins (IDPs) are proteins that lack rigid 3D structure. Hence, they are often misconceived to present a challenge to Anfinsen's dogma. However, IDPs exist as ensembles that sample a quasi-continuum of rapidly interconverting conformations and, as such, may represent proteins at the extreme limit of the Anfinsen postulate. IDPs play important biological roles and are key components of the cellular protein interaction network (PIN). Many IDPs can interconvert between disordered and ordered states as they bind to appropriate partners. Conformational dynamics of IDPs contribute to conformational noise in the cell. Thus, the dysregulation of IDPs contributes to increased noise and "promiscuous" interactions. This leads to PIN rewiring to output an appropriate response underscoring the critical role of IDPs in cellular decision making. Nonetheless, IDPs are not easily tractable experimentally. Furthermore, in the absence of a reference conformation, discerning the energy landscape representation of the weakly funneled IDPs in terms of reaction coordinates is challenging. To understand conformational dynamics in real time and decipher how IDPs recognize multiple binding partners with high specificity, several sophisticated knowledge-based and physics-based in silico sampling techniques have been developed. Here, using specific examples, we highlight recent advances in energy landscape visualization and molecular dynamics simulations to discern conformational dynamics and discuss how the conformational preferences of IDPs modulate their function, especially in phenotypic switching. Finally, we discuss recent progress in identifying small molecules targeting IDPs underscoring the potential therapeutic value of IDPs. Understanding structure and function of IDPs can not only provide new insight on cellular decision making but may also help to refine and extend Anfinsen's structure/function paradigm.
Collapse
Affiliation(s)
- Prakash Kulkarni
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Vitor B. P. Leite
- Departamento de Física, Instituto de Biociências, Letras e Ciências Exatas, Universidade Estadual Paulista (UNESP), São José do Rio Preto, São Paulo 15054-000, Brazil
| | - Susmita Roy
- Department of Chemical Sciences, Indian Institute of Science Education and Research Kolkata, Mohanpur, West Bengal 741246, India
| | - Supriyo Bhattacharyya
- Translational Bioinformatics, Center for Informatics, Department of Computational and Quantitative Medicine, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Atish Mohanty
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Srisairam Achuthan
- Center for Informatics, Division of Research Informatics, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Divyoj Singh
- Center for BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, India
| | - Rajeswari Appadurai
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| | - Govindan Rangarajan
- Department of Mathematics, Indian Institute of Science, Bangalore 560012, India
| | - Keith Weninger
- Department of Physics, North Carolina State University, Raleigh, North Carolina 27695, USA
| | | | - Anand Srivastava
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| | - Mohit Kumar Jolly
- Center for BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, India
| | - Jose N. Onuchic
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005-1892, USA
| | | | - Ravi Salgia
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, California 91010, USA
| |
Collapse
|
4
|
Kolimi N, Pabbathi A, Saikia N, Ding F, Sanabria H, Alper J. Out-of-Equilibrium Biophysical Chemistry: The Case for Multidimensional, Integrated Single-Molecule Approaches. J Phys Chem B 2021; 125:10404-10418. [PMID: 34506140 PMCID: PMC8474109 DOI: 10.1021/acs.jpcb.1c02424] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
![]()
Out-of-equilibrium
processes are ubiquitous across living organisms
and all structural hierarchies of life. At the molecular scale, out-of-equilibrium
processes (for example, enzyme catalysis, gene regulation, and motor
protein functions) cause biological macromolecules to sample an ensemble
of conformations over a wide range of time scales. Quantifying and
conceptualizing the structure–dynamics to function relationship
is challenging because continuously evolving multidimensional energy
landscapes are necessary to describe nonequilibrium biological processes
in biological macromolecules. In this perspective, we explore the
challenges associated with state-of-the-art experimental techniques
to understanding biological macromolecular function. We argue that
it is time to revisit how we probe and model functional out-of-equilibrium
biomolecular dynamics. We suggest that developing integrated single-molecule
multiparametric force–fluorescence instruments and using advanced
molecular dynamics simulations to study out-of-equilibrium biomolecules
will provide a path towards understanding the principles of and mechanisms
behind the structure–dynamics to function paradigm in biological
macromolecules.
Collapse
Affiliation(s)
- Narendar Kolimi
- Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, United States
| | - Ashok Pabbathi
- Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, United States
| | - Nabanita Saikia
- Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, United States
| | - Feng Ding
- Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, United States
| | - Hugo Sanabria
- Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, United States
| | - Joshua Alper
- Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, United States.,Department of Biological Sciences, Clemson University, Clemson, South Carolina 29634, United States
| |
Collapse
|
5
|
Chekmarev SF. First-passage times in protein folding: exploring the native-like states vs. overcoming the free energy barrier. Phys Chem Chem Phys 2021; 23:17856-17865. [PMID: 34378547 DOI: 10.1039/d0cp06560a] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Using a model β-hairpin protein as a representative example of simple two-state folders whose kinetics are uncomplicated by the presence of on- and off-pathway intermediates, it is studied how the search for the protein's native state among native-like states affects the folding kinetics. It is revealed that the first-passage time (FPT) distributions are essentially single-exponential not only for the times to overcome the free energy barrier between the unfolded and native-like states but also for the times to find the native state among the native-like ones. The FPT distributions of this type are observed through all studied two-state-like regimes of protein folding, varying from a regime close to two-state folding to a regime close to downhill folding. If the protein explores native-like states for a time much longer than the time to overcome the free energy barrier, which is characteristic of high temperatures, the resulting FPT distribution to reach the native state remains close to exponential but the mean FPT (MFPT) is determined not by the height of the free energy barrier but by the time to explore native-like states. In particular, the mean time to overcome the free energy barrier is in reasonable agreement with the Kramers rate formula and generally far shorter than the overall MFPT to reach the native state. The observed increase of the overall MFPT, as a result of longer exploration of native-like states, may lead to an overestimate of the height of the free energy barrier between the unfolded and folded states when it is calculated from the overall MFPT.
Collapse
|
6
|
Abstract
Chemists visualize chemical reactions as motion along one-dimensional "reaction coordinates" over free energy barriers. Various rate theories, such as transition state theory and the Kramers theory of diffusive barrier crossing, differ in their assumptions regarding the mathematical specifics of this motion. Direct experimental observation of the motion along reaction coordinates requires single-molecule experiments performed with unprecedented time resolution. Toward this goal, recent single-molecule studies achieved time resolution sufficient to catch biomolecules in the act of crossing free energy barriers as they fold, bind to their targets, or undergo other large structural changes, offering a window into the elusive reaction "mechanisms". This Perspective describes what we can learn (and what we have already learned) about barrier crossing dynamics through synergy of single-molecule experiments, theory, and molecular simulations. In particular, I will discuss how emerging experimental data can be used to answer several questions of principle. For example, is motion along the reaction coordinate diffusive, is there conformational memory, and is reduction to just one degree of freedom to represent the reaction mechanism justified? It turns out that these questions can be formulated as experimentally testable mathematical inequalities, and their application to experimental and simulated data has already led to a number of insights. I will also discuss open issues and current challenges in this fast evolving field of research.
Collapse
Affiliation(s)
- Dmitrii E Makarov
- Department of Chemistry and Institute for Computational Engineering and Sciences, University of Texas at Austin, Austin, Texas 78712, United States
| |
Collapse
|
7
|
Broad distributions of transition-path times are fingerprints of multidimensionality of the underlying free energy landscapes. Proc Natl Acad Sci U S A 2020; 117:27116-27123. [PMID: 33087575 DOI: 10.1073/pnas.2008307117] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Recent single-molecule experiments have observed transition paths, i.e., brief events where molecules (particularly biomolecules) are caught in the act of surmounting activation barriers. Such measurements offer unprecedented mechanistic insights into the dynamics of biomolecular folding and binding, molecular machines, and biological membrane channels. A key challenge to these studies is to infer the complex details of the multidimensional energy landscape traversed by the transition paths from inherently low-dimensional experimental signals. A common minimalist model attempting to do so is that of one-dimensional diffusion along a reaction coordinate, yet its validity has been called into question. Here, we show that the distribution of the transition path time, which is a common experimental observable, can be used to differentiate between the dynamics described by models of one-dimensional diffusion from the dynamics in which multidimensionality is essential. Specifically, we prove that the coefficient of variation obtained from this distribution cannot possibly exceed 1 for any one-dimensional diffusive model, no matter how rugged its underlying free energy landscape is: In other words, this distribution cannot be broader than the single-exponential one. Thus, a coefficient of variation exceeding 1 is a fingerprint of multidimensional dynamics. Analysis of transition paths in atomistic simulations of proteins shows that this coefficient often exceeds 1, signifying essential multidimensionality of those systems.
Collapse
|
8
|
Trotter D, Wallin S. Effects of Topology and Sequence in Protein Folding Linked via Conformational Fluctuations. Biophys J 2020; 118:1370-1380. [PMID: 32061276 DOI: 10.1016/j.bpj.2020.01.020] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Revised: 12/17/2019] [Accepted: 01/13/2020] [Indexed: 01/18/2023] Open
Abstract
Experiments have compared the folding of proteins with different amino acid sequences but the same basic structure, or fold. Results indicate that folding is robust to sequence variations for proteins with some nonlocal folds, such as all-β, whereas the folding of more local, all-α proteins typically exhibits a stronger sequence dependence. Here, we use a coarse-grained model to systematically study how variations in sequence perturb the folding energy landscapes of three model sequences with 3α, 4β + α, and β-barrel folds, respectively. These three proteins exhibit folding features in line with experiments, including expected rank order in the cooperativity of the folding transition and stability-dependent shifts in the location of the free-energy barrier to folding. Using a generalized-ensemble simulation approach, we determine the thermodynamics of around 2000 sequence variants representing all possible hydrophobic or polar single- and double-point mutations. From an analysis of the subset of stability-neutral mutations, we find that folding is perturbed in a topology-dependent manner, with the β-barrel protein being the most robust. Our analysis shows, in particular, that the magnitude of mutational perturbations of the transition state is controlled in part by the size or "width" of the underlying conformational ensemble. This result suggests that the mutational robustness of the folding of the β-barrel protein is underpinned by its conformationally restricted transition state ensemble, revealing a link between sequence and topological effects in protein folding.
Collapse
Affiliation(s)
- Daniel Trotter
- Department of Physics and Physical Oceanography, Memorial University of Newfoundland, St. John's, Newfoundland, Canada
| | - Stefan Wallin
- Department of Physics and Physical Oceanography, Memorial University of Newfoundland, St. John's, Newfoundland, Canada.
| |
Collapse
|
9
|
Oliveira AB, Yang H, Whitford PC, Leite VBP. Distinguishing Biomolecular Pathways and Metastable States. J Chem Theory Comput 2019; 15:6482-6490. [DOI: 10.1021/acs.jctc.9b00704] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Affiliation(s)
- Antonio B. Oliveira
- Departamento de Física, Instituto de Biociências, Letras e Ciências Exatas, Universidade Estadual Paulista, São José do Rio Preto, São Paulo 15054-000, Brazil
| | - Huan Yang
- Department of Physics, Northeastern University, Boston, Massachusetts 02115, United States
| | - Paul C. Whitford
- Department of Physics, Northeastern University, Boston, Massachusetts 02115, United States
| | - Vitor B. P. Leite
- Departamento de Física, Instituto de Biociências, Letras e Ciências Exatas, Universidade Estadual Paulista, São José do Rio Preto, São Paulo 15054-000, Brazil
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, United States
| |
Collapse
|
10
|
Hoffer NQ, Woodside MT. Probing microscopic conformational dynamics in folding reactions by measuring transition paths. Curr Opin Chem Biol 2019; 53:68-74. [PMID: 31479831 DOI: 10.1016/j.cbpa.2019.07.006] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Revised: 07/08/2019] [Accepted: 07/20/2019] [Indexed: 12/20/2022]
Abstract
Transition paths comprise those parts of a folding trajectory where the molecule passes through the high-energy transition states separating folded and unfolded conformations. The transition states determine the folding kinetics and mechanism but are difficult to observe because of their brief duration. Single-molecule experiments have in recent years begun to characterize transition paths in folding reactions, allowing the microscopic conformational dynamics that occur as a molecule traverses the energy barriers to be probed directly. Here we review single-molecule fluorescence and force spectroscopy measurements of transition-path properties, including the time taken to traverse the paths, the local velocity along them, the path shapes, and the variability within these measurements reflecting differences between individual barrier crossings. We discuss how these measurements have been related to theories of folding as diffusion over an energy landscape to deduce properties such as the diffusion coefficient, and how they are being combined with simulations to obtain enhanced atomistic understanding of folding. The richly detailed information available from transition path measurements holds great promise for improved understanding of microscopic mechanisms in folding.
Collapse
Affiliation(s)
- Noel Q Hoffer
- Department of Physics, University of Alberta, Edmonton, AB T6G 2E1, Canada
| | - Michael T Woodside
- Department of Physics, University of Alberta, Edmonton, AB T6G 2E1, Canada.
| |
Collapse
|