1
|
Dhar M, Berg MA. Efficient, nonparametric removal of noise and recovery of probability distributions from time series using nonlinear-correlation functions: Photon and photon-counting noise. J Chem Phys 2024; 161:034116. [PMID: 39028845 DOI: 10.1063/5.0212157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Accepted: 06/28/2024] [Indexed: 07/21/2024] Open
Abstract
A preceding paper [M. Dhar, J. A. Dickinson, and M. A. Berg, J. Chem. Phys. 159, 054110 (2023)] shows how to remove additive noise from an experimental time series, allowing both the equilibrium distribution of the system and its Green's function to be recovered. The approach is based on nonlinear-correlation functions and is fully nonparametric: no initial model of the system or of the noise is needed. However, single-molecule spectroscopy often produces time series with either photon or photon-counting noise. Unlike additive noise, photon noise is signal-size correlated and quantized. Photon counting adds the potential for bias. This paper extends noise-corrected-correlation methods to these cases and tests them on synthetic datasets. Neither signal-size correlation nor quantization is a significant complication. Analysis of the sampling error yields guidelines for the data quality needed to recover the properties of a system with a given complexity. We show that bias in photon-counting data can be corrected, even at the high count rates needed to optimize the time resolution. Using all these results, we discuss the factors that limit the time resolution of single-molecule spectroscopy and the conditions that would be needed to push measurements into the submicrosecond region.
Collapse
Affiliation(s)
- Mainak Dhar
- Department of Chemistry and Biochemistry, University of South Carolina, Columbia, South Carolina 29208, USA
| | - Mark A Berg
- Department of Chemistry and Biochemistry, University of South Carolina, Columbia, South Carolina 29208, USA
| |
Collapse
|
2
|
Chen YT, Yang H, Chu JW. Trajectory Statistical Learning of the Potential Mean of Force and Diffusion Coefficient from Molecular Dynamics Simulations. J Phys Chem B 2024; 128:56-66. [PMID: 38165090 DOI: 10.1021/acs.jpcb.3c05245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2024]
Abstract
Central to studying the conformational changes of a complex protein is understanding the dynamics and energetics involved. Phenomenologically, structural dynamics can be formulated using an overdamped Langevin model along an observable, e.g., the distance between two residues in the protein. The Langevin model is specified by the deterministic force (the potential of mean force, PMF) and stochastic force (characterized by the diffusion coefficient, D). It is therefore of great interest to be able to extract both PMF and D from an observable time series but under the same computational framework. Here, we approach this challenge in molecular dynamics (MD) simulations by treating it as a missing-data Bayesian estimation problem. An important distinction in our methodology is that the entire MD trajectory, as opposed to the individual data elements, is used as the statistical variable in Bayesian imputation. This idea is implemented through an eigen-decomposition procedure for a time-symmetrized Fokker-Planck equation, followed by maximizing the likelihood for parameter estimation. The mathematical expressions for the functional derivatives used in learning PMF and D also provide new physical insights for the manner by which the information on both the deterministic and stochastic forces is encoded in the dynamics data. An all-atom MD simulation of a nontrivial biomolecule case is used to illustrate the application of this approach. We show that, interestingly, the results of trajectory statistical learning can motivate new order parameters for an improved description of the kinetic bottlenecks in conformational changes. Complementing purely data-driven or black-box methods, this work underscores the advantages of physics-based machine learning in gaining chemical insights from quantitative parameter estimation.
Collapse
Affiliation(s)
- Yi-Tsao Chen
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 30010, Taiwan, Republic of China
| | - Haw Yang
- Department of Chemistry, Princeton University, Princeton, New Jersey 08544, United States
| | - Jhih-Wei Chu
- Institute of Bioinformatics and Systems Biology, Department of Biological Science and Technology, Institute of Molecular Medicine and Bioengineering, and Center for Intelligent Drug Systems and Smart Bio-Devices (IDS2B), National Yang Ming Chiao Tung University, Hsinchu 30010, Taiwan, Republic of China
| |
Collapse
|
3
|
Gopich IV, Chung HS. Theory and Analysis of Single-Molecule FRET Experiments. Methods Mol Biol 2022; 2376:247-282. [PMID: 34845614 DOI: 10.1007/978-1-0716-1716-8_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Inter-dye distances and conformational dynamics can be studied using single-molecule FRET measurements. We consider two approaches to analyze sequences of photons with recorded photon colors and arrival times. The first approach is based on FRET efficiency histograms obtained from binned photon sequences. The experimental histograms are compared with the theoretical histograms obtained using the joint distribution of acceptor and donor photons or the Gaussian approximation. In the second approach, a photon sequence is analyzed without binning. The parameters of a model describing conformational dynamics are found by maximizing the appropriate likelihood function. The first approach is simpler, while the second one is more accurate, especially when the population of species is small and transition rates are fast. The likelihood-based analysis as well as the recoloring method has the advantage that diffusion of molecules through the laser focus can be rigorously handled.
Collapse
Affiliation(s)
- Irina V Gopich
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD, USA.
| | - Hoi Sung Chung
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
4
|
Genkin M, Hughes O, Engel TA. Learning non-stationary Langevin dynamics from stochastic observations of latent trajectories. Nat Commun 2021; 12:5986. [PMID: 34645828 PMCID: PMC8514604 DOI: 10.1038/s41467-021-26202-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Accepted: 09/22/2021] [Indexed: 11/09/2022] Open
Abstract
Many complex systems operating far from the equilibrium exhibit stochastic dynamics that can be described by a Langevin equation. Inferring Langevin equations from data can reveal how transient dynamics of such systems give rise to their function. However, dynamics are often inaccessible directly and can be only gleaned through a stochastic observation process, which makes the inference challenging. Here we present a non-parametric framework for inferring the Langevin equation, which explicitly models the stochastic observation process and non-stationary latent dynamics. The framework accounts for the non-equilibrium initial and final states of the observed system and for the possibility that the system's dynamics define the duration of observations. Omitting any of these non-stationary components results in incorrect inference, in which erroneous features arise in the dynamics due to non-stationary data distribution. We illustrate the framework using models of neural dynamics underlying decision making in the brain.
Collapse
Affiliation(s)
- Mikhail Genkin
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | | | | |
Collapse
|
5
|
Genkin M, Engel TA. Moving beyond generalization to accurate interpretation of flexible models. NAT MACH INTELL 2020; 2:674-683. [DOI: 10.1038/s42256-020-00242-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
6
|
Okazaki KI, Nakamura A, Iino R. Chemical-State-Dependent Free Energy Profile from Single-Molecule Trajectories of Biomolecular Motors: Application to Processive Chitinase. J Phys Chem B 2020; 124:6475-6487. [DOI: 10.1021/acs.jpcb.0c02698] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Affiliation(s)
- Kei-ichi Okazaki
- Department of Theoretical and Computational Molecular Science, Institute for Molecular Science, National Institutes of Natural Sciences, Okazaki, 444-8585, Japan
| | - Akihiko Nakamura
- Department of Life and Coordination-Complex Molecular Science, Institute for Molecular Science, National Institutes of Natural Sciences, Okazaki, 444-8787, Japan
- Department of Applied Life Sciences, Faculty of Agriculture, Shizuoka University, Shizuoka, 422-8529, Japan
| | - Ryota Iino
- Department of Life and Coordination-Complex Molecular Science, Institute for Molecular Science, National Institutes of Natural Sciences, Okazaki, 444-8787, Japan
| |
Collapse
|
7
|
Yin S, Tien M, Yang H. Prior-Apprised Unsupervised Learning of Subpixel Curvilinear Features in Low Signal/Noise Images. Biophys J 2020; 118:2458-2469. [PMID: 32359407 PMCID: PMC7231927 DOI: 10.1016/j.bpj.2020.04.009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Revised: 03/07/2020] [Accepted: 04/09/2020] [Indexed: 11/16/2022] Open
Abstract
Many biophysical problems involve molecular and nanoscale targets moving next to a curvilinear track, e.g., a cytosolic cargo transported by motor proteins moving along a microtubule. For this type of problem, fluorescence imaging is usually the primary tool of choice. There is, however, an ∼20-fold mismatch between target-localization precision and track-imaging resolution such that questions requiring high-fidelity definition of the target's track remain inaccessible. On the other hand, if the contextual image of the tracks can be refined to a level comparable to that of the target, many intuitive yet mechanistically important issues can begin to be addressed. This work demonstrates that it is possible to statistically infer, to subpixel precision, curvilinear features in a low signal/noise image. This is achieved by a framework that consists of three stages: the Hessian-based feature enhancement, the subimage feature sampling and registration, and the statistical learning of the underlying curvilinear structure using a new, to our knowledge, method developed here for inferring the principal curves. In each stage, the descriptive prior information that the features come from curvilinear elements is explicitly taken into account. It is fully automated without user supervision, which is distinctly different from approaches that require user seeding or well-defined training data sets. Computer simulations of realistic images are used to investigate the performance of the framework and its implementation. The characterization results suggest that curvilinear features are refined to the same order of precision as that of the target and that the bootstrap confidence intervals from the analysis allow an estimate for the statistical bounds of the simulated "true" curve. Also shown are analyses of experimental images from three different microscopy modalities: two-photon laser-scanning microscopy, epifluorescence microscopy, and total internal reflection fluorescence microscopy. The practical application of this prior-apprised unsupervised learning framework as well as its potential outlook are discussed.
Collapse
Affiliation(s)
- Shuhui Yin
- Department of Chemistry, Princeton University, Princeton, New Jersey
| | - Ming Tien
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, Pennsylvania
| | - Haw Yang
- Department of Chemistry, Princeton University, Princeton, New Jersey.
| |
Collapse
|
8
|
Matsunaga Y, Sugita Y. Use of single-molecule time-series data for refining conformational dynamics in molecular simulations. Curr Opin Struct Biol 2020; 61:153-159. [DOI: 10.1016/j.sbi.2019.12.022] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 12/24/2019] [Accepted: 12/27/2019] [Indexed: 12/18/2022]
|
9
|
Tsai MY, Zheng W, Chen M, Wolynes PG. Multiple Binding Configurations of Fis Protein Pairs on DNA: Facilitated Dissociation versus Cooperative Dissociation. J Am Chem Soc 2019; 141:18113-18126. [DOI: 10.1021/jacs.9b08287] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Affiliation(s)
- Min-Yeh Tsai
- Department of Chemistry, Tamkang University, New Taipei City 25137, Taiwan (R.O.C.)
| | | | | | | |
Collapse
|
10
|
Abstract
Time series obtained from time-dependent experiments contain rich information on kinetics and dynamics of the system under investigation. This work describes an unsupervised learning framework, along with the derivation of the necessary analytical expressions, for the analysis of Gaussian-distributed time series that exhibit discrete states. After the time series has been partitioned into segments in a model-free manner using the previously developed change-point (CP) method, this protocol starts with an agglomerative hierarchical clustering algorithm to classify the detected segments into possible states. The initial state clustering is further refined using an expectation-maximization (EM) procedure, and the number of states is determined by a Bayesian information criterion (BIC). Also introduced here is an achievement scalarization function, usually seen in artificial intelligence literature, for quantitatively assessing the performance of state determination. The statistical learning framework, which is comprised of three stages, detection of signal change, clustering, and number-of-state determination, was thoroughly characterized using simulated trajectories with random intensity segments that have no underlying kinetics, and its performance was critically evaluated. The application to experimental data is also demonstrated. The results suggested that this general framework, the implementation of which is based on firm theoretical foundations and does not require the imposition of any kinetics model, is powerful in determining the number of states, the parameters contained in each state, as well as the associated statistical significance.
Collapse
Affiliation(s)
- Hao Li
- Department of Chemistry , Princeton University , Princeton , New Jersey 08544 , United States
| | - Haw Yang
- Department of Chemistry , Princeton University , Princeton , New Jersey 08544 , United States
| |
Collapse
|
11
|
Morrell TE, Rafalska-Metcalf IU, Yang H, Chu JW. Compound Molecular Logic in Accessing the Active Site of Mycobacterium tuberculosis Protein Tyrosine Phosphatase B. J Am Chem Soc 2018; 140:14747-14752. [PMID: 30301350 DOI: 10.1021/jacs.8b08070] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Protein tyrosine phosphatase B (PtpB) from Mycobacterium tuberculosis (Mtb) extends the bacteria's survival in hosts and hence is a potential target for Mtb-specific drugs. To study how Mtb-specific sequence insertions in PtpB may regulate access to its active site through large-amplitude conformational changes, we performed free-energy calculations using an all-atom explicit solvent model. Corroborated by biochemical assays, the results show that PtpB's active site is controlled via an "either/or" compound conformational gating mechanism, an unexpected discovery that Mtb has evolved to bestow a single enzyme with such intricate logical operations. In addition to providing unprecedented insights for its active-site surroundings, the findings also suggest new ways of inactivating PtpB.
Collapse
Affiliation(s)
- Thomas E Morrell
- Department of Chemistry , Princeton University , Princeton , New Jersey 08544 , United States
| | | | - Haw Yang
- Department of Chemistry , Princeton University , Princeton , New Jersey 08544 , United States
| | - Jhih-Wei Chu
- Institute of Bioinformatics and Systems Biology, Department of Biological Science and Technology, and Institute of Molecular Medicine and Bioengineering , National Chiao Tung University , Hsinchu , Taiwan 30068 , ROC
| |
Collapse
|
12
|
Matsunaga Y, Sugita Y. Linking time-series of single-molecule experiments with molecular dynamics simulations by machine learning. eLife 2018; 7:32668. [PMID: 29723137 PMCID: PMC5933924 DOI: 10.7554/elife.32668] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Accepted: 04/23/2018] [Indexed: 12/27/2022] Open
Abstract
Single-molecule experiments and molecular dynamics (MD) simulations are indispensable tools for investigating protein conformational dynamics. The former provide time-series data, such as donor-acceptor distances, whereas the latter give atomistic information, although this information is often biased by model parameters. Here, we devise a machine-learning method to combine the complementary information from the two approaches and construct a consistent model of conformational dynamics. It is applied to the folding dynamics of the formin-binding protein WW domain. MD simulations over 400 μs led to an initial Markov state model (MSM), which was then "refined" using single-molecule Förster resonance energy transfer (FRET) data through hidden Markov modeling. The refined or data-assimilated MSM reproduces the FRET data and features hairpin one in the transition-state ensemble, consistent with mutation experiments. The folding pathway in the data-assimilated MSM suggests interplay between hydrophobic contacts and turn formation. Our method provides a general framework for investigating conformational transitions in other proteins.
Collapse
Affiliation(s)
- Yasuhiro Matsunaga
- Computational Biophysics Research Team, RIKEN Center for Computational Science, Kobe, Japan.,JST PRESTO, Kawaguchi, Japan
| | - Yuji Sugita
- Computational Biophysics Research Team, RIKEN Center for Computational Science, Kobe, Japan.,Theoretical Molecular Science Laboratory, RIKEN Cluster for Pioneering Research, Wako, Japan.,Laboratory for Biomolecular Function Simulation, RIKEN Center for Biosystems Dynamics Research, Kobe, Japan
| |
Collapse
|
13
|
Abstract
In this study, we overview resonance energy transfer between molecules in the presence of plasmonic structures and derive an explicit Förster-type expression for the rate of plasmon-coupled resonance energy transfer (PC-RET). The proposed theory is general for energy transfer in the presence of materials with any space-dependent, frequency-dependent, or complex dielectric functions. Furthermore, the theory allows us to develop the concept of a generalized spectral overlap (GSO) J̃ (the integral of the molecular absorption coefficient, normalized emission spectrum, and the plasmon coupling factor) for understanding the wavelength dependence of PC-RET and to estimate the rate of PC-RET WET. Indeed, WET = (8.785 × 10-25 mol) ϕDτD-1J̃, where ϕD is donor fluorescence quantum yield and τD is the emission lifetime. Simulations of the GSO for PC-RET show that the most important spectral region for PC-RET is not necessarily near the maximum overlap of donor emission and acceptor absorption. Instead a significant plasmonic contribution can involve a different spectral region from the extinction maximum of the plasmonic structure. This study opens a promising direction for exploring exciton transport in plasmonic nanostructures, with possible applications in spectroscopy, photonics, biosensing, and energy devices.
Collapse
Affiliation(s)
- Liang-Yan Hsu
- Department of Chemistry, Northwestern University , 2145 Sheridan Road, Evanston, Illinois 60208-3113, United States
| | - Wendu Ding
- Department of Chemistry, Northwestern University , 2145 Sheridan Road, Evanston, Illinois 60208-3113, United States
| | - George C Schatz
- Department of Chemistry, Northwestern University , 2145 Sheridan Road, Evanston, Illinois 60208-3113, United States
| |
Collapse
|
14
|
Chu JW, Yang H. Identifying the structural and kinetic elements in protein large-amplitude conformational motions. INT REV PHYS CHEM 2017. [DOI: 10.1080/0144235x.2017.1283885] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
15
|
Sun X, Morrell TE, Yang H. Extraction of Protein Conformational Modes from Distance Distributions Using Structurally Imputed Bayesian Data Augmentation. J Phys Chem B 2016; 120:10469-10482. [PMID: 27642672 DOI: 10.1021/acs.jpcb.6b07767] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Protein conformational changes are known to play important roles in assorted biochemical and biological processes. Driven by thermal motions of surrounding solvent molecules, such a structural remodeling often occurs stochastically. Yet, regardless of how random the conformational reconfiguration may appear, it could in principle be described by a linear combination of a set of orthogonal modes which, in turn, are contained in the intramolecular distance distributions. The central challenge is how to obtain the distribution. This contribution proposes a Bayesian data-augmentation scheme to extract the predominant modes from only few distance distributions, be they from computational sampling or directly from experiments such as single-molecule Förster-type resonance energy transfer (smFRET). The inference of the complete protein structure from insufficient data was recognized as isomorphic to the missing-data problem in Bayesian statistical learning. Using smFRET data as an example, the missing coordinates were deduced, given protein structural constraints and multiple but limited number of smFRET distances; the Boltzmann weighing of each inferred protein structure was then evaluated using computational modeling to numerically construct the posterior density for the global protein conformation. The conformational modes were then determined from the iteratively converged overall conformational distribution using principal component analysis. Two examples were presented to illustrate these basic ideas as well as their practical implementation. The scheme described herein was based on the theory behind the powerful Tanner-Wang algorithm that guarantees convergence to the true posterior density. However, instead of assuming a mathematical model to calculate the likelihood as in conventional statistical inference, here the protein structure was treated as a statistical parameter and was imputed from the numerical likelihood function based on structural information, a probability model-free method. The framework put forth here is anticipated to be generally applicable, offering a new way to articulate protein conformational changes in a quantifiable manner.
Collapse
Affiliation(s)
- Xun Sun
- Department of Chemistry, Princeton University , Princeton, New Jersey 08544, United States
| | - Thomas E Morrell
- Department of Chemistry, Princeton University , Princeton, New Jersey 08544, United States
| | - Haw Yang
- Department of Chemistry, Princeton University , Princeton, New Jersey 08544, United States
| |
Collapse
|
16
|
Wang J, Ferguson AL. Nonlinear reconstruction of single-molecule free-energy surfaces from univariate time series. Phys Rev E 2016; 93:032412. [PMID: 27078395 DOI: 10.1103/physreve.93.032412] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2015] [Indexed: 01/27/2023]
Abstract
The stable conformations and dynamical fluctuations of polymers and macromolecules are governed by the underlying single-molecule free energy surface. By integrating ideas from dynamical systems theory with nonlinear manifold learning, we have recovered single-molecule free energy surfaces from univariate time series in a single coarse-grained system observable. Using Takens' Delay Embedding Theorem, we expand the univariate time series into a high dimensional space in which the dynamics are equivalent to those of the molecular motions in real space. We then apply the diffusion map nonlinear manifold learning algorithm to extract a low-dimensional representation of the free energy surface that is diffeomorphic to that computed from a complete knowledge of all system degrees of freedom. We validate our approach in molecular dynamics simulations of a C(24)H(50) n-alkane chain to demonstrate that the two-dimensional free energy surface extracted from the atomistic simulation trajectory is - subject to spatial and temporal symmetries - geometrically and topologically equivalent to that recovered from a knowledge of only the head-to-tail distance of the chain. Our approach lays the foundations to extract empirical single-molecule free energy surfaces directly from experimental measurements.
Collapse
Affiliation(s)
- Jiang Wang
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Andrew L Ferguson
- Department of Materials Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA.,Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| |
Collapse
|
17
|
Chung HS, Louis JM, Gopich IV. Analysis of Fluorescence Lifetime and Energy Transfer Efficiency in Single-Molecule Photon Trajectories of Fast-Folding Proteins. J Phys Chem B 2016; 120:680-99. [PMID: 26812046 DOI: 10.1021/acs.jpcb.5b11351] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
In single-molecule Förster resonance energy transfer (FRET) spectroscopy, the dynamics of molecular processes are usually determined by analyzing the fluorescence intensity of donor and acceptor dyes. Since FRET efficiency is related to fluorescence lifetimes, additional information can be extracted by analyzing fluorescence intensity and lifetime together. For fast processes where individual states are not well separated in a trajectory, it is not easy to obtain the lifetime information. Here, we present analysis methods to utilize fluorescence lifetime information from single-molecule FRET experiments, and apply these methods to three fast-folding, two-state proteins. By constructing 2D FRET efficiency-lifetime histograms, the correlation can be visualized between the FRET efficiency and fluorescence lifetimes in the presence of the submicrosecond to millisecond dynamics. We extend the previously developed method for analyzing delay times of donor photons to include acceptor delay times. To determine the kinetics and lifetime parameters accurately, we used a maximum likelihood method. We found that acceptor blinking can lead to inaccurate parameters in the donor delay time analysis. This problem can be solved by incorporating acceptor blinking into a model. While the analysis of acceptor delay times is not affected by acceptor blinking, it is more sensitive to the shape of the delay time distribution resulting from a broad conformational distribution in the unfolded state.
Collapse
Affiliation(s)
- Hoi Sung Chung
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health , Bethesda, Maryland 20892-0520, United States
| | - John M Louis
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health , Bethesda, Maryland 20892-0520, United States
| | - Irina V Gopich
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health , Bethesda, Maryland 20892-0520, United States
| |
Collapse
|
18
|
Gopich IV. Accuracy of maximum likelihood estimates of a two-state model in single-molecule FRET. J Chem Phys 2015; 142:034110. [PMID: 25612692 DOI: 10.1063/1.4904381] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Photon sequences from single-molecule Förster resonance energy transfer (FRET) experiments can be analyzed using a maximum likelihood method. Parameters of the underlying kinetic model (FRET efficiencies of the states and transition rates between conformational states) are obtained by maximizing the appropriate likelihood function. In addition, the errors (uncertainties) of the extracted parameters can be obtained from the curvature of the likelihood function at the maximum. We study the standard deviations of the parameters of a two-state model obtained from photon sequences with recorded colors and arrival times. The standard deviations can be obtained analytically in a special case when the FRET efficiencies of the states are 0 and 1 and in the limiting cases of fast and slow conformational dynamics. These results are compared with the results of numerical simulations. The accuracy and, therefore, the ability to predict model parameters depend on how fast the transition rates are compared to the photon count rate. In the limit of slow transitions, the key parameters that determine the accuracy are the number of transitions between the states and the number of independent photon sequences. In the fast transition limit, the accuracy is determined by the small fraction of photons that are correlated with their neighbors. The relative standard deviation of the relaxation rate has a "chevron" shape as a function of the transition rate in the log-log scale. The location of the minimum of this function dramatically depends on how well the FRET efficiencies of the states are separated.
Collapse
Affiliation(s)
- Irina V Gopich
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| |
Collapse
|
19
|
Ramanathan R, Muñoz V. A Method for Extracting the Free Energy Surface and Conformational Dynamics of Fast-Folding Proteins from Single Molecule Photon Trajectories. J Phys Chem B 2015; 119:7944-56. [PMID: 25988351 PMCID: PMC4718529 DOI: 10.1021/acs.jpcb.5b03176] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2015] [Revised: 05/13/2015] [Indexed: 12/11/2022]
Abstract
Single molecule fluorescence spectroscopy holds the promise of providing direct measurements of protein folding free energy landscapes and conformational motions. However, fulfilling this promise has been prevented by technical limitations, most notably, the difficulty in analyzing the small packets of photons per millisecond that are typically recorded from individual biomolecules. Such limitation impairs the ability to accurately determine conformational distributions and resolve sub-millisecond processes. Here we develop an analytical procedure for extracting the conformational distribution and dynamics of fast-folding proteins directly from time-stamped photon arrival trajectories produced by single molecule FRET experiments. Our procedure combines the maximum likelihood analysis originally developed by Gopich and Szabo with a statistical mechanical model that describes protein folding as diffusion on a one-dimensional free energy surface. Using stochastic kinetic simulations, we thoroughly tested the performance of the method in identifying diverse fast-folding scenarios, ranging from two-state to one-state downhill folding, as a function of relevant experimental variables such as photon count rate, amount of input data, and background noise. The tests demonstrate that the analysis can accurately retrieve the original one-dimensional free energy surface and microsecond folding dynamics in spite of the sub-megahertz photon count rates and significant background noise levels of current single molecule fluorescence experiments. Therefore, our approach provides a powerful tool for the quantitative analysis of single molecule FRET experiments of fast protein folding that is also potentially extensible to the analysis of any other biomolecular process governed by sub-millisecond conformational dynamics.
Collapse
Affiliation(s)
- Ravishankar Ramanathan
- Centro
Nacional de Biotecnología, Consejo
Superior de Investigaciones Científicas, 28049 Madrid, Spain
| | - Victor Muñoz
- Centro
Nacional de Biotecnología, Consejo
Superior de Investigaciones Científicas, 28049 Madrid, Spain
- School
of Engineering, University of California
Merced, Merced, California 95343, United States
| |
Collapse
|
20
|
Abstract
Single-molecule spectroscopy is widely used to study macromolecular dynamics. Although this technique provides unique information that cannot be obtained at the ensemble level, the possibility of studying fast molecular dynamics is limited by the number of photons detected per unit time (photon count rate), which is proportional to the illumination intensity. However, simply increasing the illumination intensity often does not help because of various photophysical and photochemical problems. In this Perspective, we show how to improve the dynamic range of single-molecule fluorescence spectroscopy at a given photon count rate by considering each and every photon and using a maximum likelihood method. For a photon trajectory with recorded photon colors and inter-photon times, the parameters of a model describing molecular dynamics are obtained by maximizing the appropriate likelihood function. We discuss various likelihood functions, their applicability, and the accuracy of the extracted parameters. The maximum likelihood method has been applied to analyze the experiments on fast two-state protein folding and to measure transition path times. Utilizing other information such as fluorescence lifetimes is discussed in the framework of two-dimensional FRET efficiency-lifetime histograms.
Collapse
Affiliation(s)
- Hoi Sung Chung
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health (NIH), Bethesda, MD, 20892-0520
| | - Irina V. Gopich
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health (NIH), Bethesda, MD, 20892-0520
| |
Collapse
|
21
|
Shuang B, Cooper D, Taylor JN, Kisley L, Chen J, Wang W, Li CB, Komatsuzaki T, Landes CF. Fast Step Transition and State Identification (STaSI) for Discrete Single-Molecule Data Analysis. J Phys Chem Lett 2014; 5:3157-3161. [PMID: 25247055 PMCID: PMC4167035 DOI: 10.1021/jz501435p] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Accepted: 08/28/2014] [Indexed: 05/26/2023]
Abstract
We introduce a step transition and state identification (STaSI) method for piecewise constant single-molecule data with a newly derived minimum description length equation as the objective function. We detect the step transitions using the Student's t test and group the segments into states by hierarchical clustering. The optimum number of states is determined based on the minimum description length equation. This method provides comprehensive, objective analysis of multiple traces requiring few user inputs about the underlying physical models and is faster and more precise in determining the number of states than established and cutting-edge methods for single-molecule data analysis. Perhaps most importantly, the method does not require either time-tagged photon counting or photon counting in general and thus can be applied to a broad range of experimental setups and analytes.
Collapse
Affiliation(s)
- Bo Shuang
- Department
of Chemistry, Rice University, MS 60, Houston, Texas 77251-1892, United States
| | - David Cooper
- Department
of Chemistry, Rice University, MS 60, Houston, Texas 77251-1892, United States
| | - J. Nick Taylor
- Molecule
& Life Nonlinear Sciences Laboratory, Research Institute for Electronic
Science, Hokkaido University, Sapporo 001-0020, Japan
| | - Lydia Kisley
- Department
of Chemistry, Rice University, MS 60, Houston, Texas 77251-1892, United States
| | - Jixin Chen
- Department
of Chemistry, Rice University, MS 60, Houston, Texas 77251-1892, United States
| | - Wenxiao Wang
- Department
of Electrical and Computer Engineering, Rice Quantum Institute, Rice University, MS 60, Houston, Texas 77251-1892, United States
| | - Chun Biu Li
- Molecule
& Life Nonlinear Sciences Laboratory, Research Institute for Electronic
Science, Hokkaido University, Sapporo 001-0020, Japan
| | - Tamiki Komatsuzaki
- Molecule
& Life Nonlinear Sciences Laboratory, Research Institute for Electronic
Science, Hokkaido University, Sapporo 001-0020, Japan
| | - Christy F. Landes
- Department
of Chemistry, Rice University, MS 60, Houston, Texas 77251-1892, United States
- Department
of Electrical and Computer Engineering, Rice Quantum Institute, Rice University, MS 60, Houston, Texas 77251-1892, United States
| |
Collapse
|
22
|
Haas KR, Yang H, Chu JW. Analysis of Trajectory Entropy for Continuous Stochastic Processes at Equilibrium. J Phys Chem B 2014; 118:8099-107. [DOI: 10.1021/jp501133w] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Kevin R. Haas
- Department
of Chemical and Biomolecular Engineering, University of California—Berkeley, Berkeley, California 94720, United States
| | - Haw Yang
- Department
of Chemistry, Princeton University, Princeton, New Jersey 08544, United States
| | - Jhih-Wei Chu
- Department
of Biological Science and Technology, National Chiao Tung University, Hsinchu 30068, Taiwan
- Institute
of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu 30068, Taiwan
| |
Collapse
|