1
|
Li J, Zhang L, Johnson-Buck A, Walter NG. Foundation model for efficient biological discovery in single-molecule data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.26.609721. [PMID: 39253410 PMCID: PMC11383305 DOI: 10.1101/2024.08.26.609721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/11/2024]
Abstract
Modern data-intensive techniques offer ever deeper insights into biology, but render the process of discovery increasingly complex. For example, exploiting the unique ability of single-molecule fluorescence microscopy (SMFM)1-5. to uncover rare but critical intermediates often demands manual inspection of time traces and iterative ad hoc approaches that are difficult to systematize. To facilitate systematic and efficient discovery from SMFM data, we introduce META-SiM, a transformer-based foundation model pre-trained on diverse SMFM analysis tasks. META-SiM achieves high performance-rivaling best-in-class algorithms-on a broad range of analysis tasks including trace selection, classification, segmentation, idealization, and stepwise photobleaching analysis. Additionally, the model produces high-dimensional embedding vectors that encapsulate detailed information about each trace, which the web-based META-SiM Projector (https://www.simol-projector.org) casts into lower-dimensional space for efficient whole-dataset visualization, labeling, comparison, and sharing. Combining this Projector with the objective metric of Local Shannon Entropy enables rapid identification of condition-specific behaviors, even if rare or subtle. As a result, by applying META-SiM to an existing single-molecule Förster resonance energy transfer (smFRET) dataset6, we discover a previously unobserved intermediate state in pre-mRNA splicing. META-SiM thus removes bottlenecks, improves objectivity, and both systematizes and accelerates biological discovery in complex single-molecule data.
Collapse
Affiliation(s)
- Jieming Li
- Bristol Myers Squibb, New Brunswick, NJ, USA
| | | | - Alexander Johnson-Buck
- Single Molecule Analysis Group, Department of Chemistry, The University of Michigan, Ann Arbor, MI, USA
| | - Nils G. Walter
- Single Molecule Analysis Group, Department of Chemistry, The University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
2
|
Guo W, Liu Y, Han Y, Tang H, Fan X, Wang C, Chen PR. Amplifiable protein identification via residue-resolved barcoding and composition code counting. Natl Sci Rev 2024; 11:nwae183. [PMID: 39055168 PMCID: PMC11272068 DOI: 10.1093/nsr/nwae183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2024] [Revised: 05/06/2024] [Accepted: 05/08/2024] [Indexed: 07/27/2024] Open
Abstract
Ultrasensitive protein identification is of paramount importance in basic research and clinical diagnostics but remains extremely challenging. A key bottleneck in preventing single-molecule protein sequencing is that, unlike the revolutionary nucleic acid sequencing methods that rely on the polymerase chain reaction (PCR) to amplify DNA and RNA molecules, protein molecules cannot be directly amplified. Decoding the proteins via amplification of certain fingerprints rather than the intact protein sequence thus represents an appealing alternative choice to address this formidable challenge. Herein, we report a proof-of-concept method that relies on residue-resolved DNA barcoding and composition code counting for amplifiable protein fingerprinting (AmproCode). In AmproCode, selective types of residues on peptides or proteins are chemically labeled with a DNA barcode, which can be amplified and quantified via quantitative PCR. The operation generates a relative ratio as the residue-resolved 'composition code' for each target protein that can be utilized as the fingerprint to determine its identity from the proteome database. We developed a database searching algorithm and applied it to assess the coverage of the whole proteome and secretome via computational simulations, proving the theoretical feasibility of AmproCode. We then designed the residue-specific DNA barcoding and amplification workflow, and identified different synthetic model peptides found in the secretome at as low as the fmol/L level for demonstration. These results build the foundation for an unprecedented amplifiable protein fingerprinting method. We believe that, in the future, AmproCode could ultimately realize single-molecule amplifiable identification of trace complex samples without further purification, and it may open a new avenue in the development of next-generation protein sequencing techniques.
Collapse
Affiliation(s)
- Weiming Guo
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Yuan Liu
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Yu Han
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Huan Tang
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Xinyuan Fan
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Chu Wang
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Peng R Chen
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| |
Collapse
|
3
|
Seabury AG, Khodabocus AJ, Kogan IM, Hoy GR, DeSalvo GA, Wustholz KL. Blinking characteristics of organic fluorophores for blink-based multiplexing. Commun Chem 2024; 7:18. [PMID: 38280979 PMCID: PMC10821931 DOI: 10.1038/s42004-024-01106-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 01/12/2024] [Indexed: 01/29/2024] Open
Abstract
Single-molecule fluorescence experiments have transformed our understanding of complex materials and biological systems. Whether single molecules are used to report on their nano-environment or provide for localization, understanding their blinking dynamics (i.e., stochastic fluctuations in emission intensity under continuous illumination) is paramount. We recently demonstrated another use for blinking dynamics called blink-based multiplexing (BBM), where individual emitters are classified using a single excitation laser based on blinking dynamics, rather than color. This study elucidates the structure-activity relationships governing BBM performance in a series of model rhodamine, BODIPY, and anthraquinone fluorophores that undergo different photo-physical and-chemical processes during blinking. Change point detection and multinomial logistic regression analyses show that BBM can leverage spectral fluctuations, electron and proton transfer kinetics, as well as photostability for molecular classification-even within the context of a shared blinking mechanism. In doing so, we demonstrate two- and three-color BBM with ≥ 93% accuracy using spectrally-overlapped fluorophores.
Collapse
Affiliation(s)
| | | | | | - Grayson R Hoy
- Chemistry Department, William & Mary, Williamsburg, VA, USA
| | | | | |
Collapse
|
4
|
Chi W, Tan D, Qiao Q, Xu Z, Liu X. Spontaneously Blinking Rhodamine Dyes for Single-Molecule Localization Microscopy. Angew Chem Int Ed Engl 2023; 62:e202306061. [PMID: 37246144 DOI: 10.1002/anie.202306061] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 05/25/2023] [Accepted: 05/26/2023] [Indexed: 05/30/2023]
Abstract
Single-molecule localization microscopy (SMLM) has found extensive applications in various fields of biology and chemistry. As a vital component of SMLM, fluorophores play an essential role in obtaining super-resolution fluorescence images. Recent research on spontaneously blinking fluorophores has greatly simplified the experimental setups and extended the imaging duration of SMLM. To support this crucial development, this review provides a comprehensive overview of the development of spontaneously blinking rhodamines from 2014 to 2023, as well as the key mechanistic aspects of intramolecular spirocyclization reactions. We hope that by offering insightful design guidelines, this review will contribute to accelerating the advancement of super-resolution imaging technologies.
Collapse
Affiliation(s)
- Weijie Chi
- Collaborative Innovation Center of One Health, School of Science, Hainan University, Renmin Road 58, Haikou, 570228, P. R. China
- Fluorescence Research Group, Singapore University of Technology and Design, 8 Somapah Road, 487372, Singapore, Singapore
| | - Davin Tan
- Fluorescence Research Group, Singapore University of Technology and Design, 8 Somapah Road, 487372, Singapore, Singapore
| | - Qinglong Qiao
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, 457 Zhongshan Road, Dalian, 116023, China
| | - Zhaochao Xu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, 457 Zhongshan Road, Dalian, 116023, China
| | - Xiaogang Liu
- Fluorescence Research Group, Singapore University of Technology and Design, 8 Somapah Road, 487372, Singapore, Singapore
| |
Collapse
|
5
|
Wei X, Penkauskas T, Reiner JE, Kennard C, Uline MJ, Wang Q, Li S, Aksimentiev A, Robertson JW, Liu C. Engineering Biological Nanopore Approaches toward Protein Sequencing. ACS NANO 2023; 17:16369-16395. [PMID: 37490313 PMCID: PMC10676712 DOI: 10.1021/acsnano.3c05628] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/26/2023]
Abstract
Biotechnological innovations have vastly improved the capacity to perform large-scale protein studies, while the methods we have for identifying and quantifying individual proteins are still inadequate to perform protein sequencing at the single-molecule level. Nanopore-inspired systems devoted to understanding how single molecules behave have been extensively developed for applications in genome sequencing. These nanopore systems are emerging as prominent tools for protein identification, detection, and analysis, suggesting realistic prospects for novel protein sequencing. This review summarizes recent advances in biological nanopore sensors toward protein sequencing, from the identification of individual amino acids to the controlled translocation of peptides and proteins, with attention focused on device and algorithm development and the delineation of molecular mechanisms with the aid of simulations. Specifically, the review aims to offer recommendations for the advancement of nanopore-based protein sequencing from an engineering perspective, highlighting the need for collaborative efforts across multiple disciplines. These efforts should include chemical conjugation, protein engineering, molecular simulation, machine-learning-assisted identification, and electronic device fabrication to enable practical implementation in real-world scenarios.
Collapse
Affiliation(s)
- Xiaojun Wei
- Biomedical Engineering Program, University of South Carolina, Columbia, SC 29208, United States
- Department of Chemical Engineering, University of South Carolina, Columbia, SC 29208, United States
| | - Tadas Penkauskas
- Biophysics and Biomedical Measurement Group, Microsystems and Nanotechnology Division, National Institute of Standards and Technology, Gaithersburg, MD 20899, United States
- School of Engineering, Brown University, Providence, RI 02912, United States
| | - Joseph E. Reiner
- Department of Physics, Virginia Commonwealth University, Richmond, VA 23284, United States
| | - Celeste Kennard
- Biomedical Engineering Program, University of South Carolina, Columbia, SC 29208, United States
| | - Mark J. Uline
- Biomedical Engineering Program, University of South Carolina, Columbia, SC 29208, United States
- Department of Chemical Engineering, University of South Carolina, Columbia, SC 29208, United States
| | - Qian Wang
- Department of Chemistry and Biochemistry, University of South Carolina, Columbia, SC 29208, United States
| | - Sheng Li
- School of Data Science, University of Virginia, Charlottesville, VA 22903, United States
| | - Aleksei Aksimentiev
- Department of Physics and Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, United States
| | - Joseph W.F. Robertson
- Biophysics and Biomedical Measurement Group, Microsystems and Nanotechnology Division, National Institute of Standards and Technology, Gaithersburg, MD 20899, United States
| | - Chang Liu
- Biomedical Engineering Program, University of South Carolina, Columbia, SC 29208, United States
- Department of Chemical Engineering, University of South Carolina, Columbia, SC 29208, United States
| |
Collapse
|