1
|
Fischetti G, Schmid N, Bruderer S, Heitmann B, Henrici A, Scarso A, Caldarelli G, Wilhelm D. A deep learning framework for multiplet splitting classification in 1H NMR. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2025; 373:107851. [PMID: 39978294 DOI: 10.1016/j.jmr.2025.107851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2024] [Revised: 02/06/2025] [Accepted: 02/07/2025] [Indexed: 02/22/2025]
Abstract
One-dimensional 1H Nuclear Magnetic Resonance (NMR) stands out as the quickest and simplest among various NMR experimental setups. Unfortunately, it suffers from lengthy annotation times and does not always have a clear and unique interpretation. From NMR discovery, efforts have been dedicated to introducing an automated approach to streamline the characterization of chemical compounds while ensuring consistency of the results across the scientific community. Nonetheless, this remains an ongoing challenge that has garnered renewed interest with the emergence of deep learning techniques. Here, we present MuSe Net, a novel supervised probabilistic deep learning framework that can emulate the tasks performed by an expert spectroscopist in annotating one-dimensional NMR spectra generated by small molecules. Considering only the spectrum, MuSe Net detects and classifies multiplets with up to four coupling constants for their splitting phenotype, providing a segmentation of the spectral range. We exploit uncertainty quantification to produce a confidence score to both assess classification reliability and to detect signals that do not fit into any other phenotype class. The results of the evaluation against 48 experimental 1H NMR spectra of small molecules annotated by experts demonstrate that MuSe Net can deal with anomalies and unclear signals while correctly classifying multiplets and detecting overlapping peaks.
Collapse
Affiliation(s)
- Giulia Fischetti
- School of Engineering, Zurich University of Applied Sciences (ZHAW), Technikumstrasse 9, Winterthur, 8401, Zurich, Switzerland; Dipartimento di Scienze Molecolari e Nanosistemi, Ca' Foscari Università Venezia, Via Torino 155, Mestre, 30172, Italy.
| | - Nicolas Schmid
- School of Engineering, Zurich University of Applied Sciences (ZHAW), Technikumstrasse 9, Winterthur, 8401, Zurich, Switzerland; EcoVision Lab, Department of Mathematical Modeling and Machine Learning (DM3L), University of Zurich (UZH), Winterthurerstrasse 190, Zurich, 8057, Zurich, Switzerland
| | - Simon Bruderer
- Bruker Switzerland AG, Industriestrasse 26, Fällanden, 8117, Zurich, Switzerland
| | - Björn Heitmann
- Bruker Switzerland AG, Industriestrasse 26, Fällanden, 8117, Zurich, Switzerland
| | - Andreas Henrici
- School of Engineering, Zurich University of Applied Sciences (ZHAW), Technikumstrasse 9, Winterthur, 8401, Zurich, Switzerland
| | - Alessandro Scarso
- Dipartimento di Scienze Molecolari e Nanosistemi, Ca' Foscari Università Venezia, Via Torino 155, Mestre, 30172, Italy
| | - Guido Caldarelli
- Dipartimento di Scienze Molecolari e Nanosistemi, Ca' Foscari Università Venezia, Via Torino 155, Mestre, 30172, Italy; Istituto dei Sistemi Complessi (ISC), Consiglio Nazionale delle Ricerche (CNR), Via dei Taurini 19, Roma, 00185, Italy; European Center for Living Technology (ECLT), Sestiere Dorsoduro 3911, Venezia, 30123, Italy; London Institute for Mathematical Science, Royal Institution, 21 Albemarle St, London, United Kingdom.
| | - Dirk Wilhelm
- School of Engineering, Zurich University of Applied Sciences (ZHAW), Technikumstrasse 9, Winterthur, 8401, Zurich, Switzerland.
| |
Collapse
|
2
|
Fischetti G, Schmid N, Bruderer S, Caldarelli G, Scarso A, Henrici A, Wilhelm D. Automatic classification of signal regions in 1H Nuclear Magnetic Resonance spectra. Front Artif Intell 2023; 5:1116416. [PMID: 36714208 PMCID: PMC9874632 DOI: 10.3389/frai.2022.1116416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 12/22/2022] [Indexed: 01/12/2023] Open
Abstract
The identification and characterization of signal regions in Nuclear Magnetic Resonance (NMR) spectra is a challenging but crucial phase in the analysis and determination of complex chemical compounds. Here, we present a novel supervised deep learning approach to perform automatic detection and classification of multiplets in 1H NMR spectra. Our deep neural network was trained on a large number of synthetic spectra, with complete control over the features represented in the samples. We show that our model can detect signal regions effectively and minimize classification errors between different types of resonance patterns. We demonstrate that the network generalizes remarkably well on real experimental 1H NMR spectra.
Collapse
Affiliation(s)
- Giulia Fischetti
- Dipartimento di Scienze Molecolari e Nanosistemi, Ca' Foscari Università di Venezia, Venice, Italy
| | - Nicolas Schmid
- Zürcher Hochschule für Angewandte Wissenschaften (ZHAW), Zurich, Switzerland
- Institute for Computational Science, Universität Zürich (UZH), Zurich, Switzerland
| | | | - Guido Caldarelli
- Dipartimento di Scienze Molecolari e Nanosistemi, Ca' Foscari Università di Venezia, Venice, Italy
| | - Alessandro Scarso
- Dipartimento di Scienze Molecolari e Nanosistemi, Ca' Foscari Università di Venezia, Venice, Italy
| | - Andreas Henrici
- Zürcher Hochschule für Angewandte Wissenschaften (ZHAW), Zurich, Switzerland
| | - Dirk Wilhelm
- Zürcher Hochschule für Angewandte Wissenschaften (ZHAW), Zurich, Switzerland
| |
Collapse
|
3
|
Lane D, Soong R, Bermel W, Ning P, Dutta Majumdar R, Tabatabaei-Anaraki M, Heumann H, Gundy M, Bönisch H, Liaghati Mobarhan Y, Simpson MJ, Simpson AJ. Selective Amino Acid-Only in Vivo NMR: A Powerful Tool To Follow Stress Processes. ACS OMEGA 2019; 4:9017-9028. [PMID: 31459990 PMCID: PMC6648361 DOI: 10.1021/acsomega.9b00931] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Accepted: 05/09/2019] [Indexed: 05/24/2023]
Abstract
In vivo NMR of small 13C-enriched aquatic organisms is developing as a powerful tool to detect and explain toxic stress at the biochemical level. Amino acids are a very important category of metabolites for stress detection as they are involved in the vast majority of stress response pathways. As such, they are a useful proxy for stress detection in general, which could then be a trigger for more in-depth analysis of the metabolome. 1H-13C heteronuclear single quantum coherence (HSQC) is commonly used to provide additional spectral dispersion in vivo and permit metabolite assignment. While some amino acids can be assigned from HSQC, spectral overlap makes monitoring them in vivo challenging. Here, an experiment typically used to study protein structures is adapted for the selective detection of amino acids inside living Daphnia magna (water fleas). All 20 common amino acids can be selectively detected in both extracts and in vivo. By monitoring bisphenol-A exposure, the in vivo amino acid-only approach identified larger fluxes in a greater number of amino acids when compared to published works using extracts from whole organism homogenates. This suggests that amino acid-only NMR of living organisms may be a very sensitive tool in the detection of stress in vivo and is highly complementary to more traditional metabolomics-based methods. The ability of selective NMR experiments to help researchers to "look inside" living organisms and only detect specific molecules of interest is quite profound and paves the way for the future development of additional targeted experiments for in vivo research and monitoring.
Collapse
Affiliation(s)
- Daniel Lane
- Environmental
NMR Centre, Department of Physical and Environmental Science, University of Toronto, 1265 Military Trail, Toronto, ON, Canada M1C 1A4
| | - Ronald Soong
- Environmental
NMR Centre, Department of Physical and Environmental Science, University of Toronto, 1265 Military Trail, Toronto, ON, Canada M1C 1A4
| | - Wolfgang Bermel
- Bruker
BioSpin GmbH, Silberstreifen 4, Rheinstetten, Germany
| | - Paris Ning
- Environmental
NMR Centre, Department of Physical and Environmental Science, University of Toronto, 1265 Military Trail, Toronto, ON, Canada M1C 1A4
| | - Rudraksha Dutta Majumdar
- Environmental
NMR Centre, Department of Physical and Environmental Science, University of Toronto, 1265 Military Trail, Toronto, ON, Canada M1C 1A4
- Bruker
Canada Ltd, 2800 High
Point Drive, Milton, Ontario, Canada L9T 6P4
| | - Maryam Tabatabaei-Anaraki
- Environmental
NMR Centre, Department of Physical and Environmental Science, University of Toronto, 1265 Military Trail, Toronto, ON, Canada M1C 1A4
| | | | | | | | - Yalda Liaghati Mobarhan
- Environmental
NMR Centre, Department of Physical and Environmental Science, University of Toronto, 1265 Military Trail, Toronto, ON, Canada M1C 1A4
| | - Myrna J. Simpson
- Environmental
NMR Centre, Department of Physical and Environmental Science, University of Toronto, 1265 Military Trail, Toronto, ON, Canada M1C 1A4
| | - André J. Simpson
- Environmental
NMR Centre, Department of Physical and Environmental Science, University of Toronto, 1265 Military Trail, Toronto, ON, Canada M1C 1A4
| |
Collapse
|
4
|
Fenwick M, Weatherby G, Vyas J, Sesanker C, Martyn TO, Ellis HJ, Gryk MR. CONNJUR Workflow Builder: a software integration environment for spectral reconstruction. JOURNAL OF BIOMOLECULAR NMR 2015; 62:313-326. [PMID: 26066803 PMCID: PMC4864993 DOI: 10.1007/s10858-015-9946-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Accepted: 05/09/2015] [Indexed: 06/04/2023]
Abstract
CONNJUR Workflow Builder (WB) is an open-source software integration environment that leverages existing spectral reconstruction tools to create a synergistic, coherent platform for converting biomolecular NMR data from the time domain to the frequency domain. WB provides data integration of primary data and metadata using a relational database, and includes a library of pre-built workflows for processing time domain data. WB simplifies maximum entropy reconstruction, facilitating the processing of non-uniformly sampled time domain data. As will be shown in the paper, the unique features of WB provide it with novel abilities to enhance the quality, accuracy, and fidelity of the spectral reconstruction process. WB also provides features which promote collaboration, education, parameterization, and non-uniform data sets along with processing integrated with the Rowland NMR Toolkit (RNMRTK) and NMRPipe software packages. WB is available free of charge in perpetuity, dual-licensed under the MIT and GPL open source licenses.
Collapse
Affiliation(s)
- Matthew Fenwick
- Department of Molecular Biology and Biophysics, UConn Health, Farmington, Connecticut 06030-3305
| | - Gerard Weatherby
- Department of Molecular Biology and Biophysics, UConn Health, Farmington, Connecticut 06030-3305
| | - Jay Vyas
- Department of Molecular Biology and Biophysics, UConn Health, Farmington, Connecticut 06030-3305
| | - Colbert Sesanker
- Department of Molecular Biology and Biophysics, UConn Health, Farmington, Connecticut 06030-3305
| | - Timothy O. Martyn
- retired from Department of Engineering and Science, Rensselaer at Hartford, Hartford, Connecticut 06120
| | - Heidi J.C. Ellis
- Department of Computer Science and Information Technology Western New England College, Springfield, MA 01119
| | - Michael R. Gryk
- Department of Molecular Biology and Biophysics, UConn Health, Farmington, Connecticut 06030-3305
| |
Collapse
|
5
|
Tramesel D, Catherinot V, Delsuc MA. Modeling of NMR processing, toward efficient unattended processing of NMR experiments. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2007; 188:56-67. [PMID: 17616410 DOI: 10.1016/j.jmr.2007.05.023] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2006] [Revised: 04/27/2007] [Accepted: 05/09/2007] [Indexed: 05/16/2023]
Abstract
Many alternative processing techniques have recently been proposed in the literature. Most of these techniques rely on specific acquisition protocols as well as on specific data processing techniques, the need for an efficient versatile and expandable NMR processing tool would be a particularly timely addition to the modern NMR spectroscopy laboratory. The work presented here consists in a modeling of the various possible NMR data processing approaches. This modeling presents a common working frame for most of the modern acquisition/processing protocols. Two different data modeling approaches are presented, strong modeling and weak modeling, depending whether the system under study or the measurement is modeled. The emphasis is placed on the weak modeling approach. This modeling is implemented in a computer program developed in python and called NPK standing (standing for NMR Processing Kernel), organized in four logical layers (i) mathematical kernel; (ii) elementary actions; (iii) processing phases; (iv) processing strategies. This organisation, along with default values for most processing parameters allows the use of the program in an unattended manner, producing close to optimal spectra. Examples are shown for 1D and 2D processing, and liquid and solid NMR spectroscopy. NPK is available from the site: http://abcis.cbs.cnrs.fr/NPK.
Collapse
Affiliation(s)
- Dominique Tramesel
- Centre de Biochimie Structurale, 29 rue de Navacelles, CNRS UMR5048, INSERM U554, Université Montpellier 1 & 2, F34090 Montpellier, France
| | | | | |
Collapse
|
6
|
Delsuc MA, Tramesel D. Application du traitement par entropie maximale aux données RMN multidimensionnelles ; cas de l'échantillonnage partiel. CR CHIM 2006. [DOI: 10.1016/j.crci.2005.06.013] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
7
|
Shimba N, Stern AS, Craik CS, Hoch JC, Dötsch V. Elimination of 13Calpha splitting in protein NMR spectra by deconvolution with maximum entropy reconstruction. J Am Chem Soc 2003; 125:2382-3. [PMID: 12603112 DOI: 10.1021/ja027973e] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Homonuclear 13C-13C couplings can significantly reduce the sensitivity and resolution of multidimensional NMR experiments. The most important of these couplings is the 13Calpha-13Cbeta coupling, and several different methods have been developed to eliminate its effect from spectra used for backbone assignment, including short or constant-time evolution periods, selectively labeled amino acids, and multiple-band decoupling sequences. In this communication we show that postacquisition deconvolution of the spectra with a maximum entropy algorithm can be superior to experimental decoupling. The method is very robust, does not introduce shifts of the resonance positions, and simplifies the measurement of the most important NMR experiments for protein backbone assignment.
Collapse
Affiliation(s)
- Nobuhisa Shimba
- Department of Pharmaceutical Chemistry, University of California-San Francisco, San Francisco, CA 94143, USA
| | | | | | | | | |
Collapse
|