1
|
Metz TO, Chang CH, Gautam V, Anjum A, Tian S, Wang F, Colby SM, Nunez JR, Blumer MR, Edison AS, Fiehn O, Jones DP, Li S, Morgan ET, Patti GJ, Ross DH, Shapiro MR, Williams AJ, Wishart DS. Introducing "Identification Probability" for Automated and Transferable Assessment of Metabolite Identification Confidence in Metabolomics and Related Studies. Anal Chem 2024. [PMID: 39699939 DOI: 10.1021/acs.analchem.4c04060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2024]
Abstract
Methods for assessing compound identification confidence in metabolomics and related studies have been debated and actively researched for the past two decades. The earliest effort in 2007 focused primarily on mass spectrometry and nuclear magnetic resonance spectroscopy and resulted in four recommended levels of metabolite identification confidence─the Metabolite Standards Initiative (MSI) Levels. In 2014, the original MSI Levels were expanded to five levels (including two sublevels) to facilitate communication of compound identification confidence in high resolution mass spectrometry studies. Further refinement in identification levels have occurred, for example to accommodate use of ion mobility spectrometry in metabolomics workflows, and alternate approaches to communicate compound identification confidence also have been developed based on identification points schema. However, neither qualitative levels of identification confidence nor quantitative scoring systems address the degree of ambiguity in compound identifications in the context of the chemical space being considered. Neither are they easily automated nor transferable between analytical platforms. In this perspective, we propose that the metabolomics and related communities consider identification probability as an approach for automated and transferable assessment of compound identification and ambiguity in metabolomics and related studies. Identification probability is defined simply as 1/N, where N is the number of compounds in a database that matches an experimentally measured molecule within user-defined measurement precision(s), for example mass measurement or retention time accuracy, etc. We demonstrate the utility of identification probability in an in silico analysis of multiproperty reference libraries constructed from a subset of the Human Metabolome Database and computational property predictions, provide guidance to the community in transparent implementation of the concept, and invite the community to further evaluate this concept in parallel with their current preferred methods for assessing metabolite identification confidence.
Collapse
Affiliation(s)
- Thomas O Metz
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Christine H Chang
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Vasuk Gautam
- Department of Biological Sciences, University of Alberta, Edmonton, Alberta T6G 2E9, Canada
| | - Afia Anjum
- Department of Biological Sciences, University of Alberta, Edmonton, Alberta T6G 2E9, Canada
| | - Siyang Tian
- Department of Biological Sciences, University of Alberta, Edmonton, Alberta T6G 2E9, Canada
| | - Fei Wang
- Department of Computing Science, University of Alberta, Edmonton, Alberta T6G 2E8, Canada
- Alberta Machine Intelligence Institute, Edmonton, Alberta T5J 1S5, Canada
| | - Sean M Colby
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Jamie R Nunez
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Madison R Blumer
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Arthur S Edison
- Department of Biochemistry & Molecular Biology, Complex Carbohydrate Research Center and Institute of Bioinformatics, University of Georgia, Athens, Georgia 30602, United States
| | - Oliver Fiehn
- West Coast Metabolomics Center, University of California Davis, Davis, California 95616, United States
| | - Dean P Jones
- Clinical Biomarkers Laboratory, Department of Medicine, Emory University, Atlanta, Georgia 30322, United States
| | - Shuzhao Li
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, United States
| | - Edward T Morgan
- Department of Pharmacology and Chemical Biology, Emory University School of Medicine, Atlanta, Georgia 30322, United States
| | - Gary J Patti
- Center for Mass Spectrometry and Metabolic Tracing, Department of Chemistry, Department of Medicine, Washington University, Saint Louis, Missouri 63105, United States
| | - Dylan H Ross
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Madelyn R Shapiro
- Artificial Intelligence & Data Analytics Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Antony J Williams
- U.S. Environmental Protection Agency, Office of Research & Development, Center for Computational Toxicology & Exposure (CCTE), Research Triangle Park, North Carolina 27711, United States
| | - David S Wishart
- Department of Biological Sciences, University of Alberta, Edmonton, Alberta T6G 2E9, Canada
| |
Collapse
|
2
|
Metz TO, Chang CH, Gautam V, Anjum A, Tian S, Wang F, Colby SM, Nunez JR, Blumer MR, Edison AS, Fiehn O, Jones DP, Li S, Morgan ET, Patti GJ, Ross DH, Shapiro MR, Williams AJ, Wishart DS. Introducing 'identification probability' for automated and transferable assessment of metabolite identification confidence in metabolomics and related studies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.30.605945. [PMID: 39131324 PMCID: PMC11312557 DOI: 10.1101/2024.07.30.605945] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/13/2024]
Abstract
Methods for assessing compound identification confidence in metabolomics and related studies have been debated and actively researched for the past two decades. The earliest effort in 2007 focused primarily on mass spectrometry and nuclear magnetic resonance spectroscopy and resulted in four recommended levels of metabolite identification confidence - the Metabolite Standards Initiative (MSI) Levels. In 2014, the original MSI Levels were expanded to five levels (including two sublevels) to facilitate communication of compound identification confidence in high resolution mass spectrometry studies. Further refinement in identification levels have occurred, for example to accommodate use of ion mobility spectrometry in metabolomics workflows, and alternate approaches to communicate compound identification confidence also have been developed based on identification points schema. However, neither qualitative levels of identification confidence nor quantitative scoring systems address the degree of ambiguity in compound identifications in context of the chemical space being considered, are easily automated, or are transferable between analytical platforms. In this perspective, we propose that the metabolomics and related communities consider identification probability as an approach for automated and transferable assessment of compound identification and ambiguity in metabolomics and related studies. Identification probability is defined simply as 1/N, where N is the number of compounds in a reference library or chemical space that match to an experimentally measured molecule within user-defined measurement precision(s), for example mass measurement or retention time accuracy, etc. We demonstrate the utility of identification probability in an in silico analysis of multi-property reference libraries constructed from the Human Metabolome Database and computational property predictions, provide guidance to the community in transparent implementation of the concept, and invite the community to further evaluate this concept in parallel with their current preferred methods for assessing metabolite identification confidence.
Collapse
Affiliation(s)
- Thomas O. Metz
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Christine H. Chang
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Vasuk Gautam
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| | - Afia Anjum
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| | - Siyang Tian
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| | - Fei Wang
- Department of Computing Science, University of Alberta, Edmonton, AB, Canada
- Alberta Machine Intelligence Institute, Edmonton, AB, Canada
| | - Sean M. Colby
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Jamie R. Nunez
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Madison R. Blumer
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Arthur S. Edison
- Department of Biochemistry & Molecular Biology, Complex Carbohydrate Research Center and Institute of Bioinformatics, University of Georgia, Athens, GA, USA
| | - Oliver Fiehn
- West Coast Metabolomics Center, University of California Davis, Davis, CA, USA
| | - Dean P. Jones
- Clinical Biomarkers Laboratory, Department of Medicine, Emory University, Atlanta, Georgia, USA
| | - Shuzhao Li
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Edward T. Morgan
- Department of Pharmacology and Chemical Biology, Emory University School of Medicine, Atlanta, Georgia, USA
| | - Gary J. Patti
- Center for Mass Spectrometry and Metabolic Tracing, Department of Chemistry, Department of Medicine, Washington University, Saint Louis, Missouri, USA
| | - Dylan H. Ross
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Madelyn R. Shapiro
- Artificial Intelligence & Data Analytics Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Antony J. Williams
- U.S. Environmental Protection Agency, Office of Research & Development, Center for Computational Toxicology & Exposure (CCTE), Research Triangle Park, NC USA
| | - David S. Wishart
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| |
Collapse
|
3
|
Shen ZJ, Xu SX, Huang QY, Li ZY, Xu YD, Lin CS, Huang YJ. TMT proteomics analysis of a pseudocereal crop, quinoa ( Chenopodium quinoa Willd.), during seed maturation. FRONTIERS IN PLANT SCIENCE 2022; 13:975073. [PMID: 36426144 PMCID: PMC9678934 DOI: 10.3389/fpls.2022.975073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Accepted: 09/20/2022] [Indexed: 06/16/2023]
Abstract
Quinoa (Chenopodium quinoa Willd.), an Andean native crop, is increasingly popular around the world due to its high nutritional content and stress tolerance. The production and the popularity of this strategic global food are greatly restricted by many limiting factors, such as seed pre-harvest sprouting, bitter saponin, etc. To solve these problems, the underlying mechanism of seed maturation in quinoa needs to be investigated. In this study, based on the investigation of morphological characteristics, a quantitative analysis of its global proteome was conducted using the combinational proteomics of tandem mass tag (TMT) labeling and parallel reaction monitoring (PRM). The proteome changes related to quinoa seed maturation conversion were monitored to aid its genetic improvement. Typical changes of morphological characteristics were discovered during seed maturation, including mean grain diameter, mean grain thickness, mean hundred-grain weight, palea, episperm color, etc. With TMT proteomics analysis, 581 differentially accumulated proteins (DAPs) were identified. Functional classification analysis and Gene Ontology enrichment analysis showed that most DAPs involved in photosynthesis were downregulated, indicating low levels of photosynthesis. DAPs that participated in glycolysis, such as glyceraldehyde-3-phosphate dehydrogenase, pyruvate decarboxylase, and alcohol dehydrogenase, were upregulated to fulfill the increasing requirement of energy consumption during maturation conversion. The storage proteins, such as globulins, legumins, vicilins, and oleosin, were also increased significantly during maturation conversion. Protein-protein interaction analysis and function annotation revealed that the upregulation of oleosin, oil body-associated proteins, and acyl-coenzyme A oxidase 2 resulted in the accumulation of oil in quinoa seeds. The downregulation of β-amyrin 28-oxidase was observed, indicating the decreasing saponin content, during maturation, which makes the quinoa "sweet". By the PRM and qRT-PCR analysis, the expression patterns of most selected DAPs were consistent with the result of TMT proteomics. Our study enhanced the understanding of the maturation conversion in quinoa. This might be the first and most important step toward the genetic improvement of quinoa.
Collapse
Affiliation(s)
- Zhi-Jun Shen
- Fujian Key Laboratory of Subtropical Plant Physiology and Biochemistry, Fujian Institute of Subtropical Botany, Xiamen, China
| | - Su-Xia Xu
- Fujian Key Laboratory of Subtropical Plant Physiology and Biochemistry, Fujian Institute of Subtropical Botany, Xiamen, China
| | - Qing-Yun Huang
- Fujian Key Laboratory of Subtropical Plant Physiology and Biochemistry, Fujian Institute of Subtropical Botany, Xiamen, China
| | - Zi-Yang Li
- Institute of Gene Science for Bamboo and Rattan Resources, International Center for Bamboo and Rattan, Beijing, China
| | - Yi-Ding Xu
- Landscape Architecture and Landscape Research Branch, China Academy of Urban Planning and Design, Beijing, China
| | - Chun-Song Lin
- Fujian Key Laboratory of Subtropical Plant Physiology and Biochemistry, Fujian Institute of Subtropical Botany, Xiamen, China
| | - Yi-Jin Huang
- Department of Dermatology, The First Affiliated Hospital of Xiamen University, Xiamen, China
| |
Collapse
|
4
|
Borges R, Colby SM, Das S, Edison AS, Fiehn O, Kind T, Lee J, Merrill AT, Merz KM, Metz TO, Nunez JR, Tantillo DJ, Wang LP, Wang S, Renslow RS. Quantum Chemistry Calculations for Metabolomics. Chem Rev 2021; 121:5633-5670. [PMID: 33979149 PMCID: PMC8161423 DOI: 10.1021/acs.chemrev.0c00901] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Indexed: 02/07/2023]
Abstract
A primary goal of metabolomics studies is to fully characterize the small-molecule composition of complex biological and environmental samples. However, despite advances in analytical technologies over the past two decades, the majority of small molecules in complex samples are not readily identifiable due to the immense structural and chemical diversity present within the metabolome. Current gold-standard identification methods rely on reference libraries built using authentic chemical materials ("standards"), which are not available for most molecules. Computational quantum chemistry methods, which can be used to calculate chemical properties that are then measured by analytical platforms, offer an alternative route for building reference libraries, i.e., in silico libraries for "standards-free" identification. In this review, we cover the major roadblocks currently facing metabolomics and discuss applications where quantum chemistry calculations offer a solution. Several successful examples for nuclear magnetic resonance spectroscopy, ion mobility spectrometry, infrared spectroscopy, and mass spectrometry methods are reviewed. Finally, we consider current best practices, sources of error, and provide an outlook for quantum chemistry calculations in metabolomics studies. We expect this review will inspire researchers in the field of small-molecule identification to accelerate adoption of in silico methods for generation of reference libraries and to add quantum chemistry calculations as another tool at their disposal to characterize complex samples.
Collapse
Affiliation(s)
- Ricardo
M. Borges
- Walter
Mors Institute of Research on Natural Products, Federal University of Rio de Janeiro, Rio de Janeiro 21941-901, Brazil
| | - Sean M. Colby
- Biological
Science Division, Pacific Northwest National
Laboratory, Richland, Washington 99352, United States
| | - Susanta Das
- Department
of Chemistry, Michigan State University, East Lansing, Michigan 48824, United States
| | - Arthur S. Edison
- Departments
of Genetics and Biochemistry and Molecular Biology, Complex Carbohydrate
Research Center and Institute of Bioinformatics, University of Georgia, Athens, Georgia 30602, United States
| | - Oliver Fiehn
- West
Coast Metabolomics Center for Compound Identification, UC Davis Genome
Center, University of California, Davis, California 95616, United States
| | - Tobias Kind
- West
Coast Metabolomics Center for Compound Identification, UC Davis Genome
Center, University of California, Davis, California 95616, United States
| | - Jesi Lee
- West
Coast Metabolomics Center for Compound Identification, UC Davis Genome
Center, University of California, Davis, California 95616, United States
- Department
of Chemistry, University of California, Davis, California 95616, United States
| | - Amy T. Merrill
- Department
of Chemistry, University of California, Davis, California 95616, United States
| | - Kenneth M. Merz
- Department
of Chemistry, Michigan State University, East Lansing, Michigan 48824, United States
| | - Thomas O. Metz
- Biological
Science Division, Pacific Northwest National
Laboratory, Richland, Washington 99352, United States
| | - Jamie R. Nunez
- Biological
Science Division, Pacific Northwest National
Laboratory, Richland, Washington 99352, United States
| | - Dean J. Tantillo
- Department
of Chemistry, University of California, Davis, California 95616, United States
| | - Lee-Ping Wang
- Department
of Chemistry, University of California, Davis, California 95616, United States
| | - Shunyang Wang
- West
Coast Metabolomics Center for Compound Identification, UC Davis Genome
Center, University of California, Davis, California 95616, United States
- Department
of Chemistry, University of California, Davis, California 95616, United States
| | - Ryan S. Renslow
- Biological
Science Division, Pacific Northwest National
Laboratory, Richland, Washington 99352, United States
| |
Collapse
|
5
|
Cunningham CM, Bellipanni G, Habas R, Balciunas D. Deletion of morpholino binding sites (DeMOBS) to assess specificity of morphant phenotypes. Sci Rep 2020; 10:15366. [PMID: 32958829 PMCID: PMC7506532 DOI: 10.1038/s41598-020-71708-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Accepted: 08/13/2020] [Indexed: 01/05/2023] Open
Abstract
Two complimentary approaches are widely used to study gene function in zebrafish: induction of genetic mutations, usually using targeted nucleases such as CRISPR/Cas9, and suppression of gene expression, typically using Morpholino oligomers. Neither method is perfect. Morpholinos (MOs) sometimes produce off-target or toxicity-related effects that can be mistaken for true phenotypes. Conversely, genetic mutants can be subject to compensation, or may fail to yield a null phenotype due to leakiness (e.g. use of cryptic splice sites or downstream AUGs). When discrepancy between mutant and morpholino-induced (morphant) phenotypes is observed, experimental validation of such phenotypes becomes very labor intensive. We have developed a simple genetic method to differentiate between genuine morphant phenotypes and those produced due to off-target effects. We speculated that indels within 5' untranslated regions would be unlikely to have a significant negative effect on gene expression. Mutations induced within a MO target site would result in a Morpholino-refractive allele thus suppressing true MO phenotypes whilst non-specific phenotypes would remain. We tested this hypothesis on one gene with an exclusively zygotic function, tbx5a, and one gene with strong maternal effect, ctnnb2. We found that indels within the Morpholino binding site are indeed able to suppress both zygotic and maternal morphant phenotypes. We also observed that the ability of such indels to suppress morpholino phenotypes does depend on the size and the location of the deletion. Nonetheless, mutating the morpholino binding sites in both maternal and zygotic genes can ascertain the specificity of morphant phenotypes.
Collapse
Affiliation(s)
| | - Gianfranco Bellipanni
- Department of Biology, College of Science and Technology, Temple University, Philadelphia, PA, 19122, USA
| | - Raymond Habas
- Department of Biology, College of Science and Technology, Temple University, Philadelphia, PA, 19122, USA
| | - Darius Balciunas
- Department of Biology, College of Science and Technology, Temple University, Philadelphia, PA, 19122, USA.
| |
Collapse
|