1
|
Jin H, Moseley HNB. md_harmonize: A Python Package for Atom-Level Harmonization of Public Metabolic Databases. Metabolites 2023; 13:1199. [PMID: 38132881 PMCID: PMC10744849 DOI: 10.3390/metabo13121199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 12/13/2023] [Accepted: 12/15/2023] [Indexed: 12/23/2023] Open
Abstract
A major challenge to integrating public metabolic resources is the use of different nomenclatures by individual databases. This paper presents md_harmonize, an open-source Python package for harmonizing compounds and metabolic reactions across various metabolic databases. The md_harmonize package utilizes a neighborhood-specific graph coloring method for generating a unique identifier for each compound via atom identifiers based on a compound's chemical structure. The resulting harmonized compounds and reactions can be used for various downstream analyses, including the construction of atom-resolved metabolic networks and models for metabolic flux analysis. Parts of the md_harmonize package have been optimized using a variety of computational techniques to allow certain NP-complete problems handled by the software to be tractable for these specific use-cases. The software is available on GitHub and through the Python Package Index, with end-user documentation hosted on GitHub Pages.
Collapse
Affiliation(s)
- Huan Jin
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, KY 40536, USA;
| | - Hunter N. B. Moseley
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, KY 40536, USA;
- Department of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40536, USA
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA
- Superfund Research Center, University of Kentucky, Lexington, KY 40506, USA
- Institute for Biomedical Informatics, University of Kentucky, Lexington, KY 40536, USA
| |
Collapse
|
2
|
Huckvale ED, Powell CD, Jin H, Moseley HNB. Benchmark Dataset for Training Machine Learning Models to Predict the Pathway Involvement of Metabolites. Metabolites 2023; 13:1120. [PMID: 37999216 PMCID: PMC10673125 DOI: 10.3390/metabo13111120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 10/25/2023] [Accepted: 10/30/2023] [Indexed: 11/25/2023] Open
Abstract
Metabolic pathways are a human-defined grouping of life sustaining biochemical reactions, metabolites being both the reactants and products of these reactions. But many public datasets include identified metabolites whose pathway involvement is unknown, hindering metabolic interpretation. To address these shortcomings, various machine learning models, including those trained on data from the Kyoto Encyclopedia of Genes and Genomes (KEGG), have been developed to predict the pathway involvement of metabolites based on their chemical descriptions; however, these prior models are based on old metabolite KEGG-based datasets, including one benchmark dataset that is invalid due to the presence of over 1500 duplicate entries. Therefore, we have developed a new benchmark dataset derived from the KEGG following optimal standards of scientific computational reproducibility and including all source code needed to update the benchmark dataset as KEGG changes. We have used this new benchmark dataset with our atom coloring methodology to develop and compare the performance of Random Forest, XGBoost, and multilayer perceptron with autoencoder models generated from our new benchmark dataset. Best overall weighted average performance across 1000 unique folds was an F1 score of 0.8180 and a Matthews correlation coefficient of 0.7933, which was provided by XGBoost binary classification models for 11 KEGG-defined pathway categories.
Collapse
Affiliation(s)
- Erik D. Huckvale
- Markey Cancer Center, University of Kentucky, Lexington, KY 40506, USA
- Superfund Research Center, University of Kentucky, Lexington, KY 40506, USA
| | - Christian D. Powell
- Markey Cancer Center, University of Kentucky, Lexington, KY 40506, USA
- Superfund Research Center, University of Kentucky, Lexington, KY 40506, USA
- Department of Computer Science (Data Science Program), University of Kentucky, Lexington, KY 40506, USA
| | - Huan Jin
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, KY 40536, USA
| | - Hunter N. B. Moseley
- Markey Cancer Center, University of Kentucky, Lexington, KY 40506, USA
- Superfund Research Center, University of Kentucky, Lexington, KY 40506, USA
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, KY 40536, USA
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, KY 40506, USA
- Institute for Biomedical Informatics, University of Kentucky, Lexington, KY 40506, USA
| |
Collapse
|
3
|
Huckvale ED, Powell CD, Jin H, Moseley HN. Benchmark dataset for training machine learning models to predict the pathway involvement of metabolites. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.03.560715. [PMID: 37873272 PMCID: PMC10592640 DOI: 10.1101/2023.10.03.560715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Metabolic pathways are a human-defined grouping of life sustaining biochemical reactions, metabolites being both the reactants and products of these reactions. But many public datasets include identified metabolites whose pathway involvement is unknown, hindering metabolic interpretation. To address these shortcomings, various machine learning models, including those trained on data from the Kyoto Encyclopedia of Genes and Genomes (KEGG), have been developed to predict the pathway involvement of metabolites based on their chemical descriptions; however, these prior models are based on old metabolite KEGG-based datasets, including one benchmark dataset that is invalid due to the presence of over 1500 duplicate entries. Therefore, we have developed a new benchmark dataset derived from the KEGG following optimal standards of scientific computational reproducibility and including all source code needed to update the benchmark dataset as KEGG changes. We have used this new benchmark dataset with our atom coloring methodology to develop and compare the performance of Random Forest, XGBoost, and multilayer perceptron with autoencoder models generated from our new benchmark dataset. Best overall weighted average performance across 1000 unique folds was an F1-score of 0.8180 and Matthews correlation coefficient of 0.7933, which was provided by XGBoost binary classification models for 11 KEGG-defined pathway categories.
Collapse
Affiliation(s)
- Erik D. Huckvale
- Department of Computer Science (Data Science Program), University of Kentucky, Lexington, KY 40506, USA
| | - Christian D. Powell
- Department of Computer Science (Data Science Program), University of Kentucky, Lexington, KY 40506, USA
- Markey Cancer Center, University of Kentucky, Lexington, KY 40506, USA
- Superfund Research Center, University of Kentucky, Lexington, KY 40506, USA
| | - Huan Jin
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, KY 40536, USA
| | - Hunter N.B. Moseley
- Markey Cancer Center, University of Kentucky, Lexington, KY 40506, USA
- Superfund Research Center, University of Kentucky, Lexington, KY 40506, USA
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, KY 40506, USA
- Institute for Biomedical Informatics, University of Kentucky, Lexington, KY 40506, USA
| |
Collapse
|
4
|
Mitchell JM, Flight RM, Moseley HNB. Untargeted Lipidomics of Non-Small Cell Lung Carcinoma Demonstrates Differentially Abundant Lipid Classes in Cancer vs. Non-Cancer Tissue. Metabolites 2021; 11:740. [PMID: 34822397 PMCID: PMC8622625 DOI: 10.3390/metabo11110740] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Revised: 10/24/2021] [Accepted: 10/25/2021] [Indexed: 01/25/2023] Open
Abstract
Lung cancer remains the leading cause of cancer death worldwide and non-small cell lung carcinoma (NSCLC) represents 85% of newly diagnosed lung cancers. In this study, we utilized our untargeted assignment tool Small Molecule Isotope Resolved Formula Enumerator (SMIRFE) and ultra-high-resolution Fourier transform mass spectrometry to examine lipid profile differences between paired cancerous and non-cancerous lung tissue samples from 86 patients with suspected stage I or IIA primary NSCLC. Correlation and co-occurrence analysis revealed significant lipid profile differences between cancer and non-cancer samples. Further analysis of machine-learned lipid categories for the differentially abundant molecular formulas identified a high abundance sterol, high abundance and high m/z sphingolipid, and low abundance glycerophospholipid metabolic phenotype across the NSCLC samples. At the class level, higher abundances of sterol esters and lower abundances of cardiolipins were observed suggesting altered stearoyl-CoA desaturase 1 (SCD1) or acetyl-CoA acetyltransferase (ACAT1) activity and altered human cardiolipin synthase 1 or lysocardiolipin acyltransferase activity respectively, the latter of which is known to confer apoptotic resistance. The presence of a shared metabolic phenotype across a variety of genetically distinct NSCLC subtypes suggests that this phenotype is necessary for NSCLC development and may result from multiple distinct genetic lesions. Thus, targeting the shared affected pathways may be beneficial for a variety of genetically distinct NSCLC subtypes.
Collapse
Affiliation(s)
- Joshua M. Mitchell
- Department of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40536, USA;
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA;
- Resource Center for Stable Isotope Resolved Metabolomics, University of Kentucky, Lexington, KY 40536, USA
| | - Robert M. Flight
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA;
- Resource Center for Stable Isotope Resolved Metabolomics, University of Kentucky, Lexington, KY 40536, USA
| | - Hunter N. B. Moseley
- Department of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40536, USA;
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA;
- Resource Center for Stable Isotope Resolved Metabolomics, University of Kentucky, Lexington, KY 40536, USA
- Institute for Biomedical Informatics, University of Kentucky, Lexington, KY 40536, USA
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, KY 40536, USA
| |
Collapse
|
5
|
Jin H, Moseley HNB. Hierarchical Harmonization of Atom-Resolved Metabolic Reactions across Metabolic Databases. Metabolites 2021; 11:metabo11070431. [PMID: 34209357 PMCID: PMC8307411 DOI: 10.3390/metabo11070431] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Revised: 06/26/2021] [Accepted: 06/28/2021] [Indexed: 11/16/2022] Open
Abstract
Metabolic models have been proven to be useful tools in system biology and have been successfully applied to various research fields in a wide range of organisms. A relatively complete metabolic network is a prerequisite for deriving reliable metabolic models. The first step in constructing metabolic network is to harmonize compounds and reactions across different metabolic databases. However, effectively integrating data from various sources still remains a big challenge. Incomplete and inconsistent atomistic details in compound representations across databases is a very important limiting factor. Here, we optimized a subgraph isomorphism detection algorithm to validate generic compound pairs. Moreover, we defined a set of harmonization relationship types between compounds to deal with inconsistent chemical details while successfully capturing atom-level characteristics, enabling a more complete enabling compound harmonization across metabolic databases. In total, 15,704 compound pairs across KEGG (Kyoto Encyclopedia of Genes and Genomes) and MetaCyc databases were detected. Furthermore, utilizing the classification of compound pairs and EC (Enzyme Commission) numbers of reactions, we established hierarchical relationships between metabolic reactions, enabling the harmonization of 3856 reaction pairs. In addition, we created and used atom-specific identifiers to evaluate the consistency of atom mappings within and between harmonized reactions, detecting some consistency issues between the reaction and compound descriptions in these metabolic databases.
Collapse
Affiliation(s)
- Huan Jin
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, KY 40536, USA;
| | - Hunter N. B. Moseley
- Department of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40536, USA
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA
- Superfund Research Center, University of Kentucky, Lexington, KY 40506, USA
- Institute for Biomedical Informatics, University of Kentucky, Lexington, KY 40536, USA
- Correspondence: ; Tel.: +1-859-218-2964
| |
Collapse
|
6
|
Atom Identifiers Generated by a Neighborhood-Specific Graph Coloring Method Enable Compound Harmonization across Metabolic Databases. Metabolites 2020; 10:metabo10090368. [PMID: 32933023 PMCID: PMC7570338 DOI: 10.3390/metabo10090368] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2020] [Revised: 09/04/2020] [Accepted: 09/08/2020] [Indexed: 02/06/2023] Open
Abstract
Metabolic flux analysis requires both a reliable metabolic model and reliable metabolic profiles in characterizing metabolic reprogramming. Advances in analytic methodologies enable production of high-quality metabolomics datasets capturing isotopic flux. However, useful metabolic models can be difficult to derive due to the lack of relatively complete atom-resolved metabolic networks for a variety of organisms, including human. Here, we developed a neighborhood-specific graph coloring method that creates unique identifiers for each atom in a compound facilitating construction of an atom-resolved metabolic network. What is more, this method is guaranteed to generate the same identifier for symmetric atoms, enabling automatic identification of possible additional mappings caused by molecular symmetry. Furthermore, a compound coloring identifier derived from the corresponding atom coloring identifiers can be used for compound harmonization across various metabolic network databases, which is an essential first step in network integration. With the compound coloring identifiers, 8865 correspondences between KEGG (Kyoto Encyclopedia of Genes and Genomes) and MetaCyc compounds are detected, with 5451 of them confirmed by other identifiers provided by the two databases. In addition, we found that the Enzyme Commission numbers (EC) of reactions can be used to validate possible correspondence pairs, with 1848 unconfirmed pairs validated by commonality in reaction ECs. Moreover, we were able to detect various issues and errors with compound representation in KEGG and MetaCyc databases by compound coloring identifiers, demonstrating the usefulness of this methodology for database curation.
Collapse
|
7
|
Mitchell JM, Flight RM, Moseley HN. Deriving Lipid Classification Based on Molecular Formulas. Metabolites 2020; 10:E122. [PMID: 32214009 PMCID: PMC7143220 DOI: 10.3390/metabo10030122] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Revised: 03/02/2020] [Accepted: 03/21/2020] [Indexed: 12/20/2022] Open
Abstract
Despite instrument and algorithmic improvements, the untargeted and accurate assignment of metabolites remains an unsolved problem in metabolomics. New assignment methods such as our SMIRFE algorithm can assign elemental molecular formulas to observed spectral features in a highly untargeted manner without orthogonal information from tandem MS or chromatography. However, for many lipidomics applications, it is necessary to know at least the lipid category or class that is associated with a detected spectral feature to derive a biochemical interpretation. Our goal is to develop a method for robustly classifying elemental molecular formula assignments into lipid categories for an application to SMIRFE-generated assignments. Using a Random Forest machine learning approach, we developed a method that can predict lipid category and class from SMIRFE non-adducted molecular formula assignments. Our methods achieve high average predictive accuracy (>90%) and precision (>83%) across all eight of the lipid categories in the LIPIDMAPS database. Classification performance was evaluated using sets of theoretical, data-derived, and artifactual molecular formulas. Our methods enable the lipid classification of non-adducted molecular formula assignments generated by SMIRFE without orthogonal information, facilitating the biochemical interpretation of untargeted lipidomics experiments. This lipid classification appears insufficient for validating single-spectrum assignments, but could be useful in cross-spectrum assignment validation.
Collapse
Affiliation(s)
- Joshua M. Mitchell
- Department of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40536, USA; (J.M.M.); (R.M.F.)
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA
- Resource Center for Stable Isotope Resolved Metabolomics, University of Kentucky, Lexington, KY 40536, USA
| | - Robert M. Flight
- Department of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40536, USA; (J.M.M.); (R.M.F.)
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA
- Resource Center for Stable Isotope Resolved Metabolomics, University of Kentucky, Lexington, KY 40536, USA
| | - Hunter N.B. Moseley
- Department of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40536, USA; (J.M.M.); (R.M.F.)
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA
- Resource Center for Stable Isotope Resolved Metabolomics, University of Kentucky, Lexington, KY 40536, USA
- Institute for Biomedical Informatics, University of Kentucky, Lexington, KY 40536, USA
- Center for Clinical and Translational Science, University of Kentucky, Lexington, KY 40536, USA
| |
Collapse
|
8
|
Mitchell JM, Flight RM, Moseley HNB. Small Molecule Isotope Resolved Formula Enumeration: A Methodology for Assigning Isotopologues and Metabolite Formulas in Fourier Transform Mass Spectra. Anal Chem 2019; 91:8933-8940. [PMID: 31260262 DOI: 10.1021/acs.analchem.9b00748] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Improvements in Fourier transform mass spectrometry (FT-MS) enable increasingly more complex experiments in the field of metabolomics. What is directly detected in FT-MS spectra are spectral features (peaks) that correspond to sets of adducted and charged forms of specific molecules in the sample. The robust assignment of these features is an essential step for MS-based metabolomics experiments, but the sheer complexity of what is detected and a variety of analytically introduced variance, errors, and artifacts has hindered the systematic analysis of complex patterns of observed peaks with respect to isotope content. We have developed a method called SMIRFE that detects small biomolecules and determines their elemental molecular formula (EMF) using detected sets of isotopologue peaks sharing the same EMF. SMIRFE does not use a database of known metabolite formulas; instead a nearly comprehensive search space of all isotopologues within a mass range is constructed and used for assignment. This search space can be tailored for different isotope labeling patterns expected in different stable isotope tracing experiments. Using consumer-level computing equipment, a large search space of 2000 Da was constructed, and assignment performance was evaluated and validated using verified assignments on a pair of peak lists derived from spectra containing unlabeled and 15N-labeled versions of amino acids derivatized using ethylchloroformate. SMIRFE identified 18 of 18 predicted derivatized EMFs, and each assignment was evaluated statistically and assigned an e-value representing the probability to occur by chance.
Collapse
|
9
|
Hein DW, Kidd LCR. Design and Success of a 21st Century Cancer Education Program at the University of Louisville. JOURNAL OF CANCER EDUCATION : THE OFFICIAL JOURNAL OF THE AMERICAN ASSOCIATION FOR CANCER EDUCATION 2018; 33:298-308. [PMID: 27474114 PMCID: PMC5280580 DOI: 10.1007/s13187-016-1083-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Cancer incidence, morbidity, and mortality in the Commonwealth of Kentucky are among the highest in the nation. The University of Louisville was the recipient of a National Cancer Institute (NCI)-funded cancer education program grant in 1975 under the leadership of Dr. Norbert Burzynski. A new and totally redesigned performance-based University of Louisville Cancer Education Program was funded by NCI in 2011 to recruit and motivate outstanding undergraduate and health professional students to pursue further training and careers in cancer research. Here, we describe the strategy, design, methods, implementation, and accomplishments of our twenty-first century performance-based cancer education program. Our program will meet or exceed all of its 5-year performance goals, including the total number students (n = 156) and under-represented minorities (n = 53) who successfully completed the program under the mentorship of cancer research-intensive faculty members of the James Graham Brown Cancer Center (JGBCC). The mentored research program is complemented with professional development and enhancement activities, including cancer research seminars presented by faculty members actively engaged in research centered on the diagnosis, treatment or prevention of cancer, creation of individual career development plans, exploration of cancer research careers, and acquisition of professionalism skills. Student interests towards cancer research significantly increased after completion of the program compared to baseline (P = 0.02). Based on quantitative and qualitative analysis of various components of the curricula, the trainees favor practical, engaging, and interactive activities aligned within professional career goals and objectives. For instance, the trainees prefer two 30-min small group discussions on "Navigating Careers in Cancer Research" with faculty, professional students, and program alumni. Future updates to the program include new activities that capitalize on the cross-disciplinary background of our mentors and trainees as well as a team-based approach to professional development. Our cancer education program will continue to enhance the professional development of the next generation of cancer scientists and clinicians.
Collapse
Affiliation(s)
- David W Hein
- Department of Pharmacology & Toxicology and James Graham Brown Cancer Center, University of Louisville, Kosair Charities Clinical and Translational Research, 505 South Hancock Street, Louisville, KY, USA.
| | - La Creis R Kidd
- Department of Pharmacology & Toxicology and James Graham Brown Cancer Center, University of Louisville, Kosair Charities Clinical and Translational Research, 505 South Hancock Street, Louisville, KY, USA
| |
Collapse
|
10
|
Yao S, Flight RM, Rouchka EC, Moseley HNB. Perspectives and expectations in structural bioinformatics of metalloproteins. Proteins 2017; 85:938-944. [PMID: 28168746 PMCID: PMC5389925 DOI: 10.1002/prot.25263] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Revised: 01/16/2017] [Accepted: 01/31/2017] [Indexed: 01/21/2023]
Abstract
Recent papers highlight the presence of large numbers of compressed angles in metal ion coordination geometries for metalloprotein entries in the worldwide Protein Data Bank, due mainly to multidentate coordination. The prevalence of these compressed angles has raised the controversial idea that significantly populated aberrant or even novel coordination geometries may exist. Some of these papers have undergone severe criticism, apparently due to views held that only canonical coordination geometries exist in significant numbers. While criticism of controversial ideas is warranted and to be expected, we believe that a line was crossed where unfair criticism was put forth to discredit an inconvenient result that compressed angles exist in large numbers, which does not support the dogmatic canonical coordination geometry view. We present a review of the major controversial results and their criticisms, pointing out both good suggestions that have been incorporated in new analyses, but also unfair criticism that was put forth to support a particular view. We also suggest that better science is enabled through: (i) a more collegial and collaborative approach in future critical reviews and (ii) the requirement for a description of methods and data including source code and visualizations that enables full reproducibility of results. Proteins 2017; 85:938-944. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Sen Yao
- School of Interdisciplinary and Graduate Studies, University of Louisville, Louisville, Kentucky, 40292
- Department of Computer Engineering and Computer Science, University of Louisville, Louisville, Kentucky, 40292
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, 40356
- Markey Cancer Center, University of Kentucky, Lexington, Kentucky, 40356
- Center for Environmental and Systems Biochemistry, University of Kentucky, Lexington, Kentucky, 40356
| | - Robert M Flight
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, 40356
- Markey Cancer Center, University of Kentucky, Lexington, Kentucky, 40356
- Center for Environmental and Systems Biochemistry, University of Kentucky, Lexington, Kentucky, 40356
| | - Eric C Rouchka
- School of Interdisciplinary and Graduate Studies, University of Louisville, Louisville, Kentucky, 40292
- Department of Computer Engineering and Computer Science, University of Louisville, Louisville, Kentucky, 40292
| | - Hunter N B Moseley
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, Kentucky, 40356
- Markey Cancer Center, University of Kentucky, Lexington, Kentucky, 40356
- Center for Environmental and Systems Biochemistry, University of Kentucky, Lexington, Kentucky, 40356
| |
Collapse
|
11
|
Lane AN, Higashi RM, Fan TWM. Preclinical models for interrogating drug action in human cancers using Stable Isotope Resolved Metabolomics (SIRM). Metabolomics 2016; 12:118. [PMID: 27489532 PMCID: PMC4968890 DOI: 10.1007/s11306-016-1065-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
AIMS In this review we compare the advantages and disadvantages of different model biological systems for determining the metabolic functions of cells in complex environments, how they may change in different disease states, and respond to therapeutic interventions. BACKGROUND All preclinical drug-testing models have advantages and drawbacks. We compare and contrast established cell, organoid and animal models with ex vivo organ or tissue culture and in vivo human experiments in the context of metabolic readout of drug efficacy. As metabolism reports directly on the biochemical state of cells and tissues, it can be very sensitive to drugs and/or other environmental changes. This is especially so when metabolic activities are probed by stable isotope tracing methods, which can also provide detailed mechanistic information on drug action. We have developed and been applying Stable Isotope-Resolved Metabolomics (SIRM) to examine metabolic reprogramming of human lung cancer cells in monoculture, in mouse xenograft/explant models, and in lung cancer patients in situ (Lane et al. 2011; T. W. Fan et al. 2011; T. W-M. Fan et al. 2012; T. W. Fan et al. 2012; Xie et al. 2014b; Ren et al. 2014a; Sellers et al. 2015b). We are able to determine the influence of the tumor microenvironment using these models. We have now extended the range of models to fresh human tissue slices, similar to those originally described by O. Warburg (Warburg 1923), which retain the native tissue architecture and heterogeneity with a paired benign versus cancer design under defined cell culture conditions. This platform offers an unprecedented human tissue model for preclinical studies on metabolic reprogramming of human cancer cells in their tissue context, and response to drug treatment (Xie et al. 2014a). As the microenvironment of the target human tissue is retained and individual patient's response to drugs is obtained, this platform promises to transcend current limitations of drug selection for clinical trials or treatments. CONCLUSIONS AND FUTURE WORK Development of ex vivo human tissue and animal models with humanized organs including bone marrow and liver show considerable promise for analyzing drug responses that are more relevant to humans. Similarly using stable isotope tracer methods with these improved models in advanced stages of the drug development pipeline, in conjunction with tissue biopsy is expected significantly to reduce the high failure rate of experimental drugs in Phase II and III clinical trials.
Collapse
Affiliation(s)
- Andrew N Lane
- Center for Environmental and Systems Biochemistry, University of Kentucky
| | - Richard M Higashi
- Center for Environmental and Systems Biochemistry, University of Kentucky
| | - Teresa W-M Fan
- Center for Environmental and Systems Biochemistry, University of Kentucky
| |
Collapse
|
12
|
Yamanishi Y, Tabei Y, Kotera M. Metabolome-scale de novo pathway reconstruction using regioisomer-sensitive graph alignments. Bioinformatics 2015; 31:i161-70. [PMID: 26072478 PMCID: PMC4765868 DOI: 10.1093/bioinformatics/btv224] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Recent advances in mass spectrometry and related metabolomics technologies have enabled the rapid and comprehensive analysis of numerous metabolites. However, biosynthetic and biodegradation pathways are only known for a small portion of metabolites, with most metabolic pathways remaining uncharacterized. RESULTS In this study, we developed a novel method for supervised de novo metabolic pathway reconstruction with an improved graph alignment-based approach in the reaction-filling framework. We proposed a novel chemical graph alignment algorithm, which we called PACHA (Pairwise Chemical Aligner), to detect the regioisomer-sensitive connectivities between the aligned substructures of two compounds. Unlike other existing graph alignment methods, PACHA can efficiently detect only one common subgraph between two compounds. Our results show that the proposed method outperforms previous descriptor-based methods or existing graph alignment-based methods in the enzymatic reaction-likeness prediction for isomer-enriched reactions. It is also useful for reaction annotation that assigns potential reaction characteristics such as EC (Enzyme Commission) numbers and PIERO (Enzymatic Reaction Ontology for Partial Information) terms to substrate-product pairs. Finally, we conducted a comprehensive enzymatic reaction-likeness prediction for all possible uncharacterized compound pairs, suggesting potential metabolic pathways for newly predicted substrate-product pairs.
Collapse
Affiliation(s)
- Yoshihiro Yamanishi
- Division of System Cohort, Medical Institute of Bioregulation, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8582, Institute for Advanced Study, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8581, PRESTO, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012 and Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo, 152-8550, Japan Division of System Cohort, Medical Institute of Bioregulation, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8582, Institute for Advanced Study, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8581, PRESTO, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012 and Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo, 152-8550, Japan
| | - Yasuo Tabei
- Division of System Cohort, Medical Institute of Bioregulation, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8582, Institute for Advanced Study, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8581, PRESTO, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012 and Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo, 152-8550, Japan
| | - Masaaki Kotera
- Division of System Cohort, Medical Institute of Bioregulation, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8582, Institute for Advanced Study, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8581, PRESTO, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012 and Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo, 152-8550, Japan
| |
Collapse
|
13
|
Vaniya A, Fiehn O. Using fragmentation trees and mass spectral trees for identifying unknown compounds in metabolomics. Trends Analyt Chem 2015; 69:52-61. [PMID: 26213431 PMCID: PMC4509603 DOI: 10.1016/j.trac.2015.04.002] [Citation(s) in RCA: 97] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Identification of unknown metabolites is the bottleneck in advancing metabolomics, leaving interpretation of metabolomics results ambiguous. The chemical diversity of metabolism is vast, making structure identification arduous and time consuming. Currently, comprehensive analysis of mass spectra in metabolomics is limited to library matching, but tandem mass spectral libraries are small compared to the large number of compounds found in the biosphere, including xenobiotics. Resolving this bottleneck requires richer data acquisition and better computational tools. Multi-stage mass spectrometry (MSn) trees show promise to aid in this regard. Fragmentation trees explore the fragmentation process, generate fragmentation rules and aid in sub-structure identification, while mass spectral trees delineate the dependencies in multi-stage MS of collision-induced dissociations. This review covers advancements over the past 10 years as a tool for metabolite identification, including algorithms, software and databases used to build and to implement fragmentation trees and mass spectral annotations.
Collapse
Affiliation(s)
- Arpana Vaniya
- University of California Davis, Department of Chemistry, One Shields Avenue, Davis, CA 95616, USA
- University of California Davis, West Coast Metabolomics Center, Genome Center, 451 Health Sciences Drive, Davis, CA 95616, USA
| | - Oliver Fiehn
- University of California Davis, West Coast Metabolomics Center, Genome Center, 451 Health Sciences Drive, Davis, CA 95616, USA
- King Abdulaziz University, Biochemistry Department, Jeddah, Saudi Arabia
| |
Collapse
|
14
|
Johnson AR, Makowski L. Nutrition and metabolic correlates of obesity and inflammation: clinical considerations. J Nutr 2015; 145:1131S-1136S. [PMID: 25833891 PMCID: PMC4410497 DOI: 10.3945/jn.114.200758] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2014] [Accepted: 12/08/2014] [Indexed: 11/14/2022] Open
Abstract
Since 1980, the global prevalence of obesity has doubled; in the United States, it has almost tripled. Billions of people are overweight and obese; the WHO reports that >65% of the world's population die of diseases related to overweight rather than underweight. Obesity is a complex disease that can be studied from "metropolis to metabolite"—that is, beginning at the policy and the population level through epidemiology and intervention studies; to bench work including preclinical models, tissue, and cell culture studies; to biochemical assays; and to metabolomics. Metabolomics is the next research frontier because it provides a real-time snapshot of biochemical building blocks and products of cellular processes. This report comments on practical considerations when conducting metabolomics research. The pros and cons and important study design concerns are addressed to aid in increasing metabolomics research in the United States. The link between metabolism and inflammation is an understudied phenomenon that has great potential to transform our understanding of immunometabolism in obesity, diabetes, cancer, and other diseases; metabolomics promises to be an important tool in understanding the complex relations between factors contributing to such diseases.
Collapse
Affiliation(s)
- Amy R Johnson
- Department of Nutrition, Gillings School of Global Public Health, and
| | - Liza Makowski
- Department of Nutrition, Gillings School of Global Public Health, and School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC
| |
Collapse
|
15
|
Abram F. Systems-based approaches to unravel multi-species microbial community functioning. Comput Struct Biotechnol J 2014; 13:24-32. [PMID: 25750697 PMCID: PMC4348430 DOI: 10.1016/j.csbj.2014.11.009] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2014] [Revised: 11/25/2014] [Accepted: 11/26/2014] [Indexed: 01/24/2023] Open
Abstract
Some of the most transformative discoveries promising to enable the resolution of this century's grand societal challenges will most likely arise from environmental science and particularly environmental microbiology and biotechnology. Understanding how microbes interact in situ, and how microbial communities respond to environmental changes remains an enormous challenge for science. Systems biology offers a powerful experimental strategy to tackle the exciting task of deciphering microbial interactions. In this framework, entire microbial communities are considered as metaorganisms and each level of biological information (DNA, RNA, proteins and metabolites) is investigated along with in situ environmental characteristics. In this way, systems biology can help unravel the interactions between the different parts of an ecosystem ultimately responsible for its emergent properties. Indeed each level of biological information provides a different level of characterisation of the microbial communities. Metagenomics, metatranscriptomics, metaproteomics, metabolomics and SIP-omics can be employed to investigate collectively microbial community structure, potential, function, activity and interactions. Omics approaches are enabled by high-throughput 21st century technologies and this review will discuss how their implementation has revolutionised our understanding of microbial communities.
Collapse
Affiliation(s)
- Florence Abram
- Functional Environmental Microbiology, School of Natural Sciences, National University of Ireland Galway, University Road, Galway, Ireland
| |
Collapse
|