1
|
Nguyen QH, Nguyen H, Oh EC, Nguyen T. Current approaches and outstanding challenges of functional annotation of metabolites: a comprehensive review. Brief Bioinform 2024; 25:bbae498. [PMID: 39397425 PMCID: PMC11471905 DOI: 10.1093/bib/bbae498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Revised: 09/03/2024] [Accepted: 10/02/2024] [Indexed: 10/15/2024] Open
Abstract
Metabolite profiling is a powerful approach for the clinical diagnosis of complex diseases, ranging from cardiometabolic diseases, cancer, and cognitive disorders to respiratory pathologies and conditions that involve dysregulated metabolism. Because of the importance of systems-level interpretation, many methods have been developed to identify biologically significant pathways using metabolomics data. In this review, we first describe a complete metabolomics workflow (sample preparation, data acquisition, pre-processing, downstream analysis, etc.). We then comprehensively review 24 approaches capable of performing functional analysis, including those that combine metabolomics data with other types of data to investigate the disease-relevant changes at multiple omics layers. We discuss their availability, implementation, capability for pre-processing and quality control, supported omics types, embedded databases, pathway analysis methodologies, and integration techniques. We also provide a rating and evaluation of each software, focusing on their key technique, software accessibility, documentation, and user-friendliness. Following our guideline, life scientists can easily choose a suitable method depending on method rating, available data, input format, and method category. More importantly, we highlight outstanding challenges and potential solutions that need to be addressed by future research. To further assist users in executing the reviewed methods, we provide wrappers of the software packages at https://github.com/tinnlab/metabolite-pathway-review-docker.
Collapse
Affiliation(s)
- Quang-Huy Nguyen
- Department of Computer Science and Software Engineering, Auburn University, Auburn, AL 36849, United States
| | - Ha Nguyen
- Department of Computer Science and Software Engineering, Auburn University, Auburn, AL 36849, United States
| | - Edwin C Oh
- Department of Internal Medicine, UNLV School of Medicine, University of Nevada, Las Vegas, NV 89154, United States
| | - Tin Nguyen
- Department of Computer Science and Software Engineering, Auburn University, Auburn, AL 36849, United States
| |
Collapse
|
2
|
Pakkir Shah AK, Walter A, Ottosson F, Russo F, Navarro-Diaz M, Boldt J, Kalinski JCJ, Kontou EE, Elofson J, Polyzois A, González-Marín C, Farrell S, Aggerbeck MR, Pruksatrakul T, Chan N, Wang Y, Pöchhacker M, Brungs C, Cámara B, Caraballo-Rodríguez AM, Cumsille A, de Oliveira F, Dührkop K, El Abiead Y, Geibel C, Graves LG, Hansen M, Heuckeroth S, Knoblauch S, Kostenko A, Kuijpers MCM, Mildau K, Papadopoulos Lambidis S, Portal Gomes PW, Schramm T, Steuer-Lodd K, Stincone P, Tayyab S, Vitale GA, Wagner BC, Xing S, Yazzie MT, Zuffa S, de Kruijff M, Beemelmanns C, Link H, Mayer C, van der Hooft JJJ, Damiani T, Pluskal T, Dorrestein P, Stanstrup J, Schmid R, Wang M, Aron A, Ernst M, Petras D. Statistical analysis of feature-based molecular networking results from non-targeted metabolomics data. Nat Protoc 2024:10.1038/s41596-024-01046-3. [PMID: 39304763 DOI: 10.1038/s41596-024-01046-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 07/02/2024] [Indexed: 09/22/2024]
Abstract
Feature-based molecular networking (FBMN) is a popular analysis approach for liquid chromatography-tandem mass spectrometry-based non-targeted metabolomics data. While processing liquid chromatography-tandem mass spectrometry data through FBMN is fairly streamlined, downstream data handling and statistical interrogation are often a key bottleneck. Especially users new to statistical analysis struggle to effectively handle and analyze complex data matrices. Here we provide a comprehensive guide for the statistical analysis of FBMN results, focusing on the downstream analysis of the FBMN output table. We explain the data structure and principles of data cleanup and normalization, as well as uni- and multivariate statistical analysis of FBMN results. We provide explanations and code in two scripting languages (R and Python) as well as the QIIME2 framework for all protocol steps, from data clean-up to statistical analysis. All code is shared in the form of Jupyter Notebooks ( https://github.com/Functional-Metabolomics-Lab/FBMN-STATS ). Additionally, the protocol is accompanied by a web application with a graphical user interface ( https://fbmn-statsguide.gnps2.org/ ) to lower the barrier of entry for new users and for educational purposes. Finally, we also show users how to integrate their statistical results into the molecular network using the Cytoscape visualization tool. Throughout the protocol, we use a previously published environmental metabolomics dataset for demonstration purposes. Together, the protocol, code and web application provide a complete guide and toolbox for FBMN data integration, cleanup and advanced statistical analysis, enabling new users to uncover molecular insights from their non-targeted metabolomics data. Our protocol is tailored for the seamless analysis of FBMN results from Global Natural Products Social Molecular Networking and can be easily adapted to other mass spectrometry feature detection, annotation and networking tools.
Collapse
Affiliation(s)
- Abzer K Pakkir Shah
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Axel Walter
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
- Applied Bioinformatics, Department of Computer Science, University of Tübingen, Tübingen, Germany
| | - Filip Ottosson
- Section for Clinical Mass Spectrometry, Danish Center for Neonatal Screening, Department of Congenital Disorders, Statens Serum Institut, Copenhagen S, Denmark
| | - Francesco Russo
- Section for Clinical Mass Spectrometry, Danish Center for Neonatal Screening, Department of Congenital Disorders, Statens Serum Institut, Copenhagen S, Denmark
| | - Marcelo Navarro-Diaz
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Judith Boldt
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
- German Center for Infection Research, Partner Site Braunschweig-Hannover, Braunschweig, Germany
| | - Jarmo-Charles J Kalinski
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Department of Biochemistry and Microbiology, Rhodes University, Makhanda, South Africa
| | - Eftychia Eva Kontou
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- The Novo Nordisk Foundation for Biosustainability, Technical University of Denmark, Kongens Lyngby, Denmark
| | - James Elofson
- Department of Chemistry and Biochemistry, University of Denver, Denver, CO, USA
| | - Alexandros Polyzois
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Boyce Thompson Institute and Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY, USA
| | - Carolina González-Marín
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Universidad EAFIT, Medellín, Antioquia, Colombia
| | - Shane Farrell
- Bigelow Laboratory for Ocean Sciences, East Boothbay, ME, USA
- School of Marine Sciences, Darling Marine Center, University of Maine, Walpole, ME, USA
| | - Marie R Aggerbeck
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Department of Environmental Science, Aarhus University, Roskilde, Denmark
| | - Thapanee Pruksatrakul
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- National Center for Genetic Engineering and Biotechnology, National Science and Technology Development Agency, Thailand Science Park, Pathum Thani, Thailand
| | - Nathan Chan
- Department of Computer Science, University of California Riverside, Riverside, CA, USA
| | - Yunshu Wang
- Department of Computer Science, University of California Riverside, Riverside, CA, USA
| | - Magdalena Pöchhacker
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Department of Food Chemistry and Toxicology, University of Vienna, Vienna, Austria
| | - Corinna Brungs
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic
| | - Beatriz Cámara
- Laboratorio de Microbiología Molecular y Biotecnología Ambiental, Centro de Biotecnología DAL, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | | | - Andres Cumsille
- Laboratorio de Microbiología Molecular y Biotecnología Ambiental, Centro de Biotecnología DAL, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Fernanda de Oliveira
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA, USA
- Department of Biotechnology, Engineering School of Lorena, University of São Paulo, Lorena, São Paulo, Brazil
| | - Kai Dührkop
- Department of Bioinformatics, University of Jena, Jena, Germany
| | - Yasin El Abiead
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA, USA
| | - Christian Geibel
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Lana G Graves
- Department of Environmental Systems Analysis, University of Tübingen, Tübingen, Germany
- Leibniz Institute of Freshwater Ecology and Inland Fisheries, Berlin, Germany
| | - Martin Hansen
- Department of Environmental Science, Aarhus University, Roskilde, Denmark
| | - Steffen Heuckeroth
- Institute of Inorganic and Analytical Chemistry, University of Münster, Münster, Germany
| | - Simon Knoblauch
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Anastasiia Kostenko
- Department of Chemistry and Biochemistry, University of Denver, Denver, CO, USA
| | - Mirte C M Kuijpers
- Department of Ecology, Behavior and Evolution, University of California San Diego, San Diego, CA, USA
| | - Kevin Mildau
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Department of Analytical Chemistry, University of Vienna, Vienna, Austria
- Bioinformatics Group, Wageningen University and Research, Wageningen, the Netherlands
| | | | - Paulo Wender Portal Gomes
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA, USA
| | - Tilman Schramm
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
- Department of Biochemistry, University of California Riverside, Riverside, CA, USA
| | - Karoline Steuer-Lodd
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
- Department of Biochemistry, University of California Riverside, Riverside, CA, USA
| | - Paolo Stincone
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Sibgha Tayyab
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Giovanni Andrea Vitale
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Berenike C Wagner
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Shipei Xing
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA, USA
| | - Marquis T Yazzie
- Department of Chemistry and Biochemistry, University of Denver, Denver, CO, USA
| | - Simone Zuffa
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA, USA
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA, USA
| | - Martinus de Kruijff
- Helmholtz Institute for Pharmaceutical Research Saarland, Helmholtz Centre for Infection Research, Saarbrücken, Germany
| | - Christine Beemelmanns
- Helmholtz Institute for Pharmaceutical Research Saarland, Helmholtz Centre for Infection Research, Saarbrücken, Germany
- Saarland University, Saarbrücken, Germany
| | - Hannes Link
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Christoph Mayer
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Justin J J van der Hooft
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Bioinformatics Group, Wageningen University and Research, Wageningen, the Netherlands
- Department of Biochemistry, University of Johannesburg, Johannesburg, South Africa
| | - Tito Damiani
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic
| | - Tomáš Pluskal
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic
| | - Pieter Dorrestein
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA, USA
| | - Jan Stanstrup
- Department of Nutrition, Exercise and Sports, University of Copenhagen, Frederiksberg C, Denmark
| | - Robin Schmid
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic
| | - Mingxun Wang
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Department of Computer Science, University of California Riverside, Riverside, CA, USA
| | - Allegra Aron
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Department of Chemistry and Biochemistry, University of Denver, Denver, CO, USA
| | - Madeleine Ernst
- Section for Clinical Mass Spectrometry, Danish Center for Neonatal Screening, Department of Congenital Disorders, Statens Serum Institut, Copenhagen S, Denmark.
| | - Daniel Petras
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA.
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany.
- Department of Biochemistry, University of California Riverside, Riverside, CA, USA.
| |
Collapse
|
3
|
Anwardeen NR, Diboun I, Mokrab Y, Althani AA, Elrayess MA. Statistical methods and resources for biomarker discovery using metabolomics. BMC Bioinformatics 2023; 24:250. [PMID: 37322419 DOI: 10.1186/s12859-023-05383-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Accepted: 06/09/2023] [Indexed: 06/17/2023] Open
Abstract
Metabolomics is a dynamic tool for elucidating biochemical changes in human health and disease. Metabolic profiles provide a close insight into physiological states and are highly volatile to genetic and environmental perturbations. Variation in metabolic profiles can inform mechanisms of pathology, providing potential biomarkers for diagnosis and assessment of the risk of contracting a disease. With the advancement of high-throughput technologies, large-scale metabolomics data sources have become abundant. As such, careful statistical analysis of intricate metabolomics data is essential for deriving relevant and robust results that can be deployed in real-life clinical settings. Multiple tools have been developed for both data analysis and interpretations. In this review, we survey statistical approaches and corresponding statistical tools that are available for discovery of biomarkers using metabolomics.
Collapse
Affiliation(s)
- Najeha R Anwardeen
- Research and Graduate Studies, Biomedical Research Center, Qatar University, P.O. Box 2713, Doha, Qatar
| | - Ilhame Diboun
- Department of Human Genetics, Sidra Medicine, Doha, Qatar
| | - Younes Mokrab
- Department of Human Genetics, Sidra Medicine, Doha, Qatar
| | - Asma A Althani
- Research and Graduate Studies, Biomedical Research Center, Qatar University, P.O. Box 2713, Doha, Qatar
- QU Health, Qatar University, Doha, Qatar
| | - Mohamed A Elrayess
- Research and Graduate Studies, Biomedical Research Center, Qatar University, P.O. Box 2713, Doha, Qatar.
- QU Health, Qatar University, Doha, Qatar.
| |
Collapse
|
4
|
Chaiyachat P, Kaewseekhao B, Chaiprasert A, Kamolwat P, Nonghanphithak D, Phetcharaburanin J, Sirichoat A, Ong RTH, Faksri K. Metabolomic analysis of Mycobacterium tuberculosis reveals metabolic profiles for identification of drug-resistant tuberculosis. Sci Rep 2023; 13:8655. [PMID: 37244948 DOI: 10.1038/s41598-023-35882-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Accepted: 05/25/2023] [Indexed: 05/29/2023] Open
Abstract
The detection of pre-extensively (pre-XDR) and extensively drug-resistant tuberculosis (XDR-TB) is challenging. Drug-susceptibility tests for some anti-TB drugs, especially ethambutol (ETH) and ethionamide (ETO), are problematic due to overlapping thresholds to differentiate between susceptible and resistant phenotypes. We aimed to identify possible metabolomic markers to detect Mycobacterium tuberculosis (Mtb) strains causing pre-XDR and XDR-TB. The metabolic patterns of ETH- and ETO-resistant Mtb isolates were also investigated. Metabolomics of 150 Mtb isolates (54 pre-XDR, 63 XDR-TB and 33 pan-susceptible; pan-S) were investigated. Metabolomics of ETH and ETO phenotypically resistant subgroups were analyzed using UHPLC-ESI-QTOF-MS/MS. Orthogonal partial least-squares discriminant analysis revealed distinct separation in all pairwise comparisons among groups. Two metabolites (meso-hydroxyheme and itaconic anhydride) were able to differentiate the pre-XDR and XDR-TB groups from the pan-S group with 100% sensitivity and 100% specificity. In comparisons of the ETH and ETO phenotypically resistant subsets, sets of increased (ETH = 15, ETO = 7) and decreased (ETH = 1, ETO = 6) metabolites specific for the resistance phenotype of each drug were found. We demonstrated the potential for metabolomics of Mtb to differentiate among types of DR-TB as well as between isolates that were phenotypically resistant to ETO and ETH. Thus, metabolomics might be further applied for DR-TB diagnosis and patient management.
Collapse
Affiliation(s)
- Pratchakan Chaiyachat
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
- Research and Diagnostic Center for Emerging Infectious Diseases (RCEID), Khon Kaen University, Khon Kaen, Thailand
| | - Benjawan Kaewseekhao
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
- Research and Diagnostic Center for Emerging Infectious Diseases (RCEID), Khon Kaen University, Khon Kaen, Thailand
| | - Angkana Chaiprasert
- Office for Research and Development, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Phalin Kamolwat
- Bureau of Tuberculosis, Department of Disease Control, Ministry of Public Health, Nonthaburi, Thailand
| | - Ditthawat Nonghanphithak
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
- Research and Diagnostic Center for Emerging Infectious Diseases (RCEID), Khon Kaen University, Khon Kaen, Thailand
| | - Jutarop Phetcharaburanin
- Department of Systems Biosciences and Computational Medicine, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
| | - Auttawit Sirichoat
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand
- Research and Diagnostic Center for Emerging Infectious Diseases (RCEID), Khon Kaen University, Khon Kaen, Thailand
| | - Rick Twee-Hee Ong
- Saw Swee Hock School of Public Health, National University of Singapore and National University Health System, Singapore, Singapore
| | - Kiatichai Faksri
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand.
- Research and Diagnostic Center for Emerging Infectious Diseases (RCEID), Khon Kaen University, Khon Kaen, Thailand.
| |
Collapse
|
5
|
Paul I, Bolzan D, Youssef A, Gagnon KA, Hook H, Karemore G, Oliphant MUJ, Lin W, Liu Q, Phanse S, White C, Padhorny D, Kotelnikov S, Chen CS, Hu P, Denis GV, Kozakov D, Raught B, Siggers T, Wuchty S, Muthuswamy SK, Emili A. Parallelized multidimensional analytic framework applied to mammary epithelial cells uncovers regulatory principles in EMT. Nat Commun 2023; 14:688. [PMID: 36755019 PMCID: PMC9908882 DOI: 10.1038/s41467-023-36122-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 01/17/2023] [Indexed: 02/10/2023] Open
Abstract
A proper understanding of disease etiology will require longitudinal systems-scale reconstruction of the multitiered architecture of eukaryotic signaling. Here we combine state-of-the-art data acquisition platforms and bioinformatics tools to devise PAMAF, a workflow that simultaneously examines twelve omics modalities, i.e., protein abundance from whole-cells, nucleus, exosomes, secretome and membrane; N-glycosylation, phosphorylation; metabolites; mRNA, miRNA; and, in parallel, single-cell transcriptomes. We apply PAMAF in an established in vitro model of TGFβ-induced epithelial to mesenchymal transition (EMT) to quantify >61,000 molecules from 12 omics and 10 timepoints over 12 days. Bioinformatics analysis of this EMT-ExMap resource allowed us to identify; -topological coupling between omics, -four distinct cell states during EMT, -omics-specific kinetic paths, -stage-specific multi-omics characteristics, -distinct regulatory classes of genes, -ligand-receptor mediated intercellular crosstalk by integrating scRNAseq and subcellular proteomics, and -combinatorial drug targets (e.g., Hedgehog signaling and CAMK-II) to inhibit EMT, which we validate using a 3D mammary duct-on-a-chip platform. Overall, this study provides a resource on TGFβ signaling and EMT.
Collapse
Affiliation(s)
- Indranil Paul
- Department of Biochemistry, Boston University School of Medicine, Boston University, 71 East Concord Street, Boston, MA, 02118, USA
| | - Dante Bolzan
- Department of Computer Science, University of Miami, 1356 Memorial Drive, Coral Gables, FL, 33146, USA
| | - Ahmed Youssef
- Graduate Program in Bioinformatics, Boston University, 24 Cummington Mall, Boston, MA, 02215, USA
| | - Keith A Gagnon
- Department of Biomedical Engineering, Boston University, 44 Cummington Mall, Boston, MA, 02215, USA
| | - Heather Hook
- Department of Biology, Boston University, 24 Cummington Mall, Boston, MA, 02115, USA
- Biological Design Center, Boston University, 610 Commonwealth Avenue, Boston, MA, 02215, USA
| | - Gopal Karemore
- Advanced Analytics, Novo Nordisk A/S, 2760, Måløv, Denmark
| | - Michael U J Oliphant
- Cancer Research Institute, Department of Medicine, Beth Israel Deaconess Medical Center, Boston, MA, 02115, USA
| | - Weiwei Lin
- Department of Biochemistry, Boston University School of Medicine, Boston University, 71 East Concord Street, Boston, MA, 02118, USA
| | - Qian Liu
- Department of Biochemistry and Medical Genetics, University of Manitoba, Winnipeg, Manitoba, R3E 0J9, Canada
| | - Sadhna Phanse
- Department of Biochemistry, Boston University School of Medicine, Boston University, 71 East Concord Street, Boston, MA, 02118, USA
| | - Carl White
- Department of Biochemistry, Boston University School of Medicine, Boston University, 71 East Concord Street, Boston, MA, 02118, USA
| | - Dzmitry Padhorny
- Department of Applied Mathematics and Statistics, Stony Brook University, 11794, Stony Brook, NY, USA
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, 11794, USA
| | - Sergei Kotelnikov
- Department of Applied Mathematics and Statistics, Stony Brook University, 11794, Stony Brook, NY, USA
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, 11794, USA
| | - Christopher S Chen
- Department of Biomedical Engineering, Boston University, 44 Cummington Mall, Boston, MA, 02215, USA
- Wyss Institute for Biologically Inspired Engineering, Harvard University, 3 Blackfan Circle, Boston, MA, 02115, USA
| | - Pingzhao Hu
- Department of Biochemistry, Western University, London, ON, N6A 5C1, Canada
| | - Gerald V Denis
- Boston Medical Center Cancer Center, Boston University, Boston University, 72 East Concord Street, Boston, MA, 02118, USA
| | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony Brook University, 11794, Stony Brook, NY, USA
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, 11794, USA
| | - Brian Raught
- Discovery Tower (TMDT), 101 College St, Rm. 9-701A, University of Toronto, Toronto, ON, M5G 1L7, Canada
| | - Trevor Siggers
- Department of Biology, Boston University, 24 Cummington Mall, Boston, MA, 02115, USA
- Biological Design Center, Boston University, 610 Commonwealth Avenue, Boston, MA, 02215, USA
| | - Stefan Wuchty
- Department of Computer Science, University of Miami, 1356 Memorial Drive, Coral Gables, FL, 33146, USA
| | - Senthil K Muthuswamy
- Laboratory of Cancer Biology and Genetics, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD, USA
| | - Andrew Emili
- Department of Biochemistry, Boston University School of Medicine, Boston University, 71 East Concord Street, Boston, MA, 02118, USA.
- Department of Biology, Charles River Campus, Boston University, Life Science & Engineering (LSEB-602), 24 Cummington Mall, Boston, MA, 02215, USA.
- Division of Oncological Sciences, Knight Cancer Institute, Oregon Health and Science University, Portland, USA.
| |
Collapse
|
6
|
Peralbo-Molina Á, Solà-Santos P, Perera-Lluna A, Chicano-Gálvez E. Data Processing and Analysis in Mass Spectrometry-Based Metabolomics. Methods Mol Biol 2023; 2571:207-239. [PMID: 36152164 DOI: 10.1007/978-1-0716-2699-3_20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Metabolomics is the latest of the omics sciences. It attempts to measure and characterize metabolites-small chemical compounds <1500 Da-on cells, tissue, or biofluids, which are usually products of biological reactions. As metabolic reactions are closer to the phenotype, metabolomics has emerged as an attractive science for various areas of research, including personalized medicine. However, due to the complexity of data obtained and the absence of curated databases for metabolite identification, data processing is the major bottleneck in this area since most technicians lack the required bioinformatics expertise to process datasets in a reliable and fast manner. The aim of this chapter is to describe the available tools for data processing that makes an inexperienced researcher capable of obtaining reliable results without having to undergo through huge parametrization steps.
Collapse
Affiliation(s)
- Ángela Peralbo-Molina
- IMIBIC Mass Spectrometry and Molecular Imaging Unit, Maimonides, Biomedical Research Institute of Cordoba (IMIBIC), Reina Sofia University Hospital, University of Cordoba (UCO), Córdoba, Spain.
| | - Pol Solà-Santos
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Barcelona, Spain
- Networking Biomedical Research Centre in the Subject Area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, Spain
- Institut de Recerca Sant Joan de Déu, Barcelona, Spain
| | - Alexandre Perera-Lluna
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Barcelona, Spain
- Networking Biomedical Research Centre in the Subject Area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, Spain
- Institut de Recerca Sant Joan de Déu, Barcelona, Spain
| | - Eduardo Chicano-Gálvez
- IMIBIC Mass Spectrometry and Molecular Imaging Unit, Maimonides, Biomedical Research Institute of Cordoba (IMIBIC), Reina Sofia University Hospital, University of Cordoba (UCO), Córdoba, Spain
| |
Collapse
|
7
|
Ma A, Qi X. Mining plant metabolomes: Methods, applications, and perspectives. PLANT COMMUNICATIONS 2021; 2:100238. [PMID: 34746766 PMCID: PMC8554038 DOI: 10.1016/j.xplc.2021.100238] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/13/2021] [Revised: 07/31/2021] [Accepted: 09/02/2021] [Indexed: 06/13/2023]
Abstract
Plants produce a variety of metabolites that are essential for plant growth and human health. To fully understand the diversity of metabolites in certain plants, lots of methods have been developed for metabolites detection and data processing. In the data-processing procedure, how to effectively reduce false-positive peaks, analyze large-scale metabolic data, and annotate plant metabolites remains challenging. In this review, we introduce and discuss some prominent methods that could be exploited to solve these problems, including a five-step filtering method for reducing false-positive signals in LC-MS analysis, QPMASS for analyzing ultra-large GC-MS data, and MetDNA for annotating metabolites. The main applications of plant metabolomics in species discrimination, metabolic pathway dissection, population genetic studies, and some other aspects are also highlighted. To further promote the development of plant metabolomics, more effective and integrated methods/platforms for metabolite detection and comprehensive databases for metabolite identification are highly needed. With the improvement of these technologies and the development of genomics and transcriptomics, plant metabolomics will be widely used in many fields.
Collapse
Affiliation(s)
- Aimin Ma
- Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100049, China
| | - Xiaoquan Qi
- Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
8
|
Barranco-Altirriba M, Solà-Santos P, Picart-Armada S, Kanaan-Izquierdo S, Fonollosa J, Perera-Lluna A. mWISE: An Algorithm for Context-Based Annotation of Liquid Chromatography-Mass Spectrometry Features through Diffusion in Graphs. Anal Chem 2021; 93:10772-10778. [PMID: 34320315 DOI: 10.1021/acs.analchem.1c00238] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Untargeted metabolomics using liquid chromatography coupled to mass spectrometry (LC-MS) allows the detection of thousands of metabolites in biological samples. However, LC-MS data annotation is still considered a major bottleneck in the metabolomics pipeline since only a small fraction of the metabolites present in the sample can be annotated with the required confidence level. Here, we introduce mWISE (metabolomics wise inference of speck entities), an R package for context-based annotation of LC-MS data. The algorithm consists of three main steps aimed at (i) matching mass-to-charge ratio values to the Kyoto Encyclopedia of Genes and Genomes (KEGG) database, (ii) clustering and filtering the potential KEGG candidates, and (iii) building a final prioritized list using diffusion in graphs. The algorithm performance is evaluated with three publicly available studies using both positive and negative ionization modes. We have also compared mWISE to other available annotation algorithms in terms of their performance and computation time. In particular, we explored four different configurations for mWISE, and all four of them outperform xMSannotator (a state-of-the-art annotator) in terms of both performance and computation time. Using a diffusion configuration that combines the biological network obtained from the FELLA R package and raw scores, mWISE shows a sensitivity mean (standard deviation) across data sets of 0.63 (0.07), while xMSannotator achieves a sensitivity of 0.55 (0.19). We have also shown that the chemical structures of the compounds proposed by mWISE are closer to the original compounds than those proposed by xMSannotator. Finally, we explore the diffusion prioritization separately, showing its key role in the annotation process. mWISE is freely available on GitHub (https://github.com/b2slab/mWISE) under a GPL license.
Collapse
Affiliation(s)
- Maria Barranco-Altirriba
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Av. Diagonal 647, 08028 Barcelona, Spain.,Networking Biomedical Research Centre in the Subject Area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), 28029 Madrid, Spain.,Institut de Recerca Sant Joan de Déu, Esplugues de Llobregat, 08950 Barcelona, Spain.,Department of Endocrinology and Nutrition, Hospital de la Santa Creu i Sant Pau and Institut d'Investigació Biomèdica Sant Pau (IIB Sant Pau), 08041 Barcelona, Spain
| | - Pol Solà-Santos
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Av. Diagonal 647, 08028 Barcelona, Spain.,Networking Biomedical Research Centre in the Subject Area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), 28029 Madrid, Spain.,Institut de Recerca Sant Joan de Déu, Esplugues de Llobregat, 08950 Barcelona, Spain
| | - Sergio Picart-Armada
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Av. Diagonal 647, 08028 Barcelona, Spain.,Networking Biomedical Research Centre in the Subject Area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), 28029 Madrid, Spain.,Institut de Recerca Sant Joan de Déu, Esplugues de Llobregat, 08950 Barcelona, Spain
| | - Samir Kanaan-Izquierdo
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Av. Diagonal 647, 08028 Barcelona, Spain.,Networking Biomedical Research Centre in the Subject Area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), 28029 Madrid, Spain.,Institut de Recerca Sant Joan de Déu, Esplugues de Llobregat, 08950 Barcelona, Spain
| | - Jordi Fonollosa
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Av. Diagonal 647, 08028 Barcelona, Spain.,Networking Biomedical Research Centre in the Subject Area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), 28029 Madrid, Spain.,Institut de Recerca Sant Joan de Déu, Esplugues de Llobregat, 08950 Barcelona, Spain
| | - Alexandre Perera-Lluna
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Av. Diagonal 647, 08028 Barcelona, Spain.,Networking Biomedical Research Centre in the Subject Area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), 28029 Madrid, Spain.,Institut de Recerca Sant Joan de Déu, Esplugues de Llobregat, 08950 Barcelona, Spain
| |
Collapse
|
9
|
Ahmad Azam A, Ismail IS, Shaikh MF, Abas F, Shaari K. Multi-Platform Metabolomics Analyses Revealed the Complexity of Serum Metabolites in LPS-Induced Neuroinflammed Rats Treated with Clinacanthus nutans Aqueous Extract. Front Pharmacol 2021; 12:629561. [PMID: 34177565 PMCID: PMC8220158 DOI: 10.3389/fphar.2021.629561] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Accepted: 04/26/2021] [Indexed: 11/13/2022] Open
Abstract
The use of metabolomics as a comprehensive tool in the analysis of metabolic profiles in disease progression and therapeutic intervention is rapidly advancing. Yet, a single analytical platform could not be applied to cover the entire spectrum of a biological sample’s metabolome. In the present paper, multi-platform metabolomics approaches were explored to determine the diverse rat sera metabolites extracted from intracerebroventricular lipopolysaccharides (LPS)-induced neuroinflammed rats treated with oral therapeutic interventions of positive drug (dextromethorphan, 5 mg/kg BW); with Clinacanthus nutans (CN) aqueous extract (CNE, 500 mg/kg BW); and with phosphate buffer saline (PBS) as the control group for 14 days. Analyzed by nuclear magnetic resonance (NMR) and liquid chromatography-mass spectrometry (LC-MS) techniques, this study depicted the potential of metabolites associated with neuroinflammation and verified by MetDisease. The key observations in the perturbed metabolic pathways that showed ameliorative effects were linked to the class of amino acid and peptide metabolism involving valine, leucine, and isoleucine biosynthesis; phenylalanine, tyrosine, and tryptophan biosynthesis; and phenylalanine metabolism. Lipid metabolism of arachidonic acid metabolism, glycerophospholipid metabolism, terpenoid backbone biosynthesis, and glycosphingolipid metabolism were also affected. Current findings suggested that the putative biomarkers, especially lysophosphatidic acid (LPA) and 5-diphosphomevalonic acid from glycerophospholipid and squalene/terpenoid and cholesterol biosynthesis, respectively, showed the ameliorative effects of the drug and CN treatments by controlling cell differentiation and proliferation. Our study proved that the complex and dynamic sera profiling affected during the CN treatment was greatly influenced by the analytical platform selection as integration between the two data yielded a more holistic summary of the metabolite pattern changes. Hence, an evidence-based herb, such as CN, can be used for novel diagnostic tools in the quest for ethnopharmacological studies.
Collapse
Affiliation(s)
- Amalina Ahmad Azam
- Laboratory of Natural Products, Institute of Bioscience, Universiti Putra Malaysia, Serdang, Malaysia
| | - Intan Safinar Ismail
- Laboratory of Natural Products, Institute of Bioscience, Universiti Putra Malaysia, Serdang, Malaysia
| | - Mohd Farooq Shaikh
- Neuropharmacology Research Laboratory, Jeffrey Cheah, School of Medicine and Health Sciences, Monash University Malaysia, Subang Jaya, Malaysia
| | - Faridah Abas
- Laboratory of Natural Products, Institute of Bioscience, Universiti Putra Malaysia, Serdang, Malaysia
| | - Khozirah Shaari
- Laboratory of Natural Products, Institute of Bioscience, Universiti Putra Malaysia, Serdang, Malaysia
| |
Collapse
|
10
|
DBnorm as an R package for the comparison and selection of appropriate statistical methods for batch effect correction in metabolomic studies. Sci Rep 2021; 11:5657. [PMID: 33707505 PMCID: PMC7952378 DOI: 10.1038/s41598-021-84824-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 02/19/2021] [Indexed: 02/07/2023] Open
Abstract
As a powerful phenotyping technology, metabolomics provides new opportunities in biomarker discovery through metabolome-wide association studies (MWAS) and the identification of metabolites having a regulatory effect in various biological processes. While mass spectrometry-based (MS) metabolomics assays are endowed with high throughput and sensitivity, MWAS are doomed to long-term data acquisition generating an overtime-analytical signal drift that can hinder the uncovering of real biologically relevant changes. We developed “dbnorm”, a package in the R environment, which allows for an easy comparison of the model performance of advanced statistical tools commonly used in metabolomics to remove batch effects from large metabolomics datasets. “dbnorm” integrates advanced statistical tools to inspect the dataset structure not only at the macroscopic (sample batches) scale, but also at the microscopic (metabolic features) level. To compare the model performance on data correction, “dbnorm” assigns a score that help users identify the best fitting model for each dataset. In this study, we applied “dbnorm” to two large-scale metabolomics datasets as a proof of concept. We demonstrate that “dbnorm” allows for the accurate selection of the most appropriate statistical tool to efficiently remove the overtime signal drift and to focus on the relevant biological components of complex datasets.
Collapse
|
11
|
Choudhury R, Beezley J, Davis B, Tomeck J, Gratzl S, Golzarri-Arroyo L, Wan J, Raftery D, Baumes J, O'Connell TM. Viime: Visualization and Integration of Metabolomics Experiments. JOURNAL OF OPEN SOURCE SOFTWARE 2020; 5:2410. [PMID: 33768193 PMCID: PMC7990241 DOI: 10.21105/joss.02410] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Metabolomics involves the comprehensive measurement of metabolites from a biological system. The resulting metabolite profiles are influenced by genetics, lifestyle, biological stresses, disease, diet and the environment and therefore provides a more holistic biological readout of the pathological condition of the organism (Beger et al., 2016; Wishart, 2016). The challenge for metabolomics is that no single analytical platform can provide a truly comprehensive coverage of the metabolome. The most commonly used platforms are based on mass-spectrometry (MS) and nuclear magnetic resonance (NMR). Investigators are increasingly using both methods to increase the metabolite coverage. The challenge for this type of multi-platform approach is that the data structure may be very different in these two platforms. For example, NMR data may be reported as a list of spectral features, e.g., bins or peaks with arbitrary intensity units or more directly with named metabolites reported in concentration units ranging from micromolar to millimolar. Some MS approaches can also provide data in the form of identified metabolite concentrations, but given the superior sensitivity of MS, the concentrations can be several orders of magnitude lower than for NMR. Other MS approaches yield data in the form of arbitrary response units where the dynamic range can be more than 6 orders of magnitude. Importantly, the variability and reproducibility of the data may differ across platforms. Given the diversity of data structures (i.e., magnitude and dynamic range) integrating the data from multiple platforms can be challenging. This often leads investigators to analyze the datasets separately, which prevents the observation of potentially interesting relationships and correlations between metabolites detected on different platforms. Viime (VIsualization and Integration of Metabolomics Experiments) is an open-source, web-based application designed to integrate metabolomics data from multiple platforms. The workflow of Viime for data integration and visualization is shown in Figure 1.
Collapse
Affiliation(s)
| | | | | | | | | | - Lilian Golzarri-Arroyo
- Department of Epidemiology and Biostatistics, Indiana University School of Public Health
| | - Jun Wan
- Department of Medical and Molecular Genetics, Indiana University School of Medicine
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine
- Department of BioHealth Informatics, Indiana University School of Informatics and Computing
| | - Daniel Raftery
- Department of Anesthesiology and Pain Medicine, University of Washington
| | | | - Thomas M O'Connell
- Department of Otolaryngology-Head and Neck Surgery, Indiana University School of Medicine
| |
Collapse
|
12
|
Liang D, Liu Q, Zhou K, Jia W, Xie G, Chen T. IP4M: an integrated platform for mass spectrometry-based metabolomics data mining. BMC Bioinformatics 2020; 21:444. [PMID: 33028191 PMCID: PMC7542974 DOI: 10.1186/s12859-020-03786-x] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Accepted: 09/28/2020] [Indexed: 12/15/2022] Open
Abstract
Background Metabolomics data analyses rely on the use of bioinformatics tools. Many integrated multi-functional tools have been developed for untargeted metabolomics data processing and have been widely used. More alternative platforms are expected for both basic and advanced users. Results Integrated mass spectrometry-based untargeted metabolomics data mining (IP4M) software was designed and developed. The IP4M, has 62 functions categorized into 8 modules, covering all the steps of metabolomics data mining, including raw data preprocessing (alignment, peak de-convolution, peak picking, and isotope filtering), peak annotation, peak table preprocessing, basic statistical description, classification and biomarker detection, correlation analysis, cluster and sub-cluster analysis, regression analysis, ROC analysis, pathway and enrichment analysis, and sample size and power analysis. Additionally, a KEGG-derived metabolic reaction database was embedded and a series of ratio variables (product/substrate) can be generated with enlarged information on enzyme activity. A new method, GRaMM, for correlation analysis between metabolome and microbiome data was also provided. IP4M provides both a number of parameters for customized and refined analysis (for expert users), as well as 4 simplified workflows with few key parameters (for beginners who are unfamiliar with computational metabolomics). The performance of IP4M was evaluated and compared with existing computational platforms using 2 data sets derived from standards mixture and 2 data sets derived from serum samples, from GC–MS and LC–MS respectively. Conclusion IP4M is powerful, modularized, customizable and easy-to-use. It is a good choice for metabolomics data processing and analysis. Free versions for Windows, MAC OS, and Linux systems are provided.
Collapse
Affiliation(s)
- Dandan Liang
- Shanghai Key Laboratory of Diabetes Mellitus and Center for Translational Medicine, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, 200233, China
| | - Quan Liu
- Human Metabolomics Institute, Inc., Shenzhen, 518109, Guangdong, China
| | - Kejun Zhou
- Human Metabolomics Institute, Inc., Shenzhen, 518109, Guangdong, China
| | - Wei Jia
- Shanghai Key Laboratory of Diabetes Mellitus and Center for Translational Medicine, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, 200233, China.
| | - Guoxiang Xie
- Human Metabolomics Institute, Inc., Shenzhen, 518109, Guangdong, China.
| | - Tianlu Chen
- Shanghai Key Laboratory of Diabetes Mellitus and Center for Translational Medicine, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, 200233, China.
| |
Collapse
|
13
|
Li D, Liu B, Zheng H, Xiao X, Li Z, Luan E, Li W, Yang Y, Wang Y, Long Q, Song J, Zhang G. XY-Meta: A High-Efficiency Search Engine for Large-Scale Metabolome Annotation with Accurate FDR Estimation. Anal Chem 2020; 92:5701-5707. [PMID: 32212716 DOI: 10.1021/acs.analchem.9b03355] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
FDR control has been a huge challenge for large-scale metabolome annotation. Although recent research indicated that the target-decoy strategy could be implemented to estimate FDR, it is hard to perform FDR control due to the difficulty of getting a reliable decoy database because of the complex fragmentation mechanism of metabolites and ubiquitous isomers. To tackle this problem, we developed a decoy generation method, which generates forged spectra from the reference target database by preserving the original reference signals to simulate the presence of isomers of metabolites. Benchmarks on GNPS data sets in Passatutto showed that the decoy database generated by our method is closer to the actual FDR than other methods, especially in the low FDR range (0-0.05). Large-scale metabolite annotation on 35 data sets showed that strict FDR reduced the number of annotated metabolites but increased the spectral efficiency, indicating the necessity of quality control. We recommended that the FDR threshold should be set to 0.01 in large-scale metabolite annotation. We implemented decoy generation, database search, and FDR control into a search engine called XY-Meta. It facilitates large-scale metabolome annotation applications.
Collapse
Affiliation(s)
- Dehua Li
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes and MOE Key Laboratory of Tumor Molecular Biology, Institute of Life and Health Engineering, College of Life Science and Technology, Jinan University, Guangzhou 510632, China.,Shenzhen Digital Life Institute, Shenzhen 518110, China
| | - Binghang Liu
- Shenzhen Digital Life Institute, Shenzhen 518110, China
| | - Hancheng Zheng
- Shenzhen Digital Life Institute, Shenzhen 518110, China.,iCarbonX (Shenzhen) Co., Ltd., Shenzhen 518053, China
| | - Xu Xiao
- Shenzhen Digital Life Institute, Shenzhen 518110, China
| | - Zhenyu Li
- Shenzhen Digital Life Institute, Shenzhen 518110, China
| | - Enhui Luan
- Shenzhen Digital Life Institute, Shenzhen 518110, China
| | - Wei Li
- Shenzhen Webio Co., Ltd., Shenzhen 518110, China
| | - Yaling Yang
- Shenzhen Digital Life Institute, Shenzhen 518110, China
| | - Yalan Wang
- Shenzhen Webio Co., Ltd., Shenzhen 518110, China
| | - Qiaoyun Long
- Shenzhen Webio Co., Ltd., Shenzhen 518110, China
| | - Jiaping Song
- Shenzhen Digital Life Institute, Shenzhen 518110, China
| | - Gong Zhang
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes and MOE Key Laboratory of Tumor Molecular Biology, Institute of Life and Health Engineering, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| |
Collapse
|
14
|
Stuart KA, Welsh K, Walker MC, Edrada-Ebel R. Metabolomic tools used in marine natural product drug discovery. Expert Opin Drug Discov 2020; 15:499-522. [PMID: 32026730 DOI: 10.1080/17460441.2020.1722636] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Introduction: The marine environment is a very promising resource for natural product research, with many of these reaching the market as new drugs, especially in the field of cancer therapy as well as the drug discovery pipeline for new antimicrobials. Exploitation for bioactive marine compounds with unique structures and novel bioactivity such as the isoquinoline alkaloid; trabectedin, the polyether macrolide; halichondrin B, and the peptide; dolastatin 10, requires the use of analytical techniques, which can generate unbiased, quantitative, and qualitative data to benefit the biodiscovery process. Metabolomics has shown to bridge this understanding and facilitate the development of new potential drugs from marine sources and particularly their microbial symbionts.Areas covered: In this review, articles on applied secondary metabolomics ranging from 1990-2018 as well as to the last quarter of 2019 were probed to investigate the impact of metabolomics on drug discovery for new antibiotics and cancer treatment.Expert opinion: The current literature review highlighted the effectiveness of metabolomics in the study of targeting biologically active secondary metabolites from marine sources for optimized discovery of potential new natural products to be made accessible to a R&D pipeline.
Collapse
Affiliation(s)
- Kevin Andrew Stuart
- Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, Glasgow, UK
| | - Keira Welsh
- Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, Glasgow, UK
| | - Molly Clare Walker
- Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, Glasgow, UK
| | - RuAngelie Edrada-Ebel
- Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, Glasgow, UK
| |
Collapse
|
15
|
Ali A, Abouleila Y, Shimizu Y, Hiyama E, Emara S, Mashaghi A, Hankemeier T. Single-cell metabolomics by mass spectrometry: Advances, challenges, and future applications. Trends Analyt Chem 2019. [DOI: 10.1016/j.trac.2019.02.033] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
|
16
|
Cardoso S, Afonso T, Maraschin M, Rocha M. WebSpecmine: A Website for Metabolomics Data Analysis and Mining. Metabolites 2019; 9:metabo9100237. [PMID: 31635085 PMCID: PMC6835413 DOI: 10.3390/metabo9100237] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Revised: 10/09/2019] [Accepted: 10/15/2019] [Indexed: 11/16/2022] Open
Abstract
Metabolomics data analysis is an important task in biomedical research. The available tools do not provide a wide variety of methods and data types, nor ways to store and share data and results generated. Thus, we have developed WebSpecmine to overcome the aforementioned limitations. WebSpecmine is a web-based application designed to perform the analysis of metabolomics data based on spectroscopic and chromatographic techniques (NMR, Infrared, UV-visible, and Raman, and LC/GC-MS) and compound concentrations. Users, even those not possessing programming skills, can access several analysis methods including univariate, unsupervised and supervised multivariate statistical analysis, as well as metabolite identification and pathway analysis, also being able to create accounts to store their data and results, either privately or publicly. The tool's implementation is based in the R project, including its shiny web-based framework. Webspecmine is freely available, supporting all major browsers. We provide abundant documentation, including tutorials and a user guide with case studies.
Collapse
Affiliation(s)
- Sara Cardoso
- CEB-Centre Biological Engineering, University of Minho, 4710-057 Braga, Portugal.
| | - Telma Afonso
- CEB-Centre Biological Engineering, University of Minho, 4710-057 Braga, Portugal.
| | - Marcelo Maraschin
- Plant Morphogenesis and Biochemistry Laboratory, Federal University of Santa Catarina, Florianópolis SC 88040-900, Brazil.
| | - Miguel Rocha
- CEB-Centre Biological Engineering, University of Minho, 4710-057 Braga, Portugal.
| |
Collapse
|
17
|
Stanstrup J, Broeckling CD, Helmus R, Hoffmann N, Mathé E, Naake T, Nicolotti L, Peters K, Rainer J, Salek RM, Schulze T, Schymanski EL, Stravs MA, Thévenot EA, Treutler H, Weber RJM, Willighagen E, Witting M, Neumann S. The metaRbolomics Toolbox in Bioconductor and beyond. Metabolites 2019; 9:E200. [PMID: 31548506 PMCID: PMC6835268 DOI: 10.3390/metabo9100200] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2019] [Revised: 09/16/2019] [Accepted: 09/17/2019] [Indexed: 11/17/2022] Open
Abstract
Metabolomics aims to measure and characterise the complex composition of metabolites in a biological system. Metabolomics studies involve sophisticated analytical techniques such as mass spectrometry and nuclear magnetic resonance spectroscopy, and generate large amounts of high-dimensional and complex experimental data. Open source processing and analysis tools are of major interest in light of innovative, open and reproducible science. The scientific community has developed a wide range of open source software, providing freely available advanced processing and analysis approaches. The programming and statistics environment R has emerged as one of the most popular environments to process and analyse Metabolomics datasets. A major benefit of such an environment is the possibility of connecting different tools into more complex workflows. Combining reusable data processing R scripts with the experimental data thus allows for open, reproducible research. This review provides an extensive overview of existing packages in R for different steps in a typical computational metabolomics workflow, including data processing, biostatistics, metabolite annotation and identification, and biochemical network and pathway analysis. Multifunctional workflows, possible user interfaces and integration into workflow management systems are also reviewed. In total, this review summarises more than two hundred metabolomics specific packages primarily available on CRAN, Bioconductor and GitHub.
Collapse
Affiliation(s)
- Jan Stanstrup
- Preventive and Clinical Nutrition, University of Copenhagen, Rolighedsvej 30, 1958 Frederiksberg C, Denmark.
| | - Corey D Broeckling
- Proteomics and Metabolomics Facility, Colorado State University, Fort Collins, CO 80523, USA.
| | - Rick Helmus
- Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, 1098 XH Amsterdam, The Netherlands.
| | - Nils Hoffmann
- Leibniz-Institut für Analytische Wissenschaften-ISAS-e.V., Otto-Hahn-Straße 6b, 44227 Dortmund, Germany.
| | - Ewy Mathé
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, USA.
| | - Thomas Naake
- Max Planck Institute of Molecular Plant Physiology, 14476 Potsdam-Golm, Germany.
| | - Luca Nicolotti
- The Australian Wine Research Institute, Metabolomics Australia, PO Box 197, Adelaide SA 5064, Australia.
| | - Kristian Peters
- Leibniz Institute of Plant Biochemistry (IPB Halle), Bioinformatics and Scientific Data, 06120 Halle, Germany.
| | - Johannes Rainer
- Institute for Biomedicine, Eurac Research, Affiliated Institute of the University of Lübeck, 39100 Bolzano, Italy.
| | - Reza M Salek
- The International Agency for Research on Cancer, 150 cours Albert Thomas, CEDEX 08, 69372 Lyon, France.
| | - Tobias Schulze
- Department of Effect-Directed Analysis, Helmholtz Centre for Environmental Research-UFZ, Permoserstraße 15, 04318 Leipzig, Germany.
| | - Emma L Schymanski
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 6 avenue du Swing, L-4367 Belvaux, Luxembourg.
| | - Michael A Stravs
- Eawag, Swiss Federal Institute of Aquatic Science and Technology, Überlandstrasse 133, 8600 Dubendorf, Switzerland.
| | - Etienne A Thévenot
- CEA, LIST, Laboratory for Data Sciences and Decision, MetaboHUB, Gif-Sur-Yvette F-91191, France.
| | - Hendrik Treutler
- Leibniz Institute of Plant Biochemistry (IPB Halle), Bioinformatics and Scientific Data, 06120 Halle, Germany.
| | - Ralf J M Weber
- Phenome Centre Birmingham and School of Biosciences, University of Birmingham, Edgbaston, Birmingham B15 2TT, UK.
| | - Egon Willighagen
- Department of Bioinformatics-BiGCaT, NUTRIM, Maastricht University, 6229 ER Maastricht, The Netherlands.
| | - Michael Witting
- Research Unit Analytical BioGeoChemistry, Helmholtz Zentrum München, 85764 Neuherberg, Germany.
- Chair of Analytical Food Chemistry, Technische Universität München, 85354 Weihenstephan, Germany.
| | - Steffen Neumann
- Leibniz Institute of Plant Biochemistry (IPB Halle), Bioinformatics and Scientific Data, 06120 Halle, Germany.
- German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig Deutscher, Platz 5e, 04103 Leipzig, Germany.
| |
Collapse
|
18
|
Llorach R, Favari C, Alonso D, Garcia-Aloy M, Andres-Lacueva C, Urpi-Sarda M. Comparative metabolite fingerprinting of legumes using LC-MS-based untargeted metabolomics. Food Res Int 2019; 126:108666. [PMID: 31732019 DOI: 10.1016/j.foodres.2019.108666] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Revised: 04/14/2019] [Accepted: 09/09/2019] [Indexed: 01/08/2023]
Abstract
Legumes are a well-known source of phytochemicals and are commonly believed to have similar composition between different genera. To date, there are no studies evaluating changes in legumes to discover those compounds that help to discriminate for food quality and authenticity. The aim of this work was to characterize and make a comparative analysis of the composition of bioactive compounds between Cicer arietinum L. (chickpea), Lens culinaris L. (lentil) and Phaseolus vulgaris L. (white bean) through an LC-MS-Orbitrap metabolomic approach to establish which compounds discriminate between the three studied legumes. Untargeted metabolomic analysis was carried out by LC-MS-Orbitrap from extracts of freeze-dried legumes prepared from pre-cooked canned legumes. The metabolomic data treatment and statistical analysis were realized by using MAIT R's package, and final identification and characterization was done using MSn experiments. Fold-change evaluation was made through Metaboanalyst 4.0. Results showed 43 identified and characterized compounds displaying differences between the three legumes. Polyphenols, mainly flavonol and flavanol compounds, were the main group with 30 identified compounds, followed by α-galactosides (n = 5). Fatty acyls, prenol lipids, a nucleoside and organic compounds were also characterized. The fold-change analysis showed flavanols as the wider class of discriminative compounds of lentils compared to the other legumes; prenol lipids and eucomic acids were the most discriminative compounds of beans versus other legumes and several phenolic acids (such as primeveroside salycilic), kaempferol derivatives, coumesterol and α-galactosides were the most discriminative compounds of chickpeas. This study highlights the applicability of metabolomics for evaluating which are the characteristic compounds of the different legumes. In addition, it describes the future application of metabolomics as tool for the quality control of foods and authentication of different kinds of legumes.
Collapse
Affiliation(s)
- Rafael Llorach
- Biomarkers and Nutrimetabolomics Laboratory, Department of Nutrition, Food Science and Gastronomy, Food Technology Reference Net (XaRTA), Nutrition and Food Safety Research Institute (INSA), Faculty of Pharmacy and Food Science, Campus Torribera, University of Barcelona, 08028 Barcelona, Spain; CIBER Fragilidad y Envejecimiento Saludable (CIBERFES), Instituto de Salud Carlos III, 08028 Barcelona, Spain
| | - Claudia Favari
- Biomarkers and Nutrimetabolomics Laboratory, Department of Nutrition, Food Science and Gastronomy, Food Technology Reference Net (XaRTA), Nutrition and Food Safety Research Institute (INSA), Faculty of Pharmacy and Food Science, Campus Torribera, University of Barcelona, 08028 Barcelona, Spain
| | - David Alonso
- Biomarkers and Nutrimetabolomics Laboratory, Department of Nutrition, Food Science and Gastronomy, Food Technology Reference Net (XaRTA), Nutrition and Food Safety Research Institute (INSA), Faculty of Pharmacy and Food Science, Campus Torribera, University of Barcelona, 08028 Barcelona, Spain
| | - Mar Garcia-Aloy
- Biomarkers and Nutrimetabolomics Laboratory, Department of Nutrition, Food Science and Gastronomy, Food Technology Reference Net (XaRTA), Nutrition and Food Safety Research Institute (INSA), Faculty of Pharmacy and Food Science, Campus Torribera, University of Barcelona, 08028 Barcelona, Spain; CIBER Fragilidad y Envejecimiento Saludable (CIBERFES), Instituto de Salud Carlos III, 08028 Barcelona, Spain
| | - Cristina Andres-Lacueva
- Biomarkers and Nutrimetabolomics Laboratory, Department of Nutrition, Food Science and Gastronomy, Food Technology Reference Net (XaRTA), Nutrition and Food Safety Research Institute (INSA), Faculty of Pharmacy and Food Science, Campus Torribera, University of Barcelona, 08028 Barcelona, Spain; CIBER Fragilidad y Envejecimiento Saludable (CIBERFES), Instituto de Salud Carlos III, 08028 Barcelona, Spain
| | - Mireia Urpi-Sarda
- Biomarkers and Nutrimetabolomics Laboratory, Department of Nutrition, Food Science and Gastronomy, Food Technology Reference Net (XaRTA), Nutrition and Food Safety Research Institute (INSA), Faculty of Pharmacy and Food Science, Campus Torribera, University of Barcelona, 08028 Barcelona, Spain; CIBER Fragilidad y Envejecimiento Saludable (CIBERFES), Instituto de Salud Carlos III, 08028 Barcelona, Spain.
| |
Collapse
|
19
|
Bell M, Blais JM. "-Omics" workflow for paleolimnological and geological archives: A review. THE SCIENCE OF THE TOTAL ENVIRONMENT 2019; 672:438-455. [PMID: 30965259 DOI: 10.1016/j.scitotenv.2019.03.477] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2018] [Revised: 03/29/2019] [Accepted: 03/30/2019] [Indexed: 06/09/2023]
Abstract
"-Omics" is a powerful screening method with applications in molecular biology, toxicology, wildlife biology, natural product discovery, and many other fields. Genomics, proteomics, metabolomics, and lipidomics are common examples included under the "-omics" umbrella. This screening method uses combinations of untargeted, semi-targeted, and targeted analyses paired with data mining to facilitate researchers' understanding of the genome, proteins, and small organic molecules in biological systems. Recently, however, the use of "-omics" has expanded into the fields of geology, specifically petrology, and paleolimnology. Specifically, untargeted analyses stand to transform these fields as petroleomics, and sediment-"omics" become more prevalent. "-Omics" facilitates the visualization of small molecule profiles from environmental matrices (i.e. oil and sediment). Small molecule profiles can provide improved understanding of small molecules distributions throughout the environment, and how those compositions can change depending on conditions (i.e. climate change, weathering, etc.). "-Omics" also facilities discovery of next-generation biomarkers that can be used for oil source identification and as proxies for reconstructing past environmental changes. Untargeted analyses paired with data mining and multivariate statistical analyses represents a powerful suite of tools for hypothesis generation, and new method development for environmental reconstructions. Here we present an introduction to "-omics" methodology, technical terms, and examples of applications to paleolimnology and petrology. The purpose of this review is to highlight the important considerations at each step in the "-omics" workflow to produce high quality and statistically powerful data for petrological and paleolimnological applications.
Collapse
Affiliation(s)
- Madison Bell
- Laboratory for the Analysis of Natural and Synthetic Environmental Toxicants, Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada
| | - Jules M Blais
- Laboratory for the Analysis of Natural and Synthetic Environmental Toxicants, Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada.
| |
Collapse
|
20
|
Gorrochategui E, Jaumot J, Tauler R. ROIMCR: a powerful analysis strategy for LC-MS metabolomic datasets. BMC Bioinformatics 2019; 20:256. [PMID: 31101001 PMCID: PMC6525397 DOI: 10.1186/s12859-019-2848-8] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 04/25/2019] [Indexed: 01/29/2023] Open
Abstract
BACKGROUND The analysis of LC-MS metabolomic datasets appears to be a challenging task in a wide range of disciplines since it demands the highly extensive processing of a vast amount of data. Different LC-MS data analysis packages have been developed in the last few years to facilitate this analysis. However, most of these strategies involve chromatographic alignment and peak shaping and often associate each "feature" (i.e., chromatographic peak) with a unique m/z measurement. Thus, the development of an alternative data analysis strategy that is applicable to most types of MS datasets and properly addresses these issues is still a challenge in the metabolomics field. RESULTS Here, we present an alternative approach called ROIMCR to: i) filter and compress massive LC-MS datasets while transforming their original structure into a data matrix of features without losing relevant information through the search of regions of interest (ROIs) in the m/z domain and ii) resolve compressed data to identify their contributing pure components without previous alignment or peak shaping by applying a Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS) analysis. In this study, the basics of the ROIMCR method are presented in detail and a detailed description of its implementation is also provided. Data were analyzed using the MATLAB (The MathWorks, Inc., www.mathworks.com ) programming and computing environment. The application of the ROIMCR methodology is described in detail, with an example of LC-MS data generated in a lipidomic study and with other examples of recent applications. CONCLUSIONS The methodology presented here combines the benefits of data filtering and compression based on the searching of ROI features, without the loss of spectral accuracy. The method has the benefits of the application of the powerful MCR-ALS data resolution method without the necessity of performing chromatographic peak alignment or modelling. The presented method is a powerful alternative to other existing data analysis approaches that do not use the MCR-ALS method to resolve LC-MS data. The ROIMCR method also represents an improved strategy compared to the direct applications of the MCR-ALS method that use less-powerful data compression strategies such as binning and windowing. Overall, the strategy presented here confirms the usefulness of the ROIMCR chemometrics method for analyzing LC-MS untargeted metabolomics data.
Collapse
Affiliation(s)
- Eva Gorrochategui
- Department of Environmental Chemistry, Institute of Environmental Assessment and Water Research (IDAEA), Consejo Superior de Investigaciones Científicas (CSIC), Jorsi Girona 18-25, Barcelona, 08034, Catalonia, Spain
| | - Joaquim Jaumot
- Department of Environmental Chemistry, Institute of Environmental Assessment and Water Research (IDAEA), Consejo Superior de Investigaciones Científicas (CSIC), Jorsi Girona 18-25, Barcelona, 08034, Catalonia, Spain
| | - Romà Tauler
- Department of Environmental Chemistry, Institute of Environmental Assessment and Water Research (IDAEA), Consejo Superior de Investigaciones Científicas (CSIC), Jorsi Girona 18-25, Barcelona, 08034, Catalonia, Spain.
| |
Collapse
|
21
|
MetaboAnalystR 2.0: From Raw Spectra to Biological Insights. Metabolites 2019; 9:metabo9030057. [PMID: 30909447 PMCID: PMC6468840 DOI: 10.3390/metabo9030057] [Citation(s) in RCA: 207] [Impact Index Per Article: 41.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Revised: 03/20/2019] [Accepted: 03/21/2019] [Indexed: 02/07/2023] Open
Abstract
Global metabolomics based on high-resolution liquid chromatography mass spectrometry (LC-MS) has been increasingly employed in recent large-scale multi-omics studies. Processing and interpretation of these complex metabolomics datasets have become a key challenge in current computational metabolomics. Here, we introduce MetaboAnalystR 2.0 for comprehensive LC-MS data processing, statistical analysis, and functional interpretation. Compared to the previous version, this new release seamlessly integrates XCMS and CAMERA to support raw spectral processing and peak annotation, and also features high-performance implementations of mummichog and GSEA approaches for predictions of pathway activities. The application and utility of the MetaboAnalystR 2.0 workflow were demonstrated using a synthetic benchmark dataset and a clinical dataset. In summary, MetaboAnalystR 2.0 offers a unified and flexible workflow that enables end-to-end analysis of LC-MS metabolomics data within the open-source R environment.
Collapse
|
22
|
Bae SH, Jo A, Park JH, Lim CW, Choi Y, Oh J, Park JM, Kong T, Weissleder R, Lee H, Moon J. Bioassay for monitoring the anti-aging effect of cord blood treatment. Theranostics 2019; 9:1-10. [PMID: 30662549 PMCID: PMC6332798 DOI: 10.7150/thno.30422] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Accepted: 11/18/2018] [Indexed: 12/31/2022] Open
Abstract
Background: Treating aged animals with plasma of an early developmental stage (e.g, umbilical cord plasma) showed an impressive potential to slow age-associated degradation of neuronal and cognitive functions. Translating such findings to clinical realities, however, requires effective ways for assessing treatment efficacy; ideal methods should be minimally invasive, amenable for serial assays, cost-effective, and quantitative. Methods: We developed a new biosensor approach to monitor anti-aging therapy. We advanced two key sensor components: i) a blood-borne metabolite was identified as a surrogate aging-marker; and ii) a compact and cost-effective assay system was developed for on-site applications. We treated aged mice either with human umbilical cord plasma or saline; unbiased metabolite profiling on mouse plasma revealed arachidonic acid (AA) as a potent indicator associated with anti-aging effect. We next implemented a competitive magneto-electrochemical sensor (cMES) optimized for AA detection directly from plasma. The developed platform could detect AA directly from small volumes of plasma (0.5 µL) within 1.5 hour. Results: cMES assays confirmed a strong correlation between AA levels and anti-aging effect: AA levels, while decreasing with aging, increased in the plasma-treated aged mice which also showed improved learning and memory performance. Conclusions: The cMES platform will empower both pre- and clinical anti-aging research by enabling minimally invasive, longitudinal treatment surveillance; these capacities will accelerate the development of anti-aging therapies, improving the quality of individual lives.
Collapse
Affiliation(s)
- Sang-Hun Bae
- Department of Biotechnology, College of Life Science, CHA University, Gyeonggi-do 13488, Republic of Korea
| | - Ala Jo
- Department of Biotechnology, College of Life Science, CHA University, Gyeonggi-do 13488, Republic of Korea
- Center for Systems Biology, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA
| | - Jae Hyun Park
- Department of Biotechnology, College of Life Science, CHA University, Gyeonggi-do 13488, Republic of Korea
| | - Chul-Woo Lim
- Department of Biotechnology, College of Life Science, CHA University, Gyeonggi-do 13488, Republic of Korea
| | - Yuri Choi
- Department of Biotechnology, College of Life Science, CHA University, Gyeonggi-do 13488, Republic of Korea
| | - Juhyun Oh
- Center for Systems Biology, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA
| | - Ji-Min Park
- Department of Biotechnology, College of Life Science, CHA University, Gyeonggi-do 13488, Republic of Korea
| | - TaeHo Kong
- Department of Biotechnology, College of Life Science, CHA University, Gyeonggi-do 13488, Republic of Korea
| | - Ralph Weissleder
- Center for Systems Biology, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Hakho Lee
- Center for Systems Biology, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA
| | - Jisook Moon
- Department of Biotechnology, College of Life Science, CHA University, Gyeonggi-do 13488, Republic of Korea
| |
Collapse
|
23
|
Picart-Armada S, Fernández-Albert F, Vinaixa M, Yanes O, Perera-Lluna A. FELLA: an R package to enrich metabolomics data. BMC Bioinformatics 2018; 19:538. [PMID: 30577788 PMCID: PMC6303911 DOI: 10.1186/s12859-018-2487-5] [Citation(s) in RCA: 55] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2018] [Accepted: 11/12/2018] [Indexed: 12/27/2022] Open
Abstract
Background Pathway enrichment techniques are useful for understanding experimental metabolomics data. Their purpose is to give context to the affected metabolites in terms of the prior knowledge contained in metabolic pathways. However, the interpretation of a prioritized pathway list is still challenging, as pathways show overlap and cross talk effects. Results We introduce FELLA, an R package to perform a network-based enrichment of a list of affected metabolites. FELLA builds a hierarchical representation of an organism biochemistry from the Kyoto Encyclopedia of Genes and Genomes (KEGG), containing pathways, modules, enzymes, reactions and metabolites. In addition to providing a list of pathways, FELLA reports intermediate entities (modules, enzymes, reactions) that link the input metabolites to them. This sheds light on pathway cross talk and potential enzymes or metabolites as targets for the condition under study. FELLA has been applied to six public datasets –three from Homo sapiens, two from Danio rerio and one from Mus musculus– and has reproduced findings from the original studies and from independent literature. Conclusions The R package FELLA offers an innovative enrichment concept starting from a list of metabolites, based on a knowledge graph representation of the KEGG database that focuses on interpretability. Besides reporting a list of pathways, FELLA suggests intermediate entities that are of interest per se. Its usefulness has been shown at several molecular levels on six public datasets, including human and animal models. The user can run the enrichment analysis through a simple interactive graphical interface or programmatically. FELLA is publicly available in Bioconductor under the GPL-3 license. Electronic supplementary material The online version of this article (10.1186/s12859-018-2487-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sergio Picart-Armada
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Barcelona, 08028, Spain. .,Networking Biomedical Research Centre in the subject area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, 28029, Spain. .,Institut de Recerca Pediàtrica Hospital Sant Joan de Déu, Esplugues de Llobregat, Barcelona, 08950, Spain.
| | - Francesc Fernández-Albert
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Barcelona, 08028, Spain.,Networking Biomedical Research Centre in the subject area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, 28029, Spain.,Takeda Cambridge Ltd, Cambridge, CB4 0PZ, UK
| | - Maria Vinaixa
- Metabolomics Platform, IISPV, Department of Electronic Engineering (DEEEA), Universitat Rovira i Virgili, Tarragona, 43003, Spain.,CIBER of Diabetes and Associated Metabolic Diseases (CIBERDEM), Madrid, 28029, Spain
| | - Oscar Yanes
- Metabolomics Platform, IISPV, Department of Electronic Engineering (DEEEA), Universitat Rovira i Virgili, Tarragona, 43003, Spain.,CIBER of Diabetes and Associated Metabolic Diseases (CIBERDEM), Madrid, 28029, Spain
| | - Alexandre Perera-Lluna
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Barcelona, 08028, Spain.,Networking Biomedical Research Centre in the subject area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, 28029, Spain.,Institut de Recerca Pediàtrica Hospital Sant Joan de Déu, Esplugues de Llobregat, Barcelona, 08950, Spain
| |
Collapse
|
24
|
Baira E, Dagla I, Siapi E, Zoumpoulakis P, Simitzis P, Goliomytis M, Deligeorgis SG, Skaltsounis AL, Gikas E. UHPLC–HRMS-based tissue untargeted metabolomics study of naringin and hesperidin after dietary supplementation in chickens. Food Chem 2018; 269:276-285. [DOI: 10.1016/j.foodchem.2018.06.146] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2018] [Revised: 06/26/2018] [Accepted: 06/30/2018] [Indexed: 01/06/2023]
|
25
|
Degradation of cocoa proteins into oligopeptides during spontaneous fermentation of cocoa beans. Food Res Int 2018; 109:506-516. [DOI: 10.1016/j.foodres.2018.04.068] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2018] [Revised: 04/27/2018] [Accepted: 04/29/2018] [Indexed: 12/11/2022]
|
26
|
Untargeted Profiling of Concordant/Discordant Phenotypes of High Insulin Resistance and Obesity To Predict the Risk of Developing Diabetes. J Proteome Res 2018; 17:2307-2317. [PMID: 29905079 DOI: 10.1021/acs.jproteome.7b00855] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
This study explores the metabolic profiles of concordant/discordant phenotypes of high insulin resistance (IR) and obesity. Through untargeted metabolomics (LC-ESI-QTOF-MS), we analyzed the fasting serum of subjects with high IR and/or obesity ( n = 64). An partial least-squares discriminant analysis with orthogonal signal correction followed by univariate statistics and enrichment analysis allowed exploration of these metabolic profiles. A multivariate regression method (LASSO) was used for variable selection and a predictive biomarker model to identify subjects with high IR regardless of obesity was built. Adrenic acid and a dyglyceride (DG) were shared by high IR and obesity. Uric and margaric acids, 14 DGs, ketocholesterol, and hydroxycorticosterone were unique to high IR, while arachidonic, hydroxyeicosatetraenoic (HETE), palmitoleic, triHETE, and glycocholic acids, HETE lactone, leukotriene B4, and two glutamyl-peptides to obesity. DGs and adrenic acid differed in concordant/discordant phenotypes, thereby revealing protective mechanisms against high IR also in obesity. A biomarker model formed by DGs, uric and adrenic acids presented a high predictive power to identify subjects with high IR [AUC 80.1% (68.9-91.4)]. These findings could become relevant for diabetes risk detection and unveil new potential targets in therapeutic treatments of IR, diabetes, and obesity. An independent validated cohort is needed to confirm these results.
Collapse
|
27
|
Domingo-Almenara X, Montenegro-Burke JR, Benton HP, Siuzdak G. Annotation: A Computational Solution for Streamlining Metabolomics Analysis. Anal Chem 2018; 90:480-489. [PMID: 29039932 PMCID: PMC5750104 DOI: 10.1021/acs.analchem.7b03929] [Citation(s) in RCA: 105] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Metabolite identification is still considered an imposing bottleneck in liquid chromatography mass spectrometry (LC/MS) untargeted metabolomics. The identification workflow usually begins with detecting relevant LC/MS peaks via peak-picking algorithms and retrieving putative identities based on accurate mass searching. However, accurate mass search alone provides poor evidence for metabolite identification. For this reason, computational annotation is used to reveal the underlying metabolites monoisotopic masses, improving putative identification in addition to confirmation with tandem mass spectrometry. This review examines LC/MS data from a computational and analytical perspective, focusing on the occurrence of neutral losses and in-source fragments, to understand the challenges in computational annotation methodologies. Herein, we examine the state-of-the-art strategies for computational annotation including: (i) peak grouping or full scan (MS1) pseudo-spectra extraction, i.e., clustering all mass spectral signals stemming from each metabolite; (ii) annotation using ion adduction and mass distance among ion peaks; (iii) incorporation of biological knowledge such as biotransformations or pathways; (iv) tandem MS data; and (v) metabolite retention time calibration, usually achieved by prediction from molecular descriptors. Advantages and pitfalls of each of these strategies are discussed, as well as expected future trends in computational annotation.
Collapse
Affiliation(s)
- Xavier Domingo-Almenara
- Scripps Center for Metabolomics, The Scripps Research Institute , 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - J Rafael Montenegro-Burke
- Scripps Center for Metabolomics, The Scripps Research Institute , 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - H Paul Benton
- Scripps Center for Metabolomics, The Scripps Research Institute , 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Gary Siuzdak
- Scripps Center for Metabolomics, The Scripps Research Institute , 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| |
Collapse
|
28
|
Blum BC, Mousavi F, Emili A. Single-platform ‘multi-omic’ profiling: unified mass spectrometry and computational workflows for integrative proteomics–metabolomics analysis. Mol Omics 2018; 14:307-319. [DOI: 10.1039/c8mo00136g] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Advances in instrumentation and analysis tools are permitting evermore comprehensive interrogation of diverse biomolecules and allowing investigators to move from linear signaling cascades to network models, which more accurately reflect the molecular basis of biological systems and processes.
Collapse
Affiliation(s)
- Benjamin C. Blum
- Center for Network Systems Biology
- Boston University School of Medicine
- Boston
- USA
- Department of Biochemistry
| | - Fatemeh Mousavi
- Donnelly Centre
- Department of Molecular Genetics
- University of Toronto
- Toronto
- Canada
| | - Andrew Emili
- Center for Network Systems Biology
- Boston University School of Medicine
- Boston
- USA
- Department of Biochemistry
| |
Collapse
|
29
|
Picart-Armada S, Fernández-Albert F, Vinaixa M, Rodríguez MA, Aivio S, Stracker TH, Yanes O, Perera-Lluna A. Null diffusion-based enrichment for metabolomics data. PLoS One 2017; 12:e0189012. [PMID: 29211807 PMCID: PMC5718512 DOI: 10.1371/journal.pone.0189012] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2017] [Accepted: 11/16/2017] [Indexed: 12/11/2022] Open
Abstract
Metabolomics experiments identify metabolites whose abundance varies as the conditions under study change. Pathway enrichment tools help in the identification of key metabolic processes and in building a plausible biological explanation for these variations. Although several methods are available for pathway enrichment using experimental evidence, metabolomics does not yet have a comprehensive overview in a network layout at multiple molecular levels. We propose a novel pathway enrichment procedure for analysing summary metabolomics data based on sub-network analysis in a graph representation of a reference database. Relevant entries are extracted from the database according to statistical measures over a null diffusive process that accounts for network topology and pathway crosstalk. Entries are reported as a sub-pathway network, including not only pathways, but also modules, enzymes, reactions and possibly other compound candidates for further analyses. This provides a richer biological context, suitable for generating new study hypotheses and potential enzymatic targets. Using this method, we report results from cells depleted for an uncharacterised mitochondrial gene using GC and LC-MS data and employing KEGG as a knowledge base. Partial validation is provided with NMR-based tracking of 13C glucose labelling of these cells.
Collapse
Affiliation(s)
- Sergio Picart-Armada
- Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Barcelona, Spain.,Networking Biomedical Research Centre in the subject area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, Spain.,Institut de Recerca Pediàtrica Hospital Sant Joan de Déu, Esplugues de Llobregat, Barcelona, Spain
| | - Francesc Fernández-Albert
- Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Barcelona, Spain.,Networking Biomedical Research Centre in the subject area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, Spain.,Takeda Cambridge Ltd, Cambridge, United Kingdom
| | - Maria Vinaixa
- Centre for Omic Sciences, Rovira i Virgili University, Reus, Spain.,Department of Electronic Engineering, Rovira i Virgili University, Tarragona, Spain.,Metabolomics Platform, Spanish Biomedical Research Centre in Diabetes and Associated Metabolic Disorders, Madrid, Spain
| | | | - Suvi Aivio
- Institute for Research in Biomedicine, Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Travis H Stracker
- Institute for Research in Biomedicine, Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Oscar Yanes
- Centre for Omic Sciences, Rovira i Virgili University, Reus, Spain.,Department of Electronic Engineering, Rovira i Virgili University, Tarragona, Spain.,Metabolomics Platform, Spanish Biomedical Research Centre in Diabetes and Associated Metabolic Disorders, Madrid, Spain
| | - Alexandre Perera-Lluna
- Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Barcelona, Spain.,Networking Biomedical Research Centre in the subject area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, Spain.,Institut de Recerca Pediàtrica Hospital Sant Joan de Déu, Esplugues de Llobregat, Barcelona, Spain
| |
Collapse
|
30
|
Bowler RP, Wendt CH, Fessler MB, Foster MW, Kelly RS, Lasky-Su J, Rogers AJ, Stringer KA, Winston BW. New Strategies and Challenges in Lung Proteomics and Metabolomics. An Official American Thoracic Society Workshop Report. Ann Am Thorac Soc 2017; 14:1721-1743. [PMID: 29192815 PMCID: PMC5946579 DOI: 10.1513/annalsats.201710-770ws] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
This document presents the proceedings from the workshop entitled, "New Strategies and Challenges in Lung Proteomics and Metabolomics" held February 4th-5th, 2016, in Denver, Colorado. It was sponsored by the National Heart Lung Blood Institute, the American Thoracic Society, the Colorado Biological Mass Spectrometry Society, and National Jewish Health. The goal of this workshop was to convene, for the first time, relevant experts in lung proteomics and metabolomics to discuss and overcome specific challenges in these fields that are unique to the lung. The main objectives of this workshop were to identify, review, and/or understand: (1) emerging technologies in metabolomics and proteomics as applied to the study of the lung; (2) the unique composition and challenges of lung-specific biological specimens for metabolomic and proteomic analysis; (3) the diverse informatics approaches and databases unique to metabolomics and proteomics, with special emphasis on the lung; (4) integrative platforms across genetic and genomic databases that can be applied to lung-related metabolomic and proteomic studies; and (5) the clinical applications of proteomics and metabolomics. The major findings and conclusions of this workshop are summarized at the end of the report, and outline the progress and challenges that face these rapidly advancing fields.
Collapse
|
31
|
Evaluation of intensity drift correction strategies using MetaboDrift, a normalization tool for multi-batch metabolomics data. J Chromatogr A 2017; 1523:265-274. [PMID: 28927937 DOI: 10.1016/j.chroma.2017.09.023] [Citation(s) in RCA: 55] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2017] [Revised: 09/07/2017] [Accepted: 09/08/2017] [Indexed: 12/18/2022]
Abstract
In recent years, mass spectrometry-based metabolomics has increasingly been applied to large-scale epidemiological studies of human subjects. However, the successful use of metabolomics in this context is subject to the challenge of detecting biologically significant effects despite substantial intensity drift that often occurs when data are acquired over a long period or in multiple batches. Numerous computational strategies and software tools have been developed to aid in correcting for intensity drift in metabolomics data, but most of these techniques are implemented using command-line driven software and custom scripts which are not accessible to all end users of metabolomics data. Further, it has not yet become routine practice to assess the quantitative accuracy of drift correction against techniques which enable true absolute quantitation such as isotope dilution mass spectrometry. We developed an Excel-based tool, MetaboDrift, to visually evaluate and correct for intensity drift in a multi-batch liquid chromatography - mass spectrometry (LC-MS) metabolomics dataset. The tool enables drift correction based on either quality control (QC) samples analyzed throughout the batches or using QC-sample independent methods. We applied MetaboDrift to an original set of clinical metabolomics data from a mixed-meal tolerance test (MMTT). The performance of the method was evaluated for multiple classes of metabolites by comparison with normalization using isotope-labeled internal standards. QC sample-based intensity drift correction significantly improved correlation with IS-normalized data, and resulted in detection of additional metabolites with significant physiological response to the MMTT. The relative merits of different QC-sample curve fitting strategies are discussed in the context of batch size and drift pattern complexity. Our drift correction tool offers a practical, simplified approach to drift correction and batch combination in large metabolomics studies.
Collapse
|
32
|
Hakeem Said I, Rezk A, Hussain I, Grimbs A, Shrestha A, Schepker H, Brix K, Ullrich MS, Kuhnert N. Metabolome Comparison of Bioactive and Inactive Rhododendron Extracts and Identification of an Antibacterial Cannabinoid(s) from Rhododendron collettianum. PHYTOCHEMICAL ANALYSIS : PCA 2017; 28:454-464. [PMID: 28612345 DOI: 10.1002/pca.2694] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Accepted: 04/18/2017] [Indexed: 06/07/2023]
Abstract
INTRODUCTION The science of metabolomics offers the possibility to measure full secondary plant metabolomes with limited experimental effort to allow identification of metabolome differences using principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) of liquid chromatography mass spectrometry (LC-MS) data. OBJECTIVE To demonstrate a bioinformatics driven hypothesis generator for identification of biologically active compounds in plant crude extracts, which is validated by activity guided fractionation. METHODOLOGY Crude extracts of Rhododendron leaves were tested for their antibacterial activity using agar diffusion and minimum inhibitory concentration assays. Extracts were profiled by LC-MS. PCA and PLS-DA were used for differentiation of bioactive and inactive extracts and their metabolites. Preparative-high performance liquid chromatography (HPLC) and nuclear magnetic resonance (NMR) spectroscopy were used for separation and structure elucidation of pure compound(s) respectively. RESULTS An antibacterial Rhododendron collettianum was compared to a series of inactive extracts. Three metabolites were found to distinguish R. collettianum from other species indicating the ability of PCA and PLS-DA to suggest potential bioactive substances. An activity-guided fractionation of R. collettianum extracts was carried out and cannabiorcichromenic acid (CCA) was identified as antibacterial compound thereby validating the PCA and PLS-DA generated hypothesis. Four mammalian cell lines were used to estimate possible cytotoxicity of CCA. CONCLUSION It was shown that bioinformatics tools facilitate early stage identification of a biologically active compound(s) using LC-MS data, which reduce complexity and number of separation steps in bioactive screening. Copyright © 2017 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Inamullah Hakeem Said
- Department of Life Sciences and Chemistry, Jacobs University Bremen, 28759, Bremen, Germany
| | - Ahmed Rezk
- Department of Life Sciences and Chemistry, Jacobs University Bremen, 28759, Bremen, Germany
| | - Ishtiaq Hussain
- Department of Life Sciences and Chemistry, Jacobs University Bremen, 28759, Bremen, Germany
| | - Anne Grimbs
- Department of Life Sciences and Chemistry, Jacobs University Bremen, 28759, Bremen, Germany
| | - Abhinandan Shrestha
- Department of Life Sciences and Chemistry, Jacobs University Bremen, 28759, Bremen, Germany
| | - Hartwig Schepker
- Stiftung Bremer Rhododendronpark, Deliusweg 40, 28359, Bremen, Germany
| | - Klaudia Brix
- Department of Life Sciences and Chemistry, Jacobs University Bremen, 28759, Bremen, Germany
| | - Matthias S Ullrich
- Department of Life Sciences and Chemistry, Jacobs University Bremen, 28759, Bremen, Germany
| | - Nikolai Kuhnert
- Department of Life Sciences and Chemistry, Jacobs University Bremen, 28759, Bremen, Germany
| |
Collapse
|
33
|
Perez de Souza L, Naake T, Tohge T, Fernie AR. From chromatogram to analyte to metabolite. How to pick horses for courses from the massive web resources for mass spectral plant metabolomics. Gigascience 2017; 6:1-20. [PMID: 28520864 PMCID: PMC5499862 DOI: 10.1093/gigascience/gix037] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2017] [Revised: 05/08/2017] [Accepted: 05/12/2017] [Indexed: 01/19/2023] Open
Abstract
The grand challenge currently facing metabolomics is the expansion of the coverage of the metabolome from a minor percentage of the metabolic complement of the cell toward the level of coverage afforded by other post-genomic technologies such as transcriptomics and proteomics. In plants, this problem is exacerbated by the sheer diversity of chemicals that constitute the metabolome, with the number of metabolites in the plant kingdom generally considered to be in excess of 200 000. In this review, we focus on web resources that can be exploited in order to improve analyte and ultimately metabolite identification and quantification. There is a wide range of available software that not only aids in this but also in the related area of peak alignment; however, for the uninitiated, choosing which program to use is a daunting task. For this reason, we provide an overview of the pros and cons of the software as well as comments regarding the level of programing skills required to effectively exploit their basic functions. In addition, the torrent of available genome and transcriptome sequences that followed the advent of next-generation sequencing has opened up further valuable resources for metabolite identification. All things considered, we posit that only via a continued communal sharing of information such as that deposited in the databases described within the article are we likely to be able to make significant headway toward improving our coverage of the plant metabolome.
Collapse
Affiliation(s)
- Leonardo Perez de Souza
- Max-Planck-Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
| | - Thomas Naake
- Max-Planck-Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
| | - Takayuki Tohge
- Max-Planck-Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
| | - Alisdair R Fernie
- Max-Planck-Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
| |
Collapse
|
34
|
Origin-based polyphenolic fingerprinting of Theobroma cacao in unfermented and fermented beans. Food Res Int 2017; 99:550-559. [PMID: 28784516 DOI: 10.1016/j.foodres.2017.06.007] [Citation(s) in RCA: 58] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2017] [Revised: 05/31/2017] [Accepted: 06/02/2017] [Indexed: 01/03/2023]
Abstract
A comprehensive analysis of cocoa polyphenols from unfermented and fermented cocoa beans from a wide range of geographic origins was carried out to catalogue systematic differences based on their origin as well as fermentation status. This study identifies previously unknown compounds with the goal to ascertain, which of these are responsible for the largest differences between bean types. UHPLC coupled with ultra-high resolution time-of-flight mass spectrometry was employed to identify and relatively quantify various oligomeric proanthocyanidins and their glycosides amongst several other unreported compounds. A series of biomarkers allowing a clear distinction between unfermented and fermented cocoa beans and for beans of different origins were identified. The large sample set employed allowed comparison of statistically significant variations of key cocoa constituents.
Collapse
|
35
|
Wen B, Mei Z, Zeng C, Liu S. metaX: a flexible and comprehensive software for processing metabolomics data. BMC Bioinformatics 2017; 18:183. [PMID: 28327092 PMCID: PMC5361702 DOI: 10.1186/s12859-017-1579-y] [Citation(s) in RCA: 431] [Impact Index Per Article: 61.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2016] [Accepted: 03/03/2017] [Indexed: 12/14/2022] Open
Abstract
Background Non-targeted metabolomics based on mass spectrometry enables high-throughput profiling of the metabolites in a biological sample. The large amount of data generated from mass spectrometry requires intensive computational processing for annotation of mass spectra and identification of metabolites. Computational analysis tools that are fully integrated with multiple functions and are easily operated by users who lack extensive knowledge in programing are needed in this research field. Results We herein developed an R package, metaX, that is capable of end-to-end metabolomics data analysis through a set of interchangeable modules. Specifically, metaX provides several functions, such as peak picking and annotation, data quality assessment, missing value imputation, data normalization, univariate and multivariate statistics, power analysis and sample size estimation, receiver operating characteristic analysis, biomarker selection, pathway annotation, correlation network analysis, and metabolite identification. In addition, metaX offers a web-based interface (http://metax.genomics.cn) for data quality assessment and normalization method evaluation, and it generates an HTML-based report with a visualized interface. The metaX utilities were demonstrated with a published metabolomics dataset on a large scale. The software is available for operation as either a web-based graphical user interface (GUI) or in the form of command line functions. The package and the example reports are available at http://metax.genomics.cn/. Conclusions The pipeline of metaX is platform-independent and is easy to use for analysis of metabolomics data generated from mass spectrometry. Electronic supplementary material The online version of this article (doi:10.1186/s12859-017-1579-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Bo Wen
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank-Shenzhen, BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
| | - Zhanlong Mei
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank-Shenzhen, BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
| | - Chunwei Zeng
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank-Shenzhen, BGI-Shenzhen, Shenzhen, Guangdong, 518083, China
| | - Siqi Liu
- BGI-Shenzhen, Shenzhen, 518083, China. .,China National GeneBank-Shenzhen, BGI-Shenzhen, Shenzhen, Guangdong, 518083, China.
| |
Collapse
|
36
|
Grimbs A, Shrestha A, Rezk ASD, Grimbs S, Hakeem Said I, Schepker H, Hütt MT, Albach DC, Brix K, Kuhnert N, Ullrich MS. Bioactivity in Rhododendron: A Systemic Analysis of Antimicrobial and Cytotoxic Activities and Their Phylogenetic and Phytochemical Origins. FRONTIERS IN PLANT SCIENCE 2017; 8:551. [PMID: 28450876 PMCID: PMC5390042 DOI: 10.3389/fpls.2017.00551] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2017] [Accepted: 03/27/2017] [Indexed: 05/20/2023]
Abstract
The exceptional diversity of the genus Rhododendron has a strong potential for identification, characterization, and production of bioactive lead compounds for health purposes. A particularly relevant field of application is the search for new antibiotics. Here, we present a comparative analysis of nearly 90 Rhododendron species targeted toward the search for such candidate substances. Through a combination of phytochemical profiles with antimicrobial susceptibility and cytotoxicity, complemented by phylogenetic analyses, we identify seven potentially antimicrobial active but non-cytotoxic compounds in terms of mass-to-charge ratios and retention times. Exemplary bioactivity-guided fractionation for a promising Rhododendron species experimentally supports in fact one of these candidate lead compounds. By combining categorical correlation analysis with Boolean operations, we have been able to investigate the origin of bioactive effects in further detail. Intriguingly, we discovered clear indications of systems effects (synergistic interactions and functional redundancies of compounds) in the manifestation of antimicrobial activities in this plant genus.
Collapse
Affiliation(s)
- Anne Grimbs
- Department for Life Sciences and Chemistry, Jacobs University BremenBremen, Germany
| | - Abhinandan Shrestha
- Department for Life Sciences and Chemistry, Jacobs University BremenBremen, Germany
| | - Ahmed S. D. Rezk
- Department for Life Sciences and Chemistry, Jacobs University BremenBremen, Germany
| | - Sergio Grimbs
- Department for Life Sciences and Chemistry, Jacobs University BremenBremen, Germany
| | | | | | - Marc-Thorsten Hütt
- Department for Life Sciences and Chemistry, Jacobs University BremenBremen, Germany
| | - Dirk C. Albach
- Institute for Biology and Environmental Sciences, Carl von Ossietzky University OldenburgOldenburg, Germany
| | - Klaudia Brix
- Department for Life Sciences and Chemistry, Jacobs University BremenBremen, Germany
| | - Nikolai Kuhnert
- Department for Life Sciences and Chemistry, Jacobs University BremenBremen, Germany
| | - Matthias S. Ullrich
- Department for Life Sciences and Chemistry, Jacobs University BremenBremen, Germany
- *Correspondence: Matthias S. Ullrich
| |
Collapse
|
37
|
Spicer R, Salek RM, Moreno P, Cañueto D, Steinbeck C. Navigating freely-available software tools for metabolomics analysis. Metabolomics 2017; 13:106. [PMID: 28890673 PMCID: PMC5550549 DOI: 10.1007/s11306-017-1242-7] [Citation(s) in RCA: 142] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/11/2017] [Accepted: 07/25/2017] [Indexed: 12/21/2022]
Abstract
INTRODUCTION The field of metabolomics has expanded greatly over the past two decades, both as an experimental science with applications in many areas, as well as in regards to data standards and bioinformatics software tools. The diversity of experimental designs and instrumental technologies used for metabolomics has led to the need for distinct data analysis methods and the development of many software tools. OBJECTIVES To compile a comprehensive list of the most widely used freely available software and tools that are used primarily in metabolomics. METHODS The most widely used tools were selected for inclusion in the review by either ≥ 50 citations on Web of Science (as of 08/09/16) or the use of the tool being reported in the recent Metabolomics Society survey. Tools were then categorised by the type of instrumental data (i.e. LC-MS, GC-MS or NMR) and the functionality (i.e. pre- and post-processing, statistical analysis, workflow and other functions) they are designed for. RESULTS A comprehensive list of the most used tools was compiled. Each tool is discussed within the context of its application domain and in relation to comparable tools of the same domain. An extended list including additional tools is available at https://github.com/RASpicer/MetabolomicsTools which is classified and searchable via a simple controlled vocabulary. CONCLUSION This review presents the most widely used tools for metabolomics analysis, categorised based on their main functionality. As future work, we suggest a direct comparison of tools' abilities to perform specific data analysis tasks e.g. peak picking.
Collapse
Affiliation(s)
- Rachel Spicer
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD UK
| | - Reza M. Salek
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD UK
| | - Pablo Moreno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD UK
| | - Daniel Cañueto
- Metabolomics Platform, IISPV, DEEEA, Universitat Rovira i Virgili, Campus Sescelades, Carretera de Valls, s/n, 43007 Tarragona, Catalonia Spain
| | - Christoph Steinbeck
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD UK
- Friedrich-Schiller-University Jena, Lessingstr. 8, Jena, 07743 Germany
| |
Collapse
|
38
|
Post-acquisition spectral stitching. An alternative approach for data processing in untargeted metabolomics by UHPLC-ESI(-)-HRMS. J Chromatogr B Analyt Technol Biomed Life Sci 2016; 1047:106-114. [PMID: 27825627 DOI: 10.1016/j.jchromb.2016.10.030] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2016] [Revised: 10/11/2016] [Accepted: 10/23/2016] [Indexed: 11/24/2022]
Abstract
INTRODUCTION In the case of the MS-based metabolomics, the large number of false positives remains a fundamental issue. OBJECTIVE The aim of this study was to develop a new strategy, which highlights the number of the reliable features i.e. the detected features that correspond to a consistent peak according to chromatographic and mass spectrometric criteria. METHOD For the analysis blood samples from 20 chickens, which were administrated with naringin and 9 samples from control, were analyzed by UHPLC-HRMS (Orbitrap Velos). Two methodologies have been compared for data processing. In the first one (classical approach), all data in the 100-900 m/z mass-to charge range were included for the data processing procedure whereas for the newly developed methodology, the data were shred in 100Da slices generating 8 datasets, which have been then subjected to the downstream MS data processing. Each dataset was treated separately and the m/z_tR features obtained by either VIP's or t-test values were merged and used as the input for the construction of the general model. RESULTS The new methodology resulted to a 4-fold increase of the peaks that could be considered chromatographically and mass spectrometrically valid. CONCLUSION A new strategy was reported on the detection of chromatographically reliable features during a metabolomic approach. The shredding of the LC-MS chromatograms into multiple m/z ranges increased the number of the identified chromatographically reliable features.
Collapse
|
39
|
Mahieu NG, Spalding JL, Gelman SJ, Patti GJ. Defining and Detecting Complex Peak Relationships in Mass Spectral Data: The Mz.unity Algorithm. Anal Chem 2016; 88:9037-46. [PMID: 27513885 PMCID: PMC6427821 DOI: 10.1021/acs.analchem.6b01702] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Analysis of a single analyte by mass spectrometry can result in the detection of more than 100 degenerate peaks. These degenerate peaks complicate spectral interpretation and are challenging to annotate. In mass spectrometry-based metabolomics, this degeneracy leads to inflated false discovery rates, data sets containing an order of magnitude more features than analytes, and an inefficient use of resources during data analysis. Although software has been introduced to annotate spectral degeneracy, current approaches are unable to represent several important classes of peak relationships. These include heterodimers and higher complex adducts, distal fragments, relationships between peaks in different polarities, and complex adducts between features and background peaks. Here we outline sources of peak degeneracy in mass spectra that are not annotated by current approaches and introduce a software package called mz.unity to detect these relationships in accurate mass data. Using mz.unity, we find that data sets contain many more complex relationships than we anticipated. Examples include the adduct of glutamate and nicotinamide adenine dinucleotide (NAD), fragments of NAD detected in the same or opposite polarities, and the adduct of glutamate and a background peak. Further, the complex relationships we identify show that several assumptions commonly made when interpreting mass spectral degeneracy do not hold in general. These contributions provide new tools and insight to aid in the annotation of complex spectral relationships and provide a foundation for improved data set identification. Mz.unity is an R package and is freely available at https://github.com/nathaniel-mahieu/mz.unity as well as our laboratory Web site http://pattilab.wustl.edu/software/ .
Collapse
Affiliation(s)
- Nathaniel G. Mahieu
- Department of Chemistry, Washington University, St. Louis, Missouri 63130, United States
- Department of Medicine, Washington University School of Medicine, St. Louis, Missouri 63110, United States
| | - Jonathan L. Spalding
- Department of Chemistry, Washington University, St. Louis, Missouri 63130, United States
- Department of Genetics
| | - Susan J. Gelman
- Department of Chemistry, Washington University, St. Louis, Missouri 63130, United States
- Department of Medicine, Washington University School of Medicine, St. Louis, Missouri 63110, United States
| | - Gary J. Patti
- Department of Chemistry, Washington University, St. Louis, Missouri 63130, United States
- Department of Medicine, Washington University School of Medicine, St. Louis, Missouri 63110, United States
| |
Collapse
|
40
|
Costa C, Maraschin M, Rocha M. An R package for the integrated analysis of metabolomics and spectral data. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2016; 129:117-124. [PMID: 26853041 DOI: 10.1016/j.cmpb.2016.01.008] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/24/2015] [Revised: 01/03/2016] [Accepted: 01/05/2016] [Indexed: 06/05/2023]
Abstract
Recently, there has been a growing interest in the field of metabolomics, materialized by a remarkable growth in experimental techniques, available data and related biological applications. Indeed, techniques as nuclear magnetic resonance, gas or liquid chromatography, mass spectrometry, infrared and UV-visible spectroscopies have provided extensive datasets that can help in tasks as biological and biomedical discovery, biotechnology and drug development. However, as it happens with other omics data, the analysis of metabolomics datasets provides multiple challenges, both in terms of methodologies and in the development of appropriate computational tools. Indeed, from the available software tools, none addresses the multiplicity of existing techniques and data analysis tasks. In this work, we make available a novel R package, named specmine, which provides a set of methods for metabolomics data analysis, including data loading in different formats, pre-processing, metabolite identification, univariate and multivariate data analysis, machine learning, and feature selection. Importantly, the implemented methods provide adequate support for the analysis of data from diverse experimental techniques, integrating a large set of functions from several R packages in a powerful, yet simple to use environment. The package, already available in CRAN, is accompanied by a web site where users can deposit datasets, scripts and analysis reports to be shared with the community, promoting the efficient sharing of metabolomics data analysis pipelines.
Collapse
Affiliation(s)
- Christopher Costa
- CEB - Centre Biological Engineering, University of Minho, Braga, Portugal
| | - Marcelo Maraschin
- Plant Morphogenesis and Biochemistry Laboratory, Federal University of Santa Catarina, Florianopolis, Brazil
| | - Miguel Rocha
- CEB - Centre Biological Engineering, University of Minho, Braga, Portugal.
| |
Collapse
|
41
|
Khymenets O, Andres-Lacueva C, Urpi-Sarda M, Vazquez-Fresno R, Mart MM, Reglero G, Torres M, Llorach R. Metabolic fingerprint after acute and under sustained consumption of a functional beverage based on grape skin extract in healthy human subjects. Food Funct 2016; 6:1288-98. [PMID: 25761658 DOI: 10.1039/c4fo00684d] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Grape-derived polyphenols are considered to be one of the most promising ingredients for functional foods due to their health-promoting activities. We applied a HPLC-MS-based untargeted metabolomic approach in order to evaluate the impact of a functional food based on grape skin polyphenols on the urinary metabolome of healthy subjects. Thirty-one volunteers participated in two dietary crossover randomized intervention studies: with a single-dose intake (187 mL) and with a 15-day sustained consumption (twice per day, 187 mL per day in total) of a functional beverage (FB). Postprandial (4-hour) and 24-hour urine samples collected after acute consumption and on the last day of sustained FB consumption, respectively, were analysed using an untargeted HPLC-qTOF-MS approach. Multivariate modelling with subsequent application of an S-plot revealed differential mass features related to acute and prolonged consumption of FB. More than half of the mass features were shared between the two types of samples, among which several phase II metabolites of grape-derived polyphenols were identified at confidence level II. Prolonged consumption of FB was specifically reflected in urine metabolome by the presence of first-stage microbial metabolites of flavanols: hydroxyvaleric acid and hydroxyvalerolactone derivatives. Overall, several epicatechin and phenolic acid metabolites both of tissular and microbiota origin were the most representative markers of FB consumption. To our knowledge, this is one of the first studies where an untargeted LC-MS metabolomic approach has been applied in nutrition research on a grape-derived FB.
Collapse
Affiliation(s)
- Olha Khymenets
- Biomarkers and Nutrimetabolomic Lab., Department of Nutrition and Food Science, XaRTA, INSA, Campus Torribera, Pharmacy School, University of Barcelona, Barcelona, Spain.
| | | | | | | | | | | | | | | |
Collapse
|
42
|
Yi L, Dong N, Yun Y, Deng B, Ren D, Liu S, Liang Y. Chemometric methods in data processing of mass spectrometry-based metabolomics: A review. Anal Chim Acta 2016; 914:17-34. [PMID: 26965324 DOI: 10.1016/j.aca.2016.02.001] [Citation(s) in RCA: 159] [Impact Index Per Article: 19.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2015] [Revised: 01/28/2016] [Accepted: 02/01/2016] [Indexed: 01/03/2023]
Abstract
This review focuses on recent and potential advances in chemometric methods in relation to data processing in metabolomics, especially for data generated from mass spectrometric techniques. Metabolomics is gradually being regarded a valuable and promising biotechnology rather than an ambitious advancement. Herein, we outline significant developments in metabolomics, especially in the combination with modern chemical analysis techniques, and dedicated statistical, and chemometric data analytical strategies. Advanced skills in the preprocessing of raw data, identification of metabolites, variable selection, and modeling are illustrated. We believe that insights from these developments will help narrow the gap between the original dataset and current biological knowledge. We also discuss the limitations and perspectives of extracting information from high-throughput datasets.
Collapse
Affiliation(s)
- Lunzhao Yi
- Yunnan Food Safety Research Institute, Kunming University of Science and Technology, Kunming, 650500, China.
| | - Naiping Dong
- Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hong Kong, 999077, China
| | - Yonghuan Yun
- College of Chemistry and Chemical Engineering, Central South University, Changsha, 410083, China
| | - Baichuan Deng
- College of Animal Science, South China Agricultural University, Guangzhou, 510642, China
| | - Dabing Ren
- Yunnan Food Safety Research Institute, Kunming University of Science and Technology, Kunming, 650500, China
| | - Shao Liu
- Xiangya Hospital, Central South University, Changsha, 410008, China
| | - Yizeng Liang
- College of Chemistry and Chemical Engineering, Central South University, Changsha, 410083, China
| |
Collapse
|
43
|
Misra BB, van der Hooft JJJ. Updates in metabolomics tools and resources: 2014-2015. Electrophoresis 2015; 37:86-110. [DOI: 10.1002/elps.201500417] [Citation(s) in RCA: 100] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2015] [Revised: 10/04/2015] [Accepted: 10/05/2015] [Indexed: 12/12/2022]
Affiliation(s)
- Biswapriya B. Misra
- Department of Biology, Genetics Institute; University of Florida; Gainesville FL USA
| | | |
Collapse
|
44
|
Mora-Cubillos X, Tulipani S, Garcia-Aloy M, Bulló M, Tinahones FJ, Andres-Lacueva C. Plasma metabolomic biomarkers of mixed nuts exposure inversely correlate with severity of metabolic syndrome. Mol Nutr Food Res 2015; 59:2480-90. [DOI: 10.1002/mnfr.201500549] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2015] [Revised: 09/08/2015] [Accepted: 09/10/2015] [Indexed: 01/30/2023]
Affiliation(s)
- Ximena Mora-Cubillos
- Biomarkers & Nutrimetabolomic Lab; Department of Nutrition and Food Science, XaRTA, INSA, Campus Torribera, Faculty of Pharmacy; University of Barcelona; Barcelona Spain
| | - Sara Tulipani
- Biomarkers & Nutrimetabolomic Lab; Department of Nutrition and Food Science, XaRTA, INSA, Campus Torribera, Faculty of Pharmacy; University of Barcelona; Barcelona Spain
- Biomedical Research Institute (IBIMA); Service of Endocrinology and Nutrition; Málaga Hospital Complex (Virgen de la Victoria), Campus de Teatinos s/n; University of Málaga; Málaga Spain
| | - Mar Garcia-Aloy
- Biomarkers & Nutrimetabolomic Lab; Department of Nutrition and Food Science, XaRTA, INSA, Campus Torribera, Faculty of Pharmacy; University of Barcelona; Barcelona Spain
| | - Mònica Bulló
- Human Nutrition Unit; Faculty of Medicine and Health Sciences; IISPV; Universitat Rovira i Virgili; Reus Spain
- CIBER Fisiopatología de la Obesidad y Nutrición (CIBERobn); Instituto de Salud Carlos III (ISCIII); Madrid Spain
| | - Francisco J Tinahones
- Biomedical Research Institute (IBIMA); Service of Endocrinology and Nutrition; Málaga Hospital Complex (Virgen de la Victoria), Campus de Teatinos s/n; University of Málaga; Málaga Spain
- CIBER Fisiopatología de la Obesidad y Nutrición (CIBERobn); Instituto de Salud Carlos III (ISCIII); Madrid Spain
| | - Cristina Andres-Lacueva
- Biomarkers & Nutrimetabolomic Lab; Department of Nutrition and Food Science, XaRTA, INSA, Campus Torribera, Faculty of Pharmacy; University of Barcelona; Barcelona Spain
| |
Collapse
|
45
|
Godzien J, Armitage EG, Angulo S, Martinez-Alcazar MP, Alonso-Herranz V, Otero A, Lopez-Gonzalvez A, Barbas C. In-source fragmentation and correlation analysis as tools for metabolite identification exemplified with CE-TOF untargeted metabolomics. Electrophoresis 2015; 36:2188-2195. [DOI: 10.1002/elps.201500016] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2015] [Revised: 02/08/2015] [Accepted: 02/08/2015] [Indexed: 01/16/2023]
Affiliation(s)
- Joanna Godzien
- Centre for Metabolomics and Bioanalysis (CEMBIO); Facultad de Farmacia; Universidad CEU San Pablo, Campus Montepríncipe; Boadilla del Monte Madrid Spain
| | - Emily G. Armitage
- Centre for Metabolomics and Bioanalysis (CEMBIO); Facultad de Farmacia; Universidad CEU San Pablo, Campus Montepríncipe; Boadilla del Monte Madrid Spain
| | - Santiago Angulo
- Centre for Metabolomics and Bioanalysis (CEMBIO); Facultad de Farmacia; Universidad CEU San Pablo, Campus Montepríncipe; Boadilla del Monte Madrid Spain
| | - Mari Paz Martinez-Alcazar
- Centre for Metabolomics and Bioanalysis (CEMBIO); Facultad de Farmacia; Universidad CEU San Pablo, Campus Montepríncipe; Boadilla del Monte Madrid Spain
| | - Vanesa Alonso-Herranz
- Centre for Metabolomics and Bioanalysis (CEMBIO); Facultad de Farmacia; Universidad CEU San Pablo, Campus Montepríncipe; Boadilla del Monte Madrid Spain
| | - Abraham Otero
- Department of Information and Communications Systems Engineering; Universidad CEU San Pablo, Campus Montepríncipe; Boadilla del Monte Madrid Spain
| | - Angeles Lopez-Gonzalvez
- Centre for Metabolomics and Bioanalysis (CEMBIO); Facultad de Farmacia; Universidad CEU San Pablo, Campus Montepríncipe; Boadilla del Monte Madrid Spain
| | - Coral Barbas
- Centre for Metabolomics and Bioanalysis (CEMBIO); Facultad de Farmacia; Universidad CEU San Pablo, Campus Montepríncipe; Boadilla del Monte Madrid Spain
| |
Collapse
|
46
|
Alonso A, Marsal S, Julià A. Analytical methods in untargeted metabolomics: state of the art in 2015. Front Bioeng Biotechnol 2015; 3:23. [PMID: 25798438 PMCID: PMC4350445 DOI: 10.3389/fbioe.2015.00023] [Citation(s) in RCA: 395] [Impact Index Per Article: 43.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2014] [Accepted: 02/18/2015] [Indexed: 12/20/2022] Open
Abstract
Metabolomics comprises the methods and techniques that are used to measure the small molecule composition of biofluids and tissues, and is actually one of the most rapidly evolving research fields. The determination of the metabolomic profile - the metabolome - has multiple applications in many biological sciences, including the developing of new diagnostic tools in medicine. Recent technological advances in nuclear magnetic resonance and mass spectrometry are significantly improving our capacity to obtain more data from each biological sample. Consequently, there is a need for fast and accurate statistical and bioinformatic tools that can deal with the complexity and volume of the data generated in metabolomic studies. In this review, we provide an update of the most commonly used analytical methods in metabolomics, starting from raw data processing and ending with pathway analysis and biomarker identification. Finally, the integration of metabolomic profiles with molecular data from other high-throughput biotechnologies is also reviewed.
Collapse
Affiliation(s)
- Arnald Alonso
- Rheumatology Research Group, Vall d’Hebron Research Institute, Barcelona, Spain
- Department of Automatic Control (ESAII), Polytechnic University of Catalonia, Barcelona, Spain
| | - Sara Marsal
- Rheumatology Research Group, Vall d’Hebron Research Institute, Barcelona, Spain
| | - Antonio Julià
- Rheumatology Research Group, Vall d’Hebron Research Institute, Barcelona, Spain
| |
Collapse
|
47
|
Dhanasekaran AR, Pearson JL, Ganesan B, Weimer BC. Metabolome searcher: a high throughput tool for metabolite identification and metabolic pathway mapping directly from mass spectrometry and using genome restriction. BMC Bioinformatics 2015; 16:62. [PMID: 25887958 PMCID: PMC4347650 DOI: 10.1186/s12859-015-0462-y] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2014] [Accepted: 01/13/2015] [Indexed: 01/19/2023] Open
Abstract
Background Mass spectrometric analysis of microbial metabolism provides a long list of possible compounds. Restricting the identification of the possible compounds to those produced by the specific organism would benefit the identification process. Currently, identification of mass spectrometry (MS) data is commonly done using empirically derived compound databases. Unfortunately, most databases contain relatively few compounds, leaving long lists of unidentified molecules. Incorporating genome-encoded metabolism enables MS output identification that may not be included in databases. Using an organism’s genome as a database restricts metabolite identification to only those compounds that the organism can produce. Results To address the challenge of metabolomic analysis from MS data, a web-based application to directly search genome-constructed metabolic databases was developed. The user query returns a genome-restricted list of possible compound identifications along with the putative metabolic pathways based on the name, formula, SMILES structure, and the compound mass as defined by the user. Multiple queries can be done simultaneously by submitting a text file created by the user or obtained from the MS analysis software. The user can also provide parameters specific to the experiment’s MS analysis conditions, such as mass deviation, adducts, and detection mode during the query so as to provide additional levels of evidence to produce the tentative identification. The query results are provided as an HTML page and downloadable text file of possible compounds that are restricted to a specific genome. Hyperlinks provided in the HTML file connect the user to the curated metabolic databases housed in ProCyc, a Pathway Tools platform, as well as the KEGG Pathway database for visualization and metabolic pathway analysis. Conclusions Metabolome Searcher, a web-based tool, facilitates putative compound identification of MS output based on genome-restricted metabolic capability. This enables researchers to rapidly extend the possible identifications of large data sets for metabolites that are not in compound databases. Putative compound names with their associated metabolic pathways from metabolomics data sets are returned to the user for additional biological interpretation and visualization. This novel approach enables compound identification by restricting the possible masses to those encoded in the genome.
Collapse
Affiliation(s)
- A Ranjitha Dhanasekaran
- Center for Integrated BioSystems, Computer Science Department, Utah State University, Logan, 84322-8700, USA. .,Linda Crnic Institute for Down Syndrome, Department of Pediatrics, School of Medicine, University of Colorado Denver, 12700 E 19th Avenue, Aurora, CO, 80045, USA.
| | - Jon L Pearson
- Center for Integrated BioSystems, Computer Science Department, Utah State University, Logan, 84322-8700, USA. .,Spillman Technologies, 4625 West Lake Park Blvd, Salt Lake City, UT, 84120, USA.
| | - Balasubramanian Ganesan
- Center for Integrated BioSystems, Computer Science Department, Utah State University, Logan, 84322-8700, USA. .,Western Dairy Center, Department of Nutrition, Dietetics, and Food Sciences, Utah State University, Logan, 84322-8700, USA.
| | - Bart C Weimer
- University of California, Davis, School of Veterinary Medicine, 1089 Veterinary Medicine Dr., VM3B, Room 4023, Davis, CA, 95616, USA.
| |
Collapse
|
48
|
Tulipani S, Mora-Cubillos X, Jáuregui O, Llorach R, García-Fuentes E, Tinahones FJ, Andres-Lacueva C. New and Vintage Solutions To Enhance the Plasma Metabolome Coverage by LC-ESI-MS Untargeted Metabolomics: The Not-So-Simple Process of Method Performance Evaluation. Anal Chem 2015; 87:2639-47. [DOI: 10.1021/ac503031d] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Affiliation(s)
- Sara Tulipani
- Biomarkers & Nutrimetabolomic Lab., Nutrition and Food Science Department, XaRTA, INSA, Campus Torribera, Pharmacy Faculty, University of Barcelona, 08028 Barcelona, Spain
- Biomedical Research
Institute (IBIMA), Service of Endocrinology and Nutrition, Malaga
Hospital Complex (Virgen de la Victoria), Campus de Teatinos s/n, University of Malaga, 29071 Malaga, Spain
| | - Ximena Mora-Cubillos
- Biomarkers & Nutrimetabolomic Lab., Nutrition and Food Science Department, XaRTA, INSA, Campus Torribera, Pharmacy Faculty, University of Barcelona, 08028 Barcelona, Spain
| | - Olga Jáuregui
- Scientific and Technological Centres of the University of Barcelona (CCIT-UB), 08028 Barcelona, Spain
| | - Rafael Llorach
- Biomarkers & Nutrimetabolomic Lab., Nutrition and Food Science Department, XaRTA, INSA, Campus Torribera, Pharmacy Faculty, University of Barcelona, 08028 Barcelona, Spain
| | - Eduardo García-Fuentes
- Biomedical Research
Institute (IBIMA), Service of Endocrinology and Nutrition, Hospital
Regional Universitario, Plaza del Hospital Civil s/n, University of Malaga, 29071 Malaga, Spain
- CIBER Fisiopatología
de la Obesidad y Nutrición (CIBERobn), Instituto de Salud Carlos III (ISCIII), 28029 Madrid, Spain
| | - Francisco J Tinahones
- Biomedical Research
Institute (IBIMA), Service of Endocrinology and Nutrition, Malaga
Hospital Complex (Virgen de la Victoria), Campus de Teatinos s/n, University of Malaga, 29071 Malaga, Spain
- CIBER Fisiopatología
de la Obesidad y Nutrición (CIBERobn), Instituto de Salud Carlos III (ISCIII), 28029 Madrid, Spain
| | - Cristina Andres-Lacueva
- Biomarkers & Nutrimetabolomic Lab., Nutrition and Food Science Department, XaRTA, INSA, Campus Torribera, Pharmacy Faculty, University of Barcelona, 08028 Barcelona, Spain
| |
Collapse
|
49
|
Yi L, Dong N, Yun Y, Deng B, Liu S, Zhang Y, Liang Y. WITHDRAWN: Recent advances in chemometric methods for plant metabolomics: A review. Biotechnol Adv 2014:S0734-9750(14)00183-9. [PMID: 25461504 DOI: 10.1016/j.biotechadv.2014.11.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Revised: 11/17/2014] [Accepted: 11/18/2014] [Indexed: 12/17/2022]
Abstract
This article has been withdrawn at the request of the author(s) and/or editor. The Publisher apologizes for any inconvenience this may cause. The full Elsevier Policy on Article Withdrawal can be found at http://www.elsevier.com/locate/withdrawalpolicy.
Collapse
Affiliation(s)
- Lunzhao Yi
- Yunnan Food Safety Research Institute, Kunming University of Science and Technology, Kunming 650500, China.
| | - Naiping Dong
- Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hong Kong 999077, Hong Kong, China
| | - Yonghuan Yun
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China
| | - Baichuan Deng
- Department of Chemistry, University of Bergen, Bergen N-5007, Norway
| | - Shao Liu
- Xiangya Hospital, Central South University, Changsha 410008, China
| | - Yi Zhang
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China
| | - Yizeng Liang
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China
| |
Collapse
|
50
|
Garcia-Aloy M, Llorach R, Urpi-Sarda M, Jáuregui O, Corella D, Ruiz-Canela M, Salas-Salvadó J, Fitó M, Ros E, Estruch R, Andres-Lacueva C. A metabolomics-driven approach to predict cocoa product consumption by designing a multimetabolite biomarker model in free-living subjects from the PREDIMED study. Mol Nutr Food Res 2014; 59:212-20. [PMID: 25298021 DOI: 10.1002/mnfr.201400434] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2014] [Revised: 09/27/2014] [Accepted: 09/30/2014] [Indexed: 01/05/2023]
Abstract
SCOPE The aim of the current study was to apply an untargeted metabolomics strategy to characterize a model of cocoa intake biomarkers in a free-living population. METHODS AND RESULTS An untargeted HPLC-q-ToF-MS based metabolomics approach was applied to human urine from 32 consumers of cocoa or derived products (CC) and 32 matched control subjects with no consumption of cocoa products (NC). The multivariate statistical analysis (OSC-PLS-DA) showed clear differences between CC and NC groups. The discriminant biomarkers identified were mainly related to the metabolic pathways of theobromine and polyphenols, as well as to cocoa processing. Consumption of cocoa products was also associated with reduced urinary excretions of methylglutarylcarnitine, which could be related to effects of cocoa exposure on insulin resistance. To improve the prediction of cocoa consumption, a combined urinary metabolite model was constructed. ROC curves were performed to evaluate the model and individual metabolites. The AUC values (95% CI) for the model were 95.7% (89.8-100%) and 92.6% (81.9-100%) in training and validation sets, respectively, whereas the AUCs for individual metabolites were <90%. CONCLUSIONS The metabolic signature of cocoa consumption in free-living subjects reveals that combining different metabolites as biomarker models improves prediction of dietary exposure to cocoa.
Collapse
Affiliation(s)
- Mar Garcia-Aloy
- Biomarkers & Nutrimetabolomic Lab, Nutrition and Food Science Department, XaRTA, INSA, Campus Torribera, Pharmacy Faculty, University of Barcelona, Spain**; INGENIO-CONSOLIDER Program, Fun-C-Food CSD2007-063, Spain
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|