1
|
Sriram V, Conard AM, Rosenberg I, Kim D, Saponas TS, Hall AK. Addressing biomedical data challenges and opportunities to inform a large-scale data lifecycle for enhanced data sharing, interoperability, analysis, and collaboration across stakeholders. Sci Rep 2025; 15:6291. [PMID: 39984563 PMCID: PMC11845626 DOI: 10.1038/s41598-025-90453-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Accepted: 02/13/2025] [Indexed: 02/23/2025] Open
Abstract
Biomedical discovery is fraught with challenges stemming from diverse data types and siloed analysis. In this study, we explored common biomedical data tasks and pain points that could be addressed to elevate data quality, enhance sharing, streamline analysis, and foster collaboration across stakeholders. We recruited fifteen professionals from various biomedical roles and industries to participate in sixty-minute semi-structured interviews, which involved an assessment of their challenges, needs, and tasks as well as a brainstorm exercise to validate each professional's research process. We applied a qualitative analysis of individual interviews using an inductive-deductive thematic coding approach for emerging themes. We identified a common set of challenges related to procuring and validating data, applying new analysis techniques and navigating varied computational environments, distributing results effectively and reproducibly, and managing the flow of data across phases of the data lifecycle. Our findings emphasize the importance of secure data sharing and facilities for collaboration throughout the discovery process. Our identified pain points provide researchers with an opportunity to align workstreams and enhance research data lifecycles to conduct biomedical discovery. We conclude our study with a summary of key actionable recommendations to tackle multiomic data challenges across the stages and phases of biomedical discovery.
Collapse
Affiliation(s)
- Vivek Sriram
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Ashley Mae Conard
- Health Futures, Microsoft Research, Microsoft Building 99, 14820 NE 36Th Street, Redmond, Washington, 98052, USA
| | - Ilyana Rosenberg
- Health Futures, Microsoft Research, Microsoft Building 99, 14820 NE 36Th Street, Redmond, Washington, 98052, USA
| | - Dokyoon Kim
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - T Scott Saponas
- Health Futures, Microsoft Research, Microsoft Building 99, 14820 NE 36Th Street, Redmond, Washington, 98052, USA
| | - Amanda K Hall
- Health Futures, Microsoft Research, Microsoft Building 99, 14820 NE 36Th Street, Redmond, Washington, 98052, USA.
- Department of Biomedical Informatics and Medical Education, University of Washington School of Medicine, Seattle, WA, 98195, USA.
| |
Collapse
|
2
|
Tomisova K, Jarosova V, Marsik P, Bergo AM, Cinek O, Hlinakova L, Kloucek P, Janousek V, Valentová K, Havlik J. Mutual Interactions of Silymarin and Colon Microbiota in Healthy Young and Healthy Elder Subjects. Mol Nutr Food Res 2024; 68:e2400500. [PMID: 39473280 PMCID: PMC11605779 DOI: 10.1002/mnfr.202400500] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Revised: 10/01/2024] [Indexed: 11/30/2024]
Abstract
SCOPE This multi-omic study investigates the bidirectional interactions between gut microbiota and silymarin metabolism, highlighting the differential effects across various age groups. Silymarin, the extract from Silybum marianum (milk thistle), is commonly used for its hepatoprotective effects. METHODS AND RESULTS An in vitro fermentation colon model was used with microbiota from 20 stool samples obtained from healthy donors divided into two age groups. A combination of three analytical advanced techniques, namely proton nuclear magnetic resonance (1H NMR), next-generation sequencing (NGS), and liquid chromatography-mass spectrometry (LC-MS) was used to determine silymarin microbial metabolites over 24 h, overall metabolome, and microbiota composition. Silymarin at a low diet-relevant dose of 50 µg mL-1 significantly altered gut microbiota metabolism, reducing short-chain fatty acid (acetate, butyrate, propionate) production, glucose utilization, and increasing alpha-diversity. Notably, the study reveals age-related differences in silymarin catabolism. Healthy elderly donors (70-80 years) exhibited a significant increase in a specific catabolite associated with Oscillibacter sp., whereas healthy young donors (12-45 years) showed a faster breakdown of silymarin components, particularly isosilybin B, which is associated with higher abundance of Faecalibacterium and Erysipelotrichaceae UCG-003. CONCLUSION This study provides insights into microbiome functionality in metabolizing dietary flavonolignans, highlighting implications for age-specific nutritional strategies, and advancing our understanding of dietary (poly)phenol metabolism.
Collapse
Affiliation(s)
- Katerina Tomisova
- Department of Food Science, Faculty of Agrobiology, Food and Natural ResourcesCzech University of Life Sciences PragueKamycka 129Prague Suchdol165 00Czech Republic
| | - Veronika Jarosova
- Department of Food Science, Faculty of Agrobiology, Food and Natural ResourcesCzech University of Life Sciences PragueKamycka 129Prague Suchdol165 00Czech Republic
| | - Petr Marsik
- Department of Food Science, Faculty of Agrobiology, Food and Natural ResourcesCzech University of Life Sciences PragueKamycka 129Prague Suchdol165 00Czech Republic
| | - Anna Mascellani Bergo
- Department of Food Science, Faculty of Agrobiology, Food and Natural ResourcesCzech University of Life Sciences PragueKamycka 129Prague Suchdol165 00Czech Republic
| | - Ondrej Cinek
- Department of PediatricsCharles University and University Hospital MotolV Uvalu 84Prague150 06Czech Republic
| | - Lucie Hlinakova
- Department of PediatricsCharles University and University Hospital MotolV Uvalu 84Prague150 06Czech Republic
| | - Pavel Kloucek
- Department of Food Science, Faculty of Agrobiology, Food and Natural ResourcesCzech University of Life Sciences PragueKamycka 129Prague Suchdol165 00Czech Republic
| | | | - Kateřina Valentová
- Institute of Microbiology of the Czech Academy of SciencesVidenska 1083Prague142 00Czech Republic
| | - Jaroslav Havlik
- Department of Food Science, Faculty of Agrobiology, Food and Natural ResourcesCzech University of Life Sciences PragueKamycka 129Prague Suchdol165 00Czech Republic
| |
Collapse
|
3
|
Chi J, Shu J, Li M, Mudappathi R, Jin Y, Lewis F, Boon A, Qin X, Liu L, Gu H. Artificial Intelligence in Metabolomics: A Current Review. Trends Analyt Chem 2024; 178:117852. [PMID: 39071116 PMCID: PMC11271759 DOI: 10.1016/j.trac.2024.117852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
Metabolomics and artificial intelligence (AI) form a synergistic partnership. Metabolomics generates large datasets comprising hundreds to thousands of metabolites with complex relationships. AI, aiming to mimic human intelligence through computational modeling, possesses extraordinary capabilities for big data analysis. In this review, we provide a recent overview of the methodologies and applications of AI in metabolomics studies in the context of systems biology and human health. We first introduce the AI concept, history, and key algorithms for machine learning and deep learning, summarizing their strengths and weaknesses. We then discuss studies that have successfully used AI across different aspects of metabolomic analysis, including analytical detection, data preprocessing, biomarker discovery, predictive modeling, and multi-omics data integration. Lastly, we discuss the existing challenges and future perspectives in this rapidly evolving field. Despite limitations and challenges, the combination of metabolomics and AI holds great promises for revolutionary advancements in enhancing human health.
Collapse
Affiliation(s)
- Jinhua Chi
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Center for Translational Science, Florida International University, Port St. Lucie, FL 34987, USA
| | - Jingmin Shu
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Center for Personalized Diagnostics, Biodesign Institute, Arizona State University, Tempe, AZ 85281, USA
| | - Ming Li
- Phoenix VA Health Care System, Phoenix, AZ 85012, USA
- University of Arizona College of Medicine, Phoenix, AZ 85004, USA
| | - Rekha Mudappathi
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Center for Personalized Diagnostics, Biodesign Institute, Arizona State University, Tempe, AZ 85281, USA
| | - Yan Jin
- Center for Translational Science, Florida International University, Port St. Lucie, FL 34987, USA
| | - Freeman Lewis
- Center for Translational Science, Florida International University, Port St. Lucie, FL 34987, USA
| | - Alexandria Boon
- Center for Translational Science, Florida International University, Port St. Lucie, FL 34987, USA
| | - Xiaoyan Qin
- College of Liberal Arts and Sciences, Arizona State University, Tempe, AZ 85281, USA
| | - Li Liu
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Center for Personalized Diagnostics, Biodesign Institute, Arizona State University, Tempe, AZ 85281, USA
| | - Haiwei Gu
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Center for Translational Science, Florida International University, Port St. Lucie, FL 34987, USA
| |
Collapse
|
4
|
Balasubramaniam NK, Penberthy S, Fenyo D, Viessmann N, Russmann C, Borchers CH. Digitalomics - digital transformation leading to omics insights. Expert Rev Proteomics 2024; 21:337-344. [PMID: 39364775 DOI: 10.1080/14789450.2024.2413107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2024] [Revised: 09/02/2024] [Accepted: 09/23/2024] [Indexed: 10/05/2024]
Abstract
INTRODUCTION Biomarker discovery is increasingly moving from single omics to multiomics, as well as from multi-cell omics to single-cell omics. These transitions have increasingly adopted digital transformation technologies to accelerate the progression from data to insight. Here, we will discuss the concept of 'digitalomics' and how digital transformation directly impacts biomarker discovery. This will ultimately assist clinicians in personalized therapy and precision-medicine treatment decisions. AREAS COVERED Genotype-to-phenotype-based insight generation involves integrating large amounts of complex multiomic data. This data integration and analysis is aided through digital transformation, leading to better clinical outcomes. We also highlight the challenges and opportunities of Digitalomics, and provide examples of the application of Artificial Intelligence, cloud- and high-performance computing, and use of tensors for multiomic analysis workflows. EXPERT OPINION Biomarker discovery, aided by digital transformation, is having a significant impact on cancer, cardiovascular, infectious, immunological, and neurological diseases, among others. Data insights garnered from multiomic analyses, combined with patient meta data, aids patient stratification and targeted treatment across a broad spectrum of diseases. Digital transformation offers time and cost savings while leading to improved patent healthcare. Here, we highlight the impact of digital transformation on multiomics- based biomarker discovery with specific applications related to oncology.
Collapse
Affiliation(s)
- Nandha Kumar Balasubramaniam
- PromptBio Inc, Pleasanton, CA, USA
- Health Campus Goettingen/University of Applied Sciences and Arts (HAWK), Göttingen, Germany
| | | | - David Fenyo
- New York University Grossman School of Medicine, New York, NY, USA
| | - Nina Viessmann
- Health Campus Goettingen/University of Applied Sciences and Arts (HAWK), Göttingen, Germany
| | - Christoph Russmann
- Health Campus Goettingen/University of Applied Sciences and Arts (HAWK), Göttingen, Germany
- Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Christoph H Borchers
- Segal Cancer Proteomics Center, Lady Davis Institute for Medical Research, Jewish General Hospital and McGill University, Montreal, QC, Canada
- Gerald Bronfman Department of Oncology, Lady Davis Institute for Medical Research, Jewish General Hospital, Montreal, QC, Canada
- Division of Experimental Medicine, McGill University, Montreal, QC, Canada
- Department of Pathology, McGill University, Montreal, QC, Canada
| |
Collapse
|
5
|
Anazco D, Acosta A. Precision medicine for obesity: current evidence and insights for personalization of obesity pharmacotherapy. Int J Obes (Lond) 2024:10.1038/s41366-024-01599-z. [PMID: 39127792 DOI: 10.1038/s41366-024-01599-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 06/17/2024] [Accepted: 07/29/2024] [Indexed: 08/12/2024]
Abstract
Obesity is a chronic and complex disease associated with increased morbidity, mortality, and financial burden. It is expected that by 2030 one of two people in the United States will have obesity. The backbone for obesity management continues to be lifestyle interventions, consisting of calorie deficit diets and increased physical activity levels, however, these interventions are often insufficient to achieve sufficient and maintained weight loss. As a result, multiple patients require additional interventions such as antiobesity medications or bariatric interventions in order to achieve clinically significant weight loss and improvement or resolution of obesity-associated comorbidities. Despite the recent advances in the field of obesity pharmacotherapy that have resulted in never-before-seen weight loss outcomes, comorbidity improvement, and even reduction in cardiovascular mortality, there is still a significant interindividual variability in terms of response to antiobesity medications, with a subset of patients not achieving a clinically significant weight loss. Currently, the trial-and-error paradigm for the selection of antiobesity medications results in increased costs and risks for developing side effects, while also reduces engagement in weight management programs for patients with obesity. The implementation of a precision medicine framework to the selection of antiobesity medications might help reduce heterogeneity and optimize weight loss outcomes by identifying unique subsets of patients, or phenotypes, that have a better response to a specific intervention. The detailed study of energy balance regulation holds promise, as actionable behavioral and physiologic traits could help guide antiobesity medication selection based on previous mechanistic studies. Moreover, the rapid advances in genotyping, multi-omics, and big data analysis might hold the key to discover additional signatures or phenotypes that might respond better to a certain intervention and might permit the widespread adoption of a precision medicine approach for obesity management.
Collapse
Affiliation(s)
- Diego Anazco
- Precision Medicine for Obesity Program, Division of Gastroenterology and Hepatology, Department of Medicine, Mayo Clinic, Rochester, MN, USA
| | - Andres Acosta
- Precision Medicine for Obesity Program, Division of Gastroenterology and Hepatology, Department of Medicine, Mayo Clinic, Rochester, MN, USA.
| |
Collapse
|
6
|
Yadegar A, Bar-Yoseph H, Monaghan TM, Pakpour S, Severino A, Kuijper EJ, Smits WK, Terveer EM, Neupane S, Nabavi-Rad A, Sadeghi J, Cammarota G, Ianiro G, Nap-Hill E, Leung D, Wong K, Kao D. Fecal microbiota transplantation: current challenges and future landscapes. Clin Microbiol Rev 2024; 37:e0006022. [PMID: 38717124 PMCID: PMC11325845 DOI: 10.1128/cmr.00060-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2024] Open
Abstract
SUMMARYGiven the importance of gut microbial homeostasis in maintaining health, there has been considerable interest in developing innovative therapeutic strategies for restoring gut microbiota. One such approach, fecal microbiota transplantation (FMT), is the main "whole gut microbiome replacement" strategy and has been integrated into clinical practice guidelines for treating recurrent Clostridioides difficile infection (rCDI). Furthermore, the potential application of FMT in other indications such as inflammatory bowel disease (IBD), metabolic syndrome, and solid tumor malignancies is an area of intense interest and active research. However, the complex and variable nature of FMT makes it challenging to address its precise functionality and to assess clinical efficacy and safety in different disease contexts. In this review, we outline clinical applications, efficacy, durability, and safety of FMT and provide a comprehensive assessment of its procedural and administration aspects. The clinical applications of FMT in children and cancer immunotherapy are also described. We focus on data from human studies in IBD in contrast with rCDI to delineate the putative mechanisms of this treatment in IBD as a model, including colonization resistance and functional restoration through bacterial engraftment, modulating effects of virome/phageome, gut metabolome and host interactions, and immunoregulatory actions of FMT. Furthermore, we comprehensively review omics technologies, metagenomic approaches, and bioinformatics pipelines to characterize complex microbial communities and discuss their limitations. FMT regulatory challenges, ethical considerations, and pharmacomicrobiomics are also highlighted to shed light on future development of tailored microbiome-based therapeutics.
Collapse
Affiliation(s)
- Abbas Yadegar
- Foodborne and Waterborne Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Haggai Bar-Yoseph
- Department of Gastroenterology, Rambam Health Care Campus, Haifa, Israel
- Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, Israel
| | - Tanya Marie Monaghan
- National Institute for Health Research Nottingham Biomedical Research Centre, University of Nottingham, Nottingham, United Kingdom
- Nottingham Digestive Diseases Centre, School of Medicine, University of Nottingham, Nottingham, United Kingdom
| | - Sepideh Pakpour
- School of Engineering, Faculty of Applied Sciences, UBC, Okanagan Campus, Kelowna, British Columbia, Canada
| | - Andrea Severino
- Department of Translational Medicine and Surgery, Università Cattolica del Sacro Cuore, Rome, Italy
- Department of Medical and Surgical Sciences, UOC CEMAD Centro Malattie dell'Apparato Digerente, Medicina Interna e Gastroenterologia, Fondazione Policlinico Universitario Gemelli IRCCS, Rome, Italy
- Department of Medical and Surgical Sciences, UOC Gastroenterologia, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Rome, Italy
| | - Ed J Kuijper
- Center for Microbiota Analysis and Therapeutics (CMAT), Leiden University Center for Infectious Diseases, Leiden University Medical Center, Leiden, The Netherlands
| | - Wiep Klaas Smits
- Center for Microbiota Analysis and Therapeutics (CMAT), Leiden University Center for Infectious Diseases, Leiden University Medical Center, Leiden, The Netherlands
| | - Elisabeth M Terveer
- Center for Microbiota Analysis and Therapeutics (CMAT), Leiden University Center for Infectious Diseases, Leiden University Medical Center, Leiden, The Netherlands
| | - Sukanya Neupane
- Division of Gastroenterology, Department of Medicine, University of Alberta, Edmonton, Alberta, Canada
| | - Ali Nabavi-Rad
- Foodborne and Waterborne Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Javad Sadeghi
- School of Engineering, Faculty of Applied Sciences, UBC, Okanagan Campus, Kelowna, British Columbia, Canada
| | - Giovanni Cammarota
- Department of Translational Medicine and Surgery, Università Cattolica del Sacro Cuore, Rome, Italy
- Department of Medical and Surgical Sciences, UOC CEMAD Centro Malattie dell'Apparato Digerente, Medicina Interna e Gastroenterologia, Fondazione Policlinico Universitario Gemelli IRCCS, Rome, Italy
- Department of Medical and Surgical Sciences, UOC Gastroenterologia, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Rome, Italy
| | - Gianluca Ianiro
- Department of Translational Medicine and Surgery, Università Cattolica del Sacro Cuore, Rome, Italy
- Department of Medical and Surgical Sciences, UOC CEMAD Centro Malattie dell'Apparato Digerente, Medicina Interna e Gastroenterologia, Fondazione Policlinico Universitario Gemelli IRCCS, Rome, Italy
- Department of Medical and Surgical Sciences, UOC Gastroenterologia, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Rome, Italy
| | - Estello Nap-Hill
- Department of Medicine, Division of Gastroenterology, St Paul's Hospital, University of British Columbia, Vancouver, British Columbia, Canada
| | - Dickson Leung
- Division of Gastroenterology, Department of Medicine, University of Alberta, Edmonton, Alberta, Canada
| | - Karen Wong
- Division of Gastroenterology, Department of Medicine, University of Alberta, Edmonton, Alberta, Canada
| | - Dina Kao
- Division of Gastroenterology, Department of Medicine, University of Alberta, Edmonton, Alberta, Canada
| |
Collapse
|
7
|
Liu Z, Zhou Y, Wang H, Liu C, Wang L. Recent advances in understanding the fitness and survival mechanisms of Vibrio parahaemolyticus. Int J Food Microbiol 2024; 417:110691. [PMID: 38631283 DOI: 10.1016/j.ijfoodmicro.2024.110691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 03/14/2024] [Accepted: 04/02/2024] [Indexed: 04/19/2024]
Abstract
The presence of Vibrio parahaemolyticus (Vp) in different production stages of seafood has generated negative impacts on both public health and the sustainability of the industry. To further better investigate the fitness of Vp at the phenotypical level, a great number of studies have been conducted in recent years using plate counting methods. In the meantime, with the increasing accessibility of the next generation sequencing and the advances in analytical chemistry techniques, omics-oriented biotechnologies have further advanced our knowledge in the survival and virulence mechanisms of Vp at various molecular levels. These observations provide insights to guide the development of novel prevention and control strategies and benefit the monitoring and mitigation of food safety risks associated with Vp contamination. To timely capture these recent advances, this review firstly summarizes the most recent phenotypical level studies and provide insights about the survival of Vp under important in vitro stresses and on aquatic products. After that, molecular survival mechanisms of Vp at transcriptomic and proteomic levels are summarized and discussed. Looking forward, other newer omics-biotechnology such as metabolomics and secretomics show great potential to be used for confirming the cellular responses of Vp. Powerful data mining tools from the field of machine learning and artificial intelligence, that can better utilize the omics data and solve complex problems in the processing, analysis, and interpretation of omics data, will further improve our mechanistic understanding of Vp.
Collapse
Affiliation(s)
- Zhuosheng Liu
- Department of Food Science and Technology, University of California Davis, Davis, CA 95618, USA
| | - Yi Zhou
- Department of Food Science and Technology, University of California Davis, Davis, CA 95618, USA
| | - Hongye Wang
- Department of Food Science and Technology, University of California Davis, Davis, CA 95618, USA
| | - Chengchu Liu
- University of Maryland Sea Grant Extension Program, UMES Center for Food Science and Technology, Princess Anne, MD, United States
| | - Luxin Wang
- Department of Food Science and Technology, University of California Davis, Davis, CA 95618, USA.
| |
Collapse
|
8
|
Lac L, Leung CK, Hu P. Computational frameworks integrating deep learning and statistical models in mining multimodal omics data. J Biomed Inform 2024; 152:104629. [PMID: 38552994 DOI: 10.1016/j.jbi.2024.104629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 02/26/2024] [Accepted: 03/25/2024] [Indexed: 04/04/2024]
Abstract
BACKGROUND In health research, multimodal omics data analysis is widely used to address important clinical and biological questions. Traditional statistical methods rely on the strong assumptions of distribution. Statistical methods such as testing and differential expression are commonly used in omics analysis. Deep learning, on the other hand, is an advanced computer science technique that is powerful in mining high-dimensional omics data for prediction tasks. Recently, integrative frameworks or methods have been developed for omics studies that combine statistical models and deep learning algorithms. METHODS AND RESULTS The aim of these integrative frameworks is to combine the strengths of both statistical methods and deep learning algorithms to improve prediction accuracy while also providing interpretability and explainability. This review report discusses the current state-of-the-art integrative frameworks, their limitations, and potential future directions in survival and time-to-event longitudinal analysis, dimension reduction and clustering, regression and classification, feature selection, and causal and transfer learning.
Collapse
Affiliation(s)
- Leann Lac
- Department of Computer Science, University of Manitoba, Winnipeg, Manitoba, Canada; Department of Statistics, University of Manitoba, Winnipeg, Manitoba, Canada
| | - Carson K Leung
- Department of Computer Science, University of Manitoba, Winnipeg, Manitoba, Canada
| | - Pingzhao Hu
- Department of Computer Science, University of Manitoba, Winnipeg, Manitoba, Canada; Department of Biochemistry, Western University, London, Ontario, Canada; Department of Computer Science, Western University, London, Ontario, Canada; Department of Oncology, Western University, London, Ontario, Canada; Department of Epidemiology and Biostatistics, Western University, London, Ontario, Canada; The Children's Health Research Institute, Lawson Health Research Institute, London, Ontario, Canada.
| |
Collapse
|
9
|
Banerjee M, Srivastava S, Rai SN, States JC. Chronic arsenic exposure induces malignant transformation of human HaCaT cells through both deterministic and stochastic changes in transcriptome expression. Toxicol Appl Pharmacol 2024; 484:116865. [PMID: 38373578 PMCID: PMC10994602 DOI: 10.1016/j.taap.2024.116865] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Revised: 02/11/2024] [Accepted: 02/13/2024] [Indexed: 02/21/2024]
Abstract
Biological processes are inherently stochastic, i.e., are partially driven by hard to predict random probabilistic processes. Carcinogenesis is driven both by stochastic and deterministic (predictable non-random) changes. However, very few studies systematically examine the contribution of stochastic events leading to cancer development. In differential gene expression studies, the established data analysis paradigms incentivize expression changes that are uniformly different across the experimental versus control groups, introducing preferential inclusion of deterministic changes at the expense of stochastic processes that might also play a crucial role in the process of carcinogenesis. In this study, we applied simple computational techniques to quantify: (i) The impact of chronic arsenic (iAs) exposure as well as passaging time on stochastic gene expression and (ii) Which genes were expressed deterministically and which were expressed stochastically at each of the three stages of cancer development. Using biological coefficient of variation as an empirical measure of stochasticity we demonstrate that chronic iAs exposure consistently suppressed passaging related stochastic gene expression at multiple time points tested, selecting for a homogenous cell population that undergo transformation. Employing multiple balanced removal of outlier data, we show that chronic iAs exposure induced deterministic and stochastic changes in the expression of unique set of genes, that populate largely unique biological pathways. Together, our data unequivocally demonstrate that both deterministic and stochastic changes in transcriptome-wide expression are critical in driving biological processes, pathways and networks towards clonal selection, carcinogenesis, and tumor heterogeneity.
Collapse
Affiliation(s)
- Mayukh Banerjee
- Department of Pharmacology and Toxicology, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA; Center for Integrative Environmental Health Sciences, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA
| | - Sudhir Srivastava
- Department of Bioinformatics and Biostatistics, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA
| | - Shesh N Rai
- Department of Bioinformatics and Biostatistics, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA; Biostatistics and Bioinformatics Facility, James Graham Brown Cancer Center, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA; Biostatistics and Informatics Facility Core, Center for Integrative Environmental Health Sciences, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA
| | - J Christopher States
- Department of Pharmacology and Toxicology, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA; Center for Integrative Environmental Health Sciences, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA.
| |
Collapse
|
10
|
Chen SF, Loguercio S, Chen KY, Lee SE, Park JB, Liu S, Sadaei HJ, Torkamani A. Artificial Intelligence for Risk Assessment on Primary Prevention of Coronary Artery Disease. CURRENT CARDIOVASCULAR RISK REPORTS 2023; 17:215-231. [DOI: 10.1007/s12170-023-00731-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/09/2023] [Indexed: 01/04/2025]
Abstract
Abstract
Purpose of Review
Coronary artery disease (CAD) is a common and etiologically complex disease worldwide. Current guidelines for primary prevention, or the prevention of a first acute event, include relatively simple risk assessment and leave substantial room for improvement both for risk ascertainment and selection of prevention strategies. Here, we review how advances in big data and predictive modeling foreshadow a promising future of improved risk assessment and precision medicine for CAD.
Recent Findings
Artificial intelligence (AI) has improved the utility of high dimensional data, providing an opportunity to better understand the interplay between numerous CAD risk factors. Beyond applications of AI in cardiac imaging, the vanguard application of AI in healthcare, recent translational research is also revealing a promising path for AI in multi-modal risk prediction using standard biomarkers, genetic and other omics technologies, a variety of biosensors, and unstructured data from electronic health records (EHRs). However, gaps remain in clinical validation of AI models, most notably in the actionability of complex risk prediction for more precise therapeutic interventions.
Summary
The recent availability of nation-scale biobank datasets has provided a tremendous opportunity to richly characterize longitudinal health trajectories using health data collected at home, at laboratories, and through clinic visits. The ever-growing availability of deep genotype-phenotype data is poised to drive a transition from simple risk prediction algorithms to complex, “data-hungry,” AI models in clinical decision-making. While AI models provide the means to incorporate essentially all risk factors into comprehensive risk prediction frameworks, there remains a need to wrap these predictions in interpretable frameworks that map to our understanding of underlying biological mechanisms and associated personalized intervention. This review explores recent advances in the role of machine learning and AI in CAD primary prevention and highlights current strengths as well as limitations mediating potential future applications.
Collapse
|
11
|
Brunet TA, Ayciriex S, Arquier D, Lemoine J, Randon J, Salvador A. Scout triggered multiple reaction monitoring mass spectrometry for the rapid transfer of large multiplexed targeted methods in metabolomics. J Chromatogr B Analyt Technol Biomed Life Sci 2023; 1228:123849. [PMID: 37634392 DOI: 10.1016/j.jchromb.2023.123849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 07/20/2023] [Accepted: 08/11/2023] [Indexed: 08/29/2023]
Abstract
The field of metabolomics based on mass spectrometry has grown considerably in recent years due to the need to detect and, above all, quantify a very large number of metabolites, simultaneously. Up to now, targeted multiplexed analysis on complex samples by Liquid Chromatography coupled with tandem Mass Spectrometry (LC-MS/MS) has relied almost exclusively on compound detection based on absolute retention times, as in the Scheduled-MRM (sMRM) approach. Those methods turn out to be poorly transferable from one instrument to another and result in a time-consuming and tedious method development involving a significant number of critical parameters that need specific re-optimisation. To address this challenge, we introduce a novel acquisition mode called scout-triggered MRM (stMRM). In stMRM, a marker transition is used to trigger MS analysis for a group of dependent target analytes. These marker transitions are strategically distributed throughout the chromatographic run, and the dependent analytes are associated based on their retention times. The result is a targeted assay that remains robust even in the presence of retention time shifts. A 3 to 5-fold increase in the number of detected transitions associated to plasma metabolites was obtained when transferring from a direct application of a published sMRM to a stMRM method. This significant improvement highlights the universal applicability of the stMRM method, as it can be implemented on any LC system without the need for extensive method development. We subsequently illustrate the robustness of stMRM in modified chromatographic elution conditions. Despite a large change in metabolite's selectivity, the multiplexed assay successfully recovered 70% of the monitored transitions when consequently modifying the gradient method. These findings demonstrate the versatility and adaptability of stMRM, opening new avenues for the development of highly multiplexed LC-MS/MS methods in metabolomics. These methods are characterized by their analytical transparency and straightforward implementation using existing literature data.
Collapse
Affiliation(s)
- Thomas Alexandre Brunet
- Univ Lyon, CNRS, Université Claude Bernard Lyon 1, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100 Villeurbanne, France
| | - Sophie Ayciriex
- Univ Lyon, CNRS, Université Claude Bernard Lyon 1, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100 Villeurbanne, France
| | - Delphine Arquier
- Univ Lyon, CNRS, Université Claude Bernard Lyon 1, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100 Villeurbanne, France
| | - Jérôme Lemoine
- Univ Lyon, CNRS, Université Claude Bernard Lyon 1, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100 Villeurbanne, France
| | - Jérôme Randon
- Univ Lyon, CNRS, Université Claude Bernard Lyon 1, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100 Villeurbanne, France
| | - Arnaud Salvador
- Univ Lyon, CNRS, Université Claude Bernard Lyon 1, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100 Villeurbanne, France.
| |
Collapse
|
12
|
Rathnakumar S, Kambhampati NSV, Saiswaroop R, Pradhan SS, Ramkumar G, Beeraka N, Muddu GK, Kumar S, Javvaji SK, Parangoankar A, Sivaramakrishnan V, Ramamurthy SS. Integrated clinical and metabolomic analysis of dengue infection shows molecular signatures associated with host-pathogen interaction in different phases of the disease. Metabolomics 2023; 19:47. [PMID: 37130982 DOI: 10.1007/s11306-023-02011-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 04/20/2023] [Indexed: 05/04/2023]
Abstract
PURPOSE Dengue is a mosquito vector-borne disease caused by the dengue virus, which affects 125 million people globally. The disease causes considerable morbidity. The disease, based on symptoms, is classified into three characteristic phases, which can further lead to complications in the second phase. Molecular signatures that are associated with the three phases have not been well characterized. We performed an integrated clinical and metabolomic analysis of our patient cohort and compared it with omics data from the literature to identify signatures unique to the different phases. METHODS The dengue patients are recruited by clinicians after standard-of-care diagnostic tests and evaluation of symptoms. Blood from the patients was collected. NS1 antigen, IgM, IgG antibodies, and cytokines in serum were analyzed using ELISA. Targeted metabolomics was performed using LC-MS triple quad. The results were compared with analyzed transcriptomic data from the GEO database and metabolomic data sets from the literature. RESULTS The dengue patients displayed characteristic features of the disease, including elevated NS1 levels. TNF-α was found to be elevated in all three phases compared to healthy controls. The metabolic pathways were found to be deregulated compared to healthy controls only in phases I and II of dengue patients. The pathways represent viral replication and host response mediated pathways. The major pathways include nucleotide metabolism of various amino acids and fatty acids, biotin, etc. CONCLUSION: The results show elevated TNF-α and metabolites that are characteristic of viral infection and host response. IL10 and IFN-γ were not significant, consistent with the absence of any complications.
Collapse
Affiliation(s)
- Sriram Rathnakumar
- Disease Biology Lab, Department of Biosciences, Sri Sathya Sai Institute of Higher Learning, Prasanthi Nilayam, Puttaparthi, Andhra Pradesh, 515134, India
| | - Naga Sai Visweswar Kambhampati
- STAR Laboratory, Central Research Instruments Facility, Sri Sathya Sai Institute of Higher Learning, Prasanthi Nilayam, Puttaparthi, Andhra Pradesh, 515134, India
| | - R Saiswaroop
- Disease Biology Lab, Department of Biosciences, Sri Sathya Sai Institute of Higher Learning, Prasanthi Nilayam, Puttaparthi, Andhra Pradesh, 515134, India
| | - Sai Sanwid Pradhan
- Disease Biology Lab, Department of Biosciences, Sri Sathya Sai Institute of Higher Learning, Prasanthi Nilayam, Puttaparthi, Andhra Pradesh, 515134, India
| | - G Ramkumar
- Department of General Medicine, Sri Sathya Sai General Hospital, Sri Sathya Sai Institute of Higher Medical Sciences Campus, Whitefield, Bengaluru, Karnataka, 560066, India
| | - Nirmala Beeraka
- Department of General Medicine, Sri Sathya Sai General Hospital, Sri Sathya Sai Institute of Higher Medical Sciences Campus, Whitefield, Bengaluru, Karnataka, 560066, India
| | - Gopi Krishna Muddu
- Department of Pediatrics, Sri Sathya Sai General Hospital, Puttaparthi, Andhra Pradesh, 515134, India
| | - Sandeep Kumar
- Department of General Medicine, Sri Sathya Sai General Hospital, Puttaparthi, Andhra Pradesh, 515134, India
| | - Sai Kiran Javvaji
- Department of Laboratory Medicine and Cardiology, Sri Sathya Sai Institute of Higher Medical Sciences, Whitefield, Bengaluru, Karnataka, 560066, India
| | | | - Venketesh Sivaramakrishnan
- Disease Biology Lab, Department of Biosciences, Sri Sathya Sai Institute of Higher Learning, Prasanthi Nilayam, Puttaparthi, Andhra Pradesh, 515134, India.
| | - Sai Sathish Ramamurthy
- STAR Laboratory, Central Research Instruments Facility, Sri Sathya Sai Institute of Higher Learning, Prasanthi Nilayam, Puttaparthi, Andhra Pradesh, 515134, India.
| |
Collapse
|
13
|
Sun C, Schuman E. A multi-omics view of neuronal subcellular protein synthesis. Curr Opin Neurobiol 2023; 80:102705. [PMID: 36913750 DOI: 10.1016/j.conb.2023.102705] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 02/14/2023] [Accepted: 02/15/2023] [Indexed: 03/13/2023]
Abstract
While it has long been known that protein synthesis is necessary for long-term memory in the brain, the logistics of neuronal protein synthesis is complicated by the extensive subcellular compartmentalization of the neuron. Local protein synthesis solves many of the logistic problems posed by the extreme complexity of dendritic and axonal arbors and the huge number of synapses. Here we review recent multi-omic and quantitative studies that elaborate a systems view of decentralized neuronal protein synthesis. We highlight recent insights from the transcriptomic, translatomic, and proteomic levels, discuss the nuanced logic of local protein synthesis for different protein features, and list the missing information needed to build a comprehensive logistic model for neuronal protein supply.
Collapse
Affiliation(s)
- Chao Sun
- Max Planck Institute for Brain Research, Frankfurt, Germany; Danish Research Institute of Translational Neuroscience - DANDRITE, Nordic-EMBL Partnership for Molecular Medicine, Denmark; Aarhus University, Department of Molecular Biology and Genetics, Universitetsbyen 81, 8000 Aarhus C, Denmark. https://twitter.com/LukeChaoSun
| | - Erin Schuman
- Max Planck Institute for Brain Research, Frankfurt, Germany.
| |
Collapse
|
14
|
Big Data Analysis and Application of Liver Cancer Gene Sequence Based on Second-Generation Sequencing Technology. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:4004130. [PMID: 36017150 PMCID: PMC9398858 DOI: 10.1155/2022/4004130] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 07/03/2022] [Accepted: 07/14/2022] [Indexed: 12/04/2022]
Abstract
In big data analysis with the rapid improvement of computer storage capacity and the rapid development of complex algorithms, the exponential growth of massive data has also made science and technology progress with each passing day. Based on omics data such as mRNA data, microRNA data, or DNA methylation data, this study uses traditional clustering methods such as kmeans, K-nearest neighbors, hierarchical clustering, affinity propagation, and nonnegative matrix decomposition to classify samples into categories, obtained: (1) The assumption that the attributes are independent of each other reduces the classification effect of the algorithm to a certain extent. According to the idea of multilevel grid, there is a one-to-one mapping from high-dimensional space to one-dimensional. The complexity is greatly simplified by encoding the one-dimensional grid of the hierarchical grid. The logic of the algorithm is relatively simple, and it also has a very stable classification efficiency. (2) Convert the two-dimensional representation of the data into the one-dimensional representation of the binary, realize the dimensionality reduction processing of the data, and improve the organization and storage efficiency of the data. The grid coding expresses the spatial position of the data, maintains the original organization method of the data, and does not make the abstract expression of the data object. (3) The data processing of nondiscrete and missing values provides a new opportunity for the identification of protein targets of small molecule therapy and obtains a better classification effect. (4) The comparison of the three models shows that Naive Bayes is the optimal model. Each iteration is composed of alternately expected steps and maximal steps and then identified and quantified by MS.
Collapse
|
15
|
Hill C, Avila-Palencia I, Maxwell AP, Hunter RF, McKnight AJ. Harnessing the Full Potential of Multi-Omic Analyses to Advance the Study and Treatment of Chronic Kidney Disease. FRONTIERS IN NEPHROLOGY 2022; 2:923068. [PMID: 37674991 PMCID: PMC10479694 DOI: 10.3389/fneph.2022.923068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Accepted: 05/30/2022] [Indexed: 09/08/2023]
Abstract
Chronic kidney disease (CKD) was the 12th leading cause of death globally in 2017 with the prevalence of CKD estimated at ~9%. Early detection and intervention for CKD may improve patient outcomes, but standard testing approaches even in developed countries do not facilitate identification of patients at high risk of developing CKD, nor those progressing to end-stage kidney disease (ESKD). Recent advances in CKD research are moving towards a more personalised approach for CKD. Heritability for CKD ranges from 30% to 75%, yet identified genetic risk factors account for only a small proportion of the inherited contribution to CKD. More in depth analysis of genomic sequencing data in large cohorts is revealing new genetic risk factors for common diagnoses of CKD and providing novel diagnoses for rare forms of CKD. Multi-omic approaches are now being harnessed to improve our understanding of CKD and explain some of the so-called 'missing heritability'. The most common omic analyses employed for CKD are genomics, epigenomics, transcriptomics, metabolomics, proteomics and phenomics. While each of these omics have been reviewed individually, considering integrated multi-omic analysis offers considerable scope to improve our understanding and treatment of CKD. This narrative review summarises current understanding of multi-omic research alongside recent experimental and analytical approaches, discusses current challenges and future perspectives, and offers new insights for CKD.
Collapse
Affiliation(s)
| | | | | | | | - Amy Jayne McKnight
- Centre for Public Health, Queen’s University Belfast, Belfast, United Kingdom
| |
Collapse
|
16
|
Roell K, Koval LE, Boyles R, Patlewicz G, Ring C, Rider CV, Ward-Caviness C, Reif DM, Jaspers I, Fry RC, Rager JE. Development of the InTelligence And Machine LEarning (TAME) Toolkit for Introductory Data Science, Chemical-Biological Analyses, Predictive Modeling, and Database Mining for Environmental Health Research. FRONTIERS IN TOXICOLOGY 2022; 4:893924. [PMID: 35812168 PMCID: PMC9257219 DOI: 10.3389/ftox.2022.893924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 05/30/2022] [Indexed: 01/09/2023] Open
Abstract
Research in environmental health is becoming increasingly reliant upon data science and computational methods that can more efficiently extract information from complex datasets. Data science and computational methods can be leveraged to better identify relationships between exposures to stressors in the environment and human disease outcomes, representing critical information needed to protect and improve global public health. Still, there remains a critical gap surrounding the training of researchers on these in silico methods. We aimed to address this gap by developing the inTelligence And Machine lEarning (TAME) Toolkit, promoting trainee-driven data generation, management, and analysis methods to “TAME” data in environmental health studies. Training modules were developed to provide applications-driven examples of data organization and analysis methods that can be used to address environmental health questions. Target audiences for these modules include students, post-baccalaureate and post-doctorate trainees, and professionals that are interested in expanding their skillset to include recent advances in data analysis methods relevant to environmental health, toxicology, exposure science, epidemiology, and bioinformatics/cheminformatics. Modules were developed by study coauthors using annotated script and were organized into three chapters within a GitHub Bookdown site. The first chapter of modules focuses on introductory data science, which includes the following topics: setting up R/RStudio and coding in the R environment; data organization basics; finding and visualizing data trends; high-dimensional data visualizations; and Findability, Accessibility, Interoperability, and Reusability (FAIR) data management practices. The second chapter of modules incorporates chemical-biological analyses and predictive modeling, spanning the following methods: dose-response modeling; machine learning and predictive modeling; mixtures analyses; -omics analyses; toxicokinetic modeling; and read-across toxicity predictions. The last chapter of modules was organized to provide examples on environmental health database mining and integration, including chemical exposure, health outcome, and environmental justice indicators. Training modules and associated data are publicly available online (https://uncsrp.github.io/Data-Analysis-Training-Modules/). Together, this resource provides unique opportunities to obtain introductory-level training on current data analysis methods applicable to 21st century science and environmental health.
Collapse
Affiliation(s)
- Kyle Roell
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Lauren E. Koval
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Rebecca Boyles
- Research Computing, RTI International, Durham, NC, United States
| | - Grace Patlewicz
- Center for Computational Toxicology and Exposure, US Environmental Protection Agency, Durham, NC, United States
| | - Caroline Ring
- Center for Computational Toxicology and Exposure, US Environmental Protection Agency, Durham, NC, United States
| | - Cynthia V. Rider
- Division of the National Toxicology Program, National Institute of Environmental Health Sciences, Durham, NC, United States
| | - Cavin Ward-Caviness
- Center for Public Health and Environmental Assessment, US Environmental Protection Agency, Chapel Hill, NC, United States
| | - David M. Reif
- Bioinformatics Research Center, Department of Biological Sciences, North Carolina State University, Raleigh, NC, United States
| | - Ilona Jaspers
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Curriculum in Toxicology and Environmental Medicine, School of Medicine, University of North Carolina, Chapel Hill, NC, United States
- Center for Environmental Medicine, Asthma and Lung Biology, School of Medicine, University of North Carolina, Chapel Hill, NC, United States
- Department of Pediatrics, Microbiology and Immunology, School of Medicine, University of North Carolina, Chapel Hill, NC, United States
| | - Rebecca C. Fry
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Curriculum in Toxicology and Environmental Medicine, School of Medicine, University of North Carolina, Chapel Hill, NC, United States
| | - Julia E. Rager
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Curriculum in Toxicology and Environmental Medicine, School of Medicine, University of North Carolina, Chapel Hill, NC, United States
- Center for Environmental Medicine, Asthma and Lung Biology, School of Medicine, University of North Carolina, Chapel Hill, NC, United States
- *Correspondence: Julia E. Rager,
| |
Collapse
|
17
|
Heacock ML, Lopez AR, Amolegbe SM, Carlin DJ, Henry HF, Trottier BA, Velasco ML, Suk WA. Enhancing Data Integration, Interoperability, and Reuse to Address Complex and Emerging Environmental Health Problems. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2022; 56:7544-7552. [PMID: 35549252 PMCID: PMC9227711 DOI: 10.1021/acs.est.1c08383] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Indexed: 05/21/2023]
Abstract
Environmental health sciences (EHS) span many diverse disciplines. Within the EHS community, the National Institute of Environmental Health Sciences Superfund Research Program (SRP) funds multidisciplinary research aimed to address pressing and complex issues on how people are exposed to hazardous substances and their related health consequences with the goal of identifying strategies to reduce exposures and protect human health. While disentangling the interrelationships that contribute to environmental exposures and their effects on human health over the course of life remains difficult, advances in data science and data sharing offer a path forward to explore data across disciplines to reveal new insights. Multidisciplinary SRP-funded teams are well-positioned to examine how to best integrate EHS data across diverse research domains to address multifaceted environmental health problems. As such, SRP supported collaborative research projects designed to foster and enhance the interoperability and reuse of diverse and complex data streams. This perspective synthesizes those experiences as a landscape view of the challenges identified while working to increase the FAIR-ness (Findable, Accessible, Interoperable, and Reusable) of EHS data and opportunities to address them.
Collapse
Affiliation(s)
- Michelle L. Heacock
- Superfund
Research Program, National Institute of Environmental Health Sciences
(NIEHS), National Institutes
of Health (NIH), Department of Health and Human Services (DHHS), Research Triangle Park, North Carolina 27709, United States
- . Tel: 984-287-3267
| | | | - Sara M. Amolegbe
- Superfund
Research Program, National Institute of Environmental Health Sciences
(NIEHS), National Institutes
of Health (NIH), Department of Health and Human Services (DHHS), Research Triangle Park, North Carolina 27709, United States
| | - Danielle J. Carlin
- Superfund
Research Program, National Institute of Environmental Health Sciences
(NIEHS), National Institutes
of Health (NIH), Department of Health and Human Services (DHHS), Research Triangle Park, North Carolina 27709, United States
| | - Heather F. Henry
- Superfund
Research Program, National Institute of Environmental Health Sciences
(NIEHS), National Institutes
of Health (NIH), Department of Health and Human Services (DHHS), Research Triangle Park, North Carolina 27709, United States
| | - Brittany A. Trottier
- Superfund
Research Program, National Institute of Environmental Health Sciences
(NIEHS), National Institutes
of Health (NIH), Department of Health and Human Services (DHHS), Research Triangle Park, North Carolina 27709, United States
| | | | - William A. Suk
- Superfund
Research Program, National Institute of Environmental Health Sciences
(NIEHS), National Institutes
of Health (NIH), Department of Health and Human Services (DHHS), Research Triangle Park, North Carolina 27709, United States
| |
Collapse
|
18
|
Jiang W, Jones JC, Shankavaram U, Sproull M, Camphausen K, Krauze AV. Analytical Considerations of Large-Scale Aptamer-Based Datasets for Translational Applications. Cancers (Basel) 2022; 14:2227. [PMID: 35565358 PMCID: PMC9105298 DOI: 10.3390/cancers14092227] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 04/15/2022] [Accepted: 04/18/2022] [Indexed: 11/17/2022] Open
Abstract
The development and advancement of aptamer technology has opened a new realm of possibilities for unlocking the biocomplexity available within proteomics. With ultra-high-throughput and multiplexing, alongside remarkable specificity and sensitivity, aptamers could represent a powerful tool in disease-specific research, such as supporting the discovery and validation of clinically relevant biomarkers. One of the fundamental challenges underlying past and current proteomic technology has been the difficulty of translating proteomic datasets into standards of practice. Aptamers provide the capacity to generate single panels that span over 7000 different proteins from a singular sample. However, as a recent technology, they also present unique challenges, as the field of translational aptamer-based proteomics still lacks a standardizing methodology for analyzing these large datasets and the novel considerations that must be made in response to the differentiation amongst current proteomic platforms and aptamers. We address these analytical considerations with respect to surveying initial data, deploying proper statistical methodologies to identify differential protein expressions, and applying datasets to discover multimarker and pathway-level findings. Additionally, we present aptamer datasets within the multi-omics landscape by exploring the intersectionality of aptamer-based proteomics amongst genomics, transcriptomics, and metabolomics, alongside pre-existing proteomic platforms. Understanding the broader applications of aptamer datasets will substantially enhance current efforts to generate translatable findings for the clinic.
Collapse
Affiliation(s)
- Will Jiang
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, NIH, 9000 Rockville Pike, Building 10, CRC, Bethesda, MD 20892, USA; (W.J.); (U.S.); (M.S.); (K.C.)
| | - Jennifer C. Jones
- Translational Nanobiology Section, Laboratory of Pathology, NIH/NCI/CCR, Bethesda, MD 20892, USA;
| | - Uma Shankavaram
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, NIH, 9000 Rockville Pike, Building 10, CRC, Bethesda, MD 20892, USA; (W.J.); (U.S.); (M.S.); (K.C.)
| | - Mary Sproull
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, NIH, 9000 Rockville Pike, Building 10, CRC, Bethesda, MD 20892, USA; (W.J.); (U.S.); (M.S.); (K.C.)
| | - Kevin Camphausen
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, NIH, 9000 Rockville Pike, Building 10, CRC, Bethesda, MD 20892, USA; (W.J.); (U.S.); (M.S.); (K.C.)
| | - Andra V. Krauze
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, NIH, 9000 Rockville Pike, Building 10, CRC, Bethesda, MD 20892, USA; (W.J.); (U.S.); (M.S.); (K.C.)
| |
Collapse
|
19
|
Li H, Uittenbogaard M, Navarro R, Ahmed M, Gropman A, Chiaramello A, Hao L. Integrated proteomic and metabolomic analyses of the mitochondrial neurodegenerative disease MELAS. Mol Omics 2022; 18:196-205. [PMID: 34982085 PMCID: PMC11334596 DOI: 10.1039/d1mo00416f] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
MELAS (mitochondrial encephalomyopathy, lactic acidosis, stroke-like episodes) is a progressive neurodegenerative disease caused by pathogenic mitochondrial DNA variants. The pathogenic mechanism of MELAS remains enigmatic due to the exceptional clinical heterogeneity and the obscure genotype-phenotype correlation among MELAS patients. To gain insights into the pathogenic signature of MELAS, we designed a comprehensive strategy integrating proteomics and metabolomics in patient-derived dermal fibroblasts harboring the ultra-rare MELAS pathogenic variant m.14453G>A, specifically affecting the mitochondrial respiratory complex I. Global proteomics was achieved by data-dependent acquisition (DDA) and verified by data-independent acquisition (DIA) using both Spectronaut and the recently launched MaxDIA platforms. Comprehensive metabolite coverage was achieved for both polar and nonpolar metabolites in both reverse phase and HILIC LC-MS/MS analyses. Our proof-of-principle MELAS study with multi-omics integration revealed OXPHOS dysregulation with a predominant deficiency of complex I subunits, as well as alterations in key bioenergetic pathways, glycolysis, tricarboxylic acid cycle, and fatty acid β-oxidation. The most clinically relevant discovery is the downregulation of the arginine biosynthesis pathway, likely due to blocked argininosuccinate synthase, which is congruent with the MELAS cardinal symptom of stroke-like episodes and its current treatment by arginine infusion. In conclusion, we demonstrated an integrated proteomic and metabolomic strategy for patient-derived fibroblasts, which has great clinical potential to discover therapeutic targets and design personalized interventions after validation with a larger patient cohort in the future.
Collapse
Affiliation(s)
- Haorong Li
- Department of Chemistry, The George Washington University, Science and Engineering Hall, 800 22nd St., NW, Washington, DC 20052, USA.
| | - Martine Uittenbogaard
- Department of Anatomy and Cell Biology, George Washington University School of Medicine and Health Sciences, Washington, DC 20037, USA
| | - Ryan Navarro
- Department of Chemistry, The George Washington University, Science and Engineering Hall, 800 22nd St., NW, Washington, DC 20052, USA.
| | - Mustafa Ahmed
- Department of Chemistry, The George Washington University, Science and Engineering Hall, 800 22nd St., NW, Washington, DC 20052, USA.
| | - Andrea Gropman
- Division of Neurogenetics and Neurodevelopmental Pediatrics, Children's National Medical Center, Washington, DC 20010, USA
| | - Anne Chiaramello
- Department of Anatomy and Cell Biology, George Washington University School of Medicine and Health Sciences, Washington, DC 20037, USA
| | - Ling Hao
- Department of Chemistry, The George Washington University, Science and Engineering Hall, 800 22nd St., NW, Washington, DC 20052, USA.
| |
Collapse
|
20
|
Diaz-Flores E, Meyer T, Giorkallos A. Evolution of Artificial Intelligence-Powered Technologies in Biomedical Research and Healthcare. ADVANCES IN BIOCHEMICAL ENGINEERING/BIOTECHNOLOGY 2022; 182:23-60. [DOI: 10.1007/10_2021_189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|