1
|
Yadegar A, Bar-Yoseph H, Monaghan TM, Pakpour S, Severino A, Kuijper EJ, Smits WK, Terveer EM, Neupane S, Nabavi-Rad A, Sadeghi J, Cammarota G, Ianiro G, Nap-Hill E, Leung D, Wong K, Kao D. Fecal microbiota transplantation: current challenges and future landscapes. Clin Microbiol Rev 2024; 37:e0006022. [PMID: 38717124 DOI: 10.1128/cmr.00060-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2024] Open
Abstract
SUMMARYGiven the importance of gut microbial homeostasis in maintaining health, there has been considerable interest in developing innovative therapeutic strategies for restoring gut microbiota. One such approach, fecal microbiota transplantation (FMT), is the main "whole gut microbiome replacement" strategy and has been integrated into clinical practice guidelines for treating recurrent Clostridioides difficile infection (rCDI). Furthermore, the potential application of FMT in other indications such as inflammatory bowel disease (IBD), metabolic syndrome, and solid tumor malignancies is an area of intense interest and active research. However, the complex and variable nature of FMT makes it challenging to address its precise functionality and to assess clinical efficacy and safety in different disease contexts. In this review, we outline clinical applications, efficacy, durability, and safety of FMT and provide a comprehensive assessment of its procedural and administration aspects. The clinical applications of FMT in children and cancer immunotherapy are also described. We focus on data from human studies in IBD in contrast with rCDI to delineate the putative mechanisms of this treatment in IBD as a model, including colonization resistance and functional restoration through bacterial engraftment, modulating effects of virome/phageome, gut metabolome and host interactions, and immunoregulatory actions of FMT. Furthermore, we comprehensively review omics technologies, metagenomic approaches, and bioinformatics pipelines to characterize complex microbial communities and discuss their limitations. FMT regulatory challenges, ethical considerations, and pharmacomicrobiomics are also highlighted to shed light on future development of tailored microbiome-based therapeutics.
Collapse
Affiliation(s)
- Abbas Yadegar
- Foodborne and Waterborne Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Haggai Bar-Yoseph
- Department of Gastroenterology, Rambam Health Care Campus, Haifa, Israel
- Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, Israel
| | - Tanya Marie Monaghan
- National Institute for Health Research Nottingham Biomedical Research Centre, University of Nottingham, Nottingham, United Kingdom
- Nottingham Digestive Diseases Centre, School of Medicine, University of Nottingham, Nottingham, United Kingdom
| | - Sepideh Pakpour
- School of Engineering, Faculty of Applied Sciences, UBC, Okanagan Campus, Kelowna, British Columbia, Canada
| | - Andrea Severino
- Department of Translational Medicine and Surgery, Università Cattolica del Sacro Cuore, Rome, Italy
- Department of Medical and Surgical Sciences, UOC CEMAD Centro Malattie dell'Apparato Digerente, Medicina Interna e Gastroenterologia, Fondazione Policlinico Universitario Gemelli IRCCS, Rome, Italy
- Department of Medical and Surgical Sciences, UOC Gastroenterologia, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Rome, Italy
| | - Ed J Kuijper
- Center for Microbiota Analysis and Therapeutics (CMAT), Leiden University Center for Infectious Diseases, Leiden University Medical Center, Leiden, The Netherlands
| | - Wiep Klaas Smits
- Center for Microbiota Analysis and Therapeutics (CMAT), Leiden University Center for Infectious Diseases, Leiden University Medical Center, Leiden, The Netherlands
| | - Elisabeth M Terveer
- Center for Microbiota Analysis and Therapeutics (CMAT), Leiden University Center for Infectious Diseases, Leiden University Medical Center, Leiden, The Netherlands
| | - Sukanya Neupane
- Division of Gastroenterology, Department of Medicine, University of Alberta, Edmonton, Alberta, Canada
| | - Ali Nabavi-Rad
- Foodborne and Waterborne Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Javad Sadeghi
- School of Engineering, Faculty of Applied Sciences, UBC, Okanagan Campus, Kelowna, British Columbia, Canada
| | - Giovanni Cammarota
- Department of Translational Medicine and Surgery, Università Cattolica del Sacro Cuore, Rome, Italy
- Department of Medical and Surgical Sciences, UOC CEMAD Centro Malattie dell'Apparato Digerente, Medicina Interna e Gastroenterologia, Fondazione Policlinico Universitario Gemelli IRCCS, Rome, Italy
- Department of Medical and Surgical Sciences, UOC Gastroenterologia, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Rome, Italy
| | - Gianluca Ianiro
- Department of Translational Medicine and Surgery, Università Cattolica del Sacro Cuore, Rome, Italy
- Department of Medical and Surgical Sciences, UOC CEMAD Centro Malattie dell'Apparato Digerente, Medicina Interna e Gastroenterologia, Fondazione Policlinico Universitario Gemelli IRCCS, Rome, Italy
- Department of Medical and Surgical Sciences, UOC Gastroenterologia, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Rome, Italy
| | - Estello Nap-Hill
- Department of Medicine, Division of Gastroenterology, St Paul's Hospital, University of British Columbia, Vancouver, British Columbia, Canada
| | - Dickson Leung
- Division of Gastroenterology, Department of Medicine, University of Alberta, Edmonton, Alberta, Canada
| | - Karen Wong
- Division of Gastroenterology, Department of Medicine, University of Alberta, Edmonton, Alberta, Canada
| | - Dina Kao
- Division of Gastroenterology, Department of Medicine, University of Alberta, Edmonton, Alberta, Canada
| |
Collapse
|
2
|
Liu Z, Zhou Y, Wang H, Liu C, Wang L. Recent advances in understanding the fitness and survival mechanisms of Vibrio parahaemolyticus. Int J Food Microbiol 2024; 417:110691. [PMID: 38631283 DOI: 10.1016/j.ijfoodmicro.2024.110691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 03/14/2024] [Accepted: 04/02/2024] [Indexed: 04/19/2024]
Abstract
The presence of Vibrio parahaemolyticus (Vp) in different production stages of seafood has generated negative impacts on both public health and the sustainability of the industry. To further better investigate the fitness of Vp at the phenotypical level, a great number of studies have been conducted in recent years using plate counting methods. In the meantime, with the increasing accessibility of the next generation sequencing and the advances in analytical chemistry techniques, omics-oriented biotechnologies have further advanced our knowledge in the survival and virulence mechanisms of Vp at various molecular levels. These observations provide insights to guide the development of novel prevention and control strategies and benefit the monitoring and mitigation of food safety risks associated with Vp contamination. To timely capture these recent advances, this review firstly summarizes the most recent phenotypical level studies and provide insights about the survival of Vp under important in vitro stresses and on aquatic products. After that, molecular survival mechanisms of Vp at transcriptomic and proteomic levels are summarized and discussed. Looking forward, other newer omics-biotechnology such as metabolomics and secretomics show great potential to be used for confirming the cellular responses of Vp. Powerful data mining tools from the field of machine learning and artificial intelligence, that can better utilize the omics data and solve complex problems in the processing, analysis, and interpretation of omics data, will further improve our mechanistic understanding of Vp.
Collapse
Affiliation(s)
- Zhuosheng Liu
- Department of Food Science and Technology, University of California Davis, Davis, CA 95618, USA
| | - Yi Zhou
- Department of Food Science and Technology, University of California Davis, Davis, CA 95618, USA
| | - Hongye Wang
- Department of Food Science and Technology, University of California Davis, Davis, CA 95618, USA
| | - Chengchu Liu
- University of Maryland Sea Grant Extension Program, UMES Center for Food Science and Technology, Princess Anne, MD, United States
| | - Luxin Wang
- Department of Food Science and Technology, University of California Davis, Davis, CA 95618, USA.
| |
Collapse
|
3
|
Lac L, Leung CK, Hu P. Computational frameworks integrating deep learning and statistical models in mining multimodal omics data. J Biomed Inform 2024; 152:104629. [PMID: 38552994 DOI: 10.1016/j.jbi.2024.104629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 02/26/2024] [Accepted: 03/25/2024] [Indexed: 04/04/2024]
Abstract
BACKGROUND In health research, multimodal omics data analysis is widely used to address important clinical and biological questions. Traditional statistical methods rely on the strong assumptions of distribution. Statistical methods such as testing and differential expression are commonly used in omics analysis. Deep learning, on the other hand, is an advanced computer science technique that is powerful in mining high-dimensional omics data for prediction tasks. Recently, integrative frameworks or methods have been developed for omics studies that combine statistical models and deep learning algorithms. METHODS AND RESULTS The aim of these integrative frameworks is to combine the strengths of both statistical methods and deep learning algorithms to improve prediction accuracy while also providing interpretability and explainability. This review report discusses the current state-of-the-art integrative frameworks, their limitations, and potential future directions in survival and time-to-event longitudinal analysis, dimension reduction and clustering, regression and classification, feature selection, and causal and transfer learning.
Collapse
Affiliation(s)
- Leann Lac
- Department of Computer Science, University of Manitoba, Winnipeg, Manitoba, Canada; Department of Statistics, University of Manitoba, Winnipeg, Manitoba, Canada
| | - Carson K Leung
- Department of Computer Science, University of Manitoba, Winnipeg, Manitoba, Canada
| | - Pingzhao Hu
- Department of Computer Science, University of Manitoba, Winnipeg, Manitoba, Canada; Department of Biochemistry, Western University, London, Ontario, Canada; Department of Computer Science, Western University, London, Ontario, Canada; Department of Oncology, Western University, London, Ontario, Canada; Department of Epidemiology and Biostatistics, Western University, London, Ontario, Canada; The Children's Health Research Institute, Lawson Health Research Institute, London, Ontario, Canada.
| |
Collapse
|
4
|
Banerjee M, Srivastava S, Rai SN, States JC. Chronic arsenic exposure induces malignant transformation of human HaCaT cells through both deterministic and stochastic changes in transcriptome expression. Toxicol Appl Pharmacol 2024; 484:116865. [PMID: 38373578 PMCID: PMC10994602 DOI: 10.1016/j.taap.2024.116865] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Revised: 02/11/2024] [Accepted: 02/13/2024] [Indexed: 02/21/2024]
Abstract
Biological processes are inherently stochastic, i.e., are partially driven by hard to predict random probabilistic processes. Carcinogenesis is driven both by stochastic and deterministic (predictable non-random) changes. However, very few studies systematically examine the contribution of stochastic events leading to cancer development. In differential gene expression studies, the established data analysis paradigms incentivize expression changes that are uniformly different across the experimental versus control groups, introducing preferential inclusion of deterministic changes at the expense of stochastic processes that might also play a crucial role in the process of carcinogenesis. In this study, we applied simple computational techniques to quantify: (i) The impact of chronic arsenic (iAs) exposure as well as passaging time on stochastic gene expression and (ii) Which genes were expressed deterministically and which were expressed stochastically at each of the three stages of cancer development. Using biological coefficient of variation as an empirical measure of stochasticity we demonstrate that chronic iAs exposure consistently suppressed passaging related stochastic gene expression at multiple time points tested, selecting for a homogenous cell population that undergo transformation. Employing multiple balanced removal of outlier data, we show that chronic iAs exposure induced deterministic and stochastic changes in the expression of unique set of genes, that populate largely unique biological pathways. Together, our data unequivocally demonstrate that both deterministic and stochastic changes in transcriptome-wide expression are critical in driving biological processes, pathways and networks towards clonal selection, carcinogenesis, and tumor heterogeneity.
Collapse
Affiliation(s)
- Mayukh Banerjee
- Department of Pharmacology and Toxicology, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA; Center for Integrative Environmental Health Sciences, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA
| | - Sudhir Srivastava
- Department of Bioinformatics and Biostatistics, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA
| | - Shesh N Rai
- Department of Bioinformatics and Biostatistics, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA; Biostatistics and Bioinformatics Facility, James Graham Brown Cancer Center, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA; Biostatistics and Informatics Facility Core, Center for Integrative Environmental Health Sciences, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA
| | - J Christopher States
- Department of Pharmacology and Toxicology, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA; Center for Integrative Environmental Health Sciences, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA.
| |
Collapse
|
5
|
Brunet TA, Ayciriex S, Arquier D, Lemoine J, Randon J, Salvador A. Scout triggered multiple reaction monitoring mass spectrometry for the rapid transfer of large multiplexed targeted methods in metabolomics. J Chromatogr B Analyt Technol Biomed Life Sci 2023; 1228:123849. [PMID: 37634392 DOI: 10.1016/j.jchromb.2023.123849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 07/20/2023] [Accepted: 08/11/2023] [Indexed: 08/29/2023]
Abstract
The field of metabolomics based on mass spectrometry has grown considerably in recent years due to the need to detect and, above all, quantify a very large number of metabolites, simultaneously. Up to now, targeted multiplexed analysis on complex samples by Liquid Chromatography coupled with tandem Mass Spectrometry (LC-MS/MS) has relied almost exclusively on compound detection based on absolute retention times, as in the Scheduled-MRM (sMRM) approach. Those methods turn out to be poorly transferable from one instrument to another and result in a time-consuming and tedious method development involving a significant number of critical parameters that need specific re-optimisation. To address this challenge, we introduce a novel acquisition mode called scout-triggered MRM (stMRM). In stMRM, a marker transition is used to trigger MS analysis for a group of dependent target analytes. These marker transitions are strategically distributed throughout the chromatographic run, and the dependent analytes are associated based on their retention times. The result is a targeted assay that remains robust even in the presence of retention time shifts. A 3 to 5-fold increase in the number of detected transitions associated to plasma metabolites was obtained when transferring from a direct application of a published sMRM to a stMRM method. This significant improvement highlights the universal applicability of the stMRM method, as it can be implemented on any LC system without the need for extensive method development. We subsequently illustrate the robustness of stMRM in modified chromatographic elution conditions. Despite a large change in metabolite's selectivity, the multiplexed assay successfully recovered 70% of the monitored transitions when consequently modifying the gradient method. These findings demonstrate the versatility and adaptability of stMRM, opening new avenues for the development of highly multiplexed LC-MS/MS methods in metabolomics. These methods are characterized by their analytical transparency and straightforward implementation using existing literature data.
Collapse
Affiliation(s)
- Thomas Alexandre Brunet
- Univ Lyon, CNRS, Université Claude Bernard Lyon 1, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100 Villeurbanne, France
| | - Sophie Ayciriex
- Univ Lyon, CNRS, Université Claude Bernard Lyon 1, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100 Villeurbanne, France
| | - Delphine Arquier
- Univ Lyon, CNRS, Université Claude Bernard Lyon 1, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100 Villeurbanne, France
| | - Jérôme Lemoine
- Univ Lyon, CNRS, Université Claude Bernard Lyon 1, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100 Villeurbanne, France
| | - Jérôme Randon
- Univ Lyon, CNRS, Université Claude Bernard Lyon 1, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100 Villeurbanne, France
| | - Arnaud Salvador
- Univ Lyon, CNRS, Université Claude Bernard Lyon 1, Institut des Sciences Analytiques, UMR 5280, 5 rue de la Doua, F-69100 Villeurbanne, France.
| |
Collapse
|
6
|
Rathnakumar S, Kambhampati NSV, Saiswaroop R, Pradhan SS, Ramkumar G, Beeraka N, Muddu GK, Kumar S, Javvaji SK, Parangoankar A, Sivaramakrishnan V, Ramamurthy SS. Integrated clinical and metabolomic analysis of dengue infection shows molecular signatures associated with host-pathogen interaction in different phases of the disease. Metabolomics 2023; 19:47. [PMID: 37130982 DOI: 10.1007/s11306-023-02011-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 04/20/2023] [Indexed: 05/04/2023]
Abstract
PURPOSE Dengue is a mosquito vector-borne disease caused by the dengue virus, which affects 125 million people globally. The disease causes considerable morbidity. The disease, based on symptoms, is classified into three characteristic phases, which can further lead to complications in the second phase. Molecular signatures that are associated with the three phases have not been well characterized. We performed an integrated clinical and metabolomic analysis of our patient cohort and compared it with omics data from the literature to identify signatures unique to the different phases. METHODS The dengue patients are recruited by clinicians after standard-of-care diagnostic tests and evaluation of symptoms. Blood from the patients was collected. NS1 antigen, IgM, IgG antibodies, and cytokines in serum were analyzed using ELISA. Targeted metabolomics was performed using LC-MS triple quad. The results were compared with analyzed transcriptomic data from the GEO database and metabolomic data sets from the literature. RESULTS The dengue patients displayed characteristic features of the disease, including elevated NS1 levels. TNF-α was found to be elevated in all three phases compared to healthy controls. The metabolic pathways were found to be deregulated compared to healthy controls only in phases I and II of dengue patients. The pathways represent viral replication and host response mediated pathways. The major pathways include nucleotide metabolism of various amino acids and fatty acids, biotin, etc. CONCLUSION: The results show elevated TNF-α and metabolites that are characteristic of viral infection and host response. IL10 and IFN-γ were not significant, consistent with the absence of any complications.
Collapse
Affiliation(s)
- Sriram Rathnakumar
- Disease Biology Lab, Department of Biosciences, Sri Sathya Sai Institute of Higher Learning, Prasanthi Nilayam, Puttaparthi, Andhra Pradesh, 515134, India
| | - Naga Sai Visweswar Kambhampati
- STAR Laboratory, Central Research Instruments Facility, Sri Sathya Sai Institute of Higher Learning, Prasanthi Nilayam, Puttaparthi, Andhra Pradesh, 515134, India
| | - R Saiswaroop
- Disease Biology Lab, Department of Biosciences, Sri Sathya Sai Institute of Higher Learning, Prasanthi Nilayam, Puttaparthi, Andhra Pradesh, 515134, India
| | - Sai Sanwid Pradhan
- Disease Biology Lab, Department of Biosciences, Sri Sathya Sai Institute of Higher Learning, Prasanthi Nilayam, Puttaparthi, Andhra Pradesh, 515134, India
| | - G Ramkumar
- Department of General Medicine, Sri Sathya Sai General Hospital, Sri Sathya Sai Institute of Higher Medical Sciences Campus, Whitefield, Bengaluru, Karnataka, 560066, India
| | - Nirmala Beeraka
- Department of General Medicine, Sri Sathya Sai General Hospital, Sri Sathya Sai Institute of Higher Medical Sciences Campus, Whitefield, Bengaluru, Karnataka, 560066, India
| | - Gopi Krishna Muddu
- Department of Pediatrics, Sri Sathya Sai General Hospital, Puttaparthi, Andhra Pradesh, 515134, India
| | - Sandeep Kumar
- Department of General Medicine, Sri Sathya Sai General Hospital, Puttaparthi, Andhra Pradesh, 515134, India
| | - Sai Kiran Javvaji
- Department of Laboratory Medicine and Cardiology, Sri Sathya Sai Institute of Higher Medical Sciences, Whitefield, Bengaluru, Karnataka, 560066, India
| | | | - Venketesh Sivaramakrishnan
- Disease Biology Lab, Department of Biosciences, Sri Sathya Sai Institute of Higher Learning, Prasanthi Nilayam, Puttaparthi, Andhra Pradesh, 515134, India.
| | - Sai Sathish Ramamurthy
- STAR Laboratory, Central Research Instruments Facility, Sri Sathya Sai Institute of Higher Learning, Prasanthi Nilayam, Puttaparthi, Andhra Pradesh, 515134, India.
| |
Collapse
|
7
|
Sun C, Schuman E. A multi-omics view of neuronal subcellular protein synthesis. Curr Opin Neurobiol 2023; 80:102705. [PMID: 36913750 DOI: 10.1016/j.conb.2023.102705] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 02/14/2023] [Accepted: 02/15/2023] [Indexed: 03/13/2023]
Abstract
While it has long been known that protein synthesis is necessary for long-term memory in the brain, the logistics of neuronal protein synthesis is complicated by the extensive subcellular compartmentalization of the neuron. Local protein synthesis solves many of the logistic problems posed by the extreme complexity of dendritic and axonal arbors and the huge number of synapses. Here we review recent multi-omic and quantitative studies that elaborate a systems view of decentralized neuronal protein synthesis. We highlight recent insights from the transcriptomic, translatomic, and proteomic levels, discuss the nuanced logic of local protein synthesis for different protein features, and list the missing information needed to build a comprehensive logistic model for neuronal protein supply.
Collapse
Affiliation(s)
- Chao Sun
- Max Planck Institute for Brain Research, Frankfurt, Germany; Danish Research Institute of Translational Neuroscience - DANDRITE, Nordic-EMBL Partnership for Molecular Medicine, Denmark; Aarhus University, Department of Molecular Biology and Genetics, Universitetsbyen 81, 8000 Aarhus C, Denmark. https://twitter.com/LukeChaoSun
| | - Erin Schuman
- Max Planck Institute for Brain Research, Frankfurt, Germany.
| |
Collapse
|
8
|
Big Data Analysis and Application of Liver Cancer Gene Sequence Based on Second-Generation Sequencing Technology. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:4004130. [PMID: 36017150 PMCID: PMC9398858 DOI: 10.1155/2022/4004130] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 07/03/2022] [Accepted: 07/14/2022] [Indexed: 12/04/2022]
Abstract
In big data analysis with the rapid improvement of computer storage capacity and the rapid development of complex algorithms, the exponential growth of massive data has also made science and technology progress with each passing day. Based on omics data such as mRNA data, microRNA data, or DNA methylation data, this study uses traditional clustering methods such as kmeans, K-nearest neighbors, hierarchical clustering, affinity propagation, and nonnegative matrix decomposition to classify samples into categories, obtained: (1) The assumption that the attributes are independent of each other reduces the classification effect of the algorithm to a certain extent. According to the idea of multilevel grid, there is a one-to-one mapping from high-dimensional space to one-dimensional. The complexity is greatly simplified by encoding the one-dimensional grid of the hierarchical grid. The logic of the algorithm is relatively simple, and it also has a very stable classification efficiency. (2) Convert the two-dimensional representation of the data into the one-dimensional representation of the binary, realize the dimensionality reduction processing of the data, and improve the organization and storage efficiency of the data. The grid coding expresses the spatial position of the data, maintains the original organization method of the data, and does not make the abstract expression of the data object. (3) The data processing of nondiscrete and missing values provides a new opportunity for the identification of protein targets of small molecule therapy and obtains a better classification effect. (4) The comparison of the three models shows that Naive Bayes is the optimal model. Each iteration is composed of alternately expected steps and maximal steps and then identified and quantified by MS.
Collapse
|
9
|
Hill C, Avila-Palencia I, Maxwell AP, Hunter RF, McKnight AJ. Harnessing the Full Potential of Multi-Omic Analyses to Advance the Study and Treatment of Chronic Kidney Disease. FRONTIERS IN NEPHROLOGY 2022; 2:923068. [PMID: 37674991 PMCID: PMC10479694 DOI: 10.3389/fneph.2022.923068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Accepted: 05/30/2022] [Indexed: 09/08/2023]
Abstract
Chronic kidney disease (CKD) was the 12th leading cause of death globally in 2017 with the prevalence of CKD estimated at ~9%. Early detection and intervention for CKD may improve patient outcomes, but standard testing approaches even in developed countries do not facilitate identification of patients at high risk of developing CKD, nor those progressing to end-stage kidney disease (ESKD). Recent advances in CKD research are moving towards a more personalised approach for CKD. Heritability for CKD ranges from 30% to 75%, yet identified genetic risk factors account for only a small proportion of the inherited contribution to CKD. More in depth analysis of genomic sequencing data in large cohorts is revealing new genetic risk factors for common diagnoses of CKD and providing novel diagnoses for rare forms of CKD. Multi-omic approaches are now being harnessed to improve our understanding of CKD and explain some of the so-called 'missing heritability'. The most common omic analyses employed for CKD are genomics, epigenomics, transcriptomics, metabolomics, proteomics and phenomics. While each of these omics have been reviewed individually, considering integrated multi-omic analysis offers considerable scope to improve our understanding and treatment of CKD. This narrative review summarises current understanding of multi-omic research alongside recent experimental and analytical approaches, discusses current challenges and future perspectives, and offers new insights for CKD.
Collapse
Affiliation(s)
| | | | | | | | - Amy Jayne McKnight
- Centre for Public Health, Queen’s University Belfast, Belfast, United Kingdom
| |
Collapse
|
10
|
Roell K, Koval LE, Boyles R, Patlewicz G, Ring C, Rider CV, Ward-Caviness C, Reif DM, Jaspers I, Fry RC, Rager JE. Development of the InTelligence And Machine LEarning (TAME) Toolkit for Introductory Data Science, Chemical-Biological Analyses, Predictive Modeling, and Database Mining for Environmental Health Research. FRONTIERS IN TOXICOLOGY 2022; 4:893924. [PMID: 35812168 PMCID: PMC9257219 DOI: 10.3389/ftox.2022.893924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 05/30/2022] [Indexed: 01/09/2023] Open
Abstract
Research in environmental health is becoming increasingly reliant upon data science and computational methods that can more efficiently extract information from complex datasets. Data science and computational methods can be leveraged to better identify relationships between exposures to stressors in the environment and human disease outcomes, representing critical information needed to protect and improve global public health. Still, there remains a critical gap surrounding the training of researchers on these in silico methods. We aimed to address this gap by developing the inTelligence And Machine lEarning (TAME) Toolkit, promoting trainee-driven data generation, management, and analysis methods to “TAME” data in environmental health studies. Training modules were developed to provide applications-driven examples of data organization and analysis methods that can be used to address environmental health questions. Target audiences for these modules include students, post-baccalaureate and post-doctorate trainees, and professionals that are interested in expanding their skillset to include recent advances in data analysis methods relevant to environmental health, toxicology, exposure science, epidemiology, and bioinformatics/cheminformatics. Modules were developed by study coauthors using annotated script and were organized into three chapters within a GitHub Bookdown site. The first chapter of modules focuses on introductory data science, which includes the following topics: setting up R/RStudio and coding in the R environment; data organization basics; finding and visualizing data trends; high-dimensional data visualizations; and Findability, Accessibility, Interoperability, and Reusability (FAIR) data management practices. The second chapter of modules incorporates chemical-biological analyses and predictive modeling, spanning the following methods: dose-response modeling; machine learning and predictive modeling; mixtures analyses; -omics analyses; toxicokinetic modeling; and read-across toxicity predictions. The last chapter of modules was organized to provide examples on environmental health database mining and integration, including chemical exposure, health outcome, and environmental justice indicators. Training modules and associated data are publicly available online (https://uncsrp.github.io/Data-Analysis-Training-Modules/). Together, this resource provides unique opportunities to obtain introductory-level training on current data analysis methods applicable to 21st century science and environmental health.
Collapse
Affiliation(s)
- Kyle Roell
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Lauren E. Koval
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Rebecca Boyles
- Research Computing, RTI International, Durham, NC, United States
| | - Grace Patlewicz
- Center for Computational Toxicology and Exposure, US Environmental Protection Agency, Durham, NC, United States
| | - Caroline Ring
- Center for Computational Toxicology and Exposure, US Environmental Protection Agency, Durham, NC, United States
| | - Cynthia V. Rider
- Division of the National Toxicology Program, National Institute of Environmental Health Sciences, Durham, NC, United States
| | - Cavin Ward-Caviness
- Center for Public Health and Environmental Assessment, US Environmental Protection Agency, Chapel Hill, NC, United States
| | - David M. Reif
- Bioinformatics Research Center, Department of Biological Sciences, North Carolina State University, Raleigh, NC, United States
| | - Ilona Jaspers
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Curriculum in Toxicology and Environmental Medicine, School of Medicine, University of North Carolina, Chapel Hill, NC, United States
- Center for Environmental Medicine, Asthma and Lung Biology, School of Medicine, University of North Carolina, Chapel Hill, NC, United States
- Department of Pediatrics, Microbiology and Immunology, School of Medicine, University of North Carolina, Chapel Hill, NC, United States
| | - Rebecca C. Fry
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Curriculum in Toxicology and Environmental Medicine, School of Medicine, University of North Carolina, Chapel Hill, NC, United States
| | - Julia E. Rager
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Curriculum in Toxicology and Environmental Medicine, School of Medicine, University of North Carolina, Chapel Hill, NC, United States
- Center for Environmental Medicine, Asthma and Lung Biology, School of Medicine, University of North Carolina, Chapel Hill, NC, United States
- *Correspondence: Julia E. Rager,
| |
Collapse
|
11
|
Heacock ML, Lopez AR, Amolegbe SM, Carlin DJ, Henry HF, Trottier BA, Velasco ML, Suk WA. Enhancing Data Integration, Interoperability, and Reuse to Address Complex and Emerging Environmental Health Problems. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2022; 56:7544-7552. [PMID: 35549252 PMCID: PMC9227711 DOI: 10.1021/acs.est.1c08383] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Indexed: 05/21/2023]
Abstract
Environmental health sciences (EHS) span many diverse disciplines. Within the EHS community, the National Institute of Environmental Health Sciences Superfund Research Program (SRP) funds multidisciplinary research aimed to address pressing and complex issues on how people are exposed to hazardous substances and their related health consequences with the goal of identifying strategies to reduce exposures and protect human health. While disentangling the interrelationships that contribute to environmental exposures and their effects on human health over the course of life remains difficult, advances in data science and data sharing offer a path forward to explore data across disciplines to reveal new insights. Multidisciplinary SRP-funded teams are well-positioned to examine how to best integrate EHS data across diverse research domains to address multifaceted environmental health problems. As such, SRP supported collaborative research projects designed to foster and enhance the interoperability and reuse of diverse and complex data streams. This perspective synthesizes those experiences as a landscape view of the challenges identified while working to increase the FAIR-ness (Findable, Accessible, Interoperable, and Reusable) of EHS data and opportunities to address them.
Collapse
Affiliation(s)
- Michelle L. Heacock
- Superfund
Research Program, National Institute of Environmental Health Sciences
(NIEHS), National Institutes
of Health (NIH), Department of Health and Human Services (DHHS), Research Triangle Park, North Carolina 27709, United States
- . Tel: 984-287-3267
| | | | - Sara M. Amolegbe
- Superfund
Research Program, National Institute of Environmental Health Sciences
(NIEHS), National Institutes
of Health (NIH), Department of Health and Human Services (DHHS), Research Triangle Park, North Carolina 27709, United States
| | - Danielle J. Carlin
- Superfund
Research Program, National Institute of Environmental Health Sciences
(NIEHS), National Institutes
of Health (NIH), Department of Health and Human Services (DHHS), Research Triangle Park, North Carolina 27709, United States
| | - Heather F. Henry
- Superfund
Research Program, National Institute of Environmental Health Sciences
(NIEHS), National Institutes
of Health (NIH), Department of Health and Human Services (DHHS), Research Triangle Park, North Carolina 27709, United States
| | - Brittany A. Trottier
- Superfund
Research Program, National Institute of Environmental Health Sciences
(NIEHS), National Institutes
of Health (NIH), Department of Health and Human Services (DHHS), Research Triangle Park, North Carolina 27709, United States
| | | | - William A. Suk
- Superfund
Research Program, National Institute of Environmental Health Sciences
(NIEHS), National Institutes
of Health (NIH), Department of Health and Human Services (DHHS), Research Triangle Park, North Carolina 27709, United States
| |
Collapse
|
12
|
Jiang W, Jones JC, Shankavaram U, Sproull M, Camphausen K, Krauze AV. Analytical Considerations of Large-Scale Aptamer-Based Datasets for Translational Applications. Cancers (Basel) 2022; 14:2227. [PMID: 35565358 PMCID: PMC9105298 DOI: 10.3390/cancers14092227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 04/15/2022] [Accepted: 04/18/2022] [Indexed: 11/17/2022] Open
Abstract
The development and advancement of aptamer technology has opened a new realm of possibilities for unlocking the biocomplexity available within proteomics. With ultra-high-throughput and multiplexing, alongside remarkable specificity and sensitivity, aptamers could represent a powerful tool in disease-specific research, such as supporting the discovery and validation of clinically relevant biomarkers. One of the fundamental challenges underlying past and current proteomic technology has been the difficulty of translating proteomic datasets into standards of practice. Aptamers provide the capacity to generate single panels that span over 7000 different proteins from a singular sample. However, as a recent technology, they also present unique challenges, as the field of translational aptamer-based proteomics still lacks a standardizing methodology for analyzing these large datasets and the novel considerations that must be made in response to the differentiation amongst current proteomic platforms and aptamers. We address these analytical considerations with respect to surveying initial data, deploying proper statistical methodologies to identify differential protein expressions, and applying datasets to discover multimarker and pathway-level findings. Additionally, we present aptamer datasets within the multi-omics landscape by exploring the intersectionality of aptamer-based proteomics amongst genomics, transcriptomics, and metabolomics, alongside pre-existing proteomic platforms. Understanding the broader applications of aptamer datasets will substantially enhance current efforts to generate translatable findings for the clinic.
Collapse
Affiliation(s)
- Will Jiang
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, NIH, 9000 Rockville Pike, Building 10, CRC, Bethesda, MD 20892, USA; (W.J.); (U.S.); (M.S.); (K.C.)
| | - Jennifer C. Jones
- Translational Nanobiology Section, Laboratory of Pathology, NIH/NCI/CCR, Bethesda, MD 20892, USA;
| | - Uma Shankavaram
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, NIH, 9000 Rockville Pike, Building 10, CRC, Bethesda, MD 20892, USA; (W.J.); (U.S.); (M.S.); (K.C.)
| | - Mary Sproull
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, NIH, 9000 Rockville Pike, Building 10, CRC, Bethesda, MD 20892, USA; (W.J.); (U.S.); (M.S.); (K.C.)
| | - Kevin Camphausen
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, NIH, 9000 Rockville Pike, Building 10, CRC, Bethesda, MD 20892, USA; (W.J.); (U.S.); (M.S.); (K.C.)
| | - Andra V. Krauze
- Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, NIH, 9000 Rockville Pike, Building 10, CRC, Bethesda, MD 20892, USA; (W.J.); (U.S.); (M.S.); (K.C.)
| |
Collapse
|
13
|
Li H, Uittenbogaard M, Navarro R, Ahmed M, Gropman A, Chiaramello A, Hao L. Integrated proteomic and metabolomic analyses of the mitochondrial neurodegenerative disease MELAS. Mol Omics 2022; 18:196-205. [PMID: 34982085 DOI: 10.1039/d1mo00416f] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
MELAS (mitochondrial encephalomyopathy, lactic acidosis, stroke-like episodes) is a progressive neurodegenerative disease caused by pathogenic mitochondrial DNA variants. The pathogenic mechanism of MELAS remains enigmatic due to the exceptional clinical heterogeneity and the obscure genotype-phenotype correlation among MELAS patients. To gain insights into the pathogenic signature of MELAS, we designed a comprehensive strategy integrating proteomics and metabolomics in patient-derived dermal fibroblasts harboring the ultra-rare MELAS pathogenic variant m.14453G>A, specifically affecting the mitochondrial respiratory complex I. Global proteomics was achieved by data-dependent acquisition (DDA) and verified by data-independent acquisition (DIA) using both Spectronaut and the recently launched MaxDIA platforms. Comprehensive metabolite coverage was achieved for both polar and nonpolar metabolites in both reverse phase and HILIC LC-MS/MS analyses. Our proof-of-principle MELAS study with multi-omics integration revealed OXPHOS dysregulation with a predominant deficiency of complex I subunits, as well as alterations in key bioenergetic pathways, glycolysis, tricarboxylic acid cycle, and fatty acid β-oxidation. The most clinically relevant discovery is the downregulation of the arginine biosynthesis pathway, likely due to blocked argininosuccinate synthase, which is congruent with the MELAS cardinal symptom of stroke-like episodes and its current treatment by arginine infusion. In conclusion, we demonstrated an integrated proteomic and metabolomic strategy for patient-derived fibroblasts, which has great clinical potential to discover therapeutic targets and design personalized interventions after validation with a larger patient cohort in the future.
Collapse
Affiliation(s)
- Haorong Li
- Department of Chemistry, The George Washington University, Science and Engineering Hall, 800 22nd St., NW, Washington, DC 20052, USA.
| | - Martine Uittenbogaard
- Department of Anatomy and Cell Biology, George Washington University School of Medicine and Health Sciences, Washington, DC 20037, USA
| | - Ryan Navarro
- Department of Chemistry, The George Washington University, Science and Engineering Hall, 800 22nd St., NW, Washington, DC 20052, USA.
| | - Mustafa Ahmed
- Department of Chemistry, The George Washington University, Science and Engineering Hall, 800 22nd St., NW, Washington, DC 20052, USA.
| | - Andrea Gropman
- Division of Neurogenetics and Neurodevelopmental Pediatrics, Children's National Medical Center, Washington, DC 20010, USA
| | - Anne Chiaramello
- Department of Anatomy and Cell Biology, George Washington University School of Medicine and Health Sciences, Washington, DC 20037, USA
| | - Ling Hao
- Department of Chemistry, The George Washington University, Science and Engineering Hall, 800 22nd St., NW, Washington, DC 20052, USA.
| |
Collapse
|
14
|
Diaz-Flores E, Meyer T, Giorkallos A. Evolution of Artificial Intelligence-Powered Technologies in Biomedical Research and Healthcare. ADVANCES IN BIOCHEMICAL ENGINEERING/BIOTECHNOLOGY 2022; 182:23-60. [DOI: 10.1007/10_2021_189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|