1
|
Liu W, Liu T, Si X, Liang J, Yan X, Zhang J, Pang B, Luo W, Liu J, Yang H, Shi P. Multi-omic characterization of air pollution effects: Applications of AirSigOmniTWP Hub. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2024; 284:116939. [PMID: 39191137 DOI: 10.1016/j.ecoenv.2024.116939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Revised: 08/19/2024] [Accepted: 08/23/2024] [Indexed: 08/29/2024]
Abstract
Air pollution, particularly fine particulate matter and gaseous pollutants including NO2 and NOx, presents significant public health challenges. While the harmful effects of these pollutants are well-documented, the molecular mechanisms underlying their impact on health remain incompletely understood. In this study, we utilized genome-wide association study (GWAS) data from the UK Biobank, expression quantitative trait loci (eQTL) data from the Genotype-Tissue Expression (GTEx) project, and protein quantitative trait loci (pQTL) data from the Atherosclerosis Risk in Communities (ARIC) study to conduct comprehensive analyses using the Unified Test for Molecular Signatures (UTMOST), Transcriptome-wide Association Studies (TWAS), and Proteome-wide Association Studies (PWAS). To integrate and synthesize these analyses, we developed the AirSigOmniTWP Hub, a specialized platform designed to consolidate and interpret the results from UTMOST, TWAS, and PWAS. TWAS analysis identified a significant association between PM10 exposure and the gene INO80E in females (P = 4.37×10⁻⁵, FDR = 0.0383), suggesting a potential role in chromatin remodeling. PWAS analysis revealed a significant association between NOx exposure and the gene PIP in females (P = 2.28×10⁻⁵, FDR = 0.0299), implicating its involvement in inflammatory pathways. Additionally, UTMOST analyses uncovered significant associations between various pollutants and genes including NCOA4P3 and SPATS2L with PM2.5 exposure, indicating potential mechanisms related to transcriptional regulation and gene-environment interactions.
Collapse
Affiliation(s)
- Wei Liu
- Department of Biomedical-Engineering, School of Intelligent Medicine, China Medical University, Shenyang 110122, PR China
| | - Tong Liu
- Department of Clinical Medicine, the Fourth Clinical Medical School, China Medical University, Shenyang 110122, PR China
| | - Xinxin Si
- Department of Clinical Medicine, the Fourth Clinical Medical School, China Medical University, Shenyang 110122, PR China
| | - Jiaxing Liang
- Department of Clinical Medicine, Shengjing Hospital, China Medical University, Shenyang 110122, PR China
| | - Xia Yan
- Department of Clinical Medicine, the First Clinical College, China Medical University, Shenyang 110122, PR China
| | - Juexin Zhang
- Department of Clinical Medicine, the Fourth Clinical Medical School, China Medical University, Shenyang 110122, PR China
| | - Bing Pang
- Department of Clinical Medicine, the Fourth Clinical Medical School, China Medical University, Shenyang 110122, PR China
| | - Wenmin Luo
- Department of Clinical Medicine, the Fourth Clinical Medical School, China Medical University, Shenyang 110122, PR China
| | - Junhong Liu
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang 110122, PR China
| | - Huazhe Yang
- Department of Biophysics, School of Intelligent Medicine, China Medical University, Shenyang 110122, PR China
| | - Peng Shi
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang 110122, PR China.
| |
Collapse
|
2
|
Abstract
Pathway enrichment analysis (PEA) is a computational biology method that identifies biological functions that are overrepresented in a group of genes more than would be expected by chance and ranks these functions by relevance. The relative abundance of genes pertinent to specific pathways is measured through statistical methods, and associated functional pathways are retrieved from online bioinformatics databases. In the last decade, along with the spread of the internet, higher availability of computational resources made PEA software tools easy to access and to use for bioinformatics practitioners worldwide. Although it became easier to use these tools, it also became easier to make mistakes that could generate inflated or misleading results, especially for beginners and inexperienced computational biologists. With this article, we propose nine quick tips to avoid common mistakes and to out a complete, sound, thorough PEA, which can produce relevant and robust results. We describe our nine guidelines in a simple way, so that they can be understood and used by anyone, including students and beginners. Some tips explain what to do before starting a PEA, others are suggestions of how to correctly generate meaningful results, and some final guidelines indicate some useful steps to properly interpret PEA results. Our nine tips can help users perform better pathway enrichment analyses and eventually contribute to a better understanding of current biology.
Collapse
|
3
|
Sun Q, Nematbakhsh A, Kuntala PK, Kellogg G, Pugh BF, Lai WKM. STENCIL: A web templating engine for visualizing and sharing life science datasets. PLoS Comput Biol 2022; 18:e1009859. [PMID: 35139076 PMCID: PMC8863220 DOI: 10.1371/journal.pcbi.1009859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 02/22/2022] [Accepted: 01/24/2022] [Indexed: 11/25/2022] Open
Abstract
The ability to aggregate experimental data analysis and results into a concise and interpretable format is a key step in evaluating the success of an experiment. This critical step determines baselines for reproducibility and is a key requirement for data dissemination. However, in practice it can be difficult to consolidate data analyses that encapsulates the broad range of datatypes available in the life sciences. We present STENCIL, a web templating engine designed to organize, visualize, and enable the sharing of interactive data visualizations. STENCIL leverages a flexible web framework for creating templates to render highly customizable visual front ends. This flexibility enables researchers to render small or large sets of experimental outcomes, producing high-quality downloadable and editable figures that retain their original relationship to the source data. REST API based back ends provide programmatic data access and supports easy data sharing. STENCIL is a lightweight tool that can stream data from Galaxy, a popular bioinformatic analysis web platform. STENCIL has been used to support the analysis and dissemination of two large scale genomic projects containing the complete data analysis for over 2,400 distinct datasets. Code and implementation details are available on GitHub: https://github.com/CEGRcode/stencil.
Collapse
Affiliation(s)
- Qi Sun
- Cornell Institute of Biotechnology, Cornell University, Ithaca, New York, United States of America
| | - Ali Nematbakhsh
- Cornell Institute of Biotechnology, Cornell University, Ithaca, New York, United States of America
| | - Prashant K. Kuntala
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Gretta Kellogg
- Cornell Institute of Biotechnology, Cornell University, Ithaca, New York, United States of America
| | - B. Franklin Pugh
- Department of Molecular Biology and Genetics, Cornell University, New York, United States of America
| | - William K. M. Lai
- Department of Molecular Biology and Genetics, Cornell University, New York, United States of America
- Department of Computational Biology, Cornell University, New York, United States of America
| |
Collapse
|
4
|
Marini F, Ludt A, Linke J, Strauch K. GeneTonic: an R/Bioconductor package for streamlining the interpretation of RNA-seq data. BMC Bioinformatics 2021; 22:610. [PMID: 34949163 PMCID: PMC8697502 DOI: 10.1186/s12859-021-04461-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 10/26/2021] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND The interpretation of results from transcriptome profiling experiments via RNA sequencing (RNA-seq) can be a complex task, where the essential information is distributed among different tabular and list formats-normalized expression values, results from differential expression analysis, and results from functional enrichment analyses. A number of tools and databases are widely used for the purpose of identification of relevant functional patterns, yet often their contextualization within the data and results at hand is not straightforward, especially if these analytic components are not combined together efficiently. RESULTS We developed the GeneTonic software package, which serves as a comprehensive toolkit for streamlining the interpretation of functional enrichment analyses, by fully leveraging the information of expression values in a differential expression context. GeneTonic is implemented in R and Shiny, leveraging packages that enable HTML-based interactive visualizations for executing drilldown tasks seamlessly, viewing the data at a level of increased detail. GeneTonic is integrated with the core classes of existing Bioconductor workflows, and can accept the output of many widely used tools for pathway analysis, making this approach applicable to a wide range of use cases. Users can effectively navigate interlinked components (otherwise available as flat text or spreadsheet tables), bookmark features of interest during the exploration sessions, and obtain at the end a tailored HTML report, thus combining the benefits of both interactivity and reproducibility. CONCLUSION GeneTonic is distributed as an R package in the Bioconductor project ( https://bioconductor.org/packages/GeneTonic/ ) under the MIT license. Offering both bird's-eye views of the components of transcriptome data analysis and the detailed inspection of single genes, individual signatures, and their relationships, GeneTonic aims at simplifying the process of interpretation of complex and compelling RNA-seq datasets for many researchers with different expertise profiles.
Collapse
Affiliation(s)
- Federico Marini
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center of the Johannes Gutenberg University Mainz, Obere Zahlbacher Str. 69, 55131 Mainz, Germany
- Center for Thrombosis and Hemostasis (CTH), University Medical Center of the Johannes Gutenberg University Mainz, Langenbeckstr. 1, 55131 Mainz, Germany
| | - Annekathrin Ludt
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center of the Johannes Gutenberg University Mainz, Obere Zahlbacher Str. 69, 55131 Mainz, Germany
| | - Jan Linke
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center of the Johannes Gutenberg University Mainz, Obere Zahlbacher Str. 69, 55131 Mainz, Germany
- Center for Thrombosis and Hemostasis (CTH), University Medical Center of the Johannes Gutenberg University Mainz, Langenbeckstr. 1, 55131 Mainz, Germany
| | - Konstantin Strauch
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center of the Johannes Gutenberg University Mainz, Obere Zahlbacher Str. 69, 55131 Mainz, Germany
| |
Collapse
|
5
|
Aichem M, Czauderna T, Zhu Y, Zhao J, Klapperstück M, Klein K, Li J, Schreiber F. Visual Exploration of Large Metabolic Models. Bioinformatics 2021; 37:4460-4468. [PMID: 33970212 DOI: 10.1093/bioinformatics/btab335] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Revised: 03/01/2021] [Accepted: 04/30/2021] [Indexed: 01/09/2023] Open
Abstract
MOTIVATION Large metabolic models, including genome-scale metabolic models (GSMMs), are nowadays common in systems biology, biotechnology and pharmacology. They typically contain thousands of metabolites and reactions and therefore methods for their automatic visualisation and interactive exploration can facilitate a better understanding of these models. RESULTS We developed a novel method for the visual exploration of large metabolic models and implemented it in LMME (Large Metabolic Model Explorer), an add-on for the biological network analysis tool VANTED. The underlying idea of our method is to analyse a large model as follows. Starting from a decomposition into several subsystems, relationships between these subsystems are identified and an overview is computed and visualised. From this overview, detailed subviews may be constructed and visualised in order to explore subsystems and relationships in greater detail. Decompositions may either be predefined or computed, using built-in or self-implemented methods. Realised as add-on for VANTED, LMME is embedded in a domain-specific environment, allowing for further related analysis at any stage during the exploration. We describe the method, provide a use case, and discuss the strengths and weaknesses of different decomposition methods. AVAILABILITY The methods and algorithms presented here are implemented in LMME, an open-source add-on for VANTED. LMME can be downloaded from www.cls.uni-konstanz.de/software/lmme and VANTED can be downloaded from www.vanted.org. The source code of LMME is available from GitHub, at https://github.com/LSI-UniKonstanz/lmme.
Collapse
Affiliation(s)
- Michael Aichem
- Department of Computer and Information Science, University of Konstanz, Konstanz, Germany
| | - Tobias Czauderna
- Faculty of Information Technology, Monash University, Melbourne, Australia
| | - Yan Zhu
- Biomedicine Discovery Institute, Infection & Immunity Program and Department of Microbiology, Monash University, Melbourne, Australia
| | - Jinxin Zhao
- Biomedicine Discovery Institute, Infection & Immunity Program and Department of Microbiology, Monash University, Melbourne, Australia
| | | | - Karsten Klein
- Department of Computer and Information Science, University of Konstanz, Konstanz, Germany
| | - Jian Li
- Biomedicine Discovery Institute, Infection & Immunity Program and Department of Microbiology, Monash University, Melbourne, Australia
| | - Falk Schreiber
- Department of Computer and Information Science, University of Konstanz, Konstanz, Germany.,Faculty of Information Technology, Monash University, Melbourne, Australia
| |
Collapse
|