1
|
Wang H, Qiu Y, Guo H, Yin Y, Liu P. Information-incorporated gene network construction with FDR control. Bioinformatics 2024; 40:btae125. [PMID: 38430463 PMCID: PMC10937901 DOI: 10.1093/bioinformatics/btae125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 02/08/2024] [Accepted: 02/29/2024] [Indexed: 03/03/2024] Open
Abstract
MOTIVATION Large-scale gene expression studies allow gene network construction to uncover associations among genes. To study direct associations among genes, partial correlation-based networks are preferred over marginal correlations. However, FDR control for partial correlation-based network construction is not well-studied. In addition, currently available partial correlation-based methods cannot take existing biological knowledge to help network construction while controlling FDR. RESULTS In this paper, we propose a method called Partial Correlation Graph with Information Incorporation (PCGII). PCGII estimates partial correlations between each pair of genes by regularized node-wise regression that can incorporate prior knowledge while controlling the effects of all other genes. It handles high-dimensional data where the number of genes can be much larger than the sample size and controls FDR at the same time. We compare PCGII with several existing approaches through extensive simulation studies and demonstrate that PCGII has better FDR control and higher power. We apply PCGII to a plant gene expression dataset where it recovers confirmed regulatory relationships and a hub node, as well as several direct associations that shed light on potential functional relationships in the system. We also introduce a method to supplement observed data with a pseudogene to apply PCGII when no prior information is available, which also allows checking FDR control and power for real data analysis. AVAILABILITY AND IMPLEMENTATION R package is freely available for download at https://cran.r-project.org/package=PCGII.
Collapse
Affiliation(s)
- Hao Wang
- Department of Statistics, Iowa State University, Ames, IA 50010, United States
| | - Yumou Qiu
- Department of Statistics, Iowa State University, Ames, IA 50010, United States
| | - Hongqing Guo
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50010, United States
| | - Yanhai Yin
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50010, United States
| | - Peng Liu
- Department of Statistics, Iowa State University, Ames, IA 50010, United States
| |
Collapse
|
2
|
Fabbri L, Garlantézec R, Audouze K, Bustamante M, Carracedo Á, Chatzi L, Ramón González J, Gražulevičienė R, Keun H, Lau CHE, Sabidó E, Siskos AP, Slama R, Thomsen C, Wright J, Lun Yuan W, Casas M, Vrijheid M, Maitre L. Childhood exposure to non-persistent endocrine disrupting chemicals and multi-omic profiles: A panel study. ENVIRONMENT INTERNATIONAL 2023; 173:107856. [PMID: 36867994 DOI: 10.1016/j.envint.2023.107856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Revised: 02/23/2023] [Accepted: 02/23/2023] [Indexed: 06/18/2023]
Abstract
BACKGROUND Individuals are exposed to environmental pollutants with endocrine disrupting activity (endocrine disruptors, EDCs) and the early stages of life are particularly susceptible to these exposures. Previous studies have focused on identifying molecular signatures associated with EDCs, but none have used repeated sampling strategy and integrated multiple omics. We aimed to identify multi-omic signatures associated with childhood exposure to non-persistent EDCs. METHODS We used data from the HELIX Child Panel Study, which included 156 children aged 6 to 11. Children were followed for one week, in two time periods. Twenty-two non-persistent EDCs (10 phthalate, 7 phenol, and 5 organophosphate pesticide metabolites) were measured in two weekly pools of 15 urine samples each. Multi-omic profiles (methylome, serum and urinary metabolome, proteome) were measured in blood and in a pool urine samples. We developed visit-specific Gaussian Graphical Models based on pairwise partial correlations. The visit-specific networks were then merged to identify reproducible associations. Independent biological evidence was systematically sought to confirm some of these associations and assess their potential health implications. RESULTS 950 reproducible associations were found among which 23 were direct associations between EDCs and omics. For 9 of them, we were able to find corroborating evidence from previous literature: DEP - serotonin, OXBE - cg27466129, OXBE - dimethylamine, triclosan - leptin, triclosan - serotonin, MBzP - Neu5AC, MEHP - cg20080548, oh-MiNP - kynurenine, oxo-MiNP - 5-oxoproline. We used these associations to explore possible mechanisms between EDCs and health outcomes, and found links to health outcomes for 3 analytes: serotonin and kynurenine in relation to neuro-behavioural development, and leptin in relation to obesity and insulin resistance. CONCLUSIONS This multi-omics network analysis at two time points identified biologically relevant molecular signatures related to non-persistent EDC exposure in childhood, suggesting pathways related to neurological and metabolic outcomes.
Collapse
Affiliation(s)
- Lorenzo Fabbri
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Ronan Garlantézec
- Univ Rennes, CHU Rennes, Inserm, EHESP, Irset (Institut de recherche en santé environnement et travail), UMR_S 1085, Rennes, France
| | - Karine Audouze
- Université Paris Cité, T3S, INSERM UMR-S 1124, 45 rue des Saints Pères, Paris, France
| | - Mariona Bustamante
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Center for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain; CIBER Epidemiologa y Salud Pública (CIBERESP), Madrid, Spain
| | - Ángel Carracedo
- Medicine Genomics Group, Centro de Investigación Biomédica en Red Enfermedades Raras (CIBERER), University of Santiago de Compostela, CEGEN-PRB3, Santiago de Compostela, Spain; Galician Foundation of Genomic Medicine, Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Servicio Gallego de Salud (SERGAS), Santiago de Compostela, Spain
| | - Leda Chatzi
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, USA
| | - Juan Ramón González
- ISGlobal, Barcelona, Spain; CIBER Epidemiologa y Salud Pública (CIBERESP), Madrid, Spain; Department of Mathematics, Universitat Autònoma de Barcelona, Bellaterra, Spain
| | | | - Hector Keun
- Cancer Metabolism & Systems Toxicology Group, Division of Cancer, Department of Surgery and Cancer & Division of Systems Medicine, Department of Metabolism, Digestion & Reproduction, Imperial College London, Hammersmith Hospital Campus, London, UK
| | - Chung-Ho E Lau
- MRC Centre for Environment and Health, School of Public Health, Imperial College London, London, UK; Division of Systems Medicine, Department of Metabolism, Digestion & Reproduction, Imperial College, South Kensington, London, UK
| | - Eduard Sabidó
- Universitat Pompeu Fabra (UPF), Barcelona, Spain; Center for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Alexandros P Siskos
- Cancer Metabolism & Systems Toxicology Group, Division of Cancer, Department of Surgery and Cancer & Division of Systems Medicine, Department of Metabolism, Digestion & Reproduction, Imperial College London, Hammersmith Hospital Campus, London, UK
| | - Rémy Slama
- Team of Environmental Epidemiology Applied to Reproduction and Respiratory Health, Institute for Advanced Biosciences (IAB), Inserm, CNRS, Université Grenoble Alpes, Grenoble, France
| | - Cathrine Thomsen
- Department of Environmental Health, Norwegian Institute of Public Health, Oslo, Norway
| | - John Wright
- Bradford Institute for Health Research, Bradford Teaching Hospitals NHS Foundation Trust, Bradford, UK
| | - Wen Lun Yuan
- Université de Paris, Centre for Research in Epidemiology and Statistics (CRESS), INSERM, INRAE, Paris, France; Singapore Institute for Clinical Sciences (SICS), Agency for Science, Technology, and Research (A*STAR), Singapore, Singapore
| | - Maribel Casas
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiologa y Salud Pública (CIBERESP), Madrid, Spain
| | - Martine Vrijheid
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiologa y Salud Pública (CIBERESP), Madrid, Spain
| | - Léa Maitre
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiologa y Salud Pública (CIBERESP), Madrid, Spain.
| |
Collapse
|
3
|
Shutta KH, Weighill D, Burkholz R, Guebila M, DeMeo DL, Zacharias HU, Quackenbush J, Altenbuchinger M. DRAGON: Determining Regulatory Associations using Graphical models on multi-Omic Networks. Nucleic Acids Res 2022; 51:e15. [PMID: 36533448 PMCID: PMC9943674 DOI: 10.1093/nar/gkac1157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Revised: 11/08/2022] [Accepted: 11/23/2022] [Indexed: 12/23/2022] Open
Abstract
The increasing quantity of multi-omic data, such as methylomic and transcriptomic profiles collected on the same specimen or even on the same cell, provides a unique opportunity to explore the complex interactions that define cell phenotype and govern cellular responses to perturbations. We propose a network approach based on Gaussian Graphical Models (GGMs) that facilitates the joint analysis of paired omics data. This method, called DRAGON (Determining Regulatory Associations using Graphical models on multi-Omic Networks), calibrates its parameters to achieve an optimal trade-off between the network's complexity and estimation accuracy, while explicitly accounting for the characteristics of each of the assessed omics 'layers.' In simulation studies, we show that DRAGON adapts to edge density and feature size differences between omics layers, improving model inference and edge recovery compared to state-of-the-art methods. We further demonstrate in an analysis of joint transcriptome - methylome data from TCGA breast cancer specimens that DRAGON can identify key molecular mechanisms such as gene regulation via promoter methylation. In particular, we identify Transcription Factor AP-2 Beta (TFAP2B) as a potential multi-omic biomarker for basal-type breast cancer. DRAGON is available as open-source code in Python through the Network Zoo package (netZooPy v0.8; netzoo.github.io).
Collapse
Affiliation(s)
| | | | - Rebekka Burkholz
- CISPA Helmholtz Center for Information Security, Saarbrücken, Germany
| | - Marouen Ben Guebila
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Dawn L DeMeo
- Channing Division of Network Medicine, Brigham and Women’s Hospital, and Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Helena U Zacharias
- Department of Internal Medicine I, University Medical Center Schleswig-Holstein, Campus Kiel, Kiel, Germany,Institute of Clinical Molecular Biology, Kiel University and University Medical Center Schleswig-Holstein, Campus Kiel, Kiel, Germany,Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, Hannover, Germany
| | | | - Michael Altenbuchinger
- To whom correspondence should be addressed. Tel: +49 551 39 61788; Fax: +49 551 39 61783;
| |
Collapse
|
4
|
Bernal V, Soancatl-Aguilar V, Bulthuis J, Guryev V, Horvatovich P, Grzegorczyk M. GeneNetTools: tests for Gaussian graphical models with shrinkage. Bioinformatics 2022; 38:5049-5054. [PMID: 36179082 PMCID: PMC9665865 DOI: 10.1093/bioinformatics/btac657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 09/14/2022] [Accepted: 09/29/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Gaussian graphical models (GGMs) are network representations of random variables (as nodes) and their partial correlations (as edges). GGMs overcome the challenges of high-dimensional data analysis by using shrinkage methodologies. Therefore, they have become useful to reconstruct gene regulatory networks from gene-expression profiles. However, it is often ignored that the partial correlations are 'shrunk' and that they cannot be compared/assessed directly. Therefore, accurate (differential) network analyses need to account for the number of variables, the sample size, and also the shrinkage value, otherwise, the analysis and its biological interpretation would turn biased. To date, there are no appropriate methods to account for these factors and address these issues. RESULTS We derive the statistical properties of the partial correlation obtained with the Ledoit-Wolf shrinkage. Our result provides a toolbox for (differential) network analyses as (i) confidence intervals, (ii) a test for zero partial correlation (null-effects) and (iii) a test to compare partial correlations. Our novel (parametric) methods account for the number of variables, the sample size and the shrinkage values. Additionally, they are computationally fast, simple to implement and require only basic statistical knowledge. Our simulations show that the novel tests perform better than DiffNetFDR-a recently published alternative-in terms of the trade-off between true and false positives. The methods are demonstrated on synthetic data and two gene-expression datasets from Escherichia coli and Mus musculus. AVAILABILITY AND IMPLEMENTATION The R package with the methods and the R script with the analysis are available in https://github.com/V-Bernal/GeneNetTools. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Victor Bernal
- Center of Information Technology, University of Groningen, Groningen 9747 AJ, The Netherlands,Department of Mathematics, Bernoulli Institute, University of Groningen, Groningen 9747 AG, The Netherlands
| | | | - Jonas Bulthuis
- Center of Information Technology, University of Groningen, Groningen 9747 AJ, The Netherlands
| | - Victor Guryev
- European Research Institute for the Biology of Ageing, University Medical Center Groningen, University of Groningen, Groningen 9713 AV, The Netherlands
| | | | | |
Collapse
|
5
|
Bernal V, Bischoff R, Horvatovich P, Guryev V, Grzegorczyk M. The 'un-shrunk' partial correlation in Gaussian graphical models. BMC Bioinformatics 2021; 22:424. [PMID: 34493207 PMCID: PMC8424921 DOI: 10.1186/s12859-021-04313-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Accepted: 08/02/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In systems biology, it is important to reconstruct regulatory networks from quantitative molecular profiles. Gaussian graphical models (GGMs) are one of the most popular methods to this end. A GGM consists of nodes (representing the transcripts, metabolites or proteins) inter-connected by edges (reflecting their partial correlations). Learning the edges from quantitative molecular profiles is statistically challenging, as there are usually fewer samples than nodes ('high dimensional problem'). Shrinkage methods address this issue by learning a regularized GGM. However, it remains open to study how the shrinkage affects the final result and its interpretation. RESULTS We show that the shrinkage biases the partial correlation in a non-linear way. This bias does not only change the magnitudes of the partial correlations but also affects their order. Furthermore, it makes networks obtained from different experiments incomparable and hinders their biological interpretation. We propose a method, referred to as 'un-shrinking' the partial correlation, which corrects for this non-linear bias. Unlike traditional methods, which use a fixed shrinkage value, the new approach provides partial correlations that are closer to the actual (population) values and that are easier to interpret. This is demonstrated on two gene expression datasets from Escherichia coli and Mus musculus. CONCLUSIONS GGMs are popular undirected graphical models based on partial correlations. The application of GGMs to reconstruct regulatory networks is commonly performed using shrinkage to overcome the 'high-dimensional problem'. Besides it advantages, we have identified that the shrinkage introduces a non-linear bias in the partial correlations. Ignoring this type of effects caused by the shrinkage can obscure the interpretation of the network, and impede the validation of earlier reported results.
Collapse
Affiliation(s)
- Victor Bernal
- Bernoulli Institute, University of Groningen, Groningen, 9747 AG, The Netherlands.,Department of Analytical Biochemistry, Groningen Research Institute of Pharmacy, University of Groningen, Groningen, 9713 AV, The Netherlands
| | - Rainer Bischoff
- Department of Analytical Biochemistry, Groningen Research Institute of Pharmacy, University of Groningen, Groningen, 9713 AV, The Netherlands
| | - Peter Horvatovich
- Department of Analytical Biochemistry, Groningen Research Institute of Pharmacy, University of Groningen, Groningen, 9713 AV, The Netherlands.
| | - Victor Guryev
- European Research Institute for the Biology of Ageing, University Medical Center Groningen, University of Groningen, Groningen, 9713 AV, The Netherlands.
| | - Marco Grzegorczyk
- Bernoulli Institute, University of Groningen, Groningen, 9747 AG, The Netherlands.
| |
Collapse
|
6
|
Tu JJ, Ou-Yang L, Yan H, Zhang XF, Qin H. Joint reconstruction of multiple gene networks by simultaneously capturing inter-tumor and intra-tumor heterogeneity. Bioinformatics 2020; 36:2755-2762. [PMID: 31971577 DOI: 10.1093/bioinformatics/btaa014] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Revised: 12/22/2019] [Accepted: 01/18/2020] [Indexed: 12/27/2022] Open
Abstract
MOTIVATION Reconstruction of cancer gene networks from gene expression data is important for understanding the mechanisms underlying human cancer. Due to heterogeneity, the tumor tissue samples for a single cancer type can be divided into multiple distinct subtypes (inter-tumor heterogeneity) and are composed of non-cancerous and cancerous cells (intra-tumor heterogeneity). If tumor heterogeneity is ignored when inferring gene networks, the edges specific to individual cancer subtypes and cell types cannot be characterized. However, most existing network reconstruction methods do not simultaneously take inter-tumor and intra-tumor heterogeneity into account. RESULTS In this article, we propose a new Gaussian graphical model-based method for jointly estimating multiple cancer gene networks by simultaneously capturing inter-tumor and intra-tumor heterogeneity. Given gene expression data of heterogeneous samples for different cancer subtypes, a non-cancerous network shared across different cancer subtypes and multiple subtype-specific cancerous networks are estimated jointly. Tumor heterogeneity can be revealed by the difference in the estimated networks. The performance of our method is first evaluated using simulated data, and the results indicate that our method outperforms other state-of-the-art methods. We also apply our method to The Cancer Genome Atlas breast cancer data to reconstruct non-cancerous and subtype-specific cancerous gene networks. Hub nodes in the networks estimated by our method perform important biological functions associated with breast cancer development and subtype classification. AVAILABILITY AND IMPLEMENTATION The source code is available at https://github.com/Zhangxf-ccnu/NETI2. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jia-Juan Tu
- Department of Statistics, Hubei Key Laboratory of Mathematical Sciences, School of Mathematics and Statistics, Central China Normal University, Wuhan 430079, China
| | - Le Ou-Yang
- College of Electronics and Information Engineering, Shenzhen University, Shenzhen 518060, China
| | - Hong Yan
- Department of Electrical Engineering, City University of Hong Kong, Hong Kong 999077, China
| | - Xiao-Fei Zhang
- Department of Statistics, Hubei Key Laboratory of Mathematical Sciences, School of Mathematics and Statistics, Central China Normal University, Wuhan 430079, China.,Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence (Fudan University), Ministry of Education, Shanghai 200433, China
| | - Hong Qin
- Department of Statistics, Hubei Key Laboratory of Mathematical Sciences, School of Mathematics and Statistics, Central China Normal University, Wuhan 430079, China.,Department of Statistics, Zhongnan University of Economics and Law, Wuhan 430073, China
| |
Collapse
|
7
|
Blencowe M, Karunanayake T, Wier J, Hsu N, Yang X. Network Modeling Approaches and Applications to Unravelling Non-Alcoholic Fatty Liver Disease. Genes (Basel) 2019; 10:E966. [PMID: 31771247 PMCID: PMC6947017 DOI: 10.3390/genes10120966] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Revised: 11/18/2019] [Accepted: 11/22/2019] [Indexed: 12/12/2022] Open
Abstract
Non-alcoholic fatty liver disease (NAFLD) is a progressive condition of the liver encompassing a range of pathologies including steatosis, non-alcoholic steatohepatitis (NASH), cirrhosis, and hepatocellular carcinoma. Research into this disease is imperative due to its rapid growth in prevalence, economic burden, and current lack of FDA approved therapies. NAFLD involves a highly complex etiology that calls for multi-tissue multi-omics network approaches to uncover the pathogenic genes and processes, diagnostic biomarkers, and potential therapeutic strategies. In this review, we first present a basic overview of disease pathogenesis, risk factors, and remaining knowledge gaps, followed by discussions of the need and concepts of multi-tissue multi-omics approaches, various network methodologies and application examples in NAFLD research. We highlight the findings that have been uncovered thus far including novel biomarkers, genes, and biological pathways involved in different stages of NAFLD, molecular connections between NAFLD and its comorbidities, mechanisms underpinning sex differences, and druggable targets. Lastly, we outline the future directions of implementing network approaches to further improve our understanding of NAFLD in order to guide diagnosis and therapeutics.
Collapse
Affiliation(s)
- Montgomery Blencowe
- Department of Integrative Biology and Physiology, University of California, Los Angeles, 610 Charles E. Young Drive East, Los Angeles, CA 90095, USA; (M.B.); (T.K.); (J.W.); (N.H.)
- Molecular, Cellular, and Integrative Physiology Interdepartmental Program, University of California, Los Angeles, 610 Charles E. Young Drive East, Los Angeles, CA 90095, USA
| | - Tilan Karunanayake
- Department of Integrative Biology and Physiology, University of California, Los Angeles, 610 Charles E. Young Drive East, Los Angeles, CA 90095, USA; (M.B.); (T.K.); (J.W.); (N.H.)
| | - Julian Wier
- Department of Integrative Biology and Physiology, University of California, Los Angeles, 610 Charles E. Young Drive East, Los Angeles, CA 90095, USA; (M.B.); (T.K.); (J.W.); (N.H.)
| | - Neil Hsu
- Department of Integrative Biology and Physiology, University of California, Los Angeles, 610 Charles E. Young Drive East, Los Angeles, CA 90095, USA; (M.B.); (T.K.); (J.W.); (N.H.)
| | - Xia Yang
- Department of Integrative Biology and Physiology, University of California, Los Angeles, 610 Charles E. Young Drive East, Los Angeles, CA 90095, USA; (M.B.); (T.K.); (J.W.); (N.H.)
- Molecular, Cellular, and Integrative Physiology Interdepartmental Program, University of California, Los Angeles, 610 Charles E. Young Drive East, Los Angeles, CA 90095, USA
- Interdepartmental Program of Bioinformatics, University of California, Los Angeles, 610 Charles E. Young Drive East, Los Angeles, CA 90095, USA
| |
Collapse
|
8
|
Imkamp K, Bernal V, Grzegorzcyk M, Horvatovich P, Vermeulen CJ, Heijink IH, Guryev V, Kerstjens HAM, van den Berge M, Faiz A. Gene network approach reveals co-expression patterns in nasal and bronchial epithelium. Sci Rep 2019; 9:15835. [PMID: 31676779 PMCID: PMC6825243 DOI: 10.1038/s41598-019-50963-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Accepted: 09/13/2019] [Indexed: 12/20/2022] Open
Abstract
Nasal gene expression profiling is a new approach to investigate the airway epithelium as a biomarker to study the activity and treatment responses of obstructive pulmonary diseases. We investigated to what extent gene expression profiling of nasal brushings is similar to that of bronchial brushings. We performed genome wide gene expression profiling on matched nasal and bronchial epithelial brushes from 77 respiratory healthy individuals. To investigate differences and similarities among regulatory modules, network analysis was performed on correlated, differentially expressed and smoking-related genes using Gaussian Graphical Models. Between nasal and bronchial brushes, 619 genes were correlated and 1692 genes were differentially expressed (false discovery rate <0.05, |Fold-change|>2). Network analysis of correlated genes showed pro-inflammatory pathways to be similar between the two locations. Focusing on smoking-related genes, cytochrome-P450 pathway related genes were found to be similar, supporting the concept of a detoxifying response to tobacco exposure throughout the airways. In contrast, cilia-related pathways were decreased in nasal compared to bronchial brushes when focusing on differentially expressed genes. Collectively, while there are substantial differences in gene expression between nasal and bronchial brushes, we also found similarities, especially in the response to the external factors such as smoking.
Collapse
Affiliation(s)
- Kai Imkamp
- University of Groningen, University Medical Center Groningen, Department of Pulmonology, Groningen, The Netherlands. .,University of Groningen, University Medical Center Groningen, GRIAC (Groningen Research Institute for Asthma and COPD), Groningen, The Netherlands.
| | - Victor Bernal
- University of Groningen, Bernoulli Institute (JBI), Groningen, The Netherlands.,University of Groningen, Department of Pharmacy, Analytical Biochemistry, Groningen, The Netherlands
| | - Marco Grzegorzcyk
- University of Groningen, Bernoulli Institute (JBI), Groningen, The Netherlands
| | - Peter Horvatovich
- University of Groningen, Department of Pharmacy, Analytical Biochemistry, Groningen, The Netherlands
| | - Cornelis J Vermeulen
- University of Groningen, University Medical Center Groningen, Department of Pulmonology, Groningen, The Netherlands.,University of Groningen, University Medical Center Groningen, GRIAC (Groningen Research Institute for Asthma and COPD), Groningen, The Netherlands
| | - Irene H Heijink
- University of Groningen, University Medical Center Groningen, Department of Pulmonology, Groningen, The Netherlands.,University of Groningen, University Medical Center Groningen, GRIAC (Groningen Research Institute for Asthma and COPD), Groningen, The Netherlands.,University of Groningen, University Medical Center Groningen, Department of Pathology & Medical Biology, section Medical Biology, Groningen, The Netherlands
| | - Victor Guryev
- University of Groningen, University Medical Center Groningen, GRIAC (Groningen Research Institute for Asthma and COPD), Groningen, The Netherlands.,European Research Institute for the Biology of Ageing, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Huib A M Kerstjens
- University of Groningen, University Medical Center Groningen, Department of Pulmonology, Groningen, The Netherlands.,University of Groningen, University Medical Center Groningen, GRIAC (Groningen Research Institute for Asthma and COPD), Groningen, The Netherlands
| | - Maarten van den Berge
- University of Groningen, University Medical Center Groningen, Department of Pulmonology, Groningen, The Netherlands.,University of Groningen, University Medical Center Groningen, GRIAC (Groningen Research Institute for Asthma and COPD), Groningen, The Netherlands
| | - Alen Faiz
- University of Groningen, University Medical Center Groningen, Department of Pulmonology, Groningen, The Netherlands.,University of Groningen, University Medical Center Groningen, GRIAC (Groningen Research Institute for Asthma and COPD), Groningen, The Netherlands.,University of Groningen, University Medical Center Groningen, Department of Pathology & Medical Biology, section Medical Biology, Groningen, The Netherlands.,University of Technology Sydney, Respiratory Bioinformatics and Molecular Biology (RBMB), School of life sciences, Sydney, Australia.,Woolcock Emphysema Centre, Woolcock Institute of Medical Research, University of Sydney, Sydney, NSW, Australia
| |
Collapse
|