1
|
Chen L, Li Z, Wu H. CeDAR: incorporating cell type hierarchy improves cell type-specific differential analyses in bulk omics data. Genome Biol 2023; 24:37. [PMID: 36855165 PMCID: PMC9972684 DOI: 10.1186/s13059-023-02857-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 01/17/2023] [Indexed: 03/02/2023] Open
Abstract
Bulk high-throughput omics data contain signals from a mixture of cell types. Recent developments of deconvolution methods facilitate cell type-specific inferences from bulk data. Our real data exploration suggests that differential expression or methylation status is often correlated among cell types. Based on this observation, we develop a novel statistical method named CeDAR to incorporate the cell type hierarchy in cell type-specific differential analyses of bulk data. Extensive simulation and real data analyses demonstrate that this approach significantly improves the accuracy and power in detecting cell type-specific differential signals compared with existing methods, especially in low-abundance cell types.
Collapse
Affiliation(s)
- Luxiao Chen
- Department of Biostatistics and Bioinformatics, Emory University, GA 30322 Atlanta, USA
| | - Ziyi Li
- Department of Biostatistics, The University of MD Anderson Cancer Center, 77030 Houston, TX, USA
| | - Hao Wu
- Faculty of Computer Science and Control Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, 1068 Xueyuan Avenue, Shenzhen University Town, Shenzhen, 518055 P.R. China
| |
Collapse
|
2
|
Haftorn KL, Denault WRP, Lee Y, Page CM, Romanowska J, Lyle R, Næss ØE, Kristjansson D, Magnus PM, Håberg SE, Bohlin J, Jugessur A. Nucleated red blood cells explain most of the association between DNA methylation and gestational age. Commun Biol 2023; 6:224. [PMID: 36849614 PMCID: PMC9971030 DOI: 10.1038/s42003-023-04584-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Accepted: 02/13/2023] [Indexed: 03/01/2023] Open
Abstract
Determining if specific cell type(s) are responsible for an association between DNA methylation (DNAm) and a given phenotype is important for understanding the biological mechanisms underlying the association. Our EWAS of gestational age (GA) in 953 newborns from the Norwegian MoBa study identified 13,660 CpGs significantly associated with GA (pBonferroni<0.05) after adjustment for cell type composition. When the CellDMC algorithm was applied to explore cell-type specific effects, 2,330 CpGs were significantly associated with GA, mostly in nucleated red blood cells [nRBCs; n = 2,030 (87%)]. Similar patterns were found in another dataset based on a different array and when applying an alternative algorithm to CellDMC called Tensor Composition Analysis (TCA). Our findings point to nRBCs as the main cell type driving the DNAm-GA association, implicating an epigenetic signature of erythropoiesis as a likely mechanism. They also explain the poor correlation observed between epigenetic age clocks for newborns and those for adults.
Collapse
Affiliation(s)
- Kristine L Haftorn
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway.
- Institute of Health and Society, University of Oslo, Oslo, Norway.
| | - William R P Denault
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
- Department of Human Genetics, University of Chicago, Chicago, IL, 60637, USA
| | - Yunsung Lee
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
| | - Christian M Page
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
- Department of Physical Health and Ageing, Division of Mental and Physical Health, Norwegian Institute of Public Health, Oslo, Norway
| | - Julia Romanowska
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
- Department of Global Public Health and Primary Care, , University of Bergen, Bergen, Norway
| | - Robert Lyle
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
- Department of Medical Genetics, Oslo University Hospital and University of Oslo, Oslo, Norway
| | - Øyvind E Næss
- Institute of Health and Society, University of Oslo, Oslo, Norway
- Division of Mental and Physical Health, Norwegian Institute of Public Health, Oslo, Norway
| | - Dana Kristjansson
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
- Department of Genetics and Bioinformatics, Norwegian Institute of Public Health, Oslo, Norway
| | - Per M Magnus
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
| | - Siri E Håberg
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
| | - Jon Bohlin
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
- Division for Infection Control and Environmental Health, Department of Infectious Disease Epidemiology and Modelling, Norwegian Institute of Public Health, Oslo, Norway
| | - Astanand Jugessur
- Centre for Fertility and Health, Norwegian Institute of Public Health, Oslo, Norway
- Department of Global Public Health and Primary Care, , University of Bergen, Bergen, Norway
| |
Collapse
|
3
|
Song X, Ji J, Rothstein JH, Alexeeff SE, Sakoda LC, Sistig A, Achacoso N, Jorgenson E, Whittemore AS, Klein RJ, Habel LA, Wang P, Sieh W. MiXcan: a framework for cell-type-aware transcriptome-wide association studies with an application to breast cancer. Nat Commun 2023; 14:377. [PMID: 36690614 PMCID: PMC9871010 DOI: 10.1038/s41467-023-35888-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 01/05/2023] [Indexed: 01/25/2023] Open
Abstract
Human bulk tissue samples comprise multiple cell types with diverse roles in disease etiology. Conventional transcriptome-wide association study approaches predict genetically regulated gene expression at the tissue level, without considering cell-type heterogeneity, and test associations of predicted tissue-level expression with disease. Here we develop MiXcan, a cell-type-aware transcriptome-wide association study approach that predicts cell-type-level expression, identifies disease-associated genes via combination of cell-type-level association signals for multiple cell types, and provides insight into the disease-critical cell type. As a proof of concept, we conducted cell-type-aware analyses of breast cancer in 58,648 women and identified 12 transcriptome-wide significant genes using MiXcan compared with only eight genes using conventional approaches. Importantly, MiXcan identified genes with distinct associations in mammary epithelial versus stromal cells, including three new breast cancer susceptibility genes. These findings demonstrate that cell-type-aware transcriptome-wide analyses can reveal new insights into the genetic and cellular etiology of breast cancer and other diseases.
Collapse
Affiliation(s)
- Xiaoyu Song
- Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| | - Jiayi Ji
- Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Joseph H Rothstein
- Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Stacey E Alexeeff
- Division of Research, Kaiser Permanente Northern California, Oakland, CA, USA
| | - Lori C Sakoda
- Division of Research, Kaiser Permanente Northern California, Oakland, CA, USA
| | - Adriana Sistig
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ninah Achacoso
- Division of Research, Kaiser Permanente Northern California, Oakland, CA, USA
| | - Eric Jorgenson
- Division of Research, Kaiser Permanente Northern California, Oakland, CA, USA
- Regeneron Genetics Center, Tarrytown, NY, USA
| | - Alice S Whittemore
- Department of Epidemiology and Population Health, Stanford University School of Medicine, Stanford, CA, USA
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA
| | - Robert J Klein
- Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Laurel A Habel
- Division of Research, Kaiser Permanente Northern California, Oakland, CA, USA
| | - Pei Wang
- Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| | - Weiva Sieh
- Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| |
Collapse
|
4
|
Meng G, Tang W, Huang E, Li Z, Feng H. A comprehensive assessment of cell type-specific differential expression methods in bulk data. Brief Bioinform 2023; 24:bbac516. [PMID: 36472568 PMCID: PMC9851321 DOI: 10.1093/bib/bbac516] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 10/08/2022] [Accepted: 10/29/2022] [Indexed: 12/12/2022] Open
Abstract
Accounting for cell type compositions has been very successful at analyzing high-throughput data from heterogeneous tissues. Differential gene expression analysis at cell type level is becoming increasingly popular, yielding biomarker discovery in a finer granularity within a particular cell type. Although several computational methods have been developed to identify cell type-specific differentially expressed genes (csDEG) from RNA-seq data, a systematic evaluation is yet to be performed. Here, we thoroughly benchmark six recently published methods: CellDMC, CARseq, TOAST, LRCDE, CeDAR and TCA, together with two classical methods, csSAM and DESeq2, for a comprehensive comparison. We aim to systematically evaluate the performance of popular csDEG detection methods and provide guidance to researchers. In simulation studies, we benchmark available methods under various scenarios of baseline expression levels, sample sizes, cell type compositions, expression level alterations, technical noises and biological dispersions. Real data analyses of three large datasets on inflammatory bowel disease, lung cancer and autism provide evaluation in both the gene level and the pathway level. We find that csDEG calling is strongly affected by effect size, baseline expression level and cell type compositions. Results imply that csDEG discovery is a challenging task itself, with room to improvements on handling low signal-to-noise ratio and low expression genes.
Collapse
Affiliation(s)
- Guanqun Meng
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, 44106, Ohio, USA
| | - Wen Tang
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, 44106, Ohio, USA
| | - Emina Huang
- Department of Surgery, The University of Texas Southwestern Medical Center, Dallas, 75390, Texas, USA
| | - Ziyi Li
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, 77030, Texas, USA
| | - Hao Feng
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, 44106, Ohio, USA
| |
Collapse
|
5
|
Shi C, Zhu J, Shen Y, Luo S, Zhu H, Song R. Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process. J Am Stat Assoc 2022. [DOI: 10.1080/01621459.2022.2110876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
Affiliation(s)
| | | | - Ye Shen
- North Carolina State University
| | | | - Hongtu Zhu
- University of North Carolina at Chapel Hill
| | | |
Collapse
|
6
|
Yu J, Luo X. Identification of cell-type-specific spatially variable genes accounting for excess zeros. Bioinformatics 2022; 38:4135-4144. [PMID: 35792822 PMCID: PMC9438960 DOI: 10.1093/bioinformatics/btac457] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Revised: 05/27/2022] [Accepted: 07/05/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Spatial transcriptomic techniques can profile gene expressions while retaining the spatial information, thus offering unprecedented opportunities to explore the relationship between gene expression and spatial locations. The spatial relationship may vary across cell types, but there is a lack of statistical methods to identify cell-type-specific spatially variable (SV) genes by simultaneously modeling excess zeros and cell-type proportions. RESULTS We develop a statistical approach CTSV to detect cell-type-specific SV genes. CTSV directly models spatial raw count data and considers zero-inflation as well as overdispersion using a zero-inflated negative binomial distribution. It then incorporates cell-type proportions and spatial effect functions in the zero-inflated negative binomial regression framework. The R package pscl is employed to fit the model. For robustness, a Cauchy combination rule is applied to integrate P-values from multiple choices of spatial effect functions. Simulation studies show that CTSV not only outperforms competing methods at the aggregated level but also achieves more power at the cell-type level. By analyzing pancreatic ductal adenocarcinoma spatial transcriptomic data, SV genes identified by CTSV reveal biological insights at the cell-type level. AVAILABILITY AND IMPLEMENTATION The R package of CTSV is available at https://bioconductor.org/packages/devel/bioc/html/CTSV.html. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jinge Yu
- Institute of Statistics and Big Data, Renmin University of China, Beijing 100872, China
| | | |
Collapse
|
7
|
Qi L, Teschendorff AE. Cell-type heterogeneity: Why we should adjust for it in epigenome and biomarker studies. Clin Epigenetics 2022; 14:31. [PMID: 35227298 PMCID: PMC8887190 DOI: 10.1186/s13148-022-01253-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Accepted: 02/21/2022] [Indexed: 12/18/2022] Open
Abstract
Most studies aiming to identify epigenetic biomarkers do so from complex tissues that are composed of many different cell-types. By definition, these cell-types vary substantially in terms of their epigenetic profiles. This cell-type specific variation among healthy cells is completely independent of the variation associated with disease, yet it dominates the epigenetic variability landscape. While cell-type composition of tissues can change in disease and this may provide accurate and reproducible biomarkers, not adjusting for the underlying cell-type heterogeneity may seriously limit the sensitivity and precision to detect disease-relevant biomarkers or hamper our understanding of such biomarkers. Given that computational and experimental tools for tackling cell-type heterogeneity are available, we here stress that future epigenetic biomarker studies should aim to provide estimates of underlying cell-type fractions for all samples in the study, and to identify biomarkers before and after adjustment for cell-type heterogeneity, in order to obtain a more complete and unbiased picture of the biomarker-landscape. This is critical, not only to improve reproducibility and for the eventual clinical application of such biomarkers, but importantly, to also improve our molecular understanding of disease itself.
Collapse
Affiliation(s)
- Luo Qi
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
| | - Andrew E Teschendorff
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, 200031, China. .,UCL Cancer Institute, University College London, London, WC1E 8BT, UK.
| |
Collapse
|
8
|
Rahmani E, Jew B, Halperin E. The Effect of Model Directionality on Cell-Type-Specific Differential DNA Methylation Analysis. FRONTIERS IN BIOINFORMATICS 2022; 1:792605. [PMID: 36303752 PMCID: PMC9580934 DOI: 10.3389/fbinf.2021.792605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2021] [Accepted: 12/21/2021] [Indexed: 11/29/2022] Open
Abstract
Calling differential methylation at a cell-type level from tissue-level bulk data is a fundamental challenge in genomics that has recently received more attention. These studies most often aim at identifying statistical associations rather than causal effects. However, existing methods typically make an implicit assumption about the direction of effects, and thus far, little to no attention has been given to the fact that this directionality assumption may not hold and can consequently affect statistical power and control for false positives. We demonstrate that misspecification of the model directionality can lead to a drastic decrease in performance and increase in risk of spurious findings in cell-type-specific differential methylation analysis, and we discuss the need to carefully consider model directionality before choosing a statistical method for analysis.
Collapse
Affiliation(s)
- Elior Rahmani
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA, United States
| | - Brandon Jew
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, United States
| | - Eran Halperin
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, United States
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA, United States
- Department of Anesthesiology and Perioperative Medicine, David Geffen School of Medicine at UCLA, Los Angeles, CA, United States
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, United States
- *Correspondence: Eran Halperin,
| |
Collapse
|
9
|
Luo X, Schwartz J, Baccarelli A, Liu Z. Testing cell-type-specific mediation effects in genome-wide epigenetic studies. Brief Bioinform 2021; 22:bbaa131. [PMID: 32632436 PMCID: PMC8138838 DOI: 10.1093/bib/bbaa131] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2020] [Revised: 05/19/2020] [Accepted: 05/27/2020] [Indexed: 12/12/2022] Open
Abstract
Epigenome-wide mediation analysis aims to identify DNA methylation CpG sites that mediate the causal effects of genetic/environmental exposures on health outcomes. However, DNA methylations in the peripheral blood tissues are usually measured at the bulk level based on a heterogeneous population of white blood cells. Using the bulk level DNA methylation data in mediation analysis might cause confounding bias and reduce study power. Therefore, it is crucial to get fine-grained results by detecting mediation CpG sites in a cell-type-specific way. However, there is a lack of methods and software to achieve this goal. We propose a novel method (Mediation In a Cell-type-Specific fashion, MICS) to identify cell-type-specific mediation effects in genome-wide epigenetic studies using only the bulk-level DNA methylation data. MICS follows the standard mediation analysis paradigm and consists of three key steps. In step1, we assess the exposure-mediator association for each cell type; in step 2, we assess the mediator-outcome association for each cell type; in step 3, we combine the cell-type-specific exposure-mediator and mediator-outcome associations using a multiple testing procedure named MultiMed [Sampson JN, Boca SM, Moore SC, et al. FWER and FDR control when testing multiple mediators. Bioinformatics 2018;34:2418-24] to identify significant CpGs with cell-type-specific mediation effects. We conduct simulation studies to demonstrate that our method has correct FDR control. We also apply the MICS procedure to the Normative Aging Study and identify nine DNA methylation CpG sites in the lymphocytes that might mediate the effect of cigarette smoking on the lung function.
Collapse
Affiliation(s)
- Xiangyu Luo
- Institute of Statistics and Big Data, Renmin University of China, Beijing, China
| | - Joel Schwartz
- Department of Environmental Health, Harvard University, Boston, MA, USA
| | - Andrea Baccarelli
- Leon Hess Professor in the Department of Environmental Health Sciences, Columbia University, New York City, NY, USA
| | - Zhonghua Liu
- Department of Statistics and Actuarial Science, University of Hong Kong, Hong Kong SAR, China
| |
Collapse
|
10
|
Jin C, Chen M, Lin D, Sun W. Cell type-aware analysis of RNA-seq data. NATURE COMPUTATIONAL SCIENCE 2021; 1:253-261. [PMID: 34957416 PMCID: PMC8697413 DOI: 10.1038/s43588-021-00055-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 03/10/2021] [Indexed: 12/13/2022]
Abstract
Most tissue samples are composed of different cell types. Differential expression analysis without accounting for cell type composition cannot separate the changes due to cell type composition or cell type-specific expression. We propose a computational framework to address these limitations: Cell Type Aware analysis of RNA-seq (CARseq). CARseq employs a negative binomial distribution that appropriately models the count data from RNA-seq experiments. Simulation studies show that CARseq has substantially higher power than a linear model-based approach and it also provides more accurate estimate of the rankings of differentially expressed genes. We have applied CARseq to compare gene expression of schizophrenia/autism subjects versus controls, and identified the cell types underlying the difference and similarities of these two neuron-developmental diseases. Our results are consistent with the results from differential expression analysis using single cell RNA-seq data.
Collapse
Affiliation(s)
- Chong Jin
- Department of Biostatistics, University of North Carolina at Chapel Hill
| | | | - Danyu Lin
- Department of Biostatistics, University of North Carolina at Chapel Hill
| | - Wei Sun
- Department of Biostatistics, University of North Carolina at Chapel Hill
- Public Health Science Division, Fred Hutchinson Cancer Research Center
- Department of Biostatistics, University of Washington
| |
Collapse
|
11
|
You C, Wu S, Zheng SC, Zhu T, Jing H, Flagg K, Wang G, Jin L, Wang S, Teschendorff AE. A cell-type deconvolution meta-analysis of whole blood EWAS reveals lineage-specific smoking-associated DNA methylation changes. Nat Commun 2020; 11:4779. [PMID: 32963246 PMCID: PMC7508850 DOI: 10.1038/s41467-020-18618-y] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2020] [Accepted: 08/31/2020] [Indexed: 02/06/2023] Open
Abstract
Highly reproducible smoking-associated DNA methylation changes in whole blood have been reported by many Epigenome-Wide-Association Studies (EWAS). These epigenetic alterations could have important implications for understanding and predicting the risk of smoking-related diseases. To this end, it is important to establish if these DNA methylation changes happen in all blood cell subtypes or if they are cell-type specific. Here, we apply a cell-type deconvolution algorithm to identify cell-type specific DNA methylation signals in seven large EWAS. We find that most of the highly reproducible smoking-associated hypomethylation signatures are more prominent in the myeloid lineage. A meta-analysis further identifies a myeloid-specific smoking-associated hypermethylation signature enriched for DNase Hypersensitive Sites in acute myeloid leukemia. These results may guide the design of future smoking EWAS and have important implications for our understanding of how smoking affects immune-cell subtypes and how this may influence the risk of smoking related diseases. Smoking-associated DNA methylation changes in whole blood have been reported by many EWAS. Here, the authors use a cell-type deconvolution algorithm to identify cell-type specific DNA methylation signals in seven EWAS, identifying lineage-specific smoking-associated DNA methylation changes.
Collapse
Affiliation(s)
- Chenglong You
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institute for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China
| | - Sijie Wu
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institute for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China.,Human Phenome Institute, Fudan University, 825 Zhangheng Road, Shanghai, China.,State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Shijie C Zheng
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institute for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China.,Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Tianyu Zhu
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institute for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China
| | - Han Jing
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institute for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China
| | - Ken Flagg
- Guangzhou Regenerative Medicine Guangdong Laboratory, Guangzhou, China
| | - Guangyu Wang
- Department of Computer Science and Technology, Tsinghua University, Beijing, China
| | - Li Jin
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institute for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China.,Human Phenome Institute, Fudan University, 825 Zhangheng Road, Shanghai, China.,State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Sijia Wang
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institute for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China. .,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, 650223, China.
| | - Andrew E Teschendorff
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institute for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China. .,UCL Cancer Institute, Paul O'Gorman Building, University College London, 72 Huntley Street, London, WC1E 6BT, UK.
| |
Collapse
|
12
|
Hanson HE, Koussayer B, Kilvitis HJ, Schrey AW, Maddox JD, Martin LB. Epigenetic Potential in Native and Introduced Populations of House Sparrows (Passer domesticus). Integr Comp Biol 2020; 60:1458-1468. [PMID: 32497186 DOI: 10.1093/icb/icaa060] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Epigenetic potential, defined as the capacity for epigenetically-mediated phenotypic plasticity, may play an important role during range expansions. During range expansions, populations may encounter relatively novel challenges while experiencing lower genetic diversity. Phenotypic plasticity via epigenetic potential might be selectively advantageous at the time of initial introduction or during spread into new areas, enabling introduced organisms to cope rapidly with novel challenges. Here, we asked whether one form of epigenetic potential (i.e., the abundance of CpG sites) in three microbial surveillance genes: Toll-like receptors (TLRs) 1B (TLR1B), 2A (TLR2A), and 4 (TLR4) varied between native and introduced house sparrows (Passer domesticus). Using an opportunistic approach based on samples collected from sparrow populations around the world, we found that introduced birds had more CpG sites in TLR2A and TLR4, but not TLR1B, than native ones. Introduced birds also lost more CpG sites in TLR1B, gained more CpG sites in TLR2A, and lost fewer CpG sites in TLR4 compared to native birds. These results were not driven by differences in genetic diversity or population genetic structure, and many CpG sites fell within predicted transcription factor binding sites (TFBS), with losses and gains of CpG sites altering predicted TFBS. Although we lacked statistical power to conduct the most rigorous possible analyses, these results suggest that epigenetic potential may play a role in house sparrow range expansions, but additional work will be critical to elucidating how epigenetic potential affects gene expression and hence phenotypic plasticity at the individual, population, and species levels.
Collapse
Affiliation(s)
- Haley E Hanson
- Global and Planetary Health, University of South Florida, 3720 Spectrum Blvd, Suite 304, Tampa, FL 33620, USA
| | - Bilal Koussayer
- Global and Planetary Health, University of South Florida, 3720 Spectrum Blvd, Suite 304, Tampa, FL 33620, USA
| | - Holly J Kilvitis
- Department of Integrative Biology, University of South Florida, 4202 E. Fowler Ave, SCA110, Tampa, FL 33620, USA
| | - Aaron W Schrey
- Department of Biology, Georgia Southern University, Armstrong Campus, 11935 Abercorn St, SC1010, Savannah, GA 31419, USA
| | - J Dylan Maddox
- Field Museum of Natural History, 1400 S. Lake Shore Drive, Chicago, IL 60605, USA.,Laboratorio de Biotecnología y Bioenergética, Universidad Científica del Perú, Iquitos, Perú.,American Public University System, Environmental Sciences, Charles Town, WV 25414, USA
| | - Lynn B Martin
- Global and Planetary Health, University of South Florida, 3720 Spectrum Blvd, Suite 304, Tampa, FL 33620, USA
| |
Collapse
|