1
|
Hua X, Zhou H, Wu HC, Furnari J, Kotidis CP, Rabadan R, Genkinger JM, Bruce JN, Canoll P, Santella RM, Zhang Z. Tumor detection by analysis of both symmetric- and hemi-methylation of plasma cell-free DNA. Nat Commun 2024; 15:6113. [PMID: 39030196 PMCID: PMC11271492 DOI: 10.1038/s41467-024-50471-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 07/08/2024] [Indexed: 07/21/2024] Open
Abstract
Aberrant DNA methylation patterns have been used for cancer detection. However, DNA hemi-methylation, present at about 10% CpG dinucleotides, has been less well studied. Here we show that a majority of differentially hemi-methylated regions (DHMRs) in liver tumor DNA or plasma cells free (cf) DNA do not overlap with differentially methylated regions (DMRs) of the same samples, indicating that DHMRs could serve as independent biomarkers. Furthermore, we analyzed the cfDNA methylomes of 215 samples from individuals with liver or brain cancer and individuals without cancer (controls), and trained machine learning models using DMRs, DHMRs or both. The models incorporated with both DMRs and DHMRs show a superior performance compared to models trained with DMRs or DHMRs, with AUROC being 0.978, 0.990, and 0.983 in distinguishing control, liver and brain cancer, respectively, in a validation cohort. This study supports the potential of utilizing both DMRs and DHMRs for multi-cancer detection.
Collapse
Affiliation(s)
- Xu Hua
- Institute for Cancer Genetics, Columbia University Irving Medical Center, New York, NY, 10032, USA
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, 10032, USA
- Department of Pediatrics, Columbia University Irving Medical Center, New York, NY, 10032, USA
- Department of Genetics and Development, Columbia University Irving Medical Center, New York, NY, 10032, USA
| | - Hui Zhou
- Institute for Cancer Genetics, Columbia University Irving Medical Center, New York, NY, 10032, USA
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, 10032, USA
- Department of Pediatrics, Columbia University Irving Medical Center, New York, NY, 10032, USA
- Department of Genetics and Development, Columbia University Irving Medical Center, New York, NY, 10032, USA
| | - Hui-Chen Wu
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, 10032, USA
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, 10032, USA
| | - Julia Furnari
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, 10032, USA
- Department of Neurological Surgery, Columbia University Irving Medical Center, New York, NY, 10032, USA
| | - Corina P Kotidis
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, 10032, USA
- Department of Pathology and Cell Biology, Columbia University Irving Medical Center, New York, NY, 10032, USA
| | - Raul Rabadan
- Program for Mathematical Genomics and Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
| | - Jeanine M Genkinger
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, 10032, USA
- Department of Epidemiology, Columbia University Mailman School of Public Health, New York, NY, USA
| | - Jeffrey N Bruce
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, 10032, USA
- Department of Neurological Surgery, Columbia University Irving Medical Center, New York, NY, 10032, USA
| | - Peter Canoll
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, 10032, USA
- Department of Pathology and Cell Biology, Columbia University Irving Medical Center, New York, NY, 10032, USA
| | - Regina M Santella
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, 10032, USA
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, 10032, USA
| | - Zhiguo Zhang
- Institute for Cancer Genetics, Columbia University Irving Medical Center, New York, NY, 10032, USA.
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, 10032, USA.
- Department of Pediatrics, Columbia University Irving Medical Center, New York, NY, 10032, USA.
- Department of Genetics and Development, Columbia University Irving Medical Center, New York, NY, 10032, USA.
| |
Collapse
|
2
|
Videla Rodriguez EA, Pértille F, Guerrero-Bosagna C, Mitchell JBO, Jensen P, Smith VA. Practical application of a Bayesian network approach to poultry epigenetics and stress. BMC Bioinformatics 2022; 23:261. [PMID: 35778683 PMCID: PMC9250184 DOI: 10.1186/s12859-022-04800-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Accepted: 06/14/2022] [Indexed: 11/23/2022] Open
Abstract
Background Relationships among genetic or epigenetic features can be explored by learning probabilistic networks and unravelling the dependencies among a set of given genetic/epigenetic features. Bayesian networks (BNs) consist of nodes that represent the variables and arcs that represent the probabilistic relationships between the variables. However, practical guidance on how to make choices among the wide array of possibilities in Bayesian network analysis is limited. Our study aimed to apply a BN approach, while clearly laying out our analysis choices as an example for future researchers, in order to provide further insights into the relationships among epigenetic features and a stressful condition in chickens (Gallus gallus). Results Chickens raised under control conditions (n = 22) and chickens exposed to a social isolation protocol (n = 24) were used to identify differentially methylated regions (DMRs). A total of 60 DMRs were selected by a threshold, after bioinformatic pre-processing and analysis. The treatment was included as a binary variable (control = 0; stress = 1). Thereafter, a BN approach was applied: initially, a pre-filtering test was used for identifying pairs of features that must not be included in the process of learning the structure of the network; then, the average probability values for each arc of being part of the network were calculated; and finally, the arcs that were part of the consensus network were selected. The structure of the BN consisted of 47 out of 61 features (60 DMRs and the stressful condition), displaying 43 functional relationships. The stress condition was connected to two DMRs, one of them playing a role in tight and adhesive intracellular junctions in organs such as ovary, intestine, and brain. Conclusions We clearly explain our steps in making each analysis choice, from discrete BN models to final generation of a consensus network from multiple model averaging searches. The epigenetic BN unravelled functional relationships among the DMRs, as well as epigenetic features in close association with the stressful condition the chickens were exposed to. The DMRs interacting with the stress condition could be further explored in future studies as possible biomarkers of stress in poultry species. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04800-0.
Collapse
Affiliation(s)
| | - Fábio Pértille
- Environmental Toxicology Program, Institute of Organismal Biology, Uppsala University, Uppsala, Sweden.,Department of Biomedical & Clinical Sciences (BKV), Linköping University, 58183, Linköping, Sweden.,AVIAN Behavioural Genomics and Physiology Group, Department of Physics, Chemistry and Biology, Linköping University, 58183, Linköping, Sweden
| | - Carlos Guerrero-Bosagna
- Environmental Toxicology Program, Institute of Organismal Biology, Uppsala University, Uppsala, Sweden.,AVIAN Behavioural Genomics and Physiology Group, Department of Physics, Chemistry and Biology, Linköping University, 58183, Linköping, Sweden
| | - John B O Mitchell
- EaStCHEM School of Chemistry, University of St Andrews, St Andrews, Fife, KY16 9ST, UK
| | - Per Jensen
- AVIAN Behavioural Genomics and Physiology Group, Department of Physics, Chemistry and Biology, Linköping University, 58183, Linköping, Sweden
| | - V Anne Smith
- School of Biology, University of St Andrews, St Andrews, Fife, KY16 9TH, UK.
| |
Collapse
|
3
|
Nayan V, Singh K, Iquebal MA, Jaiswal S, Bhardwaj A, Singh C, Bhatia T, Kumar S, Singh R, Swaroop MN, Kumar R, Phulia SK, Bharadwaj A, Datta TK, Rai A, Kumar D. Genome-Wide DNA Methylation and Its Effect on Gene Expression During Subclinical Mastitis in Water Buffalo. Front Genet 2022; 13:828292. [PMID: 35368672 PMCID: PMC8965078 DOI: 10.3389/fgene.2022.828292] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Accepted: 01/31/2022] [Indexed: 11/20/2022] Open
Abstract
Subclinical mastitis (SCM) in buffalo is one of the most challenging paradoxes for the dairy sector with very significant milk production losses and poses an imminent danger to milch animal’s milk-producing ability. We present here the genome-wide methylation specific to SCM in water buffalo and its consequential effect on the gene expression landscape for the first time. Whole-genome DNA methylation profiles from peripheral blood lymphocytes and gene expression profiles from milk somatic cells of healthy and SCM cases were catalogued from the MeDIP-Seq and RNA-Seq data. The average methylation in healthy buffaloes was found to be higher than that in the SCM-infected buffaloes. DNA methylation was abundant in the intergenic region followed by the intronic region in both healthy control and SCM groups. A total of 3,950 differentially methylated regions (DMRs) were identified and annotated to 370 differentially methylated genes (DMGs), most of which were enriched in the promoter region. Several important pathways were activated due to hypomethylation and belonged to the Staphylococcus aureus infection, Th17 cell differentiation, and antigen processing and presentation pathways along with others of defense responses. DNA methylome was compared with transcriptome to understand the regulatory role of DNA methylation on gene expression specific to SCM in buffaloes. A total of 4,778 significant differentially expressed genes (DEGs) were extracted in response to SCM, out of which 67 DMGs were also found to be differentially expressed, suggesting that during SCM, DNA methylation could be one of the epigenetic regulatory mechanisms of gene expression. Genes like CSF2RB, LOC102408349, C3 and PZP like, and CPAMD8 were found to be downregulated in our study, which are known to be involved in the immune response to SCM. Association of DNA methylation with transposable elements, miRNAs, and lncRNAs was also studied. The present study reports a buffalo SCM web resource (BSCM2TDb) available at http://webtom.cabgrid.res.in/BSCM2TDb that catalogues all the mastitis-related information of the analyses results of this study in a single place. This will be of immense use to buffalo researchers to understand the host–pathogen interaction involving SCM, which is required in endeavors of mastitis control and management.
Collapse
Affiliation(s)
- Varij Nayan
- ICAR-Central Institute for Research on Buffaloes, Hisar, India
| | - Kalpana Singh
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistical Research Institute, New Delhi, India
| | - Mir Asif Iquebal
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistical Research Institute, New Delhi, India
| | - Sarika Jaiswal
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistical Research Institute, New Delhi, India
| | | | - Chhama Singh
- ICAR-Central Institute for Research on Buffaloes, Hisar, India
| | - Tanvi Bhatia
- ICAR-Central Institute for Research on Buffaloes, Hisar, India
| | - Sunil Kumar
- ICAR-Central Institute for Research on Buffaloes, Hisar, India
| | - Rakshita Singh
- ICAR-Central Institute for Research on Buffaloes, Hisar, India
| | - M. N. Swaroop
- ICAR-Central Institute for Research on Buffaloes, Hisar, India
| | - Rajesh Kumar
- ICAR-Central Institute for Research on Buffaloes, Hisar, India
| | - S. K. Phulia
- ICAR-Central Institute for Research on Buffaloes, Hisar, India
| | | | - T. K. Datta
- ICAR-Central Institute for Research on Buffaloes, Hisar, India
| | - Anil Rai
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistical Research Institute, New Delhi, India
| | - Dinesh Kumar
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistical Research Institute, New Delhi, India
- *Correspondence: Dinesh Kumar,
| |
Collapse
|
4
|
Sperm Methylome Profiling Can Discern Fertility Levels in the Porcine Biomedical Model. Int J Mol Sci 2021; 22:ijms22052679. [PMID: 33800945 PMCID: PMC7961483 DOI: 10.3390/ijms22052679] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Revised: 03/01/2021] [Accepted: 03/04/2021] [Indexed: 12/20/2022] Open
Abstract
A combined Genotyping By Sequencing (GBS) and methylated DNA immunoprecipitation (MeDIP) protocol was used to identify—in parallel—genetic variation (Genomic-Wide Association Studies (GWAS) and epigenetic differences of Differentially Methylated Regions (DMR) in the genome of spermatozoa from the porcine animal model. Breeding boars with good semen quality (n = 11) and specific and well-documented differences in fertility (farrowing rate, FR) and prolificacy (litter size, LS) (n = 7) in artificial insemination programs, using combined FR and LS, were categorized as High Fertile (HF, n = 4) or Low Fertile (LF, n = 3), and boars with Unknown Fertility (UF, n = 4) were tested for eventual epigenetical similarity with those fertility-proven. We identified 165,944 Single Nucleotide Polymorphisms (SNPs) that explained 14–15% of variance among selection lines. Between HF and LF individuals (n = 7, 4 HF and 3 LF), we identified 169 SNPs with p ≤ 0.00015, which explained 58% of the variance. For the epigenetic analyses, we considered fertility and period of ejaculate collection (late-summer and mid-autumn). Approximately three times more DMRs were observed in HF than in LF boars across these periods. Interestingly, UF boars were clearly clustered with one of the other HF or LF groups. The highest differences in DMRs between HF and LF experimental groups across the pig genome were located in the chr 3, 9, 13, and 16, with most DMRs being hypermethylated in LF boars. In both HF and LF boars, DMRs were mostly hypermethylated in late-summer compared to mid-autumn. Three overlaps were detected between SNPs (p ≤ 0.0005, n = 1318) and CpG sites within DMRs. In conclusion, fertility levels in breeding males including FR and LS can be discerned using methylome analyses. The findings in this biomedical animal model ought to be applied besides sire selection for andrological diagnosis of idiopathic sub/infertility.
Collapse
|
5
|
Pértille F, Ibelli AMG, Sharif ME, Poleti MD, Fröhlich AS, Rezaei S, Ledur MC, Jensen P, Guerrero-Bosagna C, Coutinho LL. Putative Epigenetic Biomarkers of Stress in Red Blood Cells of Chickens Reared Across Different Biomes. Front Genet 2020; 11:508809. [PMID: 33240310 PMCID: PMC7667380 DOI: 10.3389/fgene.2020.508809] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Accepted: 09/11/2020] [Indexed: 12/19/2022] Open
Abstract
Production animals are constantly subjected to early adverse environmental conditions that influence the adult phenotype and produce epigenetic effects. CpG dinucleotide methylation in red blood cells (RBC) could be a useful epigenetic biomarker to identify animals subjected to chronic stress in the production environment. Here we compared a reduced fraction of the RBC methylome of chickens exposed to social isolation to non-exposed. These experiments were performed in two different locations: Brazil and Sweden. The aim was to identify stress-associated DNA methylation profiles in RBC across these populations, in spite of the variable conditions to which birds are exposed in each facility and their different lineages. Birds were increasingly exposed to a social isolation treatment, combined with food and water deprivation, at random periods of the day from weeks 1-4 after hatching. We then collected the RBC DNA from individuals and compared a reduced fraction of their methylome between the experimental groups using two bioinformatic approaches to identify differentially methylated regions (DMRs): one using fixed-size windows and another that preselected differential peaks with MACS2. Three levels of significance were used (P ≤ 0.05, P ≤ 0.005, and P ≤ 0.0005) to identify DMRs between experimental groups, which were then used for different analyses. With both of the approaches more DMRs reached the defined significance thresholds in BR individuals compared to SW. However, more DMRs had higher fold change values in SW compared to BR individuals. Interestingly, ChrZ was enriched above expectancy for the presence of DMRs. Additionally, when analyzing the locations of these DMRs in relation to the transcription starting site (TSS), we found three peaks with high DMR presence: 10 kb upstream, the TSS itself, and 20-40 kb downstream. Interestingly, these peaks had DMRs with a high presence (>50%) of specific transcription factor binding sites. Three overlapping DMRs were found between the BR and SW population using the most relaxed p-value (P ≤ 0.05). With the most stringent p-value (P ≤ 0.0005), we found 7 and 4 DMRs between treatments in the BR and SW populations, respectively. This study is the first approximation to identify epigenetic biomarkers of long-term exposure to stress in different lineages of production animals.
Collapse
Affiliation(s)
- Fábio Pértille
- Animal Biotechnology Laboratory, Animal Science and Pastures Department, University of São Paulo (USP)/"Luiz de Queiroz" College of Agriculture (ESALQ), Piracicaba, Brazil.,Avian Behavioural Genomics and Physiology Group, IFM Biology, Linköping University, Linköping, Sweden
| | | | - Maj El Sharif
- Avian Behavioural Genomics and Physiology Group, IFM Biology, Linköping University, Linköping, Sweden
| | - Mirele Daiana Poleti
- Animal Science Program, Faculty of Animal Science and Food Engineering (FZEA), University of São Paulo (USP), Pirassununga, Brazil
| | - Anna Sophie Fröhlich
- Avian Behavioural Genomics and Physiology Group, IFM Biology, Linköping University, Linköping, Sweden
| | - Shiva Rezaei
- Avian Behavioural Genomics and Physiology Group, IFM Biology, Linköping University, Linköping, Sweden
| | | | - Per Jensen
- Avian Behavioural Genomics and Physiology Group, IFM Biology, Linköping University, Linköping, Sweden
| | - Carlos Guerrero-Bosagna
- Avian Behavioural Genomics and Physiology Group, IFM Biology, Linköping University, Linköping, Sweden.,Evolutionary Biology Centre, Department of Organismal Biology, Uppsala University, Uppsala, Sweden
| | - Luiz Lehmann Coutinho
- Animal Biotechnology Laboratory, Animal Science and Pastures Department, University of São Paulo (USP)/"Luiz de Queiroz" College of Agriculture (ESALQ), Piracicaba, Brazil
| |
Collapse
|
6
|
Welsh L, Maleszka R, Foret S. Detecting rare asymmetrically methylated cytosines and decoding methylation patterns in the honeybee genome. ROYAL SOCIETY OPEN SCIENCE 2017; 4:170248. [PMID: 28989734 PMCID: PMC5627074 DOI: 10.1098/rsos.170248] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2017] [Accepted: 08/07/2017] [Indexed: 05/12/2023]
Abstract
Context-dependent gene expression in eukaryotes is controlled by several mechanisms including cytosine methylation that primarily occurs in the CG dinucleotides (CpGs). However, less frequent non-CpG asymmetric methylation has been found in various cell types, such as mammalian neurons, and recent results suggest that these sites can repress transcription independently of CpG contexts. In addition, an emerging view is that CpG hemimethylation may arise not only from deregulation of cellular processes but also be a standard feature of the methylome. Here, we have applied a novel approach to examine whether asymmetric CpG methylation is present in a sparsely methylated genome of the honeybee, a social insect with a high level of epigenetically driven phenotypic plasticity. By combining strand-specific ultra-deep amplicon sequencing of illustrator genes with whole-genome methylomics and bioinformatics, we show that rare asymmetrically methylated CpGs can be unambiguously detected in the honeybee genome. Additionally, we confirm differential methylation between two phenotypically and reproductively distinct castes, queens and workers, and offer new insight into the heterogeneity of brain methylation patterns. In particular, we challenge the assumption that symmetrical methylation levels reflect symmetry in the underlying methylation patterns and conclude that hemimethylation may occur more frequently than indicated by methylation levels. Finally, we question the validity of a prior study in which most of cytosine methylation in this species was reported to be asymmetric.
Collapse
|
7
|
Welsh L, Maleszka R, Foret S. Detecting rare asymmetrically methylated cytosines and decoding methylation patterns in the honeybee genome. ROYAL SOCIETY OPEN SCIENCE 2017; 4:170248. [PMID: 28989734 DOI: 10.5061/dryad.7nb8q] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 03/15/2017] [Accepted: 08/07/2017] [Indexed: 05/26/2023]
Abstract
Context-dependent gene expression in eukaryotes is controlled by several mechanisms including cytosine methylation that primarily occurs in the CG dinucleotides (CpGs). However, less frequent non-CpG asymmetric methylation has been found in various cell types, such as mammalian neurons, and recent results suggest that these sites can repress transcription independently of CpG contexts. In addition, an emerging view is that CpG hemimethylation may arise not only from deregulation of cellular processes but also be a standard feature of the methylome. Here, we have applied a novel approach to examine whether asymmetric CpG methylation is present in a sparsely methylated genome of the honeybee, a social insect with a high level of epigenetically driven phenotypic plasticity. By combining strand-specific ultra-deep amplicon sequencing of illustrator genes with whole-genome methylomics and bioinformatics, we show that rare asymmetrically methylated CpGs can be unambiguously detected in the honeybee genome. Additionally, we confirm differential methylation between two phenotypically and reproductively distinct castes, queens and workers, and offer new insight into the heterogeneity of brain methylation patterns. In particular, we challenge the assumption that symmetrical methylation levels reflect symmetry in the underlying methylation patterns and conclude that hemimethylation may occur more frequently than indicated by methylation levels. Finally, we question the validity of a prior study in which most of cytosine methylation in this species was reported to be asymmetric.
Collapse
Affiliation(s)
- Laura Welsh
- Research School of Biology, The Australian National University, Canberra, Australian Capital Territory 2601, Australia
| | - Ryszard Maleszka
- Research School of Biology, The Australian National University, Canberra, Australian Capital Territory 2601, Australia
| | - Sylvain Foret
- Research School of Biology, The Australian National University, Canberra, Australian Capital Territory 2601, Australia
| |
Collapse
|