Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Price EM, Robinson WP. Adjusting for Batch Effects in DNA Methylation Microarray Data, a Lesson Learned. Front Genet 2018;9:83. [PMID: 29616078 PMCID: PMC5864890 DOI: 10.3389/fgene.2018.00083] [Citation(s) in RCA: 57] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2017] [Accepted: 02/27/2018] [Indexed: 11/15/2022] Open

For:	Price EM, Robinson WP. Adjusting for Batch Effects in DNA Methylation Microarray Data, a Lesson Learned. Front Genet 2018;9:83. [PMID: 29616078 PMCID: PMC5864890 DOI: 10.3389/fgene.2018.00083] [Citation(s) in RCA: 57] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2017] [Accepted: 02/27/2018] [Indexed: 11/15/2022] Open

Number

Cited by Other Article(s)

Pospiech M, Beckford J, Kumar AMS, Tamizharasan M, Brito J, Liang G, Mangul S, Alachkar H. The DNA methylation landscape across the TCR loci in patients with acute myeloid leukemia. Int Immunopharmacol 2024;138:112376. [PMID: 38917523 DOI: 10.1016/j.intimp.2024.112376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Revised: 05/09/2024] [Accepted: 05/28/2024] [Indexed: 06/27/2024]

Rahman ML, Breeze CE, Shu XO, Wong JYY, Blechter B, Cardenas A, Wang X, Ji BT, Hu W, Cai Q, Hosgood HD, Yang G, Shi J, Long J, Gao YT, Bell DA, Zheng W, Rothman N, Lan Q. Epigenome-wide association study of lung cancer among never smokers in two prospective cohorts in Shanghai, China. Thorax 2024;79:735-744. [PMID: 38702190 PMCID: PMC11251856 DOI: 10.1136/thorax-2023-220352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 02/17/2024] [Indexed: 05/06/2024]

Abstract

BACKGROUND

The aetiology of lung cancer among individuals who never smoked remains elusive, despite 15% of lung cancer cases in men and 53% in women worldwide being unrelated to smoking. Epigenetic alterations, particularly DNA methylation (DNAm) changes, have emerged as potential drivers. Yet, few prospective epigenome-wide association studies (EWAS), primarily focusing on peripheral blood DNAm with limited representation of never smokers, have been conducted.

METHODS

We conducted a nested case-control study of 80 never-smoking incident lung cancer cases and 83 never-smoking controls within the Shanghai Women's Health Study and Shanghai Men's Health Study. DNAm was measured in prediagnostic oral rinse samples using Illumina MethylationEPIC array. Initially, we conducted an EWAS to identify differentially methylated positions (DMPs) associated with lung cancer in the discovery sample of 101 subjects. The top 50 DMPs were further evaluated in a replication sample of 62 subjects, and results were pooled using fixed-effect meta-analysis.

RESULTS

Our study identified three DMPs significantly associated with lung cancer at the epigenome-wide significance level of p<8.22×10-8. These DMPs were identified as cg09198866 (MYH9; TXN2), cg01411366 (SLC9A10) and cg12787323. Furthermore, examination of the top 1000 DMPs indicated significant enrichment in epithelial regulatory regions and their involvement in small GTPase-mediated signal transduction pathways. Additionally, GrimAge acceleration was identified as a risk factor for lung cancer (OR=1.19 per year; 95% CI 1.06 to 1.34).

CONCLUSIONS

While replication in a larger sample size is necessary, our findings suggest that DNAm patterns in prediagnostic oral rinse samples could provide novel insights into the underlying mechanisms of lung cancer in never smokers.

Collapse

Affiliation(s)

Mohammad L Rahman Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
Charles E Breeze Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
Xiao-Ou Shu Vanderbilt University Medical Center, Nashville, Tennessee, USA
Jason Y Y Wong Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
Batel Blechter Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
Andres Cardenas Department of Epidemiology and Population Health, Stanford University, Stanford, California, USA
Xuting Wang Immunity, Inflammation and Diseases Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina, USA
Bu-Tian Ji Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
Wei Hu Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
Qiuyin Cai Vanderbilt University, Nashville, Tennessee, USA
H Dean Hosgood Albert Einstein College of Medicine, Bronx, New York, USA
Gong Yang Department of Medicine, Vanderbilt-Ingram Cancer Center, Nashville, Tennessee, USA
Jianxin Shi Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
Jirong Long Department of Medicine, Vanderbilt-Ingram Cancer Center, Nashville, Tennessee, USA
Yu-Tang Gao Shanghai Cancer Institute, Shanghai, China
Douglas A Bell Immunity, Inflammation and Diseases Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina, USA
Wei Zheng Vanderbilt University Medical Center, Nashville, Tennessee, USA
Nathaniel Rothman Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
Qing Lan Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA

Collapse

Hannon ER, Marsit CJ, Dent AE, Embury P, Ogolla S, Midem D, Williams SM, Kazura JW. Transcriptome- and DNA methylation-based cell-type deconvolutions produce similar estimates of differential gene expression and differential methylation. BioData Min 2024;17:21. [PMID: 38992677 PMCID: PMC11241886 DOI: 10.1186/s13040-024-00374-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Accepted: 07/01/2024] [Indexed: 07/13/2024] Open

Zhuang BC, Jude MS, Konwar C, Ryan CP, Whitehead J, Engelbrecht HR, MacIsaac JL, Dever K, Toan TK, Korinek K, Zimmer Z, Huffman KM, Lee NR, McDade TW, Kuzawa CW, Belsky DW, Kobor MS. Comparison of Infinium MethylationEPIC v2.0 to v1.0 for human population epigenetics: considerations for addressing EPIC version differences in DNA methylation-based tools. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.02.600461. [PMID: 39005299 PMCID: PMC11245009 DOI: 10.1101/2024.07.02.600461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]

Affiliation(s)

Beryl C Zhuang BC Children's Hospital Research Institute, 950 West 28th Avenue, Vancouver, BC, V5Z 4H4, Canada Department of Medical Genetics, Faculty of Medicine, University of British Columbia, Vancouver, BC, V6T 1Z3, Canada
Marcia Smiti Jude BC Children's Hospital Research Institute, 950 West 28th Avenue, Vancouver, BC, V5Z 4H4, Canada Department of Medical Genetics, Faculty of Medicine, University of British Columbia, Vancouver, BC, V6T 1Z3, Canada
Chaini Konwar BC Children's Hospital Research Institute, 950 West 28th Avenue, Vancouver, BC, V5Z 4H4, Canada Department of Medical Genetics, Faculty of Medicine, University of British Columbia, Vancouver, BC, V6T 1Z3, Canada
Calen P Ryan Robert N. Butler Columbia Aging Center, Mailman School of Public Health, Columbia University, New York, NY 10032, USA
Joanne Whitehead BC Children's Hospital Research Institute, 950 West 28th Avenue, Vancouver, BC, V5Z 4H4, Canada Department of Medical Genetics, Faculty of Medicine, University of British Columbia, Vancouver, BC, V6T 1Z3, Canada
Hannah-Ruth Engelbrecht BC Children's Hospital Research Institute, 950 West 28th Avenue, Vancouver, BC, V5Z 4H4, Canada Department of Medical Genetics, Faculty of Medicine, University of British Columbia, Vancouver, BC, V6T 1Z3, Canada
Julia L MacIsaac BC Children's Hospital Research Institute, 950 West 28th Avenue, Vancouver, BC, V5Z 4H4, Canada Department of Medical Genetics, Faculty of Medicine, University of British Columbia, Vancouver, BC, V6T 1Z3, Canada
Kristy Dever BC Children's Hospital Research Institute, 950 West 28th Avenue, Vancouver, BC, V5Z 4H4, Canada Department of Medical Genetics, Faculty of Medicine, University of British Columbia, Vancouver, BC, V6T 1Z3, Canada
Tran Khanh Toan Family Medicine Department, Hanoi Medical University, Hanoi, Vietnam
Kim Korinek Department of Sociology, University of Utah, Salt Lake City, USA
Zachary Zimmer Department of Family Studies and Gerontology, Mount Saint Vincent University, Halifax, Canada Canada Research Chair, Global Aging and Community Initiative
Kim M Huffman Duke University School of Medicine, Durham, NC, 27701, USA
Nanette R Lee USC-Office of Population Studies Foundation, Inc., University of San Carlos, Cebu City, Philippines
Thomas W McDade Department of Anthropology, Northwestern University, Evanston, Illinois, USA Program in Child and Brain Development, CIFAR, Toronto, Ontario, Canada
Christopher W Kuzawa Department of Anthropology and Institute for Policy Research, Northwestern University, Evanston, IL 60208, USA
Daniel W Belsky Butler Columbia Aging Center, Columbia University Mailman School of Public Health, New York, NY, USA Department of Epidemiology, Columbia University Mailman School of Public Health, New York, NY, USA
Michael S Kobor BC Children's Hospital Research Institute, 950 West 28th Avenue, Vancouver, BC, V5Z 4H4, Canada Department of Medical Genetics, Faculty of Medicine, University of British Columbia, Vancouver, BC, V6T 1Z3, Canada Program in Child and Brain Development, CIFAR, Toronto, Ontario, Canada The Edwin S.H. Leong UBC Healthy Aging Chair-A UBC President's Excellence Chair, University of British Columbia, Canada

Collapse

Deng WQ, Pigeyre M, Azab SM, Wilson SL, Campbell N, Cawte N, Morrison KM, Atkinson SA, Subbarao P, Turvey SE, Moraes TJ, Mandhane P, Azad MB, Simons E, Pare G, Anand SS. Consistent cord blood DNA methylation signatures of gestational age between South Asian and white European cohorts. Clin Epigenetics 2024;16:74. [PMID: 38840168 PMCID: PMC11155053 DOI: 10.1186/s13148-024-01684-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Accepted: 05/23/2024] [Indexed: 06/07/2024] Open

Abstract

BACKGROUND

Epigenetic modifications, particularly DNA methylation (DNAm) in cord blood, are an important biological marker of how external exposures during gestation can influence the in-utero environment and subsequent offspring development. Despite the recognized importance of DNAm during gestation, comparative studies to determine the consistency of these epigenetic signals across different ethnic groups are largely absent. To address this gap, we first performed epigenome-wide association studies (EWAS) of gestational age (GA) using newborn cord blood DNAm comparatively in a white European (n = 342) and a South Asian (n = 490) birth cohort living in Canada. Then, we capitalized on established cord blood epigenetic GA clocks to examine the associations between maternal exposures, offspring characteristics and epigenetic GA, as well as GA acceleration, defined as the residual difference between epigenetic and chronological GA at birth.

RESULTS

Individual EWASs confirmed 1,211 and 1,543 differentially methylated CpGs previously reported to be associated with GA, in white European and South Asian cohorts, respectively, with a similar distribution of effects. We confirmed that Bohlin's cord blood GA clock was robustly correlated with GA in white Europeans (r = 0.71; p = 6.0 × 10-54) and South Asians (r = 0.66; p = 6.9 × 10-64). In both cohorts, Bohlin's clock was positively associated with newborn weight and length and negatively associated with parity, newborn female sex, and gestational diabetes. Exclusive to South Asians, the GA clock was positively associated with the newborn ponderal index, while pre-pregnancy weight and gestational weight gain were strongly predictive of increased epigenetic GA in white Europeans. Important predictors of GA acceleration included gestational diabetes mellitus, newborn sex, and parity in both cohorts.

CONCLUSIONS

These results demonstrate the consistent DNAm signatures of GA and the utility of Bohlin's GA clock across the two populations. Although the overall pattern of DNAm is similar, its connections with the mother's environment and the baby's anthropometrics can differ between the two groups. Further research is needed to understand these unique relationships.

Collapse

Affiliation(s)

Wei Q Deng Peter Boris Centre for Addictions Research, St. Joseph's Healthcare Hamilton, Hamilton, Canada. Department of Psychiatry and Behavioural Neurosciences, McMaster University, Hamilton, Canada. Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Canada.
Marie Pigeyre Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Canada Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, Canada Thrombosis and Atherosclerosis Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, ON, Canada
Sandi M Azab Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Canada Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Canada
Samantha L Wilson Department of Obstetrics and Gynecology, McMaster University, Hamilton, Canada
Natalie Campbell Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Canada
Nathan Cawte Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, Canada
Katherine M Morrison Department of Pediatrics, McMaster University, Hamilton, Canada
Stephanie A Atkinson Department of Pediatrics, McMaster University, Hamilton, Canada
Padmaja Subbarao Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Canada Hospital for Sick Children, Department of Pediatrics, University of Toronto, Toronto, Canada Program in Translational Medicine, SickKids Research Institute, Toronto, Canada
Stuart E Turvey Department of Pediatrics, BC Children's Hospital, The University of British Columbia, Vancouver, Canada
Theo J Moraes Hospital for Sick Children, Department of Pediatrics, University of Toronto, Toronto, Canada Program in Translational Medicine, SickKids Research Institute, Toronto, Canada
Piush Mandhane Department of Pediatrics, University of Alberta, Edmonton, Canada
Meghan B Azad Department of Pediatrics and Child Health, Children's Hospital Research Institute of Manitoba, University of Manitoba, Winnipeg, Canada
Elinor Simons Section of Allergy and Immunology, Department of Pediatrics and Child Health, University of Manitoba, Winnipeg, Canada
Guillaume Pare Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, Canada Thrombosis and Atherosclerosis Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, ON, Canada Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Canada Department of Pathology and Molecular Medicine, Michael G. DeGroote School of Medicine, McMaster University, Hamilton, Canada
Sonia S Anand Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Canada. Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, Canada. Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Canada.

Collapse

Hannon ER, Marsit CJ, Dent AE, Embury P, Ogolla S, Midem D, Williams SM, Kazura JW. Transcriptome- and DNA methylation-based cell-type deconvolutions produce similar estimates of differential gene expression and differential methylation. RESEARCH SQUARE 2024:rs.3.rs-3992113. [PMID: 38645047 PMCID: PMC11030537 DOI: 10.21203/rs.3.rs-3992113/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]

Bunyavanich S, Becker PM, Altman MC, Lasky-Su J, Ober C, Zengler K, Berdyshev E, Bonneau R, Chatila T, Chatterjee N, Chung KF, Cutcliffe C, Davidson W, Dong G, Fang G, Fulkerson P, Himes BE, Liang L, Mathias RA, Ogino S, Petrosino J, Price ND, Schadt E, Schofield J, Seibold MA, Steen H, Wheatley L, Zhang H, Togias A, Hasegawa K. Analytical challenges in omics research on asthma and allergy: A National Institute of Allergy and Infectious Diseases workshop. J Allergy Clin Immunol 2024;153:954-968. [PMID: 38295882 PMCID: PMC10999353 DOI: 10.1016/j.jaci.2024.01.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 01/19/2024] [Accepted: 01/24/2024] [Indexed: 02/29/2024]

Affiliation(s)

Supinda Bunyavanich Icahn School of Medicine at Mount Sinai, New York, NY.
Patrice M Becker National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, Md
Matthew C Altman University of Washington, Seattle, Wash
Jessica Lasky-Su Brigham & Women's Hospital and Harvard Medical School, Boston, Mass
Carole Ober University of Chicago, Chicago, Ill
Karsten Zengler University of California, San Diego, Calif
Evgeny Berdyshev National Jewish Health, Denver, Colo
Richard Bonneau Genentech, New York, NY
Talal Chatila Boston Children's Hospital and Harvard Medical School, Boston, Mass
Nilanjan Chatterjee Johns Hopkins University, Baltimore, Md
Kian Fan Chung Imperial College, London, United Kingdom
Colleen Cutcliffe Pendulum Therapeutics, San Francisco, Calif
Wendy Davidson National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, Md
Gang Dong National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, Md
Gang Fang Icahn School of Medicine at Mount Sinai, New York, NY
Patricia Fulkerson National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, Md
Blanca E Himes University of Pennsylvania, Philadelphia, Pa
Liming Liang Harvard T. H. Chan School of Public Health, Boston, Mass
Rasika A Mathias Johns Hopkins University, Baltimore, Md
Shuji Ogino Brigham & Women's Hospital and Harvard Medical School, Boston, Mass; Harvard T. H. Chan School of Public Health, Boston, Mass; Broad Institute of MIT and Harvard, Boston, Mass
Joseph Petrosino Baylor College of Medicine, Houston, Tex
Nathan D Price Thorne HealthTech, New York, NY
Eric Schadt Icahn School of Medicine at Mount Sinai, New York, NY
James Schofield University of Southampton, Southampton, United Kingdom
Max A Seibold National Jewish Health, Denver, Colo; University of Colorado School of Medicine, Aurora, Colo
Hanno Steen Boston Children's Hospital and Harvard Medical School, Boston, Mass
Lisa Wheatley National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, Md
Hongmei Zhang School of Public Health, University of Memphis, Memphis, Tenn
Alkis Togias National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, Md
Kohei Hasegawa Massachusetts General Hospital and Harvard Medical School, Boston, Mass

Collapse

Kreitmaier P, Park YC, Swift D, Gilly A, Wilkinson JM, Zeggini E. Epigenomic profiling of the infrapatellar fat pad in osteoarthritis. Hum Mol Genet 2024;33:501-509. [PMID: 37975894 PMCID: PMC10939427 DOI: 10.1093/hmg/ddad198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 10/13/2023] [Accepted: 11/07/2023] [Indexed: 11/19/2023] Open

Casazza W, Inkster AM, Del Gobbo GF, Yuan V, Delahaye F, Marsit C, Park YP, Robinson WP, Mostafavi S, Dennis JK. Sex-dependent placental methylation quantitative trait loci provide insight into the prenatal origins of childhood onset traits and conditions. iScience 2024;27:109047. [PMID: 38357671 PMCID: PMC10865402 DOI: 10.1016/j.isci.2024.109047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Revised: 06/19/2023] [Accepted: 01/23/2024] [Indexed: 02/16/2024] Open

Affiliation(s)

William Casazza Centre for Molecular Medicine and Therapeutics, BC Children’s Hospital, Vancouver, BC, Canada Bioinformatics Graduate Program, University of British Columbia, Vancouver, BC, Canada BC Children’s Hospital Research Institute, Vancouver, BC, Canada
Amy M. Inkster BC Children’s Hospital Research Institute, Vancouver, BC, Canada Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada
Giulia F. Del Gobbo BC Children’s Hospital Research Institute, Vancouver, BC, Canada Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada Children’s Hospital of Eastern Ontario Research Institute, University of Ottawa, Ottawa, ON, Canada
Victor Yuan BC Children’s Hospital Research Institute, Vancouver, BC, Canada Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada
Fabien Delahaye Albert Einstein College of Medicine, The Bronx, NY, USA
Carmen Marsit Rollins School of Public Health, Emory University, Atlanta, GA, USA
Yongjin P. Park Department of Statistics, University of British Columbia, Vancouver, BC, Canada Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, BC, Canada
Wendy P. Robinson BC Children’s Hospital Research Institute, Vancouver, BC, Canada Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada
Sara Mostafavi Centre for Molecular Medicine and Therapeutics, BC Children’s Hospital, Vancouver, BC, Canada Paul Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA
Jessica K. Dennis Centre for Molecular Medicine and Therapeutics, BC Children’s Hospital, Vancouver, BC, Canada Bioinformatics Graduate Program, University of British Columbia, Vancouver, BC, Canada BC Children’s Hospital Research Institute, Vancouver, BC, Canada Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada

Collapse

Stone TC, Ward V, Hogan A, Alexander Ho KM, Wilson A, McBain H, Duku M, Wolfson P, Cheung S, Rosenfeld A, Lovat LB. Using saliva epigenetic data to develop and validate a multivariable predictor of esophageal cancer status. Epigenomics 2024;16:109-125. [PMID: 38226541 PMCID: PMC10825730 DOI: 10.2217/epi-2023-0248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 01/04/2024] [Indexed: 01/17/2024] Open

Affiliation(s)

Timothy C Stone Division of Surgery & Interventional Science, University College London, Charles Bell House, 43-45 Foley Street, London, W1W 7TY, UK
Vanessa Ward Division of Surgery & Interventional Science, University College London, Charles Bell House, 43-45 Foley Street, London, W1W 7TY, UK
Aine Hogan Division of Surgery & Interventional Science, University College London, Charles Bell House, 43-45 Foley Street, London, W1W 7TY, UK
Kai Man Alexander Ho Division of Surgery & Interventional Science, University College London, Charles Bell House, 43-45 Foley Street, London, W1W 7TY, UK Wellcome/EPSRC Centre for Interventional & Surgical Sciences (WEISS), University College London, Charles Bell House, 43-45 Foley Street, London, W1W 7TY, UK
Ash Wilson Division of Surgery & Interventional Science, University College London, Charles Bell House, 43-45 Foley Street, London, W1W 7TY, UK
Hazel McBain Wellcome/EPSRC Centre for Interventional & Surgical Sciences (WEISS), University College London, Charles Bell House, 43-45 Foley Street, London, W1W 7TY, UK
Margaret Duku Wellcome/EPSRC Centre for Interventional & Surgical Sciences (WEISS), University College London, Charles Bell House, 43-45 Foley Street, London, W1W 7TY, UK
Paul Wolfson Division of Surgery & Interventional Science, University College London, Charles Bell House, 43-45 Foley Street, London, W1W 7TY, UK
Sharon Cheung Division of Surgery & Interventional Science, University College London, Charles Bell House, 43-45 Foley Street, London, W1W 7TY, UK
Avi Rosenfeld Department of Computer Science, Jerusalem College of Technology, Havaad Haleumi 21, Givat Mordechai, 91160, Jerusalem, Israel
Laurence B Lovat Division of Surgery & Interventional Science, University College London, Charles Bell House, 43-45 Foley Street, London, W1W 7TY, UK Wellcome/EPSRC Centre for Interventional & Surgical Sciences (WEISS), University College London, Charles Bell House, 43-45 Foley Street, London, W1W 7TY, UK Department of Gastrointestinal Services, University College London Hospital, 235 Euston Road, London, NW1 2BU, UK

Collapse

Al-Chalabi N, Qian J, Gerretsen P, Chaudhary Z, Fischer C, Graff A, Remington G, De Luca V. Dynamic change in genome-wide methylation in response to increased suicidal ideation in schizophrenia spectrum disorders. J Neural Transm (Vienna) 2023;130:1303-1313. [PMID: 37584690 DOI: 10.1007/s00702-023-02661-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Accepted: 06/01/2023] [Indexed: 08/17/2023]

Landen S, Jacques M, Hiam D, Alvarez-Romero J, Schittenhelm RB, Shah AD, Huang C, Steele JR, Harvey NR, Haupt LM, Griffiths LR, Ashton KJ, Lamon S, Voisin S, Eynon N. Sex differences in muscle protein expression and DNA methylation in response to exercise training. Biol Sex Differ 2023;14:56. [PMID: 37670389 PMCID: PMC10478435 DOI: 10.1186/s13293-023-00539-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Accepted: 08/18/2023] [Indexed: 09/07/2023] Open

Abstract

BACKGROUND

Exercise training elicits changes in muscle physiology, epigenomics, transcriptomics, and proteomics, with males and females exhibiting differing physiological responses to exercise training. However, the molecular mechanisms contributing to the differing adaptations between the sexes are poorly understood.

METHODS

We performed a meta-analysis for sex differences in skeletal muscle DNA methylation following an endurance training intervention (Gene SMART cohort and E-MTAB-11282 cohort). We investigated for sex differences in the skeletal muscle proteome following an endurance training intervention (Gene SMART cohort). Lastly, we investigated whether the methylome and proteome are associated with baseline cardiorespiratory fitness (maximal oxygen consumption; VO2max) in a sex-specific manner.

RESULTS

Here, we investigated for the first time, DNA methylome and proteome sex differences in response to exercise training in human skeletal muscle (n = 78; 50 males, 28 females). We identified 92 DNA methylation sites (CpGs) associated with exercise training; however, no CpGs changed in a sex-dependent manner. In contrast, we identified 189 proteins that are differentially expressed between the sexes following training, with 82 proteins differentially expressed between the sexes at baseline. Proteins showing the most robust sex-specific response to exercise include SIRT3, MRPL41, and MBP. Irrespective of sex, cardiorespiratory fitness was associated with robust methylome changes (19,257 CpGs) and no proteomic changes. We did not observe sex differences in the association between cardiorespiratory fitness and the DNA methylome. Integrative multi-omic analysis identified sex-specific mitochondrial metabolism pathways associated with exercise responses. Lastly, exercise training and cardiorespiratory fitness shifted the DNA methylomes to be more similar between the sexes.

CONCLUSIONS

We identified sex differences in protein expression changes, but not DNA methylation changes, following an endurance exercise training intervention; whereas we identified no sex differences in the DNA methylome or proteome response to lifelong training. Given the delicate interaction between sex and training as well as the limitations of the current study, more studies are required to elucidate whether there is a sex-specific training effect on the DNA methylome. We found that genes involved in mitochondrial metabolism pathways are differentially modulated between the sexes following endurance exercise training. These results shed light on sex differences in molecular adaptations to exercise training in skeletal muscle.

Collapse

Affiliation(s)

Shanie Landen Institute for Health and Sport (iHeS), Victoria University, Melbourne, Australia Centre for Endocrinology and Metabolism, Hudson Institute of Medical Research, Melbourne, VIC, Australia
Macsue Jacques Institute for Health and Sport (iHeS), Victoria University, Melbourne, Australia
Danielle Hiam Institute for Health and Sport (iHeS), Victoria University, Melbourne, Australia Institute for Physical Activity and Nutrition, School of Exercise and Nutrition Sciences, Deakin University, Geelong, Australia
Javier Alvarez-Romero Institute for Health and Sport (iHeS), Victoria University, Melbourne, Australia
Ralf B Schittenhelm Monash Proteomics and Metabolomics Facility, Monash University, Melbourne, Australia
Anup D Shah Monash Proteomics and Metabolomics Facility, Monash University, Melbourne, Australia
Cheng Huang Monash Proteomics and Metabolomics Facility, Monash University, Melbourne, Australia
Joel R Steele Monash Proteomics and Metabolomics Facility, Monash University, Melbourne, Australia
Nicholas R Harvey Faculty of Health Sciences and Medicine, Bond University, Gold Coast, QLD, 4226, Australia Centre for Genomics and Personalised Health, Genomics Research Centre, School of Biomedical Sciences, Queensland University of Technology (QUT), 60 Musk Ave., Kelvin Grove, QLD, 4059, Australia
Larisa M Haupt Centre for Genomics and Personalised Health, Genomics Research Centre, School of Biomedical Sciences, Queensland University of Technology (QUT), 60 Musk Ave., Kelvin Grove, QLD, 4059, Australia
Lyn R Griffiths Centre for Genomics and Personalised Health, Genomics Research Centre, School of Biomedical Sciences, Queensland University of Technology (QUT), 60 Musk Ave., Kelvin Grove, QLD, 4059, Australia
Kevin J Ashton Faculty of Health Sciences and Medicine, Bond University, Gold Coast, QLD, 4226, Australia
Séverine Lamon Institute for Physical Activity and Nutrition, School of Exercise and Nutrition Sciences, Deakin University, Geelong, Australia
Sarah Voisin Institute for Health and Sport (iHeS), Victoria University, Melbourne, Australia Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Nir Eynon Institute for Health and Sport (iHeS), Victoria University, Melbourne, Australia. Australian Regenerative Medicine Institute (ARMI), Monash University, Clayton, VIC, 3800, Australia.

Collapse

Fransquet PD, Macdonald JA, Ryan J, Greenwood CJ, Olsson CA. Exploring perinatal biopsychosocial factors and epigenetic age in 1-year-old offspring. Epigenomics 2023;15:927-939. [PMID: 37905426 DOI: 10.2217/epi-2023-0284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2023] Open

Nishitani S, Fujisawa TX, Yao A, Takiguchi S, Tomoda A. Evaluation of the pooled sample method in Infinium MethylationEPIC BeadChip array by comparison with individual samples. Clin Epigenetics 2023;15:138. [PMID: 37641110 PMCID: PMC10463626 DOI: 10.1186/s13148-023-01544-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 07/29/2023] [Indexed: 08/31/2023] Open

Abstract

BACKGROUND

The pooled sample method is used in epigenomic research and expression analysis and is a cost-effective screening approach for small amounts of DNA. Evaluation of the pooled sample method in epigenomic studies is performed using the Illumina Infinium Methylation 450K BeadChip array; however, subsequent reports on the updated 850K array are lacking. A previous study demonstrated that the methylation levels obtained from individual samples were accurately replicated using pooled samples but did not address epigenome-wide association study (EWAS) statistics. The DNA quantification method, which is important for the homogeneous mixing of DNA in the pooled sample method, has since become fluorescence-based, and additional factors need to be considered including the resolution of batch effects of microarray chips and the heterogeneity of the cellular proportions from which the DNA samples are derived. In this study, four pooled samples were created from 44 individual samples, and EWAS statistics for differentially methylated positions (DMPs) and regions (DMRs) were conducted for individual samples and compared with the statistics obtained from the pooled samples.

RESULTS

The methylation levels could be reproduced fairly well in the pooled samples. This was the case for the entire dataset and when limited to the top 100 CpG sites, consistent with a previous study using the 450K BeadChip array. However, the statistical results of the EWAS for the DMP by individual samples were not replicated in pooled samples. Qualitative analyses highlighting methylation within an arbitrary candidate gene were replicable. Focusing on chr 20, the statistical results of EWAS for DMR from individual samples showed replicability in the pooled samples as long as they were limited to regions with a sufficient effect size.

CONCLUSIONS

The pooled sample method replicated the methylation values well and can be used for EWAS in DMR. This method is sample amount-effective and cost-effective and can be utilized for screening by carefully understanding the effective features and disadvantages of the pooled sample method and combining it with candidate gene analyses.

Collapse

Ye H, Zhang X, Wang C, Goode EL, Chen J. Batch-effect correction with sample remeasurement in highly confounded case-control studies. NATURE COMPUTATIONAL SCIENCE 2023;3:709-719. [PMID: 38177326 PMCID: PMC10993308 DOI: 10.1038/s43588-023-00500-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 07/11/2023] [Indexed: 01/06/2024]

Fajarda O, Almeida JR, Duarte-Pereira S, Silva RM, Oliveira JL. Methodology to identify a gene expression signature by merging microarray datasets. Comput Biol Med 2023;159:106867. [PMID: 37060770 DOI: 10.1016/j.compbiomed.2023.106867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 03/01/2023] [Accepted: 03/30/2023] [Indexed: 04/17/2023]

Perini S, Filosi M, Domenici E. Candidate biomarkers from the integration of methylation and gene expression in discordant autistic sibling pairs. Transl Psychiatry 2023;13:109. [PMID: 37012247 PMCID: PMC10070641 DOI: 10.1038/s41398-023-02407-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/29/2022] [Revised: 03/18/2023] [Accepted: 03/21/2023] [Indexed: 04/05/2023] Open

Yosef A, Shnaider E, Schneider M, Gurevich M. Heuristic normalization procedure for batch effect correction. Soft comput 2023. [DOI: 10.1007/s00500-023-08049-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/31/2023]

Brockway HM, Wilson SL, Kallapur SG, Buhimschi CS, Muglia LJ, Jones HN. Characterization of methylation profiles in spontaneous preterm birth placental villous tissue. PLoS One 2023;18:e0279991. [PMID: 36952446 PMCID: PMC10035933 DOI: 10.1371/journal.pone.0279991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Indexed: 03/25/2023] Open

Abstract

Preterm birth is a global public health crisis which results in significant neonatal and maternal mortality. Yet little is known regarding the molecular mechanisms of idiopathic spontaneous preterm birth, and we have few diagnostic markers for adequate assessment of placental development and function. Previous studies of placental pathology and our transcriptomics studies suggest a role for placental maturity in idiopathic spontaneous preterm birth. It is known that placental DNA methylation changes over gestation. We hypothesized that if placental hypermaturity is present in our samples, we would observe a unique idiopathic spontaneous preterm birth DNA methylation profile potentially driving the gene expression differences we previously identified in our placental samples. Our results indicate the idiopathic spontaneous preterm birth DNA methylation pattern mimics the term birth methylation pattern suggesting hypermaturity. Only seven significant differentially methylated regions fitting the idiopathic spontaneous preterm birth specific (relative to the controls) profile were identified, indicating unusually high similarity in DNA methylation between idiopathic spontaneous preterm birth and term birth samples. We identified an additional 1,718 significantly methylated regions in our gestational age matched controls where the idiopathic spontaneous preterm birth DNA methylation pattern mimics the term birth methylation pattern, again indicating a striking level of similarity between the idiopathic spontaneous preterm birth and term birth samples. Pathway analysis of these regions revealed differences in genes within the WNT and Cadherin signaling pathways, both of which are essential in placental development and maturation. Taken together, these data demonstrate that the idiopathic spontaneous preterm birth samples display a hypermature methylation signature than expected given their respective gestational age which likely impacts birth timing.

Collapse

Gim JA. Integrative Approaches of DNA Methylation Patterns According to Age, Sex and Longitudinal Changes. Curr Genomics 2023;23:385-399. [PMID: 37920553 PMCID: PMC10173416 DOI: 10.2174/1389202924666221207100513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 10/04/2022] [Accepted: 11/04/2022] [Indexed: 12/12/2022] Open

Louise J, Deussen AR, Dodd JM. Data processing choices can affect findings in differential methylation analyses: an investigation using data from the LIMIT RCT. PeerJ 2023;11:e14786. [PMID: 36755865 PMCID: PMC9901304 DOI: 10.7717/peerj.14786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 01/03/2023] [Indexed: 02/05/2023] Open

Inkster AM, Wong MT, Matthews AM, Brown CJ, Robinson WP. Who's afraid of the X? Incorporating the X and Y chromosomes into the analysis of DNA methylation array data. Epigenetics Chromatin 2023;16:1. [PMID: 36609459 PMCID: PMC9825011 DOI: 10.1186/s13072-022-00477-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 12/27/2022] [Indexed: 01/09/2023] Open

Abstract

BACKGROUND

Many human disease phenotypes manifest differently by sex, making the development of methods for incorporating X and Y-chromosome data into analyses vital. Unfortunately, X and Y chromosome data are frequently excluded from large-scale analyses of the human genome and epigenome due to analytical complexity associated with sex chromosome dosage differences between XX and XY individuals, and the impact of X-chromosome inactivation (XCI) on the epigenome. As such, little attention has been given to considering the methods by which sex chromosome data may be included in analyses of DNA methylation (DNAme) array data.

RESULTS

With Illumina Infinium HumanMethylation450 DNAme array data from 634 placental samples, we investigated the effects of probe filtering, normalization, and batch correction on DNAme data from the X and Y chromosomes. Processing steps were evaluated in both mixed-sex and sex-stratified subsets of the analysis cohort to identify whether including both sexes impacted processing results. We found that identification of probes that have a high detection p-value, or that are non-variable, should be performed in sex-stratified data subsets to avoid over- and under-estimation of the quantity of probes eligible for removal, respectively. All normalization techniques investigated returned X and Y DNAme data that were highly correlated with the raw data from the same samples. We found no difference in batch correction results after application to mixed-sex or sex-stratified cohorts. Additionally, we identify two analytical methods suitable for XY chromosome data, the choice between which should be guided by the research question of interest, and we performed a proof-of-concept analysis studying differential DNAme on the X and Y chromosome in the context of placental acute chorioamnionitis. Finally, we provide an annotation of probe types that may be desirable to filter in X and Y chromosome analyses, including probes in repetitive elements, the X-transposed region, and cancer-testis gene promoters.

CONCLUSION

While there may be no single "best" approach for analyzing DNAme array data from the X and Y chromosome, analysts must consider key factors during processing and analysis of sex chromosome data to accommodate the underlying biology of these chromosomes, and the technical limitations of DNA methylation arrays.

Collapse

Yosef A, Shnaider E, Schneider M, Gurevich M. Normalization of Large-Scale Transcriptome Data Using Heuristic Methods. Bioinform Biol Insights 2023;17:11779322231160397. [PMID: 37020503 PMCID: PMC10068970 DOI: 10.1177/11779322231160397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Accepted: 02/09/2023] [Indexed: 04/03/2023] Open

Chicco D, Oneto L, Tavazzi E. Eleven quick tips for data cleaning and feature engineering. PLoS Comput Biol 2022;18:e1010718. [PMID: 36520712 PMCID: PMC9754225 DOI: 10.1371/journal.pcbi.1010718] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open

Kalyakulina A, Yusipov I, Bacalini MG, Franceschi C, Vedunova M, Ivanchenko M. Disease classification for whole-blood DNA methylation: Meta-analysis, missing values imputation, and XAI. Gigascience 2022;11:giac097. [PMID: 36259657 PMCID: PMC9718659 DOI: 10.1093/gigascience/giac097] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 08/01/2022] [Accepted: 09/15/2022] [Indexed: 07/25/2023] Open

Keshawarz A, Joehanes R, Guan W, Huan T, DeMeo DL, Grove ML, Fornage M, Levy D, O’Connor G. Longitudinal change in blood DNA epigenetic signature after smoking cessation. Epigenetics 2022;17:1098-1109. [PMID: 34570667 PMCID: PMC9542417 DOI: 10.1080/15592294.2021.1985301] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 08/20/2021] [Accepted: 09/21/2021] [Indexed: 12/14/2022] Open

Chu S, Avery A, Yoshimoto J, Bryan JN. Genome wide exploration of the methylome in aggressive B-cell lymphoma in Golden Retrievers reveals a conserved hypermethylome. Epigenetics 2022;17:2022-2038. [PMID: 35912844 PMCID: PMC9665123 DOI: 10.1080/15592294.2022.2105033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open

Derakhshan M, Kessler NJ, Ishida M, Demetriou C, Brucato N, Moore G, Fall CHD, Chandak GR, Ricaut FX, Prentice A, Hellenthal G, Silver M. Tissue- and ethnicity-independent hypervariable DNA methylation states show evidence of establishment in the early human embryo. Nucleic Acids Res 2022;50:6735-6752. [PMID: 35713545 PMCID: PMC9749461 DOI: 10.1093/nar/gkac503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Revised: 05/06/2022] [Accepted: 05/27/2022] [Indexed: 12/24/2022] Open

HarmonizR enables data harmonization across independent proteomic datasets with appropriate handling of missing values. Nat Commun 2022;13:3523. [PMID: 35725563 PMCID: PMC9209422 DOI: 10.1038/s41467-022-31007-x] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Accepted: 05/25/2022] [Indexed: 01/01/2023] Open

Ross JP, van Dijk S, Phang M, Skilton MR, Molloy PL, Oytam Y. Batch-effect detection, correction and characterisation in Illumina HumanMethylation450 and MethylationEPIC BeadChip array data. Clin Epigenetics 2022;14:58. [PMID: 35488315 PMCID: PMC9055778 DOI: 10.1186/s13148-022-01277-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 04/10/2022] [Indexed: 11/20/2022] Open

Abstract

Background

Genomic technologies can be subject to significant batch-effects which are known to reduce experimental power and to potentially create false positive results. The Illumina Infinium Methylation BeadChip is a popular technology choice for epigenome-wide association studies (EWAS), but presently, little is known about the nature of batch-effects on these designs. Given the subtlety of biological phenotypes in many EWAS, control for batch-effects should be a consideration.

Results

Using the batch-effect removal approaches in the ComBat and Harman software, we examined two in-house datasets and compared results with three large publicly available datasets, (1214 HumanMethylation450 and 1094 MethylationEPIC BeadChips in total), and find that despite various forms of preprocessing, some batch-effects persist. This residual batch-effect is associated with the day of processing, the individual glass slide and the position of the array on the slide. Consistently across all datasets, 4649 probes required high amounts of correction. To understand the impact of this set to EWAS studies, we explored the literature and found three instances where persistently batch-effect prone probes have been reported in abstracts as key sites of differential methylation. As well as batch-effect susceptible probes, we also discover a set of probes which are erroneously corrected. We provide batch-effect workflows for Infinium Methylation data and provide reference matrices of batch-effect prone and erroneously corrected features across the five datasets spanning regionally diverse populations and three commonly collected biosamples (blood, buccal and saliva).

Conclusions

Batch-effects are ever present, even in high-quality data, and a strategy to deal with them should be part of experimental design, particularly for EWAS. Batch-effect removal tools are useful to reduce technical variance in Infinium Methylation data, but they need to be applied with care and make use of post hoc diagnostic measures.

Supplementary Information

The online version contains supplementary material available at 10.1186/s13148-022-01277-9.

Collapse

Dayon L, Cominetti O, Affolter M. Proteomics of Human Biological Fluids for Biomarker Discoveries: Technical Advances and Recent Applications. Expert Rev Proteomics 2022;19:131-151. [PMID: 35466824 DOI: 10.1080/14789450.2022.2070477] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Cao X, Li W, Wang T, Ran D, Davalos V, Planas-Serra L, Pujol A, Esteller M, Wang X, Yu H. Accelerated biological aging in COVID-19 patients. Nat Commun 2022;13:2135. [PMID: 35440567 PMCID: PMC9018863 DOI: 10.1038/s41467-022-29801-8] [Citation(s) in RCA: 82] [Impact Index Per Article: 41.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 03/30/2022] [Indexed: 01/01/2023] Open

Affiliation(s)

Xue Cao Department of Oncology, Guizhou Provincial People's Hospital, Guiyang, Guizhou, China.,Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Disease, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, Guangdong, China.,Department of Colorectal Surgery, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, Guangdong, China
Wenjuan Li Department of Pulmonary and Critical Care Medicine, The Third Affiliated Hospital, Sun Yat-sen University, Guangzhou, Guangdong, China
Ting Wang Research & Development, Thermo Fisher Scientific Inc., Los Angeles, CA, USA
Dongzhi Ran Department of Pharmacology, College of Medicine, University of Arizona, Tucson, AZ, USA.,Key Laboratory of Biochemistry and Molecular Pharmacology, Department of Pharmacology, Chongqing Medical University, Chongqing, China
Veronica Davalos Josep Carreras Leukaemia Research Institute (IJC), Barcelona, Catalonia, Spain
Laura Planas-Serra Neurometabolic Diseases Laboratory, Bellvitge Biomedical Research Institute (IDIBELL), Barcelona, Catalonia, Spain.,Center for Biomedical Research on Rare Diseases (CIBERER), ISCIII, Madrid, Spain
Aurora Pujol Neurometabolic Diseases Laboratory, Bellvitge Biomedical Research Institute (IDIBELL), Barcelona, Catalonia, Spain.,Center for Biomedical Research on Rare Diseases (CIBERER), ISCIII, Madrid, Spain.,Institucio Catalana de Recerca i Estudis Avancats (ICREA), Barcelona, Catalonia, Spain
Manel Esteller Josep Carreras Leukaemia Research Institute (IJC), Barcelona, Catalonia, Spain.,Institucio Catalana de Recerca i Estudis Avancats (ICREA), Barcelona, Catalonia, Spain.,Centro de Investigación Biomédica en Red de Cancer (CIBERONC), Madrid, Spain.,Physiological Sciences Department, School of Medicine and Health Sciences, University of Barcelona (UB), Barcelona, Catalonia, Spain
Xiaolin Wang Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Disease, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, Guangdong, China
Huichuan Yu Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Disease, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, Guangdong, China. .,Department of Colorectal Surgery, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, Guangdong, China.

Collapse

Noble AJ, Purcell RV, Adams AT, Lam YK, Ring PM, Anderson JR, Osborne AJ. A Final Frontier in Environment-Genome Interactions? Integrated, Multi-Omic Approaches to Predictions of Non-Communicable Disease Risk. Front Genet 2022;13:831866. [PMID: 35211161 PMCID: PMC8861380 DOI: 10.3389/fgene.2022.831866] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Accepted: 01/19/2022] [Indexed: 12/26/2022] Open

Vandenbon A. Evaluation of critical data processing steps for reliable prediction of gene co-expression from large collections of RNA-seq data. PLoS One 2022;17:e0263344. [PMID: 35089979 PMCID: PMC8797241 DOI: 10.1371/journal.pone.0263344] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 01/16/2022] [Indexed: 11/19/2022] Open

Fujisawa TX, Nishitani S, Makita K, Yao A, Takiguchi S, Hamamura S, Shimada K, Okazawa H, Matsuzaki H, Tomoda A. Association of Epigenetic Differences Screened in a Few Cases of Monozygotic Twins Discordant for Attention-Deficit Hyperactivity Disorder With Brain Structures. Front Neurosci 2022;15:799761. [PMID: 35145374 PMCID: PMC8823258 DOI: 10.3389/fnins.2021.799761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Accepted: 12/16/2021] [Indexed: 11/13/2022] Open

Affiliation(s)

Takashi X. Fujisawa Research Center for Child Mental Development, University of Fukui, Fukui, Japan Division of Developmental Higher Brain Functions, United Graduate School of Child Development, Osaka University, Kanazawa University, Hamamatsu University School of Medicine, Chiba University, and University of Fukui, Osaka, Japan *Correspondence: Takashi X. Fujisawa,
Shota Nishitani Research Center for Child Mental Development, University of Fukui, Fukui, Japan Division of Developmental Higher Brain Functions, United Graduate School of Child Development, Osaka University, Kanazawa University, Hamamatsu University School of Medicine, Chiba University, and University of Fukui, Osaka, Japan
Kai Makita Research Center for Child Mental Development, University of Fukui, Fukui, Japan
Akiko Yao Research Center for Child Mental Development, University of Fukui, Fukui, Japan Division of Developmental Higher Brain Functions, United Graduate School of Child Development, Osaka University, Kanazawa University, Hamamatsu University School of Medicine, Chiba University, and University of Fukui, Osaka, Japan
Shinichiro Takiguchi Department of Child and Adolescent Psychological Medicine, University of Fukui Hospital, Fukui, Japan
Shoko Hamamura Division of Developmental Higher Brain Functions, United Graduate School of Child Development, Osaka University, Kanazawa University, Hamamatsu University School of Medicine, Chiba University, and University of Fukui, Osaka, Japan Department of Child and Adolescent Psychological Medicine, University of Fukui Hospital, Fukui, Japan
Koji Shimada Research Center for Child Mental Development, University of Fukui, Fukui, Japan Division of Developmental Higher Brain Functions, United Graduate School of Child Development, Osaka University, Kanazawa University, Hamamatsu University School of Medicine, Chiba University, and University of Fukui, Osaka, Japan Biomedical Imaging Research Center, University of Fukui, Fukui, Japan
Hidehiko Okazawa Research Center for Child Mental Development, University of Fukui, Fukui, Japan Division of Developmental Higher Brain Functions, United Graduate School of Child Development, Osaka University, Kanazawa University, Hamamatsu University School of Medicine, Chiba University, and University of Fukui, Osaka, Japan Biomedical Imaging Research Center, University of Fukui, Fukui, Japan
Hideo Matsuzaki Research Center for Child Mental Development, University of Fukui, Fukui, Japan Division of Developmental Higher Brain Functions, United Graduate School of Child Development, Osaka University, Kanazawa University, Hamamatsu University School of Medicine, Chiba University, and University of Fukui, Osaka, Japan Department of Child and Adolescent Psychological Medicine, University of Fukui Hospital, Fukui, Japan
Akemi Tomoda Research Center for Child Mental Development, University of Fukui, Fukui, Japan Division of Developmental Higher Brain Functions, United Graduate School of Child Development, Osaka University, Kanazawa University, Hamamatsu University School of Medicine, Chiba University, and University of Fukui, Osaka, Japan Department of Child and Adolescent Psychological Medicine, University of Fukui Hospital, Fukui, Japan *Correspondence: Takashi X. Fujisawa,

Collapse

Xia Q, Thompson JA, Koestler DC. Batch effect reduction of microarray data with dependent samples using an empirical Bayes approach (BRIDGE). Stat Appl Genet Mol Biol 2021;20:101-119. [PMID: 34905304 PMCID: PMC9617207 DOI: 10.1515/sagmb-2021-0020] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Accepted: 10/29/2021] [Indexed: 11/15/2022]

Campagna MP, Xavier A, Lechner-Scott J, Maltby V, Scott RJ, Butzkueven H, Jokubaitis VG, Lea RA. Epigenome-wide association studies: current knowledge, strategies and recommendations. Clin Epigenetics 2021;13:214. [PMID: 34863305 PMCID: PMC8645110 DOI: 10.1186/s13148-021-01200-8] [Citation(s) in RCA: 60] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 11/19/2021] [Indexed: 02/06/2023] Open

Abstract

The aetiology and pathophysiology of complex diseases are driven by the interaction between genetic and environmental factors. The variability in risk and outcomes in these diseases are incompletely explained by genetics or environmental risk factors individually. Therefore, researchers are now exploring the epigenome, a biological interface at which genetics and the environment can interact. There is a growing body of evidence supporting the role of epigenetic mechanisms in complex disease pathophysiology. Epigenome-wide association studies (EWASes) investigate the association between a phenotype and epigenetic variants, most commonly DNA methylation. The decreasing cost of measuring epigenome-wide methylation and the increasing accessibility of bioinformatic pipelines have contributed to the rise in EWASes published in recent years. Here, we review the current literature on these EWASes and provide further recommendations and strategies for successfully conducting them. We have constrained our review to studies using methylation data as this is the most studied epigenetic mechanism; microarray-based data as whole-genome bisulphite sequencing remains prohibitively expensive for most laboratories; and blood-based studies due to the non-invasiveness of peripheral blood collection and availability of archived DNA, as well as the accessibility of publicly available blood-cell-based methylation data. Further, we address multiple novel areas of EWAS analysis that have not been covered in previous reviews: (1) longitudinal study designs, (2) the chip analysis methylation pipeline (ChAMP), (3) differentially methylated region (DMR) identification paradigms, (4) methylation quantitative trait loci (methQTL) analysis, (5) methylation age analysis and (6) identifying cell-specific differential methylation from mixed cell data using statistical deconvolution.

Collapse

Zou Q, Wang X, Ren D, Hu B, Tang G, Zhang Y, Huang M, Pai RK, Buchanan DD, Win AK, Newcomb PA, Grady WM, Yu H, Luo Y. DNA methylation-based signature of CD8+ tumor-infiltrating lymphocytes enables evaluation of immune response and prognosis in colorectal cancer. J Immunother Cancer 2021;9:jitc-2021-002671. [PMID: 34548385 PMCID: PMC8458312 DOI: 10.1136/jitc-2021-002671] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/11/2021] [Indexed: 01/12/2023] Open

Affiliation(s)

Qi Zou Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Disease, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.,Department of Colorectal Surgery, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.,Department of Colorectal and Anal Surgery, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
Xiaolin Wang Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Disease, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
Donglin Ren Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Disease, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.,Department of Colorectal and Anal Surgery, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
Bang Hu Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Disease, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.,Department of Colorectal and Anal Surgery, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
Guannan Tang Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Disease, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
Yu Zhang Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Disease, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
Meijin Huang Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Disease, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China.,Department of Colorectal Surgery, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
Rish K Pai Department of laboratory Medicine and Pathology, Mayo Clinic Arizona, Scottsdale, Arizona, USA
Daniel D Buchanan Colorectal Oncogenomics Group, Department of Clinical Pathology, The University of Melbourne, Parkville, Victoria, Australia.,University of Melbourne Centre for Cancer Research, Victorian Comprehensive Cancer Centre, Parkville, Victoria, Australia.,Genomic Medicine and Familial Cancer Centre, The Royal Melbourne Hospital, Parkville, Victoria, Australia
Aung Ko Win Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Parkville, Victoria, Australia
Polly A Newcomb Department of Epidemiology, University of Washington School of Public Health, Seattle, Washington, USA.,Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
William M Grady Clinical Research Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA.,Department of Medicine, University of Washington School of Medicine, Seattle, Washington, USA
Huichuan Yu Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Disease, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China
Yanxin Luo Guangdong Institute of Gastroenterology, Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Disease, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China .,Department of Colorectal Surgery, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, China

Collapse

Vanderlinden LA, Johnson RK, Carry PM, Dong F, DeMeo DL, Yang IV, Norris JM, Kechris K. An effective processing pipeline for harmonizing DNA methylation data from Illumina's 450K and EPIC platforms for epidemiological studies. BMC Res Notes 2021;14:352. [PMID: 34496950 PMCID: PMC8424820 DOI: 10.1186/s13104-021-05741-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 08/16/2021] [Indexed: 02/06/2023] Open

Fang ZY, Lin CX, Xu YP, Li HD, Xu QS. REBET: a method to determine the number of cell clusters based on batch effect removal. Brief Bioinform 2021;22:6299206. [PMID: 34131702 DOI: 10.1093/bib/bbab204] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 04/20/2021] [Accepted: 03/12/2021] [Indexed: 01/01/2023] Open

Inkster AM, Yuan V, Konwar C, Matthews AM, Brown CJ, Robinson WP. A cross-cohort analysis of autosomal DNA methylation sex differences in the term placenta. Biol Sex Differ 2021;12:38. [PMID: 34044884 PMCID: PMC8162041 DOI: 10.1186/s13293-021-00381-4] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 05/17/2021] [Indexed: 12/14/2022] Open

Non AL. Social epigenomics: are we at an impasse? Epigenomics 2021;13:1747-1759. [PMID: 33749316 DOI: 10.2217/epi-2020-0136] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open

Hettegger P, Vierlinger K, Weinhaeusel A. Random rotation for identifying differentially expressed genes with linear models following batch effect correction. Bioinformatics 2021;37:2142-2149. [PMID: 33523104 DOI: 10.1093/bioinformatics/btab063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2020] [Revised: 01/11/2021] [Accepted: 01/27/2021] [Indexed: 11/13/2022] Open

Transcriptome Profiling Analyses in Psoriasis: A Dynamic Contribution of Keratinocytes to the Pathogenesis. Genes (Basel) 2020;11:genes11101155. [PMID: 33007857 PMCID: PMC7600703 DOI: 10.3390/genes11101155] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Revised: 09/28/2020] [Accepted: 09/29/2020] [Indexed: 02/08/2023] Open

Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers. BIOMED RESEARCH INTERNATIONAL 2020;2020:1363827. [PMID: 32832541 PMCID: PMC7428878 DOI: 10.1155/2020/1363827] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Revised: 03/30/2020] [Accepted: 05/11/2020] [Indexed: 11/21/2022]

Zindler T, Frieling H, Neyazi A, Bleich S, Friedel E. Simulating ComBat: how batch correction can lead to the systematic introduction of false positive results in DNA methylation microarray studies. BMC Bioinformatics 2020;21:271. [PMID: 32605541 PMCID: PMC7328269 DOI: 10.1186/s12859-020-03559-6] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2020] [Accepted: 05/26/2020] [Indexed: 12/04/2022] Open

Abstract

Background

Systematic technical effects—also called batch effects—are a considerable challenge when analyzing DNA methylation (DNAm) microarray data, because they can lead to false results when confounded with the variable of interest. Methods to correct these batch effects are error-prone, as previous findings have shown.

Results

Here, we demonstrate how using the R function ComBat to correct simulated Infinium HumanMethylation450 BeadChip (450 K) and Infinium MethylationEPIC BeadChip Kit (EPIC) DNAm data can lead to a large number of false positive results under certain conditions. We further provide a detailed assessment of the consequences for the highly relevant problem of p-value inflation with subsequent false positive findings after application of the frequently used ComBat method. Using ComBat to correct for batch effects in randomly generated samples produced alarming numbers of false discovery rate (FDR) and Bonferroni-corrected (BF) false positive results in unbalanced as well as in balanced sample distributions in terms of the relation between the outcome of interest variable and the technical position of the sample during the probe measurement. Both sample size and number of batch factors (e.g. number of chips) were systematically simulated to assess the probability of false positive findings. The effect of sample size was simulated using n = 48 up to n = 768 randomly generated samples. Increasing the number of corrected factors led to an exponential increase in the number of false positive signals. Increasing the number of samples reduced, but did not completely prevent, this effect.

Conclusions

Using the approach described, we demonstrate, that using ComBat for batch correction in DNAm data can lead to false positive results under certain conditions and sample distributions. Our results are thus contrary to previous publications, considering a balanced sample distribution as unproblematic when using ComBat. We do not claim completeness in terms of reporting all technical conditions and possible solutions of the occurring problems as we approach the problem from a clinician’s perspective and not from that of a computer scientist. With our approach of simulating data, we provide readers with a simple method to assess the probability of false positive findings in DNAm microarray data analysis pipelines.

Collapse

Judge M, Parker E, Naniche D, Le Souëf P. Gene Expression: the Key to Understanding HIV-1 Infection? Microbiol Mol Biol Rev 2020;84:e00080-19. [PMID: 32404327 PMCID: PMC7233484 DOI: 10.1128/mmbr.00080-19] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open

Salgado C, Gruis N, Heijmans BT, Oosting J, van Doorn R. Genome-wide analysis of constitutional DNA methylation in familial melanoma. Clin Epigenetics 2020;12:43. [PMID: 32143689 PMCID: PMC7060565 DOI: 10.1186/s13148-020-00831-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Accepted: 02/20/2020] [Indexed: 12/26/2022] Open

Yu F, Qiu C, Xu C, Tian Q, Zhao LJ, Wu L, Deng HW, Shen H. Mendelian Randomization Identifies CpG Methylation Sites With Mediation Effects for Genetic Influences on BMD in Peripheral Blood Monocytes. Front Genet 2020;11:60. [PMID: 32180791 PMCID: PMC7059767 DOI: 10.3389/fgene.2020.00060] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Accepted: 01/17/2020] [Indexed: 12/18/2022] Open

Machine learning workflows to estimate class probabilities for precision cancer diagnostics on DNA methylation microarray data. Nat Protoc 2020;15:479-512. [PMID: 31932775 DOI: 10.1038/s41596-019-0251-6] [Citation(s) in RCA: 65] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2019] [Accepted: 10/04/2019] [Indexed: 01/01/2023]

Abstract

DNA methylation data-based precision cancer diagnostics is emerging as the state of the art for molecular tumor classification. Standards for choosing statistical methods with regard to well-calibrated probability estimates for these typically highly multiclass classification tasks are still lacking. To support this choice, we evaluated well-established machine learning (ML) classifiers including random forests (RFs), elastic net (ELNET), support vector machines (SVMs) and boosted trees in combination with post-processing algorithms and developed ML workflows that allow for unbiased class probability (CP) estimation. Calibrators included ridge-penalized multinomial logistic regression (MR) and Platt scaling by fitting logistic regression (LR) and Firth's penalized LR. We compared these workflows on a recently published brain tumor 450k DNA methylation cohort of 2,801 samples with 91 diagnostic categories using a 5 × 5-fold nested cross-validation scheme and demonstrated their generalizability on external data from The Cancer Genome Atlas. ELNET was the top stand-alone classifier with the best calibration profiles. The best overall two-stage workflow was MR-calibrated SVM with linear kernels closely followed by ridge-calibrated tuned RF. For calibration, MR was the most effective regardless of the primary classifier. The protocols developed as a result of these comparisons provide valuable guidance on choosing ML workflows and their tuning to generate well-calibrated CP estimates for precision diagnostics using DNA methylation data. Computation times vary depending on the ML algorithm from <15 min to 5 d using multi-core desktop PCs. Detailed scripts in the open-source R language are freely available on GitHub, targeting users with intermediate experience in bioinformatics and statistics and using R with Bioconductor extensions.

Collapse