201
|
Liu C, Wang M, Wei X, Wu L, Xu J, Dai X, Xia J, Cheng M, Yuan Y, Zhang P, Li J, Feng T, Chen A, Zhang W, Chen F, Shang Z, Zhang X, Peters BA, Liu L. An ATAC-seq atlas of chromatin accessibility in mouse tissues. Sci Data 2019; 6:65. [PMID: 31110271 PMCID: PMC6527694 DOI: 10.1038/s41597-019-0071-0] [Citation(s) in RCA: 59] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2018] [Accepted: 04/10/2019] [Indexed: 01/06/2023] Open
Abstract
The Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) is a fundamental epigenomics approach and has been widely used in profiling the chromatin accessibility dynamics in multiple species. A comprehensive reference of ATAC-seq datasets for mammalian tissues is important for the understanding of regulatory specificity and developmental abnormality caused by genetic or environmental alterations. Here, we report an adult mouse ATAC-seq atlas by producing a total of 66 ATAC-seq profiles from 20 primary tissues of both male and female mice. The ATAC-seq read enrichment, fragment size distribution, and reproducibility between replicates demonstrated the high quality of the full dataset. We identified a total of 296,574 accessible elements, of which 26,916 showed tissue-specific accessibility. Further, we identified key transcription factors specific to distinct tissues and found that the enrichment of each motif reflects the developmental similarities across tissues. In summary, our study provides an important resource on the mouse epigenome and will be of great importance to various scientific disciplines such as development, cell reprogramming, and genetic disease.
Collapse
Affiliation(s)
- Chuanyu Liu
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, 518083, China
- BGI-Shenzhen, Shenzhen, 518083, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Mingyue Wang
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, 518083, China
- BGI-Shenzhen, Shenzhen, 518083, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Xiaoyu Wei
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, 518083, China
- BGI-Shenzhen, Shenzhen, 518083, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Liang Wu
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, 518083, China
- BGI-Shenzhen, Shenzhen, 518083, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Jiangshan Xu
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, 518083, China
- BGI-Shenzhen, Shenzhen, 518083, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Xi Dai
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, 518083, China
- BGI-Shenzhen, Shenzhen, 518083, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Jun Xia
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, 518083, China
- BGI-Shenzhen, Shenzhen, 518083, China
- MGI, BGI-Shenzhen, Shenzhen, 518083, China
| | - Mengnan Cheng
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, 518083, China
- BGI-Shenzhen, Shenzhen, 518083, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Yue Yuan
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, 518083, China
- BGI-Shenzhen, Shenzhen, 518083, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Pengfan Zhang
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, 518083, China
- BGI-Shenzhen, Shenzhen, 518083, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Jiguang Li
- BGI-Shenzhen, Shenzhen, 518083, China
- MGI, BGI-Shenzhen, Shenzhen, 518083, China
| | - Taiqing Feng
- BGI-Shenzhen, Shenzhen, 518083, China
- MGI, BGI-Shenzhen, Shenzhen, 518083, China
| | - Ao Chen
- BGI-Shenzhen, Shenzhen, 518083, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Wenwei Zhang
- BGI-Shenzhen, Shenzhen, 518083, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Fang Chen
- BGI-Shenzhen, Shenzhen, 518083, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
- MGI, BGI-Shenzhen, Shenzhen, 518083, China
- Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, Copenhagen, 2100, Denmark
| | - Zhouchun Shang
- BGI-Shenzhen, Shenzhen, 518083, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Xiuqing Zhang
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, 518083, China
- BGI-Shenzhen, Shenzhen, 518083, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Brock A Peters
- BGI-Shenzhen, Shenzhen, 518083, China.
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China.
- MGI, BGI-Shenzhen, Shenzhen, 518083, China.
- Advanced Genomics Technology Lab, Complete Genomics Inc., 2904 Orchard Parkway, San Jose, California, 95134, USA.
| | - Longqi Liu
- BGI-Shenzhen, Shenzhen, 518083, China.
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China.
| |
Collapse
|
202
|
Brinkman AB, Nik-Zainal S, Simmer F, Rodríguez-González FG, Smid M, Alexandrov LB, Butler A, Martin S, Davies H, Glodzik D, Zou X, Ramakrishna M, Staaf J, Ringnér M, Sieuwerts A, Ferrari A, Morganella S, Fleischer T, Kristensen V, Gut M, van de Vijver MJ, Børresen-Dale AL, Richardson AL, Thomas G, Gut IG, Martens JWM, Foekens JA, Stratton MR, Stunnenberg HG. Partially methylated domains are hypervariable in breast cancer and fuel widespread CpG island hypermethylation. Nat Commun 2019; 10:1749. [PMID: 30988298 PMCID: PMC6465362 DOI: 10.1038/s41467-019-09828-0] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Accepted: 03/27/2019] [Indexed: 12/21/2022] Open
Abstract
Global loss of DNA methylation and CpG island (CGI) hypermethylation are key epigenomic aberrations in cancer. Global loss manifests itself in partially methylated domains (PMDs) which extend up to megabases. However, the distribution of PMDs within and between tumor types, and their effects on key functional genomic elements including CGIs are poorly defined. We comprehensively show that loss of methylation in PMDs occurs in a large fraction of the genome and represents the prime source of DNA methylation variation. PMDs are hypervariable in methylation level, size and distribution, and display elevated mutation rates. They impose intermediate DNA methylation levels incognizant of functional genomic elements including CGIs, underpinning a CGI methylator phenotype (CIMP). Repression effects on tumor suppressor genes are negligible as they are generally excluded from PMDs. The genomic distribution of PMDs reports tissue-of-origin and may represent tissue-specific silent regions which tolerate instability at the epigenetic, transcriptomic and genetic level.
Collapse
Affiliation(s)
- Arie B Brinkman
- Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Radboud University, PO Box 9101, Nijmegen, 6500 HB, The Netherlands.
| | - Serena Nik-Zainal
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
- Academic Department of Medical Genetics, University of Cambridge, Cambridge, CB2 0QQ, UK
| | - Femke Simmer
- Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Radboud University, PO Box 9101, Nijmegen, 6500 HB, The Netherlands
- Department of Pathology, Radboud University Nijmegen Medical Centre, P.O. Box 9101, Nijmegen, 6500 HB, The Netherlands
| | - F Germán Rodríguez-González
- Erasmus MC Cancer Institute and Cancer Genomics Netherlands, Department of Medical Oncology, Erasmus University Medical Center, Rotterdam, 3015 GD, The Netherlands
| | - Marcel Smid
- Erasmus MC Cancer Institute and Cancer Genomics Netherlands, Department of Medical Oncology, Erasmus University Medical Center, Rotterdam, 3015 GD, The Netherlands
| | - Ludmil B Alexandrov
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
- Theoretical Biology and Biophysics (T-6), Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
| | - Adam Butler
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| | - Sancha Martin
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| | - Helen Davies
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| | - Dominik Glodzik
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| | - Xueqing Zou
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK
| | | | - Johan Staaf
- Division of Oncology and Pathology, Department of Clinical Sciences Lund, Lund University, Lund, SE-223 81, Sweden
| | - Markus Ringnér
- Division of Oncology and Pathology, Department of Clinical Sciences Lund, Lund University, Lund, SE-223 81, Sweden
| | - Anieta Sieuwerts
- Erasmus MC Cancer Institute and Cancer Genomics Netherlands, Department of Medical Oncology, Erasmus University Medical Center, Rotterdam, 3015 GD, The Netherlands
| | - Anthony Ferrari
- Synergie Lyon Cancer, Centre Léon Bérard, 28 rue Laënnec, Lyon Cedex 08, France
| | - Sandro Morganella
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| | - Thomas Fleischer
- Department of Genetics, Institute for Cancer Research, Oslo University Hospital, The Norwegian Radium Hospital, Oslo, 0310, Norway
| | - Vessela Kristensen
- Department of Genetics, Institute for Cancer Research, Oslo University Hospital, The Norwegian Radium Hospital, Oslo, 0310, Norway
- K.G. Jebsen Centre for Breast Cancer Research, Institute for Clinical Medicine, University of Oslo, Oslo, 0316, Norway
- Department of Clinical Molecular Biology and Laboratory Science (EpiGen), Division of Medicine, Akershus University Hospital, Lørenskog, 1478, Norway
| | - Marta Gut
- Centro Nacional de Análisis Genómico (CNAG), Parc Científic de Barcelona, Barcelona, 08028, Spain
| | - Marc J van de Vijver
- Department of Pathology, Academic Medical Center, Meibergdreef 9, Amsterdam, AZ 1105, The Netherlands
| | - Anne-Lise Børresen-Dale
- Department of Genetics, Institute for Cancer Research, Oslo University Hospital, The Norwegian Radium Hospital, Oslo, 0310, Norway
- K.G. Jebsen Centre for Breast Cancer Research, Institute for Clinical Medicine, University of Oslo, Oslo, 0316, Norway
| | - Andrea L Richardson
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, 02115, USA
- Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Gilles Thomas
- Synergie Lyon Cancer, Centre Léon Bérard, 28 rue Laënnec, Lyon Cedex 08, France
| | - Ivo G Gut
- Centro Nacional de Análisis Genómico (CNAG), Parc Científic de Barcelona, Barcelona, 08028, Spain
| | - John W M Martens
- Erasmus MC Cancer Institute and Cancer Genomics Netherlands, Department of Medical Oncology, Erasmus University Medical Center, Rotterdam, 3015 GD, The Netherlands
| | - John A Foekens
- Erasmus MC Cancer Institute and Cancer Genomics Netherlands, Department of Medical Oncology, Erasmus University Medical Center, Rotterdam, 3015 GD, The Netherlands
| | | | - Hendrik G Stunnenberg
- Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Radboud University, PO Box 9101, Nijmegen, 6500 HB, The Netherlands.
| |
Collapse
|
203
|
La Russa F, Lopes DM, Hobbs C, Argunhan F, Brain S, Bevan S, Bennett DLH, McMahon SB. Disruption of the Sensory System Affects Sterile Cutaneous Inflammation In Vivo. J Invest Dermatol 2019; 139:1936-1945.e3. [PMID: 30974165 DOI: 10.1016/j.jid.2019.01.037] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Revised: 01/11/2019] [Accepted: 01/12/2019] [Indexed: 01/09/2023]
Abstract
Increasing evidence suggests that nerve fibers responding to noxious stimuli (nociceptors) modulate immunity in a variety of tissues, including the skin. Yet, the role of nociceptors in regulating sterile cutaneous inflammation remains unexplored. To address this question, we have developed a detailed description of the sterile inflammation caused by overexposure to UVB irradiation (i.e., sunburn) in the mouse plantar skin. Using this model, we observed that chemical depletion of nociceptor terminals did not alter the early phase of the inflammatory response to UVB, but it caused a significant increase in the number of dendritic cells and αβ+ T cells as well as enhanced extravasation during the later stages of inflammation. Finally, we showed that such regulation was driven by the nociceptive neuropeptide calcitonin gene-related peptide. In conclusion, we propose that nociceptors not only play a crucial role in inflammation through avoidance reflexes and behaviors, but can also regulate sterile cutaneous immunity in vivo.
Collapse
Affiliation(s)
- Federica La Russa
- Wolfson Centre for Age-Related Diseases, King's College London, London, United Kingdom.
| | - Douglas M Lopes
- Wolfson Centre for Age-Related Diseases, King's College London, London, United Kingdom
| | - Carl Hobbs
- Wolfson Centre for Age-Related Diseases, King's College London, London, United Kingdom
| | - Fulye Argunhan
- School of Cardiovascular Medicine and Sciences, King's College London, London, United Kingdom
| | - Susan Brain
- School of Cardiovascular Medicine and Sciences, King's College London, London, United Kingdom
| | - Stuart Bevan
- Wolfson Centre for Age-Related Diseases, King's College London, London, United Kingdom
| | - David L H Bennett
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, United Kingdom
| | - Stephen B McMahon
- Wolfson Centre for Age-Related Diseases, King's College London, London, United Kingdom
| |
Collapse
|
204
|
Lannes R, Rizzon C, Lerat E. Does the Presence of Transposable Elements Impact the Epigenetic Environment of Human Duplicated Genes? Genes (Basel) 2019; 10:genes10030249. [PMID: 30917603 PMCID: PMC6470583 DOI: 10.3390/genes10030249] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2019] [Revised: 03/22/2019] [Accepted: 03/22/2019] [Indexed: 02/07/2023] Open
Abstract
Epigenetic modifications have an important role to explain part of the intra- and inter-species variation in gene expression. They also have a role in the control of transposable elements (TEs) whose activity may have a significant impact on genome evolution by promoting various mutations, which are expected to be mostly deleterious. A change in the local epigenetic landscape associated with the presence of TEs is expected to affect the expression of neighboring genes since these modifications occurring at TE sequences can spread to neighboring sequences. In this work, we have studied how the epigenetic modifications of genes are conserved and what the role of TEs is in this conservation. For that, we have compared the conservation of the epigenome associated with human duplicated genes and the differential presence of TEs near these genes. Our results show higher epigenome conservation of duplicated genes from the same family when they share similar TE environment, suggesting a role for the differential presence of TEs in the evolutionary divergence of duplicates through variation in the epigenetic landscape.
Collapse
Affiliation(s)
- Romain Lannes
- Laboratoire de Biométrie et Biologie Evolutive UMR 5558, Université de Lyon, Université Lyon 1, CNRS, F-69622 Villeurbanne, France.
| | - Carène Rizzon
- Laboratoire de Mathématiques et Modélisation d'Evry (LaMME), Université d'Evry Val d'Essonne, UMR CNRS 8071, ENSIIE, USC INRA, 23 bvd de France, 91037, Evry CEDEX Paris, France.
| | - Emmanuelle Lerat
- Laboratoire de Biométrie et Biologie Evolutive UMR 5558, Université de Lyon, Université Lyon 1, CNRS, F-69622 Villeurbanne, France.
| |
Collapse
|
205
|
Müller F, Scherer M, Assenov Y, Lutsik P, Walter J, Lengauer T, Bock C. RnBeads 2.0: comprehensive analysis of DNA methylation data. Genome Biol 2019; 20:55. [PMID: 30871603 PMCID: PMC6419383 DOI: 10.1186/s13059-019-1664-9] [Citation(s) in RCA: 180] [Impact Index Per Article: 36.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2018] [Accepted: 02/28/2019] [Indexed: 12/25/2022] Open
Abstract
DNA methylation is a widely investigated epigenetic mark with important roles in development and disease. High-throughput assays enable genome-scale DNA methylation analysis in large numbers of samples. Here, we describe a new version of our RnBeads software - an R/Bioconductor package that implements start-to-finish analysis workflows for Infinium microarrays and various types of bisulfite sequencing. RnBeads 2.0 (https://rnbeads.org/) provides additional data types and analysis methods, new functionality for interpreting DNA methylation differences, improved usability with a novel graphical user interface, and better use of computational resources. We demonstrate RnBeads 2.0 in four re-runnable use cases focusing on cell differentiation and cancer.
Collapse
Affiliation(s)
- Fabian Müller
- Max Planck Institute for Informatics, Saarland Informatics Campus, 66123, Saarbrücken, Germany. .,Present Address: Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA.
| | - Michael Scherer
- Max Planck Institute for Informatics, Saarland Informatics Campus, 66123, Saarbrücken, Germany. .,Graduate School of Computer Science, Saarland University, Saarland Informatics Campus, 66123, Saarbrücken, Germany.
| | - Yassen Assenov
- Division of Epigenomics and Cancer Risk Factors, German Cancer Research Center (DKFZ), 69120, Heidelberg, Germany.
| | - Pavlo Lutsik
- Division of Epigenomics and Cancer Risk Factors, German Cancer Research Center (DKFZ), 69120, Heidelberg, Germany.
| | - Jörn Walter
- Department of Genetics/Epigenetics, Saarland University, 66123, Saarbrücken, Germany
| | - Thomas Lengauer
- Max Planck Institute for Informatics, Saarland Informatics Campus, 66123, Saarbrücken, Germany
| | - Christoph Bock
- Max Planck Institute for Informatics, Saarland Informatics Campus, 66123, Saarbrücken, Germany. .,CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, 1090, Vienna, Austria. .,Department of Laboratory Medicine, Medical University of Vienna, 1090, Vienna, Austria.
| |
Collapse
|
206
|
Ding J, Orozco G. Identification of rheumatoid arthritis causal genes using functional genomics. Scand J Immunol 2019; 89:e12753. [PMID: 30710386 DOI: 10.1111/sji.12753] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2018] [Revised: 01/18/2019] [Accepted: 01/29/2019] [Indexed: 12/14/2022]
Abstract
Over the past decade, genome-wide association studies have contributed a wealth of knowledge to our understanding of polygenic disorders such as rheumatoid arthritis. As the size of sample cohorts has improved so too have the computational and experimental methods used to robustly define variants associated with disease susceptibility. The challenge now remains to translate these findings into improved understanding of disease aetiology and patient care. Whilst much of the focus of translating the findings of genome-wide association studies has been on global analysis of all variants identified, careful functional study of individual disease susceptibility loci will be required in order to refine our understanding of how individual variants contribute to disease risk. Here, we present the argument behind such an approach and describe some of the novel tools being used to investigate risk loci. This includes the use of chromosomal conformation capture techniques and modifications of the CRISPR-Cas9 system, with several examples of their implementation being described.
Collapse
Affiliation(s)
- James Ding
- Arthritis Research UK Centre for Genetics and Genomics, Division of Musculoskeletal and Dermatological Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, UK
| | - Gisela Orozco
- Arthritis Research UK Centre for Genetics and Genomics, Division of Musculoskeletal and Dermatological Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, UK.,NIHR Manchester Biomedical Research Centre, Manchester University NHS Foundation Trust, Manchester Academic Health Science Centre, Manchester, UK
| |
Collapse
|
207
|
Aranda S, Alcaine-Colet A, Blanco E, Borràs E, Caillot C, Sabidó E, Di Croce L. Chromatin capture links the metabolic enzyme AHCY to stem cell proliferation. SCIENCE ADVANCES 2019; 5:eaav2448. [PMID: 30854431 PMCID: PMC6402848 DOI: 10.1126/sciadv.aav2448] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2018] [Accepted: 01/28/2019] [Indexed: 05/19/2023]
Abstract
Profiling the chromatin-bound proteome (chromatome) in a simple, direct, and reliable manner might be key to uncovering the role of yet uncharacterized chromatin factors in physiology and disease. Here, we have designed an experimental strategy to survey the chromatome of proliferating cells by using the DNA-mediated chromatin pull-down (Dm-ChP) technology. Our approach provides a global view of cellular chromatome under normal physiological conditions and enables the identification of chromatin-bound proteins de novo. Integrating Dm-ChP with genomic and functional data, we have discovered an unexpected chromatin function for adenosylhomocysteinase, a major one-carbon pathway metabolic enzyme, in gene activation. Our study reveals a new regulatory axis between the metabolic state of pluripotent cells, ribosomal protein production, and cell division during the early phase of embryo development, in which the metabolic flux of methylation reactions is favored in a local milieu.
Collapse
Affiliation(s)
- Sergi Aranda
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Anna Alcaine-Colet
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Enrique Blanco
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Eva Borràs
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Claire Caillot
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Eduard Sabidó
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Luciano Di Croce
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Pg. Lluis Companys 23, Barcelona 08010, Spain
| |
Collapse
|
208
|
NCBoost classifies pathogenic non-coding variants in Mendelian diseases through supervised learning on purifying selection signals in humans. Genome Biol 2019; 20:32. [PMID: 30744685 PMCID: PMC6371618 DOI: 10.1186/s13059-019-1634-2] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Accepted: 01/17/2019] [Indexed: 02/07/2023] Open
Abstract
State-of-the-art methods assessing pathogenic non-coding variants have mostly been characterized on common disease-associated polymorphisms, yet with modest accuracy and strong positional biases. In this study, we curated 737 high-confidence pathogenic non-coding variants associated with monogenic Mendelian diseases. In addition to interspecies conservation, a comprehensive set of recent and ongoing purifying selection signals in humans is explored, accounting for lineage-specific regulatory elements. Supervised learning using gradient tree boosting on such features achieves a high predictive performance and overcomes positional bias. NCBoost performs consistently across diverse learning and independent testing data sets and outperforms other existing reference methods.
Collapse
|
209
|
Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage. Nat Immunol 2019; 20:163-172. [PMID: 30643263 PMCID: PMC6340744 DOI: 10.1038/s41590-018-0276-y] [Citation(s) in RCA: 2048] [Impact Index Per Article: 409.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2018] [Accepted: 11/09/2018] [Indexed: 02/07/2023]
Abstract
Tissue fibrosis is a major cause of mortality that results from the deposition of matrix proteins by an activated mesenchyme. Macrophages accumulate in fibrosis, but the role of specific subgroups in supporting fibrogenesis has not been investigated in vivo. Here we used single-cell RNA sequencing (scRNA-seq) to characterize the heterogeneity of macrophages in bleomycin-induced lung fibrosis in mice. A novel computational framework for the annotation of scRNA-seq by reference to bulk transcriptomes (SingleR) enabled the subclustering of macrophages and revealed a disease-associated subgroup with a transitional gene expression profile intermediate between monocyte-derived and alveolar macrophages. These CX3CR1+SiglecF+ transitional macrophages localized to the fibrotic niche and had a profibrotic effect in vivo. Human orthologues of genes expressed by the transitional macrophages were upregulated in samples from patients with idiopathic pulmonary fibrosis. Thus, we have identified a pathological subgroup of transitional macrophages that are required for the fibrotic response to injury.
Collapse
|
210
|
Erkek S, Johann PD, Finetti MA, Drosos Y, Chou HC, Zapatka M, Sturm D, Jones DTW, Korshunov A, Rhyzova M, Wolf S, Mallm JP, Beck K, Witt O, Kulozik AE, Frühwald MC, Northcott PA, Korbel JO, Lichter P, Eils R, Gajjar A, Roberts CWM, Williamson D, Hasselblatt M, Chavez L, Pfister SM, Kool M. Comprehensive Analysis of Chromatin States in Atypical Teratoid/Rhabdoid Tumor Identifies Diverging Roles for SWI/SNF and Polycomb in Gene Regulation. Cancer Cell 2019; 35:95-110.e8. [PMID: 30595504 PMCID: PMC6341227 DOI: 10.1016/j.ccell.2018.11.014] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/13/2017] [Revised: 05/30/2018] [Accepted: 11/21/2018] [Indexed: 12/27/2022]
Abstract
Biallelic inactivation of SMARCB1, encoding a member of the SWI/SNF chromatin remodeling complex, is the hallmark genetic aberration of atypical teratoid rhabdoid tumors (ATRT). Here, we report how loss of SMARCB1 affects the epigenome in these tumors. Using chromatin immunoprecipitation sequencing (ChIP-seq) on primary tumors for a series of active and repressive histone marks, we identified the chromatin states differentially represented in ATRTs compared with other brain tumors and non-neoplastic brain. Re-expression of SMARCB1 in ATRT cell lines enabled confirmation of our genome-wide findings for the chromatin states. Additional generation of ChIP-seq data for SWI/SNF and Polycomb group proteins and the transcriptional repressor protein REST determined differential dependencies of SWI/SNF and Polycomb complexes in regulation of diverse gene sets in ATRTs.
Collapse
Affiliation(s)
- Serap Erkek
- Hopp-Children's Cancer Center at the NCT Heidelberg (KiTZ), 69120 Heidelberg, Germany; Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), Im Neuenheimer Feld 280, 69120 Heidelberg, Germany; European Molecular Biology Laboratory, Genome Biology Unit, 69117 Heidelberg, Germany; Izmir Biomedicine and Genome Center, 35340 Izmir, Turkey
| | - Pascal D Johann
- Hopp-Children's Cancer Center at the NCT Heidelberg (KiTZ), 69120 Heidelberg, Germany; Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), Im Neuenheimer Feld 280, 69120 Heidelberg, Germany; Department of Pediatric Hematology and Oncology, University Hospital Heidelberg, 69120 Heidelberg, Germany
| | - Martina A Finetti
- Wolfson Childhood Cancer Research Centre, Northern Institute for Cancer Research, Newcastle University, NE2 Newcastle Upon Tyne, UK
| | - Yiannis Drosos
- Department of Oncology, St Jude Children's Research Hospital, 38105 Memphis, USA
| | - Hsien-Chao Chou
- Department of Oncology, St Jude Children's Research Hospital, 38105 Memphis, USA
| | - Marc Zapatka
- Division of Molecular Genetics, German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), 69120 Heidelberg, Germany
| | - Dominik Sturm
- Hopp-Children's Cancer Center at the NCT Heidelberg (KiTZ), 69120 Heidelberg, Germany; Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), Im Neuenheimer Feld 280, 69120 Heidelberg, Germany; Department of Pediatric Hematology and Oncology, University Hospital Heidelberg, 69120 Heidelberg, Germany
| | - David T W Jones
- Hopp-Children's Cancer Center at the NCT Heidelberg (KiTZ), 69120 Heidelberg, Germany; Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), Im Neuenheimer Feld 280, 69120 Heidelberg, Germany
| | - Andrey Korshunov
- Department of Neuropathology, University Hospital Heidelberg, 69120 Heidelberg, Germany; Clinical Cooperation Unit Neuropathology, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Marina Rhyzova
- Department of Neuropathology, Burdenko Neurosurgical Institute, 125047 Moscow, Russia
| | - Stephan Wolf
- Genomics and Proteomics Core Facility, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Jan-Philipp Mallm
- Genome Organization & Function Research Group, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany; Heidelberg Center for Personalized Oncology, DKFZ-HIPO, DKFZ, 69120 Heidelberg, Germany
| | - Katja Beck
- Heidelberg Center for Personalized Oncology, DKFZ-HIPO, DKFZ, 69120 Heidelberg, Germany
| | - Olaf Witt
- Department of Pediatric Hematology and Oncology, University Hospital Heidelberg, 69120 Heidelberg, Germany; Clinical Cooperation Unit Pediatric Oncology, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Andreas E Kulozik
- Department of Pediatric Hematology and Oncology, University Hospital Heidelberg, 69120 Heidelberg, Germany
| | - Michael C Frühwald
- University Children's Hospital Augsburg, Swabian Children's Cancer Center, 86156 Augsburg, Germany; EU-RHAB Registry Center, 86156 Augsburg, Germany
| | - Paul A Northcott
- Hopp-Children's Cancer Center at the NCT Heidelberg (KiTZ), 69120 Heidelberg, Germany; Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), Im Neuenheimer Feld 280, 69120 Heidelberg, Germany; Department of Developmental Neurobiology, St. Jude Children's Research Hospital, 38105 Memphis, TN, USA
| | - Jan O Korbel
- European Molecular Biology Laboratory, Genome Biology Unit, 69117 Heidelberg, Germany
| | - Peter Lichter
- Division of Molecular Genetics, German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), 69120 Heidelberg, Germany; Heidelberg Center for Personalized Oncology, DKFZ-HIPO, DKFZ, 69120 Heidelberg, Germany
| | - Roland Eils
- Genome Organization & Function Research Group, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany; Division of Theoretical Bioinformatics, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Amar Gajjar
- Department of Oncology, St Jude Children's Research Hospital, 38105 Memphis, USA
| | - Charles W M Roberts
- Department of Oncology, St Jude Children's Research Hospital, 38105 Memphis, USA
| | - Daniel Williamson
- Wolfson Childhood Cancer Research Centre, Northern Institute for Cancer Research, Newcastle University, NE2 Newcastle Upon Tyne, UK
| | - Martin Hasselblatt
- Institute of Neuropathology, University Hospital Münster, 48149 Münster, Germany
| | - Lukas Chavez
- Hopp-Children's Cancer Center at the NCT Heidelberg (KiTZ), 69120 Heidelberg, Germany; Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), Im Neuenheimer Feld 280, 69120 Heidelberg, Germany
| | - Stefan M Pfister
- Hopp-Children's Cancer Center at the NCT Heidelberg (KiTZ), 69120 Heidelberg, Germany; Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), Im Neuenheimer Feld 280, 69120 Heidelberg, Germany; Department of Pediatric Hematology and Oncology, University Hospital Heidelberg, 69120 Heidelberg, Germany
| | - Marcel Kool
- Hopp-Children's Cancer Center at the NCT Heidelberg (KiTZ), 69120 Heidelberg, Germany; Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ) and German Cancer Consortium (DKTK), Im Neuenheimer Feld 280, 69120 Heidelberg, Germany.
| |
Collapse
|
211
|
Frankish A, Diekhans M, Ferreira AM, Johnson R, Jungreis I, Loveland J, Mudge JM, Sisu C, Wright J, Armstrong J, Barnes I, Berry A, Bignell A, Carbonell Sala S, Chrast J, Cunningham F, Di Domenico T, Donaldson S, Fiddes IT, García Girón C, Gonzalez JM, Grego T, Hardy M, Hourlier T, Hunt T, Izuogu OG, Lagarde J, Martin FJ, Martínez L, Mohanan S, Muir P, Navarro FC, Parker A, Pei B, Pozo F, Ruffier M, Schmitt BM, Stapleton E, Suner MM, Sycheva I, Uszczynska-Ratajczak B, Xu J, Yates A, Zerbino D, Zhang Y, Aken B, Choudhary JS, Gerstein M, Guigó R, Hubbard TJ, Kellis M, Paten B, Reymond A, Tress ML, Flicek P. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res 2019; 47:D766-D773. [PMID: 30357393 PMCID: PMC6323946 DOI: 10.1093/nar/gky955] [Citation(s) in RCA: 1810] [Impact Index Per Article: 362.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 09/20/2018] [Accepted: 10/08/2018] [Indexed: 02/06/2023] Open
Abstract
The accurate identification and description of the genes in the human and mouse genomes is a fundamental requirement for high quality analysis of data informing both genome biology and clinical genomics. Over the last 15 years, the GENCODE consortium has been producing reference quality gene annotations to provide this foundational resource. The GENCODE consortium includes both experimental and computational biology groups who work together to improve and extend the GENCODE gene annotation. Specifically, we generate primary data, create bioinformatics tools and provide analysis to support the work of expert manual gene annotators and automated gene annotation pipelines. In addition, manual and computational annotation workflows use any and all publicly available data and analysis, along with the research literature to identify and characterise gene loci to the highest standard. GENCODE gene annotations are accessible via the Ensembl and UCSC Genome Browsers, the Ensembl FTP site, Ensembl Biomart, Ensembl Perl and REST APIs as well as https://www.gencodegenes.org.
Collapse
Affiliation(s)
- Adam Frankish
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mark Diekhans
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Anne-Maud Ferreira
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
| | - Rory Johnson
- Department of Medical Oncology, Inselspital, University Hospital, University of Bern, Bern, Switzerland
- Department of Biomedical Research (DBMR), University of Bern, Bern, Switzerland
| | - Irwin Jungreis
- MIT Computer Science and Artificial Intelligence Laboratory, 32 Vasser St, Cambridge, MA 02139, USA
- Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA 02142, USA
| | - Jane Loveland
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Cristina Sisu
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
- Department of Bioscience, Brunel University London, Uxbridge UB8 3PH, UK
| | - James Wright
- Functional Proteomics, Division of Cancer Biology, Institute of Cancer Research, 123 Old Brompton Road, London SW7 3RP, UK
| | - Joel Armstrong
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - If Barnes
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Andrew Berry
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Alexandra Bignell
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Silvia Carbonell Sala
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, Barcelona, E-08003 Catalonia, Spain
| | - Jacqueline Chrast
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
| | - Fiona Cunningham
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Tomás Di Domenico
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | - Sarah Donaldson
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ian T Fiddes
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Carlos García Girón
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jose Manuel Gonzalez
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Tiago Grego
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Matthew Hardy
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Thibaut Hourlier
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Toby Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Osagie G Izuogu
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Julien Lagarde
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, Barcelona, E-08003 Catalonia, Spain
| | - Fergal J Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Laura Martínez
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | - Shamika Mohanan
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Paul Muir
- Department of Molecular, Cellular & Developmental Biology, Yale University, New Haven, CT 06520, USA
- Systems Biology Institute, Yale University, West Haven, CT 06516, USA
| | - Fabio C P Navarro
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Anne Parker
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Baikang Pei
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Fernando Pozo
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | - Magali Ruffier
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Bianca M Schmitt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Eloise Stapleton
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Marie-Marthe Suner
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Irina Sycheva
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | - Jinuri Xu
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Andrew Yates
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Daniel Zerbino
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Yan Zhang
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, USA
| | - Bronwen Aken
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jyoti S Choudhary
- Functional Proteomics, Division of Cancer Biology, Institute of Cancer Research, 123 Old Brompton Road, London SW7 3RP, UK
| | - Mark Gerstein
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
- Program in Computational Biology & Bioinformatics, Yale University, Bass 432, 266 Whitney Avenue, New Haven, CT 06520, USA
- Department of Computer Science, Yale University, Bass 432, 266 Whitney Avenue, New Haven, CT 06520, USA
| | - Roderic Guigó
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, Barcelona, E-08003 Catalonia, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, E-08003 Catalonia, Spain
| | - Tim J P Hubbard
- Department of Medical and Molecular Genetics, King's College London, Guys Hospital, Great Maze Pond, London SE1 9RT, UK
| | - Manolis Kellis
- MIT Computer Science and Artificial Intelligence Laboratory, 32 Vasser St, Cambridge, MA 02139, USA
- Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA 02142, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Alexandre Reymond
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
| | - Michael L Tress
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
212
|
Wang C, Zhang S. Reveal cell type-specific regulatory elements and their characterized histone code classes via a hidden Markov model. BMC Genomics 2018; 19:903. [PMID: 30598107 PMCID: PMC6311906 DOI: 10.1186/s12864-018-5274-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND With the maturity of next generation sequencing technology, a huge amount of epigenomic data have been generated by several large consortia in the last decade. These plenty resources leave us the opportunity about sufficiently utilizing those data to explore biological problems. RESULTS Here we developed an integrative and comparative method, CsreHMM, which is based on a hidden Markov model, to systematically reveal cell type-specific regulatory elements (CSREs) along the whole genome, and simultaneously recognize the histone codes (mark combinations) charactering them. This method also reveals the subclasses of CSREs and explicitly label those shared by a few cell types. We applied this method to a data set of 9 cell types and 9 chromatin marks to demonstrate its effectiveness and found that the revealed CSREs relates to different kinds of functional regulatory regions significantly. Their proximal genes have consistent expression and are likely to participate in cell type-specific biological functions. CONCLUSIONS These results suggest CsreHMM has the potential to help understand cell identity and the diverse mechanisms of gene regulation.
Collapse
Affiliation(s)
- Can Wang
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, China
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Shihua Zhang
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, China.
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China.
- Center for Excel-lence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, 650223, China.
| |
Collapse
|
213
|
Lu J, Cao X, Zhong S. A likelihood approach to testing hypotheses on the co-evolution of epigenome and genome. PLoS Comput Biol 2018; 14:e1006673. [PMID: 30586383 PMCID: PMC6324829 DOI: 10.1371/journal.pcbi.1006673] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2018] [Revised: 01/08/2019] [Accepted: 11/26/2018] [Indexed: 01/03/2023] Open
Abstract
Central questions to epigenome evolution include whether interspecies changes of histone modifications are independent of evolutionary changes of DNA, and if there is dependence whether they depend on any specific types of DNA sequence changes. Here, we present a likelihood approach for testing hypotheses on the co-evolution of genome and histone modifications. The gist of this approach is to convert evolutionary biology hypotheses into probabilistic forms, by explicitly expressing the joint probability of multispecies DNA sequences and histone modifications, which we refer to as a class of Joint Evolutionary Model for the Genome and the Epigenome (JEMGE). JEMGE can be summarized as a mixture model of four components representing four evolutionary hypotheses, namely dependence and independence of interspecies epigenomic variations to underlying sequence substitutions and to underlying sequence insertions and deletions (indels). We implemented a maximum likelihood method to fit the models to the data. Based on comparison of likelihoods, we inferred whether interspecies epigenomic variations depended on substitution or indels in local genomic sequences based on DNase hypersensitivity and spermatid H3K4me3 ChIP-seq data from human and rhesus macaque. Approximately 5.5% of homologous regions in the genomes exhibited H3K4me3 modification in either species, among which approximately 67% homologous regions exhibited local-sequence-dependent interspecies H3K4me3 variations. Substitutions accounted for less local-sequence-dependent H3K4me3 variations than indels. Among transposon-mediated indels, ERV1 insertions and L1 insertions were most strongly associated with H3K4me3 gains and losses, respectively. By initiating probabilistic formulation on the co-evolution of genomes and epigenomes, JEMGE helps to bring evolutionary biology principles to comparative epigenomic studies. Epigenetic modifications play a significant role in gene regulations and thus heavily influence phenotypic outcomes. Whereas cross-species epigenomic comparisons have been fruitful in revealing the function of epigenetic modifications, it still remains unclear how the epigenome changes across species. A central question in epigenome evolution studies is whether interspecies epigenomic variations rely on genomic changes in cis and, if partially yes, whether different genomic changes have distinct impacts. To tackle this question, we initiated a likelihood-based approach, in which different hypotheses related to the co-evolution of the genome and the epigenome could be converted into probabilistic models. By fitting the models to actual data, each model yielded a likelihood, and the hypothesis corresponded to the largest likelihood was selected as most supported by observed data. In this work, we focused on the influence of two types of underlying sequence changes: substitutions, and insertions and deletions (indels). We quantitatively assessed the dependence of H3K4me3 variations on substitutions and indels between human and rhesus, and separated their relative impacts within each genomic region with H3K4me3. The methodology presented here provides a framework for modeling the epigenome together with the genome and a quantitative approach to test different evolutionary hypotheses.
Collapse
Affiliation(s)
- Jia Lu
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Xiaoyi Cao
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Sheng Zhong
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
- * E-mail:
| |
Collapse
|
214
|
Wang T, Pehrsson EC, Purushotham D, Li D, Zhuo X, Zhang B, Lawson HA, Province MA, Krapp C, Lan Y, Coarfa C, Katz TA, Tang WY, Wang Z, Biswal S, Rajagopalan S, Colacino JA, Tsai ZTY, Sartor MA, Neier K, Dolinoy DC, Pinto J, Hamanaka RB, Mutlu GM, Patisaul HB, Aylor DL, Crawford GE, Wiltshire T, Chadwick LH, Duncan CG, Garton AE, McAllister KA, Bartolomei MS, Walker CL, Tyson FL. The NIEHS TaRGET II Consortium and environmental epigenomics. Nat Biotechnol 2018; 36:225-227. [PMID: 29509741 DOI: 10.1038/nbt.4099] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Affiliation(s)
- Ting Wang
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, USA.,The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Erica C Pehrsson
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, USA.,The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Deepak Purushotham
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, USA.,The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Daofeng Li
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, USA.,The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Xiaoyu Zhuo
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, USA.,The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Bo Zhang
- Center for Regenerative Medicine, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Heather A Lawson
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Michael A Province
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Christopher Krapp
- Epigenetics Institute, Center for Excellence in Environmental Toxicology, Department of Cell and Developmental Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Yemin Lan
- Epigenetics Institute, Center for Excellence in Environmental Toxicology, Department of Cell and Developmental Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Cristian Coarfa
- Center for Precision Environmental Health, Departments of Molecular & Cellular Biology and Medicine, Baylor College of Medicine, Houston, Texas, USA
| | - Tiffany A Katz
- Center for Precision Environmental Health, Departments of Molecular & Cellular Biology and Medicine, Baylor College of Medicine, Houston, Texas, USA
| | - Wan Yee Tang
- Department of Environmental Health Sciences, Johns Hopkins School of Public Health, Baltimore, Maryland, USA
| | - Zhibin Wang
- Department of Environmental Health Sciences, Johns Hopkins School of Public Health, Baltimore, Maryland, USA
| | - Shyam Biswal
- Department of Environmental Health Sciences, Johns Hopkins School of Public Health, Baltimore, Maryland, USA
| | - Sanjay Rajagopalan
- Department of Medicine, Case Western Reserve University, Cleveland, Ohio, USA
| | - Justin A Colacino
- Department of Environmental Health Sciences and Nutritional Sciences, University of Michigan School of Public Health, Ann Arbor, Michigan, USA
| | - Zing Tsung-Yeh Tsai
- Department of Environmental Health Sciences and Nutritional Sciences, University of Michigan School of Public Health, Ann Arbor, Michigan, USA
| | - Maureen A Sartor
- Department of Environmental Health Sciences and Nutritional Sciences, University of Michigan School of Public Health, Ann Arbor, Michigan, USA
| | - Kari Neier
- Department of Environmental Health Sciences and Nutritional Sciences, University of Michigan School of Public Health, Ann Arbor, Michigan, USA
| | - Dana C Dolinoy
- Department of Environmental Health Sciences and Nutritional Sciences, University of Michigan School of Public Health, Ann Arbor, Michigan, USA
| | - Jayant Pinto
- Department of Medicine, The University of Chicago, Chicago, Illinois, USA
| | - Robert B Hamanaka
- Department of Medicine, The University of Chicago, Chicago, Illinois, USA
| | - Gokhan M Mutlu
- Department of Medicine, The University of Chicago, Chicago, Illinois, USA
| | - Heather B Patisaul
- Department of Biological Sciences, Center for Human Health and the Environment, Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA
| | - David L Aylor
- Department of Biological Sciences, Center for Human Health and the Environment, Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA
| | - Gregory E Crawford
- Center for Genomic & Computational Biology, Division of Medical Genetics, Department of Pediatrics, Duke University, Durham, North Carolina, USA
| | - Tim Wiltshire
- Division of Pharmacotherapy and Experimental Therapeutics, UNC Eshelman School of Pharmacy, Chapel Hill, North Carolina, USA
| | - Lisa H Chadwick
- Genes Environment and Health Branch, Division of Extramural Research and Training, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina, USA
| | - Christopher G Duncan
- Genes Environment and Health Branch, Division of Extramural Research and Training, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina, USA
| | - Amanda E Garton
- Genes Environment and Health Branch, Division of Extramural Research and Training, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina, USA
| | - Kimberly A McAllister
- Genes Environment and Health Branch, Division of Extramural Research and Training, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina, USA
| | | | - Marisa S Bartolomei
- Epigenetics Institute, Center for Excellence in Environmental Toxicology, Department of Cell and Developmental Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Cheryl L Walker
- Center for Precision Environmental Health, Departments of Molecular & Cellular Biology and Medicine, Baylor College of Medicine, Houston, Texas, USA
| | - Frederick L Tyson
- Genes Environment and Health Branch, Division of Extramural Research and Training, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina, USA
| |
Collapse
|
215
|
Single-cell and single-molecule epigenomics to uncover genome regulation at unprecedented resolution. Nat Genet 2018; 51:19-25. [DOI: 10.1038/s41588-018-0290-x] [Citation(s) in RCA: 115] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2017] [Accepted: 08/30/2018] [Indexed: 12/19/2022]
|
216
|
Identification of differentially methylated cell types in epigenome-wide association studies. Nat Methods 2018; 15:1059-1066. [PMID: 30504870 PMCID: PMC6277016 DOI: 10.1038/s41592-018-0213-x] [Citation(s) in RCA: 133] [Impact Index Per Article: 22.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2018] [Accepted: 10/09/2018] [Indexed: 12/22/2022]
Abstract
An outstanding challenge of epigenome-wide association studies (EWASs) performed in complex tissues is the identification of the specific cell type(s) responsible for the observed differential DNA methylation. Here we present a statistical algorithm called CellDMC ( https://github.com/sjczheng/EpiDISH ), which can identify differentially methylated positions and the specific cell type(s) driving the differential methylation. We validated CellDMC on in silico mixtures of DNA methylation data generated with different technologies, as well as on real mixtures from epigenome-wide association and cancer epigenome studies. CellDMC achieved over 90% sensitivity and specificity in scenarios where current state-of-the-art methods did not identify differential methylation. By applying CellDMC to an EWAS performed in buccal swabs, we identified smoking-associated differentially methylated positions occurring in the epithelial compartment, which we validated in smoking-related lung cancer. CellDMC may be useful in the identification of causal DNA-methylation alterations in disease.
Collapse
|
217
|
Behjati Ardakani F, Kattler K, Nordström K, Gasparoni N, Gasparoni G, Fuchs S, Sinha A, Barann M, Ebert P, Fischer J, Hutter B, Zipprich G, Imbusch CD, Felder B, Eils J, Brors B, Lengauer T, Manke T, Rosenstiel P, Walter J, Schulz MH. Integrative analysis of single-cell expression data reveals distinct regulatory states in bidirectional promoters. Epigenetics Chromatin 2018; 11:66. [PMID: 30414612 PMCID: PMC6230222 DOI: 10.1186/s13072-018-0236-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2018] [Accepted: 10/26/2018] [Indexed: 01/12/2023] Open
Abstract
Background Bidirectional promoters (BPs) are prevalent in eukaryotic genomes. However, it is poorly understood how the cell integrates different epigenomic information, such as transcription factor (TF) binding and chromatin marks, to drive gene expression at BPs. Single-cell sequencing technologies are revolutionizing the field of genome biology. Therefore, this study focuses on the integration of single-cell RNA-seq data with bulk ChIP-seq and other epigenetics data, for which single-cell technologies are not yet established, in the context of BPs. Results We performed integrative analyses of novel human single-cell RNA-seq (scRNA-seq) data with bulk ChIP-seq and other epigenetics data. scRNA-seq data revealed distinct transcription states of BPs that were previously not recognized. We find associations between these transcription states to distinct patterns in structural gene features, DNA accessibility, histone modification, DNA methylation and TF binding profiles. Conclusions Our results suggest that a complex interplay of all of these elements is required to achieve BP-specific transcriptional output in this specialized promoter configuration. Further, our study implies that novel statistical methods can be developed to deconvolute masked subpopulations of cells measured with different bulk epigenomic assays using scRNA-seq data. Electronic supplementary material The online version of this article (10.1186/s13072-018-0236-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Fatemeh Behjati Ardakani
- Excellence Cluster for Multimodal Computing and Interaction, Saarland Informatics Campus, Saarland University, Campus E1 7, Saarbrücken, 66123, Germany.,Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarland Informatics, Campus E 4, Saarbrücken, 66123, Germany.,Graduate School of Computer Science, Saarland University, Campus E1 3, Saarbrücken, 66123, Germany
| | - Kathrin Kattler
- Department of Genetics, University of Saarland, Campus A2 4, Saarbrücken, 66123, Germany
| | - Karl Nordström
- Department of Genetics, University of Saarland, Campus A2 4, Saarbrücken, 66123, Germany
| | - Nina Gasparoni
- Department of Genetics, University of Saarland, Campus A2 4, Saarbrücken, 66123, Germany
| | - Gilles Gasparoni
- Department of Genetics, University of Saarland, Campus A2 4, Saarbrücken, 66123, Germany
| | - Sarah Fuchs
- Department of Genetics, University of Saarland, Campus A2 4, Saarbrücken, 66123, Germany
| | - Anupam Sinha
- Institute of Clinical Molecular Biology, Christian-Albrechts-University, Rosalind-Franklin-Str. 12, Kiel, 24105, Germany
| | - Matthias Barann
- Institute of Clinical Molecular Biology, Christian-Albrechts-University, Rosalind-Franklin-Str. 12, Kiel, 24105, Germany
| | - Peter Ebert
- Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarland Informatics, Campus E 4, Saarbrücken, 66123, Germany.,Graduate School of Computer Science, Saarland University, Campus E1 3, Saarbrücken, 66123, Germany
| | - Jonas Fischer
- Excellence Cluster for Multimodal Computing and Interaction, Saarland Informatics Campus, Saarland University, Campus E1 7, Saarbrücken, 66123, Germany.,Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarland Informatics, Campus E 4, Saarbrücken, 66123, Germany.,Graduate School of Computer Science, Saarland University, Campus E1 3, Saarbrücken, 66123, Germany
| | - Barbara Hutter
- Applied Bioinformatics, Deutsches Krebsforschungszentrum, Berliner-Str. 41, Heidelberg, 69120, Germany
| | - Gideon Zipprich
- Data Management and Genomics IT, Deutsches Krebsforschungszentrum, Im Neuenheimer Feld 280, Heidelberg, 69120, Germany
| | - Charles D Imbusch
- Applied Bioinformatics, Deutsches Krebsforschungszentrum, Berliner-Str. 41, Heidelberg, 69120, Germany
| | - Bärbel Felder
- Data Management and Genomics IT, Deutsches Krebsforschungszentrum, Im Neuenheimer Feld 280, Heidelberg, 69120, Germany
| | - Jürgen Eils
- Data Management and Genomics IT, Deutsches Krebsforschungszentrum, Im Neuenheimer Feld 280, Heidelberg, 69120, Germany
| | - Benedikt Brors
- Applied Bioinformatics, Deutsches Krebsforschungszentrum, Berliner-Str. 41, Heidelberg, 69120, Germany
| | - Thomas Lengauer
- Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarland Informatics, Campus E 4, Saarbrücken, 66123, Germany
| | - Thomas Manke
- Max Planck Institute of Immunobiology and Epigenetics, Stübeweg 51, Freiburg, 79108, Germany
| | - Philip Rosenstiel
- Institute of Clinical Molecular Biology, Christian-Albrechts-University, Rosalind-Franklin-Str. 12, Kiel, 24105, Germany
| | - Jörn Walter
- Department of Genetics, University of Saarland, Campus A2 4, Saarbrücken, 66123, Germany
| | - Marcel H Schulz
- Excellence Cluster for Multimodal Computing and Interaction, Saarland Informatics Campus, Saarland University, Campus E1 7, Saarbrücken, 66123, Germany. .,Department of Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarland Informatics, Campus E 4, Saarbrücken, 66123, Germany. .,Institute for Cardiovascular Regeneration, Goethe University, Theodor-Stern-Kai 7, Frankfurt am Main, 60590, Germany. .,German Center for Cardiovascular Research, Partner site Rhein-Main, Frankfurt am Main, 60590, Germany.
| |
Collapse
|
218
|
Sud A, Thomsen H, Orlando G, Försti A, Law PJ, Broderick P, Cooke R, Hariri F, Pastinen T, Easton DF, Pharoah PDP, Dunning AM, Peto J, Canzian F, Eeles R, Kote-Jarai ZS, Muir K, Pashayan N, Campa D, Hoffmann P, Nöthen MM, Jöckel KH, von Strandmann EP, Swerdlow AJ, Engert A, Orr N, Hemminki K, Houlston RS. Genome-wide association study implicates immune dysfunction in the development of Hodgkin lymphoma. Blood 2018; 132:2040-2052. [PMID: 30194254 PMCID: PMC6236462 DOI: 10.1182/blood-2018-06-855296] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2018] [Accepted: 08/19/2018] [Indexed: 02/08/2023] Open
Abstract
To further our understanding of inherited susceptibility to Hodgkin lymphoma (HL), we performed a meta-analysis of 7 genome-wide association studies totaling 5325 HL cases and 22 423 control patients. We identify 5 new HL risk loci at 6p21.31 (rs649775; P = 2.11 × 10-10), 6q23.3 (rs1002658; P = 2.97 × 10-8), 11q23.1 (rs7111520; P = 1.44 × 10-11), 16p11.2 (rs6565176; P = 4.00 × 10-8), and 20q13.12 (rs2425752; P = 2.01 × 10-8). Integration of gene expression, histone modification, and in situ promoter capture Hi-C data at the 5 new and 13 known risk loci implicates dysfunction of the germinal center reaction, disrupted T-cell differentiation and function, and constitutive NF-κB activation as mechanisms of predisposition. These data provide further insights into the genetic susceptibility and biology of HL.
Collapse
Affiliation(s)
- Amit Sud
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, United Kingdom
| | - Hauke Thomsen
- Division of Molecular Genetic Epidemiology, German Cancer Research Center, Heidelberg, Germany
| | - Giulia Orlando
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, United Kingdom
| | - Asta Försti
- Division of Molecular Genetic Epidemiology, German Cancer Research Center, Heidelberg, Germany
- Center for Primary Health Care Research, Lund University, Malmö, Sweden
| | - Philip J Law
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, United Kingdom
| | - Peter Broderick
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, United Kingdom
| | - Rosie Cooke
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, United Kingdom
| | - Fadi Hariri
- Department of Human Genetics, McGill University and Genome Quebec Innovation Centre, McGill University, Montreal, QC, Canada
| | - Tomi Pastinen
- Department of Human Genetics, McGill University and Genome Quebec Innovation Centre, McGill University, Montreal, QC, Canada
| | - Douglas F Easton
- Centre for Cancer Genetic Epidemiology, Department of Oncology, and
- Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom
| | - Paul D P Pharoah
- Centre for Cancer Genetic Epidemiology, Department of Oncology, and
- Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom
| | - Alison M Dunning
- Centre for Cancer Genetic Epidemiology, Department of Oncology, and
| | - Julian Peto
- Department of Non-Communicable Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Federico Canzian
- Genomic Epidemiology Group, German Cancer Research Center, Heidelberg, Germany
| | - Rosalind Eeles
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, United Kingdom
- Royal Marsden NHS Foundation Trust, London, United Kingdom
| | - ZSofia Kote-Jarai
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, United Kingdom
| | - Kenneth Muir
- Institute of Population Health, University of Manchester, Manchester, United Kingdom
- Division of Health Sciences, Warwick Medical School, Warwick University, Coventry, United Kingdom
| | - Nora Pashayan
- Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom
- Department of Applied Health Research, University College London, London, United Kingdom
| | - Daniele Campa
- Department of Biology, University of Pisa, Pisa, Italy
| | - Per Hoffmann
- Human Genomic Research Group, Department of Biomedicine, University of Basel, Basel, Switzerland
- Institute of Human Genetics and
- Department of Genomics, Life & Brain Center, University of Bonn, Bonn, Germany
| | - Markus M Nöthen
- Institute of Human Genetics and
- Department of Genomics, Life & Brain Center, University of Bonn, Bonn, Germany
| | | | - Elke Pogge von Strandmann
- Experimental Tumor Research, Center for Tumor Biology and Immunology, Clinic for Hematology, Oncology and Immunology, Philipps University, Marburg, Germany
| | - Anthony J Swerdlow
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, United Kingdom
- Division of Breast Cancer Research, The Institute of Cancer Research, London, United Kingdom; and
| | - Andreas Engert
- Department of Internal Medicine, University Hospital of Cologne, Cologne, Germany
| | - Nick Orr
- Division of Breast Cancer Research, The Institute of Cancer Research, London, United Kingdom; and
| | - Kari Hemminki
- Division of Molecular Genetic Epidemiology, German Cancer Research Center, Heidelberg, Germany
- Center for Primary Health Care Research, Lund University, Malmö, Sweden
| | - Richard S Houlston
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, United Kingdom
| |
Collapse
|
219
|
Abstract
Genome-wide association studies (GWASs) have identified thousands of loci associated with hundreds of complex diseases and traits, and progress is being made toward elucidating the causal variants and genes underlying these associations. Functional characterization of mechanisms at GWAS loci is a multi-faceted challenge. Challenges include linkage disequilibrium and allelic heterogeneity at each locus, the noncoding nature of most loci, and the time and cost needed for experimentally evaluating the potential mechanistic contributions of genes and variants. As GWAS sample sizes increase, more loci are identified, and the complexities of individual loci emerge. Loci can consist of multiple association signals, each of which can reflect the influence of multiple variants, inseparable by association analyses. Each signal within a locus can influence the same or different target genes. Experimental studies of genes and variants can differ on the basis of cell type, cellular environment, or other context-specific variables. In this review, we describe the complexity of mechanisms at GWAS loci-including multiple signals, multiple variants, and/or multiple genes-and the implications these complexities hold for experimental study design and interpretation of GWAS mechanisms.
Collapse
Affiliation(s)
- Maren E Cannon
- Department of Genetics, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Karen L Mohlke
- Department of Genetics, University of North Carolina, Chapel Hill, NC, 27599, USA.
| |
Collapse
|
220
|
Abstract
Complexity in genome architecture determines how gene expression programs are established, maintained, and modified from early developmental stages to normal adult phenotypes. Large scale and hierarchical organization of the genome impacts various aspects of cell functions, ranging from X-chromosome inactivation, stem-cell fate determination to transcription, DNA replication, and cellular repair. While chromatin loops and topologically-associated domains represent a basic structural or fundamental unit of chromatin organization, spatio-temporal organization of the genome further creates a complex network of interacting genome patterns, forming chromosomal compartments and chromosome territories. The understanding of human diseases, including cancers, auto-immune disorders, Alzheimer's, and cardiovascular diseases, relies on the associated molecular and epigenetic mechanisms. There is a growing interest in the impact of three-dimensional chromatin folding upon the genome structure and function, which gives rise to the question "What's in the fold?" and is the main focus of this review. Here we discuss the principles determining the spatial and regulatory relationships between gene regulation and three-dimensional chromatin landscapes, and how changes in chromatin-folding could influence the outcome of genome function in healthy and disease states.
Collapse
|
221
|
Suzuki Y, Wang Y, Au KF, Morishita S. A Statistical Method for Observing Personal Diploid Methylomes and Transcriptomes with Single-Molecule Real-Time Sequencing. Genes (Basel) 2018; 9:E460. [PMID: 30235838 PMCID: PMC6162384 DOI: 10.3390/genes9090460] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 09/12/2018] [Accepted: 09/12/2018] [Indexed: 11/16/2022] Open
Abstract
We address the problem of observing personal diploid methylomes, CpG methylome pairs of homologous chromosomes that are distinguishable with respect to phased heterozygous variants (PHVs), which is challenging due to scarcity of PHVs in personal genomes. Single molecule real-time (SMRT) sequencing is promising as it outputs long reads with CpG methylation information, but a serious concern is whether reliable PHVs are available in erroneous SMRT reads with an error rate of ∼15%. To overcome the issue, we propose a statistical model that reduces the error rate of phasing CpG site to 1%, thereby calling CpG hypomethylation in each haplotype with >90% precision and sensitivity. Using our statistical model, we examined GNAS complex locus known for a combination of maternally, paternally, or biallelically expressed isoforms, and observed allele-specific methylation pattern almost perfectly reflecting their respective allele-specific expression status, demonstrating the merit of elucidating comprehensive personal diploid methylomes and transcriptomes.
Collapse
Affiliation(s)
- Yuta Suzuki
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo 277-8561, Japan.
| | - Yunhao Wang
- Department of Internal Medicine, University of Iowa, Iowa City, IA 52242, USA.
| | - Kin Fai Au
- Department of Internal Medicine, University of Iowa, Iowa City, IA 52242, USA.
- Department of Biomedical Informatics, Ohio State University, Columbus, OH 43210, USA.
| | - Shinichi Morishita
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo 277-8561, Japan.
| |
Collapse
|
222
|
Zeng J, Li G. TFmapper: A Tool for Searching Putative Factors Regulating Gene Expression Using ChIP-seq Data. Int J Biol Sci 2018; 14:1724-1731. [PMID: 30416387 PMCID: PMC6216026 DOI: 10.7150/ijbs.28850] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2018] [Accepted: 07/30/2018] [Indexed: 02/06/2023] Open
Abstract
Background: Next-generation sequencing coupled to chromatin immunoprecipitation (ChIP-seq), DNase I hypersensitivity (DNase-seq) and the transposase-accessible chromatin assay (ATAC-seq) has generated enormous amounts of data, markedly improved our understanding of the transcriptional and epigenetic control of gene expression. To take advantage of the availability of such datasets and provide clues on what factors, including transcription factors, epigenetic regulators and histone modifications, potentially regulates the expression of a gene of interest, a tool for simultaneous queries of multiple datasets using symbols or genomic coordinates as search terms is needed. Results: In this study, we annotated the peaks of thousands of ChIP-seq datasets generated by ENCODE project, or ChIP-seq/DNase-seq/ATAC-seq datasets deposited in Gene Expression Omnibus (GEO) and curated by Cistrome project; We built a MySQL database called TFmapper containing the annotations and associated metadata, allowing users without bioinformatics expertise to search across thousands of datasets to identify factors targeting a genomic region/gene of interest in a specified sample through a web interface. Users can also visualize multiple peaks in genome browsers and download the corresponding sequences. Conclusion: TFmapper will help users explore the vast amount of publicly available ChIP-seq/DNase-seq/ATAC-seq data and perform integrative analyses to understand the regulation of a gene of interest. The web server is freely accessible at http://www.tfmapper.org/.
Collapse
Affiliation(s)
- Jianming Zeng
- Faculty of Health Sciences, University of Macau, Taipa, Macau, China
| | - Gang Li
- Faculty of Health Sciences, University of Macau, Taipa, Macau, China
| |
Collapse
|
223
|
Cai S, Georgakilas GK, Johnson JL, Vahedi G. A Cosine Similarity-Based Method to Infer Variability of Chromatin Accessibility at the Single-Cell Level. Front Genet 2018; 9:319. [PMID: 30158954 PMCID: PMC6103536 DOI: 10.3389/fgene.2018.00319] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Accepted: 07/26/2018] [Indexed: 12/23/2022] Open
Abstract
Cellular identity between generations of developing cells is propagated through the epigenome particularly via the accessible parts of the chromatin. It is now possible to measure chromatin accessibility at single-cell resolution using single-cell assay for transposase accessible chromatin (scATAC-seq), which can reveal the regulatory variation behind the phenotypic variation. However, single-cell chromatin accessibility data are sparse, binary, and high dimensional, leading to unique computational challenges. To overcome these difficulties, we developed PRISM, a computational workflow that quantifies cell-to-cell chromatin accessibility variation while controlling for technical biases. PRISM is a novel multidimensional scaling-based method using angular cosine distance metrics coupled with distance from the spatial centroid. PRISM takes differences in accessibility at each genomic region between single cells into account. Using data generated in our lab and publicly available, we showed that PRISM outperforms an existing algorithm, which relies on the aggregate of signal across a set of genomic regions. PRISM showed robustness to noise in cells with low coverage for measuring chromatin accessibility. Our approach revealed the previously undetected accessibility variation where accessible sites differ between cells but the total number of accessible sites is constant. We also showed that PRISM, but not an existing algorithm, can find suppressed heterogeneity of accessibility at CTCF binding sites. Our updated approach uncovers new biological results with profound implications on the cellular heterogeneity of chromatin architecture.
Collapse
Affiliation(s)
- Stanley Cai
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.,Penn Institute for Immunology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.,Penn Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Georgios K Georgakilas
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.,Penn Institute for Immunology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.,Penn Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - John L Johnson
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.,Penn Institute for Immunology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.,Penn Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Golnaz Vahedi
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.,Penn Institute for Immunology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States.,Penn Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| |
Collapse
|
224
|
Epigenetic prediction of response to anti-PD-1 treatment in non-small-cell lung cancer: a multicentre, retrospective analysis. THE LANCET RESPIRATORY MEDICINE 2018; 6:771-781. [PMID: 30100403 DOI: 10.1016/s2213-2600(18)30284-4] [Citation(s) in RCA: 137] [Impact Index Per Article: 22.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2018] [Revised: 06/20/2018] [Accepted: 06/21/2018] [Indexed: 12/16/2022]
Abstract
BACKGROUND Anti-programmed death-1 (PD-1) treatment for advanced non-small-cell lung cancer (NSCLC) has improved the survival of patients. However, a substantial percentage of patients do not respond to this treatment. We examined the use of DNA methylation profiles to determine the efficacy of anti-PD-1 treatment in patients recruited with current stage IV NSCLC. METHODS In this multicentre study, we recruited adult patients from 15 hospitals in France, Spain, and Italy who had histologically proven stage IV NSCLC and had been exposed to PD-1 blockade during the course of the disease. The study structure comprised a discovery cohort to assess the correlation between epigenetic features and clinical benefit with PD-1 blockade and two validation cohorts to assess the validity of our assumptions. We first established an epigenomic profile based on a microarray DNA methylation signature (EPIMMUNE) in a discovery set of tumour samples from patients treated with nivolumab or pembrolizumab. The EPIMMUNE signature was validated in an independent set of patients. A derived DNA methylation marker was validated by a single-methylation assay in a validation cohort of patients. The main study outcomes were progression-free survival and overall survival. We used the Kaplan-Meier method to estimate progression-free and overall survival, and calculated the differences between the groups with the log-rank test. We constructed a multivariate Cox model to identify the variables independently associated with progression-free and overall survival. FINDINGS Between June 23, 2014, and May 18, 2017, we obtained samples from 142 patients: 34 in the discovery cohort, 47 in the EPIMMUNE validation cohort, and 61 in the derived methylation marker cohort (the T-cell differentiation factor forkhead box P1 [FOXP1]). The EPIMMUNE signature in patients with stage IV NSCLC treated with anti-PD-1 agents was associated with improved progression-free survival (hazard ratio [HR] 0·010, 95% CI 3·29 × 10-4-0·0282; p=0·0067) and overall survival (0·080, 0·017-0·373; p=0·0012). The EPIMMUNE-positive signature was not associated with PD-L1 expression, the presence of CD8+ cells, or mutational load. EPIMMUNE-negative tumours were enriched in tumour-associated macrophages and neutrophils, cancer-associated fibroblasts, and senescent endothelial cells. The EPIMMUNE-positive signature was associated with improved progression-free survival in the EPIMMUNE validation cohort (0·330, 0·149-0·727; p=0·0064). The unmethylated status of FOXP1 was associated with improved progression-free survival (0·415, 0·209-0·802; p=0·0063) and overall survival (0·409, 0·220-0·780; p=0·0094) in the FOXP1 validation cohort. The EPIMMUNE signature and unmethylated FOXP1 were not associated with clinical benefit in lung tumours that did not receive immunotherapy. INTERPRETATION Our study shows that the epigenetic milieu of NSCLC tumours indicates which patients are most likely to benefit from nivolumab or pembrolizumab treatments. The methylation status of FOXP1 could be associated with validated predictive biomarkers such as PD-L1 staining and mutational load to better select patients who will experience clinical benefit with PD-1 blockade, and its predictive value should be evaluated in prospective studies. FUNDING "Obra Social" La Caixa, Cellex Foundation, and the Health and Science Departments of the Generalitat de Catalunya.
Collapse
|
225
|
Poller W, Dimmeler S, Heymans S, Zeller T, Haas J, Karakas M, Leistner DM, Jakob P, Nakagawa S, Blankenberg S, Engelhardt S, Thum T, Weber C, Meder B, Hajjar R, Landmesser U. Non-coding RNAs in cardiovascular diseases: diagnostic and therapeutic perspectives. Eur Heart J 2018; 39:2704-2716. [PMID: 28430919 PMCID: PMC6454570 DOI: 10.1093/eurheartj/ehx165] [Citation(s) in RCA: 273] [Impact Index Per Article: 45.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/10/2016] [Revised: 01/14/2017] [Accepted: 03/15/2017] [Indexed: 02/06/2023] Open
Abstract
Recent research has demonstrated that the non-coding genome plays a key role in genetic programming and gene regulation during development as well as in health and cardiovascular disease. About 99% of the human genome do not encode proteins, but are transcriptionally active representing a broad spectrum of non-coding RNAs (ncRNAs) with important regulatory and structural functions. Non-coding RNAs have been identified as critical novel regulators of cardiovascular risk factors and cell functions and are thus important candidates to improve diagnostics and prognosis assessment. Beyond this, ncRNAs are rapidly emgerging as fundamentally novel therapeutics. On a first level, ncRNAs provide novel therapeutic targets some of which are entering assessment in clinical trials. On a second level, new therapeutic tools were developed from endogenous ncRNAs serving as blueprints. Particularly advanced is the development of RNA interference (RNAi) drugs which use recently discovered pathways of endogenous short interfering RNAs and are becoming versatile tools for efficient silencing of protein expression. Pioneering clinical studies include RNAi drugs targeting liver synthesis of PCSK9 resulting in highly significant lowering of LDL cholesterol or targeting liver transthyretin (TTR) synthesis for treatment of cardiac TTR amyloidosis. Further novel drugs mimicking actions of endogenous ncRNAs may arise from exploitation of molecular interactions not accessible to conventional pharmacology. We provide an update on recent developments and perspectives for diagnostic and therapeutic use of ncRNAs in cardiovascular diseases, including atherosclerosis/coronary disease, post-myocardial infarction remodelling, and heart failure.
Collapse
Affiliation(s)
- Wolfgang Poller
- Department of Cardiology, CBF, CC11, Charite Universitätsmedizin Berlin, Campus Benjamin Franklin, Charite Centrum 11 (Cardiovascular Medicine), Hindenburgdamm 20, Berlin, Germany
- German Center for Cardiovascular Research (DZHK), Site Berlin, Berlin, Germany
| | - Stefanie Dimmeler
- Institute for Cardiovascular Regeneration, Center of Molecular Medicine, Johann Wolfgang Goethe Universität, Theodor-Stern-Kai 7, Frankfurt am Main, Germany
- DZHK, Site Rhein-Main, Frankfurt, Germany
| | - Stephane Heymans
- Center for Heart Failure Research, Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, Netherlands
| | - Tanja Zeller
- Clinic for General and Interventional Cardiology, University Heart Center Hamburg, Martinistrasse 52, Hamburg, Germany
- DZHK, Site Hamburg/Kiel/Lübeck, Hamburg, Germany
| | - Jan Haas
- Institute for Cardiomyopathies Heidelberg (ICH), Universitätsklinikum Heidelberg, Im Neuenheimer Feld 669, Heidelberg, Germany
- DZHK, Site Heidelberg/Mannheim, Heidelberg, Germany
| | - Mahir Karakas
- Clinic for General and Interventional Cardiology, University Heart Center Hamburg, Martinistrasse 52, Hamburg, Germany
- DZHK, Site Hamburg/Kiel/Lübeck, Hamburg, Germany
| | - David-Manuel Leistner
- Department of Cardiology, CBF, CC11, Charite Universitätsmedizin Berlin, Campus Benjamin Franklin, Charite Centrum 11 (Cardiovascular Medicine), Hindenburgdamm 20, Berlin, Germany
- German Center for Cardiovascular Research (DZHK), Site Berlin, Berlin, Germany
| | - Philipp Jakob
- Department of Cardiology, CBF, CC11, Charite Universitätsmedizin Berlin, Campus Benjamin Franklin, Charite Centrum 11 (Cardiovascular Medicine), Hindenburgdamm 20, Berlin, Germany
- German Center for Cardiovascular Research (DZHK), Site Berlin, Berlin, Germany
| | - Shinichi Nakagawa
- RNA Biology Laboratory, RIKEN Advanced Research Institute, Wako, Saitama, Japan
- RNA Biology Laboratory, Faculty of Pharmaceutical Sciences, Hokkaido University, Kita 12-jo Nishi 6-chome, Kita-ku, Sapporo, Japan
| | - Stefan Blankenberg
- Clinic for General and Interventional Cardiology, University Heart Center Hamburg, Martinistrasse 52, Hamburg, Germany
- DZHK, Site Hamburg/Kiel/Lübeck, Hamburg, Germany
| | - Stefan Engelhardt
- Institute for Pharmacology and Toxikology, Technische Universität München, Biedersteiner Strasse 29, München, Germany
- DZHK, Site Munich, Munich, Germany
| | - Thomas Thum
- Institute of Molecular and Translational Therapeutic Strategies (IMTTS), Hannover Medical School, Hannover, Germany
| | - Christian Weber
- DZHK, Site Munich, Munich, Germany
- Institute for Cardiovascular Prevention (IPEK), Ludwig-Maximilians-Universität, Pettenkoferstrasse 8a/9, Munich, Germany
| | - Benjamin Meder
- Institute for Cardiomyopathies Heidelberg (ICH), Universitätsklinikum Heidelberg, Im Neuenheimer Feld 669, Heidelberg, Germany
- DZHK, Site Heidelberg/Mannheim, Heidelberg, Germany
| | - Roger Hajjar
- Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ulf Landmesser
- Department of Cardiology, CBF, CC11, Charite Universitätsmedizin Berlin, Campus Benjamin Franklin, Charite Centrum 11 (Cardiovascular Medicine), Hindenburgdamm 20, Berlin, Germany
- German Center for Cardiovascular Research (DZHK), Site Berlin, Berlin, Germany
- Berlin Institute of Health, Kapelle-Ufer 2, Berlin, Germany
| |
Collapse
|
226
|
Dozmorov MG. Epigenomic annotation-based interpretation of genomic data: from enrichment analysis to machine learning. Bioinformatics 2018; 33:3323-3330. [PMID: 29028263 DOI: 10.1093/bioinformatics/btx414] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2017] [Accepted: 06/22/2017] [Indexed: 12/12/2022] Open
Abstract
Motivation One of the goals of functional genomics is to understand the regulatory implications of experimentally obtained genomic regions of interest (ROIs). Most sequencing technologies now generate ROIs distributed across the whole genome. The interpretation of these genome-wide ROIs represents a challenge as the majority of them lie outside of functionally well-defined protein coding regions. Recent efforts by the members of the International Human Epigenome Consortium have generated volumes of functional/regulatory data (reference epigenomic datasets), effectively annotating the genome with epigenomic properties. Consequently, a wide variety of computational tools has been developed utilizing these epigenomic datasets for the interpretation of genomic data. Results The purpose of this review is to provide a structured overview of practical solutions for the interpretation of ROIs with the help of epigenomic data. Starting with epigenomic enrichment analysis, we discuss leading tools and machine learning methods utilizing epigenomic and 3D genome structure data. The hierarchy of tools and methods reviewed here presents a practical guide for the interpretation of genome-wide ROIs within an epigenomic context. Contact mikhail.dozmorov@vcuhealth.org. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mikhail G Dozmorov
- Department of Biostatistics, Virginia Commonwealth University, Richmond, VA 23298, USA
| |
Collapse
|
227
|
Zheng SC, Webster AP, Dong D, Feber A, Graham DG, Sullivan R, Jevons S, Lovat LB, Beck S, Widschwendter M, Teschendorff AE. A novel cell-type deconvolution algorithm reveals substantial contamination by immune cells in saliva, buccal and cervix. Epigenomics 2018; 10:925-940. [DOI: 10.2217/epi-2018-0037] [Citation(s) in RCA: 82] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Aim: An outstanding challenge in epigenome studies is the estimation of cell-type proportions in complex epithelial tissues. Materials & methods: Here, we construct and validate a DNA methylation reference and algorithm for complex tissues that contain epithelial, immune and nonimmune stromal cells. Results: Using this reference, we show that easily accessible tissues such as saliva, buccal and cervix exhibit substantial variation in immune cell (IC) contamination. We further validate our reference in the context of oral cancer, where it correctly predicts an increased IC infiltration in cancer but suppressed in patients with highest smoking exposure. Finally, our method can improve the specificity of differentially methylated CpG calls in epithelial cancer. Conclusion: The degree and variation of IC contamination in complex epithelial tissues is substantial. We provide a valuable resource and tool for assessing the epithelial purity and IC contamination of samples and for identifying differential methylation in such complex tissues.
Collapse
Affiliation(s)
- Shijie C Zheng
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, 320 Yue Yang Road, Shanghai 200031, PR China
- University of Chinese Academy of Sciences, 19A Yuquan Road, Beijing 100049, PR China
| | - Amy P Webster
- UCL Cancer Institute, Paul O'Gorman Building, University College London, 72 Huntley Street, London WC1E 6BT, UK
| | - Danyue Dong
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, 320 Yue Yang Road, Shanghai 200031, PR China
- University of Chinese Academy of Sciences, 19A Yuquan Road, Beijing 100049, PR China
| | - Andy Feber
- Division of Surgery & Interventional Science, UCL, London WC1E 6BT, UK
| | - David G Graham
- Division of Surgery & Interventional Science, UCL, London WC1E 6BT, UK
| | - Roisin Sullivan
- Division of Surgery & Interventional Science, UCL, London WC1E 6BT, UK
| | - Sarah Jevons
- Division of Surgery & Interventional Science, UCL, London WC1E 6BT, UK
| | - Laurence B Lovat
- Division of Surgery & Interventional Science, UCL, London WC1E 6BT, UK
| | - Stephan Beck
- UCL Cancer Institute, Paul O'Gorman Building, University College London, 72 Huntley Street, London WC1E 6BT, UK
| | - Martin Widschwendter
- Department of Women's Cancer, University College London, 74 Huntley Street, London WC1E 6AU, UK
| | - Andrew E Teschendorff
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, 320 Yue Yang Road, Shanghai 200031, PR China
- Department of Women's Cancer, University College London, 74 Huntley Street, London WC1E 6AU, UK
| |
Collapse
|
228
|
Okla K, Wertel I, Wawruszak A, Bobiński M, Kotarski J. Blood-based analyses of cancer: Circulating myeloid-derived suppressor cells - is a new era coming? Crit Rev Clin Lab Sci 2018; 55:376-407. [PMID: 29927668 DOI: 10.1080/10408363.2018.1477729] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Progress in cancer treatment made by the beginning of the 21st century has shifted the paradigm from one-size-fits-all to tailor-made treatment. The popular vision, to study solid tumors through the relatively noninvasive sampling of blood, is one of the most thrilling and rapidly advancing fields in global cancer diagnostics. From this perspective, immune-cell analysis in cancer could play a pivotal role in oncology practice. This approach is driven both by rapid technological developments, including the analysis of circulating myeloid-derived suppressor cells (cMDSCs), and by the increasing application of (immune) therapies, the success or failure of which may depend on effective and timely measurements of relevant biomarkers. Although the implementation of these powerful noninvasive diagnostic capabilities in guiding precision cancer treatment is poised to change the ways in which we select and monitor cancer therapy, challenges remain. Here, we discuss the challenges associated with the analysis and clinical aspects of cMDSCs and assess whether the problems in implementing tumor-evolution monitoring as a global tool in personalized oncology can be overcome.
Collapse
Affiliation(s)
- Karolina Okla
- a 1st Chair and Department of Oncological Gynaecology and Gynaecology, Tumor Immunology Laboratory , Medical University of Lublin , Lublin , Poland
| | - Iwona Wertel
- a 1st Chair and Department of Oncological Gynaecology and Gynaecology, Tumor Immunology Laboratory , Medical University of Lublin , Lublin , Poland
| | - Anna Wawruszak
- b Department of Biochemistry and Molecular Biology , Medical University of Lublin , Lublin , Poland
| | - Marcin Bobiński
- a 1st Chair and Department of Oncological Gynaecology and Gynaecology, Tumor Immunology Laboratory , Medical University of Lublin , Lublin , Poland
| | - Jan Kotarski
- a 1st Chair and Department of Oncological Gynaecology and Gynaecology, Tumor Immunology Laboratory , Medical University of Lublin , Lublin , Poland
| |
Collapse
|
229
|
Casey AE, Sinha A, Singhania R, Livingstone J, Waterhouse P, Tharmapalan P, Cruickshank J, Shehata M, Drysdale E, Fang H, Kim H, Isserlin R, Bailey S, Medina T, Deblois G, Shiah YJ, Barsyte-Lovejoy D, Hofer S, Bader G, Lupien M, Arrowsmith C, Knapp S, De Carvalho D, Berman H, Boutros PC, Kislinger T, Khokha R. Mammary molecular portraits reveal lineage-specific features and progenitor cell vulnerabilities. J Cell Biol 2018; 217:2951-2974. [PMID: 29921600 PMCID: PMC6080920 DOI: 10.1083/jcb.201804042] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2018] [Revised: 05/14/2018] [Accepted: 05/17/2018] [Indexed: 12/15/2022] Open
Abstract
Casey et al. integrate epigenomic, transcriptomic, and proteomic profiling of primary basal and luminal mammary cells to identify master epigenetic regulators of the mammary epithelium and uncover stem and progenitor cell vulnerabilities. They develop a pipeline to identify drugs that abrogate progenitor cell activity in normal and high-risk breast cancer patient samples in vitro and in vivo. The mammary epithelium depends on specific lineages and their stem and progenitor function to accommodate hormone-triggered physiological demands in the adult female. Perturbations of these lineages underpin breast cancer risk, yet our understanding of normal mammary cell composition is incomplete. Here, we build a multimodal resource for the adult gland through comprehensive profiling of primary cell epigenomes, transcriptomes, and proteomes. We define systems-level relationships between chromatin–DNA–RNA–protein states, identify lineage-specific DNA methylation of transcription factor binding sites, and pinpoint proteins underlying progesterone responsiveness. Comparative proteomics of estrogen and progesterone receptor–positive and –negative cell populations, extensive target validation, and drug testing lead to discovery of stem and progenitor cell vulnerabilities. Top epigenetic drugs exert cytostatic effects; prevent adult mammary cell expansion, clonogenicity, and mammopoiesis; and deplete stem cell frequency. Select drugs also abrogate human breast progenitor cell activity in normal and high-risk patient samples. This integrative computational and functional study provides fundamental insight into mammary lineage and stem cell biology.
Collapse
Affiliation(s)
| | - Ankit Sinha
- Princess Margaret Cancer Centre, Toronto, ON, Canada
| | | | - Julie Livingstone
- Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, ON, Canada
| | | | | | | | - Mona Shehata
- Princess Margaret Cancer Centre, Toronto, ON, Canada
| | - Erik Drysdale
- Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Hui Fang
- Princess Margaret Cancer Centre, Toronto, ON, Canada
| | - Hyeyeon Kim
- Princess Margaret Cancer Centre, Toronto, ON, Canada
| | - Ruth Isserlin
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada
| | - Swneke Bailey
- Princess Margaret Cancer Centre, Toronto, ON, Canada
| | - Tiago Medina
- Princess Margaret Cancer Centre, Toronto, ON, Canada
| | | | - Yu-Jia Shiah
- Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, ON, Canada
| | | | - Stefan Hofer
- Princess Margaret Cancer Centre, Toronto, ON, Canada
| | - Gary Bader
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada
| | - Mathieu Lupien
- Princess Margaret Cancer Centre, Toronto, ON, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | - Cheryl Arrowsmith
- Princess Margaret Cancer Centre, Toronto, ON, Canada.,Structural Genomics Consortium, University of Toronto, Toronto, ON, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | - Stefan Knapp
- Nuffield Department of Clinical Medicine, Structural Genomics Consortium, University of Oxford, Oxford, UK
| | - Daniel De Carvalho
- Princess Margaret Cancer Centre, Toronto, ON, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | - Hal Berman
- Princess Margaret Cancer Centre, Toronto, ON, Canada.,Department of Laboratory Medicine and Pathobiology, University of Toronto, ON, Canada
| | - Paul C Boutros
- Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, ON, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.,Department of Pharmacology and Toxicology, University of Toronto, Toronto, ON, Canada
| | - Thomas Kislinger
- Princess Margaret Cancer Centre, Toronto, ON, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| | - Rama Khokha
- Princess Margaret Cancer Centre, Toronto, ON, Canada .,Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
230
|
Han X, Chen S, Flynn E, Wu S, Wintner D, Shen Y. Distinct epigenomic patterns are associated with haploinsufficiency and predict risk genes of developmental disorders. Nat Commun 2018; 9:2138. [PMID: 29849042 PMCID: PMC5976622 DOI: 10.1038/s41467-018-04552-7] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Accepted: 05/08/2018] [Indexed: 12/21/2022] Open
Abstract
Haploinsufficiency is a major mechanism of genetic risk in developmental disorders. Accurate prediction of haploinsufficient genes is essential for prioritizing and interpreting deleterious variants in genetic studies. Current methods based on mutation intolerance in population data suffer from inadequate power for genes with short transcripts. Here we show haploinsufficiency is strongly associated with epigenomic patterns, and develop a computational method (Episcore) to predict haploinsufficiency leveraging epigenomic data from a broad range of tissue and cell types by machine learning methods. Based on data from recent exome sequencing studies on developmental disorders, Episcore achieves better performance in prioritizing likely-gene-disrupting (LGD) de novo variants than current methods. We further show that Episcore is less-biased by gene size, and complementary to mutation intolerance metrics for prioritizing LGD variants. Our approach enables new applications of epigenomic data and facilitates discovery and interpretation of novel risk variants implicated in developmental disorders. Predicting haploinsufficient genes helps to understand the genetic risk underlying developmental disorders. Here, the authors develop a Random Forest-based method that uses epigenomic data to predict haploinsufficiency, Episcore, which is complementary to methods based on mutation intolerance scores.
Collapse
Affiliation(s)
- Xinwei Han
- Department of Systems Biology, Columbia University, New York, NY, 10032, USA.,Department of Pediatrics, Columbia University, New York, NY, 10032, USA.,Constellation Pharmaceuticals, 215 First Street, Cambridge, MA, 02142, USA
| | - Siying Chen
- Department of Systems Biology, Columbia University, New York, NY, 10032, USA.,The Integrated Program in Cellular, Molecular and Biomedical Studies, Columbia University, New York, NY, 10032, USA
| | - Elise Flynn
- Department of Systems Biology, Columbia University, New York, NY, 10032, USA.,The Integrated Program in Cellular, Molecular and Biomedical Studies, Columbia University, New York, NY, 10032, USA
| | - Shuang Wu
- Department of Biostatistics, Columbia University, New York, NY, 10032, USA
| | - Dana Wintner
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Yufeng Shen
- Department of Systems Biology, Columbia University, New York, NY, 10032, USA. .,Department of Biomedical Informatics, Columbia University, New York, NY, 10032, USA. .,JP Sulzberger Columbia Genome Center, Columbia University, New York, NY, 10032, USA.
| |
Collapse
|
231
|
Albrecht F, List M, Bock C, Lengauer T. DeepBlueR: large-scale epigenomic analysis in R. Bioinformatics 2018; 33:2063-2064. [PMID: 28334349 PMCID: PMC5870546 DOI: 10.1093/bioinformatics/btx099] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2016] [Accepted: 02/21/2017] [Indexed: 12/21/2022] Open
Abstract
Motivation While large amounts of epigenomic data are publicly available, their retrieval in a form suitable for downstream analysis is a bottleneck in current research. The DeepBlue Epigenomic Data Server provides a powerful interface and API for filtering, transforming, aggregating and downloading data from several epigenomic consortia. Results To make public epigenomic data conveniently available for analysis in R, we developed an R/Bioconductor package that connects to the DeepBlue Epigenomic Data Server, enabling users to quickly gather and transform epigenomic data from selected experiments for analysis in the Bioconductor ecosystem. Availability and Implementation http://deepblue.mpi-inf.mpg.de/R. Requirements R 3.3, Bioconductor 3.4. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Felipe Albrecht
- Max Planck Institute for Informatics.,Graduate School of Computer Science, Saarland Informatics Campus, 66123 Saarbrücken, Germany
| | | | - Christoph Bock
- Max Planck Institute for Informatics.,CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences.,Department of Laboratory Medicine, Medical University of Vienna, 1090 Vienna, Austria
| | | |
Collapse
|
232
|
Gao Y, Widschwendter M, Teschendorff AE. DNA Methylation Patterns in Normal Tissue Correlate more Strongly with Breast Cancer Status than Copy-Number Variants. EBioMedicine 2018; 31:243-252. [PMID: 29735413 PMCID: PMC6013931 DOI: 10.1016/j.ebiom.2018.04.025] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2018] [Revised: 04/25/2018] [Accepted: 04/27/2018] [Indexed: 02/07/2023] Open
Abstract
Normal tissue at risk of neoplastic transformation is characterized by somatic mutations, copy-number variation and DNA methylation changes. It is unclear however, which type of alteration may be more informative of cancer risk. We analyzed genome-wide DNA methylation and copy-number calls from the same DNA assay in a cohort of healthy breast samples and age-matched normal samples collected adjacent to breast cancer. Using statistical methods to adjust for cell type heterogeneity, we show that DNA methylation changes can discriminate normal-adjacent from normal samples better than somatic copy-number variants. We validate this important finding in an independent dataset. These results suggest that DNA methylation alterations in the normal cell of origin may offer better cancer risk prediction and early detection markers than copy-number changes.
Collapse
Affiliation(s)
- Yang Gao
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institute for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai 200031, China
| | - Martin Widschwendter
- Department of Women's Cancer, University College London, 74 Huntley Street, London WC1E 6AU, United Kingdom
| | - Andrew E Teschendorff
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institute for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai 200031, China; Department of Women's Cancer, University College London, 74 Huntley Street, London WC1E 6AU, United Kingdom; UCL Cancer Institute, Paul O'Gorman Building, University College London, 72 Huntley Street, London WC1E 6BT, United Kingdom.
| |
Collapse
|
233
|
Kindt AS, Fuerst RW, Knoop J, Laimighofer M, Telieps T, Hippich M, Woerheide MA, Wahl S, Wilson R, Sedlmeier EM, Hommel A, Todd JA, Krumsiek J, Ziegler AG, Bonifacio E. Allele-specific methylation of type 1 diabetes susceptibility genes. J Autoimmun 2018; 89:63-74. [DOI: 10.1016/j.jaut.2017.11.008] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2017] [Revised: 11/23/2017] [Accepted: 11/25/2017] [Indexed: 01/09/2023]
|
234
|
Cotter JA, Judkins AR. Fitting the epigenome into the picture: methylation classification for paediatric brain tumours. Neuropathol Appl Neurobiol 2018; 44:543-547. [PMID: 29679371 DOI: 10.1111/nan.12488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Affiliation(s)
- J A Cotter
- Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Keck School of Medicine of the University of Southern California, Los Angeles, California, USA
| | - A R Judkins
- Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Keck School of Medicine of the University of Southern California, Los Angeles, California, USA
| |
Collapse
|
235
|
Angrish MM, Allard P, McCullough SD, Druwe IL, Helbling Chadwick L, Hines E, Chorley BN. Epigenetic Applications in Adverse Outcome Pathways and Environmental Risk Evaluation. ENVIRONMENTAL HEALTH PERSPECTIVES 2018; 126:045001. [PMID: 29669403 PMCID: PMC6071815 DOI: 10.1289/ehp2322] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Revised: 02/15/2018] [Accepted: 03/01/2018] [Indexed: 05/07/2023]
Abstract
BACKGROUND The epigenome may be an important interface between environmental chemical exposures and human health. However, the links between epigenetic modifications and health outcomes are often correlative and do not distinguish between cause and effect or common-cause relationships. The Adverse Outcome Pathway (AOP) framework has the potential to demonstrate, by way of an inference- and science-based analysis, the causal relationship between chemical exposures, epigenome, and adverse health outcomes. OBJECTIVE The objective of this work is to discuss the epigenome as a modifier of exposure effects and risk, perspectives for integrating toxicoepigenetic data into an AOP framework, tools for the exploration of epigenetic toxicity, and integration of AOP-guided epigenetic information into science and risk-assessment processes. DISCUSSION Organizing epigenetic information into the topology of a qualitative AOP network may help describe how a system will respond to epigenetic modifications caused by environmental chemical exposures. However, understanding the biological plausibility, linking epigenetic effects to short- and long-term health outcomes, and including epigenetic studies in the risk assessment process is met by substantive challenges. These obstacles include understanding the complex range of epigenetic modifications and their combinatorial effects, the large number of environmental chemicals to be tested, and the lack of data that quantitatively evaluate the epigenetic effects of environmental exposure. CONCLUSION We anticipate that epigenetic information organized into AOP frameworks can be consistently used to support biological plausibility and to identify data gaps that will accelerate the pace at which epigenetic information is applied in chemical evaluation and risk-assessment paradigms. https://doi.org/10.1289/EHP2322.
Collapse
Affiliation(s)
- Michelle M Angrish
- National Center for Environmental Assessment, Office of Research and Development (ORD), U.S. Environmental Protection Agency (EPA), Research Triangle Park, North Carolina, USA
| | - Patrick Allard
- University of California Los Angeles Institute for Society and Genetics, Los Angeles, California, USA
| | - Shaun D McCullough
- National Health and Environmental Effects Research Laboratory, ORD, U.S. EPA, Research Triangle Park, North Carolina, USA
| | - Ingrid L Druwe
- National Center for Environmental Assessment, Office of Research and Development (ORD), U.S. Environmental Protection Agency (EPA), Research Triangle Park, North Carolina, USA
| | - Lisa Helbling Chadwick
- Division of Extramural Research and Training, National Institute of Environmental Health Sciences, National Institutes of Health, Department of Health and Human Services, Research Triangle Park, North Carolina, USA
| | - Erin Hines
- National Center for Environmental Assessment, Office of Research and Development (ORD), U.S. Environmental Protection Agency (EPA), Research Triangle Park, North Carolina, USA
| | - Brian N Chorley
- University of California Los Angeles Institute for Society and Genetics, Los Angeles, California, USA
| |
Collapse
|
236
|
Izzi B, Noro F, Cludts K, Freson K, Hoylaerts MF. Cell-Specific PEAR1 Methylation Studies Reveal a Locus that Coordinates Expression of Multiple Genes. Int J Mol Sci 2018; 19:ijms19041069. [PMID: 29614055 PMCID: PMC5979289 DOI: 10.3390/ijms19041069] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2018] [Revised: 03/19/2018] [Accepted: 03/28/2018] [Indexed: 02/07/2023] Open
Abstract
Chromosomal interactions connect distant enhancers and promoters on the same chromosome, activating or repressing gene expression. PEAR1 encodes the Platelet-Endothelial Aggregation Receptor 1, a contact receptor involved in platelet function and megakaryocyte and endothelial cell proliferation. PEAR1 expression during megakaryocyte differentiation is controlled by DNA methylation at its first CpG island. We identified a PEAR1 cell-specific methylation sensitive region in endothelial cells and megakaryocytes that showed strong chromosomal interactions with ISGL20L2, RRNAD1, MRLP24, HDGF and PRCC, using available promoter capture Hi-C datasets. These genes are involved in ribosome processing, protein synthesis, cell cycle and cell proliferation. We next studied the methylation and expression profile of these five genes in Human Umbilical Vein Endothelial Cells (HUVECs) and megakaryocyte precursors. While cell-specific PEAR1 methylation corresponded to variability in expression for four out of five genes, no methylation change was observed in their promoter regions across cell types. Our data suggest that PEAR1 cell-type specific methylation changes may control long distance interactions with other genes. Further studies are needed to show whether such interaction data might be relevant for the genome-wide association data that showed a role for non-coding PEAR1 variants in the same region and platelet function, platelet count and cardiovascular risk.
Collapse
Affiliation(s)
- Benedetta Izzi
- Department of Cardiovascular Sciences, Center for Molecular and Vascular Biology, University of Leuven, 3000 Leuven, Belgium.
| | - Fabrizia Noro
- Department of Epidemiology and Prevention, IRCCS Istituto Neurologico Mediterraneo Neuromed, Via dell'Elettronica, 86077 Pozzilli (IS), Italy.
| | - Katrien Cludts
- Department of Cardiovascular Sciences, Center for Molecular and Vascular Biology, University of Leuven, 3000 Leuven, Belgium.
| | - Kathleen Freson
- Department of Cardiovascular Sciences, Center for Molecular and Vascular Biology, University of Leuven, 3000 Leuven, Belgium.
| | - Marc F Hoylaerts
- Department of Cardiovascular Sciences, Center for Molecular and Vascular Biology, University of Leuven, 3000 Leuven, Belgium.
| |
Collapse
|
237
|
Allyn-Feuer A, Ade A, Luzum JA, Higgins GA, Athey BD. The pharmacoepigenomics informatics pipeline defines a pathway of novel and known warfarin pharmacogenomics variants. Pharmacogenomics 2018; 19:413-434. [PMID: 29400612 PMCID: PMC6021929 DOI: 10.2217/pgs-2017-0186] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Accepted: 01/16/2018] [Indexed: 12/21/2022] Open
Abstract
AIM 'Pharmacoepigenomics' methods informed by omics datasets and pre-existing knowledge have yielded discoveries in neuropsychiatric pharmacogenomics. Now we evaluate the generality of these methods by discovering an extended warfarin pharmacogenomics pathway. MATERIALS & METHODS We developed the pharmacoepigenomics informatics pipeline, a scalable multi-omics variant screening pipeline for pharmacogenomics, and conducted an experiment in the genomics of warfarin. RESULTS We discovered known and novel pharmacogenomics variants and genes, both coding and regulatory, for warfarin response, including adverse events. Such genes and variants cluster in a warfarin response pathway consolidating known and novel warfarin response variants and genes. CONCLUSION These results can inform a new warfarin test. The pharmacoepigenomics informatics pipeline may be able to discover new pharmacogenomics markers in other drug-disease systems.
Collapse
Affiliation(s)
- Ari Allyn-Feuer
- Department of Computational Medicine & Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Alex Ade
- Department of Computational Medicine & Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Jasmine A Luzum
- Department of Clinical Pharmacy, College of Pharmacy, University of Michigan, Ann Arbor, MI 48109, USA
| | - Gerald A Higgins
- Department of Computational Medicine & Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Brian D Athey
- Department of Computational Medicine & Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
- Department of Internal Medicine, University of Michigan Health System, Ann Arbor, MI 48109, USA
- Department of Psychiatry, University of Michigan Medical School, Ann Arbor, MI 48109, USA
- Michigan Institute for Data Science, University of Michigan Office of Research, Ann Arbor, MI 48109, USA
| |
Collapse
|
238
|
Cibrián Uhalte E, Wilkinson JM, Southam L, Zeggini E. Pathways to understanding the genomic aetiology of osteoarthritis. Hum Mol Genet 2018; 26:R193-R201. [PMID: 28977450 PMCID: PMC5886472 DOI: 10.1093/hmg/ddx302] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2017] [Accepted: 07/25/2017] [Indexed: 02/07/2023] Open
Abstract
Osteoarthritis is a common, complex disease with no curative therapy. In this review, we summarize current knowledge on disease aetiopathogenesis and outline genetics and genomics approaches that are helping catalyse a much-needed improved understanding of the biological underpinning of disease development and progression.
Collapse
Affiliation(s)
- Elena Cibrián Uhalte
- Human Genetics and Cellular Genetics, Wellcome Trust Sanger Institute, Hinxton CB10 1SA, UK
| | - Jeremy Mark Wilkinson
- Department of Oncology and Metabolism, University of Sheffield, Sheffield, S10 2RX, UK
| | - Lorraine Southam
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK.,Human Genetics, Wellcome Trust Sanger Institute, Hinxton CB10 1SA, UK
| | | |
Collapse
|
239
|
Marinov GK, Kundaje A. ChIP-ping the branches of the tree: functional genomics and the evolution of eukaryotic gene regulation. Brief Funct Genomics 2018; 17:116-137. [PMID: 29529131 PMCID: PMC5889016 DOI: 10.1093/bfgp/ely004] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Advances in the methods for detecting protein-DNA interactions have played a key role in determining the directions of research into the mechanisms of transcriptional regulation. The most recent major technological transformation happened a decade ago, with the move from using tiling arrays [chromatin immunoprecipitation (ChIP)-on-Chip] to high-throughput sequencing (ChIP-seq) as a readout for ChIP assays. In addition to the numerous other ways in which it is superior to arrays, by eliminating the need to design and manufacture them, sequencing also opened the door to carrying out comparative analyses of genome-wide transcription factor occupancy across species and studying chromatin biology in previously less accessible model and nonmodel organisms, thus allowing us to understand the evolution and diversity of regulatory mechanisms in unprecedented detail. Here, we review the biological insights obtained from such studies in recent years and discuss anticipated future developments in the field.
Collapse
Affiliation(s)
- Georgi K Marinov
- Corresponding author: Georgi K. Marinov, Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA. E-mail:
| | | |
Collapse
|
240
|
Widschwendter M, Jones A, Evans I, Reisel D, Dillner J, Sundström K, Steyerberg EW, Vergouwe Y, Wegwarth O, Rebitschek FG, Siebert U, Sroczynski G, de Beaufort ID, Bolt I, Cibula D, Zikan M, Bjørge L, Colombo N, Harbeck N, Dudbridge F, Tasse AM, Knoppers BM, Joly Y, Teschendorff AE, Pashayan N. Epigenome-based cancer risk prediction: rationale, opportunities and challenges. Nat Rev Clin Oncol 2018; 15:292-309. [PMID: 29485132 DOI: 10.1038/nrclinonc.2018.30] [Citation(s) in RCA: 103] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The incidence of cancer is continuing to rise and risk-tailored early diagnostic and/or primary prevention strategies are urgently required. The ideal risk-predictive test should: integrate the effects of both genetic and nongenetic factors and aim to capture these effects using an approach that is both biologically stable and technically reproducible; derive a score from easily accessible biological samples that acts as a surrogate for the organ in question; and enable the effectiveness of risk-reducing measures to be monitored. Substantial evidence has accumulated suggesting that the epigenome and, in particular, DNA methylation-based tests meet all of these requirements. However, the development and implementation of DNA methylation-based risk-prediction tests poses considerable challenges. In particular, the cell type specificity of DNA methylation and the extensive cellular heterogeneity of the easily accessible surrogate cells that might contain information relevant to less accessible tissues necessitates the use of novel methods in order to account for these confounding issues. Furthermore, the engagement of the scientific community with health-care professionals, policymakers and the public is required in order to identify and address the organizational, ethical, legal, social and economic challenges associated with the routine use of epigenetic testing.
Collapse
Affiliation(s)
- Martin Widschwendter
- Department of Women's Cancer, Institute for Women's Health, University College London, London, UK
| | - Allison Jones
- Department of Women's Cancer, Institute for Women's Health, University College London, London, UK
| | - Iona Evans
- Department of Women's Cancer, Institute for Women's Health, University College London, London, UK
| | - Daniel Reisel
- Department of Women's Cancer, Institute for Women's Health, University College London, London, UK
| | - Joakim Dillner
- Department of Laboratory Medicine, Karolinska Institutet, Stockholm, Sweden.,Karolinska University Laboratory, Karolinska University Hospital, Stockholm, Sweden
| | - Karin Sundström
- Department of Laboratory Medicine, Karolinska Institutet, Stockholm, Sweden.,Karolinska University Laboratory, Karolinska University Hospital, Stockholm, Sweden
| | - Ewout W Steyerberg
- Center for Medical Decision Sciences, Department of Public Health, Erasmus MC, Rotterdam, Netherlands.,Department of Biomedical Data Sciences, LUMC, Leiden, Netherlands
| | - Yvonne Vergouwe
- Center for Medical Decision Sciences, Department of Public Health, Erasmus MC, Rotterdam, Netherlands
| | - Odette Wegwarth
- Max Planck Institute for Human Development, Harding Center for Risk Literacy, Berlin, Germany.,Max Planck Institute for Human Development, Center for Adaptive Rationality, Berlin, Germany
| | - Felix G Rebitschek
- Max Planck Institute for Human Development, Harding Center for Risk Literacy, Berlin, Germany
| | - Uwe Siebert
- Institute of Public Health, Medical Decision Making and Health Technology Assessment, Department of Public Health, Health Services Research, and HTA, UMIT-University for Health Sciences, Medical Informatics and Technology, Hall in Tirol, Austria.,Harvard T. C. Chan School of Public Health, Center for Health Decision Science, Department of Health Policy and Management, Boston, MA, USA.,Oncotyrol: Center for Personalized Medicine, Innsbruck, Austria
| | - Gaby Sroczynski
- Institute of Public Health, Medical Decision Making and Health Technology Assessment, Department of Public Health, Health Services Research, and HTA, UMIT-University for Health Sciences, Medical Informatics and Technology, Hall in Tirol, Austria
| | - Inez D de Beaufort
- Department of Medical Ethics and Philosophy of Medicine, Erasmus Medical Center, Rotterdam, Netherlands
| | - Ineke Bolt
- Department of Medical Ethics and Philosophy of Medicine, Erasmus Medical Center, Rotterdam, Netherlands
| | - David Cibula
- Department of Obstetrics and Gynaecology, First Medical Faculty of the Charles University and General Faculty Hospital, Prague, Czech Republic
| | - Michal Zikan
- Department of Obstetrics and Gynaecology, First Medical Faculty of the Charles University and General Faculty Hospital, Prague, Czech Republic
| | - Line Bjørge
- Department of Obstetrics and Gynecology, Haukeland University Hospital, and Centre for Cancer Biomarkers, Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Nicoletta Colombo
- European Institute of Oncology and University Milan-Bicocca, Milan, Italy
| | - Nadia Harbeck
- Breast Center, Department of Gynaecology and Obstetrics, University of Munich (LMU), Munich, Germany
| | - Frank Dudbridge
- Department of Non-communicable Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, UK.,Department of Health Sciences, University of Leicester, Leicester, UK
| | - Anne-Marie Tasse
- Public Population Project in Genomics and Society, McGill University and Genome Quebec Innovation Centre, Montreal, Canada
| | | | - Yann Joly
- Centre of Genomics and Policy, McGill University, Montreal, Canada
| | - Andrew E Teschendorff
- Department of Women's Cancer, Institute for Women's Health, University College London, London, UK
| | - Nora Pashayan
- Department of Applied Health Research, Institute of Epidemiology and Healthcare, University College London, UK
| | | |
Collapse
|
241
|
Dekker J, Belmont AS, Guttman M, Leshyk VO, Lis JT, Lomvardas S, Mirny LA, O'Shea CC, Park PJ, Ren B, Politz JCR, Shendure J, Zhong S. The 4D nucleome project. Nature 2018; 549:219-226. [PMID: 28905911 DOI: 10.1038/nature23884] [Citation(s) in RCA: 437] [Impact Index Per Article: 72.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2017] [Accepted: 07/27/2017] [Indexed: 12/19/2022]
Abstract
The 4D Nucleome Network aims to develop and apply approaches to map the structure and dynamics of the human and mouse genomes in space and time with the goal of gaining deeper mechanistic insights into how the nucleus is organized and functions. The project will develop and benchmark experimental and computational approaches for measuring genome conformation and nuclear organization, and investigate how these contribute to gene regulation and other genome functions. Validated experimental technologies will be combined with biophysical approaches to generate quantitative models of spatial genome organization in different biological states, both in cell populations and in single cells.
Collapse
Affiliation(s)
- Job Dekker
- Program in Systems Biology, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Howard Hughes Medical Institute, Worcester, Massachusetts 01605, USA
| | - Andrew S Belmont
- Department of Cell and Developmental Biology, University of Illinois, Urbana-Champaign, Illinois 61801, USA
| | - Mitchell Guttman
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California 91125, USA
| | - Victor O Leshyk
- Department of Bioengineering, University of California San Diego, La Jolla, California 92093, USA
| | - John T Lis
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
| | - Stavros Lomvardas
- Department of Biochemistry and Molecular Biophysics, Mortimer B. Zuckerman Mind Brain and Behavior Institute, Columbia University, New York, New York 10027, USA
| | - Leonid A Mirny
- Institute for Medical Engineering and Science, and Department of Physics, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Clodagh C O'Shea
- Molecular and Cell Biology Laboratory, Salk Institute for Biological Studies, La Jolla, California 92037, USA
| | - Peter J Park
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Bing Ren
- Ludwig Institute for Cancer Research, Department of Cellular and Molecular Medicine, Institute of Genomic Medicine, Moores Cancer Center, University of California San Diego, La Jolla California 92093, USA
| | - Joan C Ritland Politz
- Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, Washington 98109, USA
| | - Jay Shendure
- Department of Genome Sciences, University of Washington, Howard Hughes Medical Institute, Seattle, Washington 98109, USA
| | - Sheng Zhong
- Department of Bioengineering, University of California San Diego, La Jolla, California 92093, USA
| | | |
Collapse
|
242
|
Venuto D, Bourque G. Identifying co-opted transposable elements using comparative epigenomics. Dev Growth Differ 2018; 60:53-62. [PMID: 29363107 DOI: 10.1111/dgd.12423] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2017] [Accepted: 12/08/2017] [Indexed: 12/19/2022]
Abstract
The human genome gives rise to different epigenomic landscapes that define each cell type and can be deregulated in disease. Recent efforts by ENCODE, the NIH Roadmap and the International Human Epigenome Consortium (IHEC) have made significant advances towards assembling reference epigenomic maps of various tissues. Notably, these projects have found that approximately 80% of human DNA was biochemically active in at least one epigenomic assay while only approximately 10% of the sequence displayed signs of purifying selection. Given that transposable elements (TEs) make up at least 50% of the human genome and can be actively transcribed or act as regulatory elements either for their own purposes or be co-opted for the benefit of their host; we are interested in exploring their overall contribution to the "functional" genome. Traditional methods used to identify functional DNA have relied on comparative genomics, conservation analysis and low throughput validation assays. To discover co-opted TEs, and distinguish them from noisy genomic elements, we argue that comparative epigenomic methods will also be important.
Collapse
Affiliation(s)
- David Venuto
- Department of Human Genetics, McGill University, Montréal, H3A 1B1, Québec, Canada
| | - Guillaume Bourque
- Department of Human Genetics, McGill University, Montréal, H3A 1B1, Québec, Canada.,Canadian Center for Computational Genomics, Montréal, H3A 0G1, Québec, Canada.,McGill University and Génome Québec Innovation Center, Montréal, H3A 0G1, Québec, Canada
| |
Collapse
|
243
|
Cook CE, Bergman MT, Cochrane G, Apweiler R, Birney E. The European Bioinformatics Institute in 2017: data coordination and integration. Nucleic Acids Res 2018; 46:D21-D29. [PMID: 29186510 PMCID: PMC5753251 DOI: 10.1093/nar/gkx1154] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Revised: 10/30/2017] [Accepted: 11/20/2017] [Indexed: 12/14/2022] Open
Abstract
The European Bioinformatics Institute (EMBL-EBI) supports life-science research throughout the world by providing open data, open-source software and analytical tools, and technical infrastructure (https://www.ebi.ac.uk). We accommodate an increasingly diverse range of data types and integrate them, so that biologists in all disciplines can explore life in ever-increasing detail. We maintain over 40 data resources, many of which are run collaboratively with partners in 16 countries (https://www.ebi.ac.uk/services). Submissions continue to increase exponentially: our data storage has doubled in less than two years to 120 petabytes. Recent advances in cellular imaging and single-cell sequencing techniques are generating a vast amount of high-dimensional data, bringing to light new cell types and new perspectives on anatomy. Accordingly, one of our main focus areas is integrating high-quality information from bioimaging, biobanking and other types of molecular data. This is reflected in our deep involvement in Open Targets, stewarding of plant phenotyping standards (MIAPPE) and partnership in the Human Cell Atlas data coordination platform, as well as the 2017 launch of the Omics Discovery Index. This update gives a birds-eye view of EMBL-EBI's approach to data integration and service development as genomics begins to enter the clinic.
Collapse
Affiliation(s)
- Charles E Cook
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mary T Bergman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Guy Cochrane
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Rolf Apweiler
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ewan Birney
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
244
|
Chen Y, Widschwendter M, Teschendorff AE. Systems-epigenomics inference of transcription factor activity implicates aryl-hydrocarbon-receptor inactivation as a key event in lung cancer development. Genome Biol 2017; 18:236. [PMID: 29262847 PMCID: PMC5738803 DOI: 10.1186/s13059-017-1366-0] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2017] [Accepted: 11/27/2017] [Indexed: 12/25/2022] Open
Abstract
Background Diverse molecular alterations associated with smoking in normal and precursor lung cancer cells have been reported, yet their role in lung cancer etiology remains unclear. A prominent example is hypomethylation of the aryl hydrocarbon-receptor repressor (AHRR) locus, which is observed in blood and squamous epithelial cells of smokers, but not in lung cancer. Results Using a novel systems-epigenomics algorithm, called SEPIRA, which leverages the power of a large RNA-sequencing expression compendium to infer regulatory activity from messenger RNA expression or DNA methylation (DNAm) profiles, we infer the landscape of binding activity of lung-specific transcription factors (TFs) in lung carcinogenesis. We show that lung-specific TFs become preferentially inactivated in lung cancer and precursor lung cancer lesions and further demonstrate that these results can be derived using only DNAm data. We identify subsets of TFs which become inactivated in precursor cells. Among these regulatory factors, we identify AHR, the aryl hydrocarbon-receptor which controls a healthy immune response in the lung epithelium and whose repressor, AHRR, has recently been implicated in smoking-mediated lung cancer. In addition, we identify FOXJ1, a TF which promotes growth of airway cilia and effective clearance of the lung airway epithelium from carcinogens. Conclusions We identify TFs, such as AHR, which become inactivated in the earliest stages of lung cancer and which, unlike AHRR hypomethylation, are also inactivated in lung cancer itself. The novel systems-epigenomics algorithm SEPIRA will be useful to the wider epigenome-wide association study community as a means of inferring regulatory activity. Electronic supplementary material The online version of this article (doi:10.1186/s13059-017-1366-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yuting Chen
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, 320 Yue Yang Road, Shanghai, 200031, China
| | - Martin Widschwendter
- Department of Women's Cancer, University College London, 74 Huntley Street, London, WC1E 6AU, UK
| | - Andrew E Teschendorff
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, 320 Yue Yang Road, Shanghai, 200031, China. .,Department of Women's Cancer, University College London, 74 Huntley Street, London, WC1E 6AU, UK. .,UCL Cancer Institute, University College London, Paul O'Gorman Building, 72 Huntley Street, London, WC1E 6BT, UK.
| |
Collapse
|
245
|
Ecker S, Pancaldi V, Valencia A, Beck S, Paul DS. Epigenetic and Transcriptional Variability Shape Phenotypic Plasticity. Bioessays 2017; 40. [PMID: 29251357 DOI: 10.1002/bies.201700148] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2017] [Revised: 10/31/2017] [Indexed: 12/15/2022]
Abstract
Epigenetic and transcriptional variability contribute to the vast diversity of cellular and organismal phenotypes and are key in human health and disease. In this review, we describe different types, sources, and determinants of epigenetic and transcriptional variability, enabling cells and organisms to adapt and evolve to a changing environment. We highlight the latest research and hypotheses on how chromatin structure and the epigenome influence gene expression variability. Further, we provide an overview of challenges in the analysis of biological variability. An improved understanding of the molecular mechanisms underlying epigenetic and transcriptional variability, at both the intra- and inter-individual level, provides great opportunity for disease prevention, better therapeutic approaches, and personalized medicine.
Collapse
Affiliation(s)
- Simone Ecker
- UCL Cancer Institute, University College London, 72 Huntley Street, London, WC1E 6BT, UK
| | - Vera Pancaldi
- Barcelona Supercomputing Center (BSC), C/ Jordi Girona 39-31, 08034, Barcelona, Spain
| | - Alfonso Valencia
- Barcelona Supercomputing Center (BSC), C/ Jordi Girona 39-31, 08034, Barcelona, Spain.,ICREA, Pg. Lluís Companys 23, 08010, Barcelona, Spain
| | - Stephan Beck
- UCL Cancer Institute, University College London, 72 Huntley Street, London, WC1E 6BT, UK
| | - Dirk S Paul
- MRC/BHF Cardiovascular Epidemiology Unit Department of Public Health and Primary Care, University of Cambridge, Cambridge, CB1 8RN, UK.,Department of Human Genetics Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1HH, UK
| |
Collapse
|
246
|
Baranzini SE, Oksenberg JR. The Genetics of Multiple Sclerosis: From 0 to 200 in 50 Years. Trends Genet 2017; 33:960-970. [PMID: 28987266 PMCID: PMC5701819 DOI: 10.1016/j.tig.2017.09.004] [Citation(s) in RCA: 127] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2017] [Revised: 08/26/2017] [Accepted: 09/11/2017] [Indexed: 12/12/2022]
Abstract
Multiple sclerosis (MS) is a common autoimmune disease that targets myelin in the central nervous system (CNS). Multiple genome-wide association studies (GWAS) over the past 10 years have uncovered more than 200 loci that independently contribute to disease pathogenesis. As with many other complex diseases, risk of developing MS is driven by multiple common variants whose biological effects are not immediately clear. Here, we present a historical perspective on the progress made in MS genetics and discuss current work geared towards creating a more complete model that accurately represents the genetic landscape of MS susceptibility. Such a model necessarily includes a better understanding of the individual contributions of each common variant to the cellular phenotypes, and interactions with other genes and with the environment. Future genetic studies in MS will likely focus on the role of rare variants and endophenotypes.
Collapse
Affiliation(s)
- Sergio E Baranzini
- Weill Institute for Neurosciences. Department of Neurology, University of California San Francisco, San Francisco, CA, USA; Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA; Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, CA, USA.
| | - Jorge R Oksenberg
- Weill Institute for Neurosciences. Department of Neurology, University of California San Francisco, San Francisco, CA, USA; Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| |
Collapse
|
247
|
Statistical and integrative system-level analysis of DNA methylation data. Nat Rev Genet 2017; 19:129-147. [PMID: 29129922 DOI: 10.1038/nrg.2017.86] [Citation(s) in RCA: 168] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Epigenetics plays a key role in cellular development and function. Alterations to the epigenome are thought to capture and mediate the effects of genetic and environmental risk factors on complex disease. Currently, DNA methylation is the only epigenetic mark that can be measured reliably and genome-wide in large numbers of samples. This Review discusses some of the key statistical challenges and algorithms associated with drawing inferences from DNA methylation data, including cell-type heterogeneity, feature selection, reverse causation and system-level analyses that require integration with other data types such as gene expression, genotype, transcription factor binding and other epigenetic information.
Collapse
|
248
|
Siu C, Wiseman S, Gakkhar S, Heravi-Moussavi A, Bilenky M, Carles A, Sierocinski T, Tam A, Zhao E, Kasaian K, Moore RA, Mungall AJ, Walker B, Thomson T, Marra MA, Hirst M, Jones SJM. Characterization of the human thyroid epigenome. J Endocrinol 2017; 235:153-165. [PMID: 28808080 DOI: 10.1530/joe-17-0145] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/02/2017] [Accepted: 08/14/2017] [Indexed: 12/15/2022]
Abstract
The thyroid gland, necessary for normal human growth and development, functions as an essential regulator of metabolism by the production and secretion of appropriate levels of thyroid hormone. However, assessment of abnormal thyroid function may be challenging suggesting a more fundamental understanding of normal function is needed. One way to characterize normal gland function is to study the epigenome and resulting transcriptome within its constituent cells. This study generates the first published reference epigenomes for human thyroid from four individuals using ChIP-seq and RNA-seq. We profiled six histone modifications (H3K4me1, H3K4me3, H3K27ac, H3K36me3, H3K9me3, H3K27me3), identified chromatin states using a hidden Markov model, produced a novel quantitative metric for model selection and established epigenomic maps of 19 chromatin states. We found that epigenetic features characterizing promoters and transcription elongation tend to be more consistent than regions characterizing enhancers or Polycomb-repressed regions and that epigenetically active genes consistent across all epigenomes tend to have higher expression than those not marked as epigenetically active in all epigenomes. We also identified a set of 18 genes epigenetically active and consistently expressed in the thyroid that are likely highly relevant to thyroid function. Altogether, these epigenomes represent a powerful resource to develop a deeper understanding of the underlying molecular biology of thyroid function and provide contextual information of thyroid and human epigenomic data for comparison and integration into future studies.
Collapse
Affiliation(s)
- Celia Siu
- Canada's Michael Smith Genome Sciences CentreBC Cancer Agency, Vancouver, Canada
- Department of SciencesUniversity of British Columbia, Vancouver, Canada
| | - Sam Wiseman
- Department of SurgerySt. Paul's Hospital & University of British Columbia, Vancouver, Canada
| | - Sitanshu Gakkhar
- Canada's Michael Smith Genome Sciences CentreBC Cancer Agency, Vancouver, Canada
| | | | - Misha Bilenky
- Canada's Michael Smith Genome Sciences CentreBC Cancer Agency, Vancouver, Canada
| | - Annaick Carles
- Department of Microbiology & ImmunologyMichael Smith Laboratories, University of British Columbia, Vancouver, Canada
| | - Thomas Sierocinski
- Department of Microbiology & ImmunologyMichael Smith Laboratories, University of British Columbia, Vancouver, Canada
| | - Angela Tam
- Canada's Michael Smith Genome Sciences CentreBC Cancer Agency, Vancouver, Canada
| | - Eric Zhao
- Canada's Michael Smith Genome Sciences CentreBC Cancer Agency, Vancouver, Canada
| | - Katayoon Kasaian
- Canada's Michael Smith Genome Sciences CentreBC Cancer Agency, Vancouver, Canada
| | - Richard A Moore
- Canada's Michael Smith Genome Sciences CentreBC Cancer Agency, Vancouver, Canada
| | - Andrew J Mungall
- Canada's Michael Smith Genome Sciences CentreBC Cancer Agency, Vancouver, Canada
| | - Blair Walker
- Department of Pathology and Laboratory MedicineSt. Paul's Hospital & University of British Columbia, Vancouver, Canada
| | - Thomas Thomson
- Department of Pathology and Laboratory MedicineBC Cancer Agency & University of British Columbia, Vancouver, Canada
| | - Marco A Marra
- Canada's Michael Smith Genome Sciences CentreBC Cancer Agency, Vancouver, Canada
- Department of Medical GeneticsUniversity of British Columbia, Vancouver, Canada
| | - Martin Hirst
- Canada's Michael Smith Genome Sciences CentreBC Cancer Agency, Vancouver, Canada
- Department of Microbiology & ImmunologyMichael Smith Laboratories, University of British Columbia, Vancouver, Canada
| | - Steven J M Jones
- Canada's Michael Smith Genome Sciences CentreBC Cancer Agency, Vancouver, Canada
- Department of Medical GeneticsUniversity of British Columbia, Vancouver, Canada
- Department of Molecular Biology & BiochemistrySimon Fraser University, Burnaby, Canada
| |
Collapse
|
249
|
Sud A, Kinnersley B, Houlston RS. Genome-wide association studies of cancer: current insights and future perspectives. Nat Rev Cancer 2017; 17:692-704. [PMID: 29026206 DOI: 10.1038/nrc.2017.82] [Citation(s) in RCA: 224] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Genome-wide association studies (GWAS) provide an agnostic approach for investigating the genetic basis of complex diseases. In oncology, GWAS of nearly all common malignancies have been performed, and over 450 genetic variants associated with increased risks have been identified. As well as revealing novel pathways important in carcinogenesis, these studies have shown that common genetic variation contributes substantially to the heritable risk of many common cancers. The clinical application of GWAS is starting to provide opportunities for drug discovery and repositioning as well as for cancer prevention. However, deciphering the functional and biological basis of associations is challenging and is in part a barrier to fully unlocking the potential of GWAS.
Collapse
Affiliation(s)
- Amit Sud
- Division of Genetics and Epidemiology, The Institute of Cancer Research
| | - Ben Kinnersley
- Division of Genetics and Epidemiology, The Institute of Cancer Research
| | - Richard S Houlston
- Division of Genetics and Epidemiology, The Institute of Cancer Research
- Division of Molecular Pathology, The Institute of Cancer Research, 15 Cotswold Road, Sutton, London SM2 5NG, UK
| |
Collapse
|
250
|
Zhang Y, Hardison RC. Accurate and reproducible functional maps in 127 human cell types via 2D genome segmentation. Nucleic Acids Res 2017; 45:9823-9836. [PMID: 28973456 PMCID: PMC5622376 DOI: 10.1093/nar/gkx659] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2017] [Accepted: 07/25/2017] [Indexed: 12/20/2022] Open
Abstract
The Roadmap Epigenomics Consortium has published whole-genome functional annotation maps in 127 human cell types by integrating data from studies of multiple epigenetic marks. These maps have been widely used for studying gene regulation in cell type-specific contexts and predicting the functional impact of DNA mutations on disease. Here, we present a new map of functional elements produced by applying a method called IDEAS on the same data. The method has several unique advantages and outperforms existing methods, including that used by the Roadmap Epigenomics Consortium. Using five categories of independent experimental datasets, we compared the IDEAS and Roadmap Epigenomics maps. While the overall concordance between the two maps is high, the maps differ substantially in the prediction details and in their consistency of annotation of a given genomic position across cell types. The annotation from IDEAS is uniformly more accurate than the Roadmap Epigenomics annotation and the improvement is substantial based on several criteria. We further introduce a pipeline that improves the reproducibility of functional annotation maps. Thus, we provide a high-quality map of candidate functional regions across 127 human cell types and compare the quality of different annotation methods in order to facilitate biomedical research in epigenomics.
Collapse
Affiliation(s)
- Yu Zhang
- Department of Statistics, the Pennsylvania State University, University Park, PA 16802, USA
| | - Ross C Hardison
- Department of Biochemistry and Molecular Biology, the Pennsylvania State University, University Park, PA 16802, USA
| |
Collapse
|