251
|
Initial high-resolution microscopic mapping of active and inactive regulatory sequences proves non-random 3D arrangements in chromatin domain clusters. Epigenetics Chromatin 2017; 10:39. [PMID: 28784182 PMCID: PMC5547466 DOI: 10.1186/s13072-017-0146-0] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2017] [Accepted: 07/31/2017] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND The association of active transcription regulatory elements (TREs) with DNAse I hypersensitivity (DHS[+]) and an 'open' local chromatin configuration has long been known. However, the 3D topography of TREs within the nuclear landscape of individual cells in relation to their active or inactive status has remained elusive. Here, we explored the 3D nuclear topography of active and inactive TREs in the context of a recently proposed model for a functionally defined nuclear architecture, where an active and an inactive nuclear compartment (ANC-INC) form two spatially co-aligned and functionally interacting networks. RESULTS Using 3D structured illumination microscopy, we performed 3D FISH with differently labeled DNA probe sets targeting either sites with DHS[+], apparently active TREs, or DHS[-] sites harboring inactive TREs. Using an in-house image analysis tool, DNA targets were quantitatively mapped on chromatin compaction shaped 3D nuclear landscapes. Our analyses present evidence for a radial 3D organization of chromatin domain clusters (CDCs) with layers of increasing chromatin compaction from the periphery to the CDC core. Segments harboring active TREs are significantly enriched at the decondensed periphery of CDCs with loops penetrating into interchromatin compartment channels, constituting the ANC. In contrast, segments lacking active TREs (DHS[-]) are enriched toward the compacted interior of CDCs (INC). CONCLUSIONS Our results add further evidence in support of the ANC-INC network model. The different 3D topographies of DHS[+] and DHS[-] sites suggest positional changes of TREs between the ANC and INC depending on their functional state, which might provide additional protection against an inappropriate activation. Our finding of a structural organization of CDCs based on radially arranged layers of different chromatin compaction levels indicates a complex higher-order chromatin organization beyond a dichotomic classification of chromatin into an 'open,' active and 'closed,' inactive state.
Collapse
|
252
|
Chuang TJ, Tseng YH, Chen CY, Wang YD. Assessment of imprinting- and genetic variation-dependent monoallelic expression using reciprocal allele descendants between human family trios. Sci Rep 2017; 7:7038. [PMID: 28765567 PMCID: PMC5539102 DOI: 10.1038/s41598-017-07514-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2017] [Accepted: 06/23/2017] [Indexed: 11/23/2022] Open
Abstract
Genomic imprinting is an important epigenetic process that silences one of the parentally-inherited alleles of a gene and thereby exhibits allelic-specific expression (ASE). Detection of human imprinting events is hampered by the infeasibility of the reciprocal mating system in humans and the removal of ASE events arising from non-imprinting factors. Here, we describe a pipeline with the pattern of reciprocal allele descendants (RADs) through genotyping and transcriptome sequencing data across independent parent-offspring trios to discriminate between varied types of ASE (e.g., imprinting, genetic variation-dependent ASE, and random monoallelic expression (RME)). We show that the vast majority of ASE events are due to sequence-dependent genetic variant, which are evolutionarily conserved and may themselves play a cis-regulatory role. Particularly, 74% of non-RAD ASE events, even though they exhibit ASE biases toward the same parentally-inherited allele across different individuals, are derived from genetic variation but not imprinting. We further show that the RME effect may affect the effectiveness of the population-based method for detecting imprinting events and our pipeline can help to distinguish between these two ASE types. Taken together, this study provides a good indicator for categorization of different types of ASE, opening up this widespread and complex mechanism for comprehensive characterization.
Collapse
Affiliation(s)
| | | | - Chia-Ying Chen
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
| | - Yi-Da Wang
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
253
|
Litchfield K, Levy M, Orlando G, Loveday C, Law P, Migliorini G, Holroyd A, Broderick P, Karlsson R, Haugen TB, Kristiansen W, Nsengimana J, Fenwick K, Assiotis I, Kote-Jarai ZS, Dunning AM, Muir K, Peto J, Eeles R, Easton DF, Dudakia D, Orr N, Pashayan N, Bishop DT, Reid A, Huddart RA, Shipley J, Grotmol T, Wiklund F, Houlston RS, Turnbull C. Identification of 19 new risk loci and potential regulatory mechanisms influencing susceptibility to testicular germ cell tumor. Nat Genet 2017; 49:1133-1140. [PMID: 28604728 PMCID: PMC6016736 DOI: 10.1038/ng.3896] [Citation(s) in RCA: 99] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2016] [Accepted: 05/16/2017] [Indexed: 12/29/2022]
Abstract
Genome-wide association studies (GWAS) have transformed understanding of susceptibility to testicular germ cell tumors (TGCTs), but much of the heritability remains unexplained. Here we report a new GWAS, a meta-analysis with previous GWAS and a replication series, totaling 7,319 TGCT cases and 23,082 controls. We identify 19 new TGCT risk loci, roughly doubling the number of known TGCT risk loci to 44. By performing in situ Hi-C in TGCT cells, we provide evidence for a network of physical interactions among all 44 TGCT risk SNPs and candidate causal genes. Our findings implicate widespread disruption of developmental transcriptional regulators as a basis of TGCT susceptibility, consistent with failed primordial germ cell differentiation as an initiating step in oncogenesis. Defective microtubule assembly and dysregulation of KIT-MAPK signaling also feature as recurrently disrupted pathways. Our findings support a polygenic model of risk and provide insight into the biological basis of TGCT.
Collapse
Affiliation(s)
- Kevin Litchfield
- Division of Genetics & Epidemiology, The Institute of Cancer Research, London, SM2 5NG, UK
| | - Max Levy
- Division of Genetics & Epidemiology, The Institute of Cancer Research, London, SM2 5NG, UK
| | - Giulia Orlando
- Division of Genetics & Epidemiology, The Institute of Cancer Research, London, SM2 5NG, UK
| | - Chey Loveday
- Division of Genetics & Epidemiology, The Institute of Cancer Research, London, SM2 5NG, UK
| | - Philip Law
- Division of Genetics & Epidemiology, The Institute of Cancer Research, London, SM2 5NG, UK
| | - Gabriele Migliorini
- Division of Genetics & Epidemiology, The Institute of Cancer Research, London, SM2 5NG, UK
| | - Amy Holroyd
- Division of Genetics & Epidemiology, The Institute of Cancer Research, London, SM2 5NG, UK
| | - Peter Broderick
- Division of Genetics & Epidemiology, The Institute of Cancer Research, London, SM2 5NG, UK
| | - Robert Karlsson
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, 171 77, Sweden
| | - Trine B Haugen
- Faculty of Health Sciences, Oslo and Akershus University College of Applied Sciences, Oslo, Norway
| | - Wenche Kristiansen
- Faculty of Health Sciences, Oslo and Akershus University College of Applied Sciences, Oslo, Norway
| | - Jérémie Nsengimana
- Section of Epidemiology & Biostatistics, Leeds Institute of Cancer and Pathology, Leeds, LS9 7TF, UK
| | - Kerry Fenwick
- Tumour Profiling Unit, The Institute of Cancer Research, London, SM2 5NG, UK
| | - Ioannis Assiotis
- Tumour Profiling Unit, The Institute of Cancer Research, London, SM2 5NG, UK
| | - ZSofia Kote-Jarai
- Division of Genetics & Epidemiology, The Institute of Cancer Research, London, SM2 5NG, UK
| | - Alison M. Dunning
- Centre for Cancer Genetic Epidemiology, Department of Oncology, University of Cambridge, Cambridge, CB1 8RN, UK
| | - Kenneth Muir
- Division of Health Sciences, Warwick Medical School, Warwick University, CV4 7AL, UK
- Institute of Population Health, University of Manchester, M1 3BB, UK
| | - Julian Peto
- Department of Non-Communicable Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Rosalind Eeles
- Division of Genetics & Epidemiology, The Institute of Cancer Research, London, SM2 5NG, UK
- Royal Marsden NHS Foundation Trust, London, SM2 5NG, UK
| | - Douglas F Easton
- Centre for Cancer Genetic Epidemiology, Department of Oncology, University of Cambridge, Cambridge, CB1 8RN, UK
- Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge, CB1 8RN, UK
| | - Darshna Dudakia
- Division of Genetics & Epidemiology, The Institute of Cancer Research, London, SM2 5NG, UK
| | - Nick Orr
- The Breast Cancer Now Toby Robins Research Centre, The Institute of Cancer Research, 237 Fulham Road, London SW3 6JB, UK
| | - Nora Pashayan
- Department of Applied Health Research, University College London, London, WC1E 6BT, UK
| | | | | | - D. Timothy Bishop
- Section of Epidemiology & Biostatistics, Leeds Institute of Cancer and Pathology, Leeds, LS9 7TF, UK
| | - Alison Reid
- Academic Radiotherapy Unit, Institute of Cancer Research, Sutton, Surrey, SM2 5NG, UK
| | - Robert A Huddart
- Academic Radiotherapy Unit, Institute of Cancer Research, Sutton, Surrey, SM2 5NG, UK
| | - Janet Shipley
- Division of Molecular Pathology, The Institute of Cancer Research, London, SM2 5NG, UK
| | - Tom Grotmol
- Department of Research, Cancer Registry of Norway, Oslo, 0369, Norway
| | - Fredrik Wiklund
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, 171 77, Sweden
| | - Richard S Houlston
- Division of Genetics & Epidemiology, The Institute of Cancer Research, London, SM2 5NG, UK
| | - Clare Turnbull
- Division of Genetics & Epidemiology, The Institute of Cancer Research, London, SM2 5NG, UK
- William Harvey Research Institute, Queen Mary University, London, EC1M 6BQ, UK
| |
Collapse
|
254
|
Mamrut S, Avidan N, Truffault F, Staun-Ram E, Sharshar T, Eymard B, Frenkian M, Pitha J, de Baets M, Servais L, Berrih-Aknin S, Miller A. Methylome and transcriptome profiling in Myasthenia Gravis monozygotic twins. J Autoimmun 2017; 82:62-73. [PMID: 28549776 DOI: 10.1016/j.jaut.2017.05.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2017] [Revised: 05/11/2017] [Accepted: 05/15/2017] [Indexed: 01/07/2023]
Abstract
OBJECTIVE To identify novel genetic and epigenetic factors associated with Myasthenia gravis (MG) using an identical twins experimental study design. METHODS The transcriptome and methylome of peripheral monocytes were compared between monozygotic (MZ) twins discordant and concordant for MG, as well as with MG singletons and healthy controls, all females. Sets of differentially expressed genes and differentially methylated CpGs were validated using RT-PCR for expression and target bisulfite sequencing for methylation on additional samples. RESULTS >100 differentially expressed genes and ∼1800 differentially methylated CpGs were detected in peripheral monocytes between MG patients and controls. Several transcripts associated with immune homeostasis and inflammation resolution were reduced in MG patients. Only a relatively few genes differed between the discordant healthy and MG co-twins, and both their expression and methylation profiles demonstrated very high similarity. INTERPRETATION This is the first study to characterize the DNA methylation profile in MG, and the expression profile of immune cells in MZ twins with MG. Results suggest that numerous small changes in gene expression or methylation might together contribute to disease. Impaired monocyte function in MG and decreased expression of genes associated with inflammation resolution could contribute to the chronicity of the disease. Findings may serve as potential new predictive biomarkers for disease and disease activity, as well as potential future targets for therapy development. The high similarity between the healthy and the MG discordant twins, suggests that a molecular signature might precede a clinical phenotype, and that genetic predisposition may have a stronger contribution to disease than previously assumed.
Collapse
Affiliation(s)
- Shimrat Mamrut
- Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, 31096, Israel
| | - Nili Avidan
- Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, 31096, Israel
| | - Frédérique Truffault
- INSERM - U974/CNRS UMR7215//UPMC UM76/AIM, Institute of Myology Pitie-Salpetriere, Paris, 73013, France
| | - Elsebeth Staun-Ram
- Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, 31096, Israel
| | - Tarek Sharshar
- General Intensive Care Medicine, Assistance Publique Hôpitaux de Paris, Raymond Poincaré Hospital, University of Versailles Saint-Quentin en Yvelines, 92380, Garches, France
| | - Bruno Eymard
- Department of Neuromuscular Disorders, CHU Salpêtrière, Paris, 75013, France
| | - Mélinée Frenkian
- INSERM - U974/CNRS UMR7215//UPMC UM76/AIM, Institute of Myology Pitie-Salpetriere, Paris, 73013, France
| | - Jiri Pitha
- Department of Neurology and Clinical Neuroscience Center, 1st Faculty of Medicine, Charles University and General Teaching Hospital, Prague, Czech Republic
| | - Marc de Baets
- Neuroimmunology Group, Division of Neuroscience, School for Mental Health and Neuroscience, Maastricht University, Maastricht, The Netherlands
| | - Laurent Servais
- Institute of Myology, Groupe hospitalier Pitié-Salpêtrière, AP-HP, Sorbonne Universités, UPMC Universités Paris 06, INSERM, Paris, 75013, France
| | - Sonia Berrih-Aknin
- INSERM - U974/CNRS UMR7215//UPMC UM76/AIM, Institute of Myology Pitie-Salpetriere, Paris, 73013, France
| | - Ariel Miller
- Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, 31096, Israel; Division of Neuroimmunology, Lady Davis Carmel Medical Center, Haifa, 34362, Israel.
| |
Collapse
|
255
|
Prognostic value of CA20, a score based on centrosome amplification-associated genes, in breast tumors. Sci Rep 2017; 7:262. [PMID: 28325915 PMCID: PMC5428291 DOI: 10.1038/s41598-017-00363-w] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2016] [Accepted: 02/20/2017] [Indexed: 11/08/2022] Open
Abstract
Centrosome amplification (CA) is a hallmark of cancer, observable in ≥75% of breast tumors. CA drives aggressive cellular phenotypes such as chromosomal instability (CIN) and invasiveness. Thus, assessment of CA may offer insights into the prognosis of breast cancer and identify patients who might benefit from centrosome declustering agents. However, it remains unclear whether CA is correlated with clinical outcomes after adjusting for confounding factors. To gain insights, we developed a signature, “CA20”, comprising centrosome structural genes and genes whose dysregulation is implicated in inducing CA. We found that CA20 was a significant independent predictor of worse survival in two large independent datasets after adjusting for potentially confounding factors. In multivariable analyses including both CA20 and CIN25 (a gene expression-based score that correlates with aneuploidy and has prognostic value in many types of cancer), only CA20 was significant, suggesting CA20 captures the risk-predictive information of CIN25 and offers information beyond it. CA20 correlated strongly with CIN25, so a high CA20 score may reflect tumors with high CIN and potentially other aggressive features that may require more aggressive treatment. Finally, we identified processes and pathways differing between CA20-low and high groups that may be valuable therapeutic targets.
Collapse
|
256
|
Huang YF, Gulko B, Siepel A. Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data. Nat Genet 2017; 49:618-624. [PMID: 28288115 PMCID: PMC5395419 DOI: 10.1038/ng.3810] [Citation(s) in RCA: 221] [Impact Index Per Article: 31.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Accepted: 02/13/2017] [Indexed: 12/17/2022]
Abstract
Many genetic variants that influence phenotypes of interest are located outside of protein-coding genes, yet existing methods for identifying such variants have poor predictive power. Here we introduce a new computational method, called LINSIGHT, that substantially improves the prediction of noncoding nucleotide sites at which mutations are likely to have deleterious fitness consequences, and which, therefore, are likely to be phenotypically important. LINSIGHT combines a generalized linear model for functional genomic data with a probabilistic model of molecular evolution. The method is fast and highly scalable, enabling it to exploit the 'big data' available in modern genomics. We show that LINSIGHT outperforms the best available methods in identifying human noncoding variants associated with inherited diseases. In addition, we apply LINSIGHT to an atlas of human enhancers and show that the fitness consequences at enhancers depend on cell type, tissue specificity, and constraints at associated promoters.
Collapse
Affiliation(s)
- Yi-Fei Huang
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| | - Brad Gulko
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA.,Graduate Field of Computer Science, Cornell University, Ithaca, New York, USA
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, USA
| |
Collapse
|
257
|
Glodzik D, Morganella S, Davies H, Simpson PT, Li Y, Zou X, Diez-Perez J, Staaf J, Alexandrov LB, Smid M, Brinkman AB, Rye IH, Russnes H, Raine K, Purdie CA, Lakhani SR, Thompson AM, Birney E, Stunnenberg HG, van de Vijver MJ, Martens JWM, Børresen-Dale AL, Richardson AL, Kong G, Viari A, Easton D, Evan G, Campbell PJ, Stratton MR, Nik-Zainal S. A somatic-mutational process recurrently duplicates germline susceptibility loci and tissue-specific super-enhancers in breast cancers. Nat Genet 2017; 49:341-348. [PMID: 28112740 PMCID: PMC5988034 DOI: 10.1038/ng.3771] [Citation(s) in RCA: 62] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2016] [Accepted: 12/16/2016] [Indexed: 12/18/2022]
Abstract
Somatic rearrangements contribute to the mutagenized landscape of cancer genomes. Here, we systematically interrogated rearrangements in 560 breast cancers by using a piecewise constant fitting approach. We identified 33 hotspots of large (>100 kb) tandem duplications, a mutational signature associated with homologous-recombination-repair deficiency. Notably, these tandem-duplication hotspots were enriched in breast cancer germline susceptibility loci (odds ratio (OR) = 4.28) and breast-specific 'super-enhancer' regulatory elements (OR = 3.54). These hotspots may be sites of selective susceptibility to double-strand-break damage due to high transcriptional activity or, through incrementally increasing copy number, may be sites of secondary selective pressure. The transcriptomic consequences ranged from strong individual oncogene effects to weak but quantifiable multigene expression effects. We thus present a somatic-rearrangement mutational process affecting coding sequences and noncoding regulatory elements and contributing a continuum of driver consequences, from modest to strong effects, thereby supporting a polygenic model of cancer development.
Collapse
Affiliation(s)
| | | | | | - Peter T Simpson
- The University of Queensland: UQ Centre for Clinical Research and School of Medicine, Brisbane, Queensland, Australia
| | - Yilong Li
- Wellcome Trust Sanger Institute, Cambridge, UK
| | - Xueqing Zou
- Wellcome Trust Sanger Institute, Cambridge, UK
| | | | - Johan Staaf
- Department of Clinical Sciences Lund, Division of Oncology and Pathology, Lund University, Lund, Sweden
| | - Ludmil B Alexandrov
- Wellcome Trust Sanger Institute, Cambridge, UK
- Theoretical Biology and Biophysics (T-6), Los Alamos National Laboratory, Los Alamos, New Mexico, USA
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico, USA
| | - Marcel Smid
- Department of Medical Oncology, Erasmus MC Cancer Institute and Cancer Genomics Netherlands, Erasmus University Medical Center, Rotterdam, the Netherlands
| | - Arie B Brinkman
- Department of Molecular Biology, Faculties of Science and Medicine, Radboud University, Nijmegen, the Netherlands
| | - Inga Hansine Rye
- Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital, Norwegian Radiumhospital, Oslo, Norway
- K.G. Jebsen Centre for Breast Cancer Research, Institute for Clinical Medicine, University of Oslo, Oslo, Norway
| | - Hege Russnes
- Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital, Norwegian Radiumhospital, Oslo, Norway
- K.G. Jebsen Centre for Breast Cancer Research, Institute for Clinical Medicine, University of Oslo, Oslo, Norway
| | | | - Colin A Purdie
- Department of Pathology, Ninewells Hospital &Medical School, Dundee, UK
| | - Sunil R Lakhani
- The University of Queensland: UQ Centre for Clinical Research and School of Medicine, Brisbane, Queensland, Australia
- Pathology Queensland, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia
| | - Alastair M Thompson
- Department of Pathology, Ninewells Hospital &Medical School, Dundee, UK
- Department of Breast Surgical Oncology, University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Ewan Birney
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridgeshire, UK
| | - Hendrik G Stunnenberg
- Department of Medical Oncology, Erasmus MC Cancer Institute and Cancer Genomics Netherlands, Erasmus University Medical Center, Rotterdam, the Netherlands
| | | | - John W M Martens
- Department of Medical Oncology, Erasmus MC Cancer Institute and Cancer Genomics Netherlands, Erasmus University Medical Center, Rotterdam, the Netherlands
| | - Anne-Lise Børresen-Dale
- Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital, Norwegian Radiumhospital, Oslo, Norway
- K.G. Jebsen Centre for Breast Cancer Research, Institute for Clinical Medicine, University of Oslo, Oslo, Norway
| | - Andrea L Richardson
- Department of Pathology, Brigham and Women's Hospital, Boston, Massachusetts, USA
- Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| | - Gu Kong
- Department of Pathology, College of Medicine, Hanyang University, Seoul, South Korea
| | - Alain Viari
- Equipe Erable, INRIA Grenoble-Rhône-Alpes, Montbonnot-Saint Martin, France
- Synergie Lyon Cancer, Centre Léon Bérard, Lyon, France
| | - Douglas Easton
- Department of Public Health and Primary Care, Centre for Cancer Genetic Epidemiology, University of Cambridge, Strangeways Research Laboratory, Cambridge, UK
| | - Gerard Evan
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | | | | | - Serena Nik-Zainal
- Wellcome Trust Sanger Institute, Cambridge, UK
- East Anglian Medical Genetics Service, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| |
Collapse
|
258
|
Zacher B, Michel M, Schwalb B, Cramer P, Tresch A, Gagneur J. Accurate Promoter and Enhancer Identification in 127 ENCODE and Roadmap Epigenomics Cell Types and Tissues by GenoSTAN. PLoS One 2017; 12:e0169249. [PMID: 28056037 PMCID: PMC5215863 DOI: 10.1371/journal.pone.0169249] [Citation(s) in RCA: 64] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2016] [Accepted: 12/14/2016] [Indexed: 12/22/2022] Open
Abstract
Accurate maps of promoters and enhancers are required for understanding transcriptional regulation. Promoters and enhancers are usually mapped by integration of chromatin assays charting histone modifications, DNA accessibility, and transcription factor binding. However, current algorithms are limited by unrealistic data distribution assumptions. Here we propose GenoSTAN (Genomic STate ANnotation), a hidden Markov model overcoming these limitations. We map promoters and enhancers for 127 cell types and tissues from the ENCODE and Roadmap Epigenomics projects, today’s largest compendium of chromatin assays. Extensive benchmarks demonstrate that GenoSTAN generally identifies promoters and enhancers with significantly higher accuracy than previous methods. Moreover, GenoSTAN-derived promoters and enhancers showed significantly higher enrichment of complex trait-associated genetic variants than current annotations. Altogether, GenoSTAN provides an easy-to-use tool to define promoters and enhancers in any system, and our annotation of human transcriptional cis-regulatory elements constitutes a rich resource for future research in biology and medicine.
Collapse
Affiliation(s)
- Benedikt Zacher
- Gene Center and Department of Biochemistry, Center for Integrated Protein Science CIPSM, Ludwig-Maximilians-Universität Munich, Germany
- * E-mail: (BZ); (AT); (JG)
| | - Margaux Michel
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| | - Björn Schwalb
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| | - Patrick Cramer
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| | - Achim Tresch
- Department of Biology, University of Cologne, Cologne, Germany
- Max Planck Institute for Plant Breeding Research, Cologne, Germany
- * E-mail: (BZ); (AT); (JG)
| | - Julien Gagneur
- Gene Center and Department of Biochemistry, Center for Integrated Protein Science CIPSM, Ludwig-Maximilians-Universität Munich, Germany
- * E-mail: (BZ); (AT); (JG)
| |
Collapse
|
259
|
Fishilevich S, Nudel R, Rappaport N, Hadar R, Plaschkes I, Iny Stein T, Rosen N, Kohn A, Twik M, Safran M, Lancet D, Cohen D. GeneHancer: genome-wide integration of enhancers and target genes in GeneCards. Database (Oxford) 2017; 2017:3737828. [PMID: 28605766 DOI: 10.1093/database/bax028/3737828] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Accepted: 03/10/2017] [Indexed: 05/26/2023]
Abstract
UNLABELLED A major challenge in understanding gene regulation is the unequivocal identification of enhancer elements and uncovering their connections to genes. We present GeneHancer, a novel database of human enhancers and their inferred target genes, in the framework of GeneCards. First, we integrated a total of 434 000 reported enhancers from four different genome-wide databases: the Encyclopedia of DNA Elements (ENCODE), the Ensembl regulatory build, the functional annotation of the mammalian genome (FANTOM) project and the VISTA Enhancer Browser. Employing an integration algorithm that aims to remove redundancy, GeneHancer portrays 285 000 integrated candidate enhancers (covering 12.4% of the genome), 94 000 of which are derived from more than one source, and each assigned an annotation-derived confidence score. GeneHancer subsequently links enhancers to genes, using: tissue co-expression correlation between genes and enhancer RNAs, as well as enhancer-targeted transcription factor genes; expression quantitative trait loci for variants within enhancers; and capture Hi-C, a promoter-specific genome conformation assay. The individual scores based on each of these four methods, along with gene–enhancer genomic distances, form the basis for GeneHancer’s combinatorial likelihood-based scores for enhancer–gene pairing. Finally, we define ‘elite’ enhancer–gene relations reflecting both a high-likelihood enhancer definition and a strong enhancer–gene association. GeneHancer predictions are fully integrated in the widely used GeneCards Suite, whereby candidate enhancers and their annotations are displayed on every relevant GeneCard. This assists in the mapping of non-coding variants to enhancers, and via the linked genes, forms a basis for variant–phenotype interpretation of whole-genome sequences in health and disease. DATABASE URL http://www.genecards.org/.
Collapse
Affiliation(s)
- Simon Fishilevich
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Ron Nudel
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Noa Rappaport
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Rotem Hadar
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Inbar Plaschkes
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Tsippi Iny Stein
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Naomi Rosen
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Asher Kohn
- LifeMap Sciences Inc, Marshfield, MA 02050, USA
| | - Michal Twik
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Marilyn Safran
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Doron Lancet
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Dana Cohen
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 7610001, Israel
| |
Collapse
|
260
|
Chadaeva IV, Ponomarenko MP, Rasskazov DA, Sharypova EB, Kashina EV, Matveeva MY, Arshinova TV, Ponomarenko PM, Arkova OV, Bondar NP, Savinkova LK, Kolchanov NA. Candidate SNP markers of aggressiveness-related complications and comorbidities of genetic diseases are predicted by a significant change in the affinity of TATA-binding protein for human gene promoters. BMC Genomics 2016; 17:995. [PMID: 28105927 PMCID: PMC5249025 DOI: 10.1186/s12864-016-3353-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Aggressiveness in humans is a hereditary behavioral trait that mobilizes all systems of the body-first of all, the nervous and endocrine systems, and then the respiratory, vascular, muscular, and others-e.g., for the defense of oneself, children, family, shelter, territory, and other possessions as well as personal interests. The level of aggressiveness of a person determines many other characteristics of quality of life and lifespan, acting as a stress factor. Aggressive behavior depends on many parameters such as age, gender, diseases and treatment, diet, and environmental conditions. Among them, genetic factors are believed to be the main parameters that are well-studied at the factual level, but in actuality, genome-wide studies of aggressive behavior appeared relatively recently. One of the biggest projects of the modern science-1000 Genomes-involves identification of single nucleotide polymorphisms (SNPs), i.e., differences of individual genomes from the reference genome. SNPs can be associated with hereditary diseases, their complications, comorbidities, and responses to stress or a drug. Clinical comparisons between cohorts of patients and healthy volunteers (as a control) allow for identifying SNPs whose allele frequencies significantly separate them from one another as markers of the above conditions. Computer-based preliminary analysis of millions of SNPs detected by the 1000 Genomes project can accelerate clinical search for SNP markers due to preliminary whole-genome search for the most meaningful candidate SNP markers and discarding of neutral and poorly substantiated SNPs. RESULTS Here, we combine two computer-based search methods for SNPs (that alter gene expression) {i} Web service SNP_TATA_Comparator (DNA sequence analysis) and {ii} PubMed-based manual search for articles on aggressiveness using heuristic keywords. Near the known binding sites for TATA-binding protein (TBP) in human gene promoters, we found aggressiveness-related candidate SNP markers, including rs1143627 (associated with higher aggressiveness in patients undergoing cytokine immunotherapy), rs544850971 (higher aggressiveness in old women taking lipid-lowering medication), and rs10895068 (childhood aggressiveness-related obesity in adolescence with cardiovascular complications in adulthood). CONCLUSIONS After validation of these candidate markers by clinical protocols, these SNPs may become useful for physicians (may help to improve treatment of patients) and for the general population (a lifestyle choice preventing aggressiveness-related complications).
Collapse
Affiliation(s)
- Irina V. Chadaeva
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Avenue, Novosibirsk, 630090 Russia
- Novosibirsk State University, 2 Pirogova Street, Novosibirsk, 630090 Russia
| | - Mikhail P. Ponomarenko
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Avenue, Novosibirsk, 630090 Russia
- Novosibirsk State University, 2 Pirogova Street, Novosibirsk, 630090 Russia
| | - Dmitry A. Rasskazov
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Avenue, Novosibirsk, 630090 Russia
| | - Ekaterina B. Sharypova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Avenue, Novosibirsk, 630090 Russia
| | - Elena V. Kashina
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Avenue, Novosibirsk, 630090 Russia
| | - Marina Yu Matveeva
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Avenue, Novosibirsk, 630090 Russia
| | - Tatjana V. Arshinova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Avenue, Novosibirsk, 630090 Russia
| | - Petr M. Ponomarenko
- Children’s Hospital Los Angeles, 4640 Hollywood Boulevard, University of Southern California, Los Angeles, CA 90027 USA
| | - Olga V. Arkova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Avenue, Novosibirsk, 630090 Russia
- Vector-Best Inc, Koltsovo, Novosibirsk Region 630559 Russia
| | - Natalia P. Bondar
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Avenue, Novosibirsk, 630090 Russia
| | - Ludmila K. Savinkova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Avenue, Novosibirsk, 630090 Russia
| | - Nikolay A. Kolchanov
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Avenue, Novosibirsk, 630090 Russia
- Novosibirsk State University, 2 Pirogova Street, Novosibirsk, 630090 Russia
| |
Collapse
|
261
|
Genetic Adaptation and Neandertal Admixture Shaped the Immune System of Human Populations. Cell 2016; 167:643-656.e17. [PMID: 27768888 PMCID: PMC5075285 DOI: 10.1016/j.cell.2016.09.024] [Citation(s) in RCA: 283] [Impact Index Per Article: 35.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2016] [Revised: 07/14/2016] [Accepted: 09/15/2016] [Indexed: 12/30/2022]
Abstract
Humans differ in the outcome that follows exposure to life-threatening pathogens, yet the extent of population differences in immune responses and their genetic and evolutionary determinants remain undefined. Here, we characterized, using RNA sequencing, the transcriptional response of primary monocytes from Africans and Europeans to bacterial and viral stimuli-ligands activating Toll-like receptor pathways (TLR1/2, TLR4, and TLR7/8) and influenza virus-and mapped expression quantitative trait loci (eQTLs). We identify numerous cis-eQTLs that contribute to the marked differences in immune responses detected within and between populations and a strong trans-eQTL hotspot at TLR1 that decreases expression of pro-inflammatory genes in Europeans only. We find that immune-responsive regulatory variants are enriched in population-specific signals of natural selection and show that admixture with Neandertals introduced regulatory variants into European genomes, affecting preferentially responses to viral challenges. Together, our study uncovers evolutionarily important determinants of differences in host immune responsiveness between human populations.
Collapse
|
262
|
Aken BL, Achuthan P, Akanni W, Amode MR, Bernsdorff F, Bhai J, Billis K, Carvalho-Silva D, Cummins C, Clapham P, Gil L, Girón CG, Gordon L, Hourlier T, Hunt SE, Janacek SH, Juettemann T, Keenan S, Laird MR, Lavidas I, Maurel T, McLaren W, Moore B, Murphy DN, Nag R, Newman V, Nuhn M, Ong CK, Parker A, Patricio M, Riat HS, Sheppard D, Sparrow H, Taylor K, Thormann A, Vullo A, Walts B, Wilder SP, Zadissa A, Kostadima M, Martin FJ, Muffato M, Perry E, Ruffier M, Staines DM, Trevanion SJ, Cunningham F, Yates A, Zerbino DR, Flicek P. Ensembl 2017. Nucleic Acids Res 2016; 45:D635-D642. [PMID: 27899575 PMCID: PMC5210575 DOI: 10.1093/nar/gkw1104] [Citation(s) in RCA: 409] [Impact Index Per Article: 51.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2016] [Revised: 10/25/2016] [Accepted: 10/28/2016] [Indexed: 12/12/2022] Open
Abstract
Ensembl (www.ensembl.org) is a database and genome browser for enabling research on vertebrate genomes. We import, analyse, curate and integrate a diverse collection of large-scale reference data to create a more comprehensive view of genome biology than would be possible from any individual dataset. Our extensive data resources include evidence-based gene and regulatory region annotation, genome variation and gene trees. An accompanying suite of tools, infrastructure and programmatic access methods ensure uniform data analysis and distribution for all supported species. Together, these provide a comprehensive solution for large-scale and targeted genomics applications alike. Among many other developments over the past year, we have improved our resources for gene regulation and comparative genomics, and added CRISPR/Cas9 target sites. We released new browser functionality and tools, including improved filtering and prioritization of genome variation, Manhattan plot visualization for linkage disequilibrium and eQTL data, and an ontology search for phenotypes, traits and disease. We have also enhanced data discovery and access with a track hub registry and a selection of new REST end points. All Ensembl data are freely released to the scientific community and our source code is available via the open source Apache 2.0 license.
Collapse
Affiliation(s)
- Bronwen L Aken
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Premanand Achuthan
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Wasiu Akanni
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - M Ridwan Amode
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Friederike Bernsdorff
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jyothish Bhai
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Konstantinos Billis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Denise Carvalho-Silva
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Carla Cummins
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Peter Clapham
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Laurent Gil
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Carlos García Girón
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Leo Gordon
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Thibaut Hourlier
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sarah E Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sophie H Janacek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Thomas Juettemann
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Stephen Keenan
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Matthew R Laird
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ilias Lavidas
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Thomas Maurel
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - William McLaren
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Benjamin Moore
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Daniel N Murphy
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Rishi Nag
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Victoria Newman
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Michael Nuhn
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Chuang Kee Ong
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Anne Parker
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mateus Patricio
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Harpreet Singh Riat
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Daniel Sheppard
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Helen Sparrow
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Kieron Taylor
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Anja Thormann
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Alessandro Vullo
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Brandon Walts
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Steven P Wilder
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Amonida Zadissa
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Myrto Kostadima
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Fergal J Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Matthieu Muffato
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Emily Perry
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Magali Ruffier
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Daniel M Staines
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Stephen J Trevanion
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Fiona Cunningham
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Andrew Yates
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Daniel R Zerbino
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK .,Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| |
Collapse
|
263
|
Javierre BM, Burren OS, Wilder SP, Kreuzhuber R, Hill SM, Sewitz S, Cairns J, Wingett SW, Várnai C, Thiecke MJ, Burden F, Farrow S, Cutler AJ, Rehnström K, Downes K, Grassi L, Kostadima M, Freire-Pritchett P, Wang F, Stunnenberg HG, Todd JA, Zerbino DR, Stegle O, Ouwehand WH, Frontini M, Wallace C, Spivakov M, Fraser P. Lineage-Specific Genome Architecture Links Enhancers and Non-coding Disease Variants to Target Gene Promoters. Cell 2016; 167:1369-1384.e19. [PMID: 27863249 PMCID: PMC5123897 DOI: 10.1016/j.cell.2016.09.037] [Citation(s) in RCA: 648] [Impact Index Per Article: 81.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2016] [Revised: 09/06/2016] [Accepted: 09/22/2016] [Indexed: 12/20/2022]
Abstract
Long-range interactions between regulatory elements and gene promoters play key roles in transcriptional regulation. The vast majority of interactions are uncharted, constituting a major missing link in understanding genome control. Here, we use promoter capture Hi-C to identify interacting regions of 31,253 promoters in 17 human primary hematopoietic cell types. We show that promoter interactions are highly cell type specific and enriched for links between active promoters and epigenetically marked enhancers. Promoter interactomes reflect lineage relationships of the hematopoietic tree, consistent with dynamic remodeling of nuclear architecture during differentiation. Interacting regions are enriched in genetic variants linked with altered expression of genes they contact, highlighting their functional role. We exploit this rich resource to connect non-coding disease variants to putative target promoters, prioritizing thousands of disease-candidate genes and implicating disease pathways. Our results demonstrate the power of primary cell promoter interactomes to reveal insights into genomic regulatory mechanisms underlying common diseases.
Collapse
Affiliation(s)
- Biola M Javierre
- Nuclear Dynamics Programme, The Babraham Institute, Babraham Research Campus, Cambridge CB22 3AT, UK
| | - Oliver S Burren
- JDRF/Wellcome Trust Diabetes and Inflammation Laboratory, Department of Medical Genetics, NIHR Cambridge Biomedical Research Centre, Cambridge Institute for Medical Research, University of Cambridge, Cambridge CB2 0XY, UK
| | - Steven P Wilder
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Cambridge CB10 1SD, UK
| | - Roman Kreuzhuber
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Cambridge CB10 1SD, UK; Department of Haematology, University of Cambridge, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK; National Health Service Blood and Transplant, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK
| | - Steven M Hill
- MRC Biostatistics Unit, Cambridge Institute of Public Health, Cambridge Biomedical Campus, Cambridge CB2 0SR, UK
| | - Sven Sewitz
- Nuclear Dynamics Programme, The Babraham Institute, Babraham Research Campus, Cambridge CB22 3AT, UK
| | - Jonathan Cairns
- Nuclear Dynamics Programme, The Babraham Institute, Babraham Research Campus, Cambridge CB22 3AT, UK
| | - Steven W Wingett
- Nuclear Dynamics Programme, The Babraham Institute, Babraham Research Campus, Cambridge CB22 3AT, UK
| | - Csilla Várnai
- Nuclear Dynamics Programme, The Babraham Institute, Babraham Research Campus, Cambridge CB22 3AT, UK
| | - Michiel J Thiecke
- Nuclear Dynamics Programme, The Babraham Institute, Babraham Research Campus, Cambridge CB22 3AT, UK
| | - Frances Burden
- Department of Haematology, University of Cambridge, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK; National Health Service Blood and Transplant, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK
| | - Samantha Farrow
- Department of Haematology, University of Cambridge, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK; National Health Service Blood and Transplant, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK
| | - Antony J Cutler
- JDRF/Wellcome Trust Diabetes and Inflammation Laboratory, Department of Medical Genetics, NIHR Cambridge Biomedical Research Centre, Cambridge Institute for Medical Research, University of Cambridge, Cambridge CB2 0XY, UK
| | - Karola Rehnström
- Department of Haematology, University of Cambridge, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK; National Health Service Blood and Transplant, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK
| | - Kate Downes
- Department of Haematology, University of Cambridge, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK; National Health Service Blood and Transplant, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK
| | - Luigi Grassi
- Department of Haematology, University of Cambridge, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK; National Health Service Blood and Transplant, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK
| | - Myrto Kostadima
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Cambridge CB10 1SD, UK; Department of Haematology, University of Cambridge, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK; National Health Service Blood and Transplant, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK
| | - Paula Freire-Pritchett
- Nuclear Dynamics Programme, The Babraham Institute, Babraham Research Campus, Cambridge CB22 3AT, UK
| | - Fan Wang
- MRC Biostatistics Unit, Cambridge Institute of Public Health, Cambridge Biomedical Campus, Cambridge CB2 0SR, UK
| | - Hendrik G Stunnenberg
- Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Radboud University Nijmegen, Geert Grooteplein Zuid 30, 6525 GA Nijmegen, the Netherlands
| | - John A Todd
- JDRF/Wellcome Trust Diabetes and Inflammation Laboratory, Department of Medical Genetics, NIHR Cambridge Biomedical Research Centre, Cambridge Institute for Medical Research, University of Cambridge, Cambridge CB2 0XY, UK
| | - Daniel R Zerbino
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Cambridge CB10 1SD, UK
| | - Oliver Stegle
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Cambridge CB10 1SD, UK
| | - Willem H Ouwehand
- Department of Haematology, University of Cambridge, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK; National Health Service Blood and Transplant, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK; British Heart Foundation Centre of Excellence, Division of Cardiovascular Medicine, Addenbrooke's Hospital, Hills Road, Cambridge CB2 0QQ, UK; Department of Human Genetics, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1HH, UK
| | - Mattia Frontini
- Department of Haematology, University of Cambridge, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK; National Health Service Blood and Transplant, Cambridge Biomedical Campus, Long Road, Cambridge CB2 0PT, UK; British Heart Foundation Centre of Excellence, Division of Cardiovascular Medicine, Addenbrooke's Hospital, Hills Road, Cambridge CB2 0QQ, UK.
| | - Chris Wallace
- JDRF/Wellcome Trust Diabetes and Inflammation Laboratory, Department of Medical Genetics, NIHR Cambridge Biomedical Research Centre, Cambridge Institute for Medical Research, University of Cambridge, Cambridge CB2 0XY, UK; MRC Biostatistics Unit, Cambridge Institute of Public Health, Cambridge Biomedical Campus, Cambridge CB2 0SR, UK; Department of Medicine, University of Cambridge, Addenbrooke's Hospital, Cambridge CB2 0SP, UK.
| | - Mikhail Spivakov
- Nuclear Dynamics Programme, The Babraham Institute, Babraham Research Campus, Cambridge CB22 3AT, UK.
| | - Peter Fraser
- Nuclear Dynamics Programme, The Babraham Institute, Babraham Research Campus, Cambridge CB22 3AT, UK.
| |
Collapse
|
264
|
DNA Methylation Dynamics of Human Hematopoietic Stem Cell Differentiation. Cell Stem Cell 2016; 19:808-822. [PMID: 27867036 PMCID: PMC5145815 DOI: 10.1016/j.stem.2016.10.019] [Citation(s) in RCA: 172] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2016] [Revised: 10/04/2016] [Accepted: 10/24/2016] [Indexed: 01/29/2023]
Abstract
Hematopoietic stem cells give rise to all blood cells in a differentiation process that involves widespread epigenome remodeling. Here we present genome-wide reference maps of the associated DNA methylation dynamics. We used a meta-epigenomic approach that combines DNA methylation profiles across many small pools of cells and performed single-cell methylome sequencing to assess cell-to-cell heterogeneity. The resulting dataset identified characteristic differences between HSCs derived from fetal liver, cord blood, bone marrow, and peripheral blood. We also observed lineage-specific DNA methylation between myeloid and lymphoid progenitors, characterized immature multi-lymphoid progenitors, and detected progressive DNA methylation differences in maturing megakaryocytes. We linked these patterns to gene expression, histone modifications, and chromatin accessibility, and we used machine learning to derive a model of human hematopoietic differentiation directly from DNA methylation data. Our results contribute to a better understanding of human hematopoietic stem cell differentiation and provide a framework for studying blood-linked diseases. Sequencing provides DNA methylation maps of hematopoietic stem and progenitor cells Methylation differs in HSCs from fetal liver, bone marrow, cord, and peripheral blood Myeloid and lymphoid progenitors are distinguished by enhancer-linked DNA methylation Machine learning enables data-driven reconstruction of the hematopoietic lineage
Collapse
|
265
|
Pearson H, Daouda T, Granados DP, Durette C, Bonneil E, Courcelles M, Rodenbrock A, Laverdure JP, Côté C, Mader S, Lemieux S, Thibault P, Perreault C. MHC class I-associated peptides derive from selective regions of the human genome. J Clin Invest 2016; 126:4690-4701. [PMID: 27841757 DOI: 10.1172/jci88590] [Citation(s) in RCA: 136] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2016] [Accepted: 09/30/2016] [Indexed: 12/24/2022] Open
Abstract
MHC class I-associated peptides (MAPs) define the immune self for CD8+ T lymphocytes and are key targets of cancer immunosurveillance. Here, the goals of our work were to determine whether the entire set of protein-coding genes could generate MAPs and whether specific features influence the ability of discrete genes to generate MAPs. Using proteogenomics, we have identified 25,270 MAPs isolated from the B lymphocytes of 18 individuals who collectively expressed 27 high-frequency HLA-A,B allotypes. The entire MAP repertoire presented by these 27 allotypes covered only 10% of the exomic sequences expressed in B lymphocytes. Indeed, 41% of expressed protein-coding genes generated no MAPs, while 59% of genes generated up to 64 MAPs, often derived from adjacent regions and presented by different allotypes. We next identified several features of transcripts and proteins associated with efficient MAP production. From these data, we built a logistic regression model that predicts with good accuracy whether a gene generates MAPs. Our results show preferential selection of MAPs from a limited repertoire of proteins with distinctive features. The notion that the MHC class I immunopeptidome presents only a small fraction of the protein-coding genome for monitoring by the immune system has profound implications in autoimmunity and cancer immunology.
Collapse
|
266
|
Inoue F, Kircher M, Martin B, Cooper GM, Witten DM, McManus MT, Ahituv N, Shendure J. A systematic comparison reveals substantial differences in chromosomal versus episomal encoding of enhancer activity. Genome Res 2016; 27:38-52. [PMID: 27831498 PMCID: PMC5204343 DOI: 10.1101/gr.212092.116] [Citation(s) in RCA: 182] [Impact Index Per Article: 22.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2016] [Accepted: 11/08/2016] [Indexed: 11/24/2022]
Abstract
Candidate enhancers can be identified on the basis of chromatin modifications, the binding of chromatin modifiers and transcription factors and cofactors, or chromatin accessibility. However, validating such candidates as bona fide enhancers requires functional characterization, typically achieved through reporter assays that test whether a sequence can increase expression of a transcriptional reporter via a minimal promoter. A longstanding concern is that reporter assays are mainly implemented on episomes, which are thought to lack physiological chromatin. However, the magnitude and determinants of differences in cis-regulation for regulatory sequences residing in episomes versus chromosomes remain almost completely unknown. To address this systematically, we developed and applied a novel lentivirus-based massively parallel reporter assay (lentiMPRA) to directly compare the functional activities of 2236 candidate liver enhancers in an episomal versus a chromosomally integrated context. We find that the activities of chromosomally integrated sequences are substantially different from the activities of the identical sequences assayed on episomes, and furthermore are correlated with different subsets of ENCODE annotations. The results of chromosomally based reporter assays are also more reproducible and more strongly predictable by both ENCODE annotations and sequence-based models. With a linear model that combines chromatin annotations and sequence information, we achieve a Pearson's R2 of 0.362 for predicting the results of chromosomally integrated reporter assays. This level of prediction is better than with either chromatin annotations or sequence information alone and also outperforms predictive models of episomal assays. Our results have broad implications for how cis-regulatory elements are identified, prioritized and functionally validated.
Collapse
Affiliation(s)
- Fumitaka Inoue
- Department of Bioengineering and Therapeutic Sciences, Institute for Human Genetics, University of California San Francisco, San Francisco, California 94158, USA
| | - Martin Kircher
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Beth Martin
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Gregory M Cooper
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Daniela M Witten
- Departments of Statistics and Biostatistics, University of Washington, Seattle, Washington 98195, USA
| | - Michael T McManus
- Department of Microbiology and Immunology, UCSF Diabetes Center, Keck Center for Noncoding RNA, University of California, San Francisco, San Francisco, California 94143, USA
| | - Nadav Ahituv
- Department of Bioengineering and Therapeutic Sciences, Institute for Human Genetics, University of California San Francisco, San Francisco, California 94158, USA
| | - Jay Shendure
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA.,Howard Hughes Medical Institute, Seattle, Washington 98195, USA
| |
Collapse
|
267
|
Kado S, Chang WLW, Chi AN, Wolny M, Shepherd DM, Vogel CFA. Aryl hydrocarbon receptor signaling modifies Toll-like receptor-regulated responses in human dendritic cells. Arch Toxicol 2016; 91:2209-2221. [PMID: 27783115 DOI: 10.1007/s00204-016-1880-y] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2016] [Accepted: 10/20/2016] [Indexed: 01/04/2023]
Abstract
Currently, it is not well understood how ligands of the aryl hydrocarbon receptor (AhR) modify inflammatory responses triggered by Toll-like receptor (TLR) agonists in human dendritic cells (DCs). Here, we show that AhR ligands 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD), the tryptophan derivatives 6-formylindolo[3,2-b] carbazole (FICZ), kynurenine (kyn), and the natural dietary compound indole-3-carbinol (I3C) differentially modify cytokine expression in human monocyte-derived DCs (MoDCs). The results show that TLR-activated MoDCs express higher levels of AhR and are more sensitive toward the effects of AhR ligands. Depending on the cytokine, treatment with AhR ligands led to a synergistic or antagonistic effect of the TLR-triggered response in MoDCs. Thus, activation of AhR increased the expression of interleukin (IL)-1β, but decreased the expression of IL-12A in TLR-activated MoDCs. Furthermore, TCDD and FICZ may have opposite effects on the expression of cytochrome P4501A1 (CYP1A1) in TLR8-activated MoDCs indicating that the effect of the specific AhR ligand may depend on the presence of the specific TLR agonist. Gene silencing showed that synergistic effects of AhR ligands on TLR-induced expression of IL-1β require a functional AhR and the expression of NF-κB RelB. On the other hand, repression of IL-12A by TCDD and FICZ involved the induction of the caudal type homeobox 2 (CDX2) transcription factor. Additionally, the levels of DC surface markers were decreased in MoDCs by TCDD, FICZ and I3C, but not by kyn. Overall, these data demonstrate that AhR modulates TLR-induced expression of cytokines and DC-specific surface markers in MoDCs involving NFκB RelB and the immune regulatory factor CDX2.
Collapse
Affiliation(s)
- Sarah Kado
- Center for Health and the Environment, University of California Davis, One Shields Avenue, Davis, CA 95616, USA
| | - W L William Chang
- Center for Comparative Medicine, University of California, One Shields Avenue, Davis, CA 95616, USA
| | - Aimy Nguyen Chi
- Center for Health and the Environment, University of California Davis, One Shields Avenue, Davis, CA 95616, USA
| | - Monika Wolny
- Center for Health and the Environment, University of California Davis, One Shields Avenue, Davis, CA 95616, USA
| | - David M Shepherd
- Department of Biomedical and Pharmaceutical Sciences, University of Montana, Missoula, MT, 59812, USA
| | - Christoph F A Vogel
- Center for Health and the Environment, University of Montana, Missoula, MT, 59812, USA. .,Department of Environmental Toxicology, University of California, Davis, One Shields Avenue, Davis, CA, 95616, USA.
| |
Collapse
|
268
|
Banday VS, Lejon K. Elevated systemic glutamic acid level in the non-obese diabetic mouse is Idd linked and induces beta cell apoptosis. Immunology 2016; 150:162-171. [PMID: 27649685 DOI: 10.1111/imm.12674] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2016] [Revised: 09/09/2016] [Accepted: 09/14/2016] [Indexed: 12/29/2022] Open
Abstract
Although type 1 diabetes (T1D) is a T-cell-mediated disease in the effector stage, the mechanism behind the initial beta cell assault is less understood. Metabolomic differences, including elevated levels of glutamic acid, have been observed in patients with T1D before disease onset, as well as in pre-diabetic non-obese diabetic (NOD) mice. Increased levels of glutamic acid damage both neurons and beta cells, implying that this could contribute to the initial events of T1D pathogenesis. We investigated the underlying genetic factors and consequences of the increased levels of glutamic acid in NOD mice. Serum glutamic acid levels from a (NOD×B6)F2 cohort (n = 182) were measured. By genome-wide and Idd region targeted microsatellite mapping, genetic association was detected for six regions including Idd2, Idd4 and Idd22. In silico analysis of potential enzymes and transporters located in and around the mapped regions that are involved in glutamic acid metabolism consisted of alanine aminotransferase, glutamic-oxaloacetic transaminase, aldehyde dehydrogenase 18 family, alutamyl-prolyl-tRNA synthetase, glutamic acid transporters GLAST and EAAC1. Increased EAAC1 protein expression was observed in lysates from livers of NOD mice compared with B6 mice. Functional consequence of the elevated glutamic acid level in NOD mice was tested by culturing NOD. Rag2-/- Langerhans' islets with glutamic acid. Induction of apoptosis of the islets was detected upon glutamic acid challenge using TUNEL assay. Our results support the notion that a dysregulated metabolome could contribute to the initiation of T1D. We suggest that targeting of the increased glutamic acid in pre-diabetic patients could be used as a potential therapy.
Collapse
Affiliation(s)
- Viqar Showkat Banday
- Division of Immunology, Department of Clinical Microbiology, Umeå University, Umeå, Sweden
| | - Kristina Lejon
- Division of Immunology, Department of Clinical Microbiology, Umeå University, Umeå, Sweden
| |
Collapse
|
269
|
Laverrière JN, L'Hôte D, Tabouy L, Schang AL, Quérat B, Cohen-Tannoudji J. Epigenetic regulation of alternative promoters and enhancers in progenitor, immature, and mature gonadotrope cell lines. Mol Cell Endocrinol 2016; 434:250-65. [PMID: 27402603 DOI: 10.1016/j.mce.2016.07.010] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Revised: 07/05/2016] [Accepted: 07/05/2016] [Indexed: 11/25/2022]
Abstract
Gonadotrope cell identity genes emerge in a stepwise process during mouse pituitary development. Cga, encoding for the α-subunit of TSH, LH, and FSH, is initially detected at E11.5 followed by Gnrhr and steroidogenic factor Sf1 at E13.5, specifying cells engaged in a gonadotrope cell fate. Lhb and Fshb appear at E16.5 and 17.5, respectively, typifying differentiated gonadotrope cells. Using the αT1-1, αT3-1 and LβT2 cell lines recapitulating these stages of gonadotrope differentiation, DNA methylation at Gnrhr and Sf1 was investigated. Regulatory regions were found hypermethylated in progenitor αT1-1 cells and hypomethylated in differentiated LβT2 cells. Abundance of RNA polymerase II together with active histone modifications including H3K4me1, H3K4me3, and H3K27ac were strictly correlated with DNA hypomethylation. Analyses of epigenomic modifications and chromatin accessibility were further extended to Isl1, Lhx3, Gata2, and Pitx2, highlighting alternative usages of specific regulatory gene domains in progenitor αT1-1, immature αT3-1, and mature LβT2 gonadotrope cells.
Collapse
Affiliation(s)
- Jean-Noël Laverrière
- Univ Paris Diderot, Sorbonne Paris Cité, Biologie Fonctionnelle et Adaptative (BFA), F-75013, Paris, France; CNRS UMR 8251, F-75013, Paris, France; Physiologie de l'axe gonadotrope INSERM U1133, F-75013, Paris, France.
| | - David L'Hôte
- Univ Paris Diderot, Sorbonne Paris Cité, Biologie Fonctionnelle et Adaptative (BFA), F-75013, Paris, France; CNRS UMR 8251, F-75013, Paris, France; Physiologie de l'axe gonadotrope INSERM U1133, F-75013, Paris, France
| | - Laure Tabouy
- Univ Paris Diderot, Sorbonne Paris Cité, Biologie Fonctionnelle et Adaptative (BFA), F-75013, Paris, France; CNRS UMR 8251, F-75013, Paris, France; Physiologie de l'axe gonadotrope INSERM U1133, F-75013, Paris, France
| | - Anne-Laure Schang
- Univ Paris Diderot, Sorbonne Paris Cité, Biologie Fonctionnelle et Adaptative (BFA), F-75013, Paris, France; CNRS UMR 8251, F-75013, Paris, France; Physiologie de l'axe gonadotrope INSERM U1133, F-75013, Paris, France
| | - Bruno Quérat
- Univ Paris Diderot, Sorbonne Paris Cité, Biologie Fonctionnelle et Adaptative (BFA), F-75013, Paris, France; CNRS UMR 8251, F-75013, Paris, France; Physiologie de l'axe gonadotrope INSERM U1133, F-75013, Paris, France
| | - Joëlle Cohen-Tannoudji
- Univ Paris Diderot, Sorbonne Paris Cité, Biologie Fonctionnelle et Adaptative (BFA), F-75013, Paris, France; CNRS UMR 8251, F-75013, Paris, France; Physiologie de l'axe gonadotrope INSERM U1133, F-75013, Paris, France
| |
Collapse
|
270
|
Streeter I, Harrison PW, Faulconbridge A, Flicek P, Parkinson H, Clarke L. The human-induced pluripotent stem cell initiative-data resources for cellular genetics. Nucleic Acids Res 2016; 45:D691-D697. [PMID: 27733501 PMCID: PMC5210631 DOI: 10.1093/nar/gkw928] [Citation(s) in RCA: 62] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2016] [Revised: 09/30/2016] [Accepted: 10/05/2016] [Indexed: 01/09/2023] Open
Abstract
The Human Induced Pluripotent Stem Cell Initiative (HipSci) isf establishing a large catalogue of human iPSC lines, arguably the most well characterized collection to date. The HipSci portal enables researchers to choose the right cell line for their experiment, and makes HipSci's rich catalogue of assay data easy to discover and reuse. Each cell line has genomic, transcriptomic, proteomic and cellular phenotyping data. Data are deposited in the appropriate EMBL-EBI archives, including the European Nucleotide Archive (ENA), European Genome-phenome Archive (EGA), ArrayExpress and PRoteomics IDEntifications (PRIDE) databases. The project will make 500 cell lines from healthy individuals, and from 150 patients with rare genetic diseases; these will be available through the European Collection of Authenticated Cell Cultures (ECACC). As of August 2016, 238 cell lines are available for purchase. Project data is presented through the HipSci data portal (http://www.hipsci.org/lines) and is downloadable from the associated FTP site (ftp://ftp.hipsci.ebi.ac.uk/vol1/ftp). The data portal presents a summary matrix of the HipSci cell lines, showing available data types. Each line has its own page containing descriptive metadata, quality information, and links to archived assay data. Analysis results are also available in a Track Hub, allowing visualization in the context of public genomic annotations (http://www.hipsci.org/data/trackhubs).
Collapse
Affiliation(s)
- Ian Streeter
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Peter W Harrison
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Adam Faulconbridge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Helen Parkinson
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Laura Clarke
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
271
|
Forbes SA, Beare D, Bindal N, Bamford S, Ward S, Cole CG, Jia M, Kok C, Boutselakis H, De T, Sondka Z, Ponting L, Stefancsik R, Harsha B, Tate J, Dawson E, Thompson S, Jubb H, Campbell PJ. COSMIC: High-Resolution Cancer Genetics Using the Catalogue of Somatic Mutations in Cancer. ACTA ACUST UNITED AC 2016; 91:10.11.1-10.11.37. [PMID: 27727438 DOI: 10.1002/cphg.21] [Citation(s) in RCA: 94] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
COSMIC (http://cancer.sanger.ac.uk) is an expert-curated database of somatic mutations in human cancer. Broad and comprehensive in scope, recent releases in 2016 describe over 4 million coding mutations across all human cancer disease types. Mutations are annotated across the entire genome, but expert curation is focused on over 400 key cancer genes. Now encompassing the majority of molecular mutation mechanisms in oncogenetics, COSMIC additionally describes 10 million non-coding mutations, 1 million copy-number aberrations, 9 million gene-expression variants, and almost 8 million differentially methylated CpGs. This information combines a consistent interpretation of the data from the major cancer genome consortia and cancer genome literature with exhaustive hand curation of over 22,000 gene-specific literature publications. This unit describes the graphical Web site in detail; alternative protocols overview other ways the entire database can be accessed, analyzed, and downloaded. © 2016 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- S A Forbes
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - D Beare
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - N Bindal
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - S Bamford
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - S Ward
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - C G Cole
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - M Jia
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - C Kok
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - H Boutselakis
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - T De
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - Z Sondka
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - L Ponting
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - R Stefancsik
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - B Harsha
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - J Tate
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - E Dawson
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - S Thompson
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - H Jubb
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - P J Campbell
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| |
Collapse
|
272
|
Smedley D, Schubach M, Jacobsen J, Köhler S, Zemojtel T, Spielmann M, Jäger M, Hochheiser H, Washington N, McMurry J, Haendel M, Mungall C, Lewis S, Groza T, Valentini G, Robinson P. A Whole-Genome Analysis Framework for Effective Identification of Pathogenic Regulatory Variants in Mendelian Disease. Am J Hum Genet 2016; 99:595-606. [PMID: 27569544 DOI: 10.1016/j.ajhg.2016.07.005] [Citation(s) in RCA: 171] [Impact Index Per Article: 21.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2016] [Accepted: 07/01/2016] [Indexed: 12/17/2022] Open
Abstract
The interpretation of non-coding variants still constitutes a major challenge in the application of whole-genome sequencing in Mendelian disease, especially for single-nucleotide and other small non-coding variants. Here we present Genomiser, an analysis framework that is able not only to score the relevance of variation in the non-coding genome, but also to associate regulatory variants to specific Mendelian diseases. Genomiser scores variants through either existing methods such as CADD or a bespoke machine learning method and combines these with allele frequency, regulatory sequences, chromosomal topological domains, and phenotypic relevance to discover variants associated to specific Mendelian disorders. Overall, Genomiser is able to identify causal regulatory variants as the top candidate in 77% of simulated whole genomes, allowing effective detection and discovery of regulatory variants in Mendelian disease.
Collapse
|
273
|
Candidate SNP Markers of Chronopathologies Are Predicted by a Significant Change in the Affinity of TATA-Binding Protein for Human Gene Promoters. BIOMED RESEARCH INTERNATIONAL 2016; 2016:8642703. [PMID: 27635400 PMCID: PMC5011241 DOI: 10.1155/2016/8642703] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Revised: 06/25/2016] [Accepted: 06/28/2016] [Indexed: 01/14/2023]
Abstract
Variations in human genome (e.g., single nucleotide polymorphisms, SNPs) may be associated with hereditary diseases, their complications, comorbidities, and drug responses. Using Web service SNP_TATA_Comparator presented in our previous paper, here we analyzed immediate surroundings of known SNP markers of diseases and identified several candidate SNP markers that can significantly change the affinity of TATA-binding protein for human gene promoters, with circadian consequences. For example, rs572527200 may be related to asthma, where symptoms are circadian (worse at night), and rs367732974 may be associated with heart attacks that are characterized by a circadian preference (early morning). By the same method, we analyzed the 90 bp proximal promoter region of each protein-coding transcript of each human gene of the circadian clock core. This analysis yielded 53 candidate SNP markers, such as rs181985043 (susceptibility to acute Q fever in male patients), rs192518038 (higher risk of a heart attack in patients with diabetes), and rs374778785 (emphysema and lung cancer in smokers). If they are properly validated according to clinical standards, these candidate SNP markers may turn out to be useful for physicians (to select optimal treatment for each patient) and for the general population (to choose a lifestyle preventing possible circadian complications of diseases).
Collapse
|
274
|
Wolthers BO, Frandsen TL, Abrahamsson J, Albertsen BK, Helt LR, Heyman M, Jónsson ÓG, Kõrgvee LT, Lund B, Raja RA, Rasmussen KK, Taskinen M, Tulstrup M, Vaitkevičienė GE, Yadav R, Gupta R, Schmiegelow K. Asparaginase-associated pancreatitis: a study on phenotype and genotype in the NOPHO ALL2008 protocol. Leukemia 2016; 31:325-332. [PMID: 27451978 DOI: 10.1038/leu.2016.203] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2016] [Revised: 06/12/2016] [Accepted: 06/15/2016] [Indexed: 12/17/2022]
Abstract
Asparaginase (ASP)-associated pancreatitis (AAP) occurs during acute lymphoblastic leukemia treatment. Among 1285 children (1.0-17.9 years) diagnosed during July 2008-December 2014 and treated according to the Nordic/Baltic ALL2008 protocol, 86 (cumulative incidence=6.8%) developed AAP. Seventy-three cases were severe (diagnostic AAP criteria persisting >72 h) and 13 mild. Cases were older than controls (median: 6.5 vs 4.5 years; P=0.001). Pseudocysts developed in 28%. Of the 20 re-exposed to ASP, 9 (45%) developed a second AAP. After a median follow-up of 2.3 years, 8% needed permanent insulin therapy, and 7% had recurrent abdominal pain. Germline DNA on 62 cases and 638 controls was genotyped on Omni2.5exome-8-v1.2 BeadChip arrays. Overall, the ULK2 variant rs281366 showed the strongest association with AAP (P=5.8 × 10-7; odds ratio (OR)=6.7). Cases with the rs281366 variant were younger (4.3 vs 8 years; P=0.015) and had lower risk of AAP-related complications (15% vs 43%; P=0.13) compared with cases without this variant. Among 45 cases and 517 controls <10 years, the strongest associations with AAP were found for RGS6 variant rs17179470 (P=9.8 × 10-9; OR=7.3). Rs281366 is located in the ULK2 gene involved in autophagy, and RGS6 regulates G-protein signaling regulating cell dynamics. More than 50% of AAP cases <10 years carried one or both risk alleles.
Collapse
Affiliation(s)
- B O Wolthers
- Department of Pediatrics and Adolescent Medicine, University Hospital Rigshospitalet, Copenhagen, Denmark
| | - T L Frandsen
- Department of Pediatrics and Adolescent Medicine, University Hospital Rigshospitalet, Copenhagen, Denmark
| | - J Abrahamsson
- Department of Clinical Sciences, Queen Silvia's Children's Hospital, Gothenburg, Sweden
| | - B K Albertsen
- Department of Paediatrics, Aarhus University Hospital Skejby, Aarhus, Denmark
| | - L R Helt
- Department of Pediatrics and Adolescent Medicine, University Hospital Rigshospitalet, Copenhagen, Denmark
| | - M Heyman
- Department of Pediatrics, Astrid Lindgrens Hospital, Stockholm, Sweden
| | - Ó G Jónsson
- Children's Hospital, Landspitali University Hospital, Reykjavík, Iceland
| | | | - B Lund
- Department of Paediatrics, St Olavs University Hospital, Trondheim, Norway.,Department of Laboratory Medicine, Children's and Women's Health, Medical Faculty, The Norwegian University of Science and Technology, Trondheim, Norway
| | - R A Raja
- Department of Pediatrics and Adolescent Medicine, University Hospital Rigshospitalet, Copenhagen, Denmark
| | - K K Rasmussen
- Department of Pediatrics and Adolescent Medicine, University Hospital Rigshospitalet, Copenhagen, Denmark
| | - M Taskinen
- Helsinki University Central Hospital, University of Helsinki, Helsinki, Finland
| | - M Tulstrup
- Department of Pediatrics and Adolescent Medicine, University Hospital Rigshospitalet, Copenhagen, Denmark
| | - G E Vaitkevičienė
- Clinic of Children's Diseases, Faculty of Medicine, Vilnius University, Vilnius, Lithuania
| | - R Yadav
- Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, Denmark
| | - R Gupta
- Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, Denmark
| | - K Schmiegelow
- Department of Pediatrics and Adolescent Medicine, University Hospital Rigshospitalet, Copenhagen, Denmark.,Institute of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
275
|
Vogel CF, Chang WW, Kado S, McCulloh K, Vogel H, Wu D, Haarmann-Stemmann T, Yang G, Leung PS, Matsumura F, Gershwin ME. Transgenic Overexpression of Aryl Hydrocarbon Receptor Repressor (AhRR) and AhR-Mediated Induction of CYP1A1, Cytokines, and Acute Toxicity. ENVIRONMENTAL HEALTH PERSPECTIVES 2016; 124:1071-1083. [PMID: 26862745 PMCID: PMC4937866 DOI: 10.1289/ehp.1510194] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2015] [Revised: 08/03/2015] [Accepted: 01/13/2016] [Indexed: 05/30/2023]
Abstract
BACKGROUND The aryl hydrocarbon receptor repressor (AhRR) is known to repress aryl hydrocarbon receptor (AhR) signaling, but very little is known regarding the role of the AhRR in vivo. OBJECTIVE This study tested the role of AhRR in vivo in AhRR overexpressing mice on molecular and toxic end points mediated through a prototypical AhR ligand. METHODS We generated AhRR-transgenic mice (AhRR Tg) based on the genetic background of C57BL/6J wild type (wt) mice. We tested the effect of the prototypical AhR ligand 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD) on the expression of cytochrome P450 (CYP)1A1 and cytokines in various tissues of mice. We next analyzed the infiltration of immune cells in adipose tissue of mice after treatment with TCDD using flow cytometry. RESULTS AhRR Tg mice express significantly higher levels of AhRR compared to wt mice. Activation of AhR by TCDD caused a significant increase of the inflammatory cytokines Interleukin (IL)-1β, IL-6 and IL-10, and CXCL chemokines in white epididymal adipose tissue from both wt and AhRR Tg mice. However, the expression of IL-1β, CXCL2 and CXCL3 were significantly lower in AhRR Tg versus wt mice following TCDD treatment. Exposure to TCDD caused a rapid accumulation of neutrophils and macrophages in white adipose tissue of wt and AhRR Tg mice. Furthermore we found that male AhRR Tg mice were protected from high-dose TCDD-induced lethality associated with a reduced inflammatory response and liver damage as indicated by lower levels of TCDD-induced alanine aminotransferase and hepatic triglycerides. Females from both wt and AhRR Tg mice were less sensitive than male mice to acute toxicity induced by TCDD. CONCLUSION In conclusion, the current study identifies AhRR as a previously uncharacterized regulator of specific inflammatory cytokines, which may protect from acute toxicity induced by TCDD. CITATION Vogel CF, Chang WL, Kado S, McCulloh K, Vogel H, Wu D, Haarmann-Stemmann T, Yang GX, Leung PS, Matsumura F, Gershwin ME. 2016. Transgenic overexpression of aryl hydrocarbon receptor repressor (AhRR) and AhR-mediated induction of CYP1A1, cytokines, and acute toxicity. Environ Health Perspect 124:1071-1083; http://dx.doi.org/10.1289/ehp.1510194.
Collapse
Affiliation(s)
| | - W.L. William Chang
- Center for Comparative Medicine, University of California, Davis, Davis, California, USA
| | | | | | | | - Dalei Wu
- Center for Health and the Environment,
| | | | - GuoXiang Yang
- Division of Rheumatology, Allergy and Clinical Immunology, University of California, Davis, Davis, California, USA
| | - Patrick S.C. Leung
- Division of Rheumatology, Allergy and Clinical Immunology, University of California, Davis, Davis, California, USA
| | - Fumio Matsumura
- Department of Environmental Toxicology,
- Center for Health and the Environment,
| | - M. Eric Gershwin
- Division of Rheumatology, Allergy and Clinical Immunology, University of California, Davis, Davis, California, USA
| |
Collapse
|
276
|
Gammie SC, Driessen TM, Zhao C, Saul MC, Eisinger BE. Genetic and neuroendocrine regulation of the postpartum brain. Front Neuroendocrinol 2016; 42:1-17. [PMID: 27184829 PMCID: PMC5030130 DOI: 10.1016/j.yfrne.2016.05.002] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/22/2016] [Revised: 04/11/2016] [Accepted: 05/13/2016] [Indexed: 12/11/2022]
Abstract
Changes in expression of hundreds of genes occur during the production and function of the maternal brain that support a wide range of processes. In this review, we synthesize findings from four microarray studies of different maternal brain regions and identify a core group of 700 maternal genes that show significant expression changes across multiple regions. With those maternal genes, we provide new insights into reward-related pathways (maternal bonding), postpartum depression, social behaviors, mental health disorders, and nervous system plasticity/developmental events. We also integrate the new genes into well-studied maternal signaling pathways, including those for prolactin, oxytocin/vasopressin, endogenous opioids, and steroid receptors (estradiol, progesterone, cortisol). A newer transcriptional regulation model for the maternal brain is provided that incorporates recent work on maternal microRNAs. We also compare the top 700 genes with other maternal gene expression studies. Together, we highlight new genes and new directions for studies on the postpartum brain.
Collapse
Affiliation(s)
- Stephen C Gammie
- Department of Zoology, University of Wisconsin-Madison, Madison, WI, USA; Neuroscience Training Program, University of Wisconsin-Madison, Madison, WI, USA.
| | - Terri M Driessen
- Department of Zoology, University of Wisconsin-Madison, Madison, WI, USA
| | - Changjiu Zhao
- Department of Zoology, University of Wisconsin-Madison, Madison, WI, USA
| | - Michael C Saul
- Department of Zoology, University of Wisconsin-Madison, Madison, WI, USA
| | - Brian E Eisinger
- Department of Zoology, University of Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
277
|
Contino G, Eldridge MD, Secrier M, Bower L, Fels Elliott R, Weaver J, Lynch AG, Edwards PA, Fitzgerald RC. Whole-genome sequencing of nine esophageal adenocarcinoma cell lines. F1000Res 2016; 5:1336. [PMID: 27594985 PMCID: PMC4991527 DOI: 10.12688/f1000research.7033.1] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 06/01/2016] [Indexed: 11/20/2022] Open
Abstract
Esophageal adenocarcinoma (EAC) is highly mutated and molecularly heterogeneous. The number of cell lines available for study is limited and their genome has been only partially characterized. The availability of an accurate annotation of their mutational landscape is crucial for accurate experimental design and correct interpretation of genotype-phenotype findings. We performed high coverage, paired end whole genome sequencing on eight EAC cell lines-ESO26, ESO51, FLO-1, JH-EsoAd1, OACM5.1 C, OACP4 C, OE33, SK-GT-4-all verified against original patient material, and one esophageal high grade dysplasia cell line, CP-D. We have made available the aligned sequence data and report single nucleotide variants (SNVs), small insertions and deletions (indels), and copy number alterations, identified by comparison with the human reference genome and known single nucleotide polymorphisms (SNPs). We compare these putative mutations to mutations found in primary tissue EAC samples, to inform the use of these cell lines as a model of EAC.
Collapse
Affiliation(s)
- Gianmarco Contino
- Medical Research Council (MRC) Cancer Unit, University of Cambridge, Cambridge, UK
| | - Matthew D. Eldridge
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
| | - Maria Secrier
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
| | - Lawrence Bower
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
| | - Rachael Fels Elliott
- Medical Research Council (MRC) Cancer Unit, University of Cambridge, Cambridge, UK
| | - Jamie Weaver
- Medical Research Council (MRC) Cancer Unit, University of Cambridge, Cambridge, UK
| | - Andy G. Lynch
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
| | | | | |
Collapse
|
278
|
McLaren W, Gil L, Hunt SE, Riat HS, Ritchie GRS, Thormann A, Flicek P, Cunningham F. The Ensembl Variant Effect Predictor. Genome Biol 2016. [PMID: 27268795 DOI: 10.1186/s13059-016–0974-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
The Ensembl Variant Effect Predictor is a powerful toolset for the analysis, annotation, and prioritization of genomic variants in coding and non-coding regions. It provides access to an extensive collection of genomic annotation, with a variety of interfaces to suit different requirements, and simple options for configuring and extending analysis. It is open source, free to use, and supports full reproducibility of results. The Ensembl Variant Effect Predictor can simplify and accelerate variant interpretation in a wide range of study designs.
Collapse
Affiliation(s)
- William McLaren
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| | - Laurent Gil
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Sarah E Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Harpreet Singh Riat
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Graham R S Ritchie
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Anja Thormann
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Fiona Cunningham
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| |
Collapse
|
279
|
McLaren W, Gil L, Hunt SE, Riat HS, Ritchie GRS, Thormann A, Flicek P, Cunningham F. The Ensembl Variant Effect Predictor. Genome Biol 2016; 17:122. [PMID: 27268795 PMCID: PMC4893825 DOI: 10.1186/s13059-016-0974-4] [Citation(s) in RCA: 4329] [Impact Index Per Article: 541.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2016] [Accepted: 05/03/2016] [Indexed: 02/06/2023] Open
Abstract
The Ensembl Variant Effect Predictor is a powerful toolset for the analysis, annotation, and prioritization of genomic variants in coding and non-coding regions. It provides access to an extensive collection of genomic annotation, with a variety of interfaces to suit different requirements, and simple options for configuring and extending analysis. It is open source, free to use, and supports full reproducibility of results. The Ensembl Variant Effect Predictor can simplify and accelerate variant interpretation in a wide range of study designs.
Collapse
Affiliation(s)
- William McLaren
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| | - Laurent Gil
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Sarah E Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Harpreet Singh Riat
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Graham R S Ritchie
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Anja Thormann
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Fiona Cunningham
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| |
Collapse
|
280
|
Lundberg SM, Tu WB, Raught B, Penn LZ, Hoffman MM, Lee SI. ChromNet: Learning the human chromatin network from all ENCODE ChIP-seq data. Genome Biol 2016; 17:82. [PMID: 27139377 PMCID: PMC4852466 DOI: 10.1186/s13059-016-0925-0] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2016] [Accepted: 03/15/2016] [Indexed: 01/12/2023] Open
Abstract
A cell's epigenome arises from interactions among regulatory factors-transcription factors and histone modifications-co-localized at particular genomic regions. We developed a novel statistical method, ChromNet, to infer a network of these interactions, the chromatin network, by inferring conditional-dependence relationships among a large number of ChIP-seq data sets. We applied ChromNet to all available 1451 ChIP-seq data sets from the ENCODE Project, and showed that ChromNet revealed previously known physical interactions better than alternative approaches. We experimentally validated one of the previously unreported interactions, MYC-HCFC1. An interactive visualization tool is available at http://chromnet.cs.washington.edu.
Collapse
Affiliation(s)
- Scott M Lundberg
- Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA
| | - William B Tu
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.,Princess Margaret Cancer Centre, Toronto, ON, Canada
| | - Brian Raught
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.,Princess Margaret Cancer Centre, Toronto, ON, Canada
| | - Linda Z Penn
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.,Princess Margaret Cancer Centre, Toronto, ON, Canada
| | - Michael M Hoffman
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.,Princess Margaret Cancer Centre, Toronto, ON, Canada.,Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Su-In Lee
- Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA. .,Department of Genome Sciences, University of Washington, Seattle, WA, USA.
| |
Collapse
|
281
|
Ponomarenko MP, Arkova O, Rasskazov D, Ponomarenko P, Savinkova L, Kolchanov N. Candidate SNP Markers of Gender-Biased Autoimmune Complications of Monogenic Diseases Are Predicted by a Significant Change in the Affinity of TATA-Binding Protein for Human Gene Promoters. Front Immunol 2016; 7:130. [PMID: 27092142 PMCID: PMC4819121 DOI: 10.3389/fimmu.2016.00130] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2015] [Accepted: 03/21/2016] [Indexed: 12/17/2022] Open
Abstract
Some variations of human genome [for example, single nucleotide polymorphisms (SNPs)] are markers of hereditary diseases and drug responses. Analysis of them can help to improve treatment. Computer-based analysis of millions of SNPs in the 1000 Genomes project makes a search for SNP markers more targeted. Here, we combined two computer-based approaches: DNA sequence analysis and keyword search in databases. In the binding sites for TATA-binding protein (TBP) in human gene promoters, we found candidate SNP markers of gender-biased autoimmune diseases, including rs1143627 [cachexia in rheumatoid arthritis (double prevalence among women)]; rs11557611 [demyelinating diseases (thrice more prevalent among young white women than among non-white individuals)]; rs17231520 and rs569033466 [both: atherosclerosis comorbid with related diseases (double prevalence among women)]; rs563763767 [Hughes syndrome-related thrombosis (lethal during pregnancy)]; rs2814778 [autoimmune diseases (excluding multiple sclerosis and rheumatoid arthritis) underlying hypergammaglobulinemia in women]; rs72661131 and rs562962093 (both: preterm delivery in pregnant diabetic women); and rs35518301, rs34166473, rs34500389, rs33981098, rs33980857, rs397509430, rs34598529, rs33931746, rs281864525, and rs63750953 (all: autoimmune diseases underlying hypergammaglobulinemia in women). Validation of these predicted candidate SNP markers using the clinical standards may advance personalized medicine.
Collapse
Affiliation(s)
- Mikhail P. Ponomarenko
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| | - Olga Arkova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russia
| | - Dmitry Rasskazov
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russia
| | | | - Ludmila Savinkova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russia
| | - Nikolay Kolchanov
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| |
Collapse
|
282
|
Zerbino DR, Johnson N, Juetteman T, Sheppard D, Wilder SP, Lavidas I, Nuhn M, Perry E, Raffaillac-Desfosses Q, Sobral D, Keefe D, Gräf S, Ahmed I, Kinsella R, Pritchard B, Brent S, Amode R, Parker A, Trevanion S, Birney E, Dunham I, Flicek P. Ensembl regulation resources. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:bav119. [PMID: 26888907 PMCID: PMC4756621 DOI: 10.1093/database/bav119] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/25/2015] [Accepted: 11/24/2015] [Indexed: 12/11/2022]
Abstract
New experimental techniques in epigenomics allow researchers to assay a diversity of highly dynamic features such as histone marks, DNA modifications or chromatin structure. The study of their fluctuations should provide insights into gene expression regulation, cell differentiation and disease. The Ensembl project collects and maintains the Ensembl regulation data resources on epigenetic marks, transcription factor binding and DNA methylation for human and mouse, as well as microarray probe mappings and annotations for a variety of chordate genomes. From this data, we produce a functional annotation of the regulatory elements along the human and mouse genomes with plans to expand to other species as data becomes available. Starting from well-studied cell lines, we will progressively expand our library of measurements to a greater variety of samples. Ensembl’s regulation resources provide a central and easy-to-query repository for reference epigenomes. As with all Ensembl data, it is freely available at http://www.ensembl.org, from the Perl and REST APIs and from the public Ensembl MySQL database server at ensembldb.ensembl.org. Database URL: http://www.ensembl.org
Collapse
Affiliation(s)
- Daniel R Zerbino
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Nathan Johnson
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Thomas Juetteman
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Dan Sheppard
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Steven P Wilder
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ilias Lavidas
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Michael Nuhn
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Emily Perry
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Quentin Raffaillac-Desfosses
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Daniel Sobral
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Damian Keefe
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Stefan Gräf
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ikhlak Ahmed
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Rhoda Kinsella
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Bethan Pritchard
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Simon Brent
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Ridwan Amode
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Anne Parker
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Steven Trevanion
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Ewan Birney
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ian Dunham
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| |
Collapse
|
283
|
Yates A, Akanni W, Amode MR, Barrell D, Billis K, Carvalho-Silva D, Cummins C, Clapham P, Fitzgerald S, Gil L, Girón CG, Gordon L, Hourlier T, Hunt SE, Janacek SH, Johnson N, Juettemann T, Keenan S, Lavidas I, Martin FJ, Maurel T, McLaren W, Murphy DN, Nag R, Nuhn M, Parker A, Patricio M, Pignatelli M, Rahtz M, Riat HS, Sheppard D, Taylor K, Thormann A, Vullo A, Wilder SP, Zadissa A, Birney E, Harrow J, Muffato M, Perry E, Ruffier M, Spudich G, Trevanion SJ, Cunningham F, Aken BL, Zerbino DR, Flicek P. Ensembl 2016. Nucleic Acids Res 2016; 44:D710-6. [PMID: 26687719 PMCID: PMC4702834 DOI: 10.1093/nar/gkv1157] [Citation(s) in RCA: 1072] [Impact Index Per Article: 134.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2015] [Revised: 10/19/2015] [Accepted: 10/19/2015] [Indexed: 01/17/2023] Open
Abstract
The Ensembl project (http://www.ensembl.org) is a system for genome annotation, analysis, storage and dissemination designed to facilitate the access of genomic annotation from chordates and key model organisms. It provides access to data from 87 species across our main and early access Pre! websites. This year we introduced three newly annotated species and released numerous updates across our supported species with a concentration on data for the latest genome assemblies of human, mouse, zebrafish and rat. We also provided two data updates for the previous human assembly, GRCh37, through a dedicated website (http://grch37.ensembl.org). Our tools, in particular the VEP, have been improved significantly through integration of additional third party data. REST is now capable of larger-scale analysis and our regulatory data BioMart can deliver faster results. The website is now capable of displaying long-range interactions such as those found in cis-regulated datasets. Finally we have launched a website optimized for mobile devices providing views of genes, variants and phenotypes. Our data is made available without restriction and all code is available from our GitHub organization site (http://github.com/Ensembl) under an Apache 2.0 license.
Collapse
Affiliation(s)
- Andrew Yates
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Wasiu Akanni
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - M Ridwan Amode
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Daniel Barrell
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Konstantinos Billis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Denise Carvalho-Silva
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Carla Cummins
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Peter Clapham
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Stephen Fitzgerald
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Laurent Gil
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Carlos García Girón
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Leo Gordon
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Thibaut Hourlier
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sarah E Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sophie H Janacek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Nathan Johnson
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Thomas Juettemann
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Stephen Keenan
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ilias Lavidas
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Fergal J Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Thomas Maurel
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - William McLaren
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Daniel N Murphy
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Rishi Nag
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Michael Nuhn
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Anne Parker
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mateus Patricio
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Miguel Pignatelli
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Matthew Rahtz
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Harpreet Singh Riat
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Daniel Sheppard
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Kieron Taylor
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Anja Thormann
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Alessandro Vullo
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Steven P Wilder
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Amonida Zadissa
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ewan Birney
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jennifer Harrow
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Matthieu Muffato
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Emily Perry
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Magali Ruffier
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Giulietta Spudich
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Stephen J Trevanion
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Fiona Cunningham
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Bronwen L Aken
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Daniel R Zerbino
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| |
Collapse
|
284
|
Abstract
The linear and three-dimensional arrangement and composition of chromatin in eukaryotic genomes underlies the mechanisms directing gene regulation. Understanding this organization requires the integration of many data types and experimental results. Here we describe the approach of integrating genome-wide protein-DNA binding data to determine chromatin states. To investigate spatial aspects of genome organization, we present a detailed description of how to run stochastic simulations of protein movements within a simulated nucleus in 3D. This systems level approach enables the development of novel questions aimed at understanding the basic mechanisms that regulate genome dynamics.
Collapse
Affiliation(s)
- Sven Sewitz
- Babraham Institute, Nuclear Dynamics Programme, Cambridge, CB22 3AT, UK
| | - Karen Lipkow
- Babraham Institute, Nuclear Dynamics Programme, Cambridge, CB22 3AT, UK.
- Cambridge Systems Biology Centre, University of Cambridge, Cambridge, CB2 1QR, UK.
| |
Collapse
|
285
|
Arkova OV, Ponomarenko MP, Rasskazov DA, Drachkova IA, Arshinova TV, Ponomarenko PM, Savinkova LK, Kolchanov NA. Obesity-related known and candidate SNP markers can significantly change affinity of TATA-binding protein for human gene promoters. BMC Genomics 2015; 16 Suppl 13:S5. [PMID: 26694100 PMCID: PMC4686794 DOI: 10.1186/1471-2164-16-s13-s5] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND Obesity affects quality of life and life expectancy and is associated with cardiovascular disorders, cancer, diabetes, reproductive disorders in women, prostate diseases in men, and congenital anomalies in children. The use of single nucleotide polymorphism (SNP) markers of diseases and drug responses (i.e., significant differences of personal genomes of patients from the reference human genome) can help physicians to improve treatment. Clinical research can validate SNP markers via genotyping of patients and demonstration that SNP alleles are significantly more frequent in patients than in healthy people. The search for biomedical SNP markers of interest can be accelerated by computer-based analysis of hundreds of millions of SNPs in the 1000 Genomes project because of selection of the most meaningful candidate SNP markers and elimination of neutral SNPs. RESULTS We cross-validated the output of two computer-based methods: DNA sequence analysis using Web service SNP_TATA_Comparator and keyword search for articles on comorbidities of obesity. Near the sites binding to TATA-binding protein (TBP) in human gene promoters, we found 22 obesity-related candidate SNP markers, including rs10895068 (male breast cancer in obesity); rs35036378 (reduced risk of obesity after ovariectomy); rs201739205 (reduced risk of obesity-related cancers due to weight loss by diet/exercise in obese postmenopausal women); rs183433761 (obesity resistance during a high-fat diet); rs367732974 and rs549591993 (both: cardiovascular complications in obese patients with type 2 diabetes mellitus); rs200487063 and rs34104384 (both: obesity-caused hypertension); rs35518301, rs72661131, and rs562962093 (all: obesity); and rs397509430, rs33980857, rs34598529, rs33931746, rs33981098, rs34500389, rs63750953, rs281864525, rs35518301, and rs34166473 (all: chronic inflammation in comorbidities of obesity). Using an electrophoretic mobility shift assay under nonequilibrium conditions, we empirically validated the statistical significance (α < 0.00025) of the differences in TBP affinity values between the minor and ancestral alleles of 4 out of the 22 SNPs: rs200487063, rs201381696, rs34104384, and rs183433761. We also measured half-life (t1/2), Gibbs free energy change (ΔG), and the association and dissociation rate constants, ka and kd, of the TBP-DNA complex for these SNPs. CONCLUSIONS Validation of the 22 candidate SNP markers by proper clinical protocols appears to have a strong rationale and may advance postgenomic predictive preventive personalized medicine.
Collapse
Affiliation(s)
- Olga V Arkova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyeva Avenue, Novosibirsk 630090, Russia
| | - Mikhail P Ponomarenko
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyeva Avenue, Novosibirsk 630090, Russia
- Novosibirsk State University, 2 Pirogova Street, Novosibirsk 630090, Russia
- Laboratory of Evolutionary Bioinformatics and Theoretical Genetics, Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyev Avenue, Novosibirsk 630090, Russia
| | - Dmitry A Rasskazov
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyeva Avenue, Novosibirsk 630090, Russia
| | - Irina A Drachkova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyeva Avenue, Novosibirsk 630090, Russia
| | - Tatjana V Arshinova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyeva Avenue, Novosibirsk 630090, Russia
| | - Petr M Ponomarenko
- Children's Hospital Los Angeles, 4640 Hollywood Boulevard, University of Southern California, Los Angeles, CA 90027, USA
| | - Ludmila K Savinkova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyeva Avenue, Novosibirsk 630090, Russia
| | - Nikolay A Kolchanov
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, 10 Lavrentyeva Avenue, Novosibirsk 630090, Russia
- Novosibirsk State University, 2 Pirogova Street, Novosibirsk 630090, Russia
| |
Collapse
|
286
|
Parra RG, Rohr CO, Koile D, Perez-Castro C, Yankilevich P. INSECT 2.0: a web-server for genome-wide cis-regulatory modules prediction. Bioinformatics 2015; 32:1229-31. [PMID: 26656931 DOI: 10.1093/bioinformatics/btv726] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2015] [Accepted: 12/04/2015] [Indexed: 11/14/2022] Open
Abstract
UNLABELLED INSECT is a user-friendly web server to predict the occurrence of Cis-Regulatory Modules (CRMs), which control gene expression. Here, we present a new release of INSECT which includes several new features, such as whole genome analysis, nucleosome occupancy predictions, and which provides additional links to third-party functional tools that complement user capabilities, CRM analysis and hypothesis construction. Improvements in the core implementation have led to a faster and more efficient tool. In addition, this new release introduces a new interface designed for a more integrative and dynamic user experience. AVAILABILITY AND IMPLEMENTATION http://bioinformatics.ibioba-mpsp-conicet.gov.ar/INSECT2 CONTACT: pyankilevich@ibioba-mpsp-conicet.gov.ar.
Collapse
Affiliation(s)
- R Gonzalo Parra
- Facultad De Ciencias Exactas Y Naturales, Universidad De Buenos Aires, Buenos Aires, Argentina and
| | - Cristian O Rohr
- Facultad De Ciencias Exactas Y Naturales, Universidad De Buenos Aires, Buenos Aires, Argentina and
| | - Daniel Koile
- Instituto De Investigacion En Biomedicina De Buenos Aires (IBioBA), CONICET, Partner Institute of the Max Planck Society, Godoy Cruz 2390, Buenos Aires C1425FQA, Argentina
| | - Carolina Perez-Castro
- Instituto De Investigacion En Biomedicina De Buenos Aires (IBioBA), CONICET, Partner Institute of the Max Planck Society, Godoy Cruz 2390, Buenos Aires C1425FQA, Argentina
| | - Patricio Yankilevich
- Instituto De Investigacion En Biomedicina De Buenos Aires (IBioBA), CONICET, Partner Institute of the Max Planck Society, Godoy Cruz 2390, Buenos Aires C1425FQA, Argentina
| |
Collapse
|
287
|
PExFInS: An Integrative Post-GWAS Explorer for Functional Indels and SNPs. Sci Rep 2015; 5:17302. [PMID: 26612672 PMCID: PMC4661514 DOI: 10.1038/srep17302] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2015] [Accepted: 10/28/2015] [Indexed: 12/22/2022] Open
Abstract
Expression quantitative trait loci (eQTLs) mapping and linkage disequilibrium (LD) analysis have been widely employed to interpret findings of genome-wide association studies (GWAS). With the availability of deep sequencing data of 423 lymphoblastoid cell lines (LCLs) from six global populations and the microarray expression data, we performed eQTL analysis, identified more than 228 K SNP cis-eQTLs and 21 K indel cis-eQTLs and generated a LCL cis-eQTL database. We demonstrate that the percentages of population-shared and population-specific cis-eQTLs are comparable; while indel cis-eQTLs in the population-specific subsection make more contribution to gene expression variations than those in the population-shared subsection. We found cis-eQTLs, especially the population-shared cis-eQTLs are significantly enriched toward transcription start site. Moreover, the National Human Genome Research Institute cataloged GWAS SNPs are enriched for LCL cis-eQTLs. Specifically, 32.8% GWAS SNPs are LCL cis-eQTLs, among which 12.5% can be tagged by indel cis-eQTLs, suggesting the fundamental contribution of indel cis-eQTLs to GWAS association signals. To search for functional indels and SNPs tagging GWAS SNPs, a pipeline Post-GWAS Explorer for Functional Indels and SNPs (PExFInS) has been developed, integrating LD analysis, functional annotation from public databases, cis-eQTL mapping with our LCL cis-eQTL database and other published cis-eQTL datasets.
Collapse
|
288
|
Lesurf R, Cotto KC, Wang G, Griffith M, Kasaian K, Jones SJM, Montgomery SB, Griffith OL. ORegAnno 3.0: a community-driven resource for curated regulatory annotation. Nucleic Acids Res 2015; 44:D126-32. [PMID: 26578589 PMCID: PMC4702855 DOI: 10.1093/nar/gkv1203] [Citation(s) in RCA: 108] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2015] [Accepted: 10/26/2015] [Indexed: 12/26/2022] Open
Abstract
The Open Regulatory Annotation database (ORegAnno) is a resource for curated regulatory annotation. It contains information about regulatory regions, transcription factor binding sites, RNA binding sites, regulatory variants, haplotypes, and other regulatory elements. ORegAnno differentiates itself from other regulatory resources by facilitating crowd-sourced interpretation and annotation of regulatory observations from the literature and highly curated resources. It contains a comprehensive annotation scheme that aims to describe both the elements and outcomes of regulatory events. Moreover, ORegAnno assembles these disparate data sources and annotations into a single, high quality catalogue of curated regulatory information. The current release is an update of the database previously featured in the NAR Database Issue, and now contains 1 948 307 records, across 18 species, with a combined coverage of 334 215 080 bp. Complete records, annotation, and other associated data are available for browsing and download at http://www.oreganno.org/.
Collapse
Affiliation(s)
- Robert Lesurf
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
| | - Kelsy C Cotto
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
| | - Grace Wang
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
| | - Malachi Griffith
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA Siteman Cancer Center, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Katayoon Kasaian
- Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, Vancouver, BC V5Z 4S6, Canada
| | - Steven J M Jones
- Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, Vancouver, BC V5Z 4S6, Canada Department of Molecular Biology & Biochemistry, Simon Fraser University, Burnaby, BC V5A 1S6, Canada Department of Medical Genetics, University of British Columbia, Vancouver, BC V6T 1Z3, Canada
| | - Stephen B Montgomery
- Department of Pathology, Stanford University School of Medicine, Stanford, CA 94305, USA Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Obi L Griffith
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA Siteman Cancer Center, Washington University School of Medicine, St. Louis, MO 63110, USA Department of Medicine, Division of Oncology, Washington University School of Medicine, St. Louis, MO 63110, USA
| | | |
Collapse
|
289
|
Sarkar MK, Gayen S, Kumar S, Maclary E, Buttigieg E, Hinten M, Kumari A, Harris C, Sado T, Kalantry S. An Xist-activating antisense RNA required for X-chromosome inactivation. Nat Commun 2015; 6:8564. [PMID: 26477563 PMCID: PMC4616153 DOI: 10.1038/ncomms9564] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2015] [Accepted: 09/07/2015] [Indexed: 01/06/2023] Open
Abstract
The transcriptional imbalance due to the difference in the number of X chromosomes between male and female mammals is remedied through X-chromosome inactivation, the epigenetic transcriptional silencing of one of the two X chromosomes in females. The X-linked Xist long non-coding RNA functions as an X inactivation master regulator; Xist is selectively upregulated from the prospective inactive X chromosome and is required in cis for X inactivation. Here we discover an Xist antisense long non-coding RNA, XistAR (XistActivating RNA), which is encoded within exon 1 of the mouse Xist gene and is transcribed only from the inactive X chromosome. Selective truncation of XistAR, while sparing the overlapping Xist RNA, leads to a deficiency in Xist RNA expression in cis during the initiation of X inactivation. Thus, the Xist gene carries within its coding sequence an antisense RNA that drives Xist expression. The X-chromosome linked long non-coding RNA, Xist, is a master regulator of the X inactivation. Here, the authors report that XistAR, an Xist anti-sense long non-coding RNA encoded within the mouse Xist gene and transcribed only from the inactive X chromosome, regulates Xist expression.
Collapse
Affiliation(s)
- Mrinal K Sarkar
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA
| | - Srimonta Gayen
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA
| | - Surinder Kumar
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA
| | - Emily Maclary
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA
| | - Emily Buttigieg
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA
| | - Michael Hinten
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA
| | - Archana Kumari
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA
| | - Clair Harris
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA
| | - Takashi Sado
- Department of Advanced Bioscience, Graduate School of Agriculture, Kinki University, 3327-204 Nakamachi, Nara 631-8505, Japan
| | - Sundeep Kalantry
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA
| |
Collapse
|
290
|
Kannan L, Ramos M, Re A, El-Hachem N, Safikhani Z, Gendoo DM, Davis S, Gomez-Cabrero D, Castelo R, Hansen KD, Carey VJ, Morgan M, Culhane AC, Haibe-Kains B, Waldron L. Public data and open source tools for multi-assay genomic investigation of disease. Brief Bioinform 2015; 17:603-15. [PMID: 26463000 PMCID: PMC4945830 DOI: 10.1093/bib/bbv080] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2015] [Indexed: 01/07/2023] Open
Abstract
Molecular interrogation of a biological sample through DNA sequencing, RNA and microRNA profiling, proteomics and other assays, has the potential to provide a systems level approach to predicting treatment response and disease progression, and to developing precision therapies. Large publicly funded projects have generated extensive and freely available multi-assay data resources; however, bioinformatic and statistical methods for the analysis of such experiments are still nascent. We review multi-assay genomic data resources in the areas of clinical oncology, pharmacogenomics and other perturbation experiments, population genomics and regulatory genomics and other areas, and tools for data acquisition. Finally, we review bioinformatic tools that are explicitly geared toward integrative genomic data visualization and analysis. This review provides starting points for accessing publicly available data and tools to support development of needed integrative methods.
Collapse
|
291
|
Ponomarenko M, Rasskazov D, Arkova O, Ponomarenko P, Suslov V, Savinkova L, Kolchanov N. How to Use SNP_TATA_Comparator to Find a Significant Change in Gene Expression Caused by the Regulatory SNP of This Gene's Promoter via a Change in Affinity of the TATA-Binding Protein for This Promoter. BIOMED RESEARCH INTERNATIONAL 2015; 2015:359835. [PMID: 26516624 PMCID: PMC4609514 DOI: 10.1155/2015/359835] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/03/2015] [Accepted: 08/24/2015] [Indexed: 01/11/2023]
Abstract
The use of biomedical SNP markers of diseases can improve effectiveness of treatment. Genotyping of patients with subsequent searching for SNPs more frequent than in norm is the only commonly accepted method for identification of SNP markers within the framework of translational research. The bioinformatics applications aimed at millions of unannotated SNPs of the "1000 Genomes" can make this search for SNP markers more focused and less expensive. We used our Web service involving Fisher's Z-score for candidate SNP markers to find a significant change in a gene's expression. Here we analyzed the change caused by SNPs in the gene's promoter via a change in affinity of the TATA-binding protein for this promoter. We provide examples and discuss how to use this bioinformatics application in the course of practical analysis of unannotated SNPs from the "1000 Genomes" project. Using known biomedical SNP markers, we identified 17 novel candidate SNP markers nearby: rs549858786 (rheumatoid arthritis); rs72661131 (cardiovascular events in rheumatoid arthritis); rs562962093 (stroke); rs563558831 (cyclophosphamide bioactivation); rs55878706 (malaria resistance, leukopenia), rs572527200 (asthma, systemic sclerosis, and psoriasis), rs371045754 (hemophilia B), rs587745372 (cardiovascular events); rs372329931, rs200209906, rs367732974, and rs549591993 (all four: cancer); rs17231520 and rs569033466 (both: atherosclerosis); rs63750953, rs281864525, and rs34166473 (all three: malaria resistance, thalassemia).
Collapse
Affiliation(s)
- Mikhail Ponomarenko
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk 630090, Russia
- Department of Natural Sciences, Novosibirsk State University, Novosibirsk 630090, Russia
| | - Dmitry Rasskazov
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk 630090, Russia
| | - Olga Arkova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk 630090, Russia
| | - Petr Ponomarenko
- Children's Hospital Los Angeles, University of Southern California, Los Angeles, CA 90027, USA
| | - Valentin Suslov
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk 630090, Russia
| | - Ludmila Savinkova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk 630090, Russia
| | - Nikolay Kolchanov
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk 630090, Russia
- Department of Natural Sciences, Novosibirsk State University, Novosibirsk 630090, Russia
| |
Collapse
|