1
|
Vishnevsky OV, Bocharnikov AV, Ignatieva EV. Peak Scores Significantly Depend on the Relationships between Contextual Signals in ChIP-Seq Peaks. Int J Mol Sci 2024; 25:1011. [PMID: 38256085 PMCID: PMC10816497 DOI: 10.3390/ijms25021011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 12/13/2023] [Accepted: 01/09/2024] [Indexed: 01/24/2024] Open
Abstract
Chromatin immunoprecipitation followed by massively parallel DNA sequencing (ChIP-seq) is a central genome-wide method for in vivo analyses of DNA-protein interactions in various cellular conditions. Numerous studies have demonstrated the complex contextual organization of ChIP-seq peak sequences and the presence of binding sites for transcription factors in them. We assessed the dependence of the ChIP-seq peak score on the presence of different contextual signals in the peak sequences by analyzing these sequences from several ChIP-seq experiments using our fully enumerative GPU-based de novo motif discovery method, Argo_CUDA. Analysis revealed sets of significant IUPAC motifs corresponding to the binding sites of the target and partner transcription factors. For these ChIP-seq experiments, multiple regression models were constructed, demonstrating a significant dependence of the peak scores on the presence in the peak sequences of not only highly significant target motifs but also less significant motifs corresponding to the binding sites of the partner transcription factors. A significant correlation was shown between the presence of the target motifs FOXA2 and the partner motifs HNF4G, which found experimental confirmation in the scientific literature, demonstrating the important contribution of the partner transcription factors to the binding of the target transcription factor to DNA and, consequently, their important contribution to the peak score.
Collapse
Affiliation(s)
- Oleg V. Vishnevsky
- Institute of Cytology and Genetics, 630090 Novosibirsk, Russia;
- Department of Natural Science, Novosibirsk State University, 630090 Novosibirsk, Russia;
| | - Andrey V. Bocharnikov
- Department of Natural Science, Novosibirsk State University, 630090 Novosibirsk, Russia;
| | - Elena V. Ignatieva
- Institute of Cytology and Genetics, 630090 Novosibirsk, Russia;
- Department of Natural Science, Novosibirsk State University, 630090 Novosibirsk, Russia;
| |
Collapse
|
2
|
Antontseva EV, Degtyareva AO, Korbolina EE, Damarov IS, Merkulova TI. Human-genome single nucleotide polymorphisms affecting transcription factor binding and their role in pathogenesis. Vavilovskii Zhurnal Genet Selektsii 2023; 27:662-675. [PMID: 37965371 PMCID: PMC10641029 DOI: 10.18699/vjgb-23-77] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 03/24/2023] [Accepted: 03/30/2023] [Indexed: 11/16/2023] Open
Abstract
Single nucleotide polymorphisms (SNPs) are the most common type of variation in the human genome. The vast majority of SNPs identified in the human genome do not have any effect on the phenotype; however, some can lead to changes in the function of a gene or the level of its expression. Most SNPs associated with certain traits or pathologies are mapped to regulatory regions of the genome and affect gene expression by changing transcription factor binding sites. In recent decades, substantial effort has been invested in searching for such regulatory SNPs (rSNPs) and understanding the mechanisms by which they lead to phenotypic differences, primarily to individual differences in susceptibility to diseases and in sensitivity to drugs. The development of the NGS (next-generation sequencing) technology has contributed not only to the identification of a huge number of SNPs and to the search for their association (genome-wide association studies, GWASs) with certain diseases or phenotypic manifestations, but also to the development of more productive approaches to their functional annotation. It should be noted that the presence of an association does not allow one to identify a functional, truly disease-associated DNA sequence variant among multiple marker SNPs that are detected due to linkage disequilibrium. Moreover, determination of associations of genetic variants with a disease does not provide information about the functionality of these variants, which is necessary to elucidate the molecular mechanisms of the development of pathology and to design effective methods for its treatment and prevention. In this regard, the functional analysis of SNPs annotated in the GWAS catalog, both at the genome-wide level and at the level of individual SNPs, became especially relevant in recent years. A genome-wide search for potential rSNPs is possible without any prior knowledge of their association with a trait. Thus, mapping expression quantitative trait loci (eQTLs) makes it possible to identify an SNP for which - among transcriptomes of homozygotes and heterozygotes for its various alleles - there are differences in the expression level of certain genes, which can be located at various distances from the SNP. To predict rSNPs, approaches based on searches for allele-specific events in RNA-seq, ChIP-seq, DNase-seq, ATAC-seq, MPRA, and other data are also used. Nonetheless, for a more complete functional annotation of such rSNPs, it is necessary to establish their association with a trait, in particular, with a predisposition to a certain pathology or sensitivity to drugs. Thus, approaches to finding SNPs important for the development of a trait can be categorized into two groups: (1) starting from data on an association of SNPs with a certain trait, (2) starting from the determination of allele-specific changes at the molecular level (in a transcriptome or regulome). Only comprehensive use of strategically different approaches can considerably enrich our knowledge about the role of genetic determinants in the molecular mechanisms of trait formation, including predisposition to multifactorial diseases.
Collapse
Affiliation(s)
- E V Antontseva
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - A O Degtyareva
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - E E Korbolina
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - I S Damarov
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - T I Merkulova
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| |
Collapse
|
3
|
Transcription networks in liver development and acute liver failure. LIVER RESEARCH 2022. [DOI: 10.1016/j.livres.2022.11.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]
|
4
|
Price JD, Lindtner S, Ypsilanti A, Binyameen F, Johnson JR, Newton BW, Krogan NJ, Rubenstein JLR. DLX1 and the NuRD complex cooperate in enhancer decommissioning and transcriptional repression. Development 2022; 149:dev199508. [PMID: 35695185 PMCID: PMC9245191 DOI: 10.1242/dev.199508] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2021] [Accepted: 03/17/2022] [Indexed: 09/27/2023]
Abstract
In the developing subpallium, the fate decision between neurons and glia is driven by expression of Dlx1/2 or Olig1/2, respectively, two sets of transcription factors with a mutually repressive relationship. The mechanism by which Dlx1/2 repress progenitor and oligodendrocyte fate, while promoting transcription of genes needed for differentiation, is not fully understood. We identified a motif within DLX1 that binds RBBP4, a NuRD complex subunit. ChIP-seq studies of genomic occupancy of DLX1 and six different members of the NuRD complex show that DLX1 and NuRD colocalize to putative regulatory elements enriched near other transcription factor genes. Loss of Dlx1/2 leads to dysregulation of genome accessibility at putative regulatory elements near genes repressed by Dlx1/2, including Olig2. Consequently, heterozygosity of Dlx1/2 and Rbbp4 leads to an increase in the production of OLIG2+ cells. These findings highlight the importance of the interplay between transcription factors and chromatin remodelers in regulating cell-fate decisions.
Collapse
Affiliation(s)
- James D. Price
- Department of Psychiatry, Langley Porter Psychiatric Institute, UCSF Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA, USA
- Developmental and Stem Cell Biology Graduate Program, University of California San Francisco, San Francisco, CA 94143, USA
| | - Susan Lindtner
- Department of Psychiatry, Langley Porter Psychiatric Institute, UCSF Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA, USA
| | - Athena Ypsilanti
- Department of Psychiatry, Langley Porter Psychiatric Institute, UCSF Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA, USA
| | - Fadya Binyameen
- Department of Psychiatry, Langley Porter Psychiatric Institute, UCSF Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA, USA
| | - Jeffrey R. Johnson
- Quantitative Biosciences Institute, University of California San Francisco, San Francisco, CA 94158, USA
- Gladstone Institute of Data Science and Biosciences, J. David Gladstone Institutes, San Francisco, CA 94158, USA
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, CA 94143, USA
- Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Billy W. Newton
- Gladstone Institute of Data Science and Biosciences, J. David Gladstone Institutes, San Francisco, CA 94158, USA
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, CA 94143, USA
| | - Nevan J. Krogan
- Quantitative Biosciences Institute, University of California San Francisco, San Francisco, CA 94158, USA
- Gladstone Institute of Data Science and Biosciences, J. David Gladstone Institutes, San Francisco, CA 94158, USA
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, CA 94143, USA
- Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - John L. R. Rubenstein
- Department of Psychiatry, Langley Porter Psychiatric Institute, UCSF Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA, USA
| |
Collapse
|
5
|
Paredes O, Morales JA, Mendizabal AP, Romo-Vázquez R. Metacode: One code to rule them all. Biosystems 2021; 208:104486. [PMID: 34274462 DOI: 10.1016/j.biosystems.2021.104486] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Revised: 07/07/2021] [Accepted: 07/09/2021] [Indexed: 12/13/2022]
Abstract
The code of codes or metacode is a microcosm where biological layers, as well as their codes, interact together allowing the continuity of information flow in organisms by increasing biological entities' complexity. Through this novel organic code, biological systems scale towards niches with higher informatic freedom building structures that increase the entropy in the universe. Code biology has developed a novel informational framework where biological entities strive themselves through the information flow carried out through organic codes consisting of two molecular or functional landscapes intertwined through arbitrary linkages via an adaptor whose nature is autonomous from molecular determinism. Here we will integrate genomic and epigenomic codes according to the evidence released in ENCODE (phase 3), psychENCODE and GTEx project, outlining the principles of the metacode, to address the continuous nature of biological systems and their inter-layered information flow. This novel complex metacode maps from very constrained sets of elements (i.e., regulation sites modulating gene expression) to new ones with greater freedom of decoding (i.e., a continuous cell phenotypic space). This leads to a new domain in code biology where biological systems are informatic attractors that navigate an energy metaspace through a complexity-noise balance, stalling in emergent niches where organic codes take meaning.
Collapse
Affiliation(s)
- Omar Paredes
- Computer Sciences Department, CUCEI, Universidad de Guadalajara, Mexico
| | | | - Adriana P Mendizabal
- Molecular Biology Laboratory, Farmacobiology Department, CUCEI, Universidad de Guadalajara, Mexico
| | | |
Collapse
|
6
|
Degtyareva AO, Antontseva EV, Merkulova TI. Regulatory SNPs: Altered Transcription Factor Binding Sites Implicated in Complex Traits and Diseases. Int J Mol Sci 2021; 22:6454. [PMID: 34208629 PMCID: PMC8235176 DOI: 10.3390/ijms22126454] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Revised: 06/15/2021] [Accepted: 06/15/2021] [Indexed: 12/19/2022] Open
Abstract
The vast majority of the genetic variants (mainly SNPs) associated with various human traits and diseases map to a noncoding part of the genome and are enriched in its regulatory compartment, suggesting that many causal variants may affect gene expression. The leading mechanism of action of these SNPs consists in the alterations in the transcription factor binding via creation or disruption of transcription factor binding sites (TFBSs) or some change in the affinity of these regulatory proteins to their cognate sites. In this review, we first focus on the history of the discovery of regulatory SNPs (rSNPs) and systematized description of the existing methodical approaches to their study. Then, we brief the recent comprehensive examples of rSNPs studied from the discovery of the changes in the TFBS sequence as a result of a nucleotide substitution to identification of its effect on the target gene expression and, eventually, to phenotype. We also describe state-of-the-art genome-wide approaches to identification of regulatory variants, including both making molecular sense of genome-wide association studies (GWAS) and the alternative approaches the primary goal of which is to determine the functionality of genetic variants. Among these approaches, special attention is paid to expression quantitative trait loci (eQTLs) analysis and the search for allele-specific events in RNA-seq (ASE events) as well as in ChIP-seq, DNase-seq, and ATAC-seq (ASB events) data.
Collapse
Affiliation(s)
- Arina O. Degtyareva
- Department of Molecular Genetic, Institute of Cytology and Genetics, 630090 Novosibirsk, Russia; (A.O.D.); (E.V.A.)
| | - Elena V. Antontseva
- Department of Molecular Genetic, Institute of Cytology and Genetics, 630090 Novosibirsk, Russia; (A.O.D.); (E.V.A.)
| | - Tatiana I. Merkulova
- Department of Molecular Genetic, Institute of Cytology and Genetics, 630090 Novosibirsk, Russia; (A.O.D.); (E.V.A.)
- Department of Natural Sciences, Novosibirsk State University, 630090 Novosibirsk, Russia
| |
Collapse
|
7
|
Control of Cell Identity by the Nuclear Receptor HNF4 in Organ Pathophysiology. Cells 2020; 9:cells9102185. [PMID: 32998360 PMCID: PMC7600215 DOI: 10.3390/cells9102185] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Revised: 09/25/2020] [Accepted: 09/26/2020] [Indexed: 12/14/2022] Open
Abstract
Hepatocyte Nuclear Factor 4 (HNF4) is a transcription factor (TF) belonging to the nuclear receptor family whose expression and activities are restricted to a limited number of organs including the liver and gastrointestinal tract. In this review, we present robust evidence pointing to HNF4 as a master regulator of cellular differentiation during development and a safekeeper of acquired cell identity in adult organs. Importantly, we discuss that transient loss of HNF4 may represent a protective mechanism upon acute organ injury, while prolonged impairment of HNF4 activities could contribute to organ dysfunction. In this context, we describe in detail mechanisms involved in the pathophysiological control of cell identity by HNF4, including how HNF4 works as part of cell-specific TF networks and how its expression/activities are disrupted in injured organs.
Collapse
|
8
|
Mazrooei P, Kron KJ, Zhu Y, Zhou S, Grillo G, Mehdi T, Ahmed M, Severson TM, Guilhamon P, Armstrong NS, Huang V, Yamaguchi TN, Fraser M, van der Kwast T, Boutros PC, He HH, Bergman AM, Bristow RG, Zwart W, Lupien M. Cistrome Partitioning Reveals Convergence of Somatic Mutations and Risk Variants on Master Transcription Regulators in Primary Prostate Tumors. Cancer Cell 2019; 36:674-689.e6. [PMID: 31735626 DOI: 10.1016/j.ccell.2019.10.005] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Revised: 08/02/2019] [Accepted: 10/17/2019] [Indexed: 12/26/2022]
Abstract
Thousands of noncoding somatic single-nucleotide variants (SNVs) of unknown function are reported in tumors. Partitioning the genome according to cistromes reveals the enrichment of somatic SNVs in prostate tumors as opposed to adjacent normal tissue cistromes of master transcription regulators, including AR, FOXA1, and HOXB13. This parallels enrichment of prostate cancer genetic predispositions over these transcription regulators' tumor cistromes, exemplified at the 8q24 locus harboring both risk variants and somatic SNVs in cis-regulatory elements upregulating MYC expression. However, Massively Parallel Reporter Assays reveal that few SNVs can alter the transactivation potential of individual cis-regulatory elements. Instead, similar to inherited risk variants, SNVs accumulate in cistromes of master transcription regulators required for prostate cancer development.
Collapse
Affiliation(s)
- Parisa Mazrooei
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON M5G 1L7, Canada; Department of Medical Biophysics, University of Toronto, Toronto, ON M5G 1L7, Canada
| | - Ken J Kron
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON M5G 1L7, Canada
| | - Yanyun Zhu
- Division of Oncogenomics, Oncode Institute, the Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Stanley Zhou
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON M5G 1L7, Canada; Department of Medical Biophysics, University of Toronto, Toronto, ON M5G 1L7, Canada
| | - Giacomo Grillo
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON M5G 1L7, Canada
| | - Tahmid Mehdi
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON M5G 1L7, Canada
| | - Musaddeque Ahmed
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON M5G 1L7, Canada
| | - Tesa M Severson
- Division of Oncogenomics, Oncode Institute, the Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Paul Guilhamon
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON M5G 1L7, Canada
| | | | - Vincent Huang
- Ontario Institute for Cancer Research, Toronto, ON M5G 0A3, Canada
| | | | - Michael Fraser
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON M5G 1L7, Canada; Ontario Institute for Cancer Research, Toronto, ON M5G 0A3, Canada
| | - Theodorus van der Kwast
- Department of Pathology and Laboratory Medicine, Toronto General Hospital, University Health Network, Toronto, ON M5G 2C4, Canada
| | - Paul C Boutros
- Department of Medical Biophysics, University of Toronto, Toronto, ON M5G 1L7, Canada; Ontario Institute for Cancer Research, Toronto, ON M5G 0A3, Canada; Department of Pharmacology and Toxicology, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Housheng Hansen He
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON M5G 1L7, Canada; Department of Medical Biophysics, University of Toronto, Toronto, ON M5G 1L7, Canada
| | - Andries M Bergman
- Division of Oncogenomics, Oncode Institute, the Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Robert G Bristow
- CRUK Manchester Institute and Manchester Cancer Research Centre, University of Manchester, Manchester M20 4GJ, UK
| | - Wilbert Zwart
- Division of Oncogenomics, Oncode Institute, the Netherlands Cancer Institute, Amsterdam, The Netherlands; Laboratory of Chemical Biology and Institute for Complex Molecular Systems, Department of Biomedical Engineering, Eindhoven University of Technology, PO Box 513, 5600 MB Eindhoven, The Netherlands.
| | - Mathieu Lupien
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON M5G 1L7, Canada; Department of Medical Biophysics, University of Toronto, Toronto, ON M5G 1L7, Canada; Ontario Institute for Cancer Research, Toronto, ON M5G 0A3, Canada.
| |
Collapse
|