1
|
Wang J, Nakato R. Churros: a Docker-based pipeline for large-scale epigenomic analysis. DNA Res 2024; 31:dsad026. [PMID: 38102723 PMCID: PMC11389749 DOI: 10.1093/dnares/dsad026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 11/23/2023] [Accepted: 12/13/2023] [Indexed: 12/17/2023] Open
Abstract
The epigenome, which reflects the modifications on chromatin or DNA sequences, provides crucial insight into gene expression regulation and cellular activity. With the continuous accumulation of epigenomic datasets such as chromatin immunoprecipitation followed by sequencing (ChIP-seq) data, there is a great demand for a streamlined pipeline to consistently process them, especially for large-dataset comparisons involving hundreds of samples. Here, we present Churros, an end-to-end epigenomic analysis pipeline that is environmentally independent and optimized for handling large-scale data. We successfully demonstrated the effectiveness of Churros by analyzing large-scale ChIP-seq datasets with the hg38 or Telomere-to-Telomere (T2T) human reference genome. We found that applying T2T to the typical analysis workflow has important impacts on read mapping, quality checks, and peak calling. We also introduced a useful feature to study context-specific epigenomic landscapes. Churros will contribute a comprehensive and unified resource for analyzing large-scale epigenomic data.
Collapse
Affiliation(s)
- Jiankang Wang
- School of Biomedical Sciences, Hunan University, Changsha, Hunan, China
- Institute for Quantitative Biosciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Ryuichiro Nakato
- Institute for Quantitative Biosciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| |
Collapse
|
2
|
Krug B, Hu B, Chen H, Ptack A, Chen X, Gretarsson KH, Deshmukh S, Kabir N, Andrade AF, Jabbour E, Harutyunyan AS, Lee JJY, Hulswit M, Faury D, Russo C, Xu X, Johnston MJ, Baguette A, Dahl NA, Weil AG, Ellezam B, Dali R, Blanchette M, Wilson K, Garcia BA, Soni RK, Gallo M, Taylor MD, Kleinman CL, Majewski J, Jabado N, Lu C. H3K27me3 spreading organizes canonical PRC1 chromatin architecture to regulate developmental programs. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.28.567931. [PMID: 38116029 PMCID: PMC10729739 DOI: 10.1101/2023.11.28.567931] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2023]
Abstract
Polycomb Repressive Complex 2 (PRC2)-mediated histone H3K27 tri-methylation (H3K27me3) recruits canonical PRC1 (cPRC1) to maintain heterochromatin. In early development, polycomb-regulated genes are connected through long-range 3D interactions which resolve upon differentiation. Here, we report that polycomb looping is controlled by H3K27me3 spreading and regulates target gene silencing and cell fate specification. Using glioma-derived H3 Lys-27-Met (H3K27M) mutations as tools to restrict H3K27me3 deposition, we show that H3K27me3 confinement concentrates the chromatin pool of cPRC1, resulting in heightened 3D interactions mirroring chromatin architecture of pluripotency, and stringent gene repression that maintains cells in progenitor states to facilitate tumor development. Conversely, H3K27me3 spread in pluripotent stem cells, following neural differentiation or loss of the H3K36 methyltransferase NSD1, dilutes cPRC1 concentration and dissolves polycomb loops. These results identify the regulatory principles and disease implications of polycomb looping and nominate histone modification-guided distribution of reader complexes as an important mechanism for nuclear compartment organization. Highlights The confinement of H3K27me3 at PRC2 nucleation sites without its spreading correlates with increased 3D chromatin interactions.The H3K27M oncohistone concentrates canonical PRC1 that anchors chromatin loop interactions in gliomas, silencing developmental programs.Stem and progenitor cells require factors promoting H3K27me3 confinement, including H3K36me2, to maintain cPRC1 loop architecture.The cPRC1-H3K27me3 interaction is a targetable driver of aberrant self-renewal in tumor cells.
Collapse
|
3
|
Nakato R, Sakata T, Wang J, Nagai LAE, Nagaoka Y, Oba GM, Bando M, Shirahige K. Context-dependent perturbations in chromatin folding and the transcriptome by cohesin and related factors. Nat Commun 2023; 14:5647. [PMID: 37726281 PMCID: PMC10509244 DOI: 10.1038/s41467-023-41316-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 08/29/2023] [Indexed: 09/21/2023] Open
Abstract
Cohesin regulates gene expression through context-specific chromatin folding mechanisms such as enhancer-promoter looping and topologically associating domain (TAD) formation by cooperating with factors such as cohesin loaders and the insulation factor CTCF. We developed a computational workflow to explore how three-dimensional (3D) structure and gene expression are regulated collectively or individually by cohesin and related factors. The main component is CustardPy, by which multi-omics datasets are compared systematically. To validate our methodology, we generated 3D genome, transcriptome, and epigenome data before and after depletion of cohesin and related factors and compared the effects of depletion. We observed diverse effects on the 3D genome and transcriptome, and gene expression changes were correlated with the splitting of TADs caused by cohesin loss. We also observed variations in long-range interactions across TADs, which correlated with their epigenomic states. These computational tools and datasets will be valuable for 3D genome and epigenome studies.
Collapse
Affiliation(s)
- Ryuichiro Nakato
- Laboratory of Computational Genomics, Institute for Quantitative Biosciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-Ku, Tokyo, 113-0032, Japan.
| | - Toyonori Sakata
- Laboratory of Genome Structure and Function, Institute for Quantitative Biosciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-Ku, Tokyo, 113-0032, Japan
- Karolinska Institutet, Department of Biosciences and Nutrition, Biomedicum, Quarter A6, 171 77, Stockholm, Sweden
- Karolinska Institutet, Department of Cell and Molecular Biology, Biomedicum, Quarter A6, 171 77, Stockholm, Sweden
| | - Jiankang Wang
- School of Biomedical Sciences, Hunan University, Changsha, China
| | - Luis Augusto Eijy Nagai
- Laboratory of Computational Genomics, Institute for Quantitative Biosciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-Ku, Tokyo, 113-0032, Japan
| | - Yuya Nagaoka
- Laboratory of Computational Genomics, Institute for Quantitative Biosciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-Ku, Tokyo, 113-0032, Japan
| | - Gina Miku Oba
- Laboratory of Computational Genomics, Institute for Quantitative Biosciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-Ku, Tokyo, 113-0032, Japan
| | - Masashige Bando
- Laboratory of Genome Structure and Function, Institute for Quantitative Biosciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-Ku, Tokyo, 113-0032, Japan
| | - Katsuhiko Shirahige
- Laboratory of Genome Structure and Function, Institute for Quantitative Biosciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-Ku, Tokyo, 113-0032, Japan.
- Karolinska Institutet, Department of Biosciences and Nutrition, Biomedicum, Quarter A6, 171 77, Stockholm, Sweden.
- Karolinska Institutet, Department of Cell and Molecular Biology, Biomedicum, Quarter A6, 171 77, Stockholm, Sweden.
| |
Collapse
|
4
|
Wang J, Nakato R. CohesinDB: a comprehensive database for decoding cohesin-related epigenomes, 3D genomes and transcriptomes in human cells. Nucleic Acids Res 2022; 51:D70-D79. [PMID: 36162821 PMCID: PMC9825609 DOI: 10.1093/nar/gkac795] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Revised: 08/29/2022] [Accepted: 09/03/2022] [Indexed: 01/29/2023] Open
Abstract
Cohesin is a multifunctional protein responsible for transcriptional regulation and chromatin organization. Cohesin binds to chromatin at tens of thousands of distinct sites in a conserved or tissue-specific manner, whereas the function of cohesin varies greatly depending on the epigenetic properties of specific chromatin loci. Cohesin also extensively mediates cis-regulatory modules (CRMs) and chromatin loops. Even though next-generation sequencing technologies have provided a wealth of information on different aspects of cohesin, the integration and exploration of the resultant massive cohesin datasets are not straightforward. Here, we present CohesinDB (https://cohesindb.iqb.u-tokyo.ac.jp), a comprehensive multiomics cohesin database in human cells. CohesinDB includes 2043 epigenomics, transcriptomics and 3D genomics datasets from 530 studies involving 176 cell types. By integrating these large-scale data, CohesinDB summarizes three types of 'cohesin objects': 751 590 cohesin binding sites, 957 868 cohesin-related chromatin loops and 2 229 500 cohesin-related CRMs. Each cohesin object is annotated with locus, cell type, classification, function, 3D genomics and cis-regulatory information. CohesinDB features a user-friendly interface for browsing, searching, analyzing, visualizing and downloading the desired information. CohesinDB contributes a valuable resource for all researchers studying cohesin, epigenomics, transcriptional regulation and chromatin organization.
Collapse
Affiliation(s)
- Jiankang Wang
- Institute for Quantitative Biosciences, The University of Tokyo, Bunkyo-ku, Tokyo, Yayoi 1-1-1, Japan,Graduate School of Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, Hongo 7-3-1, Japan
| | - Ryuichiro Nakato
- To whom correspondence should be addressed. Tel: +81 3 5841 1471; Fax: +81 3 5841 7308;
| |
Collapse
|
5
|
Wang J, Bando M, Shirahige K, Nakato R. Large-scale multi-omics analysis suggests specific roles for intragenic cohesin in transcriptional regulation. Nat Commun 2022; 13:3218. [PMID: 35680859 PMCID: PMC9184728 DOI: 10.1038/s41467-022-30792-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Accepted: 05/14/2022] [Indexed: 12/19/2022] Open
Abstract
Cohesin, an essential protein complex for chromosome segregation, regulates transcription through a variety of mechanisms. It is not a trivial task to assign diverse cohesin functions. Moreover, the context-specific roles of cohesin-mediated interactions, especially on intragenic regions, have not been thoroughly investigated. Here we perform a comprehensive characterization of cohesin binding sites in several human cell types. We integrate epigenomic, transcriptomic and chromatin interaction data to explore the context-specific functions of intragenic cohesin related to gene activation. We identify a specific subset of cohesin binding sites, decreased intragenic cohesin sites (DICs), which are negatively correlated with transcriptional regulation. A subgroup of DICs is enriched with enhancer markers and RNA polymerase II, while the others are more correlated to chromatin architecture. DICs are observed in various cell types, including cells from patients with cohesinopathy. We also implement machine learning to our data and identified genomic features for isolating DICs from all cohesin sites. These results suggest a previously unidentified function of cohesin on intragenic regions for transcriptional regulation.
Collapse
Affiliation(s)
- Jiankang Wang
- Institute for Quantitative Biosciences, The University of Tokyo, Tokyo, Japan
- Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| | - Masashige Bando
- Institute for Quantitative Biosciences, The University of Tokyo, Tokyo, Japan
| | - Katsuhiko Shirahige
- Institute for Quantitative Biosciences, The University of Tokyo, Tokyo, Japan
- Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
- Department of Biosciences and Nutrition, Karolinska Institutet, Stockholm, Sweden
| | - Ryuichiro Nakato
- Institute for Quantitative Biosciences, The University of Tokyo, Tokyo, Japan.
- Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.
| |
Collapse
|
6
|
Comprehensive assessment of differential ChIP-seq tools guides optimal algorithm selection. Genome Biol 2022; 23:119. [PMID: 35606795 PMCID: PMC9128273 DOI: 10.1186/s13059-022-02686-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Accepted: 05/09/2022] [Indexed: 11/21/2022] Open
Abstract
Background The analysis of chromatin binding patterns of proteins in different biological states is a main application of chromatin immunoprecipitation followed by sequencing (ChIP-seq). A large number of algorithms and computational tools for quantitative comparison of ChIP-seq datasets exist, but their performance is strongly dependent on the parameters of the biological system under investigation. Thus, a systematic assessment of available computational tools for differential ChIP-seq analysis is required to guide the optimal selection of analysis tools based on the present biological scenario. Results We created standardized reference datasets by in silico simulation and sub-sampling of genuine ChIP-seq data to represent different biological scenarios and binding profiles. Using these data, we evaluated the performance of 33 computational tools and approaches for differential ChIP-seq analysis. Tool performance was strongly dependent on peak size and shape as well as on the scenario of biological regulation. Conclusions Our analysis provides unbiased guidelines for the optimized choice of software tools in differential ChIP-seq analysis. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02686-y.
Collapse
|
7
|
Girgis J, Yang D, Chakroun I, Liu Y, Blais A. Six1 promotes skeletal muscle thyroid hormone response through regulation of the MCT10 transporter. Skelet Muscle 2021; 11:26. [PMID: 34809717 PMCID: PMC8607597 DOI: 10.1186/s13395-021-00281-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 10/29/2021] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND The Six1 transcription factor is implicated in controlling the development of several tissue types, notably skeletal muscle. Six1 also contributes to muscle metabolism and its activity is associated with the fast-twitch, glycolytic phenotype. Six1 regulates the expression of certain genes of the fast muscle program by directly stimulating their transcription or indirectly acting through a long non-coding RNA. We hypothesized that additional mechanisms of action of Six1 might be at play. METHODS A combined analysis of gene expression profiling and genome-wide location analysis data was performed. Results were validated using in vivo RNA interference loss-of-function assays followed by measurement of gene expression by RT-PCR and transcriptional reporter assays. RESULTS The Slc16a10 gene, encoding the thyroid hormone transmembrane transporter MCT10, was identified as a gene with a transcriptional enhancer directly bound by Six1 and requiring Six1 activity for full expression in adult mouse tibialis anterior, a predominantly fast-twitch muscle. Of the various thyroid hormone transporters, MCT10 mRNA was found to be the most abundant in skeletal muscle, and to have a stronger expression in fast-twitch compared to slow-twitch muscle groups. Loss-of-function of MCT10 in the tibialis anterior recapitulated the effect of Six1 on the expression of fast-twitch muscle genes and led to lower activity of a thyroid hormone receptor-dependent reporter gene. CONCLUSIONS These results shed light on the molecular mechanisms controlling the tissue expression profile of MCT10 and identify modulation of the thyroid hormone signaling pathway as an additional mechanism by which Six1 influences skeletal muscle metabolism.
Collapse
Affiliation(s)
- John Girgis
- Faculty of Medicine, Department of Biochemistry, Microbiology and Immunology, University of Ottawa, 451 Smyth Road, Ottawa, Ontario, K1H 8M5, Canada.,Ottawa Institute of Systems Biology, Ottawa, Ontario, Canada.,Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Dabo Yang
- Faculty of Medicine, Department of Biochemistry, Microbiology and Immunology, University of Ottawa, 451 Smyth Road, Ottawa, Ontario, K1H 8M5, Canada.,Ottawa Institute of Systems Biology, Ottawa, Ontario, Canada
| | - Imane Chakroun
- Faculty of Medicine, Department of Biochemistry, Microbiology and Immunology, University of Ottawa, 451 Smyth Road, Ottawa, Ontario, K1H 8M5, Canada.,Ottawa Institute of Systems Biology, Ottawa, Ontario, Canada
| | - Yubing Liu
- Faculty of Medicine, Department of Biochemistry, Microbiology and Immunology, University of Ottawa, 451 Smyth Road, Ottawa, Ontario, K1H 8M5, Canada.,Ottawa Institute of Systems Biology, Ottawa, Ontario, Canada
| | - Alexandre Blais
- Faculty of Medicine, Department of Biochemistry, Microbiology and Immunology, University of Ottawa, 451 Smyth Road, Ottawa, Ontario, K1H 8M5, Canada. .,Ottawa Institute of Systems Biology, Ottawa, Ontario, Canada. .,University of Ottawa Centre for Inflammation, Immunity and Infection (CI3), Ottawa, Ontario, Canada.
| |
Collapse
|
8
|
Menzel M, Hurka S, Glasenhardt S, Gogol-Döring A. NoPeak: k-mer-based motif discovery in ChIP-Seq data without peak calling. Bioinformatics 2021; 37:596-602. [PMID: 32991679 DOI: 10.1093/bioinformatics/btaa845] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2020] [Accepted: 09/14/2020] [Indexed: 01/30/2023] Open
Abstract
MOTIVATION The discovery of sequence motifs mediating DNA-protein binding usually implies the determination of binding sites using high-throughput sequencing and peak calling. The determination of peaks, however, depends strongly on data quality and is susceptible to noise. RESULTS Here, we present a novel approach to reliably identify transcription factor-binding motifs from ChIP-Seq data without peak detection. By evaluating the distributions of sequencing reads around the different k-mers in the genome, we are able to identify binding motifs in ChIP-Seq data that yield no results in traditional pipelines. AVAILABILITY AND IMPLEMENTATION NoPeak is published under the GNU General Public License and available as a standalone console-based Java application at https://github.com/menzel/nopeak. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Michael Menzel
- MNI, Technische Hochschule Mittelhessen, University of Applied Sciences, Giessen 35390, Germany
| | - Sabine Hurka
- Institute for Insect Biotechnology, Justus Liebig University, Giessen 35392, Germany
| | - Stefan Glasenhardt
- MNI, Technische Hochschule Mittelhessen, University of Applied Sciences, Giessen 35390, Germany
| | - Andreas Gogol-Döring
- MNI, Technische Hochschule Mittelhessen, University of Applied Sciences, Giessen 35390, Germany
| |
Collapse
|
9
|
Nakato R, Sakata T. Methods for ChIP-seq analysis: A practical workflow and advanced applications. Methods 2021; 187:44-53. [PMID: 32240773 DOI: 10.1016/j.ymeth.2020.03.005] [Citation(s) in RCA: 93] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Revised: 03/17/2020] [Accepted: 03/18/2020] [Indexed: 12/13/2022] Open
Abstract
Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is a central method in epigenomic research. Genome-wide analysis of histone modifications, such as enhancer analysis and genome-wide chromatin state annotation, enables systematic analysis of how the epigenomic landscape contributes to cell identity, development, lineage specification, and disease. In this review, we first present a typical ChIP-seq analysis workflow, from quality assessment to chromatin-state annotation. We focus on practical, rather than theoretical, approaches for biological studies. Next, we outline various advanced ChIP-seq applications and introduce several state-of-the-art methods, including prediction of gene expression level and chromatin loops from epigenome data and data imputation. Finally, we discuss recently developed single-cell ChIP-seq analysis methodologies that elucidate the cellular diversity within complex tissues and cancers.
Collapse
Affiliation(s)
- Ryuichiro Nakato
- Laboratory of Computational Genomics, Institute for Quantitative Biosciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113-0032, Japan.
| | - Toyonori Sakata
- Laboratory of Genome Structure and Function, Institute for Quantitative Biosciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113-0032, Japan.
| |
Collapse
|
10
|
Affiliation(s)
- Hinrich Gronemeyer
- Department of Functional Genomics and Cancer, Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC), Illkirch, France.,Centre National de la Recherche Scientifique, Illkirch, France.,Institut National de la Santé et de la Recherche Médicale, Illkirch, France.,Université de Strasbourg, Illkirch, France.,International Journal of Cancer, Heidelberg, Germany
| | | |
Collapse
|
11
|
Fang Y, Xu X, Ding J, Yang L, Doan MT, Karmaus PWF, Snyder NW, Zhao Y, Li JL, Li X. Histone crotonylation promotes mesoendodermal commitment of human embryonic stem cells. Cell Stem Cell 2021; 28:748-763.e7. [PMID: 33450185 DOI: 10.1016/j.stem.2020.12.009] [Citation(s) in RCA: 60] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2019] [Revised: 08/20/2020] [Accepted: 12/15/2020] [Indexed: 12/17/2022]
Abstract
Histone crotonylation is a non-acetyl histone lysine modification that is as widespread as acetylation. However, physiological functions associated with histone crotonylation remain almost completely unknown. Here we report that histone crotonylation is crucial for endoderm differentiation. We demonstrate that key crotonyl-coenzyme A (CoA)-producing enzymes are specifically induced in endodermal cells during differentiation of human embryonic stem cells (hESCs) in vitro and in mouse embryos, where they function to increase histone crotonylation and enhance endodermal gene expression. Chemical enhancement of histone crotonylation promotes endoderm differentiation of hESCs, whereas deletion of crotonyl-CoA-producing enzymes reduces histone crotonylation and impairs meso/endoderm differentiation in vitro and in vivo. Our study uncovers a histone crotonylation-mediated mechanism that promotes endodermal commitment of pluripotent stem cells, which may have important implications for therapeutic strategies against a number of human diseases.
Collapse
Affiliation(s)
- Yi Fang
- Signal Transduction Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA.
| | - Xiaojiang Xu
- Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA
| | - Jun Ding
- Ben May Department for Cancer Research, The University of Chicago, Chicago, IL 60637, USA
| | - Lu Yang
- Ben May Department for Cancer Research, The University of Chicago, Chicago, IL 60637, USA
| | - Mary T Doan
- Center for Metabolic Disease Research, Department of Microbiology and Immunology, Lewis Katz School of Medicine at Temple University, Philadelphia, PA 19140, USA
| | - Peer W F Karmaus
- Immunity, Inflammation, and Disease Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA
| | - Nathaniel W Snyder
- Center for Metabolic Disease Research, Department of Microbiology and Immunology, Lewis Katz School of Medicine at Temple University, Philadelphia, PA 19140, USA
| | - Yingming Zhao
- Ben May Department for Cancer Research, The University of Chicago, Chicago, IL 60637, USA
| | - Jian-Liang Li
- Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA
| | - Xiaoling Li
- Signal Transduction Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA.
| |
Collapse
|
12
|
Halstead MM, Kern C, Saelao P, Wang Y, Chanthavixay G, Medrano JF, Van Eenennaam AL, Korf I, Tuggle CK, Ernst CW, Zhou H, Ross PJ. A comparative analysis of chromatin accessibility in cattle, pig, and mouse tissues. BMC Genomics 2020; 21:698. [PMID: 33028202 PMCID: PMC7541309 DOI: 10.1186/s12864-020-07078-9] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 09/17/2020] [Indexed: 12/25/2022] Open
Abstract
Background Although considerable progress has been made towards annotating the noncoding portion of the human and mouse genomes, regulatory elements in other species, such as livestock, remain poorly characterized. This lack of functional annotation poses a substantial roadblock to agricultural research and diminishes the value of these species as model organisms. As active regulatory elements are typically characterized by chromatin accessibility, we implemented the Assay for Transposase Accessible Chromatin (ATAC-seq) to annotate and characterize regulatory elements in pigs and cattle, given a set of eight adult tissues. Results Overall, 306,304 and 273,594 active regulatory elements were identified in pig and cattle, respectively. 71,478 porcine and 47,454 bovine regulatory elements were highly tissue-specific and were correspondingly enriched for binding motifs of known tissue-specific transcription factors. However, in every tissue the most prevalent accessible motif corresponded to the insulator CTCF, suggesting pervasive involvement in 3-D chromatin organization. Taking advantage of a similar dataset in mouse, open chromatin in pig, cattle, and mice were compared, revealing that the conservation of regulatory elements, in terms of sequence identity and accessibility, was consistent with evolutionary distance; whereas pig and cattle shared about 20% of accessible sites, mice and ungulates only had about 10% of accessible sites in common. Furthermore, conservation of accessibility was more prevalent at promoters than at intergenic regions. Conclusions The lack of conserved accessibility at distal elements is consistent with rapid evolution of enhancers, and further emphasizes the need to annotate regulatory elements in individual species, rather than inferring elements based on homology. This atlas of chromatin accessibility in cattle and pig constitutes a substantial step towards annotating livestock genomes and dissecting the regulatory link between genome and phenome.
Collapse
Affiliation(s)
- Michelle M Halstead
- Department of Animal Science, University of California Davis, Davis, CA, 95616, USA
| | - Colin Kern
- Department of Animal Science, University of California Davis, Davis, CA, 95616, USA
| | - Perot Saelao
- Department of Animal Science, University of California Davis, Davis, CA, 95616, USA
| | - Ying Wang
- Department of Animal Science, University of California Davis, Davis, CA, 95616, USA
| | - Ganrea Chanthavixay
- Department of Animal Science, University of California Davis, Davis, CA, 95616, USA
| | - Juan F Medrano
- Department of Animal Science, University of California Davis, Davis, CA, 95616, USA
| | | | - Ian Korf
- Department of Animal Science, University of California Davis, Davis, CA, 95616, USA
| | | | - Catherine W Ernst
- Department of Animal Science, Michigan State University, East Lansing, 48824, MI, USA
| | - Huaijun Zhou
- Department of Animal Science, University of California Davis, Davis, CA, 95616, USA.
| | - Pablo J Ross
- Department of Animal Science, University of California Davis, Davis, CA, 95616, USA.
| |
Collapse
|
13
|
Anzawa H, Yamagata H, Kinoshita K. Theoretical characterisation of strand cross-correlation in ChIP-seq. BMC Bioinformatics 2020; 21:417. [PMID: 32962634 PMCID: PMC7510163 DOI: 10.1186/s12859-020-03729-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Accepted: 08/31/2020] [Indexed: 01/03/2023] Open
Abstract
BACKGROUND Strand cross-correlation profiles are used for both peak calling pre-analysis and quality control (QC) in chromatin immunoprecipitation followed by sequencing (ChIP-seq) analysis. Despite its potential for robust and accurate assessments of signal-to-noise ratio (S/N) because of its peak calling independence, it remains unclear what aspects of quality such strand cross-correlation profiles actually measure. RESULTS We introduced a simple model to simulate the mapped read-density of ChIP-seq and then derived the theoretical maximum and minimum of cross-correlation coefficients between strands. The results suggest that the maximum coefficient of typical ChIP-seq samples is directly proportional to the number of total mapped reads and the square of the ratio of signal reads, and inversely proportional to the number of peaks and the length of read-enriched regions. Simulation analysis supported our results and evaluation using 790 ChIP-seq data obtained from the public database demonstrated high consistency between calculated cross-correlation coefficients and estimated coefficients based on the theoretical relations and peak calling results. In addition, we found that the mappability-bias-correction improved sensitivity, enabling differentiation of maximum coefficients from the noise level. Based on these insights, we proposed virtual S/N (VSN), a novel peak call-free metric for S/N assessment. We also developed PyMaSC, a tool to calculate strand cross-correlation and VSN efficiently. VSN achieved most consistent S/N estimation for various ChIP targets and sequencing read depths. Furthermore, we demonstrated that a combination of VSN and pre-existing peak calling results enable the estimation of the numbers of detectable peaks for posterior experiments and assess peak calling results. CONCLUSIONS We present the first theoretical insights into the strand cross-correlation, and the results reveal the potential and the limitations of strand cross-correlation analysis. Our quality assessment framework using VSN provides peak call-independent QC and will help in the evaluation of peak call analysis in ChIP-seq experiments.
Collapse
Affiliation(s)
- Hayato Anzawa
- Graduate School of Information Sciences, Tohoku University, Sendai, Miyagi, Japan
| | - Hitoshi Yamagata
- Advanced Research Laboratory, Canon Medical Systems Corporation, Otawara, Tochigi, Japan
| | - Kengo Kinoshita
- Graduate School of Information Sciences, Tohoku University, Sendai, Miyagi, Japan. .,Advanced Research Laboratory, Canon Medical Systems Corporation, Otawara, Tochigi, Japan. .,Tohoku Medical Megabank Organization, Sendai, Miyagi, Japan. .,Institute of Development, Aging and Cancer, Tohoku University, Sendai, Miyagi, Japan.
| |
Collapse
|
14
|
Nakato R, Wada Y, Nakaki R, Nagae G, Katou Y, Tsutsumi S, Nakajima N, Fukuhara H, Iguchi A, Kohro T, Kanki Y, Saito Y, Kobayashi M, Izumi-Taguchi A, Osato N, Tatsuno K, Kamio A, Hayashi-Takanaka Y, Wada H, Ohta S, Aikawa M, Nakajima H, Nakamura M, McGee RC, Heppner KW, Kawakatsu T, Genno M, Yanase H, Kume H, Senbonmatsu T, Homma Y, Nishimura S, Mitsuyama T, Aburatani H, Kimura H, Shirahige K. Comprehensive epigenome characterization reveals diverse transcriptional regulation across human vascular endothelial cells. Epigenetics Chromatin 2019; 12:77. [PMID: 31856914 PMCID: PMC6921469 DOI: 10.1186/s13072-019-0319-0] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Accepted: 12/03/2019] [Indexed: 01/19/2023] Open
Abstract
Background Endothelial cells (ECs) make up the innermost layer throughout the entire vasculature. Their phenotypes and physiological functions are initially regulated by developmental signals and extracellular stimuli. The underlying molecular mechanisms responsible for the diverse phenotypes of ECs from different organs are not well understood. Results To characterize the transcriptomic and epigenomic landscape in the vascular system, we cataloged gene expression and active histone marks in nine types of human ECs (generating 148 genome-wide datasets) and carried out a comprehensive analysis with chromatin interaction data. We developed a robust procedure for comparative epigenome analysis that circumvents variations at the level of the individual and technical noise derived from sample preparation under various conditions. Through this approach, we identified 3765 EC-specific enhancers, some of which were associated with disease-associated genetic variations. We also identified various candidate marker genes for each EC type. We found that the nine EC types can be divided into two subgroups, corresponding to those with upper-body origins and lower-body origins, based on their epigenomic landscape. Epigenomic variations were highly correlated with gene expression patterns, but also provided unique information. Most of the deferentially expressed genes and enhancers were cooperatively enriched in more than one EC type, suggesting that the distinct combinations of multiple genes play key roles in the diverse phenotypes across EC types. Notably, many homeobox genes were differentially expressed across EC types, and their expression was correlated with the relative position of each organ in the body. This reflects the developmental origins of ECs and their roles in angiogenesis, vasculogenesis and wound healing. Conclusions This comprehensive analysis of epigenome characterization of EC types reveals diverse transcriptional regulation across human vascular systems. These datasets provide a valuable resource for understanding the vascular system and associated diseases.
Collapse
Affiliation(s)
- Ryuichiro Nakato
- Laboratory of Computational Genomics, Institute for Quantitative Biosciences, The University of Tokyo, Tokyo, 113-0032, Japan.,Japan Agency for Medical Research and Development (AMED-CREST), AMED, 1-7-1 Otemachi, Chiyoda-ku, Tokyo, 100-0004, Japan
| | - Youichiro Wada
- Japan Agency for Medical Research and Development (AMED-CREST), AMED, 1-7-1 Otemachi, Chiyoda-ku, Tokyo, 100-0004, Japan. .,Isotope Science Center, The University of Tokyo, Tokyo, 113-0032, Japan.
| | - Ryo Nakaki
- Genome Science Division, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, 153-8904, Japan
| | - Genta Nagae
- Japan Agency for Medical Research and Development (AMED-CREST), AMED, 1-7-1 Otemachi, Chiyoda-ku, Tokyo, 100-0004, Japan.,Genome Science Division, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, 153-8904, Japan
| | - Yuki Katou
- Laboratory of Genome Structure and Function, Institute for Quantitative Biosciences, The University of Tokyo, Tokyo, 113-0032, Japan
| | - Shuichi Tsutsumi
- Genome Science Division, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, 153-8904, Japan
| | - Natsu Nakajima
- Laboratory of Computational Genomics, Institute for Quantitative Biosciences, The University of Tokyo, Tokyo, 113-0032, Japan
| | - Hiroshi Fukuhara
- Department of Urology, Graduate School of Medicine, The University of Tokyo, Tokyo, 113-8655, Japan
| | - Atsushi Iguchi
- Department of Cardiovascular Surgery, Saitama Medical University International Medical Center, Saitama, 350-1298, Japan
| | - Takahide Kohro
- Department of Clinical Informatics, Jichi Medical University School of Medicine, Shimotsuke, 329-0498, Japan
| | - Yasuharu Kanki
- Japan Agency for Medical Research and Development (AMED-CREST), AMED, 1-7-1 Otemachi, Chiyoda-ku, Tokyo, 100-0004, Japan.,Isotope Science Center, The University of Tokyo, Tokyo, 113-0032, Japan
| | - Yutaka Saito
- Japan Agency for Medical Research and Development (AMED-CREST), AMED, 1-7-1 Otemachi, Chiyoda-ku, Tokyo, 100-0004, Japan.,Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-ku, Tokyo, 135-0064, Japan.,Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), 3-4-1 Okubo, Shinjuku-ku, Tokyo, 169-8555, Japan
| | - Mika Kobayashi
- Isotope Science Center, The University of Tokyo, Tokyo, 113-0032, Japan
| | | | - Naoki Osato
- Japan Agency for Medical Research and Development (AMED-CREST), AMED, 1-7-1 Otemachi, Chiyoda-ku, Tokyo, 100-0004, Japan.,Genome Science Division, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, 153-8904, Japan
| | - Kenji Tatsuno
- Genome Science Division, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, 153-8904, Japan
| | - Asuka Kamio
- Genome Science Division, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, 153-8904, Japan
| | - Yoko Hayashi-Takanaka
- Japan Agency for Medical Research and Development (AMED-CREST), AMED, 1-7-1 Otemachi, Chiyoda-ku, Tokyo, 100-0004, Japan.,Cell Biology Center, Institute of Innovative Research, Tokyo Institute of Technology, Yokohama, 226-8503, Japan
| | - Hiromi Wada
- Isotope Science Center, The University of Tokyo, Tokyo, 113-0032, Japan.,Brain Attack Center, Ohta Memorial Hospital, Fukuyama, 720-0825, Japan
| | - Shinzo Ohta
- Brain Attack Center, Ohta Memorial Hospital, Fukuyama, 720-0825, Japan
| | - Masanori Aikawa
- The Center for Excellence in Vascular Biology and the Center for Interdisciplinary Cardiovascular Sciences, Cardiovascular Division and Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Hiroyuki Nakajima
- Department of Cardiovascular Surgery, Saitama Medical University International Medical Center, Saitama, 350-1298, Japan
| | - Masaki Nakamura
- Department of Urology, Graduate School of Medicine, The University of Tokyo, Tokyo, 113-8655, Japan
| | | | | | - Tatsuo Kawakatsu
- Bio-Medical Department, Kurabo Industries Ltd., Neyagawa, Osaka, 572-0823, Japan
| | - Michiru Genno
- Bio-Medical Department, Kurabo Industries Ltd., Neyagawa, Osaka, 572-0823, Japan
| | - Hiroshi Yanase
- Bio-Medical Department, Kurabo Industries Ltd., Neyagawa, Osaka, 572-0823, Japan
| | - Haruki Kume
- Department of Urology, Graduate School of Medicine, The University of Tokyo, Tokyo, 113-8655, Japan
| | - Takaaki Senbonmatsu
- Department of Cardiology, Saitama Medical University International Medical Center, Saitama, 350-1298, Japan
| | - Yukio Homma
- Department of Urology, Graduate School of Medicine, The University of Tokyo, Tokyo, 113-8655, Japan
| | - Shigeyuki Nishimura
- Department of Cardiology, Saitama Medical University International Medical Center, Saitama, 350-1298, Japan
| | - Toutai Mitsuyama
- Japan Agency for Medical Research and Development (AMED-CREST), AMED, 1-7-1 Otemachi, Chiyoda-ku, Tokyo, 100-0004, Japan.,Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-ku, Tokyo, 135-0064, Japan
| | - Hiroyuki Aburatani
- Japan Agency for Medical Research and Development (AMED-CREST), AMED, 1-7-1 Otemachi, Chiyoda-ku, Tokyo, 100-0004, Japan.,Genome Science Division, Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, 153-8904, Japan
| | - Hiroshi Kimura
- Japan Agency for Medical Research and Development (AMED-CREST), AMED, 1-7-1 Otemachi, Chiyoda-ku, Tokyo, 100-0004, Japan. .,Cell Biology Center, Institute of Innovative Research, Tokyo Institute of Technology, Yokohama, 226-8503, Japan. .,Laboratory of Functional Nuclear Imaging, Institute for Quantitative Biosciences, The University of Tokyo, Tokyo, 113-0032, Japan.
| | - Katsuhiko Shirahige
- Japan Agency for Medical Research and Development (AMED-CREST), AMED, 1-7-1 Otemachi, Chiyoda-ku, Tokyo, 100-0004, Japan. .,Laboratory of Genome Structure and Function, Institute for Quantitative Biosciences, The University of Tokyo, Tokyo, 113-0032, Japan.
| |
Collapse
|