1
|
Chin IM, Gardell ZA, Corces MR. Decoding polygenic diseases: advances in noncoding variant prioritization and validation. Trends Cell Biol 2024; 34:465-483. [PMID: 38719704 DOI: 10.1016/j.tcb.2024.03.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 03/12/2024] [Accepted: 03/21/2024] [Indexed: 06/09/2024]
Abstract
Genome-wide association studies (GWASs) provide a key foundation for elucidating the genetic underpinnings of common polygenic diseases. However, these studies have limitations in their ability to assign causality to particular genetic variants, especially those residing in the noncoding genome. Over the past decade, technological and methodological advances in both analytical and empirical prioritization of noncoding variants have enabled the identification of causative variants by leveraging orthogonal functional evidence at increasing scale. In this review, we present an overview of these approaches and describe how this workflow provides the groundwork necessary to move beyond associations toward genetically informed studies on the molecular and cellular mechanisms of polygenic disease.
Collapse
Affiliation(s)
- Iris M Chin
- Gladstone Institute of Neurological Disease, Gladstone Institutes, San Francisco, CA, USA; Gladstone Institute of Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA, USA; Department of Neurology, University of California San Francisco, San Francisco, CA, USA
| | - Zachary A Gardell
- Gladstone Institute of Neurological Disease, Gladstone Institutes, San Francisco, CA, USA; Gladstone Institute of Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA, USA; Department of Neurology, University of California San Francisco, San Francisco, CA, USA
| | - M Ryan Corces
- Gladstone Institute of Neurological Disease, Gladstone Institutes, San Francisco, CA, USA; Gladstone Institute of Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA, USA; Department of Neurology, University of California San Francisco, San Francisco, CA, USA.
| |
Collapse
|
2
|
Ling Z, Li J, Jiang T, Zhang Z, Zhu Y, Zhou Z, Yang J, Tong X, Yang B, Huang L. Omics-based construction of regulatory variants can be applied to help decipher pig liver-related traits. Commun Biol 2024; 7:381. [PMID: 38553586 PMCID: PMC10980749 DOI: 10.1038/s42003-024-06050-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2023] [Accepted: 03/14/2024] [Indexed: 04/02/2024] Open
Abstract
Genetic variants can influence complex traits by altering gene expression through changes to regulatory elements. However, the genetic variants that affect the activity of regulatory elements in pigs are largely unknown, and the extent to which these variants influence gene expression and contribute to the understanding of complex phenotypes remains unclear. Here, we annotate 90,991 high-quality regulatory elements using acetylation of histone H3 on lysine 27 (H3K27ac) ChIP-seq of 292 pig livers. Combined with genome resequencing and RNA-seq data, we identify 28,425 H3K27ac quantitative trait loci (acQTLs) and 12,250 expression quantitative trait loci (eQTLs). Through the allelic imbalance analysis, we validate two causative acQTL variants in independent datasets. We observe substantial sharing of genetic controls between gene expression and H3K27ac, particularly within promoters. We infer that 46% of H3K27ac exhibit a concomitant rather than causative relationship with gene expression. By integrating GWAS, eQTLs, acQTLs, and transcription factor binding prediction, we further demonstrate their application, through metabolites dulcitol, phosphatidylcholine (PC) (16:0/16:0) and published phenotypes, in identifying likely causal variants and genes, and discovering sub-threshold GWAS loci. We provide insight into the relationship between regulatory elements and gene expression, and the genetic foundation for dissecting the molecular mechanism of phenotypes.
Collapse
Affiliation(s)
- Ziqi Ling
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, P.R. China.
| | - Jing Li
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, P.R. China
| | - Tao Jiang
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, P.R. China
| | - Zhen Zhang
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, P.R. China
| | - Yaling Zhu
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, P.R. China
| | - Zhimin Zhou
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, P.R. China
| | - Jiawen Yang
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, P.R. China
| | - Xinkai Tong
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, P.R. China
| | - Bin Yang
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, P.R. China.
| | - Lusheng Huang
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, P.R. China.
| |
Collapse
|
3
|
Wang Q, Zhang J, Wen Y, Qi S, Duan Y, Liu Q, Li C. The pleiotropic enhancer enh9 promotes cell proliferation and migration in non-small cell lung cancer via ERMP1 and PD-L1. Biochim Biophys Acta Mol Basis Dis 2024; 1870:167015. [PMID: 38182069 DOI: 10.1016/j.bbadis.2023.167015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2023] [Revised: 12/27/2023] [Accepted: 12/28/2023] [Indexed: 01/07/2024]
Abstract
Enhancers, cis-acting DNA elements for transcriptional regulation, are important regulators of cell identity and disease. However, of the hundreds of thousands of enhancers annotated in the human genome, only a few have been studied for their regulatory mechanisms and functions in cancer progression and therapeutic resistance. Here, we report the pleiotropy of one enhancer (named enh9) in both cell proliferation and migration in non-small cell lung cancer (NSCLC) cells. By integrating multi-genomic data, ERMP1 and PD-L1 were screened out as potential targets of enh9. CUT&Tag sequencing demonstrated that enh9 was involved in the genomic interactions between the transcription factor RELA and the promoters of ERMP1 and PD-L1. In addition, ERMP1 and PD-L1 were validated to be involved in cell proliferation and migration, respectively. Our study fully elucidated the function and transcriptional regulation mechanisms of enh9 in NSCLC. The exploration on enhancers is promising to provide new insights for cancer diagnosis and therapy.
Collapse
Affiliation(s)
- Qilin Wang
- School of Engineering Medicine, Beihang University, Beijing 100191, China; Key Laboratory of Big Data-Based Precision Medicine (Ministry of Industry and Information Technology), Beihang University, Beijing 100191, China; School of Biological Science and Medical Engineering, Beihang University, Beijing 100191, China
| | - Junyou Zhang
- School of Engineering Medicine, Beihang University, Beijing 100191, China; Key Laboratory of Big Data-Based Precision Medicine (Ministry of Industry and Information Technology), Beihang University, Beijing 100191, China; School of Biological Science and Medical Engineering, Beihang University, Beijing 100191, China
| | - Yanling Wen
- Institute for Hepatology, National Clinical Research Center for Infectious Disease, Shenzhen Third People's Hospital, Shenzhen, Guangdong 518112, China; The Second Affiliated Hospital, School of Medicine, Southern University of Science and Technology, Shenzhen, Guangdong 518112, China
| | - Sihan Qi
- School of Engineering Medicine, Beihang University, Beijing 100191, China; Key Laboratory of Big Data-Based Precision Medicine (Ministry of Industry and Information Technology), Beihang University, Beijing 100191, China; School of Biological Science and Medical Engineering, Beihang University, Beijing 100191, China
| | - Yingying Duan
- School of Engineering Medicine, Beihang University, Beijing 100191, China; Key Laboratory of Big Data-Based Precision Medicine (Ministry of Industry and Information Technology), Beihang University, Beijing 100191, China; School of Biological Science and Medical Engineering, Beihang University, Beijing 100191, China
| | - Qian Liu
- School of Engineering Medicine, Beihang University, Beijing 100191, China; Key Laboratory of Big Data-Based Precision Medicine (Ministry of Industry and Information Technology), Beihang University, Beijing 100191, China; School of Biological Science and Medical Engineering, Beihang University, Beijing 100191, China
| | - Chunyan Li
- School of Engineering Medicine, Beihang University, Beijing 100191, China; Key Laboratory of Big Data-Based Precision Medicine (Ministry of Industry and Information Technology), Beihang University, Beijing 100191, China; School of Biological Science and Medical Engineering, Beihang University, Beijing 100191, China; Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, Beihang University, Beijing 100191, China.
| |
Collapse
|
4
|
Fadri MTM, Lee JB, Keung AJ. Summary of ChIP-Seq Methods and Description of an Optimized ChIP-Seq Protocol. Methods Mol Biol 2024; 2842:419-447. [PMID: 39012609 DOI: 10.1007/978-1-0716-4051-7_22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/17/2024]
Abstract
Chromatin immunoprecipitation (ChIP) is an invaluable method to characterize interactions between proteins and genomic DNA, such as the genomic localization of transcription factors and post-translational modification of histones. DNA and proteins are reversibly and covalently crosslinked using formaldehyde. Then the cells are lysed to release the chromatin. The chromatin is fragmented into smaller sizes either by micrococcal nuclease (MN) or sonication and then purified from other cellular components. The protein-DNA complexes are enriched by immunoprecipitation (IP) with antibodies that target the epitope of interest. The DNA is released from the proteins by heat and protease treatment, followed by degradation of contaminating RNAs with RNase. The resulting DNA is analyzed using various methods, including polymerase chain reaction (PCR), quantitative PCR (qPCR), or sequencing. This protocol outlines each of these steps for both yeast and human cells. This chapter includes a contextual discussion of the combination of ChIP with DNA analysis methods such as ChIP-on-Chip, ChIP-qPCR, and ChIP-Seq, recent updates on ChIP-Seq data analysis pipelines, complementary methods for identification of binding sites of DNA binding proteins, and additional protocol information about ChIP-qPCR and ChIP-Seq.
Collapse
Affiliation(s)
- Maria Theresa M Fadri
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC, USA.
| | - Jessica B Lee
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC, USA
| | - Albert J Keung
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC, USA.
| |
Collapse
|
5
|
Sigauke RF, Sanford L, Maas ZL, Jones T, Stanley JT, Townsend HA, Allen MA, Dowell RD. Atlas of nascent RNA transcripts reveals enhancer to gene linkages. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.07.570626. [PMID: 38105978 PMCID: PMC10723487 DOI: 10.1101/2023.12.07.570626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Gene transcription is controlled and modulated by regulatory regions, including enhancers and promoters. These regions are abundant in unstable, non-coding bidirectional transcription. Using nascent RNA transcription data across hundreds of human samples, we identified over 800,000 regions containing bidirectional transcription. We then identify highly correlated transcription between bidirectional and gene regions. The identified correlated pairs, a bidirectional region and a gene, are enriched for disease associated SNPs and often supported by independent 3D data. We present these resources as an SQL database which serves as a resource for future studies into gene regulation, enhancer associated RNAs, and transcription factors.
Collapse
Affiliation(s)
- Rutendo F. Sigauke
- BioFrontiers Institute, University of Colorado Boulder, 3415 Colorado Ave., UCB 596, Boulder, 80309, CO, USA
| | - Lynn Sanford
- BioFrontiers Institute, University of Colorado Boulder, 3415 Colorado Ave., UCB 596, Boulder, 80309, CO, USA
| | - Zachary L. Maas
- BioFrontiers Institute, University of Colorado Boulder, 3415 Colorado Ave., UCB 596, Boulder, 80309, CO, USA
- Computer Science, University of Colorado Boulder, 1111 Engineering Drive, UCB 430, Boulder, 80309, CO, USA
| | - Taylor Jones
- BioFrontiers Institute, University of Colorado Boulder, 3415 Colorado Ave., UCB 596, Boulder, 80309, CO, USA
| | - Jacob T. Stanley
- BioFrontiers Institute, University of Colorado Boulder, 3415 Colorado Ave., UCB 596, Boulder, 80309, CO, USA
| | - Hope A. Townsend
- BioFrontiers Institute, University of Colorado Boulder, 3415 Colorado Ave., UCB 596, Boulder, 80309, CO, USA
- Molecular, Cellular and Developmental Biology, University of Colorado Boulder, 1945 Colorado Ave, UCB 347, Boulder, 80309, CO, USA
| | - Mary A. Allen
- BioFrontiers Institute, University of Colorado Boulder, 3415 Colorado Ave., UCB 596, Boulder, 80309, CO, USA
| | - Robin D. Dowell
- BioFrontiers Institute, University of Colorado Boulder, 3415 Colorado Ave., UCB 596, Boulder, 80309, CO, USA
- Computer Science, University of Colorado Boulder, 1111 Engineering Drive, UCB 430, Boulder, 80309, CO, USA
- Molecular, Cellular and Developmental Biology, University of Colorado Boulder, 1945 Colorado Ave, UCB 347, Boulder, 80309, CO, USA
| |
Collapse
|
6
|
Li Y, Ju F, Chen Z, Qu Y, Xia H, He L, Wu L, Zhu J, Shao B, Deng P. CREaTor: zero-shot cis-regulatory pattern modeling with attention mechanisms. Genome Biol 2023; 24:266. [PMID: 37996959 PMCID: PMC10666311 DOI: 10.1186/s13059-023-03103-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 11/03/2023] [Indexed: 11/25/2023] Open
Abstract
Linking cis-regulatory sequences to target genes has been a long-standing challenge. In this study, we introduce CREaTor, an attention-based deep neural network designed to model cis-regulatory patterns for genomic elements up to 2 Mb from target genes. Coupled with a training strategy that predicts gene expression from flanking candidate cis-regulatory elements (cCREs), CREaTor can model cell type-specific cis-regulatory patterns in new cell types without prior knowledge of cCRE-gene interactions or additional training. The zero-shot modeling capability, combined with the use of only RNA-seq and ChIP-seq data, allows for the ready generalization of CREaTor to a broad range of cell types.
Collapse
Affiliation(s)
- Yongge Li
- Microsoft Research AI4Science, Beijing, China
- School of Medicine, Tsinghua University, Beijing, China
| | - Fusong Ju
- Microsoft Research AI4Science, Beijing, China
| | - Zhiyuan Chen
- Microsoft Research AI4Science, Beijing, China
- School of Computing, Australian National University, Canberra, Australia
| | - Yiming Qu
- Microsoft Research AI4Science, Beijing, China
- School of Life Sciences, Tsinghua University, Beijing, China
| | | | - Liang He
- Microsoft Research AI4Science, Beijing, China
| | - Lijun Wu
- Microsoft Research AI4Science, Beijing, China
| | - Jianwei Zhu
- Microsoft Research AI4Science, Beijing, China
| | - Bin Shao
- Microsoft Research AI4Science, Beijing, China
| | - Pan Deng
- Microsoft Research AI4Science, Beijing, China.
| |
Collapse
|
7
|
Umarov R, Hon CC. Enhancer target prediction: state-of-the-art approaches and future prospects. Biochem Soc Trans 2023; 51:1975-1988. [PMID: 37830459 DOI: 10.1042/bst20230917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 10/02/2023] [Accepted: 10/02/2023] [Indexed: 10/14/2023]
Abstract
Enhancers are genomic regions that regulate gene transcription and are located far away from the transcription start sites of their target genes. Enhancers are highly enriched in disease-associated variants and thus deciphering the interactions between enhancers and genes is crucial to understanding the molecular basis of genetic predispositions to diseases. Experimental validations of enhancer targets can be laborious. Computational methods have thus emerged as a valuable alternative for studying enhancer-gene interactions. A variety of computational methods have been developed to predict enhancer targets by incorporating genomic features (e.g. conservation, distance, and sequence), epigenomic features (e.g. histone marks and chromatin contacts) and activity measurements (e.g. covariations of enhancer activity and gene expression). With the recent advances in genome perturbation and chromatin conformation capture technologies, data on experimentally validated enhancer targets are becoming available for supervised training of these methods and evaluation of their performance. In this review, we categorize enhancer target prediction methods based on their rationales and approaches. Then we discuss their merits and limitations and highlight the future directions for enhancer targets prediction.
Collapse
Affiliation(s)
- Ramzan Umarov
- RIKEN Centre for Integrative Medical Sciences, Yokohama RIKEN Institute, Yokohama, Japan
| | - Chung-Chau Hon
- RIKEN Centre for Integrative Medical Sciences, Yokohama RIKEN Institute, Yokohama, Japan
| |
Collapse
|
8
|
Baur B, Roy S. Predicting patient-specific enhancer-promoter interactions. CELL REPORTS METHODS 2023; 3:100594. [PMID: 37751694 PMCID: PMC10545932 DOI: 10.1016/j.crmeth.2023.100594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Revised: 08/30/2023] [Accepted: 08/30/2023] [Indexed: 09/28/2023]
Abstract
Computational methods that can predict hard-to-measure modalities from those that are easier to measure, in a patient-specific manner, play a critical role in personalized medicine. In this issue of Cell Reports Methods, Khurana et al. present differential gene targets of accessible chromatin (DGTAC), an approach which predicts patient-specific enhancer-promoter interactions.
Collapse
Affiliation(s)
- Brittany Baur
- Wisconsin Institute for Discovery, 330 N. Orchard Street, Madison, WI 53715, USA; The Max Harry Weil Institute of Critical Care Research & Innovation, University of Michigan, Ann Arbor, MI, USA; Department of Emergency Medicine, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Sushmita Roy
- Wisconsin Institute for Discovery, 330 N. Orchard Street, Madison, WI 53715, USA; Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53715, USA.
| |
Collapse
|
9
|
Bevill SM, Casaní-Galdón S, El Farran CA, Cytrynbaum EG, Macias KA, Oldeman SE, Oliveira KJ, Moore MM, Hegazi E, Adriaens C, Najm FJ, Demetri GD, Cohen S, Mullen JT, Riggi N, Johnstone SE, Bernstein BE. Impact of supraphysiologic MDM2 expression on chromatin networks and therapeutic responses in sarcoma. CELL GENOMICS 2023; 3:100321. [PMID: 37492096 PMCID: PMC10363746 DOI: 10.1016/j.xgen.2023.100321] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 03/09/2023] [Accepted: 04/14/2023] [Indexed: 07/27/2023]
Abstract
Amplification of MDM2 on supernumerary chromosomes is a common mechanism of P53 inactivation across tumors. Here, we investigated the impact of MDM2 overexpression on chromatin, gene expression, and cellular phenotypes in liposarcoma. Three independent regulatory circuits predominate in aggressive, dedifferentiated tumors. RUNX and AP-1 family transcription factors bind mesenchymal gene enhancers. P53 and MDM2 co-occupy enhancers and promoters associated with P53 signaling. When highly expressed, MDM2 also binds thousands of P53-independent growth and stress response genes, whose promoters engage in multi-way topological interactions. Overexpressed MDM2 concentrates within nuclear foci that co-localize with PML and YY1 and could also contribute to P53-independent phenotypes associated with supraphysiologic MDM2. Importantly, we observe striking cell-to-cell variability in MDM2 copy number and expression in tumors and models. Whereas liposarcoma cells are generally sensitive to MDM2 inhibitors and their combination with pro-apoptotic drugs, MDM2-high cells tolerate them and may underlie the poor clinical efficacy of these agents.
Collapse
Affiliation(s)
- Samantha M. Bevill
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Departments of Cell Biology and Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Salvador Casaní-Galdón
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Departments of Cell Biology and Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Chadi A. El Farran
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Departments of Cell Biology and Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Eli G. Cytrynbaum
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Departments of Cell Biology and Pathology, Harvard Medical School, Boston, MA 02115, USA
- Department of Pathology and Center for Cancer Research, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - Kevin A. Macias
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Departments of Cell Biology and Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Sylvie E. Oldeman
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
- Departments of Cell Biology and Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Kayla J. Oliveira
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Department of Pathology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
| | - Molly M. Moore
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Esmat Hegazi
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Departments of Cell Biology and Pathology, Harvard Medical School, Boston, MA 02115, USA
- Department of Pathology and Center for Cancer Research, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - Carmen Adriaens
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Departments of Cell Biology and Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Fadi J. Najm
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - George D. Demetri
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02215, USA
- Ludwig Center at Harvard, Harvard Medical School, Boston, MA 02115, USA
| | - Sonia Cohen
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Departments of Cell Biology and Pathology, Harvard Medical School, Boston, MA 02115, USA
- Department of Surgery, Massachusetts General Hospital, Boston, MA 02114, USA
| | - John T. Mullen
- Department of Surgery, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Nicolò Riggi
- Department of Cell and Tissue Genomics (CTG), Genentech Inc, South San Francisco, CA 94080, USA
| | - Sarah E. Johnstone
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Department of Pathology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
| | - Bradley E. Bernstein
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Departments of Cell Biology and Pathology, Harvard Medical School, Boston, MA 02115, USA
- Ludwig Center at Harvard, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
10
|
Heer M, Giudice L, Mengoni C, Giugno R, Rico D. Esearch3D: propagating gene expression in chromatin networks to illuminate active enhancers. Nucleic Acids Res 2023; 51:e55. [PMID: 37021559 PMCID: PMC10250221 DOI: 10.1093/nar/gkad229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 03/06/2023] [Accepted: 04/03/2023] [Indexed: 04/07/2023] Open
Abstract
Most cell type-specific genes are regulated by the interaction of enhancers with their promoters. The identification of enhancers is not trivial as enhancers are diverse in their characteristics and dynamic in their interaction partners. We present Esearch3D, a new method that exploits network theory approaches to identify active enhancers. Our work is based on the fact that enhancers act as a source of regulatory information to increase the rate of transcription of their target genes and that the flow of this information is mediated by the folding of chromatin in the three-dimensional (3D) nuclear space between the enhancer and the target gene promoter. Esearch3D reverse engineers this flow of information to calculate the likelihood of enhancer activity in intergenic regions by propagating the transcription levels of genes across 3D genome networks. Regions predicted to have high enhancer activity are shown to be enriched in annotations indicative of enhancer activity. These include: enhancer-associated histone marks, bidirectional CAGE-seq, STARR-seq, P300, RNA polymerase II and expression quantitative trait loci (eQTLs). Esearch3D leverages the relationship between chromatin architecture and transcription, allowing the prediction of active enhancers and an understanding of the complex underpinnings of regulatory networks. The method is available at: https://github.com/InfOmics/Esearch3D and https://doi.org/10.5281/zenodo.7737123.
Collapse
Affiliation(s)
- Maninder Heer
- Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - Luca Giudice
- Department of Computer Science, University of Verona, Strada le Grazie 15, 37134, Verona, Italy
- A.I. Virtanen Institute for Molecular Sciences, University of Eastern Finland, Kuopio, Finland
| | - Claudia Mengoni
- Department of Computer Science, University of Verona, Strada le Grazie 15, 37134, Verona, Italy
| | - Rosalba Giugno
- Department of Computer Science, University of Verona, Strada le Grazie 15, 37134, Verona, Italy
| | - Daniel Rico
- Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| |
Collapse
|
11
|
Hoellinger T, Mestre C, Aschard H, Le Goff W, Foissac S, Faraut T, Djebali S. Enhancer/gene relationships: Need for more reliable genome-wide reference sets. FRONTIERS IN BIOINFORMATICS 2023; 3:1092853. [PMID: 36909938 PMCID: PMC9999192 DOI: 10.3389/fbinf.2023.1092853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 02/07/2023] [Indexed: 02/26/2023] Open
Abstract
Differences in cells' functions arise from differential activity of regulatory elements, including enhancers. Enhancers are cis-regulatory elements that cooperate with promoters through transcription factors to activate the expression of one or several genes by getting physically close to them in the 3D space of the nucleus. There is increasing evidence that genetic variants associated with common diseases are enriched in enhancers active in cell types relevant to these diseases. Identifying the enhancers associated with genes and conversely, the sets of genes activated by each enhancer (the so-called enhancer/gene or E/G relationships) across cell types, can help understanding the genetic mechanisms underlying human diseases. There are three broad approaches for the genome-wide identification of E/G relationships in a cell type: 1) genetic link methods or eQTL, 2) functional link methods based on 1D functional data such as open chromatin, histone mark or gene expression and 3) spatial link methods based on 3D data such as HiC. Since 1) and 3) are costly, the current strategy is to develop functional link methods and to use data from 1) and 3) as reference to evaluate them. However, there is still no consensus on the best functional link method to date, and method comparison remain seldom. Here, we compared the relative performances of three recent methods for the identification of enhancer-gene links, TargetFinder, Average-Rank, and the ABC model, using the three latest benchmarks from the field: a reference that combines 3D and eQTL data, called BENGI, and two genetic screening references, called CRiFF and CRiSPRi. Overall, none of the three methods performed best on the three references. CRiFF and CRISPRi reference sets are likely more reliable, but CRiFF is not genome-wide and CRiFF and CRISPRi are mostly available on the K562 cancer cell line. The BENGI reference set is genome-wide but likely contains many false positives. This study therefore calls for new reliable and genome-wide E/G reference data rather than new functional link E/G identification methods.
Collapse
Affiliation(s)
- Tristan Hoellinger
- IRSD, Université de Toulouse, INSERM, INRAE, ENVT, Univ Toulouse III - Paul Sabatier (UPS), Toulouse, France
- INSA Toulouse, INP-ENSEEIHT, Toulouse, France
| | - Camille Mestre
- GenPhySE, Université de Toulouse, INRAE, INPT, ENVT, Toulouse, France
| | - Hugues Aschard
- Institut Pasteur, Université Paris Cité, Department of Computational Biology, Paris, France
- Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, United States
| | - Wilfried Le Goff
- Sorbonne Université, INSERM, Institute of Cardiometabolism and Nutrition (ICAN), UMR_S1166, Paris, France
| | - Sylvain Foissac
- GenPhySE, Université de Toulouse, INRAE, INPT, ENVT, Toulouse, France
| | - Thomas Faraut
- GenPhySE, Université de Toulouse, INRAE, INPT, ENVT, Toulouse, France
| | - Sarah Djebali
- IRSD, Université de Toulouse, INSERM, INRAE, ENVT, Univ Toulouse III - Paul Sabatier (UPS), Toulouse, France
- GenPhySE, Université de Toulouse, INRAE, INPT, ENVT, Toulouse, France
| |
Collapse
|
12
|
Teng M. Statistical Analysis in ChIP-seq-Related Applications. Methods Mol Biol 2023; 2629:169-181. [PMID: 36929078 DOI: 10.1007/978-1-0716-2986-4_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/27/2023]
Abstract
Chromatin immunoprecipitation sequencing (ChIP-seq) has been widely performed to identify protein binding information along the genome. The sequencing protocol is quite flexible and mature to measure different types of protein binding as long as sequencing parameters are properly tailored to accommodate protein features. Two distinct types of protein binding are point-source-like binding by transcription factors and diffused-distribution binding by histone modifications. Consequently, statistical approaches have been proposed to address ChIP-seq-related questions according to different protein features. In this chapter, we briefly summarize statistical principles, approaches, and tools that are widely implemented in modeling ChIP-seq data, from raw data quality control to final result reporting. We discuss the key solutions in addressing eight routine questions in ChIP-seq applications. We also include discussion on approaches fitting unique data features in different ChIP-seq types. We hope this chapter will serve as a brief guide, especially for ChIP-seq beginners, to provide them with a high-level overview to understand and design processing plans for their ChIP-seq experiments.
Collapse
Affiliation(s)
- Mingxiang Teng
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, FL, USA.
| |
Collapse
|
13
|
Leung AKY, Yao L, Yu H. Functional genomic assays to annotate enhancer-promoter interactions genome wide. Hum Mol Genet 2022; 31:R97-R104. [PMID: 36018818 PMCID: PMC9585677 DOI: 10.1093/hmg/ddac204] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Revised: 08/16/2022] [Accepted: 08/17/2022] [Indexed: 11/14/2022] Open
Abstract
Enhancers are pivotal for regulating gene transcription that occurs at promoters. Identification of the interacting enhancer-promoter pairs and understanding the mechanisms behind how they interact and how enhancers modulate transcription can provide fundamental insight into gene regulatory networks. Recently, advances in high-throughput methods in three major areas-chromosome conformation capture assay, such as Hi-C to study basic chromatin architecture, ectopic reporter experiments such as self-transcribing active regulatory region sequencing (STARR-seq) to quantify promoter and enhancer activity, and endogenous perturbations such as clustered regularly interspaced short palindromic repeat interference (CRISPRi) to identify enhancer-promoter compatibility-have further our knowledge about transcription. In this review, we will discuss the major method developments and key findings from these assays.
Collapse
Affiliation(s)
- Alden King-Yung Leung
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
- Center for Genomics and Proteomics Technology Development (CGPT), Cornell University, Ithaca NY 14853, USA
| | - Li Yao
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
- Center for Genomics and Proteomics Technology Development (CGPT), Cornell University, Ithaca NY 14853, USA
| | - Haiyuan Yu
- Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA
- Center for Genomics and Proteomics Technology Development (CGPT), Cornell University, Ithaca NY 14853, USA
| |
Collapse
|
14
|
Mulero Hernández J, Fernández-Breis JT. Analysis of the landscape of human enhancer sequences in biological databases. Comput Struct Biotechnol J 2022; 20:2728-2744. [PMID: 35685360 PMCID: PMC9168495 DOI: 10.1016/j.csbj.2022.05.045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 05/20/2022] [Accepted: 05/21/2022] [Indexed: 12/01/2022] Open
Abstract
The process of gene regulation extends as a network in which both genetic sequences and proteins are involved. The levels of regulation and the mechanisms involved are multiple. Transcription is the main control mechanism for most genes, being the downstream steps responsible for refining the transcription patterns. In turn, gene transcription is mainly controlled by regulatory events that occur at promoters and enhancers. Several studies are focused on analyzing the contribution of enhancers in the development of diseases and their possible use as therapeutic targets. The study of regulatory elements has advanced rapidly in recent years with the development and use of next generation sequencing techniques. All this information has generated a large volume of information that has been transferred to a growing number of public repositories that store this information. In this article, we analyze the content of those public repositories that contain information about human enhancers with the aim of detecting whether the knowledge generated by scientific research is contained in those databases in a way that could be computationally exploited. The analysis will be based on three main aspects identified in the literature: types of enhancers, type of evidence about the enhancers, and methods for detecting enhancer-promoter interactions. Our results show that no single database facilitates the optimal exploitation of enhancer data, most types of enhancers are not represented in the databases and there is need for a standardized model for enhancers. We have identified major gaps and challenges for the computational exploitation of enhancer data.
Collapse
Affiliation(s)
- Juan Mulero Hernández
- Dept. Informática y Sistemas, Universidad de Murcia, CEIR Campus Mare Nostrum, IMIB-Arrixaca, Spain
| | | |
Collapse
|
15
|
Genome-wide analysis of cis-regulatory changes underlying metabolic adaptation of cavefish. Nat Genet 2022; 54:684-693. [PMID: 35551306 DOI: 10.1038/s41588-022-01049-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 03/09/2022] [Indexed: 12/13/2022]
Abstract
Cis-regulatory changes are key drivers of adaptative evolution. However, their contribution to the metabolic adaptation of organisms is not well understood. Here, we used a unique vertebrate model, Astyanax mexicanus-different morphotypes of which survive in nutrient-rich surface and nutrient-deprived cave waters-to uncover gene regulatory networks underlying metabolic adaptation. We performed genome-wide epigenetic profiling in the liver tissues of Astyanax and found that many of the identified cis-regulatory elements (CREs) have genetically diverged and have differential chromatin features between surface and cave morphotypes, while retaining remarkably similar regulatory signatures between independently derived cave populations. One such CRE in the hpdb gene harbors a genomic deletion in cavefish that abolishes IRF2 repressor binding and derepresses enhancer activity in reporter assays. Selection of this mutation in multiple independent cave populations supports its importance in cave adaptation, and provides novel molecular insights into the evolutionary trade-off between loss of pigmentation and adaptation to food-deprived caves.
Collapse
|
16
|
Qin T, Lee C, Li S, Cavalcante RG, Orchard P, Yao H, Zhang H, Wang S, Patil S, Boyle AP, Sartor MA. Comprehensive enhancer-target gene assignments improve gene set level interpretation of genome-wide regulatory data. Genome Biol 2022; 23:105. [PMID: 35473573 PMCID: PMC9044877 DOI: 10.1186/s13059-022-02668-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 04/06/2022] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND Revealing the gene targets of distal regulatory elements is challenging yet critical for interpreting regulome data. Experiment-derived enhancer-gene links are restricted to a small set of enhancers and/or cell types, while the accuracy of genome-wide approaches remains elusive due to the lack of a systematic evaluation. We combined multiple spatial and in silico approaches for defining enhancer locations and linking them to their target genes aggregated across >500 cell types, generating 1860 human genome-wide distal enhancer-to-target gene definitions (EnTDefs). To evaluate performance, we used gene set enrichment (GSE) testing on 87 independent ENCODE ChIP-seq datasets of 34 transcription factors (TFs) and assessed concordance of results with known TF Gene Ontology annotations, and other benchmarks. RESULTS The top ranked 741 (40%) EnTDefs significantly outperform the common, naïve approach of linking distal regions to the nearest genes, and the top 10 EnTDefs perform well when applied to ChIP-seq data of other cell types. The GSE-based ranking of EnTDefs is highly concordant with ranking based on overlap with curated benchmarks of enhancer-gene interactions. Both our top general EnTDef and cell-type-specific EnTDefs significantly outperform seven independent computational and experiment-based enhancer-gene pair datasets. We show that using our top EnTDefs for GSE with either genome-wide DNA methylation or ATAC-seq data is able to better recapitulate the biological processes changed in gene expression data performed in parallel for the same experiment than our lower-ranked EnTDefs. CONCLUSIONS Our findings illustrate the power of our approach to provide genome-wide interpretation regardless of cell type.
Collapse
Affiliation(s)
- Tingting Qin
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA.
| | - Christopher Lee
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
- Department of Biostatistics, School of Public Health, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Shiting Li
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Raymond G Cavalcante
- Biomedical Research Core Facilities, Epigenomics Core, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Peter Orchard
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Heming Yao
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Hanrui Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Shuze Wang
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Snehal Patil
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Alan P Boyle
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Maureen A Sartor
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA.
- Department of Biostatistics, School of Public Health, University of Michigan Medical School, Ann Arbor, MI, USA.
| |
Collapse
|
17
|
Chua EHZ, Yasar S, Harmston N. The importance of considering regulatory domains in genome-wide analyses - the nearest gene is often wrong! Biol Open 2022; 11:274931. [PMID: 35377406 PMCID: PMC9002814 DOI: 10.1242/bio.059091] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
The expression of a large number of genes is regulated by regulatory elements that are located far away from their promoters. Identifying which gene is the target of a specific regulatory element or is affected by a non-coding mutation is often accomplished by assigning these regions to the nearest gene in the genome. However, this heuristic ignores key features of genome organisation and gene regulation; in that the genome is partitioned into regulatory domains, which at some loci directly coincide with the span of topologically associated domains (TADs), and that genes are regulated by enhancers located throughout these regions, even across intervening genes. In this review, we examine the results from genome-wide studies using chromosome conformation capture technologies and from those dissecting individual gene regulatory domains, to highlight that the phenomenon of enhancer skipping is pervasive and affects multiple types of genes. We discuss how simply assigning a genomic region of interest to its nearest gene is problematic and often leads to incorrect predictions and highlight that where possible information on both the conservation and topological organisation of the genome should be used to generate better hypotheses. The article has an associated Future Leader to Watch interview. Summary: Identifying which gene is the target of an enhancer is often accomplished by assigning it to the nearest gene, here we discuss how this heuristic can lead to incorrect predictions.
Collapse
Affiliation(s)
| | - Samen Yasar
- Science Division, Yale-NUS College, Singapore 138527, Singapore
| | - Nathan Harmston
- Science Division, Yale-NUS College, Singapore 138527, Singapore.,Program in Cancer and Stem Cell Biology, Duke-NUS Medical School, Singapore 169857, Singapore
| |
Collapse
|
18
|
Abstract
The Human Genome Project marked a major milestone in the scientific community as it unravelled the ~3 billion bases that are central to crucial aspects of human life. Despite this achievement, it only scratched the surface of understanding how each nucleotide matters, both individually and as part of a larger unit. Beyond the coding genome, which comprises only ~2% of the whole genome, scientists have realized that large portions of the genome, not known to code for any protein, were crucial for regulating the coding genes. These large portions of the genome comprise the 'non-coding genome'. The history of gene regulation mediated by proteins that bind to the regulatory non-coding genome dates back many decades to the 1960s. However, the original definition of 'enhancers' was first used in the early 1980s. In this Review, we summarize benchmark studies that have mapped the role of cardiac enhancers in disease and development. We highlight instances in which enhancer-localized genetic variants explain the missing link to cardiac pathogenesis. Finally, we inspire readers to consider the next phase of exploring enhancer-based gene therapy for cardiovascular disease.
Collapse
|
19
|
Steinauer N, Zhang K, Guo C, Zhang J. Computational Modeling of Gene-Specific Transcriptional Repression, Activation and Chromatin Interactions in Leukemogenesis by LASSO-Regularized Logistic Regression. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2109-2122. [PMID: 33961561 PMCID: PMC8572318 DOI: 10.1109/tcbb.2021.3078128] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Many physiological and pathological pathways are dependent on gene-specific on/off regulation of transcription. Some genes are repressed, while others are activated. Although many previous studies have analyzed the mechanisms of gene-specific repression and activation, these studies are mainly based on the use of candidate genes, which are either repressed or activated, without simultaneously comparing and contrasting both groups of genes. There is also insufficient consideration of gene locations. Here we describe an integrated machine learning approach, using LASSO-regularized logistic regression, to model gene-specific repression and activation and the underlying contribution of chromatin interactions. LASSO-regularized logistic regression accurately predicted gene-specific transcriptional events and robustly detected the rate-limiting factors that underlie the differences of gene activation and repression. An example was provided by the leukemogenic transcription factor AML1-ETO, which is responsible for 10-15 percent of all acute myeloid leukemia cases. The analysis of AML1-ETO has also revealed novel networks of chromatin interactions and uncovered an unexpected role for E-proteins in AML1-ETO-p300 interactions and a role for the pre-existing gene state in governing the transcriptional response. Our results show that logistic regression-based probabilistic modeling is a promising tool to decipher mechanisms that integrate gene regulation and chromatin interactions in regulated transcription.
Collapse
|
20
|
Salviato E, Djordjilović V, Hariprakash JM, Tagliaferri I, Pal K, Ferrari F. Leveraging three-dimensional chromatin architecture for effective reconstruction of enhancer-target gene regulatory interactions. Nucleic Acids Res 2021; 49:e97. [PMID: 34197622 PMCID: PMC8464068 DOI: 10.1093/nar/gkab547] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Revised: 06/07/2021] [Accepted: 06/17/2021] [Indexed: 12/23/2022] Open
Abstract
A growing amount of evidence in literature suggests that germline sequence variants and somatic mutations in non-coding distal regulatory elements may be crucial for defining disease risk and prognostic stratification of patients, in genetic disorders as well as in cancer. Their functional interpretation is challenging because genome-wide enhancer-target gene (ETG) pairing is an open problem in genomics. The solutions proposed so far do not account for the hierarchy of structural domains which define chromatin three-dimensional (3D) architecture. Here we introduce a change of perspective based on the definition of multi-scale structural chromatin domains, integrated in a statistical framework to define ETG pairs. In this work (i) we develop a computational and statistical framework to reconstruct a comprehensive map of ETG pairs leveraging functional genomics data; (ii) we demonstrate that the incorporation of chromatin 3D architecture information improves ETG pairing accuracy and (iii) we use multiple experimental datasets to extensively benchmark our method against previous solutions for the genome-wide reconstruction of ETG pairs. This solution will facilitate the annotation and interpretation of sequence variants in distal non-coding regulatory elements. We expect this to be especially helpful in clinically oriented applications of whole genome sequencing in cancer and undiagnosed genetic diseases research.
Collapse
Affiliation(s)
- Elisa Salviato
- IFOM, the FIRC Institute of Molecular Oncology, Milan 20139, Italy
| | - Vera Djordjilović
- Department of Economics, Ca’ Foscari University of Venice, Venice 30100, Italy
| | | | | | - Koustav Pal
- IFOM, the FIRC Institute of Molecular Oncology, Milan 20139, Italy
| | - Francesco Ferrari
- IFOM, the FIRC Institute of Molecular Oncology, Milan 20139, Italy
- Institute of Molecular Genetics “Luigi Luca Cavalli-Sforza”, National Research Council, Pavia 27100, Italy
| |
Collapse
|
21
|
Functional Analysis of Non-Genetic Resistance to Platinum in Epithelial Ovarian Cancer Reveals a Role for the MBD3-NuRD Complex in Resistance Development. Cancers (Basel) 2021; 13:cancers13153801. [PMID: 34359703 PMCID: PMC8345099 DOI: 10.3390/cancers13153801] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 07/15/2021] [Accepted: 07/23/2021] [Indexed: 01/04/2023] Open
Abstract
Simple Summary Most epithelial ovarian cancer (EOC) patients, although initially responsive to standard treatment with platinum-based chemotherapy, develop platinum resistance over the clinical course and succumb due to drug-resistant metastases. It has long been hypothesized that resistance to platinum develops as a result of epigenetic changes within tumor cells evolving over time. In this study, we investigated epigenomic changes in EOC patient samples, as well as in cell lines, and showed that profound changes at enhancers result in a platinum-resistant phenotype. Through correlation of the epigenomic alterations with changes in the transcriptome, we could identify potential novel prognostic biomarkers for early patient stratification. Furthermore, we applied a combinatorial RNAi screening approach to identify suitable targets that prevent the enhancer remodeling process. Our results advance the molecular understanding of epigenetic mechanisms in EOC and therapy resistance, which will be essential for the further exploration of epigenetic drug targets and combinatorial treatment regimes. Abstract Epithelial ovarian cancer (EOC) is the most lethal disease of the female reproductive tract, and although most patients respond to the initial treatment with platinum (cPt)-based compounds, relapse is very common. We investigated the role of epigenetic changes in cPt-sensitive and -resistant EOC cell lines and found distinct differences in their enhancer landscape. Clinical data revealed that two genes (JAK1 and FGF10), which gained large enhancer clusters in resistant EOC cell lines, could provide novel biomarkers for early patient stratification with statistical independence for JAK1. To modulate the enhancer remodeling process and prevent the acquisition of cPt resistance in EOC cells, we performed a chromatin-focused RNAi screen in the presence of cPt. We identified subunits of the Nucleosome Remodeling and Deacetylase (NuRD) complex as critical factors sensitizing the EOC cell line A2780 to platinum treatment. Suppression of the Methyl-CpG Binding Domain Protein 3 (MBD3) sensitized cells and prevented the establishment of resistance under prolonged cPt exposure through alterations of H3K27ac at enhancer regions, which are differentially regulated in cPt-resistant cells, leading to a less aggressive phenotype. Our work establishes JAK1 as an independent prognostic marker and the NuRD complex as a potential target for combinational therapy.
Collapse
|
22
|
Nakato R, Sakata T. Methods for ChIP-seq analysis: A practical workflow and advanced applications. Methods 2021; 187:44-53. [PMID: 32240773 DOI: 10.1016/j.ymeth.2020.03.005] [Citation(s) in RCA: 93] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Revised: 03/17/2020] [Accepted: 03/18/2020] [Indexed: 12/13/2022] Open
Abstract
Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is a central method in epigenomic research. Genome-wide analysis of histone modifications, such as enhancer analysis and genome-wide chromatin state annotation, enables systematic analysis of how the epigenomic landscape contributes to cell identity, development, lineage specification, and disease. In this review, we first present a typical ChIP-seq analysis workflow, from quality assessment to chromatin-state annotation. We focus on practical, rather than theoretical, approaches for biological studies. Next, we outline various advanced ChIP-seq applications and introduce several state-of-the-art methods, including prediction of gene expression level and chromatin loops from epigenome data and data imputation. Finally, we discuss recently developed single-cell ChIP-seq analysis methodologies that elucidate the cellular diversity within complex tissues and cancers.
Collapse
Affiliation(s)
- Ryuichiro Nakato
- Laboratory of Computational Genomics, Institute for Quantitative Biosciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113-0032, Japan.
| | - Toyonori Sakata
- Laboratory of Genome Structure and Function, Institute for Quantitative Biosciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113-0032, Japan.
| |
Collapse
|
23
|
Tao H, Li H, Xu K, Hong H, Jiang S, Du G, Wang J, Sun Y, Huang X, Ding Y, Li F, Zheng X, Chen H, Bo X. Computational methods for the prediction of chromatin interaction and organization using sequence and epigenomic profiles. Brief Bioinform 2021; 22:6102668. [PMID: 33454752 PMCID: PMC8424394 DOI: 10.1093/bib/bbaa405] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 11/26/2020] [Accepted: 12/10/2020] [Indexed: 12/14/2022] Open
Abstract
The exploration of three-dimensional chromatin interaction and organization provides insight into mechanisms underlying gene regulation, cell differentiation and disease development. Advances in chromosome conformation capture technologies, such as high-throughput chromosome conformation capture (Hi-C) and chromatin interaction analysis by paired-end tag (ChIA-PET), have enabled the exploration of chromatin interaction and organization. However, high-resolution Hi-C and ChIA-PET data are only available for a limited number of cell lines, and their acquisition is costly, time consuming, laborious and affected by theoretical limitations. Increasing evidence shows that DNA sequence and epigenomic features are informative predictors of regulatory interaction and chromatin architecture. Based on these features, numerous computational methods have been developed for the prediction of chromatin interaction and organization, whereas they are not extensively applied in biomedical study. A systematical study to summarize and evaluate such methods is still needed to facilitate their application. Here, we summarize 48 computational methods for the prediction of chromatin interaction and organization using sequence and epigenomic profiles, categorize them and compare their performance. Besides, we provide a comprehensive guideline for the selection of suitable methods to predict chromatin interaction and organization based on available data and biological question of interest.
Collapse
Affiliation(s)
- Huan Tao
- Beijing Institute of Radiation Medicine
| | - Hao Li
- Beijing Institute of Radiation Medicine
| | - Kang Xu
- Beijing Institute of Radiation Medicine
| | - Hao Hong
- Beijing Institute of Radiation Medicine, Department of Biotechnology
| | - Shuai Jiang
- Beijing Institute of Radiation Medicine, Department of Biotechnology
| | - Guifang Du
- Beijing Institute of Radiation Medicine, Department of Biotechnology
| | | | - Yu Sun
- Beijing Institute of Radiation Medicine, Department of Biotechnology
| | - Xin Huang
- Beijing Institute of Radiation Medicine, Department of Biotechnology
| | - Yang Ding
- Beijing Institute of Radiation Medicine
| | - Fei Li
- Chinese Academy of Sciences, Department of Computer Network Information Center
| | | | | | | |
Collapse
|
24
|
Lidschreiber K, Jung LA, von der Emde H, Dave K, Taipale J, Cramer P, Lidschreiber M. Transcriptionally active enhancers in human cancer cells. Mol Syst Biol 2021; 17:e9873. [PMID: 33502116 PMCID: PMC7838827 DOI: 10.15252/msb.20209873] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Revised: 12/11/2020] [Accepted: 12/16/2020] [Indexed: 12/30/2022] Open
Abstract
The growth of human cancer cells is driven by aberrant enhancer and gene transcription activity. Here, we use transient transcriptome sequencing (TT-seq) to map thousands of transcriptionally active putative enhancers in fourteen human cancer cell lines covering seven types of cancer. These enhancers were associated with cell type-specific gene expression, enriched for genetic variants that predispose to cancer, and included functionally verified enhancers. Enhancer-promoter (E-P) pairing by correlation of transcription activity revealed ~ 40,000 putative E-P pairs, which were depleted for housekeeping genes and enriched for transcription factors, cancer-associated genes, and 3D conformational proximity. The cell type specificity and transcription activity of target genes increased with the number of paired putative enhancers. Our results represent a rich resource for future studies of gene regulation by enhancers and their role in driving cancerous cell growth.
Collapse
Affiliation(s)
- Katja Lidschreiber
- Department of Molecular BiologyMax Planck Institute for Biophysical ChemistryGöttingenGermany
- Department of Biosciences and NutritionKarolinska InstitutetNEOHuddingeSweden
| | - Lisa A Jung
- Department of Biosciences and NutritionKarolinska InstitutetNEOHuddingeSweden
- Department of Cell and Molecular BiologyKarolinska InstitutetBiomedicumSolnaSweden
| | - Henrik von der Emde
- Department of Molecular BiologyMax Planck Institute for Biophysical ChemistryGöttingenGermany
| | - Kashyap Dave
- Department of Medical Biochemistry and BiophysicsKarolinska InstitutetBiomedicumSolnaSweden
| | - Jussi Taipale
- Department of Medical Biochemistry and BiophysicsKarolinska InstitutetBiomedicumSolnaSweden
- Department of BiochemistryUniversity of CambridgeCambridgeUK
- Genome‐Scale Biology ProgramUniversity of HelsinkiHelsinkiFinland
| | - Patrick Cramer
- Department of Molecular BiologyMax Planck Institute for Biophysical ChemistryGöttingenGermany
- Department of Biosciences and NutritionKarolinska InstitutetNEOHuddingeSweden
| | - Michael Lidschreiber
- Department of Molecular BiologyMax Planck Institute for Biophysical ChemistryGöttingenGermany
- Department of Biosciences and NutritionKarolinska InstitutetNEOHuddingeSweden
| |
Collapse
|
25
|
Noncoding RNAs Set the Stage for RNA Polymerase II Transcription. Trends Genet 2020; 37:279-291. [PMID: 33046273 DOI: 10.1016/j.tig.2020.09.013] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2020] [Revised: 09/11/2020] [Accepted: 09/14/2020] [Indexed: 12/24/2022]
Abstract
Effective synthesis of mammalian messenger (m)RNAs depends on many factors that together direct RNA polymerase II (pol II) through the different stages of the transcription cycle and ensure efficient cotranscriptional processing of mRNAs. In addition to the many proteins involved in transcription initiation, elongation, and termination, several noncoding (nc)RNAs also function as global transcriptional regulators. Understanding the mode of action of these non-protein regulators has been an intense area of research in recent years. Here, we describe how these ncRNAs influence key regulatory steps of the transcription process, to affect large numbers of genes. Through direct association with pol II or by modulating the activity of transcription or RNA processing factors, these regulatory RNAs perform critical roles in gene expression.
Collapse
|
26
|
Xu H, Zhang S, Yi X, Plewczynski D, Li MJ. Exploring 3D chromatin contacts in gene regulation: The evolution of approaches for the identification of functional enhancer-promoter interaction. Comput Struct Biotechnol J 2020; 18:558-570. [PMID: 32226593 PMCID: PMC7090358 DOI: 10.1016/j.csbj.2020.02.013] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Revised: 02/21/2020] [Accepted: 02/22/2020] [Indexed: 12/12/2022] Open
Abstract
Mechanisms underlying gene regulation are key to understand how multicellular organisms with various cell types develop from the same genetic blueprint. Dynamic interactions between enhancers and genes are revealed to play central roles in controlling gene transcription, but the determinants to link functional enhancer-promoter pairs remain elusive. A major challenge is the lack of reliable approach to detect and verify functional enhancer-promoter interactions (EPIs). In this review, we summarized the current methods for detecting EPIs and described how developing techniques facilitate the identification of EPI through assessing the merits and drawbacks of these methods. We also reviewed recent state-of-art EPI prediction methods in terms of their rationale, data usage and characterization. Furthermore, we briefly discussed the evolved strategies for validating functional EPIs.
Collapse
Affiliation(s)
- Hang Xu
- 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, Tianjin Key Laboratory of Medical Epigenetics, Tianjin Medical University, Tianjin, China
| | - Shijie Zhang
- 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, Tianjin Key Laboratory of Medical Epigenetics, Tianjin Medical University, Tianjin, China
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Xianfu Yi
- School of Biomedical Engineering, Tianjin Medical University, Tianjin, China
| | - Dariusz Plewczynski
- Centre of New Technologies, University of Warsaw, Banacha 2c, 02-097 Warsaw, Poland
- Faculty of Mathematics and Information Science, Warsaw University of Technology, Koszykowa 75, 00-662 Warsaw, Poland
| | - Mulin Jun Li
- 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, Tianjin Key Laboratory of Medical Epigenetics, Tianjin Medical University, Tianjin, China
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| |
Collapse
|