1
|
Mao F, Luo L, Ma N, Qu Q, Chen H, Yi C, Cao M, Shao E, Lin H, Lin Z, Zhu F, Lu G, Lin D. A Spatiotemporal Transcriptome Reveals Stalk Development in Pearl Millet. Int J Mol Sci 2024; 25:9798. [PMID: 39337286 PMCID: PMC11432187 DOI: 10.3390/ijms25189798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2024] [Revised: 09/05/2024] [Accepted: 09/09/2024] [Indexed: 09/30/2024] Open
Abstract
Pearl millet is a major cereal crop that feeds more than 90 million people worldwide in arid and semi-arid regions. The stalk phenotypes of Poaceous grasses are critical for their productivity and stress tolerance; however, the molecular mechanisms governing stalk development in pearl millet remain to be deciphered. In this study, we spatiotemporally measured 19 transcriptomes for stalk internodes of four different early developmental stages. Data analysis of the transcriptomes defined four developmental zones on the stalks and identified 12 specific gene sets with specific expression patterns across the zones. Using weighted gene co-expression network analysis (WGCNA), we found that two co-expression modules together with candidate genes were involved in stalk elongation and the thickening of pearl millet. Among the elongation-related candidate genes, we established by SELEX that a MYB-family transcription factor PMF7G02448 can bind to the promoters of three cell wall synthases genes (CesAs). In summary, these findings provide insights into stalk development and offer potential targets for future genetic improvement in pearl millet.
Collapse
Affiliation(s)
- Fei Mao
- National Engineering Research Center of JUNCAO, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology and College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Lin Luo
- National Engineering Research Center of JUNCAO, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology and College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Nana Ma
- College of Life Science, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Qi Qu
- National Engineering Research Center of JUNCAO, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology and College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Hao Chen
- National Engineering Research Center of JUNCAO, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology and College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Chao Yi
- National Engineering Research Center of JUNCAO, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology and College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Mengxue Cao
- National Engineering Research Center of JUNCAO, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology and College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Ensi Shao
- National Engineering Research Center of JUNCAO, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology and College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Hui Lin
- National Engineering Research Center of JUNCAO, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology and College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Zhanxi Lin
- National Engineering Research Center of JUNCAO, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology and College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Fangjie Zhu
- National Engineering Research Center of JUNCAO, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology and College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Guodong Lu
- Key Laboratory of Bio-Pesticides and Chemical Biology, Ministry of Education, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Dongmei Lin
- National Engineering Research Center of JUNCAO, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Haixia Institute of Science and Technology and College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| |
Collapse
|
2
|
Li RZ, Han CZ, Glass CK. TIANA: transcription factors cooperativity inference analysis with neural attention. BMC Bioinformatics 2024; 25:274. [PMID: 39174927 PMCID: PMC11342676 DOI: 10.1186/s12859-024-05852-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Accepted: 07/01/2024] [Indexed: 08/24/2024] Open
Abstract
BACKGROUND Growing evidence suggests that distal regulatory elements are essential for cellular function and states. The sequences within these distal elements, especially motifs for transcription factor binding, provide critical information about the underlying regulatory programs. However, cooperativities between transcription factors that recognize these motifs are nonlinear and multiplexed, rendering traditional modeling methods insufficient to capture the underlying mechanisms. Recent development of attention mechanism, which exhibit superior performance in capturing dependencies across input sequences, makes them well-suited to uncover and decipher intricate dependencies between regulatory elements. RESULT We present Transcription factors cooperativity Inference Analysis with Neural Attention (TIANA), a deep learning framework that focuses on interpretability. In this study, we demonstrated that TIANA could discover biologically relevant insights into co-occurring pairs of transcription factor motifs. Compared with existing tools, TIANA showed superior interpretability and robust performance in identifying putative transcription factor cooperativities from co-occurring motifs. CONCLUSION Our results suggest that TIANA can be an effective tool to decipher transcription factor cooperativities from distal sequence data. TIANA can be accessed through: https://github.com/rzzli/TIANA .
Collapse
Affiliation(s)
- Rick Z Li
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Claudia Z Han
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Christopher K Glass
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, 92093, USA.
| |
Collapse
|
3
|
Loupe JM, Anderson AG, Rizzardi LF, Rodriguez-Nunez I, Moyers B, Trausch-Lowther K, Jain R, Bunney WE, Bunney BG, Cartagena P, Sequeira A, Watson SJ, Akil H, Cooper GM, Myers RM. Multiomic profiling of transcription factor binding and function in human brain. Nat Neurosci 2024; 27:1387-1399. [PMID: 38831039 DOI: 10.1038/s41593-024-01658-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 04/19/2024] [Indexed: 06/05/2024]
Abstract
Transcription factors (TFs) orchestrate gene expression programs crucial for brain function, but we lack detailed information about TF binding in human brain tissue. We generated a multiomic resource (ChIP-seq, ATAC-seq, RNA-seq, DNA methylation) on bulk tissues and sorted nuclei from several postmortem brain regions, including binding maps for more than 100 TFs. We demonstrate improved measurements of TF activity, including motif recognition and gene expression modeling, upon identification and removal of high TF occupancy regions. Further, predictive TF binding models demonstrate a bias for these high-occupancy sites. Neuronal TFs SATB2 and TBR1 bind unique regions depleted for such sites and promote neuronal gene expression. Binding sites for TFs, including TBR1 and PKNOX1, are enriched for risk variants associated with neuropsychiatric disorders, predominantly in neurons. This work, titled BrainTF, is a powerful resource for future studies seeking to understand the roles of specific TFs in regulating gene expression in the human brain.
Collapse
Affiliation(s)
- Jacob M Loupe
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | | | - Lindsay F Rizzardi
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
- Department of Biochemistry and Molecular Biology, The University of Alabama in Birmingham, Birmingham, AL, USA
| | | | - Belle Moyers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | | | - Rashmi Jain
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - William E Bunney
- Department of Psychiatry and Human Behavior, University of California, Irvine, CA, USA
| | - Blynn G Bunney
- Department of Psychiatry and Human Behavior, University of California, Irvine, CA, USA
| | - Preston Cartagena
- Department of Psychiatry and Human Behavior, University of California, Irvine, CA, USA
| | - Adolfo Sequeira
- Department of Psychiatry and Human Behavior, University of California, Irvine, CA, USA
| | - Stanley J Watson
- The Michigan Neuroscience Institute, University of Michigan, Ann Arbor, MI, USA
| | - Huda Akil
- The Michigan Neuroscience Institute, University of Michigan, Ann Arbor, MI, USA
| | | | - Richard M Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA.
| |
Collapse
|
4
|
Vaknin I, Willinger O, Mandl J, Heuberger H, Ben-Ami D, Zeng Y, Goldberg S, Orenstein Y, Amit R. A universal system for boosting gene expression in eukaryotic cell-lines. Nat Commun 2024; 15:2394. [PMID: 38493141 PMCID: PMC10944472 DOI: 10.1038/s41467-024-46573-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 03/04/2024] [Indexed: 03/18/2024] Open
Abstract
We demonstrate a transcriptional regulatory design algorithm that can boost expression in yeast and mammalian cell lines. The system consists of a simplified transcriptional architecture composed of a minimal core promoter and a synthetic upstream regulatory region (sURS) composed of up to three motifs selected from a list of 41 motifs conserved in the eukaryotic lineage. The sURS system was first characterized using an oligo-library containing 189,990 variants. We validate the resultant expression model using a set of 43 unseen sURS designs. The validation sURS experiments indicate that a generic set of grammar rules for boosting and attenuation may exist in yeast cells. Finally, we demonstrate that this generic set of grammar rules functions similarly in mammalian CHO-K1 and HeLa cells. Consequently, our work provides a design algorithm for boosting the expression of promoters used for expressing industrially relevant proteins in yeast and mammalian cell lines.
Collapse
Affiliation(s)
- Inbal Vaknin
- Department of Biotechnology and Food Engineering, Technion, Haifa, Israel
| | - Or Willinger
- Department of Biotechnology and Food Engineering, Technion, Haifa, Israel
| | - Jonathan Mandl
- Department of Computer Science, Bar-Ilan University, Ramat Gan, Israel
| | - Hadar Heuberger
- School of Electrical and Computer Engineering, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Dan Ben-Ami
- School of Electrical and Computer Engineering, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Yi Zeng
- Department of Biotechnology and Food Engineering, Technion, Haifa, Israel
| | - Sarah Goldberg
- Department of Biotechnology and Food Engineering, Technion, Haifa, Israel
| | - Yaron Orenstein
- Department of Computer Science, Bar-Ilan University, Ramat Gan, Israel
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel
| | - Roee Amit
- Department of Biotechnology and Food Engineering, Technion, Haifa, Israel.
- The Russell Berrie Nanotechnology Institute, Technion, Haifa, Israel.
| |
Collapse
|
5
|
de Boer CG, Taipale J. Hold out the genome: a roadmap to solving the cis-regulatory code. Nature 2024; 625:41-50. [PMID: 38093018 DOI: 10.1038/s41586-023-06661-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 09/20/2023] [Indexed: 01/05/2024]
Abstract
Gene expression is regulated by transcription factors that work together to read cis-regulatory DNA sequences. The 'cis-regulatory code' - how cells interpret DNA sequences to determine when, where and how much genes should be expressed - has proven to be exceedingly complex. Recently, advances in the scale and resolution of functional genomics assays and machine learning have enabled substantial progress towards deciphering this code. However, the cis-regulatory code will probably never be solved if models are trained only on genomic sequences; regions of homology can easily lead to overestimation of predictive performance, and our genome is too short and has insufficient sequence diversity to learn all relevant parameters. Fortunately, randomly synthesized DNA sequences enable testing a far larger sequence space than exists in our genomes, and designed DNA sequences enable targeted queries to maximally improve the models. As the same biochemical principles are used to interpret DNA regardless of its source, models trained on these synthetic data can predict genomic activity, often better than genome-trained models. Here we provide an outlook on the field, and propose a roadmap towards solving the cis-regulatory code by a combination of machine learning and massively parallel assays using synthetic DNA.
Collapse
Affiliation(s)
- Carl G de Boer
- School of Biomedical Engineering, University of British Columbia, Vancouver, British Columbia, Canada.
| | - Jussi Taipale
- Applied Tumor Genomics Research Program, Faculty of Medicine, University of Helsinki, Helsinki, Finland.
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden.
- Department of Biochemistry, University of Cambridge, Cambridge, UK.
| |
Collapse
|
6
|
Wen C, Yuan Z, Zhang X, Chen H, Luo L, Li W, Li T, Ma N, Mao F, Lin D, Lin Z, Lin C, Xu T, Lü P, Lin J, Zhu F. Sea-ATI unravels novel vocabularies of plant active cistrome. Nucleic Acids Res 2023; 51:11568-11583. [PMID: 37850650 PMCID: PMC10681729 DOI: 10.1093/nar/gkad853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2023] [Revised: 08/11/2023] [Accepted: 09/25/2023] [Indexed: 10/19/2023] Open
Abstract
The cistrome consists of all cis-acting regulatory elements recognized by transcription factors (TFs). However, only a portion of the cistrome is active for TF binding in a specific tissue. Resolving the active cistrome in plants remains challenging. In this study, we report the assay sequential extraction assisted-active TF identification (sea-ATI), a low-input method that profiles the DNA sequences recognized by TFs in a target tissue. We applied sea-ATI to seven plant tissues to survey their active cistrome and generated 41 motif models, including 15 new models that represent previously unidentified cis-regulatory vocabularies. ATAC-seq and RNA-seq analyses confirmed the functionality of the cis-elements from the new models, in that they are actively bound in vivo, located near the transcription start site, and influence chromatin accessibility and transcription. Furthermore, comparing dimeric WRKY CREs between sea-ATI and DAP-seq libraries revealed that thermodynamics and genetic drifts cooperatively shaped their evolution. Notably, sea-ATI can identify not only positive but also negative regulatory cis-elements, thereby providing unique insights into the functional non-coding genome of plants.
Collapse
Affiliation(s)
- Chenjin Wen
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Zhen Yuan
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Xiaotian Zhang
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Hao Chen
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Lin Luo
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Wanying Li
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Tian Li
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Nana Ma
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Fei Mao
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Dongmei Lin
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Zhanxi Lin
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Chentao Lin
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Tongda Xu
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Peitao Lü
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Juncheng Lin
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Fangjie Zhu
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| |
Collapse
|
7
|
Novakovsky G, Fornes O, Saraswat M, Mostafavi S, Wasserman WW. ExplaiNN: interpretable and transparent neural networks for genomics. Genome Biol 2023; 24:154. [PMID: 37370113 DOI: 10.1186/s13059-023-02985-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Accepted: 06/12/2023] [Indexed: 06/29/2023] Open
Abstract
Deep learning models such as convolutional neural networks (CNNs) excel in genomic tasks but lack interpretability. We introduce ExplaiNN, which combines the expressiveness of CNNs with the interpretability of linear models. ExplaiNN can predict TF binding, chromatin accessibility, and de novo motifs, achieving performance comparable to state-of-the-art methods. Its predictions are transparent, providing global (cell state level) as well as local (individual sequence level) biological insights into the data. ExplaiNN can serve as a plug-and-play platform for pretrained models and annotated position weight matrices. ExplaiNN aims to accelerate the adoption of deep learning in genomic sequence analysis by domain experts.
Collapse
Affiliation(s)
- Gherman Novakovsky
- Department of Medical Genetics, Centre for Molecular Medicine and Therapeutics, BC Children's Hospital Research Institute, University of British Columbia, Vancouver, BC, Canada
| | - Oriol Fornes
- Department of Medical Genetics, Centre for Molecular Medicine and Therapeutics, BC Children's Hospital Research Institute, University of British Columbia, Vancouver, BC, Canada
| | - Manu Saraswat
- Department of Medical Genetics, Centre for Molecular Medicine and Therapeutics, BC Children's Hospital Research Institute, University of British Columbia, Vancouver, BC, Canada
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Sara Mostafavi
- Paul G. Allen School of Computer Science and Engineering, University of Washington (UW), Seattle, USA
| | - Wyeth W Wasserman
- Department of Medical Genetics, Centre for Molecular Medicine and Therapeutics, BC Children's Hospital Research Institute, University of British Columbia, Vancouver, BC, Canada.
| |
Collapse
|
8
|
Yuan Y, Wang C, Zhuang X, Lin S, Luo M, Deng W, Zhou J, Liu L, Mao L, Peng W, Chen J, Wang Q, Shu Y, Xue Y, Huang P. PIM1 promotes hepatic conversion by suppressing reprogramming-induced ferroptosis and cell cycle arrest. Nat Commun 2022; 13:5237. [PMID: 36068222 PMCID: PMC9448736 DOI: 10.1038/s41467-022-32976-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 08/25/2022] [Indexed: 11/08/2022] Open
Abstract
Protein kinase-mediated phosphorylation plays a critical role in many biological processes. However, the identification of key regulatory kinases is still a great challenge. Here, we develop a trans-omics-based method, central kinase inference, to predict potentially key kinases by integrating quantitative transcriptomic and phosphoproteomic data. Using known kinases associated with anti-cancer drug resistance, the accuracy of our method denoted by the area under the curve is 5.2% to 29.5% higher than Kinase-Substrate Enrichment Analysis. We further use this method to analyze trans-omic data in hepatocyte maturation and hepatic reprogramming of human dermal fibroblasts, uncovering 5 kinases as regulators in the two processes. Further experiments reveal that a serine/threonine kinase, PIM1, promotes hepatic conversion and protects human dermal fibroblasts from reprogramming-induced ferroptosis and cell cycle arrest. This study not only reveals new regulatory kinases, but also provides a helpful method that might be extended to predict central kinases involved in other biological processes.
Collapse
Affiliation(s)
- Yangyang Yuan
- Institute of Biomedical Engineering, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin, 300192, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, China
- Centre for Translational Stem Cell Biology Limited, Hong Kong, 999077, China
| | - Chenwei Wang
- MOE Key Laboratory of Molecular Biophysics, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Center for Artificial Intelligence Biology, Institute of Artificial Intelligence, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, Hubei, China
| | - Xuran Zhuang
- School of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, China
| | - Shaofeng Lin
- MOE Key Laboratory of Molecular Biophysics, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Center for Artificial Intelligence Biology, Institute of Artificial Intelligence, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, Hubei, China
| | - Miaomiao Luo
- Institute of Biomedical Engineering, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin, 300192, China
| | - Wankun Deng
- MOE Key Laboratory of Molecular Biophysics, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Center for Artificial Intelligence Biology, Institute of Artificial Intelligence, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, Hubei, China
| | - Jiaqi Zhou
- MOE Key Laboratory of Molecular Biophysics, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Center for Artificial Intelligence Biology, Institute of Artificial Intelligence, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, Hubei, China
| | - Lihui Liu
- Institute of Biomedical Engineering, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin, 300192, China
| | - Lina Mao
- Institute of Biomedical Engineering, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin, 300192, China
| | - Wenbo Peng
- School of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, China
| | - Jian Chen
- ENT institute and Department of Otorhinolaryngology, Eye & ENT Hospital, State Key Laboratory of Medical Neurobiology and Institutes of Biomedical Sciences, Fudan University, Shanghai, 200031, China
- NHC Key Laboratory of Hearing Medicine, Fudan University, Shanghai, 200031, China
| | - Qiangsong Wang
- Institute of Biomedical Engineering, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin, 300192, China
| | - Yilai Shu
- ENT institute and Department of Otorhinolaryngology, Eye & ENT Hospital, State Key Laboratory of Medical Neurobiology and Institutes of Biomedical Sciences, Fudan University, Shanghai, 200031, China.
- NHC Key Laboratory of Hearing Medicine, Fudan University, Shanghai, 200031, China.
| | - Yu Xue
- MOE Key Laboratory of Molecular Biophysics, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Center for Artificial Intelligence Biology, Institute of Artificial Intelligence, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, Hubei, China.
- Nanjing University Institute of Artificial Intelligence Biomedicine, Nanjing, Jiangsu, 210031, China.
| | - Pengyu Huang
- Institute of Biomedical Engineering, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin, 300192, China.
- School of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, China.
| |
Collapse
|
9
|
Yang MG, Ling E, Cowley CJ, Greenberg ME, Vierbuchen T. Characterization of sequence determinants of enhancer function using natural genetic variation. eLife 2022; 11:76500. [PMID: 36043696 PMCID: PMC9662815 DOI: 10.7554/elife.76500] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Accepted: 08/30/2022] [Indexed: 02/04/2023] Open
Abstract
Sequence variation in enhancers that control cell-type-specific gene transcription contributes significantly to phenotypic variation within human populations. However, it remains difficult to predict precisely the effect of any given sequence variant on enhancer function due to the complexity of DNA sequence motifs that determine transcription factor (TF) binding to enhancers in their native genomic context. Using F1-hybrid cells derived from crosses between distantly related inbred strains of mice, we identified thousands of enhancers with allele-specific TF binding and/or activity. We find that genetic variants located within the central region of enhancers are most likely to alter TF binding and enhancer activity. We observe that the AP-1 family of TFs (Fos/Jun) are frequently required for binding of TEAD TFs and for enhancer function. However, many sequence variants outside of core motifs for AP-1 and TEAD also impact enhancer function, including sequences flanking core TF motifs and AP-1 half sites. Taken together, these data represent one of the most comprehensive assessments of allele-specific TF binding and enhancer function to date and reveal how sequence changes at enhancers alter their function across evolutionary timescales.
Collapse
Affiliation(s)
- Marty G Yang
- Department of Neurobiology, Harvard Medical School, Boston, United States.,Program in Neuroscience, Harvard Medical School, Boston, United States
| | - Emi Ling
- Department of Neurobiology, Harvard Medical School, Boston, United States
| | | | | | - Thomas Vierbuchen
- Developmental Biology Program, Sloan Kettering Institute for Cancer Research, New York, United States.,Center for Stem Cell Biology, Sloan Kettering Institute for Cancer Research, New York, United States
| |
Collapse
|
10
|
WhichTF is functionally important in your open chromatin data? PLoS Comput Biol 2022; 18:e1010378. [PMID: 36040971 PMCID: PMC9426921 DOI: 10.1371/journal.pcbi.1010378] [Citation(s) in RCA: 48] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 07/11/2022] [Indexed: 11/19/2022] Open
Abstract
We present WhichTF, a computational method to identify functionally important transcription factors (TFs) from chromatin accessibility measurements. To rank TFs, WhichTF applies an ontology-guided functional approach to compute novel enrichment by integrating accessibility measurements, high-confidence pre-computed conservation-aware TF binding sites, and putative gene-regulatory models. Comparison with prior sheer abundance-based methods reveals the unique ability of WhichTF to identify context-specific TFs with functional relevance, including NF-κB family members in lymphocytes and GATA factors in cardiac cells. To distinguish the transcriptional regulatory landscape in closely related samples, we apply differential analysis and demonstrate its utility in lymphocyte, mesoderm developmental, and disease cells. We find suggestive, under-characterized TFs, such as RUNX3 in mesoderm development and GLI1 in systemic lupus erythematosus. We also find TFs known for stress response, suggesting routine experimental caveats that warrant careful consideration. WhichTF yields biological insight into known and novel molecular mechanisms of TF-mediated transcriptional regulation in diverse contexts, including human and mouse cell types, cell fate trajectories, and disease-associated cells. Transcription factors (TFs), a class of DNA binding proteins, regulate tissue- and cell-type-specific expression of genes. Identifying the critical TFs in a given cellular context leads to investigating molecular regulatory mechanisms in development, differentiation, and disease. Because there are more than 1,500 human TFs, experimental measurements of genome-wide occupancy across all TFs have been challenging. While computational approaches play pivotal roles, most existing methods rely on statistical enrichment, focusing either on sequence motif similarity recognized by TFs or the similarity of the genomic region of interest with the previously characterized TF occupancy profile. Here we propose WhichTF as an alternative, incorporating curated biomedical knowledge from ontology and integrating it with the high-confidence prediction of conserved TF binding sites in user-provided genomic regions of interest. We develop a new WhichTF score to rank TFs and demonstrate its applicability across human and mouse cell types, cellular differentiation trajectories, and disease-associated cells.
Collapse
|
11
|
Abstract
DNA can determine where and when genes are expressed, but the full set of sequence determinants that control gene expression is unknown. Here, we measured the transcriptional activity of DNA sequences that represent an ~100 times larger sequence space than the human genome using massively parallel reporter assays (MPRAs). Machine learning models revealed that transcription factors (TFs) generally act in an additive manner with weak grammar and that most enhancers increase expression from a promoter by a mechanism that does not appear to involve specific TF–TF interactions. The enhancers themselves can be classified into three types: classical, closed chromatin and chromatin dependent. We also show that few TFs are strongly active in a cell, with most activities being similar between cell types. Individual TFs can have multiple gene regulatory activities, including chromatin opening and enhancing, promoting and determining transcription start site (TSS) activity, consistent with the view that the TF binding motif is the key atomic unit of gene expression. Analysis of massively parallel reporter assays measuring the transcriptional activity of DNA sequences indicates that most transcription factor (TF) activity is additive and does not rely on specific TF–TF interactions. Individual TFs can have different gene regulatory activities.
Collapse
|
12
|
Bray D, Hook H, Zhao R, Keenan JL, Penvose A, Osayame Y, Mohaghegh N, Chen X, Parameswaran S, Kottyan LC, Weirauch MT, Siggers T. CASCADE: high-throughput characterization of regulatory complex binding altered by non-coding variants. CELL GENOMICS 2022; 2. [PMID: 35252945 PMCID: PMC8896503 DOI: 10.1016/j.xgen.2022.100098] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
Non-coding DNA variants (NCVs) impact gene expression by altering binding sites for regulatory complexes. New high-throughput methods are needed to characterize the impact of NCVs on regulatory complexes. We developed CASCADE (Customizable Approach to Survey Complex Assembly at DNA Elements), an array-based high-throughput method to profile cofactor (COF) recruitment. CASCADE identifies DNA-bound transcription factor-cofactor (TF-COF) complexes in nuclear extracts and quantifies the impact of NCVs on their binding. We demonstrate CASCADE sensitivity in characterizing condition-specific recruitment of COFs p300 and RBBP5 (MLL subunit) to the CXCL10 promoter in lipopolysaccharide (LPS)-stimulated human macrophages and quantify the impact of all possible NCVs. To demonstrate applicability to NCV screens, we profile TF-COF binding to ~1,700 single-nucleotide polymorphism quantitative trait loci (SNP-QTLs) in human macrophages and identify perturbed ETS domain-containing complexes. CASCADE will facilitate high-throughput testing of molecular mechanisms of NCVs for diverse biological applications. Bray et al. develop CASCADE, a method to profile transcription factor (TF)-cofactor (COF) complexes binding to DNA. They demonstrate the approach by profiling complex binding across the CXCL10 cytokine promoter and to ~1,700 single-nucleotide polymorphisms (SNPs). They anticipate that CASCADE can be applied to diverse biological systems to examine regulatory complex binding to DNA.
Collapse
Affiliation(s)
- David Bray
- Department of Biology, Boston University, Boston, MA, USA
- Biological Design Center, Boston University, Boston, MA, USA
- Bioinformatics Program, Boston University, Boston, MA, USA
| | - Heather Hook
- Department of Biology, Boston University, Boston, MA, USA
- Biological Design Center, Boston University, Boston, MA, USA
| | - Rose Zhao
- Department of Biology, Boston University, Boston, MA, USA
- Biological Design Center, Boston University, Boston, MA, USA
| | - Jessica L. Keenan
- Department of Biology, Boston University, Boston, MA, USA
- Biological Design Center, Boston University, Boston, MA, USA
- Bioinformatics Program, Boston University, Boston, MA, USA
| | - Ashley Penvose
- Department of Biology, Boston University, Boston, MA, USA
- Biological Design Center, Boston University, Boston, MA, USA
| | - Yemi Osayame
- Department of Biology, Boston University, Boston, MA, USA
- Biological Design Center, Boston University, Boston, MA, USA
| | - Nima Mohaghegh
- Department of Biology, Boston University, Boston, MA, USA
- Biological Design Center, Boston University, Boston, MA, USA
| | - Xiaoting Chen
- Center for Autoimmune Genomics and Etiology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
| | - Sreeja Parameswaran
- Center for Autoimmune Genomics and Etiology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
| | - Leah C. Kottyan
- Center for Autoimmune Genomics and Etiology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
- Division of Allergy and Immunology, Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio 45229, USA
| | - Matthew T. Weirauch
- Center for Autoimmune Genomics and Etiology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
- Division of Developmental Biology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
- Division of Biomedical Informatics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
| | - Trevor Siggers
- Department of Biology, Boston University, Boston, MA, USA
- Biological Design Center, Boston University, Boston, MA, USA
- Corresponding author
| |
Collapse
|
13
|
Lehtiö J, Arslan T, Siavelis I, Pan Y, Socciarelli F, Berkovska O, Umer HM, Mermelekas G, Pirmoradian M, Jönsson M, Brunnström H, Brustugun OT, Purohit KP, Cunningham R, Asl HF, Isaksson S, Arbajian E, Aine M, Karlsson A, Kotevska M, Hansen CG, Haakensen VD, Helland Å, Tamborero D, Johansson HJ, Branca RM, Planck M, Staaf J, Orre LM. Proteogenomics of non-small cell lung cancer reveals molecular subtypes associated with specific therapeutic targets and immune evasion mechanisms. NATURE CANCER 2021; 2:1224-1242. [PMID: 34870237 PMCID: PMC7612062 DOI: 10.1038/s43018-021-00259-9] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Despite major advancements in lung cancer treatment, long-term survival is still rare, and a deeper understanding of molecular phenotypes would allow the identification of specific cancer dependencies and immune evasion mechanisms. Here we performed in-depth mass spectrometry (MS)-based proteogenomic analysis of 141 tumors representing all major histologies of non-small cell lung cancer (NSCLC). We identified six distinct proteome subtypes with striking differences in immune cell composition and subtype-specific expression of immune checkpoints. Unexpectedly, high neoantigen burden was linked to global hypomethylation and complex neoantigens mapped to genomic regions, such as endogenous retroviral elements and introns, in immune-cold subtypes. Further, we linked immune evasion with LAG3 via STK11 mutation-dependent HNF1A activation and FGL1 expression. Finally, we develop a data-independent acquisition MS-based NSCLC subtype classification method, validate it in an independent cohort of 208 NSCLC cases and demonstrate its clinical utility by analyzing an additional cohort of 84 late-stage NSCLC biopsy samples.
Collapse
Affiliation(s)
- Janne Lehtiö
- Department of Oncology and Pathology, Karolinska Institutet, SciLifeLab, Solna, Sweden.
| | - Taner Arslan
- Department of Oncology and Pathology, Karolinska Institutet, Science for Life Laboratory, Solna, SE-17165, Sweden
| | - Ioannis Siavelis
- Department of Oncology and Pathology, Karolinska Institutet, Science for Life Laboratory, Solna, SE-17165, Sweden
| | - Yanbo Pan
- Department of Oncology and Pathology, Karolinska Institutet, Science for Life Laboratory, Solna, SE-17165, Sweden
| | - Fabio Socciarelli
- Department of Oncology and Pathology, Karolinska Institutet, Science for Life Laboratory, Solna, SE-17165, Sweden
| | - Olena Berkovska
- Department of Oncology and Pathology, Karolinska Institutet, Science for Life Laboratory, Solna, SE-17165, Sweden
| | - Husen M. Umer
- Department of Oncology and Pathology, Karolinska Institutet, Science for Life Laboratory, Solna, SE-17165, Sweden
| | - Georgios Mermelekas
- Department of Oncology and Pathology, Karolinska Institutet, Science for Life Laboratory, Solna, SE-17165, Sweden
| | - Mohammad Pirmoradian
- Department of Oncology and Pathology, Karolinska Institutet, Science for Life Laboratory, Solna, SE-17165, Sweden
| | - Mats Jönsson
- Division of Oncology, Department of Clinical Sciences, Lund and CREATE Health Strategic Center for Translational Cancer Research, Lund University, Lund, Sweden
| | - Hans Brunnström
- Department of Pathology, Laboratory Medicine Region Skåne, Lund, Sweden,Division of Pathology, Department of Clinical Sciences, Lund, Lund University, Lund, Sweden
| | - Odd Terje Brustugun
- Section of Oncology, Drammen Hospital, Vestre Viken Health Trust, Drammen, Norway,Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway
| | - Krishna Pinganksha Purohit
- University of Edinburgh Centre for Inflammation Research, Institute for Regeneration and Repair, Queen’s Medical Research Institute, Edinburgh bioQuarter, 47 Little France Crescent, Edinburgh EH16 4TJ, UK,MRC Centre for Regenerative Medicine, Institute for Regeneration and Repair, University of Edinburgh, Edinburgh bioQuarter, 5 Little France Drive, Edinburgh EH16 4UU, UK
| | - Richard Cunningham
- University of Edinburgh Centre for Inflammation Research, Institute for Regeneration and Repair, Queen’s Medical Research Institute, Edinburgh bioQuarter, 47 Little France Crescent, Edinburgh EH16 4TJ, UK,MRC Centre for Regenerative Medicine, Institute for Regeneration and Repair, University of Edinburgh, Edinburgh bioQuarter, 5 Little France Drive, Edinburgh EH16 4UU, UK
| | - Hassan Foroughi Asl
- Genomic Medicine Center, Karolinska University Hospital, Stockholm, Sweden. Clinical Genomics Facility, Department of Microbiology, Tumour and Cell Biology, Karolinska Institutet, Stockholm, Sweden
| | - Sofi Isaksson
- Division of Oncology, Department of Clinical Sciences, Lund and CREATE Health Strategic Center for Translational Cancer Research, Lund University, Lund, Sweden
| | - Elsa Arbajian
- Division of Oncology, Department of Clinical Sciences, Lund and CREATE Health Strategic Center for Translational Cancer Research, Lund University, Lund, Sweden
| | - Mattias Aine
- Division of Oncology, Department of Clinical Sciences, Lund and CREATE Health Strategic Center for Translational Cancer Research, Lund University, Lund, Sweden
| | - Anna Karlsson
- Division of Oncology, Department of Clinical Sciences, Lund and CREATE Health Strategic Center for Translational Cancer Research, Lund University, Lund, Sweden
| | - Marija Kotevska
- Division of Oncology, Department of Clinical Sciences, Lund and CREATE Health Strategic Center for Translational Cancer Research, Lund University, Lund, Sweden,Department of Respiratory Medicine and Allergology, Skåne University Hospital, Lund, Sweden
| | - Carsten Gram Hansen
- University of Edinburgh Centre for Inflammation Research, Institute for Regeneration and Repair, Queen’s Medical Research Institute, Edinburgh bioQuarter, 47 Little France Crescent, Edinburgh EH16 4TJ, UK,MRC Centre for Regenerative Medicine, Institute for Regeneration and Repair, University of Edinburgh, Edinburgh bioQuarter, 5 Little France Drive, Edinburgh EH16 4UU, UK
| | - Vilde Drageset Haakensen
- Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway,Department of Oncology, Oslo University Hospital, Oslo, Norway
| | - Åslaug Helland
- Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway,Department of Oncology, Oslo University Hospital, Oslo, Norway,Faculty of Medicine, University of Oslo, Norway
| | - David Tamborero
- Department of Oncology and Pathology, Karolinska Institutet, Science for Life Laboratory, Solna, SE-17165, Sweden
| | - Henrik J. Johansson
- Department of Oncology and Pathology, Karolinska Institutet, Science for Life Laboratory, Solna, SE-17165, Sweden
| | - Rui M. Branca
- Department of Oncology and Pathology, Karolinska Institutet, Science for Life Laboratory, Solna, SE-17165, Sweden
| | - Maria Planck
- Division of Oncology, Department of Clinical Sciences, Lund and CREATE Health Strategic Center for Translational Cancer Research, Lund University, Lund, Sweden,Department of Respiratory Medicine and Allergology, Skåne University Hospital, Lund, Sweden
| | - Johan Staaf
- Division of Oncology, Department of Clinical Sciences, Lund and CREATE Health Strategic Center for Translational Cancer Research, Lund University, Lund, Sweden
| | - Lukas M. Orre
- Department of Oncology and Pathology, Karolinska Institutet, Science for Life Laboratory, Solna, SE-17165, Sweden
| |
Collapse
|
14
|
Morgunova E, Taipale J. Structural insights into the interaction between transcription factors and the nucleosome. Curr Opin Struct Biol 2021; 71:171-179. [PMID: 34364091 DOI: 10.1016/j.sbi.2021.06.016] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Revised: 06/21/2021] [Accepted: 06/28/2021] [Indexed: 01/26/2023]
Abstract
In eukaryotic cells, DNA interacts with two main types of binding proteins: transcription factors and histones. Histones form the core of nucleosomes and display weak sequence preference owing to differences in bendability of different DNA sequences. By contrast, the affinity of transcription factors is highly dependent on DNA sequence - all sequences are bound with moderate affinity, but only few specific sequences are bound more tightly via molecular recognition of the DNA bases. Transcription factors can interact with nucleosomes directly by recognizing nucleosome-associated DNA and also indirectly by recruiting histone-modifying enzymes and nucleosome remodelers. These interactions result in sequence-dependent formation of a pattern of open and closed chromatin, where specific positions are occupied by transcription factors, histone-modifying enzymes, and modified histones. These patterns are then recognized by large DNA-associated macromolecular complexes such as cohesin and RNA polymerase II, which are involved in regulation of higher-order chromatin structure and transcription, respectively. Here, we review recent work that has provided structural and mechanistic insight into the interactions between all these classes of DNA-associated proteins.
Collapse
Affiliation(s)
- Ekaterina Morgunova
- Karolinska Institute, Department of Medical Biochemistry and Biophysics, Stockholm, Sweden
| | | |
Collapse
|
15
|
Koo PK, Ploenzke M. Improving representations of genomic sequence motifs in convolutional networks with exponential activations. NAT MACH INTELL 2021; 3:258-266. [PMID: 34322657 DOI: 10.1038/s42256-020-00291-x] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Deep convolutional neural networks (CNNs) trained on regulatory genomic sequences tend to build representations in a distributed manner, making it a challenge to extract learned features that are biologically meaningful, such as sequence motifs. Here we perform a comprehensive analysis on synthetic sequences to investigate the role that CNN activations have on model interpretability. We show that employing an exponential activation to first layer filters consistently leads to interpretable and robust representations of motifs compared to other commonly used activations. Strikingly, we demonstrate that CNNs with better test performance do not necessarily imply more interpretable representations with attribution methods. We find that CNNs with exponential activations significantly improve the efficacy of recovering biologically meaningful representations with attribution methods. We demonstrate these results generalise to real DNA sequences across several in vivo datasets. Together, this work demonstrates how a small modification to existing CNNs, i.e. setting exponential activations in the first layer, can significantly improve the robustness and interpretabilty of learned representations directly in convolutional filters and indirectly with attribution methods.
Collapse
Affiliation(s)
- Peter K Koo
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Matt Ploenzke
- Department of Biostatistics, T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
| |
Collapse
|
16
|
Partridge EC, Chhetri SB, Prokop JW, Ramaker RC, Jansen CS, Goh ST, Mackiewicz M, Newberry KM, Brandsmeier LA, Meadows SK, Messer CL, Hardigan AA, Coppola CJ, Dean EC, Jiang S, Savic D, Mortazavi A, Wold BJ, Myers RM, Mendenhall EM. Occupancy maps of 208 chromatin-associated proteins in one human cell type. Nature 2020; 583:720-728. [PMID: 32728244 PMCID: PMC7398277 DOI: 10.1038/s41586-020-2023-4] [Citation(s) in RCA: 73] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Accepted: 01/09/2020] [Indexed: 01/02/2023]
Abstract
Transcription factors are DNA-binding proteins that have key roles in gene regulation1,2. Genome-wide occupancy maps of transcriptional regulators are important for understanding gene regulation and its effects on diverse biological processes3–6. However, only a minority of the more than 1,600 transcription factors encoded in the human genome has been assayed. Here we present, as part of the ENCODE (Encyclopedia of DNA Elements) project, data and analyses from chromatin immunoprecipitation followed by high-throughput sequencing (ChIP–seq) experiments using the human HepG2 cell line for 208 chromatin-associated proteins (CAPs). These comprise 171 transcription factors and 37 transcriptional cofactors and chromatin regulator proteins, and represent nearly one-quarter of CAPs expressed in HepG2 cells. The binding profiles of these CAPs form major groups associated predominantly with promoters or enhancers, or with both. We confirm and expand the current catalogue of DNA sequence motifs for transcription factors, and describe motifs that correspond to other transcription factors that are co-enriched with the primary ChIP target. For example, FOX family motifs are enriched in ChIP–seq peaks of 37 other CAPs. We show that motif content and occupancy patterns can distinguish between promoters and enhancers. This catalogue reveals high-occupancy target regions at which many CAPs associate, although each contains motifs for only a minority of the numerous associated transcription factors. These analyses provide a more complete overview of the gene regulatory networks that define this cell type, and demonstrate the usefulness of the large-scale production efforts of the ENCODE Consortium. ChIP–seq and CETCh–seq data are used to analyse binding maps for 208 transcription factors and other chromatin-associated proteins in a single human cell type, providing a comprehensive catalogue of the transcription factor landscape and gene regulatory networks in these cells.
Collapse
Affiliation(s)
| | - Surya B Chhetri
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA.,Department of Biological Sciences, The University of Alabama in Huntsville, Huntsville, AL, USA.,Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MA, USA
| | - Jeremy W Prokop
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA.,Department of Pediatrics and Human Development, College of Human Medicine, Michigan State University, Grand Rapids, MI, USA
| | - Ryne C Ramaker
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA.,Department of Genetics, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Camden S Jansen
- Department of Developmental and Cell Biology, University of California Irvine, Irvine, CA, USA
| | - Say-Tar Goh
- Division of Biology, California Institute of Technology, Pasadena, CA, USA
| | - Mark Mackiewicz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | | | | | - Sarah K Meadows
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - C Luke Messer
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Andrew A Hardigan
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA.,Department of Genetics, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Candice J Coppola
- Department of Biological Sciences, The University of Alabama in Huntsville, Huntsville, AL, USA
| | - Emma C Dean
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA.,Department of Pathology, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Shan Jiang
- Department of Developmental and Cell Biology, University of California Irvine, Irvine, CA, USA
| | - Daniel Savic
- Pharmaceutical Sciences Department, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Ali Mortazavi
- Department of Developmental and Cell Biology, University of California Irvine, Irvine, CA, USA
| | - Barbara J Wold
- Division of Biology, California Institute of Technology, Pasadena, CA, USA
| | - Richard M Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA.
| | - Eric M Mendenhall
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA. .,Department of Biological Sciences, The University of Alabama in Huntsville, Huntsville, AL, USA.
| |
Collapse
|
17
|
Kim J, Lee Y, Lu X, Song B, Fong KW, Cao Q, Licht JD, Zhao JC, Yu J. Polycomb- and Methylation-Independent Roles of EZH2 as a Transcription Activator. Cell Rep 2019; 25:2808-2820.e4. [PMID: 30517868 PMCID: PMC6342284 DOI: 10.1016/j.celrep.2018.11.035] [Citation(s) in RCA: 193] [Impact Index Per Article: 38.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2018] [Revised: 10/15/2018] [Accepted: 11/07/2018] [Indexed: 12/22/2022] Open
Abstract
Enhancer of Zeste 2 (EZH2) is the enzymatic subunit of Polycomb
Repressive Complex 2 (PRC2), which catalyzes histone H3 lysine 27 trimethylation
(H3K27me3) at target promoters for gene silencing. Here, we report that EZH2
activates androgen receptor (AR) gene transcription through direct occupancy at
its promoter. Importantly, this activating role of EZH2 is independent of PRC2
and its methyltransferase activities. Genome-wide assays revealed extensive EZH2
occupancy at promoters marked by either H3K27ac or H3K27me3, leading to gene
activation or repression, respectively. Last, we demonstrate enhanced efficacy
of enzymatic EZH2 inhibitors when used in combination with AR antagonists in
blocking the dual roles of EZH2 and suppressing prostate cancer progression
in vitro and in vivo. Taken together, our
study reports EZH2 as a transcriptional activator, a key target of which is AR,
and suggests a drug-combinatory approach to treat advanced prostate cancer. Kim et al. report EZH2 as a transcriptional activator that directly
induces AR gene expression in a Polycomb- and methylation-independent manner,
providing a mechanism to escape enzymatic EZH2 inhibitors. Combination of
inhibitors with AR-targeted therapies showed a strong synergy in blocking the
EZH2 downstream pathways and suppressing prostate cancer progression.
Collapse
Affiliation(s)
- Jung Kim
- Division of Hematology and Oncology, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Yongik Lee
- Division of Hematology and Oncology, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Xiaodong Lu
- Division of Hematology and Oncology, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Bing Song
- Division of Hematology and Oncology, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Ka-Wing Fong
- Division of Hematology and Oncology, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Qi Cao
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, USA; Center for Inflammation and Epigenetics, Houston Methodist Research Institute, Houston, TX, USA
| | - Jonathan D Licht
- Division of Hematology and Oncology, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA; Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, USA; Division of Hematology and Oncology, University of Florida Health Cancer Center, Gainesville, FL 2033, USA
| | - Jonathan C Zhao
- Division of Hematology and Oncology, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA; Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, USA.
| | - Jindan Yu
- Division of Hematology and Oncology, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA; Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, USA; Department of Biochemistry and Molecular Genetics, Northwestern University Feinberg School of Medicine, Chicago, IL, USA.
| |
Collapse
|
18
|
SubCellBarCode: Proteome-wide Mapping of Protein Localization and Relocalization. Mol Cell 2019; 73:166-182.e7. [DOI: 10.1016/j.molcel.2018.11.035] [Citation(s) in RCA: 120] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2018] [Revised: 09/28/2018] [Accepted: 11/27/2018] [Indexed: 11/22/2022]
|
19
|
Yamada T, Akimitsu N. Contributions of regulated transcription and mRNA decay to the dynamics of gene expression. WILEY INTERDISCIPLINARY REVIEWS-RNA 2018; 10:e1508. [PMID: 30276972 DOI: 10.1002/wrna.1508] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2018] [Revised: 08/06/2018] [Accepted: 08/27/2018] [Indexed: 12/21/2022]
Abstract
Organisms have acquired sophisticated regulatory networks that control gene expression in response to cellular perturbations. Understanding of the mechanisms underlying the coordinated changes in gene expression in response to external and internal stimuli is a fundamental issue in biology. Recent advances in high-throughput technologies have enabled the measurement of diverse biological information, including gene expression levels, kinetics of gene expression, and interactions among gene expression regulatory molecules. By coupling these technologies with quantitative modeling, we can now uncover the biological roles and mechanisms of gene regulation at the system level. This review consists of two parts. First, we focus on the methods using uridine analogs that measure synthesis and decay rates of RNAs, which demonstrate how cells dynamically change the regulation of gene expression in response to both internal and external cues. Second, we discuss the underlying mechanisms of these changes in kinetics, including the functions of transcription factors and RNA-binding proteins. Overall, this review will help to clarify a system-level view of gene expression programs in cells. This article is categorized under: Regulatory RNAs/RNAi/Riboswitches > Regulatory RNAs RNA Turnover and Surveillance > Regulation of RNA Stability RNA Methods > RNA Analyses in vitro and In Silico.
Collapse
Affiliation(s)
- Toshimichi Yamada
- Department of Molecular and Cellular Biochemistry, Meiji Pharmaceutical University, Tokyo, Japan
| | | |
Collapse
|
20
|
|