1
|
Murthy S, Dey U, Olymon K, Abbas E, Yella VR, Kumar A. Discerning the Role of DNA Sequence, Shape, and Flexibility in Recognition by Drosophila Transcription Factors. ACS Chem Biol 2024; 19:1533-1543. [PMID: 38902964 DOI: 10.1021/acschembio.4c00202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/22/2024]
Abstract
The precise spatial and temporal orchestration of gene expression is crucial for the ontogeny of an organism and is mainly governed by transcription factors (TFs). The mechanism of recognition of cognate sites amid millions of base pairs in the genome by TFs is still incompletely understood. In this study, we focus on DNA sequence composition, shape, and flexibility preferences of 28 quintessential TFs from Drosophila melanogaster that are critical to development and body patterning mechanisms. Our study finds that TFs exhibit distinct predilections for DNA shape, flexibility, and sequence compositions in the proximity of transcription factor binding sites (TFBSs). Notably, certain zinc finger proteins prefer GC-rich areas with less negative propeller twist, while homeodomains mainly seek AT-rich regions with a more negative propeller twist at their sites. Intriguingly, while numerous cofactors share similar binding site preferences and bind closer to each other in the genome, some cofactors that have different preferences bind farther apart. These findings shed light on TF DNA recognition and provide novel insights into possible cofactor binding and transcriptional regulation mechanisms.
Collapse
Affiliation(s)
- Smrithi Murthy
- Department of Molecular Biology and Biotechnology, Tezpur University, Tezpur, Assam 784028, India
| | - Upalabdha Dey
- Department of Molecular Biology and Biotechnology, Tezpur University, Tezpur, Assam 784028, India
| | - Kaushika Olymon
- Department of Molecular Biology and Biotechnology, Tezpur University, Tezpur, Assam 784028, India
| | - Eshan Abbas
- Department of Molecular Biology and Biotechnology, Tezpur University, Tezpur, Assam 784028, India
| | - Venkata Rajesh Yella
- Department of Biotechnology, Koneru Lakshmaiah Education Foundation, Guntur 520002, India
| | - Aditya Kumar
- Department of Molecular Biology and Biotechnology, Tezpur University, Tezpur, Assam 784028, India
| |
Collapse
|
2
|
Liang Y, Luan YX. The functional evolution of collembolan Ubx on the regulation of abdominal appendage formation. Dev Genes Evol 2024:10.1007/s00427-024-00718-0. [PMID: 38980376 DOI: 10.1007/s00427-024-00718-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Accepted: 06/18/2024] [Indexed: 07/10/2024]
Abstract
Folsomia candida is a tiny soil-living arthropod belonging to the Collembola, which is an outgroup to Insecta. It resembles insects as having a pair of antennae and three pairs of thorax legs, while it also possesses three abdominal appendages: a ventral tube located in the first abdominal segment (A1), a retinaculum in A3, and a furca in A4. Collembolan Ubx and AbdA specify abdominal appendages, but they are unable to repress appendage marker gene Dll. The genetic basis of collembolan appendage formation and the mechanisms by which Ubx and AbdA regulate Dll transcription and appendage development remains unknown. In this study, we analysed the developmental transcriptomes of F. candida and identified candidate appendage formation genes, including Ubx (FcUbx). The expression data revealed the dominance of Dll over Ubx during the embryonic 3.5 and 4.5 days, suggesting that Ubx is deficient in suppressing Dll at early appendage formation stages. Furthermore, via electrophoretic mobility shift assays and dual luciferase assays, we found that the binding and repression capacity of FcUbx on Drosophila Dll resembles those of the longest isoform of Drosophila Ubx (DmUbx_Ib), while the regulatory mechanism of the C-terminus of FcUbx on Dll repression is similar to that of the crustacean Artemia franciscana Ubx (AfUbx), demonstrating that the function of collembolan Ubx is intermediate between that of Insecta and Crustacea. In summary, our study provides novel insights into collembolan appendage formation and sheds light on the functional evolution of Ubx. Additionally, we propose a model that collembolan Ubx regulates abdominal segments in a context-specific manner.
Collapse
Affiliation(s)
- Yan Liang
- Key Laboratory of Insect Developmental and Evolutionary Biology, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | - Yun-Xia Luan
- Key Laboratory of Insect Developmental and Evolutionary Biology, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China.
- Guangdong Provincial Key Laboratory of Insect Developmental Biology and Applied Technology, Guangzhou Key Laboratory of Insect Development Regulation and Application Research, Institute of Insect Science and Technology, School of Life Sciences, South China Normal University, Guangzhou, 510631, China.
| |
Collapse
|
3
|
Bonnell VA, Zhang Y, Brown AS, Horton J, Josling GA, Chiu TP, Rohs R, Mahony S, Gordân R, Llinás M. DNA sequence and chromatin differentiate sequence-specific transcription factor binding in the human malaria parasite Plasmodium falciparum. Nucleic Acids Res 2024:gkae585. [PMID: 38966997 DOI: 10.1093/nar/gkae585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 05/30/2024] [Accepted: 06/27/2024] [Indexed: 07/06/2024] Open
Abstract
Development of the malaria parasite, Plasmodium falciparum, is regulated by a limited number of sequence-specific transcription factors (TFs). However, the mechanisms by which these TFs recognize genome-wide binding sites is largely unknown. To address TF specificity, we investigated the binding of two TF subsets that either bind CACACA or GTGCAC DNA sequence motifs and further characterized two additional ApiAP2 TFs, PfAP2-G and PfAP2-EXP, which bind unique DNA motifs (GTAC and TGCATGCA). We also interrogated the impact of DNA sequence and chromatin context on P. falciparum TF binding by integrating high-throughput in vitro and in vivo binding assays, DNA shape predictions, epigenetic post-translational modifications, and chromatin accessibility. We found that DNA sequence context minimally impacts binding site selection for paralogous CACACA-binding TFs, while chromatin accessibility, epigenetic patterns, co-factor recruitment, and dimerization correlate with differential binding. In contrast, GTGCAC-binding TFs prefer different DNA sequence context in addition to chromatin dynamics. Finally, we determined that TFs that preferentially bind divergent DNA motifs may bind overlapping genomic regions due to low-affinity binding to other sequence motifs. Our results demonstrate that TF binding site selection relies on a combination of DNA sequence and chromatin features, thereby contributing to the complexity of P. falciparum gene regulatory mechanisms.
Collapse
Affiliation(s)
- Victoria A Bonnell
- Department of Biochemistry and Molecular Biology, University of Southern California, Los Angeles, CA 90089, USA
- Huck Institutes Center for Eukaryotic Gene Regulation, University of Southern California, Los Angeles, CA 90089, USA
- Huck Institutes Center for Malaria Research, University of Southern California, Los Angeles, CA 90089, USA
| | - Yuning Zhang
- Center for Genomic and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
- Department of Biostatistics and Bioinformatics, University of Southern California, Los Angeles, CA 90089, USA
- Program in Computational Biology and Bioinformatics, University of Southern California, Los Angeles, CA 90089, USA
| | - Alan S Brown
- Department of Biochemistry and Molecular Biology, University of Southern California, Los Angeles, CA 90089, USA
- Huck Institutes Center for Eukaryotic Gene Regulation, University of Southern California, Los Angeles, CA 90089, USA
- Huck Institutes Center for Malaria Research, University of Southern California, Los Angeles, CA 90089, USA
| | - John Horton
- Center for Genomic and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
- Department of Biostatistics and Bioinformatics, University of Southern California, Los Angeles, CA 90089, USA
| | - Gabrielle A Josling
- Department of Biochemistry and Molecular Biology, University of Southern California, Los Angeles, CA 90089, USA
- Huck Institutes Center for Eukaryotic Gene Regulation, University of Southern California, Los Angeles, CA 90089, USA
- Huck Institutes Center for Malaria Research, University of Southern California, Los Angeles, CA 90089, USA
| | - Tsu-Pei Chiu
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Remo Rohs
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
- Department of Chemistry, University of Southern California, Los Angeles, CA 90089, USA
- Department of Physics and Astronomy, University of Southern California, Los Angeles, CA 90089, USA
- Thomas Lord Department of Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| | - Shaun Mahony
- Department of Biochemistry and Molecular Biology, University of Southern California, Los Angeles, CA 90089, USA
- Huck Institutes Center for Eukaryotic Gene Regulation, University of Southern California, Los Angeles, CA 90089, USA
| | - Raluca Gordân
- Center for Genomic and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
- Department of Biostatistics and Bioinformatics, University of Southern California, Los Angeles, CA 90089, USA
- Department of Computer Science, Durham, NC 27708, USA
- Department of Molecular Genetics and Microbiology, Duke University, Durham, NC 27708, USA
| | - Manuel Llinás
- Department of Biochemistry and Molecular Biology, University of Southern California, Los Angeles, CA 90089, USA
- Huck Institutes Center for Eukaryotic Gene Regulation, University of Southern California, Los Angeles, CA 90089, USA
- Huck Institutes Center for Malaria Research, University of Southern California, Los Angeles, CA 90089, USA
- Department of Chemistry, The Pennsylvania State University, University Park, PA 16802, USA
| |
Collapse
|
4
|
Saibo NV, Maiti S, Boral S, Banerjee P, Kushwaha T, Inampudi KK, Goswami R, De S. The intrinsically disordered transactivation region of HOXA9 regulates its function by auto-inhibition of its DNA-binding activity. Int J Biol Macromol 2024; 273:132704. [PMID: 38825283 DOI: 10.1016/j.ijbiomac.2024.132704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 02/19/2024] [Accepted: 05/26/2024] [Indexed: 06/04/2024]
Abstract
HOXA9 transcription factor is expressed in hematopoietic stem cells and is involved in the regulation of their differentiation and maturation to various blood cells. HOXA9 is linked to various leukemia and is a marker for poor prognosis of acute myeloid leukemia (AML). This protein has a conserved DNA-binding homeodomain and a transactivation domain. We show that this N-terminal transactivation domain is intrinsically disordered and inhibits DNA-binding by the homeodomain. Using NMR spectroscopy and molecular dynamics simulation, we show that the hexapeptide 197AANWLH202 in the disordered region transiently occludes the DNA-binding interface. The hexapeptide also forms a rigid segment, as determined by NMR dynamics, in an otherwise flexible disordered region. Interestingly, this hexapeptide is known to mediate the interaction of HOXA9 and its TALE partner proteins, such as PBX1, and help in cooperative DNA binding. Mutation of tryptophan to alanine in the hexapeptide abrogates the DNA-binding auto-inhibition. We propose that the disordered transactivation region plays a dual role in the regulation of HOXA9 function. In the absence of TALE partners, it inhibits DNA binding, and in the presence of TALE partners it interacts with the TALE protein and facilitates the cooperative DNA binding by the HOX-TALE complex.
Collapse
Affiliation(s)
- Nikita V Saibo
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB 721302, India
| | - Snigdha Maiti
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB 721302, India
| | - Soumendu Boral
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB 721302, India
| | - Puja Banerjee
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB 721302, India
| | - Tushar Kushwaha
- Department of Biophysics, All India Institute of Medical Sciences, New Delhi, India
| | - Krishna K Inampudi
- Department of Biophysics, All India Institute of Medical Sciences, New Delhi, India
| | - Ritobrata Goswami
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB 721302, India
| | - Soumya De
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB 721302, India.
| |
Collapse
|
5
|
Maio KA, Moubayidin L. 'Organ'ising Floral Organ Development. PLANTS (BASEL, SWITZERLAND) 2024; 13:1595. [PMID: 38931027 PMCID: PMC11207604 DOI: 10.3390/plants13121595] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 06/05/2024] [Accepted: 06/06/2024] [Indexed: 06/28/2024]
Abstract
Flowers are plant structures characteristic of the phylum Angiosperms composed of organs thought to have emerged from homologous structures to leaves in order to specialize in a distinctive function: reproduction. Symmetric shapes, colours, and scents all play important functional roles in flower biology. The evolution of flower symmetry and the morphology of individual flower parts (sepals, petals, stamens, and carpels) has significantly contributed to the diversity of reproductive strategies across flowering plant species. This diversity facilitates attractiveness for pollination, protection of gametes, efficient fertilization, and seed production. Symmetry, the establishment of body axes, and fate determination are tightly linked. The complex genetic networks underlying the establishment of organ, tissue, and cellular identity, as well as the growth regulators acting across the body axes, are steadily being elucidated in the field. In this review, we summarise the wealth of research already at our fingertips to begin weaving together how separate processes involved in specifying organ identity within the flower may interact, providing a functional perspective on how identity determination and axial regulation may be coordinated to inform symmetrical floral organ structures.
Collapse
Affiliation(s)
| | - Laila Moubayidin
- Department of Cell and Developmental Biology, John Innes Centre, Colney Lane, Norwich NR4 7UH, UK;
| |
Collapse
|
6
|
Zhuang J, Huang X, Liu S, Gao W, Su R, Feng K. MulTFBS: A Spatial-Temporal Network with Multichannels for Predicting Transcription Factor Binding Sites. J Chem Inf Model 2024; 64:4322-4333. [PMID: 38733561 DOI: 10.1021/acs.jcim.3c02088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2024]
Abstract
Revealing the mechanisms that influence transcription factor binding specificity is the key to understanding gene regulation. In previous studies, DNA double helix structure and one-hot embedding have been used successfully to design computational methods for predicting transcription factor binding sites (TFBSs). However, DNA sequence as a kind of biological language, the method of word embedding representation in natural language processing, has not been considered properly in TFBS prediction models. In our work, we integrate different types of features of DNA sequence to design a multichanneled deep learning framework, namely MulTFBS, in which independent one-hot encoding, word embedding encoding, which can incorporate contextual information and extract the global features of the sequences, and double helix three-dimensional structural features have been trained in different channels. To extract sequence high-level information effectively, in our deep learning framework, we select the spatial-temporal network by combining convolutional neural networks and bidirectional long short-term memory networks with attention mechanism. Compared with six state-of-the-art methods on 66 universal protein-binding microarray data sets of different transcription factors, MulTFBS performs best on all data sets in the regression tasks, with the average R2 of 0.698 and the average PCC of 0.833, which are 5.4% and 3.2% higher, respectively, than the suboptimal method CRPTS. In addition, we evaluate the classification performance of MulTFBS for distinguishing bound or unbound regions on TF ChIP-seq data. The results show that our framework also performs well in the TFBS classification tasks.
Collapse
Affiliation(s)
- Jujuan Zhuang
- The School of Science, Dalian Maritime University, Dalian 116026, China
| | - Xinru Huang
- The School of Science, Dalian Maritime University, Dalian 116026, China
| | - Shuhan Liu
- The School of Science, Dalian Maritime University, Dalian 116026, China
| | - Wanquan Gao
- The School of Science, Dalian Maritime University, Dalian 116026, China
| | - Rui Su
- The School of Science, Dalian Maritime University, Dalian 116026, China
| | - Kexin Feng
- The School of Science, Dalian Maritime University, Dalian 116026, China
| |
Collapse
|
7
|
Singh A, Acharya B, Mukherjee B, Boorla VS, Boral S, Maiti S, De S. Stability and dynamics of extradenticle modulates its function. Curr Res Struct Biol 2024; 7:100150. [PMID: 38784963 PMCID: PMC11112286 DOI: 10.1016/j.crstbi.2024.100150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 04/03/2024] [Accepted: 05/06/2024] [Indexed: 05/25/2024] Open
Abstract
Extradenticle (EXD) is a partner protein of the HOX transcription factors and plays an important role in the development of Drosophila. It confers increased affinity and specificity of DNA-binding to the HOX proteins. However, the DNA-binding homeodomain of EXD has a significantly weaker affinity to DNA compared to the HOX homeodomains. Here, we show that a glycine residue (G290) in the middle of the EXD DNA-binding helix primarily results in this weaker binding. Glycine destabilizes helices. To probe its role in the stability and function of the protein, G290 was mutated to alanine. The intrinsic stability of the DNA-binding helix increased in the G290A mutant as observed by NMR studies and molecular dynamics (MD) simulation. Also, NMR dynamics and MD simulation show that dynamic motions present in the wild-type protein are quenched in the mutant. This in turn resulted in increased stability of the entire homeodomain (ΔΔGG→A of -2.6 kcal/mol). Increased protein stability resulted in three-fold better DNA-binding affinity of the mutant as compared to the wild-type protein. Molecular mechanics with generalized Born and surface area solvation (MMGBSA) analysis of our MD simulation on DNA-bound models of both wild-type and mutant proteins shows that the contribution to binding is enhanced for most of the interface residues in the mutant compared to the wild-type. Interestingly, the flexible N-terminal arm makes more stable contact with the DNA minor groove in the mutant. We found that the two interaction sites i.e. the DNA-binding helix and the unstructured N-terminal arm influence each other via the bound DNA. These results provide an interesting conundrum: alanine at position 290 enhances both the stability and the DNA-binding affinity of the protein, however, evolution prefers glycine at this position. We have provided several plausible explanations for this apparent conundrum. The function of the EXD as a HOX co-factor requires its ability to discriminate similar DNA sequences, which is most likely comprom.
Collapse
Affiliation(s)
- Aakanksha Singh
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB, 721302, India
| | - Bidisha Acharya
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB, 721302, India
| | - Beas Mukherjee
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB, 721302, India
| | | | | | | | - Soumya De
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB, 721302, India
| |
Collapse
|
8
|
Adhikari PB, Liu X, Huang C, Mitsuda N, Notaguchi M, Kasahara RD. Transcription Factors behind MYB98 Regulation: What Does the Discovery of SaeM Suggest? PLANTS (BASEL, SWITZERLAND) 2024; 13:1007. [PMID: 38611536 PMCID: PMC11013860 DOI: 10.3390/plants13071007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 03/15/2024] [Accepted: 03/27/2024] [Indexed: 04/14/2024]
Abstract
MYB98 is master regulator of the molecular network involved in pollen tube attraction. Until recently, it was unclear how this gene exhibits exclusively synergid cell-specific expression in ovule. Our recent study has established that a 16-bp-long SaeM element is crucial for its synergid cell-specific expression in ovule, and an 84-bp-long fragment harboring SaeM is sufficient to drive the process. In this study, we have developed a workflow to predict functional roles of potential transcription factors (TFs) putatively binding to the promoter region, taking MYB98 promoter as a test subject. After sequential assessment of co-expression pattern, network analysis, and potential master regulator identification, we have proposed a multi-TF model for MYB98 regulation. Our study suggests that ANL2, GT-1, and their respective homologs could be direct regulators of MYB98 and indicates that TCP15, TCP16, FRS9, and HB34 are likely master regulators of the majority of the TFs involved in its regulation. Comprehensive studies in the future are expected to offer more insights into such propositions. Developed workflow can be used while designing similar regulome-related studies for any other species and genes.
Collapse
Affiliation(s)
- Prakash B. Adhikari
- Biotechnology and Bioscience Research Center, Nagoya University, Nagoya 464-8601, Japan;
| | - Xiaoyan Liu
- College of Life Science, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (X.L.); (C.H.)
| | - Chen Huang
- College of Life Science, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (X.L.); (C.H.)
| | - Nobutaka Mitsuda
- Bioproduction Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), Tsukuba 305-8560, Japan;
| | - Michitaka Notaguchi
- Biotechnology and Bioscience Research Center, Nagoya University, Nagoya 464-8601, Japan;
| | - Ryushiro Dora Kasahara
- Biotechnology and Bioscience Research Center, Nagoya University, Nagoya 464-8601, Japan;
| |
Collapse
|
9
|
Khetan S, Bulyk ML. Overlapping binding sites underlie TF genomic occupancy. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.05.583629. [PMID: 38496549 PMCID: PMC10942454 DOI: 10.1101/2024.03.05.583629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Sequence-specific DNA binding by transcription factors (TFs) is a crucial step in gene regulation. However, current high-throughput in vitro approaches cannot reliably detect lower affinity TF-DNA interactions, which play key roles in gene regulation. Here, we developed PADIT-seq ( p rotein a ffinity to D NA by in vitro transcription and RNA seq uencing) to assay TF binding preferences to all 10-bp DNA sequences at far greater sensitivity than prior approaches. The expanded catalogs of low affinity DNA binding sites for the human TFs HOXD13 and EGR1 revealed that nucleotides flanking high affinity DNA binding sites create overlapping lower affinity sites that together modulate TF genomic occupancy in vivo . Formation of such extended recognition sequences stems from an inherent property of TF binding sites to interweave each other and expands the genomic sequence space for identifying noncoding variants that directly alter TF binding. One-Sentence Summary Overlapping DNA binding sites underlie TF genomic occupancy through their inherent propensity to interweave each other.
Collapse
|
10
|
Li J, Chiu TP, Rohs R. Predicting DNA structure using a deep learning method. Nat Commun 2024; 15:1243. [PMID: 38336958 PMCID: PMC10858265 DOI: 10.1038/s41467-024-45191-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 01/17/2024] [Indexed: 02/12/2024] Open
Abstract
Understanding the mechanisms of protein-DNA binding is critical in comprehending gene regulation. Three-dimensional DNA structure, also described as DNA shape, plays a key role in these mechanisms. In this study, we present a deep learning-based method, Deep DNAshape, that fundamentally changes the current k-mer based high-throughput prediction of DNA shape features by accurately accounting for the influence of extended flanking regions, without the need for extensive molecular simulations or structural biology experiments. By using the Deep DNAshape method, DNA structural features can be predicted for any length and number of DNA sequences in a high-throughput manner, providing an understanding of the effects of flanking regions on DNA structure in a target region of a sequence. The Deep DNAshape method provides access to the influence of distant flanking regions on a region of interest. Our findings reveal that DNA shape readout mechanisms of a core target are quantitatively affected by flanking regions, including extended flanking regions, providing valuable insights into the detailed structural readout mechanisms of protein-DNA binding. Furthermore, when incorporated in machine learning models, the features generated by Deep DNAshape improve the model prediction accuracy. Collectively, Deep DNAshape can serve as versatile and powerful tool for diverse DNA structure-related studies.
Collapse
Affiliation(s)
- Jinsen Li
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, 90089, USA
| | - Tsu-Pei Chiu
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, 90089, USA
| | - Remo Rohs
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, 90089, USA.
- Department of Chemistry, University of Southern California, Los Angeles, CA, 90089, USA.
- Department of Physics and Astronomy, University of Southern California, Los Angeles, CA, 90089, USA.
- Thomas Lord Department of Computer Science, University of Southern California, Los Angeles, CA, 90089, USA.
| |
Collapse
|
11
|
Ritesh KC, de Boer RL, Lin M, Jeannotte L, Philippidou P. Multimodal Hox5 activity generates motor neuron diversity. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.08.579338. [PMID: 38370781 PMCID: PMC10871347 DOI: 10.1101/2024.02.08.579338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
Motor neurons (MNs) are the final output of circuits driving fundamental behaviors, such as respiration and locomotion. Hox proteins are essential in generating the MN diversity required for accomplishing these functions, but the transcriptional mechanisms that enable Hox paralogs to assign distinct MN subtype identities despite their promiscuous DNA binding motif are not well understood. Here we show that Hoxa5 controls chromatin accessibility in all mouse spinal cervical MN subtypes and engages TALE co-factors to directly bind and regulate subtype-specific genes. We identify a paralog-specific interaction of Hoxa5 with the phrenic MN-specific transcription factor Scip and show that heterologous expression of Hoxa5 and Scip is sufficient to suppress limb-innervating MN identity. We also demonstrate that phrenic MN identity is stable after Hoxa5 downregulation and identify Klf proteins as potential regulators of phrenic MN maintenance. Our data identify multiple modes of Hoxa5 action that converge to induce and maintain MN identity.
Collapse
Affiliation(s)
- K C Ritesh
- Department of Neurosciences, Case Western Reserve University, Cleveland, OH, USA
| | - Raquel López de Boer
- Department of Neurosciences, Case Western Reserve University, Cleveland, OH, USA
| | - Minshan Lin
- Department of Neurosciences, Case Western Reserve University, Cleveland, OH, USA
| | - Lucie Jeannotte
- Department of Molecular Biology, Medical Biochemistry & Pathology, Université Laval, Centre Recherche sur le Cancer de l'Université Laval, Centre de recherche du CHU de Québec-Université Laval (Oncology), Québec, Canada
| | - Polyxeni Philippidou
- Department of Neurosciences, Case Western Reserve University, Cleveland, OH, USA
| |
Collapse
|
12
|
Rosales-Vega M, Reséndez-Pérez D, Vázquez M. Antennapedia: The complexity of a master developmental transcription factor. Genesis 2024; 62:e23561. [PMID: 37830148 DOI: 10.1002/dvg.23561] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Revised: 09/25/2023] [Accepted: 09/27/2023] [Indexed: 10/14/2023]
Abstract
Hox genes encode transcription factors that play an important role in establishing the basic body plan of animals. In Drosophila, Antennapedia is one of the five genes that make up the Antennapedia complex (ANT-C). Antennapedia determines the identity of the second thoracic segment, known as the mesothorax. Misexpression of Antennapedia at different developmental stages changes the identity of the mesothorax, including the muscles, nervous system, and cuticle. In Drosophila, Antennapedia has two distinct promoters highly regulated throughout development by several transcription factors. Antennapedia proteins are found with other transcription factors in different ANTENNAPEDIA transcriptional complexes to regulate multiple subsets of target genes. In this review, we describe the different mechanisms that regulate the expression and function of Antennapedia and the role of this Hox gene in the development of Drosophila.
Collapse
Affiliation(s)
- Marco Rosales-Vega
- Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Diana Reséndez-Pérez
- Facultad de Ciencias Biológicas, Departamento de Inmunología y Virología, Universidad Autónoma de Nuevo León, San Nicolás de los Garza, Nuevo León, Mexico
| | - Martha Vázquez
- Departamento de Genética del Desarrollo y Fisiología Molecular, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| |
Collapse
|
13
|
Kim S, Morgunova E, Naqvi S, Goovaerts S, Bader M, Koska M, Popov A, Luong C, Pogson A, Swigut T, Claes P, Taipale J, Wysocka J. DNA-guided transcription factor cooperativity shapes face and limb mesenchyme. Cell 2024; 187:692-711.e26. [PMID: 38262408 PMCID: PMC10872279 DOI: 10.1016/j.cell.2023.12.032] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 10/23/2023] [Accepted: 12/27/2023] [Indexed: 01/25/2024]
Abstract
Transcription factors (TFs) can define distinct cellular identities despite nearly identical DNA-binding specificities. One mechanism for achieving regulatory specificity is DNA-guided TF cooperativity. Although in vitro studies suggest that it may be common, examples of such cooperativity remain scarce in cellular contexts. Here, we demonstrate how "Coordinator," a long DNA motif composed of common motifs bound by many basic helix-loop-helix (bHLH) and homeodomain (HD) TFs, uniquely defines the regulatory regions of embryonic face and limb mesenchyme. Coordinator guides cooperative and selective binding between the bHLH family mesenchymal regulator TWIST1 and a collective of HD factors associated with regional identities in the face and limb. TWIST1 is required for HD binding and open chromatin at Coordinator sites, whereas HD factors stabilize TWIST1 occupancy at Coordinator and titrate it away from HD-independent sites. This cooperativity results in the shared regulation of genes involved in cell-type and positional identities and ultimately shapes facial morphology and evolution.
Collapse
Affiliation(s)
- Seungsoo Kim
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA 94305, USA; Department of Developmental Biology, Stanford University, Stanford, CA 94305, USA; Institute for Stem Cell Biology and Regenerative Medicine, Stanford University, Stanford, CA 94305, USA; Howard Hughes Medical Institute, Stanford, CA 94305, USA
| | - Ekaterina Morgunova
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Solna, Sweden
| | - Sahin Naqvi
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA 94305, USA; Department of Developmental Biology, Stanford University, Stanford, CA 94305, USA; Institute for Stem Cell Biology and Regenerative Medicine, Stanford University, Stanford, CA 94305, USA; Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Seppe Goovaerts
- Medical Imaging Research Center, UZ Leuven, Leuven, Belgium; Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Maram Bader
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA 94305, USA; Department of Developmental Biology, Stanford University, Stanford, CA 94305, USA; Institute for Stem Cell Biology and Regenerative Medicine, Stanford University, Stanford, CA 94305, USA
| | - Mervenaz Koska
- Department of Developmental Biology, Stanford University, Stanford, CA 94305, USA
| | | | - Christy Luong
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA 94305, USA
| | - Angela Pogson
- Department of Developmental Biology, Stanford University, Stanford, CA 94305, USA
| | - Tomek Swigut
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA 94305, USA; Department of Developmental Biology, Stanford University, Stanford, CA 94305, USA; Institute for Stem Cell Biology and Regenerative Medicine, Stanford University, Stanford, CA 94305, USA; Howard Hughes Medical Institute, Stanford, CA 94305, USA
| | - Peter Claes
- Medical Imaging Research Center, UZ Leuven, Leuven, Belgium; Department of Human Genetics, KU Leuven, Leuven, Belgium; Department of Electrical Engineering, ESAT/PSI, KU Leuven, Leuven, Belgium
| | - Jussi Taipale
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Solna, Sweden; Department of Biochemistry, University of Cambridge, Cambridge, UK; Applied Tumor Genomics Program, University of Helsinki, Helsinki, Finland
| | - Joanna Wysocka
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA 94305, USA; Department of Developmental Biology, Stanford University, Stanford, CA 94305, USA; Institute for Stem Cell Biology and Regenerative Medicine, Stanford University, Stanford, CA 94305, USA; Howard Hughes Medical Institute, Stanford, CA 94305, USA.
| |
Collapse
|
14
|
Jiang Y, Chiu TP, Mitra R, Rohs R. Probing the role of the protonation state of a minor groove-linker histidine in Exd-Hox-DNA binding. Biophys J 2024; 123:248-259. [PMID: 38130056 PMCID: PMC10808038 DOI: 10.1016/j.bpj.2023.12.013] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 09/22/2023] [Accepted: 12/13/2023] [Indexed: 12/23/2023] Open
Abstract
DNA recognition and targeting by transcription factors (TFs) through specific binding are fundamental in biological processes. Furthermore, the histidine protonation state at the TF-DNA binding interface can significantly influence the binding mechanism of TF-DNA complexes. Nevertheless, the role of histidine in TF-DNA complexes remains underexplored. Here, we employed all-atom molecular dynamics simulations using AlphaFold2-modeled complexes based on previously solved co-crystal structures to probe the role of the His-12 residue in the Extradenticle (Exd)-Sex combs reduced (Scr)-DNA complex when binding to Scr and Ultrabithorax (Ubx) target sites. Our results demonstrate that the protonation state of histidine notably affected the DNA minor-groove width profile and binding free energy. Examining flanking sequences of various binding affinities derived from SELEX-seq experiments, we analyzed the relationship between binding affinity and specificity. We uncovered how histidine protonation leads to increased binding affinity but can lower specificity. Our findings provide new mechanistic insights into the role of histidine in modulating TF-DNA binding.
Collapse
Affiliation(s)
- Yibei Jiang
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California
| | - Tsu-Pei Chiu
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California
| | - Raktim Mitra
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California
| | - Remo Rohs
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California; Department of Chemistry, University of Southern California, Los Angeles, California; Department of Physics and Astronomy, University of Southern California, Los Angeles, California; Thomas Lord Department of Computer Science, University of Southern California, Los Angeles, California.
| |
Collapse
|
15
|
Bobola N, Sagerström CG. TALE transcription factors: Cofactors no more. Semin Cell Dev Biol 2024; 152-153:76-84. [PMID: 36509674 DOI: 10.1016/j.semcdb.2022.11.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Revised: 10/27/2022] [Accepted: 11/30/2022] [Indexed: 12/14/2022]
Abstract
Exd/PBX, Hth/MEIS and PREP proteins belong to the TALE (three-amino-acid loop extension) superclass of transcription factors (TFs) with an atypical homedomain (HD). Originally discovered as "cofactors" to HOX proteins, revisiting their traditional role in light of genome-wide experiments reveals a strong and reproducible pattern of HOX and TALE co-occupancy across diverse embryonic tissues. While confirming that TALE increases HOX specificity and selectivity in vivo, this wider outlook also reveals novel aspects of HOX:TALE collaboration, namely that HOX TFs generally require pre-bound TALE factors to access their functional binding sites in vivo. In contrast to the restricted expression domains of HOX TFs, TALE factors are largely ubiquitous, and PBX and PREP are expressed at the earliest developmental stages. PBX and MEIS control development of many organs and tissues and their dysregulation is associated with congenital disease and cancer. Accordingly, many instances of TALE cooperation with non HOX TFs have been documented in various systems. The model that emerges from these studies is that TALE TFs create a permissive chromatin platform that is selected by tissue-restricted TFs for binding. In turn, HOX and other tissue-restricted TFs selectively convert a ubiquitous pool of low affinity TALE binding events into high confidence, tissue-restricted binding events associated with transcriptional activation. As a result, TALE:TF complexes are associated with active chromatin and domain/lineage-specific gene activity. TALE ubiquitous expression and broad genomic occupancy, as well as the increasing examples of TALE tissue-specific partners, reveal a universal and obligatory role for TALE in the control of tissue and lineage-specific transcriptional programs, beyond their initial discovery as HOX co-factors.
Collapse
Affiliation(s)
- Nicoletta Bobola
- School of Medical Sciences, University of Manchester, Manchester, UK.
| | - Charles G Sagerström
- Section of Developmental Biology, Department of Pediatrics, University of Colorado Medical School, Aurora, CO, USA.
| |
Collapse
|
16
|
Salomone J, Farrow E, Gebelein B. Homeodomain complex formation and biomolecular condensates in Hox gene regulation. Semin Cell Dev Biol 2024; 152-153:93-100. [PMID: 36517343 PMCID: PMC10258226 DOI: 10.1016/j.semcdb.2022.11.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Revised: 10/21/2022] [Accepted: 11/30/2022] [Indexed: 12/15/2022]
Abstract
Hox genes are a family of homeodomain transcription factors that regulate specialized morphological structures along the anterior-posterior axis of metazoans. Over the past few decades, researchers have focused on defining how Hox factors with similar in vitro DNA binding activities achieve sufficient target specificity to regulate distinct cell fates in vivo. In this review, we highlight how protein interactions with other transcription factors, many of which are also homeodomain proteins, result in the formation of transcription factor complexes with enhanced DNA binding specificity. These findings suggest that Hox-regulated enhancers utilize distinct combinations of homeodomain binding sites, many of which are low-affinity, to recruit specific Hox complexes. However, low-affinity sites can only yield reproducible responses with high transcription factor concentrations. To overcome this limitation, recent studies revealed how transcription factors, including Hox factors, use intrinsically disordered domains (IDRs) to form biomolecular condensates that increase protein concentrations. Moreover, Hox factors with altered IDRs have been associated with altered transcriptional activity and human disease states, demonstrating the importance of IDRs in mediating essential Hox output. Collectively, these studies highlight how Hox factors use their DNA binding domains, protein-protein interaction domains, and IDRs to form specific transcription factor complexes that yield accurate gene expression.
Collapse
Affiliation(s)
- Joseph Salomone
- Graduate Program in Molecular and Developmental Biology, Cincinnati Children's Hospital Research Foundation, Cincinnati, OH 45229, USA; Medical-Scientist Training Program, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
| | - Edward Farrow
- Graduate Program in Molecular and Developmental Biology, Cincinnati Children's Hospital Research Foundation, Cincinnati, OH 45229, USA; Medical-Scientist Training Program, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
| | - Brian Gebelein
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, 3333 Burnet Ave, MLC 7007, Cincinnati, OH 45229, USA; Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA.
| |
Collapse
|
17
|
Mitra R, Li J, Sagendorf JM, Jiang Y, Chiu TP, Rohs R. DeepPBS: Geometric deep learning for interpretable prediction of protein-DNA binding specificity. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.15.571942. [PMID: 38293168 PMCID: PMC10827229 DOI: 10.1101/2023.12.15.571942] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2024]
Abstract
Predicting specificity in protein-DNA interactions is a challenging yet essential task for understanding gene regulation. Here, we present Deep Predictor of Binding Specificity (DeepPBS), a geometric deep-learning model designed to predict binding specificity across protein families based on protein-DNA structures. The DeepPBS architecture allows investigation of different family-specific recognition patterns. DeepPBS can be applied to predicted structures, and can aid in the modeling of protein-DNA complexes. DeepPBS is interpretable and can be used to calculate protein heavy atom-level importance scores, demonstrated as a case-study on p53-DNA interface. When aggregated at the protein residue level, these scores conform well with alanine scanning mutagenesis experimental data. The inference time for DeepPBS is sufficiently fast for analyzing simulation trajectories, as demonstrated on a molecular-dynamics simulation of a Drosophila Hox-DNA tertiary complex with its cofactor. DeepPBS and its corresponding data resources offer a foundation for machine-aided protein-DNA interaction studies, guiding experimental choices and complex design, as well as advancing our understanding of molecular interactions.
Collapse
|
18
|
Chen J, Lu J, Liu J, Fang J, Zhong X, Song J. DNA conformational dynamics in the context-dependent non-CG CHH methylation by plant methyltransferase DRM2. J Biol Chem 2023; 299:105433. [PMID: 37926286 PMCID: PMC10711165 DOI: 10.1016/j.jbc.2023.105433] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 10/05/2023] [Accepted: 10/29/2023] [Indexed: 11/07/2023] Open
Abstract
DNA methylation provides an important epigenetic mechanism that critically regulates gene expression, genome imprinting, and retrotransposon silencing. In plants, DNA methylation is prevalent not only in a CG dinucleotide context but also in non-CG contexts, namely CHG and CHH (H = C, T, or A) methylation. It has been established that plant non-CG DNA methylation is highly context dependent, with the +1- and +2-flanking sequences enriched with A/T nucleotides. How DNA sequence, conformation, and dynamics influence non-CG methylation remains elusive. Here, we report structural and biochemical characterizations of the intrinsic substrate preference of DOMAINS REARRANGED METHYLTRANSFERASE 2 (DRM2), a plant DNA methyltransferase responsible for establishing all cytosine methylation and maintaining CHH methylation. Among nine CHH motifs, the DRM2 methyltransferase (MTase) domain shows marked substrate preference toward CWW (W = A or T) motifs, correlating well with their relative abundance in planta. Furthermore, we report the crystal structure of DRM2 MTase in complex with a DNA duplex containing a flexible TpA base step at the +1/+2-flanking sites of the target nucleotide. Comparative structural analysis of the DRM2-DNA complexes provides a mechanism by which flanking nucleotide composition impacts DRM2-mediated DNA methylation. Furthermore, the flexibility of the TpA step gives rise to two alternative DNA conformations, resulting in different interactions with DRM2 and consequently temperature-dependent shift of the substrate preference of DRM2. Together, this study provides insights into how the interplay between the conformational dynamics of DNA and temperature as an environmental factor contributes to the context-dependent CHH methylation by DRM2.
Collapse
Affiliation(s)
- Jianbin Chen
- Department of Biochemistry, University of California, Riverside, California, USA
| | - Jiuwei Lu
- Department of Biochemistry, University of California, Riverside, California, USA
| | - Jie Liu
- Department of Biology, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Jian Fang
- Department of Biochemistry, University of California, Riverside, California, USA
| | - Xuehua Zhong
- Department of Biology, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Jikui Song
- Department of Biochemistry, University of California, Riverside, California, USA.
| |
Collapse
|
19
|
Wen C, Yuan Z, Zhang X, Chen H, Luo L, Li W, Li T, Ma N, Mao F, Lin D, Lin Z, Lin C, Xu T, Lü P, Lin J, Zhu F. Sea-ATI unravels novel vocabularies of plant active cistrome. Nucleic Acids Res 2023; 51:11568-11583. [PMID: 37850650 PMCID: PMC10681729 DOI: 10.1093/nar/gkad853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2023] [Revised: 08/11/2023] [Accepted: 09/25/2023] [Indexed: 10/19/2023] Open
Abstract
The cistrome consists of all cis-acting regulatory elements recognized by transcription factors (TFs). However, only a portion of the cistrome is active for TF binding in a specific tissue. Resolving the active cistrome in plants remains challenging. In this study, we report the assay sequential extraction assisted-active TF identification (sea-ATI), a low-input method that profiles the DNA sequences recognized by TFs in a target tissue. We applied sea-ATI to seven plant tissues to survey their active cistrome and generated 41 motif models, including 15 new models that represent previously unidentified cis-regulatory vocabularies. ATAC-seq and RNA-seq analyses confirmed the functionality of the cis-elements from the new models, in that they are actively bound in vivo, located near the transcription start site, and influence chromatin accessibility and transcription. Furthermore, comparing dimeric WRKY CREs between sea-ATI and DAP-seq libraries revealed that thermodynamics and genetic drifts cooperatively shaped their evolution. Notably, sea-ATI can identify not only positive but also negative regulatory cis-elements, thereby providing unique insights into the functional non-coding genome of plants.
Collapse
Affiliation(s)
- Chenjin Wen
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Zhen Yuan
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Xiaotian Zhang
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Hao Chen
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Lin Luo
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Wanying Li
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Tian Li
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Nana Ma
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Fei Mao
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Dongmei Lin
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Zhanxi Lin
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Chentao Lin
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Tongda Xu
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Peitao Lü
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Juncheng Lin
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| | - Fangjie Zhu
- College of Life Science, Haixia Institute of Science and Technology, National Engineering Research Center of JUNCAO, Fujian Agriculture and Forestry University, Fuzhou 350002, Fujian, China
| |
Collapse
|
20
|
Martin V, Zhuang F, Zhang Y, Pinheiro K, Gordân R. High-throughput data and modeling reveal insights into the mechanisms of cooperative DNA-binding by transcription factor proteins. Nucleic Acids Res 2023; 51:11600-11612. [PMID: 37889068 PMCID: PMC10681739 DOI: 10.1093/nar/gkad872] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 09/21/2023] [Accepted: 10/05/2023] [Indexed: 10/28/2023] Open
Abstract
Cooperative DNA-binding by transcription factor (TF) proteins is critical for eukaryotic gene regulation. In the human genome, many regulatory regions contain TF-binding sites in close proximity to each other, which can facilitate cooperative interactions. However, binding site proximity does not necessarily imply cooperative binding, as TFs can also bind independently to each of their neighboring target sites. Currently, the rules that drive cooperative TF binding are not well understood. In addition, it is oftentimes difficult to infer direct TF-TF cooperativity from existing DNA-binding data. Here, we show that in vitro binding assays using DNA libraries of a few thousand genomic sequences with putative cooperative TF-binding events can be used to develop accurate models of cooperativity and to gain insights into cooperative binding mechanisms. Using factors ETS1 and RUNX1 as our case study, we show that the distance and orientation between ETS1 sites are critical determinants of cooperative ETS1-ETS1 binding, while cooperative ETS1-RUNX1 interactions show more flexibility in distance and orientation and can be accurately predicted based on the affinity and sequence/shape features of the binding sites. The approach described here, combining custom experimental design with machine-learning modeling, can be easily applied to study the cooperative DNA-binding patterns of any TFs.
Collapse
Affiliation(s)
- Vincentius Martin
- Department of Computer Science, Durham, NC 27708, USA
- Center for Genomic & Computational Biology, Durham, NC 27708, USA
| | - Farica Zhuang
- Department of Computer Science, Durham, NC 27708, USA
- Center for Genomic & Computational Biology, Durham, NC 27708, USA
| | - Yuning Zhang
- Center for Genomic & Computational Biology, Durham, NC 27708, USA
- Program in Computational Biology & Bioinformatics, Durham, NC 27708, USA
| | - Kyle Pinheiro
- Department of Computer Science, Durham, NC 27708, USA
- Center for Genomic & Computational Biology, Durham, NC 27708, USA
| | - Raluca Gordân
- Department of Computer Science, Durham, NC 27708, USA
- Center for Genomic & Computational Biology, Durham, NC 27708, USA
- Department of Biostatistics & Bioinformatics, Department of Molecular Genetics and Microbiology, Department of Cell Biology, Duke University, Durham, NC 27708, USA
| |
Collapse
|
21
|
Stevenson MJ, Phanor SK, Patel U, Gisselbrecht SS, Bulyk ML, O'Brien LL. Altered binding affinity of SIX1-Q177R correlates with enhanced WNT5A and WNT pathway effector expression in Wilms tumor. Dis Model Mech 2023; 16:dmm050208. [PMID: 37815464 PMCID: PMC10668032 DOI: 10.1242/dmm.050208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Accepted: 09/27/2023] [Indexed: 10/11/2023] Open
Abstract
Wilms tumors present as an amalgam of varying proportions of tissues located within the developing kidney, one being the nephrogenic blastema comprising multipotent nephron progenitor cells (NPCs). The recurring missense mutation Q177R in NPC transcription factors SIX1 and SIX2 is most correlated with tumors of blastemal histology and is significantly associated with relapse. Yet, the transcriptional regulatory consequences of SIX1/2-Q177R that might promote tumor progression and recurrence have not been investigated extensively. Utilizing multiple Wilms tumor transcriptomic datasets, we identified upregulation of the gene encoding non-canonical WNT ligand WNT5A in addition to other WNT pathway effectors in SIX1/2-Q177R mutant tumors. SIX1 ChIP-seq datasets from Wilms tumors revealed shared binding sites for SIX1/SIX1-Q177R within a promoter of WNT5A and at putative distal cis-regulatory elements (CREs). We demonstrate colocalization of SIX1 and WNT5A in Wilms tumor tissue and utilize in vitro assays that support SIX1 and SIX1-Q177R activation of expression from the WNT5A CREs, as well as enhanced binding affinity within the WNT5A promoter that may promote the differential expression of WNT5A and other WNT pathway effectors associated with SIX1-Q177R tumors.
Collapse
Affiliation(s)
- Matthew J. Stevenson
- Department of Cell Biology and Physiology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Sabrina K. Phanor
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Urvi Patel
- Department of Cell Biology and Physiology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Stephen S. Gisselbrecht
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Martha L. Bulyk
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA
- Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Lori L. O'Brien
- Department of Cell Biology and Physiology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
22
|
Li J, Chiu TP, Rohs R. Deep DNAshape: Predicting DNA shape considering extended flanking regions using a deep learning method. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.22.563383. [PMID: 37961633 PMCID: PMC10634709 DOI: 10.1101/2023.10.22.563383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Understanding the mechanisms of protein-DNA binding is critical in comprehending gene regulation. Three-dimensional DNA shape plays a key role in these mechanisms. In this study, we present a deep learning-based method, Deep DNAshape, that fundamentally changes the current k -mer based high-throughput prediction of DNA shape features by accurately accounting for the influence of extended flanking regions, without the need for extensive molecular simulations or structural biology experiments. By using the Deep DNAshape method, refined DNA shape features can be predicted for any length and number of DNA sequences in a high-throughput manner, providing a deeper understanding of the effects of flanking regions on DNA shape in a target region of a sequence. Deep DNAshape method provides access to the influence of distant flanking regions on a region of interest. Our findings reveal that DNA shape readout mechanisms of a core target are quantitatively affected by flanking regions, including extended flanking regions, providing valuable insights into the detailed structural readout mechanisms of protein-DNA binding. Furthermore, when incorporated in machine learning models, the features generated by Deep DNAshape improve the model prediction accuracy. Collectively, Deep DNAshape can serve as a versatile and powerful tool for diverse DNA structure-related studies.
Collapse
|
23
|
Khan S, Pradhan SJ, Giraud G, Bleicher F, Paul R, Merabet S, Shashidhara LS. A Micro-evolutionary Change in Target Binding Sites as a Key Determinant of Ultrabithorax Function in Drosophila. J Mol Evol 2023; 91:616-627. [PMID: 37341745 DOI: 10.1007/s00239-023-10123-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Accepted: 06/01/2023] [Indexed: 06/22/2023]
Abstract
Hox genes encode Homeodomain-containing transcription factors, which specify segmental identities along the anterior-posterior axis. Functional changes in Hox genes have been directly implicated in the evolution of body plans across the metazoan lineage. The Hox protein Ultrabithorax (Ubx) is expressed and required in developing third thoracic (T3) segments in holometabolous insects studied so far, particularly, of the order Coleoptera, Lepidoptera and Diptera. Ubx function is key to specify differential development of the second (T2) and T3 thoracic segments in these insects. While Ubx is expressed in the third thoracic segment in developing larvae of Hymenopteran Apis mellifera, the morphological differences between T2 and T3 are subtle. To identify evolutionary changes that are behind the differential function of Ubx in Drosophila and Apis, which are diverged for more than 350 million years, we performed comparative analyses of genome wide Ubx-binding sites between these two insects. Our studies reveal that a motif with a TAAAT core is a preferred binding site for Ubx in Drosophila, but not in Apis. Biochemical and transgenic assays suggest that in Drosophila, the TAAAT core sequence in the Ubx binding sites is required for Ubx-mediated regulation of two of its target genes studied here; CG13222, a gene that is normally upregulated by Ubx and vestigial (vg), whose expression is repressed by Ubx in T3. Interestingly, changing the TAAT site to a TAAAT site was sufficient to bring an otherwise unresponsive enhancer of the vg gene from Apis under the control of Ubx in a Drosophila transgenic assay. Taken together, our results suggest an evolutionary mechanism by which critical wing patterning genes might have come under the regulation of Ubx in the Dipteran lineage.
Collapse
Affiliation(s)
- Soumen Khan
- Indian Institute of Science Education and Research (IISER), Pune, 411008, India.
| | - Saurabh J Pradhan
- Indian Institute of Science Education and Research (IISER), Pune, 411008, India
| | - Guillaume Giraud
- IGFL, ENS Lyon, UMR5242, 32 Av. Tony Garnier, 69007, Lyon, France
| | | | - Rachel Paul
- IGFL, ENS Lyon, UMR5242, 32 Av. Tony Garnier, 69007, Lyon, France
| | - Samir Merabet
- IGFL, ENS Lyon, UMR5242, 32 Av. Tony Garnier, 69007, Lyon, France
| | - L S Shashidhara
- Indian Institute of Science Education and Research (IISER), Pune, 411008, India.
- Ashoka University, Sonipat, Haryana, 131029, India.
| |
Collapse
|
24
|
Mahendrawada L, Warfield L, Donczew R, Hahn S. Surprising connections between DNA binding and function for the near-complete set of yeast transcription factors. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.25.550593. [PMID: 37546716 PMCID: PMC10402042 DOI: 10.1101/2023.07.25.550593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]
Abstract
DNA sequence-specific transcription factors (TFs) modulate transcription and chromatin architecture, acting from regulatory sites in enhancers and promoters of eukaryotic genes. How TFs locate their DNA targets and how multiple TFs cooperate to regulate individual genes is still unclear. Most yeast TFs are thought to regulate transcription via binding to upstream activating sequences, situated within a few hundred base pairs upstream of the regulated gene. While this model has been validated for individual TFs and specific genes, it has not been tested in a systematic way with the large set of yeast TFs. Here, we have integrated information on the binding and expression targets for the near-complete set of yeast TFs. While we found many instances of functional TF binding sites in upstream regulatory regions, we found many more instances that do not fit this model. In many cases, rapid TF depletion affects gene expression where there is no detectable binding of that TF to the upstream region of the affected gene. In addition, for most TFs, only a small fraction of bound TFs regulates the nearby gene, showing that TF binding does not automatically correspond to regulation of the linked gene. Finally, we found that only a small percentage of TFs are exclusively strong activators or repressors with most TFs having dual function. Overall, our comprehensive mapping of TF binding and regulatory targets have both confirmed known TF relationships and revealed surprising properties of TF function.
Collapse
|
25
|
Lacher SE, Skon-Hegg C, Ruis BL, Krznarich J, Slattery M. An antioxidant response element regulates the HIF1α axis in breast cancer cells. Free Radic Biol Med 2023; 204:243-251. [PMID: 37179033 PMCID: PMC10321210 DOI: 10.1016/j.freeradbiomed.2023.05.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/19/2023] [Revised: 04/28/2023] [Accepted: 05/02/2023] [Indexed: 05/15/2023]
Abstract
The redox sensitive transcription factor NRF2 is a central regulator of the transcriptional response to reactive oxygen species (ROS). NRF2 is widely recognized for its ROS-responsive upregulation of antioxidant genes that are essential for mitigating the damaging effects of oxidative stress. However, multiple genome-wide approaches have suggested that NRF2's regulatory reach extends well beyond the canonical antioxidant genes, with the potential to regulate many noncanonical target genes. Recent work from our lab and others suggests HIF1A, which encodes the hypoxia-responsive transcription factor HIF1α, is one such noncanonical NRF2 target. These studies found that NRF2 activity is associated with high HIF1A expression in multiple cellular contexts, HIF1A expression is partially dependent on NRF2, and there is a putative NRF2 binding site (antioxidant response element, or ARE) approximately 30 kilobases upstream of HIF1A. These findings all support a model in which HIF1A is a direct target of NRF2, but did not confirm the functional importance of the upstream ARE in HIF1A expression. Here we use CRISPR/Cas9 genome editing to mutate this ARE in its genomic context and test the impact on HIF1A expression. We find that mutation of this ARE in a breast cancer cell line (MDA-MB-231) eliminates NRF2 binding and decreases HIF1A expression at the transcript and protein levels, and disrupts HIF1α target genes as well as phenotypes driven by these HIF1α targets. Taken together, these results indicate that this NRF2 targeted ARE plays an important role in the expression of HIF1A and activity of the HIF1α axis in MDA-MB-231 cells.
Collapse
Affiliation(s)
- Sarah E Lacher
- Department of Biomedical Sciences, University of Minnesota Medical School, Duluth, MN, 55812, USA.
| | - Cara Skon-Hegg
- Whiteside Institute for Clinical Research, St. Luke's Hospital, University of Minnesota Medical School, Duluth, MN, 55812, USA
| | - Brian L Ruis
- Masonic Cancer Center, University of Minnesota Medical School, Minneapolis, MN, 55455, USA
| | - Jennifer Krznarich
- Department of Biomedical Sciences, University of Minnesota Medical School, Duluth, MN, 55812, USA
| | - Matthew Slattery
- Department of Biomedical Sciences, University of Minnesota Medical School, Duluth, MN, 55812, USA.
| |
Collapse
|
26
|
Bhimsaria D, Rodríguez-Martínez JA, Mendez-Johnson JL, Ghoshdastidar D, Varadarajan A, Bansal M, Daniels DL, Ramanathan P, Ansari AZ. Hidden modes of DNA binding by human nuclear receptors. Nat Commun 2023; 14:4179. [PMID: 37443151 PMCID: PMC10345098 DOI: 10.1038/s41467-023-39577-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Accepted: 06/19/2023] [Indexed: 07/15/2023] Open
Abstract
Human nuclear receptors (NRs) are a superfamily of ligand-responsive transcription factors that have central roles in cellular function. Their malfunction is linked to numerous diseases, and the ability to modulate their activity with synthetic ligands has yielded 16% of all FDA-approved drugs. NRs regulate distinct gene networks, however they often function from genomic sites that lack known binding motifs. Here, to annotate genomic binding sites of known and unexamined NRs more accurately, we use high-throughput SELEX to comprehensively map DNA binding site preferences of all full-length human NRs, in complex with their ligands. Furthermore, to identify non-obvious binding sites buried in DNA-protein interactomes, we develop MinSeq Find, a search algorithm based on the MinTerm concept from electrical engineering and digital systems design. The resulting MinTerm sequence set (MinSeqs) reveal a constellation of binding sites that more effectively annotate NR-binding profiles in cells. MinSeqs also unmask binding sites created or disrupted by 52,106 single-nucleotide polymorphisms associated with human diseases. By implicating druggable NRs as hidden drivers of multiple human diseases, our results not only reveal new biological roles of NRs, but they also provide a resource for drug-repurposing and precision medicine.
Collapse
Affiliation(s)
- Devesh Bhimsaria
- Department of Biosciences and Bioengineering, Indian Institute of Technology Roorkee, Roorkee, 247667, India.
| | | | | | | | - Ashwin Varadarajan
- Department of Electrical and Computer Engineering, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Manju Bansal
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India
| | - Danette L Daniels
- Promega Corporation, Madison, WI, 53711, USA
- Foghorn Therapeutics, Cambridge, MA, 02139, USA
| | - Parameswaran Ramanathan
- Department of Electrical and Computer Engineering, University of Wisconsin-Madison, Madison, WI, 53706, USA.
| | - Aseem Z Ansari
- Department of Chemical Biology and Therapeutics, St. Jude Children's Research Hospital, Memphis, TN, 38105, USA.
| |
Collapse
|
27
|
Cain B, Webb J, Yuan Z, Cheung D, Lim HW, Kovall R, Weirauch MT, Gebelein B. Prediction of cooperative homeodomain DNA binding sites from high-throughput-SELEX data. Nucleic Acids Res 2023; 51:6055-6072. [PMID: 37114997 PMCID: PMC10325903 DOI: 10.1093/nar/gkad318] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 04/12/2023] [Accepted: 04/25/2023] [Indexed: 04/29/2023] Open
Abstract
Homeodomain proteins constitute one of the largest families of metazoan transcription factors. Genetic studies have demonstrated that homeodomain proteins regulate many developmental processes. Yet, biochemical data reveal that most bind highly similar DNA sequences. Defining how homeodomain proteins achieve DNA binding specificity has therefore been a long-standing goal. Here, we developed a novel computational approach to predict cooperative dimeric binding of homeodomain proteins using High-Throughput (HT) SELEX data. Importantly, we found that 15 of 88 homeodomain factors form cooperative homodimer complexes on DNA sites with precise spacing requirements. Approximately one third of the paired-like homeodomain proteins cooperatively bind palindromic sequences spaced 3 bp apart, whereas other homeodomain proteins cooperatively bind sites with distinct orientation and spacing requirements. Combining structural models of a paired-like factor with our cooperativity predictions identified key amino acid differences that help differentiate between cooperative and non-cooperative factors. Finally, we confirmed predicted cooperative dimer sites in vivo using available genomic data for a subset of factors. These findings demonstrate how HT-SELEX data can be computationally mined to predict cooperativity. In addition, the binding site spacing requirements of select homeodomain proteins provide a mechanism by which seemingly similar AT-rich DNA sequences can preferentially recruit specific homeodomain factors.
Collapse
Affiliation(s)
- Brittany Cain
- Department of Biomedical Engineering, University of Cincinnati, Cincinnati, OH 45221, USA
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, 3333 Burnet Ave, MLC 7007, Cincinnati, OH 45229, USA
| | - Jordan Webb
- Department of Molecular Genetics, Biochemistry and Microbiology, University of Cincinnati College of Medicine, Cincinnati, OH 45267, USA
| | - Zhenyu Yuan
- Department of Molecular Genetics, Biochemistry and Microbiology, University of Cincinnati College of Medicine, Cincinnati, OH 45267, USA
| | - David Cheung
- Graduate Program in Molecular and Developmental Biology, Cincinnati Children's Hospital Research Foundation, Cincinnati, OH 45229, USA
| | - Hee-Woong Lim
- Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
| | - Rhett A Kovall
- Department of Molecular Genetics, Biochemistry and Microbiology, University of Cincinnati College of Medicine, Cincinnati, OH 45267, USA
| | - Matthew T Weirauch
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
- Divisions of Human Genetics, Biomedical Informatics and Developmental Biology, Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA
| | - Brian Gebelein
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, 3333 Burnet Ave, MLC 7007, Cincinnati, OH 45229, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA
| |
Collapse
|
28
|
Cooper BH, Dantas Machado AC, Gan Y, Aparicio O, Rohs R. DNA binding specificity of all four Saccharomyces cerevisiae forkhead transcription factors. Nucleic Acids Res 2023; 51:5621-5633. [PMID: 37177995 PMCID: PMC10287902 DOI: 10.1093/nar/gkad372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 04/19/2023] [Accepted: 04/27/2023] [Indexed: 05/15/2023] Open
Abstract
Quantifying the nucleotide preferences of DNA binding proteins is essential to understanding how transcription factors (TFs) interact with their targets in the genome. High-throughput in vitro binding assays have been used to identify the inherent DNA binding preferences of TFs in a controlled environment isolated from confounding factors such as genome accessibility, DNA methylation, and TF binding cooperativity. Unfortunately, many of the most common approaches for measuring binding preferences are not sensitive enough for the study of moderate-to-low affinity binding sites, and are unable to detect small-scale differences between closely related homologs. The Forkhead box (FOX) family of TFs is known to play a crucial role in regulating a variety of key processes from proliferation and development to tumor suppression and aging. By using the high-sequencing depth SELEX-seq approach to study all four FOX homologs in Saccharomyces cerevisiae, we have been able to precisely quantify the contribution and importance of nucleotide positions all along an extended binding site. Essential to this process was the alignment of our SELEX-seq reads to a set of candidate core sequences determined using a recently developed tool for the alignment of enriched k-mers and a newly developed approach for the reprioritization of candidate cores.
Collapse
Affiliation(s)
- Brendon H Cooper
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Ana Carolina Dantas Machado
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Yan Gan
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
- Molecular and Computational Biology Section, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Oscar M Aparicio
- Molecular and Computational Biology Section, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
- Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA 90033, USA
| | - Remo Rohs
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
- Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA 90033, USA
- Departments of Chemistry, Physics & Astronomy, and Computer Science, University of Southern California, Los Angeles, CA 90089, USA
| |
Collapse
|
29
|
Kim S, Morgunova E, Naqvi S, Bader M, Koska M, Popov A, Luong C, Pogson A, Claes P, Taipale J, Wysocka J. DNA-guided transcription factor cooperativity shapes face and limb mesenchyme. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.29.541540. [PMID: 37398193 PMCID: PMC10312427 DOI: 10.1101/2023.05.29.541540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Transcription factors (TFs) can define distinct cellular identities despite nearly identical DNA-binding specificities. One mechanism for achieving regulatory specificity is DNA-guided TF cooperativity. Although in vitro studies suggest it may be common, examples of such cooperativity remain scarce in cellular contexts. Here, we demonstrate how 'Coordinator', a long DNA motif comprised of common motifs bound by many basic helix-loop-helix (bHLH) and homeodomain (HD) TFs, uniquely defines regulatory regions of embryonic face and limb mesenchyme. Coordinator guides cooperative and selective binding between the bHLH family mesenchymal regulator TWIST1 and a collective of HD factors associated with regional identities in the face and limb. TWIST1 is required for HD binding and open chromatin at Coordinator sites, while HD factors stabilize TWIST1 occupancy at Coordinator and titrate it away from HD-independent sites. This cooperativity results in shared regulation of genes involved in cell-type and positional identities, and ultimately shapes facial morphology and evolution.
Collapse
Affiliation(s)
- Seungsoo Kim
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA 94305
- Department of Developmental Biology, Stanford University, Stanford, CA 94305
- Institute for Stem Cell Biology and Regenerative Medicine, Stanford University, Stanford, CA 94305
- Howard Hughes Medical Institute, Stanford, CA 94305
| | - Ekaterina Morgunova
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Solna, Sweden
| | - Sahin Naqvi
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA 94305
- Department of Developmental Biology, Stanford University, Stanford, CA 94305
- Institute for Stem Cell Biology and Regenerative Medicine, Stanford University, Stanford, CA 94305
- Department of Genetics, Stanford University, Stanford, CA 94305
| | - Maram Bader
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA 94305
- Department of Developmental Biology, Stanford University, Stanford, CA 94305
- Institute for Stem Cell Biology and Regenerative Medicine, Stanford University, Stanford, CA 94305
| | - Mervenaz Koska
- Department of Developmental Biology, Stanford University, Stanford, CA 94305
| | | | - Christy Luong
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA 94305
| | - Angela Pogson
- Department of Developmental Biology, Stanford University, Stanford, CA 94305
| | - Peter Claes
- Department of Electrical Engineering, ESAT/PSI, KU Leuven, Leuven, Belgium
- Medical Imaging Research Center, UZ Leuven, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Jussi Taipale
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Solna, Sweden
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
- Applied Tumor Genomics Program, University of Helsinki, Helsinki, Finland
| | - Joanna Wysocka
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA 94305
- Department of Developmental Biology, Stanford University, Stanford, CA 94305
- Institute for Stem Cell Biology and Regenerative Medicine, Stanford University, Stanford, CA 94305
- Howard Hughes Medical Institute, Stanford, CA 94305
| |
Collapse
|
30
|
Kalakoti Y, Clarancia Peter S, Gawande S, Sundar D. Modulation of DNA-protein interactions by proximal genetic elements as uncovered by interpretable deep learning. J Mol Biol 2023; 435:168121. [PMID: 37100167 DOI: 10.1016/j.jmb.2023.168121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 04/14/2023] [Accepted: 04/19/2023] [Indexed: 04/28/2023]
Abstract
Transcription factors (TF) recognize specific motifs in the genome that are typically 6-12 bp long to regulate various aspects of the cellular machinery. Presence of binding motifs and favorable genome accessibility are key drivers for a consistent TF-DNA interaction. Although these pre-requisites may occur thousands of times in the genome, there seems to be a high degree of selectivity for the sites that are actually bound. Here, we present a deep-learning framework that identifies and characterizes the upstream and downstream genetic elements to the binding motif, for their role in enforcing the mentioned selectivity. The proposed framework is based on an interpretable recurrent neural network architecture that enables for the relative analysis of sequence context features. We apply the framework to model twenty-six transcription factors and score the TF-DNA binding at a base-pair resolution. We find significant differences in activations of DNA context features for bound and unbound sequences. In addition to standardized evaluation protocols, we offer outstanding interpretability that enables us to identify and annotate DNA sequence with possible elements that modulate TF-DNA binding. Also, differences in data processing have a huge influence on the overall model performance. Overall, the proposed framework allows for novel insights on the non-coding genetic elements and their role in facilitating a stable TF-DNA interaction.
Collapse
Affiliation(s)
- Yogesh Kalakoti
- Department of Biochemical Engineering & Biotechnology, Indian Institute of Technology (IIT) Delhi, New Delhi - 110016, India.
| | | | - Swaraj Gawande
- Department of Computer Science and Engineering, Indian Institute of Technology (IIT) Delhi - 110016, India.
| | - Durai Sundar
- Department of Biochemical Engineering & Biotechnology, Indian Institute of Technology (IIT) Delhi, New Delhi - 110016, India; Yardi School of Artificial Intelligence, Indian Institute of Technology (IIT) Delhi, New Delhi - 110016, India.
| |
Collapse
|
31
|
Luan Y, Tang Z, He Y, Xie Z. Intra-Domain Residue Coevolution in Transcription Factors Contributes to DNA Binding Specificity. Microbiol Spectr 2023; 11:e0365122. [PMID: 36943132 PMCID: PMC10100741 DOI: 10.1128/spectrum.03651-22] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 02/22/2023] [Indexed: 03/23/2023] Open
Abstract
Understanding the basis of the DNA-binding specificity of transcription factors (TFs) has been of long-standing interest. Despite extensive efforts to map millions of putative TF binding sequences, identifying the critical determinants for DNA binding specificity remains a major challenge. The coevolution of residues in proteins occurs due to a shared evolutionary history. However, it is unclear how coevolving residues in TFs contribute to DNA binding specificity. Here, we systematically collected publicly available data sets from multiple large-scale high-throughput TF-DNA interaction screening experiments for the major TF families with large numbers of TF members. These families included the Homeobox, HLH, bZIP_1, Ets, HMG_box, ZF-C4, and Zn_clus TFs. We detected TF subclass-determining sites (TSDSs) and showed that the TSDSs were more likely to coevolve with other TSDSs than with non-TSDSs, particularly for the Homeobox, HLH, Ets, bZIP_1, and HMG_box TF families. By in silico modeling, we showed that mutation of the highly coevolving residues could significantly reduce the stability of the TF-DNA complex. The distant residues from the DNA interface also contributed to TF-DNA binding activity. Overall, our study gave evidence that coevolved residues relate to transcriptional regulation and provided insights into the potential application of engineered DNA-binding domains and proteins. IMPORTANCE While unraveling DNA-binding specificity of TFs is the key to understanding the basis and molecular mechanism of gene expression regulation, identifying the critical determinants that contribute to DNA binding specificity remains a major challenge. In this study, we provided evidence showing that coevolving residues in TF domains contributed to DNA binding specificity. We demonstrated that the TSDSs were more likely to coevolve with other TSDSs than with non-TSDSs. Mutation of the coevolving residue pairs (CRPs) could significantly reduce the stability of THE TF-DNA complex, and even the distant residues from the DNA interface contribute to TF-DNA binding activity. Collectively, our study expands our knowledge of the interactions among coevolved residues in TFs, tertiary contacting, and functional importance in refined transcriptional regulation. Understanding the impact of coevolving residues in TFs will help understand the details of transcription of gene regulation and advance the application of engineered DNA-binding domains and protein.
Collapse
Affiliation(s)
- Yizhao Luan
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Zehua Tang
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Yao He
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Zhi Xie
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
32
|
Goslin K, Finocchio A, Wellmer F. Floral Homeotic Factors: A Question of Specificity. PLANTS (BASEL, SWITZERLAND) 2023; 12:plants12051128. [PMID: 36903987 PMCID: PMC10004826 DOI: 10.3390/plants12051128] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 02/23/2023] [Accepted: 02/25/2023] [Indexed: 05/27/2023]
Abstract
MADS-domain transcription factors are involved in the control of a multitude of processes in eukaryotes, and in plants, they play particularly important roles during reproductive development. Among the members of this large family of regulatory proteins are the floral organ identity factors, which specify the identities of the different types of floral organs in a combinatorial manner. Much has been learned over the past three decades about the function of these master regulators. For example, it has been shown that they have similar DNA-binding activities and that their genome-wide binding patterns exhibit large overlaps. At the same time, it appears that only a minority of binding events lead to changes in gene expression and that the different floral organ identity factors have distinct sets of target genes. Thus, binding of these transcription factors to the promoters of target genes alone may not be sufficient for their regulation. How these master regulators achieve specificity in a developmental context is currently not well understood. Here, we review what is known about their activities and highlight open questions that need to be addressed to gain more detailed insights into the molecular mechanisms underlying their functions. We discuss evidence for the involvement of cofactors as well as the results from studies on transcription factors in animals that may be instructive for a better understanding of how the floral organ identity factors achieve regulatory specificity.
Collapse
|
33
|
van Mourik H, Chen P, Smaczniak C, Boeren S, Kaufmann K, Bemer M, Angenent GC, Muino JM. Dual specificity and target gene selection by the MADS-domain protein FRUITFULL. NATURE PLANTS 2023; 9:473-485. [PMID: 36797351 DOI: 10.1038/s41477-023-01351-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 01/11/2023] [Indexed: 05/18/2023]
Abstract
How transcription factors attain their target gene specificity and how this specificity may be modulated, acquiring different regulatory functions through the development of plant tissues, is an open question. Here we characterized different regulatory roles of the MADS-domain transcription factor FRUITFULL (FUL) in flower development and mechanisms modulating its activity. We found that the dual role of FUL in regulating floral transition and pistil development is associated with its different in vivo patterns of DNA binding in both tissues. Characterization of FUL protein complexes by liquid chromatography-tandem mass spectrometry and SELEX-seq experiments shows that aspects of tissue-specific target site selection can be predicted by tissue-specific variation in the composition of FUL protein complexes with different DNA binding specificities, without considering the chromatin status of the target region. This suggests a role for dynamic changes in FUL TF complex composition in reshaping the regulatory functions of FUL during flower development.
Collapse
Affiliation(s)
- Hilda van Mourik
- Laboratory of Molecular Biology, Wageningen University & Research, Wageningen, the Netherlands
| | - Peilin Chen
- Plant Cell and Molecular Biology, Institute of Biology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Cezary Smaczniak
- Plant Cell and Molecular Biology, Institute of Biology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Sjef Boeren
- Laboratory of Biochemistry, Wageningen University & Research, Wageningen, the Netherlands
| | - Kerstin Kaufmann
- Plant Cell and Molecular Biology, Institute of Biology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Marian Bemer
- Laboratory of Molecular Biology, Wageningen University & Research, Wageningen, the Netherlands
- Department of Bioscience, Wageningen University & Research, Wageningen, the Netherlands
| | - Gerco C Angenent
- Laboratory of Molecular Biology, Wageningen University & Research, Wageningen, the Netherlands
- Department of Bioscience, Wageningen University & Research, Wageningen, the Netherlands
| | - Jose M Muino
- Plant Cell and Molecular Biology, Institute of Biology, Humboldt-Universität zu Berlin, Berlin, Germany.
| |
Collapse
|
34
|
Physicochemical models of protein-DNA binding with standard and modified base pairs. Proc Natl Acad Sci U S A 2023; 120:e2205796120. [PMID: 36656856 PMCID: PMC9942898 DOI: 10.1073/pnas.2205796120] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
DNA-binding proteins play important roles in various cellular processes, but the mechanisms by which proteins recognize genomic target sites remain incompletely understood. Functional groups at the edges of the base pairs (bp) exposed in the DNA grooves represent physicochemical signatures. As these signatures enable proteins to form specific contacts between protein residues and bp, their study can provide mechanistic insights into protein-DNA binding. Existing experimental methods, such as X-ray crystallography, can reveal such mechanisms based on physicochemical interactions between proteins and their DNA target sites. However, the low throughput of structural biology methods limits mechanistic insights for selection of many genomic sites. High-throughput binding assays enable prediction of potential target sites by determining relative binding affinities of a protein to massive numbers of DNA sequences. Many currently available computational methods are based on the sequence of standard Watson-Crick bp. They assume that the contribution of overall binding affinity is independent for each base pair, or alternatively include dinucleotides or short k-mers. These methods cannot directly expand to physicochemical contacts, and they are not suitable to apply to DNA modifications or non-Watson-Crick bp. These variations include DNA methylation, and synthetic or mismatched bp. The proposed method, DeepRec, can predict relative binding affinities as function of physicochemical signatures and the effect of DNA methylation or other chemical modifications on binding. Sequence-based modeling methods are in comparison a coarse-grain description and cannot achieve such insights. Our chemistry-based modeling framework provides a path towards understanding genome function at a mechanistic level.
Collapse
|
35
|
Galupa R, Alvarez-Canales G, Borst NO, Fuqua T, Gandara L, Misunou N, Richter K, Alves MRP, Karumbi E, Perkins ML, Kocijan T, Rushlow CA, Crocker J. Enhancer architecture and chromatin accessibility constrain phenotypic space during Drosophila development. Dev Cell 2023; 58:51-62.e4. [PMID: 36626871 PMCID: PMC9860173 DOI: 10.1016/j.devcel.2022.12.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 10/18/2022] [Accepted: 12/07/2022] [Indexed: 01/11/2023]
Abstract
Developmental enhancers bind transcription factors and dictate patterns of gene expression during development. Their molecular evolution can underlie phenotypical evolution, but the contributions of the evolutionary pathways involved remain little understood. Here, using mutation libraries in Drosophila melanogaster embryos, we observed that most point mutations in developmental enhancers led to changes in gene expression levels but rarely resulted in novel expression outside of the native pattern. In contrast, random sequences, often acting as developmental enhancers, drove expression across a range of cell types; random sequences including motifs for transcription factors with pioneer activity acted as enhancers even more frequently. Our findings suggest that the phenotypic landscapes of developmental enhancers are constrained by enhancer architecture and chromatin accessibility. We propose that the evolution of existing enhancers is limited in its capacity to generate novel phenotypes, whereas the activity of de novo elements is a primary source of phenotypic novelty.
Collapse
Affiliation(s)
- Rafael Galupa
- European Molecular Biology Laboratory, 69117 Heidelberg, Germany.
| | | | | | - Timothy Fuqua
- European Molecular Biology Laboratory, 69117 Heidelberg, Germany
| | - Lautaro Gandara
- European Molecular Biology Laboratory, 69117 Heidelberg, Germany
| | - Natalia Misunou
- European Molecular Biology Laboratory, 69117 Heidelberg, Germany
| | - Kerstin Richter
- European Molecular Biology Laboratory, 69117 Heidelberg, Germany
| | | | - Esther Karumbi
- European Molecular Biology Laboratory, 69117 Heidelberg, Germany
| | | | - Tin Kocijan
- European Molecular Biology Laboratory, 69117 Heidelberg, Germany
| | | | - Justin Crocker
- European Molecular Biology Laboratory, 69117 Heidelberg, Germany.
| |
Collapse
|
36
|
Vandepoele K, Kaufmann K. Characterization of Gene Regulatory Networks in Plants Using New Methods and Data Types. Methods Mol Biol 2023; 2698:1-11. [PMID: 37682465 DOI: 10.1007/978-1-0716-3354-0_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/09/2023]
Abstract
A major question in plant biology is to understand how plant growth, development, and environmental responses are controlled and coordinated by the activities of regulatory factors. Gene regulatory network (GRN) analyses require integrated approaches that combine experimental approaches with computational analyses. A wide range of experimental approaches and tools are now available, such as targeted perturbation of gene activities, quantitative and cell-type specific measurements of dynamic gene activities, and systematic analysis of the molecular 'hard-wiring' of the systems. At the computational level, different tools and databases are available to study regulatory sequences, including intuitive visualizations to explore data-driven gene regulatory networks in different plant species. Furthermore, advanced data integration approaches have recently been developed to efficiently leverage complementary regulatory data types and learn context-specific networks.
Collapse
Affiliation(s)
- Klaas Vandepoele
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Ghent, Belgium.
- VIB-UGent Center for Plant Systems Biology, Ghent, Belgium.
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium.
| | - Kerstin Kaufmann
- Institute of Biology, Humboldt-Universitaet zu Berlin, Berlin, Germany
| |
Collapse
|
37
|
Chen P, Smaczniak C, Haffner J, Muino JM, Kaufmann K. Estimating DNA-Binding Specificities of Transcription Factors Using SELEX-Seq. Methods Mol Biol 2023; 2698:147-161. [PMID: 37682474 DOI: 10.1007/978-1-0716-3354-0_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/09/2023]
Abstract
Here we provide an updated protocol for the Systematic Evolution of Ligands followed by massively parallel sequencing (SELEX-seq) method to study protein-DNA interaction specificities. This in vitro method is used to characterize DNA-binding specificities of transcription factors (TFs). The procedure is based on cycles of immunoprecipitation of protein-DNA complexes, starting with a randomized DNA library of defined fragment length, followed by massively parallel sequencing. The updated protocol includes aspects of experimental design and procedure as well as basic instructions on data analysis.
Collapse
Affiliation(s)
- Peilin Chen
- Institute for Biology, Plant Cell and Molecular Biology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Cezary Smaczniak
- Institute for Biology, Plant Cell and Molecular Biology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Johanna Haffner
- Institute for Biology, Plant Cell and Molecular Biology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Jose M Muino
- Institute for Biology, Plant Cell and Molecular Biology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Kerstin Kaufmann
- Institute for Biology, Plant Cell and Molecular Biology, Humboldt-Universität zu Berlin, Berlin, Germany.
| |
Collapse
|
38
|
Hutin S, Blanc-Mathieu R, Rieu P, Parcy F, Lai X, Zubieta C. Identification of Plant Transcription Factor DNA-Binding Sites Using seq-DAP-seq. Methods Mol Biol 2023; 2698:119-145. [PMID: 37682473 DOI: 10.1007/978-1-0716-3354-0_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/09/2023]
Abstract
The identification of genome-wide transcription factor binding sites (TFBS) is a critical step in deciphering gene and transcriptional regulatory networks. However, determining the genome-wide binding of specific TFs or TF complexes remains a technical challenge. DNA affinity purification sequencing (DAP-seq) and modifications such as sequential DAP-seq (seq-DAP-seq) are robust in vitro methods for mapping individual TF or TF complex binding sites in a genome-wide manner. DAP-seq protocols use a genomic DNA (gDNA) library from any target organism with or without amplification, allowing the determination of TF binding on naked or endogenously modified DNA, respectively. As a first step, the gDNA is fragmented to ~200 bp, end-repaired, and sequencing adaptors are added. This gDNA library can be used directly or an amplification step may be performed to remove DNA modifications such as cytosine methylation. DNA libraries are then incubated with an affinity-tagged TF or TF- complex immobilized on magnetic beads. The TF or TF complex of interest is generally produced using recombinant protein expression and purified prior to DNA affinity purification. After incubation of the DNA library with the immobilized TF of interest, multiple wash steps are performed to reduce non-specific DNA binding and the TF-DNA complexes eluted. The eluted DNA is PCR-amplified and sequenced using next-generation sequencing. The resulting sequence reads are mapped to the corresponding reference genome, identifying direct potential bound regions and binding sites of the TF or TF complex of interest. Predictive TFBS models are generated from the bound regions using downstream bioinformatics analysis pipelines. Here, we present a detailed protocol outlining the steps required for seq-DAP-seq of a heterooligomeric TF complex (Fig. 1) and briefly describe the downstream bioinformatics pipeline used to develop a robust TFBS model from sequencing data generated from a DAP-seq experiment.
Collapse
Affiliation(s)
- Stephanie Hutin
- Laboratoire de Physiologie Cellulaire et Végétale, CNRS, CEA, Université Grenoble Alpes, INRAE, IRIG, CEA Grenoble, Grenoble, France
| | - Romain Blanc-Mathieu
- Laboratoire de Physiologie Cellulaire et Végétale, CNRS, CEA, Université Grenoble Alpes, INRAE, IRIG, CEA Grenoble, Grenoble, France
| | - Philippe Rieu
- Laboratoire de Physiologie Cellulaire et Végétale, CNRS, CEA, Université Grenoble Alpes, INRAE, IRIG, CEA Grenoble, Grenoble, France
| | - François Parcy
- Laboratoire de Physiologie Cellulaire et Végétale, CNRS, CEA, Université Grenoble Alpes, INRAE, IRIG, CEA Grenoble, Grenoble, France
| | - Xuelei Lai
- Laboratoire de Physiologie Cellulaire et Végétale, CNRS, CEA, Université Grenoble Alpes, INRAE, IRIG, CEA Grenoble, Grenoble, France
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, China
| | - Chloe Zubieta
- Laboratoire de Physiologie Cellulaire et Végétale, CNRS, CEA, Université Grenoble Alpes, INRAE, IRIG, CEA Grenoble, Grenoble, France.
| |
Collapse
|
39
|
Abstract
Hox genes encode evolutionarily conserved transcription factors that are essential for the proper development of bilaterian organisms. Hox genes are unique because they are spatially and temporally regulated during development in a manner that is dictated by their tightly linked genomic organization. Although their genetic function during embryonic development has been interrogated, less is known about how these transcription factors regulate downstream genes to direct morphogenetic events. Moreover, the continued expression and function of Hox genes at postnatal and adult stages highlights crucial roles for these genes throughout the life of an organism. Here, we provide an overview of Hox genes, highlighting their evolutionary history, their unique genomic organization and how this impacts the regulation of their expression, what is known about their protein structure, and their deployment in development and beyond.
Collapse
Affiliation(s)
- Katharine A. Hubert
- Program in Genetics, University of Wisconsin School of Medicine and Public Health, Madison, WI 53705, USA
- Department of Cell and Regenerative Biology, University of Wisconsin School of Medicine and Public Health, Madison, WI 53705, USA
| | - Deneen M. Wellik
- Department of Cell and Regenerative Biology, University of Wisconsin School of Medicine and Public Health, Madison, WI 53705, USA
| |
Collapse
|
40
|
Mondal A, Mishra SK, Bhattacherjee A. Nucleosome breathing facilitates cooperative binding of pluripotency factors Sox2 and Oct4 to DNA. Biophys J 2022; 121:4526-4542. [PMID: 36321206 PMCID: PMC9748375 DOI: 10.1016/j.bpj.2022.10.039] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2022] [Revised: 09/08/2022] [Accepted: 10/26/2022] [Indexed: 12/15/2022] Open
Abstract
Critical lineage commitment events are staged by multiple transcription factors (TFs) binding to their cognate motifs, often positioned at nucleosome-enriched regions of chromatin. The underlying mechanism remains elusive due to difficulty in disentangling the heterogeneity in chromatin states. Using a novel coarse-grained model and molecular dynamics simulations, here we probe the association of Sox2 and Oct4 proteins that show clustered binding at the entry-exit region of a nucleosome. The model captures the conformational heterogeneity of nucleosome breathing dynamics that features repeated wrap-unwrap transitions of a DNA segment from one end of the nucleosome. During the dynamics, DNA forms bulges that diffuse stochastically and may regulate the target search dynamics of a protein by nonspecifically interacting with it. The overall search kinetics of the TF pair follows a "dissociation-compensated-association" mechanism, where Oct4 binding is facilitated by the association of Sox2. The cooperativity stems from a change in entropy caused by an alteration in the nucleosome dynamics upon TF binding. The binding pattern is consistent with a live-cell single-particle tracking experiment, suggesting the mechanism observed for clustered binding of a TF pair, which is a hallmark of cis-regulatory elements, has broader implications in understanding gene regulation in a complex chromatin environment.
Collapse
Affiliation(s)
- Anupam Mondal
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, India
| | - Sujeet Kumar Mishra
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, India
| | - Arnab Bhattacherjee
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, India.
| |
Collapse
|
41
|
The pioneering function of the hox transcription factors. Semin Cell Dev Biol 2022:S1084-9521(22)00354-8. [PMID: 36517345 DOI: 10.1016/j.semcdb.2022.11.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 11/13/2022] [Accepted: 11/30/2022] [Indexed: 12/14/2022]
Abstract
Ever since the discovery that the Hox family of transcription factors establish morphological diversity in the developing embryo, major efforts have been directed towards understanding Hox-dependent patterning. This has led to important discoveries, notably on the mechanisms underlying the collinear expression of Hox genes and Hox binding specificity. More recently, several studies have provided evidence that Hox factors have the capacity to bind their targets in an inaccessible chromatin context and trigger the switch to an accessible, transcriptional permissive, chromatin state. In this review, we provide an overview of the evidences supporting that Hox factors behave as pioneer factors and discuss the potential mechanisms implicated in Hox pioneer activity as well as the significance of this functional property in Hox-dependent patterning.
Collapse
|
42
|
Matsuoka Y, Monteiro A. Ultrabithorax modifies a regulatory network of genes essential for butterfly eyespot development in a wing sector-specific manner. Development 2022; 149:285574. [PMID: 36341494 DOI: 10.1242/dev.200781] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2022] [Accepted: 10/31/2022] [Indexed: 11/09/2022]
Abstract
Nymphalid butterfly species often have a different number of eyespots in forewings and hindwings, but how the hindwing identity gene Ultrabithorax (Ubx) drives this asymmetry is not fully understood. We examined a three-gene regulatory network for eyespot development in the hindwings of Bicyclus anynana butterflies and compared it with the same network previously described for forewings. We also examined how Ubx interacts with each of these three eyespot-essential genes. We found similar genetic interactions between the three genes in fore- and hindwings, but we discovered three regulatory differences: Antennapedia (Antp) merely enhances spalt (sal) expression in the eyespot foci in hindwings, but is not essential for sal activation, as in forewings; Ubx upregulates Antp in all hindwing eyespot foci but represses Antp outside these wing regions; and Ubx regulates sal in a wing sector-specific manner, i.e. it activates sal expression only in the sectors that have hindwing-specific eyespots. We propose a model for how the regulatory connections between these four genes evolved to produce wing- and sector-specific variation in eyespot number.
Collapse
Affiliation(s)
- Yuji Matsuoka
- Department of Biological Sciences, National University of Singapore, 14 Science Drive 4, Block S2, Level 1, 117543Singapore
| | - Antónia Monteiro
- Department of Biological Sciences, National University of Singapore, 14 Science Drive 4, Block S2, Level 1, 117543Singapore
| |
Collapse
|
43
|
Merabet S, Carnesecchi J. Hox dosage and morphological diversification during development and evolution. Semin Cell Dev Biol 2022:S1084-9521(22)00360-3. [PMID: 36481343 DOI: 10.1016/j.semcdb.2022.11.009] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 10/15/2022] [Accepted: 11/30/2022] [Indexed: 12/12/2022]
Abstract
Hox genes encode for evolutionary conserved transcription factors that have long fascinated biologists since the observation of the first homeotic transformations in flies. Hox genes are developmental architects that instruct the formation of various and precise morphologies along the body axes in cnidarian and bilaterian species. In contrast to these highly specific developmental functions, Hox genes encode for proteins that display poorly selective DNA-binding properties in vitro. This "Hox paradox" has been partially solved with the discovery of the TALE-class cofactors, which interact with all Hox members and form versatile Hox/TALE protein complexes on DNA. Here, we describe the role of the Hox dosage as an additional molecular strategy contributing to further resolve the Hox paradox. We present several cases where the Hox dosage is involved in the formation of different morphologies in invertebrates and vertebrates, with a particular emphasis on flight appendages in insects. We also discuss how the Hox dosage could be interpreted in different types of target enhancers within the nuclear environment in vivo. Altogether our survey underlines the Hox dosage as a key mechanism for shaping Hox molecular function during development and evolution.
Collapse
|
44
|
Pinto PB, Domsch K, Lohmann I. Hox function and specificity – A tissue centric view. Semin Cell Dev Biol 2022:S1084-9521(22)00353-6. [PMID: 36517344 DOI: 10.1016/j.semcdb.2022.11.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 11/11/2022] [Accepted: 11/30/2022] [Indexed: 12/15/2022]
Abstract
Since their discovery, the Hox genes, with their incredible power to reprogram the identity of complete body regions, a phenomenon called homeosis, have captured the fascination of many biologists. Recent research has provided new insights into the function of Hox proteins in different germ layers and the mechanisms they employ to control tissue morphogenesis. We focus in this review on the ectoderm and mesoderm to highlight new findings and discuss them with regards to established concepts of Hox target gene regulation. Furthermore, we highlight the molecular mechanisms involved the transcriptional repression of specific groups of Hox target genes, and summarize the role of Hox mediated gene silencing in tissue development. Finally, we reflect on recent findings identifying a large number of tissue-specific Hox interactor partners, which open up new avenues and directions towards a better understanding of Hox function and specificity in different tissues.
Collapse
|
45
|
Ghoshdastidar D, Bansal M. Flexibility of flanking DNA is a key determinant of transcription factor affinity for the core motif. Biophys J 2022; 121:3987-4000. [PMID: 35978548 PMCID: PMC9674967 DOI: 10.1016/j.bpj.2022.08.015] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 07/28/2022] [Accepted: 08/15/2022] [Indexed: 11/02/2022] Open
Abstract
Selective gene regulation is mediated by recognition of specific DNA sequences by transcription factors (TFs). The extremely challenging task of searching out specific cognate DNA binding sites among several million putative sites within the eukaryotic genome is achieved by complex molecular recognition mechanisms. Elements of this recognition code include the core binding sequence, the flanking sequence context, and the shape and conformational flexibility of the composite binding site. To unravel the extent to which DNA flexibility modulates TF binding, in this study, we employed experimentally guided molecular dynamics simulations of ternary complex of closely related Hox heterodimers Exd-Ubx and Exd-Scr with DNA. Results demonstrate that flexibility signatures embedded in the flanking sequences impact TF binding at the cognate binding site. A DNA sequence has intrinsic shape and flexibility features. While shape features are localized, our analyses reveal that flexibility features of the flanking sequences percolate several basepairs and allosterically modulate TF binding at the core. We also show that lack of flexibility in the motif context can render the cognate site resistant to protein-induced shape changes and subsequently lower TF binding affinity. Overall, this study suggests that flexibility-guided DNA shape, and not merely the static shape, is a key unexplored component of the complex DNA-TF recognition code.
Collapse
Affiliation(s)
| | - Manju Bansal
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, Karnataka, India.
| |
Collapse
|
46
|
García-Ferrés M, Sánchez-Higueras C, Espinosa-Vázquez JM, C-G Hombría J. Specification of the endocrine primordia controlling insect moulting and metamorphosis by the JAK/STAT signalling pathway. PLoS Genet 2022; 18:e1010427. [PMID: 36191039 PMCID: PMC9560620 DOI: 10.1371/journal.pgen.1010427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 10/13/2022] [Accepted: 09/12/2022] [Indexed: 11/18/2022] Open
Abstract
The corpora allata and the prothoracic glands control moulting and metamorphosis in insects. These endocrine glands are specified in the maxillary and labial segments at positions homologous to those forming the trachea in more posterior segments. Glands and trachea can be homeotically transformed into each other suggesting that all three evolved from a metamerically repeated organ that diverged to form glands in the head and respiratory organs in the trunk. While much is known about tracheal specification, there is limited information about corpora allata and prothorathic gland specification. Here we show that the expression of a key regulator of early gland development, the snail gene, is controlled by the Dfd and Scr Hox genes and by the Hedgehog and Wnt signalling pathways that induce localised transcription of upd, the ligand of the JAK/STAT signalling pathway, which lies at the heart of gland specification. Our results show that the same upstream regulators are required for the early gland and tracheal primordia specification, reinforcing the hypothesis that they originated from a segmentally repeated organ present in an ancient arthropod.
Collapse
Affiliation(s)
- Mar García-Ferrés
- Centro Andaluz de Biología del Desarrollo (CABD), CSIC-JA-UPO, Seville, Spain
| | | | | | - James C-G Hombría
- Centro Andaluz de Biología del Desarrollo (CABD), CSIC-JA-UPO, Seville, Spain,* E-mail:
| |
Collapse
|
47
|
Rube HT, Rastogi C, Feng S, Kribelbauer JF, Li A, Becerra B, Melo LAN, Do BV, Li X, Adam HH, Shah NH, Mann RS, Bussemaker HJ. Prediction of protein-ligand binding affinity from sequencing data with interpretable machine learning. Nat Biotechnol 2022; 40:1520-1527. [PMID: 35606422 PMCID: PMC9546773 DOI: 10.1038/s41587-022-01307-0] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2021] [Accepted: 04/04/2022] [Indexed: 01/02/2023]
Abstract
Protein-ligand interactions are increasingly profiled at high throughput using affinity selection and massively parallel sequencing. However, these assays do not provide the biophysical parameters that most rigorously quantify molecular interactions. Here we describe a flexible machine learning method, called ProBound, that accurately defines sequence recognition in terms of equilibrium binding constants or kinetic rates. This is achieved using a multi-layered maximum-likelihood framework that models both the molecular interactions and the data generation process. We show that ProBound quantifies transcription factor (TF) behavior with models that predict binding affinity over a range exceeding that of previous resources; captures the impact of DNA modifications and conformational flexibility of multi-TF complexes; and infers specificity directly from in vivo data such as ChIP-seq without peak calling. When coupled with an assay called KD-seq, it determines the absolute affinity of protein-ligand interactions. We also apply ProBound to profile the kinetics of kinase-substrate interactions. ProBound opens new avenues for decoding biological networks and rationally engineering protein-ligand interactions.
Collapse
Affiliation(s)
- H Tomas Rube
- Department of Bioengineering, University of California, Merced, Merced, CA, USA
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Chaitanya Rastogi
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Siqian Feng
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, USA
| | | | - Allyson Li
- Department of Chemistry, Columbia University, New York, NY, USA
| | - Basheer Becerra
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Lucas A N Melo
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Bach Viet Do
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Xiaoting Li
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Hammaad H Adam
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Neel H Shah
- Department of Chemistry, Columbia University, New York, NY, USA
| | - Richard S Mann
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, USA
- Department of Systems Biology, Columbia University, New York, NY, USA
| | - Harmen J Bussemaker
- Department of Biological Sciences, Columbia University, New York, NY, USA.
- Department of Systems Biology, Columbia University, New York, NY, USA.
| |
Collapse
|
48
|
Cooper BH, Chiu TP, Rohs R. Top-Down Crawl: a method for the ultra-rapid and motif-free alignment of sequences with associated binding metrics. Bioinformatics 2022; 38:5121-5123. [PMID: 36179084 PMCID: PMC9665867 DOI: 10.1093/bioinformatics/btac653] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2022] [Revised: 09/21/2022] [Accepted: 09/29/2022] [Indexed: 12/24/2022] Open
Abstract
SUMMARY Several high-throughput protein-DNA binding methods currently available produce highly reproducible measurements of binding affinity at the level of the k-mer. However, understanding where a k-mer is positioned along a binding site sequence depends on alignment. Here, we present Top-Down Crawl (TDC), an ultra-rapid tool designed for the alignment of k-mer level data in a rank-dependent and position weight matrix (PWM)-independent manner. As the framework only depends on the rank of the input, the method can accept input from many types of experiments (protein binding microarray, SELEX-seq, SMiLE-seq, etc.) without the need for specialized parameterization. Measuring the performance of the alignment using multiple linear regression with 5-fold cross-validation, we find TDC to perform as well as or better than computationally expensive PWM-based methods. AVAILABILITY AND IMPLEMENTATION TDC can be run online at https://topdowncrawl.usc.edu or locally as a python package available through pip at https://pypi.org/project/TopDownCrawl. CONTACT SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Brendon H Cooper
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Tsu-Pei Chiu
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Remo Rohs
- To whom correspondence should be addressed.
| |
Collapse
|
49
|
Pantier R, Chhatbar K, Alston G, Lee HY, Bird A. High-throughput sequencing SELEX for the determination of DNA-binding protein specificities in vitro. STAR Protoc 2022; 3:101490. [PMID: 35776646 PMCID: PMC9243297 DOI: 10.1016/j.xpro.2022.101490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 05/11/2022] [Accepted: 06/02/2022] [Indexed: 11/22/2022] Open
Abstract
High-throughput sequencing SELEX (HT-SELEX) is a powerful technique for unbiased determination of preferred target motifs of DNA-binding proteins in vitro. The procedure depends upon selection of DNA binding sites from a random library of oligonucleotides by purifying protein-DNA complexes and amplifying bound DNA using the polymerase chain reaction. Here, we describe an optimized step-by-step protocol for HT-SELEX compatible with Illumina sequencing. We also introduce a bioinformatic pipeline (eme_selex) facilitating the detection of promiscuous DNA binding by analyzing the enrichment of all possible k-mers. For complete details on the use and execution of this protocol, please refer to Pantier et al. (2021).
Collapse
Affiliation(s)
- Raphaël Pantier
- The Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, The King's Buildings, Edinburgh EH9 3BF, UK.
| | - Kashyap Chhatbar
- The Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, The King's Buildings, Edinburgh EH9 3BF, UK; Informatics Forum, School of Informatics, University of Edinburgh, 10 Crichton Street, Edinburgh EH8 9AB, UK
| | - Grace Alston
- The Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, The King's Buildings, Edinburgh EH9 3BF, UK
| | - Heng Yang Lee
- The Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, The King's Buildings, Edinburgh EH9 3BF, UK
| | - Adrian Bird
- The Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, The King's Buildings, Edinburgh EH9 3BF, UK.
| |
Collapse
|
50
|
Steens J, Klein D. HOX genes in stem cells: Maintaining cellular identity and regulation of differentiation. Front Cell Dev Biol 2022; 10:1002909. [PMID: 36176275 PMCID: PMC9514042 DOI: 10.3389/fcell.2022.1002909] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 08/25/2022] [Indexed: 11/13/2022] Open
Abstract
Stem cells display a unique cell type within the body that has the capacity to self-renew and differentiate into specialized cell types. Compared to pluripotent stem cells, adult stem cells (ASC) such as mesenchymal stem cells (MSCs) and hematopoietic stem cells (HSCs) exhibit restricted differentiation capabilities that are limited to cell types typically found in the tissue of origin, which implicates that there must be a certain code or priming determined by the tissue of origin. HOX genes, a subset of homeobox genes encoding transcription factors that are generally repressed in undifferentiated pluripotent stem cells, emerged here as master regulators of cell identity and cell fate during embryogenesis, and in maintaining this positional identity throughout life as well as specifying various regional properties of respective tissues. Concurrently, intricate molecular circuits regulated by diverse stem cell-typical signaling pathways, balance stem cell maintenance, proliferation and differentiation. However, it still needs to be unraveled how stem cell-related signaling pathways establish and regulate ASC-specific HOX expression pattern with different temporal-spatial topography, known as the HOX code. This comprehensive review therefore summarizes the current knowledge of specific ASC-related HOX expression patterns and how these were integrated into stem cell-related signaling pathways. Understanding the mechanism of HOX gene regulation in stem cells may provide new ways to manipulate stem cell fate and function leading to improved and new approaches in the field of regenerative medicine.
Collapse
|