1
|
Xia Y, Du X, Liu B, Guo S, Huo YX. Species-specific design of artificial promoters by transfer-learning based generative deep-learning model. Nucleic Acids Res 2024; 52:6145-6157. [PMID: 38783063 PMCID: PMC11194083 DOI: 10.1093/nar/gkae429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 04/04/2024] [Accepted: 05/08/2024] [Indexed: 05/25/2024] Open
Abstract
Native prokaryotic promoters share common sequence patterns, but are species dependent. For understudied species with limited data, it is challenging to predict the strength of existing promoters and generate novel promoters. Here, we developed PromoGen, a collection of nucleotide language models to generate species-specific functional promoters, across dozens of species in a data and parameter efficient way. Twenty-seven species-specific models in this collection were finetuned from the pretrained model which was trained on multi-species promoters. When systematically compared with native promoters, the Escherichia coli- and Bacillus subtilis-specific artificial PromoGen-generated promoters (PGPs) were demonstrated to hold all distribution patterns of native promoters. A regression model was developed to score generated either by PromoGen or by another competitive neural network, and the overall score of PGPs is higher. Encouraged by in silico analysis, we further experimentally characterized twenty-two B. subtilis PGPs, results showed that four of tested PGPs reached the strong promoter level while all were active. Furthermore, we developed a user-friendly website to generate species-specific promoters for 27 different species by PromoGen. This work presented an efficient deep-learning strategy for de novo species-specific promoter generation even with limited datasets, providing valuable promoter toolboxes especially for the metabolic engineering of understudied microorganisms.
Collapse
Affiliation(s)
- Yan Xia
- Key Laboratory of Molecular Medicine and Biotherapy, School of Life Science, Beijing Institute of Technology, Beijing 100081, China
| | - Xiaowen Du
- Key Laboratory of Molecular Medicine and Biotherapy, School of Life Science, Beijing Institute of Technology, Beijing 100081, China
| | - Bin Liu
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
| | - Shuyuan Guo
- Key Laboratory of Molecular Medicine and Biotherapy, School of Life Science, Beijing Institute of Technology, Beijing 100081, China
| | - Yi-Xin Huo
- Key Laboratory of Molecular Medicine and Biotherapy, School of Life Science, Beijing Institute of Technology, Beijing 100081, China
- Tangshan Research Institute, Beijing Institute of Technology, Hebei 063611, China
| |
Collapse
|
2
|
Romani F, Sauret-Güeto S, Rebmann M, Annese D, Bonter I, Tomaselli M, Dierschke T, Delmans M, Frangedakis E, Silvestri L, Rever J, Bowman JL, Romani I, Haseloff J. The landscape of transcription factor promoter activity during vegetative development in Marchantia. THE PLANT CELL 2024; 36:2140-2159. [PMID: 38391349 PMCID: PMC11132968 DOI: 10.1093/plcell/koae053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 12/08/2023] [Accepted: 12/22/2023] [Indexed: 02/24/2024]
Abstract
Transcription factors (TFs) are essential for the regulation of gene expression and cell fate determination. Characterizing the transcriptional activity of TF genes in space and time is a critical step toward understanding complex biological systems. The vegetative gametophyte meristems of bryophytes share some characteristics with the shoot apical meristems of flowering plants. However, the identity and expression profiles of TFs associated with gametophyte organization are largely unknown. With only ∼450 putative TF genes, Marchantia (Marchantia polymorpha) is an outstanding model system for plant systems biology. We have generated a near-complete collection of promoter elements derived from Marchantia TF genes. We experimentally tested reporter fusions for all the TF promoters in the collection and systematically analyzed expression patterns in Marchantia gemmae. This allowed us to build a map of expression domains in early vegetative development and identify a set of TF-derived promoters that are active in the stem-cell zone. The cell markers provide additional tools and insight into the dynamic regulation of the gametophytic meristem and its evolution. In addition, we provide an online database of expression patterns for all promoters in the collection. We expect that these promoter elements will be useful for cell-type-specific expression, synthetic biology applications, and functional genomics.
Collapse
Affiliation(s)
- Facundo Romani
- Department of Plant Sciences, University of Cambridge, Cambridge CB3 EA, UK
| | | | - Marius Rebmann
- Department of Plant Sciences, University of Cambridge, Cambridge CB3 EA, UK
| | - Davide Annese
- Department of Plant Sciences, University of Cambridge, Cambridge CB3 EA, UK
| | - Ignacy Bonter
- Department of Plant Sciences, University of Cambridge, Cambridge CB3 EA, UK
| | - Marta Tomaselli
- Department of Plant Sciences, University of Cambridge, Cambridge CB3 EA, UK
| | - Tom Dierschke
- School of Biological Sciences, Monash University, Clayton, Melbourne, VIC 3800, Australia
- ARC Centre of Excellence for Plant Success in Nature and Agriculture, Monash University, Clayton, Melbourne, VIC 3800, Australia
| | - Mihails Delmans
- Department of Plant Sciences, University of Cambridge, Cambridge CB3 EA, UK
| | | | - Linda Silvestri
- Department of Plant Sciences, University of Cambridge, Cambridge CB3 EA, UK
| | - Jenna Rever
- Department of Plant Sciences, University of Cambridge, Cambridge CB3 EA, UK
| | - John L Bowman
- School of Biological Sciences, Monash University, Clayton, Melbourne, VIC 3800, Australia
- ARC Centre of Excellence for Plant Success in Nature and Agriculture, Monash University, Clayton, Melbourne, VIC 3800, Australia
| | - Ignacio Romani
- Departamento de Ciencias Sociales, Universidad Nacional de Quilmes, Bernal, Buenos Aires 1876, Argentina
| | - Jim Haseloff
- Department of Plant Sciences, University of Cambridge, Cambridge CB3 EA, UK
| |
Collapse
|
3
|
Yasmeen E, Wang J, Riaz M, Zhang L, Zuo K. Designing artificial synthetic promoters for accurate, smart, and versatile gene expression in plants. PLANT COMMUNICATIONS 2023:100558. [PMID: 36760129 PMCID: PMC10363483 DOI: 10.1016/j.xplc.2023.100558] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 01/30/2023] [Accepted: 02/06/2023] [Indexed: 06/18/2023]
Abstract
With the development of high-throughput biology techniques and artificial intelligence, it has become increasingly feasible to design and construct artificial biological parts, modules, circuits, and even whole systems. To overcome the limitations of native promoters in controlling gene expression, artificial promoter design aims to synthesize short, inducible, and conditionally controlled promoters to coordinate the expression of multiple genes in diverse plant metabolic and signaling pathways. Synthetic promoters are versatile and can drive gene expression accurately with smart responses; they show potential for enhancing desirable traits in crops, thereby improving crop yield, nutritional quality, and food security. This review first illustrates the importance of synthetic promoters, then introduces promoter architecture and thoroughly summarizes advances in synthetic promoter construction. Restrictions to the development of synthetic promoters and future applications of such promoters in synthetic plant biology and crop improvement are also discussed.
Collapse
Affiliation(s)
- Erum Yasmeen
- Single Cell Research Center, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Jin Wang
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Muhammad Riaz
- Single Cell Research Center, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Lida Zhang
- Single Cell Research Center, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Kaijing Zuo
- Single Cell Research Center, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China.
| |
Collapse
|
4
|
Khan A, Nasim N, Pudhuvai B, Koul B, Upadhyay SK, Sethi L, Dey N. Plant Synthetic Promoters: Advancement and Prospective. AGRICULTURE 2023; 13:298. [DOI: 10.3390/agriculture13020298] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/09/2024]
Abstract
Native/endogenous promoters have several fundamental limitations in terms of their size, Cis-elements distribution/patterning, and mode of induction, which is ultimately reflected in their insufficient transcriptional activity. Several customized synthetic promoters were designed and tested in plants during the past decade to circumvent such constraints. Such synthetic promoters have a built-in capacity to drive the expression of the foreign genes at their maximum amplitude in plant orthologous systems. The basic structure and function of the promoter has been discussed in this review, with emphasis on the role of the Cis-element in regulating gene expression. In addition to this, the necessity of synthetic promoters in the arena of plant biology has been highlighted. This review also provides explicit information on the two major approaches for developing plant-based synthetic promoters: the conventional approach (by utilizing the basic knowledge of promoter structure and Cis-trans interaction) and the advancement in gene editing technology. The success of plant genetic manipulation relies on the promoter efficiency and the expression level of the transgene. Therefore, advancements in the field of synthetic promoters has enormous potential in genetic engineering-mediated crop improvement.
Collapse
Affiliation(s)
- Ahamed Khan
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 370 05 České Budějovice, Czech Republic
| | - Noohi Nasim
- Division of Microbial and Plant Biotechnology, Institute of Life Sciences, Department of Biotechnology, Government of India, Bhubaneswar 751023, Odisha, India
| | - Baveesh Pudhuvai
- Department of Genetics and Biotechnology, Faculty of Agriculture and Technology, University of South Bohemia in České Budějovice, 370 05 České Budějovice, Czech Republic
| | - Bhupendra Koul
- Department of Biotechnology, Lovely Professional University, Phagwara 144411, Punjab, India
| | | | - Lini Sethi
- Division of Microbial and Plant Biotechnology, Institute of Life Sciences, Department of Biotechnology, Government of India, Bhubaneswar 751023, Odisha, India
| | - Nrisingha Dey
- Division of Microbial and Plant Biotechnology, Institute of Life Sciences, Department of Biotechnology, Government of India, Bhubaneswar 751023, Odisha, India
| |
Collapse
|
5
|
Diego-Martin B, Pérez-Alemany J, Candela-Ferre J, Corbalán-Acedo A, Pereyra J, Alabadí D, Jami-Alahmadi Y, Wohlschlegel J, Gallego-Bartolomé J. The TRIPLE PHD FINGERS proteins are required for SWI/SNF complex-mediated +1 nucleosome positioning and transcription start site determination in Arabidopsis. Nucleic Acids Res 2022; 50:10399-10417. [PMID: 36189880 PMCID: PMC9561266 DOI: 10.1093/nar/gkac826] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2022] [Revised: 09/08/2022] [Accepted: 09/16/2022] [Indexed: 11/14/2022] Open
Abstract
Eukaryotes have evolved multiple ATP-dependent chromatin remodelers to shape the nucleosome landscape. We recently uncovered an evolutionarily conserved SWItch/Sucrose Non-Fermentable (SWI/SNF) chromatin remodeler complex in plants reminiscent of the mammalian BAF subclass, which specifically incorporates the MINUSCULE (MINU) catalytic subunits and the TRIPLE PHD FINGERS (TPF) signature subunits. Here we report experimental evidence that establishes the functional relevance of TPF proteins for the complex activity. Our results show that depletion of TPF triggers similar pleiotropic phenotypes and molecular defects to those found in minu mutants. Moreover, we report the genomic location of MINU2 and TPF proteins as representative members of this SWI/SNF complex and their impact on nucleosome positioning and transcription. These analyses unravel the binding of the complex to thousands of genes where it modulates the position of the +1 nucleosome. These targets tend to produce 5′-shifted transcripts in the tpf and minu mutants pointing to the participation of the complex in alternative transcription start site usage. Interestingly, there is a remarkable correlation between +1 nucleosome shift and 5′ transcript length change suggesting their functional connection. In summary, this study unravels the function of a plant SWI/SNF complex involved in +1 nucleosome positioning and transcription start site determination.
Collapse
Affiliation(s)
- Borja Diego-Martin
- Instituto de Biología Molecular y Celular de Plantas (IBMCP), CSIC-Universitat Politècnica de València, Valencia, 46022, Spain
| | - Jaime Pérez-Alemany
- Instituto de Biología Molecular y Celular de Plantas (IBMCP), CSIC-Universitat Politècnica de València, Valencia, 46022, Spain
| | - Joan Candela-Ferre
- Instituto de Biología Molecular y Celular de Plantas (IBMCP), CSIC-Universitat Politècnica de València, Valencia, 46022, Spain
| | - Antonio Corbalán-Acedo
- Instituto de Biología Molecular y Celular de Plantas (IBMCP), CSIC-Universitat Politècnica de València, Valencia, 46022, Spain
| | - Juan Pereyra
- Instituto de Biología Molecular y Celular de Plantas (IBMCP), CSIC-Universitat Politècnica de València, Valencia, 46022, Spain
| | - David Alabadí
- Instituto de Biología Molecular y Celular de Plantas (IBMCP), CSIC-Universitat Politècnica de València, Valencia, 46022, Spain
| | - Yasaman Jami-Alahmadi
- Department of Biological Chemistry, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
| | - James Wohlschlegel
- Department of Biological Chemistry, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
| | - Javier Gallego-Bartolomé
- Instituto de Biología Molecular y Celular de Plantas (IBMCP), CSIC-Universitat Politècnica de València, Valencia, 46022, Spain
| |
Collapse
|