1
|
Watson IJ, Maranas C, Nemhauser JL, Leydon AR. A Hot-Swappable Genetic Switch: Building an inducible and trackable functional assay for the essential gene MEDIATOR 21. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.12.16.628800. [PMID: 39763940 PMCID: PMC11702731 DOI: 10.1101/2024.12.16.628800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/19/2025]
Abstract
Essential genes, estimated at approximately 20% of the Arabidopsis genome, are broadly expressed and required for reproductive success. They are difficult to study, as interfering with their function leads to premature death. Transcription is one of the essential functions of life, and the multi-protein Mediator complex coordinates the regulation of gene expression at nearly every eukaryotic promoter. In this study, we focused on a core Mediator component called MEDIATOR21 (MED21), which is required for activation of transcription. Our previous work has also shown a role for MED21 in repression of gene expression through its interaction with a corepressor protein. Here, we sought to differentiate the role MED21 plays in activation versus repression using the model plant Arabidopsis. As mutations in MED21 lead to embryo lethal phenotypes, we constructed a set of synthetic switches using PhiC31 serine integrases to create an "on-to-off" inducible loss of function MED21 in a non-essential tissue. Our technology, which we call Integrase Erasers, made it possible for med21 mutant plants to survive into adulthood by ablating protein expression selectively in lateral root primordia, allowing quantification and characterization of med21 mutant phenotypes in a post-embryonic context. In addition, we engineered chemical induction of the Integrase Eraser to ablate MED21 expression in whole seedlings at a user-specified timepoint. Finally, we extended this technology to build a hot swappable Integrase Isoform Switch where expression of the integrase toggled cells from expressing wildtype MED21 to expressing MED21 sequence variants. Our analysis of the entire set of new integrase-based tools demonstrates that this is a highly efficient and robust approach to the study of essential genes.
Collapse
Affiliation(s)
- Isabella J Watson
- Department of Biology, University of Washington, Seattle, WA 98195-1800 USA
| | - Cassandra Maranas
- Department of Biology, University of Washington, Seattle, WA 98195-1800 USA
| | | | - Alexander R Leydon
- Department of Biology, University of Washington, Seattle, WA 98195-1800 USA
| |
Collapse
|
2
|
Shi T, Gao Z, Chen J, Van de Peer Y. Dosage sensitivity shapes balanced expression and gene longevity of homoeologs after whole-genome duplications in angiosperms. THE PLANT CELL 2024; 36:4323-4337. [PMID: 39121058 PMCID: PMC7616505 DOI: 10.1093/plcell/koae227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 07/23/2024] [Accepted: 08/01/2024] [Indexed: 08/11/2024]
Abstract
Following whole-genome duplication (WGD), duplicate gene pairs (homoeologs) can evolve varying degrees of expression divergence. However, the determinants influencing these relative expression level differences (RFPKM) between homoeologs remain elusive. In this study, we analyzed the RFPKM between homoeologs in 3 angiosperms, Nymphaea colorata, Nelumbo nucifera, and Acorus tatarinowii, all having undergone a single WGD since the origin of angiosperms. Our results show significant positive correlations in RFPKM of homoeologs among tissues within the same species, and among orthologs across these 3 species, indicating convergent expression balance/bias between homoeologous gene copies following independent WGDs. We linked RFPKM between homoeologs to gene attributes associated with dosage-balance constraints, such as protein-protein interactions, lethal-phenotype scores in Arabidopsis (Arabidopsis thaliana) orthologs, domain numbers, and expression breadth. Notably, homoeologs with lower RFPKM often had more interactions and higher lethal-phenotype scores, indicating selective pressures favoring balanced expression. Also, homoeologs with lower RFPKM were more likely to be retained after WGDs in angiosperms. Within Nelumbo, greater RFPKM between homoeologs correlated with increased cis- and trans-regulatory differentiation between species, highlighting the ongoing escalation of gene expression divergence. We further found that expression degeneration in 1 copy of homoeologs is inclined toward nonfunctionalization. Our research highlights the importance of balanced expression, shaped by dosage-balance constraints, in the evolutionary retention of homoeologs in plants.
Collapse
Affiliation(s)
- Tao Shi
- Aquatic Plant Research Center, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China
| | - Zhiyan Gao
- Aquatic Plant Research Center, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China
| | - Jinming Chen
- Aquatic Plant Research Center, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent 9052, Belgium
- Centre for Plant Systems Biology, VIB, Ghent 9052, Belgium
- Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria 0028, South Africa
- College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing 210095, China
| |
Collapse
|
3
|
Wang L, Cui J, Zhang N, Wang X, Su J, Vallés MP, Wu S, Yao W, Chen X, Chen D. OsIPK1 frameshift mutations disturb phosphorus homeostasis and impair starch synthesis during grain filling in rice. PLANT MOLECULAR BIOLOGY 2024; 114:91. [PMID: 39172289 DOI: 10.1007/s11103-024-01488-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2024] [Accepted: 07/10/2024] [Indexed: 08/23/2024]
Abstract
Inositol 1,3,4,5,6-pentakisphosphate 2-kinase (IPK1) catalyzes the final step in phytic acid (InsP6) synthesis. In this study, the effects of OsIPK1 mutations on InsP6 synthesis, grain filling and their underlying mechanisms were investigated. Seven gRNAs were designed to disrupt the OsIPK1 gene via CRISPR/CAS9 system. Only 4 of them generated 29 individual insertion or deletion T0 plants, in which nine biallelic or heterozygous genotypes were identified. Segregation analysis revealed that OsIPK1 frameshift mutants are homozygous lethality. The biallelic and heterozygous frameshift mutants exhibited significant reduction in yield-related traits, particularly in the seed-setting rate and yield per plant. Despite a notable decline in pollen viability, the male and female gametes had comparable transmission rates to their progenies in the mutants. A significant number of the filling-aborted (FA) grains was observed in mature grains of these heterozygous frameshift mutants. These grains exhibited a nearly complete blockage of InsP6 synthesis, resulting in a pronounced increase in Pi content. In contrast, a slight decline in InsP6 content was observed in the plump grains. During the filling stage, owing to the excessive accumulation of Pi, starch synthesis was significantly impaired, and the endosperm development-specific gene expression was nearly abolished. Consistently, the activity of whereas AGPase, a key enzyme in starch synthesis, was significantly decreased and Pi transporter gene expression was upregulated in the FA grains. Taken together, these results demonstrate that OsIPK1 frameshift mutations result in excessive Pi accumulation, decreased starch synthesis, and ultimately leading to lower yields in rice.
Collapse
Affiliation(s)
- Lina Wang
- Department of Genetics and Cell Biology, College of Life Sciences, Nankai University, Tianjin, 300071, China
| | - Jing Cui
- Department of Genetics and Cell Biology, College of Life Sciences, Nankai University, Tianjin, 300071, China
| | - Ning Zhang
- Department of Genetics and Cell Biology, College of Life Sciences, Nankai University, Tianjin, 300071, China
| | - Xueqin Wang
- Department of Genetics and Cell Biology, College of Life Sciences, Nankai University, Tianjin, 300071, China
| | - Jingping Su
- Tianjin Key Laboratory of Crop Genetics and Breeding, Crop Research Institute, Tianjin Academy of Agricultural Sciences, Tianjin, 300384, China
| | - María Pilar Vallés
- Department of Genetics and Plant Breeding, Aula Dei Experimental Station, Spanish National Research Council (EEAD-CSIC), Zaragoza, 50059, Spain
| | - Shian Wu
- Department of Genetics and Cell Biology, College of Life Sciences, Nankai University, Tianjin, 300071, China
| | - Wei Yao
- Department of Genetics and Cell Biology, College of Life Sciences, Nankai University, Tianjin, 300071, China
| | - Xiwen Chen
- Department of Biochemistry and Molecular Biology, College of Life Sciences, Nankai University, Tianjin, 300071, China.
| | - Defu Chen
- Department of Genetics and Cell Biology, College of Life Sciences, Nankai University, Tianjin, 300071, China.
- Southwest United Graduate School, Kunming, 650092, China.
| |
Collapse
|
4
|
Quiroz D, Oya S, Lopez-Mateos D, Zhao K, Pierce A, Ortega L, Ali A, Carbonell-Bejerano P, Yarov-Yarovoy V, Suzuki S, Hayashi G, Osakabe A, Monroe G. H3K4me1 recruits DNA repair proteins in plants. THE PLANT CELL 2024; 36:2410-2426. [PMID: 38531669 PMCID: PMC11132887 DOI: 10.1093/plcell/koae089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 02/12/2024] [Accepted: 02/13/2024] [Indexed: 03/28/2024]
Abstract
DNA repair proteins can be recruited by their histone reader domains to specific epigenomic features, with consequences on intragenomic mutation rate variation. Here, we investigated H3K4me1-associated hypomutation in plants. We first examined 2 proteins which, in plants, contain Tudor histone reader domains: PRECOCIOUS DISSOCIATION OF SISTERS 5 (PDS5C), involved in homology-directed repair, and MUTS HOMOLOG 6 (MSH6), a mismatch repair protein. The MSH6 Tudor domain of Arabidopsis (Arabidopsis thaliana) binds to H3K4me1 as previously demonstrated for PDS5C, which localizes to H3K4me1-rich gene bodies and essential genes. Mutations revealed by ultradeep sequencing of wild-type and msh6 knockout lines in Arabidopsis show that functional MSH6 is critical for the reduced rate of single-base substitution (SBS) mutations in gene bodies and H3K4me1-rich regions. We explored the breadth of these mechanisms among plants by examining a large rice (Oryza sativa) mutation data set. H3K4me1-associated hypomutation is conserved in rice as are the H3K4me1-binding residues of MSH6 and PDS5C Tudor domains. Recruitment of DNA repair proteins by H3K4me1 in plants reveals convergent, but distinct, epigenome-recruited DNA repair mechanisms from those well described in humans. The emergent model of H3K4me1-recruited repair in plants is consistent with evolutionary theory regarding mutation modifier systems and offers mechanistic insight into intragenomic mutation rate variation in plants.
Collapse
Affiliation(s)
- Daniela Quiroz
- Department of Plant Sciences, University of California Davis, Davis, CA 95616, USA
- Integrative Genetics and Genomics, University of California Davis, Davis, CA 95616, USA
| | - Satoyo Oya
- Department of Plant Sciences, University of California Davis, Davis, CA 95616, USA
- Laboratory of Genetics, Department of Biological Sciences, The University of Tokyo, Tokyo 113-0033, Japan
| | - Diego Lopez-Mateos
- Department of Physiology and Membrane Biology, University of California Davis, Davis, CA 95616, USA
- Biophysics Graduate Group, University of California Davis, Davis, CA 95616, USA
| | - Kehan Zhao
- Department of Plant Sciences, University of California Davis, Davis, CA 95616, USA
- Plant Biology Graduate Group, University of California Davis, Davis, CA 95616, USA
| | - Alice Pierce
- Department of Plant Sciences, University of California Davis, Davis, CA 95616, USA
- Plant Biology Graduate Group, University of California Davis, Davis, CA 95616, USA
| | - Lissandro Ortega
- Department of Plant Sciences, University of California Davis, Davis, CA 95616, USA
| | - Alissza Ali
- Department of Plant Sciences, University of California Davis, Davis, CA 95616, USA
| | | | - Vladimir Yarov-Yarovoy
- Department of Physiology and Membrane Biology, University of California Davis, Davis, CA 95616, USA
- Biophysics Graduate Group, University of California Davis, Davis, CA 95616, USA
| | - Sae Suzuki
- Department of Biomolecular Engineering, Graduate School of Engineering, Nagoya University, Nagoya 464-0814, Japan
| | - Gosuke Hayashi
- Department of Biomolecular Engineering, Graduate School of Engineering, Nagoya University, Nagoya 464-0814, Japan
| | - Akihisa Osakabe
- Laboratory of Genetics, Department of Biological Sciences, The University of Tokyo, Tokyo 113-0033, Japan
- PRESTO, Japan Science and Technology Agency, Kawaguchi 332-0012, Japan
| | - Grey Monroe
- Department of Plant Sciences, University of California Davis, Davis, CA 95616, USA
- Integrative Genetics and Genomics, University of California Davis, Davis, CA 95616, USA
- Plant Biology Graduate Group, University of California Davis, Davis, CA 95616, USA
| |
Collapse
|
5
|
Bai W, Li C, Li W, Wang H, Han X, Wang P, Wang L. Machine learning assists prediction of genes responsible for plant specialized metabolite biosynthesis by integrating multi-omics data. BMC Genomics 2024; 25:418. [PMID: 38679745 PMCID: PMC11057162 DOI: 10.1186/s12864-024-10258-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Accepted: 03/26/2024] [Indexed: 05/01/2024] Open
Abstract
BACKGROUND Plant specialized (or secondary) metabolites (PSM), also known as phytochemicals, natural products, or plant constituents, play essential roles in interactions between plants and environment. Although many research efforts have focused on discovering novel metabolites and their biosynthetic genes, the resolution of metabolic pathways and identified biosynthetic genes was limited by rudimentary analysis approaches and enormous number of candidate genes. RESULTS Here we integrated state-of-the-art automated machine learning (ML) frame AutoGluon-Tabular and multi-omics data from Arabidopsis to predict genes encoding enzymes involved in biosynthesis of plant specialized metabolite (PSM), focusing on the three main PSM categories: terpenoids, alkaloids, and phenolics. We found that the related features of genomics and proteomics were the top two crucial categories of features contributing to the model performance. Using only these key features, we built a new model in Arabidopsis, which performed better than models built with more features including those related with transcriptomics and epigenomics. Finally, the built models were validated in maize and tomato, and models tested for maize and trained with data from two other species exhibited either equivalent or superior performance to intraspecies predictions. CONCLUSIONS Our external validation results in grape and poppy on the one hand implied the applicability of our model to the other species, and on the other hand showed enormous potential to improve the prediction of enzymes synthesizing PSM with the inclusion of valid data from a wider range of species.
Collapse
Affiliation(s)
- Wenhui Bai
- College of Computer Science and Technology (College of Data Science), Taiyuan University of Technology, Taiyuan, 030024, China
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, China, 518000, Shenzhen
| | - Cheng Li
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, China, 518000, Shenzhen
| | - Wei Li
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, China, 518000, Shenzhen
| | - Hai Wang
- National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization, Joint Laboratory for International Cooperation in Crop Molecular Breeding, China Agricultural University, Beijing, 100193, China
| | - Xiaohong Han
- College of Computer Science and Technology (College of Data Science), Taiyuan University of Technology, Taiyuan, 030024, China.
| | - Peipei Wang
- Kunpeng Institute of Modern Agriculture at Foshan, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124, China.
| | - Li Wang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, China, 518000, Shenzhen.
| |
Collapse
|
6
|
Ishii K, Kazama Y, Hirano T, Fawcett JA, Sato M, Hirai MY, Sakai F, Shirakawa Y, Ohbu S, Abe T. Genomic view of heavy-ion-induced deletions associated with distribution of essential genes in Arabidopsis thaliana. FRONTIERS IN PLANT SCIENCE 2024; 15:1352564. [PMID: 38693931 PMCID: PMC11061394 DOI: 10.3389/fpls.2024.1352564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Accepted: 03/11/2024] [Indexed: 05/03/2024]
Abstract
Heavy-ion beam, a type of ionizing radiation, has been applied to plant breeding as a powerful mutagen and is a promising tool to induce large deletions and chromosomal rearrangements. The effectiveness of heavy-ion irradiation can be explained by linear energy transfer (LET; keV µm-1). Heavy-ion beams with different LET values induce different types and sizes of mutations. It has been suggested that deletion size increases with increasing LET value, and complex chromosomal rearrangements are induced in higher LET radiations. In this study, we mapped heavy-ion beam-induced deletions detected in Arabidopsis mutants to its genome. We revealed that deletion sizes were similar between different LETs (100 to 290 keV μm-1), that their upper limit was affected by the distribution of essential genes, and that the detected chromosomal rearrangements avoid disrupting the essential genes. We also focused on tandemly arrayed genes (TAGs), where two or more homologous genes are adjacent to one another in the genome. Our results suggested that 100 keV µm-1 of LET is enough to disrupt TAGs and that the distribution of essential genes strongly affects the heritability of mutations overlapping them. Our results provide a genomic view of large deletion inductions in the Arabidopsis genome.
Collapse
Affiliation(s)
- Kotaro Ishii
- RIKEN Nishina Center for Accelerator-Based Science, Wako, Japan
- Department of Radiation Measurement and Dose Assessment, Institute for Radiological Science, Quantum Life and Medical Science Directorate, National Institutes for Quantum Science and Technology, Chiba, Japan
| | - Yusuke Kazama
- RIKEN Nishina Center for Accelerator-Based Science, Wako, Japan
- Department of Bioscience and Biotechnology, Fukui Prefectural University, Eiheiji-cho, Japan
| | - Tomonari Hirano
- RIKEN Nishina Center for Accelerator-Based Science, Wako, Japan
- Faculty of Agriculture, University of Miyazaki, Miyazaki, Japan
| | - Jeffrey A. Fawcett
- RIKEN Interdisciplinary Theoretical and Mathematical Sciences (iTHEMS), Wako, Japan
| | - Muneo Sato
- RIKEN Center for Sustainable Resource Science, Yokohama, Japan
| | - Masami Yokota Hirai
- RIKEN Center for Sustainable Resource Science, Yokohama, Japan
- Graduate School of Bioagricultural Science, Nagoya University, Nagoya, Japan
| | | | - Yuki Shirakawa
- RIKEN Nishina Center for Accelerator-Based Science, Wako, Japan
| | - Sumie Ohbu
- RIKEN Nishina Center for Accelerator-Based Science, Wako, Japan
| | - Tomoko Abe
- RIKEN Nishina Center for Accelerator-Based Science, Wako, Japan
| |
Collapse
|
7
|
Lim I, Park YJ, Ha J. Evolutionary and synteny analysis of HIS1, BADH2, GBSS1, and GBSS2 in rice: insights for effective introgression breeding strategies. Sci Rep 2024; 14:5226. [PMID: 38433262 PMCID: PMC10909864 DOI: 10.1038/s41598-024-55581-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Accepted: 02/26/2024] [Indexed: 03/05/2024] Open
Abstract
The key genes BADH2, GBSS1, GBSS2, and HIS1 regulate the fragrance, starch synthesis, and herbicide resistance in rice. Although the molecular functions of four genes have been investigated in the Oryza sativa species, little is known regarding their evolutionary history in the Oryza genus. Here, we studied the evolution of four focal genes in 10 Oryza species using phylogenetic and syntenic approaches. The HIS1 family underwent several times of tandem duplication events in the Oryza species, resulting in copy number variation ranging from 2 to 7. At most one copy of BADH2, GBSS1, and GBSS2 orthologs were identified in each Oryza species, and gene loss events of BADH2 and GBSS2 were identified in three Oryza species. Gene transfer analysis proposed that the functional roles of GBSS1 and GBSS2 were developed in the Asian and African regions, respectively, and most allelic variations of BADH2 in japonica rice emerged after the divergence between the Asian and African rice groups. These results provide clues to determine the origin and evolution of the key genes in rice breeding as well as valuable information for molecular breeders and scientists to develop efficient strategies to simultaneously improve grain quality and yield potential in rice.
Collapse
Affiliation(s)
- Insu Lim
- Department of Plant Science, Gangneung-Wonju National University, Gangneung, South Korea
| | - Yong-Jin Park
- Department of Plant Sciences, Kongju National University, Yesan, 340-702, Korea
| | - Jungmin Ha
- Department of Plant Science, Gangneung-Wonju National University, Gangneung, South Korea.
| |
Collapse
|
8
|
Liang Y, Luo H, Lin Y, Gao F. Recent advances in the characterization of essential genes and development of a database of essential genes. IMETA 2024; 3:e157. [PMID: 38868518 PMCID: PMC10989110 DOI: 10.1002/imt2.157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 10/09/2023] [Indexed: 06/14/2024]
Abstract
Over the past few decades, there has been a significant interest in the study of essential genes, which are crucial for the survival of an organism under specific environmental conditions and thus have practical applications in the fields of synthetic biology and medicine. An increasing amount of experimental data on essential genes has been obtained with the continuous development of technological methods. Meanwhile, various computational prediction methods, related databases and web servers have emerged accordingly. To facilitate the study of essential genes, we have established a database of essential genes (DEG), which has become popular with continuous updates to facilitate essential gene feature analysis and prediction, drug and vaccine development, as well as artificial genome design and construction. In this article, we summarized the studies of essential genes, overviewed the relevant databases, and discussed their practical applications. Furthermore, we provided an overview of the main applications of DEG and conducted comprehensive analyses based on its latest version. However, it should be noted that the essential gene is a dynamic concept instead of a binary one, which presents both opportunities and challenges for their future development.
Collapse
Affiliation(s)
| | - Hao Luo
- Department of PhysicsTianjin UniversityTianjinChina
| | - Yan Lin
- Department of PhysicsTianjin UniversityTianjinChina
| | - Feng Gao
- Department of PhysicsTianjin UniversityTianjinChina
- Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education)Tianjin UniversityTianjinChina
- SynBio Research PlatformCollaborative Innovation Center of Chemical Science and Engineering (Tianjin)TianjinChina
| |
Collapse
|
9
|
Kang L, Li C, Qin A, Liu Z, Li X, Zeng L, Yu H, Wang Y, Song J, Chen R. Identification and Expression Analysis of the Nucleotidyl Transferase Protein (NTP) Family in Soybean ( Glycine max) under Various Abiotic Stresses. Int J Mol Sci 2024; 25:1115. [PMID: 38256188 PMCID: PMC10816777 DOI: 10.3390/ijms25021115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2023] [Revised: 12/16/2023] [Accepted: 12/22/2023] [Indexed: 01/24/2024] Open
Abstract
Nucleotidyl transferases (NTPs) are common transferases in eukaryotes and play a crucial role in nucleotide modifications at the 3' end of RNA. In plants, NTPs can regulate RNA stability by influencing 3' end modifications, which in turn affect plant growth, development, stress responses, and disease resistance. Although the functions of NTP family members have been extensively studied in Arabidopsis, rice, and maize, there is limited knowledge about NTP genes in soybeans. In this study, we identified 16 members of the NTP family in soybeans, including two subfamilies (G1 and G2) with distinct secondary structures, conserved motifs, and domain distributions at the protein level. Evolutionary analysis of genes in the NTP family across multiple species and gene collinearity analysis revealed a relatively conserved evolutionary pattern. Analysis of the tertiary structure of the proteins showed that NTPs have three conserved aspartic acids that bind together to form a possible active site. Tissue-specific expression analysis indicated that some NTP genes exhibit tissue-specific expression, likely due to their specific functions. Stress expression analysis showed significant differences in the expression levels of NTP genes under high salt, drought, and cold stress. Additionally, RNA-seq analysis of soybean plants subjected to salt and drought stress further confirmed the association of soybean NTP genes with abiotic stress responses. Subcellular localization experiments revealed that GmNTP2 and GmNTP14, which likely have similar functions to HESO1 and URT1, are located in the nucleus. These research findings provide a foundation for further investigations into the functions of NTP family genes in soybeans.
Collapse
Affiliation(s)
- Liqing Kang
- College of Bioscience and Bioengineering, Jiangxi Agricultural University, Nanchang 330045, China; (L.K.); (C.L.); (A.Q.); (Z.L.); (X.L.); (L.Z.); (H.Y.); (Y.W.)
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Changgen Li
- College of Bioscience and Bioengineering, Jiangxi Agricultural University, Nanchang 330045, China; (L.K.); (C.L.); (A.Q.); (Z.L.); (X.L.); (L.Z.); (H.Y.); (Y.W.)
| | - Aokang Qin
- College of Bioscience and Bioengineering, Jiangxi Agricultural University, Nanchang 330045, China; (L.K.); (C.L.); (A.Q.); (Z.L.); (X.L.); (L.Z.); (H.Y.); (Y.W.)
| | - Zehui Liu
- College of Bioscience and Bioengineering, Jiangxi Agricultural University, Nanchang 330045, China; (L.K.); (C.L.); (A.Q.); (Z.L.); (X.L.); (L.Z.); (H.Y.); (Y.W.)
| | - Xuanyue Li
- College of Bioscience and Bioengineering, Jiangxi Agricultural University, Nanchang 330045, China; (L.K.); (C.L.); (A.Q.); (Z.L.); (X.L.); (L.Z.); (H.Y.); (Y.W.)
| | - Liming Zeng
- College of Bioscience and Bioengineering, Jiangxi Agricultural University, Nanchang 330045, China; (L.K.); (C.L.); (A.Q.); (Z.L.); (X.L.); (L.Z.); (H.Y.); (Y.W.)
| | - Hongyang Yu
- College of Bioscience and Bioengineering, Jiangxi Agricultural University, Nanchang 330045, China; (L.K.); (C.L.); (A.Q.); (Z.L.); (X.L.); (L.Z.); (H.Y.); (Y.W.)
| | - Yihua Wang
- College of Bioscience and Bioengineering, Jiangxi Agricultural University, Nanchang 330045, China; (L.K.); (C.L.); (A.Q.); (Z.L.); (X.L.); (L.Z.); (H.Y.); (Y.W.)
| | - Jianbo Song
- College of Bioscience and Bioengineering, Jiangxi Agricultural University, Nanchang 330045, China; (L.K.); (C.L.); (A.Q.); (Z.L.); (X.L.); (L.Z.); (H.Y.); (Y.W.)
| | - Rongrong Chen
- College of Bioscience and Bioengineering, Jiangxi Agricultural University, Nanchang 330045, China; (L.K.); (C.L.); (A.Q.); (Z.L.); (X.L.); (L.Z.); (H.Y.); (Y.W.)
| |
Collapse
|
10
|
Hu W, Li M, Xiao H, Guan L. Essential genes identification model based on sequence feature map and graph convolutional neural network. BMC Genomics 2024; 25:47. [PMID: 38200437 PMCID: PMC10777564 DOI: 10.1186/s12864-024-09958-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Accepted: 01/01/2024] [Indexed: 01/12/2024] Open
Abstract
BACKGROUND Essential genes encode functions that play a vital role in the life activities of organisms, encompassing growth, development, immune system functioning, and cell structure maintenance. Conventional experimental techniques for identifying essential genes are resource-intensive and time-consuming, and the accuracy of current machine learning models needs further enhancement. Therefore, it is crucial to develop a robust computational model to accurately predict essential genes. RESULTS In this study, we introduce GCNN-SFM, a computational model for identifying essential genes in organisms, based on graph convolutional neural networks (GCNN). GCNN-SFM integrates a graph convolutional layer, a convolutional layer, and a fully connected layer to model and extract features from gene sequences of essential genes. Initially, the gene sequence is transformed into a feature map using coding techniques. Subsequently, a multi-layer GCN is employed to perform graph convolution operations, effectively capturing both local and global features of the gene sequence. Further feature extraction is performed, followed by integrating convolution and fully-connected layers to generate prediction results for essential genes. The gradient descent algorithm is utilized to iteratively update the cross-entropy loss function, thereby enhancing the accuracy of the prediction results. Meanwhile, model parameters are tuned to determine the optimal parameter combination that yields the best prediction performance during training. CONCLUSIONS Experimental evaluation demonstrates that GCNN-SFM surpasses various advanced essential gene prediction models and achieves an average accuracy of 94.53%. This study presents a novel and effective approach for identifying essential genes, which has significant implications for biology and genomics research.
Collapse
Affiliation(s)
- Wenxing Hu
- College of Physics and Electronic Information, Gannan Normal University, Ganzhou, Jiangxi, 341000, China
| | - Mengshan Li
- College of Physics and Electronic Information, Gannan Normal University, Ganzhou, Jiangxi, 341000, China.
| | - Haiyang Xiao
- College of Physics and Electronic Information, Gannan Normal University, Ganzhou, Jiangxi, 341000, China
| | - Lixin Guan
- College of Physics and Electronic Information, Gannan Normal University, Ganzhou, Jiangxi, 341000, China
| |
Collapse
|
11
|
Pons C, van Leeuwen J. Meta-analysis of dispensable essential genes and their interactions with bypass suppressors. Life Sci Alliance 2024; 7:e202302192. [PMID: 37918966 PMCID: PMC10622647 DOI: 10.26508/lsa.202302192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 10/24/2023] [Accepted: 10/25/2023] [Indexed: 11/04/2023] Open
Abstract
Genes have been historically classified as essential or non-essential based on their requirement for viability. However, genomic mutations can sometimes bypass the requirement for an essential gene, challenging the binary classification of gene essentiality. Such dispensable essential genes represent a valuable model for understanding the incomplete penetrance of loss-of-function mutations often observed in natural populations. Here, we compiled data from multiple studies on essential gene dispensability in Saccharomyces cerevisiae to comprehensively characterize these genes. In analyses spanning different evolutionary timescales, dispensable essential genes exhibited distinct phylogenetic properties compared with other essential and non-essential genes. Integration of interactions with suppressor genes that can bypass the gene essentiality revealed the high functional modularity of the bypass suppression network. Furthermore, dispensable essential and bypass suppressor gene pairs reflected simultaneous changes in the mutational landscape of S. cerevisiae strains. Importantly, species in which dispensable essential genes were non-essential tended to carry bypass suppressor mutations in their genomes. Overall, our study offers a comprehensive view of dispensable essential genes and illustrates how their interactions with bypass suppressors reflect evolutionary outcomes.
Collapse
Affiliation(s)
- Carles Pons
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute for Science and Technology, Barcelona, Spain
| | - Jolanda van Leeuwen
- Center for Integrative Genomics, Bâtiment Génopode, University of Lausanne, Lausanne, Switzerland
| |
Collapse
|
12
|
Sen S, Woodhouse MR, Portwood JL, Andorf CM. Maize Feature Store: A centralized resource to manage and analyze curated maize multi-omics features for machine learning applications. Database (Oxford) 2023; 2023:baad078. [PMID: 37935586 PMCID: PMC10634621 DOI: 10.1093/database/baad078] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Revised: 09/16/2023] [Accepted: 10/19/2023] [Indexed: 11/09/2023]
Abstract
The big-data analysis of complex data associated with maize genomes accelerates genetic research and improves agronomic traits. As a result, efforts have increased to integrate diverse datasets and extract meaning from these measurements. Machine learning models are a powerful tool for gaining knowledge from large and complex datasets. However, these models must be trained on high-quality features to succeed. Currently, there are no solutions to host maize multi-omics datasets with end-to-end solutions for evaluating and linking features to target gene annotations. Our work presents the Maize Feature Store (MFS), a versatile application that combines features built on complex data to facilitate exploration, modeling and analysis. Feature stores allow researchers to rapidly deploy machine learning applications by managing and providing access to frequently used features. We populated the MFS for the maize reference genome with over 14 000 gene-based features based on published genomic, transcriptomic, epigenomic, variomic and proteomics datasets. Using the MFS, we created an accurate pan-genome classification model with an AUC-ROC score of 0.87. The MFS is publicly available through the maize genetics and genomics database. Database URL https://mfs.maizegdb.org/.
Collapse
Affiliation(s)
- Shatabdi Sen
- Department of Plant Pathology & Microbiology, Iowa State University, 1344 Advanced Teaching & Research Bldg, 2213 Pammel Dr, Ames, IA 50011, USA
| | - Margaret R Woodhouse
- USDA-ARS, Corn Insects and Crop Genetics Research Unit, 819 Wallace Road, Ames, IA 50011, USA
| | - John L Portwood
- USDA-ARS, Corn Insects and Crop Genetics Research Unit, 819 Wallace Road, Ames, IA 50011, USA
| | - Carson M Andorf
- USDA-ARS, Corn Insects and Crop Genetics Research Unit, 819 Wallace Road, Ames, IA 50011, USA
- Department of Computer Science, Iowa State University, Atanasoff Hall, 2434 Osborn Dr, Ames, IA 50011, USA
| |
Collapse
|
13
|
Ivanov PA, Gasanova TV, Repina MN, Zamyatnin AA. Signaling and Resistosome Formation in Plant Innate Immunity to Viruses: Is There a Common Mechanism of Antiviral Resistance Conserved across Kingdoms? Int J Mol Sci 2023; 24:13625. [PMID: 37686431 PMCID: PMC10487714 DOI: 10.3390/ijms241713625] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 08/16/2023] [Accepted: 08/31/2023] [Indexed: 09/10/2023] Open
Abstract
Virus-specific proteins, including coat proteins, movement proteins, replication proteins, and suppressors of RNA interference are capable of triggering the hypersensitive response (HR), which is a type of cell death in plants. The main cell death signaling pathway involves direct interaction of HR-inducing proteins with nucleotide-binding leucine-rich repeats (NLR) proteins encoded by plant resistance genes. Singleton NLR proteins act as both sensor and helper. In other cases, NLR proteins form an activation network leading to their oligomerization and formation of membrane-associated resistosomes, similar to metazoan inflammasomes and apoptosomes. In resistosomes, coiled-coil domains of NLR proteins form Ca2+ channels, while toll-like/interleukin-1 receptor-type (TIR) domains form oligomers that display NAD+ glycohydrolase (NADase) activity. This review is intended to highlight the current knowledge on plant innate antiviral defense signaling pathways in an attempt to define common features of antiviral resistance across the kingdoms of life.
Collapse
Affiliation(s)
- Peter A. Ivanov
- Faculty of Biology, Lomonosov Moscow State University, Moscow 119234, Russia; (P.A.I.); (T.V.G.); (M.N.R.)
| | - Tatiana V. Gasanova
- Faculty of Biology, Lomonosov Moscow State University, Moscow 119234, Russia; (P.A.I.); (T.V.G.); (M.N.R.)
| | - Maria N. Repina
- Faculty of Biology, Lomonosov Moscow State University, Moscow 119234, Russia; (P.A.I.); (T.V.G.); (M.N.R.)
| | - Andrey A. Zamyatnin
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow 119234, Russia
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow 119992, Russia
- Research Center for Translational Medicine, Sirius University of Science and Technology, Sirius 354340, Krasnodar Region, Russia
- Institute of Translational Medicine and Biotechnology, Sechenov First Moscow State Medical University, Moscow 119991, Russia
| |
Collapse
|
14
|
Ušák D, Haluška S, Pleskot R. Callose synthesis at the center point of plant development-An evolutionary insight. PLANT PHYSIOLOGY 2023; 193:54-69. [PMID: 37165709 DOI: 10.1093/plphys/kiad274] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 04/21/2023] [Accepted: 04/21/2023] [Indexed: 05/12/2023]
Abstract
Polar callose deposition into the extracellular matrix is tightly controlled in time and space. Its presence in the cell wall modifies the properties of the surrounding area, which is fundamental for the correct execution of numerous processes such as cell division, male gametophyte development, intercellular transport, or responses to biotic and abiotic stresses. Previous studies have been invaluable in characterizing specific callose synthases (CalSs) during individual cellular processes. However, the complex view of the relationships between a particular CalS and a specific process is still lacking. Here we review the recent proceedings on the role of callose and individual CalSs in cell wall remodelling from an evolutionary perspective and with a particular focus on cytokinesis. We provide a robust phylogenetic analysis of CalS across the plant kingdom, which implies a 3-subfamily distribution of CalS. We also discuss the possible linkage between the evolution of CalSs and their function in specific cell types and processes.
Collapse
Affiliation(s)
- David Ušák
- Czech Academy of Sciences, Institute of Experimental Botany, 165 02 Prague, Czech Republic
- Department of Experimental Plant Biology, Faculty of Science, Charles University in Prague, 128 44 Prague, Czech Republic
| | - Samuel Haluška
- Czech Academy of Sciences, Institute of Experimental Botany, 165 02 Prague, Czech Republic
- Department of Experimental Plant Biology, Faculty of Science, Charles University in Prague, 128 44 Prague, Czech Republic
| | - Roman Pleskot
- Czech Academy of Sciences, Institute of Experimental Botany, 165 02 Prague, Czech Republic
| |
Collapse
|
15
|
Adachi H, Sakai T, Harant A, Pai H, Honda K, Toghani A, Claeys J, Duggan C, Bozkurt TO, Wu CH, Kamoun S. An atypical NLR protein modulates the NRC immune receptor network in Nicotiana benthamiana. PLoS Genet 2023; 19:e1010500. [PMID: 36656829 PMCID: PMC9851556 DOI: 10.1371/journal.pgen.1010500] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 10/27/2022] [Indexed: 01/20/2023] Open
Abstract
The NRC immune receptor network has evolved in asterid plants from a pair of linked genes into a genetically dispersed and phylogenetically structured network of sensor and helper NLR (nucleotide-binding domain and leucine-rich repeat-containing) proteins. In some species, such as the model plant Nicotiana benthamiana and other Solanaceae, the NRC (NLR-REQUIRED FOR CELL DEATH) network forms up to half of the NLRome, and NRCs are scattered throughout the genome in gene clusters of varying complexities. Here, we describe NRCX, an atypical member of the NRC family that lacks canonical features of these NLR helper proteins, such as a functional N-terminal MADA motif and the capacity to trigger autoimmunity. In contrast to other NRCs, systemic gene silencing of NRCX in N. benthamiana markedly impairs plant growth resulting in a dwarf phenotype. Remarkably, dwarfism of NRCX silenced plants is partially dependent on NRCX paralogs NRC2 and NRC3, but not NRC4. Despite its negative impact on plant growth when silenced systemically, spot gene silencing of NRCX in mature N. benthamiana leaves doesn't result in visible cell death phenotypes. However, alteration of NRCX expression modulates the hypersensitive response mediated by NRC2 and NRC3 in a manner consistent with a negative role for NRCX in the NRC network. We conclude that NRCX is an atypical member of the NRC network that has evolved to contribute to the homeostasis of this genetically unlinked NLR network.
Collapse
Affiliation(s)
- Hiroaki Adachi
- The Sainsbury Laboratory, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
- Laboratory of Crop Evolution, Graduate School of Agriculture, Kyoto University, Kyoto, Japan
- JST-PRESTO, Saitama, Japan
| | - Toshiyuki Sakai
- The Sainsbury Laboratory, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
- Laboratory of Crop Evolution, Graduate School of Agriculture, Kyoto University, Kyoto, Japan
| | - Adeline Harant
- The Sainsbury Laboratory, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
| | - Hsuan Pai
- The Sainsbury Laboratory, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
| | - Kodai Honda
- Laboratory of Crop Evolution, Graduate School of Agriculture, Kyoto University, Kyoto, Japan
| | - AmirAli Toghani
- The Sainsbury Laboratory, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
| | - Jules Claeys
- The Sainsbury Laboratory, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
| | - Cian Duggan
- Department of Life Sciences, Imperial College London, London, United Kingdom
| | - Tolga O. Bozkurt
- Department of Life Sciences, Imperial College London, London, United Kingdom
| | - Chih-hang Wu
- The Sainsbury Laboratory, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
| | - Sophien Kamoun
- The Sainsbury Laboratory, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
| |
Collapse
|
16
|
Ng JWX, Chua SK, Mutwil M. Feature importance network reveals novel functional relationships between biological features in Arabidopsis thaliana. FRONTIERS IN PLANT SCIENCE 2022; 13:944992. [PMID: 36212273 PMCID: PMC9539877 DOI: 10.3389/fpls.2022.944992] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 08/24/2022] [Indexed: 06/16/2023]
Abstract
Understanding how the different cellular components are working together to form a living cell requires multidisciplinary approaches combining molecular and computational biology. Machine learning shows great potential in life sciences, as it can find novel relationships between biological features. Here, we constructed a dataset of 11,801 gene features for 31,522 Arabidopsis thaliana genes and developed a machine learning workflow to identify linked features. The detected linked features are visualised as a Feature Important Network (FIN), which can be mined to reveal a variety of novel biological insights pertaining to gene function. We demonstrate how FIN can be used to generate novel insights into gene function. To make this network easily accessible to the scientific community, we present the FINder database, available at finder.plant.tools.
Collapse
|
17
|
LeBlanc N, Charles TC. Bacterial genome reductions: Tools, applications, and challenges. Front Genome Ed 2022; 4:957289. [PMID: 36120530 PMCID: PMC9473318 DOI: 10.3389/fgeed.2022.957289] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 07/29/2022] [Indexed: 11/16/2022] Open
Abstract
Bacterial cells are widely used to produce value-added products due to their versatility, ease of manipulation, and the abundance of genome engineering tools. However, the efficiency of producing these desired biomolecules is often hindered by the cells’ own metabolism, genetic instability, and the toxicity of the product. To overcome these challenges, genome reductions have been performed, making strains with the potential of serving as chassis for downstream applications. Here we review the current technologies that enable the design and construction of such reduced-genome bacteria as well as the challenges that limit their assembly and applicability. While genomic reductions have shown improvement of many cellular characteristics, a major challenge still exists in constructing these cells efficiently and rapidly. Computational tools have been created in attempts at minimizing the time needed to design these organisms, but gaps still exist in modelling these reductions in silico. Genomic reductions are a promising avenue for improving the production of value-added products, constructing chassis cells, and for uncovering cellular function but are currently limited by their time-consuming construction methods. With improvements to and the creation of novel genome editing tools and in silico models, these approaches could be combined to expedite this process and create more streamlined and efficient cell factories.
Collapse
Affiliation(s)
- Nicole LeBlanc
- Department of Biology, University of Waterloo, Waterloo, ON, Canada
- *Correspondence: Nicole LeBlanc,
| | - Trevor C. Charles
- Department of Biology, University of Waterloo, Waterloo, ON, Canada
- Metagenom Bio Life Science Inc., Waterloo, ON, Canada
| |
Collapse
|
18
|
Pathak B, Maurya C, Faria MC, Alizada Z, Nandy S, Zhao S, Jamsheer K M, Srivastava V. Targeting TOR and SnRK1 Genes in Rice with CRISPR/Cas9. PLANTS (BASEL, SWITZERLAND) 2022; 11:plants11111453. [PMID: 35684226 PMCID: PMC9183148 DOI: 10.3390/plants11111453] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 05/19/2022] [Accepted: 05/23/2022] [Indexed: 05/29/2023]
Abstract
Genome targeting with CRISPR/Cas9 is a popular method for introducing mutations and creating knock-out effects. However, limited information is currently available on the mutagenesis of essential genes. This study investigated the efficiency of CRISPR/Cas9 in targeting rice essential genes: the singleton TARGET OF RAPAMYCIN (OsTOR) and the three paralogs of the Sucrose non-fermenting-1 (SNF1)-related kinase 1 (OsSnRK1α), OsSnRK1αA, OsSnRK1αB and OsSnRK1αC. Strong activity of constitutively expressed CRISPR/Cas9 was effective in creating mutations in OsTOR and OsSnRK1α genes, but inducible CRISPR/Cas9 failed to generate detectable mutations. The rate of OsTOR mutagenesis was relatively lower and only the kinase domain of OsTOR could be targeted, while mutations in the HEAT region were unrecoverable. OsSnRK1α paralogs could be targeted at higher rates; however, sterility or early senescence was observed in >50% of the primary mutants. Additionally, OsSnRK1αB and OsSnRK1αC, which bear high sequence homologies, could be targeted simultaneously to generate double-mutants. Further, although limited types of mutations were found in the surviving mutants, the recovered lines displayed loss-of-function or knockdown tor or snrk1 phenotypes. Overall, our data show that mutations in these essential genes can be created by CRISPR/Cas9 to facilitate investigations on their roles in plant development and environmental response in rice.
Collapse
Affiliation(s)
- Bhuvan Pathak
- Department of Crop, Soil & Environmental Sciences, University of Arkansas System Division of Agriculture, Fayetteville, AR 72701, USA; (B.P.); (C.M.); (M.C.F.); (S.N.); (S.Z.)
| | - Chandan Maurya
- Department of Crop, Soil & Environmental Sciences, University of Arkansas System Division of Agriculture, Fayetteville, AR 72701, USA; (B.P.); (C.M.); (M.C.F.); (S.N.); (S.Z.)
| | - Maria C. Faria
- Department of Crop, Soil & Environmental Sciences, University of Arkansas System Division of Agriculture, Fayetteville, AR 72701, USA; (B.P.); (C.M.); (M.C.F.); (S.N.); (S.Z.)
| | - Zahra Alizada
- Cell and Molecular Biology Program, University of Arkansas, Fayetteville, AR 72701, USA;
| | - Soumen Nandy
- Department of Crop, Soil & Environmental Sciences, University of Arkansas System Division of Agriculture, Fayetteville, AR 72701, USA; (B.P.); (C.M.); (M.C.F.); (S.N.); (S.Z.)
| | - Shan Zhao
- Department of Crop, Soil & Environmental Sciences, University of Arkansas System Division of Agriculture, Fayetteville, AR 72701, USA; (B.P.); (C.M.); (M.C.F.); (S.N.); (S.Z.)
| | - Muhammed Jamsheer K
- Amity Institute of Genome Engineering, Amity University Uttar Pradesh, Noida 201313, India;
| | - Vibha Srivastava
- Department of Crop, Soil & Environmental Sciences, University of Arkansas System Division of Agriculture, Fayetteville, AR 72701, USA; (B.P.); (C.M.); (M.C.F.); (S.N.); (S.Z.)
- Cell and Molecular Biology Program, University of Arkansas, Fayetteville, AR 72701, USA;
| |
Collapse
|
19
|
Mukherjee D, Saha D, Acharya D, Mukherjee A, Ghosh TC. Interplay between gene expression and gene architecture as a consequence of gene and genome duplications: evidence from metabolic genes of Arabidopsis thaliana. PHYSIOLOGY AND MOLECULAR BIOLOGY OF PLANTS : AN INTERNATIONAL JOURNAL OF FUNCTIONAL PLANT BIOLOGY 2022; 28:1091-1108. [PMID: 35722515 PMCID: PMC9203644 DOI: 10.1007/s12298-022-01188-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 05/16/2022] [Accepted: 05/18/2022] [Indexed: 05/03/2023]
Abstract
Gene and genome duplications have been widespread during the evolution of flowering plant which resulted in the increment of biological complexity as well as creation of plasticity of a genome helping the species to adapt to changing environments. Duplicated genes with higher evolutionary rates can act as a mechanism of generating novel functions in secondary metabolism. In this study, we explored duplication as a potential factor governing the expression heterogeneity and gene architecture of Primary Metabolic Genes (PMGs) and Secondary Metabolic Genes (SMGs) of Arabidopsis thaliana. It is remarkable that different types of duplication processes controlled gene expression and tissue specificity differently in PMGs and SMGs. A complex relationship exists between gene architecture and expression patterns of primary and secondary metabolic genes. Our study reflects, expression heterogeneity and gene structure variation of primary and secondary metabolism in Arabidopsis thaliana are partly results of duplication events of different origins. Our study suggests that duplication has differential effect on PMGs and SMGs regarding expression pattern by controlling gene structure, epigenetic modifications, multifunctionality and subcellular compartmentalization. This study provides an insight into the evolution of metabolism in plants in the light of gene and genome scale duplication. Supplementary Information The online version contains supplementary material available at 10.1007/s12298-022-01188-2.
Collapse
Affiliation(s)
- Dola Mukherjee
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata, 700 054 India
| | - Deeya Saha
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata, 700 054 India
| | - Debarun Acharya
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata, 700 054 India
| | - Ashutosh Mukherjee
- Department of Botany, Vivekananda College, 269, Diamond Harbour Road, Thakurpukur, Kolkata, West Bengal 700063 India
| | - Tapash Chandra Ghosh
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata, 700 054 India
| |
Collapse
|
20
|
Das D, Singha DL, Paswan RR, Chowdhury N, Sharma M, Reddy PS, Chikkaputtaiah C. Recent advancements in CRISPR/Cas technology for accelerated crop improvement. PLANTA 2022; 255:109. [PMID: 35460444 DOI: 10.1007/s00425-022-03894-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 03/29/2022] [Indexed: 06/14/2023]
Abstract
Precise genome engineering approaches could be perceived as a second paradigm for targeted trait improvement in crop plants, with the potential to overcome the constraints imposed by conventional CRISPR/Cas technology. The likelihood of reduced agricultural production due to highly turbulent climatic conditions increases as the global population expands. The second paradigm of stress-resilient crops with enhanced tolerance and increased productivity against various stresses is paramount to support global production and consumption equilibrium. Although traditional breeding approaches have substantially increased crop production and yield, effective strategies are anticipated to restore crop productivity even further in meeting the world's increasing food demands. CRISPR/Cas, which originated in prokaryotes, has surfaced as a coveted genome editing tool in recent decades, reshaping plant molecular biology in unprecedented ways and paving the way for engineering stress-tolerant crops. CRISPR/Cas is distinguished by its efficiency, high target specificity, and modularity, enables precise genetic modification of crop plants, allowing for the creation of allelic variations in the germplasm and the development of novel and more productive agricultural practices. Additionally, a slew of advanced biotechnologies premised on the CRISPR/Cas methodologies have augmented fundamental research and plant synthetic biology toolkits. Here, we describe gene editing tools, including CRISPR/Cas and its imitative tools, such as base and prime editing, multiplex genome editing, chromosome engineering followed by their implications in crop genetic improvement. Further, we comprehensively discuss the latest developments of CRISPR/Cas technology including CRISPR-mediated gene drive, tissue-specific genome editing, dCas9 mediated epigenetic modification and programmed self-elimination of transgenes in plants. Finally, we highlight the applicability and scope of advanced CRISPR-based techniques in crop genetic improvement.
Collapse
Affiliation(s)
- Debajit Das
- Biological Sciences and Technology Division, CSIR-North East Institute of Science and Technology (CSIR-NEIST), Jorhat, Assam, 785006, India
| | - Dhanawantari L Singha
- Biological Sciences and Technology Division, CSIR-North East Institute of Science and Technology (CSIR-NEIST), Jorhat, Assam, 785006, India
| | - Ricky Raj Paswan
- Department of Agricultural Biotechnology, Assam Agricultural University, Jorhat, Assam, 785013, India
| | - Naimisha Chowdhury
- Biological Sciences and Technology Division, CSIR-North East Institute of Science and Technology (CSIR-NEIST), Jorhat, Assam, 785006, India
| | - Monica Sharma
- Biological Sciences and Technology Division, CSIR-North East Institute of Science and Technology (CSIR-NEIST), Jorhat, Assam, 785006, India
| | - Palakolanu Sudhakar Reddy
- International Crop Research Institute for the Semi Arid Tropics (ICRISAT), Patancheru, Hyderabad, 502 324, India
| | - Channakeshavaiah Chikkaputtaiah
- Biological Sciences and Technology Division, CSIR-North East Institute of Science and Technology (CSIR-NEIST), Jorhat, Assam, 785006, India.
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201 002, India.
| |
Collapse
|
21
|
Chennakesavulu K, Singh H, Trivedi PK, Jain M, Yadav SR. State-of-the-Art in CRISPR Technology and Engineering Drought, Salinity, and Thermo-tolerant crop plants. PLANT CELL REPORTS 2022; 41:815-831. [PMID: 33742256 DOI: 10.1007/s00299-021-02681-w] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Accepted: 03/04/2021] [Indexed: 05/28/2023]
Abstract
Our review has described principles and functional importance of CRISPR-Cas9 with emphasis on the recent advancements, such as CRISPR-Cpf1, base editing (BE), prime editing (PE), epigenome editing, tissue-specific (CRISPR-TSKO), and inducible genome editing and their potential applications in generating stress-tolerant plants. Improved agricultural practices and enhanced food crop production using innovative crop breeding technology is essential for increasing access to nutritious foods across the planet. The crop plants play a pivotal role in energy and nutrient supply to humans. The abiotic stress factors, such as drought, heat, and salinity cause a substantial yield loss in crop plants and threaten food security. The most sustainable and eco-friendly way to overcome these challenges are the breeding of crop cultivars with improved tolerance against abiotic stress factors. The conventional plant breeding methods have been highly successful in developing abiotic stress-tolerant crop varieties, but usually cumbersome and time-consuming. Alternatively, the CRISPR/Cas genome editing has emerged as a revolutionary tool for making efficient and precise genetic manipulations in plant genomes. Here, we provide a comprehensive review of the CRISPR/Cas genome editing (GE) technology with an emphasis on recent advances in the plant genome editing, including base editing (BE), prime editing (PE), epigenome editing, tissue-specific (CRISPR-TSKO), and inducible genome editing (CRISPR-IGE), which can be used for obtaining cultivars with enhanced tolerance to various abiotic stress factors. We also describe tissue culture-free, DNA-free GE technology, and some of the CRISPR-based tools that can be modified for their use in crop plants.
Collapse
Affiliation(s)
- Kunchapu Chennakesavulu
- Department of Biotechnology, Indian Institute of Technology, Roorkee, Uttarakhand, 247667, India
| | - Harshita Singh
- Department of Biotechnology, Indian Institute of Technology, Roorkee, Uttarakhand, 247667, India
| | - Prabodh Kumar Trivedi
- CSIR-Central Institute of Medicinal and Aromatic Plants (CSIR-CIMAP), Near Kukrail Picnic Spot, Lucknow, 226015, India
| | - Mukesh Jain
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, India
| | - Shri Ram Yadav
- Department of Biotechnology, Indian Institute of Technology, Roorkee, Uttarakhand, 247667, India.
| |
Collapse
|
22
|
Pfeiffer ML, Winkler J, Van Damme D, Jacobs TB, Nowack MK. Conditional and tissue-specific approaches to dissect essential mechanisms in plant development. CURRENT OPINION IN PLANT BIOLOGY 2022; 65:102119. [PMID: 34653951 PMCID: PMC7612331 DOI: 10.1016/j.pbi.2021.102119] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 09/03/2021] [Accepted: 09/09/2021] [Indexed: 05/19/2023]
Abstract
Reverse genetics approaches are routinely used to investigate gene function. However, mutations, especially in critical genes, can lead to pleiotropic effects as severe as lethality, thus limiting functional studies in specific contexts. Approaches that allow for modifications of genes or gene products in a specific spatial or temporal setting can overcome these limitations. The advent of CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) technologies has not only revolutionized targeted genome modification in plants but also enabled new possibilities for inducible and tissue-specific manipulation of gene functions at the DNA and RNA levels. In addition, novel approaches for the direct manipulation of target proteins have been introduced in plant systems. Here, we review the current development in tissue-specific and conditional manipulation approaches at the DNA, RNA, and protein levels.
Collapse
Affiliation(s)
- Marie L Pfeiffer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052, Ghent, Belgium; VIB Center for Plant Systems Biology, 9052, Ghent, Belgium
| | - Joanna Winkler
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052, Ghent, Belgium; VIB Center for Plant Systems Biology, 9052, Ghent, Belgium
| | - Daniël Van Damme
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052, Ghent, Belgium; VIB Center for Plant Systems Biology, 9052, Ghent, Belgium
| | - Thomas B Jacobs
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052, Ghent, Belgium; VIB Center for Plant Systems Biology, 9052, Ghent, Belgium.
| | - Moritz K Nowack
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052, Ghent, Belgium; VIB Center for Plant Systems Biology, 9052, Ghent, Belgium.
| |
Collapse
|
23
|
Monroe JG, Srikant T, Carbonell-Bejerano P, Becker C, Lensink M, Exposito-Alonso M, Klein M, Hildebrandt J, Neumann M, Kliebenstein D, Weng ML, Imbert E, Ågren J, Rutter MT, Fenster CB, Weigel D. Mutation bias reflects natural selection in Arabidopsis thaliana. Nature 2022; 602:101-105. [PMID: 35022609 PMCID: PMC8810380 DOI: 10.1038/s41586-021-04269-6] [Citation(s) in RCA: 168] [Impact Index Per Article: 56.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Accepted: 11/17/2021] [Indexed: 12/24/2022]
Abstract
Since the first half of the twentieth century, evolutionary theory has been dominated by the idea that mutations occur randomly with respect to their consequences1. Here we test this assumption with large surveys of de novo mutations in the plant Arabidopsis thaliana. In contrast to expectations, we find that mutations occur less often in functionally constrained regions of the genome-mutation frequency is reduced by half inside gene bodies and by two-thirds in essential genes. With independent genomic mutation datasets, including from the largest Arabidopsis mutation accumulation experiment conducted to date, we demonstrate that epigenomic and physical features explain over 90% of variance in the genome-wide pattern of mutation bias surrounding genes. Observed mutation frequencies around genes in turn accurately predict patterns of genetic polymorphisms in natural Arabidopsis accessions (r = 0.96). That mutation bias is the primary force behind patterns of sequence evolution around genes in natural accessions is supported by analyses of allele frequencies. Finally, we find that genes subject to stronger purifying selection have a lower mutation rate. We conclude that epigenome-associated mutation bias2 reduces the occurrence of deleterious mutations in Arabidopsis, challenging the prevailing paradigm that mutation is a directionless force in evolution.
Collapse
Affiliation(s)
- J Grey Monroe
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany.
- Department of Plant Sciences, University of California Davis, Davis, CA, USA.
| | - Thanvi Srikant
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany
| | | | - Claude Becker
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany
- Faculty of Biology, Ludwig Maximilian University, Martinsried, Germany
| | - Mariele Lensink
- Department of Plant Sciences, University of California Davis, Davis, CA, USA
| | - Moises Exposito-Alonso
- Department of Plant Biology, Carnegie Institution for Science, Stanford, CA, USA
- Department of Biology, Stanford University, Stanford, CA, USA
| | - Marie Klein
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany
- Department of Plant Sciences, University of California Davis, Davis, CA, USA
| | - Julia Hildebrandt
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany
| | - Manuela Neumann
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany
| | - Daniel Kliebenstein
- Department of Plant Sciences, University of California Davis, Davis, CA, USA
| | - Mao-Lun Weng
- Department of Biology, Westfield State University, Westfield, MA, USA
| | - Eric Imbert
- ISEM, University of Montpellier, Montpellier, France
| | - Jon Ågren
- Department of Ecology and Genetics, EBC, Uppsala University, Uppsala, Sweden
| | - Matthew T Rutter
- Department of Biology, College of Charleston, Charleston, SC, USA
| | - Charles B Fenster
- Oak Lake Field Station, South Dakota State University, Brookings, SD, USA
| | - Detlef Weigel
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, Tübingen, Germany.
| |
Collapse
|
24
|
Bush SJ, Murren CJ, Urrutia AO, Kover PX. Contrasting gene-level signatures of selection with reproductive fitness. Mol Ecol 2021; 31:1515-1526. [PMID: 34918851 PMCID: PMC9304172 DOI: 10.1111/mec.16329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Revised: 11/29/2021] [Accepted: 12/09/2021] [Indexed: 11/30/2022]
Abstract
Selection leaves signatures in the DNA sequence of genes, with many test statistics devised to detect its action. While these statistics are frequently used to support hypotheses about the adaptive significance of particular genes, the effect these genes have on reproductive fitness is rarely quantified experimentally. Consequently, it is unclear how gene-level signatures of selection are associated with empirical estimates of gene effect on fitness. Eukaryotic datasets that permit this comparison are very limited. Using the model plant Arabidopsis thaliana, for which these resources are available, we calculated seven gene-level substitution and polymorphism-based statistics commonly used to infer selection (dN/dS, NI, DOS, Tajima's D, Fu and Li's D*, Fay and Wu's H, and Zeng's E) and, using knockout lines, compared these to gene-level estimates of effect on fitness. We found that consistent with expectations, essential genes were more likely to be classified as negatively selected. By contrast, using 379 Arabidopsis genes for which data was available, we found no evidence that genes predicted to be positively selected had a significantly different effect on fitness than genes evolving more neutrally. We discuss these results in the context of the analytic challenges posed by Arabidopsis, one of the only systems in which this study could be conducted, and advocate for examination in additional systems. These results are relevant to the evaluation of genome-wide studies across species where experimental fitness data is unavailable, as well as highlighting an increasing need for the latter.
Collapse
Affiliation(s)
- Stephen J Bush
- Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Courtney J Murren
- Department of Biology, College of Charleston, Charleston, SC, USA, 29424
| | - Araxi O Urrutia
- Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK.,Instituto de Ecologia, UNAM, Ciudad de Mexico, 04510, Mexico
| | - Paula X Kover
- Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| |
Collapse
|
25
|
Ahmad M, Waraich EA, Skalicky M, Hussain S, Zulfiqar U, Anjum MZ, Habib ur Rahman M, Brestic M, Ratnasekera D, Lamilla-Tamayo L, Al-Ashkar I, EL Sabagh A. Adaptation Strategies to Improve the Resistance of Oilseed Crops to Heat Stress Under a Changing Climate: An Overview. FRONTIERS IN PLANT SCIENCE 2021; 12:767150. [PMID: 34975951 PMCID: PMC8714756 DOI: 10.3389/fpls.2021.767150] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 11/11/2021] [Indexed: 05/16/2023]
Abstract
Temperature is one of the decisive environmental factors that is projected to increase by 1. 5°C over the next two decades due to climate change that may affect various agronomic characteristics, such as biomass production, phenology and physiology, and yield-contributing traits in oilseed crops. Oilseed crops such as soybean, sunflower, canola, peanut, cottonseed, coconut, palm oil, sesame, safflower, olive etc., are widely grown. Specific importance is the vulnerability of oil synthesis in these crops against the rise in climatic temperature, threatening the stability of yield and quality. The natural defense system in these crops cannot withstand the harmful impacts of heat stress, thus causing a considerable loss in seed and oil yield. Therefore, a proper understanding of underlying mechanisms of genotype-environment interactions that could affect oil synthesis pathways is a prime requirement in developing stable cultivars. Heat stress tolerance is a complex quantitative trait controlled by many genes and is challenging to study and characterize. However, heat tolerance studies to date have pointed to several sophisticated mechanisms to deal with the stress of high temperatures, including hormonal signaling pathways for sensing heat stimuli and acquiring tolerance to heat stress, maintaining membrane integrity, production of heat shock proteins (HSPs), removal of reactive oxygen species (ROS), assembly of antioxidants, accumulation of compatible solutes, modified gene expression to enable changes, intelligent agricultural technologies, and several other agronomic techniques for thriving and surviving. Manipulation of multiple genes responsible for thermo-tolerance and exploring their high expressions greatly impacts their potential application using CRISPR/Cas genome editing and OMICS technology. This review highlights the latest outcomes on the response and tolerance to heat stress at the cellular, organelle, and whole plant levels describing numerous approaches applied to enhance thermos-tolerance in oilseed crops. We are attempting to critically analyze the scattered existing approaches to temperature tolerance used in oilseeds as a whole, work toward extending studies into the field, and provide researchers and related parties with useful information to streamline their breeding programs so that they can seek new avenues and develop guidelines that will greatly enhance ongoing efforts to establish heat stress tolerance in oilseeds.
Collapse
Affiliation(s)
- Muhammad Ahmad
- Department of Agronomy, University of Agriculture, Faisalabad, Pakistan
- Horticultural Sciences Department, Tropical Research and Education Center, Institute of Food and Agricultural Sciences, University of Florida, Homestead, FL, United States
| | | | - Milan Skalicky
- Department of Botany and Plant Physiology, Faculty of Agrobiology, Food and Natural Resources, Czech University of Life Sciences Prague, Prague, Czechia
| | - Saddam Hussain
- Department of Agronomy, University of Agriculture, Faisalabad, Pakistan
| | - Usman Zulfiqar
- Department of Agronomy, University of Agriculture, Faisalabad, Pakistan
| | - Muhammad Zohaib Anjum
- Department of Forestry and Range Management, University of Agriculture, Faisalabad, Pakistan
| | - Muhammad Habib ur Rahman
- Department of Agronomy, Muhammad Nawaz Shareef University of Agriculture, Multan, Pakistan
- Crop Science Group, Institute of Crop Science and Resource Conservation (INRES), University Bonn, Bonn, Germany
| | - Marian Brestic
- Department of Botany and Plant Physiology, Faculty of Agrobiology, Food and Natural Resources, Czech University of Life Sciences Prague, Prague, Czechia
- Department of Plant Physiology, Slovak University of Agriculture, Nitra, Slovakia
| | - Disna Ratnasekera
- Department of Agricultural Biology, Faculty of Agriculture, University of Ruhuna, Kamburupitiya, Sri Lanka
| | - Laura Lamilla-Tamayo
- Department of Botany and Plant Physiology, Faculty of Agrobiology, Food and Natural Resources, Czech University of Life Sciences Prague, Prague, Czechia
| | - Ibrahim Al-Ashkar
- Department of Plant Production, College of Food and Agriculture, King Saud University, Riyadh, Saudi Arabia
- Agronomy Department, Faculty of Agriculture, Al-Azhar University, Cairo, Egypt
| | - Ayman EL Sabagh
- Department of Field Crops, Faculty of Agriculture, Siirt University, Siirt, Turkey
- Department of Agronomy, Faculty of Agriculture, Kafrelsheikh University, Kafr El-Shaikh, Egypt
| |
Collapse
|
26
|
Beder T, Aromolaran O, Dönitz J, Tapanelli S, Adedeji E, Adebiyi E, Bucher G, Koenig R. Identifying essential genes across eukaryotes by machine learning. NAR Genom Bioinform 2021; 3:lqab110. [PMID: 34859210 PMCID: PMC8634067 DOI: 10.1093/nargab/lqab110] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Revised: 10/09/2021] [Accepted: 11/29/2021] [Indexed: 02/07/2023] Open
Abstract
Identifying essential genes on a genome scale is resource intensive and has been performed for only a few eukaryotes. For less studied organisms essentiality might be predicted by gene homology. However, this approach cannot be applied to non-conserved genes. Additionally, divergent essentiality information is obtained from studying single cells or whole, multi-cellular organisms, and particularly when derived from human cell line screens and human population studies. We employed machine learning across six model eukaryotes and 60 381 genes, using 41 635 features derived from the sequence, gene function information and network topology. Within a leave-one-organism-out cross-validation, the classifiers showed high generalizability with an average accuracy close to 80% in the left-out species. As a case study, we applied the method to Tribolium castaneum and Bombyx mori and validated predictions experimentally yielding similar performances. Finally, using the classifier based on the studied model organisms enabled linking the essentiality information of human cell line screens and population studies.
Collapse
Affiliation(s)
- Thomas Beder
- Integrated Research and Treatment Center, Center for Sepsis Control and Care (CSCC), Jena University Hospital, Am Klinikum 1, 07747 Jena, Germany
- Institute of Infectious Diseases and Infection Control, Jena University Hospital, Am Klinikum 1, 07747 Jena, Germany
- Department of Internal Medicine II, University Medical Center Schleswig-Holstein, Campus Kiel, 24105 Kiel, Germany
| | - Olufemi Aromolaran
- Department of Computer & Information Sciences, Covenant University, Ota, Ogun State, Nigeria
- Covenant University Bioinformatics Research (CUBRe), Covenant University, Ota, Ogun State, Nigeria
| | - Jürgen Dönitz
- Department of Evolutionary Developmental Genetics, GZMB, University of Göttingen, Justus-von-Liebig-Weg 11, 37077 Göttingen, Germany
- Department of Medical Bioinformatics, University Medical Center Göttingen (UMG), 37099 Göttingen, Germany
| | - Sofia Tapanelli
- Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| | - Eunice O Adedeji
- Covenant University Bioinformatics Research (CUBRe), Covenant University, Ota, Ogun State, Nigeria
- Department of Biochemistry, Covenant University, Ota, Ogun State, Nigeria
| | - Ezekiel Adebiyi
- Department of Computer & Information Sciences, Covenant University, Ota, Ogun State, Nigeria
- Covenant University Bioinformatics Research (CUBRe), Covenant University, Ota, Ogun State, Nigeria
| | - Gregor Bucher
- Department of Evolutionary Developmental Genetics, GZMB, University of Göttingen, Justus-von-Liebig-Weg 11, 37077 Göttingen, Germany
| | - Rainer Koenig
- Integrated Research and Treatment Center, Center for Sepsis Control and Care (CSCC), Jena University Hospital, Am Klinikum 1, 07747 Jena, Germany
- Institute of Infectious Diseases and Infection Control, Jena University Hospital, Am Klinikum 1, 07747 Jena, Germany
| |
Collapse
|
27
|
Campos TL, Korhonen PK, Hofmann A, Gasser RB, Young ND. Harnessing model organism genomics to underpin the machine learning-based prediction of essential genes in eukaryotes - Biotechnological implications. Biotechnol Adv 2021; 54:107822. [PMID: 34461202 DOI: 10.1016/j.biotechadv.2021.107822] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Revised: 08/17/2021] [Accepted: 08/24/2021] [Indexed: 12/17/2022]
Abstract
The availability of high-quality genomes and advances in functional genomics have enabled large-scale studies of essential genes in model eukaryotes, including the 'elegant worm' (Caenorhabditis elegans; Nematoda) and the 'vinegar fly' (Drosophila melanogaster; Arthropoda). However, this is not the case for other, much less-studied organisms, such as socioeconomically important parasites, for which functional genomic platforms usually do not exist. Thus, there is a need to develop innovative techniques or approaches for the prediction, identification and investigation of essential genes. A key approach that could enable the prediction of such genes is machine learning (ML). Here, we undertake an historical review of experimental and computational approaches employed for the characterisation of essential genes in eukaryotes, with a particular focus on model ecdysozoans (C. elegans and D. melanogaster), and discuss the possible applicability of ML-approaches to organisms such as socioeconomically important parasites. We highlight some recent results showing that high-performance ML, combined with feature engineering, allows a reliable prediction of essential genes from extensive, publicly available 'omic data sets, with major potential to prioritise such genes (with statistical confidence) for subsequent functional genomic validation. These findings could 'open the door' to fundamental and applied research areas. Evidence of some commonality in the essential gene-complement between these two organisms indicates that an ML-engineering approach could find broader applicability to ecdysozoans such as parasitic nematodes or arthropods, provided that suitably large and informative data sets become/are available for proper feature engineering, and for the robust training and validation of algorithms. This area warrants detailed exploration to, for example, facilitate the identification and characterisation of essential molecules as novel targets for drugs and vaccines against parasitic diseases. This focus is particularly important, given the substantial impact that such diseases have worldwide, and the current challenges associated with their prevention and control and with drug resistance in parasite populations.
Collapse
Affiliation(s)
- Tulio L Campos
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia; Bioinformatics Core Facility, Instituto Aggeu Magalhães, Fundação Oswaldo Cruz (IAM-Fiocruz), Recife, Pernambuco, Brazil
| | - Pasi K Korhonen
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Andreas Hofmann
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Robin B Gasser
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia.
| | - Neil D Young
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia.
| |
Collapse
|
28
|
Wang P, Moore BM, Uygun S, Lehti-Shiu MD, Barry CS, Shiu SH. Optimising the use of gene expression data to predict plant metabolic pathway memberships. THE NEW PHYTOLOGIST 2021; 231:475-489. [PMID: 33749860 DOI: 10.1111/nph.17355] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 03/13/2021] [Indexed: 06/12/2023]
Abstract
Plant metabolites from diverse pathways are important for plant survival, human nutrition and medicine. The pathway memberships of most plant enzyme genes are unknown. While co-expression is useful for assigning genes to pathways, expression correlation may exist only under specific spatiotemporal and conditional contexts. Utilising > 600 tomato (Solanum lycopersicum) expression data combinations, three strategies for predicting memberships in 85 pathways were explored. Optimal predictions for different pathways require distinct data combinations indicative of pathway functions. Naive prediction (i.e. identifying pathways with the most similarly expressed genes) is error prone. In 52 pathways, unsupervised learning performed better than supervised approaches, possibly due to limited training data availability. Using gene-to-pathway expression similarities led to prediction models that outperformed those based simply on expression levels. Using 36 experimental validated genes, the pathway-best model prediction accuracy is 58.3%, significantly better compared with that for predicting annotated genes without experimental evidence (37.0%) or random guess (1.2%), demonstrating the importance of data quality. Our study highlights the need to extensively explore expression-based features and prediction strategies to maximise the accuracy of metabolic pathway membership assignment. The prediction framework outlined here can be applied to other species and serves as a baseline model for future comparisons.
Collapse
Affiliation(s)
- Peipei Wang
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Bethany M Moore
- Department of Botany, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | | | - Melissa D Lehti-Shiu
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Cornelius S Barry
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
| | - Shin-Han Shiu
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
- Department of Computational Mathematics, Science, and Engineering, Michigan State University, East Lansing, MI, 48824, USA
| |
Collapse
|
29
|
Bowers JE, Paterson AH. Chromosome number is key to longevity of polyploid lineages. THE NEW PHYTOLOGIST 2021; 231:19-28. [PMID: 33772797 DOI: 10.1111/nph.17361] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Accepted: 03/17/2021] [Indexed: 06/12/2023]
Abstract
Polyploidy is ubiquitous and often recursive in plant lineages, most frequently resulting in extinction but occasionally associated with great evolutionary success. However, instead of chromosome numbers exponentially increasing due to recurrent polyploidy, most angiosperm species have fewer than 14 chromosome pairs. Following genome duplication, diploidisation can render one copy of essential genes nonfunctional without fitness cost. In isolated subpopulations, alternate (homoeologous) gene copies can be lost, creating incompatibilities that reduce fitness of hybrids between subpopulations, constraining exchange of favourable genetic changes and reducing species fitness. When multiple sets of incompatible genes are genetically linked, their deleterious effects are not independent. The effective number of independently acting sets of incompatible loci in hybrids is limited by chromosome number and recombination. Therefore, species with many chromosomes are subject to a higher fitness penalty during diploidisation. Karyotypic changes, especially fusions, that reduce gene flow are normally fitness disadvantages, but during the diploidisation process, can increase fitness by reducing mixing of differentially diploidised alleles. Fitness penalties caused by diploidisation favour accelerated karyotypic change, with each change increasing barriers to gene flow, contributing to speciation. Lower chromosome numbers and increased chromosome fusions confer advantages to surviving the diploidisation process following polyploid formation, by independent mechanisms.
Collapse
Affiliation(s)
- John E Bowers
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30602, USA
| | - Andrew H Paterson
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA, 30602, USA
| |
Collapse
|
30
|
Gupta C, Ramegowda V, Basu S, Pereira A. Using Network-Based Machine Learning to Predict Transcription Factors Involved in Drought Resistance. Front Genet 2021; 12:652189. [PMID: 34249082 PMCID: PMC8264776 DOI: 10.3389/fgene.2021.652189] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 05/13/2021] [Indexed: 12/13/2022] Open
Abstract
Gene regulatory networks underpin stress response pathways in plants. However, parsing these networks to prioritize key genes underlying a particular trait is challenging. Here, we have built the Gene Regulation and Association Network (GRAiN) of rice (Oryza sativa). GRAiN is an interactive query-based web-platform that allows users to study functional relationships between transcription factors (TFs) and genetic modules underlying abiotic-stress responses. We built GRAiN by applying a combination of different network inference algorithms to publicly available gene expression data. We propose a supervised machine learning framework that complements GRAiN in prioritizing genes that regulate stress signal transduction and modulate gene expression under drought conditions. Our framework converts intricate network connectivity patterns of 2160 TFs into a single drought score. We observed that TFs with the highest drought scores define the functional, structural, and evolutionary characteristics of drought resistance in rice. Our approach accurately predicted the function of OsbHLH148 TF, which we validated using in vitro protein-DNA binding assays and mRNA sequencing loss-of-function mutants grown under control and drought stress conditions. Our network and the complementary machine learning strategy lends itself to predicting key regulatory genes underlying other agricultural traits and will assist in the genetic engineering of desirable rice varieties.
Collapse
Affiliation(s)
- Chirag Gupta
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| | - Venkategowda Ramegowda
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| | - Supratim Basu
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| | - Andy Pereira
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| |
Collapse
|
31
|
Vangheluwe N, Beeckman T. Lateral Root Initiation and the Analysis of Gene Function Using Genome Editing with CRISPR in Arabidopsis. Genes (Basel) 2021; 12:genes12060884. [PMID: 34201141 PMCID: PMC8227676 DOI: 10.3390/genes12060884] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Revised: 06/04/2021] [Accepted: 06/07/2021] [Indexed: 11/24/2022] Open
Abstract
Lateral root initiation is a post-embryonic process that requires the specification of a subset of pericycle cells adjacent to the xylem pole in the primary root into lateral root founder cells. The first visible event of lateral root initiation in Arabidopsis is the simultaneous migration of nuclei in neighbouring founder cells. Coinciding cell cycle activation is essential for founder cells in the pericycle to undergo formative divisions, resulting in the development of a lateral root primordium (LRP). The plant signalling molecule, auxin, is a major regulator of lateral root development; the understanding of the molecular mechanisms controlling lateral root initiation has progressed tremendously by the use of the Arabidopsis model and a continual improvement of molecular methodologies. Here, we provide an overview of the visible events, cell cycle regulators, and auxin signalling cascades related to the initiation of a new LRP. Furthermore, we highlight the potential of genome editing technology to analyse gene function in lateral root initiation, which provides an excellent model to answer fundamental developmental questions such as coordinated cell division, growth axis establishment as well as the specification of cell fate and cell polarity.
Collapse
Affiliation(s)
- Nick Vangheluwe
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium;
- VIB Center for Plant Systems Biology, 9052 Ghent, Belgium
| | - Tom Beeckman
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium;
- VIB Center for Plant Systems Biology, 9052 Ghent, Belgium
- Correspondence:
| |
Collapse
|
32
|
Winkler J, Mylle E, De Meyer A, Pavie B, Merchie J, Grones P, Van Damme D. Visualizing protein-protein interactions in plants by rapamycin-dependent delocalization. THE PLANT CELL 2021; 33:1101-1117. [PMID: 33793859 PMCID: PMC7612334 DOI: 10.1093/plcell/koab004] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 12/15/2020] [Indexed: 05/19/2023]
Abstract
Identifying protein-protein interactions (PPIs) is crucial for understanding biological processes. Many PPI tools are available, yet only some function within the context of a plant cell. Narrowing down even further, only a few tools allow complex multi-protein interactions to be visualized. Here, we present a conditional in vivo PPI tool for plant research that meets these criteria. Knocksideways in plants (KSP) is based on the ability of rapamycin to alter the localization of a bait protein and its interactors via the heterodimerization of FKBP and FRB domains. KSP is inherently free from many limitations of other PPI systems. This in vivo tool does not require spatial proximity of the bait and prey fluorophores and it is compatible with a broad range of fluorophores. KSP is also a conditional tool and therefore the visualization of the proteins in the absence of rapamycin acts as an internal control. We used KSP to confirm previously identified interactions in Nicotiana benthamiana leaf epidermal cells. Furthermore, the scripts that we generated allow the interactions to be quantified at high throughput. Finally, we demonstrate that KSP can easily be used to visualize complex multi-protein interactions. KSP is therefore a versatile tool with unique characteristics and applications that complements other plant PPI methods.
Collapse
Affiliation(s)
- Joanna Winkler
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Evelien Mylle
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Andreas De Meyer
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | | | - Julie Merchie
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Peter Grones
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Daniёl Van Damme
- Ghent University, Department of Plant Biotechnology and Bioinformatics, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| |
Collapse
|
33
|
Jia H, Xue S, Lei L, Fan M, Peng S, Li T, Nagarajan R, Carver B, Ma Z, Deng J, Yan L. A semi-dominant NLR allele causes whole-seedling necrosis in wheat. PLANT PHYSIOLOGY 2021; 186:483-496. [PMID: 33576803 PMCID: PMC8154059 DOI: 10.1093/plphys/kiab058] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 01/26/2021] [Indexed: 05/26/2023]
Abstract
Programmed cell death (PCD) and apoptosis have key functions in development and disease resistance in diverse organisms; however, the induction of necrosis remains poorly understood. Here, we identified a semi-dominant mutant allele that causes the necrotic death of the entire seedling (DES) of wheat (Triticum aestivum L.) in the absence of any pathogen or external stimulus. Positional cloning of the lethal allele mDES1 revealed that this premature death via necrosis was caused by a point mutation from Asp to Asn at amino acid 441 in a nucleotide-binding leucine-rich repeat protein containing nucleotide-binding domain and leucine-rich repeats. The overexpression of mDES1 triggered necrosis and PCD in transgenic plants. However, transgenic wheat harboring truncated wild-type DES1 proteins produced through gene editing that exhibited no significant developmental defects. The point mutation in mDES1 did not cause changes in this protein in the oligomeric state, but mDES1 failed to interact with replication protein A leading to abnormal mitotic cell division. DES1 is an ortholog of Sr35, which recognizes a Puccinia graminis f. sp. tritici stem rust disease effector in wheat, but mDES1 gained function as a direct inducer of plant death. These findings shed light on the intersection of necrosis, apoptosis, and autoimmunity in plants.
Collapse
Affiliation(s)
- Haiyan Jia
- Department of Plant and Soil Sciences, Oklahoma State University, Stillwater, OK 74078, USA
- Crop Genomics and Bioinformatics Center and National Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Jiangsu, Nanjing 210095, China
| | - Shulin Xue
- Department of Plant and Soil Sciences, Oklahoma State University, Stillwater, OK 74078, USA
- Crop Genomics and Bioinformatics Center and National Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Jiangsu, Nanjing 210095, China
| | - Lei Lei
- Department of Plant and Soil Sciences, Oklahoma State University, Stillwater, OK 74078, USA
| | - Min Fan
- Department of Plant and Soil Sciences, Oklahoma State University, Stillwater, OK 74078, USA
- Crop Genomics and Bioinformatics Center and National Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Jiangsu, Nanjing 210095, China
| | - Shuxia Peng
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK 74078, USA
| | - Tian Li
- Department of Plant and Soil Sciences, Oklahoma State University, Stillwater, OK 74078, USA
| | - Ragupathi Nagarajan
- Department of Plant and Soil Sciences, Oklahoma State University, Stillwater, OK 74078, USA
| | - Brett Carver
- Department of Plant and Soil Sciences, Oklahoma State University, Stillwater, OK 74078, USA
| | - Zhengqiang Ma
- Crop Genomics and Bioinformatics Center and National Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Jiangsu, Nanjing 210095, China
| | - Junpeng Deng
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK 74078, USA
| | - Liuling Yan
- Department of Plant and Soil Sciences, Oklahoma State University, Stillwater, OK 74078, USA
| |
Collapse
|
34
|
Cusack SA, Wang P, Lotreck SG, Moore BM, Meng F, Conner JK, Krysan PJ, Lehti-Shiu MD, Shiu SH. Predictive Models of Genetic Redundancy in Arabidopsis thaliana. Mol Biol Evol 2021; 38:3397-3414. [PMID: 33871641 PMCID: PMC8321531 DOI: 10.1093/molbev/msab111] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Genetic redundancy refers to a situation where an individual with a loss-of-function mutation in one gene (single mutant) does not show an apparent phenotype until one or more paralogs are also knocked out (double/higher-order mutant). Previous studies have identified some characteristics common among redundant gene pairs, but a predictive model of genetic redundancy incorporating a wide variety of features derived from accumulating omics and mutant phenotype data is yet to be established. In addition, the relative importance of these features for genetic redundancy remains largely unclear. Here, we establish machine learning models for predicting whether a gene pair is likely redundant or not in the model plant Arabidopsis thaliana based on six feature categories: functional annotations, evolutionary conservation including duplication patterns and mechanisms, epigenetic marks, protein properties including posttranslational modifications, gene expression, and gene network properties. The definition of redundancy, data transformations, feature subsets, and machine learning algorithms used significantly affected model performance based on holdout, testing phenotype data. Among the most important features in predicting gene pairs as redundant were having a paralog(s) from recent duplication events, annotation as a transcription factor, downregulation during stress conditions, and having similar expression patterns under stress conditions. We also explored the potential reasons underlying mispredictions and limitations of our studies. This genetic redundancy model sheds light on characteristics that may contribute to long-term maintenance of paralogs, and will ultimately allow for more targeted generation of functionally informative double mutants, advancing functional genomic studies.
Collapse
Affiliation(s)
- Siobhan A Cusack
- Cell and Molecular Biology Program, Michigan State University, East Lansing, MI, USA
| | - Peipei Wang
- Department of Plant Biology, Michigan State University, East Lansing, MI, USA
| | - Serena G Lotreck
- Department of Plant Biology, Michigan State University, East Lansing, MI, USA.,Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, MI, USA
| | - Bethany M Moore
- Department of Botany, University of Wisconsin-Madison, Madison, WI, USA
| | - Fanrui Meng
- Department of Plant Biology, Michigan State University, East Lansing, MI, USA
| | - Jeffrey K Conner
- Department of Plant Biology, Michigan State University, East Lansing, MI, USA.,Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, MI, USA.,Kellogg Biological Station, Michigan State University, East Lansing, MI, USA
| | - Patrick J Krysan
- Department of Horticulture, University of Wisconsin-Madison, Madison, WI, USA
| | | | - Shin-Han Shiu
- Cell and Molecular Biology Program, Michigan State University, East Lansing, MI, USA.,Department of Plant Biology, Michigan State University, East Lansing, MI, USA.,Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, MI, USA.,Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, MI, USA
| |
Collapse
|
35
|
Gupta C, Ramegowda V, Basu S, Pereira A. Using Network-Based Machine Learning to Predict Transcription Factors Involved in Drought Resistance. Front Genet 2021. [PMID: 34249082 DOI: 10.1101/2020.04.29.068379] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023] Open
Abstract
Gene regulatory networks underpin stress response pathways in plants. However, parsing these networks to prioritize key genes underlying a particular trait is challenging. Here, we have built the Gene Regulation and Association Network (GRAiN) of rice (Oryza sativa). GRAiN is an interactive query-based web-platform that allows users to study functional relationships between transcription factors (TFs) and genetic modules underlying abiotic-stress responses. We built GRAiN by applying a combination of different network inference algorithms to publicly available gene expression data. We propose a supervised machine learning framework that complements GRAiN in prioritizing genes that regulate stress signal transduction and modulate gene expression under drought conditions. Our framework converts intricate network connectivity patterns of 2160 TFs into a single drought score. We observed that TFs with the highest drought scores define the functional, structural, and evolutionary characteristics of drought resistance in rice. Our approach accurately predicted the function of OsbHLH148 TF, which we validated using in vitro protein-DNA binding assays and mRNA sequencing loss-of-function mutants grown under control and drought stress conditions. Our network and the complementary machine learning strategy lends itself to predicting key regulatory genes underlying other agricultural traits and will assist in the genetic engineering of desirable rice varieties.
Collapse
Affiliation(s)
- Chirag Gupta
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| | - Venkategowda Ramegowda
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| | - Supratim Basu
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| | - Andy Pereira
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| |
Collapse
|
36
|
Tiwari M, Trivedi P, Pandey A. Emerging tools and paradigm shift of gene editing in cereals, fruits, and horticultural crops for enhancing nutritional value and food security. Food Energy Secur 2020. [DOI: 10.1002/fes3.258] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Affiliation(s)
- Manish Tiwari
- National Institute of Plant Genome Research New Delhi India
| | - Prabodh Trivedi
- CSIR‐Central Institute of Medicinal and Aromatic Plants Lucknow India
| | | |
Collapse
|
37
|
Le NQK, Do DT, Hung TNK, Lam LHT, Huynh TT, Nguyen NTK. A Computational Framework Based on Ensemble Deep Neural Networks for Essential Genes Identification. Int J Mol Sci 2020; 21:E9070. [PMID: 33260643 PMCID: PMC7730808 DOI: 10.3390/ijms21239070] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2020] [Revised: 11/25/2020] [Accepted: 11/26/2020] [Indexed: 01/13/2023] Open
Abstract
Essential genes contain key information of genomes that could be the key to a comprehensive understanding of life and evolution. Because of their importance, studies of essential genes have been considered a crucial problem in computational biology. Computational methods for identifying essential genes have become increasingly popular to reduce the cost and time-consumption of traditional experiments. A few models have addressed this problem, but performance is still not satisfactory because of high dimensional features and the use of traditional machine learning algorithms. Thus, there is a need to create a novel model to improve the predictive performance of this problem from DNA sequence features. This study took advantage of a natural language processing (NLP) model in learning biological sequences by treating them as natural language words. To learn the NLP features, a supervised learning model was consequentially employed by an ensemble deep neural network. Our proposed method could identify essential genes with sensitivity, specificity, accuracy, Matthews correlation coefficient (MCC), and area under the receiver operating characteristic curve (AUC) values of 60.2%, 84.6%, 76.3%, 0.449, and 0.814, respectively. The overall performance outperformed the single models without ensemble, as well as the state-of-the-art predictors on the same benchmark dataset. This indicated the effectiveness of the proposed method in determining essential genes, in particular, and other sequencing problems, in general.
Collapse
Affiliation(s)
- Nguyen Quoc Khanh Le
- Professional Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei 106, Taiwan
- Research Center for Artificial Intelligence in Medicine, Taipei Medical University, Taipei 106, Taiwan
- Translational Imaging Research Center, Taipei Medical University Hospital, Taipei 110, Taiwan
| | - Duyen Thi Do
- Graduate Institute of Biomedical Informatics, Taipei Medical University, Taipei 106, Taiwan;
| | - Truong Nguyen Khanh Hung
- International Master/Ph.D. Program in Medicine, College of Medicine, Taipei Medical University, Taipei 110, Taiwan; (T.N.K.H.); (L.H.T.L.)
- Department of Orthopedic and Trauma, Cho Ray Hospital, Ho Chi Minh 70000, Vietnam
| | - Luu Ho Thanh Lam
- International Master/Ph.D. Program in Medicine, College of Medicine, Taipei Medical University, Taipei 110, Taiwan; (T.N.K.H.); (L.H.T.L.)
- Intensive Care Unit, Children’s Hospital 2, Ho Chi Minh 70000, Vietnam
| | - Tuan-Tu Huynh
- Department of Electrical Engineering, Yuan Ze University, Taoyuan 320, Taiwan;
- Department of Electrical Electronic and Mechanical Engineering, Lac Hong University, Dong Nai 76120, Vietnam
| | - Ngan Thi Kim Nguyen
- School of Nutrition and Health Sciences, Taipei Medical University, Taipei 110, Taiwan;
| |
Collapse
|
38
|
Xu YC, Guo YL. Less Is More, Natural Loss-of-Function Mutation Is a Strategy for Adaptation. PLANT COMMUNICATIONS 2020; 1:100103. [PMID: 33367264 PMCID: PMC7743898 DOI: 10.1016/j.xplc.2020.100103] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Revised: 07/08/2020] [Accepted: 08/12/2020] [Indexed: 05/12/2023]
Abstract
Gene gain and loss are crucial factors that shape the evolutionary success of diverse organisms. In the past two decades, more attention has been paid to the significance of gene gain through gene duplication or de novo genes. However, gene loss through natural loss-of-function (LoF) mutations, which is prevalent in the genomes of diverse organisms, has been largely ignored. With the development of sequencing techniques, many genomes have been sequenced across diverse species and can be used to study the evolutionary patterns of gene loss. In this review, we summarize recent advances in research on various aspects of LoF mutations, including their identification, evolutionary dynamics in natural populations, and functional effects. In particular, we discuss how LoF mutations can provide insights into the minimum gene set (or the essential gene set) of an organism. Furthermore, we emphasize their potential impact on adaptation. At the genome level, although most LoF mutations are neutral or deleterious, at least some of them are under positive selection and may contribute to biodiversity and adaptation. Overall, we highlight the importance of natural LoF mutations as a robust framework for understanding biological questions in general.
Collapse
Affiliation(s)
- Yong-Chao Xu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ya-Long Guo
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
39
|
Schnable JC. Genes and gene models, an important distinction. THE NEW PHYTOLOGIST 2020; 228:50-55. [PMID: 31241760 DOI: 10.1111/nph.16011] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Accepted: 06/07/2019] [Indexed: 05/22/2023]
Abstract
Genome sequencing has fundamentally changed how plant biologists think about genes. All or nearly all genes can ultimately be associated with a gene model. However, many gene models appear to play little or no role in the traits of an organism. A range of structural, molecular, population and evolutionary features all show a separation between genes with known phenotypes and the overall set of annotated gene models. These different features could be combined to develop models to distinguish the genes that determine the traits of plants from the subset gene other annotated gene models which are unlikely to play a role in doing so. Efforts to identify the subset of annotated gene models likely involved in specifying the characteristics of plants would help aid a wide range of researchers.
Collapse
Affiliation(s)
- James C Schnable
- Department of Agronomy and Horticulture and Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, NE, 68588, USA
| |
Collapse
|
40
|
Applications of CRISPR-Cas in agriculture and plant biotechnology. Nat Rev Mol Cell Biol 2020; 21:661-677. [PMID: 32973356 DOI: 10.1038/s41580-020-00288-9] [Citation(s) in RCA: 347] [Impact Index Per Article: 69.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/11/2020] [Indexed: 12/26/2022]
Abstract
The prokaryote-derived CRISPR-Cas genome editing technology has altered plant molecular biology beyond all expectations. Characterized by robustness and high target specificity and programmability, CRISPR-Cas allows precise genetic manipulation of crop species, which provides the opportunity to create germplasms with beneficial traits and to develop novel, more sustainable agricultural systems. Furthermore, the numerous emerging biotechnologies based on CRISPR-Cas platforms have expanded the toolbox of fundamental research and plant synthetic biology. In this Review, we first briefly describe gene editing by CRISPR-Cas, focusing on the newest, precise gene editing technologies such as base editing and prime editing. We then discuss the most important applications of CRISPR-Cas in increasing plant yield, quality, disease resistance and herbicide resistance, breeding and accelerated domestication. We also highlight the most recent breakthroughs in CRISPR-Cas-related plant biotechnologies, including CRISPR-Cas reagent delivery, gene regulation, multiplexed gene editing and mutagenesis and directed evolution technologies. Finally, we discuss prospective applications of this game-changing technology.
Collapse
|
41
|
Feder A, Jensen S, Wang A, Courtney L, Middleton L, Van Eck J, Liu Y, Giovannoni JJ. Tomato fruit as a model for tissue-specific gene silencing in crop plants. HORTICULTURE RESEARCH 2020; 7:142. [PMID: 32922814 PMCID: PMC7459100 DOI: 10.1038/s41438-020-00363-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Revised: 05/24/2020] [Accepted: 07/07/2020] [Indexed: 05/04/2023]
Abstract
Use of CRISPR-Cas9 (Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-CRISPR-associated 9)-mediated genome editing has proliferated for use in numerous plant species to modify gene function and expression, usually in the context of either transient or stably inherited genetic alternations. While extremely useful in many applications, modification of some loci yields outcomes detrimental to further experimental evaluation or viability of the target organism. Expression of Cas9 under a promoter conferring gene knockouts in a tissue-specific subset of genomes has been demonstrated in insect and animal models, and recently in Arabidopsis. We developed an in planta GFP (green fluorescent protein) assay system to demonstrate fruit-specific gene editing in tomato using a phosphoenolpyruvate carboxylase 2 gene promoter. We then targeted a SET-domain containing polycomb protein, SlEZ2, previously shown to yield pleiotropic phenotypes when targeted via 35S-driven RNA interference and we were able to characterize fruit phenotypes absent additional developmental perturbations. Tissue-specific gene editing will have applications in assessing function of essential genes otherwise difficult to study via germline modifications and will provide routes to edited genomes in tissues that could not otherwise be recovered when their germline modification perturbs their normal development.
Collapse
Affiliation(s)
- Ari Feder
- Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY USA
| | - Sarah Jensen
- Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY USA
- Section of Plant Breeding and Genetics, Cornell University, Ithaca, NY USA
| | - Anquan Wang
- Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY USA
- School of Biotechnology and Food Engineering, Hefei University of Technology, 230009 Hefei, China
| | - Lance Courtney
- Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY USA
- Section of Plant Biology, Cornell University, Ithaca, NY USA
| | - Lesley Middleton
- Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY USA
| | - Joyce Van Eck
- Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY USA
| | - Yongsheng Liu
- School of Biotechnology and Food Engineering, Hefei University of Technology, 230009 Hefei, China
| | - James J. Giovannoni
- Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY USA
- Section of Plant Biology, Cornell University, Ithaca, NY USA
- US Department of Agriculture–Agricultural Research Service, Robert W. Holley Center for Agriculture and Health, Ithaca, NY USA
| |
Collapse
|
42
|
Shi T, Rahmani RS, Gugger PF, Wang M, Li H, Zhang Y, Li Z, Wang Q, Van de Peer Y, Marchal K, Chen J. Distinct Expression and Methylation Patterns for Genes with Different Fates following a Single Whole-Genome Duplication in Flowering Plants. Mol Biol Evol 2020; 37:2394-2413. [PMID: 32343808 PMCID: PMC7403625 DOI: 10.1093/molbev/msaa105] [Citation(s) in RCA: 65] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
For most sequenced flowering plants, multiple whole-genome duplications (WGDs) are found. Duplicated genes following WGD often have different fates that can quickly disappear again, be retained for long(er) periods, or subsequently undergo small-scale duplications. However, how different expression, epigenetic regulation, and functional constraints are associated with these different gene fates following a WGD still requires further investigation due to successive WGDs in angiosperms complicating the gene trajectories. In this study, we investigate lotus (Nelumbo nucifera), an angiosperm with a single WGD during the K-pg boundary. Based on improved intraspecific-synteny identification by a chromosome-level assembly, transcriptome, and bisulfite sequencing, we explore not only the fundamental distinctions in genomic features, expression, and methylation patterns of genes with different fates after a WGD but also the factors that shape post-WGD expression divergence and expression bias between duplicates. We found that after a WGD genes that returned to single copies show the highest levels and breadth of expression, gene body methylation, and intron numbers, whereas the long-retained duplicates exhibit the highest degrees of protein-protein interactions and protein lengths and the lowest methylation in gene flanking regions. For those long-retained duplicate pairs, the degree of expression divergence correlates with their sequence divergence, degree in protein-protein interactions, and expression level, whereas their biases in expression level reflecting subgenome dominance are associated with the bias of subgenome fractionation. Overall, our study on the paleopolyploid nature of lotus highlights the impact of different functional constraints on gene fate and duplicate divergence following a single WGD in plant.
Collapse
Affiliation(s)
- Tao Shi
- CAS Key Laboratory of Aquatic Botany and Watershed Ecology, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
| | - Razgar Seyed Rahmani
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
| | - Paul F Gugger
- Appalachian Laboratory, University of Maryland Center for Environmental Science, Frostburg, MD
| | - Muhua Wang
- School of Marine Sciences, Sun Yat-sen University, Guangzhou, China
| | - Hui Li
- CAS Key Laboratory of Aquatic Botany and Watershed Ecology, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yue Zhang
- CAS Key Laboratory of Aquatic Botany and Watershed Ecology, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Zhizhong Li
- CAS Key Laboratory of Aquatic Botany and Watershed Ecology, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Qingfeng Wang
- CAS Key Laboratory of Aquatic Botany and Watershed Ecology, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
- Sino-African Joint Research Center, Chinese Academy of Sciences, Wuhan, China
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Centre for Plant Systems Biology, VIB, Ghent, Belgium
- Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria, South Africa
- College of Horticulture, Nanjing Agricultural University, Nanjing, China
| | - Kathleen Marchal
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Department of Information Technology, IDLab, IMEC, Ghent University, Ghent, Belgium
| | - Jinming Chen
- CAS Key Laboratory of Aquatic Botany and Watershed Ecology, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
| |
Collapse
|
43
|
Mahood EH, Kruse LH, Moghe GD. Machine learning: A powerful tool for gene function prediction in plants. APPLICATIONS IN PLANT SCIENCES 2020; 8:e11376. [PMID: 32765975 PMCID: PMC7394712 DOI: 10.1002/aps3.11376] [Citation(s) in RCA: 57] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Accepted: 03/19/2020] [Indexed: 05/06/2023]
Abstract
Recent advances in sequencing and informatic technologies have led to a deluge of publicly available genomic data. While it is now relatively easy to sequence, assemble, and identify genic regions in diploid plant genomes, functional annotation of these genes is still a challenge. Over the past decade, there has been a steady increase in studies utilizing machine learning algorithms for various aspects of functional prediction, because these algorithms are able to integrate large amounts of heterogeneous data and detect patterns inconspicuous through rule-based approaches. The goal of this review is to introduce experimental plant biologists to machine learning, by describing how it is currently being used in gene function prediction to gain novel biological insights. In this review, we discuss specific applications of machine learning in identifying structural features in sequenced genomes, predicting interactions between different cellular components, and predicting gene function and organismal phenotypes. Finally, we also propose strategies for stimulating functional discovery using machine learning-based approaches in plants.
Collapse
Affiliation(s)
- Elizabeth H. Mahood
- Plant Biology SectionSchool of Integrative Plant SciencesCornell UniversityIthacaNew York14853USA
| | - Lars H. Kruse
- Plant Biology SectionSchool of Integrative Plant SciencesCornell UniversityIthacaNew York14853USA
| | - Gaurav D. Moghe
- Plant Biology SectionSchool of Integrative Plant SciencesCornell UniversityIthacaNew York14853USA
| |
Collapse
|
44
|
Meinke DW. Genome-wide identification of EMBRYO-DEFECTIVE (EMB) genes required for growth and development in Arabidopsis. THE NEW PHYTOLOGIST 2020; 226:306-325. [PMID: 31334862 DOI: 10.1111/nph.16071] [Citation(s) in RCA: 104] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/02/2019] [Accepted: 07/10/2019] [Indexed: 05/20/2023]
Abstract
With the emergence of high-throughput methods in plant biology, the importance of long-term projects characterized by incremental advances involving multiple laboratories can sometimes be overlooked. Here, I highlight my 40-year effort to isolate and characterize the most common class of mutants encountered in Arabidopsis (Arabidopsis thaliana): those defective in embryo development. I present an updated dataset of 510 EMBRYO-DEFECTIVE (EMB) genes identified throughout the Arabidopsis community; include important details on 2200 emb mutants and 241 pigment-defective embryo (pde) mutants analyzed in my laboratory; provide curated datasets with key features and publication links for each EMB gene identified; revisit past estimates of 500-1000 total EMB genes in Arabidopsis; document 83 double mutant combinations reported to disrupt embryo development; emphasize the importance of following established nomenclature guidelines and acknowledging allele history in research publications; and consider how best to extend community-based curation and screening efforts to approach saturation for this diverse class of mutants in the future. Continued advances in identifying EMB genes and characterizing their loss-of-function mutant alleles are needed to understand genotype-to-phenotype relationships in Arabidopsis on a broad scale, and to document the contributions of large numbers of essential genes to plant growth and development.
Collapse
Affiliation(s)
- David W Meinke
- Department of Plant Biology, Ecology, and Evolution, Oklahoma State University, Stillwater, OK, 74078, USA
| |
Collapse
|
45
|
Network Embedding the Protein-Protein Interaction Network for Human Essential Genes Identification. Genes (Basel) 2020; 11:genes11020153. [PMID: 32023848 PMCID: PMC7074227 DOI: 10.3390/genes11020153] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Revised: 01/27/2020] [Accepted: 01/29/2020] [Indexed: 11/18/2022] Open
Abstract
Essential genes are a group of genes that are indispensable for cell survival and cell fertility. Studying human essential genes helps scientists reveal the underlying biological mechanisms of a human cell but also guides disease treatment. Recently, the publication of human essential gene data makes it possible for researchers to train a machine-learning classifier by using some features of the known human essential genes and to use the classifier to predict new human essential genes. Previous studies have found that the essentiality of genes closely relates to their properties in the protein–protein interaction (PPI) network. In this work, we propose a novel supervised method to predict human essential genes by network embedding the PPI network. Our approach implements a bias random walk on the network to get the node network context. Then, the node pairs are input into an artificial neural network to learn their representation vectors that maximally preserves network structure and the properties of the nodes in the network. Finally, the features are put into an SVM classifier to predict human essential genes. The prediction results on two human PPI networks show that our method achieves better performance than those that refer to either genes’ sequence information or genes’ centrality properties in the network as input features. Moreover, it also outperforms the methods that represent the PPI network by other previous approaches.
Collapse
|
46
|
Golicz AA, Bayer PE, Bhalla PL, Batley J, Edwards D. Pangenomics Comes of Age: From Bacteria to Plant and Animal Applications. Trends Genet 2019; 36:132-145. [PMID: 31882191 DOI: 10.1016/j.tig.2019.11.006] [Citation(s) in RCA: 111] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 11/09/2019] [Accepted: 11/12/2019] [Indexed: 02/01/2023]
Abstract
The pangenome refers to a collection of genomic sequence found in the entire species or population rather than in a single individual; the sequence can be core, present in all individuals, or accessory (variable or dispensable), found in a subset of individuals only. While pangenomic studies were first undertaken in bacterial species, developments in genome sequencing and assembly approaches have allowed construction of pangenomes for eukaryotic organisms, fungi, plants, and animals, including two large-scale human pangenome projects. Analysis of the these pangenomes revealed key differences, most likely stemming from divergent evolutionary histories, but also surprising similarities.
Collapse
Affiliation(s)
- Agnieszka A Golicz
- Plant Molecular Biology and Biotechnology Laboratory, Faculty of Veterinary and Agricultural Sciences, University of Melbourne, Melbourne, VIC, Australia.
| | - Philipp E Bayer
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Crawley, WA, Australia
| | - Prem L Bhalla
- Plant Molecular Biology and Biotechnology Laboratory, Faculty of Veterinary and Agricultural Sciences, University of Melbourne, Melbourne, VIC, Australia
| | - Jacqueline Batley
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Crawley, WA, Australia
| | - David Edwards
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Crawley, WA, Australia.
| |
Collapse
|
47
|
Decaestecker W, Buono RA, Pfeiffer ML, Vangheluwe N, Jourquin J, Karimi M, Van Isterdael G, Beeckman T, Nowack MK, Jacobs TB. CRISPR-TSKO: A Technique for Efficient Mutagenesis in Specific Cell Types, Tissues, or Organs in Arabidopsis. THE PLANT CELL 2019; 31:2868-2887. [PMID: 31562216 DOI: 10.1101/474981] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Accepted: 09/25/2019] [Indexed: 05/26/2023]
Abstract
Detailed functional analyses of many fundamentally important plant genes via conventional loss-of-function approaches are impeded by the severe pleiotropic phenotypes resulting from these losses. In particular, mutations in genes that are required for basic cellular functions and/or reproduction often interfere with the generation of homozygous mutant plants, precluding further functional studies. To overcome this limitation, we devised a clustered regularly interspaced short palindromic repeats (CRISPR)-based tissue-specific knockout system, CRISPR-TSKO, enabling the generation of somatic mutations in particular plant cell types, tissues, and organs. In Arabidopsis (Arabidopsis thaliana), CRISPR-TSKO mutations in essential genes caused well-defined, localized phenotypes in the root cap, stomatal lineage, or entire lateral roots. The modular cloning system developed in this study allows for the efficient selection, identification, and functional analysis of mutant lines directly in the first transgenic generation. The efficacy of CRISPR-TSKO opens avenues for discovering and analyzing gene functions in the spatial and temporal contexts of plant life while avoiding the pleiotropic effects of system-wide losses of gene function.
Collapse
Affiliation(s)
- Ward Decaestecker
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Rafael Andrade Buono
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Marie L Pfeiffer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Nick Vangheluwe
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Joris Jourquin
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Mansour Karimi
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Gert Van Isterdael
- VIB Flow Core, VIB Center for Inflammation Research, Technologiepark 71, B-9052 Ghent, Belgium
- Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium
| | - Tom Beeckman
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Moritz K Nowack
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Thomas B Jacobs
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| |
Collapse
|
48
|
Decaestecker W, Buono RA, Pfeiffer ML, Vangheluwe N, Jourquin J, Karimi M, Van Isterdael G, Beeckman T, Nowack MK, Jacobs TB. CRISPR-TSKO: A Technique for Efficient Mutagenesis in Specific Cell Types, Tissues, or Organs in Arabidopsis. THE PLANT CELL 2019; 31:2868-2887. [PMID: 31562216 PMCID: PMC6925012 DOI: 10.1105/tpc.19.00454] [Citation(s) in RCA: 143] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Revised: 08/23/2019] [Accepted: 09/25/2019] [Indexed: 05/18/2023]
Abstract
Detailed functional analyses of many fundamentally important plant genes via conventional loss-of-function approaches are impeded by the severe pleiotropic phenotypes resulting from these losses. In particular, mutations in genes that are required for basic cellular functions and/or reproduction often interfere with the generation of homozygous mutant plants, precluding further functional studies. To overcome this limitation, we devised a clustered regularly interspaced short palindromic repeats (CRISPR)-based tissue-specific knockout system, CRISPR-TSKO, enabling the generation of somatic mutations in particular plant cell types, tissues, and organs. In Arabidopsis (Arabidopsis thaliana), CRISPR-TSKO mutations in essential genes caused well-defined, localized phenotypes in the root cap, stomatal lineage, or entire lateral roots. The modular cloning system developed in this study allows for the efficient selection, identification, and functional analysis of mutant lines directly in the first transgenic generation. The efficacy of CRISPR-TSKO opens avenues for discovering and analyzing gene functions in the spatial and temporal contexts of plant life while avoiding the pleiotropic effects of system-wide losses of gene function.
Collapse
Affiliation(s)
- Ward Decaestecker
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Rafael Andrade Buono
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Marie L Pfeiffer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Nick Vangheluwe
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Joris Jourquin
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Mansour Karimi
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Gert Van Isterdael
- VIB Flow Core, VIB Center for Inflammation Research, Technologiepark 71, B-9052 Ghent, Belgium
- Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium
| | - Tom Beeckman
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Moritz K Nowack
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Thomas B Jacobs
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| |
Collapse
|
49
|
QTG-Finder: A Machine-Learning Based Algorithm To Prioritize Causal Genes of Quantitative Trait Loci in Arabidopsis and Rice. G3-GENES GENOMES GENETICS 2019; 9:3129-3138. [PMID: 31358562 PMCID: PMC6778793 DOI: 10.1534/g3.119.400319] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Linkage mapping is one of the most commonly used methods to identify genetic loci that determine a trait. However, the loci identified by linkage mapping may contain hundreds of candidate genes and require a time-consuming and labor-intensive fine mapping process to find the causal gene controlling the trait. With the availability of a rich assortment of genomic and functional genomic data, it is possible to develop a computational method to facilitate faster identification of causal genes. We developed QTG-Finder, a machine learning based algorithm to prioritize causal genes by ranking genes within a quantitative trait locus (QTL). Two predictive models were trained separately based on known causal genes in Arabidopsis and rice. An independent validation analysis showed that the models could recall about 64% of Arabidopsis and 79% of rice causal genes when the top 20% ranked genes were considered. The top 20% ranked genes can range from 10 to 100 genes, depending on the size of a QTL. The models can prioritize different types of traits though at different efficiency. We also identified several important features of causal genes including paralog copy number, being a transporter, being a transcription factor, and containing SNPs that cause premature stop codon. This work lays the foundation for systematically understanding characteristics of causal genes and establishes a pipeline to predict causal genes based on public data.
Collapse
|
50
|
Evolutionary characteristics of intergenic transcribed regions indicate rare novel genes and widespread noisy transcription in the Poaceae. Sci Rep 2019; 9:12122. [PMID: 31431676 PMCID: PMC6702216 DOI: 10.1038/s41598-019-47797-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Accepted: 07/19/2019] [Indexed: 01/19/2023] Open
Abstract
Extensive transcriptional activity occurring in intergenic regions of genomes has raised the question whether intergenic transcription represents the activity of novel genes or noisy expression. To address this, we evaluated cross-species and post-duplication sequence and expression conservation of intergenic transcribed regions (ITRs) in four Poaceae species. Among 43,301 ITRs across the four species, 34,460 (80%) are species-specific. ITRs found across species tend to be more divergent in expression and have more recent duplicates compared to annotated genes. To assess if ITRs are functional (under selection), machine learning models were established in Oryza sativa (rice) that could accurately distinguish between phenotype genes and pseudogenes (area under curve-receiver operating characteristic = 0.94). Based on the models, 584 (8%) and 4391 (61%) rice ITRs are classified as likely functional and nonfunctional with high confidence, respectively. ITRs with conserved expression and ancient retained duplicates, features that were not part of the model, are frequently classified as likely-functional, suggesting these characteristics could serve as pragmatic rules of thumb for identifying candidate sequences likely to be under selection. This study also provides a framework to identify novel genes using comparative transcriptomic data to improve genome annotation that is fundamental for connecting genotype to phenotype in crop and model systems.
Collapse
|