1
|
Zhang J, Basu S, Kurgan L. HybridDBRpred: improved sequence-based prediction of DNA-binding amino acids using annotations from structured complexes and disordered proteins. Nucleic Acids Res 2024; 52:e10. [PMID: 38048333 PMCID: PMC10810184 DOI: 10.1093/nar/gkad1131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 11/10/2023] [Indexed: 12/06/2023] Open
Abstract
Current predictors of DNA-binding residues (DBRs) from protein sequences belong to two distinct groups, those trained on binding annotations extracted from structured protein-DNA complexes (structure-trained) vs. intrinsically disordered proteins (disorder-trained). We complete the first empirical analysis of predictive performance across the structure- and disorder-annotated proteins for a representative collection of ten predictors. Majority of the structure-trained tools perform well on the structure-annotated proteins while doing relatively poorly on the disorder-annotated proteins, and vice versa. Several methods make accurate predictions for the structure-annotated proteins or the disorder-annotated proteins, but none performs highly accurately for both annotation types. Moreover, most predictors make excessive cross-predictions for the disorder-annotated proteins, where residues that interact with non-DNA ligand types are predicted as DBRs. Motivated by these results, we design, validate and deploy an innovative meta-model, hybridDBRpred, that uses deep transformer network to combine predictions generated by three best current predictors. HybridDBRpred provides accurate predictions and low levels of cross-predictions across the two annotation types, and is statistically more accurate than each of the ten tools and baseline meta-predictors that rely on averaging and logistic regression. We deploy hybridDBRpred as a convenient web server at http://biomine.cs.vcu.edu/servers/hybridDBRpred/ and provide the corresponding source code at https://github.com/jianzhang-xynu/hybridDBRpred.
Collapse
Affiliation(s)
- Jian Zhang
- School of Computer and Information Technology, Xinyang Normal University, Xinyang 464000, PR China
| | - Sushmita Basu
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| |
Collapse
|
2
|
Grin IR, Petrova DV, Endutkin AV, Ma C, Yu B, Li H, Zharkov DO. Base Excision DNA Repair in Plants: Arabidopsis and Beyond. Int J Mol Sci 2023; 24:14746. [PMID: 37834194 PMCID: PMC10573277 DOI: 10.3390/ijms241914746] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 09/27/2023] [Accepted: 09/27/2023] [Indexed: 10/15/2023] Open
Abstract
Base excision DNA repair (BER) is a key pathway safeguarding the genome of all living organisms from damage caused by both intrinsic and environmental factors. Most present knowledge about BER comes from studies of human cells, E. coli, and yeast. Plants may be under an even heavier DNA damage threat from abiotic stress, reactive oxygen species leaking from the photosynthetic system, and reactive secondary metabolites. In general, BER in plant species is similar to that in humans and model organisms, but several important details are specific to plants. Here, we review the current state of knowledge about BER in plants, with special attention paid to its unique features, such as the existence of active epigenetic demethylation based on the BER machinery, the unexplained diversity of alkylation damage repair enzymes, and the differences in the processing of abasic sites that appear either spontaneously or are generated as BER intermediates. Understanding the biochemistry of plant DNA repair, especially in species other than the Arabidopsis model, is important for future efforts to develop new crop varieties.
Collapse
Affiliation(s)
- Inga R. Grin
- Siberian Branch of the Russian Academy of Sciences Institute of Chemical Biology and Fundamental Medicine, 8 Lavrentieva Ave., Novosibirsk 630090, Russia; (D.V.P.); (A.V.E.)
- Department of Natural Sciences, Novosibirsk State University, 2 Pirogova St., Novosibirsk 630090, Russia
| | - Daria V. Petrova
- Siberian Branch of the Russian Academy of Sciences Institute of Chemical Biology and Fundamental Medicine, 8 Lavrentieva Ave., Novosibirsk 630090, Russia; (D.V.P.); (A.V.E.)
| | - Anton V. Endutkin
- Siberian Branch of the Russian Academy of Sciences Institute of Chemical Biology and Fundamental Medicine, 8 Lavrentieva Ave., Novosibirsk 630090, Russia; (D.V.P.); (A.V.E.)
| | - Chunquan Ma
- Engineering Research Center of Agricultural Microbiology Technology, Ministry of Education, Harbin 150080, China; (C.M.); (B.Y.); (H.L.)
- Heilongjiang Provincial Key Laboratory of Plant Genetic Engineering and Biological Fermentation Engineering for Cold Region, Harbin 150080, China
- School of Life Sciences, Heilongjiang University, Harbin 150080, China
| | - Bing Yu
- Engineering Research Center of Agricultural Microbiology Technology, Ministry of Education, Harbin 150080, China; (C.M.); (B.Y.); (H.L.)
- Heilongjiang Provincial Key Laboratory of Plant Genetic Engineering and Biological Fermentation Engineering for Cold Region, Harbin 150080, China
- School of Life Sciences, Heilongjiang University, Harbin 150080, China
| | - Haiying Li
- Engineering Research Center of Agricultural Microbiology Technology, Ministry of Education, Harbin 150080, China; (C.M.); (B.Y.); (H.L.)
- Heilongjiang Provincial Key Laboratory of Plant Genetic Engineering and Biological Fermentation Engineering for Cold Region, Harbin 150080, China
- School of Life Sciences, Heilongjiang University, Harbin 150080, China
| | - Dmitry O. Zharkov
- Siberian Branch of the Russian Academy of Sciences Institute of Chemical Biology and Fundamental Medicine, 8 Lavrentieva Ave., Novosibirsk 630090, Russia; (D.V.P.); (A.V.E.)
- Department of Natural Sciences, Novosibirsk State University, 2 Pirogova St., Novosibirsk 630090, Russia
| |
Collapse
|
3
|
Wu Z, Basu S, Wu X, Kurgan L. qNABpredict: Quick, accurate, and taxonomy-aware sequence-based prediction of content of nucleic acid binding amino acids. Protein Sci 2023; 32:e4544. [PMID: 36519304 PMCID: PMC9798252 DOI: 10.1002/pro.4544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 12/07/2022] [Accepted: 12/08/2022] [Indexed: 12/23/2022]
Abstract
Protein sequence-based predictors of nucleic acid (NA)-binding include methods that predict NA-binding proteins and NA-binding residues. The residue-level tools produce more details but suffer high computational cost since they must predict every amino acid in the input sequence and rely on multiple sequence alignments. We propose an alternative approach that predicts content (fraction) of the NA-binding residues, offering more information than the protein-level prediction and much shorter runtime than the residue-level tools. Our first-of-its-kind content predictor, qNABpredict, relies on a small, rationally designed and fast-to-compute feature set that represents relevant characteristics extracted from the input sequence and a well-parametrized support vector regression model. We provide two versions of qNABpredict, a taxonomy-agnostic model that can be used for proteins of unknown taxonomic origin and more accurate taxonomy-aware models that are tailored to specific taxonomic kingdoms: archaea, bacteria, eukaryota, and viruses. Empirical tests on a low-similarity test dataset show that qNABpredict is 100 times faster and generates statistically more accurate content predictions when compared to the content extracted from results produced by the residue-level predictors. We also show that qNABpredict's content predictions can be used to improve results generated by the residue-level predictors. We release qNABpredict as a convenient webserver and source code at http://biomine.cs.vcu.edu/servers/qNABpredict/. This new tool should be particularly useful to predict details of protein-NA interactions for large protein families and proteomes.
Collapse
Affiliation(s)
- Zhonghua Wu
- School of Mathematical Sciences and LPMCNankai UniversityTianjinChina
| | - Sushmita Basu
- Department of Computer ScienceVirginia Commonwealth UniversityRichmondVirginiaUSA
| | - Xuantai Wu
- School of Mathematical Sciences and LPMCNankai UniversityTianjinChina
| | - Lukasz Kurgan
- Department of Computer ScienceVirginia Commonwealth UniversityRichmondVirginiaUSA
| |
Collapse
|
4
|
Kumar A, Sharma S, Chunduri V, Kaur A, Kaur S, Malhotra N, Kumar A, Kapoor P, Kumari A, Kaur J, Sonah H, Garg M. Genome-wide Identification and Characterization of Heat Shock Protein Family Reveals Role in Development and Stress Conditions in Triticum aestivum L. Sci Rep 2020; 10:7858. [PMID: 32398647 PMCID: PMC7217896 DOI: 10.1038/s41598-020-64746-2] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2019] [Accepted: 04/01/2020] [Indexed: 12/02/2022] Open
Abstract
Heat shock proteins (HSPs) have a significant role in protein folding and are considered as prominent candidates for development of heat-tolerant crops. Understanding of wheat HSPs has great importance since wheat is severely affected by heat stress, particularly during the grain filling stage. In the present study, efforts were made to identify HSPs in wheat and to understand their role during plant development and under different stress conditions. HSPs in wheat genome were first identified by using Position-Specific Scoring Matrix (PSSMs) of known HSP domains and then also confirmed by sequence homology with already known HSPs. Collectively, 753 TaHSPs including 169 TaSHSP, 273 TaHSP40, 95 TaHSP60, 114 TaHSP70, 18 TaHSP90 and 84 TaHSP100 were identified in the wheat genome. Compared with other grass species, number of HSPs in wheat was relatively high probably due to the higher ploidy level. Large number of tandem duplication was identified in TaHSPs, especially TaSHSPs. The TaHSP genes showed random distribution on chromosomes, however, there were more TaHSPs in B and D sub-genomes as compared to the A sub-genome. Extensive computational analysis was performed using the available genomic resources to understand gene structure, gene expression and phylogentic relationship of TaHSPs. Interestingly, apart from high expression under heat stress, high expression of TaSHSP was also observed during seed development. The study provided a list of candidate HSP genes for improving thermo tolerance during developmental stages and also for understanding the seed development process in bread wheat.
Collapse
Affiliation(s)
- Ashish Kumar
- South Asian University, Chankyapuri, New Delhi, 110021, India
| | - Saloni Sharma
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute (NABI), S.A.S. Nagar (Mohali), Punjab, India
| | - Venkatesh Chunduri
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute (NABI), S.A.S. Nagar (Mohali), Punjab, India
| | - Amandeep Kaur
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute (NABI), S.A.S. Nagar (Mohali), Punjab, India
| | - Satinder Kaur
- Punjab Agricultural University, Ludhiana, 141004, India
| | - Nikhil Malhotra
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute (NABI), S.A.S. Nagar (Mohali), Punjab, India
| | - Aman Kumar
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute (NABI), S.A.S. Nagar (Mohali), Punjab, India
| | - Payal Kapoor
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute (NABI), S.A.S. Nagar (Mohali), Punjab, India
| | - Anita Kumari
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute (NABI), S.A.S. Nagar (Mohali), Punjab, India
| | | | - Humira Sonah
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute (NABI), S.A.S. Nagar (Mohali), Punjab, India.
| | - Monika Garg
- Agri-Biotechnology Division, National Agri-Food Biotechnology Institute (NABI), S.A.S. Nagar (Mohali), Punjab, India.
| |
Collapse
|
5
|
Roldán-Arjona T, Ariza RR, Córdoba-Cañero D. DNA Base Excision Repair in Plants: An Unfolding Story With Familiar and Novel Characters. FRONTIERS IN PLANT SCIENCE 2019; 10:1055. [PMID: 31543887 PMCID: PMC6728418 DOI: 10.3389/fpls.2019.01055] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2019] [Accepted: 07/30/2019] [Indexed: 05/05/2023]
Abstract
Base excision repair (BER) is a critical genome defense pathway that deals with a broad range of non-voluminous DNA lesions induced by endogenous or exogenous genotoxic agents. BER is a complex process initiated by the excision of the damaged base, proceeds through a sequence of reactions that generate various DNA intermediates, and culminates with restoration of the original DNA structure. BER has been extensively studied in microbial and animal systems, but knowledge in plants has lagged behind until recently. Results obtained so far indicate that plants share many BER factors with other organisms, but also possess some unique features and combinations. Plant BER plays an important role in preserving genome integrity through removal of damaged bases. However, it performs additional important functions, such as the replacement of the naturally modified base 5-methylcytosine with cytosine in a plant-specific pathway for active DNA demethylation.
Collapse
Affiliation(s)
- Teresa Roldán-Arjona
- Maimónides Biomedical Research Institute of Córdoba (IMIBIC), Córdoba, Spain
- Department of Genetics, University of Córdoba, Córdoba, Spain
- Reina Sofia University Hospital, Córdoba, Spain
| | - Rafael R. Ariza
- Maimónides Biomedical Research Institute of Córdoba (IMIBIC), Córdoba, Spain
- Department of Genetics, University of Córdoba, Córdoba, Spain
- Reina Sofia University Hospital, Córdoba, Spain
| | - Dolores Córdoba-Cañero
- Maimónides Biomedical Research Institute of Córdoba (IMIBIC), Córdoba, Spain
- Department of Genetics, University of Córdoba, Córdoba, Spain
- Reina Sofia University Hospital, Córdoba, Spain
| |
Collapse
|
6
|
Baizan-Edge A, Cock P, MacFarlane S, McGavin W, Torrance L, Jones S. Kodoja: A workflow for virus detection in plants using k-mer analysis of RNA-sequencing data. J Gen Virol 2019; 100:533-542. [PMID: 30676315 DOI: 10.1099/jgv.0.001210] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
RNA-sequencing of plant material allows for hypothesis-free detection of multiple viruses simultaneously. This methodology relies on bioinformatics workflows for virus identification. Most workflows are designed for human clinical data, and few go beyond sequence mapping for virus identification. We present a new workflow (Kodoja) for the detection of plant virus sequences in RNA-sequence data. Kodoja uses k-mer profiling at the nucleotide level and sequence mapping at the protein level by integrating two existing tools Kraken and Kaiju. Kodoja was tested on three existing RNA-seq datasets from grapevine, and two new RNA-seq datasets from raspberry. For grapevine, Kodoja was shown to be more sensitive than a method based on contig building and blast alignments (27 viruses detected compared to 19). The application of Kodoja to raspberry, showed that field-grown raspberries were infected by multiple viruses, and that RNA-seq can identify lower amounts of virus material than reverse transcriptase PCR. This work enabled the design of new PCR-primers for detection of Raspberry yellow net virus and Beet ringspot virus. Kodoja is a sensitive method for plant virus discovery in field samples and enables the design of more accurate primers for detection. Kodoja is available to install through Bioconda and as a tool within Galaxy.
Collapse
Affiliation(s)
- Amanda Baizan-Edge
- 1The School of Biology, University of St Andrews, Biomedical Sciences Research Complex, St Andrews, KY16 9ST, UK
| | - Peter Cock
- 2Information and Computational Sciences Group, The James Hutton Institute, Dundee, DD2 5DA, UK
| | - Stuart MacFarlane
- 3Cell and Molecular Sciences Group, The James Hutton Institute, Dundee, DD2 5DA, UK
| | - Wendy McGavin
- 3Cell and Molecular Sciences Group, The James Hutton Institute, Dundee, DD2 5DA, UK
| | - Lesley Torrance
- 1The School of Biology, University of St Andrews, Biomedical Sciences Research Complex, St Andrews, KY16 9ST, UK.,3Cell and Molecular Sciences Group, The James Hutton Institute, Dundee, DD2 5DA, UK
| | - Susan Jones
- 2Information and Computational Sciences Group, The James Hutton Institute, Dundee, DD2 5DA, UK
| |
Collapse
|
7
|
Jones S, Baizan-Edge A, MacFarlane S, Torrance L. Viral Diagnostics in Plants Using Next Generation Sequencing: Computational Analysis in Practice. FRONTIERS IN PLANT SCIENCE 2017; 8:1770. [PMID: 29123534 PMCID: PMC5662881 DOI: 10.3389/fpls.2017.01770] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2017] [Accepted: 09/28/2017] [Indexed: 05/04/2023]
Abstract
Viruses cause significant yield and quality losses in a wide variety of cultivated crops. Hence, the detection and identification of viruses is a crucial facet of successful crop production and of great significance in terms of world food security. Whilst the adoption of molecular techniques such as RT-PCR has increased the speed and accuracy of viral diagnostics, such techniques only allow the detection of known viruses, i.e., each test is specific to one or a small number of related viruses. Therefore, unknown viruses can be missed and testing can be slow and expensive if molecular tests are unavailable. Methods for simultaneous detection of multiple viruses have been developed, and (NGS) is now a principal focus of this area, as it enables unbiased and hypothesis-free testing of plant samples. The development of NGS protocols capable of detecting multiple known and emergent viruses present in infected material is proving to be a major advance for crops, nuclear stocks or imported plants and germplasm, in which disease symptoms are absent, unspecific or only triggered by multiple viruses. Researchers want to answer the question "how many different viruses are present in this crop plant?" without knowing what they are looking for: RNA-sequencing (RNA-seq) of plant material allows this question to be addressed. As well as needing efficient nucleic acid extraction and enrichment protocols, virus detection using RNA-seq requires fast and robust bioinformatics methods to enable host sequence removal and virus classification. In this review recent studies that use RNA-seq for virus detection in a variety of crop plants are discussed with specific emphasis on the computational methods implemented. The main features of a number of specific bioinformatics workflows developed for virus detection from NGS data are also outlined and possible reasons why these have not yet been widely adopted are discussed. The review concludes by discussing the future directions of this field, including the use of bioinformatics tools for virus detection deployed in analytical environments using cloud computing.
Collapse
Affiliation(s)
- Susan Jones
- Information and Computational Science Group, The James Hutton Institute, Dundee, United Kingdom
| | - Amanda Baizan-Edge
- School of Biology, The University of St Andrews, St Andrews, United Kingdom
| | - Stuart MacFarlane
- Cell and Molecular Science Group, The James Hutton Institute, Dundee, United Kingdom
| | - Lesley Torrance
- School of Biology, The University of St Andrews, St Andrews, United Kingdom
- Cell and Molecular Science Group, The James Hutton Institute, Dundee, United Kingdom
| |
Collapse
|
8
|
Ancestor of land plants acquired the DNA-3-methyladenine glycosylase (MAG) gene from bacteria through horizontal gene transfer. Sci Rep 2017; 7:9324. [PMID: 28839126 PMCID: PMC5570899 DOI: 10.1038/s41598-017-05066-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2017] [Accepted: 05/23/2017] [Indexed: 02/07/2023] Open
Abstract
The origin and evolution of land plants was an important event in the history of life and initiated the establishment of modern terrestrial ecosystems. From water to terrestrial environments, plants needed to overcome the enhanced ultraviolet (UV) radiation and many other DNA-damaging agents. Evolving new genes with the function of DNA repair is critical for the origin and radiation of land plants. In bacteria, the DNA-3-methyladenine glycosylase (MAG) recognizes of a variety of base lesions and initiates the process of the base excision repair for damaged DNA. The homologs of MAG gene are present in all major lineages of streptophytes, and both the phylogenic and sequence similarity analyses revealed that green plant MAG gene originated through an ancient horizontal gene transfer (HGT) event from bacteria. Experimental evidence demonstrated that the expression of the maize ZmMAG gene was induced by UV and zeocin, both of which are known as DNA-damaging agents. Further investigation revealed that Streptophyta MAG genes had undergone positive selection during the initial evolutionary period in the ancestor of land plants. Our findings demonstrated that the ancient HGT of MAG to the ancestor of land plants probably played an important role in preadaptation to DNA-damaging agents in terrestrial environments.
Collapse
|
9
|
Ghosh P, Sowdhamini R. Genome-wide survey of putative RNA-binding proteins encoded in the human proteome. MOLECULAR BIOSYSTEMS 2016; 12:532-40. [PMID: 26675803 DOI: 10.1039/c5mb00638d] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
RNA-binding proteins (RBPs) are involved in various post-transcriptional gene regulatory processes and are also functionally important members of the ribosome and the spliceosome. However, RBPs and their interactions with RNA are less well-studied in comparison to DNA-binding proteins. We have classified the existing RBP structures, available in complexes with RNA and RNA/DNA hybrids, into different structural families and created Hidden Markov Models (HMMs). These structure-centric family HMMs, along with the sequence-centric family HMMs, were used as a primary database to systematically search the human proteome for the presence of putative RBPs. We have found more than 2600 gene products with RBP signatures in humans, of which around 28% are likely to bind to RNA but not DNA, whereas 9% might bind to both RNA and DNA. 11% of them do not contain an explicit functional annotation yet. Nearly 30% of the putative RBPs are exclusively nuclear, 15% have known disease associations and around 30% are enzymes. Around 40% of the proteins identified in this study are novel and have not been reported by recent large-scale studies on human RBPs.
Collapse
Affiliation(s)
- Pritha Ghosh
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bellary Road, Bangalore, Karnataka 560 065, India.
| | - R Sowdhamini
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bellary Road, Bangalore, Karnataka 560 065, India.
| |
Collapse
|
10
|
Jiang SY, Ramachandran S. Expansion Mechanisms and Evolutionary History on Genes Encoding DNA Glycosylases and Their Involvement in Stress and Hormone Signaling. Genome Biol Evol 2016; 8:1165-84. [PMID: 27026054 PMCID: PMC4860697 DOI: 10.1093/gbe/evw067] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
DNA glycosylases catalyze the release of methylated bases. They play vital roles in the base excision repair pathway and might also function in DNA demethylation. At least three families of DNA glycosylases have been identified, which included 3′-methyladenine DNA glycosylase (MDG) I, MDG II, and HhH-GPD (Helix–hairpin–Helix and Glycine/Proline/aspartate (D)). However, little is known on their genome-wide identification, expansion, and evolutionary history as well as their expression profiling and biological functions. In this study, we have genome-widely identified and evolutionarily characterized these family members. Generally, a genome encodes only one MDG II gene in most of organisms. No MDG I or MDG II gene was detected in green algae. However, HhH-GPD genes were detectable in all available organisms. The ancestor species contain small size of MDG I and HhH-GPD families. These two families were mainly expanded through the whole-genome duplication and segmental duplication. They were evolutionarily conserved and were generally under purifying selection. However, we have detected recent positive selection among the Oryza genus, which might play roles in species divergence. Further investigation showed that expression divergence played important roles in gene survival after expansion. All of these family genes were expressed in most of developmental stages and tissues in rice plants. High ratios of family genes were downregulated by drought and fungus pathogen as well as abscisic acid (ABA) and jasmonic acid (JA) treatments, suggesting a negative regulation in response to drought stress and pathogen infection through ABA- and/or JA-dependent hormone signaling pathway.
Collapse
Affiliation(s)
- Shu-Ye Jiang
- Genome Structural Biology Group, Temasek Life Science Laboratory, The National University of Singapore, Singapore
| | - Srinivasan Ramachandran
- Genome Structural Biology Group, Temasek Life Science Laboratory, The National University of Singapore, Singapore
| |
Collapse
|
11
|
Matthijs M, Fabris M, Broos S, Vyverman W, Goossens A. Profiling of the Early Nitrogen Stress Response in the Diatom Phaeodactylum tricornutum Reveals a Novel Family of RING-Domain Transcription Factors. PLANT PHYSIOLOGY 2016; 170:489-98. [PMID: 26582725 PMCID: PMC4704581 DOI: 10.1104/pp.15.01300] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/19/2015] [Accepted: 11/13/2015] [Indexed: 05/24/2023]
Abstract
Diatoms often inhabit highly variable habitats where they are confronted with a wide variety of stresses, frequently including starvation of nutrients such as nitrogen. In this study, the transcriptome of the model diatom Phaeodactylum tricornutum was profiled during the onset of nitrogen starvation by RNA sequencing, and overrepresented motifs were determined in promoters of genes that were early and strongly up-regulated during the nitrogen stress response. One of these motifs could be bound by a nitrogen starvation-inducible RING-domain protein termed RING-GAF-Gln-containing protein (RGQ1), which was shown to act as a transcription factor and belongs to a previously uncharacterized family that is conserved in heterokont algae.
Collapse
Affiliation(s)
- Michiel Matthijs
- Department of Plant Systems Biology (M.M., M.F., A.G.) and Inflammation Research Center (S.B.), VIB, B-9052 Gent, Belgium; Department of Plant Biotechnology and Bioinformatics (M.M., M.F., A.G.) and Department of Biomedical Molecular Biology (S.B.), Ghent University, B-9052 Gent, Belgium; and Department of Biology, Laboratory of Protistology and Aquatic Ecology, Ghent University, B-9000 Gent, Belgium (M.M., M.F., W.V.)
| | - Michele Fabris
- Department of Plant Systems Biology (M.M., M.F., A.G.) and Inflammation Research Center (S.B.), VIB, B-9052 Gent, Belgium; Department of Plant Biotechnology and Bioinformatics (M.M., M.F., A.G.) and Department of Biomedical Molecular Biology (S.B.), Ghent University, B-9052 Gent, Belgium; and Department of Biology, Laboratory of Protistology and Aquatic Ecology, Ghent University, B-9000 Gent, Belgium (M.M., M.F., W.V.)
| | - Stefan Broos
- Department of Plant Systems Biology (M.M., M.F., A.G.) and Inflammation Research Center (S.B.), VIB, B-9052 Gent, Belgium; Department of Plant Biotechnology and Bioinformatics (M.M., M.F., A.G.) and Department of Biomedical Molecular Biology (S.B.), Ghent University, B-9052 Gent, Belgium; and Department of Biology, Laboratory of Protistology and Aquatic Ecology, Ghent University, B-9000 Gent, Belgium (M.M., M.F., W.V.)
| | - Wim Vyverman
- Department of Plant Systems Biology (M.M., M.F., A.G.) and Inflammation Research Center (S.B.), VIB, B-9052 Gent, Belgium; Department of Plant Biotechnology and Bioinformatics (M.M., M.F., A.G.) and Department of Biomedical Molecular Biology (S.B.), Ghent University, B-9052 Gent, Belgium; and Department of Biology, Laboratory of Protistology and Aquatic Ecology, Ghent University, B-9000 Gent, Belgium (M.M., M.F., W.V.)
| | - Alain Goossens
- Department of Plant Systems Biology (M.M., M.F., A.G.) and Inflammation Research Center (S.B.), VIB, B-9052 Gent, Belgium; Department of Plant Biotechnology and Bioinformatics (M.M., M.F., A.G.) and Department of Biomedical Molecular Biology (S.B.), Ghent University, B-9052 Gent, Belgium; and Department of Biology, Laboratory of Protistology and Aquatic Ecology, Ghent University, B-9000 Gent, Belgium (M.M., M.F., W.V.)
| |
Collapse
|
12
|
Malhotra S, Sowdhamini R. Collation and analyses of DNA-binding protein domain families from sequence and structural databanks. MOLECULAR BIOSYSTEMS 2015; 11:1110-8. [PMID: 25656606 DOI: 10.1039/c4mb00629a] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
DNA-protein interactions govern several high fidelity cellular processes like DNA-replication, transcription, DNA repair, etc. Proteins that have the ability to recognise and bind DNA sequences can be classified either according to their DNA-binding motif or based on the sequence of the target nucleotides. We have collated the DNA-binding families by integrating information from both protein sequence family and structural databases. This resulted in a dataset of 1057 DNA-binding protein domain families. Their family properties (the number of members, percent identity distribution and length of members) and domain architectures were examined. Further, sequence domain families were mapped to structures in the protein databank (PDB) and the protein domain structure classification database (SCOP). The DNA-binding families, with no structural information, were clustered together into potential superfamilies based on sequence associations. On the basis of functions attributed to DNA-binding protein folds, we observe that a majority of the DNA-binding proteins follow divergent evolution. This study can serve as a basis for annotation and distribution of DNA-binding proteins in genome(s) of interest. The entire collated set of DNA-binding protein domains is available for download as Hidden Markov Models.
Collapse
Affiliation(s)
- Sony Malhotra
- National Centre for Biological Sciences, Bellary Road, GKVK Campus, Bangalore, India.
| | | |
Collapse
|
13
|
Bigeard J, Rayapuram N, Pflieger D, Hirt H. Phosphorylation-dependent regulation of plant chromatin and chromatin-associated proteins. Proteomics 2014; 14:2127-40. [PMID: 24889195 DOI: 10.1002/pmic.201400073] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2014] [Revised: 04/28/2014] [Accepted: 05/26/2014] [Indexed: 12/25/2022]
Abstract
In eukaryotes, most of the DNA is located in the nucleus where it is organized with histone proteins in a higher order structure as chromatin. Chromatin and chromatin-associated proteins contribute to DNA-related processes such as replication and transcription as well as epigenetic regulation. Protein functions are often regulated by PTMs among which phosphorylation is one of the most abundant PTM. Phosphorylation of proteins affects important properties, such as enzyme activity, protein stability, or subcellular localization. We here describe the main specificities of protein phosphorylation in plants and review the current knowledge on phosphorylation-dependent regulation of plant chromatin and chromatin-associated proteins. We also outline some future challenges to further elucidate protein phosphorylation and chromatin regulation.
Collapse
Affiliation(s)
- Jean Bigeard
- Unité de Recherche en Génomique Végétale (URGV), UMR INRA/CNRS/Université d'Evry Val d'Essonne/Saclay Plant Sciences, Evry, France
| | | | | | | |
Collapse
|