1
|
Mikhailova AA, Dohmen E, Harrison MC. Major changes in domain arrangements are associated with the evolution of termites. J Evol Biol 2024; 37:758-769. [PMID: 38630634 DOI: 10.1093/jeb/voae047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 12/18/2023] [Accepted: 04/12/2024] [Indexed: 04/19/2024]
Abstract
Domains as functional protein units and their rearrangements along the phylogeny can shed light on the functional changes of proteomes associated with the evolution of complex traits like eusociality. This complex trait is associated with sterile soldiers and workers, and long-lived, highly fecund reproductives. Unlike in Hymenoptera (ants, bees, and wasps), the evolution of eusociality within Blattodea, where termites evolved from within cockroaches, was accompanied by a reduction in proteome size, raising the question of whether functional novelty was achieved with existing rather than novel proteins. To address this, we investigated the role of domain rearrangements during the evolution of termite eusociality. Analysing domain rearrangements in the proteomes of three solitary cockroaches and five eusocial termites, we inferred more than 5,000 rearrangements over the phylogeny of Blattodea. The 90 novel domain arrangements that emerged at the origin of termites were enriched for several functions related to longevity, such as protein homeostasis, DNA repair, mitochondrial activity, and nutrient sensing. Many domain rearrangements were related to changes in developmental pathways, important for the emergence of novel castes. Along with the elaboration of social complexity, including permanently sterile workers and larger, foraging colonies, we found 110 further domain arrangements with functions related to protein glycosylation and ion transport. We found an enrichment of caste-biased expression and splicing within rearranged genes, highlighting their importance for the evolution of castes. Furthermore, we found increased levels of DNA methylation among rearranged compared to non-rearranged genes suggesting fundamental differences in their regulation. Our findings indicate the importance of domain rearrangements in the generation of functional novelty necessary for termite eusociality to evolve.
Collapse
Affiliation(s)
- Alina A Mikhailova
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Elias Dohmen
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Mark C Harrison
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| |
Collapse
|
2
|
Gollapalli P, Rudrappa S, Kumar V, Santosh Kumar HS. Domain Architecture Based Methods for Comparative Functional Genomics Toward Therapeutic Drug Target Discovery. J Mol Evol 2023; 91:598-615. [PMID: 37626222 DOI: 10.1007/s00239-023-10129-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2022] [Accepted: 08/06/2023] [Indexed: 08/27/2023]
Abstract
Genes duplicate, mutate, recombine, fuse or fission to produce new genes, or when genes are formed from de novo, novel functions arise during evolution. Researchers have tried to quantify the causes of these molecular diversification processes to know how these genes increase molecular complexity over a period of time, for instance protein domain organization. In contrast to global sequence similarity, protein domain architectures can capture key structural and functional characteristics, making them better proxies for describing functional equivalence. In Prokaryotes and eukaryotes it has proven that, domain designs are retained over significant evolutionary distances. Protein domain architectures are now being utilized to categorize and distinguish evolutionarily related proteins and find homologs among species that are evolutionarily distant from one another. Additionally, structural information stored in domain structures has accelerated homology identification and sequence search methods. Tools for functional protein annotation have been developed to discover, protein domain content, domain order, domain recurrence, and domain position as all these contribute to the prediction of protein functional accuracy. In this review, an attempt is made to summarise facts and speculations regarding the use of protein domain architecture and modularity to identify possible therapeutic targets among cellular activities based on the understanding their linked biological processes.
Collapse
Affiliation(s)
- Pavan Gollapalli
- Center for Bioinformatics and Biostatistics, Nitte (Deemed to be University), Mangalore, Karnataka, 575018, India
| | - Sushmitha Rudrappa
- Department of Biotechnology and Bioinformatics, Jnana Sahyadri Campus, Kuvempu University, Shankaraghatta, Shivamogga, Karnataka, 577451, India
| | - Vadlapudi Kumar
- Department of Biochemistry, Davangere University, Shivagangothri, Davangere, Karnataka, 577007, India
| | - Hulikal Shivashankara Santosh Kumar
- Department of Biotechnology and Bioinformatics, Jnana Sahyadri Campus, Kuvempu University, Shankaraghatta, Shivamogga, Karnataka, 577451, India.
| |
Collapse
|
3
|
Mota MBS, Woods NT, Carvalho MA, Monteiro ANA, Mesquita RD. Evolution of the triplet BRCT domain. DNA Repair (Amst) 2023; 129:103532. [PMID: 37453244 DOI: 10.1016/j.dnarep.2023.103532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 06/28/2023] [Accepted: 07/01/2023] [Indexed: 07/18/2023]
Abstract
Organisms have evolved a complex system, called the DNA damage response (DDR), which maintains genome integrity. The DDR is responsible for identifying and repairing a variety of lesions and alterations in DNA. DDR proteins coordinate DNA damage detection, cell cycle arrest, and repair, with many of these events regulated by protein phosphorylation. In the human proteome, 23 proteins contain the BRCT (BRCA1 C-Terminus domain) domain, a modular signaling domain that can bind phosphopeptides and mediate protein-protein interactions. BRCTs can be found as functional single units, tandem (tBRCT), triplet (tpBRCT), and quartet. Here we examine the evolution of the tpBRCT architecture present in TOPBP1 (DNA topoisomerase II binding protein 1) and ECT2 (epithelial cell transforming 2), and their respective interaction partners RAD9 (Cell cycle checkpoint control protein RAD9) and CYK-4 (Rac GTPase-activating protein 1), with a focus on the conservation of the phosphopeptide-binding residues. The pair TOPBP1-RAD9 arose with the Eukaryotes and ECT2-CYK-4 with the Eumetazoans. Triplet structural and functional characteristics were conserved in almost all organisms. The first unit of the triplet (BRCT0) is different from the other two BRCTs but conserved between orthologs for both TOPBP1 and ECT2. BRCT domain evolution simulations suggest a trend to retain the singlet or towards two or three BRCT copies per protein consistent with functional tBRCT and tpBRCT architectures. Our results shed light on the emergence of the function and architecture of multiple BRCT domain organizations and provide information about the evolution of the BRCT triplet. Knowledge of BRCT domain evolution can improve the understanding of DNA damage response mechanisms and signal transduction in DDR.
Collapse
Affiliation(s)
- M B S Mota
- Departamento de Bioquímica, Instituto de Química, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| | - N T Woods
- Eppley Institute, Fred & Pamela Buffett Cancer Center, University of Nebraska Medical Center, Omaha, NE 68198, USA
| | - M A Carvalho
- Instituto Federal de Educação, Ciência e Tecnologia do Rio de Janeiro, RJ, Brazil; Instituto Nacional de Câncer, Rio de Janeiro, RJ, Brazil
| | - A N A Monteiro
- Cancer Epidemiology Program, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL 33612, USA
| | - R D Mesquita
- Departamento de Bioquímica, Instituto de Química, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil.
| |
Collapse
|
4
|
Kress A, Poch O, Lecompte O, Thompson JD. Real or fake? Measuring the impact of protein annotation errors on estimates of domain gain and loss events. FRONTIERS IN BIOINFORMATICS 2023; 3:1178926. [PMID: 37151482 PMCID: PMC10158824 DOI: 10.3389/fbinf.2023.1178926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 04/05/2023] [Indexed: 05/09/2023] Open
Abstract
Protein annotation errors can have significant consequences in a wide range of fields, ranging from protein structure and function prediction to biomedical research, drug discovery, and biotechnology. By comparing the domains of different proteins, scientists can identify common domains, classify proteins based on their domain architecture, and highlight proteins that have evolved differently in one or more species or clades. However, genome-wide identification of different protein domain architectures involves a complex error-prone pipeline that includes genome sequencing, prediction of gene exon/intron structures, and inference of protein sequences and domain annotations. Here we developed an automated fact-checking approach to distinguish true domain loss/gain events from false events caused by errors that occur during the annotation process. Using genome-wide ortholog sets and taking advantage of the high-quality human and Saccharomyces cerevisiae genome annotations, we analyzed the domain gain and loss events in the predicted proteomes of 9 non-human primates (NHP) and 20 non-S. cerevisiae fungi (NSF) as annotated in the Uniprot and Interpro databases. Our approach allowed us to quantify the impact of errors on estimates of protein domain gains and losses, and we show that domain losses are over-estimated ten-fold and three-fold in the NHP and NSF proteins respectively. This is in line with previous studies of gene-level losses, where issues with genome sequencing or gene annotation led to genes being falsely inferred as absent. In addition, we show that insistent protein domain annotations are a major factor contributing to the false events. For the first time, to our knowledge, we show that domain gains are also over-estimated by three-fold and two-fold respectively in NHP and NSF proteins. Based on our more accurate estimates, we infer that true domain losses and gains in NHP with respect to humans are observed at similar rates, while domain gains in the more divergent NSF are observed twice as frequently as domain losses with respect to S. cerevisiae. This study highlights the need to critically examine the scientific validity of protein annotations, and represents a significant step toward scalable computational fact-checking methods that may 1 day mitigate the propagation of wrong information in protein databases.
Collapse
|
5
|
Borah P, Ni F, Ying W, Zhuang H, Chong SL, Hu XG, Yang J, Lin EP, Huang H. Genome-wide identification and characterization of OVATE family proteins in Betula luminifera reveals involvement of BlOFP3 and BlOFP5 genes in leaf development. FRONTIERS IN PLANT SCIENCE 2022; 13:950936. [PMID: 36311104 PMCID: PMC9613114 DOI: 10.3389/fpls.2022.950936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 09/22/2022] [Indexed: 06/16/2023]
Abstract
Ovate family proteins (OFP) are plant-specific transcription factors involved in regulating morphologies of the lateral organs, plant growth and development. However, the functional roles of OFP genes in Betula luminifera, an important timber tree species, are not well studied. In this study, we identified 20 BlOFP genes and analyzed their phylogenetic relationship, gene structure, conserved motifs, and cis-elements. Further, expression analysis indicates that BlOFP genes were up-regulated in leaves on the one-year-old branch compared to leaves on the current-year branch and bract, except BlOFP7, BlOFP11, BlOFP14 and BlOFP12. The overexpression of BlOFP3 and BlOFP5 in Arabidopsis thaliana not only resulted in a slower growth rate but also produced sawtooth shape, flatter and darker green rosette leaves. Further investigation showed that the leaf thickness of the transgenic plants was more than double that of the wild type, which was caused by the increasement in the number and size of palisade tissue cells. Furthermore, the expression analysis also indicated that the expressions of several genes related to leaf development were significantly changed in the transgene plants. These results suggested the significant roles of BlOFP3 and BlOFP5 in leaf development. Moreover, protein-protein interaction studies showed that BlOFP3 interacts with BlKNAT5, and BlOFP5 interacts with BlKNAT5, BlBLH6 and BlBLH7. In conclusion, our study demonstrates that BlOFP3 and BlOFP5 were involved in leaf shape and thickness regulation by forming a complex with BlKNAT5, BlBLH6 and BlBLH7. In addition, our study serves as a guide for future functional genomic studies of OFP genes of the B. luminifera.
Collapse
Affiliation(s)
- Priyanka Borah
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, China
| | - Fei Ni
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, China
| | - Weiyang Ying
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, China
| | - Hebi Zhuang
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, China
| | - Sun-Li Chong
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, China
| | - Xian-Ge Hu
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, China
| | - Jun Yang
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Plant Science Research Center, Chinese Academy of Sciences, Shanghai, China
| | - Er-pei Lin
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, China
| | - Huahong Huang
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, China
| |
Collapse
|
6
|
Mechanism of cooperative N-glycan processing by the multi-modular endoglycosidase EndoE. Nat Commun 2022; 13:1137. [PMID: 35241669 PMCID: PMC8894350 DOI: 10.1038/s41467-022-28722-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2021] [Accepted: 02/04/2022] [Indexed: 11/29/2022] Open
Abstract
Bacteria produce a remarkably diverse range of glycoside hydrolases to metabolize glycans from the environment as a primary source of nutrients, and to promote the colonization and infection of a host. Here we focus on EndoE, a multi-modular glycoside hydrolase secreted by Enterococcus faecalis, one of the leading causes of healthcare-associated infections. We provide X-ray crystal structures of EndoE, which show an architecture composed of four domains, including GH18 and GH20 glycoside hydrolases connected by two consecutive three α-helical bundles. We determine that the GH20 domain is an exo-β-1,2-N-acetylglucosaminidase, whereas the GH18 domain is an endo-β-1,4-N-acetylglucosaminidase that exclusively processes the central core of complex-type or high-mannose-type N-glycans. Both glycoside hydrolase domains act in a concerted manner to process diverse N-glycans on glycoproteins, including therapeutic IgG antibodies. EndoE combines two enzyme domains with distinct functions and glycan specificities to play a dual role in glycan metabolism and immune evasion. EndoE is a multi-domain glycoside hydrolase of the human pathogen Enterococcus faecalis. Here, the authors present crystal structures of EndoE and provide biochemical insights into the molecular basis of EndoE’s substrate specificity and catalytic mechanism.
Collapse
|
7
|
Matveevsky S, Grishaeva T. Heterogeneity in conservation of multifunctional partner enzymes with meiotic importance, CDK2 kinase and BRCA1 ubiquitin ligase. PeerJ 2021; 9:e12231. [PMID: 34692254 PMCID: PMC8483008 DOI: 10.7717/peerj.12231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 09/09/2021] [Indexed: 12/04/2022] Open
Abstract
The evolution of proteins can be accompanied by changes not only to their amino acid sequences, but also their structural and spatial molecular organization. Comparison of the protein conservation within different taxonomic groups (multifunctional, or highly specific) allows to clarify their specificity and the direction of evolution. Two multifunctional enzymes, cyclin-dependent kinase 2 (CDK2) and BRCA1 ubiquitin ligase, that are partners in some mitotic and meiotic processes were investigated in the present work. Two research methods, bioinformatics and immunocytochemical, were combined to examine the conservation levels of the two enzymes. It has been established that CDK2 is a highly conserved protein in different taxonomic lineages of the eukaryotic tree. Immunocytochemically, a conserved CDK2 pattern was revealed in the meiotic autosomes of five rodent species and partially in domestic turkey and clawed frog. Nevertheless, variable CDK2 distribution was detected at the unsynapsed segments of the rodent X chromosomes. BRCA1 was shown to be highly conserved only within certain mammalian taxa. It was also noted that in those rodent nuclei, where BRCA1 specifically binds to antigens, asynaptic regions of sex chromosomes were positive. BRCA1 staining was not always accompanied by specific binding, and a high nonspecificity in the nucleoplasm was observed. Thus, the studies revealed different conservation of the two enzymes at the level of protein structure as well as at the level of chromosome behavior. This suggests variable rates of evolution due to both size and configuration of the protein molecules and their multifunctionality.
Collapse
Affiliation(s)
- Sergey Matveevsky
- Laboratory of Cytogenetics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Tatiana Grishaeva
- Laboratory of Cytogenetics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
8
|
Klymiuk V, Coaker G, Fahima T, Pozniak CJ. Tandem Protein Kinases Emerge as New Regulators of Plant Immunity. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2021; 34:1094-1102. [PMID: 34096764 PMCID: PMC8761531 DOI: 10.1094/mpmi-03-21-0073-cr] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Plant-pathogen interactions result in disease development in a susceptible host. Plants actively resist pathogens via a complex immune system comprising both surface-localized receptors that sense the extracellular space as well as intracellular receptors recognizing pathogen effectors. To date, the majority of cloned resistance genes encode intracellular nucleotide-binding leucine-rich repeat receptor proteins. Recent discoveries have revealed tandem kinase proteins (TKPs) as another important family of intracellular proteins involved in plant immune responses. Five TKP genes-barley Rpg1 and wheat WTK1 (Yr15), WTK2 (Sr60), WTK3 (Pm24), and WTK4-protect against devastating fungal diseases. Moreover, a large diversity and numerous putative TKPs exist across the plant kingdom. This review explores our current knowledge of TKPs and serves as a basis for future studies that aim to develop and exploit a deeper understanding of innate plant immunity receptor proteins.[Formula: see text] Copyright © 2021 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license.
Collapse
Affiliation(s)
- Valentyna Klymiuk
- Crop Development Centre and Department of Plant Sciences,
University of Saskatchewan, Saskatoon, SK S7N 5A8, Canada
| | - Gitta Coaker
- Department of Plant Pathology, University of California,
Davis, CA, U.S.A
| | - Tzion Fahima
- Institute of Evolution, University of Haifa, 199 Abba-Hushi
Avenue, Mt. Carmel, 3498838 Haifa, Israel
- Department of Evolutionary and Environmental Biology,
University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838 Haifa, Israel
| | - Curtis J. Pozniak
- Crop Development Centre and Department of Plant Sciences,
University of Saskatchewan, Saskatoon, SK S7N 5A8, Canada
| |
Collapse
|
9
|
Abstract
We describe the de novo design of an allosterically regulated protein, which comprises two tightly coupled domains. One domain is based on the DF (Due Ferri in Italian or two-iron in English) family of de novo proteins, which have a diiron cofactor that catalyzes a phenol oxidase reaction, while the second domain is based on PS1 (Porphyrin-binding Sequence), which binds a synthetic Zn-porphyrin (ZnP). The binding of ZnP to the original PS1 protein induces changes in structure and dynamics, which we expected to influence the catalytic rate of a fused DF domain when appropriately coupled. Both DF and PS1 are four-helix bundles, but they have distinct bundle architectures. To achieve tight coupling between the domains, they were connected by four helical linkers using a computational method to discover the most designable connections capable of spanning the two architectures. The resulting protein, DFP1 (Due Ferri Porphyrin), bound the two cofactors in the expected manner. The crystal structure of fully reconstituted DFP1 was also in excellent agreement with the design, and it showed the ZnP cofactor bound over 12 Å from the dimetal center. Next, a substrate-binding cleft leading to the diiron center was introduced into DFP1. The resulting protein acts as an allosterically modulated phenol oxidase. Its Michaelis-Menten parameters were strongly affected by the binding of ZnP, resulting in a fourfold tighter K m and a 7-fold decrease in k cat These studies establish the feasibility of designing allosterically regulated catalytic proteins, entirely from scratch.
Collapse
|
10
|
Yadav A, Fernández-Baca D, Cannon SB. Family-Specific Gains and Losses of Protein Domains in the Legume and Grass Plant Families. Evol Bioinform Online 2020; 16:1176934320939943. [PMID: 32694909 PMCID: PMC7350399 DOI: 10.1177/1176934320939943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 06/15/2020] [Indexed: 11/27/2022] Open
Abstract
Protein domains can be regarded as sections of protein sequences capable of folding independently and performing specific functions. In addition to amino-acid level changes, protein sequences can also evolve through domain shuffling events such as domain insertion, deletion, or duplication. The evolution of protein domains can be studied by tracking domain changes in a selected set of species with known phylogenetic relationships. Here, we conduct such an analysis by defining domains as “features” or “descriptors,” and considering the species (target + outgroup) as instances or data-points in a data matrix. We then look for features (domains) that are significantly different between the target species and the outgroup species. We study the domain changes in 2 large, distinct groups of plant species: legumes (Fabaceae) and grasses (Poaceae), with respect to selected outgroup species. We evaluate 4 types of domain feature matrices: domain content, domain duplication, domain abundance, and domain versatility. The 4 types of domain feature matrices attempt to capture different aspects of domain changes through which the protein sequences may evolve—that is, via gain or loss of domains, increase or decrease in the copy number of domains along the sequences, expansion or contraction of domains, or through changes in the number of adjacent domain partners. All the feature matrices were analyzed using feature selection techniques and statistical tests to select protein domains that have significant different feature values in legumes and grasses. We report the biological functions of the top selected domains from the analysis of all the feature matrices. In addition, we also perform domain-centric gene ontology (dcGO) enrichment analysis on all selected domains from all 4 feature matrices to study the gene ontology terms associated with the significantly evolving domains in legumes and grasses. Domain content analysis revealed a striking loss of protein domains from the Fanconi anemia (FA) pathway, the pathway responsible for the repair of interstrand DNA crosslinks. The abundance analysis of domains found in legumes revealed an increase in glutathione synthase enzyme, an antioxidant required from nitrogen fixation, and a decrease in xanthine oxidizing enzymes, a phenomenon confirmed by previous studies. In grasses, the abundance analysis showed increases in domains related to gene silencing which could be due to polyploidy or due to enhanced response to viral infection. We provide a docker container that can be used to perform this analysis workflow on any user-defined sets of species, available at https://cloud.docker.com/u/akshayayadav/repository/docker/akshayayadav/protein-domain-evolution-project.
Collapse
Affiliation(s)
- Akshay Yadav
- Bioinformatics and Computational Biology Graduate Program, Iowa State University, Ames, IA, USA
| | | | - Steven B Cannon
- Corn Insects and Crop Genetics Research Unit, USDA-Agricultural Research Service, Ames, IA, USA
| |
Collapse
|
11
|
Glutaredoxin-like protein (GLP)-a novel bacteria sulfurtransferase that protects cells against cyanide and oxidative stresses. Appl Microbiol Biotechnol 2020; 104:5477-5492. [PMID: 32307572 DOI: 10.1007/s00253-020-10491-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2019] [Revised: 02/14/2020] [Accepted: 02/20/2020] [Indexed: 02/01/2023]
Abstract
The pathogen Xylella fastidiosa belongs to the Xanthomonadaceae family, a large group of Gram-negative bacteria that cause diseases in many economically important crops. A predicted gene, annotated as glutaredoxin-like protein (glp), was found to be highly conserved among the genomes of different genera within this family and highly expressed in X. fastidiosa. Analysis of the GLP protein sequences revealed three protein domains: one similar to monothiol glutaredoxins (Grx), an Fe-S cluster and a thiosulfate sulfurtransferase/rhodanese domain (Tst/Rho), which is generally involved in sulfur metabolism and cyanide detoxification. To characterize the biochemical properties of GLP, we expressed and purified the X. fastidiosa recombinant GLP enzyme. Grx activity and Fe-S cluster formation were not observed, while an evaluation of Tst/Rho enzymatic activity revealed that GLP can detoxify cyanide and transfer inorganic sulfur to acceptor molecules in vitro. The biological activity of GLP relies on the cysteine residues in the Grx and Tst/Rho domains (Cys33 and Cys266, respectively), and structural analysis showed that GLP and GLPC266S were able to form high molecular weight oligomers (> 600 kDa), while replacement of Cys33 with Ser destabilized the quaternary structure. In vivo heterologous enzyme expression experiments in Escherichia coli revealed that GLP can protect bacteria against high concentrations of cyanide and hydrogen peroxide. Finally, phylogenetic analysis showed that homologous glp genes are distributed across Gram-negative bacterial families with conservation of the N- to C-domain order. However, no eukaryotic organism contains this enzyme. Altogether, these results suggest that GLP is an important enzyme with cyanide-decomposing and sulfurtransferase functions in bacteria, whose presence in eukaryotes we could not observe, representing a promising biological target for new pharmaceuticals.
Collapse
|
12
|
Dohmen E, Klasberg S, Bornberg-Bauer E, Perrey S, Kemena C. The modular nature of protein evolution: domain rearrangement rates across eukaryotic life. BMC Evol Biol 2020; 20:30. [PMID: 32059645 PMCID: PMC7023805 DOI: 10.1186/s12862-020-1591-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Accepted: 01/31/2020] [Indexed: 12/25/2022] Open
Abstract
BACKGROUND Modularity is important for evolutionary innovation. The recombination of existing units to form larger complexes with new functionalities spares the need to create novel elements from scratch. In proteins, this principle can be observed at the level of protein domains, functional subunits which are regularly rearranged to acquire new functions. RESULTS In this study we analyse the mechanisms leading to new domain arrangements in five major eukaryotic clades (vertebrates, insects, fungi, monocots and eudicots) at unprecedented depth and breadth. This allows, for the first time, to directly compare rates of rearrangements between different clades and identify both lineage specific and general patterns of evolution in the context of domain rearrangements. We analyse arrangement changes along phylogenetic trees by reconstructing ancestral domain content in combination with feasible single step events, such as fusion or fission. Using this approach we explain up to 70% of all rearrangements by tracing them back to their precursors. We find that rates in general and the ratio between these rates for a given clade in particular, are highly consistent across all clades. In agreement with previous studies, fusions are the most frequent event leading to new domain arrangements. A lineage specific pattern in fungi reveals exceptionally high loss rates compared to other clades, supporting recent studies highlighting the importance of loss for evolutionary innovation. Furthermore, our methodology allows us to link domain emergences at specific nodes in the phylogenetic tree to important functional developments, such as the origin of hair in mammals. CONCLUSIONS Our results demonstrate that domain rearrangements are based on a canonical set of mutational events with rates which lie within a relatively narrow and consistent range. In addition, gained knowledge about these rates provides a basis for advanced domain-based methodologies for phylogenetics and homology analysis which complement current sequence-based methods.
Collapse
Affiliation(s)
- Elias Dohmen
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstrasse 1, Münster, 48149, Germany.,Institute for Bioinformatics and Chemoinformatics, Westphalian University of Applied Sciences, August-Schmidt-Ring 10, Recklinghausen, 45665, Germany
| | - Steffen Klasberg
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstrasse 1, Münster, 48149, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstrasse 1, Münster, 48149, Germany
| | - Sören Perrey
- Institute for Bioinformatics and Chemoinformatics, Westphalian University of Applied Sciences, August-Schmidt-Ring 10, Recklinghausen, 45665, Germany
| | - Carsten Kemena
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstrasse 1, Münster, 48149, Germany.
| |
Collapse
|
13
|
Filatov G, Bauwens B, Kertész-Farkas A. LZW-Kernel: fast kernel utilizing variable length code blocks from LZW compressors for protein sequence classification. Bioinformatics 2019; 34:3281-3288. [PMID: 29741583 DOI: 10.1093/bioinformatics/bty349] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2018] [Accepted: 05/03/2018] [Indexed: 11/13/2022] Open
Abstract
Motivation Bioinformatics studies often rely on similarity measures between sequence pairs, which often pose a bottleneck in large-scale sequence analysis. Results Here, we present a new convolutional kernel function for protein sequences called the Lempel-Ziv-Welch (LZW)-Kernel. It is based on code words identified with the LZW universal text compressor. The LZW-Kernel is an alignment-free method, it is always symmetric, is positive, always provides 1.0 for self-similarity and it can directly be used with Support Vector Machines (SVMs) in classification problems, contrary to normalized compression distance, which often violates the distance metric properties in practice and requires further techniques to be used with SVMs. The LZW-Kernel is a one-pass algorithm, which makes it particularly plausible for big data applications. Our experimental studies on remote protein homology detection and protein classification tasks reveal that the LZW-Kernel closely approaches the performance of the Local Alignment Kernel (LAK) and the SVM-pairwise method combined with Smith-Waterman (SW) scoring at a fraction of the time. Moreover, the LZW-Kernel outperforms the SVM-pairwise method when combined with Basic Local Alignment Search Tool (BLAST) scores, which indicates that the LZW code words might be a better basis for similarity measures than local alignment approximations found with BLAST. In addition, the LZW-Kernel outperforms n-gram based mismatch kernels, hidden Markov model based SAM and Fisher kernel and protein family based PSI-BLAST, among others. Further advantages include the LZW-Kernel's reliance on a simple idea, its ease of implementation, and its high speed, three times faster than BLAST and several magnitudes faster than SW or LAK in our tests. Availability and implementation LZW-Kernel is implemented as a standalone C code and is a free open-source program distributed under GPLv3 license and can be downloaded from https://github.com/kfattila/LZW-Kernel. Supplementary information Supplementary data are available at Bioinformatics Online.
Collapse
Affiliation(s)
- Gleb Filatov
- Faculty of Computer Science, Department of Data Analysis and Artificial Intelligence, Moscow, Russia
| | - Bruno Bauwens
- Faculty of Computer Science, Department of Big Data and Information Retrieval, Moscow, Russia
| | - Attila Kertész-Farkas
- Faculty of Computer Science, Department of Data Analysis and Artificial Intelligence, Moscow, Russia
| |
Collapse
|
14
|
Nguyen SN, Flores A, Talamantes D, Dar F, Valdez A, Schwans J, Berlemont R. GeneHunt for rapid domain-specific annotation of glycoside hydrolases. Sci Rep 2019; 9:10137. [PMID: 31300677 PMCID: PMC6626019 DOI: 10.1038/s41598-019-46290-w] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Accepted: 06/26/2019] [Indexed: 12/20/2022] Open
Abstract
The identification of glycoside hydrolases (GHs) for efficient polysaccharide deconstruction is essential for the development of biofuels. Here, we investigate the potential of sequential HMM-profile identification for the rapid and precise identification of the multi-domain architecture of GHs from various datasets. First, as a validation, we successfully reannotated >98% of the biochemically characterized enzymes listed on the CAZy database. Next, we analyzed the 43 million non-redundant sequences from the M5nr data and identified 322,068 unique GHs. Finally, we searched 129 assembled metagenomes retrieved from MG-RAST for environmental GHs and identified 160,790 additional enzymes. Although most identified sequences corresponded to single domain enzymes, many contained several domains, including known accessory domains and some domains never identified in association with GH. Several sequences displayed multiple catalytic domains and few of these potential multi-activity proteins combined potentially synergistic domains. Finally, we produced and confirmed the biochemical activities of a GH5-GH10 cellulase-xylanase and a GH11-CE4 xylanase-esterase. Globally, this "gene to enzyme pipeline" provides a rationale for mining large datasets in order to identify new catalysts combining unique properties for the efficient deconstruction of polysaccharides.
Collapse
Affiliation(s)
- S N Nguyen
- Department of Biological Sciences, California State University Long Beach, Long Beach, California, USA
| | - A Flores
- Department of Biological Sciences, California State University Long Beach, Long Beach, California, USA
| | - D Talamantes
- Department of Biological Sciences, California State University Long Beach, Long Beach, California, USA
| | - F Dar
- Department of Biological Sciences, California State University Long Beach, Long Beach, California, USA
| | - A Valdez
- Department of Biological Sciences, California State University Long Beach, Long Beach, California, USA
| | - J Schwans
- Department of Chemistry and Biochemistry, California State University Long Beach, Long Beach, California, USA
| | - R Berlemont
- Department of Biological Sciences, California State University Long Beach, Long Beach, California, USA.
| |
Collapse
|
15
|
Klymiuk V, Yaniv E, Huang L, Raats D, Fatiukha A, Chen S, Feng L, Frenkel Z, Krugman T, Lidzbarsky G, Chang W, Jääskeläinen MJ, Schudoma C, Paulin L, Laine P, Bariana H, Sela H, Saleem K, Sørensen CK, Hovmøller MS, Distelfeld A, Chalhoub B, Dubcovsky J, Korol AB, Schulman AH, Fahima T. Cloning of the wheat Yr15 resistance gene sheds light on the plant tandem kinase-pseudokinase family. Nat Commun 2018; 9:3735. [PMID: 30282993 PMCID: PMC6170490 DOI: 10.1038/s41467-018-06138-9] [Citation(s) in RCA: 140] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2018] [Accepted: 07/26/2018] [Indexed: 01/11/2023] Open
Abstract
Yellow rust, caused by Puccinia striiformis f. sp. tritici (Pst), is a devastating fungal disease threatening much of global wheat production. Race-specific resistance (R)-genes are used to control rust diseases, but the rapid emergence of virulent Pst races has prompted the search for a more durable resistance. Here, we report the cloning of Yr15, a broad-spectrum R-gene derived from wild emmer wheat, which encodes a putative kinase-pseudokinase protein, designated as wheat tandem kinase 1, comprising a unique R-gene structure in wheat. The existence of a similar gene architecture in 92 putative proteins across the plant kingdom, including the barley RPG1 and a candidate for Ug8, suggests that they are members of a distinct family of plant proteins, termed here tandem kinase-pseudokinases (TKPs). The presence of kinase-pseudokinase structure in both plant TKPs and the animal Janus kinases sheds light on the molecular evolution of immune responses across these two kingdoms.
Collapse
Affiliation(s)
- Valentina Klymiuk
- Institute of Evolution, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
- Department of Evolutionary and Environmental Biology, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
| | - Elitsur Yaniv
- Institute of Evolution, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
- Department of Evolutionary and Environmental Biology, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
- Institute of Biotechnology, University of Helsinki, Viikinkaari 1, P.O. Box 65, FI-00014, Helsinki, Finland
- Viikki Plant Science Centre, University of Helsinki, Viikinkaari 1, P.O. Box 65, FI-00014, Helsinki, Finland
| | - Lin Huang
- Institute of Evolution, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
- Department of Evolutionary and Environmental Biology, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
- Triticeae Research Institute, Sichuan Agricultural University, 611130, Chengdu, China
| | - Dina Raats
- Institute of Evolution, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
- Department of Evolutionary and Environmental Biology, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
- Earlham Institute, Norwich Research Park, Colney Lane, Norwich, NR4 7UZ, UK
| | - Andrii Fatiukha
- Institute of Evolution, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
- Department of Evolutionary and Environmental Biology, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
| | - Shisheng Chen
- Department of Plant Sciences, University of California, One Shields Avenue, Davis, CA, 95616, USA
| | - Lihua Feng
- Institute of Evolution, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
- Department of Evolutionary and Environmental Biology, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
| | - Zeev Frenkel
- Institute of Evolution, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
| | - Tamar Krugman
- Institute of Evolution, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
| | - Gabriel Lidzbarsky
- Institute of Evolution, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
| | - Wei Chang
- Institute of Biotechnology, University of Helsinki, Viikinkaari 1, P.O. Box 65, FI-00014, Helsinki, Finland
- Viikki Plant Science Centre, University of Helsinki, Viikinkaari 1, P.O. Box 65, FI-00014, Helsinki, Finland
| | - Marko J Jääskeläinen
- Institute of Biotechnology, University of Helsinki, Viikinkaari 1, P.O. Box 65, FI-00014, Helsinki, Finland
- Viikki Plant Science Centre, University of Helsinki, Viikinkaari 1, P.O. Box 65, FI-00014, Helsinki, Finland
| | - Christian Schudoma
- Earlham Institute, Norwich Research Park, Colney Lane, Norwich, NR4 7UZ, UK
| | - Lars Paulin
- Institute of Biotechnology, University of Helsinki, Viikinkaari 1, P.O. Box 65, FI-00014, Helsinki, Finland
| | - Pia Laine
- Institute of Biotechnology, University of Helsinki, Viikinkaari 1, P.O. Box 65, FI-00014, Helsinki, Finland
| | - Harbans Bariana
- The University of Sydney Plant Breeding Institute, 107 Cobbitty Road, Cobbitty, NSW, 2570, Australia
| | - Hanan Sela
- Institute of Evolution, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
- The Institute for Cereal Crops Improvement, Tel Aviv University, P.O. Box 39040, 6139001, Tel Aviv, Israel
| | - Kamran Saleem
- Department of Agroecology, Aarhus University, Forsøgsvej 1, 4200, Slagelse, Denmark
| | | | - Mogens S Hovmøller
- Department of Agroecology, Aarhus University, Forsøgsvej 1, 4200, Slagelse, Denmark
| | - Assaf Distelfeld
- Institute of Evolution, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
- School of Plant Sciences and Food Security, Tel Aviv University, P.O. Box 39040, 6139001, Tel Aviv, Israel
| | - Boulos Chalhoub
- Institute of System and Synthetic Biology-Organization and Evolution of Complex Genomes, 2 rue Gaston Crémieux CP 5708, 91057, Evry Cedex, France
| | - Jorge Dubcovsky
- Department of Plant Sciences, University of California, One Shields Avenue, Davis, CA, 95616, USA
- Howard Hughes Medical Institute, 4000 Jones Bridge Road, Chevy Chase, MD, 20815, USA
| | - Abraham B Korol
- Institute of Evolution, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
- Department of Evolutionary and Environmental Biology, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel
| | - Alan H Schulman
- Institute of Biotechnology, University of Helsinki, Viikinkaari 1, P.O. Box 65, FI-00014, Helsinki, Finland
- Viikki Plant Science Centre, University of Helsinki, Viikinkaari 1, P.O. Box 65, FI-00014, Helsinki, Finland
- Natural Resources Institute Finland (Luke), Latokartanonkaari 9, FI-00790, Helsinki, Finland
| | - Tzion Fahima
- Institute of Evolution, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel.
- Department of Evolutionary and Environmental Biology, University of Haifa, 199 Abba-Hushi Avenue, Mt. Carmel, 3498838, Haifa, Israel.
| |
Collapse
|
16
|
Abstract
Repeats are ubiquitous elements of proteins and they play important roles for cellular function and during evolution. Repeats are, however, also notoriously difficult to capture computationally and large scale studies so far had difficulties in linking genetic causes, structural properties and evolutionary trajectories of protein repeats. Here we apply recently developed methods for repeat detection and analysis to a large dataset comprising over hundred metazoan genomes. We find that repeats in larger protein families experience generally very few insertions or deletions (indels) of repeat units but there is also a significant fraction of noteworthy volatile outliers with very high indel rates. Analysis of structural data indicates that repeats with an open structure and independently folding units are more volatile and more likely to be intrinsically disordered. Such disordered repeats are also significantly enriched in sites with a high functional potential such as linear motifs. Furthermore, the most volatile repeats have a high sequence similarity between their units. Since many volatile repeats also show signs of recombination, we conclude they are often shaped by concerted evolution. Intriguingly, many of these conserved yet volatile repeats are involved in host-pathogen interactions where they might foster fast but subtle adaptation in biological arms races. KEY WORDS: protein evolution, domain rearrangements, protein repeats, concerted evolution.
Collapse
Affiliation(s)
- Andreas Schüler
- Institute for Evolution and Biodiversity, Westfalian Wilhelms University, Huefferstrasse 1, Muenster, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, Westfalian Wilhelms University, Huefferstrasse 1, Muenster, Germany
| |
Collapse
|
17
|
Dohmen E, Kremer LPM, Bornberg-Bauer E, Kemena C. DOGMA: domain-based transcriptome and proteome quality assessment. Bioinformatics 2016; 32:2577-81. [PMID: 27153665 DOI: 10.1093/bioinformatics/btw231] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2016] [Accepted: 04/21/2016] [Indexed: 12/29/2022] Open
Abstract
MOTIVATION Genome studies have become cheaper and easier than ever before, due to the decreased costs of high-throughput sequencing and the free availability of analysis software. However, the quality of genome or transcriptome assemblies can vary a lot. Therefore, quality assessment of assemblies and annotations are crucial aspects of genome analysis pipelines. RESULTS We developed DOGMA, a program for fast and easy quality assessment of transcriptome and proteome data based on conserved protein domains. DOGMA measures the completeness of a given transcriptome or proteome and provides information about domain content for further analysis. DOGMA provides a very fast way to do quality assessment within seconds. AVAILABILITY AND IMPLEMENTATION DOGMA is implemented in Python and published under GNU GPL v.3 license. The source code is available on https://ebbgit.uni-muenster.de/domainWorld/DOGMA/ CONTACTS: e.dohmen@wwu.de or c.kemena@wwu.de SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Elias Dohmen
- Institute for Evolution and Biodiversity, University of Münster, Münster 48149, Germany Institute for Bioinformatics and Chemoinformatics, Westphalian University of Applied Sciences, Recklinghausen 45665, Germany
| | - Lukas P M Kremer
- Institute for Evolution and Biodiversity, University of Münster, Münster 48149, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Münster, Münster 48149, Germany
| | - Carsten Kemena
- Institute for Evolution and Biodiversity, University of Münster, Münster 48149, Germany
| |
Collapse
|
18
|
Hanukoglu I, Hanukoglu A. Epithelial sodium channel (ENaC) family: Phylogeny, structure-function, tissue distribution, and associated inherited diseases. Gene 2016; 579:95-132. [PMID: 26772908 PMCID: PMC4756657 DOI: 10.1016/j.gene.2015.12.061] [Citation(s) in RCA: 249] [Impact Index Per Article: 31.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2015] [Revised: 12/20/2015] [Accepted: 12/22/2015] [Indexed: 01/24/2023]
Abstract
The epithelial sodium channel (ENaC) is composed of three homologous subunits and allows the flow of Na(+) ions across high resistance epithelia, maintaining body salt and water homeostasis. ENaC dependent reabsorption of Na(+) in the kidney tubules regulates extracellular fluid (ECF) volume and blood pressure by modulating osmolarity. In multi-ciliated cells, ENaC is located in cilia and plays an essential role in the regulation of epithelial surface liquid volume necessary for cilial transport of mucus and gametes in the respiratory and reproductive tracts respectively. The subunits that form ENaC (named as alpha, beta, gamma and delta, encoded by genes SCNN1A, SCNN1B, SCNN1G, and SCNN1D) are members of the ENaC/Degenerin superfamily. The earliest appearance of ENaC orthologs is in the genomes of the most ancient vertebrate taxon, Cyclostomata (jawless vertebrates) including lampreys, followed by earliest representatives of Gnathostomata (jawed vertebrates) including cartilaginous sharks. Among Euteleostomi (bony vertebrates), Actinopterygii (ray finned-fishes) branch has lost ENaC genes. Yet, most animals in the Sarcopterygii (lobe-finned fish) branch including Tetrapoda, amphibians and amniotes (lizards, crocodiles, birds, and mammals), have four ENaC paralogs. We compared the sequences of ENaC orthologs from 20 species and established criteria for the identification of ENaC orthologs and paralogs, and their distinction from other members of the ENaC/Degenerin superfamily, especially ASIC family. Differences between ENaCs and ASICs are summarized in view of their physiological functions and tissue distributions. Structural motifs that are conserved throughout vertebrate ENaCs are highlighted. We also present a comparative overview of the genotype-phenotype relationships in inherited diseases associated with ENaC mutations, including multisystem pseudohypoaldosteronism (PHA1B), Liddle syndrome, cystic fibrosis-like disease and essential hypertension.
Collapse
Affiliation(s)
- Israel Hanukoglu
- Laboratory of Cell Biology, Faculty of Natural Sciences, Ariel University, Ariel, Israel.
| | - Aaron Hanukoglu
- Division of Pediatric Endocrinology, E. Wolfson Medical Center, Holon, Israel; Sackler School of Medicine, Tel-Aviv University, Tel Aviv, Israel
| |
Collapse
|
19
|
Cromar GL, Zhao A, Xiong X, Swapna LS, Loughran N, Song H, Parkinson J. PhyloPro2.0: a database for the dynamic exploration of phylogenetically conserved proteins and their domain architectures across the Eukarya. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:baw013. [PMID: 26980519 PMCID: PMC4792532 DOI: 10.1093/database/baw013] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Accepted: 01/29/2016] [Indexed: 11/13/2022]
Abstract
PhyloPro is a database and accompanying web-based application for the construction and exploration of phylogenetic profiles across the Eukarya. In this update article, we present six major new developments in PhyloPro: (i) integration of Pfam-A domain predictions for all proteins; (ii) new summary heatmaps and detailed level views of domain conservation; (iii) an interactive, network-based visualization tool for exploration of domain architectures and their conservation; (iv) ability to browse based on protein functional categories (GOSlim); (v) improvements to the web interface to enhance drill down capability from the heatmap view; and (vi) improved coverage including 164 eukaryotes and 12 reference species. In addition, we provide improved support for downloading data and images in a variety of formats. Among the existing tools available for phylogenetic profiles, PhyloPro provides several innovative domain-based features including a novel domain adjacency visualization tool. These are designed to allow the user to identify and compare proteins with similar domain architectures across species and thus develop hypotheses about the evolution of lineage-specific trajectories. Database URL: http://www.compsysbio.org/phylopro/.
Collapse
Affiliation(s)
- Graham L Cromar
- Program in Molecular Structure and Function, Hospital for Sick Children, 21-9830 PGCRL, 686 Bay Street, Toronto, ON M5G 0A4, Canada and
| | - Anthony Zhao
- Program in Molecular Structure and Function, Hospital for Sick Children, 21-9830 PGCRL, 686 Bay Street, Toronto, ON M5G 0A4, Canada and
| | - Xuejian Xiong
- Program in Molecular Structure and Function, Hospital for Sick Children, 21-9830 PGCRL, 686 Bay Street, Toronto, ON M5G 0A4, Canada and
| | - Lakshmipuram S Swapna
- Program in Molecular Structure and Function, Hospital for Sick Children, 21-9830 PGCRL, 686 Bay Street, Toronto, ON M5G 0A4, Canada and
| | - Noeleen Loughran
- Program in Molecular Structure and Function, Hospital for Sick Children, 21-9830 PGCRL, 686 Bay Street, Toronto, ON M5G 0A4, Canada and
| | - Hongyan Song
- Program in Molecular Structure and Function, Hospital for Sick Children, 21-9830 PGCRL, 686 Bay Street, Toronto, ON M5G 0A4, Canada and
| | - John Parkinson
- Program in Molecular Structure and Function, Hospital for Sick Children, 21-9830 PGCRL, 686 Bay Street, Toronto, ON M5G 0A4, Canada and Departments of Biochemistry, Computer Science and Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| |
Collapse
|
20
|
Haider C, Kavic M, Sonnhammer EL. TreeDom: a graphical web tool for analysing domain architecture evolution. Bioinformatics 2016; 32:2384-5. [DOI: 10.1093/bioinformatics/btw140] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Accepted: 03/03/2016] [Indexed: 11/13/2022] Open
Abstract
Abstract
Summary: We present TreeDom, a web tool for graphically analysing the evolutionary history of domains in multi-domain proteins. Individual domains on the same protein chain may have distinct evolutionary histories, which is important to grasp in order to understand protein function. For instance, it may be important to know whether a domain was duplicated recently or long ago, to know the origin of inserted domains, or to know the pattern of domain loss within a protein family. TreeDom uses the Pfam database as the source of domain annotations, and displays these on a sequence tree. An advantage of TreeDom is that the user can limit the analysis to N sequences that are most similar to a query, or provide a list of sequence IDs to include. Using the Pfam alignment of the selected sequences, a tree is built and displayed together with the domain architecture of each sequence.
Availablility and implementation: http://TreeDom.sbc.su.se
Contact: Erik.Sonnhammer@scilifelab.se
Collapse
Affiliation(s)
- Christian Haider
- Stockholm Bioinformatics Center, Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, Solna SE-17121, Sweden
- FH OÖ – University of Applied Sciences Upper Austria, Hagenberg 4232, Austria
| | - Marina Kavic
- Stockholm Bioinformatics Center, Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, Solna SE-17121, Sweden
- FH OÖ – University of Applied Sciences Upper Austria, Hagenberg 4232, Austria
| | - Erik L.L. Sonnhammer
- Stockholm Bioinformatics Center, Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, Solna SE-17121, Sweden
| |
Collapse
|
21
|
Tekaia F. Inferring Orthologs: Open Questions and Perspectives. GENOMICS INSIGHTS 2016; 9:17-28. [PMID: 26966373 PMCID: PMC4778853 DOI: 10.4137/gei.s37925] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Revised: 12/30/2015] [Accepted: 01/02/2016] [Indexed: 01/25/2023]
Abstract
With the increasing number of sequenced genomes and their comparisons, the detection of orthologs is crucial for reliable functional annotation and evolutionary analyses of genes and species. Yet, the dynamic remodeling of genome content through gain, loss, transfer of genes, and segmental and whole-genome duplication hinders reliable orthology detection. Moreover, the lack of direct functional evidence and the questionable quality of some available genome sequences and annotations present additional difficulties to assess orthology. This article reviews the existing computational methods and their potential accuracy in the high-throughput era of genome sequencing and anticipates open questions in terms of methodology, reliability, and computation. Appropriate taxon sampling together with combination of methods based on similarity, phylogeny, synteny, and evolutionary knowledge that may help detecting speciation events appears to be the most accurate strategy. This review also raises perspectives on the potential determination of orthology throughout the whole species phylogeny.
Collapse
Affiliation(s)
- Fredj Tekaia
- Institut Pasteur, Unit of Structural Microbiology, CNRS URA 3528 and University Paris Diderot, Sorbonne Paris Cité, Paris, France
| |
Collapse
|
22
|
Hsu CH, Chiang AWT, Hwang MJ, Liao BY. Proteins with Highly Evolvable Domain Architectures Are Nonessential but Highly Retained. Mol Biol Evol 2016; 33:1219-30. [DOI: 10.1093/molbev/msw006] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
|
23
|
Talamantes D, Biabini N, Dang H, Abdoun K, Berlemont R. Natural diversity of cellulases, xylanases, and chitinases in bacteria. BIOTECHNOLOGY FOR BIOFUELS 2016; 9:133. [PMID: 27366206 PMCID: PMC4928363 DOI: 10.1186/s13068-016-0538-6] [Citation(s) in RCA: 53] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2016] [Accepted: 05/31/2016] [Indexed: 05/16/2023]
Abstract
BACKGROUND Glycoside hydrolases (GH) targeting cellulose, xylan, and chitin are common in the bacterial genomes that have been sequenced. Little is known, however, about the architecture of multi-domain and multi-activity glycoside hydrolases. In these enzymes, combined catalytic domains act synergistically and thus display overall improved catalytic efficiency, making these proteins of high interest for the biofuel technology industry. RESULTS Here, we identify the domain organization in 40,946 proteins targeting cellulose, xylan, and chitin derived from 11,953 sequenced bacterial genomes. These bacteria are known to be capable, or to have the potential, to degrade polysaccharides, or are newly identified potential degraders (e.g., Actinospica, Hamadaea, Cystobacter, and Microbispora). Most of the proteins we identified contain a single catalytic domain that is frequently associated with an accessory non-catalytic domain. Regarding multi-domain proteins, we found that many bacterial strains have unique GH protein architectures and that the overall protein organization is not conserved across most genera. We identified 217 multi-activity proteins with at least two GH domains for cellulose, xylan, and chitin. Of these proteins, 211 have GH domains targeting similar or associated substrates (i.e., cellulose and xylan), whereas only six proteins target both cellulose and chitin. Fifty-two percent of multi-activity GHs are hetero-GHs. Finally, GH6, -10, -44 and -48 domains were mostly C-terminal; GH9, -11, -12, and -18 were mostly N-terminal; and GH5 domains were either N- or C-terminal. CONCLUSION We identified 40,946 multi-domain/multi-activity proteins targeting cellulase, chitinase, and xylanase in bacterial genomes and proposed new candidate lineages and protein architectures for carbohydrate processing that may play a role in biofuel production.
Collapse
Affiliation(s)
- Darrian Talamantes
- Department of Biological Sciences, California State University, Long Beach, 1250 Bellflower Blvd., Long Beach, 90840-9502 USA
| | - Nazmehr Biabini
- Department of Biological Sciences, California State University, Long Beach, 1250 Bellflower Blvd., Long Beach, 90840-9502 USA
| | - Hoang Dang
- Department of Biological Sciences, California State University, Long Beach, 1250 Bellflower Blvd., Long Beach, 90840-9502 USA
| | - Kenza Abdoun
- Department of Biological Sciences, California State University, Long Beach, 1250 Bellflower Blvd., Long Beach, 90840-9502 USA
| | - Renaud Berlemont
- Department of Biological Sciences, California State University, Long Beach, 1250 Bellflower Blvd., Long Beach, 90840-9502 USA
| |
Collapse
|
24
|
Scaiewicz A, Levitt M. The language of the protein universe. Curr Opin Genet Dev 2015; 35:50-6. [PMID: 26451980 DOI: 10.1016/j.gde.2015.08.010] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Revised: 08/20/2015] [Accepted: 08/25/2015] [Indexed: 11/17/2022]
Abstract
Proteins, the main cell machinery which play a major role in nearly every cellular process, have always been a central focus in biology. We live in the post-genomic era, and inferring information from massive data sets is a steadily growing universal challenge. The increasing availability of fully sequenced genomes can be regarded as the 'Rosetta Stone' of the protein universe, allowing the understanding of genomes and their evolution, just as the original Rosetta Stone allowed Champollion to decipher the ancient Egyptian hieroglyphics. In this review, we consider aspects of the protein domain architectures repertoire that are closely related to those of human languages and aim to provide some insights about the language of proteins.
Collapse
Affiliation(s)
- Andrea Scaiewicz
- Department of Structural Biology, Stanford University, Stanford, CA 94305-5126, United States
| | - Michael Levitt
- Department of Structural Biology, Stanford University, Stanford, CA 94305-5126, United States.
| |
Collapse
|
25
|
Schüler A, Schmitz G, Reft A, Özbek S, Thurm U, Bornberg-Bauer E. The Rise and Fall of TRP-N, an Ancient Family of Mechanogated Ion Channels, in Metazoa. Genome Biol Evol 2015; 7:1713-27. [PMID: 26100409 PMCID: PMC4494053 DOI: 10.1093/gbe/evv091] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Mechanoreception, the sensing of mechanical forces, is an ancient means of orientation and communication and tightly linked to the evolution of motile animals. In flies, the transient-receptor-potential N protein (TRP-N) was found to be a cilia-associated mechanoreceptor. TRP-N belongs to a large and diverse family of ion channels. Its unusually long N-terminal repeat of 28 ankyrin domains presumably acts as the gating spring by which mechanical energy induces channel gating. We analyzed the evolutionary origins and possible diversification of TRP-N. Using a custom-made set of highly discriminative sequence profiles we scanned a representative set of metazoan genomes and subsequently corrected several gene models. We find that, contrary to other ion channel families, TRP-N is remarkably conserved in its domain arrangements and copy number (1) in all Bilateria except for amniotes, even in the wake of several whole-genome duplications. TRP-N is absent in Porifera but present in Ctenophora and Placozoa. Exceptional multiplications of TRP-N occurred in Cnidaria, independently along the Hydra and the Nematostella lineage. Molecular signals of subfunctionalization can be attributed to different mechanisms of activation of the gating spring. In Hydra this is further supported by in situ hybridization and immune staining, suggesting that at least three paralogs adapted to nematocyte discharge, which is key for predation and defense. We propose that these new candidate proteins help explain the sensory complexity of Cnidaria which has been previously observed but so far has lacked a molecular underpinning. Also, the ancient appearance of TRP-N supports a common origin of important components of the nervous systems in Ctenophores, Cnidaria, and Bilateria.
Collapse
Affiliation(s)
- Andreas Schüler
- Institute for Evolution and Biodiversity, University of Muenster, Germany
| | - Gregor Schmitz
- Institute for Evolution and Biodiversity, University of Muenster, Germany
| | - Abigail Reft
- Centre for Organismal Studies, University of Heidelberg, Germany
| | - Suat Özbek
- Centre for Organismal Studies, University of Heidelberg, Germany HEIKA-Heidelberg Karlsruhe Research Partnership, Heidelberg University, Karlsruhe Institute of Technology (KIT), Heidelberg and Karlsruhe, Germany
| | - Ulrich Thurm
- Institute for Neurobiology and Behavioural Biology, University of Muenster, Germany
| | | |
Collapse
|
26
|
Linkeviciute V, Rackham OJL, Gough J, Oates ME, Fang H. Function-selective domain architecture plasticity potentials in eukaryotic genome evolution. Biochimie 2015; 119:269-77. [PMID: 25980317 PMCID: PMC4679076 DOI: 10.1016/j.biochi.2015.05.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Accepted: 05/06/2015] [Indexed: 12/20/2022]
Abstract
To help evaluate how protein function impacts on genome evolution, we introduce a new concept of ‘architecture plasticity potential’ – the capacity to form distinct domain architectures – both for an individual domain, or more generally for a set of domains grouped by shared function. We devise a scoring metric to measure the plasticity potential for these domain sets, and evaluate how function has changed over time for different species. Applying this metric to a phylogenetic tree of eukaryotic genomes, we find that the involvement of each function is not random but highly selective. For certain lineages there is strong bias for evolution to involve domains related to certain functions. In general eukaryotic genomes, particularly animals, expand complex functional activities such as signalling and regulation, but at the cost of reducing metabolic processes. We also observe differential evolution of transcriptional regulation and a unique evolutionary role of channel regulators; crucially this is only observable in terms of the architecture plasticity potential. Our findings provide a new layer of information to understand the significance of function in eukaryotic genome evolution. A web search tool, available at http://supfam.org/Pevo, offers a wide spectrum of options for exploring functional importance in eukaryotic genome evolution. A new concept to measure domain architecture plasticity potential in a genome. We reveal the function-selective role in eukaryotic genome evolution. Eukaryotic genomes expand signalling and regulations but reduce metabolism. We observe differential evolution between trans- and cis-acting regulations. We observe a unique role of channel regulators in separating eukaryotic kingdoms.
Collapse
Affiliation(s)
- Viktorija Linkeviciute
- Computational Genomics Group, Department of Computer Science, University of Bristol, The Merchant Venturers Building, Bristol BS8 1UB, UK; School of Biological Sciences, University of Edinburgh, Darwin Building, The King's Buildings, Edinburgh EH9 3BF, UK
| | - Owen J L Rackham
- Computational Genomics Group, Department of Computer Science, University of Bristol, The Merchant Venturers Building, Bristol BS8 1UB, UK; Centre for Computational Biology, Duke-NUS Graduate Medical School, Singapore 169857, Singapore
| | - Julian Gough
- Computational Genomics Group, Department of Computer Science, University of Bristol, The Merchant Venturers Building, Bristol BS8 1UB, UK
| | - Matt E Oates
- Computational Genomics Group, Department of Computer Science, University of Bristol, The Merchant Venturers Building, Bristol BS8 1UB, UK
| | - Hai Fang
- Computational Genomics Group, Department of Computer Science, University of Bristol, The Merchant Venturers Building, Bristol BS8 1UB, UK; Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK.
| |
Collapse
|
27
|
microProtein Prediction Program (miP3): A Software for Predicting microProteins and Their Target Transcription Factors. Int J Genomics 2015; 2015:734147. [PMID: 26060811 PMCID: PMC4427850 DOI: 10.1155/2015/734147] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2014] [Revised: 03/21/2015] [Accepted: 03/22/2015] [Indexed: 01/01/2023] Open
Abstract
An emerging concept in transcriptional regulation is that a class of truncated transcription factors (TFs), called microProteins (miPs), engages in protein-protein interactions with TF complexes and provides feedback controls. A handful of miP examples have been described in the literature but the extent of their prevalence is unclear. Here we present an algorithm that predicts miPs and their target TFs from a sequenced genome. The algorithm is called miP prediction program (miP3), which is implemented in Python. The software will help shed light on the prevalence, biological roles, and evolution of miPs. Moreover, miP3 can be used to predict other types of miP-like proteins that may have evolved from other functional classes such as kinases and receptors. The program is freely available and can be applied to any sequenced genome.
Collapse
|
28
|
Kemena C, Bitard-Feildel T, Bornberg-Bauer E. MDAT- Aligning multiple domain arrangements. BMC Bioinformatics 2015; 16:19. [PMID: 25626688 PMCID: PMC4384290 DOI: 10.1186/s12859-014-0442-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2014] [Accepted: 12/03/2014] [Indexed: 11/26/2022] Open
Abstract
Background Proteins are composed of domains, protein segments that fold independently from the rest of the protein and have a specific function. During evolution the arrangement of domains can change: domains are gained, lost or their order is rearranged. To facilitate the analysis of these changes we propose the use of multiple domain alignments. Results We developed an alignment program, called MDAT, which aligns multiple domain arrangements. MDAT extends earlier programs which perform pairwise alignments of domain arrangements. MDAT uses a domain similarity matrix to score domain pairs and aligns the domain arrangements using a consistency supported progressive alignment method. Conclusion MDAT will be useful for analysing changes in domain arrangements within and between protein families and will thus provide valuable insights into the evolution of proteins and their domains. MDAT is coded in C++, and the source code is freely available for download at http://www.bornberglab.org/pages/mdat. Electronic supplementary material The online version of this article (doi:10.1186/s12859-014-0442-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Carsten Kemena
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstr. 1, Münster, Germany.
| | - Tristan Bitard-Feildel
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstr. 1, Münster, Germany.
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstr. 1, Münster, Germany.
| |
Collapse
|
29
|
Teixeira MM, de Almeida LGP, Kubitschek-Barreira P, Alves FL, Kioshima ÉS, Abadio AKR, Fernandes L, Derengowski LS, Ferreira KS, Souza RC, Ruiz JC, de Andrade NC, Paes HC, Nicola AM, Albuquerque P, Gerber AL, Martins VP, Peconick LDF, Neto AV, Chaucanez CB, Silva PA, Cunha OL, de Oliveira FFM, dos Santos TC, Barros ALN, Soares MA, de Oliveira LM, Marini MM, Villalobos-Duno H, Cunha MML, de Hoog S, da Silveira JF, Henrissat B, Niño-Vega GA, Cisalpino PS, Mora-Montes HM, Almeida SR, Stajich JE, Lopes-Bezerra LM, Vasconcelos ATR, Felipe MSS. Comparative genomics of the major fungal agents of human and animal Sporotrichosis: Sporothrix schenckii and Sporothrix brasiliensis. BMC Genomics 2014; 15:943. [PMID: 25351875 PMCID: PMC4226871 DOI: 10.1186/1471-2164-15-943] [Citation(s) in RCA: 87] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2014] [Accepted: 09/25/2014] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND The fungal genus Sporothrix includes at least four human pathogenic species. One of these species, S. brasiliensis, is the causal agent of a major ongoing zoonotic outbreak of sporotrichosis in Brazil. Elsewhere, sapronoses are caused by S. schenckii and S. globosa. The major aims on this comparative genomic study are: 1) to explore the presence of virulence factors in S. schenckii and S. brasiliensis; 2) to compare S. brasiliensis, which is cat-transmitted and infects both humans and cats with S. schenckii, mainly a human pathogen; 3) to compare these two species to other human pathogens (Onygenales) with similar thermo-dimorphic behavior and to other plant-associated Sordariomycetes. RESULTS The genomes of S. schenckii and S. brasiliensis were pyrosequenced to 17x and 20x coverage comprising a total of 32.3 Mb and 33.2 Mb, respectively. Pair-wise genome alignments revealed that the two species are highly syntenic showing 97.5% average sequence identity. Phylogenomic analysis reveals that both species diverged about 3.8-4.9 MYA suggesting a recent event of speciation. Transposable elements comprise respectively 0.34% and 0.62% of the S. schenckii and S. brasiliensis genomes and expansions of Gypsy-like elements was observed reflecting the accumulation of repetitive elements in the S. brasiliensis genome. Mitochondrial genomic comparisons showed the presence of group-I intron encoding homing endonucleases (HE's) exclusively in S. brasiliensis. Analysis of protein family expansions and contractions in the Sporothrix lineage revealed expansion of LysM domain-containing proteins, small GTPases, PKS type1 and leucin-rich proteins. In contrast, a lack of polysaccharide lyase genes that are associated with decay of plants was observed when compared to other Sordariomycetes and dimorphic fungal pathogens, suggesting evolutionary adaptations from a plant pathogenic or saprobic to an animal pathogenic life style. CONCLUSIONS Comparative genomic data suggest a unique ecological shift in the Sporothrix lineage from plant-association to mammalian parasitism, which contributes to the understanding of how environmental interactions may shape fungal virulence. . Moreover, the striking differences found in comparison with other dimorphic fungi revealed that dimorphism in these close relatives of plant-associated Sordariomycetes is a case of convergent evolution, stressing the importance of this morphogenetic change in fungal pathogenesis.
Collapse
Affiliation(s)
- Marcus M Teixeira
- />Departamento de Biologia Celular, Universidade de Brasília, Brasília, DF Brazil
| | | | - Paula Kubitschek-Barreira
- />Departamento de Biologia Celular, Instituto de Biologia Roberto Alcântara Gomes, Universidade do Estado do Rio de Janeiro, Rio de Janeiro, RJ Brazil
| | - Fernanda L Alves
- />Departamento de Microbiologia, Universidade Federal de Minas Gerais, Belo Horizonte, MG Brazil
- />Grupo Informática de Biossistemas, Centro de Pesquisas René Rachou, FIOCRUZ, Minas, Belo Horizonte, MG Brazil
| | - Érika S Kioshima
- />Departamento de Biologia Celular, Universidade de Brasília, Brasília, DF Brazil
- />Departamento de Análises Clínicas, Universidade Estadual de Maringá, Maringá, PR Brazil
| | - Ana KR Abadio
- />Departamento de Biologia Celular, Universidade de Brasília, Brasília, DF Brazil
| | - Larissa Fernandes
- />Programa de Pós-Graduação em Ciências e Tecnologias em Saúde, Universidade de Brasília, Ceilândia, Brasília, DF Brazil
| | - Lorena S Derengowski
- />Departamento de Biologia Celular, Universidade de Brasília, Brasília, DF Brazil
| | - Karen S Ferreira
- />Instituto de Ciências Ambientais, Químicas e Farmacêuticas, Universidade Federal de São Paulo, Campus Diadema, São Paulo, SP Brazil
| | - Rangel C Souza
- />Laboratório Nacional de Computação Científica, Petrópolis, RJ Brazil
| | - Jeronimo C Ruiz
- />Grupo Informática de Biossistemas, Centro de Pesquisas René Rachou, FIOCRUZ, Minas, Belo Horizonte, MG Brazil
| | - Nathalia C de Andrade
- />Departamento de Biologia Celular, Instituto de Biologia Roberto Alcântara Gomes, Universidade do Estado do Rio de Janeiro, Rio de Janeiro, RJ Brazil
| | - Hugo C Paes
- />Departamento de Biologia Celular, Universidade de Brasília, Brasília, DF Brazil
| | - André M Nicola
- />Pós-Graduação em Ciências Genômicas e Biotecnologia, Universidade Católica de Brasília, Brasília, DF Brazil
- />Programa de pós-graduação em Medicina Tropical, Universidade de Brasília, Brasília, DF Brazil
| | - Patrícia Albuquerque
- />Departamento de Biologia Celular, Universidade de Brasília, Brasília, DF Brazil
- />Programa de pós-graduação em Medicina Tropical, Universidade de Brasília, Brasília, DF Brazil
| | | | - Vicente P Martins
- />Departamento de Biologia Celular, Universidade de Brasília, Brasília, DF Brazil
| | - Luisa DF Peconick
- />Departamento de Biologia Celular, Universidade de Brasília, Brasília, DF Brazil
| | - Alan Viggiano Neto
- />Departamento de Biologia Celular, Universidade de Brasília, Brasília, DF Brazil
| | - Claudia B Chaucanez
- />Departamento de Biologia Celular, Universidade de Brasília, Brasília, DF Brazil
| | - Patrícia A Silva
- />Departamento de Biologia Celular, Universidade de Brasília, Brasília, DF Brazil
| | - Oberdan L Cunha
- />Laboratório Nacional de Computação Científica, Petrópolis, RJ Brazil
| | | | - Tayná C dos Santos
- />Departamento de Biologia Celular, Universidade de Brasília, Brasília, DF Brazil
| | - Amanda LN Barros
- />Departamento de Biologia Celular, Universidade de Brasília, Brasília, DF Brazil
| | - Marco A Soares
- />Departamento de Microbiologia, Universidade Federal de Minas Gerais, Belo Horizonte, MG Brazil
| | - Luciana M de Oliveira
- />Departamento de Microbiologia, Universidade Federal de Minas Gerais, Belo Horizonte, MG Brazil
- />Programa de pós-graduação em Bioinformática, Universidade Federal de Minas Gerais, Minas Gerais, Brazil
| | - Marjorie M Marini
- />Departamento de Microbiologia Imunobiologia e Parasitologia, Universidade Federal de São Paulo, São Paulo, SP Brazil
| | - Héctor Villalobos-Duno
- />Centro de Microbiología y Biología Celular, Instituto Venezolano de Investigaciones Cientificas, Caracas, Venezuela
| | - Marcel ML Cunha
- />Departamento de Biologia Celular, Instituto de Biologia Roberto Alcântara Gomes, Universidade do Estado do Rio de Janeiro, Rio de Janeiro, RJ Brazil
| | - Sybren de Hoog
- />CBS-KNAW Fungal Biodiversity Centre, Utrecht, The Netherlands
| | - José F da Silveira
- />Departamento de Microbiologia Imunobiologia e Parasitologia, Universidade Federal de São Paulo, São Paulo, SP Brazil
| | - Bernard Henrissat
- />Centre National de la Recherche Scientifique, Aix-Marseille, Université, CNRS, Marseille, France
| | - Gustavo A Niño-Vega
- />Centro de Microbiología y Biología Celular, Instituto Venezolano de Investigaciones Cientificas, Caracas, Venezuela
| | - Patrícia S Cisalpino
- />Grupo Informática de Biossistemas, Centro de Pesquisas René Rachou, FIOCRUZ, Minas, Belo Horizonte, MG Brazil
| | | | - Sandro R Almeida
- />Departamento de Análises Clínicas e Toxicológicas, Universidade de São Paulo, São Paulo, SP Brazil
| | - Jason E Stajich
- />Department of Plant Pathology & Microbiology, University of California, Riverside, CA USA
| | - Leila M Lopes-Bezerra
- />Departamento de Biologia Celular, Instituto de Biologia Roberto Alcântara Gomes, Universidade do Estado do Rio de Janeiro, Rio de Janeiro, RJ Brazil
| | | | - Maria SS Felipe
- />Departamento de Biologia Celular, Universidade de Brasília, Brasília, DF Brazil
- />Pós-Graduação em Ciências Genômicas e Biotecnologia, Universidade Católica de Brasília, Brasília, DF Brazil
| |
Collapse
|
30
|
Abstract
To explore protein space from a global perspective, we consider 9,710 SCOP (Structural Classification of Proteins) domains with up to 70% sequence identity and present all similarities among them as networks: In the "domain network," nodes represent domains, and edges connect domains that share "motifs," i.e., significantly sized segments of similar sequence and structure. We explore the dependence of the network on the thresholds that define the evolutionary relatedness of the domains. At excessively strict thresholds the network falls apart completely; for very lax thresholds, there are network paths between virtually all domains. Interestingly, at intermediate thresholds the network constitutes two regions that can be described as "continuous" versus "discrete." The continuous region comprises a large connected component, dominated by domains with alternating alpha and beta elements, and the discrete region includes the rest of the domains in isolated islands, each generally corresponding to a fold. We also construct the "motif network," in which nodes represent recurring motifs, and edges connect motifs that appear in the same domain. This network also features a large and highly connected component of motifs that originate from domains with alternating alpha/beta elements (and some all-alpha domains), and smaller isolated islands. Indeed, the motif network suggests that nature reuses such motifs extensively. The networks suggest evolutionary paths between domains and give hints about protein evolution and the underlying biophysics. They provide natural means of organizing protein space, and could be useful for the development of strategies for protein search and design.
Collapse
|
31
|
Tóth-Petróczy A, Tawfik DS. The robustness and innovability of protein folds. Curr Opin Struct Biol 2014; 26:131-8. [PMID: 25038399 DOI: 10.1016/j.sbi.2014.06.007] [Citation(s) in RCA: 93] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2013] [Revised: 06/26/2014] [Accepted: 06/26/2014] [Indexed: 11/30/2022]
Abstract
Assignment of protein folds to functions indicates that >60% of folds carry out one or two enzymatic functions, while few folds, for example, the TIM-barrel and Rossmann folds, exhibit hundreds. Are there structural features that make a fold amenable to functional innovation (innovability)? Do these features relate to robustness--the ability to readily accumulate sequence changes? We discuss several hypotheses regarding the relationship between the architecture of a protein and its evolutionary potential. We describe how, in a seemingly paradoxical manner, opposite properties, such as high stability and rigidity versus conformational plasticity and structural order versus disorder, promote robustness and/or innovability. We hypothesize that polarity--differentiation and low connectivity between a protein's scaffold and its active-site--is a key prerequisite for innovability.
Collapse
Affiliation(s)
- Agnes Tóth-Petróczy
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Dan S Tawfik
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel.
| |
Collapse
|
32
|
Analysis of the protein domain and domain architecture content in fungi and its application in the search of new antifungal targets. PLoS Comput Biol 2014; 10:e1003733. [PMID: 25033262 PMCID: PMC4102429 DOI: 10.1371/journal.pcbi.1003733] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2013] [Accepted: 06/04/2014] [Indexed: 01/25/2023] Open
Abstract
Over the past several years fungal infections have shown an increasing incidence in the susceptible population, and caused high mortality rates. In parallel, multi-resistant fungi are emerging in human infections. Therefore, the identification of new potential antifungal targets is a priority. The first task of this study was to analyse the protein domain and domain architecture content of the 137 fungal proteomes (corresponding to 111 species) available in UniProtKB (UniProt KnowledgeBase) by January 2013. The resulting list of core and exclusive domain and domain architectures is provided in this paper. It delineates the different levels of fungal taxonomic classification: phylum, subphylum, order, genus and species. The analysis highlighted Aspergillus as the most diverse genus in terms of exclusive domain content. In addition, we also investigated which domains could be considered promiscuous in the different organisms. As an application of this analysis, we explored three different ways to detect potential targets for antifungal drugs. First, we compared the domain and domain architecture content of the human and fungal proteomes, and identified those domains and domain architectures only present in fungi. Secondly, we looked for information regarding fungal pathways in public repositories, where proteins containing promiscuous domains could be involved. Three pathways were identified as a result: lovastatin biosynthesis, xylan degradation and biosynthesis of siroheme. Finally, we classified a subset of the studied fungi in five groups depending on their occurrence in clinical samples. We then looked for exclusive domains in the groups that were more relevant clinically and determined which of them had the potential to bind small molecules. Overall, this study provides a comprehensive analysis of the available fungal proteomes and shows three approaches that can be used as a first step in the detection of new antifungal targets. Some fungi have become pathogenic to plants and in a lesser extent to animals. Under certain conditions their presence in the human body can prove a threat for human health, especially for immunocompromised patients. Yet, some fungi can also infect healthy individuals. The low sensitivity of the antifungal drugs available together with the clinically observed resistance of some fungi raises the demand for new alternative treatments. Proteins are biological molecules which perform essential functions within the living organisms. Many of those functions are attributed to the varying folded structure of each protein. These configurations are composed of functional units -also called domains- each one independently responsible for a fraction of the overall biological function. Understanding how the different block combinations are distributed across members of the same or similar families of organisms is important. For instance, exclusive domain combinations can hold particular acquired functions. Blocks displaying a high mobility can play major roles for the organism's survival. The biological goal of this study was to analyse the functional implications of protein domains and domain combinations in the available fungal proteomes. This information can be used to highlight proteins and pathways that could be potentially used as drug targets.
Collapse
|
33
|
Chaudhry F, Jansen S, Little K, Suarez C, Boujemaa-Paterski R, Blanchoin L, Goode BL. Autonomous and in trans functions for the two halves of Srv2/CAP in promoting actin turnover. Cytoskeleton (Hoboken) 2014; 71:351-360. [PMID: 24616256 DOI: 10.1002/cm.21170] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2014] [Revised: 02/24/2014] [Accepted: 02/25/2014] [Indexed: 01/06/2023]
Abstract
Recent evidence has suggested that Srv2/CAP (cyclase-associated protein) has two distinct functional roles in regulating actin turnover, with its N-terminus enhancing cofilin-mediated severing of actin filaments and its C-terminus catalyzing actin monomer recycling. However, it has remained unclear to what degree these two activities are coordinated by being linked in one molecule, or whether they can function autonomously. To address this, we physically divided the protein into two separate halves, N-Srv2 and C-Srv2, and asked whether they are able to function in trans both in living cells and in reconstituted assays for F-actin turnover and actin-based motility. Remarkably, in F-actin turnover assays the stimulatory effects of N-Srv2 and C-Srv2 functioning in trans were quantitatively similar to those of intact full-length Srv2. Further, in bead motility assays and in vivo, the fragments again functioned in trans, although not with the full effectiveness of intact Srv2. From these data, we conclude that the functions of the two halves of Srv2/CAP are largely autonomous, although their linkage improves coordination of the two functions in specific settings, possibly explaining why the linkage is conserved across distant plant, animal, and fungal species.
Collapse
Affiliation(s)
- Faisal Chaudhry
- Department of Biology, Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, MA, 02454, U.S.A
| | - Silvia Jansen
- Department of Biology, Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, MA, 02454, U.S.A
| | - Kristin Little
- Department of Biology, Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, MA, 02454, U.S.A
| | - Cristian Suarez
- Laboratoire de Physiologie Cellulaire & Végétale, Institut de Recherches en Technologies et Sciences pour le Vivant, Centre National de la Recherche Scientifique/Commissariat à l'énergie atomique et aux énergies alternatives/Institut National de la Recherche Agronomique/Université Joseph Fourier, Grenoble 38054, FRANCE
| | - Rajaa Boujemaa-Paterski
- Laboratoire de Physiologie Cellulaire & Végétale, Institut de Recherches en Technologies et Sciences pour le Vivant, Centre National de la Recherche Scientifique/Commissariat à l'énergie atomique et aux énergies alternatives/Institut National de la Recherche Agronomique/Université Joseph Fourier, Grenoble 38054, FRANCE
| | - Laurent Blanchoin
- Laboratoire de Physiologie Cellulaire & Végétale, Institut de Recherches en Technologies et Sciences pour le Vivant, Centre National de la Recherche Scientifique/Commissariat à l'énergie atomique et aux énergies alternatives/Institut National de la Recherche Agronomique/Université Joseph Fourier, Grenoble 38054, FRANCE
| | - Bruce L Goode
- Department of Biology, Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, MA, 02454, U.S.A
| |
Collapse
|
34
|
Bhaskara RM, Mehrotra P, Rakshambikai R, Gnanavel M, Martin J, Srinivasan N. The relationship between classification of multi-domain proteins using an alignment-free approach and their functions: a case study with immunoglobulins. MOLECULAR BIOSYSTEMS 2014; 10:1082-93. [DOI: 10.1039/c3mb70443b] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
|
35
|
Abstract
Orthologues and paralogues are types of homologous genes that are related by speciation or duplication, respectively. Orthologous genes are generally assumed to retain equivalent functions in different organisms and to share other key properties. Several recent comparative genomic studies have focused on testing these expectations. Here we discuss the complexity of the evolution of gene-phenotype relationships and assess the validity of the key implications of orthology and paralogy relationships as general statistical trends and guiding principles.
Collapse
|
36
|
Bornberg-Bauer E, Albà MM. Dynamics and adaptive benefits of modular protein evolution. Curr Opin Struct Biol 2013; 23:459-66. [PMID: 23562500 DOI: 10.1016/j.sbi.2013.02.012] [Citation(s) in RCA: 80] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2013] [Revised: 02/15/2013] [Accepted: 02/15/2013] [Indexed: 11/29/2022]
Abstract
During protein evolution, novel domain arrangements are continuously formed. Rearrangements are important for the creation of molecular biodiversity and for functional molecular changes which underlie developmental shifts in the bauplan of organisms. Here we review the mechanisms by which new arrangements arise and the potential benefits of rearrangements. We concentrate on how new domains emerge and why they rapidly spread across genomes, gaining higher copy numbers than older, more established domains. This spread is most likely a consequence of their high adaptive potential but is unlikely to make up on its own for the drastic loss of domains, which is observed across different taxa. We show that a significant portion of the recently emerged domains, especially those in multidomain families, are highly disordered and speculate about the significance of these findings for the evolvability of novel genetic material.
Collapse
Affiliation(s)
- Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, School of Biological Sciences, University of Münster, Hüfferstrasse 1, D48149 Münster, Germany.
| | | |
Collapse
|