1
|
Cromar GL, Zhao A, Xiong X, Swapna LS, Loughran N, Song H, Parkinson J. PhyloPro2.0: a database for the dynamic exploration of phylogenetically conserved proteins and their domain architectures across the Eukarya. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:baw013. [PMID: 26980519 PMCID: PMC4792532 DOI: 10.1093/database/baw013] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Accepted: 01/29/2016] [Indexed: 11/13/2022]
Abstract
PhyloPro is a database and accompanying web-based application for the construction and exploration of phylogenetic profiles across the Eukarya. In this update article, we present six major new developments in PhyloPro: (i) integration of Pfam-A domain predictions for all proteins; (ii) new summary heatmaps and detailed level views of domain conservation; (iii) an interactive, network-based visualization tool for exploration of domain architectures and their conservation; (iv) ability to browse based on protein functional categories (GOSlim); (v) improvements to the web interface to enhance drill down capability from the heatmap view; and (vi) improved coverage including 164 eukaryotes and 12 reference species. In addition, we provide improved support for downloading data and images in a variety of formats. Among the existing tools available for phylogenetic profiles, PhyloPro provides several innovative domain-based features including a novel domain adjacency visualization tool. These are designed to allow the user to identify and compare proteins with similar domain architectures across species and thus develop hypotheses about the evolution of lineage-specific trajectories. Database URL: http://www.compsysbio.org/phylopro/.
Collapse
Affiliation(s)
- Graham L Cromar
- Program in Molecular Structure and Function, Hospital for Sick Children, 21-9830 PGCRL, 686 Bay Street, Toronto, ON M5G 0A4, Canada and
| | - Anthony Zhao
- Program in Molecular Structure and Function, Hospital for Sick Children, 21-9830 PGCRL, 686 Bay Street, Toronto, ON M5G 0A4, Canada and
| | - Xuejian Xiong
- Program in Molecular Structure and Function, Hospital for Sick Children, 21-9830 PGCRL, 686 Bay Street, Toronto, ON M5G 0A4, Canada and
| | - Lakshmipuram S Swapna
- Program in Molecular Structure and Function, Hospital for Sick Children, 21-9830 PGCRL, 686 Bay Street, Toronto, ON M5G 0A4, Canada and
| | - Noeleen Loughran
- Program in Molecular Structure and Function, Hospital for Sick Children, 21-9830 PGCRL, 686 Bay Street, Toronto, ON M5G 0A4, Canada and
| | - Hongyan Song
- Program in Molecular Structure and Function, Hospital for Sick Children, 21-9830 PGCRL, 686 Bay Street, Toronto, ON M5G 0A4, Canada and
| | - John Parkinson
- Program in Molecular Structure and Function, Hospital for Sick Children, 21-9830 PGCRL, 686 Bay Street, Toronto, ON M5G 0A4, Canada and Departments of Biochemistry, Computer Science and Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| |
Collapse
|
2
|
Cromar G, Wong KC, Loughran N, On T, Song H, Xiong X, Zhang Z, Parkinson J. New tricks for "old" domains: how novel architectures and promiscuous hubs contributed to the organization and evolution of the ECM. Genome Biol Evol 2014; 6:2897-917. [PMID: 25323955 PMCID: PMC4224354 DOI: 10.1093/gbe/evu228] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/07/2014] [Indexed: 12/15/2022] Open
Abstract
The extracellular matrix (ECM) is a defining characteristic of metazoans and consists of a meshwork of self-assembling, fibrous proteins, and their functionally related neighbours. Previous studies, focusing on a limited number of gene families, suggest that vertebrate complexity predominantly arose through the duplication and subsequent modification of retained, preexisting ECM genes. These genes provided the structural underpinnings to support a variety of specialized tissues, as well as a platform for the organization of spatio-temporal signaling and cell migration. However, the relative contributions of ancient versus novel domains to ECM evolution have not been quantified across the full range of ECM proteins. Here, utilizing a high quality list comprising 324 ECM genes, we reveal general and clade-specific domain combinations, identifying domains of eukaryotic and metazoan origin recruited into new roles in approximately two-third of the ECM proteins in humans representing novel vertebrate proteins. We show that, rather than acquiring new domains, sampling of new domain combinations has been key to the innovation of paralogous ECM genes during vertebrate evolution. Applying a novel framework for identifying potentially important, noncontiguous, conserved arrangements of domains, we find that the distinct biological characteristics of the ECM have arisen through unique evolutionary processes. These include the preferential recruitment of novel domains to existing architectures and the utilization of high promiscuity domains in organizing the ECM network around a connected array of structural hubs. Our focus on ECM proteins reveals that distinct types of proteins and/or the biological systems in which they operate have influenced the types of evolutionary forces that drive protein innovation. This emphasizes the need for rigorously defined systems to address questions of evolution that focus on specific systems of interacting proteins.
Collapse
Affiliation(s)
- Graham Cromar
- Program in Molecular Structure and Function, Hospital for Sick Children, Toronto, Ontario, Canada Department of Molecular Genetics, University of Toronto, Ontario, Canada
| | - Ka-Chun Wong
- Department of Computer Science, University of Toronto, Ontario, Canada Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Ontario, Canada
| | - Noeleen Loughran
- Program in Molecular Structure and Function, Hospital for Sick Children, Toronto, Ontario, Canada
| | - Tuan On
- Program in Molecular Structure and Function, Hospital for Sick Children, Toronto, Ontario, Canada Department of Molecular Genetics, University of Toronto, Ontario, Canada
| | - Hongyan Song
- Program in Molecular Structure and Function, Hospital for Sick Children, Toronto, Ontario, Canada
| | - Xuejian Xiong
- Program in Molecular Structure and Function, Hospital for Sick Children, Toronto, Ontario, Canada
| | - Zhaolei Zhang
- Department of Molecular Genetics, University of Toronto, Ontario, Canada Department of Computer Science, University of Toronto, Ontario, Canada Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Ontario, Canada Banting and Best Department of Medical Research, University of Toronto, Ontario, Canada
| | - John Parkinson
- Program in Molecular Structure and Function, Hospital for Sick Children, Toronto, Ontario, Canada Department of Molecular Genetics, University of Toronto, Ontario, Canada Department of Biochemistry, University of Toronto, Ontario, Canada
| |
Collapse
|
3
|
Abstract
Background Amino acid point mutations (nsSNPs) may change protein structure and function. However, no method directly predicts the impact of mutations on structure. Here, we compare pairs of pentamers (five consecutive residues) that locally change protein three-dimensional structure (3D, RMSD>0.4Å) to those that do not alter structure (RMSD<0.2Å). Mutations that alter structure locally can be distinguished from those that do not through a machine-learning (logistic regression) method. Results The method achieved a rather high overall performance (AUC>0.79, two-state accuracy >72%). This discriminative power was particularly unexpected given the enormous structural variability of pentamers. Mutants for which our method predicted a change of structure were also enriched in terms of disrupting stability and function. Although distinguishing change and no change in structure, the new method overall failed to distinguish between mutants with and without effect on stability or function. Conclusions Local structural change can be predicted. Future work will have to establish how useful this new perspective on predicting the effect of nsSNPs will be in combination with other methods.
Collapse
Affiliation(s)
- Christian Schaefer
- TUM, Bioinformatics-I12, Informatik, Boltzmannstrasse 3, Garching, Germany.
| | | |
Collapse
|
4
|
Gao YG, Suzuki H, Itou H, Zhou Y, Tanaka Y, Wachi M, Watanabe N, Tanaka I, Yao M. Structural and functional characterization of the LldR from Corynebacterium glutamicum: a transcriptional repressor involved in L-lactate and sugar utilization. Nucleic Acids Res 2008; 36:7110-23. [PMID: 18988622 PMCID: PMC2602784 DOI: 10.1093/nar/gkn827] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2008] [Revised: 10/09/2008] [Accepted: 10/14/2008] [Indexed: 12/01/2022] Open
Abstract
LldR (CGL2915) from Corynebacterium glutamicum is a transcription factor belonging to the GntR family, which is typically involved in the regulation of oxidized substrates associated with amino acid metabolism. In the present study, the crystal structure of LldR was determined at 2.05-A resolution. The structure consists of N- and C-domains similar to those of FadR, but with distinct domain orientations. LldR and FadR dimers achieve similar structures by domain swapping, which was first observed in dimeric assembly of transcription factors. A structural feature of Zn(2+) binding in the regulatory domain was also observed, as a difference from the FadR subfamily. DNA microarray and DNase I footprint analyses suggested that LldR acts as a repressor regulating cgl2917-lldD and cgl1934-fruK-ptsF operons, which are indispensable for l-lactate and fructose/sucrose utilization, respectively. Furthermore, the stoichiometries and affinities of LldR and DNAs were determined by isothermal titration calorimetry measurements. The transcriptional start site and repression of LldR on the cgl2917-lldD operon were analysed by primer extension assay. Mutation experiments showed that residues Lys4, Arg32, Arg42 and Gly63 are crucial for DNA binding. The location of the putative ligand binding cavity and the regulatory mechanism of LldR on its affinity for DNA were proposed.
Collapse
Affiliation(s)
- Yong-Gui Gao
- Faculty of Advanced Life Science, Hokkaido University, Sapporo 060-0810, Department of Bioengineering, Tokyo Institute of Technology, Yokohama 226-8503 and National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Hiroaki Suzuki
- Faculty of Advanced Life Science, Hokkaido University, Sapporo 060-0810, Department of Bioengineering, Tokyo Institute of Technology, Yokohama 226-8503 and National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Hiroshi Itou
- Faculty of Advanced Life Science, Hokkaido University, Sapporo 060-0810, Department of Bioengineering, Tokyo Institute of Technology, Yokohama 226-8503 and National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Yong Zhou
- Faculty of Advanced Life Science, Hokkaido University, Sapporo 060-0810, Department of Bioengineering, Tokyo Institute of Technology, Yokohama 226-8503 and National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Yoshikazu Tanaka
- Faculty of Advanced Life Science, Hokkaido University, Sapporo 060-0810, Department of Bioengineering, Tokyo Institute of Technology, Yokohama 226-8503 and National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Masaaki Wachi
- Faculty of Advanced Life Science, Hokkaido University, Sapporo 060-0810, Department of Bioengineering, Tokyo Institute of Technology, Yokohama 226-8503 and National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Nobuhisa Watanabe
- Faculty of Advanced Life Science, Hokkaido University, Sapporo 060-0810, Department of Bioengineering, Tokyo Institute of Technology, Yokohama 226-8503 and National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Isao Tanaka
- Faculty of Advanced Life Science, Hokkaido University, Sapporo 060-0810, Department of Bioengineering, Tokyo Institute of Technology, Yokohama 226-8503 and National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| | - Min Yao
- Faculty of Advanced Life Science, Hokkaido University, Sapporo 060-0810, Department of Bioengineering, Tokyo Institute of Technology, Yokohama 226-8503 and National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| |
Collapse
|
5
|
Elias M, Novotny M. cpRAS: a novel circularly permuted RAS-like GTPase domain with a highly scattered phylogenetic distribution. Biol Direct 2008; 3:21. [PMID: 18510733 PMCID: PMC2430557 DOI: 10.1186/1745-6150-3-21] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2008] [Accepted: 05/29/2008] [Indexed: 11/15/2022] Open
Abstract
A recent systematic survey suggested that the YRG (or YawG/YlqF) family with the G4-G5-G1-G2-G3 order of the conserved GTPase motifs represents the only possible circularly permuted variation of the canonical GTPase structure. Here we show that a different circularly permuted GTPase domain actually does exist, conforming to the pattern G3-G4-G5-G1-G2. The domain, dubbed cpRAS, is a variant of RAS family GTPases and occurs in two types of larger proteins, either inserted into a region homologous to a bacterial group of proteins classified as COG2373 and potentially related to the alpha-2-macroglobulin family (so far a single protein in Dictyostelium) or in combination with a von Willebrand factor type A (VWA) domain. For the latter protein type, which was found in a few metazoans and several distantly related protists, existence in the common ancestor of opisthokonts, Amoebozoa and excavates followed by at least eight independent losses may be inferred. Our findings thus bring further evidence for the importance of parallel reduction of ancestral complexity in the eukaryotic evolution. This article was reviewed by Lakshminarayan Iyer and Fyodor Kondrashov. For the full reviews, please go to the Reviewers' comments section.
Collapse
Affiliation(s)
- Marek Elias
- Charles University in Prague, Faculty of Science, Department of Botany, Benatska 2, 128 01 Prague 2, Czech Republic.
| | | |
Collapse
|
6
|
Abyzov A, Ilyin VA. A comprehensive analysis of non-sequential alignments between all protein structures. BMC STRUCTURAL BIOLOGY 2007; 7:78. [PMID: 18005453 PMCID: PMC2213659 DOI: 10.1186/1472-6807-7-78] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2007] [Accepted: 11/16/2007] [Indexed: 05/02/2023]
Abstract
Background The majority of relations between proteins can be represented as a conventional sequential alignment. Nevertheless, unusual non-sequential alignments with different connectivity of the aligned fragments in compared proteins have been reported by many researchers. It is interesting to understand those non-sequential alignments; are they unique, sporadic cases or they occur frequently; do they belong to a few specific folds or spread among many different folds, as a common feature of protein structure. We present here a comprehensive large-scale study of non-sequential alignments between available protein structures in Protein Data Bank. Results The study has been conducted on a non-redundant set of 8,865 protein structures aligned with the aid of the TOPOFIT method. It has been estimated that between 17.4% and 35.2% of all alignments are non-sequential depending on variations in the parameters. Analysis of the data revealed that non-sequential relations between proteins do occur systematically and in large quantities. Various sizes and numbers of non-sequential fragments have been observed with all possible complexities of fragment rearrangements found for alignments consisting of up to 12 fragments. It has been found that non-sequential alignments are not limited to proteins of any particular fold and are present in more than two hundred of them. Moreover, many of them are found between proteins with different fold assignments. It has been shown that protein structure symmetry does not explain non-sequential alignments. Therefore, compelling evidences have been provided that non-sequential alignments between proteins are systematic and widespread across the protein universe. Conclusion The phenomenon of the widespread occurrence of non-sequential alignments between proteins might represent a missing rule of protein structure organization. More detailed study of this phenomenon will enhance our understanding of protein stability, folding, and evolution.
Collapse
Affiliation(s)
- Alexej Abyzov
- Department of Biology, Northeastern University 360 Huntington Avenue, Boston, MA 02115, USA.
| | | |
Collapse
|
7
|
Didier G, Guziolowski C. Mapping sequences by parts. Algorithms Mol Biol 2007; 2:11. [PMID: 17880695 PMCID: PMC2148040 DOI: 10.1186/1748-7188-2-11] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2007] [Accepted: 09/19/2007] [Indexed: 12/04/2022] Open
Abstract
Background: We present the N-map method, a pairwise and asymmetrical approach which allows us to compare sequences by taking into account evolutionary events that produce shuffled, reversed or repeated elements. Basically, the optimal N-map of a sequence s over a sequence t is the best way of partitioning the first sequence into N parts and placing them, possibly complementary reversed, over the second sequence in order to maximize the sum of their gapless alignment scores. Results: We introduce an algorithm computing an optimal N-map with time complexity O (|s| × |t| × N) using O (|s| × |t| × N) memory space. Among all the numbers of parts taken in a reasonable range, we select the value N for which the optimal N-map has the most significant score. To evaluate this significance, we study the empirical distributions of the scores of optimal N-maps and show that they can be approximated by normal distributions with a reasonable accuracy. We test the functionality of the approach over random sequences on which we apply artificial evolutionary events. Practical Application: The method is illustrated with four case studies of pairs of sequences involving non-standard evolutionary events.
Collapse
Affiliation(s)
- Gilles Didier
- Institut de Mathématiques de Luminy, 163 avenue de Luminy, Case 907, 13288 Marseille Cedex 9, France
| | | |
Collapse
|
8
|
Ekman D, Björklund AK, Elofsson A. Quantification of the elevated rate of domain rearrangements in metazoa. J Mol Biol 2007; 372:1337-48. [PMID: 17689563 DOI: 10.1016/j.jmb.2007.06.022] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2007] [Revised: 06/07/2007] [Accepted: 06/08/2007] [Indexed: 11/24/2022]
Abstract
Most eukaryotic proteins consist of multiple domains created through gene fusions or internal duplications. The most frequent change of a domain architecture (DA) is insertion or deletion of a domain at the N or C terminus. Still, the mechanisms underlying the evolution of multidomain proteins are not very well studied. Here, we have studied the evolution of multidomain architectures (MDA), guided by evolutionary information in the form of a phylogenetic tree. Our results show that Pfam domain families and MDAs have been created with comparable rates (0.1-1 per million years (My)). The major changes in DA evolution have occurred in the process of multicellularization and within the metazoan lineage. In contrast, creation of domains seems to have been frequent already in the early evolution. Furthermore, most of the architectures have been created from older domains or architectures, whereas novel domains are mainly found in single-domain proteins. However, a particular group of exon-bordering domains may have contributed to the rapid evolution of novel multidomain proteins in metazoan organisms. Finally, MDAs have evolved predominantly through insertions of domains, whereas domain deletions are less common. In conclusion, the rate of creation of multidomain proteins has accelerated in the metazoan lineage, which may partly be explained by the frequent insertion of exon-bordering domains into new architectures. However, our results indicate that other factors have contributed as well.
Collapse
Affiliation(s)
- Diana Ekman
- Stockholm Bioinformatics Center, SCFAB, Stockholm University, SE-10691 Stockholm, Sweden
| | | | | |
Collapse
|
9
|
Abstract
Horizontal gene transfer (HGT) is thought to play an important role in the evolution of species and innovation of genomes. There have been many convincing evidences for HGT for specific genes or gene families, but there has been no estimate of the global extent of HGT. Here, we present a method of identifying HGT events within a given protein family and estimate the global extent of HGT in all curated protein domain families ( approximately 8,000) listed in the Pfam database. The results suggest four conclusions: (i) for all protein domain families in Pfam, the fixation of genes horizontally transferred is not a rampant phenomenon between organisms with substantial phylogenetic separations (1.1-9.7% of Pfam families surveyed at three taxonomic ranges studied show indication of HGT); (ii) however, at the level of domains, >50% of Archaea have one or more protein domains acquired by HGT, and nearly 30-50% of Bacteria did the same when examined at three taxonomic ranges. But, the equivalent value for Eukarya is <10%; (iii) HGT will have very little impact in the construction of organism phylogeny, when the construction methods use whole genomes, large numbers of common genes, or SSU rRNAs; and (iv) there appears to be no strong preference of HGT for protein families of particular cellular or molecular functions.
Collapse
Affiliation(s)
- In-Geol Choi
- Physical Biosciences Division, Lawrence Berkeley National Laboratory and Department of Chemistry, University of California, Berkeley, CA 94720
| | - Sung-Hou Kim
- Physical Biosciences Division, Lawrence Berkeley National Laboratory and Department of Chemistry, University of California, Berkeley, CA 94720
- To whom correspondence should be addressed. E-mail:
| |
Collapse
|
10
|
Weiner J, Beaussart F, Bornberg-Bauer E. Domain deletions and substitutions in the modular protein evolution. FEBS J 2006; 273:2037-47. [PMID: 16640566 DOI: 10.1111/j.1742-4658.2006.05220.x] [Citation(s) in RCA: 96] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
The main mechanisms shaping the modular evolution of proteins are gene duplication, fusion and fission, recombination and loss of fragments. While a large body of research has focused on duplications and fusions, we concentrated, in this study, on how domains are lost. We investigated motif databases and introduced a measure of protein similarity that is based on domain arrangements. Proteins are represented as strings of domains and comparison was based on the classic dynamic alignment scheme. We found that domain losses and duplications were more frequent at the ends of proteins. We showed that losses can be explained by the introduction of start and stop codons which render the terminal domains nonfunctional, such that further shortening, until the whole domain is lost, is not evolutionarily selected against. We demonstrated that domains which also occur as single-domain proteins are less likely to be lost at the N terminus and in the middle, than at the C terminus. We conclude that fission/fusion events with single-domain proteins occur mostly at the C terminus. We found that domain substitutions are rare, in particular in the middle of proteins. We also showed that many cases of substitutions or losses result from erroneous annotations, but we were also able to find courses of evolutionary events where domains vanish over time. This is explained by a case study on the bacterial formate dehydrogenases.
Collapse
Affiliation(s)
- January Weiner
- Division of Bioinformatics, School of Biological Sciences, The Westfalian Wilhelms University of Münster, Germany
| | | | | |
Collapse
|
11
|
Schilders G, van Dijk E, Raijmakers R, Pruijn GJM. Cell and Molecular Biology of the Exosome: How to Make or Break an RNA. INTERNATIONAL REVIEW OF CYTOLOGY 2006; 251:159-208. [PMID: 16939780 DOI: 10.1016/s0074-7696(06)51005-8] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
The identification and characterization of the exosome complex has shown that the exosome is a complex of 3' --> 5' exoribonucleases that plays a key role in the processing and degradation of a wide variety of RNA substrates. Advances in the understanding of exosome function have led to the identification of numerous cofactors that are required for a selective recruitment of the exosome to substrate RNAs, for their structural alterations to facilitate degradation, and to aid in their complete degradation/processing. Structural data obtained by two-hybrid interaction analyses and X-ray crystallography show that the core of the exosome adopts a doughnut-like structure and demonstrates that probably not all exosome subunits are active exoribonucleases. Despite all data obtained on the structure and function of the exosome during the last decade, there are still a lot of unanswered questions. What is the molecular mechanism by which cofactors select and target substrate RNAs to the exosome and modulate its function for correct processing or degradation? How can the exosome discriminate between processing or degradation of a specific substrate RNA? What is the precise structure of exosome subunits and how do they contribute to its function? Here we discuss studies that provide some insight to these questions and speculate on the mechanisms that control the exosome.
Collapse
Affiliation(s)
- Geurt Schilders
- Department of Biochemistry, Nijmegen Center for Molecular Life Sciences, Radboud University Nijmegen, Nijmegen, The Netherlands
| | | | | | | |
Collapse
|
12
|
Mulvenna JP, Foley FM, Craik DJ. Discovery, structural determination, and putative processing of the precursor protein that produces the cyclic trypsin inhibitor sunflower trypsin inhibitor 1. J Biol Chem 2005; 280:32245-53. [PMID: 16036912 DOI: 10.1074/jbc.m506060200] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Backbone-cyclized proteins are becoming increasingly well known, although the mechanism by which they are processed from linear precursors is poorly understood. In this report the sequence and structure of the linear precursor of a cyclic trypsin inhibitor, sunflower trypsin inhibitor 1 (SFTI-1) from sunflower seeds, is described. The structure indicates that the major elements of the reactive site loop of SFTI-1 are present before processing. This may have importance for a protease-mediated cyclizing reaction as the rigidity of SFTI-1 may drive the equilibrium of the reaction catalyzed by proteolytic enzymes toward the formation of a peptide bond rather than the normal cleavage reaction. The occurrence of residues in the SFTI-1 precursor susceptible to cleavage by asparaginyl proteases strengthens theories that involve this enzyme in the processing of SFTI-1 and further implicates it in the processing of another family of plant cyclic proteins, the cyclotides. The precursor reported here also indicates that despite strong active site sequence homology, SFTI-1 has no other similarities with the Bowman-Birk trypsin inhibitors, presenting interesting evolutionary questions.
Collapse
Affiliation(s)
- Jason P Mulvenna
- Institute for Molecular Bioscience, Australian Research Council Center for Functional and Applied Genomics, University of Queensland, Brisbane
| | | | | |
Collapse
|
13
|
Moore BD. Bifunctional and moonlighting enzymes: lighting the way to regulatory control. TRENDS IN PLANT SCIENCE 2004; 9:221-8. [PMID: 15130547 DOI: 10.1016/j.tplants.2004.03.005] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Affiliation(s)
- Brandon d Moore
- Department of Genetics, Biochemistry, and Life Science Studies, Clemson University, Clemson, SC 29634, USA.
| |
Collapse
|
14
|
Abstract
Protein-related information is more accumulated rather than reduced to a synthetic view. Itemising properties of protein sequences is informative, so is the list of ingredients to do some cooking, but without a recipe, that is, quantification and chronology, understanding is incomplete. If the goal of accumulating information is to discover or reveal the function and related biochemical mechanisms, information has to be weighed and ordered. As a guideline, the weight of a piece of information should reflect how often it consistently occurs in various contexts. We propose a common sense approach to quantify and put data and information into perspective. Complete bacterial proteomes are individually mapped with the Pfam-A database of domains and protein family signatures in an attempt to assess the modularity of proteins at the level of a single proteome and the implications of a modular description of proteins for a functional interpretation. Poorly annotated proteins in the most documented bacteria (E. coli and B. subtilis) were considered in an attempt to formulate hypothesis on the basis of domain/module content.
Collapse
|