26
|
Ponting CP. Proteins of the endoplasmic-reticulum-associated degradation pathway: domain detection and function prediction. Biochem J 2000; 351 Pt 2:527-35. [PMID: 11023840 PMCID: PMC1221390] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
Sequence database searches, using iterative-profile and Hidden-Markov-model approaches, were used to detect hitherto-undetected homologues of proteins that regulate the endoplasmic reticulum (ER)-associated degradation pathway. The translocon-associated subunit Sec63p (Sec=secretory) was shown to contain a domain of unknown function found twice in several Brr2p-like RNA helicases (Brr2=bad response to refrigeration 2). Additionally, Cue1p (Cue=coupling of ubiquitin conjugation to ER degradation), a yeast protein that recruits the ubiquitin-conjugating (UBC) enzyme Ubc7p to an ER-associated complex, was found to be one of a large family of putative scaffolding-domain-containing proteins that include the autocrine motility factor receptor and fungal Vps9p (Vps=vacuolar protein sorting). Two other yeast translocon-associated molecules, Sec72p and Hrd3p (Hrd=3-hydroxy-3-methylglutaryl-CoA reductase degradation), were shown to contain multiple tetratricopeptide-repeat-like sequences. From this observation it is suggested that Sec72p associates with a heat-shock protein, Hsp70, in a manner analogous to that known for Hop (Hsp70/Hsp90 organizing protein). Finally, the luminal portion of Ire1p (Ire=high inositol-requiring), thought to convey the sensing function of this transmembrane kinase and endoribonuclease, was shown to contain repeats similar to those in beta-propeller proteins. This finding hints at the mechanism by which Ire1p may sense extended unfolded proteins at the expense of compact folded molecules.
Collapse
|
27
|
Ponting CP, Russell RB. Identification of distant homologues of fibroblast growth factors suggests a common ancestor for all beta-trefoil proteins. J Mol Biol 2000; 302:1041-7. [PMID: 11183773 DOI: 10.1006/jmbi.2000.4087] [Citation(s) in RCA: 97] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Determination of the structures of fibroblast growth factors and interleukin-1s has previously revealed that they both adopt a beta-trefoil fold, similar to those found in Kunitz soybean trypsin inhibitors, ricin-like toxins, plant agglutinins and hisactophilin. These families possess distinct functions and occur in different subcellular localisations, and they appear to lack significant similarities in their sequences, ligands and modes of ligand binding. We have analysed the significance of sequence identities observed after structure alignment and provide statistical evidence that these beta-trefoil proteins are all homologues, having arisen from a common ancestor. In addition, we have explored the sequence space of all beta-trefoil proteins and have determined that the actin-binding proteins fascins, and other proteins of unknown function, are beta-trefoil family homologues. Unlike other beta-trefoil proteins, the triplicated repeats in each of the four beta-trefoil domains of fascins are significantly similar in sequence. This hints at how the beta-trefoil fold arose from the duplication of an ancestral gene encoding a homotrimeric single-repeat protein. The combined analysis of structure and sequence databases for detecting significant similarities is suggested as a highly sensitive approach to determining the common ancestry of extremely divergent homologues.
Collapse
|
28
|
Ponting CP. Novel eIF4G domain homologues linking mRNA translation with nonsense-mediated mRNA decay. Trends Biochem Sci 2000; 25:423-6. [PMID: 10973054 DOI: 10.1016/s0968-0004(00)01628-5] [Citation(s) in RCA: 75] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
29
|
Rea S, Eisenhaber F, O'Carroll D, Strahl BD, Sun ZW, Schmid M, Opravil S, Mechtler K, Ponting CP, Allis CD, Jenuwein T. Regulation of chromatin structure by site-specific histone H3 methyltransferases. Nature 2000; 406:593-9. [PMID: 10949293 DOI: 10.1038/35020506] [Citation(s) in RCA: 1992] [Impact Index Per Article: 83.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The organization of chromatin into higher-order structures influences chromosome function and epigenetic gene regulation. Higher-order chromatin has been proposed to be nucleated by the covalent modification of histone tails and the subsequent establishment of chromosomal subdomains by non-histone modifier factors. Here we show that human SUV39H1 and murine Suv39h1--mammalian homologues of Drosophila Su(var)3-9 and of Schizosaccharomyces pombe clr4--encode histone H3-specific methyltransferases that selectively methylate lysine 9 of the amino terminus of histone H3 in vitro. We mapped the catalytic motif to the evolutionarily conserved SET domain, which requires adjacent cysteine-rich regions to confer histone methyltransferase activity. Methylation of lysine 9 interferes with phosphorylation of serine 10, but is also influenced by pre-existing modifications in the amino terminus of H3. In vivo, deregulated SUV39H1 or disrupted Suv39h activity modulate H3 serine 10 phosphorylation in native chromatin and induce aberrant mitotic divisions. Our data reveal a functional interdependence of site-specific H3 tail modifications and suggest a dynamic mechanism for the regulation of higher-order chromatin.
Collapse
|
30
|
Ponting CP, Schultz J, Copley RR, Andrade MA, Bork P. Evolution of domain families. ADVANCES IN PROTEIN CHEMISTRY 2000; 54:185-244. [PMID: 10829229 DOI: 10.1016/s0065-3233(00)54007-8] [Citation(s) in RCA: 65] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
|
31
|
Schultz J, Doerks T, Ponting CP, Copley RR, Bork P. More than 1,000 putative new human signalling proteins revealed by EST data mining. Nat Genet 2000; 25:201-4. [PMID: 10835637 DOI: 10.1038/76069] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Cloning procedures aided by homology searches of EST databases have accelerated the pace of discovery of new genes, but EST database searching remains an involved and onerous task. More than 1.6 million human EST sequences have been deposited in public databases, making it difficult to identify ESTs that represent new genes. Compounding the problems of scale are difficulties in detection associated with a high sequencing error rate and low sequence similarity between distant homologues. We have developed a new method, coupling BLAST-based searches with a domain identification protocol, that filters candidate homologues. Application of this method in a large-scale analysis of 100 signalling domain families has led to the identification of ESTs representing more than 1,000 novel human signalling genes. The 4,206 publicly available ESTs representing these genes are a valuable resource for rapid cloning of novel human signalling proteins. For example, we were able to identify ESTs of at least 106 new small GTPases, of which 6 are likely to belong to new subfamilies. In some cases, further analyses of genomic DNA led to the discovery of previously unidentified full-length protein sequences. This is exemplified by the in silico cloning (prediction of a gene product sequence using only genomic and EST sequence data) of a new type of GTPase with two catalytic domains.
Collapse
|
32
|
Andrade MA, Ponting CP, Gibson TJ, Bork P. Homology-based method for identification of protein repeats using statistical significance estimates. J Mol Biol 2000; 298:521-37. [PMID: 10772867 DOI: 10.1006/jmbi.2000.3684] [Citation(s) in RCA: 156] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Short protein repeats, frequently with a length between 20 and 40 residues, represent a significant fraction of known proteins. Many repeats appear to possess high amino acid substitution rates and thus recognition of repeat homologues is highly problematic. Even if the presence of a certain repeat family is known, the exact locations and the number of repetitive units often cannot be determined using current methods. We have devised an iterative algorithm based on optimal and sub-optimal score distributions from profile analysis that estimates the significance of all repeats that are detected in a single sequence. This procedure allows the identification of homologues at alignment scores lower than the highest optimal alignment score for non-homologous sequences. The method has been used to investigate the occurrence of eleven families of repeats in Saccharomyces cerevisiae, Caenorhabditis elegans and Homo sapiens accounting for 1055, 2205 and 2320 repeats, respectively. For these examples, the method is both more sensitive and more selective than conventional homology search procedures. The method allowed the detection in the SwissProt database of more than 2000 previously unrecognised repeats belonging to the 11 families. In addition, the method was used to merge several repeat families that previously were supposed to be distinct, indicating common phylogenetic origins for these families.
Collapse
|
33
|
Ponting CP. Novel repeats in ryanodine and IP3 receptors and protein O-mannosyltransferases. Trends Biochem Sci 2000; 25:48-50. [PMID: 10664581 DOI: 10.1016/s0968-0004(99)01513-3] [Citation(s) in RCA: 63] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
34
|
Hodgkin MN, Masson MR, Powner D, Saqib KM, Ponting CP, Wakelam MJ. Phospholipase D regulation and localisation is dependent upon a phosphatidylinositol 4,5-biphosphate-specific PH domain. Curr Biol 2000; 10:43-6. [PMID: 10660303 DOI: 10.1016/s0960-9822(99)00264-x] [Citation(s) in RCA: 110] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
The signalling pathway leading, for example, to actin cytoskeletal reorganisation, secretion or superoxide generation involves phospholipase D (PLD)-catalysed hydrolysis of phosphatidylcholine to generate phosphatidic acid, which appears to mediate the messenger functions of this pathway. Two PLD genes (PLD1 and PLD2) with similar domain structures have been doned and progress has been made in identifying the protein regulators of PLD1 activation, for example Arf and Rho family members. The activities of both PLD isoforms are dependent on phosphatidylinositol 4,5-bisphosphate (PI(4,5)P2) and our sequence analysis suggested the presence of a pleckstrin homology (PH) domain in PLD1, although its absence has also been daimed. Investigation of the inositide dependence showed that a bis-phosphorylated lipid with a vicinal pair of phosphates was required for PLD1 activity. Furthermore, PLD1 bound specifically and with high affinity to lipid surfaces containing PI(4,5)P2 independently of the substrate phosphatidylcholine, suggesting a key role for the PH domain in PLD function. Importantly, a glutathione-S-transferase (GST) fusion protein comprising GST and the PH domain of PLD1 (GST-PLD1-PH) also bound specifically to supported lipid monolayers containing PI(4,5)P2. Point mutations within the PLD1 PH domain inhibited enzyme activity, whereas deletion of the domain both inhibited enzyme activity and disrupted normal PLD1 localisation. Thus, the functional PH domain regulates PLD by mediating its interaction with polyphosphoinositide-containing membranes; this might also induce a conformational change, thereby regulating catalytic activity.
Collapse
|
35
|
Schultz J, Copley RR, Doerks T, Ponting CP, Bork P. SMART: a web-based tool for the study of genetically mobile domains. Nucleic Acids Res 2000; 28:231-4. [PMID: 10592234 PMCID: PMC102444 DOI: 10.1093/nar/28.1.231] [Citation(s) in RCA: 959] [Impact Index Per Article: 40.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
SMART (a Simple Modular Architecture Research Tool) allows the identification and annotation of genetically mobile domains and the analysis of domain architectures (http://SMART.embl-heidelberg.de ). More than 400 domain families found in signalling, extra-cellular and chromatin-associated proteins are detectable. These domains are extensively annotated with respect to phyletic distributions, functional class, tertiary structures and functionally important residues. Each domain found in a non-redundant protein database as well as search parameters and taxonomic information are stored in a relational database system. User interfaces to this database allow searches for proteins containing specific combinations of domains in defined taxa.
Collapse
|
36
|
|
37
|
Schäffer AA, Wolf YI, Ponting CP, Koonin EV, Aravind L, Altschul SF. IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices. Bioinformatics 1999; 15:1000-11. [PMID: 10745990 DOI: 10.1093/bioinformatics/15.12.1000] [Citation(s) in RCA: 231] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Many studies have shown that database searches using position-specific score matrices (PSSMs) or profiles as queries are more effective at identifying distant protein relationships than are searches that use simple sequences as queries. One popular program for constructing a PSSM and comparing it with a database of sequences is Position-Specific Iterated BLAST (PSI-BLAST). RESULTS This paper describes a new software package, IMPALA, designed for the complementary procedure of comparing a single query sequence with a database of PSI-BLAST-generated PSSMs. We illustrate the use of IMPALA to search a database of PSSMs for protein folds, and one for protein domains involved in signal transduction. IMPALA's sensitivity to distant biological relationships is very similar to that of PSI-BLAST. However, IMPALA employs a more refined analysis of statistical significance and, unlike PSI-BLAST, guarantees the output of the optimal local alignment by using the rigorous Smith-Waterman algorithm. Also, it is considerably faster when run with a large database of PSSMs than is BLAST or PSI-BLAST when run against the complete non-redundant protein database.
Collapse
|
38
|
Ponting CP, Pallen MJ. beta-propeller repeats and a PDZ domain in the tricorn protease: predicted self-compartmentalisation and C-terminal polypeptide-binding strategies of substrate selection. FEMS Microbiol Lett 1999; 179:447-51. [PMID: 10518749 DOI: 10.1111/j.1574-6968.1999.tb08761.x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Prokaryotic proteases demonstrate a variety of substrate-selection strategies that prevent uncontrolled protein degradation. Proteasomes and ClpXP-like proteases form oligomeric structures that exclude large substrates from central solvated chambers containing their active sites. Monomeric prolyl oligopeptidases have been shown to contain beta-propeller structures that similarly reduce access to their catalytic residues. By contrast, Tsp-like enzymes contain PDZ domains that are thought to specifically target C-terminal polypeptides. We have investigated the sequence of Thermoplasma acidophilum tricorn protease using recently-developed database search methods. The tricorn protease is known to associate into a 20 hexamer capsid enclosing an extremely large cavity that is 37 nm in diameter. It is unknown, however, how this enzyme selects its small oligopeptide substrates. Our results demonstrate the presence in tricorn protease of a PDZ domain and two predicted six-bladed beta-propeller domains. We suggest that the PDZ domain is involved in targeting non-polar C-terminal peptides, similar to those generated by the T. acidophilum proteasome, whereas the beta-propeller domains serve to exclude large substrates from the tricorn protease active site in a similar manner to that previously indicated for prolyl oligopeptidase.
Collapse
|
39
|
|
40
|
Barnes MR, Russell RB, Copley RR, Ponting CP, Bork P, Cumberledge S, Reichsman F, Moore HM. A lipid-binding domain in Wnt: a case of mistaken identity? Curr Biol 1999; 9:R717-9. [PMID: 10531014 DOI: 10.1016/s0960-9822(99)80465-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
41
|
Ponting CP. Raf-like Ras/Rap-binding domains in RGS12- and still-life-like signalling proteins. J Mol Med (Berl) 1999; 77:695-8. [PMID: 10606204 DOI: 10.1007/s001099900054] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Ras proteins play critical roles in regulating cell growth and differentiation, and mutated Ras genes are expressed in a variety of human cancers. Consequently, much interest has centered on the binding partners of Ras, including the Ras-binding domain (RBD) of Raf kinase. Here evidence is presented that domains homologous to the Raf RBD are present in tandem in RGS12, RGS14 and LOCO, and singly in molecules similar to mouse Tiam-1. In addition, RGS12, RGS14 and LOCO are shown to contain single "LGN motifs" that are guanine nucleotide exchange factors specific for the alpha-subunit of G proteins. These findings indicate "cross-talk" interactions between signalling pathways involving Ras and Rap and pathways involving Rho, Rac and G alpha GTPases.
Collapse
|
42
|
|
43
|
Mohaghegh P, Rodrigues NR, Owen N, Ponting CP, Le TT, Burghes AH, Davies KE. Analysis of mutations in the tudor domain of the survival motor neuron protein SMN. Eur J Hum Genet 1999; 7:519-25. [PMID: 10439956 DOI: 10.1038/sj.ejhg.5200346] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
Autosomal recessive childhood onset spinal muscular atrophy (SMA) is a leading cause of infant mortality caused by mutations in the survival motor neuron (SMN) gene. The SMN protein is involved in RNA processing and is localised in structures called GEMs in the nucleus. Nothing is yet understood about why mutations in SMN gene result in the selective motor neuron loss observed in patients. The SMN protein domains conserved across several species may indicate functionally significant regions. Exon 3 of SMN contains homology to a tudor domain, where a Type I SMA patient has been reported to harbour a missense mutation. We have generated missense mutants in this region of SMN and have tested their ability to form GEMs when transfected into HeLa cells. Our results show such mutant SMN proteins still localise to GEMs. Furthermore, exon 7 deleted SMN protein appears to exert a dominant negative effect on localisation of endogenous SMN protein. However, exon 3 mutant protein and exon 5 deleted protein exert no such effect.
Collapse
|
44
|
Aravind L, Ponting CP. The cytoplasmic helical linker domain of receptor histidine kinase and methyl-accepting proteins is common to many prokaryotic signalling proteins. FEMS Microbiol Lett 1999; 176:111-6. [PMID: 10418137 DOI: 10.1111/j.1574-6968.1999.tb13650.x] [Citation(s) in RCA: 295] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Mutations in the cytoplasmic linker regions of receptor histidine kinase and chemoreceptor proteins have been shown previously to significantly impair receptor functions. Here we demonstrate significant sequence similarities between these regions in numerous histidine kinases, methyl-accepting proteins, adenylyl cyclases and other prokaryotic signalling proteins. It is suggested that these 'HAMP domains' possess roles of regulating the phosphorylation or methylation of homodimeric receptors by transmitting the conformational changes in periplasmic ligand-binding domains to cytoplasmic signalling kinase and methyl-acceptor domains.
Collapse
|
45
|
Ponting CP, Aravind L, Schultz J, Bork P, Koonin EV. Eukaryotic signalling domain homologues in archaea and bacteria. Ancient ancestry and horizontal gene transfer. J Mol Biol 1999; 289:729-45. [PMID: 10369758 DOI: 10.1006/jmbi.1999.2827] [Citation(s) in RCA: 260] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Phyletic distributions of eukaryotic signalling domains were studied using recently developed sensitive methods for protein sequence analysis, with an emphasis on the detection and accurate enumeration of homologues in bacteria and archaea. A major difference was found between the distributions of enzyme families that are typically found in all three divisions of cellular life and non-enzymatic domain families that are usually eukaryote-specific. Previously undetected bacterial homologues were identified for# plant pathogenesis-related proteins, Pad1, von Willebrand factor type A, src homology 3 and YWTD repeat-containing domains. Comparisons of the domain distributions in eukaryotes and prokaryotes enabled distinctions to be made between the domains originating prior to the last common ancestor of all known life forms and those apparently originating as consequences of horizontal gene transfer events. A number of transfers of signalling domains from eukaryotes to bacteria were confidently identified, in contrast to only a single case of apparent transfer from eukaryotes to archaea.
Collapse
|
46
|
Abstract
The complete sequence of the nematode worm Caenorhabditis elegans contains the genetic machinery that is required to undertake the core biological processes of single cells. However, the genome also encodes proteins that are associated with multicellularity, as well as others that are lineage-specific expansions of phylogenetically widespread families and yet more that are absent in non-nematodes. Ongoing analysis is beginning to illuminate the similarities and differences among human proteins and proteins that are encoded by the genomes of the multicellular worm and the unicellular yeast, and will be essential in determining the reliability of transferring experimental data among phylogenetically distant species.
Collapse
|
47
|
Ponting CP, Bork P, Schultz J, Aravind L. No Sec7-homology domain in guanine-nucleotide-exchange factors that act on Ras and Rho. Trends Biochem Sci 1999; 24:177-8. [PMID: 10322430 DOI: 10.1016/s0968-0004(99)01379-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
48
|
|
49
|
Aravind L, Neuwald AF, Ponting CP. Sec14p-like domains in NF1 and Dbl-like proteins indicate lipid regulation of Ras and Rho signaling. Curr Biol 1999; 9:R195-7. [PMID: 10209105 DOI: 10.1016/s0960-9822(99)80127-4] [Citation(s) in RCA: 78] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
50
|
|