1
|
Ramachandran P, Dobie R, Wilson-Kanamori JR, Dora EF, Henderson BEP, Luu NT, Portman JR, Matchett KP, Brice M, Marwick JA, Taylor RS, Efremova M, Vento-Tormo R, Carragher NO, Kendall TJ, Fallowfield JA, Harrison EM, Mole DJ, Wigmore SJ, Newsome PN, Weston CJ, Iredale JP, Tacke F, Pollard JW, Ponting CP, Marioni JC, Teichmann SA, Henderson NC. Resolving the fibrotic niche of human liver cirrhosis at single-cell level. Nature 2019; 575:512-518. [PMID: 31597160 PMCID: PMC6876711 DOI: 10.1038/s41586-019-1631-3] [Citation(s) in RCA: 786] [Impact Index Per Article: 157.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2018] [Accepted: 09/04/2019] [Indexed: 12/13/2022]
Abstract
Liver cirrhosis is a major cause of death worldwide and is characterized by extensive fibrosis. There are currently no effective antifibrotic therapies available. To obtain a better understanding of the cellular and molecular mechanisms involved in disease pathogenesis and enable the discovery of therapeutic targets, here we profile the transcriptomes of more than 100,000 single human cells, yielding molecular definitions for non-parenchymal cell types that are found in healthy and cirrhotic human liver. We identify a scar-associated TREM2+CD9+ subpopulation of macrophages, which expands in liver fibrosis, differentiates from circulating monocytes and is pro-fibrogenic. We also define ACKR1+ and PLVAP+ endothelial cells that expand in cirrhosis, are topographically restricted to the fibrotic niche and enhance the transmigration of leucocytes. Multi-lineage modelling of ligand and receptor interactions between the scar-associated macrophages, endothelial cells and PDGFRα+ collagen-producing mesenchymal cells reveals intra-scar activity of several pro-fibrogenic pathways including TNFRSF12A, PDGFR and NOTCH signalling. Our work dissects unanticipated aspects of the cellular and molecular basis of human organ fibrosis at a single-cell level, and provides a conceptual framework for the discovery of rational therapeutic targets in liver cirrhosis.
Collapse
Affiliation(s)
- P Ramachandran
- University of Edinburgh Centre for Inflammation Research, The Queen's Medical Research Institute, Edinburgh BioQuarter, Edinburgh, UK.
| | - R Dobie
- University of Edinburgh Centre for Inflammation Research, The Queen's Medical Research Institute, Edinburgh BioQuarter, Edinburgh, UK
| | - J R Wilson-Kanamori
- University of Edinburgh Centre for Inflammation Research, The Queen's Medical Research Institute, Edinburgh BioQuarter, Edinburgh, UK
| | - E F Dora
- University of Edinburgh Centre for Inflammation Research, The Queen's Medical Research Institute, Edinburgh BioQuarter, Edinburgh, UK
| | - B E P Henderson
- University of Edinburgh Centre for Inflammation Research, The Queen's Medical Research Institute, Edinburgh BioQuarter, Edinburgh, UK
| | - N T Luu
- NIHR Birmingham Biomedical Research Centre, University Hospitals Birmingham NHS Foundation Trust and University of Birmingham, Birmingham, UK
- Institute of Immunology and Immunotherapy, University of Birmingham, Birmingham, UK
| | - J R Portman
- University of Edinburgh Centre for Inflammation Research, The Queen's Medical Research Institute, Edinburgh BioQuarter, Edinburgh, UK
| | - K P Matchett
- University of Edinburgh Centre for Inflammation Research, The Queen's Medical Research Institute, Edinburgh BioQuarter, Edinburgh, UK
| | - M Brice
- University of Edinburgh Centre for Inflammation Research, The Queen's Medical Research Institute, Edinburgh BioQuarter, Edinburgh, UK
| | - J A Marwick
- University of Edinburgh Centre for Inflammation Research, The Queen's Medical Research Institute, Edinburgh BioQuarter, Edinburgh, UK
- Cancer Research UK Edinburgh Centre, MRC Institute of Genetics and Molecular Medicine at the University of Edinburgh, Edinburgh, UK
| | - R S Taylor
- University of Edinburgh Centre for Inflammation Research, The Queen's Medical Research Institute, Edinburgh BioQuarter, Edinburgh, UK
| | - M Efremova
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - R Vento-Tormo
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - N O Carragher
- Cancer Research UK Edinburgh Centre, MRC Institute of Genetics and Molecular Medicine at the University of Edinburgh, Edinburgh, UK
| | - T J Kendall
- University of Edinburgh Centre for Inflammation Research, The Queen's Medical Research Institute, Edinburgh BioQuarter, Edinburgh, UK
- Division of Pathology, University of Edinburgh, Edinburgh, UK
| | - J A Fallowfield
- University of Edinburgh Centre for Inflammation Research, The Queen's Medical Research Institute, Edinburgh BioQuarter, Edinburgh, UK
| | - E M Harrison
- Clinical Surgery, University of Edinburgh, Royal Infirmary of Edinburgh, Edinburgh, UK
| | - D J Mole
- University of Edinburgh Centre for Inflammation Research, The Queen's Medical Research Institute, Edinburgh BioQuarter, Edinburgh, UK
- Clinical Surgery, University of Edinburgh, Royal Infirmary of Edinburgh, Edinburgh, UK
| | - S J Wigmore
- University of Edinburgh Centre for Inflammation Research, The Queen's Medical Research Institute, Edinburgh BioQuarter, Edinburgh, UK
- Clinical Surgery, University of Edinburgh, Royal Infirmary of Edinburgh, Edinburgh, UK
| | - P N Newsome
- NIHR Birmingham Biomedical Research Centre, University Hospitals Birmingham NHS Foundation Trust and University of Birmingham, Birmingham, UK
- Institute of Immunology and Immunotherapy, University of Birmingham, Birmingham, UK
| | - C J Weston
- NIHR Birmingham Biomedical Research Centre, University Hospitals Birmingham NHS Foundation Trust and University of Birmingham, Birmingham, UK
- Institute of Immunology and Immunotherapy, University of Birmingham, Birmingham, UK
| | - J P Iredale
- Office of the Vice Chancellor, Beacon House and National Institute for Health Research, Biomedical Research Centre, Bristol, UK
| | - F Tacke
- Department of Hepatology and Gastroenterology, Charité University Medical Center, Berlin, Germany
| | - J W Pollard
- MRC Centre for Reproductive Health, The Queen's Medical Research Institute, University of Edinburgh, Edinburgh, UK
- Department of Developmental and Molecular Biology, Albert Einstein College of Medicine, New York, NY, USA
| | - C P Ponting
- MRC Human Genetics Unit, MRC Institute of Genetics and Molecular Medicine at the University of Edinburgh, Edinburgh, UK
| | - J C Marioni
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, UK
- Cancer Research UK Cambridge Institute, Li Ka Shing Centre, University of Cambridge, Cambridge, UK
| | - S A Teichmann
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, UK
- Theory of Condensed Matter Group, The Cavendish Laboratory, University of Cambridge, Cambridge, UK
| | - N C Henderson
- University of Edinburgh Centre for Inflammation Research, The Queen's Medical Research Institute, Edinburgh BioQuarter, Edinburgh, UK.
| |
Collapse
|
2
|
Larcombe L, Hendricusdottir R, Attwood TK, Bacall F, Beard N, Bellis LJ, Dunn WB, Hancock JM, Nenadic A, Orengo C, Overduin B, Sansone SA, Thurston M, Viant MR, Winder CL, Goble CA, Ponting CP, Rustici G. ELIXIR-UK role in bioinformatics training at the national level and across ELIXIR. F1000Res 2017; 6. [PMID: 28781748 PMCID: PMC5521157 DOI: 10.12688/f1000research.11837.1] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 06/19/2017] [Indexed: 11/20/2022] Open
Abstract
ELIXIR-UK is the UK node of ELIXIR, the European infrastructure for life science data. Since its foundation in 2014, ELIXIR-UK has played a leading role in training both within the UK and in the ELIXIR Training Platform, which coordinates and delivers training across all ELIXIR members. ELIXIR-UK contributes to the Training Platform’s coordination and supports the development of training to address key skill gaps amongst UK scientists. As part of this work it acts as a conduit for nationally-important bioinformatics training resources to promote their activities to the ELIXIR community. ELIXIR-UK also leads ELIXIR’s flagship Training Portal, TeSS, which collects information about a diverse range of training and makes it easily accessible to the community. ELIXIR-UK also works with others to provide key digital skills training, partnering with the Software Sustainability Institute to provide Software Carpentry training to the ELIXIR community and to establish the Data Carpentry initiative, and taking a lead role amongst national stakeholders to deliver the StaTS project – a coordinated effort to drive engagement with training in statistics.
Collapse
Affiliation(s)
- L Larcombe
- MRC Human Genetics Unit, The Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - R Hendricusdottir
- MRC Human Genetics Unit, The Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - T K Attwood
- School of Computer Science, University of Manchester, Manchester, M13 9PL, UK
| | - F Bacall
- School of Computer Science, University of Manchester, Manchester, M13 9PL, UK
| | - N Beard
- School of Computer Science, University of Manchester, Manchester, M13 9PL, UK
| | - L J Bellis
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - W B Dunn
- Birmingham Metabolomics Training Centre, School of Biosciences, University of Birmingham, Birmingham, B15 2TT, UK
| | | | - A Nenadic
- The Software Sustainability Institute, University of Manchester, Manchester, M13 9PL, UK
| | - C Orengo
- University College London, London, WC1E 6BT, UK
| | - B Overduin
- Edinburgh Genomics, University of Edinburgh, Edinburgh, EH9 3FL, UK
| | - S-A Sansone
- Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, UK
| | - M Thurston
- Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, UK
| | - M R Viant
- Birmingham Metabolomics Training Centre, School of Biosciences, University of Birmingham, Birmingham, B15 2TT, UK
| | - C L Winder
- Birmingham Metabolomics Training Centre, School of Biosciences, University of Birmingham, Birmingham, B15 2TT, UK
| | - C A Goble
- School of Computer Science, University of Manchester, Manchester, M13 9PL, UK
| | - C P Ponting
- MRC Human Genetics Unit, The Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - G Rustici
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| |
Collapse
|
3
|
Rodgers W, Ancliff P, Ponting CP, Sanchez-Pulido L, Burns S, Hayman M, Kimonis V, Sebire N, Bulstrode N, Harper JI. Squamous cell carcinoma in a child with Clericuzio-type poikiloderma with neutropenia. Br J Dermatol 2012; 168:665-7. [PMID: 22924337 DOI: 10.1111/bjd.12016] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
4
|
Coward K, Ponting CP, Chang HY, Hibbitt O, Savolainen P, Jones KT, Parrington J. Phospholipase Cζ, the trigger of egg activation in mammals, is present in a non-mammalian species. Reproduction 2005; 130:157-63. [PMID: 16049153 DOI: 10.1530/rep.1.00707] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The activation of the egg to begin development into an embryo is triggered by a sperm-induced increase in intracellular egg Ca2+. There has been much controversy about how the sperm induces this fundamental developmental event, but recent studies suggest that, in mammals, egg activation is triggered by a testis-specific phospholipase C: PLCζ. Since the discovery of PLCζ, it has been unclear whether its role in triggering egg activation is common to all vertebrates, or is confined to mammals. Here, we demonstrate for the first time that PLCζ is present in a non-mammalian vertebrate. Using genomic and cDNA databases, we have identified the cDNA encoding a PLCζ orthologue in the domestic chicken that, like the mammalian isoforms, is a testis-specific gene. The chicken PLCζ cDNA is 2152 bp in size and encodes an open reading frame of 639 amino acids. When injected into mouse oocytes, chicken PLCζ cRNA triggers Ca2+oscillations, indicating that it has functional properties similar to those of mammalian PLCζ. Our findings suggest that PLCζ may have a universal role in triggering egg activation in vertebrates.
Collapse
Affiliation(s)
- K Coward
- Department of Pharmacology, University of Oxford, Mansfield Road, Oxford OX1 3QT, UK.
| | | | | | | | | | | | | |
Collapse
|
5
|
|
6
|
Vargas JD, Herpers B, McKie AT, Gledhill S, McDonnell J, van den Heuvel M, Davies KE, Ponting CP. Stromal cell-derived receptor 2 and cytochrome b561 are functional ferric reductases. Biochimica et Biophysica Acta (BBA) - Proteins and Proteomics 2003; 1651:116-23. [PMID: 14499595 DOI: 10.1016/s1570-9639(03)00242-5] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Iron has a variety of functions in cellular organisms ranging from electron transport and DNA synthesis to adenosine triphosphate (ATP) and neurotransmitter synthesis. Failure to regulate the homeostasis of iron can lead to cognition and demyelination disorders when iron levels are deficient, and to neurodegenerative disorders when iron is in excess. In this study we show that three members of the b561 family of predicted ferric reductases, namely mouse cytochrome b561 and mouse and fly stromal cell-derived receptor 2 (SDR2), have ferric reductase activity. Given that a fourth member, duodenal cytochrome b (Dcytb), has previously been shown to be a ferric reductase, it is likely that all remaining members of this family also exhibit this activity. Furthermore, we show that the rat sdr2 message is predominantly expressed in the liver and kidney, with low expression in the duodenum. In hypotransferrinaemic (hpx) mice, sdr2 expression in the liver and kidney is reduced, suggesting that it may be regulated by iron. Moreover, we demonstrate the presence of mouse sdr2 in the choroid plexus and in the ependymal cells lining the four ventricles, through in situ hybridization analysis.
Collapse
Affiliation(s)
- J D Vargas
- Department of Human Anatomy and Genetics, University of Oxford, OX1 3QX Oxford, UK
| | | | | | | | | | | | | | | |
Collapse
|
7
|
Vargas JD, Culetto E, Ponting CP, Miguel-Aliaga I, Davies KE, Sattelle DB. Cloning and developmental expression analysis of ltd-1, the Caenorhabditis elegans homologue of the mouse kyphoscoliosis (ky) gene. Mech Dev 2002; 117:289-92. [PMID: 12204272 DOI: 10.1016/s0925-4773(02)00182-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
We have characterized the developmental expression pattern of the Caenorhabditis elegans homologue of the mouse ky gene. The Ky protein has a putative key function in muscle development and has homologues in invertebrates, fungi and a cyanobacterium. The C. elegans Ky homologue gene has been named ltd-1 for LIM and transglutaminase domains gene. The LTD-1::GFP construct is expressed in developing hypodermal cells from the twofold stage embryo through adulthood. These data define the ltd-1 gene as a novel marker for C. elegans epithelial cell development.
Collapse
Affiliation(s)
- J D Vargas
- MRC Functional Genetics Unit, Department of Human Anatomy and Genetics, University of Oxford, South Parks Road, Oxford OX1 3QX, UK
| | | | | | | | | | | |
Collapse
|
8
|
Abstract
The evolutionary history of eukaryotic proteins involves rapid sequence divergence, addition and deletion of domains, and fusion and fission of genes. Although the protein repertoires of distantly related species differ greatly, their domain repertoires do not. To account for the great diversity of domain contexts and an unexpected paucity of ortholog conservation, we must categorize the coding regions of completely sequenced genomes into domain families, as well as protein families.
Collapse
Affiliation(s)
- C P Ponting
- MRC Functional Genetics Unit, University of Oxford, Department of Human Anatomy and Genetics, South Parks Road, Oxford OX13QX, UK.
| | | |
Collapse
|
9
|
Ponting CP, Mott R, Bork P, Copley RR. Novel protein domains and repeats in Drosophila melanogaster: insights into structure, function, and evolution. Genome Res 2001; 11:1996-2008. [PMID: 11731489 DOI: 10.1101/gr.198701] [Citation(s) in RCA: 97] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Sequence database searching methods such as BLAST, are invaluable for predicting molecular function on the basis of sequence similarities among single regions of proteins. Searches of whole databases however, are not optimized to detect multiple homologous regions within a single polypeptide. Here we have used the prospero algorithm to perform self-comparisons of all predicted Drosophila melanogaster gene products. Predicted repeats, and their homologs from all species, were analyzed further to detect hitherto unappreciated evolutionary relationships. Results included the identification of novel tandem repeats in the human X-linked retinitis pigmentosa type-2 gene product, repeated segments in cystinosin, associated with a defect in cystine transport, and 'nested' homologous domains in dysferlin, whose gene is mutated in limb girdle muscular dystrophy. Novel signaling domain families were found that may regulate the microtubule-based cytoskeleton and ubiquitin-mediated proteolysis, respectively. Two families of glycosyl hydrolases were shown to contain internal repetitions that hint at their evolution via a piecemeal, modular approach. In addition, three examples of fruit fly genes were detected with tandem exons that appear to have arisen via internal duplication. These findings demonstrate how completely sequenced genomes can be exploited to further understand the relationships between molecular structure, function, and evolution.
Collapse
MESH Headings
- Amino Acid Sequence/genetics
- Amino Acid Transport Systems, Neutral
- Animals
- Antigens, Differentiation, B-Lymphocyte/chemistry
- Antigens, Differentiation, B-Lymphocyte/genetics
- Antigens, Differentiation, B-Lymphocyte/physiology
- Aspartate-tRNA Ligase/chemistry
- Aspartate-tRNA Ligase/genetics
- Aspartate-tRNA Ligase/physiology
- Cystinosis/genetics
- Drosophila Proteins/chemistry
- Drosophila Proteins/genetics
- Drosophila Proteins/physiology
- Drosophila melanogaster/chemistry
- Drosophila melanogaster/enzymology
- Drosophila melanogaster/genetics
- Evolution, Molecular
- Exons/genetics
- Eye Proteins
- GTP-Binding Proteins
- Gene Duplication
- Glycoproteins
- Glycoside Hydrolases/chemistry
- Glycoside Hydrolases/genetics
- Glycoside Hydrolases/physiology
- Histocompatibility Antigens Class II/chemistry
- Histocompatibility Antigens Class II/genetics
- Histocompatibility Antigens Class II/physiology
- Humans
- Insect Proteins/chemistry
- Insect Proteins/genetics
- Insect Proteins/physiology
- Intracellular Signaling Peptides and Proteins
- Membrane Proteins/chemistry
- Membrane Proteins/genetics
- Membrane Proteins/physiology
- Membrane Transport Proteins
- Molecular Sequence Data
- Muscular Dystrophies/genetics
- Protein Structure, Secondary
- Protein Structure, Tertiary
- Proteins/chemistry
- Proteins/genetics
- Proteins/physiology
- Repetitive Sequences, Amino Acid
- Retinitis Pigmentosa/genetics
- Signal Transduction/genetics
- Species Specificity
- Tandem Repeat Sequences
Collapse
Affiliation(s)
- C P Ponting
- MRC Functional Genetics Unit, Department of Human Anatomy and Genetics, University of Oxford, Oxford OX1 3QX, UK.
| | | | | | | |
Collapse
|
10
|
Emes RD, Ponting CP. A new sequence motif linking lissencephaly, Treacher Collins and oral-facial-digital type 1 syndromes, microtubule dynamics and cell migration. Hum Mol Genet 2001; 10:2813-20. [PMID: 11734546 DOI: 10.1093/hmg/10.24.2813] [Citation(s) in RCA: 147] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A previously unidentified sequence motif has been identified in the products of genes mutated in Miller-Dieker lissencephaly, Treacher Collins, oral-facial-digital type 1 and contiguous syndrome ocular albinism with late onset sensorineural deafness syndromes. An additional homologous motif was detected in a gene product fused to the fibroblast growth factor receptor type 1 in patients with an atypical stem cell myeloproliferative disorder. In total, over 100 eukaryotic intracellular proteins are shown to possess a LIS1 homology (LisH) motif, including several katanin p60 subunits, muskelin, tonneau, LEUNIG, Nopp140, aimless and numerous WD repeat-containing beta-propeller proteins. It is suggested that LisH motifs contribute to the regulation of microtubule dynamics, either by mediating dimerization, or else by binding cytoplasmic dynein heavy chain or microtubules directly. The predicted secondary structure of LisH motifs, and their occurrence in homologues of Gbeta beta-propeller subunits, suggests that they are analogues of Ggamma subunits, and might associate with the periphery of beta-propeller domains. The finding of LisH motifs in both treacle and Nopp140 reinforces previous observations of functional similarities between these nucleolar proteins. Uncharacterized LisH motif-containing proteins represent candidates for other diseases associated with aberrant microtubule dynamics and defects of cell migration, nucleokinesis or chromosome segregation.
Collapse
Affiliation(s)
- R D Emes
- MRC Functional Genetics Unit, Department of Human Anatomy and Genetics, University of Oxford, South Parks Road, Oxford OX1 3QX, UK
| | | |
Collapse
|
11
|
Abstract
The sequencing phase of the human genome project will soon be over. In its wake, repertoires of sequence polymorphisms among the human population are being sampled and a battery of functional genomics projects, from gene and protein expression studies to whole proteome interaction experiments, are generating vast quantities of data. Now that the data, or the means to generate data, are available it is the application of this information in enhancing our understanding of biology that represents the next formidable challenge. Two prominent issues should be considered. First, existing data must be analysed using the best methods available. The prediction of enzymatic activity for bestrophin, whose gene is mutated in Best macular dystrophy, is described in this review. This is an example of the experimentally testable hypotheses that can result from such detailed and exhaustive analyses. Secondly, the torrents of data from high-throughput studies will need to be made more accessible to all using web-based resources that integrate and digest complementary data types. The internet sites that showcase the human genome sequence are blazing a new trail. Ultimately, the success of genome sequencing and functional genomics will be measured not by the quantity and accuracy of raw data generated, but how rapidly they can be harnessed to span the divide between genotype and phenotype.
Collapse
Affiliation(s)
- L Goodstadt
- MRC Functional Genetics Unit, Department of Human Anatomy and Genetics, University of Oxford, South Parks Road, Oxford OX1 3QX, UK
| | | |
Collapse
|
12
|
Abstract
CHROMA annotates multiple protein sequence alignments by consensus to produce formatted and coloured text suitable for incorporation into other documents for publication. The package is designed to be flexible and reliable, and has a simple-to-use graphical user interface running under Microsoft Windows. Both the executables and source code for CHROMA running under Windows and Linux (portable command-line only) are freely available at http://www.lg.ndirect.co.uk/chroma. Software enquiries should be directed to CHROMA@lg.ndirect.co.uk.
Collapse
Affiliation(s)
- L Goodstadt
- MRC Functional Genetics Unit, University of Oxford, Department of Human Anatomy and Genetics, South Parks Road, Oxford OX1 3QX, UK.
| | | |
Collapse
|
13
|
Abstract
One of the defining characteristics of neurodegenerative diseases, including Parkinson's, Alzheimer's and Huntington's diseases, is abnormal accumulations of iron, specifically in affected areas. Following injection of iron in rat brains, a relatively selective lesion of dopamine neurons, similar to parkinsonism, occurs. These observations indicate that Fe(II)-mediated generation of free radical species, by the Fenton reaction, might contribute to the pathoetiology of these diseases. Iron is known to possess multiple roles in the biosynthesis of catecholamines in dopaminergic neurons. These include, as Fe(II), facilitating the production of dopamine from phenylalanine by tyrosine hydroxylase, and as heme, assisting the recycling of ascorbate by cytochrome b-561 required for the generation of norepinephrine from dopamine by dopamine beta-hydroxylase. In this study, it is demonstrated that a human and mouse gene product, stromal cell-derived receptor 2, is a homologue of cytochrome b-561 and duodenal cytochrome b, and is thus predicted to be active as a ferric reductase. Moreover, this protein also contains a domain homologous to the N-terminal regulatory region of dopamine beta-hydroxylase. These findings from sequence analysis lead to a prediction that stromal cell-derived receptor 2 is a catecholamine-regulated ferric reductase active in the brain. Dysfunction of cytochrome b-561 or stromal cell-derived receptor 2, therefore, might predispose individuals to abnormal accumulation of Fe(III) and/or generation of cytotoxic free radicals as a consequence of a rapid cycling between Fe(III) and Fe(II). The hypothesis that aberrant ferric reductase activities are involved in the progression of neurodegenerative diseases should open up new avenues of research, and possibly therapy, for these devastating diseases.
Collapse
Affiliation(s)
- C P Ponting
- MRC Functional Genetics Unit, University of Oxford, Department of Human Anatomy and Genetics, South Parks Road, Oxford OX1 3QX, UK.
| |
Collapse
|
14
|
Lupas AN, Ponting CP, Russell RB. On the evolution of protein folds: are similar motifs in different protein folds the result of convergence, insertion, or relics of an ancient peptide world? J Struct Biol 2001; 134:191-203. [PMID: 11551179 DOI: 10.1006/jsbi.2001.4393] [Citation(s) in RCA: 203] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
This paper presents and discusses evidence suggesting how the diversity of domain folds in existence today might have evolved from peptide ancestors. We apply a structure similarity detection method to detect instances where localized regions of different protein folds contain highly similar sequences and structures. Results of performing an all-on-all comparison of known structures are described and compared with other recently published findings. The numerous instances of local sequence and structure similarities within different protein folds, together with evidence from proteins containing sequence and structure repeats, argues in favor of the evolution of modern single polypeptide domains from ancient short peptide ancestors (antecedent domain segments (ADSs)). In this model, ancient protein structures were formed by self-assembling aggregates of short polypeptides. Subsequently, and perhaps concomitantly with the evolution of higher fidelity DNA replication and repair systems, single polypeptide domains arose from the fusion of ADSs genes. Thus modern protein domains may have a polyphyletic origin.
Collapse
Affiliation(s)
- A N Lupas
- Bioinformatics, GlaxoSmithKline, UP1345, 1250 South Collegeville Road, Collegeville, Pennsylvania 19426-0989, USA
| | | | | |
Collapse
|
15
|
Abstract
Internal repetition within proteins has been a successful strategem on multiple separate occasions throughout evolution. Such protein repeats possess regular secondary structures and form multirepeat assemblies in three dimensions of diverse sizes and functions. In general, however, internal repetition affords a protein enhanced evolutionary prospects due to an enlargement of its available binding surface area. Constraints on sequence conservation appear to be relatively lax, due to binding functions ensuing from multiple, rather than, single repeats. Considerable sequence divergence as well as the short lengths of sequence repeats mean that repeat detection can be a particularly arduous task. We also consider the conundrum of how multiple repeats, which show strong structural and functional interdependencies, ever evolved from a single repeat ancestor. In this review, we illustrate each of these points by referring to six prolific repeat types (repeats in beta-propellers and beta-trefoils and tetratricopeptide, ankyrin, armadillo/HEAT, and leucine-rich repeats) and in other less-prolific but nonetheless interesting repeats.
Collapse
Affiliation(s)
- M A Andrade
- European Molecular Biology Laboratory, Meyerhofstr. 1, Heidelberg, 69012, Germany
| | | | | |
Collapse
|
16
|
Abstract
The initial analysis of the human genome draft sequence reveals that our 'book of life' is multi-authored. A small but significant proportion of our genes owes their heritage not to antecedent eukaryotes but instead to bacteria. The publicly funded Human Genome Project study indicates that about 0.5% of all human genes were copied into the genome from bacterial sources. Detailed sequence analyses point to these 'horizontal gene transfer' events having occurred relatively recently. So how did the human 'book of life' evolve to be a chimaera, part animal and part bacterium? And what was the probable evolutionary impact of such gene plagiarism?
Collapse
Affiliation(s)
- C P Ponting
- MRC Functional Genetics Unit, Dept of Human Anatomy and Genetics, University of Oxford, South Parks Road, Oxford, UK OX1 3QX.
| |
Collapse
|
17
|
Affiliation(s)
- C P Ponting
- Department of Human Anatomy and Genetics, University of Oxford, UK
| | | |
Collapse
|
18
|
Newey SE, Howman EV, Ponting CP, Benson MA, Nawrotzki R, Loh NY, Davies KE, Blake DJ. Syncoilin, a novel member of the intermediate filament superfamily that interacts with alpha-dystrobrevin in skeletal muscle. J Biol Chem 2001; 276:6645-55. [PMID: 11053421 DOI: 10.1074/jbc.m008305200] [Citation(s) in RCA: 98] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Dystrophin coordinates the assembly of a complex of structural and signaling proteins that are required for normal muscle function. A key component of the dystrophin protein complex is alpha-dystrobrevin, a dystrophin-associated protein whose absence results in neuromuscular junction defects and muscular dystrophy. To gain further insights into the role of alpha-dystrobrevin in skeletal muscle, we used the yeast two-hybrid system to identify a novel alpha-dystrobrevin-binding partner called syncoilin. Syncoilin is a new member of the intermediate filament superfamily and is highly expressed in skeletal and cardiac muscle. In normal skeletal muscle, syncoilin is concentrated at the neuromuscular junction, where it colocalizes and coimmunoprecipitates with alpha-dystrobrevin-1. Expression studies in mammalian cells demonstrate that, while alpha-dystrobrevin and syncoilin associate directly, overexpression of syncoilin does not result in the self-assembly of intermediate filaments. Finally, unlike many components of the dystrophin protein complex, we show that syncoilin expression is up-regulated in dystrophin-deficient muscle. These data suggest that alpha-dystrobrevin provides a link between the dystrophin protein complex and the intermediate filament network at the neuromuscular junction, which may be important for the maintenance and maturation of the synapse.
Collapse
Affiliation(s)
- S E Newey
- Department of Human Anatomy and Genetics, University of Oxford, South Parks Road, Oxford OX1 3QX, United Kingdom
| | | | | | | | | | | | | | | |
Collapse
|
19
|
Abstract
Identifying homologues, defined as genes that arose from a common evolutionary ancestor, is often a relatively straightforward task, thanks to recent advances made in estimating the statistical significance of sequence similarities found from database searches. The extent by which homologues possess similarities in function, however, is less amenable to statistical analysis. Consequently, predicting function by homology is a qualitative, rather than quantitative, process and requires particular care to be taken. This review focuses on the various approaches that have been developed to predict function from the scale of the atom to that of the organism. Similarities in homologues' functions differ considerably at each of these different scales and also vary for different domain families. It is argued that due attention should be paid to all available clues to function, including orthologue identification, conservation of particular residue types, and the co-occurrence of domains in proteins. Pitfalls in database searching methods arising from amino acid compositional bias and database size effects are also discussed.
Collapse
Affiliation(s)
- C P Ponting
- Department of Human Anatomy and Genetics, University of Oxford, UK.
| |
Collapse
|
20
|
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann Y, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, Gibbs RA, Muzny DM, Scherer SE, Bouck JB, Sodergren EJ, Worley KC, Rives CM, Gorrell JH, Metzker ML, Naylor SL, Kucherlapati RS, Nelson DL, Weinstock GM, Sakaki Y, Fujiyama A, Hattori M, Yada T, Toyoda A, Itoh T, Kawagoe C, Watanabe H, Totoki Y, Taylor T, Weissenbach J, Heilig R, Saurin W, Artiguenave F, Brottier P, Bruls T, Pelletier E, Robert C, Wincker P, Smith DR, Doucette-Stamm L, Rubenfield M, Weinstock K, Lee HM, Dubois J, Rosenthal A, Platzer M, Nyakatura G, Taudien S, Rump A, Yang H, Yu J, Wang J, Huang G, Gu J, Hood L, Rowen L, Madan A, Qin S, Davis RW, Federspiel NA, Abola AP, Proctor MJ, Myers RM, Schmutz J, Dickson M, Grimwood J, Cox DR, Olson MV, Kaul R, Raymond C, Shimizu N, Kawasaki K, Minoshima S, Evans GA, Athanasiou M, Schultz R, Roe BA, Chen F, Pan H, Ramser J, Lehrach H, Reinhardt R, McCombie WR, de la Bastide M, Dedhia N, Blöcker H, Hornischer K, Nordsiek G, Agarwala R, Aravind L, Bailey JA, Bateman A, Batzoglou S, Birney E, Bork P, Brown DG, Burge CB, Cerutti L, Chen HC, Church D, Clamp M, Copley RR, Doerks T, Eddy SR, Eichler EE, Furey TS, Galagan J, Gilbert JG, Harmon C, Hayashizaki Y, Haussler D, Hermjakob H, Hokamp K, Jang W, Johnson LS, Jones TA, Kasif S, Kaspryzk A, Kennedy S, Kent WJ, Kitts P, Koonin EV, Korf I, Kulp D, Lancet D, Lowe TM, McLysaght A, Mikkelsen T, Moran JV, Mulder N, Pollara VJ, Ponting CP, Schuler G, Schultz J, Slater G, Smit AF, Stupka E, Szustakowki J, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Wallis J, Wheeler R, Williams A, Wolf YI, Wolfe KH, Yang SP, Yeh RF, Collins F, Guyer MS, Peterson J, Felsenfeld A, Wetterstrand KA, Patrinos A, Morgan MJ, de Jong P, Catanese JJ, Osoegawa K, Shizuya H, Choi S, Chen YJ, Szustakowki J. Initial sequencing and analysis of the human genome. Nature 2001; 409:860-921. [PMID: 11237011 DOI: 10.1038/35057062] [Citation(s) in RCA: 14509] [Impact Index Per Article: 630.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.
Collapse
Affiliation(s)
- E S Lander
- Whitehead Institute for Biomedical Research, Center for Genome Research, Cambridge, MA 02142, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
Abstract
Sequence similarity is the most common measure currently used to infer homology between proteins. Typically, homologous protein domains show sequence similarity over their entire lengths. Here we identify Asp box motifs, initially found as repeats in sialidases and neuraminidases, in new structural and sequence contexts. These motifs represent significantly similar sequences, localized to beta hairpins within proteins that are otherwise different in sequence and three-dimensional structure. By performing a combined sequence- and structure-based analysis we detect Asp boxes in more than nine protein families, including bacterial ribonucleases, sulfite oxidases, reelin, netrins, some lipoprotein receptors, and a variety of glycosyl hydrolases. Although the function common to each of these proteins, if any, remains unclear, we discuss possible functions of Asp boxes on the basis of previously determined experimental results and discuss different evolutionary scenarios for the origin of Asp-box containing proteins.
Collapse
|
22
|
Abstract
On the basis of significant sequence similarity, we have identified JmjC domains in more than 100 eukaryotic and bacterial sequences. These include human hairless, mutated in individuals with alopecia universalis, retinoblastoma-binding protein 2 and several putative chromatin-associated proteins. JmjC domains are predicted to be metalloenzymes that adopt the cupin fold, and are candidates for enzymes that regulate chromatin remodelling.
Collapse
Affiliation(s)
- P M Clissold
- MRC Functional Genetics Unit, Department of Human Anatomy and Genetics, University of Oxford, South Parks Road, OX1 3QX, Oxford, UK
| | | |
Collapse
|
23
|
Blanco G, Coulton GR, Biggin A, Grainge C, Moss J, Barrett M, Berquin A, Maréchal G, Skynner M, van Mier P, Nikitopoulou A, Kraus M, Ponting CP, Mason RM, Brown SD. The kyphoscoliosis (ky) mouse is deficient in hypertrophic responses and is caused by a mutation in a novel muscle-specific protein. Hum Mol Genet 2001; 10:9-16. [PMID: 11136708 DOI: 10.1093/hmg/10.1.9] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The ky mouse mutant exhibits a primary degenerative myopathy preceding chronic thoraco-lumbar kyphoscoliosis. The histopathology of the ky mutant suggests that Ky protein activity is crucial for normal muscle growth and function as well as the maturation and stabilization of the neuromuscular junction. Muscle hypertrophy in response to increasing demand is deficient in the ky mutant, whereas adaptive fibre type shifts take place. The ky locus has previously been localized to a small region of mouse chromosome 9 and we have now identified the gene and the mutation underlying the kyphoscoliotic mouse. The ky transcript encodes a novel protein that is detected only in skeletal muscle and heart. The identification of the ky gene will allow detailed analysis of the impact of primary myopathy on idiopathic scoliosis in mice and man.
Collapse
Affiliation(s)
- G Blanco
- MRC Mammalian Genetics Unit and UK Mouse Genome Centre, Harwell, Oxon OX11 ORD, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Affiliation(s)
- P M Clissold
- MRC Functional Genetics Unit, Department of Human Anatomy and Genetics, University of Oxford, South Parks Road, Oxford OX1 3QX, UK
| | | |
Collapse
|
25
|
Newey SE, Benson MA, Ponting CP, Davies KE, Blake DJ. Alternative splicing of dystrobrevin regulates the stoichiometry of syntrophin binding to the dystrophin protein complex. Curr Biol 2000; 10:1295-8. [PMID: 11069112 DOI: 10.1016/s0960-9822(00)00760-0] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Dystrophin coordinates the assembly of a complex of structural and signalling proteins that is required for normal muscle function. A key component of the dystrophin-associated protein complex (DPC) is alpha-dystrobrevin, a dystrophin-related and -associated protein whose absence results in muscular dystrophy and neuromuscular junction defects [1,2]. The current model of the DPC predicts that dystrophin and dystrobrevin each bind a single syntrophin molecule [3]. The syntrophins are PDZ-domain-containing proteins that facilitate the recruitment of signalling proteins such as nNOS (neuronal nitric oxide synthase) to the DPC [4]. Here we show, using yeast two-hybrid analysis and biochemical binding studies, that alpha-dystrobrevin in fact contains two independent syntrophin-binding sites in tandem. The previously undescribed binding site is situated within an alternatively spliced exon of alpha-dystrobrevin, termed the variable region-3 (vr3) sequence, which is specifically expressed in skeletal and cardiac muscle [5,6]. Analysis of the syntrophin-binding region of dystrobrevin reveals a tandem pair of predicted alpha helices with significant sequence similarity. These alpha helices, each termed a syntrophin-binding motif, are also highly conserved in dystrophin and utrophin. Together these data show that there are four potential syntrophin-binding sites per dystrophin complex in skeletal muscle: two on dystrobrevin and two on dystrophin or utrophin. Furthermore, alternative splicing of dystrobrevin provides a mechanism for regulating the stoichiometry of syntrophin association with the DPC. This is likely to have important consequences for the recruitment of specific signalling molecules to the DPC and ultimately for its function.
Collapse
Affiliation(s)
- S E Newey
- Department of Human Anatomy and Genetics, University of Oxford, UK
| | | | | | | | | |
Collapse
|
26
|
Ponting CP. Proteins of the endoplasmic-reticulum-associated degradation pathway: domain detection and function prediction. Biochem J 2000; 351 Pt 2:527-35. [PMID: 11023840 PMCID: PMC1221390] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
Sequence database searches, using iterative-profile and Hidden-Markov-model approaches, were used to detect hitherto-undetected homologues of proteins that regulate the endoplasmic reticulum (ER)-associated degradation pathway. The translocon-associated subunit Sec63p (Sec=secretory) was shown to contain a domain of unknown function found twice in several Brr2p-like RNA helicases (Brr2=bad response to refrigeration 2). Additionally, Cue1p (Cue=coupling of ubiquitin conjugation to ER degradation), a yeast protein that recruits the ubiquitin-conjugating (UBC) enzyme Ubc7p to an ER-associated complex, was found to be one of a large family of putative scaffolding-domain-containing proteins that include the autocrine motility factor receptor and fungal Vps9p (Vps=vacuolar protein sorting). Two other yeast translocon-associated molecules, Sec72p and Hrd3p (Hrd=3-hydroxy-3-methylglutaryl-CoA reductase degradation), were shown to contain multiple tetratricopeptide-repeat-like sequences. From this observation it is suggested that Sec72p associates with a heat-shock protein, Hsp70, in a manner analogous to that known for Hop (Hsp70/Hsp90 organizing protein). Finally, the luminal portion of Ire1p (Ire=high inositol-requiring), thought to convey the sensing function of this transmembrane kinase and endoribonuclease, was shown to contain repeats similar to those in beta-propeller proteins. This finding hints at the mechanism by which Ire1p may sense extended unfolded proteins at the expense of compact folded molecules.
Collapse
Affiliation(s)
- C P Ponting
- MRC Functional Genetics Unit, Department of Human Anatomy and Genetics, University of Oxford, South Parks Road, Oxford OX1 3QX, UK.
| |
Collapse
|
27
|
Ponting CP, Russell RB. Identification of distant homologues of fibroblast growth factors suggests a common ancestor for all beta-trefoil proteins. J Mol Biol 2000; 302:1041-7. [PMID: 11183773 DOI: 10.1006/jmbi.2000.4087] [Citation(s) in RCA: 97] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Determination of the structures of fibroblast growth factors and interleukin-1s has previously revealed that they both adopt a beta-trefoil fold, similar to those found in Kunitz soybean trypsin inhibitors, ricin-like toxins, plant agglutinins and hisactophilin. These families possess distinct functions and occur in different subcellular localisations, and they appear to lack significant similarities in their sequences, ligands and modes of ligand binding. We have analysed the significance of sequence identities observed after structure alignment and provide statistical evidence that these beta-trefoil proteins are all homologues, having arisen from a common ancestor. In addition, we have explored the sequence space of all beta-trefoil proteins and have determined that the actin-binding proteins fascins, and other proteins of unknown function, are beta-trefoil family homologues. Unlike other beta-trefoil proteins, the triplicated repeats in each of the four beta-trefoil domains of fascins are significantly similar in sequence. This hints at how the beta-trefoil fold arose from the duplication of an ancestral gene encoding a homotrimeric single-repeat protein. The combined analysis of structure and sequence databases for detecting significant similarities is suggested as a highly sensitive approach to determining the common ancestry of extremely divergent homologues.
Collapse
Affiliation(s)
- C P Ponting
- MRC Functional Genetics Unit, Department of Human Anatomy and Genetics,University of Oxford, Oxford, UK.
| | | |
Collapse
|
28
|
Affiliation(s)
- C P Ponting
- MRC Functional Genetics Unit, Dept of Human Anatomy and Genetics, University of Oxford, South Parks Road, Oxford, UK OX1 3QX.
| |
Collapse
|
29
|
Rea S, Eisenhaber F, O'Carroll D, Strahl BD, Sun ZW, Schmid M, Opravil S, Mechtler K, Ponting CP, Allis CD, Jenuwein T. Regulation of chromatin structure by site-specific histone H3 methyltransferases. Nature 2000; 406:593-9. [PMID: 10949293 DOI: 10.1038/35020506] [Citation(s) in RCA: 1986] [Impact Index Per Article: 82.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The organization of chromatin into higher-order structures influences chromosome function and epigenetic gene regulation. Higher-order chromatin has been proposed to be nucleated by the covalent modification of histone tails and the subsequent establishment of chromosomal subdomains by non-histone modifier factors. Here we show that human SUV39H1 and murine Suv39h1--mammalian homologues of Drosophila Su(var)3-9 and of Schizosaccharomyces pombe clr4--encode histone H3-specific methyltransferases that selectively methylate lysine 9 of the amino terminus of histone H3 in vitro. We mapped the catalytic motif to the evolutionarily conserved SET domain, which requires adjacent cysteine-rich regions to confer histone methyltransferase activity. Methylation of lysine 9 interferes with phosphorylation of serine 10, but is also influenced by pre-existing modifications in the amino terminus of H3. In vivo, deregulated SUV39H1 or disrupted Suv39h activity modulate H3 serine 10 phosphorylation in native chromatin and induce aberrant mitotic divisions. Our data reveal a functional interdependence of site-specific H3 tail modifications and suggest a dynamic mechanism for the regulation of higher-order chromatin.
Collapse
Affiliation(s)
- S Rea
- Research Institute of Molecular Pathology, The Vienna Biocenter, Austria
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Affiliation(s)
- C P Ponting
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | | | | | | | | |
Collapse
|
31
|
Abstract
Cloning procedures aided by homology searches of EST databases have accelerated the pace of discovery of new genes, but EST database searching remains an involved and onerous task. More than 1.6 million human EST sequences have been deposited in public databases, making it difficult to identify ESTs that represent new genes. Compounding the problems of scale are difficulties in detection associated with a high sequencing error rate and low sequence similarity between distant homologues. We have developed a new method, coupling BLAST-based searches with a domain identification protocol, that filters candidate homologues. Application of this method in a large-scale analysis of 100 signalling domain families has led to the identification of ESTs representing more than 1,000 novel human signalling genes. The 4,206 publicly available ESTs representing these genes are a valuable resource for rapid cloning of novel human signalling proteins. For example, we were able to identify ESTs of at least 106 new small GTPases, of which 6 are likely to belong to new subfamilies. In some cases, further analyses of genomic DNA led to the discovery of previously unidentified full-length protein sequences. This is exemplified by the in silico cloning (prediction of a gene product sequence using only genomic and EST sequence data) of a new type of GTPase with two catalytic domains.
Collapse
Affiliation(s)
- J Schultz
- [1] EMBL, Heidelberg, Germany. [2] Max-Delbrück-Center, Berlin-Buch, Germany
| | | | | | | | | |
Collapse
|
32
|
Abstract
Short protein repeats, frequently with a length between 20 and 40 residues, represent a significant fraction of known proteins. Many repeats appear to possess high amino acid substitution rates and thus recognition of repeat homologues is highly problematic. Even if the presence of a certain repeat family is known, the exact locations and the number of repetitive units often cannot be determined using current methods. We have devised an iterative algorithm based on optimal and sub-optimal score distributions from profile analysis that estimates the significance of all repeats that are detected in a single sequence. This procedure allows the identification of homologues at alignment scores lower than the highest optimal alignment score for non-homologous sequences. The method has been used to investigate the occurrence of eleven families of repeats in Saccharomyces cerevisiae, Caenorhabditis elegans and Homo sapiens accounting for 1055, 2205 and 2320 repeats, respectively. For these examples, the method is both more sensitive and more selective than conventional homology search procedures. The method allowed the detection in the SwissProt database of more than 2000 previously unrecognised repeats belonging to the 11 families. In addition, the method was used to merge several repeat families that previously were supposed to be distinct, indicating common phylogenetic origins for these families.
Collapse
Affiliation(s)
- M A Andrade
- European Molecular Biology Laboratory, Meyerhofstr. 1, Heidelberg, 69012, Germany
| | | | | | | |
Collapse
|
33
|
Affiliation(s)
- C P Ponting
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD 20814, USA.
| |
Collapse
|
34
|
Hodgkin MN, Masson MR, Powner D, Saqib KM, Ponting CP, Wakelam MJ. Phospholipase D regulation and localisation is dependent upon a phosphatidylinositol 4,5-biphosphate-specific PH domain. Curr Biol 2000; 10:43-6. [PMID: 10660303 DOI: 10.1016/s0960-9822(99)00264-x] [Citation(s) in RCA: 110] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
The signalling pathway leading, for example, to actin cytoskeletal reorganisation, secretion or superoxide generation involves phospholipase D (PLD)-catalysed hydrolysis of phosphatidylcholine to generate phosphatidic acid, which appears to mediate the messenger functions of this pathway. Two PLD genes (PLD1 and PLD2) with similar domain structures have been doned and progress has been made in identifying the protein regulators of PLD1 activation, for example Arf and Rho family members. The activities of both PLD isoforms are dependent on phosphatidylinositol 4,5-bisphosphate (PI(4,5)P2) and our sequence analysis suggested the presence of a pleckstrin homology (PH) domain in PLD1, although its absence has also been daimed. Investigation of the inositide dependence showed that a bis-phosphorylated lipid with a vicinal pair of phosphates was required for PLD1 activity. Furthermore, PLD1 bound specifically and with high affinity to lipid surfaces containing PI(4,5)P2 independently of the substrate phosphatidylcholine, suggesting a key role for the PH domain in PLD function. Importantly, a glutathione-S-transferase (GST) fusion protein comprising GST and the PH domain of PLD1 (GST-PLD1-PH) also bound specifically to supported lipid monolayers containing PI(4,5)P2. Point mutations within the PLD1 PH domain inhibited enzyme activity, whereas deletion of the domain both inhibited enzyme activity and disrupted normal PLD1 localisation. Thus, the functional PH domain regulates PLD by mediating its interaction with polyphosphoinositide-containing membranes; this might also induce a conformational change, thereby regulating catalytic activity.
Collapse
Affiliation(s)
- M N Hodgkin
- Institute for Cancer Studies, University of Birmingham, UK.
| | | | | | | | | | | |
Collapse
|
35
|
Abstract
SMART (a Simple Modular Architecture Research Tool) allows the identification and annotation of genetically mobile domains and the analysis of domain architectures (http://SMART.embl-heidelberg.de ). More than 400 domain families found in signalling, extra-cellular and chromatin-associated proteins are detectable. These domains are extensively annotated with respect to phyletic distributions, functional class, tertiary structures and functionally important residues. Each domain found in a non-redundant protein database as well as search parameters and taxonomic information are stored in a relational database system. User interfaces to this database allow searches for proteins containing specific combinations of domains in defined taxa.
Collapse
Affiliation(s)
- J Schultz
- EMBL, Meyerhofstrasse1, 69012 Heidelberg, Germany
| | | | | | | | | |
Collapse
|
36
|
|
37
|
Schäffer AA, Wolf YI, Ponting CP, Koonin EV, Aravind L, Altschul SF. IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices. Bioinformatics 1999; 15:1000-11. [PMID: 10745990 DOI: 10.1093/bioinformatics/15.12.1000] [Citation(s) in RCA: 231] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Many studies have shown that database searches using position-specific score matrices (PSSMs) or profiles as queries are more effective at identifying distant protein relationships than are searches that use simple sequences as queries. One popular program for constructing a PSSM and comparing it with a database of sequences is Position-Specific Iterated BLAST (PSI-BLAST). RESULTS This paper describes a new software package, IMPALA, designed for the complementary procedure of comparing a single query sequence with a database of PSI-BLAST-generated PSSMs. We illustrate the use of IMPALA to search a database of PSSMs for protein folds, and one for protein domains involved in signal transduction. IMPALA's sensitivity to distant biological relationships is very similar to that of PSI-BLAST. However, IMPALA employs a more refined analysis of statistical significance and, unlike PSI-BLAST, guarantees the output of the optimal local alignment by using the rigorous Smith-Waterman algorithm. Also, it is considerably faster when run with a large database of PSSMs than is BLAST or PSI-BLAST when run against the complete non-redundant protein database.
Collapse
Affiliation(s)
- A A Schäffer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | | | | | | | | | | |
Collapse
|
38
|
Ponting CP, Pallen MJ. beta-propeller repeats and a PDZ domain in the tricorn protease: predicted self-compartmentalisation and C-terminal polypeptide-binding strategies of substrate selection. FEMS Microbiol Lett 1999; 179:447-51. [PMID: 10518749 DOI: 10.1111/j.1574-6968.1999.tb08761.x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Prokaryotic proteases demonstrate a variety of substrate-selection strategies that prevent uncontrolled protein degradation. Proteasomes and ClpXP-like proteases form oligomeric structures that exclude large substrates from central solvated chambers containing their active sites. Monomeric prolyl oligopeptidases have been shown to contain beta-propeller structures that similarly reduce access to their catalytic residues. By contrast, Tsp-like enzymes contain PDZ domains that are thought to specifically target C-terminal polypeptides. We have investigated the sequence of Thermoplasma acidophilum tricorn protease using recently-developed database search methods. The tricorn protease is known to associate into a 20 hexamer capsid enclosing an extremely large cavity that is 37 nm in diameter. It is unknown, however, how this enzyme selects its small oligopeptide substrates. Our results demonstrate the presence in tricorn protease of a PDZ domain and two predicted six-bladed beta-propeller domains. We suggest that the PDZ domain is involved in targeting non-polar C-terminal peptides, similar to those generated by the T. acidophilum proteasome, whereas the beta-propeller domains serve to exclude large substrates from the tricorn protease active site in a similar manner to that previously indicated for prolyl oligopeptidase.
Collapse
Affiliation(s)
- C P Ponting
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | | |
Collapse
|
39
|
|
40
|
Barnes MR, Russell RB, Copley RR, Ponting CP, Bork P, Cumberledge S, Reichsman F, Moore HM. A lipid-binding domain in Wnt: a case of mistaken identity? Curr Biol 1999; 9:R717-9. [PMID: 10531014 DOI: 10.1016/s0960-9822(99)80465-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
41
|
Abstract
Ras proteins play critical roles in regulating cell growth and differentiation, and mutated Ras genes are expressed in a variety of human cancers. Consequently, much interest has centered on the binding partners of Ras, including the Ras-binding domain (RBD) of Raf kinase. Here evidence is presented that domains homologous to the Raf RBD are present in tandem in RGS12, RGS14 and LOCO, and singly in molecules similar to mouse Tiam-1. In addition, RGS12, RGS14 and LOCO are shown to contain single "LGN motifs" that are guanine nucleotide exchange factors specific for the alpha-subunit of G proteins. These findings indicate "cross-talk" interactions between signalling pathways involving Ras and Rap and pathways involving Rho, Rac and G alpha GTPases.
Collapse
Affiliation(s)
- C P Ponting
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20814, USA.
| |
Collapse
|
42
|
|
43
|
Mohaghegh P, Rodrigues NR, Owen N, Ponting CP, Le TT, Burghes AH, Davies KE. Analysis of mutations in the tudor domain of the survival motor neuron protein SMN. Eur J Hum Genet 1999; 7:519-25. [PMID: 10439956 DOI: 10.1038/sj.ejhg.5200346] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
Autosomal recessive childhood onset spinal muscular atrophy (SMA) is a leading cause of infant mortality caused by mutations in the survival motor neuron (SMN) gene. The SMN protein is involved in RNA processing and is localised in structures called GEMs in the nucleus. Nothing is yet understood about why mutations in SMN gene result in the selective motor neuron loss observed in patients. The SMN protein domains conserved across several species may indicate functionally significant regions. Exon 3 of SMN contains homology to a tudor domain, where a Type I SMA patient has been reported to harbour a missense mutation. We have generated missense mutants in this region of SMN and have tested their ability to form GEMs when transfected into HeLa cells. Our results show such mutant SMN proteins still localise to GEMs. Furthermore, exon 7 deleted SMN protein appears to exert a dominant negative effect on localisation of endogenous SMN protein. However, exon 3 mutant protein and exon 5 deleted protein exert no such effect.
Collapse
Affiliation(s)
- P Mohaghegh
- Department of Human Anatomy and Genetics, University of Oxford, UK
| | | | | | | | | | | | | |
Collapse
|
44
|
Aravind L, Ponting CP. The cytoplasmic helical linker domain of receptor histidine kinase and methyl-accepting proteins is common to many prokaryotic signalling proteins. FEMS Microbiol Lett 1999; 176:111-6. [PMID: 10418137 DOI: 10.1111/j.1574-6968.1999.tb13650.x] [Citation(s) in RCA: 295] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Mutations in the cytoplasmic linker regions of receptor histidine kinase and chemoreceptor proteins have been shown previously to significantly impair receptor functions. Here we demonstrate significant sequence similarities between these regions in numerous histidine kinases, methyl-accepting proteins, adenylyl cyclases and other prokaryotic signalling proteins. It is suggested that these 'HAMP domains' possess roles of regulating the phosphorylation or methylation of homodimeric receptors by transmitting the conformational changes in periplasmic ligand-binding domains to cytoplasmic signalling kinase and methyl-acceptor domains.
Collapse
Affiliation(s)
- L Aravind
- Department of Biology, Texas A&M University, College Station 70843, USA
| | | |
Collapse
|
45
|
Ponting CP, Aravind L, Schultz J, Bork P, Koonin EV. Eukaryotic signalling domain homologues in archaea and bacteria. Ancient ancestry and horizontal gene transfer. J Mol Biol 1999; 289:729-45. [PMID: 10369758 DOI: 10.1006/jmbi.1999.2827] [Citation(s) in RCA: 260] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Phyletic distributions of eukaryotic signalling domains were studied using recently developed sensitive methods for protein sequence analysis, with an emphasis on the detection and accurate enumeration of homologues in bacteria and archaea. A major difference was found between the distributions of enzyme families that are typically found in all three divisions of cellular life and non-enzymatic domain families that are usually eukaryote-specific. Previously undetected bacterial homologues were identified for# plant pathogenesis-related proteins, Pad1, von Willebrand factor type A, src homology 3 and YWTD repeat-containing domains. Comparisons of the domain distributions in eukaryotes and prokaryotes enabled distinctions to be made between the domains originating prior to the last common ancestor of all known life forms and those apparently originating as consequences of horizontal gene transfer events. A number of transfers of signalling domains from eukaryotes to bacteria were confidently identified, in contrast to only a single case of apparent transfer from eukaryotes to archaea.
Collapse
Affiliation(s)
- C P Ponting
- National Center for Biotechnology Information National Library of Medicine, National Institutes of Health, Bldg. 38A, Bethesda, MD, 20894, USA.
| | | | | | | | | |
Collapse
|
46
|
Abstract
The complete sequence of the nematode worm Caenorhabditis elegans contains the genetic machinery that is required to undertake the core biological processes of single cells. However, the genome also encodes proteins that are associated with multicellularity, as well as others that are lineage-specific expansions of phylogenetically widespread families and yet more that are absent in non-nematodes. Ongoing analysis is beginning to illuminate the similarities and differences among human proteins and proteins that are encoded by the genomes of the multicellular worm and the unicellular yeast, and will be essential in determining the reliability of transferring experimental data among phylogenetically distant species.
Collapse
Affiliation(s)
- R R Copley
- Biocomputing European Molecular Biology Laboratory, Heidelberg, Germany
| | | | | | | |
Collapse
|
47
|
Ponting CP, Bork P, Schultz J, Aravind L. No Sec7-homology domain in guanine-nucleotide-exchange factors that act on Ras and Rho. Trends Biochem Sci 1999; 24:177-8. [PMID: 10322430 DOI: 10.1016/s0968-0004(99)01379-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
48
|
Affiliation(s)
- C P Ponting
- NCBI, National Library of Medicine, NIH, Bethesda, MD 20894, USA.
| | | |
Collapse
|
49
|
|
50
|
|