1
|
Yang LK, Zhang J, Liu D, Han TY, Qin QS, Wang AQ, Dong B. Ancestral glycoprotein hormone and its cognate receptor present in primitive chordate ascidian: Molecular identification and functional characterization. Int J Biol Macromol 2023; 229:401-412. [PMID: 36592853 DOI: 10.1016/j.ijbiomac.2022.12.297] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 12/26/2022] [Indexed: 12/31/2022]
Abstract
The glycoprotein hormone (GPH) system is fundamentally significant in regulating the physiology of chordates, such as thyroid activity and gonadal function. However, the knowledge of the GPH system in the primitive chordate ascidian species is largely lacking. Here, we reported an ancestral GPH system in the ascidian (Styela clava), which consists of GPH α subunit (Sc-GPA2), GPH β subunit (Sc-GPB5), and the cognate leucine-rich repeat-containing G protein-coupled receptor (Sc-GPHR). Comparative structure analysis revealed that distinct from vertebrate GPH β subunits, Sc-GPB5 was less conserved, showing an atypical N-terminal sequence with a type II transmembrane domain instead of a typical signal peptide. By investigating the presence of recombinant Sc-GPA2 and Sc-GPB5 in cell lysates and culture media of HEK293T cells, we confirmed that these two subunits could be secreted out of the cells via distinct secretory pathways. The deglycosylation experiments demonstrated that N-linked glycosylation only occurred on the conserved cysteine residue (N78) of Sc-GPA2, whereas Sc-GPB5 was non-glycosylated. Although Sc-GPB5 exhibited distinct topology and biochemical properties in contrast to its chordate counterparts, it could still interact with Sc-GPA2 to form a heterodimer. The Sc-GPHR was then confirmed to be activated by tethered Sc-GPA2/GPB5 heterodimer on the Gs-cAMP pathway, suggesting that Sc-GPA2/GPB5 heterodimer-initiated Gs-cAMP signaling pathway is evolutionarily conserved in chordates. Furthermore, in situ hybridization and RT-PCR results revealed the co-expression patterns of Sc-GPA2 and Sc-GPB5 with Sc-GPHR transcripts, respectively in ascidian larvae and adults, highlighting the potential functions of Sc-GPA2/GPB5 heterodimer as an autocrine/paracrine neurohormone in regulating metamorphosis of larvae and physiological functions of adults. Our study systematically investigated the GPA2/GPB5-GPHR system in ascidian for the first time, which offers insights into understanding the function and evolution of the GPH system within the chordate lineage.
Collapse
Affiliation(s)
- Li-Kun Yang
- Fang Zongxi Centre, MoE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao 266003, China
| | - Jin Zhang
- Fang Zongxi Centre, MoE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao 266003, China
| | - Di Liu
- Fang Zongxi Centre, MoE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao 266003, China
| | - Tong-Ye Han
- Fang Zongxi Centre, MoE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao 266003, China
| | - Qi-Shu Qin
- Fang Zongxi Centre, MoE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao 266003, China
| | - An-Qi Wang
- Haide College, Ocean University of China, Qingdao 266100, China
| | - Bo Dong
- Fang Zongxi Centre, MoE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao 266003, China; Laoshan Laboratory, Qingdao 266237, China; Institute of Evolution & Marine Biodiversity, Ocean University of China, Qingdao 266003, China.
| |
Collapse
|
2
|
Li SH, Guan ZX, Zhang D, Zhang ZM, Huang J, Yang W, Lin H. Recent Advancement in Predicting Subcellular Localization of Mycobacterial Protein with Machine Learning Methods. Med Chem 2019; 16:605-619. [PMID: 31584379 DOI: 10.2174/1573406415666191004101913] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2019] [Revised: 06/25/2019] [Accepted: 08/23/2019] [Indexed: 01/28/2023]
Abstract
Mycobacterium tuberculosis (MTB) can cause the terrible tuberculosis (TB), which is reported as one of the most dreadful epidemics. Although many biochemical molecular drugs have been developed to cope with this disease, the drug resistance-especially the multidrug-resistant (MDR) and extensively drug-resistance (XDR)-poses a huge threat to the treatment. However, traditional biochemical experimental method to tackle TB is time-consuming and costly. Benefited by the appearance of the enormous genomic and proteomic sequence data, TB can be treated via sequence-based biological computational approach-bioinformatics. Studies on predicting subcellular localization of mycobacterial protein (MBP) with high precision and efficiency may help figure out the biological function of these proteins and then provide useful insights for protein function annotation as well as drug design. In this review, we reported the progress that has been made in computational prediction of subcellular localization of MBP including the following aspects: 1) Construction of benchmark datasets. 2) Methods of feature extraction. 3) Techniques of feature selection. 4) Application of several published prediction algorithms. 5) The published results. 6) The further study on prediction of subcellular localization of MBP.
Collapse
Affiliation(s)
- Shi-Hao Li
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Zheng-Xing Guan
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Dan Zhang
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Zi-Mei Zhang
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Jian Huang
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Wuritu Yang
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.,Development and Planning Department, Inner Mongolia University, Hohhot, P.R. China
| | - Hao Lin
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| |
Collapse
|
3
|
Coelho LP, Kangas JD, Naik AW, Osuna-Highley E, Glory-Afshar E, Fuhrman M, Simha R, Berget PB, Jarvik JW, Murphy RF. Determining the subcellular location of new proteins from microscope images using local features. ACTA ACUST UNITED AC 2013; 29:2343-9. [PMID: 23836142 DOI: 10.1093/bioinformatics/btt392] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Evaluation of previous systems for automated determination of subcellular location from microscope images has been done using datasets in which each location class consisted of multiple images of the same representative protein. Here, we frame a more challenging and useful problem where previously unseen proteins are to be classified. RESULTS Using CD-tagging, we generated two new image datasets for evaluation of this problem, which contain several different proteins for each location class. Evaluation of previous methods on these new datasets showed that it is much harder to train a classifier that generalizes across different proteins than one that simply recognizes a protein it was trained on. We therefore developed and evaluated additional approaches, incorporating novel modifications of local features techniques. These extended the notion of local features to exploit both the protein image and any reference markers that were imaged in parallel. With these, we obtained a large accuracy improvement in our new datasets over existing methods. Additionally, these features help achieve classification improvements for other previously studied datasets. AVAILABILITY The datasets are available for download at http://murphylab.web.cmu.edu/data/. The software was written in Python and C++ and is available under an open-source license at http://murphylab.web.cmu.edu/software/. The code is split into a library, which can be easily reused for other data and a small driver script for reproducing all results presented here. A step-by-step tutorial on applying the methods to new datasets is also available at that address. CONTACT murphy@cmu.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Luis Pedro Coelho
- Lane Center for Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
4
|
Bronckers ALJJ, Gueneli N, Lüllmann-Rauch R, Schneppenheim J, Moraru AP, Himmerkus N, Bervoets TJ, Fluhrer R, Everts V, Saftig P, Schröder B. The intramembrane protease SPPL2A is critical for tooth enamel formation. J Bone Miner Res 2013; 28:1622-30. [PMID: 23426979 DOI: 10.1002/jbmr.1895] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/21/2012] [Revised: 02/06/2013] [Accepted: 02/11/2013] [Indexed: 12/15/2022]
Abstract
Intramembrane proteases are critically involved in signal transduction and membrane protein turnover. Signal-peptide-peptidase-like 2a (SPPL2A), a presenilin-homologue residing in lysosomes/late endosomes, cleaves type II-oriented transmembrane proteins. We recently identified SPPL2A as the enzyme controlling turnover and functions of the invariant chain (CD74) of the major histocompatibility complex II (MHCII) and demonstrated critical importance of this process for B cell development. Surprisingly, we found that SPPL2A is critical for formation of dental enamel. In Sppl2a knockout mice, enamel of the erupted incisors was chalky white and rapidly eroded after eruption. SPPL2A was found to be expressed in enamel epithelium during secretory and maturation stage amelogenesis. Mineral content of enamel in Sppl2a⁻/⁻ incisors was inhomogeneous and reduced by ∼20% compared to wild-type mice with the most pronounced reduction at the mesial side. Frequently, disruption of the enamel layer and localized detachment of the most superficial enamel layer was observed in the knockout incisors leading to an uneven enamel surface. In Sppl2a null mice, morphology and function of secretory stage ameloblasts were not noticeably different from that of wild-type mice. However, maturation stage ameloblasts showed reduced height and a characteristic undulation of the ameloblast layer with localized adherence of the cells to the outer enamel. This was reflected in a delayed and incomplete resorption of the proteinaceous enamel matrix. Thus, we conclude that intramembrane proteolysis by SPPL2A is essential for maintaining cellular homeostasis of ameloblasts. Because modulation of SPPL2A activity appears to be an attractive therapeutic target to deplete B cells and treat autoimmunity, interference with tooth enamel formation should be investigated as a possible adverse effect of pharmacological SPPL2A inhibitors in humans.
Collapse
|
5
|
A Novel Type III Endosome Transmembrane Protein, TEMP. Cells 2012; 1:1029-44. [PMID: 24710541 PMCID: PMC3901140 DOI: 10.3390/cells1041029] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2012] [Revised: 10/26/2012] [Accepted: 10/30/2012] [Indexed: 12/18/2022] Open
Abstract
As part of a high-throughput subcellular localisation project, the protein encoded by the RIKEN mouse cDNA 2610528J11 was expressed and identified to be associated with both endosomes and the plasma membrane. Based on this, we have assigned the name TEMP for Type III Endosome Membrane Protein. TEMP encodes a short protein of 111 amino acids with a single, alpha-helical transmembrane domain. Experimental analysis of its membrane topology demonstrated it is a Type III membrane protein with the amino-terminus in the lumenal, or extracellular region, and the carboxy-terminus in the cytoplasm. In addition to the plasma membrane TEMP was localized to Rab5 positive early endosomes, Rab5/Rab11 positive recycling endosomes but not Rab7 positive late endosomes. Video microscopy in living cells confirmed TEMP’s plasma membrane localization and identified the intracellular endosome compartments to be tubulovesicular. Overexpression of TEMP resulted in the early/recycling endosomes clustering at the cell periphery that was dependent on the presence of intact microtubules. The cellular function of TEMP cannot be inferred based on bioinformatics comparison, but its cellular distribution between early/recycling endosomes and the plasma membrane suggests a role in membrane transport.
Collapse
|
6
|
Tang SN, Sun JM, Xiong WW, Cong PS, Li TH. Identification of the subcellular localization of mycobacterial proteins using localization motifs. Biochimie 2012; 94:847-53. [DOI: 10.1016/j.biochi.2011.12.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2011] [Accepted: 12/02/2011] [Indexed: 01/28/2023]
|
7
|
Lin TH, Murphy RF, Bar-Joseph Z. Discriminative motif finding for predicting protein subcellular localization. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011; 8:441-51. [PMID: 21233524 PMCID: PMC3050600 DOI: 10.1109/tcbb.2009.82] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Many methods have been described to predict the subcellular location of proteins from sequence information. However, most of these methods either rely on global sequence properties or use a set of known protein targeting motifs to predict protein localization. Here, we develop and test a novel method that identifies potential targeting motifs using a discriminative approach based on hidden Markov models (discriminative HMMs). These models search for motifs that are present in a compartment but absent in other, nearby, compartments by utilizing an hierarchical structure that mimics the protein sorting mechanism. We show that both discriminative motif finding and the hierarchical structure improve localization prediction on a benchmark data set of yeast proteins. The motifs identified can be mapped to known targeting motifs and they are more conserved than the average protein sequence. Using our motif-based predictions, we can identify potential annotation errors in public databases for the location of some of the proteins. A software implementation and the data set described in this paper are available from http://murphylab.web.cmu.edu/software/2009_TCBB_motif/.
Collapse
Affiliation(s)
- Tien-ho Lin
- Carnegie Mellon University, Pittsburgh, Pittsburgh, PA 15213, USA.
| | | | | |
Collapse
|
8
|
Huh S, Lee D, Murphy RF. Efficient framework for automated classification of subcellular patterns in budding yeast. Cytometry A 2010; 75:934-40. [PMID: 19753630 DOI: 10.1002/cyto.a.20793] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Fluorescent-tagging and digital imaging are widely used to determine the subcellular location of proteins. An extensive publicly available collection of images for most proteins expressed in the yeast S. cerevisae has provided both an important source of information on protein location but also a testbed for methods designed to automate the assignment of locations to unknown proteins. The first system for automated classification of subcellular patterns in these yeast images utilized a computationally expensive method for segmentation of images into individual cells and achieved an overall accuracy of 81%. The goal of the present study was to improve on both the computational efficiency and accuracy of this task. Numerical features derived from applying Gabor filters to small image patches were implemented so that patterns could be classified without segmentation into single cells. When tested on 20 classes of images visually classified as showing a single subcellular pattern, an overall accuracy of 87.8% was achieved, with 2330 images out of 2655 images in the UCSF dataset being correctly classified. On the 4 largest classes of these images, 95.3% accuracy was achieved. The improvement over the previous approach is not only in classification accuracy but also in computational efficiency, with the new approach taking about 1 h on a desktop computer to complete all steps required to perform a 6-fold cross validation on all images.
Collapse
Affiliation(s)
- Seungil Huh
- Robotics Institute, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| | | | | |
Collapse
|
9
|
Simpson JC. Screening the secretion machinery: High throughput imaging approaches to elucidate the secretory pathway. Semin Cell Dev Biol 2009; 20:903-9. [DOI: 10.1016/j.semcdb.2009.07.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2009] [Revised: 07/08/2009] [Accepted: 07/28/2009] [Indexed: 10/20/2022]
|
10
|
Malavasi F, Deaglio S, Funaro A, Ferrero E, Horenstein AL, Ortolan E, Vaisitti T, Aydin S. Evolution and function of the ADP ribosyl cyclase/CD38 gene family in physiology and pathology. Physiol Rev 2008; 88:841-86. [PMID: 18626062 DOI: 10.1152/physrev.00035.2007] [Citation(s) in RCA: 619] [Impact Index Per Article: 38.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
The membrane proteins CD38 and CD157 belong to an evolutionarily conserved family of enzymes that play crucial roles in human physiology. Expressed in distinct patterns in most tissues, CD38 (and CD157) cleaves NAD(+) and NADP(+), generating cyclic ADP ribose (cADPR), NAADP, and ADPR. These reaction products are essential for the regulation of intracellular Ca(2+), the most ancient and universal cell signaling system. The entire family of enzymes controls complex processes, including egg fertilization, cell activation and proliferation, muscle contraction, hormone secretion, and immune responses. Over the course of evolution, the molecules have developed the ability to interact laterally and frontally with other surface proteins and have acquired receptor-like features. As detailed in this review, the loss of CD38 function is associated with impaired immune responses, metabolic disturbances, and behavioral modifications in mice. CD38 is a powerful disease marker for human leukemias and myelomas, is directly involved in the pathogenesis and outcome of human immunodeficiency virus infection and chronic lymphocytic leukemia, and controls insulin release and the development of diabetes. Here, the data concerning diseases are examined in view of potential clinical applications in diagnosis, prognosis, and therapy. The concluding remarks try to frame all of the currently available information within a unified working model that takes into account both the enzymatic and receptorial functions of the molecules.
Collapse
Affiliation(s)
- Fabio Malavasi
- Laboratory of Immunogenetics, Department of Genetics, Biology, and Biochemistry and Centro di Ricerca in Medicina Sperimentale, University of Torino Medical School, Torino, Italy.
| | | | | | | | | | | | | | | |
Collapse
|
11
|
van Dijk ADJ, Bosch D, ter Braak CJF, van der Krol AR, van Ham RCHJ. Predicting sub-Golgi localization of type II membrane proteins. ACTA ACUST UNITED AC 2008; 24:1779-86. [PMID: 18562268 PMCID: PMC7110242 DOI: 10.1093/bioinformatics/btn309] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Motivation: Recent research underlines the importance of finegrained knowledge on protein localization. In particular, subcompartmental localization in the Golgi apparatus is important, for example, for the order of reactions performed in glycosylation pathways or the sorting functions of SNAREs, but is currently poorly understood. Results: We assemble a dataset of type II transmembrane proteins with experimentally determined sub-Golgi localizations and use this information to develop a predictor based on the transmembrane domain of these proteins, making use of a dedicated proteinstructure based kernel in an SVM. Various applications demonstrate the power of our approach. In particular, comparison with a large set of glycan structures illustrates the applicability of our predictions on a ‘glycomic’ scale and demonstrates a significant correlation between sub-Golgi localization and the ordering of different steps in glycan biosynthesis. Contact:roeland.vanham@wur.nl Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- A D J van Dijk
- Applied Bioinformatics, PRI, Wageningen UR, Wageningen, The Netherlands
| | | | | | | | | |
Collapse
|
12
|
Gosney R, Liau WS, Lamunyon CW. A novel function for the presenilin family member spe-4: inhibition of spermatid activation in Caenorhabditis elegans. BMC DEVELOPMENTAL BIOLOGY 2008; 8:44. [PMID: 18430247 PMCID: PMC2383881 DOI: 10.1186/1471-213x-8-44] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/10/2007] [Accepted: 04/22/2008] [Indexed: 11/23/2022]
Abstract
Background Sperm cells must regulate the timing and location of activation to maximize the likelihood of fertilization. Sperm from most species, including the nematode Caenorhabditis elegans, activate upon encountering an external signal. Activation for C. elegans sperm occurs as spermatids undergo spermiogenesis, a profound cellular reorganization that produces a pseudopod. Spermiogenesis is initiated by an activation signal that is transduced through a series of gene products. It is now clear that an inhibitory pathway also operates in spermatids, preventing their premature progression to spermatozoa and resulting in fine-scale control over the timing of activation. Here, we describe the involvement of a newly assigned member of the inhibitory pathway: spe-4, a homolog of the human presenilin gene PS1. The spe-4(hc196) allele investigated here was isolated as a suppressor of sterility of mutations in the spermiogenesis signal transduction gene spe-27. Results Through mapping, complementation tests, DNA sequencing, and transformation rescue, we determined that allele hc196 is a mutation in the spe-4 gene. Our data show that spe-4(hc196) is a bypass suppressor that eliminates the need for the spermiogenesis signal transduction. On its own, spe-4(hc196) has a recessive, temperature sensitive spermatogenesis-defective phenotype, with mutants exhibiting (i) defective spermatocytes, (ii) defective spermatids, (iii) premature spermatid activation, and (iv) spermatozoa defective in fertilization, in addition to a small number of functional sperm which appear normal microscopically. Conclusion A fraction of the sperm from spe-4(hc196) mutant males progress directly to functional spermatozoa without the need for an activation signal, suggesting that spe-4 plays a role in preventing spermatid activation. Another fraction of spermatozoa from spe-4(hc196) mutants are defective in fertilization. Therefore, prematurely activated spermatozoa may have several defects: we show that they may be defective in fertilization, and earlier work showed that they obstruct sperm transfer from males at mating. hc196 is a hypomorphic allele of spe-4, and its newly-discovered role inhibiting spermiogenesis may involve known proteolytic and/or calcium regulatory aspects of presenilin function, or it may involve yet-to-be discovered functions.
Collapse
Affiliation(s)
- Ryoko Gosney
- Department of Biological Science, California State Polytechnic University, Pomona, CA, USA.
| | | | | |
Collapse
|
13
|
Towards defining the nuclear proteome. Genome Biol 2008; 9:R15. [PMID: 18211718 PMCID: PMC2395251 DOI: 10.1186/gb-2008-9-1-r15] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2007] [Revised: 12/19/2007] [Accepted: 01/23/2008] [Indexed: 11/17/2022] Open
Abstract
Direct evidence is reported for 2,568 mammalian proteins within the nuclear proteome, consisting of at least 14% of the entire proteome. Background The nucleus is a complex cellular organelle and accurately defining its protein content is essential before any systematic characterization can be considered. Results We report direct evidence for 2,568 mammalian proteins within the nuclear proteome: the nuclear subcellular localization of 1,529 proteins based on a high-throughput subcellular localization protocol of full-length proteins and an additional 1,039 proteins for which clear experimental evidence is documented in published literature. This is direct evidence that the nuclear proteome consists of at least 14% of the entire proteome. This dataset was used to evaluate computational approaches designed to identify additional nuclear proteins. Conclusion This represents direct experimental evidence that the nuclear proteome consists of at least 14% of the entire proteome. This high-quality nuclear proteome dataset was used to evaluate computational approaches designed to identify additional nuclear proteins. Based on this analysis, researchers can determine the stringency and types of lines of evidence they consider to infer the size and complement of the nuclear proteome.
Collapse
|
14
|
Hawkins J, Mahony D, Maetschke S, Wakabayashi M, Teasdale RD, Bodén M. Identifying novel peroxisomal proteins. Proteins 2007; 69:606-16. [PMID: 17636571 DOI: 10.1002/prot.21420] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Peroxisomes are small subcellular compartments responsible for a range of essential metabolic processes. Efforts in predicting peroxisomal protein import are challenged by species variation and sparse sequence data sets with experimentally confirmed localization. We present a predictor of peroxisomal import based on the presence of the dominant peroxisomal targeting signal one (PTS1), a seemingly wellconserved but highly unspecific motif. The signal appears to rely on subtle dependencies with the preceding residues. We evaluate prediction accuracies against two alternative predictor services, PEROXIP and the PTS1 PREDICTOR. We test the integrity of prediction on a range of prokaryotic and eukaryotic proteomes lacking peroxisomes. Similarly we test the accuracy on peroxisomal proteins known to not overlap with training data. The model identified a number of proteins within the RIKEN IPS7 mouse protein dataset as potentially novel peroxisomal proteins. Three were confirmed in vitro using immunofluorescent detection of myc-epitope-tagged proteins in transiently transfected BHK-21 cells (Dhrs2, Serhl, and Ehhadh). The final model has a superior specificity to both alternatives, and an accuracy better than PEROXIP and on par with PTS1 PREDICTOR. Thus, the model we present should prove invaluable for labeling PTS1 targeted proteins with high confidence. We use the predictor to screen several additional eukaryotic genomes to revise previously estimated numbers of peroxisomal proteins. Available at http://pprowler.itee.uq.edu.au.
Collapse
Affiliation(s)
- John Hawkins
- ARC Centre for Complex Systems, The University of Queensland, St. Lucia, Queensland 4072, Australia
| | | | | | | | | | | |
Collapse
|
15
|
Sprenger J, Lynn Fink J, Karunaratne S, Hanson K, Hamilton NA, Teasdale RD. LOCATE: a mammalian protein subcellular localization database. Nucleic Acids Res 2007; 36:D230-3. [PMID: 17986452 PMCID: PMC2238969 DOI: 10.1093/nar/gkm950] [Citation(s) in RCA: 108] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
LOCATE is a curated, web-accessible database that houses data describing the membrane organization and subcellular localization of mouse and human proteins. Over the past 2 years, the data in LOCATE have grown substantially. The database now contains high-quality localization data for 20% of the mouse proteome and general localization annotation for nearly 36% of the mouse proteome. The proteome annotated in LOCATE is from the RIKEN FANTOM Consortium Isoform Protein Sequence sets which contains 58 128 mouse and 64 637 human protein isoforms. Other additions include computational subcellular localization predictions, automated computational classification of experimental localization image data, prediction of protein sorting signals and third party submission of literature data. Collectively, this database provides localization proteome for individual subcellular compartments that will underpin future systematic investigations of these regions. It is available at http://locate.imb.uq.edu.au/
Collapse
Affiliation(s)
- Josefine Sprenger
- ARC Centre of Excellence in Bioinformatics, Institute for Molecular Bioscience, The University of Queensland, St Lucia, Queensland 4072, Australia
| | | | | | | | | | | |
Collapse
|
16
|
Malavasi F, Deaglio S, Ferrero E, Funaro A, Sancho J, Ausiello CM, Ortolan E, Vaisitti T, Zubiaur M, Fedele G, Aydin S, Tibaldi EV, Durelli I, Lusso R, Cozno F, Horenstein AL. CD38 and CD157 as receptors of the immune system: a bridge between innate and adaptive immunity. Mol Med 2007. [PMID: 17380201 DOI: 10.2119/2006-00094.malavasi] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
This paper reviews some of the results and the speculations presented at the Torino CD38 Meeting in June, 2006 and focused on CD38 and CD157 seen as a family of molecules acting as surface receptors of immune cells. This partisan view was adopted in the attempt to combine the enzymatic functions with what the immunologists consider key functions in different cell models. At the moment, it is unclear whether the two functions are correlated, indifferent, or independent. Here we present conclusions inferred exclusively on human cell models, namely T and B lymphocytes, dendritic cells, and granulocytes. As an extra analytical tool, we try to follow in the history of life when the enzymatic and receptorial functions were generated, mixing ontogeny, membrane localization, and cell anchorage.
Collapse
Affiliation(s)
- Fabio Malavasi
- Laboratory of Immunogenetics, Department of Genetics, Biology and Biochemistry, University of Torino Medical School, Torino, Italy.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Malavasi F, Deaglio S, Ferrero E, Funaro A, Sancho J, Ausiello CM, Ortolan E, Vaisitti T, Zubiaur M, Fedele G, Aydin S, Tibaldi EV, Durelli I, Lusso R, Cozno F, Horenstein AL. CD38 and CD157 as receptors of the immune system: a bridge between innate and adaptive immunity. MOLECULAR MEDICINE (CAMBRIDGE, MASS.) 2007; 12:334-41. [PMID: 17380201 PMCID: PMC1829205 DOI: 10.2119/2006–00094.malavasi] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Received: 11/02/2006] [Accepted: 12/07/2006] [Indexed: 11/06/2022]
Abstract
This paper reviews some of the results and the speculations presented at the Torino CD38 Meeting in June, 2006 and focused on CD38 and CD157 seen as a family of molecules acting as surface receptors of immune cells. This partisan view was adopted in the attempt to combine the enzymatic functions with what the immunologists consider key functions in different cell models. At the moment, it is unclear whether the two functions are correlated, indifferent, or independent. Here we present conclusions inferred exclusively on human cell models, namely T and B lymphocytes, dendritic cells, and granulocytes. As an extra analytical tool, we try to follow in the history of life when the enzymatic and receptorial functions were generated, mixing ontogeny, membrane localization, and cell anchorage.
Collapse
Affiliation(s)
- Fabio Malavasi
- Laboratory of Immunogenetics, Department of Genetics, Biology and Biochemistry, University of Torino Medical School, Torino, Italy.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Davis MJ, Hanson KA, Clark F, Fink JL, Zhang F, Kasukawa T, Kai C, Kawai J, Carninci P, Hayashizaki Y, Teasdale RD. Differential use of signal peptides and membrane domains is a common occurrence in the protein output of transcriptional units. PLoS Genet 2006; 2:e46. [PMID: 16683029 PMCID: PMC1449889 DOI: 10.1371/journal.pgen.0020046] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2005] [Accepted: 02/10/2006] [Indexed: 11/18/2022] Open
Abstract
Membrane organization describes the orientation of a protein with respect to the membrane and can be determined by the presence, or absence, and organization within the protein sequence of two features: endoplasmic reticulum signal peptides and alpha-helical transmembrane domains. These features allow protein sequences to be classified into one of five membrane organization categories: soluble intracellular proteins, soluble secreted proteins, type I membrane proteins, type II membrane proteins, and multi-spanning membrane proteins. Generation of protein isoforms with variable membrane organizations can change a protein's subcellular localization or association with the membrane. Application of MemO, a membrane organization annotation pipeline, to the FANTOM3 Isoform Protein Sequence mouse protein set revealed that within the 8,032 transcriptional units (TUs) with multiple protein isoforms, 573 had variation in their use of signal peptides, 1,527 had variation in their use of transmembrane domains, and 615 generated protein isoforms from distinct membrane organization classes. The mechanisms underlying these transcript variations were analyzed. While TUs were identified encoding all pairwise combinations of membrane organization categories, the most common was conversion of membrane proteins to soluble proteins. Observed within our high-confidence set were 156 TUs predicted to generate both extracellular soluble and membrane proteins, and 217 TUs generating both intracellular soluble and membrane proteins. The differential use of endoplasmic reticulum signal peptides and transmembrane domains is a common occurrence within the variable protein output of TUs. The generation of protein isoforms that are targeted to multiple subcellular locations represents a major functional consequence of transcript variation within the mouse transcriptome.
Collapse
Affiliation(s)
- Melissa J Davis
- Institute for Molecular Bioscience and ARC Centre in Bioinformatics, University of Queensland, St. Lucia, Queensland, Australia
| | - Kelly A Hanson
- Institute for Molecular Bioscience and ARC Centre in Bioinformatics, University of Queensland, St. Lucia, Queensland, Australia
| | - Francis Clark
- Institute for Molecular Bioscience and ARC Centre in Bioinformatics, University of Queensland, St. Lucia, Queensland, Australia
- Advanced Computational Modeling Centre, University of Queensland, St. Lucia, Queensland, Australia
| | - J. Lynn Fink
- Institute for Molecular Bioscience and ARC Centre in Bioinformatics, University of Queensland, St. Lucia, Queensland, Australia
| | - Fasheng Zhang
- Institute for Molecular Bioscience and ARC Centre in Bioinformatics, University of Queensland, St. Lucia, Queensland, Australia
| | - Takeya Kasukawa
- Genome Exploration Research Group (Genome Network Project Core Group), RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan
| | - Chikatoshi Kai
- Genome Exploration Research Group (Genome Network Project Core Group), RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan
| | - Jun Kawai
- Genome Exploration Research Group (Genome Network Project Core Group), RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan
- Genome Science Laboratory, Discovery Research Institute, RIKEN Wako Institute, Wako, Japan
| | - Piero Carninci
- Genome Exploration Research Group (Genome Network Project Core Group), RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan
- Genome Science Laboratory, Discovery Research Institute, RIKEN Wako Institute, Wako, Japan
| | - Yoshihide Hayashizaki
- Genome Exploration Research Group (Genome Network Project Core Group), RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan
- Genome Science Laboratory, Discovery Research Institute, RIKEN Wako Institute, Wako, Japan
| | - Rohan D Teasdale
- Institute for Molecular Bioscience and ARC Centre in Bioinformatics, University of Queensland, St. Lucia, Queensland, Australia
| |
Collapse
|