1
|
Alghushairy O, Ali F, Alghamdi W, Khalid M, Alsini R, Asiry O. Machine learning-based model for accurate identification of druggable proteins using light extreme gradient boosting. J Biomol Struct Dyn 2023:1-12. [PMID: 37850427 DOI: 10.1080/07391102.2023.2269280] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 10/04/2023] [Indexed: 10/19/2023]
Abstract
The identification of druggable proteins (DPs) is significant for the development of new drugs, personalized medicine, understanding of disease mechanisms, drug repurposing, and economic benefits. By identifying new druggable targets, researchers can develop new therapies for a range of diseases, leading to better patient outcomes. Identification of DPs by machine learning strategies is more efficient and cost-effective than conventional methods. In this study, a computational predictor, namely Drug-LXGB, is introduced to enhance the identification of DPs. Features are discovered by composition, transition, and distribution (CTD), composition of K-spaced amino acid pair (CKSAAP), pseudo-position-specific scoring matrix (PsePSSM), and a novel descriptor, called multi-block pseudo amino acid composition (MB-PseAAC). The dimensions of CTD, CKSAAP, PsePSSM, and MB-PseAAC are integrated and utilized the sequential forward selection as feature selection algorithm. The best characteristics are provided by random forest, extreme gradient boosting, and light eXtreme gradient boosting (LXGB). The predictive analysis of these learning methods is measured via 10-fold cross-validation. The LXGB-based model secures the highest results than other existing predictors. Our novel protocol will perform an active role in designing novel drugs and would be fruitful to explore the potential target. This study will help better to capture a more universal view of a potential target.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Omar Alghushairy
- Department of Information Systems and Technology, College of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi Arabia
| | - Farman Ali
- Department of Software Engineering, Sarhad University of Science and Information Technology Peshawar Mardan Campus, Peshawar, Pakistan
| | - Wajdi Alghamdi
- Department of Information Technology, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Majdi Khalid
- Department of Computer Science, College of Computers and Information Systems, Umm Al-Qura University, Makkah, Saudi Arabia
| | - Raed Alsini
- Department of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Othman Asiry
- Department of Information Technology, College of Computing and Information Technology at Khulais, University of Jeddah, Jeddah, Saudi Arabia
| |
Collapse
|
2
|
Přerovská T, Henke S, Bleha R, Spiwok V, Gillarová S, Yvin JC, Ferrières V, Nguema-Ona E, Lipovová P. Arabinogalactan-like Glycoproteins from Ulva lactuca (Chlorophyta) Show Unique Features Compared to Land Plants AGPs. JOURNAL OF PHYCOLOGY 2021; 57:619-635. [PMID: 33338254 DOI: 10.1111/jpy.13121] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Revised: 10/27/2020] [Accepted: 10/30/2020] [Indexed: 05/21/2023]
Abstract
Arabinogalactan proteins (AGPs) encompass a diverse group of plant cell wall proteoglycans, which play an essential role in plant development, signaling, plant-microbe interactions, and many others. Although they are widely distributed throughout the plant kingdom and extensively studied, they remain largely unexplored in the lower plants, especially in seaweeds. Ulva species have high economic potential since various applications were previously described including bioremediation, biofuel production, and as a source of bioactive compounds. This article presents the first experimental confirmation of AGP-like glycoproteins in Ulva species and provides a simple extraction protocol of Ulva lactuca AGP-like glycoproteins, their partial characterization and unique comparison to scarcely described Solanum lycopersicum AGPs. The reactivity with primary anti-AGP antibodies as well as Yariv reagent showed a great variety between Ulva lactuca and Solanum lycopersicum AGP-like glycoproteins. While the amino acid analysis of the AGP-like glycoproteins purified by the β-d-glucosyl Yariv reagent showed a similarity between algal and land plant AGP-like glycoproteins, neutral saccharide analysis revealed unique glycosylation of the Ulva lactuca AGP-like glycoproteins. Surprisingly, arabinose and galactose were not the most prevalent monosaccharides and the most outstanding was the presence of 3-O-methyl-hexose, which has never been described in the AGPs. The exceptional structure of the Ulva lactuca AGP-like glycoproteins implies a specialized adaptation to the marine environment and might bring new insight into the evolution of the plant cell wall.
Collapse
Affiliation(s)
- Tereza Přerovská
- Department of Biochemistry and Microbiology, University of Chemistry and Technology Prague, Technická 3, Prague, 16625, Czech Republic
- Ecole Nationale Supérieure de Chimie de Rennes, CNRS, ISCR-UMR 6226, Univ Rennes, 35000, Rennes, France
| | - Svatopluk Henke
- Department of Carbohydrates and Cerials, University of Chemistry and Technology Prague, Technická 3, Prague, 16625, Czech Republic
| | - Roman Bleha
- Department of Carbohydrates and Cerials, University of Chemistry and Technology Prague, Technická 3, Prague, 16625, Czech Republic
| | - Vojtěch Spiwok
- Department of Biochemistry and Microbiology, University of Chemistry and Technology Prague, Technická 3, Prague, 16625, Czech Republic
| | - Simona Gillarová
- Department of Carbohydrates and Cerials, University of Chemistry and Technology Prague, Technická 3, Prague, 16625, Czech Republic
| | - Jean-Claude Yvin
- Centre Mondial de l'Innovation Roullier, Laboratoire de Nutrition Végétal, 18 Avenue Franklin Roosevelt, Saint-Malo, 35400, France
| | - Vincent Ferrières
- Ecole Nationale Supérieure de Chimie de Rennes, CNRS, ISCR-UMR 6226, Univ Rennes, 35000, Rennes, France
| | - Eric Nguema-Ona
- Centre Mondial de l'Innovation Roullier, Laboratoire de Nutrition Végétal, 18 Avenue Franklin Roosevelt, Saint-Malo, 35400, France
| | - Petra Lipovová
- Department of Biochemistry and Microbiology, University of Chemistry and Technology Prague, Technická 3, Prague, 16625, Czech Republic
| |
Collapse
|
3
|
Ali Shah SM, Taju SW, Ho QT, Nguyen TTD, Ou YY. GT-Finder: Classify the family of glucose transporters with pre-trained BERT language models. Comput Biol Med 2021; 131:104259. [PMID: 33581474 DOI: 10.1016/j.compbiomed.2021.104259] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Revised: 02/04/2021] [Accepted: 02/04/2021] [Indexed: 12/14/2022]
Abstract
Recently, language representation models have drawn a lot of attention in the field of natural language processing (NLP) due to their remarkable results. Among them, BERT (Bidirectional Encoder Representations from Transformers) has proven to be a simple, yet powerful language model that has achieved novel state-of-the-art performance. BERT adopted the concept of contextualized word embeddings to capture the semantics and context in which words appear. We utilized pre-trained BERT models to extract features from protein sequences for discriminating three families of glucose transporters: the major facilitator superfamily of glucose transporters (GLUTs), the sodium-glucose linked transporters (SGLTs), and the sugars will eventually be exported transporters (SWEETs). We treated protein sequences as sentences and transformed them into fixed-length meaningful vectors where a 768- or 1024-dimensional vector represents each amino acid. We observed that BERT-Base and BERT-Large models improved the performance by more than 4% in terms of average sensitivity and Matthews correlation coefficient (MCC), indicating the efficiency of this approach. We also developed a bidirectional transformer-based protein model (TransportersBERT) for comparison with existing pre-trained BERT models.
Collapse
Affiliation(s)
- Syed Muazzam Ali Shah
- Department of Computer Science & Engineering, Yuan Ze University, Chungli, 32003, Taiwan
| | - Semmy Wellem Taju
- Department of Computer Science & Engineering, Yuan Ze University, Chungli, 32003, Taiwan
| | - Quang-Thai Ho
- Department of Computer Science & Engineering, Yuan Ze University, Chungli, 32003, Taiwan
| | | | - Yu-Yen Ou
- Department of Computer Science & Engineering, Yuan Ze University, Chungli, 32003, Taiwan.
| |
Collapse
|
4
|
Gogoi D, Ramani S, Bhartari S, Chattopadhyay P, Mukherjee AK. Characterization of active anticoagulant fraction and a fibrin(ogen)olytic serine protease from leaves of Clerodendrum colebrookianum, a traditional ethno-medicinal plant used to reduce hypertension. JOURNAL OF ETHNOPHARMACOLOGY 2019; 243:112099. [PMID: 31326559 DOI: 10.1016/j.jep.2019.112099] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2018] [Revised: 05/21/2019] [Accepted: 07/16/2019] [Indexed: 06/10/2023]
Abstract
ETHNOPHARMACOLOGICAL RELEVANCE Cardiovascular diseases are the major cause of mortality and morbidity, causing over 17.9 million deaths a year worldwide. Currently used therapy is often having side effects and expensive, dietary interventions and alternative medicines are required. Clerodendrum colebrookianum has been used to treat cardiac hypertension but anticoagulant potency was not evaluated. AIM OF THE STUDY To characterize an active anticoagulant fraction (AAFCC) and a 30 kDa fibrin(ogen)olytic serine protease (clerofibrase) isolated from aqueous leave extract of C. colebrookianum. MATERIALS AND METHODS AAFCC/clerofibrase was subjected to extensive biochemical and pharmacological characterization including LC-MS/MS, amino acid compositional and GC-MS analyses. Interaction between clerofibrase with fibrinogen was studied by spectrofluorometric analysis. In vitro thrombolytic, antiplatelet and cytotoxicity assay were performed. In vivo toxicity, anticoagulant, defibrinogen and antithrombotic activities were determined on Swiss albino mice. RESULTS The in vitro anticoagulant activity of AAFCC was found to be superior to heparin and clerofibrase and comparable to Nattokinase and warfarin. The proteomics and amino acid composition analyses suggest that clerofibrase is a previously uncharacterized novel plant protease capable of degrading the -αβ chains of fibrinogen/fibrin. AAFCC/clerofibrase exerts their anticoagulant action via fibrinogenolytic activity and partially by antiplatelet activity albeit they have no effect on thrombin and FXa inhibition. The spectrofluorometric analysis revealed the binding of clerofibrase to fibrinogen but not to thrombin and FXa. The phytochemical constituents and bioactive components of AAFCC were characterized by biochemical, and GC-MS analyses. The AAFCC and clerofibrase inhibited collagen/ADP-induced mammalian platelet aggregation, showed in vitro thrombolytic activity, and non-cytotoxic to mammalian cells. The AAFCC showed and dose-dependent in vivo plasma defibrinogenating and anticoagulant activities and inhibited k-carrageen-induced thrombus formation in the tails of mice. CONCLUSION The potent in vivo anticoagulant and antithrombotic effects of AAFCC suggests its pharmacological significance as herbal anticoagulant drug for the prevention and/or treatment of hyperfibrinogenemia- and thrombosis associated cardiovascular disorders.
Collapse
Affiliation(s)
- Debananda Gogoi
- Microbial Biotechnology and Protein Research Laboratory, Department of Molecular Biology and Biotechnology, School of Sciences, Tezpur University, Tezpur, 784028, Assam, India
| | - Sheetal Ramani
- Microbial Biotechnology and Protein Research Laboratory, Department of Molecular Biology and Biotechnology, School of Sciences, Tezpur University, Tezpur, 784028, Assam, India
| | - Surbhi Bhartari
- Microbial Biotechnology and Protein Research Laboratory, Department of Molecular Biology and Biotechnology, School of Sciences, Tezpur University, Tezpur, 784028, Assam, India
| | - Pronobesh Chattopadhyay
- Division of Pharmaceutical Technology, Defense Research Laboratory, Tezpur, 784501, Assam, India
| | - Ashis K Mukherjee
- Microbial Biotechnology and Protein Research Laboratory, Department of Molecular Biology and Biotechnology, School of Sciences, Tezpur University, Tezpur, 784028, Assam, India.
| |
Collapse
|
5
|
Majumdar S, Dutta S, Das T, Chattopadhyay P, Mukherjee AK. Antiplatelet and antithrombotic activity of a fibrin(ogen)olytic protease from Bacillus cereus strain FF01. Int J Biol Macromol 2015; 79:477-89. [PMID: 25964180 DOI: 10.1016/j.ijbiomac.2015.04.075] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2015] [Revised: 04/26/2015] [Accepted: 04/28/2015] [Indexed: 11/25/2022]
Abstract
Fibrin(ogen)olytic enzymes offer great promise for the treatment of thrombosis associated disorders. The present study describes the characterization of an extracellular fibrin(ogen)olytic serine protease (named Bacethrombase) purified from the Bacillus cereus strain FF01. The molecular mass of the Bacethrombase was determined by sodium dodecyl sulfate polyacrylamide gel electrophoresis and matrix assisted laser desorption/ionization-time-of-flight-mass spectroscopy analyses at 39.5 kDa and 38,450.51 Da, respectively. The peptide mass fingerprinting and analyses of the composition of the amino acids revealed the similarity of the Bacethrombase to the bacterial serine proteases. The secondary structure of the Bacethrombase was composed of 14% helix, 6.6% beta-sheet, and 79.4% random coil. Bacethrombase was found to contain 48% sialic acid and it preferentially degraded the Aα-chain of fibrinogen, as well as fibrin. The anticoagulant potency of the Bacethrombase was comparable with that of warfarin and heparin, and was corroborated by its fibrinogenolytic activity rather than the inhibition of thrombin, prothrombin or FXa. Bacethrombase demonstrated antiplatelet activity, and dose-dependently inhibited the ADP-induced platelet aggregation. Bacethrombase (10 mg/kg) did not show toxicity after i.v. administration in Wistar rats; however, it revealed an in vivo anticoagulant effect and significantly inhibited the carrageenan-induced in vivo thrombus formation in rats.
Collapse
Affiliation(s)
- Sourav Majumdar
- Microbial Biotechnology and Protein Research Laboratory, Department of Molecular Biology and Biotechnology, School of Sciences, Tezpur University, Tezpur 784028, Assam, India
| | - Sumita Dutta
- Microbial Biotechnology and Protein Research Laboratory, Department of Molecular Biology and Biotechnology, School of Sciences, Tezpur University, Tezpur 784028, Assam, India
| | - Tanusree Das
- Indian Institute of Chemical Biology, 4, Raja S.C. Mullick Road, Kolkata 700032, West Bengal, India
| | - Pronobesh Chattopadhyay
- Division of Pharmaceutical Technology, Defense Research Laboratory, Tezpur 784001, Assam, India
| | - Ashis K Mukherjee
- Microbial Biotechnology and Protein Research Laboratory, Department of Molecular Biology and Biotechnology, School of Sciences, Tezpur University, Tezpur 784028, Assam, India.
| |
Collapse
|
6
|
Characterization, mechanism of anticoagulant action, and assessment of therapeutic potential of a fibrinolytic serine protease (Brevithrombolase) purified from Brevibacillus brevis strain FF02B. Biochimie 2014; 103:50-60. [PMID: 24735708 DOI: 10.1016/j.biochi.2014.04.002] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2014] [Accepted: 04/01/2014] [Indexed: 12/27/2022]
Abstract
In this study, biochemical and pharmacological characterization of Brevithrombolase, a fibrinolytic serine protease purified from Brevibacillus brevis strain FF02B has been reported. An assessment of its thrombolytic potency has also been made. The molecular mass of this monomeric protease was determined as 55 kDa, and 56043 Da, respectively, by SDS-PAGE and MALDI-TOF-MS. In the analytical studies, the N-terminal sequence of Brevithrombolase was found to be blocked; however, the peptide mass fingerprinting and amino acid composition analyses demonstrated the similarity of Brevithrombolase with endopeptidases in possessing serine in their catalytic triad. This finding was confirmed by the observation that the serine protease inhibitors decrease the catalytic (fibrinolytic) activity of Brevithrombolase. The secondary structure of Brevithrombolase was composed of 30.6% alpha helix and 69.4% random coil. Brevithrombolase showed the Km and Vmax values towards the chromogenic substrate for plasmin at 0.39 mM and 14.3 μmol/min, respectively. Brevithrombolase demonstrated optimum fibrinolytic activity at pH 7.4 and 37 °C, and showed marginal hydrolytic activity towards globulin, casein and fibrinogen. The anticoagulant potency of Brevithrombolase was comparable to the low molecular mass heparin/antithrombin-III and warfarin. Among the three enzymes-Brevithrombolase, plasmin and streptokinase-the fibrinolytic activity and in vitro thrombolytic potency of Brevithrombolase was found to be superior. The RP-HPLC and SDS-PAGE analyses suggested a similar pattern of fibrin degradation by Brevithrombolase and plasmin, indicating that former enzyme is a plasmin-like fibrinolytic serine protease. Brevithrombolase did not show in vitro cytotoxicity on HT29 and HeLa cells or hemolytic activity. At a dose of 10 mg/kg, Brevithrombolase did not exhibit lethality or toxicity on Wistar strain albino rats. Brevithrombolase did not inhibit factor Xa, and its mechanism of anticoagulant action was associated with the enzymatic cleavage of thrombin. The combined properties of Brevithrombolase indicate its therapeutic potential in peptide-based cardiovascular drug development.
Collapse
|
7
|
Rodrigues JR, Couto A, Cabezas A, Pinto RM, Ribeiro JM, Canales J, Costas MJ, Cameselle JC. Bifunctional homodimeric triokinase/FMN cyclase: contribution of protein domains to the activities of the human enzyme and molecular dynamics simulation of domain movements. J Biol Chem 2014; 289:10620-10636. [PMID: 24569995 DOI: 10.1074/jbc.m113.525626] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Mammalian triokinase, which phosphorylates exogenous dihydroxyacetone and fructose-derived glyceraldehyde, is neither molecularly identified nor firmly associated to an encoding gene. Human FMN cyclase, which splits FAD and other ribonucleoside diphosphate-X compounds to ribonucleoside monophosphate and cyclic X-phosphodiester, is identical to a DAK-encoded dihydroxyacetone kinase. This bifunctional protein was identified as triokinase. It was modeled as a homodimer of two-domain (K and L) subunits. Active centers lie between K1 and L2 or K2 and L1: dihydroxyacetone binds K and ATP binds L in different subunits too distant (≈ 14 Å) for phosphoryl transfer. FAD docked to the ATP site with ribityl 4'-OH in a possible near-attack conformation for cyclase activity. Reciprocal inhibition between kinase and cyclase reactants confirmed substrate site locations. The differential roles of protein domains were supported by their individual expression: K was inactive, and L displayed cyclase but not kinase activity. The importance of domain mobility for the kinase activity of dimeric triokinase was highlighted by molecular dynamics simulations: ATP approached dihydroxyacetone at distances below 5 Å in near-attack conformation. Based upon structure, docking, and molecular dynamics simulations, relevant residues were mutated to alanine, and kcat and Km were assayed whenever kinase and/or cyclase activity was conserved. The results supported the roles of Thr(112) (hydrogen bonding of ATP adenine to K in the closed active center), His(221) (covalent anchoring of dihydroxyacetone to K), Asp(401) and Asp(403) (metal coordination to L), and Asp(556) (hydrogen bonding of ATP or FAD ribose to L domain). Interestingly, the His(221) point mutant acted specifically as a cyclase without kinase activity.
Collapse
Affiliation(s)
- Joaquim Rui Rodrigues
- Grupo de Enzimología, Departamento de Bioquímica y Biología Molecular y Genética, Facultad de Medicina, Universidad de Extremadura, E-06006 Badajoz, Spain; Escola Superior de Tecnologia e Gestão, Instituto Politécnico de Leiria, P-2411-901 Leiria, Portugal
| | - Ana Couto
- Grupo de Enzimología, Departamento de Bioquímica y Biología Molecular y Genética, Facultad de Medicina, Universidad de Extremadura, E-06006 Badajoz, Spain
| | - Alicia Cabezas
- Grupo de Enzimología, Departamento de Bioquímica y Biología Molecular y Genética, Facultad de Medicina, Universidad de Extremadura, E-06006 Badajoz, Spain
| | - Rosa María Pinto
- Grupo de Enzimología, Departamento de Bioquímica y Biología Molecular y Genética, Facultad de Medicina, Universidad de Extremadura, E-06006 Badajoz, Spain
| | - João Meireles Ribeiro
- Grupo de Enzimología, Departamento de Bioquímica y Biología Molecular y Genética, Facultad de Medicina, Universidad de Extremadura, E-06006 Badajoz, Spain
| | - José Canales
- Grupo de Enzimología, Departamento de Bioquímica y Biología Molecular y Genética, Facultad de Medicina, Universidad de Extremadura, E-06006 Badajoz, Spain
| | - María Jesús Costas
- Grupo de Enzimología, Departamento de Bioquímica y Biología Molecular y Genética, Facultad de Medicina, Universidad de Extremadura, E-06006 Badajoz, Spain
| | - José Carlos Cameselle
- Grupo de Enzimología, Departamento de Bioquímica y Biología Molecular y Genética, Facultad de Medicina, Universidad de Extremadura, E-06006 Badajoz, Spain.
| |
Collapse
|
8
|
Abstract
Amino acid analysis is used to determine the amino acid content of amino acid-, peptide- and protein-containing samples. With minor exceptions, proteins are long linear polymers of amino acids connected to each other via peptide bonds. The first step of amino acid analysis involves hydrolyzing these peptide bonds. The liberated amino acids are then separated, detected, and quantified. The method was first developed by Moore, Stein and coworkers in the 1950s using HCl acid hydrolysis, and, despite considerable effort by many workers, the basic methodology remains relatively unchanged. This unit provides an overview and strategic planning for amino acid analysis, discussing a range of methodologies and issues. In addition, several common methods used for analysis of L-amino acids are described in detail, including: HCl acid hydrolysis, performic acid oxidation for methionine and cysteine analysis, base hydrolysis for tryptophan analysis, analysis of free amino acids, and analysis of reactive lysine.
Collapse
|
9
|
Abstract
OBJECTIVE Estrogen is an important female hormone. The estrogen receptor (ER) plays a critical role in the development of breast cancer. Mutations within the ER can have clinically significant consequences. MATERIALS AND METHODS Identification of sites vulnerable to mutation is a new trend aimed at extending our knowledge of diseases at the genomic and proteomic levels. In this study, bioinformatics analysis was performed to determine the positions corresponding to specific peptide motifs in the amino acid sequence of the ER. A new computational tool called GlobPlot was used to identify the weak linkage positions within the ER. RESULTS The results allowed the identification of mutation-resistant positions. CONCLUSION This study showed that the weak linkages within the ER could be identified, and these could provide the basis for further studies aimed at predicting possible new ER mutations.
Collapse
|
10
|
Wiwanitkit V. FHM3 in familial hemiplegic migraine is more resistant to mutation than FHM1 and FHM2. J Neurol Sci 2009; 277:76-9. [DOI: 10.1016/j.jns.2008.10.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2008] [Accepted: 10/14/2008] [Indexed: 10/21/2022]
|
11
|
Wiwanitkit V. Weak linkage in insulin receptor: Identification of mutation prone point. Diabetes Metab Syndr 2008. [DOI: 10.1016/j.dsx.2008.02.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
12
|
Wiwanitkit V. Mutation-prone points in thrombin receptor. J Thromb Thrombolysis 2007; 25:190-2. [PMID: 18064407 DOI: 10.1007/s11239-007-0167-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/24/2007] [Accepted: 10/30/2007] [Indexed: 11/24/2022]
Abstract
BACKGROUND Thrombin receptor (TR) is a G-protein-coupled receptor that transmits cellular responses to coagulant proteases in a variety of cell types in the vasculature and other tissues. Mutation within TR can be seen. Presently, the prediction of protein nanostructure and function is a great challenge in the proteomics and structural genomics era. METHODS To identify the points vulnerable to mutation is a new trend directed at expanding the knowledge on disorders in genomic and proteomic levels of diseases. In this paper, the author performed a bioinformatics analysis to find the mutation-prone positions in the amino acid sequence of TR. To identify those points in TR, a new bioinformatics tool, namely, GlobPlot was used. RESULTS According to this work, no position was identified to be resistant to mutation. CONCLUSION This means that TR is a very highly genetically unstable molecule. Thousands of types of mutation can be expected. Of interest, only a few sense mutations are mentioned in clinical settings. Therefore, many occulted sense mutations might still be detected.
Collapse
Affiliation(s)
- Viroj Wiwanitkit
- Department of Laboratory Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok 10330, Thailand.
| |
Collapse
|
13
|
Wiwanitkit V. Weak linkage in androgen receptor: identification of mutation-prone points. Fertil Steril 2007; 91:e1-3. [PMID: 17936280 DOI: 10.1016/j.fertnstert.2007.06.089] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2007] [Revised: 06/26/2007] [Accepted: 06/26/2007] [Indexed: 11/30/2022]
Abstract
OBJECTIVE Mutation within androgen receptor (AR) can be seen. Presently, prediction of protein nanostructure and function is a great challenge in the proteomics and structural genomics era. To identify the points vulnerable to mutation is a new trend to expand the knowledge on disorders in genomic and proteomic level of diseases. DESIGN The author performed a bioinformatics analysis to study the determined positions that correspond with peptide motifs in the amino acid sequence of AR. INTERVENTION(S) To identify the weak linkages in AR, a new bioinformatics tool, GlobPlot, was used. MAIN OUTCOME MEASURE(S) Mutation-sensitive points. RESULT(S) According to this work, the positions resistant to mutation are identified. CONCLUSION(S) Based on this study, the weak linkages in AR can be identified and can be good information for the prediction of possible new mutations that can result in phenotype abnormality. In addition, the results from this study can be good information for further research on the diagnosis for AR abnormalities and new treatment.
Collapse
Affiliation(s)
- Viroj Wiwanitkit
- Department of Laboratory Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.
| |
Collapse
|
14
|
Salchert K, Pompe T, Sperling C, Werner C. Quantitative analysis of immobilized proteins and protein mixtures by amino acid analysis. J Chromatogr A 2003; 1005:113-22. [PMID: 12924786 DOI: 10.1016/s0021-9673(03)00932-4] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Biomolecular surface engineering of materials often requires precise, versatile and efficient quantification of immobilized proteins at solid surfaces. Acidic hydrolysis of surface-bound proteins and subsequent HPLC analysis of fluorescence-derivatized amino acids were adapted and critically evaluated for that purpose. Contaminations and concentration-dependent amino acid retrieval during HPLC were found to influence the accuracy of the method. In addition to the choice of adequate conditions for hydrolysis, derivatization and chromatographic separation extensions of the data evaluation were suggested to improve the accuracy of the approach when applied to single protein systems: comparing the experimentally obtained amino acid ratio to the protein constitution enabled to identify the properly separated and detected amino acids. Those amino acids were selected for a more precise calculation of the amount of immobilized protein. To further increase the accuracy of the method, the retrieval of amino acids corresponding to protein amounts in the range between 0.5 and 4.0 microg was analyzed for a variety of proteins of interest to derive protein-specific correction factors. The evaluation of amino acid data was furthermore applied to quantify binary protein mixtures at similar settings. This method was proven useful to detect the composition of protein mixtures throughout a wide range of absolute and relative concentrations.
Collapse
Affiliation(s)
- Katrin Salchert
- Institut für Polymerforschung Dresden e.V. Abteilung Biokompatible Materialien, Hohe Str. 6, 01069 Dresden, Germany
| | | | | | | |
Collapse
|
15
|
Abstract
Proteomic tools measure gene expression, protein activity and interactions of biological events at the protein level. Proteins are the major catalysts of biological functions and contain several dimensions of information that collectively indicate the actual rather than the potential functional state as indicated by mRNA analysis. Measurements can be made in terms of protein quantity, location, and time-point. For the future we see a further integration of existing and new technologies for proteomics from a wide range of areas of biochemistry, chemistry, physics, computing science and molecular biology. This will further advance our knowledge of how biological systems are built up and what mechanisms control these systems. However, the potential of proteomics to comprehensively answer all biological questions is limited as only protein activity is measured. A unification of genomics, proteomics, and other technologies is needed if we are to start to understand the complexity of biological function in the context of disease and health.
Collapse
Affiliation(s)
- Valerie C Wasinger
- Mass Spectrometry and Protein Analysis Laboratory, Garvan Institute of Medical Research, 384 Victoria Street, Sydney, NSW 2010, Australia.
| | | |
Collapse
|
16
|
Affiliation(s)
- K Paul-Pletzer
- Department of Anesthesia, UMDNJ-Robert Wood Johnson Medical School, Piscataway, New Jersey 08854, USA.
| | | |
Collapse
|
17
|
Dürauer A, Csaszar E, Mechtler K, Jungbauer A, Schmid E. Characterisation of the rubber elongation factor from ammoniated latex by electrophoresis and mass spectrometry. J Chromatogr A 2000; 890:145-58. [PMID: 10976802 DOI: 10.1016/s0021-9673(00)00241-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Rubber elongation factor (REF) is considered as one of the major allergens present in latex. An extraction and purification protocol for preparation of REF standards has been modified. A protein fraction was extracted from ammoniated latex sap and purified by gel filtration chromatography. The purified and concentrated proteins were separated by sodium dodecylsulfate-polyacrylamide gel electrophoresis into two major bands. These bands were further characterised by matrix-assisted laser desorption/ionisation time-of-flight and nano-electrospray ionization mass spectrometry. REF and a truncated form could be ascertained by the mass and fragmentation pattern of the tryptic peptides. In the faster migrating band an additional peptide could be identified. This peptide is also present in Hevb3 and a Mr 27000 latex allergen. Our findings indicate that conventional REF preparations as standards may contain additional allergenic proteins.
Collapse
Affiliation(s)
- A Dürauer
- Institute of Applied Microbiology, University of Agricultural Sciences, Vienna, Austria
| | | | | | | | | |
Collapse
|
18
|
Murayama K, Shindo N, Suzuki R, Kawakami M, Mineki R, Taka H, Kazuno S, Nagata K, Maruyama T, Tanokura M. Characterization of native and recombinant peptidyl prolyl cis-trans isomerases derived from Methanococcus thermolithotrophicus based on cDNA sequence. Electrophoresis 2000; 21:1733-9. [PMID: 10870959 DOI: 10.1002/(sici)1522-2683(20000501)21:9<1733::aid-elps1733>3.0.co;2-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
It is important to establish whether a recombinant protein is an authentic copy of the predicted cDNA sequence. In this study, recombinant protein for native peptidyl prolyl cis-trans isomerase (N-PPIase) and double-labeled (13C- and 15N-) protein (DL-PPIase) appeared on the sodium dodecyl sulfate (SDS) electropherograms as two bands for N-PPIase and four bands for DL-PPIase. Since the N-terminal amino acid residues of all bands were the same, we characterized these bands using the peptide mapping method and amino acid composition analysis. Peptide mapping of the proteins seemed to be almost identical but they could not reflect the whole amino acid sequences of the protein. The bands on the polyvinylidene difluoride (PVDF) membrane, electroblotted after SDS-polyacrylamide gel electrophoresis (SDS-PAGE), were hydrolyzed and their amino acid composition was analyzed using a highly sensitive 6-aminoquinolyl-N-hydroxysuccinimidyl carbamate (AQC) amino acid analysis and compared with the cDNA sequences for proteins. The matching score (sigma(T%-E%)2) for similarity of proteins was calculated by summation of the square difference between the theoretical (T%) and the experimental (E%) amino acid composition of the recombinant protein. The amino acid composition of all bands of both proteins showed more than 93% of the theoretical values. The major molecular weights of both proteins were 16812 and 17694 by electrospray ionization (ESI)-mass spectrometry. However, the purified proteins also contained minor compounds with Mr of 3721 for N-PPIase and 5285 for DL-PPIase. These compounds were considered to be nonpeptidyl products that comigrated with the protein. Similarities of the amino acid composition of the four bands were more than 98%. Our results indicate that AQC amino acid analysis is the most suitable method for characterization of a recombinant protein.
Collapse
Affiliation(s)
- K Murayama
- Division of Biochemical Analysis, Central Laboratory of Medical Sciences, Juntendo University School of Medicine, Tokyo, Japan.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
|
20
|
Abstract
Amino acid composition analysis is a classical protein analysis method, which finds a wide application in medical and food science research and is indispensable for protein quantification. It is a complex technique, comprising two steps, hydrolysis of the substrate and chromatographic separation and detection of the residues. A properly performed hydrolysis is a prerequisite of a successful analysis. The most significant developments of the technology in the last decade consist in the (i) reduction of the hydrolysis time by the use of microwave radiation energy; (ii) improvement in the sensitivity of the residue detection, the quantification of the sensitive residues and separation of the enantiomeric forms of the amino acids; (iii) application of amino acid analysis in the large-scale protein identification by database search; and (iv) gradual replacement of the original ion exchange residue separation by reversed-phase high-performance liquid chromatography. Amino acid analysis is currently facing an enormous competition in the determination of the identity of proteins and amino acid homologs by the essentially faster mass spectrometry techniques. The amino acid analysis technology needs further simplification and automation of the hydrolysis, chromatography and detection steps to withstand the pressure exerted by the other technologies.
Collapse
Affiliation(s)
- M Fountoulakis
- F. Hoffman-La Roche Ltd., Pharma Division, Preclinical Central System--Gene Technology, Basel, Switzerland.
| | | |
Collapse
|
21
|
Shindo N, Fujimura T, Nojima-Kazuno S, Mineki R, Furusawa S, Sasaki K, Murayama K. Identification of multidrug resistant protein 1 of mouse leukemia P388 cells on a PVDF membrane using 6-aminoquinolyl-carbamyl (AQC)-amino acid analysis and World Wide Web (WWW)-accessible tools. Anal Biochem 1998; 264:251-8. [PMID: 9866690 DOI: 10.1006/abio.1998.2853] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Multidrug resistant protein 1 (MDR1) in a doxorubicin-resistant mouse leukemia cell line (P388/DOX) was identified using its amino acid composition combined with protein database searching (ExPASy and EMBL PROPSEARCH) via the World Wide Web. The proteins were separated by one-dimensional SDS-polyacrylamide gel electrophoresis, blotted onto a polyvinylidene fluoride membrane, and stained with Coomassie brilliant blue. A 160-kDa protein band was acid-hydrolyzed in the vapor phase (6 N HC1) and converted to 6-aminoquinolyl-carbamyl (AQC)-amino acids without extraction of the amino acids from the membrane. The amino acid composition of the protein was determined using the sensitive AQC-amino acid analysis method, improving our previously described method. The improved method involved using a Cosmosil 5C8-MS column instead of a Pegasil C8; replacement of the mobile phase A, constituent, 75 mM ammonium phosphate (pH 7.5), with 30 mM sodium phosphate buffer (pH 7.2); and slight modification of the separation program (9). All manipulations for protein hydrolysis and AQC derivatization were carried out in a hood using clean tools. This minimized contamination of amino acids at the low femtomolar level. A database search was carried out with bovine serum albumin as a calibration protein. MDR1 in P388/DOX was ranked first by both databases with high reliability (score 14 for ExPASy, distance 1.34 for EMBL).
Collapse
MESH Headings
- ATP Binding Cassette Transporter, Subfamily B, Member 1/chemistry
- ATP Binding Cassette Transporter, Subfamily B, Member 1/isolation & purification
- Amino Acids/analysis
- Aminoquinolines
- Animals
- Calibration
- Carbamates
- Chromatography, High Pressure Liquid
- Databases, Factual
- Electrophoresis, Polyacrylamide Gel
- Hydrolysis
- Internet
- Leukemia P388
- Membranes, Artificial
- Mice
- Molecular Weight
- Polyvinyls
- Reproducibility of Results
- Sensitivity and Specificity
- Sequence Homology, Amino Acid
- Serum Albumin, Bovine
- Specimen Handling
- Tumor Cells, Cultured
Collapse
Affiliation(s)
- N Shindo
- Division of Biochemical Analysis, Juntendo University School of Medicine, Tokyo, Japan.
| | | | | | | | | | | | | |
Collapse
|
22
|
|
23
|
Fountoulakis M, Takács B. Design of protein purification pathways: application to the proteome of Haemophilus influenzae using heparin chromatography. Protein Expr Purif 1998; 14:113-9. [PMID: 9758758 DOI: 10.1006/prep.1998.0943] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The design of efficient protein purification protocols is a scientific challenge and can be facilitated by the use of master protein enrichment or purification steps. A master purification step is in principle a list including the major proteins expected to be present in the various fractions collected from a particular enrichment process. Consideration of a master step in the design of a purification pathway can reduce the time and effort usually invested in "trial and error" approaches. Moreover, master purification steps will certainly become valuable tools in the isolation of proteins today designated as hypothetical or predicted coding regions, resulting from the sequencing of the various genomes, and for which no information on their purification exists. The construction of such a master step consists of performance of the protein separation by the particular chromatographic method, analysis by two-dimensional polyacrylamide gel electrophoresis, and identification by mass spectrometry of the proteins present in each fraction. This can be efficiently accomplished now thanks to developments in two-dimensional gel technology and large-scale sample throughput mass spectrometry. Here we present an example of construction of a master protein enrichment step using the soluble protein fraction of the bacterium Haemophilus influenzae and applying Heparin chromatography.
Collapse
Affiliation(s)
- M Fountoulakis
- Pharmaceutical Division, F. Hoffmann-La Roche Ltd., Basel, Switzerland
| | | |
Collapse
|
24
|
Arnott D, O'Connell KL, King KL, Stults JT. An integrated approach to proteome analysis: identification of proteins associated with cardiac hypertrophy. Anal Biochem 1998; 258:1-18. [PMID: 9527842 DOI: 10.1006/abio.1998.2566] [Citation(s) in RCA: 111] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Hypertrophy of cardiac myocytes is a primary response of the heart to overload, and is an independent predictor of heart failure and death. Distinct cellular phenotypes are associated with hypertrophy resulting from different causes. These phenotypes have been described by others at the molecular level by analysis of gene transcription patterns. An alternative approach is the analysis of large-scale protein expression patterns (the proteome) by two-dimensional polyacrylamide gel electrophoresis. Realization of this goal requires the ability to rigorously analyze complex 2D gel images, efficiently digest individual gel isolated proteins (especially those expressed at low levels), and analyze the resulting peptides with high sensitivity for rapid database searches. We have undertaken to improve the technology and experimental approaches to these challenges in order to effectively study a cell culture model for cardiac hypertrophy. The 2D gel patterns for cell lysates from multiple samples of cardiac myocytes with or without phenylephrine-induced hypertrophy were analyzed and spots which changed in abundance with statistical significance were located. Eleven such spots were identified using improved procedures for in-gel digestion of silver-stained proteins and high-sensitivity mass spectrometry. The incorporation of low levels of sodium dodecyl sulfate into the digestion buffer improved peptide recovery. The combination of matrix-assisted laser desorption mass spectrometry for initial measurements and capillary liquid chromatography-ion trap mass spectrometry for peptide sequence determination yielded efficient protein identification. The integration of 2D gel image analysis and routine identification of proteins present in gels at the subpicomole level represents a general model for proteome studies relating genomic sequence with protein expression patterns.
Collapse
Affiliation(s)
- D Arnott
- Protein Chemistry Department, Genentech, Inc., 1 DNA Way, South San Francisco, California 94080, USA
| | | | | | | |
Collapse
|
25
|
Weiss M, Manneberg M, Juranville JF, Lahm HW, Fountoulakis M. Effect of the hydrolysis method on the determination of the amino acid composition of proteins. J Chromatogr A 1998; 795:263-75. [PMID: 9528102 DOI: 10.1016/s0021-9673(97)00983-7] [Citation(s) in RCA: 67] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Fast and reproducible separation and determination of amino acids serves the economical and reliable characterization and quantification of peptides and proteins as well as the identification of proteins by amino acid composition analysis on a large-scale. A prerequisite of a successful compositional analysis is a complete hydrolysis of the peptides and proteins and a quantitative recovery of the residues in the hydrolyzate. We investigated the effect of different acid-hydrolysis methods on the compositional analysis of known proteins in solution and after blotting onto polyvinylidene difluoride membranes and worked out the conditions for the processing of large numbers of samples. The reliability of each method was studied by introducing the analysis data into the AACompIdent software and deducing the protein identification scores. All acid-hydrolysis methods delivered reliable analysis data. The most accurate data were provided by conventional, thermal hydrolysis of proteins in solution in the presence of methanesulfonic acid, closely followed by hydrolysis with hydrochloric acid and microwave radiation-dependent hydrolysis with hydrochloric or methanesulfonic acid, respectively. For blotted proteins, conventional hydrolysis delivered more accurate analysis data in comparison with the microwave radiation-induced hydrolysis. The extraction of the residues from the membrane hydrolyzate was a critical step for unambiguous protein identification. Microwave radiation-induced hydrolysis was responsible for a higher degree of racemization of the residues.
Collapse
Affiliation(s)
- M Weiss
- F. Hoffmann-La Roche Ltd., Pharmaceutical Research-Gene Technologies, Basel, Switzerland
| | | | | | | | | |
Collapse
|
26
|
Fountoulakis M, Juranville JF, Berndt P. Large-scale identification of proteins of Haemophilus influenzae by amino acid composition analysis. Electrophoresis 1997; 18:2968-77. [PMID: 9504837 DOI: 10.1002/elps.1150181539] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Two-dimensional protein maps of microorganisms are useful tools for elucidation and detection of target proteins, a process essential in the development of new pharmaceutical products. We applied amino acid composition analysis, following separation by two-dimensional gel electrophoresis, for large-scale identification of proteins of Haemophilus influenzae. H. influenzae is a bacterium of pharmaceutical interest of which the entire genome, comprising approximately 1700 open reading frames, has been sequenced. For amino acid analysis, we used both precolumn derivatization of amino acids followed by reversed-phase chromatography of the derivatized residues and post-column derivatization of the residues previously separated on an ion exchanger. The composition analyses derived from both methods allowed the identification of 110 protein spots. The proteins were identified using the AACompldent software on the ExPASy server accessible via the World Wide Web with a success rate of 52%. In some cases, introduction of the analysis data of 12 residues was sufficient for a correct identification. Proteins which contained an unusually high percentage of one residue could be unambiguously identified. Amino acid composition analysis proved to be an error-robust, efficient method for protein identification. The method can be practically established in every biochemical laboratory and, complementary to mass spectrometry, represents an important analytical tool for the mapping of the proteomes of organisms of interest.
Collapse
Affiliation(s)
- M Fountoulakis
- F. Hoffmann-La Roche Ltd., Pharmaceutical Research Gene Technologies, Basel, Switzerland.
| | | | | |
Collapse
|
27
|
Humphery-Smith I, Cordwell SJ, Blackstock WP. Proteome research: complementarity and limitations with respect to the RNA and DNA worlds. Electrophoresis 1997; 18:1217-42. [PMID: 9298643 DOI: 10.1002/elps.1150180804] [Citation(s) in RCA: 164] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
A methodological overview of proteome analysis is provided along with details of efforts to achieve high-throughput screening (HTS) of protein samples derived from two-dimensional electrophoresis gels. For both previously sequenced organisms and those lacking significant DNA sequence information, mass spectrometry has a key role to play in achieving HTS. Prototype robotics designed to conduct appropriate chemistries and deliver 700-1000 protein (genes) per day to batteries of mass spectrometers or liquid chromatography (LC)-based analyses are well advanced, as are efforts to produce high density gridded arrays containing > 1000 proteins on a single matrix assisted laser desorption ionisation/time-of-flight (MALDI-TOF) sample stage. High sensitivity HTS of proteins is proposed by employing principally mass spectrometry in an hierarchical manner: (i) MALDI-TOF-mass spectrometry (MS) on at least 1000 proteins per day; (ii) electrospray ionisation (ESI)/MS/MS for analysis of peptides with respect to predicted fragmentation patterns or by sequence tagging; and (iii) ESI/MS/MS for peptide sequencing. Genomic sequences when complemented with information derived from hybridisation assays and proteome analysis may herald in a new era of holistic cellular biology. The current preoccupation with the absolute quantity of gene-product (RNA and/or protein) should move backstage with respect to more molecularly relevant parameters, such as: molecular half-life; synthesis rate; functional competence (presence or absence of mutations); reaction kinetics; the influence of individual gene-products on biochemical flux; the influence of the environment, cell-cycle, stress and disease on gene-products; and the collective roles of multigenic and epigenetic phenomena governing cellular processes. Proteome analysis is demonstrated as being capable of proceeding independently of DNA sequence information and aiding in genomic annotation. Its ability to confirm the existence of gene-products predicted from DNA sequence is a major contribution to genomic science. The workings of software engines necessary to achieve large-scale proteome analysis are outlined, along with trends towards miniaturisation, analyte concentration and protein detection independent of staining technologies. A challenge for proteome analysis into the future will be to reduce its dependence on two-dimensional (2-D) gel electrophoresis as the preferred method of separating complex mixtures of cellular proteins. Nonetheless, proteome analysis already represents a means of efficiently complementing differential display, high density expression arrays, expressed sequence tags, direct or subtractive hybridisation, chromosomal linkage studies and nucleic acid sequencing as a problem solving tool in molecular biology.
Collapse
Affiliation(s)
- I Humphery-Smith
- University of Sydney, Centre for Proteome Research and Gene-Product Mapping, National Innovation Centre, Eveleigh, Australia.
| | | | | |
Collapse
|
28
|
Garrels JI, McLaughlin CS, Warner JR, Futcher B, Latter GI, Kobayashi R, Schwender B, Volpe T, Anderson DS, Mesquita-Fuentes R, Payne WE. Proteome studies of Saccharomyces cerevisiae: identification and characterization of abundant proteins. Electrophoresis 1997; 18:1347-60. [PMID: 9298649 DOI: 10.1002/elps.1150180810] [Citation(s) in RCA: 117] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Two-dimensional (2-D) gel electrophoresis can now be coupled with protein identification techniques and genome sequence information for direct detection, identification, and characterization of large numbers of proteins from microbial organisms. 2-D electrophoresis, and new protein identification techniques such as amino acid composition, are proteome research techniques in that they allow direct characterization of many proteins at the same time. Another new tool important for yeast proteome research is the Yeast Protein Database (YPD), which provides the sequence-derived protein properties needed for spot identification and tabulations of the currently known properties of the yeast proteins. Studies presented here extend the yeast 2-D protein map to 169 identified spots based upon the recent completion of the yeast genome sequence, and they show that methods of spot identification based on predicted isoelectric point, predicted molecular mass, and determination of partial amino acid composition from radiolabeled gels are powerful enough for the identification of at least 80% of the spots representing abundant proteins. Comparison of proteins predicted by YPD to be detectable on 2-D gels based on calculated molecular mass, isoelectric point and codon bias (a predictor of abundance) with proteins identified in this study suggests that many glycoproteins and integral membrane proteins are missing from the 2-D gel patterns. Using the 2-D gel map and the information available in YDP, 2-D gel experiments were analyzed to characterize the yeast proteins associated with: (i) an environmental change (heat shock), (ii) a temperature-sensitive mutation (the prp2 mRNA splicing mutant), (iii) a mutation affecting post-translational modification (N-terminal acetylation), and (iv) a purified subcellular fraction (the ribosomal proteins). The methods used here should allow future extension of these studies to many more proteins of the yeast proteome.
Collapse
|
29
|
Cordwell SJ, Basseal DJ, Humphery-Smith I. Proteome analysis of Spiroplasma melliferum (A56) and protein characterisation across species boundaries. Electrophoresis 1997; 18:1335-46. [PMID: 9298648 DOI: 10.1002/elps.1150180809] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Spiroplasma melliferum (Class: Mollicutes) is a wall-less, helical bacterium with a genome of approximately 1460 kbp encoding 800-1000 gene-products. A two-dimensional electrophoresis gel reference map of S. melliferum was produced by Phoretix 2-D gel software analysis of eight high quality gels. The reference map showed 456 silver-stained and replicated protein spots. 156 proteins (34% of visible protein spots) from S. melliferum were further characterised by one, or a combination, of the following: amino acid analysis, peptide-mass fingerprinting via matrix assisted laser desorption ionisation-time of flight (MALDI-TOF) mass spectrometry, and N-terminal protein microsequencing. Proteins with close relationship to those previously determined from other species were identified across species barriers. Thus, this study represents the first larger-scale analysis of a proteome based upon the attribution of predominantly 'unique numerical parameters' for protein characterisation across species boundaries, as opposed to a sequence-based approach. This approach allowed all database entries to be screened for homology, as is currently the case for studies based on nucleic acid or protein sequence information. Several proteins studied from this organism were identified as hypothetical, or having no close homolog already present in the databases. Gene-products from major families such as glycolysis, translation, transcription, cellular processes, energy metabolism and protein synthesis were identified. Several gene-products characterised in S. melliferum were not previously found in studies of the entire Mycoplasma genitalium and Mycoplasma pneumoniae (both closely related Mollicutes) genomes. The presence of such gene-products in S. melliferum is discussed in terms of genome size as compared with the smallest known free-living organisms. Finally, the levels of expression of S. melliferum gene-products were determined with respect to total optical intensity associated with all visible proteins expressed in exponentially grown cells.
Collapse
Affiliation(s)
- S J Cordwell
- Centre of Proteome Research and Gene-Product Mapping, National Innovation Centre, Eveleigh, Australia
| | | | | |
Collapse
|
30
|
Cordwell SJ, Basseal DJ, Bjellqvist B, Shaw DC, Humphery-Smith I. Characterisation of basic proteins from Spiroplasma melliferum using novel immobilised pH gradients. Electrophoresis 1997; 18:1393-8. [PMID: 9298653 DOI: 10.1002/elps.1150180814] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Two-dimensional polyacrylamide gel electrophoresis (2-D PAGE) has become the method of choice for efficient separation of complex protein mixtures. Previously, analysis of the Spiroplasma melliferum proteome (protein complement of a genome) has been performed with pH 3-10 and narrow range pH 4-7 IPG gel strips. We report here on the use of novel 18 cm basic (pH 6-11) immobilised pH gradients (IPG) to increase the resolution of protein spots visible within 2-D gels. These gradients were synthesised to emulate the gradient of commercially available IPG gel strips in a 5 cm region of overlap so as to attempt construction of a more complete map of cellular protein expression. Approximately 50 additional gene products were detected from S. melliferum that were not previously well-resolved or visible using wide-range pH 3-10 IPG gel strips. Twenty-seven of these were electrotransferred to polyvinylidene difluoride (PVDF) membrane and analysed by N-terminal protein microsequencing. Protein spots with an initial peak yield of as little as 100 femtomoles (fm) were sequenced to 5-10 amino acid residues, demonstrating the importance of improved sample handling procedures and analytical technologies. Many essential metabolic enzymes were shown to have basic pI, including: glyceraldehyde-3-phosphate dehydrogenase, pyruvate kinase, carbamate kinase and lactate dehydrogenase. A very basic protein (pI approximately 11.0) was identified as uridylate kinase, an enzyme indirectly associated with pyrimidine biosynthesis and thought be absent in some members of the bacterial class Mollicutes. The advent of novel basic (pH 6-11) IPGs has allowed the visualisation of a significantly greater percentage of the 'functional proteome', that portion of the total protein complement of a genome actively translated within a specific time frame, on 2-D electrophoresis gels. This will aid in the characterisation of translated gene products in conjunction with genome sequencing initiatives.
Collapse
Affiliation(s)
- S J Cordwell
- Centre for Proteome Research and Gene-Product Mapping, National Innovation Centre, Eveleigh, Australia
| | | | | | | | | |
Collapse
|
31
|
Cordwell SJ, Humphery-Smith I. Evaluation of algorithms used for cross-species proteome characterisation. Electrophoresis 1997; 18:1410-7. [PMID: 9298655 DOI: 10.1002/elps.1150180816] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The ability to effectively search databases for the identification of protein spots from two-dimensional electrophoresis gels has become an essential step in the study of microbial proteomes. A variety of analytical techniques are currently being employed during protein characterisation. A number of algorithms used to search databases, accessible via the World Wide Web, depend upon information concerning N- and C-terminal microsequence, amino acid composition, and peptide-mass fingerprinting. The effectiveness of nine such algorithms, as well as COMBINED (software developed in this laboratory for identifying proteins across species boundaries) was examined. Fifty-four ribosomal proteins from the Mycoplasma genitalium genome, and 72 amino acyl tRNA synthetases from the Haemophilus influenzae, M. genitalium and Methanococcus jannaschii genomes were chosen for study. These proteins were selected because they represent a wide range of sequence identities across species boundaries (22.7-100% identity), as detected by standard sequence alignment tools. Such sequence variation allowed for a statistical comparison of algorithm success measured against published sequence identity. The ability of analytical techniques used in protein characterisation and associated database query programs to detect identity at the functional group level was examined for proteins with low levels of homology at the gene/protein sequence level. The significance of these theoretical data manipulations provided the means to predict the utility of data acquired experimentally for non-sequence-dependent software in proteome analysis. The data obtained also predicted that 'sequence tagging' of peptide fingerprints would need to be accompanied by at least 11-20 residues of amino acid sequence for it to be widely used for protein characterisation across species boundaries.
Collapse
Affiliation(s)
- S J Cordwell
- Centre for Proteome Research and Gene-Product Mapping, National Innovation Centre, Eveleigh, Australia
| | | |
Collapse
|
32
|
Fountoulakis M, Langen H, Evers S, Gray C, Takács B. Two-dimensional map of Haemophilus influenzae following protein enrichment by heparin chromatography. Electrophoresis 1997; 18:1193-202. [PMID: 9237578 DOI: 10.1002/elps.1150180728] [Citation(s) in RCA: 63] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Two-dimensional gel electrophoresis separates several hundred protein molecules in one single experiment and is efficiently used to study the products expressed by different genomes. Low-copy-number gene products are invisible on a stained two-dimensional map and must be enriched such that sufficient amounts are present for visualization and identification. We investigated the enrichment of proteins of the bacterium Haemophilus influenzae by chromatography on immobilized heparin which has affinity for growth and protein biosynthesis factors. Total soluble proteins of the microorganism were fractionated on Heparin-Actigel which resulted in enrichment of approximately 160 proteins. The eluates, representing about 40% of the applied proteins, were analyzed by two-dimensional gel electrophoresis and the protein spots were characterized by amino acid composition analysis and matrix-assisted laser desorption ionization mass spectrometry. The proteins enriched by chromatography on the heparin gel were not exclusively low-copy-number gene products and they did not exclusively belong to one single class of proteins. The proteins that bound to the heparin gel are indicated in a two-dimensional protein map which includes more than 110 newly identified proteins.
Collapse
Affiliation(s)
- M Fountoulakis
- Gene Technologies, F. Hoffmann-La Roche Ltd., Pharmaceutical Research, Basel, Switzerland.
| | | | | | | | | |
Collapse
|
33
|
Boucherie H, Sagliocco F, Joubert R, Maillet I, Labarre J, Perrot M. Two-dimensional gel protein database of Saccharomyces cerevisiae. Electrophoresis 1996; 17:1683-99. [PMID: 8982601 DOI: 10.1002/elps.1150171106] [Citation(s) in RCA: 65] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
With the systematic sequencing of the yeast genome, yeast biology has entered a new era where novel challenges have to be faced. One challenge is the identification of the function of the several hundred novel genes discovered by genome sequencing. Another is to understand how all yeast genes act in concert to ensure and maintain cell organization. Two-dimensional (2-D) gel electrophoresis is the technique of choice to take up these challenges because it provides the opportunity of obtaining an overall view of genome expression. In prospect of these studies we have undertaken the construction of a yeast 2-D gel protein database that contains information on polypeptides of the yeast protein map. In this paper we report the information presently contained in this database. The reported information includes the identification of 250 protein spots and the characterization of polypeptides corresponding to N-terminal acetylated proteins, mitochondrial proteins, glucose-repressed proteins, heat shock induced proteins and proteins encoded by intron-containing genes. In all, 600 spots are annotated. These data can be accessed on the Yeast Protein Map server through the World Wide Web network.
Collapse
Affiliation(s)
- H Boucherie
- Institut de Biochimie et Génétique Cellularies, UPR CNRS 9026, Bordeaux, France.
| | | | | | | | | | | |
Collapse
|