1
|
Eriksen R, White MC, Dawed AY, Perez IG, Posma JM, Haid M, Sharma S, Prehn C, Thomas LE, Koivula RW, Bizzotto R, Mari A, Giordano GN, Pavo I, Schwenk JM, De Masi F, Tsirigos KD, Brunak S, Viñuela A, Mahajan A, McDonald TJ, Kokkola T, Rutters F, Beulens J, Muilwijk M, Blom M, Elders P, Hansen TH, Fernandez-Tajes J, Jones A, Jennison C, Walker M, McCarthy MI, Pedersen O, Ruetten H, Forgie I, Holst JJ, Thomsen HS, Ridderstråle M, Bell JD, Adamski J, Franks PW, Hansen T, Holmes E, Frost G, Pearson ER. The association of cardiometabolic, diet and lifestyle parameters with plasma glucagon-like peptide-1: An IMI DIRECT study. J Clin Endocrinol Metab 2024:dgae119. [PMID: 38686701 DOI: 10.1210/clinem/dgae119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 11/20/2023] [Accepted: 02/27/2024] [Indexed: 05/02/2024]
Abstract
CONTEXT The role of glucagon-like peptide-1(GLP-1) in Type 2 diabetes (T2D) and obesity is not fully understood. OBJECTIVE We investigate the association of cardiometabolic, diet and lifestyle parameters on fasting and postprandial GLP-1 in people at risk of, or living with, T2D. METHOD We analysed cross-sectional data from the two Innovative Medicines Initiative (IMI) Diabetes Research on Patient Stratification (DIRECT) cohorts, cohort 1(n=2127) individuals at risk of diabetes; cohort 2 (n=789) individuals with new-onset of T2D. RESULTS Our multiple regression analysis reveals that fasting total GLP-1 is associated with an insulin resistant phenotype and observe a strong independent relationship with male sex, increased adiposity and liver fat particularly in the prediabetes population. In contrast, we showed that incremental GLP-1 decreases with worsening glycaemia, higher adiposity, liver fat, male sex and reduced insulin sensitivity in the prediabetes cohort. Higher fasting total GLP-1 was associated with a low intake of wholegrain, fruit and vegetables inpeople with prediabetes, and with a high intake of red meat and alcohol in people with diabetes. CONCLUSION These studies provide novel insights into the association between fasting and incremental GLP-1, metabolic traits of diabetes and obesity, and dietary intake and raise intriguing questions regarding the relevance of fasting GLP-1 in the pathophysiology T2D.
Collapse
Affiliation(s)
- Rebeca Eriksen
- Section for Nutrition Research, Division of Digestive Diseases, Department of Metabolism, Digestion and Reproduction, Faculty of Medicine, Imperial College London, UK
| | - Margaret C White
- Population Health & Genomics, School of Medicine, University of Dundee, Dundee, UK
| | - Adem Y Dawed
- Population Health & Genomics, School of Medicine, University of Dundee, Dundee, UK
| | - Isabel Garcia Perez
- Section for Nutrition Research, Division of Digestive Diseases, Department of Metabolism, Digestion and Reproduction, Faculty of Medicine, Imperial College London, UK
| | - Joram M Posma
- Section of Bioinformatics, Division of Systems Medicine, Department of Metabolism, Digestion and Reproduction, Imperial College, London, UK
- Health Data Research UK, London, UK
| | - Mark Haid
- Research Unit Molecular Endocrinology And Metabolism, Helmholtz Zentrum Muenchen, German Research Center for Environemental Health (GmbH), Neuherberg, Germany
| | - Sapna Sharma
- German Center for Diabetes Research (DZD), Ingolstädter Landstr. 1, 85764 Neuherberg, Germany
- Institute of Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Bavaria, Germany
| | - Cornelia Prehn
- Research Unit Molecular Endocrinology And Metabolism, Helmholtz Zentrum Muenchen, German Research Center for Environemental Health (GmbH), Neuherberg, Germany
| | - Louise E Thomas
- Research Centre for Optimal Health, School of Life Sciences, University of Westminster, London, UK
| | - Robert W Koivula
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Lund University, Skåne University Hospital, Malmö, Sweden
- Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Radcliffe Department of Medicine, Oxford, UK
| | - Roberto Bizzotto
- Institute of Neuroscience - National Research Council, Padova, Italy
| | - Andrea Mari
- Institute of Neuroscience - National Research Council, Padova, Italy
| | - Giuseppe N Giordano
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Lund University, Skåne University Hospital, Malmö, Sweden
| | - Imre Pavo
- Eli Lilly Regional Operations GmbH, Vienna, Austria
| | - Jochen M Schwenk
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, Stockholm, Sweden
| | - Federico De Masi
- Department of Health Technology, Technical University of Denmark, Kgs Lyngby and The Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Konstantinos D Tsirigos
- Department of Health Technology, Technical University of Denmark, Kgs Lyngby and The Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Søren Brunak
- Department of Health Technology, Technical University of Denmark, Kgs Lyngby and The Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, DK-2200 Copenhagen, Denmark
| | - Ana Viñuela
- Biosciences Institute, Newcastle University. Newcastle upon Tyne. United Kingdom
| | - Anubha Mahajan
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Timothy J McDonald
- Medical School, Exeter, UK NIHR Exeter Clinical Research Facility, University of Exeter
| | - Tarja Kokkola
- Department of Medicine, University of Eastern Finland and Kuopio University Hospital, Kuopio, Finland
| | - Femke Rutters
- Department of Epidemiology and data Science, Amsterdam Public Health Institute, Amsterdam UMC, location VUMC, Amsterdam, The Netherlands
| | - Joline Beulens
- Department of Epidemiology and data Science, Amsterdam Public Health Institute, Amsterdam UMC, location VUMC, Amsterdam, The Netherlands
| | - Mirthe Muilwijk
- Department of Epidemiology and data Science, Amsterdam Public Health Institute, Amsterdam UMC, location VUMC, Amsterdam, The Netherlands
| | - Marieke Blom
- Department of Epidemiology and data Science, Amsterdam Public Health Institute, Amsterdam UMC, location VUMC, Amsterdam, The Netherlands
| | - Petra Elders
- Department of Epidemiology and data Science, Amsterdam Public Health Institute, Amsterdam UMC, location VUMC, Amsterdam, The Netherlands
| | - Tue H Hansen
- The Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Science, University of Copenhagen, Copenhagen, Denmark
| | | | - Angus Jones
- Medical School, Exeter, UK NIHR Exeter Clinical Research Facility, University of Exeter
| | - Chris Jennison
- Department of Mathematical Sciences, University of Bath, Bath, UK
| | - Mark Walker
- Institute of Cellular Medicine (Diabetes), Newcastle University, Newcastle upon Tyne, UK
| | - Mark I McCarthy
- Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Radcliffe Department of Medicine, Oxford, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- NIHR Oxford Biomedical Research Centre, Churchill Hospital, Oxford, UK
| | - Oluf Pedersen
- The Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Science, University of Copenhagen, Copenhagen, Denmark
| | - Hartmut Ruetten
- Sanofi-Aventis Deutschland GmbH, R&D, Frankfurt am Main, Germany
| | - Ian Forgie
- Population Health & Genomics, School of Medicine, University of Dundee, Dundee, UK
| | - Jens J Holst
- The Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Science, University of Copenhagen, Copenhagen, Denmark
- Department of Biomedical Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Henrik S Thomsen
- Faculty of Medical and Health Sciences, University of Copenhagen, Denmark
| | | | - Jimmy D Bell
- Research Centre for Optimal Health, School of Life Sciences, University of Westminster, London, UK
| | - Jerzy Adamski
- Research Unit Molecular Endocrinology And Metabolism, Helmholtz Zentrum Muenchen, German Research Center for Environemental Health (GmbH), Neuherberg, Germany
- Lehrstuhl für Experimentelle Genetik, Technische Universität München, 85350 Freising-Weihenstephan, Germany
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, 8 Medical Drive, Singapore 117597, Singapore
| | - Paul W Franks
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Lund University, Skåne University Hospital, Malmö, Sweden
- Department of Nutrition, Harvard School of Public Health, Boston, Massachusetts, United States of America
| | - Torben Hansen
- The Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Science, University of Copenhagen, Copenhagen, Denmark
| | - Elaine Holmes
- Section for Nutrition Research, Division of Digestive Diseases, Department of Metabolism, Digestion and Reproduction, Faculty of Medicine, Imperial College London, UK
| | - Gary Frost
- Section for Nutrition Research, Division of Digestive Diseases, Department of Metabolism, Digestion and Reproduction, Faculty of Medicine, Imperial College London, UK
| | - Ewan R Pearson
- Population Health & Genomics, School of Medicine, University of Dundee, Dundee, UK
| |
Collapse
|
2
|
Brown AA, Fernandez-Tajes JJ, Hong MG, Brorsson CA, Koivula RW, Davtian D, Dupuis T, Sartori A, Michalettou TD, Forgie IM, Adam J, Allin KH, Caiazzo R, Cederberg H, De Masi F, Elders PJM, Giordano GN, Haid M, Hansen T, Hansen TH, Hattersley AT, Heggie AJ, Howald C, Jones AG, Kokkola T, Laakso M, Mahajan A, Mari A, McDonald TJ, McEvoy D, Mourby M, Musholt PB, Nilsson B, Pattou F, Penet D, Raverdy V, Ridderstråle M, Romano L, Rutters F, Sharma S, Teare H, 't Hart L, Tsirigos KD, Vangipurapu J, Vestergaard H, Brunak S, Franks PW, Frost G, Grallert H, Jablonka B, McCarthy MI, Pavo I, Pedersen O, Ruetten H, Walker M, Adamski J, Schwenk JM, Pearson ER, Dermitzakis ET, Viñuela A. Genetic analysis of blood molecular phenotypes reveals common properties in the regulatory networks affecting complex traits. Nat Commun 2023; 14:5062. [PMID: 37604891 PMCID: PMC10442420 DOI: 10.1038/s41467-023-40569-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 08/02/2023] [Indexed: 08/23/2023] Open
Abstract
We evaluate the shared genetic regulation of mRNA molecules, proteins and metabolites derived from whole blood from 3029 human donors. We find abundant allelic heterogeneity, where multiple variants regulate a particular molecular phenotype, and pleiotropy, where a single variant associates with multiple molecular phenotypes over multiple genomic regions. The highest proportion of share genetic regulation is detected between gene expression and proteins (66.6%), with a further median shared genetic associations across 49 different tissues of 78.3% and 62.4% between plasma proteins and gene expression. We represent the genetic and molecular associations in networks including 2828 known GWAS variants, showing that GWAS variants are more often connected to gene expression in trans than other molecular phenotypes in the network. Our work provides a roadmap to understanding molecular networks and deriving the underlying mechanism of action of GWAS variants using different molecular phenotypes in an accessible tissue.
Collapse
Affiliation(s)
- Andrew A Brown
- Population Health and Genomics, Ninewells Hospital and Medical School, University of Dundee, Dundee, DD1 9SY, United Kingdom
| | - Juan J Fernandez-Tajes
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, United Kingdom
| | - Mun-Gwan Hong
- Science for Life Laboratory, School of Biotechnology, KTH - Royal Institute of Technology, Solna, SE-171 21, Sweden
| | - Caroline A Brorsson
- Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, DK-2100, Denmark
| | - Robert W Koivula
- Oxford Centre for Diabetes Endocrinology and Metabolism, University of Oxford, Oxford, OX3 7LJ, United Kingdom
| | - David Davtian
- Population Health and Genomics, Ninewells Hospital and Medical School, University of Dundee, Dundee, DD1 9SY, United Kingdom
| | - Théo Dupuis
- Population Health and Genomics, Ninewells Hospital and Medical School, University of Dundee, Dundee, DD1 9SY, United Kingdom
| | - Ambra Sartori
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, 1211, Switzerland
- Institute for Genetics and Genomics in Geneva (iGE3), University of Geneva, Geneva, 1211, Switzerland
- Swiss Institute of Bioinformatics, Geneva, 1211, Switzerland
| | - Theodora-Dafni Michalettou
- Biosciences Institute, Faculty of Medical Sciences, University of Newcastle, Newcastle upon Tyne, NE1 4EP, United Kingdom
| | - Ian M Forgie
- Population Health and Genomics, Ninewells Hospital and Medical School, University of Dundee, Dundee, DD1 9SY, United Kingdom
| | - Jonathan Adam
- German Center for Diabetes Research (DZD), Neuherberg, 85764, Germany
- Research Unit of Molecular Epidemiology, Institute of Epidemiology, German Research Center for Environmental Health, Helmholtz Zentrum München, Neuherberg, 85764, Germany
| | - Kristine H Allin
- The Novo Nordisk Center for Basic Metabolic Research, Faculty of Health and Medical Science, University of Copenhagen, Copenhagen, DK-2100, Denmark
| | - Robert Caiazzo
- University of Lille, Inserm, Lille Pasteur Institute, Lille, France
| | - Henna Cederberg
- Internal Medicine, Institute of Clinical Medicine, University of Eastern Finland, Kuopio, Finland
| | - Federico De Masi
- Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Petra J M Elders
- Department of General Practice, Amsterdam UMC- location Vumc, Amsterdam Public Health research institute, Amsterdam, The Netherlands
| | - Giuseppe N Giordano
- Department of Clinical Science, Genetic and Molecular Epidemiology, Lund University Diabetes Centre, Malmö, Sweden
| | - Mark Haid
- Metabolomics and Proteomics Core, German Research Center for Environmental Health, Helmholtz Zentrum München, Neuherberg, 85764, Germany
| | - Torben Hansen
- The Novo Nordisk Center for Basic Metabolic Research, Faculty of Health and Medical Science, University of Copenhagen, Copenhagen, DK-2100, Denmark
| | - Tue H Hansen
- The Novo Nordisk Center for Basic Metabolic Research, Faculty of Health and Medical Science, University of Copenhagen, Copenhagen, DK-2100, Denmark
| | - Andrew T Hattersley
- Department of Clinical and Biomedical Sciences, University of Exeter College of Medicine & Health, Exeter, EX25DW, United Kingdom
| | - Alison J Heggie
- Institute of Cellular Medicine, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, United Kingdom
| | - Cédric Howald
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, 1211, Switzerland
- Institute for Genetics and Genomics in Geneva (iGE3), University of Geneva, Geneva, 1211, Switzerland
- Swiss Institute of Bioinformatics, Geneva, 1211, Switzerland
| | - Angus G Jones
- Department of Clinical and Biomedical Sciences, University of Exeter College of Medicine & Health, Exeter, EX25DW, United Kingdom
| | - Tarja Kokkola
- Internal Medicine, Institute of Clinical Medicine, University of Eastern Finland, Kuopio, Finland
| | - Markku Laakso
- Internal Medicine, Institute of Clinical Medicine, University of Eastern Finland, Kuopio, Finland
| | - Anubha Mahajan
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, United Kingdom
| | - Andrea Mari
- Institute of Neuroscience, National Research Council, Padova, 35127, Italy
| | - Timothy J McDonald
- Blood Sciences, Royal Devon and Exeter NHS Foundation Trust, Exeter, EX2 5DW, United Kingdom
| | - Donna McEvoy
- Diabetes Research Network, Royal Victoria Infirmary, Newcastle upon Tyne, United Kingdom
| | - Miranda Mourby
- Nuffield Department of Population Health, Centre for Health, Law and Emerging Technologies (HeLEX), University of Oxford, Oxford, OX2 7DD, United Kingdom
| | - Petra B Musholt
- Global Development, Sanofi-Aventis Deutschland GmbH, Hoechst Industrial Park, Frankfurt am Main, 65926, Germany
| | - Birgitte Nilsson
- Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Francois Pattou
- University of Lille, Inserm, Lille Pasteur Institute, Lille, France
| | - Deborah Penet
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, 1211, Switzerland
- Institute for Genetics and Genomics in Geneva (iGE3), University of Geneva, Geneva, 1211, Switzerland
- Swiss Institute of Bioinformatics, Geneva, 1211, Switzerland
| | - Violeta Raverdy
- University of Lille, Inserm, Lille Pasteur Institute, Lille, France
| | | | - Luciana Romano
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, 1211, Switzerland
- Institute for Genetics and Genomics in Geneva (iGE3), University of Geneva, Geneva, 1211, Switzerland
- Swiss Institute of Bioinformatics, Geneva, 1211, Switzerland
| | - Femke Rutters
- Epidemiology and Data Science, VUMC, Amsterdam, The Netherlands
| | - Sapna Sharma
- Research Unit of Molecular Epidemiology, Institute of Epidemiology, German Research Center for Environmental Health, Helmholtz Zentrum München, Neuherberg, 85764, Germany
- Food Chemistry and Molecular and Sensory Science, Technical University of Munich, München, Germany
| | - Harriet Teare
- Centre for Health Law and Emerging Technologies, Department of Population Health, University of Oxford, Old Road Campus, Oxford, OX3 7DQ, United Kingdom
| | - Leen 't Hart
- Epidemiology and Data Science, VUMC, Amsterdam, The Netherlands
- Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, The Netherlands
- Department of Biomedical Data Sciences, Molecular Epidemiology section, Leiden University Medical Center, Leiden, The Netherlands
| | | | - Jagadish Vangipurapu
- Internal Medicine, Institute of Clinical Medicine, University of Eastern Finland, Kuopio, Finland
| | - Henrik Vestergaard
- The Novo Nordisk Center for Basic Metabolic Research, Faculty of Health and Medical Science, University of Copenhagen, Copenhagen, DK-2100, Denmark
- Steno Diabetes Center Copenhagen, Copenhagen, Denmark
| | - Søren Brunak
- Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, DK-2100, Denmark
| | - Paul W Franks
- Department of Clinical Science, Genetic and Molecular Epidemiology, Lund University Diabetes Centre, Malmö, Sweden
| | - Gary Frost
- Nutrition and Dietetics Research Group, Imperial College London, London, SW7 2AZ, United Kingdom
| | - Harald Grallert
- German Center for Diabetes Research (DZD), Neuherberg, 85764, Germany
- Research Unit of Molecular Epidemiology, Institute of Epidemiology, German Research Center for Environmental Health, Helmholtz Zentrum München, Neuherberg, 85764, Germany
| | - Bernd Jablonka
- Sanofi Partnering, Sanofi-Aventis Deutschland GmbH, Frankfurt am Main, 65926, Germany
| | - Mark I McCarthy
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, United Kingdom
- GENENTECH, 1 DNA Way, San Francisco, CA, 94080, USA
| | - Imre Pavo
- Eli Lilly Regional Operations Ges.m.b.H, Vienna, 1030, Austria
| | - Oluf Pedersen
- Center for Clinical Metabolic Research, Herlev and Gentofte University Hospital, Copenhagen, Denmark
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, DK-2100, Denmark
| | - Hartmut Ruetten
- Sanofi Partnering, Sanofi-Aventis Deutschland GmbH, Frankfurt am Main, 65926, Germany
| | - Mark Walker
- Translational and Clinical Research Institute, Faculty of Medical Sciences, University of Newcastle, Newcastle upon Tyne, United Kingdom
| | - Jerzy Adamski
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, 117597, Singapore
- Institute of Experimental Genetics, German Research Center for Environmental Health, Helmholtz Zentrum München, Neuherberg, 85764, Germany
- Institute of Biochemistry, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Jochen M Schwenk
- Science for Life Laboratory, School of Biotechnology, KTH - Royal Institute of Technology, Solna, SE-171 21, Sweden
| | - Ewan R Pearson
- Population Health and Genomics, Ninewells Hospital and Medical School, University of Dundee, Dundee, DD1 9SY, United Kingdom
| | - Emmanouil T Dermitzakis
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, 1211, Switzerland.
- Institute for Genetics and Genomics in Geneva (iGE3), University of Geneva, Geneva, 1211, Switzerland.
- Swiss Institute of Bioinformatics, Geneva, 1211, Switzerland.
| | - Ana Viñuela
- Biosciences Institute, Faculty of Medical Sciences, University of Newcastle, Newcastle upon Tyne, NE1 4EP, United Kingdom.
| |
Collapse
|
3
|
Teufel F, Almagro Armenteros JJ, Johansen AR, Gíslason MH, Pihl SI, Tsirigos KD, Winther O, Brunak S, von Heijne G, Nielsen H. SignalP 6.0 predicts all five types of signal peptides using protein language models. Nat Biotechnol 2022. [PMID: 34980915 DOI: 10.1038/s41587-41021-01156-41583] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Signal peptides (SPs) are short amino acid sequences that control protein secretion and translocation in all living organisms. SPs can be predicted from sequence data, but existing algorithms are unable to detect all known types of SPs. We introduce SignalP 6.0, a machine learning model that detects all five SP types and is applicable to metagenomic data.
Collapse
Affiliation(s)
- Felix Teufel
- Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark.,Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland
| | - José Juan Almagro Armenteros
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | | | - Magnús Halldór Gíslason
- Center for Genomic Medicine, Rigshospitalet (Copenhagen University Hospital), Copenhagen, Denmark
| | - Silas Irby Pihl
- Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark
| | | | - Ole Winther
- Center for Genomic Medicine, Rigshospitalet (Copenhagen University Hospital), Copenhagen, Denmark.,Department of Biology, Bioinformatics Centre, University of Copenhagen, Copenhagen, Denmark.,Section for Cognitive Systems, Department of Applied Mathematics and Computer Science, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Søren Brunak
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Gunnar von Heijne
- Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden.,Science for Life Laboratory, Stockholm University, Solna, Sweden
| | - Henrik Nielsen
- Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark.
| |
Collapse
|
4
|
Teufel F, Almagro Armenteros JJ, Johansen AR, Gíslason MH, Pihl SI, Tsirigos KD, Winther O, Brunak S, von Heijne G, Nielsen H. SignalP 6.0 predicts all five types of signal peptides using protein language models. Nat Biotechnol 2022; 40:1023-1025. [PMID: 34980915 PMCID: PMC9287161 DOI: 10.1038/s41587-021-01156-3] [Citation(s) in RCA: 663] [Impact Index Per Article: 331.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 11/08/2021] [Indexed: 11/09/2022]
Abstract
Signal peptides (SPs) are short amino acid sequences that control protein secretion and translocation in all living organisms. SPs can be predicted from sequence data, but existing algorithms are unable to detect all known types of SPs. We introduce SignalP 6.0, a machine learning model that detects all five SP types and is applicable to metagenomic data. A new version of SignalP predicts all types of signal peptides.
Collapse
Affiliation(s)
- Felix Teufel
- Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark.,Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland
| | - José Juan Almagro Armenteros
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | | | - Magnús Halldór Gíslason
- Center for Genomic Medicine, Rigshospitalet (Copenhagen University Hospital), Copenhagen, Denmark
| | - Silas Irby Pihl
- Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark
| | | | - Ole Winther
- Center for Genomic Medicine, Rigshospitalet (Copenhagen University Hospital), Copenhagen, Denmark.,Department of Biology, Bioinformatics Centre, University of Copenhagen, Copenhagen, Denmark.,Section for Cognitive Systems, Department of Applied Mathematics and Computer Science, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Søren Brunak
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Gunnar von Heijne
- Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden.,Science for Life Laboratory, Stockholm University, Solna, Sweden
| | - Henrik Nielsen
- Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark.
| |
Collapse
|
5
|
Tamposis IA, Sarantopoulou D, Theodoropoulou MC, Stasi EA, Kontou PI, Tsirigos KD, Bagos PG. Hidden neural networks for transmembrane protein topology prediction. Comput Struct Biotechnol J 2021; 19:6090-6097. [PMID: 34849210 PMCID: PMC8606341 DOI: 10.1016/j.csbj.2021.11.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Revised: 11/05/2021] [Accepted: 11/06/2021] [Indexed: 11/21/2022] Open
Abstract
Hidden Markov Models (HMMs) are amongst the most successful methods for predicting protein features in biological sequence analysis. However, there are biological problems where the Markovian assumption is not sufficient since the sequence context can provide useful information for prediction purposes. Several extensions of HMMs have appeared in the literature in order to overcome their limitations. We apply here a hybrid method that combines HMMs and Neural Networks (NNs), termed Hidden Neural Networks (HNNs), for biological sequence analysis in a straightforward manner. In this framework, the traditional HMM probability parameters are replaced by NN outputs. As a case study, we focus on the topology prediction of for alpha-helical and beta-barrel membrane proteins. The HNNs show performance gains compared to standard HMMs and the respective predictors outperform the top-scoring methods in the field. The implementation of HNNs can be found in the package JUCHMME, downloadable from http://www.compgen.org/tools/juchmme, https://github.com/pbagos/juchmme. The updated PRED-TMBB2 and HMM-TM prediction servers can be accessed at www.compgen.org.
Collapse
Key Words
- CHMM, Class Hidden Markov Models
- CML, Conditional Maximum Likelihood
- EM, Expectation-Maximization
- HMM, Hidden Markov Models
- HNN, Hidden Neural Networks
- Hidden Markov Models
- Hidden Neural Networks
- JUCHMME, Java Utility for Class Hidden Markov Models and Extensions
- MCC, Matthews Correlation Coefficient
- ML, Maximum Likelihood
- MSA, Multiple Sequence Alignment
- Membrane proteins
- NN, Neural Networks
- Neural Networks
- Protein structure prediction
- SOV, segment overlap
- Sequence analysis
Collapse
Affiliation(s)
- Ioannis A. Tamposis
- Department of Computer Science and Biomedical Informatics, University of Thessaly, 35100 Lamia, Greece
| | - Dimitra Sarantopoulou
- Institute for Translational Medicine and Therapeutics, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Present address: National Institute on Aging, National Institutes of Health, Baltimore, Maryland, USA
| | | | - Evangelia A. Stasi
- Department of Computer Science and Biomedical Informatics, University of Thessaly, 35100 Lamia, Greece
| | - Panagiota I. Kontou
- Department of Computer Science and Biomedical Informatics, University of Thessaly, 35100 Lamia, Greece
| | | | - Pantelis G. Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, 35100 Lamia, Greece
| |
Collapse
|
6
|
Tura A, Grespan E, Göbl CS, Koivula RW, Franks PW, Pearson ER, Walker M, Forgie IM, Giordano GN, Pavo I, Ruetten H, Dermitzakis ET, McCarthy MI, Pedersen O, Schwenk JM, Adamski J, De Masi F, Tsirigos KD, Brunak S, Viñuela A, Mahajan A, McDonald TJ, Kokkola T, Vangipurapu J, Cederberg H, Laakso M, Rutters F, Elders PJM, Koopman ADM, Beulens JW, Ridderstråle M, Hansen TH, Allin KH, Hansen T, Vestergaard H, Mari A. Profiles of Glucose Metabolism in Different Prediabetes Phenotypes, Classified by Fasting Glycemia, 2-Hour OGTT, Glycated Hemoglobin, and 1-Hour OGTT: An IMI DIRECT Study. Diabetes 2021; 70:2092-2106. [PMID: 34233929 DOI: 10.2337/db21-0227] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Accepted: 06/24/2021] [Indexed: 11/13/2022]
Abstract
Differences in glucose metabolism among categories of prediabetes have not been systematically investigated. In this longitudinal study, participants (N = 2,111) underwent a 2-h 75-g oral glucose tolerance test (OGTT) at baseline and 48 months. HbA1c was also measured. We classified participants as having isolated prediabetes defect (impaired fasting glucose [IFG], impaired glucose tolerance [IGT], or HbA1c indicative of prediabetes [IA1c]), two defects (IFG+IGT, IFG+IA1c, or IGT+IA1c), or all defects (IFG+IGT+IA1c). β-Cell function (BCF) and insulin sensitivity were assessed from OGTT. At baseline, in pooling of participants with isolated defects, they showed impairment in both BCF and insulin sensitivity compared with healthy control subjects. Pooled groups with two or three defects showed progressive further deterioration. Among groups with isolated defect, those with IGT showed lower insulin sensitivity, insulin secretion at reference glucose (ISRr), and insulin secretion potentiation (P < 0.002). Conversely, those with IA1c showed higher insulin sensitivity and ISRr (P < 0.0001). Among groups with two defects, we similarly found differences in both BCF and insulin sensitivity. At 48 months, we found higher type 2 diabetes incidence for progressively increasing number of prediabetes defects (odds ratio >2, P < 0.008). In conclusion, the prediabetes groups showed differences in type/degree of glucometabolic impairment. Compared with the pooled group with isolated defects, those with double or triple defect showed progressive differences in diabetes incidence.
Collapse
Affiliation(s)
- Andrea Tura
- CNR Institute of Neuroscience, Padova, Italy
| | | | - Christian S Göbl
- Division of Obstetrics and Feto-Maternal Medicine, Department of Obstetrics and Gynecology, Medical University of Vienna, Vienna, Austria
| | - Robert W Koivula
- Oxford Centre for Diabetes Endocrinology and Metabolism, University of Oxford, Oxford, U.K
- Genetic and Molecular Epidemiology, Department of Clinical Science, Lund University, Skåne University Hospital Malmö, Malmö, Sweden
| | - Paul W Franks
- Genetic and Molecular Epidemiology, Department of Clinical Science, Lund University, Skåne University Hospital Malmö, Malmö, Sweden
| | - Ewan R Pearson
- Population Health and Genomics, Ninewells Hospital and Medical School, University of Dundee, Dundee, Scotland, U.K
| | - Mark Walker
- Institute of Cellular Medicine, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, U.K
| | - Ian M Forgie
- Population Health and Genomics, Ninewells Hospital and Medical School, University of Dundee, Dundee, Scotland, U.K
| | - Giuseppe N Giordano
- Genetic and Molecular Epidemiology, Department of Clinical Science, Lund University, Skåne University Hospital Malmö, Malmö, Sweden
| | - Imre Pavo
- Eli Lilly Regional Operations Ges.m.b.H., Vienna, Austria
| | - Hartmut Ruetten
- CardioMetabolism & Respiratory Medicine, Boehringer Ingelheim International GmbH, Ingelheim/Rhein, Germany
| | - Emmanouil T Dermitzakis
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, Switzerland
| | - Mark I McCarthy
- Oxford Centre for Diabetes Endocrinology and Metabolism, University of Oxford, Oxford, U.K
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, U.K
| | - Oluf Pedersen
- Section of Metabolic Genetics, Novo Nordisk Center for Basic Metabolic Research, Faculty of Health and Medical Science, University of Copenhagen, Copenhagen, Denmark
| | - Jochen M Schwenk
- Affinity Proteomics, Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH Royal Institute of Technology, Solna, Sweden
| | - Jerzy Adamski
- Institute of Experimental Genetics, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
- Institute of Biochemistry, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Federico De Masi
- Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark
- Disease Systems Biology Program, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Konstantinos D Tsirigos
- Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark
- Disease Systems Biology Program, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Søren Brunak
- Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark
- Disease Systems Biology Program, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Ana Viñuela
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, Switzerland
- Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, U.K
| | - Anubha Mahajan
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, U.K
| | - Timothy J McDonald
- Blood Sciences, Royal Devon and Exeter NHS Foundation Trust, Exeter, U.K
| | - Tarja Kokkola
- Internal Medicine, Institute of Clinical Medicine, University of Eastern Finland, Kuopio, Finland
| | - Jagadish Vangipurapu
- Internal Medicine, Institute of Clinical Medicine, University of Eastern Finland, Kuopio, Finland
| | - Henna Cederberg
- Internal Medicine, Institute of Clinical Medicine, University of Eastern Finland, Kuopio, Finland
| | - Markku Laakso
- Internal Medicine, Institute of Clinical Medicine, University of Eastern Finland, Kuopio, Finland
| | - Femke Rutters
- Department of Epidemiology and Data Science, Amsterdam Medical Centre, location VUMC, Amsterdam, the Netherlands
| | - Petra J M Elders
- Department of Epidemiology and Data Science, Amsterdam Medical Centre, location VUMC, Amsterdam, the Netherlands
| | - Anitra D M Koopman
- Department of Epidemiology and Data Science, Amsterdam Medical Centre, location VUMC, Amsterdam, the Netherlands
| | - Joline W Beulens
- Department of Epidemiology and Data Science, Amsterdam Medical Centre, location VUMC, Amsterdam, the Netherlands
| | - Martin Ridderstråle
- Department of Clinical Sciences, Diabetes & Endocrinology Unit, Lund University, Skåne University Hospital Malmö, Malmö, Sweden
| | - Tue H Hansen
- Section of Metabolic Genetics, Novo Nordisk Center for Basic Metabolic Research, Faculty of Health and Medical Science, University of Copenhagen, Copenhagen, Denmark
| | - Kristine H Allin
- Section of Metabolic Genetics, Novo Nordisk Center for Basic Metabolic Research, Faculty of Health and Medical Science, University of Copenhagen, Copenhagen, Denmark
| | - Torben Hansen
- Section of Metabolic Genetics, Novo Nordisk Center for Basic Metabolic Research, Faculty of Health and Medical Science, University of Copenhagen, Copenhagen, Denmark
| | - Henrik Vestergaard
- Section of Metabolic Genetics, Novo Nordisk Center for Basic Metabolic Research, Faculty of Health and Medical Science, University of Copenhagen, Copenhagen, Denmark
- Department of Medicine, Bornholms Hospital, Rønne, Denmark
| | | | | |
Collapse
|
7
|
Roumia AF, Tsirigos KD, Theodoropoulou MC, Tamposis IA, Hamodrakas SJ, Bagos PG. OMPdb: A Global Hub of Beta-Barrel Outer Membrane Proteins. Front Bioinform 2021; 1:646581. [PMID: 36303794 PMCID: PMC9581022 DOI: 10.3389/fbinf.2021.646581] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2020] [Accepted: 03/18/2021] [Indexed: 11/14/2022] Open
Abstract
OMPdb (www.ompdb.org) was introduced as a database for β-barrel outer membrane proteins from Gram-negative bacteria in 2011 and then included 69,354 entries classified into 85 families. The database has been updated continuously using a collection of characteristic profile Hidden Markov Models able to discriminate between the different families of prokaryotic transmembrane β-barrels. The number of families has increased ultimately to a total of 129 families in the current, second major version of OMPdb. New additions have been made in parallel with efforts to update existing families and add novel families. Here, we present the upgrade of OMPdb, which from now on aims to become a global repository for all transmembrane β-barrel proteins, both eukaryotic and bacterial.
Collapse
Affiliation(s)
- Ahmed F. Roumia
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, Greece
| | | | | | - Ioannis A. Tamposis
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, Greece
| | - Stavros J. Hamodrakas
- Section of Cell Biology and Biophysics, Department of Biology, School of Sciences, National and Kapodistrian University of Athens, Athens, Greece
| | - Pantelis G. Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, Greece
- *Correspondence: Pantelis G. Bagos
| |
Collapse
|
8
|
Bizzotto R, Jennison C, Jones AG, Kurbasic A, Tura A, Kennedy G, Bell JD, Thomas EL, Frost G, Eriksen R, Koivula RW, Brage S, Kaye J, Hattersley AT, Heggie A, McEvoy D, 't Hart LM, Beulens JW, Elders P, Musholt PB, Ridderstråle M, Hansen TH, Allin KH, Hansen T, Vestergaard H, Lundgaard AT, Thomsen HS, De Masi F, Tsirigos KD, Brunak S, Viñuela A, Mahajan A, McDonald TJ, Kokkola T, Forgie IM, Giordano GN, Pavo I, Ruetten H, Dermitzakis E, McCarthy MI, Pedersen O, Schwenk JM, Adamski J, Franks PW, Walker M, Pearson ER, Mari A. Processes Underlying Glycemic Deterioration in Type 2 Diabetes: An IMI DIRECT Study. Diabetes Care 2021; 44:511-518. [PMID: 33323478 DOI: 10.2337/dc20-1567] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Accepted: 10/31/2020] [Indexed: 02/03/2023]
Abstract
OBJECTIVE We investigated the processes underlying glycemic deterioration in type 2 diabetes (T2D). RESEARCH DESIGN AND METHODS A total of 732 recently diagnosed patients with T2D from the Innovative Medicines Initiative Diabetes Research on Patient Stratification (IMI DIRECT) study were extensively phenotyped over 3 years, including measures of insulin sensitivity (OGIS), β-cell glucose sensitivity (GS), and insulin clearance (CLIm) from mixed meal tests, liver enzymes, lipid profiles, and baseline regional fat from MRI. The associations between the longitudinal metabolic patterns and HbA1c deterioration, adjusted for changes in BMI and in diabetes medications, were assessed via stepwise multivariable linear and logistic regression. RESULTS Faster HbA1c progression was independently associated with faster deterioration of OGIS and GS and increasing CLIm; visceral or liver fat, HDL-cholesterol, and triglycerides had further independent, though weaker, roles (R 2 = 0.38). A subgroup of patients with a markedly higher progression rate (fast progressors) was clearly distinguishable considering these variables only (discrimination capacity from area under the receiver operating characteristic = 0.94). The proportion of fast progressors was reduced from 56% to 8-10% in subgroups in which only one trait among OGIS, GS, and CLIm was relatively stable (odds ratios 0.07-0.09). T2D polygenic risk score and baseline pancreatic fat, glucagon-like peptide 1, glucagon, diet, and physical activity did not show an independent role. CONCLUSIONS Deteriorating insulin sensitivity and β-cell function, increasing insulin clearance, high visceral or liver fat, and worsening of the lipid profile are the crucial factors mediating glycemic deterioration of patients with T2D in the initial phase of the disease. Stabilization of a single trait among insulin sensitivity, β-cell function, and insulin clearance may be relevant to prevent progression.
Collapse
Affiliation(s)
| | | | - Angus G Jones
- Institute of Biomedical and Clinical Science, University of Exeter Medical School, Exeter, U.K.,Diabetes and Endocrinology, Royal Devon and Exeter NHS Foundation Trust, Exeter, U.K
| | - Azra Kurbasic
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Clinical Research Centre, Lund University, Skåne University Hospital, Malmö, Malmö, Sweden
| | - Andrea Tura
- CNR Institute of Neuroscience, Padova, Italy
| | - Gwen Kennedy
- Immunoassay Biomarker Core Laboratory, School of Medicine, Ninewells Hospital, Dundee, U.K
| | - Jimmy D Bell
- School of Life Sciences, Research Centre for Optimal Health, University of Westminster, London, U.K
| | - E Louise Thomas
- School of Life Sciences, Research Centre for Optimal Health, University of Westminster, London, U.K
| | - Gary Frost
- Section for Nutrition Research, Faculty of Medicine, Hammersmith Campus, Imperial College London, London, U.K
| | - Rebeca Eriksen
- Section for Nutrition Research, Faculty of Medicine, Hammersmith Campus, Imperial College London, London, U.K
| | - Robert W Koivula
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Clinical Research Centre, Lund University, Skåne University Hospital, Malmö, Malmö, Sweden.,Radcliffe Department of Medicine, Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Oxford, U.K
| | - Soren Brage
- MRC Epidemiology Unit, University of Cambridge, Cambridge, U.K
| | - Jane Kaye
- Faculty of Law, Centre for Health, Law and Emerging Technologies, University of Oxford, Oxford, U.K.,Melbourne Law School, Centre for Health, Law and Emerging Technologies, University of Melbourne, Carlton, Victoria, Australia
| | - Andrew T Hattersley
- Institute of Biomedical and Clinical Science, University of Exeter Medical School, Exeter, U.K.,Diabetes and Endocrinology, Royal Devon and Exeter NHS Foundation Trust, Exeter, U.K
| | - Alison Heggie
- Institute of Cellular Medicine, Newcastle University, Newcastle upon Tyne, U.K
| | - Donna McEvoy
- Diabetes Research Network, Royal Victoria Infirmary, Newcastle upon Tyne, U.K
| | - Leen M 't Hart
- Department of Epidemiology and Data Science, Amsterdam UMC-Location VUmc, Amsterdam Public Health Research Institute, Amsterdam, the Netherlands.,Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, the Netherlands.,Department of Biomedical Data Sciences, Molecular Epidemiology Section, Leiden University Medical Center, Leiden, the Netherlands
| | - Joline W Beulens
- Department of Epidemiology and Data Science, Amsterdam UMC-Location VUmc, Amsterdam Public Health Research Institute, Amsterdam, the Netherlands
| | - Petra Elders
- Department of General Practice, Amsterdam UMC-Location VUmc, Amsterdam Public Health Research Institute, Amsterdam, the Netherlands
| | - Petra B Musholt
- R&D Global Development, Translational Medicine & Clinical Pharmacology, Sanofi Deutschland GmbH, Frankfurt, Germany
| | - Martin Ridderstråle
- Clinical Pharmacology and Translational Medicine, Novo Nordisk A/S, Søborg, Denmark
| | - Tue H Hansen
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Kristine H Allin
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Torben Hansen
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Henrik Vestergaard
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark.,Bornholms Hospital, Rønne, Denmark
| | - Agnete T Lundgaard
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark.,Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Henrik S Thomsen
- Faculty of Medical and Health Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Federico De Masi
- Department of Health Technology, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Konstantinos D Tsirigos
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark.,Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Søren Brunak
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark.,Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Ana Viñuela
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, Switzerland.,Faculty of Medical Sciences, Biosciences Institute, Newcastle University, Newcastle, U.K
| | - Anubha Mahajan
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, U.K
| | - Timothy J McDonald
- Institute of Biomedical and Clinical Science, University of Exeter Medical School, Exeter, U.K.,Blood Sciences, Royal Devon and Exeter NHS Foundation Trust, Exeter, U.K
| | - Tarja Kokkola
- Department of Medicine, University of Eastern Finland, Kuopio, Finland
| | - Ian M Forgie
- Population Health and Genomics, School of Medicine, University of Dundee, Dundee, U.K
| | - Giuseppe N Giordano
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Clinical Research Centre, Lund University, Skåne University Hospital, Malmö, Malmö, Sweden
| | - Imre Pavo
- Eli Lilly Regional Operations GmbH, Vienna, Austria
| | - Hartmut Ruetten
- R&D Global Development, Translational Medicine & Clinical Pharmacology, Sanofi Deutschland GmbH, Frankfurt, Germany
| | - Emmanouil Dermitzakis
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, Switzerland
| | - Mark I McCarthy
- Radcliffe Department of Medicine, Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Oxford, U.K.,Wellcome Centre for Human Genetics, University of Oxford, Oxford, U.K.,Oxford NIHR Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford, U.K
| | - Oluf Pedersen
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Jochen M Schwenk
- Affinity Proteomics, Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH Royal Institute of Technology, Solna, Sweden
| | - Jerzy Adamski
- Research Unit of Molecular Endocrinology and Metabolism, Helmholtz Zentrum München, Neuherberg, Germany.,Lehrstuhl für Experimentelle Genetik, Technische Universität München, Freising-Weihenstephan, Germany.,Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
| | - Paul W Franks
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Clinical Research Centre, Lund University, Skåne University Hospital, Malmö, Malmö, Sweden
| | - Mark Walker
- Faculty of Medical Sciences, Translational and Clinical Research Institute, Newcastle University, Newcastle, U.K
| | - Ewan R Pearson
- Population Health and Genomics, School of Medicine, University of Dundee, Dundee, U.K
| | | | | |
Collapse
|
9
|
Eriksen R, Perez IG, Posma JM, Haid M, Sharma S, Prehn C, Thomas LE, Koivula RW, Bizzotto R, Prehn C, Mari A, Giordano GN, Pavo I, Schwenk JM, De Masi F, Tsirigos KD, Brunak S, Viñuela A, Mahajan A, McDonald TJ, Kokkola T, Rutter F, Teare H, Hansen TH, Fernandez J, Jones A, Jennison C, Walker M, McCarthy MI, Pedersen O, Ruetten H, Forgie I, Bell JD, Pearson ER, Franks PW, Adamski J, Holmes E, Frost G. Dietary metabolite profiling brings new insight into the relationship between nutrition and metabolic risk: An IMI DIRECT study. EBioMedicine 2020; 58:102932. [PMID: 32763829 PMCID: PMC7406914 DOI: 10.1016/j.ebiom.2020.102932] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2020] [Revised: 06/18/2020] [Accepted: 07/15/2020] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND Dietary advice remains the cornerstone of prevention and management of type 2 diabetes (T2D). However, understanding the efficacy of dietary interventions is confounded by the challenges inherent in assessing free living diet. Here we profiled dietary metabolites to investigate glycaemic deterioration and cardiometabolic risk in people at risk of or living with T2D. METHODS We analysed data from plasma collected at baseline and 18-month follow-up in individuals from the Innovative Medicines Initiative (IMI) Diabetes Research on Patient Stratification (DIRECT) cohort 1 n = 403 individuals with normal or impaired glucose regulation (prediabetic) and cohort 2 n = 458 individuals with new onset of T2D. A dietary metabolite profile model (Tpred) was constructed using multivariable regression of 113 plasma metabolites obtained from targeted metabolomics assays. The continuous Tpred score was used to explore the relationships between diet, glycaemic deterioration and cardio-metabolic risk via multiple linear regression models. FINDINGS A higher Tpred score was associated with healthier diets high in wholegrain (β=3.36 g, 95% CI 0.31, 6.40 and β=2.82 g, 95% CI 0.06, 5.57) and lower energy intake (β=-75.53 kcal, 95% CI -144.71, -2.35 and β=-122.51 kcal, 95% CI -186.56, -38.46), and saturated fat (β=-0.92 g, 95% CI -1.56, -0.28 and β=-0.98 g, 95% CI -1.53, -0.42 g), respectively for cohort 1 and 2. In both cohorts a higher Tpred score was also associated with lower total body adiposity and favourable lipid profiles HDL-cholesterol (β=0.07 mmol/L, 95% CI 0.03, 0.1), (β=0.08 mmol/L, 95% CI 0.04, 0.1), and triglycerides (β=-0.1 mmol/L, 95% CI -0.2, -0.03), (β=-0.2 mmol/L, 95% CI -0.3, -0.09), respectively for cohort 1 and 2. In cohort 2, the Tpred score was negatively associated with liver fat (β=-0.74%, 95% CI -0.67, -0.81), and lower fasting concentrations of HbA1c (β=-0.9 mmol/mol, 95% CI -1.5, -0.1), glucose (β=-0.2 mmol/L, 95% CI -0.4, -0.05) and insulin (β=-11.0 pmol/mol, 95% CI -19.5, -2.6). Longitudinal analysis showed at 18-month follow up a higher Tpred score was also associated lower total body adiposity in both cohorts and lower fasting glucose (β=-0.2 mmol/L, 95% CI -0.3, -0.01) and insulin (β=-9.2 pmol/mol, 95% CI -17.9, -0.4) concentrations in cohort 2. INTERPRETATION Plasma dietary metabolite profiling provides objective measures of diet intake, showing a relationship to glycaemic deterioration and cardiometabolic health. FUNDING This work was supported by the Innovative Medicines Initiative Joint Undertaking under grant agreement no. 115,317 (DIRECT), resources of which are composed of financial contribution from the European Union's Seventh Framework Programme (FP7/2007-2013) and EFPIA companies.
Collapse
Affiliation(s)
- Rebeca Eriksen
- Section for Nutrition Research, Division of Digestive Diseases, Department of Metabolism, Digestion and Reproduction, Faculty of Medicine, Imperial College London, United Kingdom.
| | - Isabel Garcia Perez
- Section for Nutrition Research, Division of Digestive Diseases, Department of Metabolism, Digestion and Reproduction, Faculty of Medicine, Imperial College London, United Kingdom
| | - Joram M Posma
- Section of Bioinformatics, Division of Systems Medicine, Department of Metabolism, Digestion and Reproduction, Imperial College, London, United Kingdom; Health Data Research UK, London, United Kingdom
| | - Mark Haid
- Research Unit Molecular Endocrinology And Metabolism, Helmholtz Zentrum Muenchen, German Research Center for Environemental Health (GmbH), Neuherberg, Germany
| | - Sapna Sharma
- German Center for Diabetes Research (DZD), Ingolstädter Landstr. 1, 85764 Neuherberg, Germany; Institute of Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Bavaria, Germany
| | - Cornelia Prehn
- Research Unit Molecular Endocrinology And Metabolism, Helmholtz Zentrum Muenchen, German Research Center for Environemental Health (GmbH), Neuherberg, Germany
| | - Louise E Thomas
- Research Centre for Optimal Health, School of Life Sciences, University of Westminster, London, United Kingdom
| | - Robert W Koivula
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Lund University, Skåne University Hospital, Malmö, Sweden; Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Radcliffe Department of Medicine, Oxford, United Kingdom
| | - Roberto Bizzotto
- Institute of Neuroscience - National Research Council, Padova, Italy
| | - Cornelia Prehn
- Research Unit Molecular Endocrinology And Metabolism, Helmholtz Zentrum Muenchen, German Research Center for Environemental Health (GmbH), Neuherberg, Germany
| | - Andrea Mari
- Institute of Neuroscience - National Research Council, Padova, Italy
| | - Giuseppe N Giordano
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Lund University, Skåne University Hospital, Malmö, Sweden
| | - Imre Pavo
- Eli Lilly Regional Operations GmbH, Vienna, Austria
| | - Jochen M Schwenk
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, Stockholm, Sweden
| | - Federico De Masi
- Department of Health Technology, Technical University of Denmark, Kgs Lyngby and The Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Konstantinos D Tsirigos
- Department of Health Technology, Technical University of Denmark, Kgs Lyngby and The Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Søren Brunak
- Department of Health Technology, Technical University of Denmark, Kgs Lyngby and The Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark; Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, DK-2200 Copenhagen, Denmark
| | - Ana Viñuela
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, Switzerland
| | - Anubha Mahajan
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Timothy J McDonald
- Medical School, Exeter, UK NIHR Exeter Clinical Research Facility, University of Exeter
| | - Tarja Kokkola
- Department of Medicine, University of Eastern Finland and Kuopio University Hospital, Kuopio, Finland
| | - Femke Rutter
- Department of Epidemiology and Biostatistics, Amsterdam Public Health Institute, Amsterdam UMC, locationVUMC, Amsterdam, Netherlands
| | - Harriet Teare
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Tue H Hansen
- The Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Science, University of Copenhagen, Copenhagen, Denmark
| | - Juan Fernandez
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Angus Jones
- Medical School, Exeter, UK NIHR Exeter Clinical Research Facility, University of Exeter
| | - Chris Jennison
- Department of Mathematical Sciences, University of Bath, Bath, United Kingdom
| | - Mark Walker
- Institute of Cellular Medicine (Diabetes), Newcastle University, Newcastle upon Tyne, United Kingdom
| | - Mark I McCarthy
- Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Radcliffe Department of Medicine, Oxford, United Kingdom; Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Oluf Pedersen
- The Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Science, University of Copenhagen, Copenhagen, Denmark
| | - Hartmut Ruetten
- Sanofi-Aventis Deutschland GmbH, R&D, Frankfurt am Main, Germany
| | - Ian Forgie
- Population Health & Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Jimmy D Bell
- Research Centre for Optimal Health, School of Life Sciences, University of Westminster, London, United Kingdom
| | - Ewan R Pearson
- Population Health & Genomics, School of Medicine, University of Dundee, Dundee, United Kingdom
| | - Paul W Franks
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Lund University, Skåne University Hospital, Malmö, Sweden
| | - Jerzy Adamski
- Research Unit Molecular Endocrinology And Metabolism, Helmholtz Zentrum Muenchen, German Research Center for Environemental Health (GmbH), Neuherberg, Germany; Lehrstuhl für Experimentelle Genetik, Technische Universität München, 85350 Freising-Weihenstephan, Germany; Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, 8 Medical Drive, Singapore 117597, Singapore
| | - Elaine Holmes
- Section for Nutrition Research, Division of Digestive Diseases, Department of Metabolism, Digestion and Reproduction, Faculty of Medicine, Imperial College London, United Kingdom
| | - Gary Frost
- Section for Nutrition Research, Division of Digestive Diseases, Department of Metabolism, Digestion and Reproduction, Faculty of Medicine, Imperial College London, United Kingdom; NIHR Oxford Biomedical Research Centre, Churchill Hospital, Oxford, United Kingdom.
| |
Collapse
|
10
|
Tamposis IA, Tsirigos KD, Theodoropoulou MC, Kontou PI, Tsaousis GN, Sarantopoulou D, Litou ZI, Bagos PG. JUCHMME: a Java Utility for Class Hidden Markov Models and Extensions for biological sequence analysis. Bioinformatics 2020; 35:5309-5312. [PMID: 31250907 DOI: 10.1093/bioinformatics/btz533] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Revised: 05/04/2019] [Accepted: 06/25/2019] [Indexed: 11/14/2022] Open
Abstract
SUMMARY JUCHMME is an open-source software package designed to fit arbitrary custom Hidden Markov Models (HMMs) with a discrete alphabet of symbols. We incorporate a large collection of standard algorithms for HMMs as well as a number of extensions and evaluate the software on various biological problems. Importantly, the JUCHMME toolkit includes several additional features that allow for easy building and evaluation of custom HMMs, which could be a useful resource for the research community. AVAILABILITY AND IMPLEMENTATION http://www.compgen.org/tools/juchmme, https://github.com/pbagos/juchmme. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ioannis A Tamposis
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, Greece
| | - Konstantinos D Tsirigos
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, Kgs Lyngby, Denmark
| | | | - Panagiota I Kontou
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, Greece
| | | | - Dimitra Sarantopoulou
- Institute of Translational Medicine and Therapeutics, University of Pennsylvania, Philadelphia, PA, USA
| | - Zoi I Litou
- Section of Cell Biology and Biophysics, Department of Biology, School of Science, National and Kapodistrian University of Athens, Athens, Greece
| | - Pantelis G Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, Greece
| |
Collapse
|
11
|
Tamposis IA, Tsirigos KD, Theodoropoulou MC, Kontou PI, Bagos PG. Semi-supervised learning of Hidden Markov Models for biological sequence analysis. Bioinformatics 2020; 35:2208-2215. [PMID: 30445435 DOI: 10.1093/bioinformatics/bty910] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Revised: 10/29/2018] [Accepted: 11/09/2018] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Hidden Markov Models (HMMs) are probabilistic models widely used in applications in computational sequence analysis. HMMs are basically unsupervised models. However, in the most important applications, they are trained in a supervised manner. Training examples accompanied by labels corresponding to different classes are given as input and the set of parameters that maximize the joint probability of sequences and labels is estimated. A main problem with this approach is that, in the majority of the cases, labels are hard to find and thus the amount of training data is limited. On the other hand, there are plenty of unclassified (unlabeled) sequences deposited in the public databases that could potentially contribute to the training procedure. This approach is called semi-supervised learning and could be very helpful in many applications. RESULTS We propose here, a method for semi-supervised learning of HMMs that can incorporate labeled, unlabeled and partially labeled data in a straightforward manner. The algorithm is based on a variant of the Expectation-Maximization (EM) algorithm, where the missing labels of the unlabeled or partially labeled data are considered as the missing data. We apply the algorithm to several biological problems, namely, for the prediction of transmembrane protein topology for alpha-helical and beta-barrel membrane proteins and for the prediction of archaeal signal peptides. The results are very promising, since the algorithms presented here can significantly improve the prediction performance of even the top-scoring classifiers. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ioannis A Tamposis
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, Greece
| | - Konstantinos D Tsirigos
- Department of Bio and Health Informatics, Technical University of Denmark, Kgs Lyngby, Denmark
| | | | - Panagiota I Kontou
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, Greece
| | - Pantelis G Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, Greece
| |
Collapse
|
12
|
Abstract
Even though in the last few years several families of eukaryotic β-barrel outer membrane proteins have been discovered, their computational characterization and their annotation in public databases are far from complete. The PFAM database includes only very few characteristic profiles for these families, and in most cases, the profile hidden Markov models (pHMMs) have been trained using prokaryotic and eukaryotic proteins together. Here, we present for the first time a comprehensive computational analysis of eukaryotic transmembrane β-barrels. Twelve characteristic pHMMs were built, based on an extensive literature search, which can discriminate eukaryotic β-barrels from other classes of proteins (globular and bacterial β-barrel ones), as well as between mitochondrial and chloroplastic ones. We built eight novel profiles for the chloroplastic β-barrel families that are not present in the PFAM database and also updated the profile for the MDM10 family (PF12519) in the PFAM database and divide the porin family (PF01459) into two separate families, namely, VDAC and TOM40.
Collapse
Affiliation(s)
- Ahmed F Roumia
- Department of Computer Science and Biomedical Informatics, University of Thessaly, 35100 Lamia, Greece
| | | | - Konstantinos D Tsirigos
- Disease Systems Biology Program, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2200 Copenhagen, Denmark.,Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, DK-2800 Kgs Lyngby, Denmark
| | - Henrik Nielsen
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, DK-2800 Kgs Lyngby, Denmark
| | - Pantelis G Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, 35100 Lamia, Greece
| |
Collapse
|
13
|
Abstract
Ever since the signal hypothesis was proposed in 1971, the exact nature of signal peptides has been a focus point of research. The prediction of signal peptides and protein subcellular location from amino acid sequences has been an important problem in bioinformatics since the dawn of this research field, involving many statistical and machine learning technologies. In this review, we provide a historical account of how position-weight matrices, artificial neural networks, hidden Markov models, support vector machines and, lately, deep learning techniques have been used in the attempts to predict where proteins go. Because the secretory pathway was the first one to be studied both experimentally and through bioinformatics, our main focus is on the historical development of prediction methods for signal peptides that target proteins for secretion; prediction methods to identify targeting signals for other cellular compartments are treated in less detail.
Collapse
Affiliation(s)
- Henrik Nielsen
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, Kgs. Lyngby, Denmark.
| | - Konstantinos D Tsirigos
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Søren Brunak
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, Kgs. Lyngby, Denmark
- Faculty of Health and Medical Sciences, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Gunnar von Heijne
- Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
- Science for Life Laboratory, Stockholm University, Solna, Sweden
| |
Collapse
|
14
|
Tamposis IA, Theodoropoulou MC, Tsirigos KD, Bagos PG. Extending hidden Markov models to allow conditioning on previous observations. J Bioinform Comput Biol 2018; 16:1850019. [DOI: 10.1142/s0219720018500191] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Hidden Markov Models (HMMs) are probabilistic models widely used in computational molecular biology. However, the Markovian assumption regarding transition probabilities which dictates that the observed symbol depends only on the current state may not be sufficient for some biological problems. In order to overcome the limitations of the first order HMM, a number of extensions have been proposed in the literature to incorporate past information in HMMs conditioning either on the hidden states, or on the observations, or both. Here, we implement a simple extension of the standard HMM in which the current observed symbol (amino acid residue) depends both on the current state and on a series of observed previous symbols. The major advantage of the method is the simplicity in the implementation, which is achieved by properly transforming the observation sequence, using an extended alphabet. Thus, it can utilize all the available algorithms for the training and decoding of HMMs. We investigated the use of several encoding schemes and performed tests in a number of important biological problems previously studied by our team (prediction of transmembrane proteins and prediction of signal peptides). The evaluation shows that, when enough data are available, the performance increased by 1.8%–8.2% and the existing prediction methods may improve using this approach. The methods, for which the improvement was significant (PRED-TMBB2, PRED-TAT and HMM-TM), are available as web-servers freely accessible to academic users at www.compgen.org/tools/ .
Collapse
Affiliation(s)
- Ioannis A. Tamposis
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Papasiopoulou 2-4, 35100 Lamia, Greece
| | - Margarita C. Theodoropoulou
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Papasiopoulou 2-4, 35100 Lamia, Greece
| | - Konstantinos D. Tsirigos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Papasiopoulou 2-4, 35100 Lamia, Greece
| | - Pantelis G. Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Papasiopoulou 2-4, 35100 Lamia, Greece
| |
Collapse
|
15
|
Tsirigos KD, Govindarajan S, Bassot C, Västermark Å, Lamb J, Shu N, Elofsson A. Topology of membrane proteins-predictions, limitations and variations. Curr Opin Struct Biol 2017; 50:9-17. [PMID: 29100082 DOI: 10.1016/j.sbi.2017.10.003] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2017] [Revised: 09/29/2017] [Accepted: 10/03/2017] [Indexed: 10/18/2022]
Abstract
Transmembrane proteins perform a variety of important biological functions necessary for the survival and growth of the cells. Membrane proteins are built up by transmembrane segments that span the lipid bilayer. The segments can either be in the form of hydrophobic alpha-helices or beta-sheets which create a barrel. A fundamental aspect of the structure of transmembrane proteins is the membrane topology, that is, the number of transmembrane segments, their position in the protein sequence and their orientation in the membrane. Along these lines, many predictive algorithms for the prediction of the topology of alpha-helical and beta-barrel transmembrane proteins exist. The newest algorithms obtain an accuracy close to 80% both for alpha-helical and beta-barrel transmembrane proteins. However, lately it has been shown that the simplified picture presented when describing a protein family by its topology is limited. To demonstrate this, we highlight examples where the topology is either not conserved in a protein superfamily or where the structure cannot be described solely by the topology of a protein. The prediction of these non-standard features from sequence alone was not successful until the recent revolutionary progress in 3D-structure prediction of proteins.
Collapse
Affiliation(s)
| | - Sudha Govindarajan
- Science for Life Laboratory, Stockholm University, SE-171 21 Solna, Sweden; Department of Biochemistry and Biophysics, Stockholm University, SE-106 91 Stockholm, Sweden
| | - Claudio Bassot
- Science for Life Laboratory, Stockholm University, SE-171 21 Solna, Sweden; Department of Biochemistry and Biophysics, Stockholm University, SE-106 91 Stockholm, Sweden
| | - Åke Västermark
- Science for Life Laboratory, Stockholm University, SE-171 21 Solna, Sweden; Department of Biochemistry and Biophysics, Stockholm University, SE-106 91 Stockholm, Sweden; NITECH, Showa-Ku, Nagoya 466-8555 Japan
| | - John Lamb
- Science for Life Laboratory, Stockholm University, SE-171 21 Solna, Sweden; Department of Biochemistry and Biophysics, Stockholm University, SE-106 91 Stockholm, Sweden
| | - Nanjiang Shu
- Science for Life Laboratory, Stockholm University, SE-171 21 Solna, Sweden; Department of Biochemistry and Biophysics, Stockholm University, SE-106 91 Stockholm, Sweden; National Bioinformatics Infrastructure, Sweden; Nordic e-Infrastructure Collaboration, Sweden
| | - Arne Elofsson
- Science for Life Laboratory, Stockholm University, SE-171 21 Solna, Sweden; Department of Biochemistry and Biophysics, Stockholm University, SE-106 91 Stockholm, Sweden; Swedish e-Science Research Center (SeRC), Sweden.
| |
Collapse
|
16
|
Dimou NL, Tsirigos KD, Elofsson A, Bagos PG. GWAR: Robust Analysis and Meta-Analysis of Genome-Wide Association Studies. Bioinformatics 2017; 33:1521-1527. [DOI: 10.1093/bioinformatics/btx008] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2016] [Accepted: 01/10/2017] [Indexed: 11/12/2022] Open
|
17
|
Piovesan D, Tabaro F, Mičetić I, Necci M, Quaglia F, Oldfield CJ, Aspromonte MC, Davey NE, Davidović R, Dosztányi Z, Elofsson A, Gasparini A, Hatos A, Kajava AV, Kalmar L, Leonardi E, Lazar T, Macedo-Ribeiro S, Macossay-Castillo M, Meszaros A, Minervini G, Murvai N, Pujols J, Roche DB, Salladini E, Schad E, Schramm A, Szabo B, Tantos A, Tonello F, Tsirigos KD, Veljković N, Ventura S, Vranken W, Warholm P, Uversky VN, Dunker AK, Longhi S, Tompa P, Tosatto SCE. Corrigendum: DisProt 7.0: a major update of the database of disordered proteins. Nucleic Acids Res 2016; 45:D1123-D1124. [PMID: 27965415 PMCID: PMC5210598 DOI: 10.1093/nar/gkw1279] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Affiliation(s)
- Damiano Piovesan
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy
| | - Francesco Tabaro
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy.,Institute of Biosciences and Medical Technology, University of Tampere, Finland
| | - Ivan Mičetić
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy
| | - Marco Necci
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy
| | - Federica Quaglia
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy
| | - Christopher J Oldfield
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, 46202 Indianapolis, IN, USA
| | | | - Norman E Davey
- Conway Institute of Biomolecular & Biomedical Research, University College Dublin, Belfield, Dublin 4, Ireland.,Ireland UCD School of Medicine & Medical Science, University College Dublin, Belfield, Dublin 4, Ireland
| | - Radoslav Davidović
- Centre for Multidisciplinary Research, Institute of Nuclear Sciences Vinca, University of Belgrade, 11001 Belgrade, Serbia
| | - Zsuzsanna Dosztányi
- MTA-ELTE Lendület Bioinformatics Research Group, Department of Biochemistry, Eötvös Loránd University, 1/c Pázmány Péter sétány, 1117 Budapest, Hungary.,Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7, H-1518 Budapest, Hungary
| | - Arne Elofsson
- Department of Biochemistry and Biophysics and Science for Life Laboratory, Stockholm University, Box 1031, 17121 Solna, Sweden
| | - Alessandra Gasparini
- Department of Woman and Child Health, University of Padova, I-35128 Padova, Italy
| | - András Hatos
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy.,Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7, H-1518 Budapest, Hungary
| | - Andrey V Kajava
- Centre de Recherche en Biologie cellulaire de Montpellier (CRBM), UMR 5237 CNRS, Université Montpellier 1919 Route de Mende, Cedex 5, Montpellier 34293, France.,Institut de Biologie Computationnelle (IBC), Montpellier 34095, France.,University ITMO, Institute of Bioengineering, St. Petersburg 197101, Russia
| | - Lajos Kalmar
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7, H-1518 Budapest, Hungary.,Department of Veterinary Medicine, University of Cambridge, Madingley Road, Cambridge CB3 0ES, UK
| | - Emanuela Leonardi
- Department of Woman and Child Health, University of Padova, I-35128 Padova, Italy
| | - Tamas Lazar
- Structural Biology Brussels, Vrije Universiteit Brussel (VUB), Brussels 1050, Belgium.,Structural Biology Research Center (SBRC), Flanders Institute for Biotechnology (VIB), Brussels 1050, Belgium
| | - Sandra Macedo-Ribeiro
- Biomolecular Structure and Function Group, Instituto de Biologia Molecular e Celular (IBMC) and Instituto de Investigação e Inovação em Saúde (i3S), Universidade do Porto, 4200-135 Porto, Portugal
| | - Mauricio Macossay-Castillo
- Structural Biology Brussels, Vrije Universiteit Brussel (VUB), Brussels 1050, Belgium.,Structural Biology Research Center (SBRC), Flanders Institute for Biotechnology (VIB), Brussels 1050, Belgium
| | - Attila Meszaros
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7, H-1518 Budapest, Hungary
| | - Giovanni Minervini
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy
| | - Nikoletta Murvai
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7, H-1518 Budapest, Hungary
| | - Jordi Pujols
- Departament de Bioquimica i Biologia Molecular and Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, Bellaterra 08193, Spain
| | - Daniel B Roche
- Centre de Recherche en Biologie cellulaire de Montpellier (CRBM), UMR 5237 CNRS, Université Montpellier 1919 Route de Mende, Cedex 5, Montpellier 34293, France.,Institut de Biologie Computationnelle (IBC), Montpellier 34095, France
| | | | - Eva Schad
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7, H-1518 Budapest, Hungary
| | | | - Beata Szabo
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7, H-1518 Budapest, Hungary
| | - Agnes Tantos
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7, H-1518 Budapest, Hungary
| | - Fiorella Tonello
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy.,CNR Institute of Neurosceince, I-35121 Padova, Italy
| | - Konstantinos D Tsirigos
- Department of Biochemistry and Biophysics and Science for Life Laboratory, Stockholm University, Box 1031, 17121 Solna, Sweden
| | - Nevena Veljković
- Centre for Multidisciplinary Research, Institute of Nuclear Sciences Vinca, University of Belgrade, 11001 Belgrade, Serbia
| | - Salvador Ventura
- Departament de Bioquimica i Biologia Molecular and Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, Bellaterra 08193, Spain
| | - Wim Vranken
- Structural Biology Brussels, Vrije Universiteit Brussel (VUB), Brussels 1050, Belgium.,Structural Biology Research Center (SBRC), Flanders Institute for Biotechnology (VIB), Brussels 1050, Belgium.,Interuniversity Institute of Bioinformatics in Brussels (IB2), ULB-VUB, Brussels 1050, Belgium
| | - Per Warholm
- Department of Biochemistry and Biophysics and Science for Life Laboratory, Stockholm University, Box 1031, 17121 Solna, Sweden
| | - Vladimir N Uversky
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, 194064 St. Petersburg, Russia.,Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| | - A Keith Dunker
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, 46202 Indianapolis, IN, USA
| | - Sonia Longhi
- Aix-Marseille Univ, CNRS, AFMB, UMR 7257, Marseille, France
| | - Peter Tompa
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7, H-1518 Budapest, Hungary .,Structural Biology Brussels, Vrije Universiteit Brussel (VUB), Brussels 1050, Belgium.,Structural Biology Research Center (SBRC), Flanders Institute for Biotechnology (VIB), Brussels 1050, Belgium
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy .,CNR Institute of Neurosceince, I-35121 Padova, Italy
| |
Collapse
|
18
|
Piovesan D, Tabaro F, Mičetić I, Necci M, Quaglia F, Oldfield CJ, Aspromonte MC, Davey NE, Davidović R, Dosztányi Z, Elofsson A, Gasparini A, Hatos A, Kajava AV, Kalmar L, Leonardi E, Lazar T, Macedo-Ribeiro S, Macossay-Castillo M, Meszaros A, Minervini G, Murvai N, Pujols J, Roche DB, Salladini E, Schad E, Schramm A, Szabo B, Tantos A, Tonello F, Tsirigos KD, Veljković N, Ventura S, Vranken W, Warholm P, Uversky VN, Dunker AK, Longhi S, Tompa P, Tosatto SCE. DisProt 7.0: a major update of the database of disordered proteins. Nucleic Acids Res 2016; 45:D219-D227. [PMID: 27899601 PMCID: PMC5210544 DOI: 10.1093/nar/gkw1056] [Citation(s) in RCA: 193] [Impact Index Per Article: 24.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2016] [Revised: 10/19/2016] [Accepted: 10/21/2016] [Indexed: 01/16/2023] Open
Abstract
The Database of Protein Disorder (DisProt, URL: www.disprot.org) has been significantly updated and upgraded since its last major renewal in 2007. The current release holds information on more than 800 entries of IDPs/IDRs, i.e. intrinsically disordered proteins or regions that exist and function without a well-defined three-dimensional structure. We have re-curated previous entries to purge DisProt from conflicting cases, and also upgraded the functional classification scheme to reflect continuous advance in the field in the past 10 years or so. We define IDPs as proteins that are disordered along their entire sequence, i.e. entirely lack structural elements, and IDRs as regions that are at least five consecutive residues without well-defined structure. We base our assessment of disorder strictly on experimental evidence, such as X-ray crystallography and nuclear magnetic resonance (primary techniques) and a broad range of other experimental approaches (secondary techniques). Confident and ambiguous annotations are highlighted separately. DisProt 7.0 presents classified knowledge regarding the experimental characterization and functional annotations of IDPs/IDRs, and is intended to provide an invaluable resource for the research community for a better understanding structural disorder and for developing better computational tools for studying disordered proteins.
Collapse
Affiliation(s)
- Damiano Piovesan
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy
| | - Francesco Tabaro
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy.,Institute of Biosciences and Medical Technology, University of Tampere, Finland
| | - Ivan Mičetić
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy
| | - Marco Necci
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy
| | - Federica Quaglia
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy
| | - Christopher J Oldfield
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, 46202 Indianapolis, IN, USA
| | | | - Norman E Davey
- Conway Institute of Biomolecular & Biomedical Research, University College Dublin, Belfield, Dublin 4, Ireland.,Ireland UCD School of Medicine & Medical Science, University College Dublin, Belfield, Dublin 4, Ireland
| | - Radoslav Davidović
- Centre for Multidisciplinary Research, Institute of Nuclear Sciences Vinca, University of Belgrade, 11001 Belgrade, Serbia
| | - Zsuzsanna Dosztányi
- MTA-ELTE Lendület Bioinformatics Research Group, Department of Biochemistry, Eötvös Loránd University, 1/c Pázmány Péter sétány, 1117 Budapest, Hungary.,Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7,H-1518 Budapest, Hungary
| | - Arne Elofsson
- Department of Biochemistry and Biophysics and Science for Life Laboratory, Stockholm University, Box 1031, 17121 Solna, Sweden
| | - Alessandra Gasparini
- Department of Woman and Child Health, University of Padova, I-35128 Padova, Italy
| | - András Hatos
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy.,Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7,H-1518 Budapest, Hungary
| | - Andrey V Kajava
- Centre de Recherche en Biologie cellulaire de Montpellier (CRBM), UMR 5237 CNRS, Université Montpellier 1919 Route de Mende, Cedex 5, Montpellier 34293, France.,Institut de Biologie Computationnelle (IBC), Montpellier 34095, France.,University ITMO, Institute of Bioengineering, St. Petersburg 197101, Russia
| | - Lajos Kalmar
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7,H-1518 Budapest, Hungary.,Department of Veterinary Medicine, University of Cambridge, Madingley Road, Cambridge CB3 0ES, UK
| | - Emanuela Leonardi
- Department of Woman and Child Health, University of Padova, I-35128 Padova, Italy
| | - Tamas Lazar
- Structural Biology Brussels, Vrije Universiteit Brussel (VUB), Brussels 1050, Belgium.,Structural Biology Research Center (SBRC), Flanders Institute for Biotechnology (VIB), Brussels 1050, Belgium
| | - Sandra Macedo-Ribeiro
- Biomolecular Structure and Function Group, Instituto de Biologia Molecular e Celular (IBMC) and Instituto de Investigação e Inovação em Saúde (i3S), Universidade do Porto, 4200-135 Porto, Portugal
| | - Mauricio Macossay-Castillo
- Structural Biology Brussels, Vrije Universiteit Brussel (VUB), Brussels 1050, Belgium.,Structural Biology Research Center (SBRC), Flanders Institute for Biotechnology (VIB), Brussels 1050, Belgium
| | - Attila Meszaros
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7,H-1518 Budapest, Hungary
| | - Giovanni Minervini
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy
| | - Nikoletta Murvai
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7,H-1518 Budapest, Hungary
| | - Jordi Pujols
- Departament de Bioquimica i Biologia Molecular and Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, Bellaterra 08193, Spain
| | - Daniel B Roche
- Centre de Recherche en Biologie cellulaire de Montpellier (CRBM), UMR 5237 CNRS, Université Montpellier 1919 Route de Mende, Cedex 5, Montpellier 34293, France.,Institut de Biologie Computationnelle (IBC), Montpellier 34095, France
| | | | - Eva Schad
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7,H-1518 Budapest, Hungary
| | | | - Beata Szabo
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7,H-1518 Budapest, Hungary
| | - Agnes Tantos
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7,H-1518 Budapest, Hungary
| | - Fiorella Tonello
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy.,CNR Institute of Neurosceince, I-35121 Padova, Italy
| | - Konstantinos D Tsirigos
- Department of Biochemistry and Biophysics and Science for Life Laboratory, Stockholm University, Box 1031, 17121 Solna, Sweden
| | - Nevena Veljković
- Centre for Multidisciplinary Research, Institute of Nuclear Sciences Vinca, University of Belgrade, 11001 Belgrade, Serbia
| | - Salvador Ventura
- Departament de Bioquimica i Biologia Molecular and Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, Bellaterra 08193, Spain
| | - Wim Vranken
- Structural Biology Brussels, Vrije Universiteit Brussel (VUB), Brussels 1050, Belgium.,Structural Biology Research Center (SBRC), Flanders Institute for Biotechnology (VIB), Brussels 1050, Belgium.,Interuniversity Institute of Bioinformatics in Brussels (IB2), ULB-VUB, Brussels 1050, Belgium
| | - Per Warholm
- Department of Biochemistry and Biophysics and Science for Life Laboratory, Stockholm University, Box 1031, 17121 Solna, Sweden
| | - Vladimir N Uversky
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, 194064 St. Petersburg, Russia.,Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| | - A Keith Dunker
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, 46202 Indianapolis, IN, USA
| | - Sonia Longhi
- Aix-Marseille Univ, CNRS, AFMB, UMR 7257, Marseille, France
| | - Peter Tompa
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7,H-1518 Budapest, Hungary .,Structural Biology Brussels, Vrije Universiteit Brussel (VUB), Brussels 1050, Belgium.,Structural Biology Research Center (SBRC), Flanders Institute for Biotechnology (VIB), Brussels 1050, Belgium
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padova, I-35121 Padova, Italy .,CNR Institute of Neurosceince, I-35121 Padova, Italy
| |
Collapse
|
19
|
Tsirigos KD, Elofsson A, Bagos PG. PRED-TMBB2: improved topology prediction and detection of beta-barrel outer membrane proteins. Bioinformatics 2016; 32:i665-i671. [DOI: 10.1093/bioinformatics/btw444] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
|
20
|
Hayat S, Peters C, Shu N, Tsirigos KD, Elofsson A. Inclusion of dyad-repeat pattern improves topology prediction of transmembrane β-barrel proteins. Bioinformatics 2016; 32:1571-3. [PMID: 26794316 DOI: 10.1093/bioinformatics/btw025] [Citation(s) in RCA: 60] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2015] [Accepted: 01/14/2016] [Indexed: 11/14/2022] Open
Abstract
UNLABELLED : Accurate topology prediction of transmembrane β-barrels is still an open question. Here, we present BOCTOPUS2, an improved topology prediction method for transmembrane β-barrels that can also identify the barrel domain, predict the topology and identify the orientation of residues in transmembrane β-strands. The major novelty of BOCTOPUS2 is the use of the dyad-repeat pattern of lipid and pore facing residues observed in transmembrane β-barrels. In a cross-validation test on a benchmark set of 42 proteins, BOCTOPUS2 predicts the correct topology in 69% of the proteins, an improvement of more than 10% over the best earlier method (BOCTOPUS) and in addition, it produces significantly fewer erroneous predictions on non-transmembrane β-barrel proteins. AVAILABILITY AND IMPLEMENTATION BOCTOPUS2 webserver along with full dataset and source code is available at http://boctopus.bioinfo.se/ CONTACT : arne@bioinfo.se SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sikander Hayat
- Memorial Sloan Kettering Cancer Center, New York City, NY, USA
| | - Christoph Peters
- Stockholm Bioinformatics Center, SciLifeLab, Swedish E-Science Research Center, Stockholm University, Stockholm, SE, 10691, Sweden and
| | - Nanjiang Shu
- Stockholm Bioinformatics Center, SciLifeLab, Swedish E-Science Research Center, Stockholm University, Stockholm, SE, 10691, Sweden and Sweden Bioinformatics Infrastructure for Life Sciences (BILS), Stockholm University, Sweden
| | - Konstantinos D Tsirigos
- Stockholm Bioinformatics Center, SciLifeLab, Swedish E-Science Research Center, Stockholm University, Stockholm, SE, 10691, Sweden and
| | - Arne Elofsson
- Stockholm Bioinformatics Center, SciLifeLab, Swedish E-Science Research Center, Stockholm University, Stockholm, SE, 10691, Sweden and
| |
Collapse
|
21
|
Peters C, Tsirigos KD, Shu N, Elofsson A. Improved topology prediction using the terminal hydrophobic helices rule. Bioinformatics 2015; 32:1158-62. [PMID: 26644416 DOI: 10.1093/bioinformatics/btv709] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2015] [Accepted: 11/29/2015] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION The translocon recognizes sufficiently hydrophobic regions of a protein and inserts them into the membrane. Computational methods try to determine what hydrophobic regions are recognized by the translocon. Although these predictions are quite accurate, many methods still fail to distinguish marginally hydrophobic transmembrane (TM) helices and equally hydrophobic regions in soluble protein domains. In vivo, this problem is most likely avoided by targeting of the TM-proteins, so that non-TM proteins never see the translocon. Proteins are targeted to the translocon by an N-terminal signal peptide. The targeting is also aided by the fact that the N-terminal helix is more hydrophobic than other TM-helices. In addition, we also recently found that the C-terminal helix is more hydrophobic than central helices. This information has not been used in earlier topology predictors. RESULTS Here, we use the fact that the N- and C-terminal helices are more hydrophobic to develop a new version of the first-principle-based topology predictor, SCAMPI. The new predictor has two main advantages; first, it can be used to efficiently separate membrane and non-membrane proteins directly without the use of an extra prefilter, and second it shows improved performance for predicting the topology of membrane proteins that contain large non-membrane domains. AVAILABILITY AND IMPLEMENTATION The predictor, a web server and all datasets are available at http://scampi.bioinfo.se/ CONTACT arne@bioinfo.se SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Christoph Peters
- Department of Biochemistry and Biophysics, Science for Life Laboratory and
| | | | - Nanjiang Shu
- Department of Biochemistry and Biophysics, Science for Life Laboratory and Sweden Bioinformatics Infrastructure for Life Sciences (BILS), Stockholm University, Solna 17121, Sweden
| | - Arne Elofsson
- Department of Biochemistry and Biophysics, Science for Life Laboratory and
| |
Collapse
|
22
|
Babbitt PC, Bagos PG, Bairoch A, Bateman A, Chatonnet A, Chen MJ, Craik DJ, Finn RD, Gloriam D, Haft DH, Henrissat B, Holliday GL, Isberg V, Kaas Q, Landsman D, Lenfant N, Manning G, Nagano N, Srinivasan N, O'Donovan C, Pruitt KD, Sowdhamini R, Rawlings ND, Saier MH, Sharman JL, Spedding M, Tsirigos KD, Vastermark A, Vriend G. Creating a specialist protein resource network: a meeting report for the protein bioinformatics and community resources retreat. Database (Oxford) 2015; 2015:bav063. [PMID: 26284514 PMCID: PMC4499208 DOI: 10.1093/database/bav063] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2015] [Revised: 05/14/2015] [Accepted: 05/18/2015] [Indexed: 11/14/2022]
Abstract
During 11–12 August 2014, a Protein Bioinformatics and Community Resources Retreat was held at the Wellcome Trust Genome Campus in Hinxton, UK. This meeting brought together the principal investigators of several specialized protein resources (such as CAZy, TCDB and MEROPS) as well as those from protein databases from the large Bioinformatics centres (including UniProt and RefSeq). The retreat was divided into five sessions: (1) key challenges, (2) the databases represented, (3) best practices for maintenance and curation, (4) information flow to and from large data centers and (5) communication and funding. An important outcome of this meeting was the creation of a Specialist Protein Resource Network that we believe will improve coordination of the activities of its member resources. We invite further protein database resources to join the network and continue the dialogue.
Collapse
Affiliation(s)
- Patricia C Babbitt
- Department of Bioengineering and Therapeutic Sciences and California Institute for Quantitative Biosciences, University of California San Francisco, 1700 4th Street, San Francisco, CA 94158, USA
| | - Pantelis G Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Papasiopoulou 2-4, Lamia, 35100, Greece
| | - Amos Bairoch
- SIB-Swiss Institute of Bioinformatics, CMU, 1 rue Michel Servet, 1211 Geneva 4, Switzerland
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Arnaud Chatonnet
- INRA, UMR866 Dynamique Musculaire et Métabolisme, F-34000 Montpellier, France
| | - Mark Jinan Chen
- Razavi Newman Center for Bioinformatics, Salk Institute, 10010 North Torrey Pines Rd., La Jolla, CA 92037, USA, ; Bioinformatics & Computational Biology, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA
| | - David J Craik
- Queensland Bioscience Precinct, 306 Carmody Rd, Building 80, The University of Queensland, Australia
| | - Robert D Finn
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - David Gloriam
- Department of Drug Design and Pharmacology, University of Copenhagen, Jagtvej 162, 2100 København Ø, Denmark
| | - Daniel H Haft
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38 A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Bernard Henrissat
- Architecture et Fonction des Macromolécules Biologiques, CNRS, Aix-Marseille Université, 13288 Marseille, France, ; Department of Biological Sciences, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Gemma L Holliday
- Department of Bioengineering and Therapeutic Sciences and California Institute for Quantitative Biosciences, University of California San Francisco, 1700 4th Street, San Francisco, CA 94158, USA
| | - Vignir Isberg
- Department of Drug Design and Pharmacology, University of Copenhagen, Jagtvej 162, 2100 København Ø, Denmark, ; CMBI, Raboudumc, Geert Grootplein Zuid 26-28, 6525 GA Nijmegen, The Netherlands
| | - Quentin Kaas
- Queensland Bioscience Precinct, 306 Carmody Rd, Building 80, The University of Queensland, Australia
| | - David Landsman
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38 A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Nicolas Lenfant
- INRA, UMR866 Dynamique Musculaire et Métabolisme, F-34000 Montpellier, France
| | - Gerard Manning
- Razavi Newman Center for Bioinformatics, Salk Institute, 10010 North Torrey Pines Rd., La Jolla, CA 92037, USA, ; Bioinformatics & Computational Biology, Genentech, 1 DNA Way, South San Francisco, CA 94080, USA
| | - Nozomi Nagano
- Biotechnology Research Institute for Drug Discovery, National Institute of Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-ku, Tokyo 135-0064, Japan
| | | | - Claire O'Donovan
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Kim D Pruitt
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38 A, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Ramanathan Sowdhamini
- National Centre for Biological Sciences, TIFR, GKVK Campus, Bangalore 560 065, India
| | - Neil D Rawlings
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Milton H Saier
- Department of Molecular Biology, University of California at San Diego, La Jolla, CA 92093-0116, USA
| | - Joanna L Sharman
- Centre for Integrative Physiology, University of Edinburgh, Hugh Robson Building, George Square, Edinburgh EH8 9XD, UK
| | - Michael Spedding
- Spedding Research Solutions, 6 Rue Ampere, 78110 Le Vesinet, France and
| | - Konstantinos D Tsirigos
- Department of Biochemistry and Biophysics, Science for Life Laboratory, Swedish E-Science Research Center, Stockholm University, Box 1031, 17121 Solna, Sweden
| | - Ake Vastermark
- Department of Molecular Biology, University of California at San Diego, La Jolla, CA 92093-0116, USA
| | - Gerrit Vriend
- CMBI, Raboudumc, Geert Grootplein Zuid 26-28, 6525 GA Nijmegen, The Netherlands
| |
Collapse
|
23
|
Tsirigos KD, Peters C, Shu N, Käll L, Elofsson A. The TOPCONS web server for consensus prediction of membrane protein topology and signal peptides. Nucleic Acids Res 2015; 43:W401-7. [PMID: 25969446 PMCID: PMC4489233 DOI: 10.1093/nar/gkv485] [Citation(s) in RCA: 601] [Impact Index Per Article: 66.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2015] [Accepted: 05/02/2015] [Indexed: 11/21/2022] Open
Abstract
TOPCONS (http://topcons.net/) is a widely used web server for consensus prediction of membrane protein topology. We hereby present a major update to the server, with some substantial improvements, including the following: (i) TOPCONS can now efficiently separate signal peptides from transmembrane regions. (ii) The server can now differentiate more successfully between globular and membrane proteins. (iii) The server now is even slightly faster, although a much larger database is used to generate the multiple sequence alignments. For most proteins, the final prediction is produced in a matter of seconds. (iv) The user-friendly interface is retained, with the additional feature of submitting batch files and accessing the server programmatically using standard interfaces, making it thus ideal for proteome-wide analyses. Indicatively, the user can now scan the entire human proteome in a few days. (v) For proteins with homology to a known 3D structure, the homology-inferred topology is also displayed. (vi) Finally, the combination of methods currently implemented achieves an overall increase in performance by 4% as compared to the currently available best-scoring methods and TOPCONS is the only method that can identify signal peptides and still maintain a state-of-the-art performance in topology predictions.
Collapse
Affiliation(s)
- Konstantinos D Tsirigos
- Department of Biochemistry and Biophysics, Stockholm University, 10691 Stockholm, Sweden Science for Life Laboratory, Stockholm University, Box 1031, 17121 Solna, Sweden
| | - Christoph Peters
- Department of Biochemistry and Biophysics, Stockholm University, 10691 Stockholm, Sweden Science for Life Laboratory, Stockholm University, Box 1031, 17121 Solna, Sweden
| | - Nanjiang Shu
- Department of Biochemistry and Biophysics, Stockholm University, 10691 Stockholm, Sweden Science for Life Laboratory, Stockholm University, Box 1031, 17121 Solna, Sweden Bioinformatics Infrastructure for Life Sciences (BILS), Stockholm University, Sweden
| | - Lukas Käll
- Department of Biochemistry and Biophysics, Stockholm University, 10691 Stockholm, Sweden Science for Life Laboratory, Stockholm University, Box 1031, 17121 Solna, Sweden
| | - Arne Elofsson
- Department of Biochemistry and Biophysics, Stockholm University, 10691 Stockholm, Sweden Science for Life Laboratory, Stockholm University, Box 1031, 17121 Solna, Sweden
| |
Collapse
|
24
|
Virkki M, Boekel C, Illergård K, Peters C, Shu N, Tsirigos KD, Elofsson A, von Heijne G, Nilsson I. Large tilts in transmembrane helices can be induced during tertiary structure formation. J Mol Biol 2014; 426:2529-38. [PMID: 24793448 DOI: 10.1016/j.jmb.2014.04.020] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2014] [Revised: 04/19/2014] [Accepted: 04/22/2014] [Indexed: 10/25/2022]
Abstract
While early structural models of helix-bundle integral membrane proteins posited that the transmembrane α-helices [transmembrane helices (TMHs)] were orientated more or less perpendicular to the membrane plane, there is now ample evidence from high-resolution structures that many TMHs have significant tilt angles relative to the membrane. Here, we address the question whether the tilt is an intrinsic property of the TMH in question or if it is imparted on the TMH during folding of the protein. Using a glycosylation mapping technique, we show that four highly tilted helices found in multi-spanning membrane proteins all have much shorter membrane-embedded segments when inserted by themselves into the membrane than seen in the high-resolution structures. This suggests that tilting can be induced by tertiary packing interactions within the protein, subsequent to the initial membrane-insertion step.
Collapse
Affiliation(s)
- Minttu Virkki
- Center for Biomembrane Research, Department of Biochemistry and Biophysics, Stockholm University, SE-10691 Stockholm, Sweden; Science for Life Laboratory, Stockholm University, SE-17177 Solna, Sweden
| | - Carolina Boekel
- Center for Biomembrane Research, Department of Biochemistry and Biophysics, Stockholm University, SE-10691 Stockholm, Sweden
| | - Kristoffer Illergård
- Center for Biomembrane Research, Department of Biochemistry and Biophysics, Stockholm University, SE-10691 Stockholm, Sweden; Science for Life Laboratory, Stockholm University, SE-17177 Solna, Sweden
| | - Christoph Peters
- Center for Biomembrane Research, Department of Biochemistry and Biophysics, Stockholm University, SE-10691 Stockholm, Sweden; Science for Life Laboratory, Stockholm University, SE-17177 Solna, Sweden
| | - Nanjiang Shu
- Center for Biomembrane Research, Department of Biochemistry and Biophysics, Stockholm University, SE-10691 Stockholm, Sweden; Science for Life Laboratory, Stockholm University, SE-17177 Solna, Sweden
| | - Konstantinos D Tsirigos
- Center for Biomembrane Research, Department of Biochemistry and Biophysics, Stockholm University, SE-10691 Stockholm, Sweden; Science for Life Laboratory, Stockholm University, SE-17177 Solna, Sweden
| | - Arne Elofsson
- Center for Biomembrane Research, Department of Biochemistry and Biophysics, Stockholm University, SE-10691 Stockholm, Sweden; Science for Life Laboratory, Stockholm University, SE-17177 Solna, Sweden
| | - Gunnar von Heijne
- Center for Biomembrane Research, Department of Biochemistry and Biophysics, Stockholm University, SE-10691 Stockholm, Sweden; Science for Life Laboratory, Stockholm University, SE-17177 Solna, Sweden.
| | - IngMarie Nilsson
- Center for Biomembrane Research, Department of Biochemistry and Biophysics, Stockholm University, SE-10691 Stockholm, Sweden.
| |
Collapse
|
25
|
Tsirigos KD, Hennerdal A, Käll L, Elofsson A. A guideline to proteome-wide α-helical membrane protein topology predictions. Proteomics 2012; 12:2282-94. [DOI: 10.1002/pmic.201100495] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
26
|
Fimereli DK, Tsirigos KD, Litou ZI, Liakopoulos TD, Bagos PG, Hamodrakas SJ. CW-PRED: A HMM-Based Method for the Classification of Cell Wall-Anchored Proteins of Gram-Positive Bacteria. Lecture Notes in Computer Science 2012. [DOI: 10.1007/978-3-642-30448-4_36] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
27
|
Abstract
We describe here OMPdb, which is currently the most complete and comprehensive collection of integral β-barrel outer membrane proteins from Gram-negative bacteria. The database currently contains 69,354 proteins, which are classified into 85 families, based mainly on structural and functional criteria. Although OMPdb follows the annotation scheme of Pfam, many of the families included in the database were not previously described or annotated in other publicly available databases. There are also cross-references to other databases, references to the literature and annotation for sequence features, like transmembrane segments and signal peptides. Furthermore, via the web interface, the user can not only browse the available data, but submit advanced text searches and run BLAST queries against the database protein sequences or domain searches against the collection of profile Hidden Markov Models that represent each family's domain organization as well. The database is freely accessible for academic users at http://bioinformatics.biol.uoa.gr/OMPdb and we expect it to be useful for genome-wide analyses, comparative genomics as well as for providing training and test sets for predictive algorithms regarding transmembrane β-barrels.
Collapse
Affiliation(s)
- Konstantinos D Tsirigos
- Department of Cell Biology and Biophysics, Faculty of Biology, University of Athens, Athens 15701, Greece
| | | | | |
Collapse
|
28
|
Abstract
MOTIVATION Computational prediction of signal peptides is of great importance in computational biology. In addition to the general secretory pathway (Sec), Bacteria, Archaea and chloroplasts possess another major pathway that utilizes the Twin-Arginine translocase (Tat), which recognizes longer and less hydrophobic signal peptides carrying a distinctive pattern of two consecutive Arginines (RR) in the n-region. A major functional differentiation between the Sec and Tat export pathways lies in the fact that the former translocates secreted proteins unfolded through a protein-conducting channel, whereas the latter translocates completely folded proteins using an unknown mechanism. The purpose of this work is to develop a novel method for predicting and discriminating Sec from Tat signal peptides at better accuracy. RESULTS We report the development of a novel method, PRED-TAT, which is capable of discriminating Sec from Tat signal peptides and predicting their cleavage sites. The method is based on Hidden Markov Models and possesses a modular architecture suitable for both Sec and Tat signal peptides. On an independent test set of experimentally verified Tat signal peptides, PRED-TAT clearly outperforms the previously proposed methods TatP and TATFIND, whereas, when evaluated as a Sec signal peptide predictor compares favorably to top-scoring predictors such as SignalP and Phobius. The method is freely available for academic users at http://www.compgen.org/tools/PRED-TAT/.
Collapse
Affiliation(s)
- Pantelis G Bagos
- Department of Computer Science and Biomedical Informatics, University of Central Greece, Papasiopoulou 2-4, Lamia 35100, Greece.
| | | | | | | |
Collapse
|
29
|
Tsaousis GN, Tsirigos KD, Andrianou XD, Liakopoulos TD, Bagos PG, Hamodrakas SJ. ExTopoDB: a database of experimentally derived topological models of transmembrane proteins. ACTA ACUST UNITED AC 2010; 26:2490-2. [PMID: 20601677 DOI: 10.1093/bioinformatics/btq362] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
UNLABELLED ExTopoDB is a publicly accessible database of experimentally derived topological models of transmembrane proteins. It contains information collected from studies in the literature that report the use of biochemical methods for the determination of the topology of α-helical transmembrane proteins. Transmembrane protein topology is highly important in order to understand their function and ExTopoDB provides an up to date, complete and comprehensive dataset of experimentally determined topologies of α-helical transmembrane proteins. Topological information is combined with transmembrane topology prediction resulting in more reliable topological models. AVAILABILITY http://bioinformatics.biol.uoa.gr/ExTopoDB.
Collapse
Affiliation(s)
- Georgios N Tsaousis
- Department of Cell Biology and Biophysics, Faculty of Biology, University of Athens, Panepistimiopolis Athens, Greece
| | | | | | | | | | | |
Collapse
|
30
|
Bagos PG, Tsirigos KD, Liakopoulos TD, Hamodrakas SJ. Prediction of lipoprotein signal peptides in Gram-positive bacteria with a Hidden Markov Model. J Proteome Res 2009; 7:5082-93. [PMID: 19367716 DOI: 10.1021/pr800162c] [Citation(s) in RCA: 106] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
We present a Hidden Markov Model method for the prediction of lipoprotein signal peptides of Gram-positive bacteria, trained on a set of 67 experimentally verified lipoproteins. The method outperforms LipoP and the methods based on regular expression patterns, in various data sets containing experimentally characterized lipoproteins, secretory proteins, proteins with an N-terminal TM segment and cytoplasmic proteins. The method is also very sensitive and specific in the detection of secretory signal peptides and in terms of overall accuracy outperforms even SignalP, which is the top-scoring method for the prediction of signal peptides. PRED-LIPO is freely available at http://bioinformatics.biol.uoa.gr/PRED-LIPO/, and we anticipate that it will be a valuable tool for the experimentalists studying secreted proteins and lipoproteins from Gram-positive bacteria.
Collapse
Affiliation(s)
- Pantelis G Bagos
- Department of Cell Biology and Biophysics, Faculty of Biology, University of Athens, Athens 15701, Greece.
| | | | | | | |
Collapse
|
31
|
Abstract
Computational prediction of signal peptides (SPs) and their cleavage sites is of great importance in computational biology; however, currently there is no available method capable of predicting reliably the SPs of archaea, due to the limited amount of experimentally verified proteins with SPs. We performed an extensive literature search in order to identify archaeal proteins having experimentally verified SP and managed to find 69 such proteins, the largest number ever reported. A detailed analysis of these sequences revealed some unique features of the SPs of archaea, such as the unique amino acid composition of the hydrophobic region with a higher than expected occurrence of isoleucine, and a cleavage site resembling more the sequences of gram-positives with almost equal amounts of alanine and valine at the position-3 before the cleavage site and a dominant alanine at position-1, followed in abundance by serine and glycine. Using these proteins as a training set, we trained a hidden Markov model method that predicts the presence of the SPs and their cleavage sites and also discriminates such proteins from cytoplasmic and transmembrane ones. The method performs satisfactorily, yielding a 35-fold cross-validation procedure, a sensitivity of 100% and specificity 98.41% with the Matthews' correlation coefficient being equal to 0.964. This particular method is currently the only available method for the prediction of secretory SPs in archaea, and performs consistently and significantly better compared with other available predictors that were trained on sequences of eukaryotic or bacterial origin. Searching 48 completely sequenced archaeal genomes we identified 9437 putative SPs. The method, PRED-SIGNAL, and the results are freely available for academic users at http://bioinformatics.biol.uoa.gr/PRED-SIGNAL/ and we anticipate that it will be a valuable tool for the computational analysis of archaeal genomes.
Collapse
Affiliation(s)
- P G Bagos
- Department of Cell Biology and Biophysics, Faculty of Biology, University of Athens, Athens 15701, Greece.
| | | | | | | | | |
Collapse
|
32
|
Litou ZI, Bagos PG, Tsirigos KD, Liakopoulos TD, Hamodrakas SJ. Prediction of cell wall sorting signals in gram-positive bacteria with a hidden markov model: application to complete genomes. J Bioinform Comput Biol 2008; 6:387-401. [PMID: 18464329 DOI: 10.1142/s0219720008003382] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2007] [Revised: 09/25/2007] [Accepted: 12/18/2007] [Indexed: 11/18/2022]
Abstract
Surface proteins in Gram-positive bacteria are frequently implicated in virulence. We have focused on a group of extracellular cell wall-attached proteins (CWPs), containing an LPXTG motif for cleavage and covalent coupling to peptidoglycan by sortase enzymes. A hidden Markov model (HMM) approach for predicting the LPXTG-anchored cell wall proteins of Gram-positive bacteria was developed and compared against existing methods. The HMM model is parsimonious in terms of the number of freely estimated parameters, and it has proved to be very sensitive and specific in a training set of 55 experimentally verified LPXTG-anchored cell wall proteins as well as in reliable data sets of globular and transmembrane proteins. In order to identify such proteins in Gram-positive bacteria, a comprehensive analysis of 94 completely sequenced genomes has been performed. We identified, in total, 860 LPXTG-anchored cell wall proteins, a number that is significantly higher compared to those obtained by other available methods. Of these proteins, 237 are hypothetical proteins according to the annotation of SwissProt, and 88 had no homologs in the SwissProt database--this might be evidence that they are members of newly identified families of CWPs. The prediction tool, the database with the proteins identified in the genomes, and supplementary material are available online at http://bioinformatics.biol.uoa.gr/CW-PRED/.
Collapse
Affiliation(s)
- Zoi I Litou
- Department of Cell Biology and Biophysics, Faculty of Biology, University of Athens, Athens 15701, Greece.
| | | | | | | | | |
Collapse
|