1
|
Feng SH, Zhang WX, Yang J, Yang Y, Shen HB. Topology Prediction Improvement of α-helical Transmembrane Proteins Through Helix-tail Modeling and Multiscale Deep Learning Fusion. J Mol Biol 2020; 432:1279-1296. [DOI: 10.1016/j.jmb.2019.12.007] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Revised: 12/02/2019] [Accepted: 12/04/2019] [Indexed: 12/18/2022]
|
2
|
Predicting Alpha Helical Transmembrane Proteins Using HMMs. Methods Mol Biol 2018; 1552:63-82. [PMID: 28224491 DOI: 10.1007/978-1-4939-6753-7_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]
Abstract
Alpha helical transmembrane (TM) proteins constitute an important structural class of membrane proteins involved in a wide variety of cellular functions. The prediction of their transmembrane topology, as well as their discrimination in newly sequenced genomes, is of great importance for the elucidation of their structure and function. Several methods have been applied for the prediction of the transmembrane segments and the topology of alpha helical transmembrane proteins utilizing different algorithmic techniques. Hidden Markov Models (HMMs) have been efficiently used in the development of several computational methods used for this task. In this chapter we give a brief review of different available prediction methods for alpha helical transmembrane proteins pointing out sequence and structural features that should be incorporated in a prediction method. We then describe the procedure of the design and development of a Hidden Markov Model capable of predicting the transmembrane alpha helices in proteins and discriminating them from globular proteins.
Collapse
|
3
|
Highly diversified expansions shaped the evolution of membrane bound proteins in metazoans. Sci Rep 2017; 7:12387. [PMID: 28959054 PMCID: PMC5620054 DOI: 10.1038/s41598-017-11543-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2017] [Accepted: 08/15/2017] [Indexed: 11/20/2022] Open
Abstract
The dramatic increase in membrane proteome complexity is arguably one of the most pivotal evolutionary events that underpins the origin of multicellular animals. However, the origin of a significant number of membrane families involved in metazoan development has not been clarified. In this study, we have manually curated the membrane proteomes of 22 metazoan and 2 unicellular holozoan species. We identify 123,014 membrane proteins in these 24 eukaryotic species and classify 86% of the dataset. We determine 604 functional clusters that are present from the last holozoan common ancestor (LHCA) through many metazoan species. Intriguingly, we show that more than 70% of the metazoan membrane protein families have a premetazoan origin. The data show that enzymes are more highly represented in the LHCA and expand less than threefold throughout metazoan species; in contrast to receptors that are relatively few in the LHCA but expand nearly eight fold within metazoans. Expansions related to cell adhesion, communication, immune defence, and developmental processes are shown in conjunction with emerging biological systems, such as neuronal development, cytoskeleton organization, and the adaptive immune response. This study defines the possible LHCA membrane proteome and describes the fundamental functional clusters that underlie metazoan diversity and innovation.
Collapse
|
4
|
Nodzyński T, Vanneste S, Zwiewka M, Pernisová M, Hejátko J, Friml J. Enquiry into the Topology of Plasma Membrane-Localized PIN Auxin Transport Components. MOLECULAR PLANT 2016; 9:1504-1519. [PMID: 27622590 PMCID: PMC5106287 DOI: 10.1016/j.molp.2016.08.010] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2016] [Revised: 08/15/2016] [Accepted: 08/26/2016] [Indexed: 05/25/2023]
Abstract
Auxin directs plant ontogenesis via differential accumulation within tissues depending largely on the activity of PIN proteins that mediate auxin efflux from cells and its directional cell-to-cell transport. Regardless of the developmental importance of PINs, the structure of these transporters is poorly characterized. Here, we present experimental data concerning protein topology of plasma membrane-localized PINs. Utilizing approaches based on pH-dependent quenching of fluorescent reporters combined with immunolocalization techniques, we mapped the membrane topology of PINs and further cross-validated our results using available topology modeling software. We delineated the topology of PIN1 with two transmembrane (TM) bundles of five α-helices linked by a large intracellular loop and a C-terminus positioned outside the cytoplasm. Using constraints derived from our experimental data, we also provide an updated position of helical regions generating a verisimilitude model of PIN1. Since the canonical long PINs show a high degree of conservation in TM domains and auxin transport capacity has been demonstrated for Arabidopsis representatives of this group, this empirically enhanced topological model of PIN1 will be an important starting point for further studies on PIN structure-function relationships. In addition, we have established protocols that can be used to probe the topology of other plasma membrane proteins in plants.
Collapse
Affiliation(s)
- Tomasz Nodzyński
- CEITEC - Central European Institute of Technology, Masaryk University, Kamenice 5, 62500 Brno, Czech Republic.
| | - Steffen Vanneste
- Department of Plant Systems Biology, VIB, 9052 Gent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Gent, Belgium
| | - Marta Zwiewka
- CEITEC - Central European Institute of Technology, Masaryk University, Kamenice 5, 62500 Brno, Czech Republic
| | - Markéta Pernisová
- CEITEC - Central European Institute of Technology, Masaryk University, Kamenice 5, 62500 Brno, Czech Republic; National Centre for Biomolecular Research, Masaryk University, Kamenice 5, 62500 Brno, Czech Republic
| | - Jan Hejátko
- CEITEC - Central European Institute of Technology, Masaryk University, Kamenice 5, 62500 Brno, Czech Republic; National Centre for Biomolecular Research, Masaryk University, Kamenice 5, 62500 Brno, Czech Republic
| | - Jiří Friml
- Institute of Science and Technology Austria (IST Austria), Am Campus 1, 3400 Klosterneuburg, Austria.
| |
Collapse
|
5
|
HMMpTM: improving transmembrane protein topology prediction using phosphorylation and glycosylation site prediction. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2013; 1844:316-22. [PMID: 24225132 DOI: 10.1016/j.bbapap.2013.11.001] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/17/2013] [Revised: 11/02/2013] [Accepted: 11/04/2013] [Indexed: 11/22/2022]
Abstract
During the last two decades a large number of computational methods have been developed for predicting transmembrane protein topology. Current predictors rely on topogenic signals in the protein sequence, such as the distribution of positively charged residues in extra-membrane loops and the existence of N-terminal signals. However, phosphorylation and glycosylation are post-translational modifications (PTMs) that occur in a compartment-specific manner and therefore the presence of a phosphorylation or glycosylation site in a transmembrane protein provides topological information. We examine the combination of phosphorylation and glycosylation site prediction with transmembrane protein topology prediction. We report the development of a Hidden Markov Model based method, capable of predicting the topology of transmembrane proteins and the existence of kinase specific phosphorylation and N/O-linked glycosylation sites along the protein sequence. Our method integrates a novel feature in transmembrane protein topology prediction, which results in improved performance for topology prediction and reliable prediction of phosphorylation and glycosylation sites. The method is freely available at http://bioinformatics.biol.uoa.gr/HMMpTM.
Collapse
|
6
|
Islam ST, Lam JS. Topological mapping methods for α-helical bacterial membrane proteins--an update and a guide. Microbiologyopen 2013; 2:350-64. [PMID: 23408725 PMCID: PMC3633358 DOI: 10.1002/mbo3.72] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2012] [Revised: 01/04/2013] [Accepted: 01/10/2013] [Indexed: 12/19/2022] Open
Abstract
Integral membrane proteins with α-helical transmembrane segments (TMS) are known to play important and diverse roles in prokaryotic cell physiology. The net hydrophobicity of TMS directly corresponds to the observed difficulties in expressing and purifying these proteins, let alone producing sufficient yields for structural studies using two-/three-dimensional (2D/3D) crystallographic or nuclear magnetic resonance methods. To gain insight into the function of these integral membrane proteins, topological mapping has become an important tool to identify exposed and membrane-embedded protein domains. This approach has led to the discovery of protein tracts of functional importance and to the proposition of novel mechanistic hypotheses. In this review, we synthesize the various methods available for topological mapping of α-helical integral membrane proteins to provide investigators with a comprehensive reference for choosing techniques suited to their particular topological queries and available resources.
Collapse
Affiliation(s)
- Salim T Islam
- Department of Molecular and Cellular Biology, University of Guelph, Guelph, Ontario, N1G 2W1, Canada
| | | |
Collapse
|
7
|
Islam ST, Lam JS. Wzx flippase-mediated membrane translocation of sugar polymer precursors in bacteria. Environ Microbiol 2012; 15:1001-15. [PMID: 23016929 DOI: 10.1111/j.1462-2920.2012.02890.x] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2012] [Revised: 08/13/2012] [Accepted: 08/18/2012] [Indexed: 11/29/2022]
Abstract
Bacterial cell surface polysaccharides confer resistance to external stress and promote survival in biotic and abiotic environments. Glycan assembly often occurs at the periplasmic leaflet of the inner membrane (IM) from undecaprenyl pyrophosphate (UndPP)-linked polysaccharide units via the Wzx/Wzy-dependent pathway. Wzx is an integral IM protein found in Gram-negative and Gram-positive bacteria that mediates IM translocation of UndPP-linked sugar repeats from the cytoplasmic to the periplasmic leaflet; interaction of Wzx with other assembly proteins is indirectly supported by genetic evidence. Topological mapping has indicated 12 α-helical transmembrane segments (TMS), with the number of charged TMS residues fluctuating based on the mapping method used. A novel Wzx tertiary structure model has been built, allowing for substrate-binding or energy-coupling roles to be proposed for functionally important charged and aromatic TMS residues. It has also led to a proposed antiport-like mechanism of Wzx function. Exquisite substrate specificity of Wzx proteins was recently revealed in distinguishing between UndPP-linked substrates with identical main-chain sugar repeats, but differing in the chemical composition of a terminal sugar side-branch cap. The objective of this review is to synthesize the most up-to-date knowledge concerning Wzx flippases and to provide perspective for future investigations in this burgeoning field.
Collapse
Affiliation(s)
- Salim T Islam
- Department of Molecular and Cellular Biology, University of Guelph, Guelph, ON, Canada, N1G 2W1
| | | |
Collapse
|
8
|
Meijles DN, Howlin BJ, Li JM. Consensus in silico computational modelling of the p22phox subunit of the NADPH oxidase. Comput Biol Chem 2012; 39:6-13. [DOI: 10.1016/j.compbiolchem.2012.05.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2012] [Revised: 05/02/2012] [Accepted: 05/02/2012] [Indexed: 11/24/2022]
|
9
|
Virzintiene E, Trane M, Hägerhäll C. Revised transmembrane orientation of the NADH:quinone oxidoreductase subunit NuoA. FEBS Lett 2011; 585:3277-83. [PMID: 21925501 DOI: 10.1016/j.febslet.2011.09.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2011] [Revised: 09/04/2011] [Accepted: 09/05/2011] [Indexed: 01/14/2023]
Abstract
NuoA is a small membrane spanning subunit of respiratory chain NADH:quinone oxidoreductase (complex I). Unlike the other complex I core protein subunits, the NuoA protein has no known homologue in other enzyme systems. The transmembrane orientation of NuoA cannot be unambiguously predicted, due to the small size of the polypeptide and the varying distribution of charged amino acid residues in NuoA from different organisms. Novel analyses of NuoA from Escherichia coli complex I expressed as fusion proteins to cytochrome c and to alkaline phosphatase demonstrated that the c-terminal end of the polypeptide is localized in the bacterial cytoplasm, in contrast to what was previously reported for the homologous NQO7 subunit from Paracoccus denitrificans complex I.
Collapse
Affiliation(s)
- Egle Virzintiene
- Department of Biochemistry and Structural Biology, Center for Molecular Protein Science, Lund University, Lund, Sweden
| | | | | |
Collapse
|
10
|
Feng WK, Wang L, Lu Y, Wang XY. A protein oxidase catalysing disulfide bond formation is localized to the chloroplast thylakoids. FEBS J 2011; 278:3419-30. [PMID: 21781282 DOI: 10.1111/j.1742-4658.2011.08265.x] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
In chloroplasts, thiol/disulfide-redox-regulated proteins have been linked to numerous metabolic pathways. However, the biochemical system for disulfide bond formation in chloroplasts remains undetermined. In the present study, we characterized an oxidoreductase, AtVKOR-DsbA, encoded by the gene At4g35760 as a potential disulfide bond oxidant in Arabidopsis. The gene product contains two distinct domains: an integral membrane domain homologous to the catalytic subunit of mammalian vitamin K epoxide reductase (VKOR) and a soluble DsbA-like domain. Transient expression of green fluorescent protein fusion in Arabidopsis protoplasts indicated that AtVKOR-DsbA is located in the chloroplast. The first 45 amino acids from the N-terminus were found to act as a transit peptide targeting the protein to the chloroplast. An immunoblot assay of chloroplast fractions revealed that AtVKOR-DsbA was localized in the thylakoid. A motility complementation assay showed that the full-length of AtVKOR-DsbA, if lacking its transit peptide, could catalyze the formation of disulfide bonds. Among the 10 cysteine residues present in the mature protein, eight cysteines (four in the AtVKOR domain and four in the AtDsbA domain) were found to be essential for promoting disulfide bond formation. The topological arrangement of AtVKOR-DsbA was assayed using alkaline phosphatase sandwich fusions. From these results, we developed a possible topology model of AtVKOR-DsbA in chloroplasts. We propose that the integral membrane domain of AtVKOR-DsbA contains four transmembrane helices, and that both termini and the cysteines involved in catalyzing the formation of disulfide bonds face the oxidative thylakoid lumen. These studies may help to resolve some of the issues surrounding the structure and function of AtVKOR-DsbA in Arabidopsis chloroplasts.
Collapse
Affiliation(s)
- Wei-Ke Feng
- College of Life Science, Shandong Agricultural University, Shandong Taian, China
| | | | | | | |
Collapse
|
11
|
Tsaousis GN, Tsirigos KD, Andrianou XD, Liakopoulos TD, Bagos PG, Hamodrakas SJ. ExTopoDB: a database of experimentally derived topological models of transmembrane proteins. ACTA ACUST UNITED AC 2010; 26:2490-2. [PMID: 20601677 DOI: 10.1093/bioinformatics/btq362] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
UNLABELLED ExTopoDB is a publicly accessible database of experimentally derived topological models of transmembrane proteins. It contains information collected from studies in the literature that report the use of biochemical methods for the determination of the topology of α-helical transmembrane proteins. Transmembrane protein topology is highly important in order to understand their function and ExTopoDB provides an up to date, complete and comprehensive dataset of experimentally determined topologies of α-helical transmembrane proteins. Topological information is combined with transmembrane topology prediction resulting in more reliable topological models. AVAILABILITY http://bioinformatics.biol.uoa.gr/ExTopoDB.
Collapse
Affiliation(s)
- Georgios N Tsaousis
- Department of Cell Biology and Biophysics, Faculty of Biology, University of Athens, Panepistimiopolis Athens, Greece
| | | | | | | | | | | |
Collapse
|
12
|
Influence of assignment on the prediction of transmembrane helices in protein structures. Amino Acids 2010; 39:1241-54. [DOI: 10.1007/s00726-010-0559-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2009] [Accepted: 03/08/2010] [Indexed: 02/01/2023]
|
13
|
Nam HJ, Jeon J, Kim S. Bioinformatic approaches for the structure and function of membrane proteins. BMB Rep 2009; 42:697-704. [PMID: 19944009 DOI: 10.5483/bmbrep.2009.42.11.697] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Membrane proteins play important roles in the biology of the cell, including intercellular communication and molecular transport. Their well-established importance notwithstanding, the high-resolution structures of membrane proteins remain elusive due to difficulties in protein expression, purification and crystallization. Thus, accurate prediction of membrane protein topology can increase the understanding of membrane protein function. Here, we provide a brief review of the diverse computational methods for predicting membrane protein structure and function, including recent progress and essential bioinformatics tools. Our hope is that this review will be instructive to users studying membrane protein biology in their choice of appropriate bioinformatics methods.
Collapse
Affiliation(s)
- Hyun-Jun Nam
- School of Interdisciplinary Bioscience and Bioengineering, Pohang University of Science and Technology, Pohang, Korea
| | | | | |
Collapse
|
14
|
Marmion M, Parviainen M, Luoto M, Heikkinen RK, Thuiller W. Evaluation of consensus methods in predictive species distribution modelling. DIVERS DISTRIB 2009. [DOI: 10.1111/j.1472-4642.2008.00491.x] [Citation(s) in RCA: 837] [Impact Index Per Article: 55.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
|
15
|
Eriksson H, Lengqvist J, Hedlund J, Uhlén K, Orre LM, Bjellqvist B, Persson B, Lehtiö J, Jakobsson PJ. Quantitative membrane proteomics applying narrow range peptide isoelectric focusing for studies of small cell lung cancer resistance mechanisms. Proteomics 2008; 8:3008-18. [PMID: 18654985 DOI: 10.1002/pmic.200800174] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Drug resistance is often associated with upregulation of membrane-associated drug-efflux systems, and thus global membrane proteomics methods are valuable tools in the search for novel components of drug resistance phenotypes. Herein we have compared the microsomal proteome from the lung cancer cell line H69 and its isogenic Doxorubicin-resistant subcell line H69AR. The method used includes microsome preparation, iTRAQ labeling followed by narrow range peptide IEF in an immobilized pH-gradient (IPG-IEF) and LC-MS/MS analysis. We demonstrate that the microsomal preparation and iTRAQ labeling is reproducible regarding protein content and composition. The rationale using narrow range peptide IPG-IEF separation is demonstrated by its ability to: (i) lowering the complexity of the sample by two-thirds while keeping high proteome coverage (96%), (ii) providing high separation efficiency, and (iii) allowing for peptide validation and possibly identifications of post-transcriptional modifications. After analyzing one-fifth of the IEF fractions (effective pH range of 4.0-4.5), a total of 3704 proteins were identified, among which 527 were predicted to be membrane proteins. One of the proteins found to be differentially expressed was Serca 2, a calcium pump located in the ER membrane that potentially could result in changes of apoptotic response toward Doxorubicin.
Collapse
Affiliation(s)
- Hanna Eriksson
- Karolinska Biomics Center, Karolinska University Hospital, Stockholm, Sweden
| | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Transmembrane topology of the Acr3 family arsenite transporter from Bacillus subtilis. BIOCHIMICA ET BIOPHYSICA ACTA-BIOMEMBRANES 2007; 1778:963-73. [PMID: 18088595 DOI: 10.1016/j.bbamem.2007.11.011] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2007] [Revised: 11/14/2007] [Accepted: 11/16/2007] [Indexed: 11/21/2022]
Abstract
The transmembrane topology of the Acr3 family arsenite transporter Acr3 from Bacillus subtilis was analysed experimentally using translational fusions with alkaline phosphatase and green fluorescent protein and in silico by topology modelling. Initial topology prediction resulted in two models with 9 and 10 TM helices respectively. 32 fusion constructs were made between truncated forms of acr3 and the reporter genes at 17 different sites throughout the acr3 sequence to discriminate between these models. Nine strong reporter protein signals provided information about the majority of the locations of the cytoplasmic and extracellular loops of Acr3 and showed that both the N- and the C-termini are located in the cytoplasm. Two ambiguous data points indicated the possibility of an alternative 8 helix topology. This possibility was investigated using another 10 fusion variants, but no experimental support for the 8 TM topology was obtained. We therefore conclude that Acr3 has 10 transmembrane helices. Overall, the loops which connect the membrane spanning segments are short, with cytoplasmic loops being somewhat longer than the extracellular loops. The study provides the first ever experimentally derived structural information on a protein of the Acr3 family which constitutes one of the largest classes of arsenite transporters.
Collapse
|
17
|
Wu Z, Zhang X, He B, Diao L, Sheng S, Wang J, Guo X, Su N, Wang L, Jiang L, Wang C, Zhai H, Wan J. A chlorophyll-deficient rice mutant with impaired chlorophyllide esterification in chlorophyll biosynthesis. PLANT PHYSIOLOGY 2007; 145:29-40. [PMID: 17535821 PMCID: PMC1976586 DOI: 10.1104/pp.107.100321] [Citation(s) in RCA: 232] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Chlorophyll (Chl) synthase catalyzes esterification of chlorophyllide to complete the last step of Chl biosynthesis. Although the Chl synthases and the corresponding genes from various organisms have been well characterized, Chl synthase mutants have not yet been reported in higher plants. In this study, a rice (Oryza Sativa) Chl-deficient mutant, yellow-green leaf1 (ygl1), was isolated, which showed yellow-green leaves in young plants with decreased Chl synthesis, increased level of tetrapyrrole intermediates, and delayed chloroplast development. Genetic analysis demonstrated that the phenotype of ygl1 was caused by a recessive mutation in a nuclear gene. The ygl1 locus was mapped to chromosome 5 and isolated by map-based cloning. Sequence analysis revealed that it encodes the Chl synthase and its identity was verified by transgenic complementation. A missense mutation was found in a highly conserved residue of YGL1 in the ygl1 mutant, resulting in reduction of the enzymatic activity. YGL1 is constitutively expressed in all tissues, and its expression is not significantly affected in the ygl1 mutant. Interestingly, the mRNA expression of the cab1R gene encoding the Chl a/b-binding protein was severely suppressed in the ygl1 mutant. Moreover, the expression of some nuclear genes associated with Chl biosynthesis or chloroplast development was also affected in ygl1 seedlings. These results indicate that the expression of nuclear genes encoding various chloroplast proteins might be feedback regulated by the level of Chl or Chl precursors.
Collapse
Affiliation(s)
- Ziming Wu
- National Key Laboratory for Crop Genetics and Germplasm Enhancement, Jiangsu Plant Gene Engineering Research Center, Nanjing Agricultural University, Nanjing 210095, China
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Affiliation(s)
- Anna E Speers
- Department of Pharmacology, University of Colorado School of Medicine, P.O. Box 6511, MS 8303, Aurora, Colorado 80045, USA
| | | |
Collapse
|
19
|
Jones DT. Improving the accuracy of transmembrane protein topology prediction using evolutionary information. Bioinformatics 2007; 23:538-44. [PMID: 17237066 DOI: 10.1093/bioinformatics/btl677] [Citation(s) in RCA: 325] [Impact Index Per Article: 19.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Many important biological processes such as cell signaling, transport of membrane-impermeable molecules, cell-cell communication, cell recognition and cell adhesion are mediated by membrane proteins. Unfortunately, as these proteins are not water soluble, it is extremely hard to experimentally determine their structure. Therefore, improved methods for predicting the structure of these proteins are vital in biological research. In order to improve transmembrane topology prediction, we evaluate the combined use of both integrated signal peptide prediction and evolutionary information in a single algorithm. RESULTS A new method (MEMSAT3) for predicting transmembrane protein topology from sequence profiles is described and benchmarked with full cross-validation on a standard data set of 184 transmembrane proteins. The method is found to predict both the correct topology and the locations of transmembrane segments for 80% of the test set. This compares with accuracies of 62-72% for other popular methods on the same benchmark. By using a second neural network specifically to discriminate transmembrane from globular proteins, a very low overall false positive rate (0.5%) can also be achieved in detecting transmembrane proteins. AVAILABILITY An implementation of the described method is available both as a web server (http://www.psipred.net) and as downloadable source code from http://bioinf.cs.ucl.ac.uk/memsat. Both the server and source code files are free to non-commercial users. Benchmark and training data are also available from http://bioinf.cs.ucl.ac.uk/memsat.
Collapse
Affiliation(s)
- David T Jones
- Department of Computer Science, University College London, Gower Street, London WC1E 6BT, United Kingdom.
| |
Collapse
|
20
|
Hu J, Yang YD, Kihara D. EMD: an ensemble algorithm for discovering regulatory motifs in DNA sequences. BMC Bioinformatics 2006; 7:342. [PMID: 16839417 PMCID: PMC1539026 DOI: 10.1186/1471-2105-7-342] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2005] [Accepted: 07/13/2006] [Indexed: 11/13/2022] Open
Abstract
Background Understanding gene regulatory networks has become one of the central research problems in bioinformatics. More than thirty algorithms have been proposed to identify DNA regulatory sites during the past thirty years. However, the prediction accuracy of these algorithms is still quite low. Ensemble algorithms have emerged as an effective strategy in bioinformatics for improving the prediction accuracy by exploiting the synergetic prediction capability of multiple algorithms. Results We proposed a novel clustering-based ensemble algorithm named EMD for de novo motif discovery by combining multiple predictions from multiple runs of one or more base component algorithms. The ensemble approach is applied to the motif discovery problem for the first time. The algorithm is tested on a benchmark dataset generated from E. coli RegulonDB. The EMD algorithm has achieved 22.4% improvement in terms of the nucleotide level prediction accuracy over the best stand-alone component algorithm. The advantage of the EMD algorithm is more significant for shorter input sequences, but most importantly, it always outperforms or at least stays at the same performance level of the stand-alone component algorithms even for longer sequences. Conclusion We proposed an ensemble approach for the motif discovery problem by taking advantage of the availability of a large number of motif discovery programs. We have shown that the ensemble approach is an effective strategy for improving both sensitivity and specificity, thus the accuracy of the prediction. The advantage of the EMD algorithm is its flexibility in the sense that a new powerful algorithm can be easily added to the system.
Collapse
Affiliation(s)
- Jianjun Hu
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
| | - Yifeng D Yang
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA
- Markey Center for Structural Biology, Purdue University, West Lafayette, IN, 47907, USA
- The Bindley Bioscience Center, Discovery Park, Purdue University, West Lafayette, IN, 47907, USA
| |
Collapse
|
21
|
Hurwitz N, Pellegrini-Calace M, Jones DT. Towards genome-scale structure prediction for transmembrane proteins. Philos Trans R Soc Lond B Biol Sci 2006; 361:465-75. [PMID: 16524835 PMCID: PMC1609336 DOI: 10.1098/rstb.2005.1804] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
In this paper we briefly review some of the recent progress made by ourselves and others in developing methods for predicting the structures of transmembrane proteins from amino acid sequence. Transmembrane proteins are an important class of proteins involved in many diverse biological functions, many of which have great impact in terms of disease mechanism and drug discovery. Despite their biological importance, it has proven very difficult to solve the structures of these proteins by experimental techniques, and so there is a great deal of pressure to develop effective methods for predicting their structure. The methods we discuss range from methods for transmembrane topology prediction to new methods for low resolution folding simulations in a knowledge-based force field. This potential is designed to reproduce the properties of the lipid bilayer. Our eventual aim is to apply these methods in tandem so that useful three-dimensional models can be built for a large fraction of the transmembrane protein domains in whole proteomes.
Collapse
Affiliation(s)
- Naama Hurwitz
- Bioinformatics Unit, Department of Computer Science & Department of Biochemistry and Molecular BiologyDarwin Building, University College London, Gower Street, London WC1E 6BT, UK
| | | | - David T Jones
- Bioinformatics Unit, Department of Computer Science & Department of Biochemistry and Molecular BiologyDarwin Building, University College London, Gower Street, London WC1E 6BT, UK
- Author for correspondence ()
| |
Collapse
|
22
|
Wiles AM, Naider F, Becker JM. Transmembrane domain prediction and consensus sequence identification of the oligopeptide transport family. Res Microbiol 2006; 157:395-406. [PMID: 16364604 DOI: 10.1016/j.resmic.2005.10.004] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2005] [Revised: 10/06/2005] [Accepted: 10/07/2005] [Indexed: 11/25/2022]
Abstract
Few polytopic membrane proteins have had their topology determined experimentally. Often, researchers turn to an algorithm to predict where the transmembrane domains might lie. Here we use a consensus method, using six different transmembrane domain prediction algorithms on six members of the oligopeptide transport family, all of which have been experimentally characterized. PSI-BLAST results indicate that the six chosen oligopeptide transport family members are distributed throughout most branches of the phylogram, suggesting that these members represent a broad view of the oligopeptide transport family. We combined the prediction algorithms with a multiple sequence alignment, and consensus transmembrane domains were assigned not only based on algorithmic output, but also based on conserved familial motifs found by analysis of the PSI-BLAST results. The consensus method combined with the "charge-difference rule" yields a model topology for the family containing 12 transmembrane domains with the N- and C-termini facing extracellular.
Collapse
Affiliation(s)
- Amy M Wiles
- Department of Microbiology, University of Tennessee, Knoxville, TN 37996, USA
| | | | | |
Collapse
|
23
|
Spisni E, Tomasi V, Cestaro A, Tosatto SCE. Structural insights into the function of human caveolin 1. Biochem Biophys Res Commun 2005; 338:1383-90. [PMID: 16263077 DOI: 10.1016/j.bbrc.2005.10.099] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2005] [Accepted: 10/18/2005] [Indexed: 12/19/2022]
Abstract
Caveolin-1 (Cav-1) is emerging as the central protein controlling caveolae formation, caveolae trafficking, and cellular signalling. In particular, it is known that Cav-1 interacts and modulates the activity of several signalling proteins through the so-called caveolin scaffolding domain. In this paper, we used a bioinformatics approach to assess the validity of some long-standing structural features of Cav-1. We could confirm the existence of a membrane spanning region of Cav-1 and highlight an interesting pattern of palmitoylated cysteine residues explaining the structural features of the Cav-1 C-terminal region. Moreover, the scaffolding domain is predicted to have a different structure than previously reported.
Collapse
Affiliation(s)
- Enzo Spisni
- Department of Experimental Biology, University of Bologna, Via Selmi 3, 40126 Bologna, Italy.
| | | | | | | |
Collapse
|
24
|
Abstract
Topology predictions for integral membrane proteins can be substantially improved if parts of the protein can be constrained to a given in/out location relative to the membrane using experimental data or other information. Here, we have identified a set of 367 domains in the SMART database that, when found in soluble proteins, have compartment-specific localization of a kind relevant for membrane protein topology prediction. Using these domains as prediction constraints, we are able to provide high-quality topology models for 11% of the membrane proteins extracted from 38 eukaryotic genomes. Two-thirds of these proteins are single spanning, a group of proteins for which current topology prediction methods perform particularly poorly.
Collapse
|
25
|
Yagur-Kroll S, Amster-Choder O. Dynamic Membrane Topology of the Escherichia coli β-Glucoside Transporter BglF. J Biol Chem 2005; 280:19306-18. [PMID: 15755739 DOI: 10.1074/jbc.m410896200] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The Escherichia coli BglF protein, a permease of the phosphoenolpyruvate-dependent phosphotransferase system, catalyzes transport and phosphorylation of beta-glucosides. In addition, BglF regulates bgl operon expression by controlling the activity of the transcriptional regulator BglG via reversible phosphorylation. BglF is composed of three domains; one is hydrophobic, which presumably forms the sugar translocation channel. We studied the topology of this domain by Cys-replacement mutagenesis and chemical modification by thiol reagents. Most Cys substitutions were well tolerated, as demonstrated by the ability of the mutant proteins to catalyze BglF activities. Our results suggest that the membrane domain contains eight transmembrane helices and an alleged cytoplasmic loop that contains two additional helices. The latter region forms a dynamic structure, as evidenced by the alternation of residues near its ends between faced-in and faced-out states. We suggest that this region, together with the two transmembrane helices encompassing it, forms the sugar translocation channel. BglF periplasmic loops are close to the membrane, the first being a reentrant loop. This is the first systematic topological study carried out with an intact phosphotransferase system permease and the first demonstration of a reentrant loop in this group of proteins.
Collapse
Affiliation(s)
- Sharon Yagur-Kroll
- Department of Molecular Biology, The Hebrew University-Hadassah Medical School, P. O. Box 12272, Jerusalem 91120, Israel
| | | |
Collapse
|
26
|
Xia JX, Ikeda M, Shimizu T. ConPred_elite: a highly reliable approach to transmembrane topology predication. Comput Biol Chem 2004; 28:51-60. [PMID: 15022642 DOI: 10.1016/j.compbiolchem.2003.11.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The function of transmembrane (TM) proteins is closely correlated to their TM topology; large quantities of highly reliable TM topology data are becoming increasingly required. We present a new consensus approach for TM topology prediction (ConPred_elite) that can predict the whole topology with accuracies of 0.98 for prokaryotic and 0.95 for eukaryotic proteins on a dataset of experimentally-characterized TM topologies. The predicted yield on the dataset is 30.4% for prokaryotic and 21.5% for eukaryotic proteins. Applying ConPred_elite to predicted TM proteins extracted from 29 prokaryotic and 10 eukaryotic proteomes, we obtained 3871 and 7271 highly reliable TM topologies (yields, 19.8 and 13.3%), respectively. The predicted TM topology data may contribute to further research into a comprehensive functional classification and identification of TM proteins based on information of the topology.
Collapse
Affiliation(s)
- Jun-Xiong Xia
- Department of Electronic and Information System Engineering, Faculty of Science and Technology, Hirosaki University, Hirosaki 036-8561,Japan
| | | | | |
Collapse
|
27
|
Melén K, Krogh A, von Heijne G. Reliability measures for membrane protein topology prediction algorithms. J Mol Biol 2003; 327:735-44. [PMID: 12634065 DOI: 10.1016/s0022-2836(03)00182-7] [Citation(s) in RCA: 167] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We have developed reliability scores for five widely used membrane protein topology prediction methods, and have applied them both on a test set of 92 bacterial plasma membrane proteins with experimentally determined topologies and on all predicted helix bundle membrane proteins in three fully sequenced genomes: Escherichia coli, Saccharomyces cerevisiae and Caenorhabditis elegans. We show that the reliability scores work well for the TMHMM and MEMSAT methods, and that they allow the probability that the predicted topology is correct to be estimated for any protein. We further show that the available test set is biased towards high-scoring proteins when compared to the genome-wide data sets, and provide estimates for the expected prediction accuracy of TMHMM across the three genomes. Finally, we show that the performance of TMHMM is considerably better when limited experimental information (such as the in/out location of a protein's C terminus) is available, and estimate that at least ten percentage points in overall accuracy in whole-genome predictions can be gained in this way.
Collapse
Affiliation(s)
- Karin Melén
- Department of Biochemistry and Biophysics, Stockholm Bioinformatics Center, Stockholm University, SE-106 91 Stockholm, Sweden
| | | | | |
Collapse
|