1
|
Faiz M, Khan SJ, Azim F, Ejaz N. Disclosing the locale of transmembrane proteins within cellular alcove by machine learning approach: systematic review and meta analysis. J Biomol Struct Dyn 2023:1-16. [PMID: 37768108 DOI: 10.1080/07391102.2023.2260490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 09/13/2023] [Indexed: 09/29/2023]
Abstract
Protein subcellular localization is a promising research question in Proteomics and associated fields, including Biological Sciences, Biomedical Engineering, Computational Biology, Bioinformatics, Proteomics, Artificial Intelligence, and Biophysics. However, computational techniques are preferred to explore this attribute for a massive number of proteins. The byproduct of this conjunction yields diversified location identifiers of proteins. These protein subcellular localization identifiers are unique regarding the database used, organisms, Machine Learning Technique, and accuracy. Despite the availability of these identifiers, the majority of the work has been done on the subcellular localization of proteins and, less work has been done specifically on locations of transmembrane proteins. This systematic review accounts for computational techniques implemented on transmembrane protein localization. Moreover, a literature search on PubMed, Science Direct, and IEEE Databases disclosed no systematic review or meta-analysis on the cell's transmembrane protein locale. A Systematic review was formed under the guidelines of PRISMA by using Science Direct, PubMed, and IEEE Databases. Journal publications from 2000 to 2023 were taken into consideration and screened. This review has focused only on computational studies rather than experimental techniques. 1004 studies were reviewed and were categorized as relevant and non-relevant according to inclusion and exclusion criteria. All the screening was done through Endnote after importing citations. This systematic review characterizes the gap in targeting the locale of the transmembrane protein and will aid researchers in exploring its new horizons.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Mehwish Faiz
- Department of Biomedical Engineering, Ziauddin University (FESTM), Karachi, Pakistan
- Department of Electrical Engineering, Ziauddin University, (FESTM), Karachi, Pakistan
| | - Saad Jawaid Khan
- Department of Biomedical Engineering, Ziauddin University (FESTM), Karachi, Pakistan
| | - Fahad Azim
- Department of Electrical Engineering, Ziauddin University, (FESTM), Karachi, Pakistan
| | - Nazia Ejaz
- Balochistan University of Engineering and Technology, Khuzdar, Pakistan
| |
Collapse
|
2
|
Onufriev AV, Izadi S. Water models for biomolecular simulations. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2017. [DOI: 10.1002/wcms.1347] [Citation(s) in RCA: 94] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Alexey V. Onufriev
- Department of Physics; Virginia Tech; Blacksburg VA USA
- Department of Computer Science; Virginia Tech; Blacksburg VA USA
- Center for Soft Matter and Biological Physics; Virginia Tech; Blacksburg VA USA
| | - Saeed Izadi
- Early Stage Pharmaceutical Development; Genentech Inc.; South San Francisco, CA USA
| |
Collapse
|
3
|
Dutagaci B, Wittayanarakul K, Mori T, Feig M. Discrimination of Native-like States of Membrane Proteins with Implicit Membrane-based Scoring Functions. J Chem Theory Comput 2017; 13:3049-3059. [PMID: 28475346 DOI: 10.1021/acs.jctc.7b00254] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
A scoring protocol based on implicit membrane-based scoring functions and a new protocol for optimizing the positioning of proteins inside the membrane was evaluated for its capacity to discriminate native-like states from misfolded decoys. A decoy set previously established by the Baker lab (Proteins: Struct., Funct., Genet. 2006, 62, 1010-1025) was used along with a second set that was generated to cover higher resolution models. The Implicit Membrane Model 1 (IMM1), IMM1 model with CHARMM 36 parameters (IMM1-p36), generalized Born with simple switching (GBSW), and heterogeneous dielectric generalized Born versions 2 (HDGBv2) and 3 (HDGBv3) were tested along with the new HDGB van der Waals (HDGBvdW) model that adds implicit van der Waals contributions to the solvation free energy. For comparison, scores were also calculated with the distance-scaled finite ideal-gas reference (DFIRE) scoring function. Z-scores for native state discrimination, energy vs root-mean-square deviation (RMSD) correlations, and the ability to select the most native-like structures as top-scoring decoys were evaluated to assess the performance of the scoring functions. Ranking of the decoys in the Baker set that were relatively far from the native state was challenging and dominated largely by packing interactions that were captured best by DFIRE with less benefit of the implicit membrane-based models. Accounting for the membrane environment was much more important in the second decoy set where especially the HDGB-based scoring functions performed very well in ranking decoys and providing significant correlations between scores and RMSD, which shows promise for improving membrane protein structure prediction and refinement applications. The new membrane structure scoring protocol was implemented in the MEMScore web server ( http://feiglab.org/memscore ).
Collapse
Affiliation(s)
- Bercem Dutagaci
- Department of Biochemistry and Molecular Biology, Michigan State University , East Lansing, Michigan, United States
| | - Kitiyaporn Wittayanarakul
- Department of Natural Resource and Environmental Management, Faculty of Applied Science and Engineering, Khon Kaen University , Nong Khai Campus, Nong Khai 43000, Thailand
| | - Takaharu Mori
- Theoretical Molecular Science Laboratory, RIKEN , Wako-shi, Japan
| | - Michael Feig
- Department of Biochemistry and Molecular Biology, Michigan State University , East Lansing, Michigan, United States
| |
Collapse
|
4
|
Mukhopadhyay A, Aguilar BH, Tolokh IS, Onufriev AV. Introducing Charge Hydration Asymmetry into the Generalized Born Model. J Chem Theory Comput 2014; 10:1788-1794. [PMID: 24803871 PMCID: PMC3985468 DOI: 10.1021/ct4010917] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Indexed: 12/15/2022]
Abstract
The effect of charge hydration asymmetry (CHA)-non-invariance of solvation free energy upon solute charge inversion-is missing from the standard linear response continuum electrostatics. The proposed charge hydration asymmetric-generalized Born (CHA-GB) approximation introduces this effect into the popular generalized Born (GB) model. The CHA is added to the GB equation via an analytical correction that quantifies the specific propensity of CHA of a given water model; the latter is determined by the charge distribution within the water model. Significant variations in CHA seen in explicit water (TIP3P, TIP4P-Ew, and TIP5P-E) free energy calculations on charge-inverted "molecular bracelets" are closely reproduced by CHA-GB, with the accuracy similar to models such as SEA and 3D-RISM that go beyond the linear response. Compared against reference explicit (TIP3P) electrostatic solvation free energies, CHA-GB shows about a 40% improvement in accuracy over the canonical GB, tested on a diverse set of 248 rigid small neutral molecules (root mean square error, rmse = 0.88 kcal/mol for CHA-GB vs 1.24 kcal/mol for GB) and 48 conformations of amino acid analogs (rmse = 0.81 kcal/mol vs 1.26 kcal/mol). CHA-GB employs a novel definition of the dielectric boundary that does not subsume the CHA effects into the intrinsic atomic radii. The strategy leads to finding a new set of intrinsic atomic radii optimized for CHA-GB; these radii show physically meaningful variation with the atom type, in contrast to the radii set optimized for GB. Compared to several popular radii sets used with the original GB model, the new radii set shows better transferability between different classes of molecules.
Collapse
Affiliation(s)
| | - Boris H. Aguilar
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Igor S. Tolokh
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Alexey V. Onufriev
- Department
of Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
| |
Collapse
|
5
|
Yuzlenko O, Lazaridis T. Membrane protein native state discrimination by implicit membrane models. J Comput Chem 2013; 34:731-8. [PMID: 23224861 PMCID: PMC3584241 DOI: 10.1002/jcc.23189] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2012] [Revised: 10/16/2012] [Accepted: 10/28/2012] [Indexed: 02/01/2023]
Abstract
Four implicit membrane models [IMM1, generalized Born (GB)-surface area-implicit membrane (GBSAIM), GB with a simple switching (GBSW), and heterogeneous dielectric GB (HDGB)] were tested for their ability to discriminate the native conformation of five membrane proteins from 450 decoys generated by the Rosetta-Membrane program. The energy ranking of the native state and Z-scores were used to assess the performance of the models. The effect of membrane thickness was examined and was found to be substantial. Quite satisfactory discrimination was achieved with the all-atom IMM1 and GBSW models at 25.4 Å thickness and with the HDGB model at 28.5 Å thickness. The energy components by themselves were not discriminative. Both van der Waals and electrostatic interactions contributed to native state discrimination, to a different extent in each model. Computational efficiency of the models decreased in the order: extended-atom IMM1 > all-atom IMM1 > GBSAIM > GBSW > HDGB. These results encourage the further development and use of implicit membrane models for membrane protein structure prediction.
Collapse
Affiliation(s)
- Olga Yuzlenko
- Department of Chemistry, City College of the City University of New York, 160 Convent Avenue, New York, New York 10031, USA
| | | |
Collapse
|
6
|
Shi Q, Padmanabhan R, Villegas CJ, Gu S, Jiang JX. Membrane topological structure of neutral system N/A amino acid transporter 4 (SNAT4) protein. J Biol Chem 2011; 286:38086-38094. [PMID: 21917917 DOI: 10.1074/jbc.m111.220277] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Members of system N/A amino acid transporter (SNAT) family mediate transport of neutral amino acids, including l-alanine, l-glutamine, and l-histidine, across the plasma membrane and are involved in a variety of cellular functions. By using chemical labeling, glycosylation, immunofluorescence combined with molecular modeling approaches, we resolved the membrane topological structure of SNAT4, a transporter expressed predominantly in liver. To analyze the orientation using the chemical labeling and biotinylation approach, the "Cys-null" mutant of SNAT4 was first generated by mutating all five endogenous cysteine residues. Based on predicted topological structures, a single cysteine residue was introduced individually into all possible nontransmembrane domains of the Cys-null mutant. The cells expressing these mutants were labeled with N-biotinylaminoethyl methanethiosulfonate, a membrane-impermeable cysteine-directed reagent. We mapped the orientations of N- and C-terminal domains. There are three extracellular loop domains, and among them, the second loop domain is the largest that spans from amino acid residue ∼242 to ∼335. The orientation of this domain was further confirmed by the identification of two N-glycosylated residues, Asn-260 and Asn-264. Together, we showed that SNAT4 contains 10 transmembrane domains with extracellular N and C termini and a large N-glycosylated, extracellular loop domain. This is the first report concerning membrane topological structure of mammalian SNAT transporters, which will provide important implications for our understanding of structure-function of the members in this amino acid transporter family.
Collapse
Affiliation(s)
- Qian Shi
- Department of Biochemistry, University of Texas Health Science Center, San Antonio, Texas 78229-3900
| | - Rugmani Padmanabhan
- Department of Biochemistry, University of Texas Health Science Center, San Antonio, Texas 78229-3900
| | - Carla J Villegas
- Department of Biochemistry, University of Texas Health Science Center, San Antonio, Texas 78229-3900
| | - Sumin Gu
- Department of Biochemistry, University of Texas Health Science Center, San Antonio, Texas 78229-3900
| | - Jean X Jiang
- Department of Biochemistry, University of Texas Health Science Center, San Antonio, Texas 78229-3900.
| |
Collapse
|
7
|
Alpha-helical topology prediction and generation of distance restraints in membrane proteins. Biophys J 2008; 95:5281-95. [PMID: 18775963 DOI: 10.1529/biophysj.108.132241] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The field of protein structure prediction has seen significant advances in recent years. Researchers have followed a multitude of approaches, including methods based on comparative modeling, fold recognition and threading, and first-principles techniques. It is noteworthy that the structure prediction of membrane proteins is comparatively less studied by researchers in the field. A membrane protein is characterized by a protein structure that extends into or through the lipid-lipid bilayer of a cell. The structure is influenced by the combination of the hydrophobic bilayer region, the direct interaction with the bilayer, and the aqueous external environment. Due to the difficulty in obtaining reliable experimental structures, accurate computational prediction of membrane proteins is of paramount importance. An optimization model has been developed to predict the interhelical interactions in alpha-helical membrane proteins. A database of alpha-helical membrane proteins of known structure and limited sequence identity can be constructed to develop interaction probabilities. By then maximizing the occurrence of highly probable pairwise or three-residue interactions, realistic contacts can be predicted by imposing a number of geometrical constraints. The development of these low distance contacts can provide additional distance restraints for first principles-based approaches to the tertiary structure prediction problem. The proposed approach is shown to successfully predict interhelical contacts in several membrane protein systems, including bovine rhodopsin and the recently released human beta2 adrenergic receptor protein structure.
Collapse
|
8
|
Bagos PG, Liakopoulos TD, Hamodrakas SJ. Algorithms for incorporating prior topological information in HMMs: application to transmembrane proteins. BMC Bioinformatics 2006; 7:189. [PMID: 16597327 PMCID: PMC1523218 DOI: 10.1186/1471-2105-7-189] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2006] [Accepted: 04/05/2006] [Indexed: 11/18/2022] Open
Abstract
Background Hidden Markov Models (HMMs) have been extensively used in computational molecular biology, for modelling protein and nucleic acid sequences. In many applications, such as transmembrane protein topology prediction, the incorporation of limited amount of information regarding the topology, arising from biochemical experiments, has been proved a very useful strategy that increased remarkably the performance of even the top-scoring methods. However, no clear and formal explanation of the algorithms that retains the probabilistic interpretation of the models has been presented so far in the literature. Results We present here, a simple method that allows incorporation of prior topological information concerning the sequences at hand, while at the same time the HMMs retain their full probabilistic interpretation in terms of conditional probabilities. We present modifications to the standard Forward and Backward algorithms of HMMs and we also show explicitly, how reliable predictions may arise by these modifications, using all the algorithms currently available for decoding HMMs. A similar procedure may be used in the training procedure, aiming at optimizing the labels of the HMM's classes, especially in cases such as transmembrane proteins where the labels of the membrane-spanning segments are inherently misplaced. We present an application of this approach developing a method to predict the transmembrane regions of alpha-helical membrane proteins, trained on crystallographically solved data. We show that this method compares well against already established algorithms presented in the literature, and it is extremely useful in practical applications. Conclusion The algorithms presented here, are easily implemented in any kind of a Hidden Markov Model, whereas the prediction method (HMM-TM) is freely available for academic users at , offering the most advanced decoding options currently available.
Collapse
Affiliation(s)
- Pantelis G Bagos
- Department of Cell Biology and Biophysics, Faculty of Biology, University of Athens, Athens 157 01, Greece
| | - Theodore D Liakopoulos
- Department of Cell Biology and Biophysics, Faculty of Biology, University of Athens, Athens 157 01, Greece
| | - Stavros J Hamodrakas
- Department of Cell Biology and Biophysics, Faculty of Biology, University of Athens, Athens 157 01, Greece
| |
Collapse
|
9
|
Mottamal M, Zhang J, Lazaridis T. Energetics of the native and non-native states of the glycophorin transmembrane helix dimer. Proteins 2006; 62:996-1009. [PMID: 16395713 DOI: 10.1002/prot.20844] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Using an implicit membrane model (IMM1), we examine whether the structure of the transmembrane domain of Glycophorin A (GpA) could be predicted based on energetic considerations alone. The energetics of native GpA shows that van der Waals interactions make the largest contribution to stability. Although specific electrostatic interactions are stabilizing, the overall electrostatic contribution is close to zero. The GXXXG motif contributes significantly to stability, but residues outside this motif contribute almost twice as much. To generate non-native states a global conformational search was done on two segments of GpA: an 18-residue peptide (GpA74-91) that is embedded in the membrane and a 29-residue peptide (GpA70-98) that has additional polar residues flanking the transmembrane region. Simulated annealing was done on a large number of conformations generated from parallel, antiparallel, left- and right-handed starting structures by rotating each helix at 20 degrees intervals around its helical axis. Several crossing points along the helix dimer were considered. For 18-residue parallel topology, an ensemble of native-like structures was found at the lowest effective energy region; the effective energy is lowest for a right-handed structure with an RMSD of 1.0 A from the solid-state NMR structure with correct orientation of the helices. For the 29-residue peptide, the effective energies of several left-handed structures were lower than that of the native, right-handed structure. This could be due to deficiencies in modeling the interactions between charged sidechains and/or omission of the sidechain entropy contribution to the free energy. For 18-residue antiparallel topology, both IMM1 and a Generalized Born model give effective energies that are lower than that of the native structure. In contrast, the Poisson-Boltzmann solvation model gives lower effective energy for the parallel topology, largely because the electrostatic solvation energy is more favorable for the parallel structure. IMM1 seems to underestimate the solvation free energy advantage when the CO and NH dipoles just outside the membrane are parallel. This highlights the importance of electrostatic interactions even when these are not obvious by looking at the structures.
Collapse
|