1
|
Kushwah AS, Dixit H, Upadhyay V, Verma SK, Prasad R. The study of iron- and copper-binding proteome of Fusarium oxysporum and its effector candidates. Proteins 2024; 92:1097-1112. [PMID: 38666709 DOI: 10.1002/prot.26696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 03/26/2024] [Accepted: 04/08/2024] [Indexed: 08/07/2024]
Abstract
Fusarium oxysporum f.sp. lycopersici is a phytopathogen which causes vascular wilt disease in tomato plants. The survival tactics of both pathogens and hosts depend on intricate interactions between host plants and pathogenic microbes. Iron-binding proteins (IBPs) and copper-binding proteins (CBPs) play a crucial role in these interactions by participating in enzyme reactions, virulence, metabolism, and transport processes. We employed high-throughput computational tools at the sequence and structural levels to investigate the IBPs and CBPs of F. oxysporum. A total of 124 IBPs and 37 CBPs were identified in the proteome of Fusarium. The ranking of amino acids based on their affinity for binding with iron is Glu > His> Asp > Asn > Cys, and for copper is His > Asp > Cys respectively. The functional annotation, determination of subcellular localization, and Gene Ontology analysis of these putative IBPs and CBPs have unveiled their potential involvement in a diverse array of cellular and biological processes. Three iron-binding glycosyl hydrolase family proteins, along with four CBPs with carbohydrate-binding domains, have been identified as potential effector candidates. These proteins are distinct from the host Solanum lycopersicum proteome. Moreover, they are known to be located extracellularly and function as enzymes that degrade the host cell wall during pathogen-host interactions. The insights gained from this report on the role of metal ions in plant-pathogen interactions can help develop a better understanding of their fundamental biology and control vascular wilt disease in tomato plants.
Collapse
Affiliation(s)
- Ankita Singh Kushwah
- Department of Biosciences and Bioengineering, Indian Institute of Technology Roorkee, Roorkee, Uttarakhand, India
| | - Himisha Dixit
- Centre for Computational Biology & Bioinformatics, Central University of Himachal Pradesh, Kangra, Himachal Pradesh, India
| | - Vipin Upadhyay
- Centre for Computational Biology & Bioinformatics, Central University of Himachal Pradesh, Kangra, Himachal Pradesh, India
| | - Shailender Kumar Verma
- Centre for Computational Biology & Bioinformatics, Central University of Himachal Pradesh, Kangra, Himachal Pradesh, India
- Department of Environmental Studies, University of Delhi, North Campus, Delhi, India
| | - Ramasare Prasad
- Department of Biosciences and Bioengineering, Indian Institute of Technology Roorkee, Roorkee, Uttarakhand, India
| |
Collapse
|
2
|
Jia P, Zhang F, Wu C, Li M. A comprehensive review of protein-centric predictors for biomolecular interactions: from proteins to nucleic acids and beyond. Brief Bioinform 2024; 25:bbae162. [PMID: 38739759 PMCID: PMC11089422 DOI: 10.1093/bib/bbae162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2024] [Revised: 02/17/2024] [Accepted: 03/31/2024] [Indexed: 05/16/2024] Open
Abstract
Proteins interact with diverse ligands to perform a large number of biological functions, such as gene expression and signal transduction. Accurate identification of these protein-ligand interactions is crucial to the understanding of molecular mechanisms and the development of new drugs. However, traditional biological experiments are time-consuming and expensive. With the development of high-throughput technologies, an increasing amount of protein data is available. In the past decades, many computational methods have been developed to predict protein-ligand interactions. Here, we review a comprehensive set of over 160 protein-ligand interaction predictors, which cover protein-protein, protein-nucleic acid, protein-peptide and protein-other ligands (nucleotide, heme, ion) interactions. We have carried out a comprehensive analysis of the above four types of predictors from several significant perspectives, including their inputs, feature profiles, models, availability, etc. The current methods primarily rely on protein sequences, especially utilizing evolutionary information. The significant improvement in predictions is attributed to deep learning methods. Additionally, sequence-based pretrained models and structure-based approaches are emerging as new trends.
Collapse
Affiliation(s)
- Pengzhen Jia
- School of Computer Science and Engineering, Central South University, 932 Lushan Road(S), Changsha 410083, China
| | - Fuhao Zhang
- School of Computer Science and Engineering, Central South University, 932 Lushan Road(S), Changsha 410083, China
- College of Information Engineering, Northwest A&F University, No. 3 Taicheng Road, Yangling, Shaanxi 712100, China
| | - Chaojin Wu
- School of Computer Science and Engineering, Central South University, 932 Lushan Road(S), Changsha 410083, China
| | - Min Li
- School of Computer Science and Engineering, Central South University, 932 Lushan Road(S), Changsha 410083, China
| |
Collapse
|
3
|
Zheng H, Zhang H, Zhong J, Gucwa M, Zhang Y, Ma H, Deng L, Mao L, Minor W, Wang N. PinMyMetal: A hybrid learning system to accurately model metal binding sites in macromolecules. RESEARCH SQUARE 2024:rs.3.rs-3908734. [PMID: 38463967 PMCID: PMC10925427 DOI: 10.21203/rs.3.rs-3908734/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Metal ions are vital components in many proteins for the inference and engineering of protein function, with coordination complexity linked to structural (4-residue predominate), catalytic (3-residue predominate), or regulatory (2-residue predominate) roles. Computational tools for modeling metal ions in protein structures, especially for transient, reversible, and concentration-dependent regulatory sites, remain immature. We present PinMyMetal (PMM), a sophisticated hybrid machine learning system for predicting zinc ion localization and environment in macromolecular structures. Compared to other predictors, PMM excels in predicting regulatory sites (median deviation of 0.34 Å), demonstrating superior accuracy in locating catalytic sites (median deviation of 0.27 Å) and structural sites (median deviation of 0.14 Å). PMM assigns a certainty score to each predicted site based on local structural and physicochemical features independent of homolog presence. Interactive validation through our server, CheckMyMetal, expands PMM's scope, enabling it to pinpoint and validates diverse functional zinc sites from different structure sources (predicted structures, cryo-EM and crystallography). This facilitates residue-wise assessment and robust metal binding site design. The lightweight PMM system demands minimal computing resources and is available at https://PMM.biocloud.top. While currently trained on zinc, the PMM workflow can easily adapt to other metals through expanded training data.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Lei Deng
- Hunan University College of Biology
| | | | | | - Nasui Wang
- Division of Endocrinology and Metabolism, The First Affiliated Hospital of Shantou University Medical College
| |
Collapse
|
4
|
Laveglia V, Bazayeva M, Andreini C, Rosato A. Hunting down zinc(II)-binding sites in proteins with distance matrices. Bioinformatics 2023; 39:btad653. [PMID: 37878807 PMCID: PMC10630175 DOI: 10.1093/bioinformatics/btad653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 10/17/2023] [Accepted: 10/23/2023] [Indexed: 10/27/2023] Open
Abstract
MOTIVATION In recent years, high-throughput sequencing technologies have made available the genome sequences of a huge variety of organisms. However, the functional annotation of the encoded proteins often still relies on low-throughput and costly experimental studies. Bioinformatics approaches offer a promising alternative to accelerate this process. In this work, we focus on the binding of zinc(II) ions, which is needed for 5%-10% of any organism's proteins to achieve their physiologically relevant form. RESULTS To implement a predictor of zinc(II)-binding sites in the 3D structures of proteins, we used a neural network, followed by a filter of the network output against the local structure of all known sites. The latter was implemented as a function comparing the distance matrices of the Cα and Cβ atoms of the sites. We called the resulting tool Master of Metals (MOM). The structural models for the entire proteome of an organism generated by AlphaFold can be used as input to our tool in order to achieve annotation at the whole organism level within a few hours. To demonstrate this, we applied MOM to the yeast proteome, obtaining a precision of about 76%, based on data for homologous proteins. AVAILABILITY AND IMPLEMENTATION Master of Metals has been implemented in Python and is available at https://github.com/cerm-cirmmp/Master-of-metals.
Collapse
Affiliation(s)
- Vincenzo Laveglia
- Department of Chemistry, University of Florence, Sesto Fiorentino 50019, Italy
| | - Milana Bazayeva
- Department of Chemistry, University of Florence, Sesto Fiorentino 50019, Italy
- Magnetic Resonance Center (CERM), University of Florence, Sesto Fiorentino 50019, Italy
| | - Claudia Andreini
- Department of Chemistry, University of Florence, Sesto Fiorentino 50019, Italy
- Magnetic Resonance Center (CERM), University of Florence, Sesto Fiorentino 50019, Italy
- Consorzio Interuniversitario di Risonanze Magnetiche di Metallo Proteine, Sesto Fiorentino 50019, Italy
| | - Antonio Rosato
- Department of Chemistry, University of Florence, Sesto Fiorentino 50019, Italy
- Magnetic Resonance Center (CERM), University of Florence, Sesto Fiorentino 50019, Italy
- Consorzio Interuniversitario di Risonanze Magnetiche di Metallo Proteine, Sesto Fiorentino 50019, Italy
| |
Collapse
|
5
|
Ravnik V, Jukič M, Bren U. Identifying Metal Binding Sites in Proteins Using Homologous Structures, the MADE Approach. J Chem Inf Model 2023; 63:5204-5219. [PMID: 37557084 PMCID: PMC10466382 DOI: 10.1021/acs.jcim.3c00558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Indexed: 08/11/2023]
Abstract
In order to identify the locations of metal ions in the binding sites of proteins, we have developed a method named the MADE (MAcromolecular DEnsity and Structure Analysis) approach. The MADE approach represents an evolution of our previous toolset, the ProBiS H2O (MD) methodology, for the identification of conserved water molecules. Our method uses experimental structures of proteins homologous to a query, which are subsequently superimposed upon it. Areas with a particular species present in a similar location among many homologous protein structures are identified using a clustering algorithm. Dense clusters likely represent positions containing species important to the query protein structure or function. We analyze well-characterized apo protein structures and show that the MADE approach can identify clusters corresponding to the expected positions of metal ions in their binding sites. The greatest advantage of our method lies in its generality. It can in principle be applied to any species found in protein records; it is not only limited to metal ions. We additionally demonstrate that the MADE approach can be successfully applied to predict the location of cofactors in computer-modeled structures, e.g., via AlphaFold. We also conduct a careful protein superposition method comparison and find our methodology robust and the results largely independent of the selected protein superposition algorithm. We postulate that with increasing structural data availability, additional applications of the MADE approach will be possible such as non-protein systems, water network identification, protein binding site elaboration, and analysis of binding events, all in a dynamic manner. We have implemented the MADE approach as a plugin for the PyMOL molecular visualization tool. The MADE plugin is available free of charge at https://gitlab.com/Jukic/made_software.
Collapse
Affiliation(s)
- Vid Ravnik
- Faculty
of Chemistry and Chemical Engineering, University
of Maribor, Smetanova
ulica 17, Maribor SI-2000, Slovenia
| | - Marko Jukič
- Faculty
of Chemistry and Chemical Engineering, University
of Maribor, Smetanova
ulica 17, Maribor SI-2000, Slovenia
- The
Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Glagoljaška 8, Koper SI-6000, Slovenia
- Institute
for Environmental Protection and Sensors, Beloruska ulica 7, Maribor SI-2000, Slovenia
| | - Urban Bren
- Faculty
of Chemistry and Chemical Engineering, University
of Maribor, Smetanova
ulica 17, Maribor SI-2000, Slovenia
- The
Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Glagoljaška 8, Koper SI-6000, Slovenia
- Institute
for Environmental Protection and Sensors, Beloruska ulica 7, Maribor SI-2000, Slovenia
| |
Collapse
|
6
|
Cheng Y, Wang H, Xu H, Liu Y, Ma B, Chen X, Zeng X, Wang X, Wang B, Shiau C, Ovchinnikov S, Su XD, Wang C. Co-evolution-based prediction of metal-binding sites in proteomes by machine learning. Nat Chem Biol 2023; 19:548-555. [PMID: 36593274 DOI: 10.1038/s41589-022-01223-z] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 11/08/2022] [Indexed: 01/03/2023]
Abstract
Metal ions have various important biological roles in proteins, including structural maintenance, molecular recognition and catalysis. Previous methods of predicting metal-binding sites in proteomes were based on either sequence or structural motifs. Here we developed a co-evolution-based pipeline named 'MetalNet' to systematically predict metal-binding sites in proteomes. We applied MetalNet to proteomes of four representative prokaryotic species and predicted 4,849 potential metalloproteins, which substantially expands the currently annotated metalloproteomes. We biochemically and structurally validated previously unannotated metal-binding sites in several proteins, including apo-citrate lyase phosphoribosyl-dephospho-CoA transferase citX, an Escherichia coli enzyme lacking structural or sequence homology to any known metalloprotein (Protein Data Bank (PDB) codes: 7DCM and 7DCN ). MetalNet also successfully recapitulated all known zinc-binding sites from the human spliceosome complex. The pipeline of MetalNet provides a unique and enabling tool for interrogating the hidden metalloproteome and studying metal biology.
Collapse
Affiliation(s)
- Yao Cheng
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing, China
- Department of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing, China
| | - Haobo Wang
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing, China
- Department of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing, China
| | - Hua Xu
- State Key Laboratory of Protein and Plant Gene Research, and Biomedical Pioneering Innovation Center (BIOPIC), Peking University, Beijing, China
| | - Yuan Liu
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing, China.
- Department of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing, China.
| | - Bin Ma
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing, China
- Department of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing, China
| | - Xuemin Chen
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing, China
- Department of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing, China
| | - Xin Zeng
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Xianghe Wang
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing, China
- Department of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing, China
| | - Bo Wang
- State Key Laboratory of Protein and Plant Gene Research, and Biomedical Pioneering Innovation Center (BIOPIC), Peking University, Beijing, China
| | | | - Sergey Ovchinnikov
- John Harvard Distinguished Science Fellow, Harvard University, Cambridge, MA, USA
| | - Xiao-Dong Su
- State Key Laboratory of Protein and Plant Gene Research, and Biomedical Pioneering Innovation Center (BIOPIC), Peking University, Beijing, China.
| | - Chu Wang
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing, China.
- Department of Chemical Biology, College of Chemistry and Molecular Engineering, Peking University, Beijing, China.
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China.
| |
Collapse
|
7
|
Zhang J, Zhou F, Liang X, Yang G. SCAMPER: Accurate Type-Specific Prediction of Calcium-Binding Residues Using Sequence-Derived Features. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:1406-1416. [PMID: 35536812 DOI: 10.1109/tcbb.2022.3173437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Understanding molecular mechanisms involved in calcium-protein interactions and modeling corresponding docking rely on the accurate identification of calcium-binding residues (CaBRs). The defects of experimentally annotating protein functions enhances the development of computational approaches that correctly identify calcium-binding interactions. Studies have reported that current methods severely cross-predict residues that interact with other types of molecules (e.g., nucleic acids, proteins, and small ligands) as CaBRs. In this study, a novel predictor named SCAMPER (Selective CAlciuM-binding PrEdictoR) is proposed for the accurate and specific prediction of CaBRs. SCAMPER is designed using newly compiled dataset with complete UniProt sequences and annotations, which include calcium-binding, nucleic acid-binding, protein-binding, and small ligand-binding residues. We use a novel designed two-layer scheme to perform predictions as well as penalize cross-predictions. Empirical tests on an independent test dataset reveals that the proposed method significantly outperforms state-of-the-art predictors. SCAMPER is proved to be capable of distinguishing CaBRs from different types of metal-ion binding residues. We further perform CaBRs predictions on the whole human proteome, and use the results to hypothesize calcium-binding proteins (CaBPs). The latest experimental verified CaBPs and GO analysis prove the accuracy of our predictions. We implement the proposed method and share the data at http://www.inforstation.com/webservers/SCAMPER/.
Collapse
|
8
|
Sun S, Xu H, Xie Y, Sanchez JE, Guo W, Liu D, Li L. HIT-2: Implementing machine learning algorithms to treat bound ions in biomolecules. Comput Struct Biotechnol J 2023; 21:1383-1389. [PMID: 36817955 PMCID: PMC9929202 DOI: 10.1016/j.csbj.2023.02.013] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 02/06/2023] [Accepted: 02/06/2023] [Indexed: 02/10/2023] Open
Abstract
Electrostatic features are fundamental to protein functions and protein-protein interactions. Studying highly charged biomolecules is challenging given the heterogeneous distribution of the ionic cloud around such biomolecules. Here we report a new computational method, Hybridizing Ions Treatment-2 (HIT-2), which is used to model biomolecule-bound ions using the implicit solvation model. By modeling ions, HIT-2 allows the user to calculate important electrostatic features of the biomolecules. HIT-2 applies an efficient algorithm to calculate the position of bound ions from molecular dynamics simulations. Modeling parameters were optimized by machine learning methods from thousands of datasets. The optimized parameters produced results with errors lower than 0.2 Å. The testing results on bound Ca2+ and Zn2+ in NAMD simulations also proved that HIT-2 can effectively identify bound ion types, numbers, and positions. Also, multiple tests performed on HIT-2 suggest the method can handle biomolecules that undergo remarkable conformational changes. HIT-2 can significantly improve electrostatic calculations for many problems in computational biophysics.
Collapse
Affiliation(s)
- Shengjie Sun
- Computational Science Program, The University of Texas at El Paso, 500 W University Ave, TX 79968, USA
| | - Honglun Xu
- Computational Science Program, The University of Texas at El Paso, 500 W University Ave, TX 79968, USA
| | - Yixin Xie
- Department of Information Technology, College of Computing and Software Engineering, Kennesaw State University, 1000 Chastain Rd NW, Kennesaw, GA 30144, USA
| | - Jason E. Sanchez
- Computational Science Program, The University of Texas at El Paso, 500 W University Ave, TX 79968, USA
| | - Wenhan Guo
- Computational Science Program, The University of Texas at El Paso, 500 W University Ave, TX 79968, USA
| | - Dongfang Liu
- Department of Computer Engineering, Rochester Institute of Technology, Rochester, NY 14623, USA
| | - Lin Li
- Computational Science Program, The University of Texas at El Paso, 500 W University Ave, TX 79968, USA
- Department of Physics, the University of Texas at El Paso, 500 W University Ave, TX 79968, USA
| |
Collapse
|
9
|
Liao J, Wang Q, Wu F, Huang Z. In Silico Methods for Identification of Potential Active Sites of Therapeutic Targets. Molecules 2022; 27:7103. [PMID: 36296697 PMCID: PMC9609013 DOI: 10.3390/molecules27207103] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 08/12/2022] [Accepted: 08/25/2022] [Indexed: 07/30/2023] Open
Abstract
Target identification is an important step in drug discovery, and computer-aided drug target identification methods are attracting more attention compared with traditional drug target identification methods, which are time-consuming and costly. Computer-aided drug target identification methods can greatly reduce the searching scope of experimental targets and associated costs by identifying the diseases-related targets and their binding sites and evaluating the druggability of the predicted active sites for clinical trials. In this review, we introduce the principles of computer-based active site identification methods, including the identification of binding sites and assessment of druggability. We provide some guidelines for selecting methods for the identification of binding sites and assessment of druggability. In addition, we list the databases and tools commonly used with these methods, present examples of individual and combined applications, and compare the methods and tools. Finally, we discuss the challenges and limitations of binding site identification and druggability assessment at the current stage and provide some recommendations and future perspectives.
Collapse
Affiliation(s)
- Jianbo Liao
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Key Laboratory of Computer-Aided Drug Design of Dongguan City, Key Laboratory for Research and Development of Natural Drugs of Guangdong Province, School of Pharmacy, Guangdong Medical University, Dongguan 523808, China
- The Second School of Clinical Medicine, Guangdong Medical University, Dongguan 523808, China
| | - Qinyu Wang
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Key Laboratory of Computer-Aided Drug Design of Dongguan City, Key Laboratory for Research and Development of Natural Drugs of Guangdong Province, School of Pharmacy, Guangdong Medical University, Dongguan 523808, China
| | - Fengxu Wu
- Hubei Key Laboratory of Wudang Local Chinese Medicine Research, School of Pharmaceutical Sciences, Hubei University of Medicine, Shiyan 442000, China
| | - Zunnan Huang
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Key Laboratory of Computer-Aided Drug Design of Dongguan City, Key Laboratory for Research and Development of Natural Drugs of Guangdong Province, School of Pharmacy, Guangdong Medical University, Dongguan 523808, China
- Marine Biomedical Research Institute of Guangdong Zhanjiang, Zhanjiang 524023, China
| |
Collapse
|
10
|
Andreini C, Rosato A. Structural Bioinformatics and Deep Learning of Metalloproteins: Recent Advances and Applications. Int J Mol Sci 2022; 23:7684. [PMID: 35887033 PMCID: PMC9323969 DOI: 10.3390/ijms23147684] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Revised: 07/04/2022] [Accepted: 07/06/2022] [Indexed: 02/04/2023] Open
Abstract
All living organisms require metal ions for their energy production and metabolic and biosynthetic processes. Within cells, the metal ions involved in the formation of adducts interact with metabolites and macromolecules (proteins and nucleic acids). The proteins that require binding to one or more metal ions in order to be able to carry out their physiological function are called metalloproteins. About one third of all protein structures in the Protein Data Bank involve metalloproteins. Over the past few years there has been tremendous progress in the number of computational tools and techniques making use of 3D structural information to support the investigation of metalloproteins. This trend has been boosted by the successful applications of neural networks and machine/deep learning approaches in molecular and structural biology at large. In this review, we discuss recent advances in the development and availability of resources dealing with metalloproteins from a structure-based perspective. We start by addressing tools for the prediction of metal-binding sites (MBSs) using structural information on apo-proteins. Then, we provide an overview of the methods for and lessons learned from the structural comparison of MBSs in a fold-independent manner. We then move to describing databases of metalloprotein/MBS structures. Finally, we summarizing recent ML/DL applications enhancing the functional interpretation of metalloprotein structures.
Collapse
Affiliation(s)
- Claudia Andreini
- Consorzio Interuniversitario di Risonanze Magnetiche di Metallo Proteine, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Italy
- Magnetic Resonance Center (CERM), Department of Chemistry, University of Florence, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Italy
| | - Antonio Rosato
- Consorzio Interuniversitario di Risonanze Magnetiche di Metallo Proteine, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Italy
- Magnetic Resonance Center (CERM), Department of Chemistry, University of Florence, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Italy
| |
Collapse
|
11
|
Aptekmann AA, Buongiorno J, Giovannelli D, Glamoclija M, Ferreiro DU, Bromberg Y. mebipred: identifying metal binding potential in protein sequence. Bioinformatics 2022; 38:3532-3540. [PMID: 35639953 PMCID: PMC9272798 DOI: 10.1093/bioinformatics/btac358] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Revised: 03/27/2022] [Accepted: 05/22/2022] [Indexed: 11/23/2022] Open
Abstract
Motivation metal-binding proteins have a central role in maintaining life processes. Nearly one-third of known protein structures contain metal ions that are used for a variety of needs, such as catalysis, DNA/RNA binding, protein structure stability, etc. Identifying metal-binding proteins is thus crucial for understanding the mechanisms of cellular activity. However, experimental annotation of protein metal-binding potential is severely lacking, while computational techniques are often imprecise and of limited applicability. Results we developed a novel machine learning-based method, mebipred, for identifying metal-binding proteins from sequence-derived features. This method is over 80% accurate in recognizing proteins that bind metal ion-containing ligands; the specific identity of 11 ubiquitously present metal ions can also be annotated. mebipred is reference-free, i.e. no sequence alignments are involved, and is thus faster than alignment-based methods; it is also more accurate than other sequence-based prediction methods. Additionally, mebipred can identify protein metal-binding capabilities from short sequence stretches, e.g. translated sequencing reads, and, thus, may be useful for the annotation of metal requirements of metagenomic samples. We performed an analysis of available microbiome data and found that ocean, hot spring sediments and soil microbiomes use a more diverse set of metals than human host-related ones. For human microbiomes, physiological conditions explain the observed metal preferences. Similarly, subtle changes in ocean sample ion concentration affect the abundance of relevant metal-binding proteins. These results highlight mebipred’s utility in analyzing microbiome metal requirements. Availability and implementation mebipred is available as a web server at services.bromberglab.org/mebipred and as a standalone package at https://pypi.org/project/mymetal/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- A A Aptekmann
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ, 08873, USA.,Institute of Marine and Coastal Sciences, Rutgers University, New Brunswick, NJ, 08901, USA
| | | | - D Giovannelli
- Institute of Marine and Coastal Sciences, Rutgers University, New Brunswick, NJ, 08901, USA.,Department of Biology, University of Naples Federico II, Naples, Italy.,Institute for Marine Biological Resources and Biotechnology-IRBIM, National Research Council of Italy, CNR, Ancona, Italy
| | - M Glamoclija
- Department of Earth and Environmental Sciences, Rutgers University, New Brunswick, NJ, 07102, USA
| | - D U Ferreiro
- Protein Physiology Lab, Departamento de Quimica Biologica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires-CONICET-IQUIBICEN, Buenos Aires, 1428, Argentina
| | - Y Bromberg
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ, 08873, USA
| |
Collapse
|
12
|
You X, Hu X, Feng Z, Wang Z, Hao S, Yang C. Recognizing Protein-metal Ion Ligands Binding Residues by Random Forest Algorithm with Adding Orthogonal Properties. Comput Biol Chem 2022; 98:107693. [DOI: 10.1016/j.compbiolchem.2022.107693] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 05/02/2022] [Accepted: 05/03/2022] [Indexed: 11/16/2022]
|
13
|
Abstract
Metalloproteins play diverse and critical functions in all living systems, and their dysfunctional forms are closely related to many human diseases. The development of methods that enable comprehensive mapping of metalloproteome is of great interest to help elucidate crucial roles of metalloproteins in both physiology and pathology, as well as to discover new metalloproteins. We herein briefly review recent progress in the field of metalloproteomics and provide future outlooks.
Collapse
Affiliation(s)
- Xin Zeng
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing 100871, China.,Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
| | - Yao Cheng
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing 100871, China.,College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Chu Wang
- Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, Peking University, Beijing 100871, China.,College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China.,Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
| |
Collapse
|
14
|
Nandy A, Duan C, Taylor MG, Liu F, Steeves AH, Kulik HJ. Computational Discovery of Transition-metal Complexes: From High-throughput Screening to Machine Learning. Chem Rev 2021; 121:9927-10000. [PMID: 34260198 DOI: 10.1021/acs.chemrev.1c00347] [Citation(s) in RCA: 81] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Transition-metal complexes are attractive targets for the design of catalysts and functional materials. The behavior of the metal-organic bond, while very tunable for achieving target properties, is challenging to predict and necessitates searching a wide and complex space to identify needles in haystacks for target applications. This review will focus on the techniques that make high-throughput search of transition-metal chemical space feasible for the discovery of complexes with desirable properties. The review will cover the development, promise, and limitations of "traditional" computational chemistry (i.e., force field, semiempirical, and density functional theory methods) as it pertains to data generation for inorganic molecular discovery. The review will also discuss the opportunities and limitations in leveraging experimental data sources. We will focus on how advances in statistical modeling, artificial intelligence, multiobjective optimization, and automation accelerate discovery of lead compounds and design rules. The overall objective of this review is to showcase how bringing together advances from diverse areas of computational chemistry and computer science have enabled the rapid uncovering of structure-property relationships in transition-metal chemistry. We aim to highlight how unique considerations in motifs of metal-organic bonding (e.g., variable spin and oxidation state, and bonding strength/nature) set them and their discovery apart from more commonly considered organic molecules. We will also highlight how uncertainty and relative data scarcity in transition-metal chemistry motivate specific developments in machine learning representations, model training, and in computational chemistry. Finally, we will conclude with an outlook of areas of opportunity for the accelerated discovery of transition-metal complexes.
Collapse
Affiliation(s)
- Aditya Nandy
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States.,Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Chenru Duan
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States.,Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Michael G Taylor
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Fang Liu
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Adam H Steeves
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Heather J Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
15
|
Fleminger G, Dayan A. The moonlighting activities of dihydrolipoamide dehydrogenase: Biotechnological and biomedical applications. J Mol Recognit 2021; 34:e2924. [PMID: 34164859 DOI: 10.1002/jmr.2924] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 06/13/2021] [Indexed: 12/13/2022]
Abstract
Dihydrolipoamide dehydrogenase (DLDH) is a homodimeric flavin-dependent enzyme that catalyzes the NAD+ -dependent oxidation of dihydrolipoamide. The enzyme is part of several multi-enzyme complexes such as the Pyruvate Dehydrogenase system that transforms pyruvate into acetyl-co-A. Concomitantly with its redox activity, DLDH produces Reactive Oxygen Species (ROS), which are involved in cellular apoptotic processes. DLDH possesses several moonlighting functions. One of these is the capacity to adhere to metal-oxides surfaces. This was first exemplified by the presence of an exocellular form of the enzyme on the cell-wall surface of Rhodococcus ruber. This capability was evolutionarily conserved and identified in the human, mitochondrial, DLDH. The enzyme was modified with Arg-Gly-Asp (RGD) groups, which enabled its interaction with integrin-rich cancer cells followed by "integrin-assisted-endocytosis." This allowed harnessing the enzyme for cancer therapy. Combining the TiO2 -binding property with DLDH's ROS-production, enabled us to develop several medical applications including improving oesseointegration of TiO2 -based implants and photodynamic treatment for melanoma. The TiO2 -binding sites of both the bacterial and human DLDH's were identified on the proteins' molecules at regions that overlap with the binding site of E3-binding protein (E3BP). This protein is essential in forming the multiunit structure of PDC. Another moonlighting activity of DLDH, which is described in this Review, is its DNA-binding capacity that may affect DNA chelation and shredding leading to apoptotic processes in living cells. The typical ROS-generation by DLDH, which occurs in association with its enzymatic activity and its implications in cancer and apoptotic cell death are also discussed.
Collapse
Affiliation(s)
- Gideon Fleminger
- The Shmunis School of Biomedicine and Cancer Research, The George Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Israel
| | - Avraham Dayan
- The Shmunis School of Biomedicine and Cancer Research, The George Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv, Israel
| |
Collapse
|
16
|
Serafini A. Interplay between central carbon metabolism and metal homeostasis in mycobacteria and other human pathogens. MICROBIOLOGY (READING, ENGLAND) 2021; 167. [PMID: 34080971 DOI: 10.1099/mic.0.001060] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Bacterial nutrition is a fundamental aspect of pathogenesis. While the host environment is in principle nutrient-rich, hosts have evolved strategies to interfere with nutrient acquisition by pathogens. In turn, pathogens have developed mechanisms to circumvent these restrictions. Changing the availability of bioavailable metal ions is a common strategy used by hosts to limit bacterial replication. Macrophages and neutrophils withhold iron, manganese, and zinc ions to starve bacteria. Alternatively, they can release manganese, zinc, and copper ions to intoxicate microorganisms. Metals are essential micronutrients and participate in catalysis, macromolecular structure, and signalling. This review summarises our current understanding of how central carbon metabolism in pathogens adapts to local fluctuations in free metal ion concentrations. We focus on the transcriptomics and proteomics data produced in studies of the iron-sparing response in Mycobacterium tuberculosis, the etiological agent of tuberculosis, and consequently generate a hypothetical model linking trehalose accumulation, succinate secretion and substrate-level phosphorylation in iron-starved M. tuberculosis. This review also aims to highlight a large gap in our knowledge of pathogen physiology: the interplay between metal homeostasis and central carbon metabolism, two cellular processes which are usually studied separately. Integrating metabolism and metal biology would allow the discovery of new weaknesses in bacterial physiology, leading to the development of novel and improved antibacterial therapies.
Collapse
Affiliation(s)
- Agnese Serafini
- Independent researcher 00012 Guidonia Montecelio, Rome, Italy
| |
Collapse
|
17
|
Dutta Gupta D, Mandal I, Nayak C, Jha SN, Bhattacharyya D, Venkatramani R, Mazumdar S. Identification of a copper ion recognition peptide sequence in the subunit II of cytochrome c oxidase: a combined theoretical and experimental study. J Biol Inorg Chem 2021; 26:411-425. [PMID: 33928437 DOI: 10.1007/s00775-021-01867-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Accepted: 03/23/2021] [Indexed: 11/28/2022]
Abstract
The role of the pentapeptide, NHSFM, derived from the surface exposed part of the metal ion binding loop of the subunit II of cytochrome c oxidase on the maturation of the binuclear purple CuA center of the enzyme has been investigated using several experimental and computational methods. The copper ion was found to form 1:1 complex of the pentapeptide with a binding constant ~ 104 M-1 to 105 M-1, where a 4 ligand coordination from the peptide in a type 2 copper center was revealed. The pH dependence of the metal-peptide was associated with a [Formula: see text] of ~ 10 suggesting deprotonation of the N-terminal amine. EXAFS studies as well as DFT calculations of the metal-peptide complexes revealed pH dependent changes in the metal-ligand bond distances. Spectroscopic properties of the metal peptides calculated from TDDFT studies agreed with the experimental results. Restrained molecular dynamics (RMD) simulations indicated coordination of a carbonyl oxygen from the asparagine (N) side chain and of water molecules apart from histidine (H), methionine (M) and terminal amine of asparagine (N) in a distorted square planar geometry of Cu-NHSFM. Analyses of the backbone distances as well as B-factors for the metal peptide suggested that the peptide backbone becomes more compact and rigid on binding of the metal ion. This indicated that binding of copper ion to this pentapeptide in the protein possibly cause movement of the protein backbone bringing other coordinating residues closer to the copper ion, and thus helping in sequential uptake of copper ions to the protein.
Collapse
Affiliation(s)
- Dwaipayan Dutta Gupta
- Tata Institute of Fundamental Research, Homi Bhabha Road, Colaba, Mumbai, 400 005, India
| | - Imon Mandal
- Tata Institute of Fundamental Research, Homi Bhabha Road, Colaba, Mumbai, 400 005, India
| | - Chandrani Nayak
- Atomic and Molecular Physics Division, Bhabha Atomic Research Centre, Mumbai, 400 085, India
| | - Shambhu Nath Jha
- Atomic and Molecular Physics Division, Bhabha Atomic Research Centre, Mumbai, 400 085, India
| | - Dibyendu Bhattacharyya
- Atomic and Molecular Physics Division, Bhabha Atomic Research Centre, Mumbai, 400 085, India
| | - Ravindra Venkatramani
- Tata Institute of Fundamental Research, Homi Bhabha Road, Colaba, Mumbai, 400 005, India.
| | - Shyamalava Mazumdar
- Tata Institute of Fundamental Research, Homi Bhabha Road, Colaba, Mumbai, 400 005, India.
| |
Collapse
|
18
|
Garg A, Pal D. Inferring metal binding sites in flexible regions of proteins. Proteins 2021; 89:1125-1133. [PMID: 33864411 DOI: 10.1002/prot.26085] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Revised: 03/15/2021] [Accepted: 04/12/2021] [Indexed: 12/20/2022]
Abstract
Metal ions are central to the molecular function of many proteins. Thus their knowledge in experimentally determined structure is important; however, such structures often lose bound metal ions during sample preparation. Identification of these metal-binding site(s) becomes difficult when the receptor is novel and/or their conformations differ in the bound/unbound states. Locating such sites in theoretical models also poses a challenge due to the uncertainties with side-chain modeling. We address the problem by employing the Geometric Hashing algorithm to create a template library of functionally important binding sites and match query structures with the available templates. The matching is done on the structure ensemble obtained from coarse-grained molecular dynamics simulation, where metal-specific amino acids are screened to infer the true site. Test on 1347 non-redundant monomer protein structures show that Ca2+ , Zn2+ , Mg2+ , Cu2+ , and Fe3+ binding site residues can be classified at 0.92, 0.95, 0.80, 0.90, and 0.92 aggregate performance (out of 1) across all possible thresholds. The performance for Ca2+ and Zn2+ is notably superior in comparison to state-of-the-art methods like IonCom and MIB. Specific case studies show that additionally predicted metal-binding site residues in proteins have features necessary for ion binding. These include new sites not predicted by other methods. The use of coarse-grained dynamics thus provides a generalized approach to improve metal-binding site prediction. The work is expected to contribute to improving our ability to correctly predict protein molecular function where knowledge of metal binding is a key requirement.
Collapse
Affiliation(s)
- Aditi Garg
- Department of Computational and Data Sciences, Indian Institute of Science, Bengaluru, India
| | - Debnath Pal
- Department of Computational and Data Sciences, Indian Institute of Science, Bengaluru, India
| |
Collapse
|
19
|
Silman I, Shnyrov VL, Ashani Y, Roth E, Nicolas A, Sussman JL, Weiner L. Torpedo californica acetylcholinesterase is stabilized by binding of a divalent metal ion to a novel and versatile 4D motif. Protein Sci 2021; 30:966-981. [PMID: 33686648 PMCID: PMC8040873 DOI: 10.1002/pro.4061] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 03/05/2021] [Accepted: 03/05/2021] [Indexed: 11/07/2022]
Abstract
Stabilization of Torpedo californica acetylcholinesterase by the divalent cations Ca+2, Mg+2, and Mn+2 was investigated. All three substantially protect the enzyme from thermal inactivation. Electron paramagnetic resonance revealed one high‐affinity binding site for Mn+2 and several much weaker sites. Differential scanning calorimetry showed a single irreversible thermal transition. All three cations raise both the temperature of the transition and the activation energy, with the transition becoming more cooperative. The crystal structures of the Ca+2 and Mg+2 complexes with Torpedo acetylcholinesterase were solved. A principal binding site was identified. In both cases, it consists of four aspartates (a 4D motif), within which the divalent ion is embedded, together with several water molecules. It makes direct contact with two of the aspartates, and indirect contact, via waters, with the other two. The 4D motif has been identified in 31 acetylcholinesterase sequences and 28 butyrylcholinesterase sequences. Zebrafish acetylcholinesterase also contains the 4D motif; it, too, is stabilized by divalent metal ions. The ASSAM server retrieved 200 other proteins that display the 4D motif, in many of which it is occupied by a divalent cation. It is a very versatile motif, since, even though tightly conserved in terms of RMSD values, it can contain from one to as many as three divalent metal ions, together with a variable number of waters. This novel motif, which binds primarily divalent metal ions, is shared by a broad repertoire of proteins. An animated Interactive 3D Complement (I3DC) is available in Proteopedia at http://proteopedia.org/w/Journal:Protein_Science:3. PDB‐ID(s): 7B38, 7B8E and 7B2W;
Collapse
Affiliation(s)
- Israel Silman
- Department of NeurobiologyWeizmann Institute of ScienceRehovotIsrael
| | - Valery L. Shnyrov
- Department of Biochemistry and Molecular BiologyUniversidad de SalamancaSalamancaSpain
| | - Yacov Ashani
- Department of Biomolecular SciencesWeizmann Institute of ScienceRehovotIsrael
| | - Esther Roth
- Department of NeurobiologyWeizmann Institute of ScienceRehovotIsrael
| | - Anne Nicolas
- Department of NeurobiologyWeizmann Institute of ScienceRehovotIsrael
- Department of Chemical and Structural BiologyWeizmann Institute of ScienceRehovotIsrael
| | - Joel L. Sussman
- Department of Chemical and Structural BiologyWeizmann Institute of ScienceRehovotIsrael
- Structural Proteomics UnitWeizmann Institute of ScienceRehovotIsrael
| | - Lev Weiner
- Department of NeurobiologyWeizmann Institute of ScienceRehovotIsrael
- Department of Chemical Research SupportWeizmann Institute of ScienceRehovotIsrael
| |
Collapse
|
20
|
Jephthah S, Månsson LK, Belić D, Morth JP, Skepö M. Physicochemical Characterisation of KEIF-The Intrinsically Disordered N-Terminal Region of Magnesium Transporter A. Biomolecules 2020; 10:biom10040623. [PMID: 32316569 PMCID: PMC7226168 DOI: 10.3390/biom10040623] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2020] [Revised: 04/10/2020] [Accepted: 04/13/2020] [Indexed: 11/23/2022] Open
Abstract
Magnesium transporter A (MgtA) is an active transporter responsible for importing magnesium ions into the cytoplasm of prokaryotic cells. This study focuses on the peptide corresponding to the intrinsically disordered N-terminal region of MgtA, referred to as KEIF. Primary-structure and bioinformatic analyses were performed, followed by studies of the undisturbed single chain using a combination of techniques including small-angle X-ray scattering, circular dichroism spectroscopy, and atomistic molecular-dynamics simulations. Moreover, interactions with large unilamellar vesicles were investigated by using dynamic light scattering, laser Doppler velocimetry, cryogenic transmission electron microscopy, and circular dichroism spectroscopy. KEIF was confirmed to be intrinsically disordered in aqueous solution, although extended and containing little β-structure and possibly PPII structure. An increase of helical content was observed in organic solvent, and a similar effect was also seen in aqueous solution containing anionic vesicles. Interactions of cationic KEIF with anionic vesicles led to the hypothesis that KEIF adsorbs to the vesicle surface through electrostatic and entropic driving forces. Considering this, there is a possibility that the biological role of KEIF is to anchor MgtA in the cell membrane, although further investigation is needed to confirm this hypothesis.
Collapse
Affiliation(s)
- Stéphanie Jephthah
- Division of Theoretical Chemistry, Department of Chemistry, Lund University, Naturvetarvägen 14, 221 00 Lund, Sweden; (S.J.); (L.K.M.)
| | - Linda K. Månsson
- Division of Theoretical Chemistry, Department of Chemistry, Lund University, Naturvetarvägen 14, 221 00 Lund, Sweden; (S.J.); (L.K.M.)
| | - Domagoj Belić
- Division of Physical Chemistry, Department of Chemistry, Lund University, Naturvetarvägen 14, 221 00 Lund, Sweden;
| | - Jens Preben Morth
- Enzyme and Protein Chemistry, Section for Protein Chemistry and Enzyme Technology, Department of Biotechnology and Biomedicine, Technical University of Denmark, Søltofts Plads, 2800 Kgs. Lyngby, Denmark;
| | - Marie Skepö
- Division of Theoretical Chemistry, Department of Chemistry, Lund University, Naturvetarvägen 14, 221 00 Lund, Sweden; (S.J.); (L.K.M.)
- Lund Institute of Advanced Neutron and X-ray Science (LINXS), Scheelevägen 19, 233 70 Lund, Sweden
- Correspondence: ; Tel.: +46-46-222-33-66
| |
Collapse
|
21
|
Hu X, Feng Z, Zhang X, Liu L, Wang S. The Identification of Metal Ion Ligand-Binding Residues by Adding the Reclassified Relative Solvent Accessibility. Front Genet 2020; 11:214. [PMID: 32265982 PMCID: PMC7096583 DOI: 10.3389/fgene.2020.00214] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Accepted: 02/24/2020] [Indexed: 11/13/2022] Open
Abstract
Many proteins realize their special functions by binding with specific metal ion ligands during a cell's life cycle. The ability to correctly identify metal ion ligand-binding residues is valuable for the human health and the design of molecular drug. Precisely identifying these residues, however, remains challenging work. We have presented an improved computational approach for predicting the binding residues of 10 metal ion ligands (Zn2+, Cu2+, Fe2+, Fe3+, Co2+, Ca2+, Mg2+, Mn2+, Na+, and K+) by adding reclassified relative solvent accessibility (RSA). The best accuracy of fivefold cross-validation was higher than 77.9%, which was about 16% higher than the previous result on the same dataset. It was found that different reclassification of the RSA information can make different contributions to the identification of specific ligand binding residues. Our study has provided an additional understanding of the effect of the RSA on the identification of metal ion ligand binding residues.
Collapse
Affiliation(s)
| | - Zhenxing Feng
- College of Sciences, Inner Mongolla University of Technology, Hohhot, China
| | - Xiaojin Zhang
- College of Sciences, Inner Mongolla University of Technology, Hohhot, China
| | | | | |
Collapse
|
22
|
Zhao Z, Xu Y, Zhao Y. SXGBsite: Prediction of Protein-Ligand Binding Sites Using Sequence Information and Extreme Gradient Boosting. Genes (Basel) 2019; 10:E965. [PMID: 31771119 PMCID: PMC6947422 DOI: 10.3390/genes10120965] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Revised: 10/19/2019] [Accepted: 11/19/2019] [Indexed: 12/13/2022] Open
Abstract
The prediction of protein-ligand binding sites is important in drug discovery and drug design. Protein-ligand binding site prediction computational methods are inexpensive and fast compared with experimental methods. This paper proposes a new computational method, SXGBsite, which includes the synthetic minority over-sampling technique (SMOTE) and the Extreme Gradient Boosting (XGBoost). SXGBsite uses the position-specific scoring matrix discrete cosine transform (PSSM-DCT) and predicted solvent accessibility (PSA) to extract features containing sequence information. A new balanced dataset was generated by SMOTE to improve classifier performance, and a prediction model was constructed using XGBoost. The parallel computing and regularization techniques enabled high-quality and fast predictions and mitigated overfitting caused by SMOTE. An evaluation using 12 different types of ligand binding site independent test sets showed that SXGBsite performs similarly to the existing methods on eight of the independent test sets with a faster computation time. SXGBsite may be applied as a complement to biological experiments.
Collapse
Affiliation(s)
| | - Yonghong Xu
- School of Electrical Engineering, Yanshan University, Qinhuangdao 066004, China
| | | |
Collapse
|
23
|
Haberal İ, Oğul H. Prediction of Protein Metal Binding Sites Using Deep Neural Networks. Mol Inform 2019; 38:e1800169. [PMID: 30977960 DOI: 10.1002/minf.201800169] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Accepted: 03/29/2019] [Indexed: 11/06/2022]
Abstract
Metals have crucial roles for many physiological, pathological and diagnostic processes. Metal binding proteins or metalloproteins are important for metabolism functions. The proteins that reach the three-dimensional structure by folding show which vital function is fulfilled. The prediction of metal-binding in proteins will be considered as a step-in function assignment for new proteins, which helps to obtain functional proteins in genomic studies, is critical to protein function annotation and drug discovery. Computational predictions made by using machine learning methods from the data obtained from amino acid sequences are widely used in the protein metal-binding and various bioinformatics fields. In this work, we present three different deep learning architectures for prediction of metal-binding of Histidines (HIS) and Cysteines (CYS) amino acids. These architectures are as follows: 2D Convolutional Neural Network, Long-Short Term Memory and Recurrent Neural Network. Their comparison is carried out on the three different sets of attributes derived from a public dataset of protein sequences. These three sets of features extracted from the protein sequence were obtained using the PAM scoring matrix, protein composition server, and binary representation methods. The results show that a better performance for prediction of protein metal- binding sites is obtained through Convolutional Neural Network architecture.
Collapse
Affiliation(s)
- İsmail Haberal
- Department of Computer Engineering, Başkent University, Fatih Sultan Mahallesi Eskişehir Yolu 18. km, 06790, Etimesgut, Ankara, Turkey
| | - Hasan Oğul
- Department of Computer Engineering, Başkent University, Fatih Sultan Mahallesi Eskişehir Yolu 18. km, 06790, Etimesgut, Ankara, Turkey.,Faculty of Computer Sciences, Østfold University College, Halden, Norway
| |
Collapse
|
24
|
Qiao L, Xie D. MIonSite: Ligand-specific prediction of metal ion-binding sites via enhanced AdaBoost algorithm with protein sequence information. Anal Biochem 2019; 566:75-88. [DOI: 10.1016/j.ab.2018.11.009] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2018] [Revised: 10/15/2018] [Accepted: 11/07/2018] [Indexed: 11/24/2022]
|
25
|
Dayan A, Lamed R, Benayahu D, Fleminger G. RGD-modified dihydrolipoamide dehydrogenase as a molecular bridge for enhancing the adhesion of bone forming cells to titanium dioxide implant surfaces. J Biomed Mater Res A 2018; 107:545-551. [DOI: 10.1002/jbm.a.36570] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Revised: 10/23/2018] [Accepted: 10/27/2018] [Indexed: 11/10/2022]
Affiliation(s)
- Avraham Dayan
- George S. Wise Faculty of Life Sciences; The School of Molecular Cell Biology and Biotechnology; Tel Aviv Israel
| | - Raphael Lamed
- George S. Wise Faculty of Life Sciences; The School of Molecular Cell Biology and Biotechnology; Tel Aviv Israel
| | - Dafna Benayahu
- The Department of Cell and Developmental Biology; Sackler School of Medicine, Tel Aviv University; Ramat Aviv, Tel Aviv 69978 Israel
| | - Gideon Fleminger
- George S. Wise Faculty of Life Sciences; The School of Molecular Cell Biology and Biotechnology; Tel Aviv Israel
| |
Collapse
|
26
|
Sharma A, Sharma D, Verma SK. In silico Study of Iron, Zinc and Copper Binding Proteins of Pseudomonas syringae pv. lapsa: Emphasis on Secreted Metalloproteins. Front Microbiol 2018; 9:1838. [PMID: 30186242 PMCID: PMC6110883 DOI: 10.3389/fmicb.2018.01838] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2018] [Accepted: 07/23/2018] [Indexed: 11/17/2022] Open
Abstract
The phytopathogenic bacteria, Pseudomonas syringae pv. lapsa (P. syringae pv. lapsa) infects the staple food crop wheat. Metalloproteins play important roles in plant-pathogen interactions. Hence, the present work is aimed to predict and analyze the iron (Fe), zinc (Zn), and copper (Cu) binding proteins of P. syringae pv. lapsa which help in its growth, adaptation, survival and pathogenicity. A total of 232 Fe, 307 Zn, and 38 Cu-binding proteins have been identified. The functional annotation, subcellular localization and gene ontology enriched network analysis revealed their role in wide range of biological activities of the phytopathogen. Among the identified metalloproteins, a total of 29 Fe-binding, 31 Zn-binding, and 5 Cu-binding proteins were found to be secreted in nature. These putative secreted metalloproteins may perform diverse cellular and biological functions ranging from transport, response to oxidative stress, proteolysis, antimicrobial resistance, metabolic processes, protein folding and DNA repair. The observations obtained here may provide initial information required to draft new schemes to control microbial infections of staple food crops and will further help in developing sustainable agriculture.
Collapse
Affiliation(s)
- Ankita Sharma
- Centre for Computational Biology and Bioinformatics, School of Life Sciences, Central University of Himachal Pradesh, Kangra, India
| | - Dixit Sharma
- Centre for Computational Biology and Bioinformatics, School of Life Sciences, Central University of Himachal Pradesh, Kangra, India
| | - Shailender K Verma
- Centre for Computational Biology and Bioinformatics, School of Life Sciences, Central University of Himachal Pradesh, Kangra, India
| |
Collapse
|
27
|
Singh G, Tripathi S, Shanker K, Sharma A. Cadmium-induced conformational changes in type 2 metallothionein of medicinal plant Coptis japonica: insights from molecular dynamics studies of apo, partially and fully metalated forms. J Biomol Struct Dyn 2018; 37:1520-1533. [PMID: 29624115 DOI: 10.1080/07391102.2018.1461688] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
Plants play an important role in the removal of excess heavy metals from soil and water. Medicinal plants can also have non-traditional use in phytoremediation technologies. Among the heavy metals, Cadmium (Cd) is the most abundant and readily taken up by the crop plants. Plant metallothioneins (MTs) are small proteins having cysteine-rich residues and appear to play key roles in metal homoeostasis. Plant metallothionein 2 (MT 2) from Coptis japonica (Gold-thread; CjMT 2) is a typical member of this subfamily and features two cysteine-rich regions containing eight and six cysteine residues, respectively, separated by 42 amino acids long linker region. In-silico analysis of MT 2 protein sequences of C. japonica was performed. In this study, ab initio methods were utilised for the prediction of three-dimensional structure of CjMT 2. After structure validation, heavy metal-binding sites were predicted for the selected modelled structures of CjMT 2. To obtain Cdi-CjMT 2 (i = 1-7), metalated complex individual docking experiments were performed. The stability of the metalated docked structures was assessed by molecular dynamics (MD) simulation studies. Our study showed that CjMT 2 binds up to 4 Cd2+ ions in two distinct domains: a N-terminal β-domain that binds to 2 Cd2+ ions and a C-terminal α-domain that binds with 2 Cd2+ ions. Our analysis revealed that Cys residues of alpha and beta domain and some residues of spacer region of CjMT 2 protein might be important for the cadmium interaction. MD simulation studies provided insight into metal-induced conformational changes and mechanism of metalation of CjMT 2, an intrinsically disordered protein. This study provides useful insights into mechanism of cadmium-type 2 metallothionein interaction.
Collapse
Affiliation(s)
- Garima Singh
- a Biotechnology Division, CSIR-Central Institute of Medicinal and Aromatic Plants , Post Office CIMAP , Lucknow 226015 , India.,c Academy of Scientific and Innovative Research (AcSIR) , Ghaziabad 201002 , India
| | - Shubhandra Tripathi
- a Biotechnology Division, CSIR-Central Institute of Medicinal and Aromatic Plants , Post Office CIMAP , Lucknow 226015 , India
| | - Karuna Shanker
- b Chemical Science Division, CSIR-Central Institute of Medicinal and Aromatic Plants , Post Office CIMAP , Lucknow 226015 , India.,c Academy of Scientific and Innovative Research (AcSIR) , Ghaziabad 201002 , India
| | - Ashok Sharma
- a Biotechnology Division, CSIR-Central Institute of Medicinal and Aromatic Plants , Post Office CIMAP , Lucknow 226015 , India.,c Academy of Scientific and Innovative Research (AcSIR) , Ghaziabad 201002 , India
| |
Collapse
|
28
|
Hu J, Li Y, Zhang Y, Yu DJ. ATPbind: Accurate Protein-ATP Binding Site Prediction by Combining Sequence-Profiling and Structure-Based Comparisons. J Chem Inf Model 2018; 58:501-510. [PMID: 29361215 DOI: 10.1021/acs.jcim.7b00397] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Protein-ATP interactions are ubiquitous in a wide variety of biological processes. Correctly locating ATP binding sites from protein information is an important but challenging task for protein function annotation and drug discovery. However, there is no method that can optimally identify ATP binding sites for different proteins. In this study, we report a new composite predictor, ATPbind, for ATP binding sites by integrating the outputs of two template-based predictors (i.e., S-SITE and TM-SITE) and three discriminative sequence-driven features of proteins: position specific scoring matrix, predicted secondary structure, and predicted solvent accessibility. In ATPbind, we assembled multiple support vector machines (SVMs) based on a random undersampling technique to cope with the serious imbalance phenomenon between the numbers of ATP binding sites and of non-ATP binding sites. We also constructed a new gold-standard benchmark data set consisting of 429 ATP binding proteins from the PDB database to evaluate and compare the proposed ATPbind with other existing predictors. Starting from a query sequence and predicted I-TASSER models, ATPbind can achieve an average accuracy of 72%, covering 62% of all ATP binding sites while achieving a Matthews correlation coefficient value that is significantly higher than that of other state-of-the-art predictors.
Collapse
Affiliation(s)
- Jun Hu
- School of Computer Science and Engineering, Nanjing University of Science and Technology , Xiaolingwei 200, Nanjing, 210094, P. R. China.,Department of Computational Medicine and Bioinformatics, University of Michigan , 100 Washtenaw, Ann Arbor, Michigan 48109-2218, United States
| | - Yang Li
- School of Computer Science and Engineering, Nanjing University of Science and Technology , Xiaolingwei 200, Nanjing, 210094, P. R. China.,Department of Computational Medicine and Bioinformatics, University of Michigan , 100 Washtenaw, Ann Arbor, Michigan 48109-2218, United States
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan , 100 Washtenaw, Ann Arbor, Michigan 48109-2218, United States
| | - Dong-Jun Yu
- School of Computer Science and Engineering, Nanjing University of Science and Technology , Xiaolingwei 200, Nanjing, 210094, P. R. China
| |
Collapse
|
29
|
Srivastava A, Kumar M. Prediction of zinc binding sites in proteins using sequence derived information. J Biomol Struct Dyn 2018; 36:4413-4423. [PMID: 29241411 DOI: 10.1080/07391102.2017.1417910] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Zinc is one the most abundant catalytic cofactor and also an important structural component of a large number of metallo-proteins. Hence prediction of zinc metal binding sites in proteins can be a significant step in annotation of molecular function of a large number of proteins. Majority of existing methods for zinc-binding site predictions are based on a data-set of proteins, which has been compiled nearly a decade ago. Hence there is a need to develop zinc-binding site prediction system using the current updated data to include recently added proteins. Herein, we propose a support vector machine-based method, named as ZincBinder, for prediction of zinc metal-binding site in a protein using sequence profile information. The predictor was trained using fivefold cross validation approach and achieved 85.37% sensitivity with 86.20% specificity during training. Benchmarking on an independent non-redundant data-set, which was not used during training, showed better performance of ZincBinder vis-à-vis existing methods. Executable versions, source code, sample datasets, and usage instructions are available at http://proteininformatics.org/mkumar/znbinder/.
Collapse
Affiliation(s)
- Abhishikha Srivastava
- a Department of Biophysics , University of Delhi South Campus , Benito Juarez Road, New Delhi 110021 , India
| | - Manish Kumar
- a Department of Biophysics , University of Delhi South Campus , Benito Juarez Road, New Delhi 110021 , India
| |
Collapse
|
30
|
Ding Y, Tang J, Guo F. Identification of Protein-Ligand Binding Sites by Sequence Information and Ensemble Classifier. J Chem Inf Model 2017; 57:3149-3161. [PMID: 29125297 DOI: 10.1021/acs.jcim.7b00307] [Citation(s) in RCA: 53] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Identifying protein-ligand binding sites is an important process in drug discovery and structure-based drug design. Detecting protein-ligand binding sites is expensive and time-consuming by traditional experimental methods. Hence, computational approaches provide many effective strategies to deal with this issue. Recently, lots of computational methods are based on structure information on proteins. However, these methods are limited in the common scenario, where both the sequence of protein target is known and sufficient 3D structure information is available. Studies indicate that sequence-based computational approaches for predicting protein-ligand binding sites are more practical. In this paper, we employ a novel computational model of protein-ligand binding sites prediction, using protein sequence. We apply the Discrete Cosine Transform (DCT) to extract feature from Position-Specific Score Matrix (PSSM). In order to improve the accuracy, Predicted Relative Solvent Accessibility (PRSA) information is also utilized. The predictor of protein-ligand binding sites is built by employing the ensemble weighted sparse representation model with random under-sampling. To evaluate our method, we conduct several comprehensive tests (12 types of ligands testing sets) for predicting protein-ligand binding sites. Results show that our method achieves better Matthew's correlation coefficient (MCC) than other outstanding methods on independent test sets of ATP (0.506), ADP (0.511), AMP (0.393), GDP (0.579), GTP (0.641), Mg2+ (0.317), Fe3+ (0.490) and HEME (0.640). Our proposed method outperforms earlier predictors (the performance of MCC) in 8 of the 12 ligands types.
Collapse
Affiliation(s)
- Yijie Ding
- School of Computer Science and Technology, Tianjin University , No. 135, Yaguan Road, Tianjin Haihe Education Park, Tianjin 300350, China
| | - Jijun Tang
- School of Computer Science and Technology, Tianjin University , No. 135, Yaguan Road, Tianjin Haihe Education Park, Tianjin 300350, China.,Department of Computer Science and Engineering, University of South Carolina , Columbia, South Carolina 29208, United States
| | - Fei Guo
- School of Computer Science and Technology, Tianjin University , No. 135, Yaguan Road, Tianjin Haihe Education Park, Tianjin 300350, China
| |
Collapse
|
31
|
Cao X, Hu X, Zhang X, Gao S, Ding C, Feng Y, Bao W. Identification of metal ion binding sites based on amino acid sequences. PLoS One 2017; 12:e0183756. [PMID: 28854211 PMCID: PMC5576659 DOI: 10.1371/journal.pone.0183756] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2017] [Accepted: 08/10/2017] [Indexed: 11/26/2022] Open
Abstract
The identification of metal ion binding sites is important for protein function annotation and the design of new drug molecules. This study presents an effective method of analyzing and identifying the binding residues of metal ions based solely on sequence information. Ten metal ions were extracted from the BioLip database: Zn2+, Cu2+, Fe2+, Fe3+, Ca2+, Mg2+, Mn2+, Na+, K+ and Co2+. The analysis showed that Zn2+, Cu2+, Fe2+, Fe3+, and Co2+ were sensitive to the conservation of amino acids at binding sites, and promising results can be achieved using the Position Weight Scoring Matrix algorithm, with an accuracy of over 79.9% and a Matthews correlation coefficient of over 0.6. The binding sites of other metals can also be accurately identified using the Support Vector Machine algorithm with multifeature parameters as input. In addition, we found that Ca2+ was insensitive to hydrophobicity and hydrophilicity information and Mn2+ was insensitive to polarization charge information. An online server was constructed based on the framework of the proposed method and is freely available at http://60.31.198.140:8081/metal/HomePage/HomePage.html.
Collapse
Affiliation(s)
- Xiaoyong Cao
- College of Sciences, Inner Mongolia University of Technology, Hohhot, 010051, China
| | - Xiuzhen Hu
- College of Sciences, Inner Mongolia University of Technology, Hohhot, 010051, China
| | - Xiaojin Zhang
- College of Sciences, Inner Mongolia University of Technology, Hohhot, 010051, China
| | - Sujuan Gao
- College of Sciences, Inner Mongolia University of Technology, Hohhot, 010051, China
- College of Sciences, Inner Mongolia Agricultural University, Hohhot, 010021, China
| | - Changjiang Ding
- College of Sciences, Inner Mongolia University of Technology, Hohhot, 010051, China
| | - Yonge Feng
- College of Sciences, Inner Mongolia Agricultural University, Hohhot, 010021, China
| | - Weihua Bao
- College of Sciences, Inner Mongolia University of Technology, Hohhot, 010051, China
| |
Collapse
|
32
|
Proteome scale identification, classification and structural analysis of iron-binding proteins in bread wheat. J Inorg Biochem 2017; 170:63-74. [DOI: 10.1016/j.jinorgbio.2017.02.012] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2016] [Revised: 01/23/2017] [Accepted: 02/10/2017] [Indexed: 12/26/2022]
|
33
|
Computational approaches for de novo design and redesign of metal-binding sites on proteins. Biosci Rep 2017; 37:BSR20160179. [PMID: 28167677 PMCID: PMC5482196 DOI: 10.1042/bsr20160179] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2016] [Revised: 02/06/2017] [Accepted: 02/06/2017] [Indexed: 12/25/2022] Open
Abstract
Metal ions play pivotal roles in protein structure, function and stability. The functional and structural diversity of proteins in nature expanded with the incorporation of metal ions or clusters in proteins. Approximately one-third of these proteins in the databases contain metal ions. Many biological and chemical processes in nature involve metal ion-binding proteins, aka metalloproteins. Many cellular reactions that underpin life require metalloproteins. Most of the remarkable, complex chemical transformations are catalysed by metalloenzymes. Realization of the importance of metal-binding sites in a variety of cellular events led to the advancement of various computational methods for their prediction and characterization. Furthermore, as structural and functional knowledgebase about metalloproteins is expanding with advances in computational and experimental fields, the focus of the research is now shifting towards de novo design and redesign of metalloproteins to extend nature’s own diversity beyond its limits. In this review, we will focus on the computational toolbox for prediction of metal ion-binding sites, de novo metalloprotein design and redesign. We will also give examples of tailor-made artificial metalloproteins designed with the computational toolbox.
Collapse
|
34
|
Complex Nature of Protein Carbonylation Specificity After Metal-Catalyzed Oxidation. Pharm Res 2017; 34:765-779. [DOI: 10.1007/s11095-017-2103-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2016] [Accepted: 01/12/2017] [Indexed: 12/27/2022]
|
35
|
Uroshlev L, Kulakovskiy I, Esipova N, Tumanyan V, Rahmanov S, Makeev V. Role of structural water for prediction of cation binding sites in apoproteins. J Biomol Struct Dyn 2017; 36:221-232. [DOI: 10.1080/07391102.2016.1273136] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- L.A. Uroshlev
- Department of Computational Systems Biology, Vavilov Institute of General Genetics, 3 Gubkina st., Moscow 119991, Russian Federation
| | - I.V. Kulakovskiy
- Department of Computational Systems Biology, Vavilov Institute of General Genetics, 3 Gubkina st., Moscow 119991, Russian Federation
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilova st., Moscow 119991, Russian Federation
| | - N.G. Esipova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilova st., Moscow 119991, Russian Federation
| | - V.G. Tumanyan
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilova st., Moscow 119991, Russian Federation
| | - S.V. Rahmanov
- Department of Computational Systems Biology, Vavilov Institute of General Genetics, 3 Gubkina st., Moscow 119991, Russian Federation
| | - V.J. Makeev
- Department of Computational Systems Biology, Vavilov Institute of General Genetics, 3 Gubkina st., Moscow 119991, Russian Federation
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilova st., Moscow 119991, Russian Federation
- State Research Institute of Genetics and Selection of Industrial Microorganisms, GosNIIGenetika, 1st Dorozhniy proezd 1, Moscow 117545, Russian Federation
- Department of Medical and Biological Physics, Moscow Institute of Physics and Technology, 9 Institutskiy per., Moscow 141700, Russian Federation
| |
Collapse
|
36
|
Sharma A, Sharma D, Verma SK. Proteome wide identification of iron binding proteins of Xanthomonas translucens pv. undulosa: focus on secretory virulent proteins. Biometals 2017; 30:127-141. [DOI: 10.1007/s10534-017-9991-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2016] [Accepted: 01/08/2017] [Indexed: 12/19/2022]
|
37
|
Gill RA, Ali B, Yang S, Tong C, Islam F, Gill MB, Mwamba TM, Ali S, Mao B, Liu S, Zhou W. Reduced Glutathione Mediates Pheno-Ultrastructure, Kinome and Transportome in Chromium-Induced Brassica napus L. FRONTIERS IN PLANT SCIENCE 2017; 8:2037. [PMID: 29312362 PMCID: PMC5732361 DOI: 10.3389/fpls.2017.02037] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2017] [Accepted: 11/14/2017] [Indexed: 05/19/2023]
Abstract
Chromium (Cr) as a toxic metal is widely used for commercial purposes and its residues have become a potential environmental threat to both human and plant health. Oilseed rape (Brassica napus L.) is one of the candidate plants that can absorb the considerable quantity of toxic metals from the soil. Here, we used two cultivars of B. napus cvs. ZS 758 (metal-tolerant) and Zheda 622 (metal-susceptible) to investigate the phenological attributes, cell ultrastructure, protein kinases (PKs) and molecular transporters (MTs) under the combined treatments of Cr stress and reduced glutathione (GSH). Seeds of these cultivars were grown in vitro at different treatments i.e., 0, 400 μM Cr, and 400 μM Cr + 1 mM GSH in control growth chamber for 6 days. Results had confirmed that Cr significantly reduced the plant length, stem and root, and fresh biomass such as leaf, stem and root. Cr noticeably caused the damages in leaf mesophyll cells. Exogenous application of GSH significantly recovered both phenological and cell structural damages in two cultivars under Cr stress. For the PKs, transcriptomic data advocated that Cr stress alone significantly increased the gene expressions of BnaA08g16610D, BnaCnng19320D, and BnaA08g00390D over that seen in controls (Ck). These genes encoded both nucleic acid and transition metal ion binding proteins, and protein kinase activity (PKA) and phosphotransferase activities in both cultivars. Similarly, the presence of Cr revealed elite MT genes [BnaA04g26560D, BnaA02g28130D, and BnaA02g01980D (novel)] that were responsible for water transmembrane transporter activity. However, GSH in combination with Cr stress significantly up-regulated the genes for PKs [such as BnaCnng69940D (novel) and BnaC08g49360D] that were related to PKA, signal transduction, and oxidoreductase activities. For MTs, BnaC01g29930D and BnaA07g14320D were responsible for secondary active transmembrane transporter and protein transporter activities that were expressed more in GSH treatment than either Ck or Cr-treated cells. In general, it can be concluded that cultivar ZS 758 is more tolerant toward Cr-induced stress than Zheda 622.
Collapse
Affiliation(s)
- Rafaqat A. Gill
- Institute of Crop Science and Zhejiang Key Laboratory of Crop Germplasm, Zhejiang University, Hangzhou, China
- Institute of Biotechnology, Zhejiang University, Hangzhou, China
| | - Basharat Ali
- Institute of Crop Science and Zhejiang Key Laboratory of Crop Germplasm, Zhejiang University, Hangzhou, China
- Institute of Crop Science and Resource Conservation, University of Bonn, Bonn, Germany
| | - Su Yang
- Institute of Crop Science and Zhejiang Key Laboratory of Crop Germplasm, Zhejiang University, Hangzhou, China
| | - Chaobo Tong
- Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan, China
| | - Faisal Islam
- Institute of Crop Science and Zhejiang Key Laboratory of Crop Germplasm, Zhejiang University, Hangzhou, China
| | - Muhammad Bilal Gill
- Institute of Crop Science and Zhejiang Key Laboratory of Crop Germplasm, Zhejiang University, Hangzhou, China
| | - Theodore M. Mwamba
- Institute of Crop Science and Zhejiang Key Laboratory of Crop Germplasm, Zhejiang University, Hangzhou, China
| | - Skhawat Ali
- Institute of Crop Science and Zhejiang Key Laboratory of Crop Germplasm, Zhejiang University, Hangzhou, China
| | - Bizeng Mao
- Institute of Biotechnology, Zhejiang University, Hangzhou, China
| | - Shengyi Liu
- Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Wuhan, China
| | - Weijun Zhou
- Institute of Crop Science and Zhejiang Key Laboratory of Crop Germplasm, Zhejiang University, Hangzhou, China
- *Correspondence: Weijun Zhou
| |
Collapse
|
38
|
Lin YF, Cheng CW, Shih CS, Hwang JK, Yu CS, Lu CH. MIB: Metal Ion-Binding Site Prediction and Docking Server. J Chem Inf Model 2016; 56:2287-2291. [PMID: 27976886 DOI: 10.1021/acs.jcim.6b00407] [Citation(s) in RCA: 168] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The structure of a protein determines its biological function(s) and its interactions with other factors; the binding regions tend to be conserved in sequence and structure, and the interacting residues involved are usually in close 3D space. The Protein Data Bank currently contains more than 110 000 protein structures, approximately one-third of which contain metal ions. Identifying and characterizing metal ion-binding sites is thus essential for investigating a protein's function(s) and interactions. However, experimental approaches are time-consuming and costly. The web server reported here was built to predict metal ion-binding residues and to generate the predicted metal ion-bound 3D structure. Binding templates have been constructed for regions that bind 12 types of metal ion-binding residues have been used to construct binding templates. The templates include residues within 3.5 Å of the metal ion, and the fragment transformation method was used for structural comparison between query proteins and templates without any data training. Through the adjustment of scoring functions, which are based on the similarity of structure and binding residues. Twelve kinds of metal ions (Ca2+, Cu2+, Fe3+, Mg2+, Mn2+, Zn2+, Cd2+, Fe2+, Ni2+, Hg2+, Co2+, and Cu+) binding residues prediction are supported. MIB also provides the metal ions docking after prediction. The MIB server is available at http://bioinfo.cmu.edu.tw/MIB/ .
Collapse
Affiliation(s)
- Yu-Feng Lin
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University , Hsinchu, 30050 Taiwan
| | - Chih-Wen Cheng
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University , Hsinchu, 30050 Taiwan
| | - Chung-Shiuan Shih
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University , Hsinchu, 30050 Taiwan
| | - Jenn-Kang Hwang
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University , Hsinchu, 30050 Taiwan
| | | | - Chih-Hao Lu
- Graduate Institute of Basic Medical Science, China Medical University , Taichung 40402, Taiwan
| |
Collapse
|
39
|
Wheeler LC, Donor MT, Prell JS, Harms MJ. Multiple Evolutionary Origins of Ubiquitous Cu2+ and Zn2+ Binding in the S100 Protein Family. PLoS One 2016; 11:e0164740. [PMID: 27764152 PMCID: PMC5072561 DOI: 10.1371/journal.pone.0164740] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2016] [Accepted: 09/29/2016] [Indexed: 12/24/2022] Open
Abstract
The S100 proteins are a large family of signaling proteins that play critical roles in biology and disease. Many S100 proteins bind Zn2+, Cu2+, and/or Mn2+ as part of their biological functions; however, the evolutionary origins of binding remain obscure. One key question is whether divalent transition metal binding is ancestral, or instead arose independently on multiple lineages. To tackle this question, we combined phylogenetics with biophysical characterization of modern S100 proteins. We demonstrate an earlier origin for established S100 subfamilies than previously believed, and reveal that transition metal binding is widely distributed across the tree. Using isothermal titration calorimetry, we found that Cu2+ and Zn2+ binding are common features of the family: the full breadth of human S100 paralogs-as well as two early-branching S100 proteins found in the tunicate Oikopleura dioica-bind these metals with μM affinity and stoichiometries ranging from 1:1 to 3:1 (metal:protein). While binding is consistent across the tree, structural responses to binding are quite variable. Further, mutational analysis and structural modeling revealed that transition metal binding occurs at different sites in different S100 proteins. This is consistent with multiple origins of transition metal binding over the evolution of this protein family. Our work reveals an evolutionary pattern in which the overall phenotype of binding is a constant feature of S100 proteins, even while the site and mechanism of binding is evolutionarily labile.
Collapse
Affiliation(s)
- Lucas C. Wheeler
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, 97403, United States of America
- Institute for Molecular Biology, University of Oregon, Eugene, Oregon, 97403, United States of America
| | - Micah T. Donor
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, 97403, United States of America
| | - James S. Prell
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, 97403, United States of America
| | - Michael J. Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, 97403, United States of America
- Institute for Molecular Biology, University of Oregon, Eugene, Oregon, 97403, United States of America
| |
Collapse
|
40
|
Minimal Functional Sites in Metalloproteins and Their Usage in Structural Bioinformatics. Int J Mol Sci 2016; 17:ijms17050671. [PMID: 27153067 PMCID: PMC4881497 DOI: 10.3390/ijms17050671] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2016] [Revised: 04/18/2016] [Accepted: 04/28/2016] [Indexed: 12/12/2022] Open
Abstract
Metal ions play a functional role in numerous biochemical processes and cellular pathways. Indeed, about 40% of all enzymes of known 3D structure require a metal ion to be able to perform catalysis. The interactions of the metals with the macromolecular framework determine their chemical properties and reactivity. The relevant interactions involve both the coordination sphere of the metal ion and the more distant interactions of the so-called second sphere, i.e., the non-bonded interactions between the macromolecule and the residues coordinating the metal (metal ligands). The metal ligands and the residues in their close spatial proximity define what we call a minimal functional site (MFS). MFSs can be automatically extracted from the 3D structures of metal-binding biological macromolecules deposited in the Protein Data Bank (PDB). They are 3D templates that describe the local environment around a metal ion or metal cofactor and do not depend on the overall macromolecular structure. MFSs provide a different view on metal-binding proteins and nucleic acids, completely focused on the metal. Here we present different protocols and tools based upon the concept of MFS to obtain deeper insight into the structural and functional properties of metal-binding macromolecules. We also show that structure conservation of MFSs in metalloproteins relates to local sequence similarity more strongly than to overall protein similarity.
Collapse
|
41
|
Yu DJ, Hu J, Li QM, Tang ZM, Yang JY, Shen HB. Constructing query-driven dynamic machine learning model with application to protein-ligand binding sites prediction. IEEE Trans Nanobioscience 2015; 14:45-58. [PMID: 25730499 DOI: 10.1109/tnb.2015.2394328] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
We are facing an era with annotated biological data rapidly and continuously generated. How to effectively incorporate new annotated data into the learning step is crucial for enhancing the performance of a bioinformatics prediction model. Although machine-learning-based methods have been extensively used for dealing with various biological problems, existing approaches usually train static prediction models based on fixed training datasets. The static approaches are found having several disadvantages such as low scalability and impractical when training dataset is huge. In view of this, we propose a dynamic learning framework for constructing query-driven prediction models. The key difference between the proposed framework and the existing approaches is that the training set for the machine learning algorithm of the proposed framework is dynamically generated according to the query input, as opposed to training a general model regardless of queries in traditional static methods. Accordingly, a query-driven predictor based on the smaller set of data specifically selected from the entire annotated base dataset will be applied on the query. The new way for constructing the dynamic model enables us capable of updating the annotated base dataset flexibly and using the most relevant core subset as the training set makes the constructed model having better generalization ability on the query, showing "part could be better than all" phenomenon. According to the new framework, we have implemented a dynamic protein-ligand binding sites predictor called OSML (On-site model for ligand binding sites prediction). Computer experiments on 10 different ligand types of three hierarchically organized levels show that OSML outperforms most existing predictors. The results indicate that the current dynamic framework is a promising future direction for bridging the gap between the rapidly accumulated annotated biological data and the effective machine-learning-based predictors. OSML web server and datasets are freely available at: http://www.csbio.sjtu.edu.cn/bioinf/OSML/ for academic use.
Collapse
|
42
|
Abstract
This chapter focuses on transition metals. All transition metal cations are toxic-those that are essential for Escherichia coli and belong to the first transition period of the periodic system of the element and also the "toxic-only" metals with higher atomic numbers. Common themes are visible in the metabolism of these ions. First, there is transport. High-rate but low-affinity uptake systems provide a variety of cations and anions to the cells. Control of the respective systems seems to be mainly through regulation of transport activity (flux control), with control of gene expression playing only a minor role. If these systems do not provide sufficient amounts of a needed ion to the cell, genes for ATP-hydrolyzing high-affinity but low-rate uptake systems are induced, e.g., ABC transport systems or P-type ATPases. On the other hand, if the amount of an ion is in surplus, genes for efflux systems are induced. By combining different kinds of uptake and efflux systems with regulation at the levels of gene expression and transport activity, the concentration of a single ion in the cytoplasm and the composition of the cellular ion "bouquet" can be rapidly adjusted and carefully controlled. The toxicity threshold of an ion is defined by its ability to produce radicals (copper, iron, chromate), to bind to sulfide and thiol groups (copper, zinc, all cations of the second and third transition period), or to interfere with the metabolism of other ions. Iron poses an exceptional metabolic problem due its metabolic importance and the low solubility of Fe(III) compounds, combined with the ability to cause dangerous Fenton reactions. This dilemma for the cells led to the evolution of sophisticated multi-channel iron uptake and storage pathways to prevent the occurrence of unbound iron in the cytoplasm. Toxic metals like Cd2+ bind to thiols and sulfide, preventing assembly of iron complexes and releasing the metal from iron-sulfur clusters. In the unique case of mercury, the cation can be reduced to the volatile metallic form. Interference of nickel and cobalt with iron is prevented by the low abundance of these metals in the cytoplasm and their sequestration by metal chaperones, in the case of nickel, or by B12 and its derivatives, in the case of cobalt. The most dangerous metal, copper, catalyzes Fenton-like reactions, binds to thiol groups, and interferes with iron metabolism. E. coli solves this problem probably by preventing copper uptake, combined with rapid efflux if the metal happens to enter the cytoplasm.
Collapse
|
43
|
Zaidi A, Singh KP, Anwar S, Suman SS, Equbal A, Singh K, Dikhit MR, Bimal S, Pandey K, Das P, Ali V. Interaction of frataxin, an iron binding protein, with IscU of Fe-S clusters biogenesis pathway and its upregulation in AmpB resistant Leishmania donovani. Biochimie 2015; 115:120-35. [PMID: 26032732 DOI: 10.1016/j.biochi.2015.05.016] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Accepted: 05/19/2015] [Indexed: 01/10/2023]
Abstract
Leishmania donovani is a unicellular protozoon parasite that causes visceral leishmaniasis (VL), which is a fatal disease if left untreated. Certain Fe-S proteins of the TCA cycle and respiratory chain have been found in the Leishmania parasite but the precise mechanisms for their biogenesis and the maturation of Fe-S clusters remains unknown. Fe-S clusters are ubiquitous cofactors of proteins that perform critical cellular functions. The clusters are biosynthesized by the mitochondrial Iron-Sulphur Cluster (ISC) machinery with core protein components that include the catalytic cysteine desulphurase IscS, the scaffold proteins IscU and IscA, and frataxin as an iron carrier/donor. However, no information regarding frataxin, its regulation, or its role in drug resistance is available for the Leishmania parasite. In this study, we characterized Ld-frataxin to investigate its role in the ISC machinery of L. donovani. We expressed and purified the recombinant Ld-frataxin protein and observed its interaction with Ld-IscU by co-purification and pull-down assay. Furthermore, we observed that the cysteine desulphurase activity of the purified Ld-IscS protein was stimulated in the presence of Ld-frataxin and Ld-IscU, particularly in the presence of iron; neither Ld-frataxin nor Ld-IscU alone had significant effects on Ld-IscS activity. Interestingly, RT-PCR and western blotting showed that Ld-frataxin is upregulated in AmpB-resistant isolates compared to sensitive strains, which may support higher Fe-S protein activity in AmpB-resistant L. donovani. Additionally, Ld-frataxin was localized in the mitochondria, as revealed by digitonin fractionation and indirect immunofluorescence. Thus, our results suggest the role of Ld-frataxin as an iron binding/carrier protein for Fe-S cluster biogenesis that physically interacts with other core components of the ISC machinery within the mitochondria.
Collapse
Affiliation(s)
- Amir Zaidi
- Laboratory of Molecular Biochemistry and Cell Biology, Department of Biochemistry, Rajendra Memorial Research Institute of Medical Sciences (RMRIMS), Agamkuan, Patna 800007, India
| | - Krishn Pratap Singh
- Laboratory of Molecular Biochemistry and Cell Biology, Department of Biochemistry, Rajendra Memorial Research Institute of Medical Sciences (RMRIMS), Agamkuan, Patna 800007, India
| | - Shadab Anwar
- Laboratory of Molecular Biochemistry and Cell Biology, Department of Biochemistry, Rajendra Memorial Research Institute of Medical Sciences (RMRIMS), Agamkuan, Patna 800007, India
| | - Shashi S Suman
- Laboratory of Molecular Biochemistry and Cell Biology, Department of Biochemistry, Rajendra Memorial Research Institute of Medical Sciences (RMRIMS), Agamkuan, Patna 800007, India
| | - Asif Equbal
- Laboratory of Molecular Biochemistry and Cell Biology, Department of Biochemistry, Rajendra Memorial Research Institute of Medical Sciences (RMRIMS), Agamkuan, Patna 800007, India
| | - Kuljit Singh
- Laboratory of Molecular Biochemistry and Cell Biology, Department of Biochemistry, Rajendra Memorial Research Institute of Medical Sciences (RMRIMS), Agamkuan, Patna 800007, India
| | - Manas R Dikhit
- Biomedical Informatic Centre, Rajendra Memorial Research Institute of Medical Sciences (RMRIMS), Agamkuan, Patna, India
| | - Sanjeeva Bimal
- Department of Immunology, Rajendra Memorial Research Institute of Medical Sciences (RMRIMS), Agamkuan, Patna, India
| | - Krishna Pandey
- Department of Clinical Medicine, Rajendra Memorial Research Institute of Medical Sciences (RMRIMS), Agamkuan, Patna, India
| | - Pradeep Das
- Department of Molecular Biology, Rajendra Memorial Research Institute of Medical Sciences (RMRIMS), Agamkuan, Patna, India
| | - Vahab Ali
- Laboratory of Molecular Biochemistry and Cell Biology, Department of Biochemistry, Rajendra Memorial Research Institute of Medical Sciences (RMRIMS), Agamkuan, Patna 800007, India.
| |
Collapse
|
44
|
Azia A, Levy R, Unger R, Edelman M, Sobolev V. Genome-wide computational determination of the human metalloproteome. Proteins 2015; 83:931-9. [PMID: 25739467 DOI: 10.1002/prot.24790] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2014] [Revised: 02/06/2015] [Accepted: 02/25/2015] [Indexed: 11/08/2022]
Abstract
Accurate prediction of protein function in humans is important for understanding biological processes at the molecular level in biomedicine and drug design. Over a third of proteins are commonly held to bind metal, and ∼10% of human proteins, to bind zinc. Therefore, an initial step in protein function prediction frequently involves predicting metal ion binding. In recent years, methods have been developed to predict a set of residues in 3D space forming the metal-ion binding site, often with a high degree of accuracy. Here, using extensions of these methods, we provide an extensive list of human proteins and their putative metal ion binding site residues, using translated gene sequences derived from the complete, resolved human genome. Under conditions of ∼90% selectivity, over 900 new human putative metal ion binding proteins are identified. A statistical analysis of resolved metal ion binding sites in the human metalloproteome is furnished and the importance of remote homology analysis is demonstrated. As an example, a novel metal-ion binding site involving a complex of a botulinum substrate with its inhibitor is presented. On the basis of the location of the predicted site and the interactions of the contacting residues at the complex interface, we postulate that metal ion binding in this region could influence complex formation and, consequently, the functioning of the protein. Thus, this work provides testable hypotheses about novel functions of known proteins.
Collapse
Affiliation(s)
- Ariel Azia
- The Mina and Everard Goodman Faculty of Life Sciences, Bar Ilan University, Ramat Gan, Israel
| | | | | | | | | |
Collapse
|
45
|
He W, Liang Z, Teng M, Niu L. mFASD: a structure-based algorithm for discriminating different types of metal-binding sites. Bioinformatics 2015; 31:1938-44. [DOI: 10.1093/bioinformatics/btv044] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2014] [Accepted: 01/17/2015] [Indexed: 11/13/2022] Open
|
46
|
Yu DJ, Hu J, Yan H, Yang XB, Yang JY, Shen HB. Enhancing protein-vitamin binding residues prediction by multiple heterogeneous subspace SVMs ensemble. BMC Bioinformatics 2014; 15:297. [PMID: 25189131 PMCID: PMC4261549 DOI: 10.1186/1471-2105-15-297] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2014] [Accepted: 08/18/2014] [Indexed: 11/10/2022] Open
Abstract
Background Vitamins are typical ligands that play critical roles in various metabolic processes. The accurate identification of the vitamin-binding residues solely based on a protein sequence is of significant importance for the functional annotation of proteins, especially in the post-genomic era, when large volumes of protein sequences are accumulating quickly without being functionally annotated. Results In this paper, a new predictor called TargetVita is designed and implemented for predicting protein-vitamin binding residues using protein sequences. In TargetVita, features derived from the position-specific scoring matrix (PSSM), predicted protein secondary structure, and vitamin binding propensity are combined to form the original feature space; then, several feature subspaces are selected by performing different feature selection methods. Finally, based on the selected feature subspaces, heterogeneous SVMs are trained and then ensembled for performing prediction. Conclusions The experimental results obtained with four separate vitamin-binding benchmark datasets demonstrate that the proposed TargetVita is superior to the state-of-the-art vitamin-specific predictor, and an average improvement of 10% in terms of the Matthews correlation coefficient (MCC) was achieved over independent validation tests. The TargetVita web server and the datasets used are freely available for academic use at http://csbio.njust.edu.cn/bioinf/TargetVita or http://www.csbio.sjtu.edu.cn/bioinf/TargetVita. Electronic supplementary material The online version of this article (doi:10.1186/1471-2105-15-297) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Dong-Jun Yu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, Xiaolingwei 200, Nanjing 210094, China.
| | | | | | | | | | | |
Collapse
|
47
|
Estellon J, Ollagnier de Choudens S, Smadja M, Fontecave M, Vandenbrouck Y. An integrative computational model for large-scale identification of metalloproteins in microbial genomes: a focus on iron-sulfur cluster proteins. Metallomics 2014; 6:1913-30. [PMID: 25117543 DOI: 10.1039/c4mt00156g] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Metalloproteins represent a ubiquitous group of molecules which are crucial to the survival of all living organisms. While several metal-binding motifs have been defined, it remains challenging to confidently identify metalloproteins from primary protein sequences using computational approaches alone. Here, we describe a comprehensive strategy based on a machine learning approach to design and assess a penalized generalized linear model. We used this strategy to detect members of the iron-sulfur cluster protein family. A new category of descriptors, whose profile is based on profile hidden Markov models, encoding structural information was combined with public descriptors into a linear model. The model was trained and tested on distinct datasets composed of well-characterized iron-sulfur protein sequences, and the resulting model provided higher sensitivity compared to a motif-based approach, while maintaining a good level of specificity. Analysis of this linear model allows us to detect and quantify the contribution of each descriptor, providing us with a better understanding of this complex protein family along with valuable indications for further experimental characterization. Two newly-identified proteins, YhcC and YdiJ, were functionally validated as genuine iron-sulfur proteins, confirming the prediction. The computational model was then applied to over 550 prokaryotic genomes to screen for iron-sulfur proteomes; the results are publicly available at: . This study represents a proof-of-concept for the application of a penalized linear model to identify metalloprotein superfamilies on a large-scale. The application employed here, screening for iron-sulfur proteomes, provides new candidates for further biochemical and structural analysis as well as new resources for an extensive exploration of iron-sulfuromes in the microbial world.
Collapse
Affiliation(s)
- Johan Estellon
- Univ. Grenoble Alpes, iRTSV-BGE, F-38000 Grenoble, France.
| | | | | | | | | |
Collapse
|
48
|
Roston RL, Wang K, Kuhn LA, Benning C. Structural determinants allowing transferase activity in SENSITIVE TO FREEZING 2, classified as a family I glycosyl hydrolase. J Biol Chem 2014; 289:26089-26106. [PMID: 25100720 DOI: 10.1074/jbc.m114.576694] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
SENSITIVE TO FREEZING 2 (SFR2) is classified as a family I glycosyl hydrolase but has recently been shown to have galactosyltransferase activity in Arabidopsis thaliana. Natural occurrences of apparent glycosyl hydrolases acting as transferases are interesting from a biocatalysis standpoint, and knowledge about the interconversion can assist in engineering SFR2 in crop plants to resist freezing. To understand how SFR2 evolved into a transferase, the relationship between its structure and function are investigated by activity assay, molecular modeling, and site-directed mutagenesis. SFR2 has no detectable hydrolase activity, although its catalytic site is highly conserved with that of family 1 glycosyl hydrolases. Three regions disparate from glycosyl hydrolases are identified as required for transferase activity as follows: a loop insertion, the C-terminal peptide, and a hydrophobic patch adjacent to the catalytic site. Rationales for the effects of these regions on the SFR2 mechanism are discussed.
Collapse
Affiliation(s)
- Rebecca L Roston
- Departments of Biochemistry and Molecular Biology and Michigan State University, East Lansing, Michigan 48824.
| | - Kun Wang
- Departments of Biochemistry and Molecular Biology and Michigan State University, East Lansing, Michigan 48824
| | - Leslie A Kuhn
- Departments of Biochemistry and Molecular Biology and Michigan State University, East Lansing, Michigan 48824; Departments of Computer Science and Engineering, Michigan State University, East Lansing, Michigan 48824
| | - Christoph Benning
- Departments of Biochemistry and Molecular Biology and Michigan State University, East Lansing, Michigan 48824
| |
Collapse
|
49
|
Borek D, Kozak M, Pei J, Jaskolski M. Crystal structure of active site mutant of antileukemicl-asparaginase reveals conserved zinc-binding site. FEBS J 2014; 281:4097-111. [DOI: 10.1111/febs.12906] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2014] [Revised: 06/16/2014] [Accepted: 07/01/2014] [Indexed: 02/03/2023]
Affiliation(s)
- Dominika Borek
- Department of Crystallography; Faculty of Chemistry; A. Mickiewicz University; Poznan Poland
| | - Maciej Kozak
- Department of Crystallography; Faculty of Chemistry; A. Mickiewicz University; Poznan Poland
- Department of Macromolecular Physics; Faculty of Physics; A. Mickiewicz University; Poznan Poland
| | - Jimin Pei
- Howard Hughes Medical Institute; University of Texas Southwestern Medical Center; Dallas TX USA
| | - Mariusz Jaskolski
- Department of Crystallography; Faculty of Chemistry; A. Mickiewicz University; Poznan Poland
- Center for Biocrystallographic Research; Institute of Bioorganic Chemistry; Polish Academy of Sciences; Poznan Poland
| |
Collapse
|
50
|
Metal-mediated modulation of streptococcal cysteine protease activity and its biological implications. Infect Immun 2014; 82:2992-3001. [PMID: 24799625 DOI: 10.1128/iai.01770-14] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Streptococcal cysteine protease (SpeB), the major secreted protease produced by group A streptococcus (GAS), cleaves both host and bacterial proteins and contributes importantly to the pathogenesis of invasive GAS infections. Modulation of SpeB expression and/or its activity during invasive GAS infections has been shown to affect bacterial virulence and infection severity. Expression of SpeB is regulated by the GAS CovR-CovS two-component regulatory system, and we demonstrated that bacteria with mutations in the CovR-CovS two-component regulatory system are selected for during localized GAS infections and that these bacteria lack SpeB expression and exhibit a hypervirulent phenotype. Additionally, in a separate study, we showed that expression of SpeB can also be modulated by human transferrin- and/or lactoferrin-mediated iron chelation. Accordingly, the goal of this study was to investigate the possible roles of iron and other metals in modulating SpeB expression and/or activity in a manner that would potentiate bacterial virulence. Here, we report that the divalent metals zinc and copper inhibit SpeB activity at the posttranslational level. Utilizing online metal-binding site prediction servers, we identified two putative metal-binding sites in SpeB, one of which involves the catalytic-dyad residues (47)Cys and (195)His. Based on our findings, we propose that zinc and/or copper availability in the bacterial microenvironment can modulate the proteolytic activity of SpeB in a manner that preserves the integrity of several other virulence factors essential for bacterial survival and dissemination within the host and thereby may exacerbate the severity of invasive GAS infections.
Collapse
|