1
|
Lin M, Li K, Zhang Y, Pan F, Wu W, Zhang J. DisDock: A Deep Learning Method for Metal Ion-Protein Redocking. Proteins 2025. [PMID: 39838957 DOI: 10.1002/prot.26791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 09/10/2024] [Accepted: 12/18/2024] [Indexed: 01/23/2025]
Abstract
The structures of metalloproteins are essential for comprehending their functions and interactions. The breakthrough of AlphaFold has made it possible to predict protein structures with experimental accuracy. However, the type of metal ion that a metalloprotein binds and the binding structure are still not readily available, even with the predicted protein structure. In this study, we present DisDock, a deep learning method for predicting protein-metal docking. DisDock takes distogram of randomly initialized protein-ligand configuration as input and outputs the distogram of the predicted binding complex. It combines the U-net architecture with self-attention modules to enhance model performance. Taking inspiration from the physical principle that atoms in closer proximity display a stronger mutual attraction, this predictor capitalizes on geometric information to uncover latent characteristics indicative of atom interactions. To train our model, we employ a high-quality metalloprotein dataset sourced from the Mother of All Databases (MOAD). Experimental results demonstrate that our approach outperforms other existing methods in prediction accuracy for various types of metal ions.
Collapse
Affiliation(s)
- Menghan Lin
- Department of Statistics, Florida State University, Tallahassee, Florida, USA
| | - Keqiao Li
- Department of Statistics, Florida State University, Tallahassee, Florida, USA
| | - Yuan Zhang
- Department of Statistics, Florida State University, Tallahassee, Florida, USA
| | - Feng Pan
- Department of Statistics, Florida State University, Tallahassee, Florida, USA
| | - Wei Wu
- Department of Statistics, Florida State University, Tallahassee, Florida, USA
| | - Jinfeng Zhang
- Department of Statistics, Florida State University, Tallahassee, Florida, USA
| |
Collapse
|
2
|
Ibrahim IH. Metalloproteins and metalloproteomics in health and disease. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2024; 141:123-176. [PMID: 38960472 DOI: 10.1016/bs.apcsb.2023.12.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/05/2024]
Abstract
Metalloproteins represents more than one third of human proteome, with huge variation in physiological functions and pathological implications, depending on the metal/metals involved and tissue context. Their functions range from catalysis, bioenergetics, redox, to DNA repair, cell proliferation, signaling, transport of vital elements, and immunity. The human metalloproteomic studies revealed that many families of metalloproteins along with individual metalloproteins are dysregulated under several clinical conditions. Also, several sorts of interaction between redox- active or redox- inert metalloproteins are observed in health and disease. Metalloproteins profiling shows distinct alterations in neurodegenerative diseases, cancer, inflammation, infection, diabetes mellitus, among other diseases. This makes metalloproteins -either individually or as families- a promising target for several therapeutic approaches. Inhibitors and activators of metalloenzymes, metal chelators, along with artificial metalloproteins could be versatile in diagnosis and treatment of several diseases, in addition to other biomedical and industrial applications.
Collapse
Affiliation(s)
- Iman Hassan Ibrahim
- Department of Biochemistry and Molecular Biology, Faculty of Pharmacy (Girls), Al-Azhar University, Cairo, Egypt.
| |
Collapse
|
3
|
Laveglia V, Bazayeva M, Andreini C, Rosato A. Hunting down zinc(II)-binding sites in proteins with distance matrices. Bioinformatics 2023; 39:btad653. [PMID: 37878807 PMCID: PMC10630175 DOI: 10.1093/bioinformatics/btad653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 10/17/2023] [Accepted: 10/23/2023] [Indexed: 10/27/2023] Open
Abstract
MOTIVATION In recent years, high-throughput sequencing technologies have made available the genome sequences of a huge variety of organisms. However, the functional annotation of the encoded proteins often still relies on low-throughput and costly experimental studies. Bioinformatics approaches offer a promising alternative to accelerate this process. In this work, we focus on the binding of zinc(II) ions, which is needed for 5%-10% of any organism's proteins to achieve their physiologically relevant form. RESULTS To implement a predictor of zinc(II)-binding sites in the 3D structures of proteins, we used a neural network, followed by a filter of the network output against the local structure of all known sites. The latter was implemented as a function comparing the distance matrices of the Cα and Cβ atoms of the sites. We called the resulting tool Master of Metals (MOM). The structural models for the entire proteome of an organism generated by AlphaFold can be used as input to our tool in order to achieve annotation at the whole organism level within a few hours. To demonstrate this, we applied MOM to the yeast proteome, obtaining a precision of about 76%, based on data for homologous proteins. AVAILABILITY AND IMPLEMENTATION Master of Metals has been implemented in Python and is available at https://github.com/cerm-cirmmp/Master-of-metals.
Collapse
Affiliation(s)
- Vincenzo Laveglia
- Department of Chemistry, University of Florence, Sesto Fiorentino 50019, Italy
| | - Milana Bazayeva
- Department of Chemistry, University of Florence, Sesto Fiorentino 50019, Italy
- Magnetic Resonance Center (CERM), University of Florence, Sesto Fiorentino 50019, Italy
| | - Claudia Andreini
- Department of Chemistry, University of Florence, Sesto Fiorentino 50019, Italy
- Magnetic Resonance Center (CERM), University of Florence, Sesto Fiorentino 50019, Italy
- Consorzio Interuniversitario di Risonanze Magnetiche di Metallo Proteine, Sesto Fiorentino 50019, Italy
| | - Antonio Rosato
- Department of Chemistry, University of Florence, Sesto Fiorentino 50019, Italy
- Magnetic Resonance Center (CERM), University of Florence, Sesto Fiorentino 50019, Italy
- Consorzio Interuniversitario di Risonanze Magnetiche di Metallo Proteine, Sesto Fiorentino 50019, Italy
| |
Collapse
|
4
|
Wang Z, Wang M, Zhao Z, Zheng P. Quantification of carboxylate-bridged di-zinc site stability in protein due ferri by single-molecule force spectroscopy. Protein Sci 2023; 32:e4583. [PMID: 36718829 PMCID: PMC9926469 DOI: 10.1002/pro.4583] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2022] [Revised: 01/16/2023] [Accepted: 01/27/2023] [Indexed: 02/01/2023]
Abstract
Carboxylate-bridged diiron proteins belong to a protein family involved in different physiological processes. These proteins share the conservative EXXH motif, which provides the carboxylate bridge and is critical for metal binding. Here, we choose de novo-designed single-chain due ferri protein (DFsc), a four-helical protein with two EXXH motifs as a model protein, to study the stability of the carboxylate-bridged di-metal binding site. The mechanical and kinetic properties of the di-Zn site in DFsc were obtained by atomic force microscopy-based single-molecule force spectroscopy. Zn-DFsc showed a considerable rupture force of ~200 pN, while the apo-protein is mechanically labile. In addition, multiple rupture pathways were observed with different probabilities, indicating the importance of the EXXH-based carboxylate-bridged metal site. These results demonstrate carboxylate-bridged di-metal site is mechanically stable and improve our understanding of this important type of metalloprotein.
Collapse
Affiliation(s)
- Zhiyi Wang
- State Key Laboratory of Coordination Chemistry, Chemistry and Biomedicine Innovation Center (ChemBIC), School of Chemistry and Chemical EngineeringNanjing UniversityNanjingPeople's Republic of China
| | - Mengdie Wang
- State Key Laboratory of Coordination Chemistry, Chemistry and Biomedicine Innovation Center (ChemBIC), School of Chemistry and Chemical EngineeringNanjing UniversityNanjingPeople's Republic of China
| | - Zhongxin Zhao
- State Key Laboratory of Coordination Chemistry, Chemistry and Biomedicine Innovation Center (ChemBIC), School of Chemistry and Chemical EngineeringNanjing UniversityNanjingPeople's Republic of China
| | - Peng Zheng
- State Key Laboratory of Coordination Chemistry, Chemistry and Biomedicine Innovation Center (ChemBIC), School of Chemistry and Chemical EngineeringNanjing UniversityNanjingPeople's Republic of China
| |
Collapse
|
5
|
Andreini C, Rosato A. Structural Bioinformatics and Deep Learning of Metalloproteins: Recent Advances and Applications. Int J Mol Sci 2022; 23:7684. [PMID: 35887033 PMCID: PMC9323969 DOI: 10.3390/ijms23147684] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Revised: 07/04/2022] [Accepted: 07/06/2022] [Indexed: 02/04/2023] Open
Abstract
All living organisms require metal ions for their energy production and metabolic and biosynthetic processes. Within cells, the metal ions involved in the formation of adducts interact with metabolites and macromolecules (proteins and nucleic acids). The proteins that require binding to one or more metal ions in order to be able to carry out their physiological function are called metalloproteins. About one third of all protein structures in the Protein Data Bank involve metalloproteins. Over the past few years there has been tremendous progress in the number of computational tools and techniques making use of 3D structural information to support the investigation of metalloproteins. This trend has been boosted by the successful applications of neural networks and machine/deep learning approaches in molecular and structural biology at large. In this review, we discuss recent advances in the development and availability of resources dealing with metalloproteins from a structure-based perspective. We start by addressing tools for the prediction of metal-binding sites (MBSs) using structural information on apo-proteins. Then, we provide an overview of the methods for and lessons learned from the structural comparison of MBSs in a fold-independent manner. We then move to describing databases of metalloprotein/MBS structures. Finally, we summarizing recent ML/DL applications enhancing the functional interpretation of metalloprotein structures.
Collapse
Affiliation(s)
- Claudia Andreini
- Consorzio Interuniversitario di Risonanze Magnetiche di Metallo Proteine, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Italy
- Magnetic Resonance Center (CERM), Department of Chemistry, University of Florence, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Italy
| | - Antonio Rosato
- Consorzio Interuniversitario di Risonanze Magnetiche di Metallo Proteine, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Italy
- Magnetic Resonance Center (CERM), Department of Chemistry, University of Florence, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Italy
| |
Collapse
|
6
|
A Comprehensive Review of Computation-Based Metal-Binding Prediction Approaches at the Residue Level. BIOMED RESEARCH INTERNATIONAL 2022; 2022:8965712. [PMID: 35402609 PMCID: PMC8989566 DOI: 10.1155/2022/8965712] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/02/2022] [Accepted: 03/04/2022] [Indexed: 12/29/2022]
Abstract
Clear evidence has shown that metal ions strongly connect and delicately tune the dynamic homeostasis in living bodies. They have been proved to be associated with protein structure, stability, regulation, and function. Even small changes in the concentration of metal ions can shift their effects from natural beneficial functions to harmful. This leads to degenerative diseases, malignant tumors, and cancers. Accurate characterizations and predictions of metalloproteins at the residue level promise informative clues to the investigation of intrinsic mechanisms of protein-metal ion interactions. Compared to biophysical or biochemical wet-lab technologies, computational methods provide open web interfaces of high-resolution databases and high-throughput predictors for efficient investigation of metal-binding residues. This review surveys and details 18 public databases of metal-protein binding. We collect a comprehensive set of 44 computation-based methods and classify them into four categories, namely, learning-, docking-, template-, and meta-based methods. We analyze the benchmark datasets, assessment criteria, feature construction, and algorithms. We also compare several methods on two benchmark testing datasets and include a discussion about currently publicly available predictive tools. Finally, we summarize the challenges and underlying limitations of the current studies and propose several prospective directions concerning the future development of the related databases and methods.
Collapse
|
7
|
Yu Y, Wang R, Teo RD. Machine Learning Approaches for Metalloproteins. Molecules 2022; 27:1277. [PMID: 35209064 PMCID: PMC8878495 DOI: 10.3390/molecules27041277] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Revised: 02/10/2022] [Accepted: 02/11/2022] [Indexed: 01/10/2023] Open
Abstract
Metalloproteins are a family of proteins characterized by metal ion binding, whereby the presence of these ions confers key catalytic and ligand-binding properties. Due to their ubiquity among biological systems, researchers have made immense efforts to predict the structural and functional roles of metalloproteins. Ultimately, having a comprehensive understanding of metalloproteins will lead to tangible applications, such as designing potent inhibitors in drug discovery. Recently, there has been an acceleration in the number of studies applying machine learning to predict metalloprotein properties, primarily driven by the advent of more sophisticated machine learning algorithms. This review covers how machine learning tools have consolidated and expanded our comprehension of various aspects of metalloproteins (structure, function, stability, ligand-binding interactions, and inhibitors). Future avenues of exploration are also discussed.
Collapse
Affiliation(s)
- Yue Yu
- Division of Natural and Applied Sciences, Duke Kunshan University, Kunshan, Jiangsu 215316, China;
- Department of Physics, Duke University, Durham, NC 27708, USA
| | - Ruobing Wang
- Department of Chemistry, Duke University, Durham, NC 27708, USA;
| | - Ruijie D. Teo
- Department of Chemistry, Duke University, Durham, NC 27708, USA;
- UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
8
|
Nguyen H, Kleingardner J. Identifying metal binding amino acids based on backbone geometries as a tool for metalloprotein engineering. Protein Sci 2021; 30:1247-1257. [PMID: 33829594 DOI: 10.1002/pro.4074] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 04/01/2021] [Accepted: 04/02/2021] [Indexed: 01/03/2023]
Abstract
Metal cofactors within proteins perform a versatile set of essential cellular functions. In order to take advantage of the diverse functionality of metalloproteins, researchers have been working to design or modify metal binding sites in proteins to rationally tune the function or activity of the metal cofactor. This study has performed an analysis on the backbone atom geometries of metal-binding amino acids among 10 different metal binding sites within the entire protein data bank. A set of 13 geometric parameters (features) was identified that is capable of predicting the presence of a metal cofactor in the protein structure with overall accuracies of up to 97% given only the relative positions of their backbone atoms. The decision tree machine-learning algorithm used can quickly analyze an entire protein structure for the presence of sets of primary metal coordination spheres upon mutagenesis, independent of their original amino acid identities. The methodology was designed for application in the field of metalloprotein engineering. A cluster analysis using the data set was also performed and demonstrated that the features chosen are useful for identifying clusters of structurally similar metal-binding sites.
Collapse
Affiliation(s)
- Hoang Nguyen
- Department of Computer Science, University of Illinois at Chicago, Chicago, Illinois, USA
| | - Jesse Kleingardner
- Department of Chemistry and Biochemistry, Messiah University, Mechanicsburg, Pennsylvania, USA
| |
Collapse
|