1
|
Li L, Zhou L, Jiang C, Liu Z, Meng D, Luo F, He Q, Yin H. AI-driven pan-proteome analyses reveal insights into the biohydrometallurgical properties of Acidithiobacillia. Front Microbiol 2023; 14:1243987. [PMID: 37744906 PMCID: PMC10512742 DOI: 10.3389/fmicb.2023.1243987] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 08/21/2023] [Indexed: 09/26/2023] Open
Abstract
Microorganism-mediated biohydrometallurgy, a sustainable approach for metal recovery from ores, relies on the metabolic activity of acidophilic bacteria. Acidithiobacillia with sulfur/iron-oxidizing capacities are extensively studied and applied in biohydrometallurgy-related processes. However, only 14 distinct proteins from Acidithiobacillia have experimentally determined structures currently available. This significantly hampers in-depth investigations of Acidithiobacillia's structure-based biological mechanisms pertaining to its relevant biohydrometallurgical processes. To address this issue, we employed a state-of-the-art artificial intelligence (AI)-driven approach, with a median model confidence of 0.80, to perform high-quality full-chain structure predictions on the pan-proteome (10,458 proteins) of the type strain Acidithiobacillia. Additionally, we conducted various case studies on de novo protein structural prediction, including sulfate transporter and iron oxidase, to demonstrate how accurate structure predictions and gene co-occurrence networks can contribute to the development of mechanistic insights and hypotheses regarding sulfur and iron utilization proteins. Furthermore, for the unannotated proteins that constitute 35.8% of the Acidithiobacillia proteome, we employed the deep-learning algorithm DeepFRI to make structure-based functional predictions. As a result, we successfully obtained gene ontology (GO) terms for 93.6% of these previously unknown proteins. This study has a significant impact on improving protein structure and function predictions, as well as developing state-of-the-art techniques for high-throughput analysis of large proteomic data.
Collapse
Affiliation(s)
- Liangzhi Li
- School of Minerals Processing and Bioengineering, Central South University, Changsha, China
- Key Laboratory of Biometallurgy of Ministry of Education, Central South University, Changsha, China
| | - Lei Zhou
- Beijing Research Institute of Chemical Engineering and Metallurgy, Beijing, China
| | - Chengying Jiang
- State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Zhenghua Liu
- School of Minerals Processing and Bioengineering, Central South University, Changsha, China
- Key Laboratory of Biometallurgy of Ministry of Education, Central South University, Changsha, China
| | - Delong Meng
- School of Minerals Processing and Bioengineering, Central South University, Changsha, China
- Key Laboratory of Biometallurgy of Ministry of Education, Central South University, Changsha, China
| | - Feng Luo
- School of Computing, Clemson University, Clemson, SC, United States
| | - Qiang He
- Department of Civil and Environmental Engineering, University of Tennessee, Knoxville, Knoxville, TN, United States
| | - Huaqun Yin
- School of Minerals Processing and Bioengineering, Central South University, Changsha, China
- Key Laboratory of Biometallurgy of Ministry of Education, Central South University, Changsha, China
| |
Collapse
|
2
|
Guterres H, Im W. CHARMM-GUI-Based Induced Fit Docking Workflow to Generate Reliable Protein-Ligand Binding Modes. J Chem Inf Model 2023; 63:4772-4779. [PMID: 37462607 PMCID: PMC10428204 DOI: 10.1021/acs.jcim.3c00416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Indexed: 08/15/2023]
Abstract
Molecular docking is a preferred method to predict ligand binding modes and their binding energy to target protein receptors, which is critical in early phase structure-based drug discovery. However, there is a persistent challenge in docking that can be attributed to the induced fit effect, as receptor binding sites undergo induced fit conformational changes upon ligand binding to achieve better binding modes. In this work, based on CHARMM-GUI LBS Finder& Refiner and High-Throughput Simulator, we present a straightforward CHARMM-GUI induced fit docking (CGUI-IFD) workflow to generate reliable protein-ligand binding modes. The CGUI-IFD workflow generates an ensemble of receptor binding site conformations through ligand-binding site (LBS) refinement, runs rigid receptor docking, and performs high-throughput molecular dynamics (MD) simulations of protein-ligand complex structures in explicit solvents. The results are evaluated based on the ligand root-mean-square deviation (RMSD)-based binding stability and the molecular mechanics generalized Born surface area binding energy. For a benchmark test, we used 258 cross-docking protein-ligand pairs across 41 target proteins from the Schrodinger IFD-MD data set. The application of CGUI-IFD on this data set shows 80% success rate (within 2.5 Å RMSD from the experimental structures). We expect that the CGUI-IFD workflow can be useful to generate reliable ligand binding modes for cross-docking cases.
Collapse
Affiliation(s)
- Hugo Guterres
- Departments of Biological
Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Wonpil Im
- Departments of Biological
Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| |
Collapse
|
3
|
Jiang W. Enhanced Configurational Sampling Approaches to Alchemical Ligand Binding Free Energy Simulations: Current Status and Challenges. J Phys Chem B 2023; 127:6835-6841. [PMID: 37499215 DOI: 10.1021/acs.jpcb.3c02020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Ligand binding free energy simulations (LB-FES) have been routine tasks in modern drug discovery campaign. A long-standing challenge for LB-FES is the difficulty in adequately sampling nontrivial environmental reorganizations in response to ligand binding. Therefore, various enhanced configurational sampling (ECS) approaches were devised to speed up fluctuations of relevant slow degrees of freedom (SDOF) and ensure simulation convergence. However, in contrast to the achievements in parametrization, software performance, and workflow automation, efficient ECS methodology suitable for high throughput screening remains in an early stage of development. Here, a review of ECS developments with LB-FES is presented, revisiting current approaches and underlining the major technical pitfalls and challenges. This Perspective focuses on alchemical LB-FES on account of their predominant role in high throughput drug screening as well as the established partnership with ECS. The critical aspects of designing ECS approaches, from both theoretical and applied perspectives, are described. This work is intended to provide a contemporary review of the scientific, technical, and practical issues associated with the accelerating convergence of alchemical LB-FES.
Collapse
Affiliation(s)
- Wei Jiang
- Computational Science Division, Argonne National Laboratory, 9700 South Cass Avenue, Building 240, Argonne, Illinois 60439, United States
| |
Collapse
|
4
|
Buckner J, Liu X, Chakravorty A, Wu Y, Cervantes LF, Lai TT, Brooks CL. pyCHARMM: Embedding CHARMM Functionality in a Python Framework. J Chem Theory Comput 2023; 19:3752-3762. [PMID: 37267404 PMCID: PMC10504603 DOI: 10.1021/acs.jctc.3c00364] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
CHARMM is rich in methodology and functionality as one of the first programs addressing problems of molecular dynamics and modeling of biological macromolecules and their partners, e.g., small molecule ligands. When combined with the highly developed CHARMM parameters for proteins, nucleic acids, small molecules, lipids, sugars, and other biologically relevant building blocks, and the versatile CHARMM scripting language, CHARMM has been a trendsetting platform for modeling studies of biological macromolecules. To further enhance the utility of accessing and using CHARMM functionality in increasingly complex workflows associated with modeling biological systems, we introduce pyCHARMM, Python bindings, functions, and modules to complement and extend the extensive set of modeling tools and methods already available in CHARMM. These include access to CHARMM function-generated variables associated with the system (psf), coordinates, velocities and forces, atom selection variables, and force field related parameters. The ability to augment CHARMM forces and energies with energy terms or methods derived from machine learning or other sources, written in Python, CUDA, or OpenCL and expressed as Python callable routines is introduced together with analogous functions callable during dynamics calculations. Integration of Python-based graphical engines for visualization of simulation models and results is also accessible. Loosely coupled parallelism is available for workflows such as free energy calculations, using MBAR/TI approaches or high-throughput multisite λ-dynamics (MSλD) free energy methods, string path optimization calculations, replica exchange, and molecular docking with a new Python-based CDOCKER module. CHARMM accelerated platform kernels through the CHARMM/OpenMM API, CHARMM/DOMDEC, and CHARMM/BLaDE API are also readily integrated into this Python framework. We anticipate that pyCHARMM will be a robust platform for the development of comprehensive and complex workflows utilizing Python and its extensive functionality as well as an optimal platform for users to learn molecular modeling methods and practices within a Python-friendly environment such as Jupyter Notebooks.
Collapse
Affiliation(s)
- Joshua Buckner
- Department of Chemistry, University of Michigan, Ann Arbor, MI
| | - Xiaorong Liu
- Department of Chemistry, University of Michigan, Ann Arbor, MI
| | | | - Yujin Wu
- Department of Chemistry, University of Michigan, Ann Arbor, MI
| | - Luis F. Cervantes
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI
| | - Thanh T. Lai
- Biophysics Program, University of Michigan, Ann Arbor, MI
| | - Charles L. Brooks
- Department of Chemistry, University of Michigan, Ann Arbor, MI
- Biophysics Program, University of Michigan, Ann Arbor, MI
| |
Collapse
|
5
|
Exploring different computational approaches for effective diagnosis of breast cancer. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2023; 177:141-150. [PMID: 36509230 DOI: 10.1016/j.pbiomolbio.2022.11.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 10/28/2022] [Accepted: 11/10/2022] [Indexed: 12/13/2022]
Abstract
Breast cancer has been identified as one among the top causes of female death worldwide. According to recent research, earlier detection plays an important role toward fortunate medicaments and thus, decreasing the mortality rate due to breast cancer among females. This review provides a fleeting summary involving traditional diagnostic procedures from the past and today, and also modern computational tools that have greatly aided in the identification of breast cancer. Computational techniques involving different algorithms such as Support vector machines, deep learning techniques and robotics are popular among the academicians for detection of breast cancer. They discovered that Convolutional neural network was a common option for categorization among such approaches. Deep learning techniques are evaluated using performance indicators such as accuracy, sensitivity, specificity, or measure. Furthermore, molecular docking, homology modeling and Molecular dynamics Simulation gives a road map for future discussions about developing improved early detection approaches that holds greater potential in increasing the survival rate of cancer patients. The different computational techniques can be a new dominion among researchers and combating the challenges associated with breast cancer.
Collapse
|
6
|
Zhang J, Li H, Zhao X, Wu Q, Huang SY. Holo Protein Conformation Generation from Apo Structures by Ligand Binding Site Refinement. J Chem Inf Model 2022; 62:5806-5820. [PMID: 36342197 DOI: 10.1021/acs.jcim.2c00895] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
An important part in structure-based drug design is the selection of an appropriate protein structure. It has been revealed that a holo protein structure that contains a well-defined binding site is a much better choice than an apo structure in structure-based drug discovery. Therefore, it is valuable to obtain a holo-like protein conformation from apo structures in the case where no holo structure is available. Meeting the need, we present a robust approach to generate reliable holo-like structures from apo structures by ligand binding site refinement with restraints derived from holo templates with low homology. Our method was tested on a test set of 32 proteins from the DUD-E data set and compared with other approaches. It was shown that our method successfully refined the apo structures toward the corresponding holo conformations for 23 of 32 proteins, reducing the average all-heavy-atom RMSD of binding site residues by 0.48 Å. In addition, when evaluated against all the holo structures in the protein data bank, our method can improve the binding site RMSD for 14 of 19 cases that experience significant conformational changes. Furthermore, our refined structures also demonstrate their advantages over the apo structures in ligand binding mode predictions by both rigid docking and flexible docking and in virtual screening on the database of active and decoy ligands from the DUD-E. These results indicate that our method is effective in recovering holo-like conformations and will be valuable in structure-based drug discovery.
Collapse
Affiliation(s)
- Jinze Zhang
- School of Physics, Huazhong University of Science and Technology, Wuhan430074, Hubei, P. R. China
| | - Hao Li
- School of Physics, Huazhong University of Science and Technology, Wuhan430074, Hubei, P. R. China
| | - Xuejun Zhao
- School of Physics, Huazhong University of Science and Technology, Wuhan430074, Hubei, P. R. China
| | - Qilong Wu
- School of Physics, Huazhong University of Science and Technology, Wuhan430074, Hubei, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan430074, Hubei, P. R. China
| |
Collapse
|
7
|
Guterres H, Park S, Zhang H, Perone T, Kim J, Im W. CHARMM‐GUI
high‐throughput simulator
for efficient evaluation of protein–ligand interactions with different force fields. Protein Sci 2022. [DOI: 10.1002/pro.4413] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Hugo Guterres
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering Lehigh University Bethlehem Pennsylvania USA
| | - Sang‐Jun Park
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering Lehigh University Bethlehem Pennsylvania USA
| | - Han Zhang
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering Lehigh University Bethlehem Pennsylvania USA
| | - Thomas Perone
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering Lehigh University Bethlehem Pennsylvania USA
| | - Jongtaek Kim
- Department of Physics and Chemistry Korea Air Force Academy Cheongju South Korea
| | - Wonpil Im
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering Lehigh University Bethlehem Pennsylvania USA
| |
Collapse
|
8
|
Barnsley KK, Ondrechen MJ. Enzyme active sites: Identification and prediction of function using computational chemistry. Curr Opin Struct Biol 2022; 74:102384. [DOI: 10.1016/j.sbi.2022.102384] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Revised: 03/20/2022] [Accepted: 03/28/2022] [Indexed: 11/03/2022]
|
9
|
Guterres H, Park SJ, Cao Y, Im W. CHARMM-GUI Ligand Designer for Template-Based Virtual Ligand Design in a Binding Site. J Chem Inf Model 2021; 61:5336-5342. [PMID: 34757752 DOI: 10.1021/acs.jcim.1c01156] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Rational drug design involves a task of finding ligands that would bind to a specific target protein. This work presents CHARMM-GUI Ligand Designer that is an intuitive and interactive web-based tool to design virtual ligands that match the shape and chemical features of a given protein binding site. Ligand Designer provides ligand modification capabilities with 3D visualization that allow researchers to modify and redesign virtual ligands while viewing how the protein-ligand interactions are affected. Virtual ligands can also be parameterized for further molecular dynamics (MD) simulations and free energy calculations. Using 8 targets from 8 different protein classes in the directory of useful decoys, enhanced (DUD-E) data set, we show that Ligand Designer can produce similar ligands to the known active ligands in the crystal structures. Ligand Designer also produces stable protein-ligand complex structures when tested using short MD simulations. We expect that Ligand Designer can be a useful and user-friendly tool to design small molecules in any given potential ligand binding site on a protein of interest.
Collapse
Affiliation(s)
- Hugo Guterres
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Sang-Jun Park
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Yiwei Cao
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Wonpil Im
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| |
Collapse
|