1
|
Satalkar V, Degaga GD, Li W, Pang YT, McShan AC, Gumbart JC, Mitchell JC, Torres MP. Generative β-hairpin design using a residue-based physicochemical property landscape. Biophys J 2024; 123:2790-2806. [PMID: 38297834 PMCID: PMC11393682 DOI: 10.1016/j.bpj.2024.01.029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 12/20/2023] [Accepted: 01/25/2024] [Indexed: 02/02/2024] Open
Abstract
De novo peptide design is a new frontier that has broad application potential in the biological and biomedical fields. Most existing models for de novo peptide design are largely based on sequence homology that can be restricted based on evolutionarily derived protein sequences and lack the physicochemical context essential in protein folding. Generative machine learning for de novo peptide design is a promising way to synthesize theoretical data that are based on, but unique from, the observable universe. In this study, we created and tested a custom peptide generative adversarial network intended to design peptide sequences that can fold into the β-hairpin secondary structure. This deep neural network model is designed to establish a preliminary foundation of the generative approach based on physicochemical and conformational properties of 20 canonical amino acids, for example, hydrophobicity and residue volume, using extant structure-specific sequence data from the PDB. The beta generative adversarial network model robustly distinguishes secondary structures of β hairpin from α helix and intrinsically disordered peptides with an accuracy of up to 96% and generates artificial β-hairpin peptide sequences with minimum sequence identities around 31% and 50% when compared against the current NCBI PDB and nonredundant databases, respectively. These results highlight the potential of generative models specifically anchored by physicochemical and conformational property features of amino acids to expand the sequence-to-structure landscape of proteins beyond evolutionary limits.
Collapse
Affiliation(s)
- Vardhan Satalkar
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia
| | - Gemechis D Degaga
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee
| | - Wei Li
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia
| | - Yui Tik Pang
- School of Physics, Georgia Institute of Technology, Atlanta, Georgia
| | - Andrew C McShan
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia
| | - James C Gumbart
- School of Physics, Georgia Institute of Technology, Atlanta, Georgia; School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia
| | - Julie C Mitchell
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee.
| | - Matthew P Torres
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia; School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia.
| |
Collapse
|
2
|
Senthil R, Anand T, Somala CS, Saravanan KM. Bibliometric analysis of artificial intelligence in healthcare research: Trends and future directions. Future Healthc J 2024; 11:100182. [PMID: 39310219 PMCID: PMC11414662 DOI: 10.1016/j.fhj.2024.100182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Revised: 08/06/2024] [Accepted: 08/30/2024] [Indexed: 09/25/2024]
Abstract
Objective The presence of artificial intelligence (AI) in healthcare is a powerful and game-changing force that is completely transforming the industry as a whole. Using sophisticated algorithms and data analytics, AI has unparalleled prospects for improving patient care, streamlining operational efficiency, and fostering innovation across the healthcare ecosystem. This study conducts a comprehensive bibliometric analysis of research on AI in healthcare, utilising the SCOPUS database as the primary data source. Methods Preliminary findings from 2013 identified 153 publications on AI and healthcare. Between 2019 and 2023, the number of publications increased exponentially, indicating significant growth and development in the field. The analysis employs various bibliometric indicators to assess research production performance, science mapping techniques, and thematic mapping analysis. Results The study reveals insights into research hotspots, thematic focus, and emerging trends in AI and healthcare research. Based on an extensive examination of the Scopus database provides a brief overview and suggests potential avenues for further investigation. Conclusion This article provides valuable contributions to understanding the current landscape of AI in healthcare, offering insights for future research directions and informing strategic decision making in the field.
Collapse
Affiliation(s)
- Renganathan Senthil
- Department of Bioinformatics, School of Lifesciences, Vels Institute of Science Technology and Advanced Studies (VISTAS), Pallavaram, Chennai 600117, Tamil Nadu, India
| | - Thirunavukarasou Anand
- SRIIC Lab, Faculty of Clinical Research, Sri Ramachandra Institute of Higher Education and Research, Chennai 600116, Tamil Nadu, India
- B Aatral Biosciences Private Limited, Bangalore 560091, Karnataka, India
| | | | - Konda Mani Saravanan
- B Aatral Biosciences Private Limited, Bangalore 560091, Karnataka, India
- Department of Biotechnology, Bharath Institute of Higher Education and Research, Chennai 600073, Tamil Nadu, India
| |
Collapse
|
3
|
Wang Q, Hu X, Wei Z, Lu H, Liu H. Reinforcement learning-driven exploration of peptide space: accelerating generation of drug-like peptides. Brief Bioinform 2024; 25:bbae444. [PMID: 39256196 PMCID: PMC11387070 DOI: 10.1093/bib/bbae444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2024] [Revised: 08/05/2024] [Accepted: 08/27/2024] [Indexed: 09/12/2024] Open
Abstract
Using amino acid residues in peptide generation has solved several key problems, including precise control of amino acid sequence order, customized peptides for property modification, and large-scale peptide synthesis. Proteins contain unknown amino acid residues. Extracting them for the synthesis of drug-like peptides can create novel structures with unique properties, driving drug development. Computer-aided design of novel peptide drug molecules can solve the high-cost and low-efficiency problems in the traditional drug discovery process. Previous studies faced limitations in enhancing the bioactivity and drug-likeness of polypeptide drugs due to less emphasis on the connection relationships in amino acid structures. Thus, we proposed a reinforcement learning-driven generation model based on graph attention mechanisms for peptide generation. By harnessing the advantages of graph attention mechanisms, this model effectively captured the connectivity structures between amino acid residues in peptides. Simultaneously, leveraging reinforcement learning's strength in guiding optimal sequence searches provided a novel approach to peptide design and optimization. This model introduces an actor-critic framework with real-time feedback loops to achieve dynamic balance between attributes, which can customize the generation of multiple peptides for specific targets and enhance the affinity between peptides and targets. Experimental results demonstrate that the generated drug-like peptides meet specified absorption, distribution, metabolism, excretion, and toxicity properties and bioactivity with a success rate of over 90$\%$, thereby significantly accelerating the process of drug-like peptide generation.
Collapse
Affiliation(s)
- Qian Wang
- College of Computer Science and Technology, Ocean University of China, 238 Songling Rd, 266100 Shandong, China
| | - Xiaotong Hu
- College of Computer Science and Technology, Ocean University of China, 238 Songling Rd, 266100 Shandong, China
| | - Zhiqiang Wei
- College of Computer Science and Technology, Ocean University of China, 238 Songling Rd, 266100 Shandong, China
| | - Hao Lu
- College of Computer Science and Technology, Ocean University of China, 238 Songling Rd, 266100 Shandong, China
| | - Hao Liu
- College of Computer Science and Technology, Ocean University of China, 238 Songling Rd, 266100 Shandong, China
| |
Collapse
|
4
|
Bhattarai S, Tayara H, Chong KT. Advancing Peptide-Based Cancer Therapy with AI: In-Depth Analysis of State-of-the-Art AI Models. J Chem Inf Model 2024; 64:4941-4957. [PMID: 38874445 DOI: 10.1021/acs.jcim.4c00295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2024]
Abstract
Anticancer peptides (ACPs) play a vital role in selectively targeting and eliminating cancer cells. Evaluating and comparing predictions from various machine learning (ML) and deep learning (DL) techniques is challenging but crucial for anticancer drug research. We conducted a comprehensive analysis of 15 ML and 10 DL models, including the models released after 2022, and found that support vector machines (SVMs) with feature combination and selection significantly enhance overall performance. DL models, especially convolutional neural networks (CNNs) with light gradient boosting machine (LGBM) based feature selection approaches, demonstrate improved characterization. Assessment using a new test data set (ACP10) identifies ACPred, MLACP 2.0, AI4ACP, mACPred, and AntiCP2.0_AAC as successive optimal predictors, showcasing robust performance. Our review underscores current prediction tool limitations and advocates for an omnidirectional ACP prediction framework to propel ongoing research.
Collapse
Affiliation(s)
- Sadik Bhattarai
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju-si, 54896 Jeollabuk-do, South Korea
| | - Hilal Tayara
- School of International Engineering and Science, Jeonbuk National University, Jeonju-si, 54896 Jeollabuk-do, South Korea
| | - Kil To Chong
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju-si, 54896 Jeollabuk-do, South Korea
- Advanced Electronics and Information Research Center, Jeonbuk National University, Jeonju-si, 54896 Jeollabuk-do, South Korea
| |
Collapse
|
5
|
Yin K, Xu W, Ren S, Xu Q, Zhang S, Zhang R, Jiang M, Zhang Y, Xu D, Li R. Machine Learning Accelerates De Novo Design of Antimicrobial Peptides. Interdiscip Sci 2024; 16:392-403. [PMID: 38416364 DOI: 10.1007/s12539-024-00612-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 01/17/2024] [Accepted: 01/23/2024] [Indexed: 02/29/2024]
Abstract
Efficient and precise design of antimicrobial peptides (AMPs) is of great importance in the field of AMP development. Computing provides opportunities for peptide de novo design. In the present investigation, a new machine learning-based AMP prediction model, AP_Sin, was trained using 1160 AMP sequences and 1160 non-AMP sequences. The results showed that AP_Sin correctly classified 94.61% of AMPs on a comprehensive dataset, outperforming the mainstream and open-source models (Antimicrobial Peptide Scanner vr.2, iAMPpred and AMPlify) and being effective in identifying AMPs. In addition, a peptide sequence generator, AP_Gen, was devised based on the concept of recombining dominant amino acids and dipeptide compositions. After inputting the parameters of the 71 tridecapeptides from antimicrobial peptides database (APD3) into AP_Gen, a tridecapeptide bank consisting of de novo designed 17,496 tridecapeptide sequences were randomly generated, from which 2675 candidate AMP sequences were identified by AP_Sin. Chemical synthesis was performed on 180 randomly selected candidate AMP sequences, of which 18 showed high antimicrobial activities against a wide range of the tested pathogenic microorganisms, and 16 of which had a minimal inhibitory concentration of less than 10 μg/mL against at least one of the tested pathogenic microorganisms. The method established in this research accelerates the discovery of valuable candidate AMPs and provides a novel approach for de novo design of antimicrobial peptides.
Collapse
Affiliation(s)
- Kedong Yin
- Key Laboratory of Functional Molecules for Biomedical Research, Henan University of Technology, 100 Lianhua Street, Zhengzhou, 450001, Henan, People's Republic of China
- College of Information Science and Engineering, Henan University of Technology, 100 Lianhua Street, Zhengzhou, 450001, Henan, People's Republic of China
| | - Wen Xu
- Key Laboratory of Functional Molecules for Biomedical Research, Henan University of Technology, 100 Lianhua Street, Zhengzhou, 450001, Henan, People's Republic of China.
- Law College, Henan University of Technology, Zhengzhou, 450001, Henan, People's Republic of China.
| | - Shiming Ren
- Key Laboratory of Functional Molecules for Biomedical Research, Henan University of Technology, 100 Lianhua Street, Zhengzhou, 450001, Henan, People's Republic of China
- College of Biological Engineering, Henan University of Technology, 100 Lianhua Street, Zhengzhou, 450001, Henan, People's Republic of China
| | - Qingpeng Xu
- Key Laboratory of Functional Molecules for Biomedical Research, Henan University of Technology, 100 Lianhua Street, Zhengzhou, 450001, Henan, People's Republic of China
- School of Artificial Intelligence and Big Data, Henan University of Technology, Zhengzhou, 450001, Henan, People's Republic of China
| | - Shaojie Zhang
- Key Laboratory of Functional Molecules for Biomedical Research, Henan University of Technology, 100 Lianhua Street, Zhengzhou, 450001, Henan, People's Republic of China
- College of Biological Engineering, Henan University of Technology, 100 Lianhua Street, Zhengzhou, 450001, Henan, People's Republic of China
| | - Ruiling Zhang
- Key Laboratory of Functional Molecules for Biomedical Research, Henan University of Technology, 100 Lianhua Street, Zhengzhou, 450001, Henan, People's Republic of China
- School of Economics and Trade, Henan University of Technology, Zhengzhou, 450001, Henan, People's Republic of China
| | - Mengwan Jiang
- School of Artificial Intelligence and Big Data, Henan University of Technology, Zhengzhou, 450001, Henan, People's Republic of China
| | - Yuhong Zhang
- School of Artificial Intelligence and Big Data, Henan University of Technology, Zhengzhou, 450001, Henan, People's Republic of China
| | - Degang Xu
- College of Information Science and Engineering, Henan University of Technology, 100 Lianhua Street, Zhengzhou, 450001, Henan, People's Republic of China.
| | - Ruifang Li
- Key Laboratory of Functional Molecules for Biomedical Research, Henan University of Technology, 100 Lianhua Street, Zhengzhou, 450001, Henan, People's Republic of China.
- College of Biological Engineering, Henan University of Technology, 100 Lianhua Street, Zhengzhou, 450001, Henan, People's Republic of China.
| |
Collapse
|
6
|
Saravanan KM, Wan JF, Dai L, Zhang J, Zhang JZH, Zhang H. A deep learning based multi-model approach for predicting drug-like chemical compound's toxicity. Methods 2024; 226:164-175. [PMID: 38702021 DOI: 10.1016/j.ymeth.2024.04.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 04/01/2024] [Accepted: 04/28/2024] [Indexed: 05/06/2024] Open
Abstract
Ensuring the safety and efficacy of chemical compounds is crucial in small-molecule drug development. In the later stages of drug development, toxic compounds pose a significant challenge, losing valuable resources and time. Early and accurate prediction of compound toxicity using deep learning models offers a promising solution to mitigate these risks during drug discovery. In this study, we present the development of several deep-learning models aimed at evaluating different types of compound toxicity, including acute toxicity, carcinogenicity, hERG_cardiotoxicity (the human ether-a-go-go related gene caused cardiotoxicity), hepatotoxicity, and mutagenicity. To address the inherent variations in data size, label type, and distribution across different types of toxicity, we employed diverse training strategies. Our first approach involved utilizing a graph convolutional network (GCN) regression model to predict acute toxicity, which achieved notable performance with Pearson R 0.76, 0.74, and 0.65 for intraperitoneal, intravenous, and oral administration routes, respectively. Furthermore, we trained multiple GCN binary classification models, each tailored to a specific type of toxicity. These models exhibited high area under the curve (AUC) scores, with an impressive AUC of 0.69, 0.77, 0.88, and 0.79 for predicting carcinogenicity, hERG_cardiotoxicity, mutagenicity, and hepatotoxicity, respectively. Additionally, we have used the approved drug dataset to determine the appropriate threshold value for the prediction score in model usage. We integrated these models into a virtual screening pipeline to assess their effectiveness in identifying potential low-toxicity drug candidates. Our findings indicate that this deep learning approach has the potential to significantly reduce the cost and risk associated with drug development by expediting the selection of compounds with low toxicity profiles. Therefore, the models developed in this study hold promise as critical tools for early drug candidate screening and selection.
Collapse
Affiliation(s)
- Konda Mani Saravanan
- Department of Biotechnology, Bharath Institute of Higher Education and Research, Chennai 600073, Tamil Nadu, India
| | - Jiang-Fan Wan
- Guangdong-Hong Kong-Macao Greater Bay Area Center for Drug Evaluation and Inspection of NMPA, Shenzhen 518000, China
| | - Liujiang Dai
- Guangdong Immune Cell Therapy Engineering and Technology Research Center, Center for Protein and Cell-Based Drugs, Institute of Biomedicine and Biotechnology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Jiajun Zhang
- Faculty of Synthetic Biology and Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China; College of Science, Hunan University of Technology and Business, Changsha 410205, China
| | - John Z H Zhang
- Faculty of Synthetic Biology and Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Haiping Zhang
- Faculty of Synthetic Biology and Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China.
| |
Collapse
|
7
|
Scalzitti N, Miralavy I, Korenchan DE, Farrar CT, Gilad AA, Banzhaf W. Computational peptide discovery with a genetic programming approach. J Comput Aided Mol Des 2024; 38:17. [PMID: 38570405 PMCID: PMC11416381 DOI: 10.1007/s10822-024-00558-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Accepted: 03/07/2024] [Indexed: 04/05/2024]
Abstract
The development of peptides for therapeutic targets or biomarkers for disease diagnosis is a challenging task in protein engineering. Current approaches are tedious, often time-consuming and require complex laboratory data due to the vast search spaces that need to be considered. In silico methods can accelerate research and substantially reduce costs. Evolutionary algorithms are a promising approach for exploring large search spaces and can facilitate the discovery of new peptides. This study presents the development and use of a new variant of the genetic-programming-based POET algorithm, called POETRegex , where individuals are represented by a list of regular expressions. This algorithm was trained on a small curated dataset and employed to generate new peptides improving the sensitivity of peptides in magnetic resonance imaging with chemical exchange saturation transfer (CEST). The resulting model achieves a performance gain of 20% over the initial POET models and is able to predict a candidate peptide with a 58% performance increase compared to the gold-standard peptide. By combining the power of genetic programming with the flexibility of regular expressions, new peptide targets were identified that improve the sensitivity of detection by CEST. This approach provides a promising research direction for the efficient identification of peptides with therapeutic or diagnostic potential.
Collapse
Affiliation(s)
- Nicolas Scalzitti
- BEACON Center of Evolution in Action, Michigan State University, East Lansing, MI, USA
- Department of Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| | - Iliya Miralavy
- BEACON Center of Evolution in Action, Michigan State University, East Lansing, MI, USA
- Department of Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
| | - David E Korenchan
- Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Christian T Farrar
- Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Assaf A Gilad
- BEACON Center of Evolution in Action, Michigan State University, East Lansing, MI, USA.
- Department of Chemical Engineering, Michigan State University, East Lansing, MI, USA.
- Department of Radiology, Michigan State University, East Lansing, MI, USA.
| | - Wolfgang Banzhaf
- BEACON Center of Evolution in Action, Michigan State University, East Lansing, MI, USA.
- Department of Computer Science and Engineering, Michigan State University, East Lansing, MI, USA.
| |
Collapse
|
8
|
Wu X, Lin H, Bai R, Duan H. Deep learning for advancing peptide drug development: Tools and methods in structure prediction and design. Eur J Med Chem 2024; 268:116262. [PMID: 38387334 DOI: 10.1016/j.ejmech.2024.116262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 02/06/2024] [Accepted: 02/17/2024] [Indexed: 02/24/2024]
Abstract
Peptides can bind challenging disease targets with high affinity and specificity, offering enormous opportunities for addressing unmet medical needs. However, peptides' unique features, including smaller size, increased structural flexibility, and limited data availability, pose additional challenges to the design process compared to proteins. This review explores the dynamic field of peptide therapeutics, leveraging deep learning to enhance structure prediction and design. Our exploration encompasses various facets of peptide research, ranging from dataset curation handling to model development. As deep learning technologies become more refined, we channel our efforts into peptide structure prediction and design, aligning with the fundamental principles of structure-activity relationships in drug development. To guide researchers in harnessing the potential of deep learning to advance peptide drug development, our insights comprehensively explore current challenges and future directions of peptide therapeutics.
Collapse
Affiliation(s)
- Xinyi Wu
- College of Pharmaceutical Sciences, Zhejiang University of Technology, Hangzhou, 310014, PR China
| | - Huitian Lin
- College of Pharmaceutical Sciences, Zhejiang University of Technology, Hangzhou, 310014, PR China
| | - Renren Bai
- School of Pharmacy, Hangzhou Normal University, Hangzhou, 311121, PR China.
| | - Hongliang Duan
- Faculty of Applied Sciences, Macao Polytechnic University, Macao, 999078, PR China.
| |
Collapse
|
9
|
Purohit K, Reddy N, Sunna A. Exploring the Potential of Bioactive Peptides: From Natural Sources to Therapeutics. Int J Mol Sci 2024; 25:1391. [PMID: 38338676 PMCID: PMC10855437 DOI: 10.3390/ijms25031391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 01/18/2024] [Accepted: 01/21/2024] [Indexed: 02/12/2024] Open
Abstract
Bioactive peptides, specific protein fragments with positive health effects, are gaining traction in drug development for advantages like enhanced penetration, low toxicity, and rapid clearance. This comprehensive review navigates the intricate landscape of peptide science, covering discovery to functional characterization. Beginning with a peptidomic exploration of natural sources, the review emphasizes the search for novel peptides. Extraction approaches, including enzymatic hydrolysis, microbial fermentation, and specialized methods for disulfide-linked peptides, are extensively covered. Mass spectrometric analysis techniques for data acquisition and identification, such as liquid chromatography, capillary electrophoresis, untargeted peptide analysis, and bioinformatics, are thoroughly outlined. The exploration of peptide bioactivity incorporates various methodologies, from in vitro assays to in silico techniques, including advanced approaches like phage display and cell-based assays. The review also discusses the structure-activity relationship in the context of antimicrobial peptides (AMPs), ACE-inhibitory peptides (ACEs), and antioxidative peptides (AOPs). Concluding with key findings and future research directions, this interdisciplinary review serves as a comprehensive reference, offering a holistic understanding of peptides and their potential therapeutic applications.
Collapse
Affiliation(s)
- Kruttika Purohit
- School of Natural Sciences, Macquarie University, Sydney, NSW 2109, Australia;
- Australian Research Council Industrial Transformation Training Centre for Facilitated Advancement of Australia’s Bioactives (FAAB), Sydney, NSW 2109, Australia;
| | - Narsimha Reddy
- Australian Research Council Industrial Transformation Training Centre for Facilitated Advancement of Australia’s Bioactives (FAAB), Sydney, NSW 2109, Australia;
- School of Science, Parramatta Campus, Western Sydney University, Penrith, NSW 2751, Australia
| | - Anwar Sunna
- School of Natural Sciences, Macquarie University, Sydney, NSW 2109, Australia;
- Australian Research Council Industrial Transformation Training Centre for Facilitated Advancement of Australia’s Bioactives (FAAB), Sydney, NSW 2109, Australia;
- Biomolecular Discovery Research Centre, Macquarie University, Sydney, NSW 2109, Australia
| |
Collapse
|
10
|
Fernandes FC, Cardoso MH, Gil-Ley A, Luchi LV, da Silva MGL, Macedo MLR, de la Fuente-Nunez C, Franco OL. Geometric deep learning as a potential tool for antimicrobial peptide prediction. FRONTIERS IN BIOINFORMATICS 2023; 3:1216362. [PMID: 37521317 PMCID: PMC10374423 DOI: 10.3389/fbinf.2023.1216362] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 06/13/2023] [Indexed: 08/01/2023] Open
Abstract
Antimicrobial peptides (AMPs) are components of natural immunity against invading pathogens. They are polymers that fold into a variety of three-dimensional structures, enabling their function, with an underlying sequence that is best represented in a non-flat space. The structural data of AMPs exhibits non-Euclidean characteristics, which means that certain properties, e.g., differential manifolds, common system of coordinates, vector space structure, or translation-equivariance, along with basic operations like convolution, in non-Euclidean space are not distinctly established. Geometric deep learning (GDL) refers to a category of machine learning methods that utilize deep neural models to process and analyze data in non-Euclidean settings, such as graphs and manifolds. This emerging field seeks to expand the use of structured models to these domains. This review provides a detailed summary of the latest developments in designing and predicting AMPs utilizing GDL techniques and also discusses both current research gaps and future directions in the field.
Collapse
Affiliation(s)
- Fabiano C. Fernandes
- Centro de Análises Proteômicas e Bioquímicas, Pós-Graduação em Ciências Genômicas e Biotecnologia, Universidade Católica de Brasília, Brasília, Brazil
- Departamento de Ciência da Computação, Instituto Federal de Brasília, Brasília, Brazil
| | - Marlon H. Cardoso
- Centro de Análises Proteômicas e Bioquímicas, Pós-Graduação em Ciências Genômicas e Biotecnologia, Universidade Católica de Brasília, Brasília, Brazil
- S-Inova Biotech, Programa de Pós-Graduação em Biotecnologia, Universidade Católica Dom Bosco, Campo Grande, Brazil
- Laboratório de Purificação de Proteínas e suas Funções Biológicas, Universidade Federal de Mato Grosso do Sul, Cidade Universitária, Campo Grande, Mato Grosso do Sul, Brazil
| | - Abel Gil-Ley
- S-Inova Biotech, Programa de Pós-Graduação em Biotecnologia, Universidade Católica Dom Bosco, Campo Grande, Brazil
| | - Lívia V. Luchi
- S-Inova Biotech, Programa de Pós-Graduação em Biotecnologia, Universidade Católica Dom Bosco, Campo Grande, Brazil
| | - Maria G. L. da Silva
- Centro de Análises Proteômicas e Bioquímicas, Pós-Graduação em Ciências Genômicas e Biotecnologia, Universidade Católica de Brasília, Brasília, Brazil
| | - Maria L. R. Macedo
- Laboratório de Purificação de Proteínas e suas Funções Biológicas, Universidade Federal de Mato Grosso do Sul, Cidade Universitária, Campo Grande, Mato Grosso do Sul, Brazil
| | - Cesar de la Fuente-Nunez
- Machine Biology Group, Departments of Psychiatry and Microbiology, Perelman School of Medicine, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, University of Pennsylvania, Philadelphia, PA, United States
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, United States
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, United States
| | - Octavio L. Franco
- Centro de Análises Proteômicas e Bioquímicas, Pós-Graduação em Ciências Genômicas e Biotecnologia, Universidade Católica de Brasília, Brasília, Brazil
- S-Inova Biotech, Programa de Pós-Graduação em Biotecnologia, Universidade Católica Dom Bosco, Campo Grande, Brazil
| |
Collapse
|
11
|
Zhang H, Saravanan KM, Zhang JZH. DeepBindGCN: Integrating Molecular Vector Representation with Graph Convolutional Neural Networks for Protein-Ligand Interaction Prediction. Molecules 2023; 28:4691. [PMID: 37375246 PMCID: PMC10301867 DOI: 10.3390/molecules28124691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 06/08/2023] [Accepted: 06/09/2023] [Indexed: 06/29/2023] Open
Abstract
The core of large-scale drug virtual screening is to select the binders accurately and efficiently with high affinity from large libraries of small molecules in which non-binders are usually dominant. The binding affinity is significantly influenced by the protein pocket, ligand spatial information, and residue types/atom types. Here, we used the pocket residues or ligand atoms as the nodes and constructed edges with the neighboring information to comprehensively represent the protein pocket or ligand information. Moreover, the model with pre-trained molecular vectors performed better than the one-hot representation. The main advantage of DeepBindGCN is that it is independent of docking conformation, and concisely keeps the spatial information and physical-chemical features. Using TIPE3 and PD-L1 dimer as proof-of-concept examples, we proposed a screening pipeline integrating DeepBindGCN and other methods to identify strong-binding-affinity compounds. It is the first time a non-complex-dependent model has achieved a root mean square error (RMSE) value of 1.4190 and Pearson r value of 0.7584 in the PDBbind v.2016 core set, respectively, thereby showing a comparable prediction power with the state-of-the-art affinity prediction models that rely upon the 3D complex. DeepBindGCN provides a powerful tool to predict the protein-ligand interaction and can be used in many important large-scale virtual screening application scenarios.
Collapse
Affiliation(s)
- Haiping Zhang
- Shenzhen Institute of Synthetic Biology, Faculty of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Konda Mani Saravanan
- Department of Biotechnology, Bharath Institute of Higher Education and Research, Chennai 600073, Tamil Nadu, India;
| | - John Z. H. Zhang
- Shenzhen Institute of Synthetic Biology, Faculty of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
| |
Collapse
|