1
|
Gu X, Zhang YA, Zhang S, Wang L, Ye X, Occhialini G, Barbour J, Pentelute BL, Wendlandt AE. Synthesis of non-canonical amino acids through dehydrogenative tailoring. Nature 2024; 634:352-358. [PMID: 39208846 DOI: 10.1038/s41586-024-07988-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 08/22/2024] [Indexed: 09/04/2024]
Abstract
Amino acids are essential building blocks in biology and chemistry. Whereas nature relies on a small number of amino acid structures, chemists desire access to a vast range of structurally diverse analogues1-3. The selective modification of amino acid side-chain residues represents an efficient strategy to access non-canonical derivatives of value in chemistry and biology. While semisynthetic methods leveraging the functional groups found in polar and aromatic amino acids have been extensively explored, highly selective and general approaches to transform unactivated C-H bonds in aliphatic amino acids remain less developed4,5. Here we disclose a stepwise dehydrogenative method to convert aliphatic amino acids into structurally diverse analogues. The key to the success of this approach lies in the development of a selective catalytic acceptorless dehydrogenation method driven by photochemical irradiation, which provides access to terminal alkene intermediates for downstream functionalization. Overall, this strategy enables the rapid synthesis of new amino acid building blocks and suggests possibilities for the late-stage modification of more complex oligopeptides.
Collapse
Affiliation(s)
- Xin Gu
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Yu-An Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Shuo Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Leon Wang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Xiyun Ye
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Gino Occhialini
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Jonah Barbour
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Bradley L Pentelute
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Alison E Wendlandt
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
2
|
Das BK, Chowdhury A, Chatterjee S, Tripathi NM, Pati B, Dutta S, Bandyopadhyay A. Harnessing a bis-electrophilic boronic acid lynchpin for azaborolo thiazolidine (ABT) grafting in cyclic peptides. Chem Sci 2024:d4sc04348k. [PMID: 39144456 PMCID: PMC11320178 DOI: 10.1039/d4sc04348k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Accepted: 07/24/2024] [Indexed: 08/16/2024] Open
Abstract
Chemical modifications of native peptides have significantly advanced modern drug discovery in recent decades. On this front, the installation of multitasking molecular grafts onto macrocyclic peptides offers numerous opportunities in biomedical applications. Here, we showcase a new class of borono-cyclic peptides featuring an azaborolo thiazolidine (ABT) graft, which can be readily assembled utilizing a bis-electrophilic boronic acid lynchpin while harnessing the inherent reactivity difference (>103 M-1 s-1) between the N-terminal cysteine and backbone cysteine for rapid and highly regioselective macrocyclization (∼1 h) under physiological conditions. The ABT-crosslinked peptides are fairly stable in endogenous environments, but can provide the linear diazaborine peptides via treatment with α-nucleophiles. This efficient peptide crosslinking protocol was further extended for regioselective bicyclizations and engineering of α-helical structures. Finally, ABT-grafted peptides were exploited in biorthogonal conjugation, leading to highly effective intracellular delivery of an apoptotic peptide (KLA) in cancer cells. The mechanism of action by which ABT-grafted KLA peptide induces apoptosis was also explored.
Collapse
Affiliation(s)
- Basab Kanti Das
- Biomimetic Peptide Engineering Laboratory, Department of Chemistry, Indian Institute of Technology Ropar Rupnagar Punjab 140001 India
| | - Arnab Chowdhury
- Biomimetic Peptide Engineering Laboratory, Department of Chemistry, Indian Institute of Technology Ropar Rupnagar Punjab 140001 India
| | - Saurav Chatterjee
- Biomimetic Peptide Engineering Laboratory, Department of Chemistry, Indian Institute of Technology Ropar Rupnagar Punjab 140001 India
| | - Nitesh Mani Tripathi
- Biomimetic Peptide Engineering Laboratory, Department of Chemistry, Indian Institute of Technology Ropar Rupnagar Punjab 140001 India
| | - Bibekananda Pati
- Biomimetic Peptide Engineering Laboratory, Department of Chemistry, Indian Institute of Technology Ropar Rupnagar Punjab 140001 India
| | - Soumit Dutta
- Biomimetic Peptide Engineering Laboratory, Department of Chemistry, Indian Institute of Technology Ropar Rupnagar Punjab 140001 India
| | - Anupam Bandyopadhyay
- Biomimetic Peptide Engineering Laboratory, Department of Chemistry, Indian Institute of Technology Ropar Rupnagar Punjab 140001 India
| |
Collapse
|
3
|
Orsi M, Reymond JL. Can large language models predict antimicrobial peptide activity and toxicity? RSC Med Chem 2024; 15:2030-2036. [PMID: 38911166 PMCID: PMC11187562 DOI: 10.1039/d4md00159a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Accepted: 04/19/2024] [Indexed: 06/25/2024] Open
Abstract
Antimicrobial peptides (AMPs) are naturally occurring or designed peptides up to a few tens of amino acids which may help address the antimicrobial resistance crisis. However, their clinical development is limited by toxicity to human cells, a parameter which is very difficult to control. Given the similarity between peptide sequences and words, large language models (LLMs) might be able to predict AMP activity and toxicity. To test this hypothesis, we fine-tuned LLMs using data from the Database of Antimicrobial Activity and Structure of Peptides (DBAASP). GPT-3 performed well but not reproducibly for activity prediction and hemolysis, taken as a proxy for toxicity. The later GPT-3.5 performed more poorly and was surpassed by recurrent neural networks (RNN) trained on sequence-activity data or support vector machines (SVM) trained on MAP4C molecular fingerprint-activity data. These simpler models are therefore recommended, although the rapid evolution of LLMs warrants future re-evaluation of their prediction abilities.
Collapse
Affiliation(s)
- Markus Orsi
- Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern Freiestrasse 3 3012 Bern Switzerland
| | - Jean-Louis Reymond
- Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern Freiestrasse 3 3012 Bern Switzerland
| |
Collapse
|
4
|
Iwaniak A, Minkiewicz P, Darewicz M. Bioinformatics and bioactive peptides from foods: Do they work together? ADVANCES IN FOOD AND NUTRITION RESEARCH 2024; 108:35-111. [PMID: 38461003 DOI: 10.1016/bs.afnr.2023.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/11/2024]
Abstract
We live in the Big Data Era which affects many aspects of science, including research on bioactive peptides derived from foods, which during the last few decades have been a focus of interest for scientists. These two issues, i.e., the development of computer technologies and progress in the discovery of novel peptides with health-beneficial properties, are closely interrelated. This Chapter presents the example applications of bioinformatics for studying biopeptides, focusing on main aspects of peptide analysis as the starting point, including: (i) the role of peptide databases; (ii) aspects of bioactivity prediction; (iii) simulation of peptide release from proteins. Bioinformatics can also be used for predicting other features of peptides, including ADMET, QSAR, structure, and taste. To answer the question asked "bioinformatics and bioactive peptides from foods: do they work together?", currently it is almost impossible to find examples of peptide research with no bioinformatics involved. However, theoretical predictions are not equivalent to experimental work and always require critical scrutiny. The aspects of compatibility of in silico and in vitro results are also summarized herein.
Collapse
Affiliation(s)
- Anna Iwaniak
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn-Kortowo, Poland.
| | - Piotr Minkiewicz
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn-Kortowo, Poland
| | - Małgorzata Darewicz
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn-Kortowo, Poland
| |
Collapse
|
5
|
Olmedo DA, Durant-Archibold AA, López-Pérez JL, Medina-Franco JL. Design and Diversity Analysis of Chemical Libraries in Drug Discovery. Comb Chem High Throughput Screen 2024; 27:502-515. [PMID: 37409545 DOI: 10.2174/1386207326666230705150110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 05/30/2023] [Accepted: 05/30/2023] [Indexed: 07/07/2023]
Abstract
Chemical libraries and compound data sets are among the main inputs to start the drug discovery process at universities, research institutes, and the pharmaceutical industry. The approach used in the design of compound libraries, the chemical information they possess, and the representation of structures, play a fundamental role in the development of studies: chemoinformatics, food informatics, in silico pharmacokinetics, computational toxicology, bioinformatics, and molecular modeling to generate computational hits that will continue the optimization process of drug candidates. The prospects for growth in drug discovery and development processes in chemical, biotechnological, and pharmaceutical companies began a few years ago by integrating computational tools with artificial intelligence methodologies. It is anticipated that it will increase the number of drugs approved by regulatory agencies shortly.
Collapse
Affiliation(s)
- Dionisio A Olmedo
- Centro de Investigaciones Farmacognósticas de la Flora Panameña (CIFLORPAN), Facultad de Farmacia, Universidad de Panamá, Ciudad de Panamá, Apartado, 0824-00178, Panamá
- Sistema Nacional de Investigación (SNI), Secretaria Nacional de Ciencia, Tecnología e Innovación (SENACYT), Ciudad del Saber, Clayton, Panamá
| | - Armando A Durant-Archibold
- Centro de Biodiversidad y Descubrimiento de Drogas, Instituto de Investigaciones Científicas y Servicios de Alta Tecnología (INDICASAT AIP), Apartado, 0843-01103, Panamá
- Departamento de Bioquímica, Facultad de Ciencias Naturales, Exactas y Tecnología, Universidad de Panamá, Ciudad de Panamá, Panamá
| | - José Luis López-Pérez
- CESIFAR, Departamento de Farmacología, Facultad de Medicina, Universidad de Panamá, Ciudad de Panamá, Panamá
- Departamento de Ciencias Farmacéuticas, Facultad de Farmacia, Universidad de Salamanca, Avda. Campo Charro s/n, 37071 Salamanca, España
| | - José Luis Medina-Franco
- DIFACQUIM Grupo de Investigación, Departamento de Farmacia, Escuela de Química, Universidad Nacional Autónoma de México, Ciudad de México, Apartado, 04510, México
| |
Collapse
|
6
|
Aguilera-Puga MDC, Cancelarich NL, Marani MM, de la Fuente-Nunez C, Plisson F. Accelerating the Discovery and Design of Antimicrobial Peptides with Artificial Intelligence. Methods Mol Biol 2024; 2714:329-352. [PMID: 37676607 DOI: 10.1007/978-1-0716-3441-7_18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
Peptides modulate many processes of human physiology targeting ion channels, protein receptors, or enzymes. They represent valuable starting points for the development of new biologics against communicable and non-communicable disorders. However, turning native peptide ligands into druggable materials requires high selectivity and efficacy, predictable metabolism, and good safety profiles. Machine learning models have gradually emerged as cost-effective and time-saving solutions to predict and generate new proteins with optimal properties. In this chapter, we will discuss the evolution and applications of predictive modeling and generative modeling to discover and design safe and effective antimicrobial peptides. We will also present their current limitations and suggest future research directions, applicable to peptide drug design campaigns.
Collapse
Affiliation(s)
- Mariana D C Aguilera-Puga
- Centro de Investigación y de Estudios Avanzados del IPN (CINVESTAV-IPN), Unidad de Genómica Avanzada, Laboratorio Nacional de Genómica para la Biodiversidad (Langebio), Irapuato, Guanajuato, Mexico
- CINVESTAV-IPN, Unidad Irapuato, Departamento de Biotecnología y Bioquímica, Irapuato, Guanajuato, Mexico
| | - Natalia L Cancelarich
- Instituto Patagónico para el Estudio de los Ecosistemas Continentales (IPEEC), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Puerto Madryn, Argentina
| | - Mariela M Marani
- Instituto Patagónico para el Estudio de los Ecosistemas Continentales (IPEEC), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Puerto Madryn, Argentina
| | - Cesar de la Fuente-Nunez
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA.
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, USA.
| | - Fabien Plisson
- Centro de Investigación y de Estudios Avanzados del IPN (CINVESTAV-IPN), Unidad de Genómica Avanzada, Laboratorio Nacional de Genómica para la Biodiversidad (Langebio), Irapuato, Guanajuato, Mexico.
- CINVESTAV-IPN, Unidad Irapuato, Departamento de Biotecnología y Bioquímica, Irapuato, Guanajuato, Mexico.
| |
Collapse
|
7
|
Yang P, Širvinskas MJ, Li B, Heller NW, Rong H, He G, Yudin AK, Chen G. Teraryl Braces in Macrocycles: Synthesis and Conformational Landscape Remodeling of Peptides. J Am Chem Soc 2023. [PMID: 37326500 DOI: 10.1021/jacs.3c03512] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
The three-dimensional structure of medium-sized cyclic peptides accounts for their biological activity and other important physiochemical properties. Despite significant advances in the past few decades, chemists' ability to fine-tune the structure, in particular, the backbone conformation, of short peptides made of canonical amino acids is still quite limited. Nature has shown that cross-linking the aromatic side chains of linear peptide precursors via enzyme catalysis can generate cyclophane-braced products with unusual structures and diverse activities. However, the biosynthetic path to these natural products is challenging to replicate in the synthetic laboratory using practical chemical modifications of peptides. Herein, we report a broadly applicable strategy to remodel the structure of homodetic peptides by cross-linking the aromatic side chains of Trp, His, and Tyr residues with various aryl linkers. The aryl linkers can be easily installed via copper-catalyzed double heteroatom-arylation reactions of peptides with aryl diiodides. These aromatic side chains and aryl linkers can be combined to form a large variety of assemblies of heteroatom-linked multi-aryl units. The assemblies can serve as tension-bearable multijoint braces to modulate the backbone conformation of peptides as an entry to previously inaccessible conformational space.
Collapse
Affiliation(s)
- Peng Yang
- State Key Laboratory and Institute of Elemento-Organic Chemistry, College of Chemistry, Nankai University, Tianjin 300071, China
| | | | - Bo Li
- State Key Laboratory and Institute of Elemento-Organic Chemistry, College of Chemistry, Nankai University, Tianjin 300071, China
| | - Nicholas W Heller
- Department of Chemistry, University of Toronto, Toronto M5S 3H4, Canada
| | - Hua Rong
- State Key Laboratory and Institute of Elemento-Organic Chemistry, College of Chemistry, Nankai University, Tianjin 300071, China
| | - Gang He
- State Key Laboratory and Institute of Elemento-Organic Chemistry, College of Chemistry, Nankai University, Tianjin 300071, China
| | - Andrei K Yudin
- Department of Chemistry, University of Toronto, Toronto M5S 3H4, Canada
| | - Gong Chen
- State Key Laboratory and Institute of Elemento-Organic Chemistry, College of Chemistry, Nankai University, Tianjin 300071, China
- Frontiers Science Center for New Organic Matter, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| |
Collapse
|
8
|
Ayala-Ruano S, Marrero-Ponce Y, Aguilera-Mendoza L, Pérez N, Agüero-Chapin G, Antunes A, Aguilar AC. Network Science and Group Fusion Similarity-Based Searching to Explore the Chemical Space of Antiparasitic Peptides. ACS OMEGA 2022; 7:46012-46036. [PMID: 36570318 PMCID: PMC9773354 DOI: 10.1021/acsomega.2c03398] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Accepted: 11/21/2022] [Indexed: 05/13/2023]
Abstract
Antimicrobial peptides (AMPs) have appeared as promising compounds to treat a wide range of diseases. Their clinical potentialities reside in the wide range of mechanisms they can use for both killing microbes and modulating immune responses. However, the hugeness of the AMPs' chemical space (AMPCS), represented by more than 1065 unique sequences, has represented a big challenge for the discovery of new promising therapeutic peptides and for the identification of common structural motifs. Here, we introduce network science and a similarity searching approach to discover new promising AMPs, specifically antiparasitic peptides (APPs). We exploited the network-based representation of APPs' chemical space (APPCS) to retrieve valuable information by using three network types: chemical space (CSN), half-space proximal (HSPN), and metadata (METN). Some centrality measures were applied to identify in each network the most important and nonredundant peptides. Then, these central peptides were considered as queries (Qs) in group fusion similarity-based searches against a comprehensive collection of known AMPs, stored in the graph database StarPepDB, to propose new potential APPs. The performance of the resulting multiquery similarity-based search models (mQSSMs) was evaluated in five benchmarking data sets of APP/non-APPs. The predictions performed by the best mQSSM showed a strong-to-very-strong performance since their external Matthews correlation coefficient (MCC) values ranged from 0.834 to 0.965. Outstanding MCC values (>0.85) were attained by the mQSSM with 219 Qs from both networks CSN and HSPN with 0.5 as similarity threshold in external data sets. Then, the performance of our best mQSSM was compared with the APPs prediction servers AMPDiscover and AMPFun. The proposed model showed its relevance by outperforming state-of-the-art machine learning models to predict APPs. After applying the best mQSSM and additional filters on the non-APP space from StarPepDB, 95 AMPs were repurposed as potential APP hits. Due to the high sequence diversity of these peptides, different computational approaches were applied to identify relevant motifs for searching and designing new APPs. Lastly, we identified 11 promising APP lead candidates by using our best mQSSMs together with diversity-based network analyses, and 24 web servers for activity/toxicity and drug-like properties. These results support that network-based similarity searches can be an effective and reliable strategy to identify APPs. The proposed models and pipeline are freely available through the StarPep toolbox software at http://mobiosd-hub.com/starpep.
Collapse
Affiliation(s)
- Sebastián Ayala-Ruano
- Grupo
de Medicina Molecular y Traslacional (MeM&T), Escuela de Medicina,
Colegio de Ciencias de la Salud (COCSA), Universidad San Francisco de Quito, Av. Interoceánica Km 12 1/2 y Av. Florencia, Quito 17-1200-841, Ecuador
- Colegio
de Ciencias e Ingenierías “El Politécnico”, Universidad San Francisco de Quito (USFQ), Quito 170901, Ecuador
| | - Yovani Marrero-Ponce
- Grupo
de Medicina Molecular y Traslacional (MeM&T), Escuela de Medicina,
Colegio de Ciencias de la Salud (COCSA), Universidad San Francisco de Quito, Av. Interoceánica Km 12 1/2 y Av. Florencia, Quito 17-1200-841, Ecuador
- Computer-Aided
Molecular “Biosilico” Discovery and Bioinformatics Research
International Network (CAMD-BIR IN), Cumbayá, Quito 170901, Ecuador
- Universidad
San Francisco de Quito (USFQ), Instituto
de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Pichincha, Ecuador
- Departamento
de Ciencias de la Computación, Centro
de Investigación Científica y de Educación Superior
de Ensenada (CICESE), Baja California 22860, Mexico
| | - Longendri Aguilera-Mendoza
- Departamento
de Ciencias de la Computación, Centro
de Investigación Científica y de Educación Superior
de Ensenada (CICESE), Baja California 22860, Mexico
| | - Noel Pérez
- Colegio
de Ciencias e Ingenierías “El Politécnico”, Universidad San Francisco de Quito (USFQ), Quito 170901, Ecuador
| | - Guillermin Agüero-Chapin
- CIIMAR/CIMAR,
Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton
de Matos s/n, 4450-208 Porto, Portugal
- Department
of Biology, Faculty of Sciences, University
of Porto, Rua do Campo
Alegre, 4169-007 Porto, Portugal
| | - Agostinho Antunes
- CIIMAR/CIMAR,
Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton
de Matos s/n, 4450-208 Porto, Portugal
- Department
of Biology, Faculty of Sciences, University
of Porto, Rua do Campo
Alegre, 4169-007 Porto, Portugal
| | - Ana Cristina Aguilar
- Grupo
de Medicina Molecular y Traslacional (MeM&T), Escuela de Medicina,
Colegio de Ciencias de la Salud (COCSA), Universidad San Francisco de Quito, Av. Interoceánica Km 12 1/2 y Av. Florencia, Quito 17-1200-841, Ecuador
| |
Collapse
|
9
|
Medina‐Franco JL, Chávez‐Hernández AL, López‐López E, Saldívar‐González FI. Chemical Multiverse: An Expanded View of Chemical Space. Mol Inform 2022; 41:e2200116. [PMID: 35916110 PMCID: PMC9787733 DOI: 10.1002/minf.202200116] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Accepted: 08/01/2022] [Indexed: 12/30/2022]
Abstract
Technological advances and practical applications of the chemical space concept in drug discovery, natural product research, and other research areas have attracted the scientific community's attention. The large- and ultra-large chemical spaces are associated with the significant increase in the number of compounds that can potentially be made and exist and the increasing number of experimental and calculated descriptors, that are emerging that encode the molecular structure and/or property aspects of the molecules. Due to the importance and continued evolution of compound libraries, herein, we discuss definitions proposed in the literature for chemical space and emphasize the convenience, discussed in the literature to use complementary descriptors to obtain a comprehensive view of the chemical space of compound data sets. In this regard, we introduce the term chemical multiverse to refer to the comprehensive analysis of compound data sets through several chemical spaces, each defined by a different set of chemical representations. The chemical multiverse is contrasted with a related idea: consensus chemical space.
Collapse
Affiliation(s)
- José L. Medina‐Franco
- DIFACQUIM research group, Department of Pharmacy, School of ChemistryNational Autonomous University of MexicoMexico City04510Mexico
| | - Ana L. Chávez‐Hernández
- DIFACQUIM research group, Department of Pharmacy, School of ChemistryNational Autonomous University of MexicoMexico City04510Mexico
| | - Edgar López‐López
- Department of PharmacologyCenter for Research and Advanced Studies of the National Polytechnic Institute (CINVESTAV)Mexico City07360Mexico
| | - Fernanda I. Saldívar‐González
- DIFACQUIM research group, Department of Pharmacy, School of ChemistryNational Autonomous University of MexicoMexico City04510Mexico
| |
Collapse
|
10
|
Zakharova E, Orsi M, Capecchi A, Reymond J. Machine Learning Guided Discovery of Non-Hemolytic Membrane Disruptive Anticancer Peptides. ChemMedChem 2022; 17:e202200291. [PMID: 35880810 PMCID: PMC9541320 DOI: 10.1002/cmdc.202200291] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 06/29/2022] [Indexed: 12/05/2022]
Abstract
Most antimicrobial peptides (AMPs) and anticancer peptides (ACPs) fold into membrane disruptive cationic amphiphilic α-helices, many of which are however also unpredictably hemolytic and toxic. Here we exploited the ability of recurrent neural networks (RNN) to distinguish active from inactive and non-hemolytic from hemolytic AMPs and ACPs to discover new non-hemolytic ACPs. Our discovery pipeline involved: 1) sequence generation using either a generative RNN or a genetic algorithm, 2) RNN classification for activity and hemolysis, 3) selection for sequence novelty, helicity and amphiphilicity, and 4) synthesis and testing. Experimental evaluation of thirty-three peptides resulted in eleven active ACPs, four of which were non-hemolytic, with properties resembling those of the natural ACP lasioglossin III. These experiments show the first example of direct machine learning guided discovery of non-hemolytic ACPs.
Collapse
Affiliation(s)
- Elena Zakharova
- Department of ChemistryBiochemistry and Pharmaceutical SciencesUniversity of BernFreiestrasse 33012BernSwitzerland
| | - Markus Orsi
- Department of ChemistryBiochemistry and Pharmaceutical SciencesUniversity of BernFreiestrasse 33012BernSwitzerland
| | - Alice Capecchi
- Department of ChemistryBiochemistry and Pharmaceutical SciencesUniversity of BernFreiestrasse 33012BernSwitzerland
| | - Jean‐Louis Reymond
- Department of ChemistryBiochemistry and Pharmaceutical SciencesUniversity of BernFreiestrasse 33012BernSwitzerland
| |
Collapse
|
11
|
Saldívar-González FI, Medina-Franco JL. Approaches for enhancing the analysis of chemical space for drug discovery. Expert Opin Drug Discov 2022; 17:789-798. [PMID: 35640229 DOI: 10.1080/17460441.2022.2084608] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
INTRODUCTION Chemical space is a powerful, general, and practical conceptual framework in drug discovery and other areas in chemistry that addresses the diversity of molecules and it has various applications. Moreover, chemical space is a cornerstone of chemoinformatics as a scientific discipline. In response to the increase in the set of chemical compounds in databases, generators of chemical structures, and tools to calculate molecular descriptors, novel approaches to generate visual representations of chemical space in low dimensions are emerging and evolving. Such approaches include a wide range of commercial and free applications, software, and open-source methods. AREAS COVERED The current state of chemical space in drug design and discovery is reviewed. The topics discussed herein include advances for efficient navigation in chemical space, the use of this concept in assessing the diversity of different data sets, exploring structure-property/activity relationships for one or multiple endpoints, and compound library design. Recent advances in methodologies for generating visual representations of chemical space have been highlighted, thereby emphasizing open-source methods. EXPERT OPINION Quantitative and qualitative generation and analysis of chemical space require novel approaches for handling the increasing number of molecules and their information available in chemical databases (including emerging ultra-large libraries). In addition, it is of utmost importance to note that chemical space is a conceptual framework that goes beyond visual representation in low dimensions. However, the graphical representation of chemical space has several practical applications in drug discovery and beyond.
Collapse
Affiliation(s)
- Fernanda I Saldívar-González
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Mexico City 04510, Mexico
| | - José L Medina-Franco
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Mexico City 04510, Mexico
| |
Collapse
|
12
|
de Oliveira ECL, da Costa KS, Taube PS, Lima AH, Junior CDSDS. Biological Membrane-Penetrating Peptides: Computational Prediction and Applications. Front Cell Infect Microbiol 2022; 12:838259. [PMID: 35402305 PMCID: PMC8992797 DOI: 10.3389/fcimb.2022.838259] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 02/21/2022] [Indexed: 12/14/2022] Open
Abstract
Peptides comprise a versatile class of biomolecules that present a unique chemical space with diverse physicochemical and structural properties. Some classes of peptides are able to naturally cross the biological membranes, such as cell membrane and blood-brain barrier (BBB). Cell-penetrating peptides (CPPs) and blood-brain barrier-penetrating peptides (B3PPs) have been explored by the biotechnological and pharmaceutical industries to develop new therapeutic molecules and carrier systems. The computational prediction of peptides’ penetration into biological membranes has been emerged as an interesting strategy due to their high throughput and low-cost screening of large chemical libraries. Structure- and sequence-based information of peptides, as well as atomistic biophysical models, have been explored in computer-assisted discovery strategies to classify and identify new structures with pharmacokinetic properties related to the translocation through biomembranes. Computational strategies to predict the permeability into biomembranes include cheminformatic filters, molecular dynamics simulations, artificial intelligence algorithms, and statistical models, and the choice of the most adequate method depends on the purposes of the computational investigation. Here, we exhibit and discuss some principles and applications of these computational methods widely used to predict the permeability of peptides into biomembranes, exhibiting some of their pharmaceutical and biotechnological applications.
Collapse
Affiliation(s)
- Ewerton Cristhian Lima de Oliveira
- Institute of Technology, Federal University of Pará, Belém, Brazil
- *Correspondence: Kauê Santana da Costa, ; Ewerton Cristhian Lima de Oliveira,
| | - Kauê Santana da Costa
- Laboratory of Computational Simulation, Institute of Biodiversity, Federal University of Western Pará, Santarém, Brazil
- *Correspondence: Kauê Santana da Costa, ; Ewerton Cristhian Lima de Oliveira,
| | - Paulo Sérgio Taube
- Laboratory of Computational Simulation, Institute of Biodiversity, Federal University of Western Pará, Santarém, Brazil
| | - Anderson H. Lima
- Laboratório de Planejamento e Desenvolvimento de Fármacos, Instituto de Ciências Exatas e Naturais, Universidade Federal do Pará, Belém, Brazil
| | | |
Collapse
|
13
|
Capecchi A, Cai X, Personne H, Köhler T, van Delden C, Reymond JL. Machine learning designs non-hemolytic antimicrobial peptides. Chem Sci 2021; 12:9221-9232. [PMID: 34349895 PMCID: PMC8285431 DOI: 10.1039/d1sc01713f] [Citation(s) in RCA: 48] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Accepted: 06/05/2021] [Indexed: 12/28/2022] Open
Abstract
Machine learning (ML) consists of the recognition of patterns from training data and offers the opportunity to exploit large structure-activity databases for drug design. In the area of peptide drugs, ML is mostly being tested to design antimicrobial peptides (AMPs), a class of biomolecules potentially useful to fight multidrug-resistant bacteria. ML models have successfully identified membrane disruptive amphiphilic AMPs, however mostly without addressing the associated toxicity to human red blood cells. Here we trained recurrent neural networks (RNN) with data from DBAASP (Database of Antimicrobial Activity and Structure of Peptides) to design short non-hemolytic AMPs. Synthesis and testing of 28 generated peptides, each at least 5 mutations away from training data, allowed us to identify eight new non-hemolytic AMPs against Pseudomonas aeruginosa, Acinetobacter baumannii, and methicillin-resistant Staphylococcus aureus (MRSA). These results show that machine learning (ML) can be used to design new non-hemolytic AMPs.
Collapse
Affiliation(s)
- Alice Capecchi
- Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern Freiestrasse 3 3012 Bern Switzerland
| | - Xingguang Cai
- Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern Freiestrasse 3 3012 Bern Switzerland
| | - Hippolyte Personne
- Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern Freiestrasse 3 3012 Bern Switzerland
| | - Thilo Köhler
- Department of Microbiology and Molecular Medicine, University of Geneva Switzerland
- Service of Infectious Diseases, University Hospital of Geneva Geneva Switzerland
| | - Christian van Delden
- Department of Microbiology and Molecular Medicine, University of Geneva Switzerland
- Service of Infectious Diseases, University Hospital of Geneva Geneva Switzerland
| | - Jean-Louis Reymond
- Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern Freiestrasse 3 3012 Bern Switzerland
| |
Collapse
|
14
|
Medina-Franco JL, Sánchez-Cruz N, López-López E, Díaz-Eufracio BI. Progress on open chemoinformatic tools for expanding and exploring the chemical space. J Comput Aided Mol Des 2021; 36:341-354. [PMID: 34143323 PMCID: PMC8211976 DOI: 10.1007/s10822-021-00399-1] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Accepted: 06/14/2021] [Indexed: 01/10/2023]
Abstract
The concept of chemical space is a cornerstone in chemoinformatics, and it has broad conceptual and practical applicability in many areas of chemistry, including drug design and discovery. One of the most considerable impacts is in the study of structure-property relationships where the property can be a biological activity or any other characteristic of interest to a particular chemistry discipline. The chemical space is highly dependent on the molecular representation that is also a cornerstone concept in computational chemistry. Herein, we discuss the recent progress on chemoinformatic tools developed to expand and characterize the chemical space of compound data sets using different types of molecular representations, generate visual representations of such spaces, and explore structure-property relationships in the context of chemical spaces. We emphasize the development of methods and freely available tools focusing on drug discovery applications. We also comment on the general advantages and shortcomings of using freely available and easy-to-use tools and discuss the value of using such open resources for research, education, and scientific dissemination.
Collapse
Affiliation(s)
- José L Medina-Franco
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, 04510, Mexico City, Mexico.
| | - Norberto Sánchez-Cruz
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, 04510, Mexico City, Mexico
| | - Edgar López-López
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, 04510, Mexico City, Mexico.,Departamento de Química y Programa de Posgrado en Farmacología, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional, Apartado 14-740, 07000, Mexico City, Mexico
| | - Bárbara I Díaz-Eufracio
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, 04510, Mexico City, Mexico
| |
Collapse
|
15
|
Van Oort CM, Ferrell JB, Remington JM, Wshah S, Li J. AMPGAN v2: Machine Learning-Guided Design of Antimicrobial Peptides. J Chem Inf Model 2021; 61:2198-2207. [PMID: 33787250 DOI: 10.1021/acs.jcim.0c01441] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Antibiotic resistance is a critical public health problem. Each year ∼2.8 million resistant infections lead to more than 35 000 deaths in the U.S. alone. Antimicrobial peptides (AMPs) show promise in treating resistant infections. However, applications of known AMPs have encountered issues in development, production, and shelf-life. To drive the development of AMP-based treatments, it is necessary to create design approaches with higher precision and selectivity toward resistant targets. Previously, we developed AMPGAN and obtained proof-of-concept evidence for the generative approach to design AMPs with experimental validation. Building on the success of AMPGAN, we present AMPGAN v2, a bidirectional conditional generative adversarial network (BiCGAN)-based approach for rational AMP design. AMPGAN v2 uses generator-discriminator dynamics to learn data-driven priors and controls generation using conditioning variables. The bidirectional component, implemented using a learned encoder to map data samples into the latent space of the generator, aids iterative manipulation of candidate peptides. These elements allow AMPGAN v2 to generate candidates that are novel, diverse, and tailored for specific applications, making it an efficient AMP design tool.
Collapse
Affiliation(s)
- Colin M Van Oort
- Department of Computer Science, University of Vermont, Burlington, Vermont 05405, United States
| | - Jonathon B Ferrell
- Department of Chemistry, University of Vermont, Burlington, Vermont 05405, United States
| | - Jacob M Remington
- Department of Chemistry, University of Vermont, Burlington, Vermont 05405, United States
| | - Safwan Wshah
- Department of Computer Science, University of Vermont, Burlington, Vermont 05405, United States
| | - Jianing Li
- Department of Chemistry, University of Vermont, Burlington, Vermont 05405, United States
| |
Collapse
|