1
|
Pérez KL, Jung V, Chen L, Huddleston K, Miranda-Quintana RA. Efficient clustering of large molecular libraries. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.10.607459. [PMID: 39149242 PMCID: PMC11326248 DOI: 10.1101/2024.08.10.607459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]
Abstract
The widespread use of Machine Learning (ML) techniques in chemical applications has come with the pressing need to analyze extremely large molecular libraries. In particular, clustering remains one of the most common tools to dissect the chemical space. Unfortunately, most current approaches present unfavorable time and memory scaling, which makes them unsuitable to handle million- and billion-sized sets. Here, we propose to bypass these problems with a time- and memory-efficient clustering algorithm, BitBIRCH. This method uses a tree structure similar to the one found in the Balanced Iterative Reducing and Clustering using Hierarchies (BIRCH) algorithm to ensure O N time scaling. BitBIRCH leverages the instant similarity (iSIM) formalism to process binary fingerprints, allowing the use of Tanimoto similarity, and reducing memory requirements. Our tests show that BitBIRCH is already > 1,000 times faster than standard implementations of the Taylor-Butina clustering for libraries with 1,500,000 molecules. BitBIRCH increases efficiency without compromising the quality of the resulting clusters. We explore strategies to handle large sets, which we applied in the clustering of one billion molecules under 5 hours using a parallel/iterative BitBIRCH approximation.
Collapse
Affiliation(s)
| | | | - Lexin Chen
- Department of Chemistry & Quantum Theory Project, University of Florida, Gainesville, Florida 32611
| | - Kate Huddleston
- Department of Chemistry & Quantum Theory Project, University of Florida, Gainesville, Florida 32611
| | | |
Collapse
|
2
|
Chen L, Mondal A, Perez A, Miranda-Quintana RA. Protein Retrieval via Integrative Molecular Ensembles (PRIME) through Extended Similarity Indices. J Chem Theory Comput 2024; 20:6303-6315. [PMID: 38978294 DOI: 10.1021/acs.jctc.4c00362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Molecular dynamics (MD) simulations are ideally suited to describe conformational ensembles of biomolecules such as proteins and nucleic acids. Microsecond-long simulations are now routine, facilitated by the emergence of graphical processing units. Clustering, which groups objects based on structural similarity, is typically used to process ensembles, leading to different states, their populations, and the identification of representative structures. A popular pipeline combines hierarchical clustering for clustering and selecting the cluster centroid as representative of the cluster. Here, we propose to improve on this approach, by developing a module-Protein Retrieval via Integrative Molecular Ensembles (PRIME), that consists of tools to improve the prediction of the representative in the most populated cluster using extended continuous similarity. PRIME is integrated with our Molecular Dynamics Analysis with N-ary Clustering Ensembles (MDANCE) package and can be used as a postprocessing tool for arbitrary clustering algorithms, compatible with several MD suites. PRIME predictions produced structures that when aligned to the experimental structure were better superposed (lower RMSD). A further benefit of PRIME is its linear scaling─rather than the traditional O(N2) traditionally associated with comparisons of elements in a set.
Collapse
Affiliation(s)
- Lexin Chen
- Department of Chemistry, University of Florida, Gainesville, Florida 32611, United States
- Quantum Theory Project, University of Florida, Gainesville, Florida 32611, United States
| | - Arup Mondal
- Department of Chemistry, University of Florida, Gainesville, Florida 32611, United States
- Quantum Theory Project, University of Florida, Gainesville, Florida 32611, United States
| | - Alberto Perez
- Department of Chemistry, University of Florida, Gainesville, Florida 32611, United States
- Quantum Theory Project, University of Florida, Gainesville, Florida 32611, United States
| | - Ramón Alain Miranda-Quintana
- Department of Chemistry, University of Florida, Gainesville, Florida 32611, United States
- Quantum Theory Project, University of Florida, Gainesville, Florida 32611, United States
| |
Collapse
|
3
|
Prado-Romero DL, Saldívar-González FI, López-Mata I, Laurel-García PA, Durán-Vargas A, García-Hernández E, Sánchez-Cruz N, Medina-Franco JL. De Novo Design of Inhibitors of DNA Methyltransferase 1: A Critical Comparison of Ligand- and Structure-Based Approaches. Biomolecules 2024; 14:775. [PMID: 39062489 PMCID: PMC11274800 DOI: 10.3390/biom14070775] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2024] [Revised: 06/14/2024] [Accepted: 06/26/2024] [Indexed: 07/28/2024] Open
Abstract
Designing and developing inhibitors against the epigenetic target DNA methyltransferase (DNMT) is an attractive strategy in epigenetic drug discovery. DNMT1 is one of the epigenetic enzymes with significant clinical relevance. Structure-based de novo design is a drug discovery strategy that was used in combination with similarity searching to identify a novel DNMT inhibitor with a novel chemical scaffold and warrants further exploration. This study aimed to continue exploring the potential of de novo design to build epigenetic-focused libraries targeted toward DNMT1. Herein, we report the results of an in-depth and critical comparison of ligand- and structure-based de novo design of screening libraries focused on DNMT1. The newly designed chemical libraries focused on DNMT1 are freely available on GitHub.
Collapse
Affiliation(s)
- Diana L. Prado-Romero
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Mexico City 04510, Mexico; (D.L.P.-R.); (F.I.S.-G.); (P.A.L.-G.)
| | - Fernanda I. Saldívar-González
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Mexico City 04510, Mexico; (D.L.P.-R.); (F.I.S.-G.); (P.A.L.-G.)
| | - Iván López-Mata
- División Académica de Ciencias Básicas, Universidad Juárez Autónoma de Tabasco, Carretera Cunduacán-Jalpa de Méndez, Km 1, Cunduacán 86690, Tabasco, Mexico;
- Instituto de Química, Unidad Mérida, Universidad Nacional Autónoma de México, Carretera Mérida-Tetiz Km. 4.5, Ucú 97357, Yucatán, Mexico;
| | - Pedro A. Laurel-García
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Mexico City 04510, Mexico; (D.L.P.-R.); (F.I.S.-G.); (P.A.L.-G.)
| | - Adrián Durán-Vargas
- Instituto de Química, Universidad Nacional Autónoma de México, Ciudad Universitaria, Mexico City 04510, Mexico; (A.D.-V.); (E.G.-H.)
| | - Enrique García-Hernández
- Instituto de Química, Universidad Nacional Autónoma de México, Ciudad Universitaria, Mexico City 04510, Mexico; (A.D.-V.); (E.G.-H.)
| | - Norberto Sánchez-Cruz
- Instituto de Química, Unidad Mérida, Universidad Nacional Autónoma de México, Carretera Mérida-Tetiz Km. 4.5, Ucú 97357, Yucatán, Mexico;
- Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Unidad Mérida, Universidad Nacional Autónoma de México, Sierra Papacál 97302, Yucatán, Mexico
| | - José L. Medina-Franco
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Mexico City 04510, Mexico; (D.L.P.-R.); (F.I.S.-G.); (P.A.L.-G.)
| |
Collapse
|
4
|
López-Pérez K, Kim TD, Miranda-Quintana RA. iSIM: instant similarity. DIGITAL DISCOVERY 2024; 3:1160-1171. [PMID: 38873032 PMCID: PMC11167700 DOI: 10.1039/d4dd00041b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Accepted: 05/06/2024] [Indexed: 06/15/2024]
Abstract
The quantification of molecular similarity has been present since the beginning of cheminformatics. Although several similarity indices and molecular representations have been reported, all of them ultimately reduce to the calculation of molecular similarities of only two objects at a time. Hence, to obtain the average similarity of a set of molecules, all the pairwise comparisons need to be computed, which demands a quadratic scaling in the number of computational resources. Here we propose an exact alternative to this problem: iSIM (instant similarity). iSIM performs comparisons of multiple molecules at the same time and yields the same value as the average pairwise comparisons of molecules represented by binary fingerprints and real-value descriptors. In this work, we introduce the mathematical framework and several applications of iSIM in chemical sampling, visualization, diversity selection, and clustering.
Collapse
Affiliation(s)
- Kenneth López-Pérez
- Department of Chemistry and Quantum Theory Project, University of Florida Gainesville Florida 32611 USA
| | - Taewon D Kim
- Department of Chemistry and Quantum Theory Project, University of Florida Gainesville Florida 32611 USA
| | | |
Collapse
|
5
|
Ellin NR, Guo Y, Miranda-Quintana RA, Prentice BM. Extended similarity methods for efficient data mining in imaging mass spectrometry. DIGITAL DISCOVERY 2024; 3:805-817. [PMID: 38638647 PMCID: PMC11022984 DOI: 10.1039/d3dd00165b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 03/19/2024] [Indexed: 04/20/2024]
Abstract
Imaging mass spectrometry is a label-free imaging modality that allows for the spatial mapping of many compounds directly in tissues. In an imaging mass spectrometry experiment, a raster of the tissue surface produces a mass spectrum at each sampled x, y position, resulting in thousands of individual mass spectra, each comprising a pixel in the resulting ion images. However, efficient analysis of imaging mass spectrometry datasets can be challenging due to the hyperspectral characteristics of the data. Each spectrum contains several thousand unique compounds at discrete m/z values that result in unique ion images, which demands robust and efficient algorithms for searching, statistical analysis, and visualization. Some traditional post-processing techniques are fundamentally ill-equipped to dissect these types of data. For example, while principal component analysis (PCA) has long served as a useful tool for mining imaging mass spectrometry datasets to identify correlated analytes and biological regions of interest, the interpretation of the PCA scores and loadings can be non-trivial. The loadings often contain negative peaks in the PCA-derived pseudo-spectra, which are difficult to ascribe to underlying tissue biology. Herein, we have utilized extended similarity indices to streamline the interpretation of imaging mass spectrometry data. This novel workflow uses PCA as a pixel-selection method to parse out the most and least correlated pixels, which are then compared using the extended similarity indices. The extended similarity indices complement PCA by removing all non-physical artifacts and streamlining the interpretation of large volumes of imaging mass spectrometry spectra simultaneously. The linear complexity, O(N), of these indices suggests that large imaging mass spectrometry datasets can be analyzed in a 1 : 1 scale of time and space with respect to the size of the input data. The extended similarity indices algorithmic workflow is exemplified here by identifying discrete biological regions of mouse brain tissue.
Collapse
Affiliation(s)
- Nicholas R Ellin
- Department of Chemistry, University of Florida Gainesville FL 32611-7200 USA
| | - Yingchan Guo
- Department of Chemistry, University of Florida Gainesville FL 32611-7200 USA
| | - Ramón Alain Miranda-Quintana
- Department of Chemistry, University of Florida Gainesville FL 32611-7200 USA
- Quantum Theory Project, University of Florida Gainesville FL 32611-7200 USA
| | - Boone M Prentice
- Department of Chemistry, University of Florida Gainesville FL 32611-7200 USA
| |
Collapse
|
6
|
Delgado-Maldonado T, Gonzalez-Morales LD, Juarez-Saldivar A, Lara-Ramírez EE, Rojas-Verde G, Moreno-Rodriguez A, Bandyopadhyay D, Rivera G. Structure-based Virtual Screening from Natural Products as Inhibitors of SARS-CoV-2 Spike Protein and ACE2 Receptor Binding and their Biological Evaluation In vitro. Med Chem 2024; 20:546-553. [PMID: 38204279 DOI: 10.2174/0115734064279323231206091314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Revised: 10/11/2023] [Accepted: 10/23/2023] [Indexed: 01/12/2024]
Abstract
BACKGROUND In the last years, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) caused more than 760 million infections and 6.9 million deaths. Currently, remains a public health problem with limited pharmacological treatments. Among the virus drug targets, the SARS-CoV-2 spike protein attracts the development of new anti-SARS-CoV-2 agents. OBJECTIVE The aim of this work was to identify new compounds derived from natural products (BIOFACQUIM and Selleckchem databases) as potential inhibitors of the spike receptor binding domain (RBD)-ACE2 binding complex. METHODS Molecular docking, molecular dynamics simulations, and ADME-Tox analysis were performed to screen and select the potential inhibitors. ELISA-based enzyme assay was done to confirm our predictive model. RESULTS Twenty compounds were identified as potential binders of RBD of the spike protein. In vitro assay showed compound B-8 caused 48% inhibition at 50 μM, and their binding pattern exhibited interactions via hydrogen bonds with the key amino acid residues present on the RBD. CONCLUSION Compound B-8 can be used as a scaffold to develop new and more efficient antiviral drugs.
Collapse
Affiliation(s)
- Timoteo Delgado-Maldonado
- Laboratorio de Biotecnología Farmacéutica, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, 88710 Reynosa, México
| | - Luis Donaldo Gonzalez-Morales
- Laboratorio de Biotecnología Farmacéutica, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, 88710 Reynosa, México
| | - Alfredo Juarez-Saldivar
- Laboratorio de Biotecnología Farmacéutica, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, 88710 Reynosa, México
| | - Edgar E Lara-Ramírez
- Laboratorio de Biotecnología Farmacéutica, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, 88710 Reynosa, México
| | - Guadalupe Rojas-Verde
- Instituto de Biotecnología, Facultad de Ciencias Biológicas, Universidad Autónoma de Nuevo León, Nuevo León CP. 66451, México
| | - Adriana Moreno-Rodriguez
- Laboratorio de Estudios Epidemiológicos, Clínicos, Diseños Experimentales e Investigación, Facultad de Ciencias Químicas, Universidad Autónoma "Benito Juárez" de Oaxaca, Avenida Universidad S/N, Ex Hacienda Cinco Señores, Oaxaca 68120, México
| | - Debasish Bandyopadhyay
- School of Integrative Biological and Chemical Sciences (SIBCS) and School of Earth, Environmental, and Marine Sciences (SEEMS), University of Texas Rio Grande Valley, Edinburg, Texas 78539, United States of America
| | - Gildardo Rivera
- Laboratorio de Biotecnología Farmacéutica, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, 88710 Reynosa, México
| |
Collapse
|
7
|
Maujean T, Kannaboina P, Green AI, Burslem GM. Lead-oriented synthesis of epigenetic relevant scaffolds. Chem Commun (Camb) 2023; 59:14555-14558. [PMID: 37991354 PMCID: PMC10842704 DOI: 10.1039/d3cc04317g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2023]
Abstract
A simple and rational method to rank lead-likeness of molecules using continuous evaluation functions was hereby developed. This strategy proved to be competitive against known methods and finally helped in driving synthetic efforts towards candidates of interest for epigenetic applications against HDAC6, BRD4 and EZH2.
Collapse
Affiliation(s)
- Timothé Maujean
- Department of Biochemistry and Biophysics, Department of Cancer Biology and Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, PA 19104, USA.
| | - Prakash Kannaboina
- Department of Biochemistry and Biophysics, Department of Cancer Biology and Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, PA 19104, USA.
| | - Adam I Green
- Department of Biochemistry and Biophysics, Department of Cancer Biology and Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, PA 19104, USA.
| | - George M Burslem
- Department of Biochemistry and Biophysics, Department of Cancer Biology and Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, PA 19104, USA.
| |
Collapse
|
8
|
López-Pérez K, López-López E, Medina-Franco JL, Miranda-Quintana RA. Sampling and Mapping Chemical Space with Extended Similarity Indices. Molecules 2023; 28:6333. [PMID: 37687162 PMCID: PMC10489020 DOI: 10.3390/molecules28176333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 08/24/2023] [Accepted: 08/26/2023] [Indexed: 09/10/2023] Open
Abstract
Visualization of the chemical space is useful in many aspects of chemistry, including compound library design, diversity analysis, and exploring structure-property relationships, to name a few. Examples of notable research areas where the visualization of chemical space has strong applications are drug discovery and natural product research. However, the sheer volume of even comparatively small sub-sections of chemical space implies that we need to use approximations at the time of navigating through chemical space. ChemMaps is a visualization methodology that approximates the distribution of compounds in large datasets based on the selection of satellite compounds that yield a similar mapping of the whole dataset when principal component analysis on a similarity matrix is performed. Here, we show how the recently proposed extended similarity indices can help find regions that are relevant to sample satellites and reduce the amount of high-dimensional data needed to describe a library's chemical space.
Collapse
Affiliation(s)
- Kenneth López-Pérez
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL 32611, USA;
| | - Edgar López-López
- DIFACQUIM Research Group, Department of Pharmacy, National Autonomous University of Mexico, Mexico City 04510, Mexico;
- Department of Chemistry and Graduate Program in Pharmacology, Center for Research and Advanced Studies of the National Polytechnic Institute, Mexico City 07000, Mexico
| | - José L. Medina-Franco
- DIFACQUIM Research Group, Department of Pharmacy, National Autonomous University of Mexico, Mexico City 04510, Mexico;
| | | |
Collapse
|
9
|
Cedillo-González R, Medina-Franco JL. Diversity and Chemical Space Characterization of Inhibitors of the Epigenetic Target G9a: A Chemoinformatics Approach. ACS OMEGA 2023; 8:30694-30704. [PMID: 37636945 PMCID: PMC10448660 DOI: 10.1021/acsomega.3c04566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 08/02/2023] [Indexed: 08/29/2023]
Abstract
G9a is a histone-lysine methyltransferase that performs the mono- and dimethylation of lysine 9 at histone 3 of the nucleosome. It belongs to the SET PKMT family, and its methylations are related to promoter repression and activation. G9a is a promising epigenetic target. Despite the fact that there are several G9a inhibitors under development, there are no compounds in clinical use due to adverse in vivo ADMET (absorption, distribution, metabolism, excretion, and toxicity) issues. The goal of this study is to discuss the exploration, characterization, and analysis of the chemical space of 409 G9a inhibitors reported in a large public database. Exploring the chemical space of the inhibitors led to the quantification of their structural diversity based on molecular scaffolds and structural fingerprints of different designs. As part of the analysis, the G9a inhibitors were compared with commercial libraries focused on epigenetic targets. The findings of this work will help in the development of, in a follow-up study, predictive models to identify G9a inhibitors. This study also points out the relevance of screening commercial libraries to expand the epigenetic relevant chemical space, in particular, G9a inhibitors.
Collapse
Affiliation(s)
- Raziel Cedillo-González
- DIFACQUIM Research Group Department
of Pharmacy School of Chemistry, Universidad
Nacional Autónoma de México, Mexico City 04510, Mexico
| | - José L. Medina-Franco
- DIFACQUIM Research Group Department
of Pharmacy School of Chemistry, Universidad
Nacional Autónoma de México, Mexico City 04510, Mexico
| |
Collapse
|
10
|
Ellin NR, Miranda-Quintana RA, Prentice BM. Extended Similarity Methods for Efficient Data Mining in Imaging Mass Spectrometry. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.27.550838. [PMID: 37546817 PMCID: PMC10402165 DOI: 10.1101/2023.07.27.550838] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]
Abstract
Imaging mass spectrometry is a label-free imaging modality that allows for the spatial mapping of many compounds directly in tissues. In an imaging mass spectrometry experiment, a raster of the tissue surface produces a mass spectrum at each sampled x , y position, resulting in thousands of individual mass spectra, each comprising a pixel in the resulting ion images. However, efficient analysis of imaging mass spectrometry datasets can be challenging due to the hyperspectral characteristics of the data. Each spectrum contains several thousand unique compounds at discrete m/z values that result in unique ion images, which demands robust and efficient algorithms for searching, statistical analysis, and visualization. Some traditional post-processing techniques are fundamentally ill-equipped to dissect these types of data. For example, while principal component analysis (PCA) has long served as a useful tool for mining imaging mass spectrometry datasets to identify correlated analytes and biological regions of interest, the interpretation of the PCA scores and loadings can be non-trivial. The loadings often containing negative peaks in the PCA-derived pseudo-spectra, which are difficult to ascribe to underlying tissue biology. Herein, we have utilized extended similarity indices to streamline the interpretation of imaging mass spectrometry data. This novel workflow uses PCA as a pixel-selection method to parse out the most and least correlated pixels, which are then compared using the extended similarity indices. The extended similarity indices complement PCA by removing all non-physical artifacts and streamlining the interpretation of large volumes of IMS spectra simultaneously. The linear complexity, O ( N ) , of these indices suggests that large imaging mass spectrometry datasets can be analyzed in a 1:1 scale of time and space with respect to the size of the input data. The extended similarity indices algorithmic workflow is exemplified here by identifying discrete biological regions of mouse brain tissue.
Collapse
Affiliation(s)
- Nicholas R Ellin
- Department of Chemistry, University of Florida, Gainesville, FL, 32611-7200; USA
| | - Ramón Alain Miranda-Quintana
- Department of Chemistry, University of Florida, Gainesville, FL, 32611-7200; USA
- Quantum Theory Project, University of Florida, Gainesville, FL, 32611-7200; USA
| | - Boone M Prentice
- Department of Chemistry, University of Florida, Gainesville, FL, 32611-7200; USA
| |
Collapse
|
11
|
González-Morales LD, Moreno-Rodríguez A, Vázquez-Jiménez LK, Delgado-Maldonado T, Juárez-Saldivar A, Ortiz-Pérez E, Paz-Gonzalez AD, Lara-Ramírez EE, Yépez-Mulia L, Meza P, Rivera G. Triose Phosphate Isomerase Structure-Based Virtual Screening and In Vitro Biological Activity of Natural Products as Leishmania mexicana Inhibitors. Pharmaceutics 2023; 15:2046. [PMID: 37631260 PMCID: PMC10458937 DOI: 10.3390/pharmaceutics15082046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2023] [Revised: 07/25/2023] [Accepted: 07/27/2023] [Indexed: 08/27/2023] Open
Abstract
Cutaneous leishmaniasis (CL) is a public health problem affecting more than 98 countries worldwide. No vaccine is available to prevent the disease, and available medical treatments cause serious side effects. Additionally, treatment failure and parasite resistance have made the development of new drugs against CL necessary. In this work, a virtual screening of natural products from the BIOFACQUIM and Selleckchem databases was performed using the method of molecular docking at the triosephosphate isomerase (TIM) enzyme interface of Leishmania mexicana (L. mexicana). Finally, the in vitro leishmanicidal activity of selected compounds against two strains of L. mexicana, their cytotoxicity, and selectivity index were determined. The top ten compounds were obtained based on the docking results. Four were selected for further in silico analysis. The ADME-Tox analysis of the selected compounds predicted favorable physicochemical and toxicological properties. Among these four compounds, S-8 (IC50 = 55 µM) demonstrated a two-fold higher activity against the promastigote of both L. mexicana strains than the reference drug glucantime (IC50 = 133 µM). This finding encourages the screening of natural products as new anti-leishmania agents.
Collapse
Affiliation(s)
- Luis D. González-Morales
- Laboratorio de Biotecnología Farmacéutica, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, Reynosa 88710, Mexico; (L.D.G.-M.); (A.J.-S.); (E.O.-P.); (E.E.L.-R.)
| | - Adriana Moreno-Rodríguez
- Laboratorio de Estudios Epidemiológicos, Clínicos, Diseños Experimentales e Investigación, Facultad de Ciencias Químicas, Universidad Autónoma “Benito Juárez” de Oaxaca, Avenida Universidad S/N, Ex Hacienda Cinco Señores, Oaxaca 68120, Mexico;
| | - Lenci K. Vázquez-Jiménez
- Laboratorio de Biotecnología Farmacéutica, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, Reynosa 88710, Mexico; (L.D.G.-M.); (A.J.-S.); (E.O.-P.); (E.E.L.-R.)
| | - Timoteo Delgado-Maldonado
- Laboratorio de Biotecnología Farmacéutica, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, Reynosa 88710, Mexico; (L.D.G.-M.); (A.J.-S.); (E.O.-P.); (E.E.L.-R.)
| | - Alfredo Juárez-Saldivar
- Laboratorio de Biotecnología Farmacéutica, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, Reynosa 88710, Mexico; (L.D.G.-M.); (A.J.-S.); (E.O.-P.); (E.E.L.-R.)
| | - Eyra Ortiz-Pérez
- Laboratorio de Biotecnología Farmacéutica, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, Reynosa 88710, Mexico; (L.D.G.-M.); (A.J.-S.); (E.O.-P.); (E.E.L.-R.)
| | - Alma D. Paz-Gonzalez
- Laboratorio de Biotecnología Farmacéutica, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, Reynosa 88710, Mexico; (L.D.G.-M.); (A.J.-S.); (E.O.-P.); (E.E.L.-R.)
| | - Edgar E. Lara-Ramírez
- Laboratorio de Biotecnología Farmacéutica, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, Reynosa 88710, Mexico; (L.D.G.-M.); (A.J.-S.); (E.O.-P.); (E.E.L.-R.)
| | - Lilian Yépez-Mulia
- Unidad de Investigación Médica en Enfermedades Infecciosas y Parasitarias-Pediatría, Instituto Mexicano del Seguro Social, Mexico City 06720, Mexico
| | - Patricia Meza
- Unidad de Investigación Médica en Enfermedades Infecciosas y Parasitarias-Pediatría, Instituto Mexicano del Seguro Social, Mexico City 06720, Mexico
| | - Gildardo Rivera
- Laboratorio de Biotecnología Farmacéutica, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, Reynosa 88710, Mexico; (L.D.G.-M.); (A.J.-S.); (E.O.-P.); (E.E.L.-R.)
| |
Collapse
|
12
|
Medina‐Franco JL, Chávez‐Hernández AL, López‐López E, Saldívar‐González FI. Chemical Multiverse: An Expanded View of Chemical Space. Mol Inform 2022; 41:e2200116. [PMID: 35916110 PMCID: PMC9787733 DOI: 10.1002/minf.202200116] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Accepted: 08/01/2022] [Indexed: 12/30/2022]
Abstract
Technological advances and practical applications of the chemical space concept in drug discovery, natural product research, and other research areas have attracted the scientific community's attention. The large- and ultra-large chemical spaces are associated with the significant increase in the number of compounds that can potentially be made and exist and the increasing number of experimental and calculated descriptors, that are emerging that encode the molecular structure and/or property aspects of the molecules. Due to the importance and continued evolution of compound libraries, herein, we discuss definitions proposed in the literature for chemical space and emphasize the convenience, discussed in the literature to use complementary descriptors to obtain a comprehensive view of the chemical space of compound data sets. In this regard, we introduce the term chemical multiverse to refer to the comprehensive analysis of compound data sets through several chemical spaces, each defined by a different set of chemical representations. The chemical multiverse is contrasted with a related idea: consensus chemical space.
Collapse
Affiliation(s)
- José L. Medina‐Franco
- DIFACQUIM research group, Department of Pharmacy, School of ChemistryNational Autonomous University of MexicoMexico City04510Mexico
| | - Ana L. Chávez‐Hernández
- DIFACQUIM research group, Department of Pharmacy, School of ChemistryNational Autonomous University of MexicoMexico City04510Mexico
| | - Edgar López‐López
- Department of PharmacologyCenter for Research and Advanced Studies of the National Polytechnic Institute (CINVESTAV)Mexico City07360Mexico
| | - Fernanda I. Saldívar‐González
- DIFACQUIM research group, Department of Pharmacy, School of ChemistryNational Autonomous University of MexicoMexico City04510Mexico
| |
Collapse
|
13
|
Rácz A, Mihalovits LM, Bajusz D, Héberger K, Miranda-Quintana RA. Molecular Dynamics Simulations and Diversity Selection by Extended Continuous Similarity Indices. J Chem Inf Model 2022; 62:3415-3425. [PMID: 35834424 PMCID: PMC9326969 DOI: 10.1021/acs.jcim.2c00433] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
![]()
Molecular dynamics (MD) is a core methodology of molecular
modeling
and computational design for the study of the dynamics and temporal
evolution of molecular systems. MD simulations have particularly benefited
from the rapid increase of computational power that has characterized
the past decades of computational chemical research, being the first
method to be successfully migrated to the GPU infrastructure. While
new-generation MD software is capable of delivering simulations on
an ever-increasing scale, relatively less effort is invested in developing
postprocessing methods that can keep up with the quickly expanding
volumes of data that are being generated. Here, we introduce a new
idea for sampling frames from large MD trajectories, based on the
recently introduced framework of extended similarity indices. Our
approach presents a new, linearly scaling alternative to the traditional
approach of applying a clustering algorithm that usually scales as
a quadratic function of the number of frames. When showcasing its
usage on case studies with different system sizes and simulation lengths,
we have registered speedups of up to 2 orders of magnitude, as compared
to traditional clustering algorithms. The conformational diversity
of the selected frames is also noticeably higher, which is a further
advantage for certain applications, such as the selection of structural
ensembles for ligand docking. The method is available open-source
at https://github.com/ramirandaq/MultipleComparisons.
Collapse
Affiliation(s)
- Anita Rácz
- Plasma Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117 Budapest, Hungary
| | - Levente M Mihalovits
- Medicinal Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117 Budapest, Hungary
| | - Dávid Bajusz
- Medicinal Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117 Budapest, Hungary
| | - Károly Héberger
- Plasma Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117 Budapest, Hungary
| | - Ramón Alain Miranda-Quintana
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, Florida 32611, United States
| |
Collapse
|
14
|
Saldívar-González FI, Medina-Franco JL. Approaches for enhancing the analysis of chemical space for drug discovery. Expert Opin Drug Discov 2022; 17:789-798. [PMID: 35640229 DOI: 10.1080/17460441.2022.2084608] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
INTRODUCTION Chemical space is a powerful, general, and practical conceptual framework in drug discovery and other areas in chemistry that addresses the diversity of molecules and it has various applications. Moreover, chemical space is a cornerstone of chemoinformatics as a scientific discipline. In response to the increase in the set of chemical compounds in databases, generators of chemical structures, and tools to calculate molecular descriptors, novel approaches to generate visual representations of chemical space in low dimensions are emerging and evolving. Such approaches include a wide range of commercial and free applications, software, and open-source methods. AREAS COVERED The current state of chemical space in drug design and discovery is reviewed. The topics discussed herein include advances for efficient navigation in chemical space, the use of this concept in assessing the diversity of different data sets, exploring structure-property/activity relationships for one or multiple endpoints, and compound library design. Recent advances in methodologies for generating visual representations of chemical space have been highlighted, thereby emphasizing open-source methods. EXPERT OPINION Quantitative and qualitative generation and analysis of chemical space require novel approaches for handling the increasing number of molecules and their information available in chemical databases (including emerging ultra-large libraries). In addition, it is of utmost importance to note that chemical space is a conceptual framework that goes beyond visual representation in low dimensions. However, the graphical representation of chemical space has several practical applications in drug discovery and beyond.
Collapse
Affiliation(s)
- Fernanda I Saldívar-González
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Mexico City 04510, Mexico
| | - José L Medina-Franco
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Mexico City 04510, Mexico
| |
Collapse
|
15
|
7-Aminoalkoxy-Quinazolines from Epigenetic Focused Libraries Are Potent and Selective Inhibitors of DNA Methyltransferase 1. Molecules 2022; 27:molecules27092892. [PMID: 35566242 PMCID: PMC9102847 DOI: 10.3390/molecules27092892] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Revised: 04/24/2022] [Accepted: 04/30/2022] [Indexed: 11/17/2022] Open
Abstract
Inhibitors of epigenetic writers such as DNA methyltransferases (DNMTs) are attractive compounds for epigenetic drug and probe discovery. To advance epigenetic probes and drug discovery, chemical companies are developing focused libraries for epigenetic targets. Based on a knowledge-based approach, herein we report the identification of two quinazoline-based derivatives identified in focused libraries with sub-micromolar inhibition of DNMT1 (30 and 81 nM), more potent than S-adenosylhomocysteine. Also, both compounds had a low micromolar affinity of DNMT3A and did not inhibit DNMT3B. The enzymatic inhibitory activity of DNMT1 and DNMT3A was rationalized with molecular modeling. The quinazolines reported in this work are known to have low cell toxicity and be potent inhibitors of the epigenetic target G9a. Therefore, the quinazoline-based compounds presented are attractive not only as novel potent inhibitors of DNMTs but also as dual and selective epigenetic agents targeting two families of epigenetic writers.
Collapse
|