1
|
Aziz A, Carrasco J. Towards Predictive Synthesis of Inorganic Materials Using Network Science. Front Chem 2022; 9:798838. [PMID: 34993176 PMCID: PMC8724131 DOI: 10.3389/fchem.2021.798838] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Accepted: 12/03/2021] [Indexed: 11/13/2022] Open
Abstract
Accelerating materials discovery is the cornerstone of modern technological competitiveness. Yet, the inorganic synthesis of new compounds is often an important bottleneck in this quest. Well-established quantum chemistry and experimental synthesis methods combined with consolidated network science approaches might provide revolutionary knowledge to tackle this challenge. Recent pioneering studies in this direction have shown that the topological analysis of material networks hold great potential to effectively explore the synthesizability of inorganic compounds. In this Perspective we discuss the most exciting work in this area, in particular emerging new physicochemical insights and general concepts on how network science can significantly help reduce the timescales required to discover new materials and find synthetic routes for their fabrication. We also provide a perspective on outstanding problems, challenges and open questions.
Collapse
Affiliation(s)
- Alex Aziz
- Centre for Cooperative Research on Alternative Energies (CIC energiGUNE), Basque Research and Technology Alliance (BRTA), Vitoria-Gasteiz, Spain
| | - Javier Carrasco
- Centre for Cooperative Research on Alternative Energies (CIC energiGUNE), Basque Research and Technology Alliance (BRTA), Vitoria-Gasteiz, Spain
| |
Collapse
|
2
|
Boezio B, Audouze K, Ducrot P, Taboureau O. Network-based Approaches in Pharmacology. Mol Inform 2017; 36. [PMID: 28692140 DOI: 10.1002/minf.201700048] [Citation(s) in RCA: 189] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Accepted: 06/21/2017] [Indexed: 12/23/2022]
Abstract
In drug discovery, network-based approaches are expected to spotlight our understanding of drug action across multiple layers of information. On one hand, network pharmacology considers the drug response in the context of a cellular or phenotypic network. On the other hand, a chemical-based network is a promising alternative for characterizing the chemical space. Both can provide complementary support for the development of rational drug design and better knowledge of the mechanisms underlying the multiple actions of drugs. Recent progress in both concepts is discussed here. In addition, a network-based approach using drug-target-therapy data is introduced as an example.
Collapse
Affiliation(s)
- Baptiste Boezio
- Université Paris Diderot - Inserm UMR-S973, MTi, 75205, Paris Cedex 13, 75013, Paris, France
| | - Karine Audouze
- Université Paris Diderot - Inserm UMR-S973, MTi, 75205, Paris Cedex 13, 75013, Paris, France
| | - Pierre Ducrot
- Institut de Recherche Servier, 125 Chemin de Ronde, 78290, Croissy-sur-Seine, France
| | - Olivier Taboureau
- Université Paris Diderot - Inserm UMR-S973, MTi, 75205, Paris Cedex 13, 75013, Paris, France
| |
Collapse
|
3
|
Chemical space visualization: transforming multidimensional chemical spaces into similarity-based molecular networks. Future Med Chem 2016; 8:1769-78. [PMID: 27572425 DOI: 10.4155/fmc-2016-0023] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND The concept of chemical space is of fundamental relevance for medicinal chemistry and chemical informatics. Multidimensional chemical space representations are coordinate-based. Chemical space networks (CSNs) have been introduced as a coordinate-free representation. RESULTS A computational approach is presented for the transformation of multidimensional chemical space into CSNs. The design of transformation CSNs (TRANS-CSNs) is based upon a similarity function that directly reflects distance relationships in original multidimensional space. CONCLUSION TRANS-CSNs provide an immediate visualization of coordinate-based chemical space and do not require the use of dimensionality reduction techniques. At low network density, TRANS-CSNs are readily interpretable and make it possible to evaluate structure-activity relationship information originating from multidimensional chemical space.
Collapse
|
4
|
Zahoránszky-Kőhalmi G, Bologa CG, Oprea TI. Impact of similarity threshold on the topology of molecular similarity networks and clustering outcomes. J Cheminform 2016; 8:16. [PMID: 27030802 PMCID: PMC4812625 DOI: 10.1186/s13321-016-0127-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2015] [Accepted: 03/08/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Complex network theory based methods and the emergence of "Big Data" have reshaped the terrain of investigating structure-activity relationships of molecules. This change gave rise to new methods which need to face an important challenge, namely: how to restructure a large molecular dataset into a network that best serves the purpose of the subsequent analyses. With special focus on network clustering, our study addresses this open question by proposing a data transformation method and a clustering framework. RESULTS Using the WOMBAT and PubChem MLSMR datasets we investigated the relation between varying the similarity threshold applied on the similarity matrix and the average clustering coefficient of the emerging similarity-based networks. These similarity networks were then clustered with the InfoMap algorithm. We devised a systematic method to generate so-called "pseudo-reference" clustering datasets which compensate for the lack of large-scale reference datasets. With help from the clustering framework we were able to observe the effects of varying the similarity threshold and its consequence on the average clustering coefficient and the clustering performance. CONCLUSIONS We observed that the average clustering coefficient versus similarity threshold function can be characterized by the presence of a peak that covers a range of similarity threshold values. This peak is preceded by a steep decline in the number of edges of the similarity network. The maximum of this peak is well aligned with the best clustering outcome. Thus, if no reference set is available, choosing the similarity threshold associated with this peak would be a near-ideal setting for the subsequent network cluster analysis. The proposed method can be used as a general approach to determine the appropriate similarity threshold to generate the similarity network of large-scale molecular datasets.
Collapse
Affiliation(s)
- Gergely Zahoránszky-Kőhalmi
- Translational Informatics Division, University of New Mexico School of Medicine, MSC09 5025, Albuquerque, NM 87131 USA
| | - Cristian G Bologa
- Translational Informatics Division, University of New Mexico School of Medicine, MSC09 5025, Albuquerque, NM 87131 USA
| | - Tudor I Oprea
- Translational Informatics Division, University of New Mexico School of Medicine, MSC09 5025, Albuquerque, NM 87131 USA
| |
Collapse
|
5
|
Lessons learned from the design of chemical space networks and opportunities for new applications. J Comput Aided Mol Des 2016; 30:191-208. [DOI: 10.1007/s10822-016-9906-3] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2016] [Accepted: 03/01/2016] [Indexed: 12/13/2022]
|
6
|
Design of chemical space networks on the basis of Tversky similarity. J Comput Aided Mol Des 2015; 30:1-12. [DOI: 10.1007/s10822-015-9891-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2015] [Accepted: 12/18/2015] [Indexed: 12/28/2022]
|
7
|
Zhang B, Vogt M, Maggiora GM, Bajorath J. Design of chemical space networks using a Tanimoto similarity variant based upon maximum common substructures. J Comput Aided Mol Des 2015; 29:937-50. [DOI: 10.1007/s10822-015-9872-1] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2015] [Accepted: 09/24/2015] [Indexed: 12/14/2022]
|
8
|
Zhang B, Vogt M, Maggiora GM, Bajorath J. Comparison of bioactive chemical space networks generated using substructure- and fingerprint-based measures of molecular similarity. J Comput Aided Mol Des 2015; 29:595-608. [DOI: 10.1007/s10822-015-9852-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Accepted: 06/03/2015] [Indexed: 12/20/2022]
|
9
|
Sukumar N, Krein MP, Prabhu G, Bhattacharya S, Sen S. Network measures for chemical library design. Drug Dev Res 2015; 75:402-11. [PMID: 25195584 DOI: 10.1002/ddr.21218] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
In this overview, we examine recent developments in network approaches to drug design. A brief overview of networks is followed by a discussion of how chemical similarity networks and their properties address challenges in drug design. Multiple methods used to assess or enhance chemical diversity for early-stage drug discovery are discussed, as well as methods that can be used for drug repositioning and ligand polypharmacology.
Collapse
Affiliation(s)
- Nagamani Sukumar
- Department of Chemistry, Shiv Nadar University, Dadri, Gautam Budh Nagar, U.P., 201314, India; Center for Informatics, Shiv Nadar University, Dadri, Gautam Budh Nagar, U.P., 201314, India
| | | | | | | | | |
Collapse
|
10
|
Design and characterization of chemical space networks for different compound data sets. J Comput Aided Mol Des 2014; 29:113-25. [DOI: 10.1007/s10822-014-9821-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2014] [Accepted: 11/27/2014] [Indexed: 01/23/2023]
|
11
|
Affiliation(s)
- Dagmar Stumpfe
- Department of Life Science Informatics, B-IT; LIMES Program Unit Chemical Biology and Medicinal Chemistry; Rheinische Friedrich-Wilhelms-Universität Bonn; Bonn D-53113 Germany
| | - Jürgen Bajorath
- Department of Life Science Informatics, B-IT; LIMES Program Unit Chemical Biology and Medicinal Chemistry; Rheinische Friedrich-Wilhelms-Universität Bonn; Bonn D-53113 Germany
| |
Collapse
|
12
|
Chemical space networks: a powerful new paradigm for the description of chemical space. J Comput Aided Mol Des 2014; 28:795-802. [PMID: 24925682 DOI: 10.1007/s10822-014-9760-0] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2014] [Accepted: 06/04/2014] [Indexed: 01/26/2023]
Abstract
The concept of chemical space is playing an increasingly important role in many areas of chemical research, especially medicinal chemistry and chemical biology. It is generally conceived as consisting of numerous compound clusters of varying sizes scattered throughout the space in much the same way as galaxies of stars inhabit our universe. A number of issues associated with this coordinate-based representation are discussed. Not the least of which is the continuous nature of the space, a feature not entirely compatible with the inherently discrete nature of chemical space. Cell-based representations, which are derived from coordinate-based spaces, have also been developed that facilitate a number of chemical informatic activities (e.g., diverse subset selection, filling 'diversity voids', and comparing compound collections).These representations generally suffer the 'curse of dimensionality'. In this work, networks are proposed as an attractive paradigm for representing chemical space since they circumvent many of the issues associated with coordinate- and cell-based representations, including the curse of dimensionality. In addition, their relational structure is entirely compatible with the intrinsic nature of chemical space. A description of the features of these chemical space networks is presented that emphasizes their statistical characteristics and indicates how they are related to various types of network topologies that exhibit random, scale-free, and/or 'small world' properties.
Collapse
|
13
|
Csermely P, Korcsmáros T, Kiss HJM, London G, Nussinov R. Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review. Pharmacol Ther 2013; 138:333-408. [PMID: 23384594 PMCID: PMC3647006 DOI: 10.1016/j.pharmthera.2013.01.016] [Citation(s) in RCA: 512] [Impact Index Per Article: 46.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Accepted: 01/22/2013] [Indexed: 02/02/2023]
Abstract
Despite considerable progress in genome- and proteome-based high-throughput screening methods and in rational drug design, the increase in approved drugs in the past decade did not match the increase of drug development costs. Network description and analysis not only give a systems-level understanding of drug action and disease complexity, but can also help to improve the efficiency of drug design. We give a comprehensive assessment of the analytical tools of network topology and dynamics. The state-of-the-art use of chemical similarity, protein structure, protein-protein interaction, signaling, genetic interaction and metabolic networks in the discovery of drug targets is summarized. We propose that network targeting follows two basic strategies. The "central hit strategy" selectively targets central nodes/edges of the flexible networks of infectious agents or cancer cells to kill them. The "network influence strategy" works against other diseases, where an efficient reconfiguration of rigid networks needs to be achieved by targeting the neighbors of central nodes/edges. It is shown how network techniques can help in the identification of single-target, edgetic, multi-target and allo-network drug target candidates. We review the recent boom in network methods helping hit identification, lead selection optimizing drug efficacy, as well as minimizing side-effects and drug toxicity. Successful network-based drug development strategies are shown through the examples of infections, cancer, metabolic diseases, neurodegenerative diseases and aging. Summarizing >1200 references we suggest an optimized protocol of network-aided drug development, and provide a list of systems-level hallmarks of drug quality. Finally, we highlight network-related drug development trends helping to achieve these hallmarks by a cohesive, global approach.
Collapse
Affiliation(s)
- Peter Csermely
- Department of Medical Chemistry, Semmelweis University, P.O. Box 260, H-1444 Budapest 8, Hungary.
| | | | | | | | | |
Collapse
|
14
|
Graphs and networks in chemical and biological informatics: past, present and future. Future Med Chem 2012; 4:2039-47. [DOI: 10.4155/fmc.12.128] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Chemical and biological network analysis has recently garnered intense interest from the perspective of drug design and discovery. While graph theoretic concepts have a long history in chemistry – predating quantum mechanics – and graphical measures of chemical structures date back to the 1970s, it is only recently with the advent of public repositories of information and availability of high-throughput assays and computational resources that network analysis of large-scale chemical networks, such as protein–protein interaction networks, has become possible. Drug design and discovery are undergoing a paradigm shift, from the notion of ‘one target, one drug’ to a much more nuanced view that relies on multiple sources of information: genomic, proteomic, metabolomic and so on. This holistic view of drug design is an incredibly daunting undertaking still very much in its infancy. Here, we focus on current developments in graph- and network-centric approaches in chemical and biological informatics, with particular reference to applications in the fields of SAR modeling and drug design. Key insights from the past suggest a path forward via visualization and fusion of multiple sources of chemical network data.
Collapse
|
15
|
Sheng C, Zhang W. Fragment Informatics and Computational Fragment-Based Drug Design: An Overview and Update. Med Res Rev 2012; 33:554-98. [DOI: 10.1002/med.21255] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- Chunquan Sheng
- Department of Medicinal Chemistry; School of Pharmacy; Second Military Medical University; 325 Guohe Road Shanghai 200433 People's Republic of China
| | - Wannian Zhang
- Department of Medicinal Chemistry; School of Pharmacy; Second Military Medical University; 325 Guohe Road Shanghai 200433 People's Republic of China
| |
Collapse
|
16
|
Krein MP, Sukumar N. Exploration of the Topology of Chemical Spaces with Network Measures. J Phys Chem A 2011; 115:12905-18. [DOI: 10.1021/jp204022u] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Affiliation(s)
- Michael P. Krein
- Rensselaer Exploratory Center for Cheminformatics Research, and Department of Chemistry & Chemical Biology, Rensselaer Polytechnic Institute, 110 Eighth Street, Troy, New York 12180, United States
| | - N. Sukumar
- Rensselaer Exploratory Center for Cheminformatics Research, and Department of Chemistry & Chemical Biology, Rensselaer Polytechnic Institute, 110 Eighth Street, Troy, New York 12180, United States
| |
Collapse
|
17
|
Cerruela García G, Luque Ruiz I, Gómez-Nieto MÁ. Analysis and Study of Molecule Data Sets Using Snowflake Diagrams of Weighted Maximum Common Subgraph Trees. J Chem Inf Model 2011; 51:1216-32. [DOI: 10.1021/ci100484z] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Gonzalo Cerruela García
- Department of Computing and Numerical Analysis, University of Córdoba, Campus de Rabanales, Albert Einstein Building, E-14071 Córdoba, Spain
| | - Irene Luque Ruiz
- Department of Computing and Numerical Analysis, University of Córdoba, Campus de Rabanales, Albert Einstein Building, E-14071 Córdoba, Spain
| | - Miguel Ángel Gómez-Nieto
- Department of Computing and Numerical Analysis, University of Córdoba, Campus de Rabanales, Albert Einstein Building, E-14071 Córdoba, Spain
| |
Collapse
|
18
|
Abstract
IMPORTANCE OF THE FIELD Structure-based in silico drug screening is now widely used in drug development projects. Structure-based in silico drug screening is generally performed using a protein-compound docking program and docking scoring function. Many docking programs have been developed over the last 2 decades, but their prediction accuracy remains insufficient. AREAS COVERED IN THIS REVIEW This review highlights the recent progress of the post-processing of protein-compound complexes after docking. WHAT THE READER WILL GAIN These methods utilize ensembles of docking poses of compounds to improve the prediction accuracy for the ligand-docking pose and screening results. While the individual docking poses are not reliable, the free energy surface or the most probable docking pose can be estimated from the ensemble of docking poses. TAKE HOME MESSAGE The protein-compound docking program provides an arbitral rather than a canonical ensemble of docking poses. When the ensemble of docking poses satisfies the canonical ensemble, we can discuss how these post-docking analysis methods work and fail. Thus, improvements to the docking software will be needed in order to generate well-defined ensembles of docking poses.
Collapse
Affiliation(s)
- Yoshifumi Fukunishi
- Biomedicinal Information Research Center (BIRC), National Institute of Advanced Industrial Science and Technology (AIST), 2-3-26, Aomi, Koto-ku, Tokyo 135 0064, Japan.
| |
Collapse
|
19
|
Haque IS, Pande VS. SCISSORS: a linear-algebraical technique to rapidly approximate chemical similarities. J Chem Inf Model 2010; 50:1075-88. [PMID: 20509629 DOI: 10.1021/ci1000136] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Algorithms for several emerging large-scale problems in cheminformatics have as their rate-limiting step the evaluation of relatively slow chemical similarity measures, such as structural similarity or three-dimensional (3-D) shape comparison. In this article we present SCISSORS, a linear-algebraical technique (related to multidimensional scaling and kernel principal components analysis) to rapidly estimate chemical similarities for several popular measures. We demonstrate that SCISSORS faithfully reflects its source similarity measures for both Tanimoto calculation and rank ordering. After an efficient precalculation step on a database, SCISSORS affords several orders of magnitude of speedup in database screening. SCISSORS furthermore provides an asymptotic speedup for large similarity matrix construction problems, reducing the number of conventional slow similarity evaluations required from quadratic to linear scaling.
Collapse
Affiliation(s)
- Imran S Haque
- Department of Computer Science and Department of Chemistry, Stanford University, Stanford, CA, USA
| | | |
Collapse
|