1
|
On the topological descriptors and structural analysis of cerium oxide nanostructures. CHEMICAL PAPERS 2023. [DOI: 10.1007/s11696-023-02675-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
|
2
|
García-García Á, de Julián-Ortiz JV, Gálvez J, Font D, Ayats C, Guna Serrano MDR, Muñoz-Collado C, Borrás R, Villalgordo JM. Similarity-Based Virtual Screening to Find Antituberculosis Agents Based on Novel Scaffolds: Design, Syntheses and Pharmacological Assays. Int J Mol Sci 2022; 23:ijms232315057. [PMID: 36499384 PMCID: PMC9737236 DOI: 10.3390/ijms232315057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 11/02/2022] [Accepted: 11/22/2022] [Indexed: 12/04/2022] Open
Abstract
A method to identify molecular scaffolds potentially active against the Mycobacterium tuberculosis complex (MTBC) is developed. A set of structurally heterogeneous agents against MTBC was used to obtain a mathematical model based on topological descriptors. This model was statistically validated through a Leave-n-Out test. It successfully discriminated between active or inactive compounds over 86% in database sets. It was also useful to select new potential antituberculosis compounds in external databases. The selection of new substituted pyrimidines, pyrimidones and triazolo[1,5-a]pyrimidines was particularly interesting because these structures could provide new scaffolds in this field. The seven selected candidates were synthesized and six of them showed activity in vitro.
Collapse
Affiliation(s)
- Ángela García-García
- Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia, Universitat de València, 46100 Burjassot, Spain
| | - Jesus Vicente de Julián-Ortiz
- Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia, Universitat de València, 46100 Burjassot, Spain
- Correspondence:
| | - Jorge Gálvez
- Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia, Universitat de València, 46100 Burjassot, Spain
| | - David Font
- Departamento de Química, Universitat de Girona, 17071 Girona, Spain
| | - Carles Ayats
- Departamento de Química, Universitat de Girona, 17071 Girona, Spain
| | - María del Remedio Guna Serrano
- Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia, Universitat de València, 46100 Burjassot, Spain
- Departamento de Microbiología, Facultad de Medicina y Odontología, Universitat de València, 46010 València, Spain
| | - Carlos Muñoz-Collado
- Departamento de Microbiología, Facultad de Medicina y Odontología, Universitat de València, 46010 València, Spain
| | - Rafael Borrás
- Departamento de Microbiología, Facultad de Medicina y Odontología, Universitat de València, 46010 València, Spain
| | | |
Collapse
|
3
|
Dixit VA. A simple model to solve a complex drug toxicity problem. Toxicol Res (Camb) 2018; 8:157-171. [PMID: 30997019 DOI: 10.1039/c8tx00261d] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2018] [Accepted: 11/28/2018] [Indexed: 02/06/2023] Open
Abstract
Linear drug toxicity models like therapeutic index (TI), physicochemical rules (rule of five, 3/75), ligand efficiency indices (LEI), ideal pharmacokinetic (PK) and pharmacodynamic (PD) profiles are widely used in drug discovery and development. In spite of this, predicting drug toxicity at various stages remains challenging and the overall productivity (<20%) and ultimate benefit to the patients remain low. A simple drug toxicity model, "Drug Toxicity Index" (DTI), is developed here using 711 oral drugs. DTI redefines drug toxicity as scaled biphasic and exponential functions of PD, PK and physicochemical parameters. PD, PK and physicochemical toxicity contributions were estimated from the on and off target IC50, maximum unbound plasma drug concentration (free C max), and log D values, respectively. These contributions are then scaled by molar dose and oral bioavailability and the logarithm of the sum of scaled contributions is DTI. Drugs with DTI above the WHO ATC drug category specific average values consistently have toxic profiles, while drugs with DTI below this average are relatively safe. DTI performs better than standard rules for lead optimization, LEI and exposure based TIs in identifying safe and toxic drugs. DTI classifies 392 drugs reported in the US-FDA's Liver Toxicity Knowledge Base (LTKB) with an AUC for ROC curves of 0.91-0.64 for different WHO ATC categories. DTI has been used to predict network meta-analysis results on relative toxicity within/across eight different therapeutic areas. It is useful in understanding PD, PK and physicochemical toxicity contributions and identifying potentially toxic drugs and the toxicity of recently approved drugs. Decision trees are proposed for applying the DTI concept in preclinical drug discovery and clinical trial settings. DTI can potentially reduce failure in drug discovery and might be useful in therapeutic drug monitoring and in xenobiotic and environmental toxicity studies.
Collapse
Affiliation(s)
- Vaibhav A Dixit
- Department of Pharmacy , Birla Institute of Technology and Sciences Pilani (BITS Pilani) , Vidya Vihar Campus , Street number 41 , Pilani , 333031 , Rajasthan , India . ; ; Tel: +91 1596 255652 ; Tel: +91-7709129400
| |
Collapse
|
4
|
García-García A, Gálvez J, de Julián-Ortiz JV, García-Domenech R, Muñoz C, Guna R, Borrás R. Search of Chemical Scaffolds for Novel Antituberculosis Agents. ACTA ACUST UNITED AC 2016; 10:206-14. [PMID: 15809316 DOI: 10.1177/1087057104273486] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
A method to identify chemical scaffolds potentially active against Mycobacterium tuberculosis is presented. The molecular features of a set of structurally heterogeneous antituberculosis drugs were coded by means of structural invariants. Three techniques were used to obtain equations able to model the antituberculosis activity: linear discriminant analysis, multilinear regression, and shrinkage estimation–ridge regression. The model obtained was statistically validated through leave- n-out test, and an external set and was applied to a database for the search of new active agents. The selected compounds were assayed in vitro, and among those identified as active stand reserpine, N,N,N′,N′-tetrakis-(2-pyridylmethyl)-ethylenediamine (TPEN), trifluoperazine, pentamidine, and 2-methyl-4,6-dinitro-phenol (DNOC). They show activity comparable to or superior to ethambutol, used in combination with other drugs for the prevention and treatment of Mycobacterium avium complex and drug-resistant tuberculosis.
Collapse
Affiliation(s)
- Angeles García-García
- Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia, Universitat de València, Av. Vicent Andrés Estellés s/n, 46100 Burjassot, Spain
| | | | | | | | | | | | | |
Collapse
|
5
|
Atabati M, Emamalizadeh R. The Hydrogen Perturbation in Molecular Connectivity Indices and their Application to a QSPR Study. J SOLUTION CHEM 2012. [DOI: 10.1007/s10953-012-9919-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
6
|
Agüero-Chapín G, Antunes A, Ubeira FM, Chou KC, González-Díaz H. Comparative study of topological indices of macro/supramolecular RNA complex networks. J Chem Inf Model 2009; 48:2265-77. [PMID: 18937437 DOI: 10.1021/ci8001809] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
RNA function annotation is often based on alignment to a previously studied template. In contrast to the study of proteins, there are not many alignment-free methods to predict RNA functions if alignment fails. The use of topological indices (TIs) of RNA complex networks (CNs) to find quantitative structure-activity relationships (QSAR) may be an alternative to incorporate secondary structure or sequence-to-sequence similarity. Here, we introduce new QSAR-like techniques using RNA macromolecular CNs (mmCNs), where nodes are nucleotides, or RNA supramolecular CNs (smCNs), where nodes are RNA sequences. We studied a data set of 198 sequences including 18S-rRNAs (important phylogenetic molecular biomarkers). We constructed three types of RNA mmCNs: sequence-linear (SL), Cartesian-lattice (CL), and sequence-folding CNs (SF-CNs) and two smCNs: sequence-sequence disagreement CN (SSD) and sequence-sequence similarity (SSS-smCN). We reported the first comparative QSAR study with all these CIs and CNs, which includes: (i) spectral moments ( ( i )micro d ( w)) of SL-mmCNs (accuracy = 75.3%), (ii) electrostatic CIs (xi d ) of CL-mmCNs (>90%), (iii) thermodynamic parameters (Delta G, Delta H, Delta S, and T m) of SF-mmCNs (64.7%), (iv) disagreement-distribution moments ( M k ) of the SSD-smCN (79.3%), and (v) node centralities of the SSD-smCN (78.0%). Furthermore, we reported the experimental isolation of a new RNA sequence from Psidum guajava leaf tissue and its QSAR and BLAST prediction to illustrate the practical use of these methods. We also investigated the use of these CNs to explore rRNA diversity on bacteria, plants, and parasites from the Dactylogyrus genus. The HPL-mmCNs model was the best of all found. All the CNs and TIs, except SF-mmCNs, were introduced here by the first time for the QSAR study of RNA, which allowed a comparative study for RNA classification.
Collapse
Affiliation(s)
- Guillermín Agüero-Chapín
- Department of Microbiology & Parasitology, Faculty of Pharmacy, University of Santiago de Compostela, Santiago de Compostela 15782, Spain.
| | | | | | | | | |
Collapse
|
7
|
Quantitative Proteome–Property Relationships (QPPRs). Part 1: Finding biomarkers of organic drugs with mean Markov connectivity indices of spiral networks of blood mass spectra. Bioorg Med Chem 2008; 16:9684-93. [DOI: 10.1016/j.bmc.2008.10.004] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2008] [Revised: 09/29/2008] [Accepted: 10/02/2008] [Indexed: 11/22/2022]
|
8
|
Rücker C, Meringer M, Kerber A. QSPR Using MOLGEN-QSPR: The Challenge of Fluoroalkane Boiling Points. J Chem Inf Model 2004; 45:74-80. [PMID: 15667131 DOI: 10.1021/ci0497298] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
By means of the new software MOLGEN-QSPR, a multilinear regression model for the boiling points of lower fluoroalkanes is established. The model is based exclusively on simple descriptors derived directly from molecular structure and nevertheless describes a broader set of data more precisely than previous attempts that used either more demanding (quantum chemical) descriptors or more demanding (nonlinear) statistical methods such as neural networks. The model's internal consistency was confirmed by leave-one-out cross-validation. The model was used to predict all unknown boiling points of fluorobutanes, and the quality of predictions was estimated by means of comparison with boiling point predictions for fluoropentanes.
Collapse
Affiliation(s)
- Christoph Rücker
- Department of Mathematics, Universität Bayreuth, D-95440 Bayreuth, Germany.
| | | | | |
Collapse
|
9
|
Rücker C, Meringer M, Kerber A. QSPR Using MOLGEN-QSPR: The Example of Haloalkane Boiling Points. ACTA ACUST UNITED AC 2004; 44:2070-6. [PMID: 15554677 DOI: 10.1021/ci049802u] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
MOLGEN-QSPR is a software newly developed for use in quantitative structure property relationships (QSPR) work. It allows to import, to manually edit, or to generate chemical structures, to detect duplicate structures, to import or to manually input property values, to calculate the values of a broad pool of molecular descriptors, to establish QSPR equations (models), and using such models to predict unknown property values. In connection with the molecule generator MOLGEN, MOLGEN-QSPR is able to predict property values for all compounds in a predetermined structure space (inverse QSPR). Some of the features of MOLGEN-QSPR are demonstrated on the example of haloalkane boiling points. The data basis used here is broader than in previous studies, and the models established are both more precise and simpler than those previously reported.
Collapse
Affiliation(s)
- Christoph Rücker
- Department of Mathematics, Universität Bayreuth, D-95440 Bayreuth, Germany.
| | | | | |
Collapse
|
10
|
Kovatcheva A, Golbraikh A, Oloff S, Xiao YD, Zheng W, Wolschann P, Buchbauer G, Tropsha A. Combinatorial QSAR of Ambergris Fragrance Compounds. ACTA ACUST UNITED AC 2004; 44:582-95. [PMID: 15032539 DOI: 10.1021/ci034203t] [Citation(s) in RCA: 83] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
A combinatorial quantitative structure-activity relationships (Combi-QSAR) approach has been developed and applied to a data set of 98 ambergris fragrance compounds with complex stereochemistry. The Combi-QSAR approach explores all possible combinations of different independent descriptor collections and various individual correlation methods to obtain statistically significant models with high internal (for the training set) and external (for the test set) accuracy. Seven different descriptor collections were generated with commercially available MOE, CoMFA, CoMMA, Dragon, VolSurf, and MolconnZ programs; we also included chirality topological descriptors recently developed in our laboratory (Golbraikh, A.; Bonchev, D.; Tropsha, A. J. Chem. Inf. Comput. Sci. 2001, 41, 147-158). CoMMA descriptors were used in combination with MOE descriptors. MolconnZ descriptors were used in combination with chirality descriptors. Each descriptor collection was combined individually with four correlation methods, including k-nearest neighbors (kNN) classification, Support Vector Machines (SVM), decision trees, and binary QSAR, giving rise to 28 different types of QSAR models. Multiple diverse and representative training and test sets were generated by the divisions of the original data set in two. Each model with high values of leave-one-out cross-validated correct classification rate for the training set was subjected to extensive internal and external validation to avoid overfitting and achieve reliable predictive power. Two validation techniques were employed, i.e., the randomization of the target property (in this case, odor intensity) also known as the Y-randomization test and the assessment of external prediction accuracy using test sets. We demonstrate that not every combination of the data modeling technique and the descriptor collection yields a validated and predictive QSAR model. kNN classification in combination with CoMFA descriptors was found to be the best QSAR approach overall since predictive models with correct classification rates for both training and test sets of 0.7 and higher were obtained for all divisions of the ambergris data set into the training and test sets. Many predictive QSAR models were also found using a combination of kNN classification method with other collections of descriptors. The combinatorial QSAR affords automation, computational efficiency, and higher probability of identifying significant QSAR models for experimental data sets than the traditional approaches that rely on a single QSAR method.
Collapse
Affiliation(s)
- Assia Kovatcheva
- Department of Pharmaceutical Chemistry, University of Vienna, Althanstrasse 14, A-1090 Vienna, Austria
| | | | | | | | | | | | | | | |
Collapse
|
11
|
Yang C, Zhong C. Modified Connectivity Indices and Their Application to QSPR Study. ACTA ACUST UNITED AC 2003; 43:1998-2004. [PMID: 14632450 DOI: 10.1021/ci034093q] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A modified adjacency matrix was developed to delineate the chemical graph of a compound, in which the element a(ii) along the diagonal of the matrix reflects the numbers of the lone-pair electrons and pi bonds of the ith atom, and the off-diagonal element a(ij) of the matrix characterizes whether the jth non-hydrogen atom is bonded to the ith non-hydrogen atom as well as the number of hydrogen atoms bonded to the jth non-hydrogen atom. The corresponding vertex-degree matrix can distinguish the non-hydrogen atoms in the compound better than that from the original adjacency matrix. Based on the newly proposed adjacency matrix, modified molecular connectivity indices (mMCIs) were proposed as structural descriptors for organic compounds, which were applied to the QSPR studies on the boiling point temperature, molar volume and molar refraction of alkanes, alkenes and alcohols. The results show that, in most cases, the mMCIs give improved correlations than the original molecular connectivity indices (MCIs), which are particularly suitable to distinguish isomers.
Collapse
Affiliation(s)
- Chunsheng Yang
- Department of Chemical Engineering, PO Box 100, Beijing University of Chemical Technology, Beijing 100029, China
| | | |
Collapse
|
12
|
Bonchev D. On the complexity of directed biological networks. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2003; 14:199-214. [PMID: 12854653 DOI: 10.1080/1062936031000101764] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Recently there was an increased interest towards network approach to biology and environmental sciences. Networks are believed to be the key to the understanding of the work of biological machine in cells, organs, organisms, and ecosystems. While complexity of undirected networks has been recently analyzed, the assessment of complexity in directed networks has specificity that has not been explored so far. The present paper aims to address the existing gap by discussing the applicability of the available complexity descriptors. New specific measures (vertex accessibility, accessible connectedness, and adjusted average distance) are introduced based on assessment of the reduced accessibility of nodes in directed networks.
Collapse
Affiliation(s)
- D Bonchev
- Program for Theory of Complex Natural Systems, Ford Crocken Campus, 5007 Avenue U, Texas A&M University, Galveston, TX 77551, USA.
| |
Collapse
|
13
|
Golbraikh A, Tropsha A. QSAR modeling using chirality descriptors derived from molecular topology. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2003; 43:144-54. [PMID: 12546547 DOI: 10.1021/ci025516b] [Citation(s) in RCA: 85] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Topological descriptors of chemical structures (such as molecular connectivity indices) are widely used in Quantitative Structure-Activity Relationships (QSAR) studies. Unfortunately, these descriptors lack the ability to discriminate between stereoisomers, which limits their application in QSAR. To circumvent this problem, we recently introduced chirality descriptors derived from molecular graphs and applied them in QSAR studies of ecdysteroids (Golbraikh A.; Bonchev, D.; Tropsha, A. J. Chem. Inf. Comput. Sci. 2001,41, 147-158). In this paper, we extend our earlier work by applying chirality descriptors to four data sets containing chiral compounds. All models were derived with the k-nearest neighbors (kNN) QSAR method developed in our laboratory (Zheng, W.; Tropsha, A. J. Chem. Inf. Comput. Sci. 2000, 40, 185-194). They were validated using the same training and test sets that were employed in various, mostly 3D-QSAR, investigations published by other authors. We show that for all data sets 2D-QSAR models that use a combination of chirality descriptors with conventional (chirality insensitive) topological descriptors afford better or similar predictive ability as compared to models generated with 3D-QSAR approaches. The results presented in this paper reassure that 2D-QSAR modeling provides a powerful alternative to 3D-QSAR.
Collapse
Affiliation(s)
- Alexander Golbraikh
- Laboratory for Molecular Modeling, Division of Medicinal Chemistry, School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7360, USA.
| | | |
Collapse
|
14
|
Gerstl Z. Quantitative structure-activity relationships (QSARs) as a tool for predicting the sorption of organic chemicals in soils. Isr J Chem 2002. [DOI: 10.1560/0p99-xt6b-9amm-x3cm] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
15
|
Golbraikh A, Bonchev D, Tropsha A. Novel ZE-isomerism descriptors derived from molecular topology and their application to QSAR analysis. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2002; 42:769-87. [PMID: 12132878 DOI: 10.1021/ci0103469] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We introduce several series of novel ZE-isomerism descriptors derived directly from two-dimensional molecular topology. These descriptors make use of a quantity named ZE-isomerism correction, which is added to the vertex degrees of atoms connected by double bonds in Z and E configurations. This approach is similar to the one described previously for topological chirality descriptors (Golbraikh, A., et al. J. Chem. Inf. Comput. Sci. 2001, 41, 147-158). The ZE-isomerism descriptors include modified molecular connectivity indices, overall Zagreb indices, extended connectivity, overall connectivity, and topological charge indices. They can be either real or complex numbers. Mathematical properties of different subgroups of ZE-isomerism descriptors are discussed. These descriptors circumvent the inability of conventional topological indices to distinguish between Z and E isomers. The applicability of ZE-isomerism descriptors to QSAR analysis is demonstrated in the studies of a series of 131 anticancer agents inhibiting tubulin polymerization.
Collapse
Affiliation(s)
- Alexander Golbraikh
- Laboratory for Molecular Modeling, Division of Medicinal Chemistry and Natural Products, School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | | | | |
Collapse
|
16
|
Hall LH, Kier LB. Issues in representation of molecular structure the development of molecular connectivity. J Mol Graph Model 2002; 20:4-18. [PMID: 11760002 DOI: 10.1016/s1093-3263(01)00097-3] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Significant issues in the representation of molecular structure and the development of the molecular connectivity paradigm are presented. In the molecular connectivity paradigm, molecular structure is represented directly. Kier and Hall developed the method by creating ways to encode electronic information based on the paradigm developed from the Randić branching index. The simple and valence delta values were created to encode atomic and valence-state electronic information through counts of sigma, pi, and lone pair electrons. A family of indices was created to provide a wide range of structure information. The key aspects of the development are presented and discussed in such a way as to reveal, at least in part, the imaginative thinking involved in the process. Possible future roles for molecular connectivity chi indices are discussed.
Collapse
Affiliation(s)
- L H Hall
- Department of Chemistry, Eastern Nazarene College, Quincy, MA 02170, USA.
| | | |
Collapse
|
17
|
Rücker G, Rücker C. Substructure, subgraph, and walk counts as measures of the complexity of graphs and molecules. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2001; 41:1457-62. [PMID: 11749569 DOI: 10.1021/ci0100548] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
In discussions of unsaturated compounds represented by multigraphs it is necessary to distinguish between the notions of substructure and subgraph. Here the difference is explained and exemplified, and a computer program is introduced which for the first time is able to construct and count all substructures and subgraphs for a colored multigraph (a molecular compound which may contain unsaturation and heteroatoms). Construction of all substructures and subgraphs is computationally demanding; therefore, two alternatives are pointed out for the treatment of large sets of compounds: (i) Often it will suffice to consider counts of substructures/subgraphs up to a certain number of edges only, information which is provided by the program much more rapidly. (ii) It is shown that information equivalent to that gained from substructure or subgraph counts is often far more easily available using walk counts. Some problems and their consequences for substructure/subgraph/walk counts are discussed that arise from the models used in organic chemistry for certain compounds such as aromatics and from the necessity to express qualitative features of molecular structures numerically.
Collapse
Affiliation(s)
- G Rücker
- Department of Mathematics, Universität Bayreuth, D-95440 Bayreuth, Germany
| | | |
Collapse
|