1
|
Kumar R. Materiomically Designed Polymeric Vehicles for Nucleic Acids: Quo Vadis? ACS APPLIED BIO MATERIALS 2022; 5:2507-2535. [PMID: 35642794 DOI: 10.1021/acsabm.2c00346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Despite rapid advances in molecular biology, particularly in site-specific genome editing technologies, such as CRISPR/Cas9 and base editing, financial and logistical challenges hinder a broad population from accessing and benefiting from gene therapy. To improve the affordability and scalability of gene therapy, we need to deploy chemically defined, economical, and scalable materials, such as synthetic polymers. For polymers to deliver nucleic acids efficaciously to targeted cells, they must optimally combine design attributes, such as architecture, length, composition, spatial distribution of monomers, basicity, hydrophilic-hydrophobic phase balance, or protonation degree. Designing polymeric vectors for specific nucleic acid payloads is a multivariate optimization problem wherein even minuscule deviations from the optimum are poorly tolerated. To explore the multivariate polymer design space rapidly, efficiently, and fruitfully, we must integrate parallelized polymer synthesis, high-throughput biological screening, and statistical modeling. Although materiomics approaches promise to streamline polymeric vector development, several methodological ambiguities must be resolved. For instance, establishing a flexible polymer ontology that accommodates recent synthetic advances, enforcing uniform polymer characterization and data reporting standards, and implementing multiplexed in vitro and in vivo screening studies require considerable planning, coordination, and effort. This contribution will acquaint readers with the challenges associated with materiomics approaches to polymeric gene delivery and offers guidelines for overcoming these challenges. Here, we summarize recent developments in combinatorial polymer synthesis, high-throughput screening of polymeric vectors, omics-based approaches to polymer design, barcoding schemes for pooled in vitro and in vivo screening, and identify materiomics-inspired research directions that will realize the long-unfulfilled clinical potential of polymeric carriers in gene therapy.
Collapse
Affiliation(s)
- Ramya Kumar
- Department of Chemical & Biological Engineering, Colorado School of Mines, 1613 Illinois St, Golden, Colorado 80401, United States
| |
Collapse
|
2
|
Wang J, Li Z, Basharat M, Wu S, Zhang S, Zhang X, Ma H, Liu W, Wu D, Wu Z. Effect of side groups on glass transition temperatures of Poly(ethoxy/phenoxy)phosphazenes: Prediction and synthesis. POLYMER 2021. [DOI: 10.1016/j.polymer.2021.124068] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
3
|
Chong JW, Thangalazhy-Gopakumar S, Muthoosamy K, Chemmangattuvalappil NG. Design of bio-oil additives via molecular signature descriptors using a multi-stage computer-aided molecular design framework. Front Chem Sci Eng 2021. [DOI: 10.1007/s11705-021-2056-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
4
|
Alshehri AS, Gani R, You F. Deep learning and knowledge-based methods for computer-aided molecular design—toward a unified approach: State-of-the-art and future directions. Comput Chem Eng 2020. [DOI: 10.1016/j.compchemeng.2020.107005] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
5
|
Gantzer P, Creton B, Nieto-Draghi C. Inverse-QSPR for de novo Design: A Review. Mol Inform 2019; 39:e1900087. [PMID: 31682079 DOI: 10.1002/minf.201900087] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Accepted: 11/04/2019] [Indexed: 11/09/2022]
Abstract
The use of computer tools to solve chemistry-related problems has given rise to a large and increasing number of publications these last decades. This new field of science is now well recognized and labelled Chemoinformatics. Among all chemoinformatics techniques, the use of statistical based approaches for property predictions has been the subject of numerous research reflecting both new developments and many cases of applications. The so obtained predictive models relating a property to molecular features - descriptors - are gathered under the acronym QSPR, for Quantitative Structure Property Relationships. Apart from the obvious use of such models to predict property values for new compounds, their use to virtually synthesize new molecules - de novo design - is currently a high-interest subject. Inverse-QSPR (i-QSPR) methods have hence been developed to accelerate the discovery of new materials that meet a set of specifications. In the proposed manuscript, we review existing i-QSPR methodologies published in the open literature in a way to highlight developments, applications, improvements and limitations of each.
Collapse
Affiliation(s)
- Philippe Gantzer
- IFP Energies nouvelles, 1 et 4 avenue de Bois-Préau, 92852, Rueil-Malmaison, France
| | - Benoit Creton
- IFP Energies nouvelles, 1 et 4 avenue de Bois-Préau, 92852, Rueil-Malmaison, France
| | - Carlos Nieto-Draghi
- IFP Energies nouvelles, 1 et 4 avenue de Bois-Préau, 92852, Rueil-Malmaison, France
| |
Collapse
|
6
|
Kurgan G, Kurgan L, Schneider A, Onyeabor M, Rodriguez-Sanchez Y, Taylor E, Martinez R, Carbonell P, Shi X, Gu H, Wang X. Identification of major malate export systems in an engineered malate-producing Escherichia coli aided by substrate similarity search. Appl Microbiol Biotechnol 2019; 103:9001-9011. [DOI: 10.1007/s00253-019-10164-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Revised: 08/27/2019] [Accepted: 09/28/2019] [Indexed: 01/29/2023]
|
7
|
St John PC, Phillips C, Kemper TW, Wilson AN, Guan Y, Crowley MF, Nimlos MR, Larsen RE. Message-passing neural networks for high-throughput polymer screening. J Chem Phys 2019; 150:234111. [PMID: 31228909 DOI: 10.1063/1.5099132] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Machine learning methods have shown promise in predicting molecular properties, and given sufficient training data, machine learning approaches can enable rapid high-throughput virtual screening of large libraries of compounds. Graph-based neural network architectures have emerged in recent years as the most successful approach for predictions based on molecular structure and have consistently achieved the best performance on benchmark quantum chemical datasets. However, these models have typically required optimized 3D structural information for the molecule to achieve the highest accuracy. These 3D geometries are costly to compute for high levels of theory, limiting the applicability and practicality of machine learning methods in high-throughput screening applications. In this study, we present a new database of candidate molecules for organic photovoltaic applications, comprising approximately 91 000 unique chemical structures. Compared to existing datasets, this dataset contains substantially larger molecules (up to 200 atoms) as well as extrapolated properties for long polymer chains. We show that message-passing neural networks trained with and without 3D structural information for these molecules achieve similar accuracy, comparable to state-of-the-art methods on existing benchmark datasets. These results therefore emphasize that for larger molecules with practical applications, near-optimal prediction results can be obtained without using optimized 3D geometry as an input. We further show that learned molecular representations can be leveraged to reduce the training data required to transfer predictions to a new density functional theory functional.
Collapse
Affiliation(s)
- Peter C St John
- Biosciences Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA
| | - Caleb Phillips
- Computational Science Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA
| | - Travis W Kemper
- Computational Science Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA
| | - A Nolan Wilson
- National Biaoenergy Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA
| | - Yanfei Guan
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523-1872, USA
| | - Michael F Crowley
- Biosciences Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA
| | - Mark R Nimlos
- National Biaoenergy Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA
| | - Ross E Larsen
- Computational Science Center, National Renewable Energy Laboratory, Golden, Colorado 80401-3393, USA
| |
Collapse
|
8
|
Koch M, Duigou T, Carbonell P, Faulon JL. Molecular structures enumeration and virtual screening in the chemical space with RetroPath2.0. J Cheminform 2017; 9:64. [PMID: 29260340 PMCID: PMC5736515 DOI: 10.1186/s13321-017-0252-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2017] [Accepted: 12/08/2017] [Indexed: 08/26/2023] Open
Abstract
BACKGROUND Network generation tools coupled with chemical reaction rules have been mainly developed for synthesis planning and more recently for metabolic engineering. Using the same core algorithm, these tools apply a set of rules to a source set of compounds, stopping when a sink set of compounds has been produced. When using the appropriate sink, source and rules, this core algorithm can be used for a variety of applications beyond those it has been developed for. RESULTS Here, we showcase the use of the open source workflow RetroPath2.0. First, we mathematically prove that we can generate all structural isomers of a molecule using a reduced set of reaction rules. We then use this enumeration strategy to screen the chemical space around a set of monomers and predict their glass transition temperatures, as well as around aminoglycosides to search structures maximizing antibacterial activity. We also perform a screening around aminoglycosides with enzymatic reaction rules to ensure biosynthetic accessibility. We finally use our workflow on an E. coli model to complete E. coli metabolome, with novel molecules generated using promiscuous enzymatic reaction rules. These novel molecules are searched on the MS spectra of an E. coli cell lysate interfacing our workflow with OpenMS through the KNIME Analytics Platform. CONCLUSION We provide an easy to use and modify, modular, and open-source workflow. We demonstrate its versatility through a variety of use cases including molecular structure enumeration, virtual screening in the chemical space, and metabolome completion. Because it is open source and freely available on MyExperiment.org, workflow community contributions should likely expand further the features of the tool, even beyond the use cases presented in the paper.
Collapse
Affiliation(s)
- Mathilde Koch
- Micalis Institute, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Thomas Duigou
- Micalis Institute, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Pablo Carbonell
- SYNBIOCHEM Centre, Manchester Institute of Biotechnology, University of Manchester, 131 Princess Street, Manchester, M1 7DN, UK
| | - Jean-Loup Faulon
- Micalis Institute, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France. .,SYNBIOCHEM Centre, Manchester Institute of Biotechnology, University of Manchester, 131 Princess Street, Manchester, M1 7DN, UK. .,CNRS-UMR8030/Laboratoire iSSB, Université Paris-Saclay, 91000, Évry, France.
| |
Collapse
|
9
|
Austin ND, Sahinidis NV, Trahan DW. Computer-aided molecular design: An introduction and review of tools, applications, and solution techniques. Chem Eng Res Des 2016. [DOI: 10.1016/j.cherd.2016.10.014] [Citation(s) in RCA: 81] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
10
|
Daoutidis P, Marvin WA, Rangarajan S, Torres AI. Engineering Biomass Conversion Processes: A Systems Perspective. AIChE J 2012. [DOI: 10.1002/aic.13978] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- Prodromos Daoutidis
- Dept. of Chemical Engineering and Materials Science; University of Minnesota; Minneapolis; MN; 55455
| | - W. Alex Marvin
- Dept. of Chemical Engineering and Materials Science; University of Minnesota; Minneapolis; MN; 55455
| | - Srinivas Rangarajan
- Dept. of Chemical Engineering and Materials Science; University of Minnesota; Minneapolis; MN; 55455
| | - Ana I. Torres
- Dept. of Chemical Engineering and Materials Science; University of Minnesota; Minneapolis; MN; 55455
| |
Collapse
|
11
|
Martin S. Lattice enumeration for inverse molecular design using the signature descriptor. J Chem Inf Model 2012; 52:1787-97. [PMID: 22657105 DOI: 10.1021/ci3001748] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
We describe an inverse quantitative structure-activity relationship (QSAR) framework developed for the design of molecular structures with desired properties. This framework uses chemical fragments encoded with a molecular descriptor known as a signature. It solves a system of linear constrained Diophantine equations to reorganize the fragments into novel molecular structures. The method has been previously applied to problems in drug and materials design but has inherent computational limitations due to the necessity of solving the Diophantine constraints. We propose a new approach to overcome these limitations using the Fincke-Pohst algorithm for lattice enumeration. We benchmark the new approach against previous results on LFA-1/ICAM-1 inhibitory peptides, linear homopolymers, and hydrofluoroether foam blowing agents. Software implementing the new approach is available at www.cs.otago.ac.nz/homepages/smartin.
Collapse
Affiliation(s)
- Shawn Martin
- Department of Computer Science, University of Otago, P.O. Box 56, Dunedin 9054, New Zealand.
| |
Collapse
|
12
|
Abstract
Computer-aided molecular design (CAMD) is a technique that helps select potential target molecules that will have desired properties before synthesis and testing in the laboratory, and provides an excellent complement to the chemical intuition possessed by experimentalists. Property predictions are obtained from a quantitative structure–property relationship (QSPR) that links changes at the molecular structure level to differences in the macroscopic properties. Ionic liquids (ILs) offer an excellent opportunity for the application of CAMD because of the numerous possible combinations of cations and anions available to fine-tune physical properties. In addition, there are many innovative applications of ILs where CAMD could make an impact. In this overview, we present the general methodology for CAMD with QSPR, and describe recent progress in this area related to ILs.
Collapse
|
13
|
Katritzky AR, Kuanar M, Slavov S, Hall CD, Karelson M, Kahn I, Dobchev DA. Quantitative Correlation of Physical and Chemical Properties with Chemical Structure: Utility for Prediction. Chem Rev 2010; 110:5714-89. [DOI: 10.1021/cr900238d] [Citation(s) in RCA: 386] [Impact Index Per Article: 27.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Alan R. Katritzky
- Center for Heterocyclic Compounds, Department of Chemistry, University of Florida, Gainesville, Florida 32611
| | - Minati Kuanar
- Center for Heterocyclic Compounds, Department of Chemistry, University of Florida, Gainesville, Florida 32611
| | - Svetoslav Slavov
- Center for Heterocyclic Compounds, Department of Chemistry, University of Florida, Gainesville, Florida 32611
| | - C. Dennis Hall
- Center for Heterocyclic Compounds, Department of Chemistry, University of Florida, Gainesville, Florida 32611
| | - Mati Karelson
- Institute of Chemistry, Tallinn University of Technology, Akadeemia tee 15, Tallinn 19086, Estonia, and MolCode, Ltd., Soola 8, Tartu 51013, Estonia
| | - Iiris Kahn
- Institute of Chemistry, Tallinn University of Technology, Akadeemia tee 15, Tallinn 19086, Estonia, and MolCode, Ltd., Soola 8, Tartu 51013, Estonia
| | - Dimitar A. Dobchev
- Institute of Chemistry, Tallinn University of Technology, Akadeemia tee 15, Tallinn 19086, Estonia, and MolCode, Ltd., Soola 8, Tartu 51013, Estonia
| |
Collapse
|
14
|
Weis DC, Visco DP. Computer-aided molecular design using the Signature molecular descriptor: Application to solvent selection. Comput Chem Eng 2010. [DOI: 10.1016/j.compchemeng.2009.10.017] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
15
|
Ishida Y, Kato Y, Zhao L, Nagamochi H, Akutsu T. Branch-and-Bound Algorithms for Enumerating Treelike Chemical Graphs with Given Path Frequency Using Detachment-Cut. J Chem Inf Model 2010; 50:934-46. [DOI: 10.1021/ci900447z] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Yusuke Ishida
- Department of Applied Mathematics and Physics, Graduate School of Informatics, Kyoto University, Yoshida, Kyoto 606-8501, Japan, and Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto 611-0011, Japan
| | - Yuki Kato
- Department of Applied Mathematics and Physics, Graduate School of Informatics, Kyoto University, Yoshida, Kyoto 606-8501, Japan, and Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto 611-0011, Japan
| | - Liang Zhao
- Department of Applied Mathematics and Physics, Graduate School of Informatics, Kyoto University, Yoshida, Kyoto 606-8501, Japan, and Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto 611-0011, Japan
| | - Hiroshi Nagamochi
- Department of Applied Mathematics and Physics, Graduate School of Informatics, Kyoto University, Yoshida, Kyoto 606-8501, Japan, and Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto 611-0011, Japan
| | - Tatsuya Akutsu
- Department of Applied Mathematics and Physics, Graduate School of Informatics, Kyoto University, Yoshida, Kyoto 606-8501, Japan, and Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto 611-0011, Japan
| |
Collapse
|
16
|
Tolle I, Huang X, Akpalu YA, Martin LL. A Modified Network Component Analysis (NCA) Methodology for the Decomposition of X-ray Scattering Signatures. Ind Eng Chem Res 2009. [DOI: 10.1021/ie8012715] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Affiliation(s)
- Ian Tolle
- Department of Chemical and Biological Engineering, Department of Chemistry and Chemical Biology, and Department of Decision Sciences and Engineering Systems, Rensselaer Polytechnic Institute, Troy, New York 12180
| | - Xinqun Huang
- Department of Chemical and Biological Engineering, Department of Chemistry and Chemical Biology, and Department of Decision Sciences and Engineering Systems, Rensselaer Polytechnic Institute, Troy, New York 12180
| | - Yvonne A. Akpalu
- Department of Chemical and Biological Engineering, Department of Chemistry and Chemical Biology, and Department of Decision Sciences and Engineering Systems, Rensselaer Polytechnic Institute, Troy, New York 12180
| | - Lealon L. Martin
- Department of Chemical and Biological Engineering, Department of Chemistry and Chemical Biology, and Department of Decision Sciences and Engineering Systems, Rensselaer Polytechnic Institute, Troy, New York 12180
| |
Collapse
|
17
|
|
18
|
Schut J, Bolikal D, Khan I, Pesnell A, Rege A, Rojas R, Sheihet L, Murthy N, Kohn J. Glass transition temperature prediction of polymers through the mass-per-flexible-bond principle. POLYMER 2007; 48:6115-6124. [PMID: 18813337 DOI: 10.1016/j.polymer.2007.07.048] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
A semi-empirical method based on the mass-per-flexible-bond (M/f) principle was used to quantitatively explain the large range of glass transition temperatures (T(g)) observed in a library of 132 L-tyrosine derived homo, co- and terpolymers containing different functional groups. Polymer class specific behavior was observed in T(g) vs. M/f plots, and explained in terms of different densities, steric hindrances and intermolecular interactions of chemically distinct polymers. The method was found to be useful in the prediction of polymer T(g). The predictive accuracy was found to range from 6.4 to 3.7 K, depending on polymer class. This level of accuracy compares favorably with (more complicated) methods used in the literature. The proposed method can also be used for structure prediction of polymers to match a target T(g) value, by keeping the thermal behavior of a terpolymer constant while independently choosing its chemistry. Both applications of the method are likely to have broad applications in polymer and (bio)material science.
Collapse
Affiliation(s)
- J Schut
- New Jersey Center for Biomaterials, Rutgers, The State University of New Jersey, 145 Bevier Road, Piscataway, NJ 08854
| | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Abstract
Accurate methods of computing the affinity of a small molecule with a protein are needed to speed the discovery of new medications and biological probes. This paper reviews physics-based models of binding, beginning with a summary of the changes in potential energy, solvation energy, and configurational entropy that influence affinity, and a theoretical overview to frame the discussion of specific computational approaches. Important advances are reported in modeling protein-ligand energetics, such as the incorporation of electronic polarization and the use of quantum mechanical methods. Recent calculations suggest that changes in configurational entropy strongly oppose binding and must be included if accurate affinities are to be obtained. The linear interaction energy (LIE) and molecular mechanics Poisson-Boltzmann surface area (MM-PBSA) methods are analyzed, as are free energy pathway methods, which show promise and may be ready for more extensive testing. Ultimately, major improvements in modeling accuracy will likely require advances on multiple fronts, as well as continued validation against experiment.
Collapse
Affiliation(s)
- Michael K Gilson
- Center for Advanced Research in Biotechnology, University of Maryland Biotechnology Institute, Rockville, Maryland 20850, USA.
| | | |
Collapse
|