1
|
Intrinsic Dimension Estimation-Based Feature Selection and Multinomial Logistic Regression for Classification of Bearing Faults Using Compressively Sampled Vibration Signals. ENTROPY 2022; 24:e24040511. [PMID: 35455173 PMCID: PMC9027945 DOI: 10.3390/e24040511] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/12/2022] [Revised: 04/01/2022] [Accepted: 04/03/2022] [Indexed: 02/01/2023]
Abstract
As failures of rolling bearings lead to major failures in rotating machines, recent vibration-based rolling bearing fault diagnosis techniques are focused on obtaining useful fault features from the huge collection of raw data. However, too many features reduce the classification accuracy and increase the computation time. This paper proposes an effective feature selection technique based on intrinsic dimension estimation of compressively sampled vibration signals. First, compressive sampling (CS) is used to get compressed measurements from the collected raw vibration signals. Then, a global dimension estimator, the geodesic minimal spanning tree (GMST), is employed to compute the minimal number of features needed to represent efficiently the compressively sampled signals. Finally, a feature selection process, combining the stochastic proximity embedding (SPE) and the neighbourhood component analysis (NCA), is used to select fewer features for bearing fault diagnosis. With regression analysis-based predictive modelling technique and the multinomial logistic regression (MLR) classifier, the selected features are assessed in two case studies of rolling bearings vibration signals under different working loads. The experimental results demonstrate that the proposed method can successfully select fewer features, with which the MLR-based trained model achieves high classification accuracy and significantly reduced computation times compared to published research.
Collapse
|
2
|
Harvy J, Sigalas E, Thakor N, Bezerianos A, Li J. Performance Improvement of Driving Fatigue Identification Based on Power Spectra and Connectivity Using Feature Level and Decision Level Fusions. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2019; 2018:102-105. [PMID: 30440351 DOI: 10.1109/embc.2018.8512259] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Power and connectivity features extracted from EEG signals have been previously utilized to detect mental fatigue during driving. Although each of the feature categories has the discriminative power to differentiate alert and fatigue states, they might represent different aspects of information relevant to fatigue identification. Two fusion methods, feature level and decision level fusions, were proposed in this study to combine individual channel information (i.e., power features) and between-channel information (i.e., connectivity features). According to the results of the study, the average accuracies of the fusion methods were higher than the accuracies of the individual feature categories (feature level fusion: 84.70%, decision level fusion: 87.13%, power features: 80.82%, connectivity features: 79.36%). The statistical analysis demonstrated that the two fusion methods significantly improved the classification performance of driving fatigue. The fusion methods proposed in this study can be embedded into a driving fatigue detection system for a practical use in a vehicle.
Collapse
|
3
|
Yoshikawa N, Hutchison GR. Fast, efficient fragment-based coordinate generation for Open Babel. J Cheminform 2019; 11:49. [PMID: 31372768 PMCID: PMC6676618 DOI: 10.1186/s13321-019-0372-5] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2019] [Accepted: 07/23/2019] [Indexed: 12/19/2022] Open
Abstract
Rapidly predicting an accurate three dimensional geometry of a molecule is a crucial task for cheminformatics and across a wide range of molecular modeling. Consequently, developing a fast, accurate, and open implementation of structure prediction is necessary for reproducible cheminformatics research. We introduce a fragment-based coordinate generation implementation for Open Babel, a widely-used open source toolkit for cheminformatics. The new implementation improves speed and stereochemical accuracy, while retaining or improving accuracy of bond lengths, bond angles, and dihedral torsions. Input molecules are broken into fragments by cutting at rotatable bonds. The coordinates of fragments are set according to a fragment library, prepared from open crystallographic databases. Since the coordinates of multiple atoms are decided at once, coordinate prediction is accelerated over the previous rules-based implementation in Open Babel, as well as the widely-used distance geometry methods in RDKit. This new implementation will be beneficial for a wide range of applications, including computational property prediction in polymers, molecular materials and drug design.
Collapse
Affiliation(s)
- Naruki Yoshikawa
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba, Japan
| | - Geoffrey R Hutchison
- Department of Chemistry and Chemical Engineering, University of Pittsburgh, 219 Parkman Avenue, Pittsburgh, PA, 15260, USA.
| |
Collapse
|
4
|
Abstract
The generation of conformations for small molecules is a problem of continuing interest in cheminformatics and computational drug discovery. This review will present an overview of methods used to sample conformational space, focusing on those methods designed for organic molecules commonly of interest in drug discovery. Different approaches to both the sampling of conformational space and the scoring of conformational stability will be compared and contrasted, with an emphasis on those methods suitable for conformer sampling of large numbers of drug-like molecules. Particular attention will be devoted to the appropriate utilization of information from experimental solid-state structures in validating and evaluating the performance of these tools. The review will conclude with some areas worthy of further investigation.
Collapse
Affiliation(s)
- Paul C D Hawkins
- OpenEye Scientific , 9 Bisbee Court, Suite D, Santa Fe, New Mexico 87508, United States
| |
Collapse
|
5
|
Najim SA, Najim AA, Lim IS, Saeed M. Parallel faithful dimensionality reduction to enhance the visualization of remote sensing imagery. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2015.05.071] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
6
|
Osolodkin DI, Radchenko EV, Orlov AA, Voronkov AE, Palyulin VA, Zefirov NS. Progress in visual representations of chemical space. Expert Opin Drug Discov 2015; 10:959-73. [DOI: 10.1517/17460441.2015.1060216] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
7
|
|
8
|
Reutlinger M, Schneider G. Nonlinear dimensionality reduction and mapping of compound libraries for drug discovery. J Mol Graph Model 2012; 34:108-17. [PMID: 22326864 DOI: 10.1016/j.jmgm.2011.12.006] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2011] [Revised: 12/13/2011] [Accepted: 12/14/2011] [Indexed: 01/29/2023]
Abstract
Visualization of 'chemical space' and compound distributions has received much attraction by medicinal chemists as it may help to intuitively comprehend pharmaceutically relevant molecular features. It has been realized that for meaningful feature extraction from complex multivariate chemical data, such as compound libraries represented by many molecular descriptors, nonlinear projection techniques are required. Recent advances in machine-learning and artificial intelligence have resulted in a transfer of such methods to chemistry. We provide an overview of prominent visualization methods based on nonlinear dimensionality reduction, and highlight applications in drug discovery. Emphasis is on neural network techniques, kernel methods and stochastic embedding approaches, which have been successfully used for ligand-based virtual screening, SAR landscape analysis, combinatorial library design, and screening compound selection.
Collapse
Affiliation(s)
- Michael Reutlinger
- Swiss Federal Institute of Technology (ETH), Department of Chemistry and Applied Biosciences, Zurich, Switzerland
| | | |
Collapse
|
9
|
Fjell CD, Hiss JA, Hancock REW, Schneider G. Designing antimicrobial peptides: form follows function. Nat Rev Drug Discov 2011; 11:37-51. [PMID: 22173434 DOI: 10.1038/nrd3591] [Citation(s) in RCA: 1405] [Impact Index Per Article: 100.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Multidrug-resistant bacteria are a severe threat to public health. Conventional antibiotics are becoming increasingly ineffective as a result of resistance, and it is imperative to find new antibacterial strategies. Natural antimicrobials, known as host defence peptides or antimicrobial peptides, defend host organisms against microbes but most have modest direct antibiotic activity. Enhanced variants have been developed using straightforward design and optimization strategies and are being tested clinically. Here, we describe advanced computer-assisted design strategies that address the difficult problem of relating primary sequence to peptide structure, and are delivering more potent, cost-effective, broad-spectrum peptides as potential next-generation antibiotics.
Collapse
Affiliation(s)
- Christopher D Fjell
- Centre for Microbial Diseases and Immunity Research, University of British Columbia, 2259 Lower Mall, Vancouver, British Columbia V6T 1Z4, Canada
| | | | | | | |
Collapse
|
10
|
Klenner A, Hähnke V, Geppert T, Schneider P, Zettl H, Haller S, Rodrigues T, Reisen F, Hoy B, Schaible AM, Werz O, Wessler S, Schneider G. From Virtual Screening to Bioactive Compounds by Visualizing and Clustering of Chemical Space. Mol Inform 2011; 31:21-6. [PMID: 27478174 DOI: 10.1002/minf.201100147] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2011] [Accepted: 11/04/2011] [Indexed: 01/31/2023]
Affiliation(s)
- Alexander Klenner
- ETH, Department of Chemistry and Applied Biosciences, Wolfgang-Pauli-Str. 10, CH-8093 Zurich, Switzerland
| | - Volker Hähnke
- ETH, Department of Chemistry and Applied Biosciences, Wolfgang-Pauli-Str. 10, CH-8093 Zurich, Switzerland
| | - Tim Geppert
- ETH, Department of Chemistry and Applied Biosciences, Wolfgang-Pauli-Str. 10, CH-8093 Zurich, Switzerland
| | - Petra Schneider
- ETH, Department of Chemistry and Applied Biosciences, Wolfgang-Pauli-Str. 10, CH-8093 Zurich, Switzerland
| | - Heiko Zettl
- ETH, Department of Chemistry and Applied Biosciences, Wolfgang-Pauli-Str. 10, CH-8093 Zurich, Switzerland
| | - Sarah Haller
- ETH, Department of Chemistry and Applied Biosciences, Wolfgang-Pauli-Str. 10, CH-8093 Zurich, Switzerland
| | - Tiago Rodrigues
- ETH, Department of Chemistry and Applied Biosciences, Wolfgang-Pauli-Str. 10, CH-8093 Zurich, Switzerland
| | - Felix Reisen
- ETH, Department of Chemistry and Applied Biosciences, Wolfgang-Pauli-Str. 10, CH-8093 Zurich, Switzerland
| | - Benjamin Hoy
- Paris-Lodron University, Department of Molecular Biology, Division of Microbiology, Billroth Str. 11, A-5020 Salzburg, Austria
| | - Anja Maria Schaible
- University of Jena, Institute of Pharmacy, Philosophenweg 14, D-07743 Jena, Germany
| | - Oliver Werz
- University of Jena, Institute of Pharmacy, Philosophenweg 14, D-07743 Jena, Germany
| | - Silja Wessler
- Paris-Lodron University, Department of Molecular Biology, Division of Microbiology, Billroth Str. 11, A-5020 Salzburg, Austria
| | - Gisbert Schneider
- ETH, Department of Chemistry and Applied Biosciences, Wolfgang-Pauli-Str. 10, CH-8093 Zurich, Switzerland.
| |
Collapse
|
11
|
Yang E, Liu P, Rassokhin DN, Agrafiotis DK. Stochastic proximity embedding on graphics processing units: taking multidimensional scaling to a new scale. J Chem Inf Model 2011; 51:2852-9. [PMID: 21961974 DOI: 10.1021/ci200420c] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Stochastic proximity embedding (SPE) was developed as a method for efficiently calculating lower dimensional embeddings of high-dimensional data sets. Rather than using a global minimization scheme, SPE relies upon updating the distances of randomly selected points in an iterative fashion. This was found to generate embeddings of comparable quality to those obtained using classical multidimensional scaling algorithms. However, SPE is able to obtain these results in O(n) rather than O(n²) time and thus is much better suited to large data sets. In an effort both to speed up SPE and utilize it for even larger problems, we have created a multithreaded implementation which takes advantage of the growing general computing power of graphics processing units (GPUs). The use of GPUs allows the embedding of data sets containing millions of data points in interactive time scales.
Collapse
Affiliation(s)
- Eric Yang
- Johnson & Johnson Pharmaceutical Research and Development, L.L.C. , Welsh and McKean Roads, Spring House, Pennsylvania 19477, USA.
| | | | | | | |
Collapse
|
12
|
Reutlinger M, Guba W, Martin RE, Alanine AI, Hoffmann T, Klenner A, Hiss JA, Schneider P, Schneider G. Neighborhood-Preserving Visualization of Adaptive Structure-Activity Landscapes: Application to Drug Discovery. Angew Chem Int Ed Engl 2011. [DOI: 10.1002/ange.201105156] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
|
13
|
Reutlinger M, Guba W, Martin RE, Alanine AI, Hoffmann T, Klenner A, Hiss JA, Schneider P, Schneider G. Neighborhood-preserving visualization of adaptive structure-activity landscapes: application to drug discovery. Angew Chem Int Ed Engl 2011; 50:11633-6. [PMID: 21984024 DOI: 10.1002/anie.201105156] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2011] [Indexed: 01/13/2023]
Affiliation(s)
- Michael Reutlinger
- Department of Chemistry and Applied Biosciences, Wolfgang-Pauli-Strasse 10, 8093 Zurich, Switzerland
| | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Schneider G. From Hits to Leads: Challenges for the Next Phase of Machine Learning in Medicinal Chemistry. Mol Inform 2011; 30:759-63. [DOI: 10.1002/minf.201100070] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2011] [Accepted: 05/23/2011] [Indexed: 12/12/2022]
|
15
|
Schneider G, Geppert T, Hartenfeller M, Reisen F, Klenner A, Reutlinger M, Hähnke V, Hiss JA, Zettl H, Keppner S, Spänkuch B, Schneider P. Reaction-driven de novo design, synthesis and testing of potential type II kinase inhibitors. Future Med Chem 2011; 3:415-24. [PMID: 21452978 DOI: 10.4155/fmc.11.8] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2023] Open
Abstract
BACKGROUND De novo design of drug-like compounds with a desired pharmacological activity profile has become feasible through innovative computer algorithms. Fragment-based design and simulated chemical reactions allow for the rapid generation of candidate compounds as blueprints for organic synthesis. METHODS We used a combination of complementary virtual-screening tools for the analysis of de novo designed compounds that were generated with the aim to inhibit inactive polo-like kinase 1 (Plk1), a target for the development of cancer therapeutics. A homology model of the inactive state of Plk1 was constructed and the nucleotide binding pocket conformations in the DFG-in and DFG-out state were compared. The de novo-designed compounds were analyzed using pharmacophore matching, structure-activity landscape analysis, and automated ligand docking. One compound was synthesized and tested in vitro. RESULTS The majority of the designed compounds possess a generic architecture present in known kinase inhibitors. Predictions favor kinases as targets of these compounds but also suggest potential off-target effects. Several bioisosteric replacements were suggested, and de novo designed compounds were assessed by automated docking for potential binding preference toward the inactive (type II inhibitors) over the active conformation (type I inhibitors) of the kinase ATP binding site. One selected compound was successfully synthesized as suggested by the software. The de novo-designed compound exhibited inhibitory activity against inactive Plk1 in vitro, but did not show significant inhibition of active Plk1 and 38 other kinases tested. CONCLUSIONS Computer-based de novo design of screening candidates in combination with ligand- and receptor-based virtual screening generates motivated suggestions for focused library design in hit and lead discovery. Attractive, synthetically accessible compounds can be obtained together with predicted on- and off-target profiles and desired activities.
Collapse
Affiliation(s)
- Gisbert Schneider
- Swiss Federal Institute of Technology, Department of Chemistry & Applied Biosciences, 8093 Zürich, Switzerland.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|