1
|
Yang Z, Wang Y, Du G, Zhan Y, Zhan W. Prediction method of pharmacokinetic parameters of small molecule drugs based on GCN network model. J Mol Model 2024; 30:264. [PMID: 38995407 DOI: 10.1007/s00894-024-06051-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Accepted: 06/26/2024] [Indexed: 07/13/2024]
Abstract
CONTEXT Accurately predicting plasma protein binding rate (PPBR) and oral bioavailability (OBA) helps to better reveal the absorption and distribution of drugs in the human body and subsequent drug design. Although machine learning models have achieved good results in prediction accuracy, they often suffer from insufficient accuracy when dealing with data with irregular topological structures. METHODS In view of this, this study proposes a pharmacokinetic parameter prediction framework based on graph convolutional networks (GCN), which predicts the PPBR and OBA of small molecule drugs. In the framework, GCN is first used to extract spatial feature information on the topological structure of drug molecules, in order to better learn node features and association information between nodes. Then, based on the principle of drug similarity, this study calculates the similarity between small molecule drugs, selects different thresholds to construct datasets, and establishes a prediction model centered on the GCN algorithm. The experimental results show that compared with traditional machine learning prediction models, the prediction model constructed based on the GCN method performs best on PPBR and OBA datasets with an inter-molecular similarity threshold of 0.25, with MAE of 0.155 and 0.167, respectively. In addition, in order to further improve the accuracy of the prediction model, GCN is combined with other algorithms. Compared to using a single GCN method, the distribution of the predicted values obtained by the combined model is highly consistent with the true values. In summary, this work provides a new method for improving the rate of early drug screening in the future.
Collapse
Affiliation(s)
- Zhihua Yang
- Department of Radiation Oncology, General Hospital of Ningxia Medical University, Yinchuan, 750004, China
| | - Ying Wang
- Engineering Research Center of Molecular and Neuro Imaging of the Ministry of Education, School of Life Science and Technology, Xidian University, Xi'an, 710071, Shaanxi, China
| | - Getao Du
- Engineering Research Center of Molecular and Neuro Imaging of the Ministry of Education, School of Life Science and Technology, Xidian University, Xi'an, 710071, Shaanxi, China
| | - Yonghua Zhan
- Engineering Research Center of Molecular and Neuro Imaging of the Ministry of Education, School of Life Science and Technology, Xidian University, Xi'an, 710071, Shaanxi, China.
| | - Wenhua Zhan
- Department of Radiation Oncology, General Hospital of Ningxia Medical University, Yinchuan, 750004, China.
| |
Collapse
|
2
|
Richardson AD, Antal T, Blythe RA, Schumacher LJ. Learning spatio-temporal patterns with Neural Cellular Automata. PLoS Comput Biol 2024; 20:e1011589. [PMID: 38669297 PMCID: PMC11078362 DOI: 10.1371/journal.pcbi.1011589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 05/08/2024] [Accepted: 04/02/2024] [Indexed: 04/28/2024] Open
Abstract
Neural Cellular Automata (NCA) are a powerful combination of machine learning and mechanistic modelling. We train NCA to learn complex dynamics from time series of images and Partial Differential Equation (PDE) trajectories. Our method is designed to identify underlying local rules that govern large scale dynamic emergent behaviours. Previous work on NCA focuses on learning rules that give stationary emergent structures. We extend NCA to capture both transient and stable structures within the same system, as well as learning rules that capture the dynamics of Turing pattern formation in nonlinear PDEs. We demonstrate that NCA can generalise very well beyond their PDE training data, we show how to constrain NCA to respect given symmetries, and we explore the effects of associated hyperparameters on model performance and stability. Being able to learn arbitrary dynamics gives NCA great potential as a data driven modelling framework, especially for modelling biological pattern formation.
Collapse
Affiliation(s)
- Alex D. Richardson
- School of Physics and Astronomy, University of Edinburgh, Edinburgh, United Kingdom
- School of Mathematics and Maxwell Institute for Mathematical Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Tibor Antal
- School of Mathematics and Maxwell Institute for Mathematical Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Richard A. Blythe
- School of Physics and Astronomy, University of Edinburgh, Edinburgh, United Kingdom
| | - Linus J. Schumacher
- School of Mathematics and Maxwell Institute for Mathematical Sciences, University of Edinburgh, Edinburgh, United Kingdom
- Institute of regeneration and repair, University of Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|
3
|
Van-Horenbeke FA, Peer A. NILRNN: A Neocortex-Inspired Locally Recurrent Neural Network for Unsupervised Feature Learning in Sequential Data. Cognit Comput 2023. [DOI: 10.1007/s12559-023-10122-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]
Abstract
AbstractUnsupervised feature learning refers to the problem of learning useful feature extraction functions from unlabeled data. Despite the great success of deep learning networks in this task in recent years, both for static and for sequential data, these systems can in general still not compete with the high performance of our brain at learning to extract useful representations from its sensory input. We propose the Neocortex-Inspired Locally Recurrent Neural Network: a new neural network for unsupervised feature learning in sequential data that brings ideas from the structure and function of the neocortex to the well-established fields of machine learning and neural networks. By mimicking connection patterns in the feedforward circuits of the neocortex, our system tries to generalize some of the ideas behind the success of convolutional neural networks to types of data other than images. To evaluate the performance of our system at extracting useful features, we have trained different classifiers using those and other learnt features as input and we have compared the obtained accuracies. Our system has shown to outperform other shallow feature learning systems in this task, both in terms of the accuracies achieved and in terms of how fast the classification task is learnt. The results obtained confirm our system as a state-of-the-art shallow feature learning system for sequential data, and suggest that extending it to or integrating it into deep architectures may lead to new successful networks that are competent at dealing with complex sequential tasks.
Collapse
|
4
|
Smerdov A, Somov A, Burnaev E, Stepanov A. AI-enabled prediction of video game player performance using the data from heterogeneous sensors. MULTIMEDIA TOOLS AND APPLICATIONS 2022; 82:11021-11046. [PMID: 36035326 PMCID: PMC9395877 DOI: 10.1007/s11042-022-13464-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 06/09/2022] [Accepted: 07/06/2022] [Indexed: 06/15/2023]
Abstract
The emerging progress of video gaming and eSports lacks the tools for ensuring high-quality analytics and training in professional and amateur eSports teams. We report on an Artificial Intelligence (AI) enabled solution for predicting the eSports player in-game performance using exclusively the data from sensors. For this reason, we collected the physiological, environmental, and the smart chair data from professional and amateur players. The player performance is assessed from the game logs in a multiplayer game for each moment of time using a recurrent neural network. We have investigated an attention mechanism improves the generalization of the network and provides a straightforward feature importance as well. The best model achieves Area Under the Receiver Operating Characteristic Curve (ROC AUC) score 0.73 in predicting whether a player will perform better or worse in the next 240 seconds based on in-game metrics. The prediction of the performance of a particular player is realized although their data are not utilized in the training set. The proposed solution has a number of promising applications for professional eSports teams and amateur players, such as a learning tool or performance monitoring system.
Collapse
Affiliation(s)
- Anton Smerdov
- CDE, Skolkovo Institute of Science and Technology (Skoltech), Moscow, Russia
| | - Andrey Somov
- CDE, Skolkovo Institute of Science and Technology (Skoltech), Moscow, Russia
| | - Evgeny Burnaev
- CDE, Skolkovo Institute of Science and Technology (Skoltech), Moscow, Russia
| | - Anton Stepanov
- CDE, Skolkovo Institute of Science and Technology (Skoltech), Moscow, Russia
| |
Collapse
|
5
|
Xu Z, Chen X, Cao L. Fast Task Adaptation Based on the Combination of Model-Based and Gradient-Based Meta Learning. IEEE TRANSACTIONS ON CYBERNETICS 2022; 52:5209-5218. [PMID: 33095739 DOI: 10.1109/tcyb.2020.3028378] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Deep reinforcement learning (DRL) recently has attained remarkable results in various domains, including games, robotics, and recommender system. Nevertheless, an urgent problem in the practical application of DRL is fast adaptation. To this end, this article proposes a new and versatile metalearning approach called fast task adaptation via metalearning (FTAML), which leverages the strengths of the model-based methods and gradient-based metalearning methods for training the initial parameters of the model, such that the model is able to efficiently master unseen tasks with a little amount of data from the tasks. The proposed algorithm makes it possible to separate task optimization and task identification, specifically, the model-based learner helps to identify the pattern of a task, while the gradient-based metalearner is capable of consistently improving the performance with only a few gradient update steps through making use of the task embedding produced by the model-based learner. In addition, the choice of network for the model-based learner in the proposed method is also discussed, and the performance of networks with different depths is explored. Finally, the simulation results on reinforcement learning problems demonstrate that the proposed approach outperforms compared metalearning algorithms and delivers a new state-of-the-art performance on a variety of challenging control tasks.
Collapse
|
6
|
Graph neural network approaches for drug-target interactions. Curr Opin Struct Biol 2022; 73:102327. [DOI: 10.1016/j.sbi.2021.102327] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 11/22/2021] [Accepted: 12/13/2021] [Indexed: 01/06/2023]
|
7
|
Smiatek J, Clemens C, Herrera LM, Arnold S, Knapp B, Presser B, Jung A, Wucherpfennig T, Bluhmki E. Generic and specific recurrent neural network models: Applications for large and small scale biopharmaceutical upstream processes. BIOTECHNOLOGY REPORTS (AMSTERDAM, NETHERLANDS) 2021; 31:e00640. [PMID: 34159058 PMCID: PMC8193373 DOI: 10.1016/j.btre.2021.e00640] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Revised: 04/24/2021] [Accepted: 05/27/2021] [Indexed: 01/02/2023]
Abstract
The calculation of temporally varying upstream process outcomes is a challenging task. Over the last years, several parametric, semi-parametric as well as non-parametric approaches were developed to provide reliable estimates for key process parameters. We present generic and product-specific recurrent neural network (RNN) models for the computation and study of growth and metabolite-related upstream process parameters as well as their temporal evolution. Our approach can be used for the control and study of single product-specific large-scale manufacturing runs as well as generic small-scale evaluations for combined processes and products at development stage. The computational results for the product titer as well as various major upstream outcomes in addition to relevant process parameters show a high degree of accuracy when compared to experimental data and, accordingly, a reasonable predictive capability of the RNN models. The calculated values for the root-mean squared errors of prediction are significantly smaller than the experimental standard deviation for the considered process run ensembles, which highlights the broad applicability of our approach. As a specific benefit for platform processes, the generic RNN model is also used to simulate process outcomes for different temperatures in good agreement with experimental results. The high level of accuracy and the straightforward usage of the approach without sophisticated parameterization and recalibration procedures highlight the benefits of the RNN models, which can be regarded as promising alternatives to existing parametric and semi-parametric methods.
Collapse
Affiliation(s)
- Jens Smiatek
- Institute for Computational Physics, University of Stuttgart, D-70569 Stuttgart, Germany
- Boehringer Ingelheim Pharma GmbH & Co. KG, Digitalization Development Biologicals CMC, D-88397 Biberach (Riss), Germany
| | - Christoph Clemens
- Boehringer Ingelheim Pharma GmbH & Co. KG, Focused Factory Drug Substance, D-88397 Biberach (Riss), Germany
| | - Liliana Montano Herrera
- Boehringer Ingelheim Pharma GmbH & Co. KG, Bioprocess Development Biologicals, D-88397 Biberach (Riss), Germany
| | - Sabine Arnold
- Boehringer Ingelheim Pharma GmbH & Co. KG, Bioprocess Development Biologicals, D-88397 Biberach (Riss), Germany
| | - Bettina Knapp
- Boehringer Ingelheim Pharma GmbH & Co. KG, Analytical Development Biologicals, D-88397 Biberach (Riss), Germany
| | - Beate Presser
- Boehringer Ingelheim Pharma GmbH & Co. KG, Analytical Development Biologicals, D-88397 Biberach (Riss), Germany
| | - Alexander Jung
- Boehringer Ingelheim Pharma GmbH & Co. KG, Digitalization Development Biologicals CMC, D-88397 Biberach (Riss), Germany
| | - Thomas Wucherpfennig
- Boehringer Ingelheim Pharma GmbH & Co. KG, Bioprocess Development Biologicals, D-88397 Biberach (Riss), Germany
| | - Erich Bluhmki
- Boehringer Ingelheim Pharma GmbH & Co. KG, Analytical Development Biologicals, D-88397 Biberach (Riss), Germany
- University of Applied Sciences Biberach, D-88397 Biberach (Riss), Germany
| |
Collapse
|
8
|
An EEMD-BiLSTM Algorithm Integrated with Boruta Random Forest Optimiser for Significant Wave Height Forecasting along Coastal Areas of Queensland, Australia. REMOTE SENSING 2021. [DOI: 10.3390/rs13081456] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Using advanced deep learning (DL) algorithms for forecasting significant wave height of coastal sea waves over a relatively short period can generate important information on its impact and behaviour. This is vital for prior planning and decision making for events such as search and rescue and wave surges along the coastal environment. Short-term 24 h forecasting could provide adequate time for relevant groups to take precautionary action. This study uses features of ocean waves such as zero up crossing wave period (Tz), peak energy wave period (Tp), sea surface temperature (SST) and significant lags for significant wave height (Hs) forecasting. The dataset was collected from 2014 to 2019 at 30 min intervals along the coastal regions of major cities in Queensland, Australia. The novelty of this study is the development and application of a highly accurate hybrid Boruta random forest (BRF)–ensemble empirical mode decomposition (EEMD)–bidirectional long short-term memory (BiLSTM) algorithm to predict significant wave height (Hs). The EEMD–BiLSTM model outperforms all other models with a higher Pearson’s correlation (R) value of 0.9961 (BiLSTM—0.991, EEMD-support vector regression (SVR)—0.9852, SVR—0.9801) and comparatively lower relative mean square error (RMSE) of 0.0214 (BiLSTM—0.0248, EEMD-SVR—0.043, SVR—0.0507) for Cairns and similarly a higher Pearson’s correlation (R) value of 0.9965 (BiLSTM—0.9903, EEMD–SVR—0.9953, SVR—0.9935) and comparatively lower RMSE of 0.0413 (BiLSTM—0.075, EEMD-SVR—0.0481, SVR—0.057) for Gold Coast.
Collapse
|
9
|
Multi-Horizon Air Pollution Forecasting with Deep Neural Networks. SENSORS 2021; 21:s21041235. [PMID: 33578633 PMCID: PMC7916344 DOI: 10.3390/s21041235] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/19/2020] [Revised: 01/26/2021] [Accepted: 01/27/2021] [Indexed: 11/18/2022]
Abstract
Air pollution is a global problem, especially in urban areas where the population density is very high due to the diverse pollutant sources such as vehicles, industrial plants, buildings, and waste. North Macedonia, as a developing country, has a serious problem with air pollution. The problem is highly present in its capital city, Skopje, where air pollution places it consistently within the top 10 cities in the world during the winter months. In this work, we propose using Recurrent Neural Network (RNN) models with long short-term memory units to predict the level of PM10 particles at 6, 12, and 24 h in the future. We employ historical air quality measurement data from sensors placed at multiple locations in Skopje and meteorological conditions such as temperature and humidity. We compare different deep learning models’ performance to an Auto-regressive Integrated Moving Average (ARIMA) model. The obtained results show that the proposed models consistently outperform the baseline model and can be successfully employed for air pollution prediction. Ultimately, we demonstrate that these models can help decision-makers and local authorities better manage the air pollution consequences by taking proactive measures.
Collapse
|
10
|
Krauss P, Maier A. Will We Ever Have Conscious Machines? Front Comput Neurosci 2020; 14:556544. [PMID: 33414712 PMCID: PMC7782472 DOI: 10.3389/fncom.2020.556544] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Accepted: 11/26/2020] [Indexed: 01/09/2023] Open
Abstract
The question of whether artificial beings or machines could become self-aware or conscious has been a philosophical question for centuries. The main problem is that self-awareness cannot be observed from an outside perspective and the distinction of being really self-aware or merely a clever imitation cannot be answered without access to knowledge about the mechanism's inner workings. We investigate common machine learning approaches with respect to their potential ability to become self-aware. We realize that many important algorithmic steps toward machines with a core consciousness have already been taken.
Collapse
Affiliation(s)
- Patrick Krauss
- Neuroscience Lab, University Hospital Erlangen, Erlangen, Germany.,Cognitive Computational Neuroscience Group, Chair of Linguistics, Friedrich-Alexander University Erlangen-Nürnberg (FAU), Erlangen, Germany
| | - Andreas Maier
- Chair of Machine Intelligence, Friedrich-Alexander University Erlangen-Nürnberg (FAU), Erlangen, Germany
| |
Collapse
|
11
|
Bus Dynamic Travel Time Prediction: Using a Deep Feature Extraction Framework Based on RNN and DNN. ELECTRONICS 2020. [DOI: 10.3390/electronics9111876] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Travel time data is an important factor for evaluating the performance of a public transport system. In terms of time and space within the nature of uncertainty, bus travel time is dynamic and flexible. Since the change of traffic status is periodic, contagious or even sudden, the changing mechanism of that is a hidden mode. Therefore, bus travel time prediction is a challenging problem in intelligent transportation system (ITS). Allowing for a large amount of traffic data can be collected at present but lack of precisely-conducting, it is still worth exploring how to extract feature sets that can accurately predict bus travel time from these data. Hence, a feature extraction framework based on the deep learning models were developed to reflect the state of bus travel time. First, the study introduced different historical stages of bus signaling time, taxi speed, the stop identity (ID) of spatial characteristics, and real-time possible arrival time, signified by fourteen spatiotemporal characteristic values. Then, an embedding network is proposed to leverage a wide and deep structure to mate the spatial and temporal data. In order to meet the temporal dependence requirements, an attention mechanism for a Recurrent Neural Network (RNN) was designed in this research in order to capture the temporal information. Finally, a Deep Neural Networks (DNN) was implemented in this research in order to achieve the dynamic bus travel time prediction. Two case studies of Guangzhou and Shenzhen were tested. The results showed that the performance of the algorithm was more efficient than that of the traditional machine-learning model and promoted by 4.82% compared to the deep neural network applied to the initial feature space. Moreover, the study visualized the weighted cost of attention on the bus’s travel time features during a certain running state. Therefore, the study demonstrated the proposed model enabled to understand the characteristic data of transit travel time with visualization.
Collapse
|
12
|
Machine Learning-Based Approach to Predict Energy Consumption of Renewable and Nonrenewable Power Sources. ENERGIES 2020. [DOI: 10.3390/en13184870] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
In today’s world, renewable energy sources are increasingly integrated with nonrenewable energy sources into electric grids and pose new challenges because of their intermittent and variable nature. Energy prediction using soft-computing techniques plays a vital role in addressing these challenges. As electricity consumption is closely linked to other energy sources such as natural gas and oil, forecasting electricity consumption is essential for making national energy policies. In this paper, we utilize various data mining techniques, including preprocessing historical load data and the load time series’s characteristics. We analyzed the power consumption trends from renewable energy sources and nonrenewable energy sources and combined them. A novel machine learning-based hybrid approach, combining multilayer perceptron (MLP), support vector regression (SVR), and CatBoost, is proposed in this paper for power forecasting. A thorough comparison is made, taking into account the results obtained using other prediction methods.
Collapse
|
13
|
Mastrogiuseppe F, Ostojic S. A Geometrical Analysis of Global Stability in Trained Feedback Networks. Neural Comput 2019; 31:1139-1182. [DOI: 10.1162/neco_a_01187] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Recurrent neural networks have been extensively studied in the context of neuroscience and machine learning due to their ability to implement complex computations. While substantial progress in designing effective learning algorithms has been achieved, a full understanding of trained recurrent networks is still lacking. Specifically, the mechanisms that allow computations to emerge from the underlying recurrent dynamics are largely unknown. Here we focus on a simple yet underexplored computational setup: a feedback architecture trained to associate a stationary output to a stationary input. As a starting point, we derive an approximate analytical description of global dynamics in trained networks, which assumes uncorrelated connectivity weights in the feedback and in the random bulk. The resulting mean-field theory suggests that the task admits several classes of solutions, which imply different stability properties. Different classes are characterized in terms of the geometrical arrangement of the readout with respect to the input vectors, defined in the high-dimensional space spanned by the network population. We find that such an approximate theoretical approach can be used to understand how standard training techniques implement the input-output task in finite-size feedback networks. In particular, our simplified description captures the local and the global stability properties of the target solution, and thus predicts training performance.
Collapse
Affiliation(s)
- Francesca Mastrogiuseppe
- Laboratoire de Neurosciences Cognitives et Computationelles, INSERM U960, and Laboratoire de Physique Statistique, CNRS UMR 8550, Ecole Normale Supérieure–PSL Research University, Paris 75005, France
| | - Srdjan Ostojic
- Laboratoire de Neurosciences Cognitives et Computationelles, INSERM U960, Ecole Normale Supérieure–PSL Research University, Paris 75005, France
| |
Collapse
|
14
|
Lee JH, Shin J, Realff MJ. Machine learning: Overview of the recent progresses and implications for the process systems engineering field. Comput Chem Eng 2018. [DOI: 10.1016/j.compchemeng.2017.10.008] [Citation(s) in RCA: 177] [Impact Index Per Article: 29.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
15
|
Fasoli D, Cattani A, Panzeri S. Pattern Storage, Bifurcations, and Groupwise Correlation Structure of an Exactly Solvable Asymmetric Neural Network Model. Neural Comput 2018; 30:1258-1295. [PMID: 29566351 DOI: 10.1162/neco_a_01069] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Despite their biological plausibility, neural network models with asymmetric weights are rarely solved analytically, and closed-form solutions are available only in some limiting cases or in some mean-field approximations. We found exact analytical solutions of an asymmetric spin model of neural networks with arbitrary size without resorting to any approximation, and we comprehensively studied its dynamical and statistical properties. The network had discrete time evolution equations and binary firing rates, and it could be driven by noise with any distribution. We found analytical expressions of the conditional and stationary joint probability distributions of the membrane potentials and the firing rates. By manipulating the conditional probability distribution of the firing rates, we extend to stochastic networks the associating learning rule previously introduced by Personnaz and coworkers. The new learning rule allowed the safe storage, under the presence of noise, of point and cyclic attractors, with useful implications for content-addressable memories. Furthermore, we studied the bifurcation structure of the network dynamics in the zero-noise limit. We analytically derived examples of the codimension 1 and codimension 2 bifurcation diagrams of the network, which describe how the neuronal dynamics changes with the external stimuli. This showed that the network may undergo transitions among multistable regimes, oscillatory behavior elicited by asymmetric synaptic connections, and various forms of spontaneous symmetry breaking. We also calculated analytically groupwise correlations of neural activity in the network in the stationary regime. This revealed neuronal regimes where, statistically, the membrane potentials and the firing rates are either synchronous or asynchronous. Our results are valid for networks with any number of neurons, although our equations can be realistically solved only for small networks. For completeness, we also derived the network equations in the thermodynamic limit of infinite network size and we analytically studied their local bifurcations. All the analytical results were extensively validated by numerical simulations.
Collapse
Affiliation(s)
- Diego Fasoli
- Laboratory of Neural Computation, Center for Neuroscience and Cognitive Systems @UniTn, Istituto Italiano di Tecnologia, 38068 Rovereto, Italy, and Center for Brain and Cognition, Computational Neuroscience Group, Universitat Pompeu Fabra, 08002 Barcelona, Spain
| | - Anna Cattani
- Laboratory of Neural Computation, Center for Neuroscience and Cognitive Systems @UniTn, Istituto Italiano di Tecnologia, 38068 Rovereto, Italy, and Department of Biomedical and Clinical Sciences "L. Sacco," University of Milan, 20157 Milan, Italy
| | - Stefano Panzeri
- Laboratory of Neural Computation, Center for Neuroscience and Cognitive Systems @UniTn, Istituto Italiano di Tecnologia, 38068 Rovereto, Italy
| |
Collapse
|
16
|
Machine Learning Improves Risk Stratification After Acute Coronary Syndrome. Sci Rep 2017; 7:12692. [PMID: 28978948 PMCID: PMC5627253 DOI: 10.1038/s41598-017-12951-x] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Accepted: 09/12/2017] [Indexed: 02/07/2023] Open
Abstract
The accurate assessment of a patient’s risk of adverse events remains a mainstay of clinical care. Commonly used risk metrics have been based on logistic regression models that incorporate aspects of the medical history, presenting signs and symptoms, and lab values. More sophisticated methods, such as Artificial Neural Networks (ANN), form an attractive platform to build risk metrics because they can easily incorporate disparate pieces of data, yielding classifiers with improved performance. Using two cohorts consisting of patients admitted with a non-ST-segment elevation acute coronary syndrome, we constructed an ANN that identifies patients at high risk of cardiovascular death (CVD). The ANN was trained and tested using patient subsets derived from a cohort containing 4395 patients (Area Under the Curve (AUC) 0.743) and validated on an independent holdout set containing 861 patients (AUC 0.767). The ANN 1-year Hazard Ratio for CVD was 3.72 (95% confidence interval 1.04–14.3) after adjusting for the TIMI Risk Score, left ventricular ejection fraction, and B-type natriuretic peptide. A unique feature of our approach is that it captures small changes in the ST segment over time that cannot be detected by visual inspection. These findings highlight the important role that ANNs can play in risk stratification.
Collapse
|
17
|
Kumar R, Srivastava S, Gupta JRP. Diagonal recurrent neural network based adaptive control of nonlinear dynamical systems using lyapunov stability criterion. ISA TRANSACTIONS 2017; 67:407-427. [PMID: 28139208 DOI: 10.1016/j.isatra.2017.01.022] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2016] [Revised: 10/14/2016] [Accepted: 01/05/2017] [Indexed: 06/06/2023]
Abstract
In this paper adaptive control of nonlinear dynamical systems using diagonal recurrent neural network (DRNN) is proposed. The structure of DRNN is a modification of fully connected recurrent neural network (FCRNN). Presence of self-recurrent neurons in the hidden layer of DRNN gives it an ability to capture the dynamic behaviour of the nonlinear plant under consideration (to be controlled). To ensure stability, update rules are developed using lyapunov stability criterion. These rules are then used for adjusting the various parameters of DRNN. The responses of plants obtained with DRNN are compared with those obtained when multi-layer feed forward neural network (MLFFNN) is used as a controller. Also, in example 4, FCRNN is also investigated and compared with DRNN and MLFFNN. Robustness of the proposed control scheme is also tested against parameter variations and disturbance signals. Four simulation examples including one-link robotic manipulator and inverted pendulum are considered on which the proposed controller is applied. The results so obtained show the superiority of DRNN over MLFFNN as a controller.
Collapse
Affiliation(s)
- Rajesh Kumar
- Division of Instrumentation and Control Engineering, Netaji Subhas Institute of Technology, Sector-3, Dwarka, New Delhi 110078, India.
| | - Smriti Srivastava
- Division of Instrumentation and Control Engineering, Netaji Subhas Institute of Technology, Sector-3, Dwarka, New Delhi 110078, India.
| | - J R P Gupta
- Division of Instrumentation and Control Engineering, Netaji Subhas Institute of Technology, Sector-3, Dwarka, New Delhi 110078, India.
| |
Collapse
|
18
|
Subrahmanya N, Shin YC. Constructive training of recurrent neural networks using hybrid optimization. Neurocomputing 2010. [DOI: 10.1016/j.neucom.2010.05.012] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
19
|
Caudill MS, Brandt SF, Nussinov Z, Wessel R. Intricate phase diagram of a prevalent visual circuit reveals universal dynamics, phase transitions, and resonances. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2009; 80:051923. [PMID: 20365022 PMCID: PMC2865257 DOI: 10.1103/physreve.80.051923] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2009] [Revised: 09/02/2009] [Indexed: 05/29/2023]
Abstract
Neural feedback-triads consisting of two feedback loops with a nonreciprocal lateral connection from one loop to the other are ubiquitous in the brain. We show analytically that the dynamics of this network topology are determined by algebraic combinations of its five synaptic weights. Exploration of network activity over the parameter space demonstrates the importance of the nonreciprocal lateral connection and reveals intricate behavior involving continuous transitions between qualitatively different activity states. In addition, we show that the response to periodic inputs is narrowly tuned around a center frequency determined by the effective synaptic parameters.
Collapse
Affiliation(s)
- Matthew S Caudill
- Department of Physics, Washington University, Campus Box 1105, St. Louis, Missouri 63130-4899, USA.
| | | | | | | |
Collapse
|
20
|
Yiu KF, Wang S, Teo KL, Tsoi AC. Nonlinear system modeling via knot-optimizing B-spline networks. IEEE TRANSACTIONS ON NEURAL NETWORKS 2008; 12:1013-22. [PMID: 18249929 DOI: 10.1109/72.950131] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
In using the B-spline network for nonlinear system modeling, owing to a lack of suitable theoretical results, it is quite difficult to choose an appropriate set of knot points to achieve a good network structure for minimizing, say, a minimum error criterion. In this paper, a novel knot-optimizing B-spline network is proposed to approximate the general nonlinear system behavior. The knot points are considered to be independent variables in the B-spline network and are optimized together with the B-spline expansion coefficients. The simulated annealing algorithm with an appropriate search strategy is used as an optimization algorithm for the training process in order to avoid any possible local minima. Examples involving dynamic systems up to six dimensions in the input space to the network are solved by the proposed method to illustrate the effectiveness of this approach.
Collapse
Affiliation(s)
- K F Yiu
- Department of Applied Mathematics, Hong Kong Polytechnic University, Kowloon, Hong Kong
| | | | | | | |
Collapse
|
21
|
Bucci BA, Vipperman JS. Performance of artificial neural network-based classifiers to identify military impulse noise. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2007; 122:1602. [PMID: 17927419 DOI: 10.1121/1.2756969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Noise monitoring stations are in place around some military installations to provide records that assist in processing noise complaints and damage claims. However, they are known to produce false positives (by incorrectly attributing naturally occurring noise to military operations) and also fail to detect many impulse events. In this project, classifiers based on artificial neural networks were developed to improve the accuracy of military impulse noise identification. Two time-domain metrics--kurtosis and crest factor--and two custom frequency-domain metrics--spectral slope and weighted square error-were inputs to the artificial neural networks. The classification algorithm was able to achieve up to 100% accuracy on the training data and the validation data, while improving detection threshold by at least 40 dB.
Collapse
Affiliation(s)
- Brian A Bucci
- University of Pittsburgh, Department of Mechanical Engineering, 648 Benedum Hall, Pittsburgh, Pennsylvania 15261, USA
| | | |
Collapse
|
22
|
Zhai JY, Fei SM, Mo XH. Multiple models switching control based on recurrent neural networks. Neural Comput Appl 2007. [DOI: 10.1007/s00521-007-0123-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
23
|
Predicting Human Immunodeficiency Virus (HIV) Drug Resistance Using Recurrent Neural Networks. BIO-INSPIRED MODELING OF COGNITIVE TASKS 2007. [DOI: 10.1007/978-3-540-73053-8_23] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
|
24
|
Recurrent neural network architectures: An overview. ACTA ACUST UNITED AC 2006. [DOI: 10.1007/bfb0053993] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
|
25
|
Leung CS, Tsoi AC. Combined learning and pruning for recurrent radial basis function networks based on recursive least square algorithms. Neural Comput Appl 2005. [DOI: 10.1007/s00521-005-0009-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
26
|
Abstract
This paper presents a novel approach in designing neural network based adaptive controllers for a class of nonlinear discrete-time systems. This type of controllers has its simplicity in parallelism to linear generalized minimum variance (GMV) controller design and efficiency to deal with complex nonlinear dynamics. A recurrent neural network is introduced as a bridge to compensation simplify controller design procedure and efficiently to deal with nonlinearity. The network weight adaptation law is derived from Lyapunov stability analysis and the connection between convergence of the network weight and the reconstruction error of the network is established. A theorem is presented for the conditions of the stability of the closed-loop systems. Two simulation examples are provided to demonstrate the efficiency of the approach.
Collapse
Affiliation(s)
- Quanmin Zhu
- Faculty of Computing, Engineering and Mathematical Science, University of the West of England, Bristol, BS16 1QY, UK.
| | | |
Collapse
|
27
|
Abstract
The aim of this letter is to introduce a new blind-deconvolution algorithm based on fixed-point optimization of a "Bussgang"-type cost function. The cost function relies on approximate Bayesian estimation achieved by an adaptive neuron. The main feature of the presented algorithm is fast convergence that guarantees good deconvolution performances with limited computational demand as compared with algorithms of the same class.
Collapse
Affiliation(s)
- Simone Fiori
- Faculty of Engineering, Perugia University, Terni 1-05100, Italy.
| |
Collapse
|
28
|
Aguirre LA, Lopes RAM, Amaral GFV, Letellier C. Constraining the topology of neural networks to ensure dynamics with symmetry properties. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2004; 69:026701. [PMID: 14995583 DOI: 10.1103/physreve.69.026701] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2003] [Indexed: 05/24/2023]
Abstract
This paper addresses the training of network models from data produced by systems with symmetry properties. It is argued that although general networks are global approximators, in practice some properties such as symmetry are very hard to learn from data. In order to guarantee that the final network will be symmetrical, constraints are developed for two types of models, namely, the multilayer perceptron (MLP) network and the radial basis function (RBF) network. In global modeling problems it becomes crucial to impose conditions for symmetry in order to stand a chance of reproducing symmetry-related phenomena. Sufficient conditions are given for MLP and RBF networks to have a set of fixed points that are symmetrical with respect to the origin of the phase space. In the case of MLP networks such conditions reduce to the absence of bias parameters and the requirement of odd activation functions. This turns out to be important from a dynamical point of view since some phenomena are only observed in the context of symmetry, which is not a structurally stable property. The results are illustrated using bench systems that display symmetry, such as the Duffing-Ueda oscillator and the Lorenz system.
Collapse
Affiliation(s)
- Luis Antonio Aguirre
- Programa de Pós Graduçao em Engenharia Elétrica, Universidade Federal de Minas Gerais, Avenida Antônio Carlos 6627, 31270-901 Belo Horizonte, Minas Gerais, Brazil
| | | | | | | |
Collapse
|
29
|
Chalup SK, Blair AD. Incremental training of first order recurrent neural networks to predict a context-sensitive language. Neural Netw 2003; 16:955-72. [PMID: 14692631 DOI: 10.1016/s0893-6080(03)00054-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
In recent years it has been shown that first order recurrent neural networks trained by gradient-descent can learn not only regular but also simple context-free and context-sensitive languages. However, the success rate was generally low and severe instability issues were encountered. The present study examines the hypothesis that a combination of evolutionary hill climbing with incremental learning and a well-balanced training set enables first order recurrent networks to reliably learn context-free and mildly context-sensitive languages. In particular, we trained the networks to predict symbols in string sequences of the context-sensitive language [a(n)b(n)c(n); n > or = 1. Comparative experiments with and without incremental learning indicated that incremental learning can accelerate and facilitate training. Furthermore, incrementally trained networks generally resulted in monotonic trajectories in hidden unit activation space, while the trajectories of non-incrementally trained networks were oscillating. The non-incrementally trained networks were more likely to generalise.
Collapse
Affiliation(s)
- Stephan K Chalup
- School of Electrical Engineering and Computer Science, The University of Newcastle, Callaghan, NSW 2308, Australia.
| | | |
Collapse
|
30
|
|
31
|
|
32
|
Carrasco RC, Forcada ML, Valdés-Muñoz MA, Neco RP. Stable encoding of finite-state machines in discrete-time recurrent neural nets with sigmoid units. Neural Comput 2000; 12:2129-74. [PMID: 10976142 DOI: 10.1162/089976600300015097] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
There has been a lot of interest in the use of discrete-time recurrent neural nets (DTRNN) to learn finite-state tasks, with interesting results regarding the induction of simple finite-state machines from input-output strings. Parallel work has studied the computational power of DTRNN in connection with finite-state computation. This article describes a simple strategy to devise stable encodings of finite-state machines in computationally capable discrete-time recurrent neural architectures with sigmoid units and gives a detailed presentation on how this strategy may be applied to encode a general class of finite-state machines in a variety of commonly used first- and second-order recurrent neural networks. Unlike previous work that either imposed some restrictions to state values or used a detailed analysis based on fixed-point attractors, our approach applies to any positive, bounded, strictly growing, continuous activation function and uses simple bounding criteria based on a study of the conditions under which a proposed encoding scheme guarantees that the DTRNN is actually behaving as a finite-state machine.
Collapse
Affiliation(s)
- R C Carrasco
- Departament de Llenguatges i Sistemes Informàtics, Universitat d'Alacant, E-03071 Alacant, Spain
| | | | | | | |
Collapse
|
33
|
Mandic D, Chambers J. On the choice of parameters of the cost function in nested modular RNN's. ACTA ACUST UNITED AC 2000; 11:315-22. [DOI: 10.1109/72.839003] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
34
|
|