201
|
Systematic Boolean Satisfiability Programming in Radial Basis Function Neural Network. Processes (Basel) 2020. [DOI: 10.3390/pr8020214] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Radial Basis Function Neural Network (RBFNN) is a class of Artificial Neural Network (ANN) that contains hidden layer processing units (neurons) with nonlinear, radially symmetric activation functions. Consequently, RBFNN has extensively suffered from significant computational error and difficulties in approximating the optimal hidden neuron, especially when dealing with Boolean Satisfiability logical rule. In this paper, we present a comprehensive investigation of the potential effect of systematic Satisfiability programming as a logical rule, namely 2 Satisfiability (2SAT) to optimize the output weights and parameters in RBFNN. The 2SAT logical rule has extensively applied in various disciplines, ranging from industrial automation to the complex management system. The core impetus of this study is to investigate the effectiveness of 2SAT logical rule in reducing the computational burden for RBFNN by obtaining the parameters in RBFNN. The comparison is made between RBFNN and the existing method, based on the Hopfield Neural Network (HNN) in searching for the optimal neuron state by utilizing different numbers of neurons. The comparison was made with the HNN as a benchmark to validate the final output of our proposed RBFNN with 2SAT logical rule. Note that the final output in HNN is represented in terms of the quality of the final states produced at the end of the simulation. The simulation dynamic was carried out by using the simulated data, randomly generated by the program. In terms of 2SAT logical rule, simulation revealed that RBFNN has two advantages over HNN model: RBFNN can obtain the correct final neuron state with the lowest error and does not require any approximation for the number of hidden layers. Furthermore, this study provides a new paradigm in the field feed-forward neural network by implementing a more systematic propositional logic rule.
Collapse
|
202
|
Serb A, Kobyzev I, Wang J, Prodromakis T. A semi-holographic hyperdimensional representation system for hardware-friendly cognitive computing. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2020; 378:20190162. [PMID: 31865886 PMCID: PMC6939245 DOI: 10.1098/rsta.2019.0162] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 08/05/2019] [Indexed: 06/10/2023]
Abstract
One of the main, long-term objectives of artificial intelligence is the creation of thinking machines. To that end, substantial effort has been placed into designing cognitive systems; i.e. systems that can manipulate semantic-level information. A substantial part of that effort is oriented towards designing the mathematical machinery underlying cognition in a way that is very efficiently implementable in hardware. In this work, we propose a 'semi-holographic' representation system that can be implemented in hardware using only multiplexing and addition operations, thus avoiding the need for expensive multiplication. The resulting architecture can be readily constructed by recycling standard microprocessor elements and is capable of performing two key mathematical operations frequently used in cognition, superposition and binding, within a budget of below 6 pJ for 64-bit operands. Our proposed 'cognitive processing unit' is intended as just one (albeit crucial) part of much larger cognitive systems where artificial neural networks of all kinds and associative memories work in concord to give rise to intelligence. This article is part of the theme issue 'Harmonizing energy-autonomous computing and intelligence'.
Collapse
Affiliation(s)
- A. Serb
- Zepler Institute, University of Southampton, Southampton SO17 1BJ, UK
| | - I. Kobyzev
- David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, CanadaN2L 3G1
| | - J. Wang
- Zepler Institute, University of Southampton, Southampton SO17 1BJ, UK
| | - T. Prodromakis
- Zepler Institute, University of Southampton, Southampton SO17 1BJ, UK
| |
Collapse
|
203
|
Data-driven simulation of pedestrian collision avoidance with a nonparametric neural network. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2019.10.062] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
204
|
Song S, Jiang W, Hou L, Zhao H. Leveraging effect size distributions to improve polygenic risk scores derived from summary statistics of genome-wide association studies. PLoS Comput Biol 2020; 16:e1007565. [PMID: 32045423 PMCID: PMC7039528 DOI: 10.1371/journal.pcbi.1007565] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Revised: 02/24/2020] [Accepted: 11/25/2019] [Indexed: 12/29/2022] Open
Abstract
Genetic risk prediction is an important problem in human genetics, and accurate prediction can facilitate disease prevention and treatment. Calculating polygenic risk score (PRS) has become widely used due to its simplicity and effectiveness, where only summary statistics from genome-wide association studies are needed in the standard method. Recently, several methods have been proposed to improve standard PRS by utilizing external information, such as linkage disequilibrium and functional annotations. In this paper, we introduce EB-PRS, a novel method that leverages information for effect sizes across all the markers to improve prediction accuracy. Compared to most existing genetic risk prediction methods, our method does not need to tune parameters nor external information. Real data applications on six diseases, including asthma, breast cancer, celiac disease, Crohn's disease, Parkinson's disease and type 2 diabetes show that EB-PRS achieved 307.1%, 42.8%, 25.5%, 3.1%, 74.3% and 49.6% relative improvements in terms of predictive r2 over standard PRS method with optimally tuned parameters. Besides, compared to LDpred that makes use of LD information, EB-PRS also achieved 37.9%, 33.6%, 8.6%, 36.2%, 40.6% and 10.8% relative improvements. We note that our method is not the first method leveraging effect size distributions. Here we first justify our method by presenting theoretical optimal property over existing methods in this class of methods, and substantiate our theoretical result with extensive simulation results. The R-package EBPRS that implements our method is available on CRAN.
Collapse
Affiliation(s)
- Shuang Song
- Center for Statistical Science, Tsinghua University, Beijing, China
- Department of Industrial Engineering, Tsinghua University, Beijing, China
| | - Wei Jiang
- Department of Biostatistics, School of Public Health, Yale University, New Haven, Connecticut, United States of America
| | - Lin Hou
- Center for Statistical Science, Tsinghua University, Beijing, China
- Department of Industrial Engineering, Tsinghua University, Beijing, China
| | - Hongyu Zhao
- Department of Biostatistics, School of Public Health, Yale University, New Haven, Connecticut, United States of America
| |
Collapse
|
205
|
Ayala HVH, Habineza D, Rakotondrabe M, dos Santos Coelho L. Nonlinear black-box system identification through coevolutionary algorithms and radial basis function artificial neural networks. Appl Soft Comput 2020. [DOI: 10.1016/j.asoc.2019.105990] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
206
|
Gao F, Chen W, Li Z, Li J, Xu B. Neural Network-Based Distributed Cooperative Learning Control for Multiagent Systems via Event-Triggered Communication. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020; 31:407-419. [PMID: 30969933 DOI: 10.1109/tnnls.2019.2904253] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
In this paper, an event-based distributed cooperative learning (DCL) law is proposed for a group of adaptive neural control systems. The plants to be controlled have identical structures, but reference signals for each plant are different. During control process, each agent intermittently broadcasts its neural network (NN) weight estimation to its neighboring agents under an event-triggered condition that is only based on its own estimated NN weights. If communication topology is connected and undirected, the NN weights of all neural control systems can converge to a small neighborhood of their optimal values. The generalization ability of NNs is guaranteed in the event-triggered context, that is, the approximation domain of each NN is the union of all system trajectories. Furthermore, a strictly positive lower bound on the interevent intervals is also guaranteed to avoid the Zeno behavior. Finally, a numerical example is given to illustrate the effectiveness of the proposed learning law.
Collapse
|
207
|
Predicting ICD-9 code groups with fuzzy similarity based supervised multi-label classification of unstructured clinical nursing notes. Knowl Based Syst 2020. [DOI: 10.1016/j.knosys.2019.105321] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
208
|
Ye Z, Yang J, Zhong N, Tu X, Jia J, Wang J. Tackling environmental challenges in pollution controls using artificial intelligence: A review. THE SCIENCE OF THE TOTAL ENVIRONMENT 2020; 699:134279. [PMID: 33736193 DOI: 10.1016/j.scitotenv.2019.134279] [Citation(s) in RCA: 84] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2019] [Revised: 09/02/2019] [Accepted: 09/03/2019] [Indexed: 06/12/2023]
Abstract
This review presents the developments in artificial intelligence technologies for environmental pollution controls. A number of AI approaches, which start with the reliable mapping of nonlinear behavior between inputs and outputs in chemical and biological processes in terms of prediction models to the emerging optimization and control algorithms that study the pollutants removal processes and intelligent control systems, have been developed for environmental clean-ups. The characteristics, advantages and limitations of AI methods, including single and hybrid AI methods, were overviewed. Hybrid AI methods exhibited synergistic effects, but with computational heaviness. The up-to-date review summarizes i) Various artificial neural networks employed in wastewater degradation process for the prediction of removal efficiency of pollutants and the search of optimizing experimental conditions; ii) Evaluation of fuzzy logic used for intelligent control of aerobic stage of wastewater treatment process; iii) AI-aided soft-sensors for precisely on-line/off-line estimation of hard-to-measure parameters in wastewater treatment plants; iv) Single and hybrid AI methods applied to estimate pollutants concentrations and design monitoring and early-warning systems for both aquatic and atmospheric environments; v) AI modelings of short-term, mid-term and long-term solid waste generations, and various ANNs for solid waste recycling and reduction. Finally, the future challenges of AI-based models employed in the environmental fields are discussed and proposed.
Collapse
Affiliation(s)
- Zhiping Ye
- College of Environment, Zhejiang University of Technology, Hangzhou 310014, PR China
| | - Jiaqian Yang
- College of Environment, Zhejiang University of Technology, Hangzhou 310014, PR China
| | - Na Zhong
- College of Environment, Zhejiang University of Technology, Hangzhou 310014, PR China
| | - Xin Tu
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool L69 3GJ, United Kingdom
| | - Jining Jia
- College of Chemical Engineering, Zhejiang University of Technology, Hangzhou 310014, PR China
| | - Jiade Wang
- College of Environment, Zhejiang University of Technology, Hangzhou 310014, PR China.
| |
Collapse
|
209
|
A Novel Model Predictive Runge–Kutta Neural Network Controller for Nonlinear MIMO Systems. Neural Process Lett 2020. [DOI: 10.1007/s11063-019-10167-w] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
210
|
Wang Z, Chen L, Zhang J, Yin Y, Li D. Multi-view ensemble learning with empirical kernel for heart failure mortality prediction. INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN BIOMEDICAL ENGINEERING 2020; 36:e3273. [PMID: 31680466 DOI: 10.1002/cnm.3273] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2019] [Revised: 09/30/2019] [Accepted: 09/30/2019] [Indexed: 06/10/2023]
Abstract
Heart failure (HF) refers to the heart's inability to pump sufficient blood to maintain the body's needs, which has a very serious impact on human health. In recent years, the prevalence of HF has remained high. This paper proposes a multi-view ensemble learning algorithm based on empirical kernel mapping called MVE-EK, which predicts the mortality of patient through hospital records. Multi-view ensemble learning can take advantage of the consistency and complementarity of different views. The MVE-EK first divides the patient's features into multiple views and then divides the samples of each view to multiple subsets through under sampling, which can reduce the imbalance rate of the original dataset and obtain some relatively balanced subsets. Each subset is mapped into kernel space by empirical kernel mapping, which can map samples from linearly inseparable spaces to linearly separable spaces. Finally, the multi-view ensemble learning is performed by the designed loss of acquaintance between views. The effectiveness of the algorithm is verified on the three datasets of HF patient in the real world. The performance of the algorithm is better than other comparison algorithms. The datasets are collected from Shanghai Shuguang Hospital and involve 10 203 hospitalization records for 4682 HF patients between March 2009 and April 2016. The prediction information provided by the algorithm can assist the clinician in providing a more personalized treatment plan for patients with HF.
Collapse
Affiliation(s)
- Zhe Wang
- Key Laboratory of Advanced Control and Optimization for Chemical Processes, Ministry of Education, East China University of Science and Technology, Shanghai, People's Republic of China
- Department of Computer Science and Engineering, East China University of Science and Technology, Shanghai, People's Republic of China
| | - Lilong Chen
- Key Laboratory of Advanced Control and Optimization for Chemical Processes, Ministry of Education, East China University of Science and Technology, Shanghai, People's Republic of China
- Department of Computer Science and Engineering, East China University of Science and Technology, Shanghai, People's Republic of China
| | - Jing Zhang
- Department of Computer Science and Engineering, East China University of Science and Technology, Shanghai, People's Republic of China
| | - Yichao Yin
- Information Center, Shanghai Shuguang Hospital, Shanghai, People's Republic of China
| | - Dongdong Li
- Department of Computer Science and Engineering, East China University of Science and Technology, Shanghai, People's Republic of China
| |
Collapse
|
211
|
Alauddin M, Khan F, Imtiaz S, Ahmed S. A variable mosquito flying optimization‐based hybrid artificial neural network model for the alarm tuning of process fault detection systems. PROCESS SAFETY PROGRESS 2019. [DOI: 10.1002/prs.12122] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Md Alauddin
- Centre for Risk Integrity and Safety Engineering (C‐RISE), Faculty of Engineering and Applied Science Memorial University of Newfoundland St. John's Newfoundland and Labrador Canada
| | - Faisal Khan
- Centre for Risk Integrity and Safety Engineering (C‐RISE), Faculty of Engineering and Applied Science Memorial University of Newfoundland St. John's Newfoundland and Labrador Canada
| | - Syed Imtiaz
- Centre for Risk Integrity and Safety Engineering (C‐RISE), Faculty of Engineering and Applied Science Memorial University of Newfoundland St. John's Newfoundland and Labrador Canada
| | - Salim Ahmed
- Centre for Risk Integrity and Safety Engineering (C‐RISE), Faculty of Engineering and Applied Science Memorial University of Newfoundland St. John's Newfoundland and Labrador Canada
| |
Collapse
|
212
|
Xu Z, Sun C, Ji T, Manton JH, Shieh W. Computational complexity comparison of feedforward/radial basis function/recurrent neural network-based equalizer for a 50-Gb/s PAM4 direct-detection optical link. OPTICS EXPRESS 2019; 27:36953-36964. [PMID: 31873466 DOI: 10.1364/oe.27.036953] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Accepted: 11/27/2019] [Indexed: 06/10/2023]
Abstract
The computational complexity and system bit-error-rate (BER) performance of four types of neural-network-based nonlinear equalizers are analyzed for a 50-Gb/s pulse amplitude modulation (PAM)-4 direct-detection (DD) optical link. The four types are feedforward neural networks (F-NN), radial basis function neural networks (RBF-NN), auto-regressive recurrent neural networks (AR-RNN) and layer-recurrent neural networks (L-RNN). Numerical results show that, for a fixed BER threshold, the AR-RNN-based equalizers have the lowest computational complexity. Amongst all the nonlinear NN-based equalizers with the same number of inputs and hidden neurons, F-NN-based equalizers have the lowest computational complexity while the AR-RNN-based equalizers exhibit the best BER performance. Compared with F-NN or RNN, RBF-NN tends to require more hidden neurons with the increase of the number of inputs, making it not suitable for long fiber transmission distance. We also demonstrate that only a few tens of multiplications per symbol are needed for NN-based equalizers to guarantee a good BER performance. This relatively low computational complexity signifies that various NN-based equalizers can be potentially implemented in real time. More broadly, this paper provides guidelines for selecting a suitable NN-based equalizer based on BER and computational complexity requirements.
Collapse
|
213
|
Qin J, Zhang G, Zheng WX, Kang Y. Neural Network-Based Adaptive Consensus Control for a Class of Nonaffine Nonlinear Multiagent Systems With Actuator Faults. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2019; 30:3633-3644. [PMID: 30946680 DOI: 10.1109/tnnls.2019.2901563] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
In this paper, the consensus problem is investigated for a class of nonaffine nonlinear multiagent systems (MASs) with actuator faults of partial loss of effectiveness fault and biased fault. To deal with the control difficulty caused by the nonaffine dynamics, a neural network (NN)-based adaptive consensus protocol is developed based on the Lyapunov analysis. The neuron input of the NN uses both the state information and the consensus error information. In addition, the negative feedback term of the NN weight update law is multiplied by an absolute value of the consensus error, which is helpful in improving the consensus accuracy. With the developed adaptive NN consensus protocol, semiglobal consensus with a bounded residual consensus error of the MAS is achieved, and the bounded NN weight matrix is guaranteed. Finally, simulation results show that the developed adaptive NN consensus protocol has advantages of fast convergence rate and good consensus accuracy and has the capability of rapid response with respect to the actuator faults.
Collapse
|
214
|
Robust adaptive neural network prescribed performance control for uncertain CSTR system with input nonlinearities and external disturbance. Neural Comput Appl 2019. [DOI: 10.1007/s00521-019-04591-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
215
|
Cao Y, Zhang Z. Robust Tracking of a Cost-Effective Micro-Stereolithography System Based on a Compliant Nanomanipulator. MICROMACHINES 2019; 10:mi10110785. [PMID: 31744040 PMCID: PMC6915434 DOI: 10.3390/mi10110785] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Revised: 11/11/2019] [Accepted: 11/13/2019] [Indexed: 12/29/2022]
Abstract
Micro-stereolithography (MSL) has emerged as a promising and challenging technique in micro-/nano-scale additive manufacturing. Besides the requirement of the light source, the motion system requires ultra-high-precision tracking capability to reach the right location for every solidification event. To achieve single-digit micron feature size of the fabrication, we propose a robust control strategy to support a self-developed cost-effective MSL prototype based on a compliant nanomanipulator and a blue light-emitting diode (LED) module. In particular, the nonlinearity and parameter-variation of the compliant manipulator are dealt with by a robust radial basis function (RBF)-based neural network, and the repetitive control (RC) is innovatively integrated with RBF to improve the tracking performance of a closed pattern. Various simulations and real-time experiments are conducted to validate the proposed control strategy. The fabrication of a closed pattern will not begin by turning on the laser source until the tracking error reaches submicrons, and the fabrication results demonstrate that the cost-effective MSL system is capable of fabricating 2.5 µm feature size in a 0.5 mm working range.
Collapse
Affiliation(s)
- Yue Cao
- State Key Laboratory of Tribology & Institute of Manufacturing Engineering, Department of Mechanical Engineering, Tsinghua University, Beijing 100084, China;
| | - Zhen Zhang
- State Key Laboratory of Tribology & Institute of Manufacturing Engineering, Department of Mechanical Engineering, Tsinghua University, Beijing 100084, China;
- Beijing Key Laboratory of Precision/Ultra-Precision Manufacturing Equipment and Control, Tsinghua University, Beijing 100084, China
- Correspondence: ; Tel.: +86-010-62771390
| |
Collapse
|
216
|
Mendez KM, Reinke SN, Broadhurst DI. A comparative evaluation of the generalised predictive ability of eight machine learning algorithms across ten clinical metabolomics data sets for binary classification. Metabolomics 2019; 15:150. [PMID: 31728648 PMCID: PMC6856029 DOI: 10.1007/s11306-019-1612-4] [Citation(s) in RCA: 104] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/22/2019] [Accepted: 11/05/2019] [Indexed: 12/18/2022]
Abstract
INTRODUCTION Metabolomics is increasingly being used in the clinical setting for disease diagnosis, prognosis and risk prediction. Machine learning algorithms are particularly important in the construction of multivariate metabolite prediction. Historically, partial least squares (PLS) regression has been the gold standard for binary classification. Nonlinear machine learning methods such as random forests (RF), kernel support vector machines (SVM) and artificial neural networks (ANN) may be more suited to modelling possible nonlinear metabolite covariance, and thus provide better predictive models. OBJECTIVES We hypothesise that for binary classification using metabolomics data, non-linear machine learning methods will provide superior generalised predictive ability when compared to linear alternatives, in particular when compared with the current gold standard PLS discriminant analysis. METHODS We compared the general predictive performance of eight archetypal machine learning algorithms across ten publicly available clinical metabolomics data sets. The algorithms were implemented in the Python programming language. All code and results have been made publicly available as Jupyter notebooks. RESULTS There was only marginal improvement in predictive ability for SVM and ANN over PLS across all data sets. RF performance was comparatively poor. The use of out-of-bag bootstrap confidence intervals provided a measure of uncertainty of model prediction such that the quality of metabolomics data was observed to be a bigger influence on generalised performance than model choice. CONCLUSION The size of the data set, and choice of performance metric, had a greater influence on generalised predictive performance than the choice of machine learning algorithm.
Collapse
Affiliation(s)
- Kevin M Mendez
- Centre for Metabolomics & Computational Biology, School of Science, Edith Cowan University, Joondalup, 6027, Australia
| | - Stacey N Reinke
- Centre for Metabolomics & Computational Biology, School of Science, Edith Cowan University, Joondalup, 6027, Australia
| | - David I Broadhurst
- Centre for Metabolomics & Computational Biology, School of Science, Edith Cowan University, Joondalup, 6027, Australia.
| |
Collapse
|
217
|
Duan S, Yu S, Chen Y, Principe JC. On Kernel Method-Based Connectionist Models and Supervised Deep Learning Without Backpropagation. Neural Comput 2019; 32:97-135. [PMID: 31703172 DOI: 10.1162/neco_a_01250] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
We propose a novel family of connectionist models based on kernel machines and consider the problem of learning layer by layer a compositional hypothesis class (i.e., a feedforward, multilayer architecture) in a supervised setting. In terms of the models, we present a principled method to "kernelize" (partly or completely) any neural network (NN). With this method, we obtain a counterpart of any given NN that is powered by kernel machines instead of neurons. In terms of learning, when learning a feedforward deep architecture in a supervised setting, one needs to train all the components simultaneously using backpropagation (BP) since there are no explicit targets for the hidden layers (Rumelhart, Hinton, & Williams, 1986). We consider without loss of generality the two-layer case and present a general framework that explicitly characterizes a target for the hidden layer that is optimal for minimizing the objective function of the network. This characterization then makes possible a purely greedy training scheme that learns one layer at a time, starting from the input layer. We provide instantiations of the abstract framework under certain architectures and objective functions. Based on these instantiations, we present a layer-wise training algorithm for an l-layer feedforward network for classification, where l≥2 can be arbitrary. This algorithm can be given an intuitive geometric interpretation that makes the learning dynamics transparent. Empirical results are provided to complement our theory. We show that the kernelized networks, trained layer-wise, compare favorably with classical kernel machines as well as other connectionist models trained by BP. We also visualize the inner workings of the greedy kernelized models to validate our claim on the transparency of the layer-wise algorithm.
Collapse
Affiliation(s)
- Shiyu Duan
- Department of Electrical and Computer Engineering, University of Florida, Gainesville, FL 32611, U.S.A.
| | - Shujian Yu
- Department of Electrical and Computer Engineering, University of Florida, Gainesville, FL 32611, U.S.A.
| | - Yunmei Chen
- Department of Mathematics, University of Florida, Gainesville, FL 32611, U.S.A.
| | - Jose C Principe
- Department of Electrical and Computer Engineering, University of Florida, Gainesville, FL 32611, U.S.A.
| |
Collapse
|
218
|
Zhang J, Yuan C, Stegagno P, Zeng W, Wang C. Small fault detection from discrete-time closed-loop control using fault dynamics residuals. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2019.07.037] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
219
|
Non-singular Terminal Sliding Mode Control of Robot Manipulators with $$H_\infty $$ Trajectory Tracking Performance. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2019. [DOI: 10.1007/s13369-019-04049-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
220
|
Pundir AS, Singh K. Chattering Free Sliding Mode Control with Observer Based Adaptive Radial Basis Function Neural Network for Temperature Tracking in a Fixed Bed Reactor. INTERNATIONAL JOURNAL OF CHEMICAL REACTOR ENGINEERING 2019. [DOI: 10.1515/ijcre-2018-0256] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Abstract
In this paper, a Chattering Free Sliding Mode Control (CFSMC) with observer based adaptive Radial Basis Function Neural Network (RBFNN) has been designed for first-order transfer function model of temperature trajectory in a fixed bed reactor. The steady-state behavior and effect of different operating parameters such as feed velocity and temperature on the operation of the fixed bed reactor have been discussed. Due to RBFNN’s capability to map the nonlinear dynamics online through self-learning ability, it is combined with CFSMC to reduce the chattering behavior. The adaptive RBFNN has been used to approximate the nonlinear dynamic behavior of the fixed bed reactor. To predict the states of the system, high gain observer based on adaptive RBFNN has been used. Design parameter of the observer has been estimated using Hurwitz polynomial. The effect of neuron number on the mapping error and the effect of space discretization step on modeling error have also been discussed. To decrease the chattering generated by the Sliding Mode Controller (SMC) in the temperature trajectory tracking, an equivalent control term is neglected from the final controller. It has two main advantages: one is the reduction in chattering behavior which is the main drawback of SMC and the second is the reduction of the high gain requirement. The SMC is used to overcome against external disturbance, load variation, variation in key parameters and model mismatch. To make the simulation realistic, constraints have been applied to control input and input rate. For guaranteeing the system stability, Lyapunov theorem has been applied. To show the suitability of the hybrid controller, a comparison has been carried out between the hybrid and PID controller. To quantify the performance, Integral Time Weighted Absolute Error (ITAE) has been estimated. Under the condition of existing model errors and external disturbances, simulation study of the control of the fixed bed reactor shows that the hybrid control algorithm consisting of sliding mode control and observer-based adaptive RBFNN performs well both for tracking the temperature trajectory and reducing the chattering.
Collapse
|
221
|
Pérez-Escamirosa F, Alarcón-Paredes A, Alonso-Silverio GA, Oropesa I, Camacho-Nieto O, Lorias-Espinoza D, Minor-Martínez A. Objective classification of psychomotor laparoscopic skills of surgeons based on three different approaches. Int J Comput Assist Radiol Surg 2019; 15:27-40. [PMID: 31605351 DOI: 10.1007/s11548-019-02073-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2019] [Accepted: 09/30/2019] [Indexed: 12/17/2022]
Abstract
BACKGROUND The determination of surgeons' psychomotor skills in minimally invasive surgery techniques is one of the major concerns of the programs of surgical training in several hospitals. Therefore, it is important to assess and classify objectively the level of experience of surgeons and residents during their training process. The aim of this study was to investigate three classification methods for establishing automatically the level of surgical competence of the surgeons based on their psychomotor laparoscopic skills. METHODS A total of 43 participants, divided into an experienced surgeons group with ten experts (> 100 laparoscopic procedures performed) and non-experienced surgeons group with 24 residents and nine medical students (< 10 laparoscopic procedures performed), performed three tasks in the EndoViS training system. Motion data of the instruments were captured with a video-tracking system built into the EndoViS simulator and analyzed using 13 motion analysis parameters (MAPs). Radial basis function networks (RBFNets), K-star (K*), and random forest (RF) were used for classifying surgeons based on the MAPs' scores of all participants. The performance of the three classifiers was examined using hold-out and leave-one-out validation techniques. RESULTS For all three tasks, the K-star method was superior in terms of accuracy and AUC in both validation techniques. The mean accuracy of the classifiers was 93.33% for K-star, 87.58% for RBFNets, and 84.85% for RF in hold-out validation, and 91.47% for K-star, 89.92% for RBFNets, and 83.72% for RF in leave-one-out cross-validation. CONCLUSIONS The three proposed methods demonstrated high performance in the classification of laparoscopic surgeons, according to their level of psychomotor skills. Together with motion analysis and three laparoscopic tasks of the Fundamental Laparoscopic Surgery Program, these classifiers provide a means for objectively classifying surgical competence of the surgeons for existing laparoscopic box trainers.
Collapse
Affiliation(s)
- Fernando Pérez-Escamirosa
- Instituto de Ciencias Aplicadas y Tecnología (ICAT), Universidad Nacional Autónoma de México (UNAM), Circuito Exterior S/N, Ciudad Universitaria, Coyoacán, 04510, Ciudad de México, Mexico
- Department of Biomedical Informatics, Faculty of Medicine, Universidad Nacional Autónoma de México (UNAM), Circuito Interior, Av. Universidad 3000, Ciudad Universitaria, Coyoacán, 04510, Ciudad de México, Mexico
| | - Antonio Alarcón-Paredes
- Laboratory of Computing Technologies and Electronics, Faculty of Engineering, Universidad Autónoma de Guerrero, Av. Lázaro Cárdenas S/N, Ciudad Universitaria, 39070, Chilpancingo, Guerrero, Mexico.
| | - Gustavo Adolfo Alonso-Silverio
- Laboratory of Computing Technologies and Electronics, Faculty of Engineering, Universidad Autónoma de Guerrero, Av. Lázaro Cárdenas S/N, Ciudad Universitaria, 39070, Chilpancingo, Guerrero, Mexico
| | - Ignacio Oropesa
- Biomedical Engineering and Telemedicine Centre (GBT), ETSI Telecomunicación, Center for Biomedical Technology, Universidad Politécnica de Madrid (UPM), Avda Complutense, 30, 28040, Madrid, Spain
| | - Oscar Camacho-Nieto
- Intelligent Computing Laboratory, Centro de Innovación y Desarrollo Tecnológico en Computación (CIDETEC-IPN), Av. Juan de Dios Bátiz, Col. Nueva Industrial Vallejo, 07700, Ciudad de México, Mexico
| | - Daniel Lorias-Espinoza
- Department of Electrical Engineering, Bioelectronics Section, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (CINVESTAV-IPN), Av. IPN 2508, Col. San Pedro Zacatenco, 07360, Ciudad de México, Mexico
| | - Arturo Minor-Martínez
- Department of Electrical Engineering, Bioelectronics Section, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (CINVESTAV-IPN), Av. IPN 2508, Col. San Pedro Zacatenco, 07360, Ciudad de México, Mexico
| |
Collapse
|
222
|
|
223
|
Wang H, Li C, Zhen X, Yang W, Zhang B. Gaussian Transfer Convolutional Neural Networks. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 2019. [DOI: 10.1109/tetci.2018.2881225] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
224
|
Fang Q, Li Z, Wang Y, Song M, Wang J. A neural-network enhanced modeling method for real-time evaluation of the temperature distribution in a data center. Neural Comput Appl 2019. [DOI: 10.1007/s00521-019-04508-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
225
|
Tomczyk K, Piekarczyk M, Sokal G. Radial Basis Functions Intended to Determine the Upper Bound of Absolute Dynamic Error at the Output of Voltage-Mode Accelerometers. SENSORS 2019; 19:s19194154. [PMID: 31557918 PMCID: PMC6806288 DOI: 10.3390/s19194154] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/03/2019] [Revised: 09/19/2019] [Accepted: 09/23/2019] [Indexed: 11/19/2022]
Abstract
In this paper, we propose using the radial basis functions (RBF) to determine the upper bound of absolute dynamic error (UAE) at the output of a voltage-mode accelerometer. Such functions can be obtained as a result of approximating the error values determined for the assumed-in-advance parameter variability associated with the mathematical model of an accelerometer. This approximation was carried out using the radial basis function neural network (RBF-NN) procedure for a given number of the radial neurons. The Monte Carlo (MC) method was also applied to determine the related error when considering the uncertainties associated with the parameters of an accelerometer mathematical model. The upper bound of absolute dynamic error can be a quality ratio for comparing the errors produced by different types of voltage-mode accelerometers that have the same operational frequency bandwidth. Determination of the RBFs was performed by applying the Python-related scientific packages, while the calculations related both to the UAE and the MC method were carried out using the MathCad program. Application of the RBFs represent a new approach for determining the UAE. These functions allow for the easy and quick determination of the value of such errors.
Collapse
Affiliation(s)
- Krzysztof Tomczyk
- Faculty of Electrical and Computer Engineering, Cracow University of Technology, Warszawska 24, 31-155 Krakow, Poland.
| | - Marcin Piekarczyk
- Faculty of Mathematics, Physics and Technical Science, Pedagogical University of Cracow, 2 Podchorazych Ave, 30-084 Krakow, Poland.
| | - Grzegorz Sokal
- Faculty of Mathematics, Physics and Technical Science, Pedagogical University of Cracow, 2 Podchorazych Ave, 30-084 Krakow, Poland.
| |
Collapse
|
226
|
Galán-Prado F, Morán A, Font J, Roca M, Rosselló JL. Compact Hardware Synthesis of Stochastic Spiking Neural Networks. Int J Neural Syst 2019; 29:1950004. [DOI: 10.1142/s0129065719500047] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Spiking neural networks (SNN) are able to emulate real neural behavior with high confidence due to their bio-inspired nature. Many designs have been proposed for the implementation of SNN in hardware, although the realization of high-density and biologically-inspired SNN is currently a complex challenge of high scientific and technical interest. In this work, we propose a compact digital design for the implementation of high-volume SNN that considers the intrinsic stochastic processes present in biological neurons and enables high-density hardware implementation. The proposed stochastic SNN model (SSNN) is compared with previous SSNN models, achieving a higher processing speed. We also show how the proposed model can be scaled to high-volume neural networks trained by using back propagation and applied to a pattern classification task. The proposed model achieves better results compared with other recently-published SNN models configured with unsupervised STDP learning.
Collapse
Affiliation(s)
- Fabio Galán-Prado
- Electronics Engineering Group, Physics Department, Universitat de les Illes Balears, Mateu Orfila Building, Ctra. Valldemossa km. 7.5, Palma de Mallorca, Balears 07122, Spain
| | - Alejandro Morán
- Electronics Engineering Group, Physics Department, Universitat de les Illes Balears, Mateu Orfila Building, Ctra. Valldemossa km. 7.5, Palma de Mallorca, Balears 07122, Spain
| | - Joan Font
- Electronics Engineering Group, Physics Department, Universitat de les Illes Balears, Mateu Orfila Building, Ctra. Valldemossa km. 7.5, Palma de Mallorca, Balears 07122, Spain
| | - Miquel Roca
- Electronics Engineering Group, Physics Department, Universitat de les Illes Balears, Mateu Orfila Building, Ctra. Valldemossa km. 7.5, Palma de Mallorca, Balears 07122, Spain
| | - Josep L. Rosselló
- Electronics Engineering Group, Physics Department, Universitat de les Illes Balears, Mateu Orfila Building, Ctra. Valldemossa km. 7.5, Palma de Mallorca, Balears 07122, Spain
| |
Collapse
|
227
|
Naik SM, Jagannath RPK, Kuppili V. Bat algorithm-based weighted Laplacian probabilistic neural network. Neural Comput Appl 2019. [DOI: 10.1007/s00521-019-04475-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
228
|
Detecting the presence of anterior cruciate ligament injury based on gait dynamics disparity and neural networks. Artif Intell Rev 2019. [DOI: 10.1007/s10462-019-09758-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
229
|
Miao T, Lu W, Guo J, Lin J, Fan Y. Modeling and uncertainty analysis of seawater intrusion based on surrogate models. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2019; 26:26015-26025. [PMID: 31273667 DOI: 10.1007/s11356-019-05799-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Accepted: 06/19/2019] [Indexed: 06/09/2023]
Abstract
When using a simulation model to study seawater intrusion (SI), uncertainty in the parameters directly affects the results. The impact of the rise in sea levels due to global warming on SI cannot be ignored. In this paper, the Monte Carlo method is used to analyze the uncertainty in modeling SI. To reduce the computational cost of the repeated invocation of the simulation model as well as time, a surrogate model is established using a radial basis function (RBF)-based neural network method. To enhance the accuracy of the substitution model, input samples are sampled using the Latin hypercube sampling (LHS) method. The results of uncertainty analysis had a high reference value and show the following: (1) The surrogate model created using the RBF method can significantly reduce computational cost and save at least 95% of the time needed for the repeated invocation of the simulation model while maintaining high accuracy. (2) Uncertainty in the parameters and the magnitude of the rise in sea levels have a significant impact on SI. The results of prediction were thus highly uncertain. In practice, it is necessary to quantify uncertainty to provide more intuitive predictions.
Collapse
Affiliation(s)
- Tiansheng Miao
- College of New Energy and Environment, Jilin University, Changchun, 130021, China
- Key Laboratory of Groundwater Resources and Environment (Jilin University), Ministry of Education, Changchun, 130021, China
- Jilin Provincial Key Laboratory of Water Resources and Environment, Jilin University, Changchun, 130021, China
| | - Wenxi Lu
- College of New Energy and Environment, Jilin University, Changchun, 130021, China.
- Key Laboratory of Groundwater Resources and Environment (Jilin University), Ministry of Education, Changchun, 130021, China.
- Jilin Provincial Key Laboratory of Water Resources and Environment, Jilin University, Changchun, 130021, China.
| | - Jiayuan Guo
- College of New Energy and Environment, Jilin University, Changchun, 130021, China
- Key Laboratory of Groundwater Resources and Environment (Jilin University), Ministry of Education, Changchun, 130021, China
- Jilin Provincial Key Laboratory of Water Resources and Environment, Jilin University, Changchun, 130021, China
| | - Jin Lin
- Nanjing Hydraulic Research Institute, Nanjing, 210029, China
| | - Yue Fan
- College of New Energy and Environment, Jilin University, Changchun, 130021, China
- Key Laboratory of Groundwater Resources and Environment (Jilin University), Ministry of Education, Changchun, 130021, China
- Jilin Provincial Key Laboratory of Water Resources and Environment, Jilin University, Changchun, 130021, China
| |
Collapse
|
230
|
Li M, Wibowo S, Guo W. Nonlinear Curve Fitting Using Extreme Learning Machines and Radial Basis Function Networks. Comput Sci Eng 2019. [DOI: 10.1109/mcse.2018.2875323] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
231
|
Park JG, Jo S. Bayesian Weight Decay on Bounded Approximation for Deep Convolutional Neural Networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2019; 30:2866-2875. [PMID: 30668505 DOI: 10.1109/tnnls.2018.2886995] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
This paper determines the weight decay parameter value of a deep convolutional neural network (CNN) that yields a good generalization. To obtain such a CNN in practice, numerical trials with different weight decay values are needed. However, the larger the CNN architecture is, the higher is the computational cost of the trials. To address this problem, this paper formulates an analytical solution for the decay parameter through a proposed objective function in conjunction with Bayesian probability distributions. For computational efficiency, a novel method to approximate this solution is suggested. This method uses a small amount of information in the Hessian matrix. Theoretically, the approximate solution is guaranteed by a provable bound and is obtained by a proposed algorithm, where its time complexity is linear in terms of both the depth and width of the CNN. The bound provides a consistent result for the proposed learning scheme. By reducing the computational cost of determining the decay value, the approximation allows for the fast investigation of a deep CNN (DCNN) which yields a small generalization error. Experimental results show that our assumption verified with different DCNNs is suitable for real-world image data sets. In addition, the proposed method significantly reduces the time cost of learning with setting the weight decay parameter while achieving good classification performances.
Collapse
|
232
|
Yang X, Wang Y, Byrne R, Schneider G, Yang S. Concepts of Artificial Intelligence for Computer-Assisted Drug Discovery. Chem Rev 2019; 119:10520-10594. [PMID: 31294972 DOI: 10.1021/acs.chemrev.8b00728] [Citation(s) in RCA: 422] [Impact Index Per Article: 70.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Artificial intelligence (AI), and, in particular, deep learning as a subcategory of AI, provides opportunities for the discovery and development of innovative drugs. Various machine learning approaches have recently (re)emerged, some of which may be considered instances of domain-specific AI which have been successfully employed for drug discovery and design. This review provides a comprehensive portrayal of these machine learning techniques and of their applications in medicinal chemistry. After introducing the basic principles, alongside some application notes, of the various machine learning algorithms, the current state-of-the art of AI-assisted pharmaceutical discovery is discussed, including applications in structure- and ligand-based virtual screening, de novo drug design, physicochemical and pharmacokinetic property prediction, drug repurposing, and related aspects. Finally, several challenges and limitations of the current methods are summarized, with a view to potential future directions for AI-assisted drug discovery and design.
Collapse
Affiliation(s)
- Xin Yang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital , Sichuan University , Chengdu , Sichuan 610041 , China
| | - Yifei Wang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital , Sichuan University , Chengdu , Sichuan 610041 , China
| | - Ryan Byrne
- ETH Zurich , Department of Chemistry and Applied Biosciences , Vladimir-Prelog-Weg 4 , CH-8093 Zurich , Switzerland
| | - Gisbert Schneider
- ETH Zurich , Department of Chemistry and Applied Biosciences , Vladimir-Prelog-Weg 4 , CH-8093 Zurich , Switzerland
| | - Shengyong Yang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital , Sichuan University , Chengdu , Sichuan 610041 , China
| |
Collapse
|
233
|
Sramka M, Slovak M, Tuckova J, Stodulka P. Improving clinical refractive results of cataract surgery by machine learning. PeerJ 2019; 7:e7202. [PMID: 31304064 PMCID: PMC6611496 DOI: 10.7717/peerj.7202] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2019] [Accepted: 05/27/2019] [Indexed: 11/20/2022] Open
Abstract
AIM To evaluate the potential of the Support Vector Machine Regression model (SVM-RM) and Multilayer Neural Network Ensemble model (MLNN-EM) to improve the intraocular lens (IOL) power calculation for clinical workflow. BACKGROUND Current IOL power calculation methods are limited in their accuracy with the possibility of decreased accuracy especially in eyes with an unusual ocular dimension. In case of an improperly calculated power of the IOL in cataract or refractive lens replacement surgery there is a risk of re-operation or further refractive correction. This may create potential complications and discomfort for the patient. METHODS A dataset containing information about 2,194 eyes was obtained using data mining process from the Electronic Health Record (EHR) system database of the Gemini Eye Clinic. The dataset was optimized and split into the selection set (used in the design for models and training), and the verification set (used in the evaluation). The set of mean prediction errors (PEs) and the distribution of predicted refractive errors were evaluated for both models and clinical results (CR). RESULTS Both models performed significantly better for the majority of the evaluated parameters compared with the CR. There was no significant difference between both evaluated models. In the ±0.50 D PE category both SVM-RM and MLNN-EM were slightly better than the Barrett Universal II formula, which is often presented as the most accurate calculation formula. CONCLUSION In comparison to the current clinical method, both SVM-RM and MLNN-EM have achieved significantly better results in IOL calculations and therefore have a strong potential to improve clinical cataract refractive outcomes.
Collapse
Affiliation(s)
- Martin Sramka
- Department of Circuit Theory/Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic
- Research and Development Department, Gemini Eye Clinic, Zlin, Czech Republic
| | - Martin Slovak
- Research and Development Department, Gemini Eye Clinic, Zlin, Czech Republic
| | - Jana Tuckova
- Department of Circuit Theory/Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic
| | - Pavel Stodulka
- Research and Development Department, Gemini Eye Clinic, Zlin, Czech Republic
- Department of Ophthalmology/Third Faculty of Medicine, Charles University, Prague, Czech Republic
| |
Collapse
|
234
|
Fast Multi-Objective Antenna Optimization Based on RBF Neural Network Surrogate Model Optimized by Improved PSO Algorithm. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9132589] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
In this paper, a radial basis function neural network (RBFNN) surrogate model optimized by an improved particle swarm optimization (PSO) algorithm is developed to reduce the computation cost of traditional antenna design methods which rely on high-fidelity electromagnetic (EM) simulations. Considering parameters adjustment and update mechanism simultaneously, two modifications are proposed in this improved PSO. First, time-varying learning factors are designed to balance exploration and exploitation ability of particles in the search space. Second, the local best information is added to the updating process of particles except for personal and global best information for better population diversity. The improved PSO is applied to train RBFNN for determining optimal network parameters. As a result, the constructed improved PSO-RBFNN model can be used as a surrogate model for antenna performance prediction with better network generalization capability. By integrating the improved PSO-RBFNN surrogate model with multi-objective evolutionary algorithms (MOEAs), a fast multi-objective antenna optimization framework for multi-parameter antenna structures is then established. Finally, a Pareto-optimal planar miniaturized multiband antenna design is presented, demonstrating that the proposed model provides better prediction performance and considerable computational savings compared to those previously published approaches.
Collapse
|
235
|
Cholesky Factorization Based Online Sequential Extreme Learning Machines with Persistent Regularization and Forgetting Factor. Symmetry (Basel) 2019. [DOI: 10.3390/sym11060801] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
The online sequential extreme learning machine with persistent regularization and forgetting factor (OSELM-PRFF) can avoid potential singularities or ill-posed problems of online sequential regularized extreme learning machines with forgetting factors (FR-OSELM), and is particularly suitable for modelling in non-stationary environments. However, existing algorithms for OSELM-PRFF are time-consuming or unstable in certain paradigms or parameters setups. This paper presents a novel algorithm for OSELM-PRFF, named “Cholesky factorization based” OSELM-PRFF (CF-OSELM-PRFF), which recurrently constructs an equation for extreme learning machine and efficiently solves the equation via Cholesky factorization during every cycle. CF-OSELM-PRFF deals with timeliness of samples by forgetting factor, and the regularization term in its cost function works persistently. CF-OSELM-PRFF can learn data one-by-one or chunk-by-chunk with a fixed or varying chunk size. Detailed performance comparisons between CF-OSELM-PRFF and relevant approaches are carried out on several regression problems. The numerical simulation results show that CF-OSELM-PRFF demonstrates higher computational efficiency than its counterparts, and can yield stable predictions.
Collapse
|
236
|
A data assimilation framework for data-driven flow models enabled by motion tomography. INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS 2019. [DOI: 10.1007/s41315-019-00092-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
237
|
Wang H, Li G. Extreme learning machine Cox model for high-dimensional survival analysis. Stat Med 2019; 38:2139-2156. [PMID: 30632193 PMCID: PMC6498851 DOI: 10.1002/sim.8090] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Revised: 10/11/2018] [Accepted: 12/12/2018] [Indexed: 11/07/2022]
Abstract
Some interesting recent studies have shown that neural network models are useful alternatives in modeling survival data when the assumptions of a classical parametric or semiparametric survival model such as the Cox (1972) model are seriously violated. However, to the best of our knowledge, the plausibility of adapting the emerging extreme learning machine (ELM) algorithm for single-hidden-layer feedforward neural networks to survival analysis has not been explored. In this paper, we present a kernel ELM Cox model regularized by an L0 -based broken adaptive ridge (BAR) penalization method. Then, we demonstrate that the resulting method, referred to as ELMCoxBAR, can outperform some other state-of-art survival prediction methods such as L1 - or L2 -regularized Cox regression, random survival forest with various splitting rules, and boosted Cox model, in terms of its predictive performance using both simulated and real world datasets. In addition to its good predictive performance, we illustrate that the proposed method has a key computational advantage over the above competing methods in terms of computation time efficiency using an a real-world ultra-high-dimensional survival data.
Collapse
Affiliation(s)
- Hong Wang
- School of Mathematics and Statistics, Central South University, Changsha, China
| | - Gang Li
- Department of Biostatistics, UCLA Fielding School of Public Health, University of California, Los Angeles, California
| |
Collapse
|
238
|
Serrano W. Genetic and deep learning clusters based on neural networks for management decision structures. Neural Comput Appl 2019. [DOI: 10.1007/s00521-019-04231-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
239
|
|
240
|
Classification of gait patterns in patients with unilateral anterior cruciate ligament deficiency based on phase space reconstruction, Euclidean distance and neural networks. Soft comput 2019. [DOI: 10.1007/s00500-019-04017-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
241
|
Xie Y, Yu J, Xie S, Huang T, Gui W. On-line prediction of ferrous ion concentration in goethite process based on self-adjusting structure RBF neural network. Neural Netw 2019; 116:1-10. [PMID: 30986722 DOI: 10.1016/j.neunet.2019.03.007] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2018] [Revised: 11/16/2018] [Accepted: 03/13/2019] [Indexed: 11/30/2022]
Abstract
Outlet ferrous ion concentration is an essential indicator to manipulate the goethite process in the zinc hydrometallurgy plant. However, it cannot be measured on-line, which leads to the delay of this feedback information. In this study, a self-adjusting structure radial basis function neural network (SAS-RBFNN) is developed to predict the outlet ferrous ion concentration on-line. First, a supervised cluster algorithm is proposed to initialize the RBFNN. Then, the network structure is adjusted by the developed self-adjusting structure mechanism. This mechanism can merge or divide the hidden neurons according to the distance of the clusters to achieve the adaptability of the RBFNN. Finally, the connection weights are determined by the gradient-based algorithm. The convergence of the SAS-RBFNN is analyzed by the Lyapunov criterion. A simulation for a benchmark problem shows the effectiveness of the proposed network. The SAS-RBFNN is then applied to predict the outlet ferrous ion concentration in the goethite process. The results demonstrate that this network can provide a more accurate prediction than the mathematical model, even under the fluctuating production condition.
Collapse
Affiliation(s)
- Yongfang Xie
- School of Automation, Central South University, Changsha City, 410083, China
| | - Jinjing Yu
- School of Automation, Central South University, Changsha City, 410083, China
| | - Shiwen Xie
- School of Automation, Central South University, Changsha City, 410083, China; Department of Electrical and Computer Engineering, College of Engineering, Wayne State University, Detroit, 48202, United States.
| | - Tingwen Huang
- Department of Mathematics, Texas A&M University at Qatar, Doha, Qatar
| | - Weihua Gui
- School of Automation, Central South University, Changsha City, 410083, China
| |
Collapse
|
242
|
Classification of Special Days in Short-Term Load Forecasting: The Spanish Case Study. ENERGIES 2019. [DOI: 10.3390/en12071253] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Short-Term Load Forecasting is a very relevant aspect in managing, operating or participating an electric system. From system operators to energy producers and retailers knowing the electric demand in advance with high accuracy is a key feature for their business. The load series of a given system presents highly repetitive daily, weekly and yearly patterns. However, other factors like temperature or social events cause abnormalities in this otherwise periodic behavior. In order to develop an effective load forecasting system, it is necessary to understand and model these abnormalities because, in many cases, the higher forecasting error typical of these special days is linked to the larger part of the losses related to load forecasting. This paper focuses on the effect that several types of special days have on the load curve and how important it is to model these behaviors in detail. The paper analyzes the Spanish national system and it uses linear regression to model the effect that social events like holidays or festive periods have on the load curve. The results presented in this paper show that a large classification of events is needed in order to accurately model all the events that may occur in a 7-year period.
Collapse
|
243
|
Luan F, Na J, Huang Y, Gao G. Adaptive neural network control for robotic manipulators with guaranteed finite-time convergence. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2019.01.063] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
244
|
The present and future of deep learning in radiology. Eur J Radiol 2019; 114:14-24. [PMID: 31005165 DOI: 10.1016/j.ejrad.2019.02.038] [Citation(s) in RCA: 184] [Impact Index Per Article: 30.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2018] [Revised: 02/17/2019] [Accepted: 02/26/2019] [Indexed: 12/18/2022]
Abstract
The advent of Deep Learning (DL) is poised to dramatically change the delivery of healthcare in the near future. Not only has DL profoundly affected the healthcare industry it has also influenced global businesses. Within a span of very few years, advances such as self-driving cars, robots performing jobs that are hazardous to human, and chat bots talking with human operators have proved that DL has already made large impact on our lives. The open source nature of DL and decreasing prices of computer hardware will further propel such changes. In healthcare, the potential is immense due to the need to automate the processes and evolve error free paradigms. The sheer quantum of DL publications in healthcare has surpassed other domains growing at a very fast pace, particular in radiology. It is therefore imperative for the radiologists to learn about DL and how it differs from other approaches of Artificial Intelligence (AI). The next generation of radiology will see a significant role of DL and will likely serve as the base for augmented radiology (AR). Better clinical judgement by AR will help in improving the quality of life and help in life saving decisions, while lowering healthcare costs. A comprehensive review of DL as well as its implications upon the healthcare is presented in this review. We had analysed 150 articles of DL in healthcare domain from PubMed, Google Scholar, and IEEE EXPLORE focused in medical imagery only. We have further examined the ethic, moral and legal issues surrounding the use of DL in medical imaging.
Collapse
|
245
|
Alessandri A, Bagnerini P, Gaggero M. Optimal Control of Propagating Fronts by Using Level Set Methods and Neural Approximations. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2019; 30:902-912. [PMID: 30072343 DOI: 10.1109/tnnls.2018.2853195] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
We address the optimal control of level sets associated with the solution of the normal flow equation. The problem consists in finding the normal velocity to the front described by a certain level set in such a way to minimize a given cost functional. First, the considered problem is shown to admit a solution on a suitable space of functions. Then, since in general it is difficult to solve it analytically, an approximation scheme that relies on the extended Ritz method is proposed to find suboptimal solutions. Specifically, the control law is forced to take on a neural structure depending nonlinearly on a finite number of parameters to be tuned, i.e., the neural weights. The selection of the optimal weights is performed with two different approaches. The first one employs classical line-search descent methods, while the second one is based on a quasi-Newton optimization that can be regarded as neural learning based on the extended Kalman filter. Compared with line-search methods, such an approach reveals to be successful with a reduced computational effort and an increased robustness with respect to the trapping into local minima, as confirmed by simulations in both two and three dimensions.
Collapse
|
246
|
Real-time detection of driver distraction: random projections for pseudo-inversion-based neural training. Knowl Inf Syst 2019. [DOI: 10.1007/s10115-019-01339-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
247
|
Geng Z, Shang D, Han Y, Zhong Y. Early warning modeling and analysis based on a deep radial basis function neural network integrating an analytic hierarchy process: A case study for food safety. Food Control 2019. [DOI: 10.1016/j.foodcont.2018.09.027] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
248
|
Scardapane S, Van Vaerenbergh S, Totaro S, Uncini A. Kafnets: Kernel-based non-parametric activation functions for neural networks. Neural Netw 2019; 110:19-32. [DOI: 10.1016/j.neunet.2018.11.002] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2017] [Revised: 10/30/2018] [Accepted: 11/04/2018] [Indexed: 11/16/2022]
|
249
|
Koo B, Jo T, Lee D. Modified inferential POD/ML for data-driven inverse procedure of steam reformer for 5-kW HT-PEMFC. Comput Chem Eng 2019. [DOI: 10.1016/j.compchemeng.2018.11.012] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
250
|
Ong F, Milanfar P, Getreuer P. Local Kernels that Approximate Bayesian Regularization and Proximal Operators. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2019; 28:3007-3019. [PMID: 30640613 DOI: 10.1109/tip.2019.2893071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
In this work, we broadly connect kernel-based filtering (e.g. approaches such as the bilateral filter and nonlocal means, but also many more) with general variational formulations of Bayesian regularized least squares, and the related concept of proximal operators. Variational/Bayesian/proximal formulations often result in optimization problems that do not have closed-form solutions, and therefore typically require global iterative solutions. Our main contribution here is to establish how one can approximate the solution of the resulting global optimization problems using locally adaptive filters with specific kernels. Our results are valid for small regularization strength (i.e. weak noise) but the approach is powerful enough to be useful for a wide range of applications because we expose how to derive a "kernelized" solution to these problems that approximates the global solution in one shot, using only local operations. As another side benefit in the reverse direction, given a local data-adaptive filter constructed with a particular choice of kernel, we enable the interpretation of such filters in the variational/Bayesian/proximal framework.
Collapse
|