1
|
Liu Z, Luo H, Chen P, Xia Q, Gan Z, Shan W. An efficient isomorphic CNN-based prediction and decision framework for financial time series. INTELL DATA ANAL 2022. [DOI: 10.3233/ida-216142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Financial time series prediction and trading decision-making are priorities of computational intelligence for researchers in academia and the finance industry due to their broad application areas and substantial impact. However, these methods remain challenging because they retain various complex statistical properties, and the mechanism behind the processes is unknown to a large extent. A significant number of machine learning-based methods are proposed and demonstrate impressive results, especially deep learning-based models. Nevertheless, due to the high complexity of massive, nonlinear, and nonindependent data and the difficulties and time consumption of complicated training models of deep learning, the performance of online trading decisions is still inadequate for practical application. This paper proposes the Integrated Framework of Forecasting Based Online Trading Strategy (IFF-BOTS) to satisfy better prediction performance and dynamic decisions for real-world online trading systems. Our method adopts a novel isomorphic convolutional neural network (CNN)-based forecaster-classifier-executor architecture to exploit CNN-based price and trend integrated prediction and direct-reinforcement-learning-based trading decision-making. IFF-BOTS can also achieve better real-time performance for online trading. We empirically compare the proposed approach with state-of-the-art prediction and trading methods on real-world S&P and DJI datasets. The results show that the IFF-BOTS outperforms its competitors in predicting metrics, trading profits, and real-time performance.
Collapse
Affiliation(s)
- Zhongming Liu
- School of Computer and Software Engineering, Xihua University, Chengdu, Sichuan, China
| | - Hang Luo
- School of Economics, Xihua University, Chengdu, Sichuan, China
| | - Peng Chen
- School of Computer and Software Engineering, Xihua University, Chengdu, Sichuan, China
| | - Qibin Xia
- School of Computer and Software Engineering, Xihua University, Chengdu, Sichuan, China
| | - Zhihao Gan
- School of Computer and Software Engineering, Xihua University, Chengdu, Sichuan, China
| | - Wenyu Shan
- School of Computer and Software Engineering, Xihua University, Chengdu, Sichuan, China
| |
Collapse
|
2
|
Feature selection and deep neural networks for stock price direction forecasting using technical analysis indicators. MACHINE LEARNING WITH APPLICATIONS 2021. [DOI: 10.1016/j.mlwa.2021.100060] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
|
3
|
Feng S, Chen CLP. Fuzzy Broad Learning System: A Novel Neuro-Fuzzy Model for Regression and Classification. IEEE TRANSACTIONS ON CYBERNETICS 2020; 50:414-424. [PMID: 30106747 DOI: 10.1109/tcyb.2018.2857815] [Citation(s) in RCA: 76] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
A novel neuro-fuzzy model named fuzzy broad learning system (BLS) is proposed by merging the Takagi-Sugeno (TS) fuzzy system into BLS. The fuzzy BLS replaces the feature nodes of BLS with a group of TS fuzzy subsystems, and the input data are processed by each of them. Instead of aggregating the outputs of fuzzy rules produced by every fuzzy subsystem into one value immediately, all of them are sent to the enhancement layer for further nonlinear transformation to preserve the characteristic of inputs. The defuzzification outputs of all fuzzy subsystem and the outputs of enhancement layer are combined together to obtain the model output. The k -means method is employed to determine the centers of Gaussian membership functions in antecedent part and the number of fuzzy rules. The parameters that need to be calculated in a fuzzy BLS are the weights connecting the outputs of enhancement layer to model output and the randomly initialized coefficients of polynomials in consequent part in fuzzy subsystems, which can be calculated analytically. Therefore, fuzzy BLS retains the fast computational nature of BLS. The proposed fuzzy BLS is evaluated by some popular benchmarks for regression and classification, and compared with some state-of-the-art nonfuzzy and neuro-fuzzy approaches. The results indicate that fuzzy BLS outperforms other models involved. Moreover, fuzzy BLS shows advantages over neuro-fuzzy models regarding to the number of fuzzy rules and training time, which can ease the problem of rule explosion to some extent.
Collapse
|
4
|
Ognjanovic S, Thüring M, Murphy RO, Hölscher C. Display clutter and its effects on visual attention distribution and financial risk judgment. APPLIED ERGONOMICS 2019; 80:168-174. [PMID: 31280801 DOI: 10.1016/j.apergo.2019.05.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2017] [Revised: 12/06/2018] [Accepted: 05/20/2019] [Indexed: 06/09/2023]
Abstract
Display clutter is a widely studied phenomenon in ergonomics, where information density and other properties of task-relevant visualizations are related to effective user performance and visual attention. This paper examines the impact of clutter in the context of financial stock visualizations. Depending on their expertise, traders can use a variety of different cues to judge the current and future value of a stock and to assess its riskiness. In our study, two groups of participants (novices and experts) judge the riskiness of 28 pairs of stocks under two clutter conditions (low and high). Consistency of judgments and group concordance serve as measures for judgment performance, while mean fixation duration, fixation frequency, and transition matrix density are employed to capture visual attention. Our results reveal significant effects of display clutter and expertise on both the performance measures as well as the visual attention measures.
Collapse
Affiliation(s)
- Svetlana Ognjanovic
- ETH Zurich, Chair of Cognitive Science, Clausiusstrasse 59, 8092, Zürich, Switzerland.
| | - Manfred Thüring
- Technische Universität Berlin, Faculty V of Mechanical Engineering and Transport Systems, Department of Psychology and Ergonomics, Chair of Cognitive Psychology and Cognitive Ergonomics, Marchstraße 23, D-10587, Berlin, Germany
| | - Ryan O Murphy
- Morningstar Investment Management, Head of Decision Sciences, 22 W. Washington St, Chicago, IL, 60602, USA
| | - Christoph Hölscher
- ETH Zurich, Chair of Cognitive Science, Clausiusstrasse 59, 8092, Zürich, Switzerland
| |
Collapse
|
5
|
|
6
|
Deng Y, Bao F, Kong Y, Ren Z, Dai Q. Deep Direct Reinforcement Learning for Financial Signal Representation and Trading. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2017; 28:653-664. [PMID: 26890927 DOI: 10.1109/tnnls.2016.2522401] [Citation(s) in RCA: 110] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Can we train the computer to beat experienced traders for financial assert trading? In this paper, we try to address this challenge by introducing a recurrent deep neural network (NN) for real-time financial signal representation and trading. Our model is inspired by two biological-related learning concepts of deep learning (DL) and reinforcement learning (RL). In the framework, the DL part automatically senses the dynamic market condition for informative feature learning. Then, the RL module interacts with deep representations and makes trading decisions to accumulate the ultimate rewards in an unknown environment. The learning system is implemented in a complex NN that exhibits both the deep and recurrent structures. Hence, we propose a task-aware backpropagation through time method to cope with the gradient vanishing issue in deep training. The robustness of the neural system is verified on both the stock and the commodity future markets under broad testing conditions.
Collapse
|
7
|
|
8
|
A feature fusion based forecasting model for financial time series. PLoS One 2014; 9:e101113. [PMID: 24971455 PMCID: PMC4074191 DOI: 10.1371/journal.pone.0101113] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2013] [Accepted: 06/03/2014] [Indexed: 11/28/2022] Open
Abstract
Predicting the stock market has become an increasingly interesting research area for both researchers and investors, and many prediction models have been proposed. In these models, feature selection techniques are used to pre-process the raw data and remove noise. In this paper, a prediction model is constructed to forecast stock market behavior with the aid of independent component analysis, canonical correlation analysis, and a support vector machine. First, two types of features are extracted from the historical closing prices and 39 technical variables obtained by independent component analysis. Second, a canonical correlation analysis method is utilized to combine the two types of features and extract intrinsic features to improve the performance of the prediction model. Finally, a support vector machine is applied to forecast the next day's closing price. The proposed model is applied to the Shanghai stock market index and the Dow Jones index, and experimental results show that the proposed model performs better in the area of prediction than other two similar models.
Collapse
|
9
|
Kar S, Das S, Ghosh PK. Applications of neuro fuzzy systems: A brief review and future outline. Appl Soft Comput 2014. [DOI: 10.1016/j.asoc.2013.10.014] [Citation(s) in RCA: 213] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
10
|
Fernandez JM, Augusto JC, Seepold R, Madrid NM. A Sensor Technology Survey for a Stress-Aware Trading Process. ACTA ACUST UNITED AC 2012. [DOI: 10.1109/tsmcc.2011.2179028] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
11
|
Tung WL, Quek C. Financial volatility trading using a self-organising neural-fuzzy semantic network and option straddle-based approach. EXPERT SYSTEMS WITH APPLICATIONS 2011; 38:4668-4688. [PMID: 32288336 PMCID: PMC7126939 DOI: 10.1016/j.eswa.2010.07.116] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Financial volatility refers to the intensity of the fluctuations in the expected return on an investment or the pricing of a financial asset due to market uncertainties. Hence, volatility modeling and forecasting is imperative to financial market investors, as such projections allow the investors to adjust their trading strategies in anticipation of the impending financial market movements. Following this, financial volatility trading is the capitalization of the uncertainties of the financial markets to realize investment profits in times of rising, falling and side-way market conditions. In this paper, an intelligent straddle trading system (framework) that consists of a volatility projection module (VPM) and a trade decision module (TDM) is proposed for financial volatility trading via the buying and selling of option straddles to help a human trader capitalizes on the underlying uncertainties of the Hong Kong stock market. Three different measures, namely: (1) the historical volatility (HV), (2) implied volatility (IV) and (3) model-based volatility (MV) of the Hang Seng Index (HSI) are employed to quantify the implicit volatility of the Hong Kong stock market. The TDM of the proposed straddle trading system combines the respective volatility measures with the well-established moving-averages convergence/divergence (MACD) principle to recommend trading actions to a human trader dealing in HSI straddles. However, the inherent limitation of the MACD trading rule is that it generates time-delayed trading signals due to the use of moving averages, which are essentially lagging trend indicators. This drawback is intuitively addressed in the proposed straddle trading system by applying the VPM to compute future projections of the volatility measures of the HSI prior to the activation of the TDM. The VPM is realized by a self-organising neural-fuzzy semantic network named the evolving fuzzy semantic memory (eFSM) model. As compared to existing statistical and computational intelligence based modeling techniques currently employed for financial volatility modeling and forecasting, eFSM possesses several desirable attributes such as: (1) an evolvable knowledge base to continuously address the non-stationary characteristics of the Hong Kong stock market; (2) highly formalized human-like information computations; and (3) a transparent structure that can be interpreted via a set of linguistic IF-THEN semantic fuzzy rules. These qualities provide added credence to the computed HSI volatility projections. The volatility modeling and forecasting performances of the eFSM, when benchmarked to several established modeling techniques, as well as the observed trading returns of the proposed straddle trading system, are encouraging.
Collapse
Affiliation(s)
- W L Tung
- Centre for Computational Intelligence, Block N4 #2A-32, School of Computer Engineering, Nanyang Technological University, Nanyang Avenue, Singapore 639798, Singapore
| | - C Quek
- Centre for Computational Intelligence, Block N4 #2A-32, School of Computer Engineering, Nanyang Technological University, Nanyang Avenue, Singapore 639798, Singapore
| |
Collapse
|
12
|
Huang SH, Lai SH, Tai SH. A learning-based contrarian trading strategy via a dual-classifier model. ACM T INTEL SYST TEC 2011. [DOI: 10.1145/1961189.1961192] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Behavioral finance is a relatively new and developing research field which adopts cognitive psychology and emotional bias to explain the inefficient market phenomenon and some irrational trading decisions. Unlike the experts in this field who tried to reason the price anomaly and applied empirical evidence in many different financial markets, we employ the advanced binary classification algorithms, such as AdaBoost and support vector machines, to precisely model the overreaction and strengthen the portfolio compositions of the contrarian trading strategies. The novelty of this article is to discover the financial time-series patterns through a high-dimensional and nonlinear model which is constructed by integrated knowledge of finance and machine learning techniques. We propose a dual-classifier learning framework to select candidate stocks from the past results of original contrarian trading strategies based on the defined learning targets. Three different feature extraction methods, including wavelet transformation, historical return distribution, and various technical indicators, are employed to represent these learning samples in a 381-dimensional financial time-series feature space. Finally, we construct the classifier models with four different learning kernels and prove that the proposed methods could improve the returns dramatically, such as the 3-year return that improved from 26.79% to 53.75%. The experiments also demonstrate significantly higher portfolio selection accuracy, improved from 57.47% to 66.41%, than the original contrarian trading strategy. To sum up, all these experiments show that the proposed method could be extended to an effective trading system in the historical stock prices of the leading U.S. companies of S&P 100 index.
Collapse
|
13
|
Tan J, Chai Quek. A BCM Theory of Meta-Plasticity for Online Self-Reorganizing Fuzzy-Associative Learning. ACTA ACUST UNITED AC 2010; 21:985-1003. [DOI: 10.1109/tnn.2010.2046747] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
14
|
Goh H, Lim JH, Quek C. Fuzzy associative conjuncted maps network. IEEE TRANSACTIONS ON NEURAL NETWORKS 2009; 20:1302-1319. [PMID: 19635694 DOI: 10.1109/tnn.2009.2023213] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
The fuzzy associative conjuncted maps (FASCOM) is a fuzzy neural network that associates data of nonlinearly related inputs and outputs. In the network, each input or output dimension is represented by a feature map that is partitioned into fuzzy or crisp sets. These fuzzy sets are then conjuncted to form antecedents and consequences, which are subsequently associated to form if-then rules. The associative memory is encoded through an offline batch mode learning process consisting of three consecutive phases. The initial unsupervised membership function initialization phase takes inspiration from the organization of sensory maps in our brains by allocating membership functions based on uniform information density. Next, supervised Hebbian learning encodes synaptic weights between input and output nodes. Finally, a supervised error reduction phase fine-tunes the network, which allows for the discovery of the varying levels of influence of each input dimension across an output feature space in the encoded memory. In the series of experiments, we show that each phase in the learning process contributes significantly to the final accuracy of prediction. Further experiments using both toy problems and real-world data demonstrate significant superiority in terms of accuracy of nonlinear estimation when benchmarked against other prominent architectures and exhibit the network's suitability to perform analysis and prediction on real-world applications, such as traffic density prediction as shown in this paper.
Collapse
Affiliation(s)
- Hanlin Goh
- Centre for Computational Intelligence, School of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore.
| | | | | |
Collapse
|
15
|
Singh A, Quek C, Cho SY. DCT-Yager FNN: a novel Yager-based fuzzy neural network with the discrete clustering technique. IEEE TRANSACTIONS ON NEURAL NETWORKS 2008; 19:625-44. [PMID: 18390309 DOI: 10.1109/tnn.2007.911709] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Earlier clustering techniques such as the modified learning vector quantization (MLVQ) and the fuzzy Kohonen partitioning (FKP) techniques have focused on the derivation of a certain set of parameters so as to define the fuzzy sets in terms of an algebraic function. The fuzzy membership functions thus generated are uniform, normal, and convex. Since any irregular training data is clustered into uniform fuzzy sets (Gaussian, triangular, or trapezoidal), the clustering may not be exact and some amount of information may be lost. In this paper, two clustering techniques using a Kohonen-like self-organizing neural network architecture, namely, the unsupervised discrete clustering technique (UDCT) and the supervised discrete clustering technique (SDCT), are proposed. The UDCT and SDCT algorithms reduce this data loss by introducing nonuniform, normal fuzzy sets that are not necessarily convex. The training data range is divided into discrete points at equal intervals, and the membership value corresponding to each discrete point is generated. Hence, the fuzzy sets obtained contain pairs of values, each pair corresponding to a discrete point and its membership grade. Thus, it can be argued that fuzzy membership functions generated using this kind of a discrete methodology provide a more accurate representation of the actual input data. This fact has been demonstrated by comparing the membership functions generated by the UDCT and SDCT algorithms against those generated by the MLVQ, FKP, and pseudofuzzy Kohonen partitioning (PFKP) algorithms. In addition to these clustering techniques, a novel pattern classifying network called the Yager fuzzy neural network (FNN) is proposed in this paper. This network corresponds completely to the Yager inference rule and exhibits remarkable generalization abilities. A modified version of the pseudo-outer product (POP)-Yager FNN called the modified Yager FNN is introduced that eliminates the drawbacks of the earlier network and yi- elds superior performance. Extensive experiments have been conducted to test the effectiveness of these two networks, using various clustering algorithms. It follows that the SDCT and UDCT clustering algorithms are particularly suited to networks based on the Yager inference rule.
Collapse
Affiliation(s)
- A Singh
- Centre for Computational Intelligence, Nanyang Technological University, Singapore
| | | | | |
Collapse
|
16
|
Teddy SD, Quek C, Lai EK. PSECMAC: a novel self-organizing multiresolution associative memory architecture. IEEE TRANSACTIONS ON NEURAL NETWORKS 2008; 19:689-712. [PMID: 18390313 DOI: 10.1109/tnn.2007.912300] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
The cerebellum constitutes a vital part of the human brain system that possesses the capability to model highly nonlinear physical dynamics. The cerebellar model articulation controller (CMAC) associative memory network is a computational model inspired by the neurophysiological properties of the cerebellum, and it has been widely used for control, optimization, and various pattern recognition tasks. However, the CMAC network's highly regularized computing structure often leads to the following: 1) a suboptimal modeling accuracy, 2) poor memory utilization, and 3) the generalization-accuracy dilemma. Previous attempts to address these shortcomings have limited success and the proposed solutions often introduce a high operational complexity to the CMAC network. This paper presents a novel neurophysiologically inspired associative memory architecture named pseudo-self-evolving CMAC (PSECMAC) that nonuniformly allocates its computing cells to overcome the architectural deficiencies encountered by the CMAC network. The nonuniform memory allocation scheme employed by the proposed PSECMAC network is inspired by the cerebellar experience-driven synaptic plasticity phenomenon observed in the cerebellum, where significantly higher densities of synaptic connections are located in the frequently accessed regions. In the PSECMAC network, this biological synaptic plasticity phenomenon is emulated by employing a data-driven adaptive memory quantization scheme that defines its computing structure. A neighborhood-based activation process is subsequently implemented to facilitate the learning and computation of the PSECMAC structure. The training stability of the PSECMAC network is theoretically assured by the proof of its learning convergence, which will be presented in this paper. The performance of the proposed network is subsequently benchmarked against the CMAC network and several representative CMAC variants on three real-life applications, namely, pricing of currency futures option, banking failure classification, and modeling of the glucose-insulin dynamics of the human glucose metabolic process. The experimental results have strongly demonstrated the effectiveness of the PSECMAC network in addressing the architectural deficiencies of the CMAC network by achieving significant improvements in the memory utilization, output accuracy as well as the generalization capability of the network.
Collapse
Affiliation(s)
- S D Teddy
- Centre for Computational Intelligence, Nanyang Technological University, Singapore 639798, Singapore
| | | | | |
Collapse
|
17
|
Teddy SD, Lai EMK, Quek C. Hierarchically clustered adaptive quantization CMAC and its learning convergence. ACTA ACUST UNITED AC 2008; 18:1658-82. [PMID: 18051184 DOI: 10.1109/tnn.2007.900810] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The cerebellar model articulation controller (CMAC) neural network (NN) is a well-established computational model of the human cerebellum. Nevertheless, there are two major drawbacks associated with the uniform quantization scheme of the CMAC network. They are the following: (1) a constant output resolution associated with the entire input space and (2) the generalization-accuracy dilemma. Moreover, the size of the CMAC network is an exponential function of the number of inputs. Depending on the characteristics of the training data, only a small percentage of the entire set of CMAC memory cells is utilized. Therefore, the efficient utilization of the CMAC memory is a crucial issue. One approach is to quantize the input space nonuniformly. For existing nonuniformly quantized CMAC systems, there is a tradeoff between memory efficiency and computational complexity. Inspired by the underlying organizational mechanism of the human brain, this paper presents a novel CMAC architecture named hierarchically clustered adaptive quantization CMAC (HCAQ-CMAC). HCAQ-CMAC employs hierarchical clustering for the nonuniform quantization of the input space to identify significant input segments and subsequently allocating more memory cells to these regions. The stability of the HCAQ-CMAC network is theoretically guaranteed by the proof of its learning convergence. The performance of the proposed network is subsequently benchmarked against the original CMAC network, as well as two other existing CMAC variants on two real-life applications, namely, automated control of car maneuver and modeling of the human blood glucose dynamics. The experimental results have demonstrated that the HCAQ-CMAC network offers an efficient memory allocation scheme and improves the generalization and accuracy of the network output to achieve better or comparable performances with smaller memory usages. Index Terms-Cerebellar model articulation controller (CMAC), hierarchical clustering, hierarchically clustered adaptive quantization CMAC (HCAQ-CMAC), learning convergence, nonuniform quantization.
Collapse
Affiliation(s)
- S D Teddy
- School of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore
| | | | | |
Collapse
|
18
|
Tung WL, Quek C. A brain-inspired fuzzy semantic memory model for learning and reasoning with uncertainty. Neural Comput Appl 2007. [DOI: 10.1007/s00521-007-0101-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
19
|
|
20
|
Tan TZ, Quek C, Ng GS. BIOLOGICAL BRAIN-INSPIRED GENETIC COMPLEMENTARY LEARNING FOR STOCK MARKET AND BANK FAILURE PREDICTION. Comput Intell 2007. [DOI: 10.1111/j.1467-8640.2007.00303.x] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
21
|
Quah KH, Quek C. MCES: A Novel Monte Carlo Evaluative Selection Approach for Objective Feature Selections. ACTA ACUST UNITED AC 2007; 18:431-48. [PMID: 17385630 DOI: 10.1109/tnn.2006.887555] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Most recent research efforts on feature selection have focused mainly on classification task due to its popularity in the data-mining community. However, feature selection research in nonlinear system estimations has been very limited. Hence, it is reasonable to devise a feature selection approach that is computationally efficient on nonlinear system estimations context. A novel feature selection approach, the Monte Carlo evaluative selection (MCES), is proposed in this paper. MCES is an objective sampling method that derives a better estimation of the relevancy measure. The algorithm is objectively designed to be applicable to both classification and nonlinear regressive tasks. The MCES method has been demonstrated to perform well with four sets of experiments, consisting of two classification and two regressive tasks. The results demonstrate that the MCES method has following strong advantages: 1) ability to identify correlated and irrelevant features based on weight ranking, 2) application to both nonlinear system estimation and classification tasks, and 3) independence of the underlying induction algorithms used to derive the performance measures.
Collapse
Affiliation(s)
- Kian Hong Quah
- Centre for Computational Intelligence, Nanyang Technological University, School of Computer Engineering, Singapore 639798, Singapore
| | | |
Collapse
|
22
|
Abstract
The cerebellum is a brain region important for a number of motor and cognitive functions. It is able to generate error correction signals to drive learning and for the acquisition of memory skills. The cerebellar model articulation controller (CMAC) is a neural network inspired by the neurophysiologic theory of the cerebellum and is recognized for its localized generalization and rapid algorithmic computation capabilities. The main deficiencies in the basic CMAC structure are: (1) it is difficult to interpret the internal operations of the CMAC network and (2) the resolution (quantization) problem arising from the partitioning of the input training space. These limitations lead to the synthesis of a fuzzy quantization technique and the mapping of a fuzzy inference scheme onto the CMAC structure. The discrete incremental clustering (DIC) technique is employed to alleviate the quantization problem in the CMAC structure, resulting in the fuzzy CMAC (FCMAC) network. The Yager inference scheme (Yager), which possesses firm fuzzy logic foundation and maps closely to the logical implication operations in the classical (binary) logic framework, is subsequently mapped onto the FCMAC structure. This results in a novel fuzzy neural architecture known as thefuzzy cerebellar model articulation controller-Yager (FCMAC-Yager) system. The proposed FCMAC-Yager network exhibits learning and memory capabilities of the cerebellum through the CMAC structure while emulating the human way of reasoning through the Yager. The new FCMAC-Yager network employs a two-phase training algorithm consisting of structural learning based on the DIC technique and parameter learning using hebbian learning (associative long-term potentiation). The proposed FCMAC-Yager architecture is evaluated using an extensive suite of real-life applications such as highway traffic-trend modeling and prediction and performing as an early warning system for bank failure classification and medical diagnosis of breast cancer. The experimental results are encouraging.
Collapse
Affiliation(s)
- J Sim
- Centre for Computational Intelligence, School of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore
| | | | | |
Collapse
|