1
|
Survival Prediction of Patients after Heart Attack and Breast Cancer Surgery with a Hybrid Model Built with Particle Swarm Optimization, Stacked AutoEncoders, and the Softmax Classifier. Biomimetics (Basel) 2024; 9:304. [PMID: 38786514 PMCID: PMC11117878 DOI: 10.3390/biomimetics9050304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 05/15/2024] [Accepted: 05/17/2024] [Indexed: 05/25/2024] Open
Abstract
The prediction of patient survival is crucial for guiding the treatment process in healthcare. Healthcare professionals rely on analyzing patients' clinical characteristics and findings to determine treatment plans, making accurate predictions essential for efficient resource utilization and optimal patient support during recovery. In this study, a hybrid architecture combining Stacked AutoEncoders, Particle Swarm Optimization, and the Softmax Classifier was developed for predicting patient survival. The architecture was evaluated using the Haberman's Survival dataset and the Echocardiogram dataset from UCI. The results were compared with several Machine Learning methods, including Decision Trees, K-Nearest Neighbors, Support Vector Machines, Neural Networks, Gradient Boosting, and Gradient Bagging applied to the same datasets. The findings indicate that the proposed architecture outperforms other Machine Learning methods in predicting patient survival for both datasets and surpasses the results reported in the literature for the Haberman's Survival dataset. In the light of the findings obtained, the models obtained with the proposed architecture can be used as a decision support system in determining patient care and applied methods.
Collapse
|
2
|
Lightweight 3D Dense Autoencoder Network for Hyperspectral Remote Sensing Image Classification. SENSORS (BASEL, SWITZERLAND) 2023; 23:8635. [PMID: 37896728 PMCID: PMC10610785 DOI: 10.3390/s23208635] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 10/16/2023] [Accepted: 10/20/2023] [Indexed: 10/29/2023]
Abstract
The lack of labeled training samples restricts the improvement of Hyperspectral Remote Sensing Image (HRSI) classification accuracy based on deep learning methods. In order to improve the HRSI classification accuracy when there are few training samples, a Lightweight 3D Dense Autoencoder Network (L3DDAN) is proposed. Structurally, the L3DDAN is designed as a stacked autoencoder which consists of an encoder and a decoder. The encoder is a hybrid combination of 3D convolutional operations and 3D dense block for extracting deep features from raw data. The decoder composed of 3D deconvolution operations is designed to reconstruct data. The L3DDAN is trained by unsupervised learning without labeled samples and supervised learning with a small number of labeled samples, successively. The network composed of the fine-tuned encoder and trained classifier is used for classification tasks. The extensive comparative experiments on three benchmark HRSI datasets demonstrate that the proposed framework with fewer trainable parameters can maintain superior performance to the other eight state-of-the-art algorithms when there are only a few training samples. The proposed L3DDAN can be applied to HRSI classification tasks, such as vegetation classification. Future work mainly focuses on training time reduction and applications on more real-world datasets.
Collapse
|
3
|
A Semi-Supervised Stacked Autoencoder Using the Pseudo Label for Classification Tasks. ENTROPY (BASEL, SWITZERLAND) 2023; 25:1274. [PMID: 37761573 PMCID: PMC10528325 DOI: 10.3390/e25091274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Revised: 08/20/2023] [Accepted: 08/28/2023] [Indexed: 09/29/2023]
Abstract
The efficiency and cognitive limitations of manual sample labeling result in a large number of unlabeled training samples in practical applications. Making full use of both labeled and unlabeled samples is the key to solving the semi-supervised problem. However, as a supervised algorithm, the stacked autoencoder (SAE) only considers labeled samples and is difficult to apply to semi-supervised problems. Thus, by introducing the pseudo-labeling method into the SAE, a novel pseudo label-based semi-supervised stacked autoencoder (PL-SSAE) is proposed to address the semi-supervised classification tasks. The PL-SSAE first utilizes the unsupervised pre-training on all samples by the autoencoder (AE) to initialize the network parameters. Then, by the iterative fine-tuning of the network parameters based on the labeled samples, the unlabeled samples are identified, and their pseudo labels are generated. Finally, the pseudo-labeled samples are used to construct the regularization term and fine-tune the network parameters to complete the training of the PL-SSAE. Different from the traditional SAE, the PL-SSAE requires all samples in pre-training and the unlabeled samples with pseudo labels in fine-tuning to fully exploit the feature and category information of the unlabeled samples. Empirical evaluations on various benchmark datasets show that the semi-supervised performance of the PL-SSAE is more competitive than that of the SAE, sparse stacked autoencoder (SSAE), semi-supervised stacked autoencoder (Semi-SAE) and semi-supervised stacked autoencoder (Semi-SSAE).
Collapse
|
4
|
Prediction of miRNA-Disease Associations by Cascade Forest Model Based on Stacked Autoencoder. Molecules 2023; 28:5013. [PMID: 37446675 DOI: 10.3390/molecules28135013] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 06/23/2023] [Accepted: 06/24/2023] [Indexed: 07/15/2023] Open
Abstract
Numerous pieces of evidence have indicated that microRNA (miRNA) plays a crucial role in a series of significant biological processes and is closely related to complex disease. However, the traditional biological experimental methods used to verify disease-related miRNAs are inefficient and expensive. Thus, it is necessary to design some excellent approaches to improve efficiency. In this work, a novel method (CFSAEMDA) is proposed for the prediction of unknown miRNA-disease associations (MDAs). Specifically, we first capture the interactive features of miRNA and disease by integrating multi-source information. Then, the stacked autoencoder is applied for obtaining the underlying feature representation. Finally, the modified cascade forest model is employed to complete the final prediction. The experimental results present that the AUC value obtained by our method is 97.67%. The performance of CFSAEMDA is superior to several of the latest methods. In addition, case studies conducted on lung neoplasms, breast neoplasms and hepatocellular carcinoma further show that the CFSAEMDA method may be regarded as a utility approach to infer unknown disease-miRNA relationships.
Collapse
|
5
|
CapsNet-LDA: predicting lncRNA-disease associations using attention mechanism and capsule network based on multi-view data. Brief Bioinform 2023; 24:6889447. [PMID: 36511221 DOI: 10.1093/bib/bbac531] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2022] [Revised: 10/25/2022] [Accepted: 11/07/2022] [Indexed: 12/15/2022] Open
Abstract
Cumulative studies have shown that many long non-coding RNAs (lncRNAs) are crucial in a number of diseases. Predicting potential lncRNA-disease associations (LDAs) can facilitate disease prevention, diagnosis and treatment. Therefore, it is vital to develop practical computational methods for LDA prediction. In this study, we propose a novel predictor named capsule network (CapsNet)-LDA for LDA prediction. CapsNet-LDA first uses a stacked autoencoder for acquiring the informative low-dimensional representations of the lncRNA-disease pairs under multiple views, then the attention mechanism is leveraged to implement an adaptive allocation of importance weights to them, and they are subsequently processed using a CapsNet-based architecture for predicting LDAs. Different from the conventional convolutional neural networks (CNNs) that have some restrictions with the usage of scalar neurons and pooling operations. the CapsNets use vector neurons instead of scalar neurons that have better robustness for the complex combination of features and they use dynamic routing processes for updating parameters. CapsNet-LDA is superior to other five state-of-the-art models on four benchmark datasets, four perturbed datasets and an independent test set in the comparison experiments, demonstrating that CapsNet-LDA has excellent performance and robustness against perturbation, as well as good generalization ability. The ablation studies verify the effectiveness of some modules of CapsNet-LDA. Moreover, the ability of multi-view data to improve performance is proven. Case studies further indicate that CapsNet-LDA can accurately predict novel LDAs for specific diseases.
Collapse
|
6
|
An Ensemble Learning Method for Emotion Charting Using Multimodal Physiological Signals. SENSORS (BASEL, SWITZERLAND) 2022; 22:9480. [PMID: 36502183 PMCID: PMC9739519 DOI: 10.3390/s22239480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Revised: 11/24/2022] [Accepted: 11/28/2022] [Indexed: 06/17/2023]
Abstract
Emotion charting using multimodal signals has gained great demand for stroke-affected patients, for psychiatrists while examining patients, and for neuromarketing applications. Multimodal signals for emotion charting include electrocardiogram (ECG) signals, electroencephalogram (EEG) signals, and galvanic skin response (GSR) signals. EEG, ECG, and GSR are also known as physiological signals, which can be used for identification of human emotions. Due to the unbiased nature of physiological signals, this field has become a great motivation in recent research as physiological signals are generated autonomously from human central nervous system. Researchers have developed multiple methods for the classification of these signals for emotion detection. However, due to the non-linear nature of these signals and the inclusion of noise, while recording, accurate classification of physiological signals is a challenge for emotion charting. Valence and arousal are two important states for emotion detection; therefore, this paper presents a novel ensemble learning method based on deep learning for the classification of four different emotional states including high valence and high arousal (HVHA), low valence and low arousal (LVLA), high valence and low arousal (HVLA) and low valence high arousal (LVHA). In the proposed method, multimodal signals (EEG, ECG, and GSR) are preprocessed using bandpass filtering and independent components analysis (ICA) for noise removal in EEG signals followed by discrete wavelet transform for time domain to frequency domain conversion. Discrete wavelet transform results in spectrograms of the physiological signal and then features are extracted using stacked autoencoders from those spectrograms. A feature vector is obtained from the bottleneck layer of the autoencoder and is fed to three classifiers SVM (support vector machine), RF (random forest), and LSTM (long short-term memory) followed by majority voting as ensemble classification. The proposed system is trained and tested on the AMIGOS dataset with k-fold cross-validation. The proposed system obtained the highest accuracy of 94.5% and shows improved results of the proposed method compared with other state-of-the-art methods.
Collapse
|
7
|
A Deep-Learning-Based Health Indicator Constructor Using Kullback-Leibler Divergence for Predicting the Remaining Useful Life of Concrete Structures. SENSORS (BASEL, SWITZERLAND) 2022; 22:3687. [PMID: 35632097 PMCID: PMC9146863 DOI: 10.3390/s22103687] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/17/2022] [Revised: 05/08/2022] [Accepted: 05/10/2022] [Indexed: 06/15/2023]
Abstract
This paper proposes a new technique for the construction of a concrete-beam health indicator based on the Kullback-Leibler divergence (KLD) and deep learning. Health indicator (HI) construction is a vital part of remaining useful lifetime (RUL) approaches for monitoring the health of concrete structures. Through the construction of a HI, the deterioration process can be processed and portrayed so that it can be forwarded to a prediction module for RUL prognosis. The degradation progression and failure can be identified by predicting the RUL based on the situation of the current specimen; as a result, maintenance can be planned to reduce safety risks, reduce financial costs, and prolong the specimen's useful lifetime. The portrayal of deterioration through HI construction from raw acoustic emission (AE) data is performed using a deep neural network (DNN), whose parameters are obtained by pretraining and fine tuning using a stack autoencoder (SAE). Kullback-Leibler divergence, which is calculated between a reference normal-conditioned signal and a current unknown signal, was used to represent the deterioration process of concrete structures, which has not been investigated for the concrete beams so far. The DNN-based constructor then learns to generate HI from raw data with KLD values as the training label. The HI construction result was evaluated with run-to-fail test data of concrete specimens with two measurements: fitness analysis of the construction result and RUL prognosis. The results confirm the reliability of KLD in portraying the deterioration process, showing a large improvement in comparison to other methods. In addition, this method requires no adept knowledge of the nature of the AE or the system fault, which is more favorable than model-based approaches where this level of expertise is compulsory. Furthermore, AE offers in-service monitoring, allowing the RUL prognosis task to be performed without disrupting the specimen's work.
Collapse
|
8
|
A Scheme with Acoustic Emission Hit Removal for the Remaining Useful Life Prediction of Concrete Structures. SENSORS 2021; 21:s21227761. [PMID: 34833836 PMCID: PMC8619075 DOI: 10.3390/s21227761] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Revised: 11/17/2021] [Accepted: 11/19/2021] [Indexed: 11/16/2022]
Abstract
In this study, a scheme of remaining useful lifetime (RUL) prognosis from raw acoustic emission (AE) data is presented to predict the concrete structure's failure before its occurrence, thus possibly prolong its service life and minimizing the risk of accidental damage. The deterioration process is portrayed by the health indicator (HI), which is automatically constructed from raw AE data with a deep neural network pretrained and fine-tuned by a stacked autoencoder deep neural network (SAE-DNN). For the deep neural network structure to perform a more accurate construction of health indicator lines, a hit removal process with a one-class support vector machine (OC-SVM), which has not been investigated in previous studies, is proposed to extract only the hits which matter the most to the portrait of deterioration. The new set of hits is then harnessed as the training labels for the deep neural network. After the completion of the health indicator line construction, health indicators are forwarded to a long short-term memory recurrent neural network (LSTM-RNN) for the training and validation of the remaining useful life prediction, as this structure is capable of capturing the long-term dependencies, even with a limited set of data. Our prediction result shows a significant improvement in comparison with a similar scheme but without the hit removal process and other methods, such as the gated recurrent unit recurrent neural network (GRU-RNN) and the simple recurrent neural network.
Collapse
|
9
|
Electroencephalogram Emotion Recognition Based on 3D Feature Fusion and Convolutional Autoencoder. Front Comput Neurosci 2021; 15:743426. [PMID: 34733148 PMCID: PMC8558247 DOI: 10.3389/fncom.2021.743426] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2021] [Accepted: 08/23/2021] [Indexed: 11/19/2022] Open
Abstract
As one of the key technologies of emotion computing, emotion recognition has received great attention. Electroencephalogram (EEG) signals are spontaneous and difficult to camouflage, so they are used for emotion recognition in academic and industrial circles. In order to overcome the disadvantage that traditional machine learning based emotion recognition technology relies too much on a manual feature extraction, we propose an EEG emotion recognition algorithm based on 3D feature fusion and convolutional autoencoder (CAE). First, the differential entropy (DE) features of different frequency bands of EEG signals are fused to construct the 3D features of EEG signals, which retain the spatial information between channels. Then, the constructed 3D features are input into the CAE constructed in this paper for emotion recognition. In this paper, many experiments are carried out on the open DEAP dataset, and the recognition accuracy of valence and arousal dimensions are 89.49 and 90.76%, respectively. Therefore, the proposed method is suitable for emotion recognition tasks.
Collapse
|
10
|
[Study on the inverse problem of diffuse optical tomography based on improved stacked auto-encoder]. SHENG WU YI XUE GONG CHENG XUE ZA ZHI = JOURNAL OF BIOMEDICAL ENGINEERING = SHENGWU YIXUE GONGCHENGXUE ZAZHI 2021; 38:774-782. [PMID: 34459178 DOI: 10.7507/1001-5515.202010041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
The inverse problem of diffuse optical tomography (DOT) is ill-posed. Traditional method cannot achieve high imaging accuracy and the calculation process is time-consuming, which restricts the clinical application of DOT. Therefore, a method based on stacked auto-encoder (SAE) was proposed and used for the DOT inverse problem. Firstly, a traditional SAE method is used to solved the inverse problem. Then, the output structure of SAE neural network is improved to a single output SAE, which reduce the burden on the neural network. Finally, the improved SAE method is used to compare with traditional SAE method and traditional levenberg-marquardt (LM) iterative method. The result shows that the average time to solve the inverse problem of the method proposed in this paper is only 1.67% of the LM method. The mean square error (MSE) value is 46.21% lower than the traditional iterative method, 61.53% lower than the traditional SAE method, and the image correlation coefficient(ICC) value is 4.03% higher than the traditional iterative method, 18.7% higher than the traditional SAE method and has good noise immunity under 3% noise conditions. The research results in this article prove that the improved SAE method has higher image quality and noise resistance than the traditional SAE method, and at the same time has a faster calculation speed than the traditional iterative method, which is conducive to the application of neural networks in DOT inverse problem calculation.
Collapse
|
11
|
Radar HRRP Target Recognition Based on Stacked Autoencoder and Extreme Learning Machine. SENSORS 2018; 18:s18010173. [PMID: 29320453 PMCID: PMC5796358 DOI: 10.3390/s18010173] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/02/2017] [Revised: 12/30/2017] [Accepted: 01/08/2018] [Indexed: 01/25/2023]
Abstract
A novel radar high-resolution range profile (HRRP) target recognition method based on a stacked autoencoder (SAE) and extreme learning machine (ELM) is presented in this paper. As a key component of deep structure, the SAE does not only learn features by making use of data, it also obtains feature expressions at different levels of data. However, with the deep structure, it is hard to achieve good generalization performance with a fast learning speed. ELM, as a new learning algorithm for single hidden layer feedforward neural networks (SLFNs), has attracted great interest from various fields for its fast learning speed and good generalization performance. However, ELM needs more hidden nodes than conventional tuning-based learning algorithms due to the random set of input weights and hidden biases. In addition, the existing ELM methods cannot utilize the class information of targets well. To solve this problem, a regularized ELM method based on the class information of the target is proposed. In this paper, SAE and the regularized ELM are combined to make full use of their advantages and make up for each of their shortcomings. The effectiveness of the proposed method is demonstrated by experiments with measured radar HRRP data. The experimental results show that the proposed method can achieve good performance in the two aspects of real-time and accuracy, especially when only a few training samples are available.
Collapse
|
12
|
Synthetic Aperture Radar Target Recognition with Feature Fusion Based on a Stacked Autoencoder. SENSORS 2017; 17:s17010192. [PMID: 28117689 PMCID: PMC5298765 DOI: 10.3390/s17010192] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2016] [Revised: 12/20/2016] [Accepted: 01/13/2017] [Indexed: 11/17/2022]
Abstract
Feature extraction is a crucial step for any automatic target recognition process, especially in the interpretation of synthetic aperture radar (SAR) imagery. In order to obtain distinctive features, this paper proposes a feature fusion algorithm for SAR target recognition based on a stacked autoencoder (SAE). The detailed procedure presented in this paper can be summarized as follows: firstly, 23 baseline features and Three-Patch Local Binary Pattern (TPLBP) features are extracted. These features can describe the global and local aspects of the image with less redundancy and more complementarity, providing richer information for feature fusion. Secondly, an effective feature fusion network is designed. Baseline and TPLBP features are cascaded and fed into a SAE. Then, with an unsupervised learning algorithm, the SAE is pre-trained by greedy layer-wise training method. Capable of feature expression, SAE makes the fused features more distinguishable. Finally, the model is fine-tuned by a softmax classifier and applied to the classification of targets. 10-class SAR targets based on Moving and Stationary Target Acquisition and Recognition (MSTAR) dataset got a classification accuracy up to 95.43%, which verifies the effectiveness of the presented algorithm.
Collapse
|
13
|
DL-PRO: A Novel Deep Learning Method for Protein Model Quality Assessment. PROCEEDINGS OF ... INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS. INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2014; 2014:2071-2078. [PMID: 25392745 PMCID: PMC4226404 DOI: 10.1109/ijcnn.2014.6889891] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Computational protein structure prediction is very important for many applications in bioinformatics. In the process of predicting protein structures, it is essential to accurately assess the quality of generated models. Although many single-model quality assessment (QA) methods have been developed, their accuracy is not high enough for most real applications. In this paper, a new approach based on C-α atoms distance matrix and machine learning methods is proposed for single-model QA and the identification of native-like models. Different from existing energy/scoring functions and consensus approaches, this new approach is purely geometry based. Furthermore, a novel algorithm based on deep learning techniques, called DL-Pro, is proposed. For a protein model, DL-Pro uses its distance matrix that contains pairwise distances between two residues' C-α atoms in the model, which sometimes is also called contact map, as an orientation-independent representation. From training examples of distance matrices corresponding to good and bad models, DL-Pro learns a stacked autoencoder network as a classifier. In experiments on selected targets from the Critical Assessment of Structure Prediction (CASP) competition, DL-Pro obtained promising results, outperforming state-of-the-art energy/scoring functions, including OPUS-CA, DOPE, DFIRE, and RW.
Collapse
|