1
|
Li Y, Dong X. A RUSBoosted tree method for k-complex detection using tunable Q-factor wavelet transform and multi-domain feature extraction. Front Neurosci 2023; 17:1108059. [PMID: 36998730 PMCID: PMC10043251 DOI: 10.3389/fnins.2023.1108059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2022] [Accepted: 02/10/2023] [Indexed: 03/18/2023] Open
Abstract
BackgroundK-complex detection traditionally relied on expert clinicians, which is time-consuming and onerous. Various automatic k-complex detection-based machine learning methods are presented. However, these methods always suffered from imbalanced datasets, which impede the subsequent processing steps.New methodIn this study, an efficient method for k-complex detection using electroencephalogram (EEG)-based multi-domain features extraction and selection method coupled with a RUSBoosted tree model is presented. EEG signals are first decomposed using a tunable Q-factor wavelet transform (TQWT). Then, multi-domain features based on TQWT are pulled out from TQWT sub-bands, and a self-adaptive feature set is obtained from a feature selection based on the consistency-based filter for the detection of k-complexes. Finally, the RUSBoosted tree model is used to perform k-complex detection.ResultsExperimental outcomes manifest the efficacy of our proposed scheme in terms of the average performance of recall measure, AUC, and F10-score. The proposed method yields 92.41 ± 7.47%, 95.4 ± 4.32%, and 83.13 ± 8.59% for k-complex detection in Scenario 1 and also achieves similar results in Scenario 2.Comparison to state-of-the-art methodsThe RUSBoosted tree model was compared with three other machine learning classifiers [i.e., linear discriminant analysis (LDA), logistic regression, and linear support vector machine (SVM)]. The performance based on the kappa coefficient, recall measure, and F10-score provided evidence that the proposed model surpassed other algorithms in the detection of the k-complexes, especially for the recall measure.ConclusionIn summary, the RUSBoosted tree model presents a promising performance in dealing with highly imbalanced data. It can be an effective tool for doctors and neurologists to diagnose and treat sleep disorders.
Collapse
Affiliation(s)
- Yabing Li
- School of Computer Science and Technology, Xi'an University of Posts and Telecommunications, Xi'an, Shaanxi, China
- Shaanxi Key Laboratory of Network Data Analysis and Intelligent Processing, Xi'an University of Posts and Telecommunications, Xi'an, Shaanxi, China
- Xi'an Key Laboratory of Big Data and Intelligent Computing, Xi'an University of Posts and Telecommunications, Xi'an, Shaanxi, China
- *Correspondence: Yabing Li
| | - Xinglong Dong
- School of Computer Science and Technology, Xi'an University of Posts and Telecommunications, Xi'an, Shaanxi, China
| |
Collapse
|
2
|
Al-Salman W, Li Y, Oudah AY, Almaged S. Sleep stage classification in EEG signals using the clustering approach based probability distribution features coupled with classification algorithms. Neurosci Res 2023; 188:51-67. [PMID: 36152918 DOI: 10.1016/j.neures.2022.09.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 08/20/2022] [Accepted: 09/20/2022] [Indexed: 10/14/2022]
Abstract
Sleep scoring is one of the primary tasks for the classification of sleep stages in Electroencephalogram (EEG) signals. Manual visual scoring of sleep stages is time-consuming as well as being dependent on the experience of a highly qualified sleep expert. This paper aims to address these issues by developing a new method to automatically classify sleep stages in EEG signals. In this research, a robust method has been presented based on the clustering approach, coupled with probability distribution features, to identify six sleep stages with the use of EEG signals. Using this method, each 30-second EEG signal is firstly segmented into small epochs and then each epoch is divided into 60 sub-segments. Each sub-segment is decomposed into five levels by using a discrete wavelet transform (DWT) to obtain the approximation and detailed coefficient. The wavelet coefficient of each level is clustered using the k-means algorithm. Subsequently, features are extracted based on the probability distribution for each wavelet coefficient. The extracted features then are forwarded to the least squares support vector machine classifier (LS-SVM) to identify sleep stages. Comparisons with several existing methods are also made in this study. The proposed method for the classification of the sleep stages achieves an average accuracy rate of 97.4%. It can be an effective tool for sleep stages classification and can be useful for doctors and neurologists for diagnosing sleep disorders.
Collapse
Affiliation(s)
- Wessam Al-Salman
- School of Mathematics, Physics and Computing, University of Southern Queensland, Australia; University of Thi-Qar, College of Education for Pure Science, Iraq.
| | - Yan Li
- School of Mathematics, Physics and Computing, University of Southern Queensland, Australia; School of Electrical and Electronic Engineering, Hubei University of Technology, Wuhan, China
| | - Atheer Y Oudah
- University of Thi-Qar, College of Education for Pure Science, Iraq; Information and Communication Technology Research Group, Scientific Research Centre, Al-Ayen University, Thi-Qar, Iraq
| | | |
Collapse
|
3
|
Multi-objective flower pollination algorithm: a new technique for EEG signal denoising. Neural Comput Appl 2022. [DOI: 10.1007/s00521-021-06757-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
4
|
Extracting Epileptic Features in EEGs Using a Dual-Tree Complex Wavelet Transform Coupled with a Classification Algorithm. Brain Res 2022; 1779:147777. [PMID: 34999060 DOI: 10.1016/j.brainres.2022.147777] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 12/07/2021] [Accepted: 01/02/2022] [Indexed: 11/24/2022]
Abstract
The detection of epileptic seizures from electroencephalogram (EEG) signals is traditionally performed by clinical experts through visual inspection. It is a long process, is error prone, and requires a highly trained expert. In this research, a new method is presented for seizure classification for EEG signals using a dual-tree complex wavelet transform (DT-CWT) and fast Fourier transform (FFT) coupled with a least square support vector machine (LS-SVM) classifier. In this method, each EEG signal is divided into four segments. Each segment is further split into smaller sub-segments. The DT-CWT is applied to decompose each sub-segment into detailed and approximation coefficients (real and imaginary parts). The obtained coefficients by the DT-CWT at each decomposition level are passed through an FFT to identify the relevant frequency bands. Finally, a set of effective features are extracted from the sub-segments, and are then forwarded to the LS-SVM classifier to classify epileptic EEGs. In this paper, two epileptic EEG databases from Bonn and Bern Universities are used to evaluate the extracted features using the proposed method. The experimental results demonstrate that the method obtained an average accuracy of 97.7% and 96.8% for the Bonn and Bern databases, respectively. The results prove that the proposed DT-CWT and FFT based features extraction is an effective way to extract discriminative information from brain signals. The obtained results are also compared to those by k-means and Naïve Bayes classifiers as well as with the results from the previous methods reported for classifying epileptic seizures and identifying the focal and non-focal EEG signals. The obtained results show that the proposed method outperforms the others and it is effective in detecting epileptic seziures in EEG signals. The technique can be adopted to aid neurologists to better diagnose neurological disorders and for an early seizure warning system.
Collapse
|
5
|
Al-Salman W, Li Y, Wen P. Detection of k-complexes in EEG signals using a multi-domain feature extraction coupled with a least square support vector machine classifier. Neurosci Res 2021; 172:26-40. [PMID: 33965451 DOI: 10.1016/j.neures.2021.03.012] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2020] [Revised: 03/22/2021] [Accepted: 03/31/2021] [Indexed: 01/28/2023]
Abstract
Sleep scoring is one of the primary tasks for the classification of sleep stages using electroencephalogram (EEG) signals. It is one of the most important diagnostic methods in sleep research and must be carried out with a high degree of accuracy because any errors in the scoring in the patient's sleep EEG recordings can cause serious problems. The aim of this research is to develop a new automatic method for detecting the most important characteristics in sleep stage 2 such as k-complexes based on multi-domain features. In this study, each EEG signal is divided into a set of segments using a sliding window technique. Based on extensive experiments during the training phase, the size of the sliding window is set to 0.5 s (s). Then a set of statistical, fractal, frequency and non-linear features are extracted from each epoch based on the time domain, Katz's algorithm, power spectrum density (PSD) and tunable Q-factor wavelet transform (TQWT). As a result, a vector of twenty-two features is obtained to represent each EEG segment. In order to detect k-complexes, the extracted features were analysed for their ability to detect the k-complex waveforms. Based on the analysis of the features, twelve out of twenty-two features are selected and forwarded to a least square support vector machine (LS-SVM) classifier to identify k-complexes in EEG signals. A set of various classification techniques of K-means and extreme learning machine classifiers are used to compare the obtained results and to evaluate the performance of the proposed method.The experimental results showed that the proposed method, based on multi-domain features, achieved better recognition results than other methods and classifiers. An average accuracy, sensitivity and specificity of 97.7 %, 97 %, and 94.2 % were obtained, respectively, with the CZ-A1 channel according to the R&K standard. The experimental results with high classification performance demonstrated that the technique can help doctors optimize the diagnosis and treatment of sleep disorders.
Collapse
Affiliation(s)
- Wessam Al-Salman
- School of Sciences, University of Southern Queensland, Australia; Thi-Qar University, College of Education for Pure Science, Iraq.
| | - Yan Li
- School of Sciences, University of Southern Queensland, Australia
| | - Peng Wen
- School of Electrical and Electronic Engineering, Hubei University of Technology, Wuhan, China
| |
Collapse
|
6
|
Singh D, Madhawan A, Roy J. Identification of multiple RNAs using feature fusion. Brief Bioinform 2021; 22:6272794. [PMID: 33971667 DOI: 10.1093/bib/bbab178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 04/08/2021] [Indexed: 11/13/2022] Open
Abstract
Detection of novel transcripts with deep sequencing has increased the demand for computational algorithms as their identification and validation using in vivo techniques is time-consuming, costly and unreliable. Most of these discovered transcripts belong to non-coding RNAs, a large group known for their diverse functional roles but lacks the common taxonomy. Thus, upon the identification of the absence of coding potential in them, it is crucial to recognize their prime functional category. To address this heterogeneity issue, we divide the ncRNAs into three classes and present RNA classifier (RNAC) that categorizes the RNAs into coding, housekeeping, small non-coding and long non-coding classes. RNAC utilizes the alignment-based genomic descriptors to extract statistical, local binary patterns and histogram features and fuse them to construct the classification models with extreme gradient boosting. The experiments are performed on four species, and the performance is assessed on multiclass and conventional binary classification (coding versus no-coding) problems. The proposed approach achieved >93% accuracy on both classification problems and also outperformed other well-known existing methods in coding potential prediction. This validates the usefulness of feature fusion for improved performance on both types of classification problems. Hence, RNAC is a valuable tool for the accurate identification of multiple RNAs .
Collapse
Affiliation(s)
- Dalwinder Singh
- National Agri-Food Biotechnology Institute, Sector 81, SAS Nagar, 140306, Punjab, India
| | - Akansha Madhawan
- National Agri-Food Biotechnology Institute, Sector 81, SAS Nagar, 140306, Punjab, India
| | - Joy Roy
- National Agri-Food Biotechnology Institute, Sector 81, SAS Nagar, 140306, Punjab, India
| |
Collapse
|
7
|
Zhang Y, Ma Z, Song J, Kong X, Guo Z, Jiang B. Comparing performance of iterative and non-iterative algorithms on various feature schemes for arrhythmia analysis. Methods 2021; 202:144-151. [PMID: 33839287 DOI: 10.1016/j.ymeth.2021.04.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Revised: 03/29/2021] [Accepted: 04/05/2021] [Indexed: 10/21/2022] Open
Abstract
To evaluate the performance of the classic machine learning algorithms and the effectiveness of various features, the iterative algorithms (i.e., support vector machine (SVM), and least-squares SVM (LS-SVM)) and non-iterative algorithms (i.e., random forest (RF) and naive bayes (NB)) for six feature schemes were performed to classify the ECG recordings. The ECG recordings were initially filtered with a 0.1 Hz - 12 Hz band pass filter. Then 80 features, including 48 time domain, 18 frequency domain, 12 time-frequency and two principle component analysis (PCA) features, were extracted to construct six feature schemes. The RF, SVM, LS-SVM and NB were employed to discern a binary-classification task (i.e., normal and AF ECG recordings) and a tri- classification task (i.e., the normal, AF and ST change ECG recordings) for the six feature schemes. The results revealed that time domain, frequency domain features and PCA features can provide relatively reliable feature combinations to the RF and SVM. In addition, the RF yielded the highest F1-scores (0.8908 and 0.7535) for the binary-classification task and the tri-classification task than the SVM, LS-SVM and NB.
Collapse
Affiliation(s)
- Yatao Zhang
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, China.
| | - Zhenguo Ma
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, China; School of Computer Science & Technology, University of Science & Technology of China, Hefei, China
| | - Jiarui Song
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, China
| | - Xiaoming Kong
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, China
| | - Ziyu Guo
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, China
| | - Bin Jiang
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, China.
| |
Collapse
|
8
|
Gopan K. G, Prabhu SS, Sinha N. Sleep EEG analysis utilizing inter-channel covariance matrices. Biocybern Biomed Eng 2020. [DOI: 10.1016/j.bbe.2020.01.013] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
9
|
Al-Salman W, Li Y, Wen P. K-complexes Detection in EEG Signals using Fractal and Frequency Features Coupled with an Ensemble Classification Model. Neuroscience 2019; 422:119-133. [PMID: 31682947 DOI: 10.1016/j.neuroscience.2019.10.034] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Revised: 10/21/2019] [Accepted: 10/21/2019] [Indexed: 11/29/2022]
Abstract
K-complexes are important transient bio-signal waveforms in sleep stage 2. Detecting k-complexes visually requires a highly qualified expert. In this study, an efficient method for detecting k-complexes from electroencephalogram (EEG) signals based on fractal and frequency features coupled with an ensemble model of three classifiers is presented. EEG signals are first partitioned into segments, using a sliding window technique. Then, each EEG segment is decomposed using a dual-tree complex wavelet transform (DT-CWT) to a set of real and imaginary parts. A total of 10 sub-bands are used based on four levels of decomposition, and the high sub-bands are considered in this research for feature extraction. Fractal and frequency features based on DT-CWT and Higuchi's algorithm are pulled out from each sub-band and then forwarded to an ensemble classifier to detect k-complexes. A twelve-feature set is finally used to detect the sleep EEG characteristics using the ensemble model. The ensemble model is designed using a combination of three classification techniques including a least square support vector machine (LS-SVM), k-means and Naïve Bayes. The proposed method for the detection of the k-complexes achieves an average accuracy rate of 97.3 %. The results from the ensemble classifier were compared with those by individual classifiers. Comparisons were also made with existing k-complexes detection approaches for which the same datasets were used. The results demonstrate that the proposed approach is efficient in identifying the k-complexes in EEG signals; it yields optimal results with a window size 0.5 s. It can be an effective tool for sleep stages classification and can be useful for doctors and neurologists for diagnosing sleep disorders.
Collapse
Affiliation(s)
- Wessam Al-Salman
- School of Agricultural, Computational and Environmental Sciences, University of Southern Queensland, Australia; College of Education for Pure Science, University of Thi-Qar, Iraq.
| | - Yan Li
- School of Agricultural, Computational and Environmental Sciences, University of Southern Queensland, Australia; School of Electrical and Electronic Engineering, Hubei University of Technology, Wuhan, China
| | - Peng Wen
- School of Mechanical and Electrical Engineering, University of Southern Queensland, Toowoomba, QLD, Australia
| |
Collapse
|
10
|
Al-Salman W, Li Y, Wen P. Detection of EEG K-Complexes Using Fractal Dimension of Time Frequency Images Technique Coupled With Undirected Graph Features. Front Neuroinform 2019; 13:45. [PMID: 31316365 PMCID: PMC6609999 DOI: 10.3389/fninf.2019.00045] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2018] [Accepted: 05/29/2019] [Indexed: 11/17/2022] Open
Abstract
K-complexes identification is a challenging task in sleep research. The detection of k-complexes in electroencephalogram (EEG) signals based on visual inspection is time consuming, prone to errors, and requires well-trained knowledge. Many existing methods for k-complexes detection rely mainly on analyzing EEG signals in time and frequency domains. In this study, an efficient method is proposed to detect k-complexes from EEG signals based on fractal dimension (FD) of time frequency (T-F) images coupled with undirected graph features. Firstly, an EEG signal is partitioned into smaller segments using a sliding window technique. Each EEG segment is passed through a spectrogram of short time Fourier transform (STFT) to obtain the T-F images. Secondly, the box counting method is applied to each T-F image to discover the FDs in EEG signals. A vector of FD features are extracted from each T-F image and then mapped into an undirected graph. The structural properties of the graphs are used as the representative features of the original EEG signals for the input of a least square support vector machine (LS-SVM) classifier. Key graphic features are extracted from the undirected graphs. The extracted graph features are forwarded to the LS-SVM for classification. To investigate the classification ability of the proposed feature extraction combined with the LS-SVM classifier, the extracted features are also forwarded to a k-means classifier for comparison. The proposed method is compared with several existing k-complexes detection methods in which the same datasets were used. The findings of this study shows that the proposed method yields better classification results than other existing methods in the literature. An average accuracy of 97% for the detection of the k-complexes is obtained using the proposed method. The proposed method could lead to an efficient tool for the scoring of automatic sleep stages which could be useful for doctors and neurologists in the diagnosis and treatment of sleep disorders and for sleep research.
Collapse
Affiliation(s)
- Wessam Al-Salman
- School of Agricultural, Computational and Environmental Sciences, University of Southern Queensland, Toowoomba, QLD, Australia.,College of Education for Pure Science, Thi-Qar University, Nasiriyah, Iraq
| | - Yan Li
- School of Agricultural, Computational and Environmental Sciences, University of Southern Queensland, Toowoomba, QLD, Australia.,School of Electrical and Electronic Engineering, Hubei University of Technology, Wuhan, China
| | - Peng Wen
- School of Agricultural, Computational and Environmental Sciences, University of Southern Queensland, Toowoomba, QLD, Australia
| |
Collapse
|