Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Zhao J, Mao X, Chen L. Speech emotion recognition using deep 1D & 2D CNN LSTM networks. Biomed Signal Process Control 2019. [DOI: 10.1016/j.bspc.2018.08.035] [Citation(s) in RCA: 194] [Impact Index Per Article: 38.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Number

Cited by Other Article(s)

Muñoz-Mata BG, Dorantes-Méndez G, Piña-Ramírez O. Classification of Parkinson's disease severity using gait stance signals in a spatiotemporal deep learning classifier. Med Biol Eng Comput 2024:10.1007/s11517-024-03148-2. [PMID: 38884852 DOI: 10.1007/s11517-024-03148-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 06/03/2024] [Indexed: 06/18/2024]

Akinpelu S, Viriri S, Adegun A. An enhanced speech emotion recognition using vision transformer. Sci Rep 2024;14:13126. [PMID: 38849422 PMCID: PMC11161461 DOI: 10.1038/s41598-024-63776-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2023] [Accepted: 06/02/2024] [Indexed: 06/09/2024] Open

Zhang C, Su L, Li S, Fu Y. Differential Brain Activation for Four Emotions in VR-2D and VR-3D Modes. Brain Sci 2024;14:326. [PMID: 38671977 PMCID: PMC11048237 DOI: 10.3390/brainsci14040326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Revised: 03/10/2024] [Accepted: 03/25/2024] [Indexed: 04/28/2024] Open

Pentari A, Kafentzis G, Tsiknakis M. Speech emotion recognition via graph-based representations. Sci Rep 2024;14:4484. [PMID: 38396002 PMCID: PMC10891082 DOI: 10.1038/s41598-024-52989-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Accepted: 01/25/2024] [Indexed: 02/25/2024] Open

Sang B, Wen H, Junek G, Neveu W, Di Francesco L, Ayazi F. An Accelerometer-Based Wearable Patch for Robust Respiratory Rate and Wheeze Detection Using Deep Learning. BIOSENSORS 2024;14:118. [PMID: 38534225 DOI: 10.3390/bios14030118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 02/17/2024] [Accepted: 02/20/2024] [Indexed: 03/28/2024]

Xu C, Liu Y, Song W, Liang Z, Chen X. A New Network Structure for Speech Emotion Recognition Research. SENSORS (BASEL, SWITZERLAND) 2024;24:1429. [PMID: 38474965 DOI: 10.3390/s24051429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 02/04/2024] [Accepted: 02/07/2024] [Indexed: 03/14/2024]

Eguchi K, Yaguchi H, Kudo I, Kimura I, Nabekura T, Kumagai R, Fujita K, Nakashiro Y, Iida Y, Hamada S, Honma S, Takei A, Moriwaka F, Yabe I. Differentiation of speech in Parkinson's disease and spinocerebellar degeneration using deep neural networks. J Neurol 2024;271:1004-1012. [PMID: 37989963 DOI: 10.1007/s00415-023-12091-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2023] [Revised: 10/29/2023] [Accepted: 10/30/2023] [Indexed: 11/23/2023]

Affiliation(s)

Katsuki Eguchi Hokuyukai Neurological Hospital, 4-30, 2jo, 2cho-me, Nijuyonken, Nishi-ku, Sapporo, 063-0802, Japan. Department of Neurology, Faculty of Medicine and Graduate School of Medicine, Hokkaido University, Kita 15, Nishi 7, Kita-ku, Sapporo, 060-8638, Japan.
Hiroaki Yaguchi Department of Neurology, Faculty of Medicine and Graduate School of Medicine, Hokkaido University, Kita 15, Nishi 7, Kita-ku, Sapporo, 060-8638, Japan
Ikue Kudo Hokuyukai Neurological Hospital, 4-30, 2jo, 2cho-me, Nijuyonken, Nishi-ku, Sapporo, 063-0802, Japan
Ibuki Kimura Hokuyukai Neurological Hospital, 4-30, 2jo, 2cho-me, Nijuyonken, Nishi-ku, Sapporo, 063-0802, Japan
Tomoko Nabekura Hokuyukai Neurological Hospital, 4-30, 2jo, 2cho-me, Nijuyonken, Nishi-ku, Sapporo, 063-0802, Japan
Ryuto Kumagai Sapporo Parkinson MS Neurological Clinic, Sapporo Kita Sky Building F12, 7-6, Kita 7-Nishi 5, Kita-ku, Sapporo, Hokkaido, 060-0807, Japan
Kenichi Fujita Hokuyukai Neurological Hospital, 4-30, 2jo, 2cho-me, Nijuyonken, Nishi-ku, Sapporo, 063-0802, Japan
Yuichi Nakashiro Hokuyukai Neurological Hospital, 4-30, 2jo, 2cho-me, Nijuyonken, Nishi-ku, Sapporo, 063-0802, Japan
Yuki Iida Hokuyukai Neurological Hospital, 4-30, 2jo, 2cho-me, Nijuyonken, Nishi-ku, Sapporo, 063-0802, Japan
Shinsuke Hamada Hokuyukai Neurological Hospital, 4-30, 2jo, 2cho-me, Nijuyonken, Nishi-ku, Sapporo, 063-0802, Japan
Sanae Honma Hokuyukai Neurological Hospital, 4-30, 2jo, 2cho-me, Nijuyonken, Nishi-ku, Sapporo, 063-0802, Japan
Asako Takei Hokuyukai Neurological Hospital, 4-30, 2jo, 2cho-me, Nijuyonken, Nishi-ku, Sapporo, 063-0802, Japan
Fumio Moriwaka Hokuyukai Neurological Hospital, 4-30, 2jo, 2cho-me, Nijuyonken, Nishi-ku, Sapporo, 063-0802, Japan
Ichiro Yabe Department of Neurology, Faculty of Medicine and Graduate School of Medicine, Hokkaido University, Kita 15, Nishi 7, Kita-ku, Sapporo, 060-8638, Japan

Collapse

Başaran OT, Can YS, André E, Ersoy C. Relieving the burden of intensive labeling for stress monitoring in the wild by using semi-supervised learning. Front Psychol 2024;14:1293513. [PMID: 38250116 PMCID: PMC10797089 DOI: 10.3389/fpsyg.2023.1293513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 12/08/2023] [Indexed: 01/23/2024] Open

Chugh N, Aggarwal S, Balyan A. The Hybrid Deep Learning Model for Identification of Attention-Deficit/Hyperactivity Disorder Using EEG. Clin EEG Neurosci 2024;55:22-33. [PMID: 37682533 DOI: 10.1177/15500594231193511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 09/09/2023]

Pyo J, Pachepsky Y, Kim S, Abbas A, Kim M, Kwon YS, Ligaray M, Cho KH. Long short-term memory models of water quality in inland water environments. WATER RESEARCH X 2023;21:100207. [PMID: 38098887 PMCID: PMC10719578 DOI: 10.1016/j.wroa.2023.100207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 11/08/2023] [Accepted: 11/14/2023] [Indexed: 12/17/2023]

Wedasingha N, Samarasinghe P, Senevirathna L, Papandrea M, Puiatti A, Rankin D. Automated anomalous child repetitive head movement identification through transformer networks. Phys Eng Sci Med 2023;46:1427-1445. [PMID: 37814077 DOI: 10.1007/s13246-023-01309-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Accepted: 07/24/2023] [Indexed: 10/11/2023]

Xia L, Feng Y, Guo Z, Ding J, Li Y, Li Y, Ma M, Gan G, Xu Y, Luo J, Shi Z, Guan Y. MuLHiTA: A Novel Multiclass Classification Framework With Multibranch LSTM and Hierarchical Temporal Attention for Early Detection of Mental Stress. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023;34:9657-9670. [PMID: 35385389 DOI: 10.1109/tnnls.2022.3159573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Abstract

Mental stress is an increasingly common psychological issue leading to diseases such as depression, addiction, and heart attack. In this study, an early detection framework based on electroencephalogram (EEG) data is developed for reducing the risk of these diseases. In existing frameworks, signals are often segmented into smaller sections prior to being input to a deep neural network. However, this approach ignores the fundamental nature of EEG signals as a carrier of valuable information (e.g., the integrity of frequency and phase, and temporal fluctuations of EEG components). As such, this type of segmenting may lead to information loss and a failure to effectively identify mental stress levels. Thus, we propose a novel multiclass classification framework termed multibranch LSTM and hierarchical temporal attention (MuLHiTA) for the early identification of mental stress levels. It specifically focuses on not only intraslice (within each slice) but also interslice (between different slices) samples in parallel. This was achieved by including two complementary branches, each of which integrated a specifically designed attention module into a bidirectional long short-term memory (BLSTM) network, enabling extraction of the most discriminative features from interslice and intraslice EEG signals simultaneously. The outputs of attention modules were then summed to obtain a feature representation that contributes to reduce overfitting and more effective multiclass classification. In addition, electrode positions were optimized using neural activity areas under high-stress conditions, thereby reducing computational costs by minimizing the number of critical electrodes. MuLHiTA was evaluated across one private [Montreal imaging stress task (MIST)] and two publicly available EEG datasets [EEG during mental arithmetic tasks (DMAT) and Simultaneous task EEG workload (STEW)]. These were divided into training and test sets using an 8:2 ratio, and the training data were further divided into training and validation sets using a fivefold cross-validation (CV) method, in which the model with the highest accuracy among the five was selected. The model was trained once more with the full training set, and the test data were then used to evaluate its performance. This approach achieved average classification accuracies of 93.58%, 91.80%, and 99.71% for the MIST, STEW, and DMAT datasets, respectively. Experimental results showed MuLHiTA was superior to state-of-the-art algorithms, including EEGNet, BLSTM, EEGLearn, convolutional neural network (CNN)-long short-term memory (LSTM), and convolutional recurrent attention model (CRAM), for multiclass classification. This demonstrates the viability of MuLHiTA for the early detection of mental stress.

Collapse

Chung Y, Lee H. Joint triplet loss with semi-hard constraint for data augmentation and disease prediction using gene expression data. Sci Rep 2023;13:18178. [PMID: 37875602 PMCID: PMC10598120 DOI: 10.1038/s41598-023-45467-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Accepted: 10/19/2023] [Indexed: 10/26/2023] Open

Choi SB, Shin HS, Kim JW. Convolution Neural Networks for Motion Detection with Electrospun Reversibly-Cross-linkable Polymers and Encapsulated Ag Nanowires. ACS APPLIED MATERIALS & INTERFACES 2023;15:47591-47603. [PMID: 37782487 DOI: 10.1021/acsami.3c11918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/03/2023]

Abstract

This paper presents the design, fabrication, and implementation of a novel composite film, a polybutadiene-based urethane (PBU)/AgNW/PBU sensor (PAPS), demonstrating remarkable mechanical stability and precision in motion detection. The sensor capitalizes on the integration of Ag nanowire (AgNW) electrodes into a neutral plane, embedded within a reversibly cross-linkable PBU polymer. The meticulous arrangement confers pore-free and interfaceless sensor formation, resulting in an enhanced mechanical robustness, reproducibility, and long-term reliability. The PBU polymer is subjected to an electrospinning process, followed by sequential Diels-Alder (DA) and retro-DA reactions to produce a planarized encapsulation layer. This pioneering technology, based on electrospinning, allows for more flawless engineering of the neutral plane as compared to conventional film lamination or layer-by-layer spin-coating processes. This encapsulation, matching the thickness of the preformed PBU film, effectively houses the AgNW electrodes. The PAPS outperforms conventional AgNW/PBU sensors (APS) in terms of mechanical stability and bending insensitivity. When affixed to various body parts, the PAPS generates distinctive signal curves, reflecting the specific body part and degree of motion involved. The PAPS sensor's utility is further magnified by the application of machine learning and deep learning algorithms for signal interpretation. K-means clustering algorithm authenticated the superior reproducibility and consistency of the signals derived from the PAPS over the APS. Deep learning algorithms, including a singular 1D convolutional neural network (1D CNN), long short-term memory (LSTM) network, and dual-layered combinations of 1D CNN + LSTM and LSTM + 1D CNN, were deployed for signal classification. The singular 1D CNN model displayed a classification accuracy exceeding 98%. The PAPS sensor signifies a pivotal development in the field of intelligent motion sensors.

Collapse

Elmezughi MK, Salih O, Afullo TJ, Duffy KJ. Path loss modeling based on neural networks and ensemble method for future wireless networks. Heliyon 2023;9:e19685. [PMID: 37809436 PMCID: PMC10558953 DOI: 10.1016/j.heliyon.2023.e19685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 08/26/2023] [Accepted: 08/30/2023] [Indexed: 10/10/2023] Open

Obeso I, Yoon B, Ledbetter D, Aczon M, Laksana E, Zhou A, Eckberg RA, Mertan K, Khemani RG, Wetzel R. A Novel Application of Spectrograms with Machine Learning Can Detect Patient Ventilator Dyssynchrony. Biomed Signal Process Control 2023;86:105251. [PMID: 37587924 PMCID: PMC10426752 DOI: 10.1016/j.bspc.2023.105251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/18/2023]

Affiliation(s)

Ishmael Obeso
Benjamin Yoon
David Ledbetter Ishmael Obeso, Benjamin Yoon, David Ledbetter, Melissa Aczon, Eugene Laksana, Alice Zhou, Andrew Eckberg, Keith Mertan, Robinder G. Khemani, and Randall Wetzel are with the Children’s Hospital Los Angeles, California
Melissa Aczon Ishmael Obeso, Benjamin Yoon, David Ledbetter, Melissa Aczon, Eugene Laksana, Alice Zhou, Andrew Eckberg, Keith Mertan, Robinder G. Khemani, and Randall Wetzel are with the Children’s Hospital Los Angeles, California
Eugene Laksana Ishmael Obeso, Benjamin Yoon, David Ledbetter, Melissa Aczon, Eugene Laksana, Alice Zhou, Andrew Eckberg, Keith Mertan, Robinder G. Khemani, and Randall Wetzel are with the Children’s Hospital Los Angeles, California
Alice Zhou Ishmael Obeso, Benjamin Yoon, David Ledbetter, Melissa Aczon, Eugene Laksana, Alice Zhou, Andrew Eckberg, Keith Mertan, Robinder G. Khemani, and Randall Wetzel are with the Children’s Hospital Los Angeles, California
R. Andrew Eckberg Ishmael Obeso, Benjamin Yoon, David Ledbetter, Melissa Aczon, Eugene Laksana, Alice Zhou, Andrew Eckberg, Keith Mertan, Robinder G. Khemani, and Randall Wetzel are with the Children’s Hospital Los Angeles, California
Keith Mertan Ishmael Obeso, Benjamin Yoon, David Ledbetter, Melissa Aczon, Eugene Laksana, Alice Zhou, Andrew Eckberg, Keith Mertan, Robinder G. Khemani, and Randall Wetzel are with the Children’s Hospital Los Angeles, California
Robinder G. Khemani Ishmael Obeso, Benjamin Yoon, David Ledbetter, Melissa Aczon, Eugene Laksana, Alice Zhou, Andrew Eckberg, Keith Mertan, Robinder G. Khemani, and Randall Wetzel are with the Children’s Hospital Los Angeles, California
Randall Wetzel Ishmael Obeso, Benjamin Yoon, David Ledbetter, Melissa Aczon, Eugene Laksana, Alice Zhou, Andrew Eckberg, Keith Mertan, Robinder G. Khemani, and Randall Wetzel are with the Children’s Hospital Los Angeles, California

Collapse

Liu K, Xie X, Yan J, Zhang S, Zhang H. An adsorption isotherm identification method based on CNN-LSTM neural network. J Mol Model 2023;29:301. [PMID: 37651008 DOI: 10.1007/s00894-023-05704-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2023] [Accepted: 08/21/2023] [Indexed: 09/01/2023]

Abstract

CONTEXT

The morphology of adsorption isotherms embodies a wealth of information regarding various adsorption mechanisms, rendering the classification and identification methodologies predicated on the shape of adsorption isotherms indispensably crucial. While research on classification techniques has been extensively developed, traditional methods of adsorption isotherm identification grapple with inefficiencies and a high margin of error. Neural network-based methodologies for adsorption isotherm identification serve as a countermeasure to these shortcomings, as they facilitate swift online identification while delivering precise results. In this paper, we deploy a hybrid of convolutional neural networks (CNN) and long short-term memory (LSTM) networks for the identification of adsorption isotherms. Extensive theoretical adsorption isotherms are generated via adsorption equations, forming a comprehensive training database, thereby circumventing the need for time-consuming and costly repetitive experiments. The F1-score, receiver operating characteristic (ROC) curves, and area under the ROC curve (AUC) are introduced as criteria to evaluate the identification performance and generalization ability of the model during the testing phase. The results highlight the model's superlative performance in the task of adsorption isotherm identification, with accuracy rates of 100% in both the training and validation sets. The mean F1-score obtained from the testing set reached 0.8885, with both macro-average and micro-average AUC exceeding 0.95.

METHOD

PyCharm was employed as an experimental and testing platform, with Python 3.9 serving as the programming language. TensorFlow 2.11.0 and Keras 2.10.0 were harnessed for the training and testing of CNN-LSTM, while numpy 1.21.5 and scipy 1.81 were utilized for the creation of training and validation datasets.

Collapse

Li H, Lin X, Lu Y, Wang M, Cheng H. Pilot study of contactless sleep apnea detection based on snore signals with hardware implementation. Physiol Meas 2023;44:085003. [PMID: 37506712 DOI: 10.1088/1361-6579/acebb5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Accepted: 07/28/2023] [Indexed: 07/30/2023]

Ullah R, Asif M, Shah WA, Anjam F, Ullah I, Khurshaid T, Wuttisittikulkij L, Shah S, Ali SM, Alibakhshikenari M. Speech Emotion Recognition Using Convolution Neural Networks and Multi-Head Convolutional Transformer. SENSORS (BASEL, SWITZERLAND) 2023;23:6212. [PMID: 37448062 DOI: 10.3390/s23136212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Revised: 05/26/2023] [Accepted: 06/04/2023] [Indexed: 07/15/2023]

Qayyum A, Razzak I, Tanveer M, Mazher M, Alhaqbani B. High-Density Electroencephalography and Speech Signal Based Deep Framework for Clinical Depression Diagnosis. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023;20:2587-2597. [PMID: 37028339 DOI: 10.1109/tcbb.2023.3257175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

Goumiri S, Benboudjema D, Pieczynski W. A new hybrid model of convolutional neural networks and hidden Markov chains for image classification. Neural Comput Appl 2023;35:1-16. [PMID: 37362578 PMCID: PMC10230497 DOI: 10.1007/s00521-023-08644-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Accepted: 05/02/2023] [Indexed: 06/28/2023]

Lukac M, Zhambulova G, Abdiyeva K, Lewis M. Study on emotion recognition bias in different regional groups. Sci Rep 2023;13:8414. [PMID: 37225756 PMCID: PMC10209154 DOI: 10.1038/s41598-023-34932-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 05/10/2023] [Indexed: 05/26/2023] Open

Abstract

Human-machine communication can be substantially enhanced by the inclusion of high-quality real-time recognition of spontaneous human emotional expressions. However, successful recognition of such expressions can be negatively impacted by factors such as sudden variations of lighting, or intentional obfuscation. Reliable recognition can be more substantively impeded due to the observation that the presentation and meaning of emotional expressions can vary significantly based on the culture of the expressor and the environment within which the emotions are expressed. As an example, an emotion recognition model trained on a regionally-specific database collected from North America might fail to recognize standard emotional expressions from another region, such as East Asia. To address the problem of regional and cultural bias in emotion recognition from facial expressions, we propose a meta-model that fuses multiple emotional cues and features. The proposed approach integrates image features, action level units, micro-expressions and macro-expressions into a multi-cues emotion model (MCAM). Each of the facial attributes incorporated into the model represents a specific category: fine-grained content-independent features, facial muscle movements, short-term facial expressions and high-level facial expressions. The results of the proposed meta-classifier (MCAM) approach show that a) the successful classification of regional facial expressions is based on non-sympathetic features b) learning the emotional facial expressions of some regional groups can confound the successful recognition of emotional expressions of other regional groups unless it is done from scratch and c) the identification of certain facial cues and features of the data-sets that serve to preclude the design of the perfect unbiased classifier. As a result of these observations we posit that to learn certain regional emotional expressions, other regional expressions first have to be "forgotten".

Collapse

Sun P, Wang J, Dong Z. CNN-LSTM Neural Network for Identification of Pre-Cooked Pasta Products in Different Physical States Using Infrared Spectroscopy. SENSORS (BASEL, SWITZERLAND) 2023;23:4815. [PMID: 37430729 DOI: 10.3390/s23104815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 05/02/2023] [Accepted: 05/13/2023] [Indexed: 07/12/2023]

Pandey SK, Shekhawat HS, Prasanna S. Multi-cultural speech emotion recognition using language and speaker cues. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]

Tanko D, Demir FB, Dogan S, Sahin SE, Tuncer T. Automated speech emotion polarization for a distance education system based on orbital local binary pattern and an appropriate sub-band selection technique. MULTIMEDIA TOOLS AND APPLICATIONS 2023:1-18. [PMID: 37362680 PMCID: PMC10068203 DOI: 10.1007/s11042-023-14648-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 08/02/2022] [Accepted: 02/03/2023] [Indexed: 06/28/2023]

Kshirsagar S, Pendyala A, Falk TH. Task-specific speech enhancement and data augmentation for improved multimodal emotion recognition under noisy conditions. FRONTIERS IN COMPUTER SCIENCE 2023. [DOI: 10.3389/fcomp.2023.1039261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/30/2023] Open

Long X, Ding X, Li J, Dong R, Su Y, Chang C. Indentation Reverse Algorithm of Mechanical Response for Elastoplastic Coatings Based on LSTM Deep Learning. MATERIALS (BASEL, SWITZERLAND) 2023;16:2617. [PMID: 37048911 PMCID: PMC10096397 DOI: 10.3390/ma16072617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Revised: 03/19/2023] [Accepted: 03/22/2023] [Indexed: 06/19/2023]

Abstract

The load-penetration depth (P-h) curves of different metallic coating materials can be determined by nanoindentation experiments, and it is a challenge to obtain stress-strain response and elastoplastic properties directly using P-h curves. These problems can be solved by means of finite element (FE) simulation along with reverse analyses and methods, which, however, typically occupy a lengthy time, in addition to the low generality of FE methodologies for different metallic materials. To eliminate the challenges that exist in conventional FE simulations, a long short-term memory (LSTM) neural network is proposed in this study and implemented to deep learn the time series of P-h curves, which is capable of mapping P-h curves to the corresponding stress-strain responses for elastoplastic materials. Prior to the operation of the neural network, 1000 sets of indentation data of metallic coating materials were generated using the FE method as the training and validating sets. Each dataset contains a set of P-h curves as well as the corresponding stress-strain curves, which are used as input data for the network and as training targets. The proposed LSTM neural networks, with various numbers of hidden layers and hidden units, are evaluated to determine the optimal hyperparameters by comparing their loss curves. Based on the analysis of the prediction results of the network, it is concluded that the relationship between the P-h curves of metallic coating materials and their stress-strain responses is well predicted, and this relationship basically coincides with the power-law equation. Furthermore, the deep learning method based on LSTM is advantageous to interpret the elastoplastic behaviors of coating materials from indentation measurement, making the predictions of stress-strain responses much more efficient than FE analysis. The established LSTM neural network exhibits the prediction accuracy up to 97%, which is proved to reliably satisfy the engineering requirements in practice.

Collapse

Mohammed Alsumaidaee YA, Yaw CT, Koh SP, Tiong SK, Chen CP, Yusaf T, Abdalla AN, Ali K, Raj AA. Detection of Corona Faults in Switchgear by Using 1D-CNN, LSTM, and 1D-CNN-LSTM Methods. SENSORS (BASEL, SWITZERLAND) 2023;23:3108. [PMID: 36991819 PMCID: PMC10059847 DOI: 10.3390/s23063108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 02/28/2023] [Accepted: 03/01/2023] [Indexed: 06/19/2023]

Abstract

The damaging effects of corona faults have made them a major concern in metal-clad switchgear, requiring extreme caution during operation. Corona faults are also the primary cause of flashovers in medium-voltage metal-clad electrical equipment. The root cause of this issue is an electrical breakdown of the air due to electrical stress and poor air quality within the switchgear. Without proper preventative measures, a flashover can occur, resulting in serious harm to workers and equipment. As a result, detecting corona faults in switchgear and preventing electrical stress buildup in switches is critical. Recent years have seen the successful use of Deep Learning (DL) applications for corona and non-corona detection, owing to their autonomous feature learning capability. This paper systematically analyzes three deep learning techniques, namely 1D-CNN, LSTM, and 1D-CNN-LSTM hybrid models, to identify the most effective model for detecting corona faults. The hybrid 1D-CNN-LSTM model is deemed the best due to its high accuracy in both the time and frequency domains. This model analyzes the sound waves generated in switchgear to detect faults. The study examines model performance in both the time and frequency domains. In the time domain analysis (TDA), 1D-CNN achieved success rates of 98%, 98.4%, and 93.9%, while LSTM obtained success rates of 97.3%, 98.4%, and 92.4%. The most suitable model, the 1D-CNN-LSTM, achieved success rates of 99.3%, 98.4%, and 98.4% in differentiating corona and non-corona cases during training, validation, and testing. In the frequency domain analysis (FDA), 1D-CNN achieved success rates of 100%, 95.8%, and 95.8%, while LSTM obtained success rates of 100%, 100%, and 100%. The 1D-CNN-LSTM model achieved a 100%, 100%, and 100% success rate during training, validation, and testing. Hence, the developed algorithms achieved high performance in identifying corona faults in switchgear, particularly the 1D-CNN-LSTM model due to its accuracy in detecting corona faults in both the time and frequency domains.

Collapse

Singh J, Saheer LB, Faust O. Speech Emotion Recognition Using Attention Model. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023;20:5140. [PMID: 36982048 PMCID: PMC10049636 DOI: 10.3390/ijerph20065140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/24/2022] [Revised: 03/01/2023] [Accepted: 03/01/2023] [Indexed: 06/18/2023]

Olatinwo DD, Abu-Mahfouz A, Hancke G, Myburgh H. IoT-Enabled WBAN and Machine Learning for Speech Emotion Recognition in Patients. SENSORS (BASEL, SWITZERLAND) 2023;23:2948. [PMID: 36991659 PMCID: PMC10056097 DOI: 10.3390/s23062948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Revised: 02/27/2023] [Accepted: 03/03/2023] [Indexed: 06/19/2023]

Abstract

Internet of things (IoT)-enabled wireless body area network (WBAN) is an emerging technology that combines medical devices, wireless devices, and non-medical devices for healthcare management applications. Speech emotion recognition (SER) is an active research field in the healthcare domain and machine learning. It is a technique that can be used to automatically identify speakers' emotions from their speech. However, the SER system, especially in the healthcare domain, is confronted with a few challenges. For example, low prediction accuracy, high computational complexity, delay in real-time prediction, and how to identify appropriate features from speech. Motivated by these research gaps, we proposed an emotion-aware IoT-enabled WBAN system within the healthcare framework where data processing and long-range data transmissions are performed by an edge AI system for real-time prediction of patients' speech emotions as well as to capture the changes in emotions before and after treatment. Additionally, we investigated the effectiveness of different machine learning and deep learning algorithms in terms of performance classification, feature extraction methods, and normalization methods. We developed a hybrid deep learning model, i.e., convolutional neural network (CNN) and bidirectional long short-term memory (BiLSTM), and a regularized CNN model. We combined the models with different optimization strategies and regularization techniques to improve the prediction accuracy, reduce generalization error, and reduce the computational complexity of the neural networks in terms of their computational time, power, and space. Different experiments were performed to check the efficiency and effectiveness of the proposed machine learning and deep learning algorithms. The proposed models are compared with a related existing model for evaluation and validation using standard performance metrics such as prediction accuracy, precision, recall, F1 score, confusion matrix, and the differences between the actual and predicted values. The experimental results proved that one of the proposed models outperformed the existing model with an accuracy of about 98%.

Collapse

Aspect-Based Sentiment Analysis of Customer Speech Data Using Deep Convolutional Neural Network and BiLSTM. Cognit Comput 2023. [DOI: 10.1007/s12559-023-10127-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/08/2023]

SMDetector: Small mitotic detector in histopathology images using faster R-CNN with dilated convolutions in backbone model. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104414] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Jiao L, Sun C, Yan N, Yan C, Qu L, Wang Q, Zhang S, Ma L. Discrimination of Salvia miltiorrhiza from Different Geographical Origins by Laser-Induced Breakdown Spectroscopy (LIBS) with Convolutional Neural Network (CNN). ANAL LETT 2023. [DOI: 10.1080/00032719.2023.2180515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]

Alsabhan W. Human-Computer Interaction with a Real-Time Speech Emotion Recognition with Ensembling Techniques 1D Convolution Neural Network and Attention. SENSORS (BASEL, SWITZERLAND) 2023;23:1386. [PMID: 36772427 PMCID: PMC9921095 DOI: 10.3390/s23031386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 01/19/2023] [Accepted: 01/20/2023] [Indexed: 06/18/2023]

Zhong MY, Yang QY, Liu Y, Zhen BY, Zhao FD, Xie BB. EEG emotion recognition based on TQWT-features and hybrid convolutional recurrent neural network. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]

Wang W, Tang Q. Combined model of air quality index forecasting based on the combination of complementary empirical mode decomposition and sequence reconstruction. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2023;316:120628. [PMID: 36370980 DOI: 10.1016/j.envpol.2022.120628] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Revised: 10/27/2022] [Accepted: 11/07/2022] [Indexed: 06/16/2023]

Abstract

One of the most important issues that cities face is air pollution. In this study, a novel integrated forecasting model of the air quality index (AQI) is suggested to carry out reliable prediction, providing useful references for urban air pollution control, public health construction, and residents' travel planning. Firstly, the original data is decomposed by the method of complementary set empirical mode decomposition (CEEMD), and the subsequences of different frequencies are formed. Secondly, the fuzzy entropy (FE) algorithm is used to reconstruct the subsequence. Then, the combined forecasting model is established, and different prediction methods are selected for different frequency subsequences. The new high-frequency sequences, low-frequency sequences, and trend sequences are predicted by the whale algorithm optimized long short term neural network (WOA-LSTM) and the extreme learning machine (ELM), respectively. Empirical analysis are carried out with the example of Beijing and Chengdu. The results indicated that: (1) The proposed CEEMD-FE-WOA-LSTM-ELM model effectively integrates the characteristics of the original sequence and has the highest prediction accuracy among all the comparison models. (2) It is necessary to preprocess the data, which can effectively extract data features. Taking Beijing as an example, compared with the non-decomposition model, after adding the decomposition algorithm, the prediction accuracy rate (PA) is increased by 8.55% on average, the RMSE is decreased by 10.36 on average, and the MAPE is decreased by 6.11% on average. (3) The overall prediction level and prediction accuracy can be effectively increased by applying various prediction methods for recombination sequences with various frequency. The research results can provide references for urban air quality prediction.

Collapse

de Lope J, Graña M. An ongoing review of speech emotion recognition. Neurocomputing 2023. [DOI: 10.1016/j.neucom.2023.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]

Shanthi N, Stonier AA, Sherine A, Devaraju T, Abinash S, Ajay R, Arul Prasath V, Ganji V. An integrated approach for mental health assessment using emotion analysis and scales. Healthc Technol Lett 2022. [DOI: 10.1049/htl2.12040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open

Wang Y, Li S, Zhang H, Liu T. A lightweight CNN-based model for early warning in sow oestrus sound monitoring. ECOL INFORM 2022. [DOI: 10.1016/j.ecoinf.2022.101863] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

Uppada SK, Patel P, B. S. An image and text-based multimodal model for detecting fake news in OSN's. J Intell Inf Syst 2022;61:1-27. [PMID: 36465146 PMCID: PMC9708513 DOI: 10.1007/s10844-022-00764-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 10/18/2022] [Accepted: 11/03/2022] [Indexed: 12/02/2022]

Akalya devi C, Karthika Renuka D, Pooventhiran G, Harish D, Yadav S, Thirunarayan K. Towards enhancing emotion recognition via multimodal framework. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-220280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Sun C, Zhang Y, Huang G, Liu L, Hao X. A soft sensor model based on long&short-term memory dual pathways convolutional gated recurrent unit network for predicting cement specific surface area. ISA TRANSACTIONS 2022;130:293-305. [PMID: 35367055 DOI: 10.1016/j.isatra.2022.03.013] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 03/06/2022] [Accepted: 03/10/2022] [Indexed: 06/14/2023]

Deep learning for Covid-19 forecasting: State-of-the-art review. Neurocomputing 2022;511:142-154. [PMID: 36097509 PMCID: PMC9454152 DOI: 10.1016/j.neucom.2022.09.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 07/03/2022] [Accepted: 09/04/2022] [Indexed: 11/21/2022]

Xefteris VR, Tsanousa A, Georgakopoulou N, Diplaris S, Vrochidis S, Kompatsiaris I. Graph Theoretical Analysis of EEG Functional Connectivity Patterns and Fusion with Physiological Signals for Emotion Recognition. SENSORS (BASEL, SWITZERLAND) 2022;22:8198. [PMID: 36365896 PMCID: PMC9656224 DOI: 10.3390/s22218198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 10/22/2022] [Accepted: 10/24/2022] [Indexed: 06/16/2023]

CNN-LSTM Facial Expression Recognition Method Fused with Two-Layer Attention Mechanism. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022;2022:7450637. [DOI: 10.1155/2022/7450637] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 09/29/2022] [Indexed: 11/17/2022]

Zhang X, Li H, Dong R, Lu Z, Li C. Electroencephalogram and surface electromyogram fusion-based precise detection of lower limb voluntary movement using convolution neural network-long short-term memory model. Front Neurosci 2022;16:954387. [PMID: 36213740 PMCID: PMC9538146 DOI: 10.3389/fnins.2022.954387] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 08/26/2022] [Indexed: 11/13/2022] Open

A hybrid data-driven online solar energy disaggregation system from the grid supply point. COMPLEX INTELL SYST 2022. [DOI: 10.1007/s40747-022-00842-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Saurav S, Saini R, Singh S. Fast facial expression recognition using Boosted Histogram of Oriented Gradient (BHOG) features. Pattern Anal Appl 2022. [DOI: 10.1007/s10044-022-01112-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Bagadi KR, Sivappagari CMR. A robust feature selection method based on meta-heuristic optimization for speech emotion recognition. EVOLUTIONARY INTELLIGENCE 2022. [DOI: 10.1007/s12065-022-00772-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]

Xu X, Li D, Zhou Y, Wang Z. Multi-type features separating fusion learning for Speech Emotion Recognition. Appl Soft Comput 2022. [DOI: 10.1016/j.asoc.2022.109648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]