1
|
Xing D, Wang Y, Sun P, Huang H, Lin E. A CNN-LSTM-att hybrid model for classification and evaluation of growth status under drought and heat stress in chinese fir (Cunninghamia lanceolata). PLANT METHODS 2023; 19:66. [PMID: 37400865 DOI: 10.1186/s13007-023-01044-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 06/22/2023] [Indexed: 07/05/2023]
Abstract
BACKGROUND Cunninghamia lanceolata (Chinese fir), is one of the most important timber trees in China. With the global warming, to develop new resistant varieties to drought or heat stress has become an essential task for breeders of Chinese fir. However, classification and evaluation of growth status of Chinese fir under drought or heat stress are still labor-intensive and time-consuming. RESULTS In this study, we proposed a CNN-LSTM-att hybrid model for classification of growth status of Chinese fir seedlings under drought and heat stress, respectively. Two RGB image datasets of Chinese fir seedling under drought and heat stress were generated for the first time, and utilized in this study. By comparing four base CNN models with LSTM, the Resnet50-LSTM was identified as the best model in classification of growth status, and LSTM would dramatically improve the classification performance. Moreover, attention mechanism further enhanced performance of Resnet50-LSTM, which was verified by Grad-CAM. By applying the established Resnet50-LSTM-att model, the accuracy rate and recall rate of classification was up to 96.91% and 96.79% for dataset of heat stress, and 96.05% and 95.88% for dataset of drought, respectively. Accordingly, the R2 value and RMSE value for evaluation on growth status under heat stress were 0.957 and 0.067, respectively. And, the R2 value and RMSE value for evaluation on growth status under drought were 0.944 and 0.076, respectively. CONCLUSION In summary, our proposed model provides an important tool for stress phenotyping in Chinese fir, which will be a great help for selection and breeding new resistant varieties in future.
Collapse
Affiliation(s)
- Dong Xing
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, 311300, Zhejiang, China
| | - Yulin Wang
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, 311300, Zhejiang, China
| | - Penghui Sun
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, 311300, Zhejiang, China
| | - Huahong Huang
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, 311300, Zhejiang, China
| | - Erpei Lin
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou, 311300, Zhejiang, China.
| |
Collapse
|
2
|
Jiang Y, Song L, Zhang J, Song Y, Yan M. Multi-Category Gesture Recognition Modeling Based on sEMG and IMU Signals. SENSORS (BASEL, SWITZERLAND) 2022; 22:s22155855. [PMID: 35957417 PMCID: PMC9371015 DOI: 10.3390/s22155855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2022] [Revised: 08/01/2022] [Accepted: 08/02/2022] [Indexed: 05/14/2023]
Abstract
Gesture recognition based on wearable devices is one of the vital components of human-computer interaction systems. Compared with skeleton-based recognition in computer vision, gesture recognition using wearable sensors has attracted wide attention for its robustness and convenience. Recently, many studies have proposed deep learning methods based on surface electromyography (sEMG) signals for gesture classification; however, most of the existing datasets are built for surface EMG signals, and there is a lack of datasets for multi-category gestures. Due to model limitations and inadequate classification data, the recognition accuracy of these methods cannot satisfy multi-gesture interaction scenarios. In this paper, a multi-category dataset containing 20 gestures is recorded with the help of a wearable device that can acquire surface electromyographic and inertial (IMU) signals. Various two-stream deep learning models are established and improved further. The basic convolutional neural network (CNN), recurrent neural network (RNN), and Transformer models are experimented on with our dataset as the classifier. The CNN and the RNN models' test accuracy is over 95%; however, the Transformer model has a lower test accuracy of 71.68%. After further improvements, the CNN model is introduced into the residual network and augmented to the CNN-Res model, achieving 98.24% accuracy; moreover, it has the shortest training and testing time. Then, after combining the RNN model and the CNN-Res model, the long short term memory (LSTM)-Res model and gate recurrent unit (GRU)-Res model achieve the highest classification accuracy of 99.67% and 99.49%, respectively. Finally, the fusion of the Transformer model and the CNN model enables the Transformer-CNN model to be constructed. Such improvement dramatically boosts the performance of the Transformer module, increasing the recognition accuracy from 71.86% to 98.96%.
Collapse
Affiliation(s)
- Yujian Jiang
- State Key Laboratory of Media Convergence and Communication, Communication University of China, Beijing 100024, China
- Key Laboratory of Acoustic Visual Technology and Intelligent Control System, Ministry of Culture and Tourism, Communication University of China, Beijing 100024, China
- Beijing Key Laboratory of Modern Entertainment Technology, Communication University of China, Beijing 100024, China
- School of Information and Communication Engineering, Communication University of China, Beijing 100024, China
- Correspondence:
| | - Lin Song
- State Key Laboratory of Media Convergence and Communication, Communication University of China, Beijing 100024, China
- Key Laboratory of Acoustic Visual Technology and Intelligent Control System, Ministry of Culture and Tourism, Communication University of China, Beijing 100024, China
- Beijing Key Laboratory of Modern Entertainment Technology, Communication University of China, Beijing 100024, China
- School of Information and Communication Engineering, Communication University of China, Beijing 100024, China
| | - Junming Zhang
- State Key Laboratory of Media Convergence and Communication, Communication University of China, Beijing 100024, China
- Key Laboratory of Acoustic Visual Technology and Intelligent Control System, Ministry of Culture and Tourism, Communication University of China, Beijing 100024, China
- Beijing Key Laboratory of Modern Entertainment Technology, Communication University of China, Beijing 100024, China
- School of Information and Communication Engineering, Communication University of China, Beijing 100024, China
| | - Yang Song
- State Key Laboratory of Media Convergence and Communication, Communication University of China, Beijing 100024, China
- Key Laboratory of Acoustic Visual Technology and Intelligent Control System, Ministry of Culture and Tourism, Communication University of China, Beijing 100024, China
- Beijing Key Laboratory of Modern Entertainment Technology, Communication University of China, Beijing 100024, China
- School of Information and Communication Engineering, Communication University of China, Beijing 100024, China
| | - Ming Yan
- State Key Laboratory of Media Convergence and Communication, Communication University of China, Beijing 100024, China
- Key Laboratory of Acoustic Visual Technology and Intelligent Control System, Ministry of Culture and Tourism, Communication University of China, Beijing 100024, China
- Beijing Key Laboratory of Modern Entertainment Technology, Communication University of China, Beijing 100024, China
- School of Information and Communication Engineering, Communication University of China, Beijing 100024, China
| |
Collapse
|
3
|
Jin Y, Suzuki G, Shioya H. Detecting and Visualizing Stops in Dance Training by Neural Network Based on Velocity and Acceleration. SENSORS (BASEL, SWITZERLAND) 2022; 22:5402. [PMID: 35891082 PMCID: PMC9321875 DOI: 10.3390/s22145402] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 07/13/2022] [Accepted: 07/15/2022] [Indexed: 06/15/2023]
Abstract
Various genres of dance, such as Yosakoi Soran, have contributed to the health of many people and contributed to their sense of belonging to a community. However, due to the effects of COVID-19, various face-to-face activities have been restricted and group dance practice has become difficult. Hence, there is a need to facilitate remote dance practice. In this paper, we propose a system for detecting and visualizing the very important dance motions known as stops. We measure dance movements by motion capture and calculate the features of each movement based on velocity and acceleration. Using a neural network to learn motion features, the system detects stops and visualizes them using a human-like 3D model. In an experiment using dance data, the proposed method obtained highly accurate stop detection results and demonstrated its effectiveness as an information and communication technology support for remote group dance practice.
Collapse
|
4
|
Hand Gesture Recognition on a Resource-Limited Interactive Wristband. SENSORS 2021; 21:s21175713. [PMID: 34502604 PMCID: PMC8434577 DOI: 10.3390/s21175713] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 07/26/2021] [Accepted: 08/01/2021] [Indexed: 11/17/2022]
Abstract
Most of the reported hand gesture recognition algorithms require high computational resources, i.e., fast MCU frequency and significant memory, which are highly inapplicable to the cost-effectiveness of consumer electronics products. This paper proposes a hand gesture recognition algorithm running on an interactive wristband, with computational resource requirements as low as Flash < 5 KB, RAM < 1 KB. Firstly, we calculated the three-axis linear acceleration by fusing accelerometer and gyroscope data with a complementary filter. Then, by recording the order of acceleration vectors crossing axes in the world coordinate frame, we defined a new feature code named axis-crossing code. Finally, we set templates for eight hand gestures to recognize new samples. We compared this algorithm's performance with the widely used dynamic time warping (DTW) algorithm and recurrent neural network (BiLSTM and GRU). The results show that the accuracies of the proposed algorithm and RNNs are higher than DTW and that the time cost of the proposed algorithm is much less than those of DTW and RNNs. The average recognition accuracy is 99.8% on the collected dataset and 97.1% in the actual user-independent case. In general, the proposed algorithm is suitable and competitive in consumer electronics. This work has been volume-produced and patent-granted.
Collapse
|
5
|
Narkhede P, Walambe R, Poddar S, Kotecha K. Incremental learning of LSTM framework for sensor fusion in attitude estimation. PeerJ Comput Sci 2021; 7:e662. [PMID: 34435103 PMCID: PMC8356651 DOI: 10.7717/peerj-cs.662] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Accepted: 07/16/2021] [Indexed: 06/13/2023]
Abstract
This paper presents a novel method for attitude estimation of an object in 3D space by incremental learning of the Long-Short Term Memory (LSTM) network. Gyroscope, accelerometer, and magnetometer are few widely used sensors in attitude estimation applications. Traditionally, multi-sensor fusion methods such as the Extended Kalman Filter and Complementary Filter are employed to fuse the measurements from these sensors. However, these methods exhibit limitations in accounting for the uncertainty, unpredictability, and dynamic nature of the motion in real-world situations. In this paper, the inertial sensors data are fed to the LSTM network which are then updated incrementally to incorporate the dynamic changes in motion occurring in the run time. The robustness and efficiency of the proposed framework is demonstrated on the dataset collected from a commercially available inertial measurement unit. The proposed framework offers a significant improvement in the results compared to the traditional method, even in the case of a highly dynamic environment. The LSTM framework-based attitude estimation approach can be deployed on a standard AI-supported processing module for real-time applications.
Collapse
Affiliation(s)
- Parag Narkhede
- Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, Maharashtra, India
| | - Rahee Walambe
- Symbiosis Centre for Applied Artificial Intelligence, Symbiosis International (Deemed University), Pune, Maharashtra, India
| | - Shashi Poddar
- Central Scientific Instruments Organisation, Council of Scientific and Industrial Research, Chandigarh, Chandigarh, India
| | - Ketan Kotecha
- Symbiosis Centre for Applied Artificial Intelligence, Symbiosis International (Deemed University), Pune, Maharashtra, India
| |
Collapse
|
6
|
Moreira de Lima JM, Ugulino de Araújo FM. Industrial Semi-Supervised Dynamic Soft-Sensor Modeling Approach Based on Deep Relevant Representation Learning. SENSORS 2021; 21:s21103430. [PMID: 34069123 PMCID: PMC8156853 DOI: 10.3390/s21103430] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Revised: 03/30/2021] [Accepted: 04/02/2021] [Indexed: 11/16/2022]
Abstract
Soft sensors based on deep learning have been growing in industrial process applications, inferring hard-to-measure but crucial quality-related variables. However, applications may present strong non-linearity, dynamicity, and a lack of labeled data. To deal with the above-cited problems, the extraction of relevant features is becoming a field of interest in soft-sensing. A novel deep representative learning soft-sensor modeling approach is proposed based on stacked autoencoder (SAE), mutual information (MI), and long-short term memory (LSTM). SAE is trained layer by layer with MI evaluation performed between extracted features and targeted output to evaluate the relevance of learned representation in each layer. This approach highlights relevant information and eliminates irrelevant information from the current layer. Thus, deep output-related representative features are retrieved. In the supervised fine-tuning stage, an LSTM is coupled to the tail of the SAE to address system inherent dynamic behavior. Also, a k-fold cross-validation ensemble strategy is applied to enhance the soft-sensor reliability. Two real-world industrial non-linear processes are employed to evaluate the proposed method performance. The obtained results show improved prediction performance in comparison to other traditional and state-of-art methods. Compared to the other methods, the proposed model can generate more than 38.6% and 39.4% improvement of RMSE for the two analyzed industrial cases.
Collapse
|
7
|
Curreri F, Patanè L, Xibilia MG. RNN- and LSTM-Based Soft Sensors Transferability for an Industrial Process. SENSORS 2021; 21:s21030823. [PMID: 33530476 PMCID: PMC7865368 DOI: 10.3390/s21030823] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 01/21/2021] [Accepted: 01/22/2021] [Indexed: 01/20/2023]
Abstract
The design and application of Soft Sensors (SSs) in the process industry is a growing research field, which needs to mediate problems of model accuracy with data availability and computational complexity. Black-box machine learning (ML) methods are often used as an efficient tool to implement SSs. Many efforts are, however, required to properly select input variables, model class, model order and the needed hyperparameters. The aim of this work was to investigate the possibility to transfer the knowledge acquired in the design of a SS for a given process to a similar one. This has been approached as a transfer learning problem from a source to a target domain. The implementation of a transfer learning procedure allows to considerably reduce the computational time dedicated to the SS design procedure, leaving out many of the required phases. Two transfer learning methods have been proposed, evaluating their suitability to design SSs based on nonlinear dynamical models. Recurrent neural structures have been used to implement the SSs. In detail, recurrent neural networks and long short-term memory architectures have been compared in regard to their transferability. An industrial case of study has been considered, to evaluate the performance of the proposed procedures and the best compromise between SS performance and computational effort in transferring the model. The problem of labeled data scarcity in the target domain has been also discussed. The obtained results demonstrate the suitability of the proposed transfer learning methods in the design of nonlinear dynamical models for industrial systems.
Collapse
Affiliation(s)
- Francesco Curreri
- Department of Mathematics and Computer Science, University of Palermo, 90123 Palermo, Italy;
| | - Luca Patanè
- Department of Engineering, University of Messina, 98166 Messina, Italy;
| | | |
Collapse
|
8
|
Dalmazzo D, Waddell G, Ramírez R. Applying Deep Learning Techniques to Estimate Patterns of Musical Gesture. Front Psychol 2021; 11:575971. [PMID: 33469435 PMCID: PMC7813937 DOI: 10.3389/fpsyg.2020.575971] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Accepted: 11/23/2020] [Indexed: 11/30/2022] Open
Abstract
Repetitive practice is one of the most important factors in improving the performance of motor skills. This paper focuses on the analysis and classification of forearm gestures in the context of violin playing. We recorded five experts and three students performing eight traditional classical violin bow-strokes: martelé, staccato, detaché, ricochet, legato, trémolo, collé, and col legno. To record inertial motion information, we utilized the Myo sensor, which reports a multidimensional time-series signal. We synchronized inertial motion recordings with audio data to extract the spatiotemporal dynamics of each gesture. Applying state-of-the-art deep neural networks, we implemented and compared different architectures where convolutional neural networks (CNN) models demonstrated recognition rates of 97.147%, 3DMultiHeaded_CNN models showed rates of 98.553%, and rates of 99.234% were demonstrated by CNN_LSTM models. The collected data (quaternion of the bowing arm of a violinist) contained sufficient information to distinguish the bowing techniques studied, and deep learning methods were capable of learning the movement patterns that distinguish these techniques. Each of the learning algorithms investigated (CNN, 3DMultiHeaded_CNN, and CNN_LSTM) produced high classification accuracies which supported the feasibility of training classifiers. The resulting classifiers may provide the foundation of a digital assistant to enhance musicians' time spent practicing alone, providing real-time feedback on the accuracy and consistency of their musical gestures in performance.
Collapse
Affiliation(s)
- David Dalmazzo
- Music Technology Group, Department of Information and Communication Technologies, Universitat Pompeu Fabra, Barcelona, Spain
| | - George Waddell
- Centre for Performance Science, Royal College of Music, London, United Kingdom.,Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Rafael Ramírez
- Music Technology Group, Department of Information and Communication Technologies, Universitat Pompeu Fabra, Barcelona, Spain
| |
Collapse
|
9
|
Mendes Junior JJA, Freitas MLB, Campos DP, Farinelli FA, Stevan SL, Pichorim SF. Analysis of Influence of Segmentation, Features, and Classification in sEMG Processing: A Case Study of Recognition of Brazilian Sign Language Alphabet. SENSORS (BASEL, SWITZERLAND) 2020; 20:E4359. [PMID: 32764286 PMCID: PMC7471999 DOI: 10.3390/s20164359] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Revised: 07/26/2020] [Accepted: 08/01/2020] [Indexed: 11/17/2022]
Abstract
Sign Language recognition systems aid communication among deaf people, hearing impaired people, and speakers. One of the types of signals that has seen increased studies and that can be used as input for these systems is surface electromyography (sEMG). This work presents the recognition of a set of alphabet gestures from Brazilian Sign Language (Libras) using sEMG acquired from an armband. Only sEMG signals were used as input. Signals from 12 subjects were acquired using a MyoTM armband for the 26 signs of the Libras alphabet. Additionally, as the sEMG has several signal processing parameters, the influence of segmentation, feature extraction, and classification was considered at each step of the pattern recognition. In segmentation, window length and the presence of four levels of overlap rates were analyzed, as well as the contribution of each feature, the literature feature sets, and new feature sets proposed for different classifiers. We found that the overlap rate had a high influence on this task. Accuracies in the order of 99% were achieved for the following factors: segments of 1.75 s with a 12.5% overlap rate; the proposed set of four features; and random forest (RF) classifiers.
Collapse
Affiliation(s)
- José Jair Alves Mendes Junior
- Graduate Program in Electrical Engineering and Industrial Informatics (CPGEI), Federal University of Technology–Paraná (UTFPR), Curitiba (PR) 80230-901, Brazil; (J.J.A.M.J.); (F.A.F.); (S.F.P.)
| | - Melissa La Banca Freitas
- Graduate Program in Electrical Engineering (PPGEE), Federal University of Technology–Paraná (UTFPR), Ponta Grossa (PR) 84017-220, Brazil;
| | - Daniel Prado Campos
- Graduate Program in Biomedical Engineering (PPGEB), Federal University of Technology–Paraná (UTFPR), Curitiba (PR) 80230-901, Brazil;
| | - Felipe Adalberto Farinelli
- Graduate Program in Electrical Engineering and Industrial Informatics (CPGEI), Federal University of Technology–Paraná (UTFPR), Curitiba (PR) 80230-901, Brazil; (J.J.A.M.J.); (F.A.F.); (S.F.P.)
| | - Sergio Luiz Stevan
- Graduate Program in Electrical Engineering (PPGEE), Federal University of Technology–Paraná (UTFPR), Ponta Grossa (PR) 84017-220, Brazil;
| | - Sérgio Francisco Pichorim
- Graduate Program in Electrical Engineering and Industrial Informatics (CPGEI), Federal University of Technology–Paraná (UTFPR), Curitiba (PR) 80230-901, Brazil; (J.J.A.M.J.); (F.A.F.); (S.F.P.)
| |
Collapse
|
10
|
Yang L, Pan Y, Lin J, Liu Y, Shang Y, Yang S, Cao H. Automatic Guidance Method for Laser Tracker Based on Rotary-Laser Scanning Angle Measurement. SENSORS (BASEL, SWITZERLAND) 2020; 20:s20154168. [PMID: 32727122 PMCID: PMC7436201 DOI: 10.3390/s20154168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Revised: 07/17/2020] [Accepted: 07/21/2020] [Indexed: 06/11/2023]
Abstract
Laser-tracking measurement systems (laser tracker) have been playing a critical role in large-scale 3D high-precision coordinate measurement. However, the existing visual guidance of laser trackers is still limited by the disadvantages of operator-dependence, small-angle view field, time-consuming laser-guided process. This paper presents an automatic guidance method for laser trackers based on the rotary-laser scanning angle measurement technology. In this method, a special target consisting of six photoelectric receivers and a retroreflector is integrated into the rotary-laser scanning transmitter' coordinate systems. Real-time constraints calculated by the proposed method would provide the coordinates of the target in a laser tracker coordinates system for guidance. Finally, the experimental results verified the automatic re-establish of sightline can be realized in horizontal 360° angle field within tens of arc-seconds, and this method is robust against the fast movement of the target.
Collapse
Affiliation(s)
| | | | - Jiarui Lin
- Correspondence: ; Tel.: +86-022-2740-6643
| | | | | | | | | |
Collapse
|