1
|
Fonseca J, Liu X, Oliveira HP, Pereira T. Mortality prediction using medical time series on TBI patients. Comput Methods Programs Biomed 2023; 242:107806. [PMID: 37832428 DOI: 10.1016/j.cmpb.2023.107806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 07/29/2023] [Accepted: 09/08/2023] [Indexed: 10/15/2023]
Abstract
BACKGROUND AND OBJECTIVE Traumatic Brain Injury (TBI) is one of the leading causes of injury-related mortality in the world, with severe cases reaching mortality rates of 30-40%. It is highly heterogeneous both in causes and consequences making more complex the medical interpretation and prognosis. Gathering clinical, demographic, and laboratory data to perform a prognosis requires time and skill in several clinical specialties. Artificial intelligence (AI) methods can take advantage of existing data by performing helpful predictions and guiding physicians toward a better prognosis and, consequently, better healthcare. The objective of this work was to develop learning models and evaluate their capability of predicting the mortality of TBI. The predictive model would allow the early assessment of the more serious cases and scarce medical resources can be pointed toward the patients who need them most. METHODS Long Short Term Memory (LSTM) and Transformer architectures were tested and compared in performance, coupled with data imbalance, missing data, and feature selection strategies. From the Medical Information Mart for Intensive Care III (MIMIC-III) dataset, a cohort of TBI patients was selected and an analysis of the first 48 hours of multiple time series sequential variables was done to predict hospital mortality. RESULTS The best performance was obtained with the Transformer architecture, achieving an AUC of 0.907 with the larger group of features and trained with class proportion class weights and binary cross entropy loss. CONCLUSIONS Using the time series sequential data, LSTM and Transformers proved to be both viable options for predicting TBI hospital mortality in 48 hours after admission. Overall, using sequential deep learning models with time series data to predict TBI mortality is viable and can be used as a helpful indicator of the well-being of patients.
Collapse
Affiliation(s)
- João Fonseca
- INESC TEC - Institute for Systems and Computer Engineering, Technology and Science, Porto, Portugal; FEUP - Faculty of Engineering, University of Porto, Porto, Portugal
| | - Xiuyun Liu
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin, China; Department of Biomedical Engineering, College of Precision Instruments and Optoelectronics Engineering, Tianjin, China
| | - Hélder P Oliveira
- INESC TEC - Institute for Systems and Computer Engineering, Technology and Science, Porto, Portugal; FCUP - Faculty of Science, University of Porto, Porto, Portugal
| | - Tania Pereira
- INESC TEC - Institute for Systems and Computer Engineering, Technology and Science, Porto, Portugal; FEUP - Faculty of Engineering, University of Porto, Porto, Portugal.
| |
Collapse
|
2
|
Liao H, Yuan L, Wu M, Chen H. Air quality prediction by integrating mechanism model and machine learning model. Sci Total Environ 2023; 899:165646. [PMID: 37474048 DOI: 10.1016/j.scitotenv.2023.165646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 06/26/2023] [Accepted: 07/17/2023] [Indexed: 07/22/2023]
Abstract
AQP (Air Quality Prediction) is a very challenging project, and its core issue is how to solve the interaction and influence among meteorological, spatial and temporal factors. To address this central conundrum, we make full use of the characteristics of mechanism model and machine learning and propose a new AQP method based on DM_STGNN (Dynamic Multi-granularity Spatio-temporal Graph Neural Network). This method is the first time to use the air quality model HYSPLIT (Hybrid Single-Particle Lagrangian Integrated Trajectory Model) to assist in building a dynamic spatio-temporal graph structure to learn the spatiotemporal relationship of pollutants. DM_STGNN is based on an elaborate encoder-decoder architecture. At the encoder, in order to better mine the spatial dependency, we built a multi-granularity graph structure, used meteorological, time and geographical features to establish node attributes, used well-known HYSPLIT model to dynamically establish the edges among nodes, and used LSTM (Long Short Term Memory) to learn the time-series relationship of pollutant concentrations. At the decoder, in order to better mine the temporal dependency, we built an attention mechanism based LSTM for decoding and AQP. Additionally, in order to efficiently learn the temporal patterns from very long-term historical time series and generate rich contextual information, an unsupervised pre-training model is used to enhance DM_STGNN. The proposed model makes full use of and fully considers the influence of meteorological, spatial and temporal factors, and integrates the advantages of mechanism model and machine learning. On a project-based dataset, we validate the effectiveness of the proposed model and examine its abilities of capturing both fine-grained and long-term influences in AQP. We also compare the proposed model with the state-of-the-art AQP methods on the dataset of Yangtze River Delta city group, the experimental results show the appealing performance of our model over competitive baselines.
Collapse
Affiliation(s)
- Haibin Liao
- School of Electronic and Electrical Engineering, Wuhan Textile University, Wuhan 430200, PR China
| | - Li Yuan
- School of Electronic and Electrical Engineering, Wuhan Textile University, Wuhan 430200, PR China
| | - Mou Wu
- School of Computer Science and Technology, Hubei University of Science and Technology, Xianning 437100, PR China; Laboratory of Optoelectronic Information and Intelligent Control, Hubei University of Science and Technology, Xianning 437100, PR China.
| | - Hongsheng Chen
- School of Computer Science and Technology, Hubei University of Science and Technology, Xianning 437100, PR China
| |
Collapse
|
3
|
K IA, V R, N V, Rajasri P. Solar forecasting for a PV-battery powered DC system. Heliyon 2023; 9:e20667. [PMID: 37860506 PMCID: PMC10582304 DOI: 10.1016/j.heliyon.2023.e20667] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 09/22/2023] [Accepted: 10/03/2023] [Indexed: 10/21/2023] Open
Abstract
The photovoltaic (PV) power generation sector has been growing rapidly as a result of the rising need for solar power and the advancement of PV technology. PV Power generation is affected by weather factors such as cloud cover, solar irradiation, temperature, breeze direction and speed, and the amount of rain or snow. As a result, a highly precise PV power predictor is essential to improve security and reliability in the face of financial penalties and ambiguity. Hence, this paper suggests a novel approach to improve the efficiency of PV-battery-powered DC systems by combining solar irradiance prediction using the Long Short-Term Memory (LSTM) algorithm with a power electronic converter design that incorporates a bidirectional port. The LSTM algorithm was employed to predict one week of solar data with a remarkable R2 score of 0.96. A steady-state analysis of the proposed Three-Port Converter (TPC) is performed for five different operating modes to guarantee optimal performance. The suggested system's prediction performance was tested using several error metrics such as Mean Absolute Error (MAE), Mean Square Error (MSE), and Root Mean Squared Error (RMSE), which were computed as 0.0318, 0.0027, and 0.0526, respectively. Results from the above error measures show that the suggested approach performs more effectively in estimating solar irradiance. The Adaptive Neural-Fuzzy Interface System (ANFIS) and Incremental Conductance (IC) algorithms are employed for Maximum Power Point Tracking (MPPT) and assessed against various atmospheric conditions. From the MATLAB simulation results, the tracking efficiency of the ANFIS-based MPPT technique is 99.97 %, which is superior to the IC-based MPPT technique. Furthermore, it proved that the suggested approach improves the efficiency of PV-battery-powered DC systems, which is more appropriate for real-world applications such as DC microgrids and Electric Vehicles.
Collapse
Affiliation(s)
- Iyswarya Annapoorani K
- Centre for Automation and School of Electrical Engineering, Vellore Institute of Technology, Chennai, India
| | - Rajaguru V
- School of Electrical Engineering, Vellore Institute of Technology, Chennai, India
| | - Vedanjali N
- School of Electrical Engineering, Vellore Institute of Technology, Chennai, India
| | - Pappula Rajasri
- School of Electrical Engineering, Vellore Institute of Technology, Chennai, India
| |
Collapse
|
4
|
Tan S, Liu G. A low carbon management model for regional energy economies based on blockchain technology. Heliyon 2023; 9:e19966. [PMID: 37809418 PMCID: PMC10559551 DOI: 10.1016/j.heliyon.2023.e19966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 08/23/2023] [Accepted: 09/07/2023] [Indexed: 10/10/2023] Open
Abstract
As the issue of sustainable energy development becomes more and more important in national economic construction, the potential dangers of climate change are gradually attracting widespread attention from countries around the world. In order to better carry out the low-carbon management of the regional energy economy, based on the analysis of the characteristics of blockchain technology, the present study utilized this technology to achieve intelligent and digital management of carbon emissions, and established a carbon emission prediction system. The cuckoo algorithm is used to improve the long-term memory network, and the improved algorithm is used in carbon emission prediction and management. The experimental results show that the improved Long Short Term Memory networks are close to the target precision in 240 iterations, and the convergence speed is fast. In the short-term regional carbon emission prediction, the average absolute error of the method is only 2%, which is highly consistent with the actual carbon emission. In the long-term carbon emission prediction, the average prediction accuracy of the upgraded long-term short-term memory networks can reach 97.26%, and the running time is only 19.46s. With high precision and running efficiency, the upgraded Long Short Term Memory networks can efficiently monitor regional carbon emission and provide a technical reference for the low-carbon management of the regional power industry.
Collapse
Affiliation(s)
- Siyue Tan
- Adam Smith School of Business, University of Glasgow, Glasgow, G12 8QQ, United Kingdom
| | - Guangmin Liu
- Department of Accounting, School of Economics and Management, Harbin University, Harbin, 150086, China
| |
Collapse
|
5
|
Dey RK, Das AK. Modified term frequency-inverse document frequency based deep hybrid framework for sentiment analysis. Multimed Tools Appl 2023; 82:1-24. [PMID: 37362742 PMCID: PMC9985492 DOI: 10.1007/s11042-023-14653-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Revised: 04/12/2022] [Accepted: 02/03/2023] [Indexed: 06/28/2023]
Abstract
Sentiment Analysis is a highly crucial subfield in Natural Language Processing that attempts to extract the public sentiment from the accessible user opinions. This paper proposes a hybridized neural network based sentiment analysis framework using a modified term frequency-inverse document frequency approach. After preprocessing of data, the basic term frequency-inverse document frequency scheme is improved by introducing a non-linear global weighting factor. This improved scheme is combined with the k-best selection method to vectorize textual features. Next, the pre-trained embedding technique is employed for the mathematical representation of the textual features to process them efficiently by the Deep Learning methodologies. The embedded features are then passed to the deep neural network, consisting of Convolutional Neural Network and Long Short Term Memory. Convolutional Neural Networks can build hierarchical representations for capturing locally embedded features within the feature space, and Long Short Term Memory tries to recall useful historical information for sentiment polarization. This deep neural network finally provides the sentiment label. The proposed model is compared with different state-of-the-art baseline models in terms of various performance metrics using several datasets to demonstrate its efficacy.
Collapse
Affiliation(s)
- Ranit Kumar Dey
- Department of Computer Science and Technology, Indian Institute of Engineering Science and Technology, Shibpur, Howrah, 711103 West Bengal India
| | - Asit Kumar Das
- Department of Computer Science and Technology, Indian Institute of Engineering Science and Technology, Shibpur, Howrah, 711103 West Bengal India
| |
Collapse
|
6
|
Sarkar N, Gupta R, Keserwani PK, Govil MC. Air Quality Index prediction using an effective hybrid deep learning model. Environ Pollut 2022; 315:120404. [PMID: 36240962 DOI: 10.1016/j.envpol.2022.120404] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 09/27/2022] [Accepted: 10/06/2022] [Indexed: 06/16/2023]
Abstract
Environmentalism has become an intrinsic part of everyday life. One of the greatest challenge to the environment's long-term existence is the air pollution. Delhi, the capital of India, has experienced decreasing of air quality for several years. The poor air quality has a significant impact on the lives of individuals. Air Quality Index (AQI) prediction can help to its beneficiaries in taking safeguards about their health before moving to any polluted area. In this study, a variety of data forecasting approaches is evaluated to predict the AQI value for Particulate Matter (PM2.5) μm at a particular area of Delhi and several error-prone strategies such as R-Squared (R2), Mean Absolute Error (MAE), and Root Mean Square Error (RMSE) methods are catalogued. In the proposed approach two deep learning models like Long-Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) are combined to predict the AQI of the environment. Several stand alone machine learning (ML) and deep learning (DL) models such as LSTM, Linear-Regression (LR), GRU, K-Nearest Neighbor (KNN) and Support Vector Machine (SVM) are also trained on the same dataset to compare their performances with the proposed hybrid (LSTM-GRU) model and it is found that the proposed hybrid model shows supremacy in the performance with the MAE value 36.11 and R2 value 0.84.
Collapse
Affiliation(s)
- Nairita Sarkar
- Computer Science and Engineering Department, National Institute of Technology Sikkim, South Sikkim, Ravangla, Sikkim, India.
| | - Rajan Gupta
- Computer Science and Engineering Department, National Institute of Technology Sikkim, South Sikkim, Ravangla, Sikkim, India.
| | - Pankaj Kumar Keserwani
- Computer Science and Engineering Department, National Institute of Technology Sikkim, South Sikkim, Ravangla, Sikkim, India.
| | - Mahesh Chandra Govil
- Computer Science and Engineering Department, National Institute of Technology Sikkim, South Sikkim, Ravangla, Sikkim, India.
| |
Collapse
|
7
|
Shambour MK. Analyzing perceptions of a global event using CNN-LSTM deep learning approach: the case of Hajj 1442 (2021). PeerJ Comput Sci 2022; 8:e1087. [PMID: 36262123 PMCID: PMC9575857 DOI: 10.7717/peerj-cs.1087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Accepted: 08/16/2022] [Indexed: 06/16/2023]
Abstract
Hajj (pilgrimage) is a unique social and religious event in which many Muslims worldwide come to perform Hajj. More than two million people travel to Makkah, Saudi Arabia annually to perform various Hajj rituals for four to five days. However, given the recent outbreak of the coronavirus (COVID-19) and its variants, Hajj in the last 2 years 2020-2021 has been different because pilgrims were limited down to a few thousand to control and prevent the spread of COVID-19. This study employs a deep learning approach to investigate the impressions of pilgrims and others from within and outside the Makkah community during the 1442 AH Hajj season. Approximately 4,300 Hajj-related posts and interactions were collected from social media channels, such as Twitter and YouTube, during the Hajj season Dhul-Hijjah 1-13, 1442 (July 11-23, 2021). Convolutional neural networks (CNNs) and long short-term memory (LSTM) deep learning methods were utilized to investigate people's impressions from the collected data. The CNN-LSTM approach showed superior performance results compared with other widely used classification models in terms of F-score and accuracy. Findings revealed significantly positive sentiment rates for tweets collected from Mina and Arafa holy sites, with ratios exceeding 4 out of 5. Furthermore, the sentiment analysis (SA) rates for tweets about Hajj and pilgrims varied during the days of Hajj. Some were classified as positive tweets, such as describing joy at receiving the days of Hajj, and some were negative tweets, such as expressing the impression about the hot weather and the level of satisfaction for some services. Moreover, the SA of comments on several YouTube videos revealed positive classified comments, including praise and supplications, and negative classified comments, such as expressing regret that the Hajj was limited to a small number of pilgrims.
Collapse
|
8
|
Liu M, Feng J, Wang Y, Li Z. Classification of overlapping spikes using convolutional neural networks and long short term memory. Comput Biol Med 2022; 148:105888. [PMID: 35872414 DOI: 10.1016/j.compbiomed.2022.105888] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 06/29/2022] [Accepted: 07/16/2022] [Indexed: 11/21/2022]
Abstract
Spike sorting is one of the key techniques to understand brain activity. In this paper, we propose a novel deep learning approach based on convolutional neural networks (CNN) and long short term memory (LSTM) to implement overlapping spike sorting. The results of the simulated data demonstrated that the clustering accuracy was greater than 99.9% and 99.0% for non-overlapping spikes and overlapping spikes, respectively. Moreover, the proposed method performed better than our previous deep learning approach named "1D-CNN". In addition, the experimental data recorded from the primary visual cortex of a macaque monkey were used to evaluate the proposed method in a practical application. It was shown that the method could successfully isolate most overlapping spikes of different neurons (ranging from two to five). In summary, the CNN + LSTM method proposed in this paper is of great advantage for overlapping spike sorting with high accuracy. It lays the foundation for application in more challenging works, such as distinguishing the simultaneous recordings of multichannel neuronal activities.
Collapse
|
9
|
Shah A, Gor M, Sagar M, Shah M. A stock market trading framework based on deep learning architectures. Multimed Tools Appl 2022; 81:14153-14171. [PMID: 35233176 PMCID: PMC8874743 DOI: 10.1007/s11042-022-12328-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/22/2021] [Revised: 01/14/2022] [Accepted: 01/18/2022] [Indexed: 06/14/2023]
Abstract
Market prediction has been a key interest for professionals around the world. Numerous modern technologies have been applied in addition to statistical models over the years. Among the modern technologies, machine learning and in general artificial intelligence have been at the core of numerous market prediction models. Deep learning techniques in particular have been successful in modeling the market movements. It is seen that automatic feature extraction models and time series forecasting techniques have been investigated separately however a stacked framework with a variety of inputs is not explored in detail. In the present article, we suggest a framework based on a convolutional neural network (CNN) paired with long-short term memory (LSTM) to predict the closing price of the Nifty 50 stock market index. A CNN-LSTM framework extracts features from a rich feature set and applies time series modeling with a look-up period of 20 trading days to predict the movement of the next day. Feature sets include raw price data of target index as well as foreign indices, technical indicators, currency exchange rates, commodities price data which are all chosen by similarities and well-known trade setups across the industry. The model is able to capture the information based on these features to predict the target variable i.e. closing price with a mean absolute percentage error of 2.54% across 10 years of data. The suggested framework shows a huge improvement on return than the traditional buy and hold method.
Collapse
Affiliation(s)
- Atharva Shah
- Department of Mechanical Engineering, Nirma University, Ahmedabad, India
| | - Maharshi Gor
- Software Engineer at Quinbay Technology, Bangalore, India
| | - Meet Sagar
- Data Science associate at ZS, Pune, India
| | - Manan Shah
- Department of Chemical Engineering, School of Technology, Pandit Deendayal Petroleum University, Gandhinagar, India
| |
Collapse
|
10
|
Singla P, Duhan M, Saroha S. An ensemble method to forecast 24-h ahead solar irradiance using wavelet decomposition and BiLSTM deep learning network. Earth Sci Inform 2022; 15:291-306. [PMID: 34804244 PMCID: PMC8596364 DOI: 10.1007/s12145-021-00723-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Accepted: 10/28/2021] [Indexed: 05/17/2023]
Abstract
In recent years, the penetration of solar power at residential and utility levels has progressed exponentially. However, due to its stochastic nature, the prediction of solar global horizontal irradiance (GHI) with higher accuracy is a challenging task; but, vital for grid management: planning, scheduling & balancing. Therefore, this paper proposes an ensemble model using the extended scope of wavelet transform (WT) and bidirectional long short term memory (BiLSTM) deep learning network to forecast 24-h ahead solar GHI. The WT decomposes the input time series data into different finite intrinsic model functions (IMF) to extract the statistical features of input time series. Further, the study reduces the number of IMF series by combining the wavelet decomposed components (D1-D6) series on the basis of comprehensive experimental analysis with an aim to improve the forecasting accuracy. Next, the trained standalone BiLSTM networks are allocated to each IMF sub-series to execute the forecasting. Finally, the forecasted values of each sub-series from BiLSTM networks are reconstructed to deliver the final solar GHI forecast. The study performed monthly solar GHI forecasting for one year dataset using one month moving window mechanism for the location of Ahmedabad, Gujarat, India. For the performance comparison, the naïve predictor as a benchmark model, standalone long short term memory (LSTM), gated recurrent unit (GRU), BiLSTM and two other wavelet-based BiLSTM models are also simulated. From the results, it is observed that the proposed model outperforms other models in terms of root mean square error (RMSE) & mean absolute percentage error (MAPE), coefficient of determination (R2) and forecast skill (FS). The proposed model reduces the monthly average RMSE by range from 26.04-58.89%, 5.17-31.35%, 23.26-56.06% & 21.08-57% in comparison with benchmark, standalone BiLSTM, GRU & LSTM networks respectively. On the other hand, the monthly average MAPE is reduced by range from 9 to 51.18%, 12.59-28.14%, 30.43-59.19% & 26.54-58.92% in comparison to benchmark, standalone BiLSTM, GRU & LSTM respectively. Further, the proposed model obtained the value of R2 equal to 0.94 and forecast skill (%) of 47% with reference to the benchmark model.
Collapse
Affiliation(s)
- Pardeep Singla
- Deenbandhu Chhotu Ram University of Science & Technology, Sonepat, India
| | - Manoj Duhan
- Deenbandhu Chhotu Ram University of Science & Technology, Sonepat, India
| | - Sumit Saroha
- Guru Jambheshwar University of Science and Technology, Hisar, India
| |
Collapse
|
11
|
Miao Z, Wang Q, Xiao X, Kamal GM, Song L, Zhang X, Li C, Zhou X, Jiang B, Liu M. CSI-LSTM: a web server to predict protein secondary structure using bidirectional long short term memory and NMR chemical shifts. J Biomol NMR 2021; 75:393-400. [PMID: 34510297 DOI: 10.1007/s10858-021-00383-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Accepted: 09/06/2021] [Indexed: 06/13/2023]
Abstract
Protein secondary structure provides rich structural information, hence the description and understanding of protein structure relies heavily on it. Identification or prediction of secondary structures therefore plays an important role in protein research. In protein NMR studies, it is more convenient to predict secondary structures from chemical shifts as compared to the traditional determination methods based on inter-nuclear distances provided by NOESY experiment. In recent years, there was a significant improvement observed in deep neural networks, which had been applied in many research fields. Here we proposed a deep neural network based on bidirectional long short term memory (biLSTM) to predict protein 3-state secondary structure using NMR chemical shifts of backbone nuclei. While comparing with the existing methods the proposed method showed better prediction accuracy. Based on the proposed method, a web server has been built to provide protein secondary structure prediction service.
Collapse
Affiliation(s)
- Zhiwei Miao
- Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Key Laboratory of Magnetic Resonance in Biological Systems, State Key Laboratory of Magnetic Resonance and Atomic and Molecular Physics, National Center for Magnetic Resonance in Wuhan, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, 430071, Wuhan, China
| | - Qianqian Wang
- Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Key Laboratory of Magnetic Resonance in Biological Systems, State Key Laboratory of Magnetic Resonance and Atomic and Molecular Physics, National Center for Magnetic Resonance in Wuhan, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, 430071, Wuhan, China
| | - Xiongjie Xiao
- Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Key Laboratory of Magnetic Resonance in Biological Systems, State Key Laboratory of Magnetic Resonance and Atomic and Molecular Physics, National Center for Magnetic Resonance in Wuhan, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, 430071, Wuhan, China
| | - Ghulam Mustafa Kamal
- Department of Chemistry, Khwaja Fareed University of Engineering & Information Technology, Rahim Yar Khan, Punjab, 64200, Pakistan
| | - Linhong Song
- Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Key Laboratory of Magnetic Resonance in Biological Systems, State Key Laboratory of Magnetic Resonance and Atomic and Molecular Physics, National Center for Magnetic Resonance in Wuhan, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, 430071, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, 10049, China
| | - Xu Zhang
- Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Key Laboratory of Magnetic Resonance in Biological Systems, State Key Laboratory of Magnetic Resonance and Atomic and Molecular Physics, National Center for Magnetic Resonance in Wuhan, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, 430071, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, 10049, China
| | - Conggang Li
- Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Key Laboratory of Magnetic Resonance in Biological Systems, State Key Laboratory of Magnetic Resonance and Atomic and Molecular Physics, National Center for Magnetic Resonance in Wuhan, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, 430071, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, 10049, China
| | - Xin Zhou
- Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Key Laboratory of Magnetic Resonance in Biological Systems, State Key Laboratory of Magnetic Resonance and Atomic and Molecular Physics, National Center for Magnetic Resonance in Wuhan, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, 430071, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, 10049, China
| | - Bin Jiang
- Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Key Laboratory of Magnetic Resonance in Biological Systems, State Key Laboratory of Magnetic Resonance and Atomic and Molecular Physics, National Center for Magnetic Resonance in Wuhan, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, 430071, Wuhan, China.
- University of Chinese Academy of Sciences, Beijing, 10049, China.
| | - Maili Liu
- Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Key Laboratory of Magnetic Resonance in Biological Systems, State Key Laboratory of Magnetic Resonance and Atomic and Molecular Physics, National Center for Magnetic Resonance in Wuhan, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, 430071, Wuhan, China.
- University of Chinese Academy of Sciences, Beijing, 10049, China.
| |
Collapse
|
12
|
Lin X, Zhu X, Feng M, Han Y, Geng Z. Economy and carbon emissions optimization of different countries or areas in the world using an improved Attention mechanism based long short term memory neural network. Sci Total Environ 2021; 792:148444. [PMID: 34153753 DOI: 10.1016/j.scitotenv.2021.148444] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Revised: 06/03/2021] [Accepted: 06/09/2021] [Indexed: 06/13/2023]
Abstract
The combustion of fossil fuels produces a large amount of carbon dioxide (CO2), which leads to global warming in the world. How to rationally consume fossil energy and control CO2 emissions has become an unavoidable problem for human beings while vigorously developing economy. This paper proposes a novel economy and CO2 emissions prediction model using an improved Attention mechanism based long short term memory (LSTM) neural network (Attention-LSTM) to analyze and optimize the energy consumption structures in different countries or areas. The Attention mechanism can add the weight of different inputs in the previous information or related factors to realize the indirect correlation between output and related inputs of the LSTM. Therefore, the Attention-LSTM can allocate more computing resources to the parts with a higher relevance of correlation in the case of limited computing power. Through inputs with the consumption of oil, natural gas, coal, hydroelectricity and renewable energy, the desirable output with the per capita gross domestic product (GDP) and the undesirable output with CO2 emissions prediction model of different countries and areas is established based on the Attention-LSTM. The experimental results show that compared with the normal LSTM, the back propagation (BP), the radial basis function (RBF) and the extreme learning machine (ELM) neural networks, the Attention-LSTM is more accurate and practical. Meanwhile, the proposed model provides guidance for optimizing energy structures to develop economy and reasonably control CO2 emissions.
Collapse
Affiliation(s)
- Xiaoyong Lin
- College of Information Science & Technology, Beijing University of Chemical Technology, Beijing, China; Engineering Research Center of Intelligent PSE, Ministry of Education in China, Beijing, China
| | - Xiaopeng Zhu
- College of Information Science & Technology, Beijing University of Chemical Technology, Beijing, China; Engineering Research Center of Intelligent PSE, Ministry of Education in China, Beijing, China
| | - Mingfei Feng
- University of International Business and Economics, Beijing, China
| | - Yongming Han
- College of Information Science & Technology, Beijing University of Chemical Technology, Beijing, China; Engineering Research Center of Intelligent PSE, Ministry of Education in China, Beijing, China.
| | - Zhiqiang Geng
- College of Information Science & Technology, Beijing University of Chemical Technology, Beijing, China; Engineering Research Center of Intelligent PSE, Ministry of Education in China, Beijing, China.
| |
Collapse
|
13
|
Zulqarnain M, Khalaf Zager Alsaedi A, Ghazali R, Ghouse MG, Sharif W, Aida Husaini N. A comparative analysis on question classification task based on deep learning approaches. PeerJ Comput Sci 2021; 7:e570. [PMID: 34435091 PMCID: PMC8356656 DOI: 10.7717/peerj-cs.570] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 05/10/2021] [Indexed: 06/13/2023]
Abstract
Question classification is one of the essential tasks for automatic question answering implementation in natural language processing (NLP). Recently, there have been several text-mining issues such as text classification, document categorization, web mining, sentiment analysis, and spam filtering that have been successfully achieved by deep learning approaches. In this study, we illustrated and investigated our work on certain deep learning approaches for question classification tasks in an extremely inflected Turkish language. In this study, we trained and tested the deep learning architectures on the questions dataset in Turkish. In addition to this, we used three main deep learning approaches (Gated Recurrent Unit (GRU), Long Short-Term Memory (LSTM), Convolutional Neural Networks (CNN)) and we also applied two different deep learning combinations of CNN-GRU and CNN-LSTM architectures. Furthermore, we applied the Word2vec technique with both skip-gram and CBOW methods for word embedding with various vector sizes on a large corpus composed of user questions. By comparing analysis, we conducted an experiment on deep learning architectures based on test and 10-cross fold validation accuracy. Experiment results were obtained to illustrate the effectiveness of various Word2vec techniques that have a considerable impact on the accuracy rate using different deep learning approaches. We attained an accuracy of 93.7% by using these techniques on the question dataset.
Collapse
Affiliation(s)
- Muhammad Zulqarnain
- Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia (UTHM), Batu Pahat, Johor, Malaysia
| | | | - Rozaida Ghazali
- Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia (UTHM), Batu Pahat, Johor, Malaysia
| | - Muhammad Ghulam Ghouse
- Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia (UTHM), Batu Pahat, Johor, Malaysia
| | - Wareesa Sharif
- Faculty of Computing, The Islamia University Bahawalpur, Bahawalpur, Punjab, Pakistan
| | - Noor Aida Husaini
- Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia (UTHM), Batu Pahat, Johor, Malaysia
| |
Collapse
|
14
|
Khan P, Khan Y, Kumar S, Khan MS, Gandomi AH. HVD-LSTM based recognition of epileptic seizures and normal human activity. Comput Biol Med 2021; 136:104684. [PMID: 34332352 DOI: 10.1016/j.compbiomed.2021.104684] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Revised: 07/21/2021] [Accepted: 07/22/2021] [Indexed: 11/26/2022]
Abstract
In this paper, we detect the occurrence of epileptic seizures in patients as well as activities namely stand, walk, and exercise in healthy persons, leveraging EEG (electroencephalogram) signals. Using Hilbert vibration decomposition (HVD) on non-linear and non-stationary EEG signal, we obtain multiple monocomponents varying in terms of amplitude and frequency. After decomposition, we extract features from the monocomponent matrix of the EEG signals. The instantaneous amplitude of the HVD monocomponents varies because of the motion artifacts present in EEG signals. Hence, the acquired statistical features from the instantaneous amplitude help in identifying the epileptic seizures and the normal human activities. The features selected by correlation-based Q-score are classified using an LSTM (Long Short Term Memory) based deep learning model in which the feature-based weight update maximizes the classification accuracy. For epilepsy diagnosis using the Bonn dataset and activity recognition leveraging our Sensor Networks Research Lab (SNRL) data, we achieve testing classification accuracies of 96.00% and 83.30% respectively through our proposed method.
Collapse
Affiliation(s)
- Pritam Khan
- Department of Electrical Engineering, Indian Institute of Technology Patna, Bihar, 801106, India.
| | - Yasin Khan
- Department of Electrical Engineering, Indian Institute of Technology Patna, Bihar, 801106, India.
| | - Sudhir Kumar
- Department of Electrical Engineering, Indian Institute of Technology Patna, Bihar, 801106, India.
| | - Mohammad S Khan
- Department of Computer & Information Sciences, East Tennessee State University, Johnson City, TN, 37614-1266, USA.
| | - Amir H Gandomi
- Faculty of Engineering and Information Technology, University of Technology Sydney, Ultimo, NSW, 2007, Australia.
| |
Collapse
|
15
|
Quddus A, Shahidi Zandi A, Prest L, Comeau FJE. Using long short term memory and convolutional neural networks for driver drowsiness detection. Accid Anal Prev 2021; 156:106107. [PMID: 33848710 DOI: 10.1016/j.aap.2021.106107] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Revised: 07/19/2020] [Accepted: 03/27/2021] [Indexed: 06/12/2023]
Abstract
Fatigue negatively affects the safety and performance of drivers on the road. In fact, drowsiness and fatigue are the cause of a substantial number of motor vehicle accidents. Drowsiness among the drivers can be detected using variety of modalities, including electroencephalogram (EEG), eye movement, and vehicle driving dynamics. Among these EEG is highly accurate but very intrusive and cumbersome. On the other hand, vehicle driving dynamics are very easy to acquire but accuracy is not very high. Eye movement based approach is very attractive in terms of balance between these two extremes. However, eye movement based techniques normally require an eye tracking device which consists of high speed camera with sophisticated algorithm to extract eye movement related parameters such as blinking, eye closure, saccades, fixation etc. This makes eye tracking based drowsiness detection difficult to implement as a practical system, especially on an embedded platform. In this paper, authors propose to use eye images from camera directly without the need for expensive eye-tracking system. Here, eye related movements are captured by Recurrent Neural Network (RNN) to detect the drowsiness. Long Short Term Memory (LSTM) is a class of RNN which has several advantages over vanilla RNNs. In this work an array of LSTM cells are utilized to model the eye movements. Two types of LSTMs were employed: 1-D LSTM (R-LSTM) which is used as baseline and the convolutional LSTM (C-LSTM) which facilitates using 2-D images directly. Patches of size 48 × 48 around each eye were extracted from 38 subjects, participating in a simulated driving experiment. The state of vigilance among the subjects were independently assessed by power spectral analysis of multichannel electroencephalogram (EEG) signals, recorded simultaneously, and binary labels of alert and drowsy (baseline) were generated. Results show high efficacy of the proposed system. R-LSTM based approach resulted in accuracy around 82 % and C-LSTM based approach resulted in accuracy in the range of 95%-97%. Comparison is also provided with a recently published eye-tracking based approach, showing the proposed LSTM technique outperform with a wide margin.
Collapse
Affiliation(s)
| | - Ali Shahidi Zandi
- Alcohol Countermeasure Systems Corp. (ACS), 60 International Boulevard, Toronto, ON, Canada.
| | - Laura Prest
- Alcohol Countermeasure Systems Corp. (ACS), 60 International Boulevard, Toronto, ON, Canada.
| | - Felix J E Comeau
- Alcohol Countermeasure Systems Corp. (ACS), 60 International Boulevard, Toronto, ON, Canada.
| |
Collapse
|
16
|
Zhang B, Zou G, Qin D, Lu Y, Jin Y, Wang H. A novel Encoder-Decoder model based on read-first LSTM for air pollutant prediction. Sci Total Environ 2021; 765:144507. [PMID: 33418334 DOI: 10.1016/j.scitotenv.2020.144507] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 11/26/2020] [Accepted: 12/11/2020] [Indexed: 06/12/2023]
Abstract
Accurate air pollutant prediction allows effective environment management to reduce the impact of pollution and prevent pollution incidents. Existing studies of air pollutant prediction are mostly interdisciplinary involving environmental science and computer science where the problem is formulated as time series prediction. A prevalent recent approach to time series prediction is the Encoder-Decoder model, which is based on recurrent neural networks (RNN) such as long short-term memory (LSTM), and great potential has been demonstrated. An LSTM network relies on various gate units, but in most existing studies the correlation between gate units is ignored. This correlation is important for establishing the relationship of the random variables in a time series as the stronger is this correlation, the stronger is the relationship between the random variables. In this paper we propose an improved LSTM, named Read-first LSTM or RLSTM for short, which is a more powerful temporal feature extractor than RNN, LSTM and Gated Recurrent Unit (GRU). RLSTM has some useful properties: (1) enables better store and remember capabilities in longer time series and (2) overcomes the problem of dependency between gate units. Since RLSTM is good at long term feature extraction, it is expected to perform well in time series prediction. Therefore, we use RLSTM as the Encoder and LSTM as the Decoder to build an Encoder-Decoder model (EDSModel) for pollutant prediction in this paper. Our experimental results show, for 1 to 24 h prediction, the proposed prediction model performed well with a root mean square error of 30.218. The effectiveness and superiority of RLSTM and the prediction model have been demonstrated.
Collapse
Affiliation(s)
- Bo Zhang
- College of Information, Mechanical and Electrical Engineering, Shanghai Normal University, Shanghai 200234, PR China.
| | - Guojian Zou
- The Key Laboratory of Road and Traffic Engineering, Ministry of Education, Tongji University, Shanghai 201804, PR China.
| | - Dongming Qin
- College of Electronic and Information Engineering, Tongji University, Shanghai, 201804 and now is with the 3Clear, Beijing 100029, PR China.
| | - Yunjie Lu
- College of Information, Mechanical and Electrical Engineering, Shanghai Normal University, Shanghai 200234, PR China.
| | - Yupeng Jin
- College of Information, Mechanical and Electrical Engineering, Shanghai Normal University, Shanghai 200234, PR China
| | - Hui Wang
- School of Computing, University of Ulster, UK.
| |
Collapse
|
17
|
Wang H, Peng MJ, Miao Z, Liu YK, Ayodeji A, Hao C. Remaining useful life prediction techniques for electric valves based on convolution auto encoder and long short term memory. ISA Trans 2021; 108:333-342. [PMID: 32891421 DOI: 10.1016/j.isatra.2020.08.031] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/12/2020] [Revised: 08/20/2020] [Accepted: 08/20/2020] [Indexed: 06/11/2023]
Abstract
To optimize the operation and maintenance of nuclear power systems, this study presents a remaining useful life (RUL) prediction method for electric valves by combining convolutional auto-encoder (CAE) and long short term memory (LSTM). CAE can extract deeper features and LSTM is efficient in dealing with time-series data. Moreover, by designing a parallel structure between the outputs of CAE and the original data, features fed into the LSTM are enriched. Also, network structure and corresponding hyper-parameters are compared to obtain a more suitable model. Moreover, the accuracy of the proposed method is tested and compared with other machine learning algorithms. This work also serves as a critical innovation to enhance the safety and economic operation of nuclear plants and other complex systems.
Collapse
Affiliation(s)
- Hang Wang
- Key Subject Laboratory of Nuclear Safety and Simulation Technology, Harbin Engineering University, Harbin, 150001, China.
| | - Min-Jun Peng
- Key Subject Laboratory of Nuclear Safety and Simulation Technology, Harbin Engineering University, Harbin, 150001, China
| | - Zhuang Miao
- China Nuclear Power Engineering Co. LTD, Beijing, 100840, China
| | - Yong-Kuo Liu
- Key Subject Laboratory of Nuclear Safety and Simulation Technology, Harbin Engineering University, Harbin, 150001, China
| | - Abiodun Ayodeji
- Key Subject Laboratory of Nuclear Safety and Simulation Technology, Harbin Engineering University, Harbin, 150001, China
| | - Chengming Hao
- Nuclear Power Institute of China, Chengdu, 610213, China
| |
Collapse
|
18
|
Xiao Q, Li W, Kai Y, Chen P, Zhang J, Wang B. Occurrence prediction of pests and diseases in cotton on the basis of weather factors by long short term memory network. BMC Bioinformatics 2019; 20:688. [PMID: 31874611 PMCID: PMC6929544 DOI: 10.1186/s12859-019-3262-y] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Background The occurrence of cotton pests and diseases has always been an important factor affecting the total cotton production. Cotton has a great dependence on environmental factors during its growth, especially climate change. In recent years, machine learning and especially deep learning methods have been widely used in many fields and have achieved good results. Methods First, this papaer used the common Aprioro algorithm to find the association rules between weather factors and the occurrence of cotton pests. Then, in this paper, the problem of predicting the occurrence of pests and diseases is formulated as time series prediction, and an LSTM-based method was developed to solve the problem. Results The association analysis reveals that moderate temperature, humid air, low wind spreed and rain fall in autumn and winter are more likely to occur cotton pests and diseases. The discovery was then used to predict the occurrence of pests and diseases. Experimental results showed that LSTM performs well on the prediction of occurrence of pests and diseases in cotton fields, and yields the Area Under the Curve (AUC) of 0.97. Conclusion Suitable temperature, humidity, low rainfall, low wind speed, suitable sunshine time and low evaporation are more likely to cause cotton pests and diseases. Based on these associations as well as historical weather and pest records, LSTM network is a good predictor for future pest and disease occurrences. Moreover, compared to the traditional machine learning models (i.e., SVM and Random Forest), the LSTM network performs the best.
Collapse
Affiliation(s)
- Qingxin Xiao
- Institutes of Physical Science and Information Technology, Anhui University, Hefei, 230601, China
| | - Weilu Li
- Institutes of Physical Science and Information Technology, Anhui University, Hefei, 230601, China
| | - Yuanzhong Kai
- School of Life Sciences, Anhui University, Hefei, 230601, China
| | - Peng Chen
- Institutes of Physical Science and Information Technology, Anhui University, Hefei, 230601, China. .,School of Electrical and Information Engineering, Anhui University of Technology, Ma'anshan, 243032, China.
| | - Jun Zhang
- School of Electrical Engineering and Automation, Anhui University, Hefei, 230601, China
| | - Bing Wang
- School of Electrical and Information Engineering, Anhui University of Technology, Ma'anshan, 243032, China.
| |
Collapse
|
19
|
Karim F, Majumdar S, Darabi H, Harford S. Multivariate LSTM-FCNs for time series classification. Neural Netw 2019; 116:237-245. [PMID: 31121421 DOI: 10.1016/j.neunet.2019.04.014] [Citation(s) in RCA: 136] [Impact Index Per Article: 27.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2018] [Revised: 04/16/2019] [Accepted: 04/17/2019] [Indexed: 11/26/2022]
Abstract
Over the past decade, multivariate time series classification has received great attention. We propose transforming the existing univariate time series classification models, the Long Short Term Memory Fully Convolutional Network (LSTM-FCN) and Attention LSTM-FCN (ALSTM-FCN), into a multivariate time series classification model by augmenting the fully convolutional block with a squeeze-and-excitation block to further improve accuracy. Our proposed models outperform most state-of-the-art models while requiring minimum preprocessing. The proposed models work efficiently on various complex multivariate time series classification tasks such as activity recognition or action recognition. Furthermore, the proposed models are highly efficient at test time and small enough to deploy on memory constrained systems.
Collapse
Affiliation(s)
- Fazle Karim
- Mechanical and Industrial Engineering, University of Illinois at Chicago, 900 W. Taylor St., Chicago, IL, 60607, USA.
| | - Somshubra Majumdar
- Computer Science, University of Illinois at Chicago, 900 W. Taylor St., Chicago, IL, 60607, USA.
| | - Houshang Darabi
- Mechanical and Industrial Engineering, University of Illinois at Chicago, 900 W. Taylor St., Chicago, IL, 60607, USA.
| | - Samuel Harford
- Mechanical and Industrial Engineering, University of Illinois at Chicago, 900 W. Taylor St., Chicago, IL, 60607, USA.
| |
Collapse
|
20
|
Cai S, Palazoglu A, Zhang L, Hu J. Process alarm prediction using deep learning and word embedding methods. ISA Trans 2019; 85:274-283. [PMID: 30401489 DOI: 10.1016/j.isatra.2018.10.032] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2018] [Revised: 09/21/2018] [Accepted: 10/19/2018] [Indexed: 06/08/2023]
Abstract
Industrial alarm systems play an essential role for the safe management of process operations. With the increase in automation and instrumentation of modern process plants, the number of alarms that the operators manage has also increased significantly. The operators are expected to make critical decisions in the presence of flooding alarms, poorly configured and maintained alarms and many nuisance alarms. In this environment, if the incoming alarms can be correctly predicted before they actually occur, the operators may have a chance to address and possibly avoid abnormal behaviors by taking corrective actions in time. Inspired by the application of deep learning in natural language processing, this paper presents an alarm prediction method based on word embedding and recurrent neural networks to predict the next alarm in a process setting. This represents both a novel approach to alarm management as well as a novel application of natural language processing and deep learning techniques to this problem. The proposed method is applied to an actual case study to demonstrate its performance.
Collapse
Affiliation(s)
- Shuang Cai
- College of Safety and Ocean Engineering, State Key Laboratory of Petroleum Resources and Prospecting, China University of Petroleum, Beijing, China; Department of Chemical Engineering, University of California, Davis, CA 95616, USA
| | - Ahmet Palazoglu
- Department of Chemical Engineering, University of California, Davis, CA 95616, USA
| | - Laibin Zhang
- College of Safety and Ocean Engineering, State Key Laboratory of Petroleum Resources and Prospecting, China University of Petroleum, Beijing, China
| | - Jinqiu Hu
- College of Safety and Ocean Engineering, State Key Laboratory of Petroleum Resources and Prospecting, China University of Petroleum, Beijing, China.
| |
Collapse
|
21
|
Lamurias A, Sousa D, Clarke LA, Couto FM. BO-LSTM: classifying relations via long short-term memory networks along biomedical ontologies. BMC Bioinformatics 2019; 20:10. [PMID: 30616557 PMCID: PMC6323831 DOI: 10.1186/s12859-018-2584-5] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2018] [Accepted: 12/12/2018] [Indexed: 01/23/2023] Open
Abstract
BACKGROUND Recent studies have proposed deep learning techniques, namely recurrent neural networks, to improve biomedical text mining tasks. However, these techniques rarely take advantage of existing domain-specific resources, such as ontologies. In Life and Health Sciences there is a vast and valuable set of such resources publicly available, which are continuously being updated. Biomedical ontologies are nowadays a mainstream approach to formalize existing knowledge about entities, such as genes, chemicals, phenotypes, and disorders. These resources contain supplementary information that may not be yet encoded in training data, particularly in domains with limited labeled data. RESULTS We propose a new model to detect and classify relations in text, BO-LSTM, that takes advantage of domain-specific ontologies, by representing each entity as the sequence of its ancestors in the ontology. We implemented BO-LSTM as a recurrent neural network with long short-term memory units and using open biomedical ontologies, specifically Chemical Entities of Biological Interest (ChEBI), Human Phenotype, and Gene Ontology. We assessed the performance of BO-LSTM with drug-drug interactions mentioned in a publicly available corpus from an international challenge, composed of 792 drug descriptions and 233 scientific abstracts. By using the domain-specific ontology in addition to word embeddings and WordNet, BO-LSTM improved the F1-score of both the detection and classification of drug-drug interactions, particularly in a document set with a limited number of annotations. We adapted an existing DDI extraction model with our ontology-based method, obtaining a higher F1 score than the original model. Furthermore, we developed and made available a corpus of 228 abstracts annotated with relations between genes and phenotypes, and demonstrated how BO-LSTM can be applied to other types of relations. CONCLUSIONS Our findings demonstrate that besides the high performance of current deep learning techniques, domain-specific ontologies can still be useful to mitigate the lack of labeled data.
Collapse
Affiliation(s)
- Andre Lamurias
- LASIGE, Faculdade de Ciências, Universidade de Lisboa, Lisboa, 1749 016 Portugal
- University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Campo Grande, C8 bdg, Lisboa, 1749 016 Portugal
| | - Diana Sousa
- LASIGE, Faculdade de Ciências, Universidade de Lisboa, Lisboa, 1749 016 Portugal
| | - Luka A. Clarke
- University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Campo Grande, C8 bdg, Lisboa, 1749 016 Portugal
| | - Francisco M. Couto
- LASIGE, Faculdade de Ciências, Universidade de Lisboa, Lisboa, 1749 016 Portugal
| |
Collapse
|
22
|
Abstract
BACKGROUND Drug-drug interaction extraction (DDI) needs assistance from automated methods to address the explosively increasing biomedical texts. In recent years, deep neural network based models have been developed to address such needs and they have made significant progress in relation identification. METHODS We propose a dependency-based deep neural network model for DDI extraction. By introducing the dependency-based technique to a bi-directional long short term memory network (Bi-LSTM), we build three channels, namely, Linear channel, DFS channel and BFS channel. All of these channels are constructed with three network layers, including embedding layer, LSTM layer and max pooling layer from bottom up. In the embedding layer, we extract two types of features, one is distance-based feature and another is dependency-based feature. In the LSTM layer, a Bi-LSTM is instituted in each channel to better capture relation information. Then max pooling is used to get optimal features from the entire encoding sequential data. At last, we concatenate the outputs of all channels and then link it to the softmax layer for relation identification. RESULTS To the best of our knowledge, our model achieves new state-of-the-art performance with the F-score of 72.0% on the DDIExtraction 2013 corpus. Moreover, our approach obtains much higher Recall value compared to the existing methods. CONCLUSIONS The dependency-based Bi-LSTM model can learn effective relation information with less feature engineering in the task of DDI extraction. Besides, the experimental results show that our model excels at balancing the Precision and Recall values.
Collapse
Affiliation(s)
- Wei Wang
- School of Computer Science, National University of Defense Technology, Changsha, 410073 China
| | - Xi Yang
- School of Computer Science, National University of Defense Technology, Changsha, 410073 China
| | - Canqun Yang
- School of Computer Science, National University of Defense Technology, Changsha, 410073 China
| | - Xiaowei Guo
- School of Computer Science, National University of Defense Technology, Changsha, 410073 China
| | - Xiang Zhang
- School of Computer Science, National University of Defense Technology, Changsha, 410073 China
| | - Chengkun Wu
- School of Computer Science, National University of Defense Technology, Changsha, 410073 China
| |
Collapse
|