1
|
Abd Wahab NH, Hasikin K, Wee Lai K, Xia K, Bei L, Huang K, Wu X. Systematic review of predictive maintenance and digital twin technologies challenges, opportunities, and best practices. PeerJ Comput Sci 2024; 10:e1943. [PMID: 38686003 PMCID: PMC11057655 DOI: 10.7717/peerj-cs.1943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 02/27/2024] [Indexed: 05/02/2024]
Abstract
Background Maintaining machines effectively continues to be a challenge for industrial organisations, which frequently employ reactive or premeditated methods. Recent research has begun to shift its attention towards the application of Predictive Maintenance (PdM) and Digital Twins (DT) principles in order to improve maintenance processes. PdM technologies have the capacity to significantly improve profitability, safety, and sustainability in various industries. Significantly, precise equipment estimation, enabled by robust supervised learning techniques, is critical to the efficacy of PdM in conjunction with DT development. This study underscores the application of PdM and DT, exploring its transformative potential across domains demanding real-time monitoring. Specifically, it delves into emerging fields in healthcare, utilities (smart water management), and agriculture (smart farm), aligning with the latest research frontiers in these areas. Methodology Employing the Preferred Reporting Items for Systematic Review and Meta-Analyses (PRISMA) criteria, this study highlights diverse modeling techniques shaping asset lifetime evaluation within the PdM context from 34 scholarly articles. Results The study revealed four important findings: various PdM and DT modelling techniques, their diverse approaches, predictive outcomes, and implementation of maintenance management. These findings align with the ongoing exploration of emerging applications in healthcare, utilities (smart water management), and agriculture (smart farm). In addition, it sheds light on the critical functions of PdM and DT, emphasising their extraordinary ability to drive revolutionary change in dynamic industrial challenges. The results highlight these methodologies' flexibility and application across many industries, providing vital insights into their potential to revolutionise asset management and maintenance practice for real-time monitoring. Conclusions Therefore, this systematic review provides a current and essential resource for academics, practitioners, and policymakers to refine PdM strategies and expand the applicability of DT in diverse industrial sectors.
Collapse
Affiliation(s)
- Nur Haninie Abd Wahab
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia
- Engineering Services Division, Ministry of Health Malaysia, Putrajaya, Malaysia
| | - Khairunnisa Hasikin
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia
- Center of Intelligent Systems for Emerging Technology, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia
| | - Khin Wee Lai
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia
| | - Kaijian Xia
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia
- Affiliated Changshu Hospital, Soochow University Changshu, Jiangsu, China
| | - Lulu Bei
- School of Information Engineering, Xuzhou University of Technology, Xuzhou, China
| | - Kai Huang
- JiangSu XCMG HanYun Technologies Co., LTD., Xuzhou, China
| | - Xiang Wu
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia
- School of Medical Information & Engineering, Xuzhou Medical University, Xuzhou, China
| |
Collapse
|
2
|
Khairuddin MZF, Sankaranarayanan S, Hasikin K, Abd Razak NA, Omar R. Contextualizing injury severity from occupational accident reports using an optimized deep learning prediction model. PeerJ Comput Sci 2024; 10:e1985. [PMID: 38660193 PMCID: PMC11042013 DOI: 10.7717/peerj-cs.1985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 03/21/2024] [Indexed: 04/26/2024]
Abstract
Background This study introduced a novel approach for predicting occupational injury severity by leveraging deep learning-based text classification techniques to analyze unstructured narratives. Unlike conventional methods that rely on structured data, our approach recognizes the richness of information within injury narrative descriptions with the aim of extracting valuable insights for improved occupational injury severity assessment. Methods Natural language processing (NLP) techniques were harnessed to preprocess the occupational injury narratives obtained from the US Occupational Safety and Health Administration (OSHA) from January 2015 to June 2023. The methodology involved meticulous preprocessing of textual narratives to standardize text and eliminate noise, followed by the innovative integration of Term Frequency-Inverse Document Frequency (TF-IDF) and Global Vector (GloVe) word embeddings for effective text representation. The proposed predictive model adopts a novel Bidirectional Long Short-Term Memory (Bi-LSTM) architecture and is further refined through model optimization, including random search hyperparameters and in-depth feature importance analysis. The optimized Bi-LSTM model has been compared and validated against other machine learning classifiers which are naïve Bayes, support vector machine, random forest, decision trees, and K-nearest neighbor. Results The proposed optimized Bi-LSTM models' superior predictability, boasted an accuracy of 0.95 for hospitalization and 0.98 for amputation cases with faster model processing times. Interestingly, the feature importance analysis revealed predictive keywords related to the causal factors of occupational injuries thereby providing valuable insights to enhance model interpretability. Conclusion Our proposed optimized Bi-LSTM model offers safety and health practitioners an effective tool to empower workplace safety proactive measures, thereby contributing to business productivity and sustainability. This study lays the foundation for further exploration of predictive analytics in the occupational safety and health domain.
Collapse
Affiliation(s)
| | - Suresh Sankaranarayanan
- Department of Computer Science, College of Computer Science and Information Technology, King Faisal University, Hofuf, Kingdom of Saudi Arabia
| | - Khairunnisa Hasikin
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Kuala Lumpur, Malaysia
| | - Nasrul Anuar Abd Razak
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Kuala Lumpur, Malaysia
| | - Rosidah Omar
- Occupational and Environmental Health Unit, Kedah State Health Department, Alor Setar, Kedah, Malaysia
| |
Collapse
|
3
|
Khairuddin MZF, Hasikin K, Razak NAA, Mohshim SA, Ibrahim SS. Harnessing the Multimodal Data Integration and Deep Learning for Occupational Injury Severity Prediction. IEEE ACCESS 2023; 11:85284-85302. [DOI: 10.1109/access.2023.3304328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
Affiliation(s)
| | - Khairunnisa Hasikin
- Department of Biomedical Engineering, Faculty of Engineering, University Malaya, Kuala Lumpur, Malaysia
| | - Nasrul Anuar Abd Razak
- Department of Biomedical Engineering, Faculty of Engineering, University Malaya, Kuala Lumpur, Malaysia
| | - Siti Afifah Mohshim
- Medical Engineering Technology Section, British Malaysian Institute, Universiti Kuala Lumpur, Kuala Lumpur, Selangor, Malaysia
| | - Siti Salwa Ibrahim
- Negeri Sembilan State Health Department, Ministry of Health, Seremban, Negeri Sembilan, Malaysia
| |
Collapse
|
4
|
Khairuddin MZF, Hasikin K, Abd Razak NA, Lai KW, Osman MZ, Aslan MF, Sabanci K, Azizan MM, Satapathy SC, Wu X. Predicting occupational injury causal factors using text-based analytics: A systematic review. Front Public Health 2022; 10:984099. [PMID: 36187621 PMCID: PMC9521307 DOI: 10.3389/fpubh.2022.984099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Accepted: 08/23/2022] [Indexed: 01/25/2023] Open
Abstract
Workplace accidents can cause a catastrophic loss to the company including human injuries and fatalities. Occupational injury reports may provide a detailed description of how the incidents occurred. Thus, the narrative is a useful information to extract, classify and analyze occupational injury. This study provides a systematic review of text mining and Natural Language Processing (NLP) applications to extract text narratives from occupational injury reports. A systematic search was conducted through multiple databases including Scopus, PubMed, and Science Direct. Only original studies that examined the application of machine and deep learning-based Natural Language Processing models for occupational injury analysis were incorporated in this study. A total of 27, out of 210 articles were reviewed in this study by adopting the Preferred Reporting Items for Systematic Review (PRISMA). This review highlighted that various machine and deep learning-based NLP models such as K-means, Naïve Bayes, Support Vector Machine, Decision Tree, and K-Nearest Neighbors were applied to predict occupational injury. On top of these models, deep neural networks are also included in classifying the type of accidents and identifying the causal factors. However, there is a paucity in using the deep learning models in extracting the occupational injury reports. This is due to these techniques are pretty much very recent and making inroads into decision-making in occupational safety and health as a whole. Despite that, this paper believed that there is a huge and promising potential to explore the application of NLP and text-based analytics in this occupational injury research field. Therefore, the improvement of data balancing techniques and the development of an automated decision-making support system for occupational injury by applying the deep learning-based NLP models are the recommendations given for future research.
Collapse
Affiliation(s)
- Mohamed Zul Fadhli Khairuddin
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia,Institute of Medical Science Technology, Universiti Kuala Lumpur, Selangor, Malaysia
| | - Khairunnisa Hasikin
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia,Centre of Intelligent Systems for Emerging Technology (CISET), Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia,*Correspondence: Khairunnisa Hasikin
| | - Nasrul Anuar Abd Razak
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia
| | - Khin Wee Lai
- Department of Biomedical Engineering, Faculty of Engineering, Universiti Malaya, Kuala Lumpur, Malaysia
| | - Mohd Zamri Osman
- Faculty of Computing, College of Computing and Applied Science, Universiti Malaysia Pahang, Gambang, Malaysia
| | - Muhammet Fatih Aslan
- Department of Electrical and Electronics Engineering, Karamanoglu Mehmetbey University, Karaman, Turkey
| | - Kadir Sabanci
- Department of Electrical and Electronics Engineering, Karamanoglu Mehmetbey University, Karaman, Turkey
| | - Muhammad Mokhzaini Azizan
- Department of Electrical and Electronic Engineering, Faculty of Engineering and Built Environment, Universiti Sains Islam Malaysia, Nilai, Negeri Sembilan, Malaysia
| | - Suresh Chandra Satapathy
- School of Computer Engineering, Kalinga Institute of Industrial Technology, Deemed to Be University, Bhubaneswar, India
| | - Xiang Wu
- School of Medical Information and Engineering, Xuzhou Medical University Xuzhou, Xuzhou, Jiangsu, China,Xiang Wu
| |
Collapse
|
5
|
Towards Automated Construction Quantity Take-Off: An Integrated Approach to Information Extraction from Work Descriptions. BUILDINGS 2022. [DOI: 10.3390/buildings12030354] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Construction-oriented quantity take-off (QTO) refers to the process of determining the quantities for construction items or work packages in accordance with their descriptions. However, the current construction-oriented QTO practice relies on estimators’ manual interpretation of work descriptions and manual processes to look up proper building objects for quantity calculation. Hence, this research aims to develop natural language processing (NLP) and rule-based algorithms to automate the information extraction (IE) from work descriptions for QTO in building construction. Specifically, several named entity recognition (NER) models, including Hidden Markov Model (HMM), Conditional Random Field (CRF), Bidirectional-Long Short-Term Memory (Bi-LSTM), and Bi-LSTM+CRF, were developed to identify construction activities, material, building component, product features, measurement unit, and additional information (e.g., work scope) from work descriptions. Cost items in the RSMeans database are used to evaluate the developed models in terms of F1 scores. HMM was found to achieve a 5% higher F1 score in the NER than the other three algorithms. Then, labeling rules and active learning strategies were applied along with the HMM model, which improved F1 score by 3% and reduced the labeling efforts by 26%. The results showed that the proposed IE method successfully interprets the desired information from the work description for QTO. This research contributed to the body of knowledge by the NLP-based information extraction model integrating HMM and formalized labeling rules that automatically process work descriptions and lay a foundation for automated QTO and cost estimation.
Collapse
|
6
|
Ayat M, Kim B, Kang CW. A new data mining-based framework to predict the success of private participation in infrastructure projects. INTERNATIONAL JOURNAL OF CONSTRUCTION MANAGEMENT 2022. [DOI: 10.1080/15623599.2022.2045862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Muhammad Ayat
- Department of Industrial and Management Engineering, Hanyang University ERICA, Ansan-si, South Korea
| | - Byunghoon Kim
- Department of Industrial and Management Engineering, Hanyang University ERICA, Ansan-si, South Korea
| | - Chang Wook Kang
- Department of Industrial and Management Engineering, Hanyang University ERICA, Ansan-si, South Korea
| |
Collapse
|
7
|
Relational Graph Convolutional Network for Text-Mining-Based Accident Causal Classification. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12052482] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Accident investigation reports are text documents that systematically review and analyze the cause and process of accidents after accidents have occurred and have been widely used in the fields such as transportation, construction and aerospace. With the aid of accident investigation reports, the cause of the accident can be clearly identified, which provides an important basis for accident prevention and reliability assessment. However, since accident record reports are mostly composed of unstructured data such as text, the analysis of accident causes inevitably relies on a lot of expert experience and statistical analyses also require a lot of manual classification. Although, in recent years, with the development of natural language processing technology, there have been many efforts to automatically analyze and classify text. However, the existing methods either rely on large corpus and data preprocessing methods, which are cumbersome, or extract text information based on bidirectional encoder representation from transformers (BERT), but the computational cost is extremely high. These shortcomings make it still a great challenge to automatically analyze accident investigation reports and extract the information therein. To address the aforementioned problems, this study proposes a text-mining-based accident causal classification method based on a relational graph convolutional network (R-GCN) and pre-trained BERT. On the one hand, the proposed method avoids preprocessing such as stop word removal and word segmentation, which not only preserves the information of accident investigation reports to the greatest extent, but also avoids tedious operations. On the other hand, with the help of R-GCN to process the semantic features obtained by BERT representation, the dependence of BERT retraining on computing resources can be avoided.
Collapse
|
8
|
Abstract
Online news outlets have the power to influence public policy issues. To understand the opinions of the people, many government departments check online news outlets to manually detect events that interest people. This process is time-consuming. To promptly respond to public expectations, this research proposes a framework for detecting news events that may interest government departments. This article proposes a method for finding event trigger words used to represent an event. The news media can be a critical participant in ‘agenda-setting’, which means that more widely discussed news is more attractive and critical than news that is less discussed. However, few studies have considered the influence of news media publishers from the ‘agenda setting’ perspective. Therefore, this study proposes an ‘agenda setting’-based filter to establish a high-impact news event detection model. The proposed framework identifies trigger words and utilises word embedding to find news event–related words. After that, an event detection model is designed to determine the events that are attractive to government departments. The experimental results show that purity increases from 0.666 when no extraction method is used to 0.809 when the extraction method in this study is used. The overall improvement trend shows significant improvement in event detection performance.
Collapse
Affiliation(s)
- Chun Chieh Chen
- Institute of Information Management, National Cheng Kung University, Taiwan Department of Information Management, National Development Council, Taiwan
| | - Hei-Chia Wang
- Institute of Information Management, National Cheng Kung University, TaiwanCenter of Innovative Fintech Business Models, National Cheng Kung University, Tainan 701, Taiwan
| |
Collapse
|
9
|
MenuNER: Domain-Adapted BERT Based NER Approach for a Domain with Limited Dataset and Its Application to Food Menu Domain. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11136007] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Entity-based information extraction is one of the main applications of Natural Language Processing (NLP). Recently, deep transfer-learning utilizing contextualized word embedding from pre-trained language models has shown remarkable results for many NLP tasks, including Named-entity recognition (NER). BERT (Bidirectional Encoder Representations from Transformers) is gaining prominent attention among various contextualized word embedding models as a state-of-the-art pre-trained language model. It is quite expensive to train a BERT model from scratch for a new application domain since it needs a huge dataset and enormous computing time. In this paper, we focus on menu entity extraction from online user reviews for the restaurant and propose a simple but effective approach for NER task on a new domain where a large dataset is rarely available or difficult to prepare, such as food menu domain, based on domain adaptation technique for word embedding and fine-tuning the popular NER task network model ‘Bi-LSTM+CRF’ with extended feature vectors. The proposed NER approach (named as ‘MenuNER’) consists of two step-processes: (1) Domain adaptation for target domain; further pre-training of the off-the-shelf BERT language model (BERT-base) in semi-supervised fashion on a domain-specific dataset, and (2) Supervised fine-tuning the popular Bi-LSTM+CRF network for downstream task with extended feature vectors obtained by concatenating word embedding from the domain-adapted pre-trained BERT model from the first step, character embedding and POS tag feature information. Experimental results on handcrafted food menu corpus from customers’ review dataset show that our proposed approach for domain-specific NER task, that is: food menu named-entity recognition, performs significantly better than the one based on the baseline off-the-shelf BERT-base model. The proposed approach achieves 92.5% F1 score on the YELP dataset for the MenuNER task.
Collapse
|