1
|
Zhang T, Rabhi F, Chen X, Paik HY, MacIntyre CR. A machine learning-based universal outbreak risk prediction tool. Comput Biol Med 2024; 169:107876. [PMID: 38176209 DOI: 10.1016/j.compbiomed.2023.107876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 12/12/2023] [Accepted: 12/18/2023] [Indexed: 01/06/2024]
Abstract
In order to prevent and control the increasing number of serious epidemics, the ability to predict the risk caused by emerging outbreaks is essential. However, most current risk prediction tools, except EPIRISK, are limited by being designed for targeting only one specific disease and one country. Differences between countries and diseases (e.g., different economic conditions, different modes of transmission, etc.) pose challenges for building models with cross-country and cross-disease prediction capabilities. The limitation of universality affects domestic and international efforts to control and prevent pandemic outbreaks. To address this problem, we used outbreak data from 43 diseases in 206 countries to develop a universal risk prediction system that can be used across countries and diseases. This system used five machine learning models (including Neural Network XGBoost, Logistic Boost, Random Forest and Kernel SVM) to predict and vote together to make ensemble predictions. It can make predictions with around 80%-90 % accuracy from economic, cultural, social, and epidemiological factors. Three different datasets were designed to test the performance of ML models under different realistic situations. This prediction system has strong predictive ability, adaptability, and generality. It can give universal outbreak risk assessment that are not limited by border or disease type, facilitate rapid response to pandemic outbreaks, government decision-making and international cooperation.
Collapse
Affiliation(s)
- Tianyu Zhang
- FinanceIT Research Group, University of New South Wales, Sydney, NSW, Australia.
| | - Fethi Rabhi
- FinanceIT Research Group, University of New South Wales, Sydney, NSW, Australia
| | - Xin Chen
- Biosecurity Program, The Kirby Institute, University of New South Wales, Sydney, NSW, 2052, Australia
| | - Hye-Young Paik
- School of Computer Science and Engineering, Faulty of Engineering, University of New South Wales, Sydney, NSW, 2052, Australia
| | - Chandini Raina MacIntyre
- Biosecurity Program, The Kirby Institute, University of New South Wales, Sydney, NSW, 2052, Australia; College of Public Service & Community Solutions, Arizona State University, Tempe, AZ, 85004, United States
| |
Collapse
|
2
|
Gao Q, Wang S, Wang Q, Cao G, Fang C, Zhan B. Epidemiological characteristics and prediction model construction of hemorrhagic fever with renal syndrome in Quzhou City, China, 2005-2022. Front Public Health 2024; 11:1333178. [PMID: 38274546 PMCID: PMC10808376 DOI: 10.3389/fpubh.2023.1333178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2023] [Accepted: 12/29/2023] [Indexed: 01/27/2024] Open
Abstract
Background Hemorrhagic fever with renal syndrome (HFRS) is one of the 10 major infectious diseases that jeopardize human health and is distributed in more than 30 countries around the world. China is the country with the highest number of reported HFRS cases worldwide, accounting for 90% of global cases. The incidence level of HFRS in Quzhou is at the forefront of Zhejiang Province, and there is no specific treatment for it yet. Therefore, it is crucial to grasp the epidemiological characteristics of HFRS in Quzhou and establish a prediction model for HFRS to lay the foundation for early warning of HFRS. Methods Descriptive epidemiological methods were used to analyze the epidemic characteristics of HFRS, the incidence map was drawn by ArcGIS software, the Seasonal AutoRegressive Integrated Moving Average (SARIMA) and Prophet model were established by R software. Then, root mean square error (RMSE) and mean absolute error (MAE) were used to evaluate the fitting and prediction performances of the model. Results A total of 843 HFRS cases were reported in Quzhou City from 2005 to 2022, with the highest annual incidence rate in 2007 (3.93/100,000) and the lowest in 2022 (1.05/100,000) (P trend<0.001). The incidence is distributed in a seasonal double-peak distribution, with the first peak from October to January and the second peak from May to July. The incidence rate in males (2.87/100,000) was significantly higher than in females (1.32/100,000). Farmers had the highest number of cases, accounting for 79.95% of the total number of cases. The incidence is high in the northwest of Quzhou City, with cases concentrated on cultivated land and artificial land. The RMSE and MAE values of the Prophet model are smaller than those of the SARIMA (1,0,1) (2,1,0)12 model. Conclusion From 2005 to 2022, the incidence of HFRS in Quzhou City showed an overall downward trend, but the epidemic in high-incidence areas was still serious. In the future, the dynamics of HFRS outbreaks and host animal surveillance should be continuously strengthened in combination with the Prophet model. During the peak season, HFRS vaccination and health education are promoted with farmers as the key groups.
Collapse
Affiliation(s)
- Qing Gao
- School of Public Health, Zhejiang Chinese Medical University, Hangzhou, Zhejiang, China
| | - Shuangqing Wang
- Quzhou Center for Disease Control and Prevention, Quzhou, Zhejiang, China
| | - Qi Wang
- School of Public Health, Zhejiang Chinese Medical University, Hangzhou, Zhejiang, China
| | - Guoping Cao
- Quzhou Center for Disease Control and Prevention, Quzhou, Zhejiang, China
| | - Chunfu Fang
- Quzhou Center for Disease Control and Prevention, Quzhou, Zhejiang, China
| | - Bingdong Zhan
- School of Public Health, Zhejiang Chinese Medical University, Hangzhou, Zhejiang, China
- Quzhou Center for Disease Control and Prevention, Quzhou, Zhejiang, China
| |
Collapse
|
3
|
Impact of Tree Cover Loss on Carbon Emission: A Learning-Based Analysis. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2023; 2023:8585839. [PMID: 36909970 PMCID: PMC9995202 DOI: 10.1155/2023/8585839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Revised: 08/09/2022] [Accepted: 08/11/2022] [Indexed: 03/05/2023]
Abstract
Describing the processes leading to deforestation is essential for the development and implementation of the forest policies. In this work, two different learning models were developed in order to identify the best possible model for the assessment of the deforestation causes and trends. We developed autoregressive integrated moving average (ARIMA) model and long short-term memory (LSTM) independently in order to see the trend between tree cover loss and carbon dioxide emission. This study includes the twenty-year data of Pakistan on tree cover loss and carbon emission from the Global Forest Watch (GFW) platform, a known platform to get numerical data. Minimum mean absolute error (MAE) for the prediction of tree cover loss and carbon emission obtained through ARIMA model is 0.89 and 0.95, respectively. The minimum MAE given by LSTM model is 0.33 and 0.43, respectively. There is no such kind of study conducted in order to identify the increase in carbon emission due to tree cover loss most specifically in Pakistan. The results endorsed that one of the main causes of increase in the pollution in the environment in terms of carbon emission is due to tree cover loss.
Collapse
|
4
|
Lou HR, Wang X, Gao Y, Zeng Q. Comparison of ARIMA model, DNN model and LSTM model in predicting disease burden of occupational pneumoconiosis in Tianjin, China. BMC Public Health 2022; 22:2167. [PMID: 36434563 PMCID: PMC9694549 DOI: 10.1186/s12889-022-14642-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 11/16/2022] [Indexed: 11/27/2022] Open
Abstract
BACKGROUND This study aims to explore appropriate model for predicting the disease burden of pneumoconiosis in Tianjin by comparing the prediction effects of Autoregressive Integrated Moving Average (ARIMA) model, Deep Neural Networks (DNN) model and multivariate Long Short-Term Memory Neural Network (LSTM) models. METHODS Disability adjusted life year (DALY) was used to evaluate the disease burden of occupational pneumoconiosis. ARIMA model, DNN model and multivariate LSTM model were used to establish prediction model. Three performance evaluation metrics including Root Mean Squared Error (RMSE), Mean Absolute Error (MAE) and Mean Absolute Percentage Error (MAPE) were used to compare the prediction effects of the three models. RESULTS From 1990 to 2021, there were 10,694 cases of pneumoconiosis patients in Tianjin, resulting in a total of 112,725.52 person-years of DALY. During this period, the annual DALY showed a fluctuating trend, but it had a strong correlation with the number of pneumoconiosis patients, the average age of onset, the average age of receiving dust and the gross industrial product, and had a significant nonlinear relationship with them. The comparison of prediction results showed that the performance of multivariate LSTM model and DNN model is much better than that of traditional ARIMA model. Compared with the DNN model, the multivariate LSTM model performed better in the training set, showing lower RMES (42.30 vs. 380.96), MAE (29.53 vs. 231.20) and MAPE (1.63% vs. 2.93%), but performed less stable than the DNN on the test set, showing slightly higher RMSE (1309.14 vs. 656.44), MAE (886.98 vs. 594.47) and MAPE (36.86% vs. 22.43%). CONCLUSION The machine learning techniques of DNN and LSTM are an innovative method to accurately and efficiently predict the burden of pneumoconiosis with the simplest data. It has great application prospects in the monitoring and early warning system of occupational disease burden.
Collapse
Affiliation(s)
- He-Ren Lou
- grid.464467.3Tianjin Center for Disease Control and Prevention, Tianjin, 300011 China ,grid.265021.20000 0000 9792 1228School of Public Health, Tianjin Medical University, Tianjin, 300070 China
| | - Xin Wang
- grid.464467.3Tianjin Center for Disease Control and Prevention, Tianjin, 300011 China
| | - Ya Gao
- grid.464467.3Tianjin Center for Disease Control and Prevention, Tianjin, 300011 China
| | - Qiang Zeng
- grid.464467.3Tianjin Center for Disease Control and Prevention, Tianjin, 300011 China
| |
Collapse
|
5
|
Prediction of global omicron pandemic using ARIMA, MLR, and Prophet models. Sci Rep 2022; 12:18138. [PMID: 36307471 PMCID: PMC9614203 DOI: 10.1038/s41598-022-23154-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Accepted: 10/25/2022] [Indexed: 12/30/2022] Open
Abstract
Globally, since the outbreak of the Omicron variant in November 2021, the number of confirmed cases of COVID-19 has continued to increase, posing a tremendous challenge to the prevention and control of this infectious disease in many countries. The global daily confirmed cases of COVID-19 between November 1, 2021, and February 17, 2022, were used as a database for modeling, and the ARIMA, MLR, and Prophet models were developed and compared. The prediction performance was evaluated using mean absolute error (MAE), mean absolute percentage error (MAPE), and root mean square error (RMSE). The study showed that ARIMA (7, 1, 0) was the optimum model, and the MAE, MAPE, and RMSE values were lower than those of the MLR and Prophet models in terms of fitting performance and forecasting performance. The ARIMA model had superior prediction performance compared to the MLR and Prophet models. In real-world research, an appropriate prediction model should be selected based on the characteristics of the data and the sample size, which is essential for obtaining more accurate predictions of infectious disease incidence.
Collapse
|
6
|
Qu S, Zhou M, Jiao S, Zhang Z, Xue K, Long J, Zha F, Chen Y, Li J, Yang Q, Wang Y. Optimizing acute stroke outcome prediction models: Comparison of generalized regression neural networks and logistic regressions. PLoS One 2022; 17:e0267747. [PMID: 35544482 PMCID: PMC9094516 DOI: 10.1371/journal.pone.0267747] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Accepted: 04/16/2022] [Indexed: 11/18/2022] Open
Abstract
Background Generalized regression neural network (GRNN) and logistic regression (LR) are extensively used in the medical field; however, the better model for predicting stroke outcome has not been established. The primary goal of this study was to compare the accuracies of GRNN and LR models to identify the most optimal model for the prediction of acute stroke outcome, as well as explore useful biomarkers for predicting the prognosis of acute stroke patients. Method In a single-center study, 216 (80% for the training set and 20% for the test set) acute stroke patients admitted to the Shenzhen Second People’s Hospital between December 2019 to June 2021 were retrospectively recruited. The functional outcomes of the patients were measured using Barthel Index (BI) on discharge. A training set was used to optimize the GRNN and LR models. The test set was utilized to validate and compare the performances of GRNN and LR in predicting acute stroke outcome based on the area under the receiver operating characteristic curve (AUROC), accuracy, sensitivity, and the Kappa value. Result The LR analysis showed that age, the National Institute Health Stroke Scale score, BI index, hemoglobin, and albumin were independently associated with stroke outcome. After validating in test set using these variables, we found that the GRNN model showed a better performance based on AUROC (0.931 vs 0.702), sensitivity (0.933 vs 0.700), specificity (0.889 vs 0.722), accuracy (0.896 vs 0.729), and the Kappa value (0.775 vs 0.416) than the LR model. Conclusion Overall, the GRNN model demonstrated superior performance to the LR model in predicting the prognosis of acute stroke patients. In addition to its advantage in not affected by implicit interactions and complex relationship in the data. Thus, we suggested that GRNN could be served as the optimal statistical model for acute stroke outcome prediction. Simultaneously, prospective validation based on more variables of the GRNN model for the prediction is required in future studies.
Collapse
Affiliation(s)
- Sheng Qu
- Department of Rehabilitation, Shenzhen Second People’s Hospital, The First Affiliated Hospital of Shenzhen University Health Science Centre, Shenzhen, China
| | - Mingchao Zhou
- Department of Rehabilitation, Shenzhen Second People’s Hospital, The First Affiliated Hospital of Shenzhen University Health Science Centre, Shenzhen, China
| | - Shengxiu Jiao
- Department of Radiation Oncology, Shandong Cancer Hospital and Institute, Shandong First Medical University and Shandong Academy of Medical Sciences, Shandong, China
| | - Zeyu Zhang
- School of Rehabilitation Sciences, The Shandong University of Traditional Chinese Medicine, Shandong, China
| | - Kaiwen Xue
- School of Rehabilitation Sciences, The Shandong University of Traditional Chinese Medicine, Shandong, China
| | - Jianjun Long
- Department of Rehabilitation, Shenzhen Second People’s Hospital, The First Affiliated Hospital of Shenzhen University Health Science Centre, Shenzhen, China
- School of Rehabilitation Sciences, The Shandong University of Traditional Chinese Medicine, Shandong, China
| | - Fubing Zha
- Department of Rehabilitation, Shenzhen Second People’s Hospital, The First Affiliated Hospital of Shenzhen University Health Science Centre, Shenzhen, China
| | - Yuan Chen
- Department of Rehabilitation, Shenzhen Second People’s Hospital, The First Affiliated Hospital of Shenzhen University Health Science Centre, Shenzhen, China
| | - Jiehui Li
- School of Rehabilitation Sciences, The Shandong University of Traditional Chinese Medicine, Shandong, China
| | - Qingqing Yang
- School of Rehabilitation Sciences, The Shandong University of Traditional Chinese Medicine, Shandong, China
| | - Yulong Wang
- Department of Rehabilitation, Shenzhen Second People’s Hospital, The First Affiliated Hospital of Shenzhen University Health Science Centre, Shenzhen, China
- School of Rehabilitation Sciences, The Shandong University of Traditional Chinese Medicine, Shandong, China
- * E-mail:
| |
Collapse
|
7
|
Zhang R, Song H, Chen Q, Wang Y, Wang S, Li Y. Comparison of ARIMA and LSTM for prediction of hemorrhagic fever at different time scales in China. PLoS One 2022; 17:e0262009. [PMID: 35030203 PMCID: PMC8759700 DOI: 10.1371/journal.pone.0262009] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Accepted: 12/15/2021] [Indexed: 11/29/2022] Open
Abstract
Objectives This study intends to build and compare two kinds of forecasting models at different time scales for hemorrhagic fever incidence in China. Methods Autoregressive Integrated Moving Average (ARIMA) and Long Short-Term Memory Neural Network (LSTM) were adopted to fit monthly, weekly and daily incidence of hemorrhagic fever in China from 2013 to 2018. The two models, combined and uncombined with rolling forecasts, were used to predict the incidence in 2019 to examine their stability and applicability. Results ARIMA (2, 1, 1) (0, 1, 1)12, ARIMA (1, 1, 3) (1, 1, 1)52 and ARIMA (5, 0, 1) were selected as the best fitting ARIMA model for monthly, weekly and daily incidence series, respectively. The LSTM model with 64 neurons and Stochastic Gradient Descent (SGDM) for monthly incidence, 8 neurons and Adaptive Moment Estimation (Adam) for weekly incidence, and 64 neurons and Root Mean Square Prop (RMSprop) for daily incidence were selected as the best fitting LSTM models. The values of root mean square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE) of the models combined with rolling forecasts in 2019 were lower than those of the direct forecasting models for both ARIMA and LSTM. It was shown from the forecasting performance in 2019 that ARIMA was better than LSTM for monthly and weekly forecasting while the LSTM was better than ARIMA for daily forecasting in rolling forecasting models. Conclusions Both ARIMA and LSTM could be used to build a prediction model for the incidence of hemorrhagic fever. Different models might be more suitable for the incidence prediction at different time scales. The findings can provide a good reference for future selection of prediction models and establishments of early warning systems for hemorrhagic fever.
Collapse
Affiliation(s)
- Rui Zhang
- Chinese Center for Disease Control and Prevention, Beijing, China
| | - Hejia Song
- National Institute of Environmental Health, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Qiulan Chen
- Chinese Center for Disease Control and Prevention, Beijing, China
| | - Yu Wang
- National Institute of Environmental Health, Chinese Center for Disease Control and Prevention, Beijing, China
| | - Songwang Wang
- Chinese Center for Disease Control and Prevention, Beijing, China
- * E-mail: (SW); (YL)
| | - Yonghong Li
- National Institute of Environmental Health, Chinese Center for Disease Control and Prevention, Beijing, China
- * E-mail: (SW); (YL)
| |
Collapse
|
8
|
Comparison of ARIMA, ES, GRNN and ARIMA–GRNN hybrid models to forecast the second wave of COVID-19 in India and the United States. Epidemiol Infect 2021. [PMCID: PMC8632421 DOI: 10.1017/s0950268821002375] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
As acute infectious pneumonia, the coronavirus disease-2019 (COVID-19) has created unique challenges for each nation and region. Both India and the United States (US) have experienced a second outbreak, resulting in a severe disease burden. The study aimed to develop optimal models to predict the daily new cases, in order to help to develop public health strategies. The autoregressive integrated moving average (ARIMA) models, generalised regression neural network (GRNN) models, ARIMA–GRNN hybrid model and exponential smoothing (ES) model were used to fit the daily new cases. The performances were evaluated by minimum mean absolute per cent error (MAPE). The predictive value with ARIMA (3, 1, 3) (1, 1, 1)14 model was closest to the actual value in India, while the ARIMA–GRNN presented a better performance in the US. According to the models, the number of daily new COVID-19 cases in India continued to decrease after 27 May 2021. In conclusion, the ARIMA model presented to be the best-fit model in forecasting daily COVID-19 new cases in India, and the ARIMA–GRNN hybrid model had the best prediction performance in the US. The appropriate model should be selected for different regions in predicting daily new cases. The results can shed light on understanding the trends of the outbreak and giving ideas of the epidemiological stage of these regions.
Collapse
|
9
|
Wang Y, Xu C, Yao S, Wang L, Zhao Y, Ren J, Li Y. Estimating the COVID-19 prevalence and mortality using a novel data-driven hybrid model based on ensemble empirical mode decomposition. Sci Rep 2021; 11:21413. [PMID: 34725416 PMCID: PMC8560776 DOI: 10.1038/s41598-021-00948-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Accepted: 10/20/2021] [Indexed: 12/23/2022] Open
Abstract
In this study, we proposed a new data-driven hybrid technique by integrating an ensemble empirical mode decomposition (EEMD), an autoregressive integrated moving average (ARIMA), with a nonlinear autoregressive artificial neural network (NARANN), called the EEMD-ARIMA-NARANN model, to perform time series modeling and forecasting based on the COVID-19 prevalence and mortality data from 28 February 2020 to 27 June 2020 in South Africa and Nigeria. By comparing the accuracy level of forecasting measurements with the basic ARIMA and NARANN models, it was shown that this novel data-driven hybrid model did a better job of capturing the dynamic changing trends of the target data than the others used in this work. Our proposed mixture technique can be deemed as a helpful policy-supportive tool to plan and provide medical supplies effectively. The overall confirmed cases and deaths were estimated to reach around 176,570 [95% uncertainty level (UL) 173,607 to 178,476] and 3454 (95% UL 3384 to 3487), respectively, in South Africa, along with 32,136 (95% UL 31,568 to 32,641) and 788 (95% UL 775 to 804) in Nigeria on 12 July 2020 using this data-driven EEMD-ARIMA-NARANN hybrid technique. The contributions of this study include three aspects. First, the proposed hybrid model can better capture the dynamic dependency characteristics compared with the individual models. Second, this new data-driven hybrid model is constructed in a more reasonable way relative to the traditional mixture model. Third, this proposed model may be generalized to estimate the epidemic patterns of COVID-19 in other regions.
Collapse
Affiliation(s)
- Yongbin Wang
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, No. 601 Jinsui Road, Hongqi District, Xinxiang City, 453003, Henan Province, People's Republic of China.
| | - Chunjie Xu
- Department of Occupational and Environmental Health, School of Public Health, Capital Medical University, Beijing, People's Republic of China
| | - Sanqiao Yao
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, No. 601 Jinsui Road, Hongqi District, Xinxiang City, 453003, Henan Province, People's Republic of China
| | - Lei Wang
- Center for Musculoskeletal Surgery, Charité-Universitätsmedizin Berlin, Freie Universität Berlin, Humboldt-Universität zu Berlin and Berlin Institute of Health, Berlin, Germany
| | - Yingzheng Zhao
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, No. 601 Jinsui Road, Hongqi District, Xinxiang City, 453003, Henan Province, People's Republic of China
| | - Jingchao Ren
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, No. 601 Jinsui Road, Hongqi District, Xinxiang City, 453003, Henan Province, People's Republic of China
| | - Yuchun Li
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, No. 601 Jinsui Road, Hongqi District, Xinxiang City, 453003, Henan Province, People's Republic of China
| |
Collapse
|
10
|
Hybrid Machine Learning Models for Forecasting Surgical Case Volumes at a Hospital. AI 2021. [DOI: 10.3390/ai2040032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Recent developments in machine learning and deep learning have led to the use of multiple algorithms to make better predictions. Surgical units in hospitals allocate their resources for day surgeries based on the number of elective patients, which is mostly disrupted by emergency surgeries. Sixteen different models were constructed for this comparative study, including four simple and twelve hybrid models for predicting the demand for endocrinology, gastroenterology, vascular, urology, and pediatric surgical units. The four simple models used were seasonal autoregressive integrated moving average (SARIMA), support vector regression (SVR), multilayer perceptron (MLP), and long short-term memory (LSTM). The twelve hybrid models used were a combination of any two of the above-mentioned simple models, namely, SARIMA–SVR, SVR–SARIMA, SARIMA–MLP, MLP–SARIMA, SARIMA–LSTM, LSTM–SARIMA, SVR–MLP, MLP–SVR, SVR–LSTM, LSTM–SVR, MLP–LSTM, and LSTM–MLP. Data from the period 2012–2018 were used to build and test the models for each surgical unit. The results indicated that, in some cases, the simple LSTM model outperformed the others while, in other cases, there was a need for hybrid models. This shows that surgical units are unique in nature and need separate models for predicting their corresponding surgical volumes.
Collapse
|
11
|
Lv CX, An SY, Qiao BJ, Wu W. Time series analysis of hemorrhagic fever with renal syndrome in mainland China by using an XGBoost forecasting model. BMC Infect Dis 2021; 21:839. [PMID: 34412581 PMCID: PMC8377883 DOI: 10.1186/s12879-021-06503-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Accepted: 07/30/2021] [Indexed: 11/15/2022] Open
Abstract
BACKGROUND Hemorrhagic fever with renal syndrome (HFRS) is still attracting public attention because of its outbreak in various cities in China. Predicting future outbreaks or epidemics disease based on past incidence data can help health departments take targeted measures to prevent diseases in advance. In this study, we propose a multistep prediction strategy based on extreme gradient boosting (XGBoost) for HFRS as an extension of the one-step prediction model. Moreover, the fitting and prediction accuracy of the XGBoost model will be compared with the autoregressive integrated moving average (ARIMA) model by different evaluation indicators. METHODS We collected HFRS incidence data from 2004 to 2018 of mainland China. The data from 2004 to 2017 were divided into training sets to establish the seasonal ARIMA model and XGBoost model, while the 2018 data were used to test the prediction performance. In the multistep XGBoost forecasting model, one-hot encoding was used to handle seasonal features. Furthermore, a series of evaluation indices were performed to evaluate the accuracy of the multistep forecast XGBoost model. RESULTS There were 200,237 HFRS cases in China from 2004 to 2018. A long-term downward trend and bimodal seasonality were identified in the original time series. According to the minimum corrected akaike information criterion (CAIC) value, the optimal ARIMA (3, 1, 0) × (1, 1, 0)12 model is selected. The index ME, RMSE, MAE, MPE, MAPE, and MASE indices of the XGBoost model were higher than those of the ARIMA model in the fitting part, whereas the RMSE of the XGBoost model was lower. The prediction performance evaluation indicators (MAE, MPE, MAPE, RMSE and MASE) of the one-step prediction and multistep prediction XGBoost model were all notably lower than those of the ARIMA model. CONCLUSIONS The multistep XGBoost prediction model showed a much better prediction accuracy and model stability than the multistep ARIMA prediction model. The XGBoost model performed better in predicting complicated and nonlinear data like HFRS. Additionally, Multistep prediction models are more practical than one-step prediction models in forecasting infectious diseases.
Collapse
Affiliation(s)
- Cai-Xia Lv
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, Liaoning China
| | - Shu-Yi An
- Liaoning Provincial Center for Disease Control and Prevention, Shenyang, Liaoning China
| | - Bao-Jun Qiao
- Liaoning Provincial Center for Disease Control and Prevention, Shenyang, Liaoning China
| | - Wei Wu
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, Liaoning China
| |
Collapse
|
12
|
Epidemiology of hemorrhagic fever with renal syndrome in Tai'an area. Sci Rep 2021; 11:11596. [PMID: 34226582 PMCID: PMC8257732 DOI: 10.1038/s41598-021-91029-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Accepted: 05/07/2021] [Indexed: 11/08/2022] Open
Abstract
Hemorrhagic fever with renal syndrome (HFRS), a serious threat to human health, is mainly transmitted by rodents in Eurasia. The risk of disease differs according to sex, age, and occupation. Further, temperature and rainfall have some lagging effects on the occurrence of the disease. The quantitative data for these factors in the Tai'an region of China are still unknown. We used a forest map to calculate the risk of HFRS in different populations and used four different mathematical models to explain the relationship between time factors, meteorological factors, and the disease. The results showed that compared with the whole population, the relative risk in rural medical staff and farmers was 5.05 and 2.00, respectively (p < 0.05). Joinpoint models showed that the number of cases decreased by 33.32% per year from 2005 to 2008 (p < 0.05). The generalized additive model showed that air temperature was positively correlated with disease risk from January to June, and that relative humidity was negatively correlated with risk from July to December. From January to June, with an increase in temperature, after 15 lags, the cumulative risk of disease increased at low temperatures. From July to December, the cumulative risk decreased with an increase in the relative humidity. Rural medical staff, farmers, men, and middle-aged individuals were at a high risk of HFRS. Moreover, air temperature and relative humidity are important factors that affect disease occurrence. These associations show lagged and differing effects according to the season.
Collapse
|
13
|
Alfred R, Obit JH. The roles of machine learning methods in limiting the spread of deadly diseases: A systematic review. Heliyon 2021; 7:e07371. [PMID: 34179541 PMCID: PMC8219638 DOI: 10.1016/j.heliyon.2021.e07371] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 10/20/2020] [Accepted: 06/17/2021] [Indexed: 12/03/2022] Open
Abstract
Machine learning (ML) methods can be leveraged to prevent the spread of deadly infectious disease outbreak (e.g., COVID-19). This can be done by applying machine learning methods in predicting and detecting the deadly infectious disease. Most reviews did not discuss about the machine learning algorithms, datasets and performance measurements used for various applications in predicting and detecting the deadly infectious disease. In contrast, this paper outlines the literature review based on two major ways (e.g., prediction, detection) to limit the spread of deadly disease outbreaks. Hence, this study aims to investigate the state of the art, challenges and future works of leveraging ML methods to detect and predict deadly disease outbreaks according to two categories mentioned earlier. Specifically, this study provides a review on various approaches (e.g., individual and ensemble models), types of datasets, parameters or variables and performance measures used in the previous works. The literature review included all articles from journals and conference proceedings published from 2010 through 2020 in Scopus indexed databases using the search terms Predicting Disease Outbreaks and/or Detecting Disease using Machine Learning. The findings from this review focus on commonly used machine learning approaches, challenges and future works to limit the spread of deadly disease outbreaks through preventions and detections.
Collapse
Affiliation(s)
- Rayner Alfred
- Corresponding author. http://www.machineintelligencespace.com
| | | |
Collapse
|
14
|
Li J, Li Y, Ye M, Yao S, Yu C, Wang L, Wu W, Wang Y. Forecasting the Tuberculosis Incidence Using a Novel Ensemble Empirical Mode Decomposition-Based Data-Driven Hybrid Model in Tibet, China. Infect Drug Resist 2021; 14:1941-1955. [PMID: 34079304 PMCID: PMC8164697 DOI: 10.2147/idr.s299704] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Accepted: 04/14/2021] [Indexed: 12/13/2022] Open
Abstract
Objective The purpose of this study is to develop a novel data-driven hybrid model by fusing ensemble empirical mode decomposition (EEMD), seasonal autoregressive integrated moving average (SARIMA), with nonlinear autoregressive artificial neural network (NARNN), called EEMD-ARIMA-NARNN model, to assess and forecast the epidemic patterns of TB in Tibet. Methods The TB incidence from January 2006 to December 2017 was obtained, and then the time series was partitioned into training subsamples (from January 2006 to December 2016) and testing subsamples (from January to December 2017). Among them, the training set was used to develop the EEMD-SARIMA-NARNN combined model, whereas the testing set was used to validate the forecasting performance of the model. Whilst the forecasting accuracy level of this novel method was compared with the basic SARIMA model, basic NARNN model, error-trend-seasonal (ETS) model, and traditional SARIMA-NARNN mixture model. Results By comparing the accuracy level of the forecasting measurements including root-mean-square error, mean absolute deviation, mean error rate, mean absolute percentage error, and root-mean-square percentage error, it was shown that the EEMD-SARIMA-NARNN combined method produced lower error rates than the others. The descriptive statistics suggested that TB was a seasonal disease, peaking in late winter and early spring and a trough in autumn and early winter, and the TB epidemic indicated a drastic increase by a factor of 1.7 from 2006 to 2017 in Tibet, with average annual percentage change of 5.8 (95% confidence intervals: 3.5–8.1). Conclusion This novel data-driven hybrid method can better consider both linear and nonlinear components in the TB incidence than the others used in this study, which is of great help to estimate and forecast the future epidemic trends of TB in Tibet. Besides, under present trends, strict precautionary measures are required to reduce the spread of TB in Tibet.
Collapse
Affiliation(s)
- Jizhen Li
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, Xinxiang, Henan Province, People's Republic of China
| | - Yuhong Li
- National Center for Tuberculosis Control and Prevention, China Center for Disease Control and Prevention, Beijing, People's Republic of China
| | - Ming Ye
- Preventive Medicine Clinic, Xinxiang Center for Disease Control and Prevention, Xinxiang, Henan Province, People's Republic of China
| | - Sanqiao Yao
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, Xinxiang, Henan Province, People's Republic of China
| | - Chongchong Yu
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, Xinxiang, Henan Province, People's Republic of China
| | - Lei Wang
- Center for Musculoskeletal Surgery, Charité-Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt-Universität Zu Berlin and Berlin Institute of Health, Berlin, Germany
| | - Weidong Wu
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, Xinxiang, Henan Province, People's Republic of China
| | - Yongbin Wang
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, Xinxiang, Henan Province, People's Republic of China
| |
Collapse
|
15
|
Zhai M, Li W, Tie P, Wang X, Xie T, Ren H, Zhang Z, Song W, Quan D, Li M, Chen L, Qiu L. Research on the predictive effect of a combined model of ARIMA and neural networks on human brucellosis in Shanxi Province, China: a time series predictive analysis. BMC Infect Dis 2021; 21:280. [PMID: 33740904 PMCID: PMC7980350 DOI: 10.1186/s12879-021-05973-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2020] [Accepted: 03/09/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Brucellosis is a major public health problem that seriously affects developing countries and could cause significant economic losses to the livestock industry and great harm to human health. Reasonable prediction of the incidence is of great significance in controlling brucellosis and taking preventive measures. METHODS Our human brucellosis incidence data were extracted from Shanxi Provincial Center for Disease Control and Prevention. We used seasonal-trend decomposition using Loess (STL) and monthplot to analyse the seasonal characteristics of human brucellosis in Shanxi Province from 2007 to 2017. The autoregressive integrated moving average (ARIMA) model, a combined model of ARIMA and the back propagation neural network (ARIMA-BPNN), and a combined model of ARIMA and the Elman recurrent neural network (ARIMA-ERNN) were established separately to make predictions and identify the best model. Additionally, the mean squared error (MAE), mean absolute error (MSE) and mean absolute percentage error (MAPE) were used to evaluate the performance of the model. RESULTS We observed that the time series of human brucellosis in Shanxi Province increased from 2007 to 2014 but decreased from 2015 to 2017. It had obvious seasonal characteristics, with the peak lasting from March to July every year. The best fitting and prediction effect was the ARIMA-ERNN model. Compared with those of the ARIMA model, the MAE, MSE and MAPE of the ARIMA-ERNN model decreased by 18.65, 31.48 and 64.35%, respectively, in fitting performance; in terms of prediction performance, the MAE, MSE and MAPE decreased by 60.19, 75.30 and 64.35%, respectively. Second, compared with those of ARIMA-BPNN, the MAE, MSE and MAPE of ARIMA-ERNN decreased by 9.60, 15.73 and 11.58%, respectively, in fitting performance; in terms of prediction performance, the MAE, MSE and MAPE decreased by 31.63, 45.79 and 29.59%, respectively. CONCLUSIONS The time series of human brucellosis in Shanxi Province from 2007 to 2017 showed obvious seasonal characteristics. The fitting and prediction performances of the ARIMA-ERNN model were better than those of the ARIMA-BPNN and ARIMA models. This will provide some theoretical support for the prediction of infectious diseases and will be beneficial to public health decision making.
Collapse
Affiliation(s)
- Mengmeng Zhai
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan City, Shanxi Province China
| | - Wenhan Li
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan City, Shanxi Province China
| | - Ping Tie
- Endemic Disease Prevention and Control Section, Shanxi Center for Disease Control and Prevention, Taiyuan City, Shanxi Province China
| | - Xuchun Wang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan City, Shanxi Province China
| | - Tao Xie
- Department of Mathematical Statistics, School of Statistics, Jiangxi University of Finance and Economics, Nanchang, Jiangxi Province China
| | - Hao Ren
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan City, Shanxi Province China
| | - Zhuang Zhang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan City, Shanxi Province China
| | - Weimei Song
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan City, Shanxi Province China
| | - Dichen Quan
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan City, Shanxi Province China
| | - Meichen Li
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan City, Shanxi Province China
| | - Limin Chen
- Shanxi Provincial Peoples Hospital, Taiyuan City, Shanxi Province China
| | - Lixia Qiu
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan City, Shanxi Province China
| |
Collapse
|
16
|
Ye GH, Alim M, Guan P, Huang DS, Zhou BS, Wu W. Improving the precision of modeling the incidence of hemorrhagic fever with renal syndrome in mainland China with an ensemble machine learning approach. PLoS One 2021; 16:e0248597. [PMID: 33725011 PMCID: PMC7963064 DOI: 10.1371/journal.pone.0248597] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 03/02/2021] [Indexed: 11/19/2022] Open
Abstract
OBJECTIVE Hemorrhagic fever with renal syndrome (HFRS), one of the main public health concerns in mainland China, is a group of clinically similar diseases caused by hantaviruses. Statistical approaches have always been leveraged to forecast the future incidence rates of certain infectious diseases to effectively control their prevalence and outbreak potential. Compared to the use of one base model, model stacking can often produce better forecasting results. In this study, we fitted the monthly reported cases of HFRS in mainland China with a model stacking approach and compared its forecasting performance with those of five base models. METHOD We fitted the monthly reported cases of HFRS ranging from January 2004 to June 2019 in mainland China with an autoregressive integrated moving average (ARIMA) model; the Holt-Winter (HW) method, seasonal decomposition of the time series by LOESS (STL); a neural network autoregressive (NNAR) model; and an exponential smoothing state space model with a Box-Cox transformation; ARMA errors; and trend and seasonal components (TBATS), and we combined the forecasting results with the inverse rank approach. The forecasting performance was estimated based on several accuracy criteria for model prediction, including the mean absolute percentage error (MAPE), root-mean-squared error (RMSE) and mean absolute error (MAE). RESULT There was a slight downward trend and obvious seasonal periodicity inherent in the time series data for HFRS in mainland China. The model stacking method was selected as the best approach with the best performance in terms of both fitting (RMSE 128.19, MAE 85.63, MAPE 8.18) and prediction (RMSE 151.86, MAE 118.28, MAPE 13.16). CONCLUSION The results showed that model stacking by using the optimal mean forecasting weight of the five abovementioned models achieved the best performance in terms of predicting HFRS one year into the future. This study has corroborated the conclusion that model stacking is an easy way to enhance prediction accuracy when modeling HFRS.
Collapse
Affiliation(s)
- Guo-hua Ye
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, Liaoning, China
| | - Mirxat Alim
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, Liaoning, China
| | - Peng Guan
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, Liaoning, China
| | - De-sheng Huang
- Department of Mathematics, School of Fundamental Sciences, China Medical University, Shenyang, Liaoning, China
| | - Bao-sen Zhou
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, Liaoning, China
| | - Wei Wu
- Department of Epidemiology, School of Public Health, China Medical University, Shenyang, Liaoning, China
- * E-mail:
| |
Collapse
|
17
|
Yu G, Feng H, Feng S, Zhao J, Xu J. Forecasting hand-foot-and-mouth disease cases using wavelet-based SARIMA-NNAR hybrid model. PLoS One 2021; 16:e0246673. [PMID: 33544752 PMCID: PMC7864434 DOI: 10.1371/journal.pone.0246673] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Accepted: 01/23/2021] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Hand-foot-and-mouth disease_(HFMD) is one of the most typical diseases in children that is associated with high morbidity. Reliable forecasting is crucial for prevention and control. Recently, hybrid models have become popular, and wavelet analysis has been widely performed. Better prediction accuracy may be achieved using wavelet-based hybrid models. Thus, our aim is to forecast number of HFMD cases with wavelet-based hybrid models. MATERIALS AND METHODS We fitted a wavelet-based seasonal autoregressive integrated moving average (SARIMA)-neural network nonlinear autoregressive (NNAR) hybrid model with HFMD weekly cases from 2009 to 2016 in Zhengzhou, China. Additionally, a single SARIMA model, simplex NNAR model, and pure SARIMA-NNAR hybrid model were established for comparison and estimation. RESULTS The wavelet-based SARIMA-NNAR hybrid model demonstrates excellent performance whether in fitting or forecasting compared with other models. Its fitted and forecasting time series are similar to the actual observed time series. CONCLUSIONS The wavelet-based SARIMA-NNAR hybrid model fitted in this study is suitable for forecasting the number of HFMD cases. Hence, it will facilitate the prevention and control of HFMD.
Collapse
Affiliation(s)
- Gongchao Yu
- Department of Gastroenterology, The Fifth Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, People’s Republic of China
| | - Huifen Feng
- Department of Gastroenterology, The Fifth Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, People’s Republic of China
- * E-mail:
| | - Shuang Feng
- Department of Gastroenterology, The Fifth Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, People’s Republic of China
| | - Jing Zhao
- Department of Gastroenterology, The Fifth Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, People’s Republic of China
| | - Jing Xu
- Department of Gastroenterology, The Fifth Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, People’s Republic of China
| |
Collapse
|
18
|
Zheng Y, Zhang X, Wang X, Wang K, Cui Y. Predictive study of tuberculosis incidence by time series method and Elman neural network in Kashgar, China. BMJ Open 2021; 11:e041040. [PMID: 33478962 PMCID: PMC7825257 DOI: 10.1136/bmjopen-2020-041040] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
OBJECTIVES Kashgar, located in Xinjiang, China has a high incidence of tuberculosis (TB) making prevention and control extremely difficult. In addition, there have been very few prediction studies on TB incidence here. We; therefore, considered it a high priority to do prediction analysis of TB incidence in Kashgar, and so provide a scientific reference for eventual prevention and control. DESIGN Time series study. SETTING KASHGAR, CHINA Kashgar, China. METHODS We used a single Box-Jenkins method and a Box-Jenkins and Elman neural network (ElmanNN) hybrid method to do prediction analysis of TB incidence in Kashgar. Root mean square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE) were used to measure the prediction accuracy. RESULTS After careful analysis, the single autoregression (AR) (1, 2, 8) model and the AR (1, 2, 8)-ElmanNN (AR-Elman) hybrid model were established, and the optimal neurons value of the AR-Elman hybrid model is 6. In the fitting dataset, the RMSE, MAE and MAPE were 6.15, 4.33 and 0.2858, respectively, for the AR (1, 2, 8) model, and 3.78, 3.38 and 0.1837, respectively, for the AR-Elman hybrid model. In the forecasting dataset, the RMSE, MAE and MAPE were 10.88, 8.75 and 0.2029, respectively, for the AR (1, 2, 8) model, and 8.86, 7.29 and 0.2006, respectively, for the AR-Elman hybrid model. CONCLUSIONS Both the single AR (1, 2, 8) model and the AR-Elman model could be used to predict the TB incidence in Kashgar, but the modelling and validation scale-dependent measures (RMSE, MAE and MAPE) in the AR (1, 2, 8) model were inferior to those in the AR-Elman hybrid model, which indicated that the AR-Elman hybrid model was better than the AR (1, 2, 8) model. The Box-Jenkins and ElmanNN hybrid method therefore can be highlighted in predicting the temporal trends of TB incidence in Kashgar, which may act as the potential for far-reaching implications for prevention and control of TB.
Collapse
Affiliation(s)
- Yanling Zheng
- State Key Laboratory of Pathogenesis, Prevention and Treatment of High Incidence Diseases in Central Asia, College of Medical Engineering and Technology, Xinjiang Medical University, Urumqi, China
| | - Xueliang Zhang
- State Key Laboratory of Pathogenesis, Prevention and Treatment of High Incidence Diseases in Central Asia, College of Medical Engineering and Technology, Xinjiang Medical University, Urumqi, China
| | - Xijiang Wang
- Center for Disease Control and Prevention of Xinjiang Uygur Autonomous Region, Urumqi, China
| | - Kai Wang
- State Key Laboratory of Pathogenesis, Prevention and Treatment of High Incidence Diseases in Central Asia, College of Medical Engineering and Technology, Xinjiang Medical University, Urumqi, China
| | - Yan Cui
- Center for Disease Control and Prevention of Xinjiang Uygur Autonomous Region, Urumqi, China
| |
Collapse
|
19
|
Zheng Y, Wang K, Zhang L, Wang L. Study on the relationship between the incidence of influenza and climate indicators and the prediction of influenza incidence. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2021; 28:473-481. [PMID: 32815008 DOI: 10.1007/s11356-020-10523-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Accepted: 08/13/2020] [Indexed: 06/11/2023]
Abstract
In recent 2 years, the incidence of influenza showed a slight upward trend in Guangxi; therefore, some joint actions should be done to help preventing and controlling this disease. The factors analysis of affecting influenza and early prediction of influenza incidence may help policy-making so as to take effective measures to prevent and control influenza. In this study, we used the cross correlation function (CCF) to analyze the effect of climate indicators on influenza incidence, ARIMA and ARIMAX (autoregressive integrated moving average model with exogenous input variables) model methods to do predictive analysis of influenza incidence. The results of CCF analysis showed that climate indicators (PM2.5, PM10, SO2, CO, NO2, O3, average temperature, maximum temperature, minimum temperature, average relative humidity, and sunshine duration) had significant effects on the incidence of influenza. People need to take good precautions in the days of severe air pollution and keep warm in cold weather to prevent influenza. We found that the ARIMAX (1,0,1)(0,0,1)12 with NO2 model has good predictive performance, which can be used to predict the influenza incidence in Guangxi, and the predicted incidence may be useful in developing early warning systems and providing important evidence for influenza control policy-making and public health intervention.
Collapse
Affiliation(s)
- Yanling Zheng
- College of Medical Engineering and Technology, Xinjiang Medical University, Urumqi, 830011, People's Republic of China.
| | - Kai Wang
- College of Medical Engineering and Technology, Xinjiang Medical University, Urumqi, 830011, People's Republic of China
| | - Liping Zhang
- College of Medical Engineering and Technology, Xinjiang Medical University, Urumqi, 830011, People's Republic of China
| | - Lei Wang
- College of Medical Engineering and Technology, Xinjiang Medical University, Urumqi, 830011, People's Republic of China
| |
Collapse
|
20
|
Zou LX, Sun L. Analysis of Hemorrhagic Fever With Renal Syndrome Using Wavelet Tools in Mainland China, 2004-2019. Front Public Health 2020; 8:571984. [PMID: 33335877 PMCID: PMC7736046 DOI: 10.3389/fpubh.2020.571984] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Accepted: 11/09/2020] [Indexed: 01/24/2023] Open
Abstract
Introduction : Hemorrhagic fever with renal syndrome (HFRS) is a life-threatening public health problem in China, accounting for ~90% of HFRS cases reported globally. Accurate analysis and prediction of the HFRS epidemic could help to establish effective preventive measures. Materials and Methods : In this study, the geographical information system (GIS) explored the spatiotemporal features of HFRS, the wavelet power spectrum (WPS) unfolded the cyclical fluctuation of HFRS, and the wavelet neural network (WNN) model predicted the trends of HFRS outbreaks in mainland China. Results : A total of 209,209 HFRS cases were reported in mainland China from 2004 to 2019, with the annual incidence ranged from 0 to 13.05 per 100,0000 persons at the province level. The WPS proved that the periodicity of HFRS could be half a year, 1 year, and roughly 7-year at different time intervals. The WNN structure of 12-6-1 was set up as the fittest forecasting model for the HFRS epidemic. Conclusions : This study provided several potential support tools for the control and risk-management of HFRS in China.
Collapse
Affiliation(s)
- Lu-Xi Zou
- School of Management, Zhejiang University, Hangzhou, China
| | - Ling Sun
- Department of Nephrology, Xuzhou Central Hospital, The Xuzhou School of Clinical Medicine of Nanjing Medical University, Xuzhou, China.,Xuzhou Clinical School of Xuzhou Medical University, Xuzhou, China
| |
Collapse
|
21
|
Li Y, Zhu L, Lu W, Chen C, Yang H. Seasonal variation in notified tuberculosis cases from 2014 to 2018 in eastern China. J Int Med Res 2020; 48:300060520949031. [PMID: 32840170 PMCID: PMC7450459 DOI: 10.1177/0300060520949031] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Objective Tuberculosis (TB) incidence shows a seasonal trend. The purpose of this study
was to explore seasonal trends in TB cases in Jiangsu Province. Methods TB case data were collected from the TB registration system from 2014 to
2018. The X12-ARIMA model was used to adjust the Jiangsu TB time series.
Analysis of variance was used to compare TB seasonal amplitude (SA) between
subgroups and identify factors responsible for seasonal variation. Results The TB incidence in Jiangsu showed a seasonal trend. Confirmed active TB
peaked in March and reached a minimum in February. The amplitude of the
peak-to-bottom difference was 38.15%. The SAs in individuals 7 to 17 years
old (80.00%) and students (71.80%) were significantly different than those
in other subgroups. Among bacterial culture positive individuals, the SAs
among female patients, individuals aged 7 to 17 years and students were
significantly different from those in the reference group. Among
culture-negative patients, the SA among individuals aged 7 to 17 years was
significantly different those in other subgroups. Conclusions The TB incidence in Jiangsu Province displayed a seasonal trend. Factors
related to seasonal variation were age and occupation. Our results highlight
the importance of controlling Mycobacterium tuberculosis
transmission during winter.
Collapse
Affiliation(s)
- Yishu Li
- Department of Epidemiology and Statistics, School of Public Health, Southeast University, Nanjing, Jiangsu Province, PR China
| | - Limei Zhu
- Department of Chronic Communicable Disease, Center for Disease Control and Prevention of Jiangsu Province, Nanjing, Jiangsu Province, PR China
| | - Wei Lu
- Department of Chronic Communicable Disease, Center for Disease Control and Prevention of Jiangsu Province, Nanjing, Jiangsu Province, PR China
| | - Cheng Chen
- Department of Chronic Communicable Disease, Center for Disease Control and Prevention of Jiangsu Province, Nanjing, Jiangsu Province, PR China
| | - Haitao Yang
- Department of Epidemiology and Statistics, School of Public Health, Southeast University, Nanjing, Jiangsu Province, PR China
| |
Collapse
|
22
|
Zheng Y, Zhang L, Zhu X, Guo G. A comparative study of two methods to predict the incidence of hepatitis B in Guangxi, China. PLoS One 2020; 15:e0234660. [PMID: 32579598 PMCID: PMC7314421 DOI: 10.1371/journal.pone.0234660] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Accepted: 05/30/2020] [Indexed: 12/19/2022] Open
Abstract
In recent years, the incidence of hepatitis B (HB) in Guangxi is higher than that of the national level; it has been increasing, so it is urgent to do a good predictive research of HB incidence, which can help analyze the early warning of hepatitis B in Guangxi, China. In the study, the feasibility of predicting HB incidence in Guangxi by autoregressive integrated moving average (ARIMA) model method and Elman neural network (ElmanNN) method was discussed respectively, and the prediction accuracy of the two models was compared. Finally, we established the ARIMA (0, 1, 1) model and ElmanNN with 8 neurons. Both ARIMA (0, 1, 1) model and ElmanNN model had good performance, and their prediction accuracy were high. The fitting and prediction root-mean-square error (RMSE) and mean absolute error (MAE) of ElmanNN were smaller than those of ARIMA (0, 1, 1) model, which indicated that ElmanNN was superior to ARIMA (0, 1, 1) model in predicting the incidence of hepatitis B in Guangxi. Based on the ElmanNN, the HB incidence from September 2019 to December 2020 in Guangxi was predicted, the predicted results showed that the incidence of HB in 2020 was slightly higher than that in 2019 and the change trend was similar to that in 2019, for 2021 and beyond, the ElmanNN model could be used to continue the predictive analysis.
Collapse
Affiliation(s)
- Yanling Zheng
- College of Medical Engineering and Technology, Xinjiang Medical University, Urumqi, People’s Republic of China
- * E-mail: (YZ); (GG)
| | - Liping Zhang
- College of Medical Engineering and Technology, Xinjiang Medical University, Urumqi, People’s Republic of China
| | - XiXun Zhu
- School of Computer Engineering, Jingchu University of Technology, Jingmen, People’s Republic of China
| | - Gang Guo
- State Key Laboratory of Pathogenesis, Prevention and Treatment of High Incidence Diseases in Central Asia, Clinical Medicine Institute, the First Affiliated Hospital of Xinjiang Medical University, Urumqi, People’s Republic of China
- * E-mail: (YZ); (GG)
| |
Collapse
|
23
|
Wang Y, Xu C, Ren J, Wu W, Zhao X, Chao L, Liang W, Yao S. Secular Seasonality and Trend Forecasting of Tuberculosis Incidence Rate in China Using the Advanced Error-Trend-Seasonal Framework. Infect Drug Resist 2020; 13:733-747. [PMID: 32184635 PMCID: PMC7062399 DOI: 10.2147/idr.s238225] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Accepted: 02/25/2020] [Indexed: 12/27/2022] Open
Abstract
Objective Tuberculosis (TB) is a major public health problem in China, and contriving a long-term forecast is a useful aid for better launching prevention initiatives. Regrettably, such a forecasting method with robust and accurate performance is still lacking. Here, we aim to investigate its potential of the error-trend-seasonal (ETS) framework through a series of comparative experiments to analyze and forecast its secular epidemic seasonality and trends of TB incidence in China. Methods We collected the TB incidence data from January 1997 to August 2019, and then partitioning the data into eight different training and testing subsamples. Thereafter, we constructed the ETS and seasonal autoregressive integrated moving average (SARIMA) models based on the training subsamples, and multiple performance indices including the mean absolute deviation, mean absolute percentage error, root-mean-squared error, and mean error rate were adopted to assess their simulation and projection effects. Results In the light of the above performance measures, the ETS models provided a pronounced improvement for the long-term seasonality and trend forecasting in TB incidence rate over the SARIMA models, be it in various training or testing subsets apart from the 48-step ahead forecasting. The descriptive results to the data revealed that TB incidence showed notable seasonal characteristics with predominant peaks of spring and early summer and began to be plunging at on average 3.722% per year since 2008. However, this rate reduced to 2.613% per year since 2015 and furthermore such a trend would be predicted to continue in years ahead. Conclusion The ETS framework has the ability to conduct long-term forecasting for TB incidence, which may be beneficial for the long-term planning of the TB prevention and control. Additionally, considering the predicted dropping rate of TB morbidity, more particular strategies should be formulated to dramatically accelerate progress towards the goals of the End TB Strategy.
Collapse
Affiliation(s)
- Yongbin Wang
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, Xinxiang, Henan, People's Republic of China
| | - Chunjie Xu
- Department of Occupational and Environmental Health, School of Public Health, Capital Medical University, Beijing, People's Republic of China
| | - Jingchao Ren
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, Xinxiang, Henan, People's Republic of China
| | - Weidong Wu
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, Xinxiang, Henan, People's Republic of China
| | - Xiangmei Zhao
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, Xinxiang, Henan, People's Republic of China
| | - Ling Chao
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, Xinxiang, Henan, People's Republic of China
| | - Wenjuan Liang
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, Xinxiang, Henan, People's Republic of China
| | - Sanqiao Yao
- Department of Epidemiology and Health Statistics, School of Public Health, Xinxiang Medical University, Xinxiang, Henan, People's Republic of China
| |
Collapse
|