1
|
Xu Y, Zheng X, Li Y, Ye X, Cheng H, Wang H, Lyu J. Exploring patient medication adherence and data mining methods in clinical big data: A contemporary review. J Evid Based Med 2023; 16:342-375. [PMID: 37718729 DOI: 10.1111/jebm.12548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 08/30/2023] [Indexed: 09/19/2023]
Abstract
BACKGROUND Increasingly, patient medication adherence data are being consolidated from claims databases and electronic health records (EHRs). Such databases offer an indirect avenue to gauge medication adherence in our data-rich healthcare milieu. The surge in data accessibility, coupled with the pressing need for its conversion to actionable insights, has spotlighted data mining, with machine learning (ML) emerging as a pivotal technique. Nonadherence poses heightened health risks and escalates medical costs. This paper elucidates the synergistic interaction between medical database mining for medication adherence and the role of ML in fostering knowledge discovery. METHODS We conducted a comprehensive review of EHR applications in the realm of medication adherence, leveraging ML techniques. We expounded on the evolution and structure of medical databases pertinent to medication adherence and harnessed both supervised and unsupervised ML paradigms to delve into adherence and its ramifications. RESULTS Our study underscores the applications of medical databases and ML, encompassing both supervised and unsupervised learning, for medication adherence in clinical big data. Databases like SEER and NHANES, often underutilized due to their intricacies, have gained prominence. Employing ML to excavate patient medication logs from these databases facilitates adherence analysis. Such findings are pivotal for clinical decision-making, risk stratification, and scholarly pursuits, aiming to elevate healthcare quality. CONCLUSION Advanced data mining in the era of big data has revolutionized medication adherence research, thereby enhancing patient care. Emphasizing bespoke interventions and research could herald transformative shifts in therapeutic modalities.
Collapse
Affiliation(s)
- Yixian Xu
- Department of Anesthesiology, The First Affiliated Hospital of Jinan University, Guangzhou, China
| | - Xinkai Zheng
- Department of Dermatology, The First Affiliated Hospital of Jinan University, Guangzhou, China
| | - Yuanjie Li
- Planning & Discipline Construction Office, The First Affiliated Hospital of Jinan University, Guangzhou, China
| | - Xinmiao Ye
- Department of Anesthesiology, The First Affiliated Hospital of Jinan University, Guangzhou, China
| | - Hongtao Cheng
- School of Nursing, Jinan University, Guangzhou, China
| | - Hao Wang
- Department of Anesthesiology, The First Affiliated Hospital of Jinan University, Guangzhou, China
| | - Jun Lyu
- Department of Clinical Research, The First Affiliated Hospital of Jinan University, Guangzhou, China
- Guangdong Provincial Key Laboratory of Traditional Chinese Medicine Informatization, Guangzhou, China
| |
Collapse
|
2
|
Leiva V, Alcudia E, Montano J, Castro C. An Epidemiological Analysis for Assessing and Evaluating COVID-19 Based on Data Analytics in Latin American Countries. BIOLOGY 2023; 12:887. [PMID: 37372171 DOI: 10.3390/biology12060887] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 06/14/2023] [Accepted: 06/16/2023] [Indexed: 06/29/2023]
Abstract
This research provides a detailed analysis of the COVID-19 spread across 14 Latin American countries. Using time-series analysis and epidemic models, we identify diverse outbreak patterns, which seem not to be influenced by geographical location or country size, suggesting the influence of other determining factors. Our study uncovers significant discrepancies between the number recorded COVID-19 cases and the real epidemiological situation, emphasizing the crucial need for accurate data handling and continuous surveillance in managing epidemics. The absence of a clear correlation between the country size and the confirmed cases, as well as with the fatalities, further underscores the multifaceted influences on COVID-19 impact beyond population size. Despite the decreased real-time reproduction number indicating quarantine effectiveness in most countries, we note a resurgence in infection rates upon resumption of daily activities. These insights spotlight the challenge of balancing public health measures with economic and social activities. Our core findings provide novel insights, applicable to guiding epidemic control strategies and informing decision-making processes in combatting the pandemic.
Collapse
Affiliation(s)
- Víctor Leiva
- School of Industrial Engineering, Pontificia Universidad Católica de Valparaíso, Valparaíso 2362807, Chile
| | - Esdras Alcudia
- Faculty of Statistics and Informatics, Universidad Veracruzana, Xalapa 91140, Mexico
| | - Julia Montano
- Faculty of Statistics and Informatics, Universidad Veracruzana, Xalapa 91140, Mexico
| | - Cecilia Castro
- Centre of Mathematics, University of Minho, 4710-057 Braga, Portugal
| |
Collapse
|
3
|
Rahman MZ, Akbar MA, Leiva V, Tahir A, Riaz MT, Martin-Barreiro C. An intelligent health monitoring and diagnosis system based on the internet of things and fuzzy logic for cardiac arrhythmia COVID-19 patients. Comput Biol Med 2023; 154:106583. [PMID: 36716687 PMCID: PMC9883984 DOI: 10.1016/j.compbiomed.2023.106583] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 12/28/2022] [Accepted: 01/22/2023] [Indexed: 01/26/2023]
Abstract
BACKGROUND During the COVID-19 pandemic, there is a global demand for intelligent health surveillance and diagnosis systems for patients with critical conditions, particularly those with severe heart diseases. Sophisticated measurement tools are used in hospitals worldwide to identify serious heart conditions. However, these tools need the face-to-face involvement of healthcare experts to identify cardiac problems. OBJECTIVE To design and implement an intelligent health monitoring and diagnosis system for critical cardiac arrhythmia COVID-19 patients. METHODOLOGY We use artificial intelligence tools divided into two parts: (i) IoT-based health monitoring; and (ii) fuzzy logic-based medical diagnosis. The intelligent diagnosis of heart conditions and IoT-based health surveillance by doctors is offered to critical COVID-19 patients or isolated in remote locations. Sensors, cloud storage, as well as a global system for mobile texts and emails for communication with doctors in case of emergency are employed in our proposal. RESULTS Our implemented system favors remote areas and isolated critical patients. This system utilizes an intelligent algorithm that employs an ECG signal pre-processed by moving through six digital filters. Then, based on the processed results, features are computed and assessed. The intelligent fuzzy system can make an autonomous diagnosis and has enough information to avoid human intervention. The algorithm is trained using ECG data from the MIT-BIH database and achieves high accuracy. In real-time validation, the fuzzy algorithm obtained almost 100% accuracy for all experiments. CONCLUSION Our intelligent system can be helpful in many situations, but it is particularly beneficial for isolated COVID-19 patients who have critical heart arrhythmia and must receive intensive care.
Collapse
Affiliation(s)
- Muhammad Zia Rahman
- Department of Mechanical, Mechatronics and Manufacturing Engineering, University of Engineering and Technology Lahore, Faisalabad, Pakistan.
| | - Muhammad Azeem Akbar
- Department of Information Technology, Lappeenranta University of Technology, Lappeenranta, Finland.
| | - Víctor Leiva
- School of Industrial Engineering, Pontificia Universidad Católica de Valparaíso, Valparaíso, Chile.
| | - Abdullah Tahir
- Department of Mechanical, Mechatronics and Manufacturing Engineering, University of Engineering and Technology Lahore, Faisalabad, Pakistan
| | - Muhammad Tanveer Riaz
- Department of Mechanical, Mechatronics and Manufacturing Engineering, University of Engineering and Technology Lahore, Faisalabad, Pakistan; Department of Electrical and Information Engineering, Politecnico di Bari, Bari, Italy
| | - Carlos Martin-Barreiro
- Faculty of Natural Sciences and Mathematics, Escuela Superior Politécnica del Litoral ESPOL, Guayaquil, Ecuador; Faculty of Engineering, Universidad Espíritu Santo, Samborondón, Ecuador
| |
Collapse
|
4
|
Perdomo-Argüello FJ, Ortega-Gómez E, Galindo-Villardón P, Leiva V, Vicente-Galindo P. STATIS multivariate three-way method for evaluating quality of life after corneal surgery: Methodology and case study in Costa Rica. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:6110-6133. [PMID: 37161100 DOI: 10.3934/mbe.2023264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Vision-related quality of life (QoL) analyzes the visual function concerning individual well-being based on activity and social participation. Because QoL is a multivariate construct, a multivariate statistical method must be used to analyze this construct. In this paper, we present a methodology based on STATIS multivariate three-way methods to assess the real change in vision-related QoL for myopic patients by comparing their conditions before and after corneal surgery. We conduct a case study in Costa Rica to detect the outcomes of patients referred for myopia that underwent refractive surgery. We consider a descriptive, observational and prospective study. We utilize the NEI VFQ-25 instrument to measure the vision-related QoL in five different stages over three months. After applying this instrument/questionnaire, a statistically significant difference was detected between the perceived QoL levels. In addition, strong correlations were identified with highly similar structures ranging from 0.857 to 0.940. The application of the dual STATIS method found the non-existence of reconceptualization in myopic patients, but a statistically significant recalibration was identified. Furthermore, a real change was observed in all patients after surgery. This finding has not been stated previously due to the limitations of the existing statistical tools. We demonstrated that dual STATIS is a multivariate method capable of evaluating vision-related QoL data and detecting changes in recalibration and reconceptualization.
Collapse
Affiliation(s)
| | - Estelina Ortega-Gómez
- Departamento de Estadística, Universidad de Panamá, Panama City, Panama
- Departamento de Estadística, Universidad de Salamanca, Salamanca, Spain
| | - Purificación Galindo-Villardón
- Departamento de Estadística, Universidad de Salamanca, Salamanca, Spain
- Centro de Estudios e Investigaciones Estadísticas, ESPOL, Guayaquil, Ecuador
- Centro de Gestión de Estudios Estadísticos, Universidad Estatal de Milagro, Guayas, Ecuador
| | - Víctor Leiva
- Escuela de Ingeniería Industrial, Pontificia Universidad Católica de Valparaíso, Valparaíso, Chile
| | | |
Collapse
|
5
|
Sardar I, Akbar MA, Leiva V, Alsanad A, Mishra P. Machine learning and automatic ARIMA/Prophet models-based forecasting of COVID-19: methodology, evaluation, and case study in SAARC countries. STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT : RESEARCH JOURNAL 2022; 37:345-359. [PMID: 36217358 PMCID: PMC9533996 DOI: 10.1007/s00477-022-02307-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 08/26/2022] [Indexed: 06/16/2023]
Abstract
Machine learning (ML) has proved to be a prominent study field while solving complex real-world problems. The whole globe has suffered and continues suffering from Coronavirus disease 2019 (COVID-19), and its projections need to be forecasted. In this article, we propose and derive an autoregressive modeling framework based on ML and statistical methods to predict confirmed cases of COVID-19 in the South Asian Association for Regional Cooperation (SAARC) countries. Automatic forecasting models based on autoregressive integrated moving average (ARIMA) and Prophet time series structures, as well as extreme gradient boosting, generalized linear model elastic net (GLMNet), and random forest ML techniques, are introduced and applied to COVID-19 data from the SAARC countries. Different forecasting models are compared by means of selection criteria. By using evaluation metrics, the best and suitable models are selected. Results prove that the ARIMA model is found to be suitable and ideal for forecasting confirmed infected cases of COVID-19 in these countries. For the confirmed cases in Afghanistan, Bangladesh, India, Maldives, and Sri Lanka, the ARIMA model is superior to the other models. In Bhutan, the Prophet time series model is appropriate for predicting such cases. The GLMNet model is more accurate than other time-series models for Nepal and Pakistan. The random forest model is excluded from forecasting because of its poor fit.
Collapse
Affiliation(s)
- Iqra Sardar
- Department of Mathematics and Statistics, International Islamic University Islamabad, Islamabad, Pakistan
| | | | - Víctor Leiva
- School of Industrial Engineering, Pontificia Universidad Católica de Valparaíso, Valparaíso, Chile
| | - Ahmed Alsanad
- STC’s Artificial Intelligence Chair, Department of Information Systems, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia
| | - Pradeep Mishra
- Department of Statistics, College of Agriculture, Powarkheda, India
| |
Collapse
|
6
|
On a Novel Dynamics of SEIR Epidemic Models with a Potential Application to COVID-19. Symmetry (Basel) 2022. [DOI: 10.3390/sym14071436] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
In this paper, we study a type of disease that unknowingly spreads for a long time, but by default, spreads only to a minimal population. This disease is not usually fatal and often goes unnoticed. We propose and derive a novel epidemic mathematical model to describe such a disease, utilizing a fractional differential system under the Atangana–Baleanu–Caputo derivative. This model deals with the transmission between susceptible, exposed, infected, and recovered classes. After formulating the model, equilibrium points as well as stability and feasibility analyses are stated. Then, we present results concerning the existence of positivity in the solutions and a sensitivity analysis. Consequently, computational experiments are conducted and discussed via proper criteria. From our experimental results, we find that the loss and regain of immunity result in the gain and loss of infections. Epidemic models can be linked to symmetry and asymmetry from distinct points of view. By using our novel approach, much research may be expected in epidemiology and other areas, particularly concerning COVID-19, to state how immunity develops after being infected by this virus.
Collapse
|
7
|
Application of Data Science for Cluster Analysis of COVID-19 Mortality According to Sociodemographic Factors at Municipal Level in Mexico. MATHEMATICS 2022. [DOI: 10.3390/math10132167] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Mexico is among the five countries with the largest number of reported deaths from COVID-19 disease, and the mortality rates associated to infections are heterogeneous in the country due to structural factors concerning population. This study aims at the analysis of clusters related to mortality rate from COVID-19 at the municipal level in Mexico from the perspective of Data Science. In this sense, a new application is presented that uses a machine learning hybrid algorithm for generating clusters of municipalities with similar values of sociodemographic indicators and mortality rates. To provide a systematic framework, we applied an extension of the International Business Machines Corporation (IBM) methodology called Batch Foundation Methodology for Data Science (FMDS). For the study, 1,086,743 death certificates corresponding to the year 2020 were used, among other official data. As a result of the analysis, two key indicators related to mortality from COVID-19 at the municipal level were identified: one is population density and the other is percentage of population in poverty. Based on these indicators, 16 municipality clusters were determined. Among the main results of this research, it was found that clusters with high values of mortality rate had high values of population density and low poverty levels. In contrast, clusters with low density values and high poverty levels had low mortality rates. Finally, we think that the patterns found, expressed as municipality clusters with similar characteristics, can be useful for decision making by health authorities regarding disease prevention and control for reinforcing public health measures and optimizing resource distribution for reducing hospitalizations and mortality.
Collapse
|
8
|
Modern Multivariate Statistical Methods for Evaluating the Impact of WhatsApp on Academic Performance: Methodology and Case Study in India. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12126141] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Despite the increasing amount of research on social media, there are few studies on the use of WhatsApp to assess academic performance. Surprisingly, students use social media during lectures, causing a problem. According to the literature, students utilize WhatsApp throughout academic activities where it is prohibited. Researchers and policymakers must pay attention to this problem to understand its impact on academic achievement. In this paper, by using multivariate statistical methods, we investigate the impact of WhatsApp use on academic performance. We construct a questionnaire for this investigation and apply it to a case study based on a sample of 258 students of management from India. We determine the prevalence of WhatsApp employment among these students and note that many of them utilize the app for academic purposes. We found a positive association between time spent on WhatsApp and students’ grade point average (GPA) based on factor, principal component, correlation, and chi-square analyses. Answering questions on the app in class is related to the program’s ability to help students learn. More use of the app in class leads to a lower GPA. However, sometimes, using the app might help students learn and perform better. We identify that many students disseminate materials through the app for academic purposes, contributing to their academic performance. Furthermore, this app is utilized for communicating with their teachers. Computations were carried out with the R and SPSS software.
Collapse
|
9
|
Alkady W, ElBahnasy K, Leiva V, Gad W. Classifying COVID-19 based on amino acids encoding with machine learning algorithms. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS : AN INTERNATIONAL JOURNAL SPONSORED BY THE CHEMOMETRICS SOCIETY 2022; 224:104535. [PMID: 35308181 PMCID: PMC8923015 DOI: 10.1016/j.chemolab.2022.104535] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/24/2021] [Revised: 02/27/2022] [Accepted: 03/05/2022] [Indexed: 05/12/2023]
Abstract
COVID-19 disease causes serious respiratory illnesses. Therefore, accurate identification of the viral infection cycle plays a key role in designing appropriate vaccines. The risk of this disease depends on proteins that interact with human receptors. In this paper, we formulate a novel model for COVID-19 named "amino acid encoding based prediction" (AAPred). This model is accurate, classifies the various coronavirus types, and distinguishes SARS-CoV-2 from other coronaviruses. With the AAPred model, we reduce the number of features to enhance its performance by selecting the most important ones employing statistical criteria. The protein sequence of SARS-CoV-2 for understanding the viral infection cycle is analyzed. Six machine learning classifiers related to decision trees, k-nearest neighbors, random forest, support vector machine, bagging ensemble, and gradient boosting are used to evaluate the model in terms of accuracy, precision, sensitivity, and specificity. We implement the obtained results computationally and apply them to real data from the National Genomics Data Center. The experimental results report that the AAPred model reduces the features to seven of them. The average accuracy of the 10-fold cross-validation is 98.69%, precision is 98.72%, sensitivity is 96.81%, and specificity is 97.72%. The features are selected utilizing information gain and classified with random forest. The proposed model predicts the type of Coronavirus and reduces the number of extracted features. We identify that SARS-CoV-2 has similar physicochemical characteristics in some regions of SARS-CoV. Also, we report that SARS-CoV-2 has similar infection cycles and sequences in some regions of SARS CoV indicating the affectedness of vaccines on SARS-CoV-2. A comparison with deep learning shows similar results with our method.
Collapse
Affiliation(s)
- Walaa Alkady
- Department of Bioinformatics, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt
| | - Khaled ElBahnasy
- Department of Information Systems, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt
| | - Víctor Leiva
- School of Industrial Engineering, Pontificia Universidad Católica de Valparaíso, Valparaíso, Chile
| | - Walaa Gad
- Department of Information Systems, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt
| |
Collapse
|
10
|
Multiscale Monitoring Using Machine Learning Methods: New Methodology and an Industrial Application to a Photovoltaic System. MATHEMATICS 2022. [DOI: 10.3390/math10060890] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
In this study, a multiscale monitoring method for nonlinear processes was developed. We introduced a machine learning tool for fault detection and isolation based on the kernel principal component analysis (PCA) and discrete wavelet transform. The principle of our proposal involved decomposing multivariate data into wavelet coefficients by employing the discrete wavelet transform. Then, the kernel PCA was applied on every matrix of coefficients to detect defects. Only those scales that manifest overruns of the squared prediction errors in control limits were considered in the data reconstruction phase. Thus, the kernel PCA was approached on the reconstructed matrix for detecting defects and isolation. This approach exploits the kernel PCA performance for nonlinear process monitoring in combination with multiscale analysis when processing time-frequency scales. The proposed method was validated on a photovoltaic system related to a complex industrial process. A data matrix was determined from the variables that characterize this process corresponding to motor current, angular speed, convertor output voltage, and power voltage system output. We tested the developed methodology on 1000 observations of photovoltaic variables. A comparison with monitoring methods based on neural PCA was established, proving the efficiency of the developed methodology.
Collapse
|
11
|
Sharbaf MAMJ, Abedini MJ. Implementation of supervised principal component analysis for global sensitivity analysis of models with correlated inputs. STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT : RESEARCH JOURNAL 2022; 36:2789-2818. [PMID: 35095342 PMCID: PMC8787458 DOI: 10.1007/s00477-021-02158-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 12/12/2021] [Indexed: 06/14/2023]
Abstract
Global Sensitivity Analysis (GSA) plays a significant role in quantifying the tangible impact of model inputs on the uncertainty of response variable. As GSA results are strongly affected by correlated inputs, several studies have considered this issue, but most of them are computationally expensive, labor-intensive, and difficult to implement. Accordingly, this paper puts forward a novel regression-based strategy based on the Supervised Principal Component Analysis (SPCA), benefiting from the Reproducing Kernel Hilbert Space. Indeed, by conducting one kind of variance-based sensitivity analysis, a renowned method exclusively customized for models with orthogonal inputs, on SPCA regression, the impact of the correlation structure of input variables is considered. The ability of the suggested technique is evaluated with five test cases as well as three hydrologic and hydraulic models, and the results are compared with those obtained from the correlation ratio method; Taken as a benchmark solution, which is a robust but quite complicated approach in terms of programming. It is found that the proposed method satisfactorily identifies the sensitivity ordering of model inputs. Furthermore, it is proved in this study that the performance of the proposed approach is also supported by the total contribution index in the derived covariance decomposition equation. Moreover, the proposed method compared with the correlation ratio method, is found to be computationally efficient and easy to implement. Overall, the proposed scheme is appropriate for high dimensional, quite strong nonlinear or expensive models with correlated inputs, whose coefficient of determination between the original model and regression-based SPCA model is larger than 0.33.
Collapse
Affiliation(s)
- Mohammad Ali Mohammad Jafar Sharbaf
- Department of Civil and Environmental Engineering, School of Engineering, Shiraz University, Engineering Building #1, Zand Street, 7134851156 Shiraz, Fars Iran
| | - Mohammad Javad Abedini
- Department of Civil and Environmental Engineering, School of Engineering, Shiraz University, Engineering Building #1, Zand Street, 7134851156 Shiraz, Fars Iran
| |
Collapse
|
12
|
Saulo H, Souza R, Vila R, Leiva V, Aykroyd RG. Modeling Mortality Based on Pollution and Temperature Using a New Birnbaum-Saunders Autoregressive Moving Average Structure with Regressors and Related-Sensors Data. SENSORS 2021; 21:s21196518. [PMID: 34640834 PMCID: PMC8512640 DOI: 10.3390/s21196518] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Revised: 09/22/2021] [Accepted: 09/25/2021] [Indexed: 12/18/2022]
Abstract
Environmental agencies are interested in relating mortality to pollutants and possible environmental contributors such as temperature. The Gaussianity assumption is often violated when modeling this relationship due to asymmetry and then other regression models should be considered. The class of Birnbaum–Saunders models, especially their regression formulations, has received considerable attention in the statistical literature. These models have been applied successfully in different areas with an emphasis on engineering, environment, and medicine. A common simplification of these models is that statistical dependence is often not considered. In this paper, we propose and derive a time-dependent model based on a reparameterized Birnbaum–Saunders (RBS) asymmetric distribution that allows us to analyze data in terms of a time-varying conditional mean. In particular, it is a dynamic class of autoregressive moving average (ARMA) models with regressors and a conditional RBS distribution (RBSARMAX). By means of a Monte Carlo simulation study, the statistical performance of the new methodology is assessed, showing good results. The asymmetric RBSARMAX structure is applied to the modeling of mortality as a function of pollution and temperature over time with sensor-related data. This modeling provides strong evidence that the new ARMA formulation is a good alternative for dealing with temporal data, particularly related to mortality with regressors of environmental temperature and pollution.
Collapse
Affiliation(s)
- Helton Saulo
- Department of Statistics, Universidade de Brasília, Brasília 70910-90, Brazil; (H.S.); (R.S.); (R.V.)
| | - Rubens Souza
- Department of Statistics, Universidade de Brasília, Brasília 70910-90, Brazil; (H.S.); (R.S.); (R.V.)
| | - Roberto Vila
- Department of Statistics, Universidade de Brasília, Brasília 70910-90, Brazil; (H.S.); (R.S.); (R.V.)
| | - Víctor Leiva
- School of Industrial Engineering, Pontificia Universidad Católica de Valparaíso, Valparaíso 2362807, Chile
- Correspondence: or
| | - Robert G. Aykroyd
- Department of Statistics, University of Leeds, Leeds, West Yorkshire LS2 9JT, UK;
| |
Collapse
|
13
|
Mahdi E, Leiva V, Mara’Beh S, Martin-Barreiro C. A New Approach to Predicting Cryptocurrency Returns Based on the Gold Prices with Support Vector Machines during the COVID-19 Pandemic Using Sensor-Related Data. SENSORS (BASEL, SWITZERLAND) 2021; 21:6319. [PMID: 34577525 PMCID: PMC8469794 DOI: 10.3390/s21186319] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 09/16/2021] [Accepted: 09/17/2021] [Indexed: 02/07/2023]
Abstract
In a real-world situation produced under COVID-19 scenarios, predicting cryptocurrency returns accurately can be challenging. Such a prediction may be helpful to the daily economic and financial market. Unlike forecasting the cryptocurrency returns, we propose a new approach to predict whether the return classification would be in the first, second, third quartile, or any quantile of the gold price the next day. In this paper, we employ the support vector machine (SVM) algorithm for exploring the predictability of financial returns for the six major digital currencies selected from the list of top ten cryptocurrencies based on data collected through sensors. These currencies are Binance Coin, Bitcoin, Cardano, Dogecoin, Ethereum, and Ripple. Our study considers the pre-COVID-19 and ongoing COVID-19 periods. An algorithm that allows updated data analysis, based on the use of a sensor in the database, is also proposed. The results show strong evidence that the SVM is a robust technique for devising profitable trading strategies and can provide accurate results before and during the current pandemic. Our findings may be helpful for different stakeholders in understanding the cryptocurrency dynamics and in making better investment decisions, especially under adverse conditions and during times of uncertain environments such as in the COVID-19 pandemic.
Collapse
Affiliation(s)
- Esam Mahdi
- Department of Mathematics, Statistics and Physics, Qatar University, Doha 2713, Qatar;
| | - Víctor Leiva
- School of Industrial Engineering, Pontificia Universidad Católica de Valparaíso, Valparaíso 2362807, Chile;
| | - Saed Mara’Beh
- Department of Mathematics, Statistics and Physics, Qatar University, Doha 2713, Qatar;
| | - Carlos Martin-Barreiro
- Faculty of Natural Sciences and Mathematics, Universidad Politécnica ESPOL, Guayaquil 090902, Ecuador;
- Faculty of Engineering, Universidad Espíritu Santo, Samborondón 0901952, Ecuador
| |
Collapse
|
14
|
A New Algorithm for Computing Disjoint Orthogonal Components in the Parallel Factor Analysis Model with Simulations and Applications to Real-World Data. MATHEMATICS 2021. [DOI: 10.3390/math9172058] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
In this paper, we extend the use of disjoint orthogonal components to three-way table analysis with the parallel factor analysis model. Traditional methods, such as scaling, orthogonality constraints, non-negativity constraints, and sparse techniques, do not guarantee that interpretable loading matrices are obtained in this model. We propose a novel heuristic algorithm that allows simple structure loading matrices to be obtained by calculating disjoint orthogonal components. This algorithm is also an alternative approach for solving the well-known degeneracy problem. We carry out computational experiments by utilizing simulated and real-world data to illustrate the benefits of the proposed algorithm.
Collapse
|
15
|
A Two-Stage Location Problem with Order Solved Using a Lagrangian Algorithm and Stochastic Programming for a Potential Use in COVID-19 Vaccination Based on Sensor-Related Data. SENSORS 2021; 21:s21165352. [PMID: 34450794 PMCID: PMC8401739 DOI: 10.3390/s21165352] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 07/28/2021] [Accepted: 07/30/2021] [Indexed: 01/21/2023]
Abstract
Healthcare service centers must be sited in strategic locations that meet the immediate needs of patients. The current situation due to the COVID-19 pandemic makes this problem particularly relevant. Assume that each center corresponds to an assigned place for vaccination and that each center uses one or more vaccine brands/laboratories. Then, each patient could choose a center instead of another, because she/he may prefer the vaccine from a more reliable laboratory. This defines an order of preference that might depend on each patient who may not want to be vaccinated in a center where there are only her/his non-preferred vaccine brands. In countries where the vaccination process is considered successful, the order assigned by each patient to the vaccination centers is defined by incentives that local governments give to their population. These same incentives for foreign citizens are seen as a strategic decision to generate income from tourism. The simple plant/center location problem (SPLP) is a combinatorial approach that has been extensively studied. However, a less-known natural extension of it with order (SPLPO) has not been explored in the same depth. In this case, the size of the instances that can be solved is limited. The SPLPO considers an order of preference that patients have over a set of facilities to meet their demands. This order adds a new set of constraints in its formulation that increases the complexity of the problem to obtain an optimal solution. In this paper, we propose a new two-stage stochastic formulation for the SPLPO (2S-SPLPO) that mimics the mentioned pandemic situation, where the order of preference is treated as a random vector. We carry out computational experiments on simulated 2S-SPLPO instances to evaluate the performance of the new proposal. We apply an algorithm based on Lagrangian relaxation that has been shown to be efficient for large instances of the SPLPO. A potential application of this new algorithm to COVID-19 vaccination is discussed and explored based on sensor-related data. Two further algorithms are proposed to store the patient’s records in a data warehouse and generate 2S-SPLPO instances using sensors.
Collapse
|
16
|
Rojas F, Leiva V, Huerta M, Martin-Barreiro C. Lot-Size Models with Uncertain Demand Considering Its Skewness/Kurtosis and Stochastic Programming Applied to Hospital Pharmacy with Sensor-Related COVID-19 Data. SENSORS (BASEL, SWITZERLAND) 2021; 21:5198. [PMID: 34372434 PMCID: PMC8347410 DOI: 10.3390/s21155198] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 07/27/2021] [Accepted: 07/28/2021] [Indexed: 01/22/2023]
Abstract
Governments have been challenged to provide timely medical care to face the COVID-19 pandemic. Under this pandemic, the demand for pharmaceutical products has changed significantly. Some of these products are in high demand, while, for others, their demand falls sharply. These changes in the random demand patterns are connected with changes in the skewness (asymmetry) and kurtosis of their data distribution. Such changes are critical to determining optimal lots and inventory costs. The lot-size model helps to make decisions based on probabilistic demand when calculating the optimal costs of supply using two-stage stochastic programming. The objective of this study is to evaluate how the skewness and kurtosis of the distribution of demand data, collected through sensors, affect the modeling of inventories of hospital pharmacy products helpful to treat COVID-19. The use of stochastic programming allows us to obtain results under demand uncertainty that are closer to reality. We carry out a simulation study to evaluate the performance of our methodology under different demand scenarios with diverse degrees of skewness and kurtosis. A case study in the field of hospital pharmacy with sensor-related COVID-19 data is also provided. An algorithm that permits us to use sensors when submitting requests for supplying pharmaceutical products in the hospital treatment of COVID-19 is designed. We show that the coefficients of skewness and kurtosis impact the total costs of inventory that involve order, purchase, holding, and shortage. We conclude that the asymmetry and kurtosis of the demand statistical distribution do not seem to affect the first-stage lot-size decisions. However, demand patterns with high positive skewness are related to significant increases in expected inventories on hand and shortage, increasing the costs of second-stage decisions. Thus, demand distributions that are highly asymmetrical to the right and leptokurtic favor high total costs in probabilistic lot-size systems.
Collapse
Affiliation(s)
- Fernando Rojas
- School of Nutrition and Dietetics, Faculty of Pharmacy, Universidad de Valparaíso, Valparaíso 2360102, Chile;
- Center of Micro-Bioinnovation, Faculty of Pharmacy, Universidad de Valparaíso, Valparaíso 2360102, Chile
| | - Víctor Leiva
- School of Industrial Engineering, Faculty of Engineering, Pontificia Universidad Católica de Valparaíso, Valparaíso 2362807, Chile;
| | - Mauricio Huerta
- School of Industrial Engineering, Faculty of Engineering, Pontificia Universidad Católica de Valparaíso, Valparaíso 2362807, Chile;
| | - Carlos Martin-Barreiro
- Faculty of Natural Sciences and Mathematics, Universidad Politécnica ESPOL, Guayaquil 090902, Ecuador; or
- Facultad de Ingeniería, Universidad Espíritu Santo, Samborondón 0901952, Ecuador
| |
Collapse
|