Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

14
(from Reference Citation Analysis)

Article PDFs (3)

Cited by > 0 (7)

Searched Name

Concept drift

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Kagerbauer SM, Ulm B, Podtschaske AH, Andonov DI, Blobner M, Jungwirth B, Graessner M. Susceptibility of AutoML mortality prediction algorithms to model drift caused by the COVID pandemic. BMC Med Inform Decis Mak 2024;24:34. [PMID: 38308256 PMCID: PMC10837894 DOI: 10.1186/s12911-024-02428-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Accepted: 01/16/2024] [Indexed: 02/04/2024] Open

Abstract

BACKGROUND

Concept drift and covariate shift lead to a degradation of machine learning (ML) models. The objective of our study was to characterize sudden data drift as caused by the COVID pandemic. Furthermore, we investigated the suitability of certain methods in model training to prevent model degradation caused by data drift.

METHODS

We trained different ML models with the H2O AutoML method on a dataset comprising 102,666 cases of surgical patients collected in the years 2014-2019 to predict postoperative mortality using preoperatively available data. Models applied were Generalized Linear Model with regularization, Default Random Forest, Gradient Boosting Machine, eXtreme Gradient Boosting, Deep Learning and Stacked Ensembles comprising all base models. Further, we modified the original models by applying three different methods when training on the original pre-pandemic dataset: (Rahmani K, et al, Int J Med Inform 173:104930, 2023) we weighted older data weaker, (Morger A, et al, Sci Rep 12:7244, 2022) used only the most recent data for model training and (Dilmegani C, 2023) performed a z-transformation of the numerical input parameters. Afterwards, we tested model performance on a pre-pandemic and an in-pandemic data set not used in the training process, and analysed common features.

RESULTS

The models produced showed excellent areas under receiver-operating characteristic and acceptable precision-recall curves when tested on a dataset from January-March 2020, but significant degradation when tested on a dataset collected in the first wave of the COVID pandemic from April-May 2020. When comparing the probability distributions of the input parameters, significant differences between pre-pandemic and in-pandemic data were found. The endpoint of our models, in-hospital mortality after surgery, did not differ significantly between pre- and in-pandemic data and was about 1% in each case. However, the models varied considerably in the composition of their input parameters. None of our applied modifications prevented a loss of performance, although very different models emerged from it, using a large variety of parameters.

CONCLUSIONS

Our results show that none of our tested easy-to-implement measures in model training can prevent deterioration in the case of sudden external events. Therefore, we conclude that, in the presence of concept drift and covariate shift, close monitoring and critical review of model predictions are necessary.

Collapse

Mehmood T, Latif S, Jamail NSM, Malik A, Latif R. LSTMDD: an optimized LSTM-based drift detector for concept drift in dynamic cloud computing. PeerJ Comput Sci 2024;10:e1827. [PMID: 38435622 PMCID: PMC10909158 DOI: 10.7717/peerj-cs.1827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 12/28/2023] [Indexed: 03/05/2024]

Zhou Q, Wang ZY, Huang L. ELM-KL-LSTM: a robust and general incremental learning method for efficient classification of time series data. PeerJ Comput Sci 2023;9:e1732. [PMID: 38192484 PMCID: PMC10773756 DOI: 10.7717/peerj-cs.1732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 11/10/2023] [Indexed: 01/10/2024]

Susnjak T, Maddigan P. Forecasting patient flows with pandemic induced concept drift using explainable machine learning. EPJ Data Sci 2023;12:11. [PMID: 37122585 PMCID: PMC10119825 DOI: 10.1140/epjds/s13688-023-00387-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/09/2022] [Accepted: 04/06/2023] [Indexed: 05/03/2023]

Paldino GM, Lebichot B, Le Borgne YA, Siblini W, Oblé F, Boracchi G, Bontempi G. The role of diversity and ensemble learning in credit card fraud detection. ADV DATA ANAL CLASSI 2022:1-25. [PMID: 36188101 PMCID: PMC9516537 DOI: 10.1007/s11634-022-00515-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2021] [Revised: 07/18/2022] [Accepted: 08/08/2022] [Indexed: 10/24/2022]

Suryawanshi S, Goswami A, Patil P, Mishra V. Adaptive windowing based recurrent neural network for drift adaption in non-stationary environment. J Ambient Intell Humaniz Comput 2022;14:1-15. [PMID: 35789602 PMCID: PMC9243804 DOI: 10.1007/s12652-022-04116-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 06/06/2022] [Indexed: 06/15/2023]

Korycki Ł, Krawczyk B. Adversarial concept drift detection under poisoning attacks for robust data stream mining. Mach Learn 2022;112:1-36. [PMID: 35668720 PMCID: PMC9162121 DOI: 10.1007/s10994-022-06177-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2020] [Revised: 11/01/2021] [Accepted: 04/12/2022] [Indexed: 11/30/2022]

Wu Y, Di B, Luo Y, Grieneisen ML, Zeng W, Zhang S, Deng X, Tang Y, Shi G, Yang F, Zhan Y. A robust approach to deriving long-term daily surface NO₂ levels across China: Correction to substantial estimation bias in back-extrapolation. Environ Int 2021;154:106576. [PMID: 33901976 DOI: 10.1016/j.envint.2021.106576] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2020] [Revised: 04/09/2021] [Accepted: 04/09/2021] [Indexed: 06/12/2023]

Abstract

BACKGROUND

Long-term surface NO₂ data are essential for retrospective policy evaluation and chronic human exposure assessment. In the absence of NO₂ observations for Mainland China before 2013, training a model with 2013-2018 data to make predictions for 2005-2012 (back-extrapolation) could cause substantial estimation bias due to concept drift.

OBJECTIVE

This study aims to correct the estimation bias in order to reconstruct the spatiotemporal distribution of daily surface NO₂ levels across China during 2005-2018.

METHODS

On the basis of ground- and satellite-based data, we proposed the robust back-extrapolation with a random forest (RBE-RF) to simulate the surface NO₂ through intermediate modeling of the scaling factors. For comparison purposes, we also employed a random forest (Base-RF), as a representative of the commonly used approach, to directly model the surface NO₂ levels.

RESULTS

The validation against Taiwan's NO₂ observations during 2005-2012 showed that RBE-RF adequately corrected the substantial underestimation by Base-RF. The RMSE decreased from 10.1 to 8.2 µg/m³, 7.1 to 4.3 µg/m³, and 6.1 to 2.9 µg/m³ in predicting daily, monthly, and annual levels, respectively. For North China with the most severe pollution, the population-weighted NO₂ ([NO₂]_pw) during 2005-2012 was estimated as 40.2 and 50.9 µg/m³ by Base-RF and RBE-RF, respectively, i.e., 21.0% difference. While both models predicted that the national annual [NO₂]_pw increased during 2005-2011 and then decreased, the interannual trends were underestimated by >50.2% by Base-RF relative to RBE-RF. During 2005-2018, the nationwide population that lived in the areas with NO₂ > 40 µg/m³ were estimated as 259 and 460 million by Base-RF and RBE-RF, respectively.

CONCLUSION

With RBE-RF, we corrected the estimation bias in back-extrapolation and obtained a full-coverage dataset of daily surface NO₂ across China during 2005-2018, which is valuable for environmental management and epidemiological research.

Collapse

Affiliation(s)

Yangyang Wu Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China
Baofeng Di Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China; Institute for Disaster Management and Reconstruction, Sichuan University, Chengdu, Sichuan 610200, China
Yuzhou Luo Department of Land, Air, and Water Resources, University of California, Davis, CA 95616, United States
Michael L Grieneisen Department of Land, Air, and Water Resources, University of California, Davis, CA 95616, United States
Wen Zeng Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China
Shifu Zhang Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China
Xunfei Deng Institute of Digital Agriculture, Zhejiang Academy of Agricultural Sciences, Hangzhou, Zhejiang 310021, China
Yulei Tang Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China; Natural Resources Comprehensive Survey Command Center, China Geological Survey, Beijing 100055, China
Guangming Shi Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China; National Engineering Research Center for Flue Gas Desulfurization, Chengdu, Sichuan 610065, China
Fumo Yang Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China; National Engineering Research Center for Flue Gas Desulfurization, Chengdu, Sichuan 610065, China
Yu Zhan Department of Environmental Science and Engineering, Sichuan University, Chengdu, Sichuan 610065, China; National Engineering Research Center for Flue Gas Desulfurization, Chengdu, Sichuan 610065, China; Yibin Institute of Industrial Technology, Sichuan University Yibin Park, Yibin 644000, China.

Collapse

Kumar S, Singh R, Khan MZ, Noorwali A. Design of adaptive ensemble classifier for online sentiment analysis and opinion mining. PeerJ Comput Sci 2021;7:e660. [PMID: 34435102 PMCID: PMC8356659 DOI: 10.7717/peerj-cs.660] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Accepted: 07/13/2021] [Indexed: 06/13/2023]

Jafarinejad F, Rahimi M, Mashayekhi H. Tracking and analysis of discourse dynamics and polarity during the early Corona pandemic in Iran. J Biomed Inform 2021;121:103862. [PMID: 34229062 PMCID: PMC9044732 DOI: 10.1016/j.jbi.2021.103862] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Revised: 05/24/2021] [Accepted: 06/30/2021] [Indexed: 11/26/2022]

Abstract

It has not been long since a new disease called COVID-19 has hit the international community. Unknown nature of the virus, evidence of its adaptability and survival in new conditions, its widespread prevalence and also lengthy recovery period, along with daily notifications of new infection and fatality statistics, have created a wave of fear and anxiety among the public community and authorities. These factors have led to extreme changes in the social discourse in a rather short period of time. The analysis of this discourse is important to reconcile the society and restore ordinary conditions of mental peace and health. Although much research has been done on the disease since its international pandemic, the sociological analysis of the recent public phenomenon, especially in developing countries, still needs attention. We propose a framework for analyzing social media data and news stories oriented around COVID-19 disease. Our research is based on an extensive Persian data set gathered from different social media networks and news agencies in the period of January 21-April 29, 2020. We use the Latent Dirichlet Allocation (LDA) model and dynamic topic modeling to understand and capture the change of discourse in terms of temporal subjects. We scrutinize the reasons of subject alternations by exploring the related events and adopted practices and policies. The social discourse can highly affect the community morale and polarization. Therefore, we further analyze the polarization in online social media posts, and detect points of concept drift in the stream. Based on the analyzed content, effective guidelines are extracted to shift polarization towards positive. The results show that the proposed framework is able to provide an effective practical approach for cause and effect analysis of the social discourse.

Collapse

Guo H, Zhang S, Wang W. Selective ensemble-based online adaptive deep neural networks for streaming data with concept drift. Neural Netw 2021;142:437-456. [PMID: 34273615 DOI: 10.1016/j.neunet.2021.06.027] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2021] [Revised: 05/27/2021] [Accepted: 06/24/2021] [Indexed: 11/16/2022]

Sarnovsky M, Kolarik M. Classification of the drifting data streams using heterogeneous diversified dynamic class-weighted ensemble. PeerJ Comput Sci 2021;7:e459. [PMID: 33834113 PMCID: PMC8022634 DOI: 10.7717/peerj-cs.459] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 03/05/2021] [Indexed: 06/12/2023]

Lobo JL, Del Ser J, Bifet A, Kasabov N. Spiking Neural Networks and online learning: An overview and perspectives. Neural Netw 2019;121:88-100. [PMID: 31536902 DOI: 10.1016/j.neunet.2019.09.004] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Revised: 07/18/2019] [Accepted: 09/02/2019] [Indexed: 11/29/2022]

Bian J, Abdelrahman S, Shi J, Del Fiol G. Automatic identification of recent high impact clinical articles in PubMed to support clinical decision making using time-agnostic features. J Biomed Inform 2019;89:1-10. [PMID: 30468912 PMCID: PMC6342626 DOI: 10.1016/j.jbi.2018.11.010] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 11/18/2018] [Accepted: 11/19/2018] [Indexed: 01/08/2023]