Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Luo G, Stone BL, Johnson MD, Tarczy-Hornoch P, Wilcox AB, Mooney SD, Sheng X, Haug PJ, Nkoy FL. Automating Construction of Machine Learning Models With Clinical Big Data: Proposal Rationale and Methods. JMIR Res Protoc 2017;6:e175. [PMID: 28851678 PMCID: PMC5596298 DOI: 10.2196/resprot.7757] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2017] [Revised: 07/14/2017] [Accepted: 07/15/2017] [Indexed: 12/14/2022] Open

For:	Luo G, Stone BL, Johnson MD, Tarczy-Hornoch P, Wilcox AB, Mooney SD, Sheng X, Haug PJ, Nkoy FL. Automating Construction of Machine Learning Models With Clinical Big Data: Proposal Rationale and Methods. JMIR Res Protoc 2017;6:e175. [PMID: 28851678 PMCID: PMC5596298 DOI: 10.2196/resprot.7757] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2017] [Revised: 07/14/2017] [Accepted: 07/15/2017] [Indexed: 12/14/2022] Open

Number

Cited by Other Article(s)

Beam K, Wang C, Beam A, Clark R, Tolia V, Ahmad K. National Needs Assessment of Utilization of Common Newborn Clinical Decision Support Tools. Am J Perinatol 2024;41:e1982-e1988. [PMID: 37207674 DOI: 10.1055/a-2096-2168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]

Imrie F, Cebere B, McKinney EF, van der Schaar M. AutoPrognosis 2.0: Democratizing diagnostic and prognostic modeling in healthcare with automated machine learning. PLOS DIGITAL HEALTH 2023;2:e0000276. [PMID: 37347752 DOI: 10.1371/journal.pdig.0000276] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 05/17/2023] [Indexed: 06/24/2023]

A machine learning analysis of correlates of mortality among patients hospitalized with COVID-19. Sci Rep 2023;13:4080. [PMID: 36906638 PMCID: PMC10007654 DOI: 10.1038/s41598-023-31251-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 03/08/2023] [Indexed: 03/13/2023] Open

Tschoellitsch T, Krummenacker S, Dünser MW, Stöger R, Meier J. The Value of the First Clinical Impression as Assessed by 18 Observations in Patients Presenting to the Emergency Department. J Clin Med 2023;12:jcm12020724. [PMID: 36675651 PMCID: PMC9862625 DOI: 10.3390/jcm12020724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 01/11/2023] [Accepted: 01/13/2023] [Indexed: 01/18/2023] Open

Randall JR, DuPai CD, Cole TJ, Davidson G, Groover KE, Slater SL, Mavridou DA, Wilke CO, Davies BW. Designing and identifying β-hairpin peptide macrocycles with antibiotic potential. SCIENCE ADVANCES 2023;9:eade0008. [PMID: 36630516 PMCID: PMC9833666 DOI: 10.1126/sciadv.ade0008] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 12/09/2022] [Indexed: 06/17/2023]

Vagliano I, Schut MC, Abu-Hanna A, Dongelmans DA, de Lange DW, Gommers D, Cremer OL, Bosman RJ, Rigter S, Wils EJ, Frenzel T, de Jong R, Peters MAA, Kamps MJA, Ramnarain D, Nowitzky R, Nooteboom FGCA, de Ruijter W, Urlings-Strop LC, Smit EGM, Mehagnoul-Schipper DJ, Dormans T, de Jager CPC, Hendriks SHA, Achterberg S, Oostdijk E, Reidinga AC, Festen-Spanjer B, Brunnekreef GB, Cornet AD, van den Tempel W, Boelens AD, Koetsier P, Lens J, Faber HJ, Karakus A, Entjes R, de Jong P, Rettig TCD, Reuland MC, Arbous S, Fleuren LM, Dam TA, Thoral PJ, Lalisang RCA, Tonutti M, de Bruin DP, Elbers PWG, de Keizer NF. Assess and validate predictive performance of models for in-hospital mortality in COVID-19 patients: A retrospective cohort study in the Netherlands comparing the value of registry data with high-granular electronic health records. Int J Med Inform 2022;167:104863. [PMID: 36162166 PMCID: PMC9492397 DOI: 10.1016/j.ijmedinf.2022.104863] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 08/19/2022] [Accepted: 09/03/2022] [Indexed: 11/17/2022]

Abstract

PURPOSE

To assess, validate and compare the predictive performance of models for in-hospital mortality of COVID-19 patients admitted to the intensive care unit (ICU) over two different waves of infections. Our models were built with high-granular Electronic Health Records (EHR) data versus less-granular registry data.

METHODS

Observational study of all COVID-19 patients admitted to 19 Dutch ICUs participating in both the national quality registry National Intensive Care Evaluation (NICE) and the EHR-based Dutch Data Warehouse (hereafter EHR). Multiple models were developed on data from the first 24 h of ICU admissions from February to June 2020 (first COVID-19 wave) and validated on prospective patients admitted to the same ICUs between July and December 2020 (second COVID-19 wave). We assessed model discrimination, calibration, and the degree of relatedness between development and validation population. Coefficients were used to identify relevant risk factors.

RESULTS

A total of 1533 patients from the EHR and 1563 from the registry were included. With high granular EHR data, the average AUROC was 0.69 (standard deviation of 0.05) for the internal validation, and the AUROC was 0.75 for the temporal validation. The registry model achieved an average AUROC of 0.76 (standard deviation of 0.05) in the internal validation and 0.77 in the temporal validation. In the EHR data, age, and respiratory-system related variables were the most important risk factors identified. In the NICE registry data, age and chronic respiratory insufficiency were the most important risk factors.

CONCLUSION

In our study, prognostic models built on less-granular but readily-available registry data had similar performance to models built on high-granular EHR data and showed similar transportability to a prospective COVID-19 population. Future research is needed to verify whether this finding can be confirmed for upcoming waves.

Collapse

Affiliation(s)

Iacopo Vagliano Department of Medical Informatics, Amsterdam UMC, University of Amsterdam, Amsterdam Public Health Research Institute, Amsterdam, The Netherlands.
Martijn C Schut Department of Medical Informatics, Amsterdam UMC, University of Amsterdam, Amsterdam Public Health Research Institute, Amsterdam, The Netherlands
Ameen Abu-Hanna Department of Medical Informatics, Amsterdam UMC, University of Amsterdam, Amsterdam Public Health Research Institute, Amsterdam, The Netherlands
Dave A Dongelmans National Intensive Care Evaluation (NICE) foundation, Amsterdam, The Netherlands; Department of Intensive Care Medicine, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
Dylan W de Lange National Intensive Care Evaluation (NICE) foundation, Amsterdam, The Netherlands; Department of Intensive Care Medicine, University Medical Center Utrecht, University Utrecht, Utrecht, The Netherlands
Diederik Gommers Department of Intensive Care, Erasmus Medical Center, Rotterdam, The Netherlands
Olaf L Cremer Intensive Care, UMC Utrecht, Utrecht, The Netherlands
Rob J Bosman ICU, OLVG, Amsterdam, The Netherlands
Sander Rigter Department of Anesthesiology and Intensive Care, St. Antonius Hospital, Nieuwegein, The Netherlands
Evert-Jan Wils Department of Intensive Care, Franciscus Gasthuis & Vlietland, Rotterdam, The Netherlands
Tim Frenzel Department of Intensive Care Medicine, Radboud University Medical Center, Nijmegen, The Netherlands
Remko de Jong Intensive Care, Bovenij Ziekenhuis, Amsterdam, The Netherlands
Marco A A Peters Intensive Care, Canisius Wilhelmina Ziekenhuis, Nijmegen, The Netherlands
Marlijn J A Kamps Intensive Care, Catharina Ziekenhuis Eindhoven, Eindhoven, The Netherlands
Dharmanand Ramnarain Department of Intensive Care, ETZ Tilburg, Tilburg, The Netherlands
Ralph Nowitzky Intensive Care, Haga Ziekenhuis, Den Haag, The Netherlands
Fleur G C A Nooteboom Intensive Care, Laurentius Ziekenhuis, Roermond, The Netherlands
Wouter de Ruijter Department of Intensive Care Medicine, Northwest Clinics, Alkmaar, The Netherlands
Louise C Urlings-Strop Intensive Care, Reinier de Graaf Gasthuis, Delft, The Netherlands
Ellen G M Smit Intensive Care, Spaarne Gasthuis, Haarlem en Hoofddorp, The Netherlands
D Jannet Mehagnoul-Schipper Intensive Care, VieCuri Medisch Centrum, Venlo, The Netherlands
Tom Dormans Intensive care, Zuyderland MC, Heerlen, The Netherlands
Cornelis P C de Jager Department of Intensive Care, Jeroen Bosch Ziekenhuis, Den Bosch, The Netherlands
Stefaan H A Hendriks Intensive Care, Albert Schweitzerziekenhuis, Dordrecht, The Netherlands
Sefanja Achterberg ICU, Haaglanden Medisch Centrum, Den Haag, The Netherlands
Evelien Oostdijk ICU, Maasstad Ziekenhuis Rotterdam, Rotterdam, The Netherlands
Auke C Reidinga ICU, SEH, BWC, Martiniziekenhuis, Groningen, The Netherlands
Barbara Festen-Spanjer Intensive Care, Ziekenhuis Gelderse Vallei, Ede, The Netherlands
Gert B Brunnekreef Department of Intensive Care, Ziekenhuisgroep Twente, Almelo, The Netherlands
Alexander D Cornet Department of Intensive Care, Medisch Spectrum Twente, Enschede, The Netherlands
Walter van den Tempel Department of Intensive Care, Ikazia Ziekenhuis Rotterdam, Rotterdam, The Netherlands
Age D Boelens Anesthesiology, Antonius Ziekenhuis Sneek, Sneek, The Netherlands
Peter Koetsier Intensive Care, Medisch Centrum Leeuwarden, Leeuwarden, The Netherlands
Judith Lens ICU, IJsselland Ziekenhuis, Capelle aan den IJssel, The Netherlands
Harald J Faber ICU, WZA, Assen, The Netherlands
A Karakus Department of Intensive Care, Diakonessenhuis Hospital, Utrecht, The Netherlands
Robert Entjes Department of Intensive Care, Adrz, Goes, The Netherlands
Paul de Jong Department of Anesthesia and Intensive Care, Slingeland Ziekenhuis, Doetinchem, The Netherlands
Thijs C D Rettig Department of Anesthesiology, Intensive Care and Pain Medicine, Amphia Ziekenhuis, Breda, The Netherlands
M C Reuland Department of Medical Informatics, Amsterdam UMC, University of Amsterdam, Amsterdam Public Health Research Institute, Amsterdam, The Netherlands
Sesmu Arbous Intensivist, LUMC, Leiden, The Netherlands
Lucas M Fleuren Department of Intensive Care Medicine, Laboratory for Critical Care Computational Intelligence, Amsterdam Medical Data Science, Amsterdam UMC, Vrije Universiteit, Amsterdam, The Netherlands
Tariq A Dam Department of Intensive Care Medicine, Laboratory for Critical Care Computational Intelligence, Amsterdam Medical Data Science, Amsterdam UMC, Vrije Universiteit, Amsterdam, The Netherlands
Patrick J Thoral Department of Intensive Care Medicine, Laboratory for Critical Care Computational Intelligence, Amsterdam Medical Data Science, Amsterdam UMC, Vrije Universiteit, Amsterdam, The Netherlands
Robbert C A Lalisang Pacmed, Amsterdam, The Netherlands
Michele Tonutti Pacmed, Amsterdam, The Netherlands
Daan P de Bruin Pacmed, Amsterdam, The Netherlands
Paul W G Elbers Department of Intensive Care Medicine, Laboratory for Critical Care Computational Intelligence, Amsterdam Medical Data Science, Amsterdam UMC, Vrije Universiteit, Amsterdam, The Netherlands
Nicolette F de Keizer Department of Medical Informatics, Amsterdam UMC, University of Amsterdam, Amsterdam Public Health Research Institute, Amsterdam, The Netherlands; National Intensive Care Evaluation (NICE) foundation, Amsterdam, The Netherlands

Collapse

A Romero RA, Y Deypalan MN, Mehrotra S, Jungao JT, Sheils NE, Manduchi E, Moore JH. Benchmarking AutoML frameworks for disease prediction using medical claims. BioData Min 2022;15:15. [PMID: 35883154 PMCID: PMC9327416 DOI: 10.1186/s13040-022-00300-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 06/27/2022] [Indexed: 11/10/2022] Open

Abstract

Objectives

Ascertain and compare the performances of Automated Machine Learning (AutoML) tools on large, highly imbalanced healthcare datasets.

Materials and Methods

We generated a large dataset using historical de-identified administrative claims including demographic information and flags for disease codes in four different time windows prior to 2019. We then trained three AutoML tools on this dataset to predict six different disease outcomes in 2019 and evaluated model performances on several metrics.

Results

The AutoML tools showed improvement from the baseline random forest model but did not differ significantly from each other. All models recorded low area under the precision-recall curve and failed to predict true positives while keeping the true negative rate high. Model performance was not directly related to prevalence. We provide a specific use-case to illustrate how to select a threshold that gives the best balance between true and false positive rates, as this is an important consideration in medical applications.

Discussion

Healthcare datasets present several challenges for AutoML tools, including large sample size, high imbalance, and limitations in the available features. Improvements in scalability, combinations of imbalance-learning resampling and ensemble approaches, and curated feature selection are possible next steps to achieve better performance.

Conclusion

Among the three explored, no AutoML tool consistently outperforms the rest in terms of predictive performance. The performances of the models in this study suggest that there may be room for improvement in handling medical claims data. Finally, selection of the optimal prediction threshold should be guided by the specific practical application.

Supplementary Information

The online version contains supplementary material available at (10.1186/s13040-022-00300-2).

Collapse

No-Code Platform-Based Deep-Learning Models for Prediction of Colorectal Polyp Histology from White-Light Endoscopy Images: Development and Performance Verification. J Pers Med 2022;12:jpm12060963. [PMID: 35743748 PMCID: PMC9225479 DOI: 10.3390/jpm12060963] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Revised: 05/27/2022] [Accepted: 06/10/2022] [Indexed: 12/17/2022] Open

Abstract Background: The authors previously developed deep-learning models for the prediction of colorectal polyp histology (advanced colorectal cancer, early cancer/high-grade dysplasia, tubular adenoma with or without low-grade dysplasia, or non-neoplasm) from endoscopic images. While the model achieved 67.3% internal-test accuracy and 79.2% external-test accuracy, model development was labour-intensive and required specialised programming expertise. Moreover, the 240-image external-test dataset included only three advanced and eight early cancers, so it was difficult to generalise model performance. These limitations may be mitigated by deep-learning models developed using no-code platforms. Objective: To establish no-code platform-based deep-learning models for the prediction of colorectal polyp histology from white-light endoscopy images and compare their diagnostic performance with traditional models. Methods: The same 3828 endoscopic images used to establish previous models were used to establish new models based on no-code platforms Neuro-T, VLAD, and Create ML-Image Classifier. A prospective multicentre validation study was then conducted using 3818 novel images. The primary outcome was the accuracy of four-category prediction. Results: The model established using Neuro-T achieved the highest internal-test accuracy (75.3%, 95% confidence interval: 71.0–79.6%) and external-test accuracy (80.2%, 76.9–83.5%) but required the longest training time. In contrast, the model established using Create ML-Image Classifier required only 3 min for training and still achieved 72.7% (70.8–74.6%) external-test accuracy. Attention map analysis revealed that the imaging features used by the no-code deep-learning models were similar to those used by endoscopists during visual inspection. Conclusion: No-code deep-learning tools allow for the rapid development of models with high accuracy for predicting colorectal polyp histology. Collapse

Dong Q, Zhang X, Luo G. Improving the Accuracy of Progress Indication for Constructing Deep Learning Models. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2022;10:63754-63781. [PMID: 35873900 PMCID: PMC9302923 DOI: 10.1109/access.2022.3181493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Khashu M, Dame C, Lavoie PM, De Plaen IG, Garg PM, Sampath V, Malhotra A, Caplan MD, Kumar P, Agrawal PB, Buonocore G, Christensen RD, Maheshwari A. Current Understanding of Transfusion-associated Necrotizing Enterocolitis: Review of Clinical and Experimental Studies and a Call for More Definitive Evidence. NEWBORN 2022;1:201-208. [PMID: 35746957 PMCID: PMC9217573 DOI: 10.5005/jp-journals-11002-0005] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Luo G, Stone BL, Sheng X, He S, Koebnick C, Nkoy FL. Using Computational Methods to Improve Integrated Disease Management for Asthma and Chronic Obstructive Pulmonary Disease: Protocol for a Secondary Analysis. JMIR Res Protoc 2021;10:e27065. [PMID: 34003134 PMCID: PMC8170556 DOI: 10.2196/27065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Revised: 04/12/2021] [Accepted: 04/19/2021] [Indexed: 12/05/2022] Open

Abstract

Background

Asthma and chronic obstructive pulmonary disease (COPD) impose a heavy burden on health care. Approximately one-fourth of patients with asthma and patients with COPD are prone to exacerbations, which can be greatly reduced by preventive care via integrated disease management that has a limited service capacity. To do this well, a predictive model for proneness to exacerbation is required, but no such model exists. It would be suboptimal to build such models using the current model building approach for asthma and COPD, which has 2 gaps due to rarely factoring in temporal features showing early health changes and general directions. First, existing models for other asthma and COPD outcomes rarely use more advanced temporal features, such as the slope of the number of days to albuterol refill, and are inaccurate. Second, existing models seldom show the reason a patient is deemed high risk and the potential interventions to reduce the risk, making already occupied clinicians expend more time on chart review and overlook suitable interventions. Regular automatic explanation methods cannot deal with temporal data and address this issue well.

Objective

To enable more patients with asthma and patients with COPD to obtain suitable and timely care to avoid exacerbations, we aim to implement comprehensible computational methods to accurately predict proneness to exacerbation and recommend customized interventions.

Methods

We will use temporal features to accurately predict proneness to exacerbation, automatically find modifiable temporal risk factors for every high-risk patient, and assess the impact of actionable warnings on clinicians’ decisions to use integrated disease management to prevent proneness to exacerbation.

Results

We have obtained most of the clinical and administrative data of patients with asthma from 3 prominent American health care systems. We are retrieving other clinical and administrative data, mostly of patients with COPD, needed for the study. We intend to complete the study in 6 years.

Conclusions

Our results will help make asthma and COPD care more proactive, effective, and efficient, improving outcomes and saving resources.

International Registered Report Identifier (IRRID)

PRR1-10.2196/27065

Collapse

Bang CS, Lim H, Jeong HM, Hwang SH. Use of Endoscopic Images in the Prediction of Submucosal Invasion of Gastric Neoplasms: Automated Deep Learning Model Development and Usability Study. J Med Internet Res 2021;23:e25167. [PMID: 33856356 PMCID: PMC8085753 DOI: 10.2196/25167] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Revised: 12/09/2020] [Accepted: 03/16/2021] [Indexed: 12/12/2022] Open

Abstract

BACKGROUND

In a previous study, we examined the use of deep learning models to classify the invasion depth (mucosa-confined versus submucosa-invaded) of gastric neoplasms using endoscopic images. The external test accuracy reached 77.3%. However, model establishment is labor intense, requiring high performance. Automated deep learning (AutoDL) models, which enable fast searching of optimal neural architectures and hyperparameters without complex coding, have been developed.

OBJECTIVE

The objective of this study was to establish AutoDL models to classify the invasion depth of gastric neoplasms. Additionally, endoscopist-artificial intelligence interactions were explored.

METHODS

The same 2899 endoscopic images that were employed to establish the previous model were used. A prospective multicenter validation using 206 and 1597 novel images was conducted. The primary outcome was external test accuracy. Neuro-T, Create ML Image Classifier, and AutoML Vision were used in establishing the models. Three doctors with different levels of endoscopy expertise were asked to classify the invasion depth of gastric neoplasms for each image without AutoDL support, with faulty AutoDL support, and with best performance AutoDL support in sequence.

RESULTS

The Neuro-T-based model reached 89.3% (95% CI 85.1%-93.5%) external test accuracy. For the model establishment time, Create ML Image Classifier showed the fastest time of 13 minutes while reaching 82.0% (95% CI 76.8%-87.2%) external test accuracy. While the expert endoscopist's decisions were not influenced by AutoDL, the faulty AutoDL misled the endoscopy trainee and the general physician. However, this was corrected by the support of the best performance AutoDL model. The trainee gained the most benefit from the AutoDL support.

CONCLUSIONS

AutoDL is deemed useful for the on-site establishment of customized deep learning models. An inexperienced endoscopist with at least a certain level of expertise can benefit from AutoDL support.

Collapse

Automated Machine Learning for Healthcare and Clinical Notes Analysis. COMPUTERS 2021. [DOI: 10.3390/computers10020024] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Abstract Machine learning (ML) has been slowly entering every aspect of our lives and its positive impact has been astonishing. To accelerate embedding ML in more applications and incorporating it in real-world scenarios, automated machine learning (AutoML) is emerging. The main purpose of AutoML is to provide seamless integration of ML in various industries, which will facilitate better outcomes in everyday tasks. In healthcare, AutoML has been already applied to easier settings with structured data such as tabular lab data. However, there is still a need for applying AutoML for interpreting medical text, which is being generated at a tremendous rate. For this to happen, a promising method is AutoML for clinical notes analysis, which is an unexplored research area representing a gap in ML research. The main objective of this paper is to fill this gap and provide a comprehensive survey and analytical study towards AutoML for clinical notes. To that end, we first introduce the AutoML technology and review its various tools and techniques. We then survey the literature of AutoML in the healthcare industry and discuss the developments specific to clinical settings, as well as those using general AutoML tools for healthcare applications. With this background, we then discuss challenges of working with clinical notes and highlight the benefits of developing AutoML for medical notes processing. Next, we survey relevant ML research for clinical notes and analyze the literature and the field of AutoML in the healthcare industry. Furthermore, we propose future research directions and shed light on the challenges and opportunities this emerging field holds. With this, we aim to assist the community with the implementation of an AutoML platform for medical notes, which if realized can revolutionize patient outcomes. Collapse

Yang F, Elmer J, Zadorozhny VI. SmartPrognosis: Automatic ensemble classification for quantitative EEG analysis in patients resuscitated from cardiac arrest. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2020.106579] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]

A Unified Framework for Automatic Detection of Wound Infection with Artificial Intelligence. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10155353] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Dong Q, Luo G. Progress Indication for Deep Learning Model Training: A Feasibility Demonstration. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2020;8:79811-79843. [PMID: 32483518 PMCID: PMC7263346 DOI: 10.1109/access.2020.2989684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Nelson CR, Ekberg J, Fridell K. Prostate Cancer Detection in Screening Using Magnetic Resonance Imaging and Artificial Intelligence. ACTA ACUST UNITED AC 2020. [DOI: 10.2174/1874061802006010001] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Abstract Background: Prostate cancer is a leading cause of death among men who do not participate in a screening programme. MRI forms a possible alternative for prostate analysis of a higher level of sensitivity than the PSA test or biopsy. Magnetic resonance is a non-invasive method and magnetic resonance tomography produces a large amount of data. If a screening programme were implemented, a dramatic increase in radiologist workload and patient waiting time will follow. Computer Aided-Diagnose (CAD) could assist radiologists to decrease reading times and cost, and increase diagnostic effectiveness. CAD mimics radiologist and imaging guidelines to detect prostate cancer. Aim: The purpose of this study was to analyse and describe current research in MRI prostate examination with the aid of CAD. The aim was to determine if CAD systems form a reliable method for use in prostate screening. Methods: This study was conducted as a systematic literature review of current scientific articles. Selection of articles was carried out using the “Preferred Reporting Items for Systematic Reviews and for Meta-Analysis” (PRISMA). Summaries were created from reviewed articles and were then categorised into relevant data for results. Results: CAD has shown that its capability concerning sensitivity or specificity is higher than a radiologist. A CAD system can reach a peak sensitivity of 100% and two CAD systems showed a specificity of 100%. CAD systems are highly specialised and chiefly focus on the peripheral zone, which could mean missing cancer in the transition zone. CAD systems can segment the prostate with the same effectiveness as a radiologist. Conclusion: When CAD analysed clinically-significant tumours with a Gleason score greater than 6, CAD outperformed radiologists. However, their focus on the peripheral zone would require the use of more than one CAD system to analyse the entire prostate. Collapse

Wang HL, Hsu WY, Lee MH, Weng HH, Chang SW, Yang JT, Tsai YH. Automatic Machine-Learning-Based Outcome Prediction in Patients With Primary Intracerebral Hemorrhage. Front Neurol 2019;10:910. [PMID: 31496988 PMCID: PMC6713018 DOI: 10.3389/fneur.2019.00910] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2019] [Accepted: 08/06/2019] [Indexed: 12/27/2022] Open

Luo G, Stone BL, Koebnick C, He S, Au DH, Sheng X, Murtaugh MA, Sward KA, Schatz M, Zeiger RS, Davidson GH, Nkoy FL. Using Temporal Features to Provide Data-Driven Clinical Early Warnings for Chronic Obstructive Pulmonary Disease and Asthma Care Management: Protocol for a Secondary Analysis. JMIR Res Protoc 2019;8:e13783. [PMID: 31199308 PMCID: PMC6592592 DOI: 10.2196/13783] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2019] [Revised: 05/13/2019] [Accepted: 05/14/2019] [Indexed: 01/19/2023] Open

Abstract

Background

Both chronic obstructive pulmonary disease (COPD) and asthma incur heavy health care burdens. To support tailored preventive care for these 2 diseases, predictive modeling is widely used to give warnings and to identify patients for care management. However, 3 gaps exist in current modeling methods owing to rarely factoring in temporal aspects showing trends and early health change: (1) existing models seldom use temporal features and often give late warnings, making care reactive. A health risk is often found at a relatively late stage of declining health, when the risk of a poor outcome is high and resolving the issue is difficult and costly. A typical model predicts patient outcomes in the next 12 months. This often does not warn early enough. If a patient will actually be hospitalized for COPD next week, intervening now could be too late to avoid the hospitalization. If temporal features were used, this patient could potentially be identified a few weeks earlier to institute preventive therapy; (2) existing models often miss many temporal features with high predictive power and have low accuracy. This makes care management enroll many patients not needing it and overlook over half of the patients needing it the most; (3) existing models often give no information on why a patient is at high risk nor about possible interventions to mitigate risk, causing busy care managers to spend more time reviewing charts and to miss suited interventions. Typical automatic explanation methods cannot handle longitudinal attributes and fully address these issues.

Objective

To fill these gaps so that more COPD and asthma patients will receive more appropriate and timely care, we will develop comprehensible data-driven methods to provide accurate early warnings of poor outcomes and to suggest tailored interventions, making care more proactive, efficient, and effective.

Methods

By conducting a secondary data analysis and surveys, the study will: (1) use temporal features to provide accurate early warnings of poor outcomes and assess the potential impact on prediction accuracy, risk warning timeliness, and outcomes; (2) automatically identify actionable temporal risk factors for each patient at high risk for future hospital use and assess the impact on prediction accuracy and outcomes; and (3) assess the impact of actionable information on clinicians’ acceptance of early warnings and on perceived care plan quality.

Results

We are obtaining clinical and administrative datasets from 3 leading health care systems’ enterprise data warehouses. We plan to start data analysis in 2020 and finish our study in 2025.

Conclusions

Techniques to be developed in this study can boost risk warning timeliness, model accuracy, and generalizability; improve patient finding for preventive care; help form tailored care plans; advance machine learning for many clinical applications; and be generalized for many other chronic diseases.

International Registered Report Identifier (IRRID)

PRR1-10.2196/13783

Collapse

Luo G. A roadmap for semi-automatically extracting predictive and clinically meaningful temporal features from medical data for predictive modeling. GLOBAL TRANSITIONS 2019;1:61-82. [PMID: 31032483 PMCID: PMC6482973 DOI: 10.1016/j.glt.2018.11.001] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

Luo G. Progress Indication for Machine Learning Model Building: A Feasibility Demonstration. SIGKDD EXPLORATIONS : NEWSLETTER OF THE SPECIAL INTEREST GROUP (SIG) ON KNOWLEDGE DISCOVERY & DATA MINING 2018;20:1-12. [PMID: 30854154 DOI: 10.1145/3299986.3299988] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Luo G, Tarczy-Hornoch P, Wilcox AB, Lee ES. Identifying Patients Who Are Likely to Receive Most of Their Care From a Specific Health Care System: Demonstration via Secondary Analysis. JMIR Med Inform 2018;6:e12241. [PMID: 30401670 PMCID: PMC6246965 DOI: 10.2196/12241] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Revised: 10/13/2018] [Accepted: 10/16/2018] [Indexed: 01/22/2023] Open

Abstract

Background

In the United States, health care is fragmented in numerous distinct health care systems including private, public, and federal organizations like private physician groups and academic medical centers. Many patients have their complete medical data scattered across these several health care systems, with no particular system having complete data on any of them. Several major data analysis tasks such as predictive modeling using historical data are considered impractical on incomplete data.

Objective

Our objective was to find a way to enable these analysis tasks for a health care system with incomplete data on many of its patients.

Methods

This study presents, to the best of our knowledge, the first method to use a geographic constraint to identify a reasonably large subset of patients who tend to receive most of their care from a given health care system. A data analysis task needing relatively complete data can be conducted on this subset of patients. We demonstrated our method using data from the University of Washington Medicine (UWM) and PreManage data covering the use of all hospitals in Washington State. We compared 10 candidate constraints to optimize the solution.

Results

For UWM, the best constraint is that the patient has a UWM primary care physician and lives within 5 miles of at least one UWM hospital. About 16.01% (55,707/348,054) of UWM patients satisfied this constraint. Around 69.38% (10,501/15,135) of their inpatient stays and emergency department visits occurred within UWM in the following 6 months, more than double the corresponding percentage for all UWM patients.

Conclusions

Our method can identify a reasonably large subset of patients who tend to receive most of their care from UWM. This enables several major analysis tasks on incomplete medical data that were previously deemed infeasible.

Collapse

Alaa AM, van der Schaar M. Prognostication and Risk Factors for Cystic Fibrosis via Automated Machine Learning. Sci Rep 2018;8:11242. [PMID: 30050169 PMCID: PMC6062529 DOI: 10.1038/s41598-018-29523-2] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2018] [Accepted: 07/03/2018] [Indexed: 01/14/2023] Open

D'Argenio V. The High-Throughput Analyses Era: Are We Ready for the Data Struggle? High Throughput 2018;7:E8. [PMID: 29498666 PMCID: PMC5876534 DOI: 10.3390/ht7010008] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2017] [Revised: 02/16/2018] [Accepted: 02/27/2018] [Indexed: 12/23/2022] Open

Zeng X, Luo G. Progressive sampling-based Bayesian optimization for efficient and automatic machine learning model selection. Health Inf Sci Syst 2017;5:2. [PMID: 29038732 PMCID: PMC5617811 DOI: 10.1007/s13755-017-0023-z] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2017] [Accepted: 09/20/2017] [Indexed: 12/11/2022] Open

Luo G. Toward a Progress Indicator for Machine Learning Model Building and Data Mining Algorithm Execution: A Position Paper. ACTA ACUST UNITED AC 2017;19:13-24. [PMID: 29177022 DOI: 10.1145/3166054.3166057] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]