Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Maslove DM, Podchiyska T, Lowe HJ. Discretization of continuous features in clinical datasets. J Am Med Inform Assoc 2012;20:544-53. [PMID: 23059731 DOI: 10.1136/amiajnl-2012-000929] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open

For:	Maslove DM, Podchiyska T, Lowe HJ. Discretization of continuous features in clinical datasets. J Am Med Inform Assoc 2012;20:544-53. [PMID: 23059731 DOI: 10.1136/amiajnl-2012-000929] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open

Number

Cited by Other Article(s)

Zhao L, Vidwans A, Bearnot CJ, Rayner J, Lin T, Baird J, Suner S, Jay GD. Prediction of anemia in real-time using a smartphone camera processing conjunctival images. PLoS One 2024;19:e0302883. [PMID: 38739605 PMCID: PMC11090304 DOI: 10.1371/journal.pone.0302883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Accepted: 04/15/2024] [Indexed: 05/16/2024] Open

Shen S, Yuan X, Wang J, Fan L, Zhao J, Tao J. Evaluation of a machine learning algorithms for predicting the dental age of adolescent based on different preprocessing methods. Front Public Health 2022;10:1068253. [PMID: 36530730 PMCID: PMC9751184 DOI: 10.3389/fpubh.2022.1068253] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Accepted: 11/14/2022] [Indexed: 12/05/2022] Open

Affiliation(s)

Shihui Shen Department of General Dentistry, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China,College of Stomatology, Shanghai Jiao Tong University, Shanghai, China,National Center for Stomatology, Shanghai, China,National Clinical Research Center for Oral Diseases, Shanghai, China,Shanghai Key Laboratory of Stomatology, Shanghai, China,Shanghai Research Institute of Stomatology, Shanghai, China
Xiaoyan Yuan Department of General Dentistry, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China,College of Stomatology, Shanghai Jiao Tong University, Shanghai, China,National Center for Stomatology, Shanghai, China,National Clinical Research Center for Oral Diseases, Shanghai, China,Shanghai Key Laboratory of Stomatology, Shanghai, China,Shanghai Research Institute of Stomatology, Shanghai, China
Jian Wang Department of General Dentistry, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China,College of Stomatology, Shanghai Jiao Tong University, Shanghai, China,National Center for Stomatology, Shanghai, China,National Clinical Research Center for Oral Diseases, Shanghai, China,Shanghai Key Laboratory of Stomatology, Shanghai, China,Shanghai Research Institute of Stomatology, Shanghai, China
Linfeng Fan College of Stomatology, Shanghai Jiao Tong University, Shanghai, China,National Center for Stomatology, Shanghai, China,National Clinical Research Center for Oral Diseases, Shanghai, China,Shanghai Key Laboratory of Stomatology, Shanghai, China,Shanghai Research Institute of Stomatology, Shanghai, China,Department of Radiology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
Junjun Zhao Department of General Dentistry, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China,College of Stomatology, Shanghai Jiao Tong University, Shanghai, China,National Center for Stomatology, Shanghai, China,National Clinical Research Center for Oral Diseases, Shanghai, China,Shanghai Key Laboratory of Stomatology, Shanghai, China,Shanghai Research Institute of Stomatology, Shanghai, China,Junjun Zhao
Jiang Tao Department of General Dentistry, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China,College of Stomatology, Shanghai Jiao Tong University, Shanghai, China,National Center for Stomatology, Shanghai, China,National Clinical Research Center for Oral Diseases, Shanghai, China,Shanghai Key Laboratory of Stomatology, Shanghai, China,Shanghai Research Institute of Stomatology, Shanghai, China,*Correspondence: Jiang Tao

Collapse

Płuciennik A, Płaczek A, Wilk A, Student S, Oczko-Wojciechowska M, Fujarewicz K. Data Integration–Possibilities of Molecular and Clinical Data Fusion on the Example of Thyroid Cancer Diagnostics. Int J Mol Sci 2022;23:ijms231911880. [PMID: 36233181 PMCID: PMC9569592 DOI: 10.3390/ijms231911880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Revised: 09/24/2022] [Accepted: 09/28/2022] [Indexed: 11/23/2022] Open

Knio ZO, Morales FL, Shah KP, Ondigi OK, Selinski CE, Baldeo CM, Zhuo DX, Bilchick KC, Mehta NK, Kwon Y, Breathett K, Thiele RH, Hulse MC, Mazimba S. A systemic congestive index (systemic pulse pressure to central venous pressure ratio) predicts adverse outcomes in patients undergoing valvular heart surgery. J Card Surg 2022;37:3259-3266. [PMID: 35842813 PMCID: PMC9543661 DOI: 10.1111/jocs.16772] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 06/09/2022] [Accepted: 06/28/2022] [Indexed: 12/26/2022]

Affiliation(s)

Ziyad O Knio Department of Anesthesiology, University of Virginia Health System, Charlottesville, Virginia, USA
Frances L Morales University of Virginia School of Medicine, Charlottesville, Virginia, USA
Kajal P Shah Division of Cardiovascular Medicine, Department of Medicine, University of Virginia Health System, Charlottesville, Virginia, USA
Olivia K Ondigi Division of Cardiovascular Medicine, Department of Medicine, University of Virginia Health System, Charlottesville, Virginia, USA
Christian E Selinski Division of Cardiovascular Medicine, Department of Medicine, University of Virginia Health System, Charlottesville, Virginia, USA
Cherisse M Baldeo Division of Cardiovascular Medicine, Department of Medicine, University of Virginia Health System, Charlottesville, Virginia, USA
David X Zhuo Division of Cardiovascular Medicine, Department of Medicine, University of Virginia Health System, Charlottesville, Virginia, USA.,Division of Cardiology, Department of Medicine, University Hospitals Cleveland Medical Center, Case Western Reserve University, Cleveland, Ohio, USA
Kenneth C Bilchick Division of Cardiovascular Medicine, Department of Medicine, University of Virginia Health System, Charlottesville, Virginia, USA
Nishaki K Mehta Division of Cardiovascular Medicine, Department of Medicine, University of Virginia Health System, Charlottesville, Virginia, USA.,Division of Cardiovascular Medicine, Beaumont Hospital, Royal Oak, Michigan, USA
Younghoon Kwon Division of Cardiology, Department of Medicine, University of Washington, Seattle, Washington, USA
Khadijah Breathett Division of Cardiovascular Medicine, Indiana University, Indianapolis, Indiana, USA
Robert H Thiele Department of Anesthesiology, University of Virginia Health System, Charlottesville, Virginia, USA
Matthew C Hulse Department of Anesthesiology, University of Virginia Health System, Charlottesville, Virginia, USA
Sula Mazimba Division of Cardiovascular Medicine, Department of Medicine, University of Virginia Health System, Charlottesville, Virginia, USA

Collapse

Optimal Data Reduction of Training Data in Machine Learning-Based Modelling: A Multidimensional Bin Packing Approach. ENERGIES 2022. [DOI: 10.3390/en15093092] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Abstract In these days, when complex, IT-controlled systems have found their way into many areas, models and the data on which they are based are playing an increasingly important role. Due to the constantly growing possibilities of collecting data through sensor technology, extensive data sets are created that need to be mastered. In concrete terms, this means extracting the information required for a specific problem from the data in a high quality. For example, in the field of condition monitoring, this includes relevant system states. Especially in the application field of machine learning, the quality of the data is of significant importance. Here, different methods already exist to reduce the size of data sets without reducing the information value. In this paper, the multidimensional binned reduction (MdBR) method is presented as an approach that has a much lower complexity in comparison on the one hand and deals with regression, instead of classification as most other approaches do, on the other. The approach merges discretization approaches with non-parametric numerosity reduction via histograms. MdBR has linear complexity and can be facilitated to reduce large multivariate data sets to smaller subsets, which could be used for model training. The evaluation, based on a dataset from the photovoltaic sector with approximately 92 million samples, aims to train a multilayer perceptron (MLP) model to estimate the output power of the system. The results show that using the approach, the number of samples for training could be reduced by more than 99%, while also increasing the model’s performance. It works best with large data sets of low-dimensional data. Although periodic data often include the most redundant samples and thus provide the best reduction capabilities, the presented approach can only handle time-invariant data and not sequences of samples, as often done in time series. Collapse

Knio ZO, Thiele RH, Wright WZ, Mazimba S, Naik BI, Hulse MC. A Novel Hemodynamic Index of Post-operative Right Heart Dysfunction Predicts Mortality in Cardiac Surgical Patients. Semin Cardiothorac Vasc Anesth 2022;26:200-208. [PMID: 35332827 DOI: 10.1177/10892532221080382] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract

INTRODUCTION

This study aimed to investigate whether mortality following cardiac surgery was associated with the pulmonary artery pulsatility index (PAPi): pulmonary artery pulse pressure divided by central venous pressure (CVP), and a novel index: mean pulmonary artery pressure (mPAP) minus CVP.

METHODS

This retrospective analysis investigated all cardiac surgery patients in the Society of Thoracic Surgeons registry at a single academic medical center from January 2017 through March 2020 (n = 1510). The primary and secondary outcomes were mortality at 1 year and serum creatinine increase during index surgical admission, respectively. CVP, mPAP, PAPi, mPAP-CVP gradient, mean arterial pressure (MAP), and cardiac index (CI) were sampled continually from invasive hemodynamic monitors post-operatively. Associations with mortality were tested with univariate and multivariate analyses. The relationship with serum creatinine was investigated with Pearson's correlation at alpha = .05.

RESULTS

One-year mortality was observed in 44/1200 patients (3.7%). On univariate analysis, mortality was associated with minimums for mPAP, MAP, and CI and maximums for CVP, mPAP, PAPi, mPAP-CVP gradient, and CI (all P < .10). Model selection revealed that the only independently predictive parameters were minimum MAP (AOR = .880 [.819-.944]), maximum mPAP-CVP gradient (AOR = 1.082 [1.031-1.133]), and maximum CI (AOR = 1.421 [.928-2.068]), with model c-statistic = .770. A maximum mPAP-CVP gradient >20.5 predicted mortality with 54.5% sensitivity and 79.30% specificity, maintaining significance on survival analysis (P < .001). Peak increase in serum creatinine from baseline demonstrated a weak association with all parameters (max |r| = .33).

CONCLUSIONS

Mortality was not predicted by the post-operative PAPi; rather, it was independently predicted by the mPAP-CVP gradient, MAP, and CI.

Collapse

Choi Y, Park JH, Hong KJ, Ro YS, Song KJ, Shin SD. Development and validation of a prehospital-stage prediction tool for traumatic brain injury: a multicentre retrospective cohort study in Korea. BMJ Open 2022;12:e055918. [PMID: 35022177 PMCID: PMC8756263 DOI: 10.1136/bmjopen-2021-055918] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open

Abstract

OBJECTIVES

Predicting diagnosis and prognosis of traumatic brain injury (TBI) at the prehospital stage is challenging; however, using comprehensive prehospital information and machine learning may improve the performance of the predictive model. We developed and tested predictive models for TBI that use machine learning algorithms using information that can be obtained in the prehospital stage.

DESIGN

This was a multicentre retrospective study.

SETTING AND PARTICIPANTS

This study was conducted at three tertiary academic emergency departments (EDs) located in an urban area of South Korea. The data from adult patients with severe trauma who were assessed by emergency medical service providers and transported to three participating hospitals between 2014 to 2018 were analysed.

RESULTS

We developed and tested five machine learning algorithms-logistic regression analyses, extreme gradient boosting, support vector machine, random forest and elastic net (EN)-to predict TBI, TBI with intracranial haemorrhage or injury (TBI-I), TBI with ED or admission result of admission or transferred (TBI with non-discharge (TBI-ND)) and TBI with ED or admission result of death (TBI-D). A total of 1169 patients were included in the final analysis, and the proportions of TBI, TBI-I, TBI-ND and TBI-D were 24.0%, 21.5%, 21.3% and 3.7%, respectively. The EN model yielded an area under receiver-operator curve of 0.799 for TBI, 0.844 for TBI-I, 0.811 for TBI-ND and 0.871 for TBI-D. The EN model also yielded the highest specificity and significant reclassification improvement. Variables related to loss of consciousness, Glasgow Coma Scale and light reflex were the three most important variables to predict all outcomes.

CONCLUSION

Our results inform the diagnosis and prognosis of TBI. Machine learning models resulted in significant performance improvement over that with logistic regression analyses, and the best performing model was EN.

Collapse

Fu K, Li Y, Lv H, Wu W, Song J, Xu J. Development of a Model Predicting the Outcome of In Vitro Fertilization Cycles by a Robust Decision Tree Method. Front Endocrinol (Lausanne) 2022;13:877518. [PMID: 36093079 PMCID: PMC9449728 DOI: 10.3389/fendo.2022.877518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Accepted: 06/21/2022] [Indexed: 11/20/2022] Open

Different Data Mining Approaches Based Medical Text Data. JOURNAL OF HEALTHCARE ENGINEERING 2021;2021:1285167. [PMID: 34912530 PMCID: PMC8668297 DOI: 10.1155/2021/1285167] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Accepted: 11/18/2021] [Indexed: 12/15/2022]

Shen S, Liu Z, Wang J, Fan L, Ji F, Tao J. Machine learning assisted Cameriere method for dental age estimation. BMC Oral Health 2021;21:641. [PMID: 34911516 PMCID: PMC8672533 DOI: 10.1186/s12903-021-01996-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Accepted: 11/24/2021] [Indexed: 11/23/2022] Open

Abstract

Background

Recently, the dental age estimation method developed by Cameriere has been widely recognized and accepted. Although machine learning (ML) methods can improve the accuracy of dental age estimation, no machine learning research exists on the use of the Cameriere dental age estimation method, making this research innovative and meaningful.

Aim

The purpose of this research is to use 7 lower left permanent teeth and three models [random forest (RF), support vector machine (SVM), and linear regression (LR)] based on the Cameriere method to predict children's dental age, and compare with the Cameriere age estimation.

Subjects and methods

This was a retrospective study that collected and analyzed orthopantomograms of 748 children (356 females and 392 males) aged 5–13 years. Data were randomly divided into training and test datasets in an 80–20% proportion for the ML algorithms. The procedure, starting with randomly creating new training and test datasets, was repeated 20 times. 7 permanent developing teeth on the left mandible (except wisdom teeth) were recorded using the Cameriere method. Then, the traditional Cameriere formula and three models (RF, SVM, and LR) were used to estimate the dental age. The age prediction accuracy was measured by five indicators: the coefficient of determination (R²), mean error (ME), root mean square error (RMSE), mean square error (MSE), and mean absolute error (MAE).

Results

The research showed that the ML models have better accuracy than the traditional Cameriere formula. The ME, MAE, MSE, and RMSE values of the SVM model (0.004, 0.489, 0.392, and 0.625, respectively) and the RF model (− 0.004, 0.495, 0.389, and 0.623, respectively) were lower with the highest accuracy. In contrast, the ME, MAE, MSE and RMSE of the European Cameriere formula were 0.592, 0.846, 0.755, and 0.869, respectively, and those of the Chinese Cameriere formula were 0.748, 0.812, 0.890 and 0.943, respectively.

Conclusions

Compared to the Cameriere formula, ML methods based on the Cameriere’s maturation stages were more accurate in estimating dental age. These results support the use of ML algorithms instead of the traditional Cameriere formula.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12903-021-01996-0.

Collapse

Affiliation(s)

Shihui Shen Department of General Dentistry, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine; College of Stomatology, Shanghai Jiao Tong University; National Center for Stomatology; National Clinical Research Center for Oral Diseases, Shanghai Key Laboratory of Stomatology, Shanghai, People's Republic of China
Zihao Liu Department of Nuclear Medicine, Xin Hua Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, People's Republic of China
Jian Wang Department of General Dentistry, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine; College of Stomatology, Shanghai Jiao Tong University; National Center for Stomatology; National Clinical Research Center for Oral Diseases, Shanghai Key Laboratory of Stomatology, Shanghai, People's Republic of China
Linfeng Fan Department of Radiology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine; College of Stomatology, Shanghai Jiao Tong University; National Center for Stomatology; National Clinical Research Center for Oral Diseases, Shanghai Key Laboratory of Stomatology, Shanghai, People's Republic of China
Fang Ji Department of Orthodontics, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine; College of Stomatology, Shanghai Jiao Tong University; National Center for Stomatology; National Clinical Research Center for Oral Diseases, Shanghai Key Laboratory of Stomatology, Shanghai, People's Republic of China.
Jiang Tao Department of General Dentistry, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine; College of Stomatology, Shanghai Jiao Tong University; National Center for Stomatology; National Clinical Research Center for Oral Diseases, Shanghai Key Laboratory of Stomatology, Shanghai, People's Republic of China.

Collapse

Zamanzadeh DJ, Petousis P, Davis TA, Nicholas SB, Norris KC, Tuttle KR, Bui AAT, Sarrafzadeh M. Autopopulus: A Novel Framework for Autoencoder Imputation on Large Clinical Datasets. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2021;2021:2303-2309. [PMID: 34891747 PMCID: PMC8862635 DOI: 10.1109/embc46164.2021.9630135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Alexandre L, Costa RS, Henriques R. DI2: prior-free and multi-item discretization of biological data and its applications. BMC Bioinformatics 2021;22:426. [PMID: 34496758 PMCID: PMC8425008 DOI: 10.1186/s12859-021-04329-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2021] [Accepted: 08/23/2021] [Indexed: 11/24/2022] Open

Bbosa FF, Nabukenya J, Nabende P, Wesonga R. On the goodness of fit of parametric and non-parametric data mining techniques: the case of malaria incidence thresholds in Uganda. HEALTH AND TECHNOLOGY 2021. [DOI: 10.1007/s12553-021-00551-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Evolutionary Algorithm for Improving Decision Tree with Global Discretization in Manufacturing. SENSORS 2021;21:s21082849. [PMID: 33919558 PMCID: PMC8074051 DOI: 10.3390/s21082849] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Revised: 04/13/2021] [Accepted: 04/15/2021] [Indexed: 11/17/2022]

Cost and Complications of Single-Level Lumbar Decompression in Those Over and Under 75: A Matched Comparison. Spine (Phila Pa 1976) 2021;46:29-34. [PMID: 32925688 DOI: 10.1097/brs.0000000000003686] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Abstract

STUDY DESIGN

Retrospective database analysis.

OBJECTIVE

This study aimed to compare costs and complication rates following single-level lumbar decompression in patients under age 75 versus patients aged 75 and older.

SUMMARY OF BACKGROUND DATA

Lumbar decompression is a common surgical treatment for lumbar pathology; however, its effectiveness can be debated in elderly patients because complication rates and costs by age group are not well-defined.

METHODS

The Medicare database was queried through the PearlDiver server for patients who underwent single-level lumbar decompression without fusion as an index procedure. The 90-day complication and reoperation rates were compared between age groups after matching for sex and comorbidity burden. Same day and 90-day costs are compared.

RESULTS

The matched cohort included 89,388 total patients (n = 44,694 for each study arm). Compared to the under 75 age group, the 75 and older age group had greater rates of deep venous thrombosis (odds ratio [OR] 1.443, P = 0.042) and dural tear (OR 1.560, P = 0.043), and a lower rate of seroma complicating the procedure (OR 0.419, P = 0.009). There was no difference in overall 90-day reoperation rate in patients under age 75 versus patients aged 75 and older (9.66% vs. 9.28%, P = 0.051), although the 75 and older age group had a greater rate of laminectomy without discectomy (CPT-63047; OR 1.175, P < 0.001), while having a lower rate of laminotomy with discectomy (CPT-63042 and CPT-63030; OR 0.727 and 0.867, respectively, P = 0.013 and <0.001, respectively). The 75 and older age group had greater same day ($3329.24 vs. $3138.05, P < 0.001) and 90-day ($5014.82 vs. $4749.44, P < 0.001) mean reimbursement.

CONCLUSION

Elderly patients experience greater rates of select perioperative complications, with mildly increased costs. There is no significant difference in overall 90-day reoperation rates.

LEVEL OF EVIDENCE

Collapse

Wang L, Tong L, Davis D, Arnold T, Esposito T. The application of unsupervised deep learning in predictive models using electronic health records. BMC Med Res Methodol 2020;20:37. [PMID: 32101147 PMCID: PMC7043035 DOI: 10.1186/s12874-020-00923-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2018] [Accepted: 02/12/2020] [Indexed: 11/18/2022] Open

Abstract

Background

The main goal of this study is to explore the use of features representing patient-level electronic health record (EHR) data, generated by the unsupervised deep learning algorithm autoencoder, in predictive modeling. Since autoencoder features are unsupervised, this paper focuses on their general lower-dimensional representation of EHR information in a wide variety of predictive tasks.

Methods

We compare the model with autoencoder features to traditional models: logistic model with least absolute shrinkage and selection operator (LASSO) and Random Forest algorithm. In addition, we include a predictive model using a small subset of response-specific variables (Simple Reg) and a model combining these variables with features from autoencoder (Enhanced Reg). We performed the study first on simulated data that mimics real world EHR data and then on actual EHR data from eight Advocate hospitals.

Results

On simulated data with incorrect categories and missing data, the precision for autoencoder is 24.16% when fixing recall at 0.7, which is higher than Random Forest (23.61%) and lower than LASSO (25.32%). The precision is 20.92% in Simple Reg and improves to 24.89% in Enhanced Reg. When using real EHR data to predict the 30-day readmission rate, the precision of autoencoder is 19.04%, which again is higher than Random Forest (18.48%) and lower than LASSO (19.70%). The precisions for Simple Reg and Enhanced Reg are 18.70 and 19.69% respectively. That is, Enhanced Reg can have competitive prediction performance compared to LASSO. In addition, results show that Enhanced Reg usually relies on fewer features under the setting of simulations of this paper.

Conclusions

We conclude that autoencoder can create useful features representing the entire space of EHR data and which are applicable to a wide array of predictive tasks. Together with important response-specific predictors, we can derive efficient and robust predictive models with less labor in data extraction and model training.

Collapse

Rodriguez-Morilla B, Estivill E, Estivill-Domènech C, Albares J, Segarra F, Correa A, Campos M, Rol MA, Madrid JA. Application of Machine Learning Methods to Ambulatory Circadian Monitoring (ACM) for Discriminating Sleep and Circadian Disorders. Front Neurosci 2019;13:1318. [PMID: 31920488 PMCID: PMC6916421 DOI: 10.3389/fnins.2019.01318] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 11/25/2019] [Indexed: 12/20/2022] Open

Prediction of good neurological recovery after out-of-hospital cardiac arrest: A machine learning analysis. Resuscitation 2019;142:127-135. [PMID: 31362082 DOI: 10.1016/j.resuscitation.2019.07.020] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2019] [Revised: 06/28/2019] [Accepted: 07/16/2019] [Indexed: 01/28/2023]

Unobtrusive Mattress-Based Identification of Hypertension by Integrating Classification and Association Rule Mining. SENSORS 2019;19:s19071489. [PMID: 30934719 PMCID: PMC6480150 DOI: 10.3390/s19071489] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Revised: 03/13/2019] [Accepted: 03/22/2019] [Indexed: 11/25/2022]

Nagasato D, Tabuchi H, Ohsugi H, Masumoto H, Enno H, Ishitobi N, Sonobe T, Kameoka M, Niki M, Mitamura Y. Deep-learning classifier with ultrawide-field fundus ophthalmoscopy for detecting branch retinal vein occlusion. Int J Ophthalmol 2019;12:94-99. [PMID: 30662847 DOI: 10.18240/ijo.2019.01.15] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2018] [Accepted: 12/06/2018] [Indexed: 12/27/2022] Open

Deep Neural Network-Based Method for Detecting Central Retinal Vein Occlusion Using Ultrawide-Field Fundus Ophthalmoscopy. J Ophthalmol 2018;2018:1875431. [PMID: 30515316 PMCID: PMC6236766 DOI: 10.1155/2018/1875431] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Accepted: 10/17/2018] [Indexed: 11/17/2022] Open

The Impact of Risk Standardization on Variation in CT Use and Emergency Physician Profiling. AJR Am J Roentgenol 2018;211:392-399. [PMID: 29975119 DOI: 10.2214/ajr.17.19188] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Abstract

OBJECTIVE

The purpose of this study is to use detailed electronic health record data to profile the use of condition-specific, risk-standardized imaging by emergency physicians.

MATERIALS AND METHODS

CT utilization in four emergency departments in a single health care system was retrospectively analyzed. The primary outcome for analysis was indication-specific, risk-standardized CT utilization. We constructed seven clinical cohorts on the basis of the presence or absence of a traumatic indication for the most frequently performed CT studies. Risk standardization was performed using machine learning algorithms and hierarchic logistic regression models. Variation in CT utilization for each cohort was analyzed using coefficients of variation and box plots, the effect of risk standardization on physician profiling was determined using slope diagrams and kappa values, and within-physician correlation was assessed using correlation coefficients and matrices.

RESULTS

For the seven cohorts, the number of physicians ordering more than 25 CT studies for a particular indication ranged from 70 to 88, and the number of ED visits ranged from 17,458 to 117,489. The unadjusted variation was large for each indication (coefficient of variation, 30.2-57.9). Risk standardization resulted in reduced but persistent variation for all indications (coefficient of variation, 12.3-22.3). Among indication-specific models, risk standardization resulted in reclassification by two or more deciles for 14.0-39.1% of physicians. The R value for within-physician correlation varied from 0.02 to 0.80 and was highest between chest and abdominal imaging for trauma.

CONCLUSION

In this multisite study of CT utilization, risk standardization had a substantial impact on variation in CT utilization and emergency physician profiling. Administrators and payers should include risk standardization in future measures of physician imaging to ensure valid assessment of performance and achieve improvements in emergency care value.

Collapse

Taylor RA, Moore CL, Cheung KH, Brandt C. Predicting urinary tract infections in the emergency department with machine learning. PLoS One 2018. [PMID: 29513742 PMCID: PMC5841824 DOI: 10.1371/journal.pone.0194085] [Citation(s) in RCA: 100] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Abstract

BACKGROUND

Urinary tract infection (UTI) is a common emergency department (ED) diagnosis with reported high diagnostic error rates. Because a urine culture, part of the gold standard for diagnosis of UTI, is usually not available for 24-48 hours after an ED visit, diagnosis and treatment decisions are based on symptoms, physical findings, and other laboratory results, potentially leading to overutilization, antibiotic resistance, and delayed treatment. Previous research has demonstrated inadequate diagnostic performance for both individual laboratory tests and prediction tools.

OBJECTIVE

Our aim, was to train, validate, and compare machine-learning based predictive models for UTI in a large diverse set of ED patients.

METHODS

Single-center, multi-site, retrospective cohort analysis of 80,387 adult ED visits with urine culture results and UTI symptoms. We developed models for UTI prediction with six machine learning algorithms using demographic information, vitals, laboratory results, medications, past medical history, chief complaint, and structured historical and physical exam findings. Models were developed with both the full set of 211 variables and a reduced set of 10 variables. UTI predictions were compared between models and to proxies of provider judgment (documentation of UTI diagnosis and antibiotic administration).

RESULTS

The machine learning models had an area under the curve ranging from 0.826-0.904, with extreme gradient boosting (XGBoost) the top performing algorithm for both full and reduced models. The XGBoost full and reduced models demonstrated greatly improved specificity when compared to the provider judgment proxy of UTI diagnosis OR antibiotic administration with specificity differences of 33.3 (31.3-34.3) and 29.6 (28.5-30.6), while also demonstrating superior sensitivity when compared to documentation of UTI diagnosis with sensitivity differences of 38.7 (38.1-39.4) and 33.2 (32.5-33.9). In the admission and discharge cohorts using the full XGboost model, approximately 1 in 4 patients (4109/15855) would be re-categorized from a false positive to a true negative and approximately 1 in 11 patients (1372/15855) would be re-categorized from a false negative to a true positive.

CONCLUSION

The best performing machine learning algorithm, XGBoost, accurately diagnosed positive urine culture results, and outperformed previously developed models in the literature and several proxies for provider judgment. Future prospective validation is warranted.

Collapse

Edmunds K, Gíslason M, Sigurðsson S, Guðnason V, Harris T, Carraro U, Gargiulo P. Advanced quantitative methods in correlating sarcopenic muscle degeneration with lower extremity function biometrics and comorbidities. PLoS One 2018. [PMID: 29513690 PMCID: PMC5841751 DOI: 10.1371/journal.pone.0193241] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open

Abstract

Sarcopenic muscular degeneration has been consistently identified as an independent risk factor for mortality in aging populations. Recent investigations have realized the quantitative potential of computed tomography (CT) image analysis to describe skeletal muscle volume and composition; however, the optimum approach to assessing these data remains debated. Current literature reports average Hounsfield unit (HU) values and/or segmented soft tissue cross-sectional areas to investigate muscle quality. However, standardized methods for CT analyses and their utility as a comorbidity index remain undefined, and no existing studies compare these methods to the assessment of entire radiodensitometric distributions. The primary aim of this study was to present a comparison of nonlinear trimodal regression analysis (NTRA) parameters of entire radiodensitometric muscle distributions against extant CT metrics and their correlation with lower extremity function (LEF) biometrics (normal/fast gait speed, timed up-and-go, and isometric leg strength) and biochemical and nutritional parameters, such as total solubilized cholesterol (SCHOL) and body mass index (BMI). Data were obtained from 3,162 subjects, aged 66–96 years, from the population-based AGES-Reykjavik Study. 1-D k-means clustering was employed to discretize each biometric and comorbidity dataset into twelve subpopulations, in accordance with Sturges’ Formula for Class Selection. Dataset linear regressions were performed against eleven NTRA distribution parameters and standard CT analyses (fat/muscle cross-sectional area and average HU value). Parameters from NTRA and CT standards were analogously assembled by age and sex. Analysis of specific NTRA parameters with standard CT results showed linear correlation coefficients greater than 0.85, but multiple regression analysis of correlative NTRA parameters yielded a correlation coefficient of 0.99 (P<0.005). These results highlight the specificities of each muscle quality metric to LEF biometrics, SCHOL, and BMI, and particularly highlight the value of the connective tissue regime in this regard.

Collapse

Rajappan S, Rangasamy D. Estimation of incomplete values in heterogeneous attribute large datasets using discretized Bayesian max–min ant colony optimization. Knowl Inf Syst 2017. [DOI: 10.1007/s10115-017-1123-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Towards a Predictive Analytics-Based Intelligent Malaria Outbreak Warning System. APPLIED SCIENCES-BASEL 2017. [DOI: 10.3390/app7080836] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]

Casanova IJ, Campos M, Juarez JM, Fernandez-Fernandez-Arroyo A, Lorente JA. Impact of time series discretization on intensive care burn unit survival classification. PROGRESS IN ARTIFICIAL INTELLIGENCE 2017. [DOI: 10.1007/s13748-017-0130-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Gómez I, Ribelles N, Franco L, Alba E, Jerez JM. Supervised discretization can discover risk groups in cancer survival analysis. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2016;136:11-19. [PMID: 27686699 DOI: 10.1016/j.cmpb.2016.08.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Revised: 07/07/2016] [Accepted: 08/12/2016] [Indexed: 06/06/2023]

Ni Y, Beck AF, Taylor R, Dyas J, Solti I, Grupp-Phelan J, Dexheimer JW. Will they participate? Predicting patients' response to clinical trial invitations in a pediatric emergency department. J Am Med Inform Assoc 2016;23:671-80. [PMID: 27121609 PMCID: PMC4926740 DOI: 10.1093/jamia/ocv216] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2015] [Accepted: 12/30/2015] [Indexed: 12/27/2022] Open

Abstract

Objective (1) To develop an automated algorithm to predict a patient’s response (ie, if the patient agrees or declines) before he/she is approached for a clinical trial invitation; (2) to assess the algorithm performance and the predictors on real-world patient recruitment data for a diverse set of clinical trials in a pediatric emergency department; and (3) to identify directions for future studies in predicting patients’ participation response.

Materials and Methods We collected 3345 patients’ response to trial invitations on 18 clinical trials at one center that were actively enrolling patients between January 1, 2010 and December 31, 2012. In parallel, we retrospectively extracted demographic, socioeconomic, and clinical predictors from multiple sources to represent the patients’ profiles. Leveraging machine learning methodology, the automated algorithms predicted participation response for individual patients and identified influential features associated with their decision-making. The performance was validated on the collection of actual patient response, where precision, recall, F-measure, and area under the ROC curve were assessed.

Results Compared to the random response predictor that simulated the current practice, the machine learning algorithms achieved significantly better performance (Precision/Recall/F-measure/area under the ROC curve: 70.82%/92.02%/80.04%/72.78% on 10-fold cross validation and 71.52%/92.68%/80.74%/75.74% on the test set). By analyzing the significant features output by the algorithms, the study confirmed several literature findings and identified challenges that could be mitigated to optimize recruitment.

Conclusion By exploiting predictive variables from multiple sources, we demonstrated that machine learning algorithms have great potential in improving the effectiveness of the recruitment process by automatically predicting patients’ participation response to trial invitations.

Collapse

Metting EI, in ’t Veen JC, Dekhuijzen PR, van Heijst E, Kocks JW, Muilwijk-Kroes JB, Chavannes NH, van der Molen T. Development of a diagnostic decision tree for obstructive pulmonary diseases based on real-life data. ERJ Open Res 2016;2:00077-2015. [PMID: 27730177 PMCID: PMC5005160 DOI: 10.1183/23120541.00077-2015] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Accepted: 11/21/2015] [Indexed: 11/05/2022] Open

Cevik M, Ergun MA, Stout NK, Trentham-Dietz A, Craven M, Alagoz O. Using Active Learning for Speeding up Calibration in Simulation Models. Med Decis Making 2015;36:581-93. [PMID: 26471190 DOI: 10.1177/0272989x15611359] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2014] [Accepted: 07/17/2015] [Indexed: 01/08/2023]

Abstract

BACKGROUND

Most cancer simulation models include unobservable parameters that determine disease onset and tumor growth. These parameters play an important role in matching key outcomes such as cancer incidence and mortality, and their values are typically estimated via a lengthy calibration procedure, which involves evaluating a large number of combinations of parameter values via simulation. The objective of this study is to demonstrate how machine learning approaches can be used to accelerate the calibration process by reducing the number of parameter combinations that are actually evaluated.

METHODS

Active learning is a popular machine learning method that enables a learning algorithm such as artificial neural networks to interactively choose which parameter combinations to evaluate. We developed an active learning algorithm to expedite the calibration process. Our algorithm determines the parameter combinations that are more likely to produce desired outputs and therefore reduces the number of simulation runs performed during calibration. We demonstrate our method using the previously developed University of Wisconsin breast cancer simulation model (UWBCS).

RESULTS

In a recent study, calibration of the UWBCS required the evaluation of 378 000 input parameter combinations to build a race-specific model, and only 69 of these combinations produced results that closely matched observed data. By using the active learning algorithm in conjunction with standard calibration methods, we identify all 69 parameter combinations by evaluating only 5620 of the 378 000 combinations.

CONCLUSION

Machine learning methods hold potential in guiding model developers in the selection of more promising parameter combinations and hence speeding up the calibration process. Applying our machine learning algorithm to one model shows that evaluating only 1.49% of all parameter combinations would be sufficient for the calibration.

Collapse