Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Levin S, Toerper M, Hamrock E, Hinson JS, Barnes S, Gardner H, Dugas A, Linton B, Kirsch T, Kelen G. Machine-Learning-Based Electronic Triage More Accurately Differentiates Patients With Respect to Clinical Outcomes Compared With the Emergency Severity Index. Ann Emerg Med 2018;71:565-574.e2. [DOI: 10.1016/j.annemergmed.2017.08.005] [Citation(s) in RCA: 142] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2016] [Revised: 07/07/2017] [Accepted: 08/01/2017] [Indexed: 11/23/2022]

For:	Levin S, Toerper M, Hamrock E, Hinson JS, Barnes S, Gardner H, Dugas A, Linton B, Kirsch T, Kelen G. Machine-Learning-Based Electronic Triage More Accurately Differentiates Patients With Respect to Clinical Outcomes Compared With the Emergency Severity Index. Ann Emerg Med 2018;71:565-574.e2. [DOI: 10.1016/j.annemergmed.2017.08.005] [Citation(s) in RCA: 142] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2016] [Revised: 07/07/2017] [Accepted: 08/01/2017] [Indexed: 11/23/2022]

Number

Cited by Other Article(s)

151

Zhang X, Bellolio MF, Medrano-Gracia P, Werys K, Yang S, Mahajan P. Use of natural language processing to improve predictive models for imaging utilization in children presenting to the emergency department. BMC Med Inform Decis Mak 2019;19:287. [PMID: 31888609 PMCID: PMC6937987 DOI: 10.1186/s12911-019-1006-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Accepted: 12/12/2019] [Indexed: 11/10/2022] Open

Abstract

OBJECTIVE

To examine the association between the medical imaging utilization and information related to patients' socioeconomic, demographic and clinical factors during the patients' ED visits; and to develop predictive models using these associated factors including natural language elements to predict the medical imaging utilization at pediatric ED.

METHODS

Pediatric patients' data from the 2012-2016 United States National Hospital Ambulatory Medical Care Survey was included to build the models to predict the use of imaging in children presenting to the ED. Multivariable logistic regression models were built with structured variables such as temperature, heart rate, age, and unstructured variables such as reason for visit, free text nursing notes and combined data available at triage. NLP techniques were used to extract information from the unstructured data.

RESULTS

Of the 27,665 pediatric ED visits included in the study, 8394 (30.3%) received medical imaging in the ED, including 6922 (25.0%) who had an X-ray and 1367 (4.9%) who had a computed tomography (CT) scan. In the predictive model including only structured variables, the c-statistic was 0.71 (95% CI: 0.70-0.71) for any imaging use, 0.69 (95% CI: 0.68-0.70) for X-ray, and 0.77 (95% CI: 0.76-0.78) for CT. Models including only unstructured information had c-statistics of 0.81 (95% CI: 0.81-0.82) for any imaging use, 0.82 (95% CI: 0.82-0.83) for X-ray, and 0.85 (95% CI: 0.83-0.86) for CT scans. When both structured variables and free text variables were included, the c-statistics reached 0.82 (95% CI: 0.82-0.83) for any imaging use, 0.83 (95% CI: 0.83-0.84) for X-ray, and 0.87 (95% CI: 0.86-0.88) for CT.

CONCLUSIONS

Both CT and X-rays are commonly used in the pediatric ED with one third of the visits receiving at least one. Patients' socioeconomic, demographic and clinical factors presented at ED triage period were associated with the medical imaging utilization. Predictive models combining structured and unstructured variables available at triage performed better than models using structured or unstructured variables alone, suggesting the potential for use of NLP in determining resource utilization.

Collapse

152

Spangler D, Hermansson T, Smekal D, Blomberg H. A validation of machine learning-based risk scores in the prehospital setting. PLoS One 2019;14:e0226518. [PMID: 31834920 PMCID: PMC6910679 DOI: 10.1371/journal.pone.0226518] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2019] [Accepted: 11/26/2019] [Indexed: 12/23/2022] Open

Abstract

BACKGROUND

The triage of patients in prehospital care is a difficult task, and improved risk assessment tools are needed both at the dispatch center and on the ambulance to differentiate between low- and high-risk patients. This study validates a machine learning-based approach to generating risk scores based on hospital outcomes using routinely collected prehospital data.

METHODS

Dispatch, ambulance, and hospital data were collected in one Swedish region from 2016-2017. Dispatch center and ambulance records were used to develop gradient boosting models predicting hospital admission, critical care (defined as admission to an intensive care unit or in-hospital mortality), and two-day mortality. Composite risk scores were generated based on the models and compared to National Early Warning Scores (NEWS) and actual dispatched priorities in a prospectively gathered dataset from 2018.

RESULTS

A total of 38203 patients were included from 2016-2018. Concordance indexes (or areas under the receiver operating characteristics curve) for dispatched priorities ranged from 0.51-0.66, while those for NEWS ranged from 0.66-0.85. Concordance ranged from 0.70-0.79 for risk scores based only on dispatch data, and 0.79-0.89 for risk scores including ambulance data. Dispatch data-based risk scores consistently outperformed dispatched priorities in predicting hospital outcomes, while models including ambulance data also consistently outperformed NEWS. Model performance in the prospective test dataset was similar to that found using cross-validation, and calibration was comparable to that of NEWS.

CONCLUSIONS

Machine learning-based risk scores outperformed a widely-used rule-based triage algorithm and human prioritization decisions in predicting hospital outcomes. Performance was robust in a prospectively gathered dataset, and scores demonstrated adequate calibration. Future research should explore the robustness of these methods when applied to other settings, establish appropriate outcome measures for use in determining the need for prehospital care, and investigate the clinical impact of interventions based on these methods.

Collapse

153

Choi SW, Ko T, Hong KJ, Kim KH. Machine Learning-Based Prediction of Korean Triage and Acuity Scale Level in Emergency Department Patients. Healthc Inform Res 2019;25:305-312. [PMID: 31777674 PMCID: PMC6859273 DOI: 10.4258/hir.2019.25.4.305] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Revised: 10/21/2019] [Accepted: 10/21/2019] [Indexed: 12/23/2022] Open

154

Ellahham S, Ellahham N, Simsekler MCE. Application of Artificial Intelligence in the Health Care Safety Context: Opportunities and Challenges. Am J Med Qual 2019;35:341-348. [DOI: 10.1177/1062860619878515] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]

155

Jiang W, Siddiqui S, Barnes S, Barouch LA, Korley F, Martinez DA, Toerper M, Cabral S, Hamrock E, Levin S. Readmission Risk Trajectories for Patients With Heart Failure Using a Dynamic Prediction Approach: Retrospective Study. JMIR Med Inform 2019;7:e14756. [PMID: 31579025 PMCID: PMC6781727 DOI: 10.2196/14756] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2019] [Revised: 07/14/2019] [Accepted: 07/19/2019] [Indexed: 02/02/2023] Open

Abstract

Background

Patients hospitalized with heart failure suffer the highest rates of 30-day readmission among other clinically defined patient populations in the United States. Investigation into the predictability of 30-day readmissions can lead to clinical decision support tools and targeted interventions that can help care providers to improve individual patient care and reduce readmission risk.

Objective

This study aimed to develop a dynamic readmission risk prediction model that yields daily predictions for patients hospitalized with heart failure toward identifying risk trajectories over time and identifying clinical predictors associated with different patterns in readmission risk trajectories.

Methods

A two-stage predictive modeling approach combining logistic and beta regression was applied to electronic health record data accumulated daily to predict 30-day readmission for 534 hospital encounters of patients with heart failure over 2750 patient days. Unsupervised clustering was performed on predictions to uncover time-dependent trends in readmission risk over the patient’s hospital stay. We used data collected between September 1, 2013, and August 31, 2015, from a community hospital in Maryland (United States) for patients with a primary diagnosis of heart failure. Patients who died during the hospital stay or were transferred to other acute care hospitals or hospice care were excluded.

Results

Readmission occurred in 107 (107/534, 20.0%) encounters. The out-of-sample area under curve for the 2-stage predictive model was 0.73 (SD 0.08). Dynamic clinical predictors capturing laboratory results and vital signs had the highest predictive value compared with demographic, administrative, medical, and procedural data included. Unsupervised clustering identified four risk trajectory groups: decreasing risk (131/534, 24.5% encounters), high risk (113/534, 21.2%), moderate risk (177/534, 33.1%), and low risk (113/534, 21.2%). The decreasing risk group demonstrated change in average probability of readmission from admission (0.69) to discharge (0.30), whereas the high risk (0.75), moderate risk (0.61), and low risk (0.39) groups maintained consistency over the hospital course. A higher level of hemoglobin, larger decrease in potassium and diastolic blood pressure from admission to discharge, and smaller number of past hospitalizations are associated with decreasing readmission risk (P<.001).

Conclusions

Dynamically predicting readmission and quantifying trends over patients’ hospital stay illuminated differing risk trajectory groups. Identifying risk trajectory patterns and distinguishing predictors may shed new light on indicators of readmission and the isolated effects of the index hospitalization.

Collapse

156

Parakh A, Lee H, Lee JH, Eisner BH, Sahani DV, Do S. Urinary Stone Detection on CT Images Using Deep Convolutional Neural Networks: Evaluation of Model Performance and Generalization. Radiol Artif Intell 2019;1:e180066. [PMID: 33937795 PMCID: PMC8017404 DOI: 10.1148/ryai.2019180066] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Revised: 05/29/2019] [Accepted: 06/20/2019] [Indexed: 12/23/2022]

Abstract

PURPOSE

To investigate the diagnostic accuracy of cascading convolutional neural network (CNN) for urinary stone detection on unenhanced CT images and to evaluate the performance of pretrained models enriched with labeled CT images across different scanners.

MATERIALS AND METHODS

This HIPAA-compliant, institutional review board-approved, retrospective clinical study used unenhanced abdominopelvic CT scans from 535 adults suspected of having urolithiasis. The scans were obtained on two scanners (scanner 1 [hereafter S1] and scanner 2 [hereafter S2]). A radiologist reviewed clinical reports and labeled cases for determination of reference standard. Stones were present on 279 (S1, 131; S2, 148) and absent on 256 (S1, 158; S2, 98) scans. One hundred scans (50 from each scanner) were randomly reserved as the test dataset, and the rest were used for developing a cascade of two CNNs: The first CNN identified the extent of the urinary tract, and the second CNN detected presence of stone. Nine variations of models were developed through the combination of different training data sources (S1, S2, or both [hereafter SB]) with (ImageNet, GrayNet) and without (Random) pretrained CNNs. First, models were compared for generalizability at the section level. Second, models were assessed by using area under the receiver operating characteristic curve (AUC) and accuracy at the patient level with test dataset from both scanners (n = 100).

RESULTS

The GrayNet-pretrained model showed higher classifier exactness than did ImageNet-pretrained or Random-initialized models when tested by using data from the same or different scanners at section level. At the patient level, the AUC for stone detection was 0.92-0.95, depending on the model. Accuracy of GrayNet-SB (95%) was higher than that of ImageNet-SB (91%) and Random-SB (88%). For stones larger than 4 mm, all models showed similar performance (false-negative results: two of 34). For stones smaller than 4 mm, the number of false-negative results for GrayNet-SB, ImageNet-SB, and Random-SB were one of 16, three of 16, and five of 16, respectively. GrayNet-SB identified stones in all 22 test cases that had obstructive uropathy.

CONCLUSION

A cascading model of CNNs can detect urinary tract stones on unenhanced CT scans with a high accuracy (AUC, 0.954). Performance and generalization of CNNs across scanners can be enhanced by using transfer learning with datasets enriched with labeled medical images.© RSNA, 2019Supplemental material is available for this article. : An earlier incorrect version appeared online. This article was corrected on August 6, 2019.

Collapse

157

Prediction of emergency department patient disposition based on natural language processing of triage notes. Int J Med Inform 2019;129:184-188. [PMID: 31445253 DOI: 10.1016/j.ijmedinf.2019.06.008] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Revised: 05/21/2019] [Accepted: 06/10/2019] [Indexed: 12/23/2022]

158

Hodgson NR, Poterack KA, Mi L, Traub SJ. Association of Vital Signs and Process Outcomes in Emergency Department Patients. West J Emerg Med 2019;20:433-437. [PMID: 31123542 PMCID: PMC6526877 DOI: 10.5811/westjem.2019.1.41498] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2018] [Revised: 01/14/2019] [Accepted: 01/29/2019] [Indexed: 11/11/2022] Open

159

Zhang X, Kim J, Patzer RE, Pitts SR, Chokshi FH, Schrager JD. Advanced diagnostic imaging utilization during emergency department visits in the United States: A predictive modeling study for emergency department triage. PLoS One 2019;14:e0214905. [PMID: 30964899 PMCID: PMC6456195 DOI: 10.1371/journal.pone.0214905] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2018] [Accepted: 03/21/2019] [Indexed: 11/18/2022] Open

Abstract

Background

Emergency department (ED) crowding is associated with negative health outcomes, patient dissatisfaction, and longer length of stay (LOS). The addition of advanced diagnostic imaging (ADI), namely CT, ultrasound (U/S), and MRI to ED encounter work up is a predictor of longer length of stay. Earlier and improved prediction of patients’ need for advanced imaging may improve overall ED efficiency. The aim of the study was to detect the association between ADI utilization and the structured and unstructured information immediately available during ED triage, and to develop and validate models to predict utilization of ADI during an ED encounter.

Methods

We used the United States National Hospital Ambulatory Medical Care Survey data from 2009 to 2014 to examine which sociodemographic and clinical factors immediately available at ED triage were associated with the utilization of CT, U/S, MRI, and multiple ADI during a patient’s ED stay. We used natural language processing (NLP) topic modeling to incorporate free-text reason for visit data available at time of ED triage in addition to other structured patient data to predict the use of ADI using multivariable logistic regression models.

Results

Among the 139,150 adult ED visits from a national probability sample of hospitals across the U.S, 21.9% resulted in ADI use, including 16.8% who had a CT, 3.6% who had an ultrasound, 0.4% who had an MRI, and 1.2% of the population who had multiple types of ADI. The c-statistic of the predictive models was greater than or equal to 0.78 for all imaging outcomes, and the addition of text-based reason for visit information improved the accuracy of all predictive models.

Conclusions

Patient information immediately available during ED triage can accurately predict the eventual use of advanced diagnostic imaging during an ED visit. Such models have the potential to be incorporated into the ED triage workflow in order to more rapidly identify patients who may require advanced imaging during their ED stay and assist with medical decision-making.

Collapse

160

Raita Y, Goto T, Faridi MK, Brown DFM, Camargo CA, Hasegawa K. Emergency department triage prediction of clinical outcomes using machine learning models. CRITICAL CARE : THE OFFICIAL JOURNAL OF THE CRITICAL CARE FORUM 2019;23:64. [PMID: 30795786 PMCID: PMC6387562 DOI: 10.1186/s13054-019-2351-7] [Citation(s) in RCA: 206] [Impact Index Per Article: 41.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/03/2019] [Accepted: 02/10/2019] [Indexed: 12/18/2022]

Abstract

Background

Development of emergency department (ED) triage systems that accurately differentiate and prioritize critically ill from stable patients remains challenging. We used machine learning models to predict clinical outcomes, and then compared their performance with that of a conventional approach—the Emergency Severity Index (ESI).

Methods

Using National Hospital and Ambulatory Medical Care Survey (NHAMCS) ED data, from 2007 through 2015, we identified all adult patients (aged ≥ 18 years). In the randomly sampled training set (70%), using routinely available triage data as predictors (e.g., demographics, triage vital signs, chief complaints, comorbidities), we developed four machine learning models: Lasso regression, random forest, gradient boosted decision tree, and deep neural network. As the reference model, we constructed a logistic regression model using the five-level ESI data. The clinical outcomes were critical care (admission to intensive care unit or in-hospital death) and hospitalization (direct hospital admission or transfer). In the test set (the remaining 30%), we measured the predictive performance, including area under the receiver-operating-characteristics curve (AUC) and net benefit (decision curves) for each model.

Results

Of 135,470 eligible ED visits, 2.1% had critical care outcome and 16.2% had hospitalization outcome. In the critical care outcome prediction, all four machine learning models outperformed the reference model (e.g., AUC, 0.86 [95%CI 0.85–0.87] in the deep neural network vs 0.74 [95%CI 0.72–0.75] in the reference model), with less under-triaged patients in ESI triage levels 3 to 5 (urgent to non-urgent). Likewise, in the hospitalization outcome prediction, all machine learning models outperformed the reference model (e.g., AUC, 0.82 [95%CI 0.82–0.83] in the deep neural network vs 0.69 [95%CI 0.68–0.69] in the reference model) with less over-triages in ESI triage levels 1 to 3 (immediate to urgent). In the decision curve analysis, all machine learning models consistently achieved a greater net benefit—a larger number of appropriate triages considering a trade-off with over-triages—across the range of clinical thresholds.

Conclusions

Compared to the conventional approach, the machine learning models demonstrated a superior performance to predict critical care and hospitalization outcomes. The application of modern machine learning models may enhance clinicians’ triage decision making, thereby achieving better clinical care and optimal resource utilization.

Electronic supplementary material

The online version of this article (10.1186/s13054-019-2351-7) contains supplementary material, which is available to authorized users.

Collapse

161

Naqvi SA, Thompson GC, Joffe AR, Blackwood J, Martin DA, Brindle M, Barkema HW, Jenne CN. Cytokines and Chemokines in Pediatric Appendicitis: A Multiplex Analysis of Inflammatory Protein Mediators. Mediators Inflamm 2019;2019:2359681. [PMID: 30918467 PMCID: PMC6409077 DOI: 10.1155/2019/2359681] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Accepted: 01/15/2019] [Indexed: 12/29/2022] Open

Abstract

OBJECTIVES

We aimed to demonstrate the potential of precision medicine to describe the inflammatory landscape present in children with suspected appendicitis. Our primary objective was to determine levels of seven inflammatory protein mediators previously associated with intra-abdominal inflammation (C-reactive protein-CRP, procalcitonin-PCT, interleukin-6 (IL), IL-8, IL-10, monocyte chemoattractant protein-1-MCP-1, and serum amyloid A-SAA) in a cohort of children with suspected appendicitis. Subsequently, using a multiplex proteomics approach, we examined an expansive array of novel candidate cytokine and chemokines within this population.

METHODS

We performed a secondary analysis of targeted proteomics data from Alberta Sepsis Network studies. Plasma mediator levels, analyzed by Luminex multiplex assays, were evaluated in children aged 5-17 years with nonappendicitis abdominal pain (NAAP), acute appendicitis (AA), and nonappendicitis sepsis (NAS). We used multivariate regression analysis to evaluate the seven target proteins, followed by decision tree and heat mapping analyses for all proteins evaluated.

RESULTS

185 children were included: 83 with NAAP, 79 AA, and 23 NAS. Plasma levels of IL-6, CRP, MCP-1, PCT, and SAA were significantly different in children with AA compared to those with NAAP (p < 0.001). Expansive proteomic analysis demonstrated 6 patterns in inflammatory mediator profiles based on severity of illness. A decision tree incorporating the proteins CRP, ferritin, SAA, regulated on activation normal T-cell expressed and secreted (RANTES), monokine induced by gamma interferon (MIG), and PCT demonstrated excellent specificity (0.920) and negative predictive value (0.882) for children with appendicitis.

CONCLUSIONS

Multiplex proteomic analyses described the inflammatory landscape of children presenting to the ED with suspected appendicitis. We have demonstrated the feasibility of this approach to identify potential novel candidate cytokines/chemokine patterns associated with a specific illness (appendicitis) amongst those with a broad ED presentation (abdominal pain). This approach can be modelled for future research initiatives in pediatric emergency medicine.

Collapse

162

Lee S, Mohr NM, Street WN, Nadkarni P. Machine Learning in Relation to Emergency Medicine Clinical and Operational Scenarios: An Overview. West J Emerg Med 2019;20:219-227. [PMID: 30881539 PMCID: PMC6404711 DOI: 10.5811/westjem.2019.1.41244] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Revised: 12/21/2018] [Accepted: 01/01/2019] [Indexed: 12/13/2022] Open

163

Taboulet P, Vincent-Cassy C, Squara PA, Resche-Rigon M. Validité de la FRENCH, l’échelle de tri des urgences hospitalières élaborée par la Société française de médecine d’urgence. ANNALES FRANCAISES DE MEDECINE D URGENCE 2019. [DOI: 10.3166/afmu-2018-0099] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Abstract Introduction : La Société française de médecine d’urgence a créé en 2016 une échelle de tri pour les infirmier( ière)s d’accueil dans une structure d’urgences. Cette échelle appelée FRENCH (FRench Emergency Nurses Classification in Hospital) classe les tris de 5 à 1 (du moins urgent au plus urgent) en fonction du pronostic et de la complexité/ sévérité des motifs de recours aux soins. Le tri 3, groupe hétérogène dans les échelles de tri internationales, a été subdivisé en deux niveaux pour prioriser les patients qui ont une comorbidité en rapport avec le motif de recours aux soins ou qui sont adressés par un médecin et qui sont prioritaires (3A) par rapport aux autres patients (3B). Objectif : Évaluer la pertinence de la FRENCH. Méthode : Nous avons analysé les données démographiques, les paramètres vitaux, les examens complémentaires prescrits et les durées de prise en charge de tous les patients accueillis dans un hôpital universitaire sur une période de neuf mois consécutifs. Le critère de jugement était l’existence d’une relation croissante entre le niveau de complexité/ sévérité des patients — reflétée par le taux d’hospitalisation et la prescription d’examens complémentaires—et le niveau de priorité du tri. Résultats : L’étude a inclus 27 598 patients. La répartition des patients par niveaux de tri était : 0,4 (tri 1), 6,7 (tri 2), 13,3 (tri 3A), 29,4 (tri 3B), 43,1 (tri 4) et 7,1 % (tri 5). Le taux d’hospitalisation était croissant quand l’ordre de priorité augmentait. La relation entre les niveaux de tri et le taux d’hospitalisation mesurée par l’aire sous la courbe (0,83 : intervalle de confiance à 95 % : [0,82–0,83]) était bonne. La prescription des examens complémentaires était croissante quand l’ordre de priorité augmentait. La corrélation entre les niveaux de tri et un indice global d’examens complémentaires prescrits était modérée (K = 0,51). Conclusion : Cette étude monocentrique valide la pertinence de l’échelle de tri FRENCH à six niveaux par sa bonne capacité à classer les patients selon leur complexité/sévérité. De nouvelles évaluations sont nécessaires dans d’autres structures d’urgences pour confirmer sa performance et favoriser son évolution. Collapse

164

Goto T, Camargo CA, Faridi MK, Freishtat RJ, Hasegawa K. Machine Learning-Based Prediction of Clinical Outcomes for Children During Emergency Department Triage. JAMA Netw Open 2019;2:e186937. [PMID: 30646206 PMCID: PMC6484561 DOI: 10.1001/jamanetworkopen.2018.6937] [Citation(s) in RCA: 132] [Impact Index Per Article: 26.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open

Abstract

IMPORTANCE

While machine learning approaches may enhance prediction ability, little is known about their utility in emergency department (ED) triage.

OBJECTIVES

To examine the performance of machine learning approaches to predict clinical outcomes and disposition in children in the ED and to compare their performance with conventional triage approaches.

DESIGN, SETTING, AND PARTICIPANTS

Prognostic study of ED data from the National Hospital Ambulatory Medical Care Survey from January 1, 2007, through December 31, 2015. A nationally representative sample of 52 037 children aged 18 years or younger who presented to the ED were included. Data analysis was performed in August 2018.

MAIN OUTCOMES AND MEASURES

The outcomes were critical care (admission to an intensive care unit and/or in-hospital death) and hospitalization (direct hospital admission or transfer). In the training set (70% random sample), using routinely available triage data as predictors (eg, demographic characteristics and vital signs), we derived 4 machine learning-based models: lasso regression, random forest, gradient-boosted decision tree, and deep neural network. In the test set (the remaining 30% of the sample), we measured the models' prediction performance by computing C statistics, prospective prediction results, and decision curves. These machine learning models were built for each outcome and compared with the reference model using the conventional triage classification information.

RESULTS

Of 52 037 eligible ED visits by children (median [interquartile range] age, 6 [2-14] years; 24 929 [48.0%] female), 163 (0.3%) had the critical care outcome and 2352 (4.5%) had the hospitalization outcome. For the critical care prediction, all machine learning approaches had higher discriminative ability compared with the reference model, although the difference was not statistically significant (eg, C statistics of 0.85 [95% CI, 0.78-0.92] for the deep neural network vs 0.78 [95% CI, 0.71-0.85] for the reference; P = .16), and lower number of undertriaged critically ill children in the conventional triage levels 3 to 5 (urgent to nonurgent). For the hospitalization prediction, all machine learning approaches had significantly higher discrimination ability (eg, C statistic, 0.80 [95% CI, 0.78-0.81] for the deep neural network vs 0.73 [95% CI, 0.71-0.75] for the reference; P < .001) and fewer overtriaged children who did not require inpatient management in the conventional triage levels 1 to 3 (immediate to urgent). The decision curve analysis demonstrated a greater net benefit of machine learning models over ranges of clinical thresholds.

CONCLUSIONS AND RELEVANCE

Machine learning-based triage had better discrimination ability to predict clinical outcomes and disposition, with reduction in undertriaging critically ill children and overtriaging children who are less ill.

Collapse

165

Chatterjee S, Desai S, Manesh R, Sun J, Nundy S, Wright SM. Assessment of a Simulated Case-Based Measurement of Physician Diagnostic Performance. JAMA Netw Open 2019;2:e187006. [PMID: 30646211 PMCID: PMC6484555 DOI: 10.1001/jamanetworkopen.2018.7006] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Abstract

IMPORTANCE

Diagnostic acumen is a fundamental skill in the practice of medicine. Scalable, practical, and objective tools to assess diagnostic performance are lacking.

OBJECTIVE

To validate a new method of assessing diagnostic performance that uses automated techniques to assess physicians' diagnostic performance on brief, open-ended case simulations.

DESIGN, SETTING, AND PARTICIPANTS

Retrospective cohort study of 11 023 unique attempts to solve case simulations on an online software platform, The Human Diagnosis Project (Human Dx). A total of 1738 practicing physicians, residents (internal medicine, family medicine, and emergency medicine), and medical students throughout the United States voluntarily used Human Dx software between January 21, 2016, and January 15, 2017.

MAIN OUTCOMES AND MEASURES

Internal structure validity was assessed by 3 measures of diagnostic performance: accuracy, efficiency, and a combined score (Diagnostic Acumen Precision Performance [DAPP]). These were each analyzed by level of training. Association with other variables' validity evidence was evaluated by correlating diagnostic performance and affiliation with an institution ranked in the top 25 medical schools by US News and World Report.

RESULTS

Data were analyzed for 239 attending physicians, 926 resident physicians, 347 intern physicians, and 226 medical students. Attending physicians had higher mean accuracy scores than medical students (difference, 8.1; 95% CI, 4.2-12.0; P < .001), as did residents (difference, 8.0; 95% CI, 4.8-11.2; P < .001) and interns (difference, 5.9; 95% CI, 2.3-9.6; P < .001). Attending physicians had higher mean efficiency compared with residents (difference, 4.8; 95% CI, 1.8-7.8; P < .001), interns (difference, 5.0; 95% CI, 1.5-8.4; P = .001), and medical students (difference, 5.4; 95% CI, 1.4-9.3; P = .003). Attending physicians also had significantly higher mean DAPP scores than residents (difference, 2.6; 95% CI, 0.0-5.2; P = .05), interns (difference, 3.6; 95% CI, 0.6-6.6; P = .01), and medical students (difference, 6.7; 95% CI, 3.3-10.2; P < .001). Attending physicians affiliated with a US News and World Report-ranked institution had higher mean DAPP scores compared with nonaffiliated attending physicians (80 [95% CI, 77-83] vs 72 [95% CI, 70-74], respectively; P < .001). Resident physicians affiliated with an institution ranked in the top 25 medical schools by US News and World Report also had higher mean DAPP scores compared with nonaffiliated peers (75 [95% CI, 73-77] vs 71 [95% CI, 69-72], respectively; P < .001).

CONCLUSIONS AND RELEVANCE

The data suggest that diagnostic performance is higher in those with more training and that DAPP scores may be a valid measure to appraise diagnostic performance. This diagnostic assessment tool allows individuals to receive immediate feedback on performance through an openly accessible online platform.

Collapse

166

Tam HL, Chung SF, Lou CK. A review of triage accuracy and future direction. BMC Emerg Med 2018;18:58. [PMID: 30572841 PMCID: PMC6302512 DOI: 10.1186/s12873-018-0215-0] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2018] [Accepted: 12/10/2018] [Indexed: 11/10/2022] Open

167

Al Jalbout N, Troncoso R, Evans JD, Rothman RE, Hinson JS. Biomarkers and Molecular Diagnostics for Early Detection and Targeted Management of Sepsis and Septic Shock in the Emergency Department. J Appl Lab Med 2018;3:724-729. [PMID: 31639740 DOI: 10.1373/jalm.2018.027425] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Accepted: 10/23/2018] [Indexed: 12/14/2022]

168

Hinson JS, Martinez DA, Cabral S, George K, Whalen M, Hansoti B, Levin S. Triage Performance in Emergency Medicine: A Systematic Review. Ann Emerg Med 2018;74:140-152. [PMID: 30470513 DOI: 10.1016/j.annemergmed.2018.09.022] [Citation(s) in RCA: 109] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Revised: 09/11/2018] [Accepted: 09/21/2018] [Indexed: 12/12/2022]

Abstract

STUDY OBJECTIVE

Rapid growth in emergency department (ED) triage literature has been accompanied by diversity in study design, methodology, and outcome assessment. We aim to synthesize existing ED triage literature by using a framework that enables performance comparisons and benchmarking across triage systems, with respect to clinical outcomes and reliability.

METHODS

PubMed, EMBASE, Scopus, and Web of Science were systematically searched for studies of adult ED triage systems through 2016. Studies evaluating triage systems with evidence of widespread adoption (Australian Triage Scale, Canadian Triage and Acuity Scale, Emergency Severity Index, Manchester Triage Scale, and South African Triage Scale) were cataloged and compared for performance in identifying patients at risk for mortality, critical illness and hospitalization, and interrater reliability. This study was performed and reported in adherence to Preferred Reporting Items for Systematic Reviews and Meta-analyses guidelines.

RESULTS

A total of 6,160 publications were identified, with 182 meeting eligibility criteria and 50 with sufficient data for inclusion in comparative analysis. The Canadian Triage and Acuity Scale (32 studies), Emergency Severity Index (43), and Manchester Triage Scale (38) were the most frequently studied triage scales, and all demonstrated similar performance. Most studies (6 of 8) reported high sensitivity (>90%) of triage scales for identifying patients with ED mortality as high acuity at triage. However, sensitivity was low (<80%) for identification of patients who had critical illness outcomes and those who died within days of the ED visit or during the index hospitalization. Sensitivity varied by critical illness and was lower for severe sepsis (36% to 74%), pulmonary embolism (54%), and non-ST-segment elevation myocardial infarction (44% to 85%) compared with ST-segment elevation myocardial infarction (56% to 92%) and general outcomes of ICU admission (58% to 100%) and lifesaving intervention (77% to 98%). Some proportion of hospitalized patients (3% to 45%) were triaged to low acuity (level 4 to 5) in all studies. Reliability measures (κ) were variable across evaluations, with only a minority (11 of 42) reporting κ above 0.8.

CONCLUSION

We found that a substantial proportion of ED patients who die postencounter or are critically ill are not designated as high acuity at triage. Opportunity to improve interrater reliability and triage performance in identifying patients at risk of adverse outcome exists.

Collapse

169

Kwon JM, Lee Y, Lee Y, Lee S, Park H, Park J. Validation of deep-learning-based triage and acuity score using a large national dataset. PLoS One 2018;13:e0205836. [PMID: 30321231 PMCID: PMC6188844 DOI: 10.1371/journal.pone.0205836] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2018] [Accepted: 10/02/2018] [Indexed: 12/03/2022] Open

170

Hong WS, Haimovich AD, Taylor RA. Predicting hospital admission at emergency department triage using machine learning. PLoS One 2018;13:e0201016. [PMID: 30028888 PMCID: PMC6054406 DOI: 10.1371/journal.pone.0201016] [Citation(s) in RCA: 101] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2018] [Accepted: 07/06/2018] [Indexed: 01/09/2023] Open

Abstract

OBJECTIVE

To predict hospital admission at the time of ED triage using patient history in addition to information collected at triage.

METHODS

This retrospective study included all adult ED visits between March 2014 and July 2017 from one academic and two community emergency rooms that resulted in either admission or discharge. A total of 972 variables were extracted per patient visit. Samples were randomly partitioned into training (80%), validation (10%), and test (10%) sets. We trained a series of nine binary classifiers using logistic regression (LR), gradient boosting (XGBoost), and deep neural networks (DNN) on three dataset types: one using only triage information, one using only patient history, and one using the full set of variables. Next, we tested the potential benefit of additional training samples by training models on increasing fractions of our data. Lastly, variables of importance were identified using information gain as a metric to create a low-dimensional model.

RESULTS

A total of 560,486 patient visits were included in the study, with an overall admission risk of 29.7%. Models trained on triage information yielded a test AUC of 0.87 for LR (95% CI 0.86-0.87), 0.87 for XGBoost (95% CI 0.87-0.88) and 0.87 for DNN (95% CI 0.87-0.88). Models trained on patient history yielded an AUC of 0.86 for LR (95% CI 0.86-0.87), 0.87 for XGBoost (95% CI 0.87-0.87) and 0.87 for DNN (95% CI 0.87-0.88). Models trained on the full set of variables yielded an AUC of 0.91 for LR (95% CI 0.91-0.91), 0.92 for XGBoost (95% CI 0.92-0.93) and 0.92 for DNN (95% CI 0.92-0.92). All algorithms reached maximum performance at 50% of the training set or less. A low-dimensional XGBoost model built on ESI level, outpatient medication counts, demographics, and hospital usage statistics yielded an AUC of 0.91 (95% CI 0.91-0.91).

CONCLUSION

Machine learning can robustly predict hospital admission using triage information and patient history. The addition of historical information improves predictive performance significantly compared to using triage information alone, highlighting the need to incorporate these variables into prediction models.

Collapse

171

Stewart J, Sprivulis P, Dwivedi G. Artificial intelligence and machine learning in emergency medicine. Emerg Med Australas 2018;30:870-874. [PMID: 30014578 DOI: 10.1111/1742-6723.13145] [Citation(s) in RCA: 63] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Accepted: 06/21/2018] [Indexed: 01/01/2023]

172

Machine learning approaches for predicting disposition of asthma and COPD exacerbations in the ED. Am J Emerg Med 2018;36:1650-1654. [PMID: 29970272 DOI: 10.1016/j.ajem.2018.06.062] [Citation(s) in RCA: 69] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2018] [Revised: 06/25/2018] [Accepted: 06/26/2018] [Indexed: 11/22/2022] Open

Abstract

OBJECTIVE

The prediction of emergency department (ED) disposition at triage remains challenging. Machine learning approaches may enhance prediction. We compared the performance of several machine learning approaches for predicting two clinical outcomes (critical care and hospitalization) among ED patients with asthma or COPD exacerbation.

METHODS

Using the 2007-2015 National Hospital and Ambulatory Medical Care Survey (NHAMCS) ED data, we identified adults with asthma or COPD exacerbation. In the training set (70% random sample), using routinely-available triage data as predictors (e.g., demographics, arrival mode, vital signs, chief complaint, comorbidities), we derived four machine learning-based models: Lasso regression, random forest, boosting, and deep neural network. In the test set (the remaining 30% of sample), we compared their prediction ability against traditional logistic regression with Emergency Severity Index (ESI, reference model).

RESULTS

Of 3206 eligible ED visits, corresponding to weighted estimates of 13.9 million visits, 4% had critical care outcome and 26% had hospitalization outcome. For the critical care prediction, the best performing approach- boosting - achieved the highest discriminative ability (C-statistics 0.80 vs. 0.68), reclassification improvement (net reclassification improvement [NRI] 53%, P = 0.002), and sensitivity (0.79 vs. 0.53) over the reference model. For the hospitalization prediction, random forest provided the highest discriminative ability (C-statistics 0.83 vs. 0.64) reclassification improvement (NRI 92%, P < 0.001), and sensitivity (0.75 vs. 0.33). Results were generally consistent across the asthma and COPD subgroups.

CONCLUSIONS

Based on nationally-representative ED data, machine learning approaches improved the ability to predict disposition of patients with asthma or COPD exacerbation.

Collapse

173

The Long and Winding Triage Road. Ann Emerg Med 2018;71:575-577. [DOI: 10.1016/j.annemergmed.2018.03.021] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

174

An Evolutionary Computation Approach for Optimizing Multilevel Data to Predict Patient Outcomes. JOURNAL OF HEALTHCARE ENGINEERING 2018;2018:7174803. [PMID: 29744026 PMCID: PMC5878885 DOI: 10.1155/2018/7174803] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/26/2017] [Accepted: 01/31/2018] [Indexed: 11/18/2022]