Jiang H, Mao H, Lu H, Lin P, Garry W, Lu H, Yang G, Rainer TH, Chen X. Machine learning-based models to support decision-making in emergency department triage for patients with suspected cardiovascular disease.
Int J Med Inform 2020;
145:104326. [PMID:
33197878 DOI:
10.1016/j.ijmedinf.2020.104326]
[Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2020] [Revised: 10/16/2020] [Accepted: 10/30/2020] [Indexed: 12/23/2022]
Abstract
BACKGROUND
Accurate differentiation and prioritization in emergency department (ED) triage is important to identify high-risk patients and to efficiently allocate of finite resources. Using data available from patients with suspected cardiovascular disease presenting at ED triage, this study aimed to train and compare the performance of four common machine learning models to assist in decision making of triage levels.
METHODS
This cross-sectional study in the second Affiliated Hospital of Guangzhou Medical University was conducted from August 2015 to December 2018 inclusive. Demographic information, vital signs, blood glucose, and other available triage scores were collected. Four machine learning models - multinomial logistic regression (multinomial LR), eXtreme gradient boosting (XGBoost), random forest (RF) and gradient-boosted decision tree (GBDT) - were compared. For each model, 80 % of the data set was used for training and 20 % was used to test the models. The area under the receiver operating characteristic curve (AUC), accuracy and macro- F1 were calculated for each model.
RESULTS
In 17,661 patients presenting with suspected cardiovascular disease, the distribution of triage of level 1, level 2, level 3 and level 4 were 1.3 %, 18.6 %, 76.5 %, and 3.6 % respectively. The AUCs were: XGBoost (0.937), GBDT (0.921), RF (0.919) and multinomial LR (0.908). Based on feature importance generated by XGBoost, blood pressure, pulse rate, oxygen saturation, and age were the most significant variables for making decisions at triage.
CONCLUSION
Four machine learning models had good discriminative ability of triage. XGBoost demonstrated a slight advantage over other models. These models could be used for differential triage of low-risk patients and high-risk patients as a strategy to improve efficiency and allocation of finite resources.
Collapse