1
|
Gao J, Chen G, O'Rourke AP, Caskey J, Carey KA, Oguss M, Stey A, Dligach D, Miller T, Mayampurath A, Churpek MM, Afshar M. Automated stratification of trauma injury severity across multiple body regions using multi-modal, multi-class machine learning models. J Am Med Inform Assoc 2024:ocae071. [PMID: 38587875 DOI: 10.1093/jamia/ocae071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 02/29/2024] [Accepted: 03/21/2024] [Indexed: 04/09/2024] Open
Abstract
OBJECTIVE The timely stratification of trauma injury severity can enhance the quality of trauma care but it requires intense manual annotation from certified trauma coders. The objective of this study is to develop machine learning models for the stratification of trauma injury severity across various body regions using clinical text and structured electronic health records (EHRs) data. MATERIALS AND METHODS Our study utilized clinical documents and structured EHR variables linked with the trauma registry data to create 2 machine learning models with different approaches to representing text. The first one fuses concept unique identifiers (CUIs) extracted from free text with structured EHR variables, while the second one integrates free text with structured EHR variables. Temporal validation was undertaken to ensure the models' temporal generalizability. Additionally, analyses to assess the variable importance were conducted. RESULTS Both models demonstrated impressive performance in categorizing leg injuries, achieving high accuracy with macro-F1 scores of over 0.8. Additionally, they showed considerable accuracy, with macro-F1 scores exceeding or near 0.7, in assessing injuries in the areas of the chest and head. We showed in our variable importance analysis that the most important features in the model have strong face validity in determining clinically relevant trauma injuries. DISCUSSION The CUI-based model achieves comparable performance, if not higher, compared to the free-text-based model, with reduced complexity. Furthermore, integrating structured EHR data improves performance, particularly when the text modalities are insufficiently indicative. CONCLUSIONS Our multi-modal, multiclass models can provide accurate stratification of trauma injury severity and clinically relevant interpretations.
Collapse
Affiliation(s)
- Jifan Gao
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53726, United States
| | - Guanhua Chen
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53726, United States
| | - Ann P O'Rourke
- Department of Surgery, University of Wisconsin-Madison, Madison, WI 53792, United States
| | - John Caskey
- Department of Medicine, University of Wisconsin-Madison, Madison, WI 53705, United States
| | - Kyle A Carey
- Department of Medicine, University of Wisconsin-Madison, Madison, WI 53705, United States
| | - Madeline Oguss
- Department of Medicine, University of Wisconsin-Madison, Madison, WI 53705, United States
| | - Anne Stey
- Department of Surgery, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, United States
- Center of Health Services and Outcomes Research, Institute for Public Health and Medicine, Chicago, IL 60611, United States
| | - Dmitriy Dligach
- Department of Computer Science, Loyola University Chicago, Chicago, IL 60660, United States
| | - Timothy Miller
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA 02115, United States
- Department of Pediatrics, Harvard Medical School, Boston, MA 02115, United States
| | - Anoop Mayampurath
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53726, United States
- Department of Medicine, University of Wisconsin-Madison, Madison, WI 53705, United States
| | - Matthew M Churpek
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53726, United States
- Department of Medicine, University of Wisconsin-Madison, Madison, WI 53705, United States
| | - Majid Afshar
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53726, United States
- Department of Medicine, University of Wisconsin-Madison, Madison, WI 53705, United States
| |
Collapse
|