1
|
Cazzaniga G, Eccher A, Munari E, Marletta S, Bonoldi E, Della Mea V, Cadei M, Sbaraglia M, Guerriero A, Dei Tos AP, Pagni F, L’Imperio V. Natural Language Processing to extract SNOMED-CT codes from pathological reports. Pathologica 2023; 115:318-324. [PMID: 38180139 PMCID: PMC10767798 DOI: 10.32074/1591-951x-952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 11/17/2023] [Indexed: 01/06/2024] Open
Abstract
Objective The use of standardized structured reports (SSR) and suitable terminologies like SNOMED-CT can enhance data retrieval and analysis, fostering large-scale studies and collaboration. However, the still large prevalence of narrative reports in our laboratories warrants alternative and automated labeling approaches. In this project, natural language processing (NLP) methods were used to associate SNOMED-CT codes to structured and unstructured reports from an Italian Digital Pathology Department. Methods Two NLP-based automatic coding systems (support vector machine, SVM, and long-short term memory, LSTM) were trained and applied to a series of narrative reports. Results The 1163 cases were tested with both algorithms, showing good performances in terms of accuracy, precision, recall, and F1 score, with SVM showing slightly better performances as compared to LSTM (0.84, 0.87, 0.83, 0.82 vs 0.83, 0.85, 0.83, 0.82, respectively). The integration of an explainability allowed identification of terms and groups of words of importance, enabling fine-tuning, balancing semantic meaning and model performance. Conclusions AI tools allow the automatic SNOMED-CT labeling of the pathology archives, providing a retrospective fix to the large lack of organization of narrative reports.
Collapse
Affiliation(s)
- Giorgio Cazzaniga
- Department of Medicine and Surgery, Pathology, IRCCS Fondazione San Gerardo dei Tintori, University of Milano-Bicocca, Italy
| | - Albino Eccher
- Section of Pathology, Department of Medical and Surgical Sciences for Children and Adults, University of Modena and Reggio Emilia, University Hospital of Modena, Modena, Italy
| | - Enrico Munari
- Department of Diagnostic and Public Health, Section of Pathology, University of Verona, Verona, Italy
| | - Stefano Marletta
- Department of Diagnostic and Public Health, Section of Pathology, University of Verona, Verona, Italy
| | - Emanuela Bonoldi
- Unit of Surgical Pathology and Cytogenetics, ASST Grande Ospedale Metropolitano Niguarda, Milan, Italy
| | - Vincenzo Della Mea
- Department of Mathematics, Computer Science and Physics, University of Udine, Udine, Italy
| | - Moris Cadei
- Pathology Unit, ASST Spedali Civili di Brescia, Brescia, Italy
| | - Marta Sbaraglia
- Surgical Pathology and Cytopathology Unit, Department of Medicine-DIMED, University of Padua School of Medicine, Padua, Italy
| | - Angela Guerriero
- Surgical Pathology and Cytopathology Unit, Department of Medicine-DIMED, University of Padua School of Medicine, Padua, Italy
| | - Angelo Paolo Dei Tos
- Surgical Pathology and Cytopathology Unit, Department of Medicine-DIMED, University of Padua School of Medicine, Padua, Italy
| | - Fabio Pagni
- Department of Medicine and Surgery, Pathology, IRCCS Fondazione San Gerardo dei Tintori, University of Milano-Bicocca, Italy
| | - Vincenzo L’Imperio
- Department of Medicine and Surgery, Pathology, IRCCS Fondazione San Gerardo dei Tintori, University of Milano-Bicocca, Italy
| |
Collapse
|