1
|
Chomutare T, Lamproudis A, Budrionis A, Svenning TO, Hind LI, Ngo PD, Mikalsen KØ, Dalianis H. Improving Quality of ICD-10 (International Statistical Classification of Diseases, Tenth Revision) Coding Using AI: Protocol for a Crossover Randomized Controlled Trial. JMIR Res Protoc 2024; 13:e54593. [PMID: 38470476 DOI: 10.2196/54593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 01/12/2024] [Accepted: 01/16/2024] [Indexed: 03/13/2024] Open
Abstract
BACKGROUND Computer-assisted clinical coding (CAC) tools are designed to help clinical coders assign standardized codes, such as the ICD-10 (International Statistical Classification of Diseases, Tenth Revision), to clinical texts, such as discharge summaries. Maintaining the integrity of these standardized codes is important both for the functioning of health systems and for ensuring data used for secondary purposes are of high quality. Clinical coding is an error-prone cumbersome task, and the complexity of modern classification systems such as the ICD-11 (International Classification of Diseases, Eleventh Revision) presents significant barriers to implementation. To date, there have only been a few user studies; therefore, our understanding is still limited regarding the role CAC systems can play in reducing the burden of coding and improving the overall quality of coding. OBJECTIVE The objective of the user study is to generate both qualitative and quantitative data for measuring the usefulness of a CAC system, Easy-ICD, that was developed for recommending ICD-10 codes. Specifically, our goal is to assess whether our tool can reduce the burden on clinical coders and also improve coding quality. METHODS The user study is based on a crossover randomized controlled trial study design, where we measure the performance of clinical coders when they use our CAC tool versus when they do not. Performance is measured by the time it takes them to assign codes to both simple and complex clinical texts as well as the coding quality, that is, the accuracy of code assignment. RESULTS We expect the study to provide us with a measurement of the effectiveness of the CAC system compared to manual coding processes, both in terms of time use and coding quality. Positive outcomes from this study will imply that CAC tools hold the potential to reduce the burden on health care staff and will have major implications for the adoption of artificial intelligence-based CAC innovations to improve coding practice. Expected results to be published summer 2024. CONCLUSIONS The planned user study promises a greater understanding of the impact CAC systems might have on clinical coding in real-life settings, especially with regard to coding time and quality. Further, the study may add new insights on how to meaningfully exploit current clinical text mining capabilities, with a view to reducing the burden on clinical coders, thus lowering the barriers and paving a more sustainable path to the adoption of modern coding systems, such as the new ICD-11. TRIAL REGISTRATION clinicaltrials.gov NCT06286865; https://clinicaltrials.gov/study/NCT06286865. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID) DERR1-10.2196/54593.
Collapse
Affiliation(s)
- Taridzo Chomutare
- Health Data Analytics, Norwegian Centre for E-health Research, Tromsø, Norway
- Department of Computer Science, UiT The Arctic University of Norway, Tromsø, Norway
| | | | - Andrius Budrionis
- Health Data Analytics, Norwegian Centre for E-health Research, Tromsø, Norway
- Department of Physics and Technology, UiT The Arctic University of Norway, Tromsø, Norway
| | | | - Lill Irene Hind
- Clinic for Surgery, Oncology and Women Health, University Hospital of North Norway, Tromsø, Norway
| | - Phuong Dinh Ngo
- Health Data Analytics, Norwegian Centre for E-health Research, Tromsø, Norway
- Department of Physics and Technology, UiT The Arctic University of Norway, Tromsø, Norway
| | - Karl Øyvind Mikalsen
- Department of Physics and Technology, UiT The Arctic University of Norway, Tromsø, Norway
- The Norwegian Centre for Clinical Artificial Intelligence, University Hospital of North Norway, Tromsø, Norway
| | - Hercules Dalianis
- Health Data Analytics, Norwegian Centre for E-health Research, Tromsø, Norway
- Department of Computer and Systems Sciences, Stockholm University, Kista, Sweden
| |
Collapse
|
2
|
Avram C, Gligor A, Roman D, Soylu A, Nyulas V, Avram L. Machine learning based assessment of preclinical health questionnaires. Int J Med Inform 2023; 180:105248. [PMID: 37866276 DOI: 10.1016/j.ijmedinf.2023.105248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 10/04/2023] [Accepted: 10/08/2023] [Indexed: 10/24/2023]
Abstract
BACKGROUND Within modern health systems, the possibility of accessing a large amount and a variety of data related to patients' health has increased significantly over the years. The source of this data could be mobile and wearable electronic systems used in everyday life, and specialized medical devices. In this study we aim to investigate the use of modern Machine Learning (ML) techniques for preclinical health assessment based on data collected from questionnaires filled out by patients. METHOD To identify the health conditions of pregnant women, we developed a questionnaire that was distributed in three maternity hospitals in the Mureș County, Romania. In this work we proposed and developed an ML model for pattern detection in common risk assessment based on data extracted from questionnaires. RESULTS Out of the 1278 women who answered the questionnaire, 381 smoked before pregnancy and only 216 quit smoking during the period in which they became pregnant. The performance of the model indicates the feasibility of the solution, with an accuracy of 98 % confirmed for the considered case study. CONCLUSION The proposed solution offers a simple and efficient way to digitize questionnaire data and to analyze the data through a reduced computational effort, both in terms of memory and computing power used.
Collapse
Affiliation(s)
- Calin Avram
- George Emil Palade University of Medicine, Pharmacy, Science and Technology of Targu Mures, Romania.
| | - Adrian Gligor
- George Emil Palade University of Medicine, Pharmacy, Science and Technology of Targu Mures, Romania.
| | - Dumitru Roman
- SINTEF AS, Norway; OsloMet - Oslo Metropolitan University, Norway.
| | - Ahmet Soylu
- OsloMet - Oslo Metropolitan University, Norway.
| | - Victoria Nyulas
- George Emil Palade University of Medicine, Pharmacy, Science and Technology of Targu Mures, Romania.
| | - Laura Avram
- "Dimitrie Cantemir" University of Târgu-Mureș, Romania.
| |
Collapse
|
3
|
Franzoi MA, Gillanders E, Vaz-Luis I. Unlocking digitally enabled research in oncology: the time is now. ESMO Open 2023; 8:101633. [PMID: 37660408 PMCID: PMC10482746 DOI: 10.1016/j.esmoop.2023.101633] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 08/07/2023] [Indexed: 09/05/2023] Open
Affiliation(s)
- M A Franzoi
- Cancer Survivorship Group, Inserm Unit 981, Gustave Roussy, Villejuif
| | - E Gillanders
- Cancer Survivorship Group, Inserm Unit 981, Gustave Roussy, Villejuif
| | - I Vaz-Luis
- Cancer Survivorship Group, Inserm Unit 981, Gustave Roussy, Villejuif; Department for the Organization of Patient Pathways, DIOPP, Gustave Roussy, Villejuif, France.
| |
Collapse
|
4
|
Rani S, Jain A. Optimizing healthcare system by amalgamation of text processing and deep learning: a systematic review. MULTIMEDIA TOOLS AND APPLICATIONS 2023:1-25. [PMID: 37362695 PMCID: PMC10183315 DOI: 10.1007/s11042-023-15539-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Revised: 05/18/2022] [Accepted: 04/19/2023] [Indexed: 06/28/2023]
Abstract
The explosion of clinical textual data has drawn the attention of researchers. Owing to the abundance of clinical data, it is becoming difficult for healthcare professionals to take real-time measures. The tools and methods are lacking when compared to the amount of clinical data generated every day. This review aims to survey the text processing pipeline with deep learning methods such as CNN, RNN, LSTM, and GRU in the healthcare domain and discuss various applications such as clinical concept detection and extraction, medically aware dialogue systems, sentiment analysis of drug reviews shared online, clinical trial matching, and pharmacovigilance. In addition, we highlighted the major challenges in deploying text processing with deep learning to clinical textual data and identified the scope of research in this domain. Furthermore, we have discussed various resources that can be used in the future to optimize the healthcare domain by amalgamating text processing and deep learning.
Collapse
Affiliation(s)
- Somiya Rani
- Department of Computer Science and Engineering, NSUT East Campus (erstwhile AIACTR), Affiliated to Guru Gobind Singh Indraprastha University, Delhi, India
| | - Amita Jain
- Department of Computer Science and Engineering, Netaji Subhas University of Technology, Delhi, India
| |
Collapse
|
5
|
Ladas N, Borchert F, Franz S, Rehberg A, Strauch N, Sommer KK, Marschollek M, Gietzelt M. Programming techniques for improving rule readability for rule-based information extraction natural language processing pipelines of unstructured and semi-structured medical texts. Health Informatics J 2023; 29:14604582231164696. [PMID: 37068028 DOI: 10.1177/14604582231164696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]
Abstract
BACKGROUND Extraction of medical terms and their corresponding values from semi-structured and unstructured texts of medical reports can be a time-consuming and error-prone process. Methods of natural language processing (NLP) can help define an extraction pipeline for accomplishing a structured format transformation strategy. OBJECTIVES In this paper, we build an NLP pipeline to extract values of the classification of malignant tumors (TNM) from unstructured and semi-structured pathology reports and import them further to a structured data source for a clinical study. Our research interest is not focused on standard performance metrics like precision, recall, and F-measure on the test and validation data. We discuss how with the help of software programming techniques the readability of rule-based (RB) information extraction (IE) pipelines can be improved, and therefore minimize the time to correct or update the rules, and efficiently import them to another programming language. METHODS The extract rules were manually programmed with training data of TNM classification and tested in two separate pipelines based on design specifications from domain experts and data curators. Firstly we implemented each rule directly in one line for each extraction item. Secondly, we reprogrammed them in a readable fashion through decomposition and intention-revealing names for the variable declaration. To measure the impact of both methods we measure the time for the fine-tuning and programming of the extractions through test data of semi-structured and unstructured texts. RESULTS We analyze the benefits of improving through readability of the writing of rules, through parallel programming with regular expressions (REGEX), and the Apache Uima Ruta language (AURL). The time for correcting the readable rules in AURL and REGEX was significantly reduced. Complicated rules in REGEX are decomposed and intention-revealing declarations were reprogrammed in AURL in 5 min. CONCLUSION We discuss the importance of factor readability and how can it be improved when programming RB text IE pipelines. Independent of the features of the programming language and the tools applied, a readable coding strategy can be proven beneficial for future maintenance and offer an interpretable solution for understanding the extraction and for transferring the rules to other domains and NLP pipelines.
Collapse
Affiliation(s)
- Nektarios Ladas
- Peter L. Reichertz Institute for Medical Informatics, TU Braunschweig and Hannover Medical School, Hannover, Germany
| | - Florian Borchert
- Hasso-Plattner-Institut Fur Digital Engineering gGmbH, Potsdam, Germany
| | - Stefan Franz
- Peter L. Reichertz Institute for Medical Informatics, TU Braunschweig and Hannover Medical School, Hannover, Germany
| | - Alina Rehberg
- Peter L. Reichertz Institute for Medical Informatics, TU Braunschweig and Hannover Medical School, Hannover, Germany
| | - Natalia Strauch
- Peter L. Reichertz Institute for Medical Informatics, TU Braunschweig and Hannover Medical School, Hannover, Germany
| | - Kim Katrin Sommer
- Peter L. Reichertz Institute for Medical Informatics, TU Braunschweig and Hannover Medical School, Hannover, Germany
| | - Michael Marschollek
- Peter L. Reichertz Institute for Medical Informatics, TU Braunschweig and Hannover Medical School, Hannover, Germany
| | - Matthias Gietzelt
- Peter L. Reichertz Institute for Medical Informatics, TU Braunschweig and Hannover Medical School, Hannover, Germany
| |
Collapse
|
6
|
Grabar N, Grouin C. Year 2020 (with COVID): Observation of Scientific Literature on Clinical Natural Language Processing. Yearb Med Inform 2021; 30:257-263. [PMID: 34479397 PMCID: PMC8416212 DOI: 10.1055/s-0041-1726528] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Objectives:
To analyze the content of publications within the medical NLP domain in 2020.
Methods:
Automatic and manual preselection of publications to be reviewed, and selection of the best NLP papers of the year. Analysis of the important issues.
Results:
Three best papers have been selected in 2020. We also propose an analysis of the content of the NLP publications in 2020, all topics included.
Conclusion:
The two main issues addressed in 2020 are related to the investigation of COVID-related questions and to the further adaptation and use of transformer models. Besides, the trends from the past years continue, such as diversification of languages processed and use of information from social networks
Collapse
Affiliation(s)
- Natalia Grabar
- Université Paris Saclay, CNRS, Laboratoire Interdisciplinaire des Sciences du Numérique, Orsay, France.,STL, CNRS, Université de Lille, Domaine du Pont-de-bois, Villeneuve-d'Ascq cedex, France
| | - Cyril Grouin
- Université Paris Saclay, CNRS, Laboratoire Interdisciplinaire des Sciences du Numérique, Orsay, France
| | | |
Collapse
|
7
|
Wulff A, Baier C, Ballout S, Tute E, Sommer KK, Kaase M, Sargeant A, Drenkhahn C, Schlüter D, Marschollek M, Scheithauer S. Transformation of microbiology data into a standardised data representation using OpenEHR. Sci Rep 2021; 11:10556. [PMID: 34006956 PMCID: PMC8131366 DOI: 10.1038/s41598-021-89796-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 04/29/2021] [Indexed: 12/22/2022] Open
Abstract
The spread of multidrug resistant organisms (MDRO) is a global healthcare challenge. Nosocomial outbreaks caused by MDRO are an important contributor to this threat. Computer-based applications facilitating outbreak detection can be essential to address this issue. To allow application reusability across institutions, the various heterogeneous microbiology data representations needs to be transformed into standardised, unambiguous data models. In this work, we present a multi-centric standardisation approach by using openEHR as modelling standard. Data models have been consented in a multicentre and international approach. Participating sites integrated microbiology reports from primary source systems into an openEHR-based data platform. For evaluation, we implemented a prototypical application, compared the transformed data with original reports and conducted automated data quality checks. We were able to develop standardised and interoperable microbiology data models. The publicly available data models can be used across institutions to transform real-life microbiology reports into standardised representations. The implementation of a proof-of-principle and quality control application demonstrated that the new formats as well as the integration processes are feasible. Holistic transformation of microbiological data into standardised openEHR based formats is feasible in a real-life multicentre setting and lays the foundation for developing cross-institutional, automated outbreak detection systems.
Collapse
Affiliation(s)
- Antje Wulff
- Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany.
| | - Claas Baier
- Institute for Medical Microbiology and Hospital Epidemiology, Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany.
| | - Sarah Ballout
- Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany
| | - Erik Tute
- Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany
| | - Kim Katrin Sommer
- Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany
| | - Martin Kaase
- Institute of Infection Control and Infectious Diseases, University Medical Center Göttingen (UMG), Georg-August University Göttingen, Robert-Koch-Str. 40, 37075, Göttingen, Germany
| | - Anneka Sargeant
- Institute of Medical Informatics, University Medical Center Göttingen (UMG), Georg-August University Göttingen, Robert-Koch-Str. 40, 37075, Göttingen, Germany
| | - Cora Drenkhahn
- IT Center for Clinical Research (ITCR-L) and Institute of Medical Informatics (IMI), University of Lübeck, Ratzeburger Allee 160, 23538, Lübeck, Germany
| | | | - Dirk Schlüter
- Institute for Medical Microbiology and Hospital Epidemiology, Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany
| | - Michael Marschollek
- Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany
| | - Simone Scheithauer
- Institute of Infection Control and Infectious Diseases, University Medical Center Göttingen (UMG), Georg-August University Göttingen, Robert-Koch-Str. 40, 37075, Göttingen, Germany
| |
Collapse
|
8
|
Gomes DC, Abreu N, Sousa P, Moro C, Carvalho DR, Cubas MR. Representation of Diagnosis and Nursing Interventions in OpenEHR Archetypes. Appl Clin Inform 2021; 12:340-347. [PMID: 33853142 DOI: 10.1055/s-0041-1728706] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open
Abstract
OBJECTIVE The study aimed to represent the content of nursing diagnosis and interventions in the openEHR standard. METHODS This is a developmental study with the models developed according to ISO 18104: 2014. The Ocean Archetype Editor tool from the openEHR Foundation was used. RESULTS Two archetypes were created; one to represent the nursing diagnosis concept and the other the nursing intervention concept. Existing archetypes available in the Clinical Knowledge Manager were reused in modeling. CONCLUSION The representation of nursing diagnosis and interventions based on the openEHR standard contributes to representing nursing care phenomena and needs in health information systems.
Collapse
Affiliation(s)
- Denilsen Carvalho Gomes
- Graduate Program in Health Technology, Pontifícia Universidade Católica do Paraná, Curitiba, Brazil
| | - Nuno Abreu
- Department of Medicine, Centro Hospitalar Universitário do Porto, Porto, Portugal
| | - Paulino Sousa
- Center for Research in Health Technologies and Information Systems (CINTESIS), Faculty of Medicine, University of Porto, Porto, Portugal
| | - Claudia Moro
- Graduate Program in Health Technology, Pontifícia Universidade Católica do Paraná, Curitiba, Brazil
| | - Deborah Ribeiro Carvalho
- Graduate Program in Health Technology, Pontifícia Universidade Católica do Paraná, Curitiba, Brazil
| | - Marcia Regina Cubas
- Graduate Program in Health Technology, Pontifícia Universidade Católica do Paraná, Curitiba, Brazil
| |
Collapse
|