1
|
Muizelaar H, Haas M, van Dortmont K, van der Putten P, Spruit M. Extracting patient lifestyle characteristics from Dutch clinical text with BERT models. BMC Med Inform Decis Mak 2024; 24:151. [PMID: 38831420 PMCID: PMC11149227 DOI: 10.1186/s12911-024-02557-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 05/28/2024] [Indexed: 06/05/2024] Open
Abstract
BACKGROUND BERT models have seen widespread use on unstructured text within the clinical domain. However, little to no research has been conducted into classifying unstructured clinical notes on the basis of patient lifestyle indicators, especially in Dutch. This article aims to test the feasibility of deep BERT models on the task of patient lifestyle classification, as well as introducing an experimental framework that is easily reproducible in future research. METHODS This study makes use of unstructured general patient text data from HagaZiekenhuis, a large hospital in The Netherlands. Over 148 000 notes were provided to us, which were each automatically labelled on the basis of the respective patients' smoking, alcohol usage and drug usage statuses. In this paper we test feasibility of automatically assigning labels, and justify it using hand-labelled input. Ultimately, we compare macro F1-scores of string matching, SGD and several BERT models on the task of classifying smoking, alcohol and drug usage. We test Dutch BERT models and English models with translated input. RESULTS We find that our further pre-trained MedRoBERTa.nl-HAGA model outperformed every other model on smoking (0.93) and drug usage (0.77). Interestingly, our ClinicalBERT model that was merely fine-tuned on translated text performed best on the alcohol task (0.80). In t-SNE visualisations, we show our MedRoBERTa.nl-HAGA model is the best model to differentiate between classes in the embedding space, explaining its superior classification performance. CONCLUSIONS We suggest MedRoBERTa.nl-HAGA to be used as a baseline in future research on Dutch free text patient lifestyle classification. We furthermore strongly suggest further exploring the application of translation to input text in non-English clinical BERT research, as we only translated a subset of the full set and yet achieved very promising results.
Collapse
Affiliation(s)
- Hielke Muizelaar
- LIACS, Leiden University, P.O. Box 9512, Leiden, 2300RA, The Netherlands.
- Department of Public Health and Primary Care, Leiden University Medical Center, Albinusdreef 2, Leiden, 2333ZA, The Netherlands.
| | - Marcel Haas
- Department of Public Health and Primary Care, Leiden University Medical Center, Albinusdreef 2, Leiden, 2333ZA, The Netherlands
| | - Koert van Dortmont
- Department of Business Intelligence, HagaZiekenhuis, Els Borst-Eilersplein 275, Den Haag, 2545AA, The Netherlands
| | | | - Marco Spruit
- LIACS, Leiden University, P.O. Box 9512, Leiden, 2300RA, The Netherlands
- Department of Public Health and Primary Care, Leiden University Medical Center, Albinusdreef 2, Leiden, 2333ZA, The Netherlands
| |
Collapse
|
2
|
Ebrahimi A, Henriksen MBH, Brasen CL, Hilberg O, Hansen TF, Jensen LH, Peimankar A, Wiil UK. Identification of patients' smoking status using an explainable AI approach: a Danish electronic health records case study. BMC Med Res Methodol 2024; 24:114. [PMID: 38760718 PMCID: PMC11100078 DOI: 10.1186/s12874-024-02231-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2023] [Accepted: 04/23/2024] [Indexed: 05/19/2024] Open
Abstract
BACKGROUND Smoking is a critical risk factor responsible for over eight million annual deaths worldwide. It is essential to obtain information on smoking habits to advance research and implement preventive measures such as screening of high-risk individuals. In most countries, including Denmark, smoking habits are not systematically recorded and at best documented within unstructured free-text segments of electronic health records (EHRs). This would require researchers and clinicians to manually navigate through extensive amounts of unstructured data, which is one of the main reasons that smoking habits are rarely integrated into larger studies. Our aim is to develop machine learning models to classify patients' smoking status from their EHRs. METHODS This study proposes an efficient natural language processing (NLP) pipeline capable of classifying patients' smoking status and providing explanations for the decisions. The proposed NLP pipeline comprises four distinct components, which are; (1) considering preprocessing techniques to address abbreviations, punctuation, and other textual irregularities, (2) four cutting-edge feature extraction techniques, i.e. Embedding, BERT, Word2Vec, and Count Vectorizer, employed to extract the optimal features, (3) utilization of a Stacking-based Ensemble (SE) model and a Convolutional Long Short-Term Memory Neural Network (CNN-LSTM) for the identification of smoking status, and (4) application of a local interpretable model-agnostic explanation to explain the decisions rendered by the detection models. The EHRs of 23,132 patients with suspected lung cancer were collected from the Region of Southern Denmark during the period 1/1/2009-31/12/2018. A medical professional annotated the data into 'Smoker' and 'Non-Smoker' with further classifications as 'Active-Smoker', 'Former-Smoker', and 'Never-Smoker'. Subsequently, the annotated dataset was used for the development of binary and multiclass classification models. An extensive comparison was conducted of the detection performance across various model architectures. RESULTS The results of experimental validation confirm the consistency among the models. However, for binary classification, BERT method with CNN-LSTM architecture outperformed other models by achieving precision, recall, and F1-scores between 97% and 99% for both Never-Smokers and Active-Smokers. In multiclass classification, the Embedding technique with CNN-LSTM architecture yielded the most favorable results in class-specific evaluations, with equal performance measures of 97% for Never-Smoker and measures in the range of 86 to 89% for Active-Smoker and 91-92% for Never-Smoker. CONCLUSION Our proposed NLP pipeline achieved a high level of classification performance. In addition, we presented the explanation of the decision made by the best performing detection model. Future work will expand the model's capabilities to analyze longer notes and a broader range of categories to maximize its utility in further research and screening applications.
Collapse
Affiliation(s)
- Ali Ebrahimi
- SDU Health Informatics and Technology, The Maersk Mc-Kinney Moller Institute, University of Southern Denmark, Odense, 5230, Denmark.
| | | | - Claus Lohman Brasen
- Department of Biochemistry and Immunology, Lillebaelt Hospital, University Hospital of Southern Denmark, Vejle, 7100, Denmark
- Institute of Regional Health Research, University of Southern Denmark, Odense, Denmark
| | - Ole Hilberg
- Institute of Regional Health Research, University of Southern Denmark, Odense, Denmark
- Department of Internal Medicine, Lillebaelt Hospital, University Hospital of Southern Denmark, Vejle, 7100, Denmark
| | - Torben Frøstrup Hansen
- Department of Oncology, Lillebaelt Hospital, University Hospital of Southern Denmark, Vejle, 7100, Denmark
- Institute of Regional Health Research, University of Southern Denmark, Odense, Denmark
| | - Lars Henrik Jensen
- Department of Oncology, Lillebaelt Hospital, University Hospital of Southern Denmark, Vejle, 7100, Denmark
| | - Abdolrahman Peimankar
- SDU Health Informatics and Technology, The Maersk Mc-Kinney Moller Institute, University of Southern Denmark, Odense, 5230, Denmark
| | - Uffe Kock Wiil
- SDU Health Informatics and Technology, The Maersk Mc-Kinney Moller Institute, University of Southern Denmark, Odense, 5230, Denmark
| |
Collapse
|
3
|
Haque MA, Gedara MLB, Nickel N, Turgeon M, Lix LM. The validity of electronic health data for measuring smoking status: a systematic review and meta-analysis. BMC Med Inform Decis Mak 2024; 24:33. [PMID: 38308231 PMCID: PMC10836023 DOI: 10.1186/s12911-024-02416-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 01/03/2024] [Indexed: 02/04/2024] Open
Abstract
BACKGROUND Smoking is a risk factor for many chronic diseases. Multiple smoking status ascertainment algorithms have been developed for population-based electronic health databases such as administrative databases and electronic medical records (EMRs). Evidence syntheses of algorithm validation studies have often focused on chronic diseases rather than risk factors. We conducted a systematic review and meta-analysis of smoking status ascertainment algorithms to describe the characteristics and validity of these algorithms. METHODS The Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines were followed. We searched articles published from 1990 to 2022 in EMBASE, MEDLINE, Scopus, and Web of Science with key terms such as validity, administrative data, electronic health records, smoking, and tobacco use. The extracted information, including article characteristics, algorithm characteristics, and validity measures, was descriptively analyzed. Sources of heterogeneity in validity measures were estimated using a meta-regression model. Risk of bias (ROB) in the reviewed articles was assessed using the Quality Assessment of Diagnostic Accuracy Studies-2 tool. RESULTS The initial search yielded 2086 articles; 57 were selected for review and 116 algorithms were identified. Almost three-quarters (71.6%) of algorithms were based on EMR data. The algorithms were primarily constructed using diagnosis codes for smoking-related conditions, although prescription medication codes for smoking treatments were also adopted. About half of the algorithms were developed using machine-learning models. The pooled estimates of positive predictive value, sensitivity, and specificity were 0.843, 0.672, and 0.918 respectively. Algorithm sensitivity and specificity were highly variable and ranged from 3 to 100% and 36 to 100%, respectively. Model-based algorithms had significantly greater sensitivity (p = 0.006) than rule-based algorithms. Algorithms for EMR data had higher sensitivity than algorithms for administrative data (p = 0.001). The ROB was low in most of the articles (76.3%) that underwent the assessment. CONCLUSIONS Multiple algorithms using different data sources and methods have been proposed to ascertain smoking status in electronic health data. Many algorithms had low sensitivity and positive predictive value, but the data source influenced their validity. Algorithms based on machine-learning models for multiple linked data sources have improved validity.
Collapse
Affiliation(s)
- Md Ashiqul Haque
- Department of Community Health Sciences, University of Manitoba, Winnipeg, MB, Canada
| | | | - Nathan Nickel
- Department of Community Health Sciences, University of Manitoba, Winnipeg, MB, Canada
| | - Maxime Turgeon
- Department of Statistics, University of Manitoba, Winnipeg, MB, Canada
| | - Lisa M Lix
- Department of Community Health Sciences, University of Manitoba, Winnipeg, MB, Canada.
| |
Collapse
|
4
|
Matthews AK, Inwanna S, Oyaluade D, Akufo J, Jeremiah R, Kim SJ. Examination of provider knowledge, attitudes, and behaviors associated with lung cancer screening among Black men receiving care at a federally qualified health center. QUALITATIVE RESEARCH IN MEDICINE & HEALTHCARE 2023; 7:11546. [PMID: 38115824 PMCID: PMC10726993 DOI: 10.4081/qrmh.2023.11546] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 11/07/2023] [Indexed: 12/21/2023] Open
Abstract
The study's goal was to look at providers' knowledge, attitudes, and behaviors regarding lung cancer screening among Black male smokers served by a federally qualified healthcare center. Participants in the study were interviewed in depth. Participants completed a short (5-10 minute) survey that assessed demographics, training, and attitudes toward lung cancer screening. For quantitative data, descriptive statistics were used, and for qualitative data, deductive thematic analysis was used. This study included ten healthcare professionals, the majority of whom identified as Black (80%) and were trained as advanced practice providers (60%). The majority of providers (90%) have heard of LDCT lung cancer screening; however, participants reported only being "somewhat" familiar with the LDCT eligibility criteria (70%). Despite generally positive attitudes toward LDCT, patient referral rates for screening were low. Barriers included a lack of provider knowledge about screening eligibility, a lack of use of shared decision-making tools, and patient concerns about screening risks. The reasons for the low referral rates varied, but they included a preference to refer patients for smoking cessation rather than screening, low screening completion and follow-up rates among referred patients, and a lower likelihood that Black smokers will meet pack-year requirements for screening. Additionally, providers discussed patient-level factors such as a lack of information, mistrust, and transportation. The study findings add to the body of knowledge about lung cancer knowledge and screening practices among providers in FQHC settings. This data can be used to create health promotion interventions aimed at smoking cessation and lung cancer screening in Black males and other high-risk smokers.
Collapse
Affiliation(s)
| | - Suchanart Inwanna
- The University of Illinois Chicago, College of Nursing, Chicago, IL, United States
- Ramathibodi School of Nursing, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok, Thailand
| | - Dami Oyaluade
- The University of Illinois Hospital, Cancer Center, Chicago, IL, United States
| | - Jennifer Akufo
- The University of Illinois Chicago, College of Nursing, Chicago, IL, United States
| | - Rohan Jeremiah
- The University of Illinois Chicago, College of Nursing, Chicago, IL, United States
| | - Sage J. Kim
- The University of Illinois Chicago, School of Public Health, Chicago, IL, United States
| |
Collapse
|
5
|
Lewis AE, Weiskopf N, Abrams ZB, Foraker R, Lai AM, Payne PRO, Gupta A. Electronic health record data quality assessment and tools: a systematic review. J Am Med Inform Assoc 2023; 30:1730-1740. [PMID: 37390812 PMCID: PMC10531113 DOI: 10.1093/jamia/ocad120] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 05/16/2023] [Accepted: 06/23/2023] [Indexed: 07/02/2023] Open
Abstract
OBJECTIVE We extended a 2013 literature review on electronic health record (EHR) data quality assessment approaches and tools to determine recent improvements or changes in EHR data quality assessment methodologies. MATERIALS AND METHODS We completed a systematic review of PubMed articles from 2013 to April 2023 that discussed the quality assessment of EHR data. We screened and reviewed papers for the dimensions and methods defined in the original 2013 manuscript. We categorized papers as data quality outcomes of interest, tools, or opinion pieces. We abstracted and defined additional themes and methods though an iterative review process. RESULTS We included 103 papers in the review, of which 73 were data quality outcomes of interest papers, 22 were tools, and 8 were opinion pieces. The most common dimension of data quality assessed was completeness, followed by correctness, concordance, plausibility, and currency. We abstracted conformance and bias as 2 additional dimensions of data quality and structural agreement as an additional methodology. DISCUSSION There has been an increase in EHR data quality assessment publications since the original 2013 review. Consistent dimensions of EHR data quality continue to be assessed across applications. Despite consistent patterns of assessment, there still does not exist a standard approach for assessing EHR data quality. CONCLUSION Guidelines are needed for EHR data quality assessment to improve the efficiency, transparency, comparability, and interoperability of data quality assessment. These guidelines must be both scalable and flexible. Automation could be helpful in generalizing this process.
Collapse
Affiliation(s)
- Abigail E Lewis
- Division of Computational and Data Sciences, Washington University in St. Louis, St. Louis, Missouri, USA
- Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Nicole Weiskopf
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, USA
| | - Zachary B Abrams
- Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Randi Foraker
- Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Albert M Lai
- Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Philip R O Payne
- Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, Missouri, USA
| | - Aditi Gupta
- Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, Missouri, USA
| |
Collapse
|
6
|
Carroll NM, Burnett-Hartman AN, Rendle KA, Neslund-Dudas CM, Greenlee RT, Honda SA, Vachani A, Ritzwoller DP. Smoking status and the association between patient-level factors and survival among lung cancer patients. J Natl Cancer Inst 2023; 115:937-948. [PMID: 37228018 PMCID: PMC10407692 DOI: 10.1093/jnci/djad098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 05/16/2023] [Accepted: 05/19/2023] [Indexed: 05/27/2023] Open
Abstract
BACKGROUND Declines in the prevalence of cigarette smoking, advances in targeted therapies, and implementation of lung cancer screening have changed the clinical landscape for lung cancer. The proportion of lung cancer deaths is increasing in those who have never smoked cigarettes. To better understand contemporary patterns in survival among patients with lung cancer, a comprehensive evaluation of factors associated with survival, including differential associations by smoking status, is needed. METHODS Patients diagnosed with lung cancer between January 1, 2010, and September 30, 2019, were identified. We estimated all-cause and lung cancer-specific median, 5-year, and multivariable restricted mean survival time (RMST) to identify demographic, socioeconomic, and clinical factors associated with survival, overall and stratified by smoking status (never, former, and current). RESULTS Analyses included 6813 patients with lung cancer: 13.9% never smoked, 54.2% formerly smoked, and 31.9% currently smoked. All-cause RMST through 5 years for those who never, formerly, and currently smoked was 32.1, 25.9, and 23.3 months, respectively. Lung cancer-specific RMST was 36.3 months, 30.3 months, and 26.0 months, respectively. Across most models, female sex, younger age, higher socioeconomic measures, first-course surgery, histology, and body mass index were positively associated, and higher stage was inversely associated with survival. Relative to White patients, Black patients had increased survival among those who formerly smoked. CONCLUSIONS We identify actionable factors associated with survival between those who never, formerly, and currently smoked cigarettes. These findings illuminate opportunities to address underlying mechanisms driving lung cancer progression, including use of first-course treatment, and enhanced implementation of tailored smoking cessation interventions for individuals diagnosed with cancer.
Collapse
Affiliation(s)
- Nikki M Carroll
- Institute for Health Research, Kaiser Permanente Colorado, Denver, CO, USA
| | - Andrea N Burnett-Hartman
- Institute for Health Research, Kaiser Permanente Colorado, Denver, CO, USA
- Department of Health Systems Science, Kaiser Permanente Bernard J. Tyson School of Medicine, Pasadena, CA, USA
| | - Katharine A Rendle
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | | | | | - Stacey A Honda
- Hawaii Permanente Medical Group, Center for Integrated Healthcare Research, Kaiser Permanente Hawaii, Honolulu, HI, USA
| | - Anil Vachani
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Debra P Ritzwoller
- Institute for Health Research, Kaiser Permanente Colorado, Denver, CO, USA
| |
Collapse
|
7
|
Riskin D, Cady R, Shroff A, Hindiyeh NA, Smith T, Kymes S. Using artificial intelligence to identify patients with migraine and associated symptoms and conditions within electronic health records. BMC Med Inform Decis Mak 2023; 23:121. [PMID: 37452338 PMCID: PMC10349448 DOI: 10.1186/s12911-023-02190-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 05/04/2023] [Indexed: 07/18/2023] Open
Abstract
BACKGROUND Real-world evidence (RWE)-based on information obtained from sources such as electronic health records (EHRs), claims and billing databases, product and disease registries, and personal devices and health applications-is increasingly used to support healthcare decision making. There is variability in the collection of EHR data, which includes "structured data" in predefined fields (e.g., problem list, open claims, medication list, etc.) and "unstructured data" as free text or narrative. Healthcare providers are likely to provide more complete information as free text, but extracting meaning from these fields requires newer technologies and a rigorous methodology to generate higher-quality evidence. Herein, an approach to identify concepts associated with the presence and progression of migraine was developed and validated using the complete patient record in EHR data, including both the structured and unstructured portions. METHODS "Traditional RWE" approaches (i.e., capture from structured EHR fields and extraction using structured queries) and "Advanced RWE" approaches (i.e., capture from unstructured EHR data and processing by artificial intelligence [AI] technology, including natural language processing and AI-based inference) were evaluated against a manual chart abstraction reference standard for data collected from a tertiary care setting. The primary endpoint was recall; differences were compared using chi square. RESULTS Compared with manual chart abstraction, recall for migraine and headache were 66.6% and 29.6%, respectively, for Traditional RWE, and 96.8% and 92.9% for Advanced RWE; differences were statistically significant (absolute differences, 30.2% and 63.3%; P < 0.001). Recall of 6 migraine-associated symptoms favored Advanced RWE over Traditional RWE to a greater extent (absolute differences, 71.5-88.8%; P < 0.001). The difference between traditional and advanced techniques for recall of migraine medications was less pronounced, approximately 80% for Traditional RWE and ≥ 98% for Advanced RWE (P < 0.001). CONCLUSION Unstructured EHR data, processed using AI technologies, provides a more credible approach to enable RWE in migraine than using structured EHR and claims data alone. An algorithm was developed that could be used to further study and validate the use of RWE to support diagnosis and management of patients with migraine.
Collapse
Affiliation(s)
| | - Roger Cady
- RK Consults, Ozark, MO USA
- Missouri State University, Springfield, MO USA
- Axon Therapeutics, San Diego, CA USA
| | | | | | | | | |
Collapse
|
8
|
Syed R, Eden R, Makasi T, Chukwudi I, Mamudu A, Kamalpour M, Kapugama Geeganage D, Sadeghianasl S, Leemans SJJ, Goel K, Andrews R, Wynn MT, Ter Hofstede A, Myers T. Digital Health Data Quality Issues: Systematic Review. J Med Internet Res 2023; 25:e42615. [PMID: 37000497 PMCID: PMC10131725 DOI: 10.2196/42615] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Revised: 12/07/2022] [Accepted: 12/31/2022] [Indexed: 04/01/2023] Open
Abstract
BACKGROUND The promise of digital health is principally dependent on the ability to electronically capture data that can be analyzed to improve decision-making. However, the ability to effectively harness data has proven elusive, largely because of the quality of the data captured. Despite the importance of data quality (DQ), an agreed-upon DQ taxonomy evades literature. When consolidated frameworks are developed, the dimensions are often fragmented, without consideration of the interrelationships among the dimensions or their resultant impact. OBJECTIVE The aim of this study was to develop a consolidated digital health DQ dimension and outcome (DQ-DO) framework to provide insights into 3 research questions: What are the dimensions of digital health DQ? How are the dimensions of digital health DQ related? and What are the impacts of digital health DQ? METHODS Following the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines, a developmental systematic literature review was conducted of peer-reviewed literature focusing on digital health DQ in predominately hospital settings. A total of 227 relevant articles were retrieved and inductively analyzed to identify digital health DQ dimensions and outcomes. The inductive analysis was performed through open coding, constant comparison, and card sorting with subject matter experts to identify digital health DQ dimensions and digital health DQ outcomes. Subsequently, a computer-assisted analysis was performed and verified by DQ experts to identify the interrelationships among the DQ dimensions and relationships between DQ dimensions and outcomes. The analysis resulted in the development of the DQ-DO framework. RESULTS The digital health DQ-DO framework consists of 6 dimensions of DQ, namely accessibility, accuracy, completeness, consistency, contextual validity, and currency; interrelationships among the dimensions of digital health DQ, with consistency being the most influential dimension impacting all other digital health DQ dimensions; 5 digital health DQ outcomes, namely clinical, clinician, research-related, business process, and organizational outcomes; and relationships between the digital health DQ dimensions and DQ outcomes, with the consistency and accessibility dimensions impacting all DQ outcomes. CONCLUSIONS The DQ-DO framework developed in this study demonstrates the complexity of digital health DQ and the necessity for reducing digital health DQ issues. The framework further provides health care executives with holistic insights into DQ issues and resultant outcomes, which can help them prioritize which DQ-related problems to tackle first.
Collapse
Affiliation(s)
- Rehan Syed
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Rebekah Eden
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Tendai Makasi
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Ignatius Chukwudi
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Azumah Mamudu
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Mostafa Kamalpour
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Dakshi Kapugama Geeganage
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Sareh Sadeghianasl
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Sander J J Leemans
- Rheinisch-Westfälische Technische Hochschule, Aachen University, Aachen, Germany
| | - Kanika Goel
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Robert Andrews
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Moe Thandar Wynn
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Arthur Ter Hofstede
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| | - Trina Myers
- School of Information Systems, Faculty of Science, Queensland University of Technology, Brisbane, Australia
| |
Collapse
|
9
|
Ardesch F, Meulendijk M, Kist J, Vos R, Vos H, Kiefte-de Jong J, Spruit M, Bruijnzeels M, Bussemaker M, Numans M, Struijs J. A data-driven population health management approach: The Extramural LUMC Academic Network data infrastructure. Health Policy 2023; 132:104769. [PMID: 37018883 DOI: 10.1016/j.healthpol.2023.104769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 02/27/2023] [Accepted: 03/09/2023] [Indexed: 03/17/2023]
Abstract
Improving population health and reducing inequalities through better integrated health and social care services is high up on the agenda of policymakers internationally. In recent years, regional cross-domain partnerships have emerged in several countries, which aim to achieve better population health, quality of care and a reduction in the per capita costs. These cross-domain partnerships aim to have a strong data foundation and are committed to continuous learning in which data plays an essential role. This paper describes our approach towards the development of the regional integrative population-based data infrastructure Extramural LUMC (Leiden University Medical Center) Academic Network (ELAN), in which we linked routinely collected medical, social and public health data at the patient level from the greater The Hague and Leiden area. Furthermore, we discuss the methodological issues of routine care data and the lessons learned about privacy, legislation and reciprocities. The initiative presented in this paper is relevant for international researchers and policy-makers because a unique data infrastructure has been set up that contains data across different domains, providing insights into societal issues and scientific questions that are important for data driven population health management approaches.
Collapse
|
10
|
Groenhof TKJ, Haitjema S, Lely AT, Grobbee DE, Asselbergs FW, Bots ML. Optimizing cardiovascular risk assessment and registration in a developing cardiovascular learning health care system: Women benefit most. PLOS DIGITAL HEALTH 2023; 2:e0000190. [PMID: 36812613 PMCID: PMC9931327 DOI: 10.1371/journal.pdig.0000190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 12/30/2022] [Indexed: 02/11/2023]
Abstract
Since 2015 we organized a uniform, structured collection of a fixed set of cardiovascular risk factors according the (inter)national guidelines on cardiovascular risk management. We evaluated the current state of a developing cardiovascular towards learning healthcare system-the Utrecht Cardiovascular Cohort Cardiovascular Risk Management (UCC-CVRM)-and its potential effect on guideline adherence in cardiovascular risk management. We conducted a before-after study comparing data from patients included in UCC-CVRM (2015-2018) and patients treated in our center before UCC-CVRM (2013-2015) who would have been eligible for UCC-CVRM using the Utrecht Patient Oriented Database (UPOD). Proportions of cardiovascular risk factor measurement before and after UCC-CVRM initiation were compared, as were proportions of patients that required (change of) blood pressure, lipid, or blood glucose lowering treatment. We estimated the likelihood to miss patients with hypertension, dyslipidemia, and elevated HbA1c before UCC-CVRM for the whole cohort and stratified for sex. In the present study, patients included up to October 2018 (n = 1904) were matched with 7195 UPOD patients with similar age, sex, department of referral and diagnose description. Completeness of risk factor measurement increased, ranging from 0% -77% before to 82%-94% after UCC-CVRM initiation. Before UCC-CVRM, we found more unmeasured risk factors in women compared to men. This sex-gap resolved in UCC-CVRM. The likelihood to miss hypertension, dyslipidemia, and elevated HbA1c was reduced by 67%, 75% and 90%, respectively, after UCC-CVRM initiation. A finding more pronounced in women compared to men. In conclusion, a systematic registration of the cardiovascular risk profile substantially improves guideline adherent assessment and decreases the risk of missing patients with elevated levels with an indication for treatment. The sex-gap disappeared after UCC-CVRM initiation. Thus, an LHS approach contributes to a more inclusive insight into quality of care and prevention of cardiovascular disease (progression).
Collapse
Affiliation(s)
- T. Katrien J. Groenhof
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Saskia Haitjema
- Laboratory of Clinical Chemistry and Haematology, University Medical Center Utrecht, Utrecht University, The Netherlands
| | - A. Titia Lely
- Wilhelmina Children’s Hospital Birth Centre, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Diederick E. Grobbee
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Folkert W. Asselbergs
- Department of Cardiology, Division Heart & Lungs, University Medical Center Utrecht, Utrecht University, The Netherlands,Institute of Cardiovascular Science, Faculty of Population Health Sciences, University College London, London, United Kingdom,Health Data Research UK, Institute of Health Informatics, University College London, London, United Kingdom
| | - Michiel L. Bots
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands,* E-mail:
| | | |
Collapse
|
11
|
de Boer AR, de Groot MCH, Groenhof TKJ, van Doorn S, Vaartjes I, Bots ML, Haitjema S. Data mining to retrieve smoking status from electronic health records in general practice . EUROPEAN HEART JOURNAL. DIGITAL HEALTH 2022; 3:437-444. [PMID: 36712169 PMCID: PMC9707867 DOI: 10.1093/ehjdh/ztac031] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 04/19/2022] [Indexed: 02/01/2023]
Abstract
Aims Optimize and assess the performance of an existing data mining algorithm for smoking status from hospital electronic health records (EHRs) in general practice EHRs. Methods and results We optimized an existing algorithm in a training set containing all clinical notes from 498 individuals (75 712 contact moments) from the Julius General Practitioners' Network (JGPN). Each moment was classified as either 'current smoker', 'former smoker', 'never smoker', or 'no information'. As a reference, we manually reviewed EHRs. Algorithm performance was assessed in an independent test set (n = 494, 78 129 moments) using precision, recall, and F1-score. Test set algorithm performance for 'current smoker' was precision 79.7%, recall 78.3%, and F1-score 0.79. For former smoker, it was precision 73.8%, recall 64.0%, and F1-score 0.69. For never smoker, it was precision 92.0%, recall 74.9%, and F1-score 0.83. On a patient level, performance for ever smoker (current and former smoker combined) was precision 87.9%, recall 94.7%, and F1-score 0.91. For never smoker, it was 98.0, 82.0, and 0.89%, respectively. We found a more narrative writing style in general practice than in hospital EHRs. Conclusion Data mining can successfully retrieve smoking status information from general practice clinical notes with a good performance for classifying ever and never smokers. Differences between general practice and hospital EHRs call for optimization of data mining algorithms when applied beyond a primary development setting.
Collapse
Affiliation(s)
| | - Mark C H de Groot
- Central Diagnostic Laboratory, University Medical Center Utrecht, Utrecht, The Netherlands
| | - T Katrien J Groenhof
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Heidelberglaan 100, Utrecht 3584 CX, The Netherlands
| | - Sander van Doorn
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Heidelberglaan 100, Utrecht 3584 CX, The Netherlands
| | - Ilonca Vaartjes
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Heidelberglaan 100, Utrecht 3584 CX, The Netherlands,Dutch Heart Foundation, The Hague, The Netherlands
| | - Michiel L Bots
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Heidelberglaan 100, Utrecht 3584 CX, The Netherlands
| | - Saskia Haitjema
- Central Diagnostic Laboratory, University Medical Center Utrecht, Utrecht, The Netherlands
| |
Collapse
|
12
|
Chintalapudi N, Angeloni U, Battineni G, di Canio M, Marotta C, Rezza G, Sagaro GG, Silenzi A, Amenta F. LASSO Regression Modeling on Prediction of Medical Terms among Seafarers’ Health Documents Using Tidy Text Mining. Bioengineering (Basel) 2022; 9:bioengineering9030124. [PMID: 35324813 PMCID: PMC8945331 DOI: 10.3390/bioengineering9030124] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 03/02/2022] [Accepted: 03/16/2022] [Indexed: 12/31/2022] Open
Abstract
Generally, seafarers face a higher risk of illnesses and accidents than land workers. In most cases, there are no medical professionals on board seagoing vessels, which makes disease diagnosis even more difficult. When this occurs, onshore doctors may be able to provide medical advice through telemedicine by receiving better symptomatic and clinical details in the health abstracts of seafarers. The adoption of text mining techniques can assist in extracting diagnostic information from clinical texts. We applied lexicon sentimental analysis to explore the automatic labeling of positive and negative healthcare terms to seafarers’ text healthcare documents. This was due to the lack of experimental evaluations using computational techniques. In order to classify diseases and their associated symptoms, the LASSO regression algorithm is applied to analyze these text documents. A visualization of symptomatic data frequency for each disease can be achieved by analyzing TF-IDF values. The proposed approach allows for the classification of text documents with 93.8% accuracy by using a machine learning model called LASSO regression. It is possible to classify text documents effectively with tidy text mining libraries. In addition to delivering health assistance, this method can be used to classify diseases and establish health observatories. Knowledge developed in the present work will be applied to establish an Epidemiological Observatory of Seafarers’ Pathologies and Injuries. This Observatory will be a collaborative initiative of the Italian Ministry of Health, University of Camerino, and International Radio Medical Centre (C.I.R.M.), the Italian TMAS.
Collapse
Affiliation(s)
- Nalini Chintalapudi
- Clinical Research Centre, School of Medicinal and Health Products Sciences, University of Camerino, 62032 Camerino, Italy; (G.B.); (M.d.C.); (G.G.S.); (F.A.)
- Correspondence: ; Tel.: +39-35-33776704
| | - Ulrico Angeloni
- General Directorate of Health Prevention, Ministry of Health, 00144 Rome, Italy; (U.A.); (C.M.); (G.R.); (A.S.)
| | - Gopi Battineni
- Clinical Research Centre, School of Medicinal and Health Products Sciences, University of Camerino, 62032 Camerino, Italy; (G.B.); (M.d.C.); (G.G.S.); (F.A.)
| | - Marzio di Canio
- Clinical Research Centre, School of Medicinal and Health Products Sciences, University of Camerino, 62032 Camerino, Italy; (G.B.); (M.d.C.); (G.G.S.); (F.A.)
- Research Department, International Radio Medical Centre (C.I.R.M.), 00144 Rome, Italy
| | - Claudia Marotta
- General Directorate of Health Prevention, Ministry of Health, 00144 Rome, Italy; (U.A.); (C.M.); (G.R.); (A.S.)
| | - Giovanni Rezza
- General Directorate of Health Prevention, Ministry of Health, 00144 Rome, Italy; (U.A.); (C.M.); (G.R.); (A.S.)
| | - Getu Gamo Sagaro
- Clinical Research Centre, School of Medicinal and Health Products Sciences, University of Camerino, 62032 Camerino, Italy; (G.B.); (M.d.C.); (G.G.S.); (F.A.)
| | - Andrea Silenzi
- General Directorate of Health Prevention, Ministry of Health, 00144 Rome, Italy; (U.A.); (C.M.); (G.R.); (A.S.)
| | - Francesco Amenta
- Clinical Research Centre, School of Medicinal and Health Products Sciences, University of Camerino, 62032 Camerino, Italy; (G.B.); (M.d.C.); (G.G.S.); (F.A.)
- Research Department, International Radio Medical Centre (C.I.R.M.), 00144 Rome, Italy
| |
Collapse
|
13
|
Overmars LM, van Es B, Groepenhoff F, De Groot MCH, Pasterkamp G, den Ruijter HM, van Solinge WW, Hoefer IE, Haitjema S. Preventing unnecessary imaging in patients suspect of coronary artery disease through machine learning of electronic health records. EUROPEAN HEART JOURNAL. DIGITAL HEALTH 2021; 3:11-19. [PMID: 36713995 PMCID: PMC9707976 DOI: 10.1093/ehjdh/ztab103] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Revised: 11/22/2021] [Accepted: 12/02/2021] [Indexed: 02/01/2023]
Abstract
Aims With the ageing European population, the incidence of coronary artery disease (CAD) is expected to rise. This will likely result in an increased imaging use. Symptom recognition can be complicated, as symptoms caused by CAD can be atypical, particularly in women. Early CAD exclusion may help to optimize use of diagnostic resources and thus improve the sustainability of the healthcare system. To develop sex-stratified algorithms, trained on routinely available electronic health records (EHRs), raw electrocardiograms, and haematology data to exclude CAD in patients upfront. Methods and results We trained XGBoost algorithms on data from patients from the Utrecht Patient-Oriented Database, who underwent coronary computed tomography angiography (CCTA), and/or stress cardiac magnetic resonance (CMR) imaging, or stress single-photon emission computerized tomography (SPECT) in the UMC Utrecht. Outcomes were extracted from radiology reports. We aimed to maximize negative predictive value (NPV) to minimize the false negative risk with acceptable specificity. Of 6808 CCTA patients (31% female), 1029 females (48%) and 1908 males (45%) had no diagnosis of CAD. Of 3053 CMR/SPECT patients (45% female), 650 females (47%) and 881 males (48%) had no diagnosis of CAD. On the train and test set, the CCTA models achieved NPVs and specificities of 0.95 and 0.19 (females) and 0.96 and 0.09 (males). The CMR/SPECT models achieved NPVs and specificities of 0.75 and 0.041 (females) and 0.92 and 0.026 (males). Conclusion Coronary artery disease can be excluded from EHRs with high NPV. Our study demonstrates new possibilities to reduce unnecessary imaging in women and men suspected of CAD.
Collapse
Affiliation(s)
- L Malin Overmars
- Central Diagnostic Laboratory, University Medical Center Utrecht, Utrecht, Heidelberglaan 100 3584 CX, the Netherlands
| | - Bram van Es
- Central Diagnostic Laboratory, University Medical Center Utrecht, Utrecht, Heidelberglaan 100 3584 CX, the Netherlands
| | - Floor Groepenhoff
- Central Diagnostic Laboratory, University Medical Center Utrecht, Utrecht, Heidelberglaan 100 3584 CX, the Netherlands,Laboratory of Experimental Cardiology, University Medical Center Utrecht, Heidelberglaan 100 3584 CX, Utrecht, the Netherlands
| | - Mark C H De Groot
- Central Diagnostic Laboratory, University Medical Center Utrecht, Utrecht, Heidelberglaan 100 3584 CX, the Netherlands
| | - Gerard Pasterkamp
- Central Diagnostic Laboratory, University Medical Center Utrecht, Utrecht, Heidelberglaan 100 3584 CX, the Netherlands
| | - Hester M den Ruijter
- Laboratory of Experimental Cardiology, University Medical Center Utrecht, Heidelberglaan 100 3584 CX, Utrecht, the Netherlands
| | - Wouter W van Solinge
- Central Diagnostic Laboratory, University Medical Center Utrecht, Utrecht, Heidelberglaan 100 3584 CX, the Netherlands
| | - Imo E Hoefer
- Central Diagnostic Laboratory, University Medical Center Utrecht, Utrecht, Heidelberglaan 100 3584 CX, the Netherlands
| | | |
Collapse
|
14
|
Using Electronic Medical Records to Identify Potentially Eligible Study Subjects for Lung Cancer Screening with Biomarkers. Cancers (Basel) 2021; 13:cancers13215449. [PMID: 34771612 PMCID: PMC8582572 DOI: 10.3390/cancers13215449] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Revised: 10/18/2021] [Accepted: 10/19/2021] [Indexed: 11/18/2022] Open
Abstract
Simple Summary Recent cancer screening trials have found that using low-dose computed tomography (LDCT), compared to chest radiography, resulted in a significant reduction in lung cancer mortality. To effectively carry out this intervention, individuals at a high risk of developing lung cancer are targeted. However, accurately identifying and retaining these groups can be challenging. As electronic medical records (EMRs) contain important demographic and clinical information, they could be used to accurately identify subjects for screening. To determine whether EMRs can be used for this purpose, this paper examines the evidence around the use of EMRs in screening trials and the information contained in them that could be used to aid researchers in identifying eligible subjects. Abstract Lung cancer screening trials using low-dose computed tomography (LDCT) show reduced late-stage diagnosis and mortality rates. These trials have identified high-risk groups that would benefit from screening. However, these sub-populations can be difficult to access and retain in trials. Implementation of national screening programmes further suggests that there is poor uptake in eligible populations. A new approach to participant selection may be more effective. Electronic medical records (EMRs) are a viable alternative to population-based or health registries, as they contain detailed clinical and demographic information. Trials have identified that e-screening using EMRs has improved trial retention and eligible subject identification. As such, this paper argues for greater use of EMRs in trial recruitment and screening programmes. Moreover, this opinion paper explores the current issues in and approaches to lung cancer screening, whether records can be used to identify eligible subjects for screening and the challenges that researchers face when using EMR data.
Collapse
|
15
|
Abstract
Objective:
In this synopsis, we give an overview of recent research and propose a selection of best papers published in 2020 in the field of Clinical Information Systems (CIS).
Method:
As CIS section editors, we annually apply a systematic process to retrieve articles for the International Medical Informatics Association Yearbook of Medical Informatics. For seven years now, we use the same query to find relevant publications in the CIS field. Each year we retrieve more than 2,400 papers which we categorize in a multi-pass review to distill a preselection of 15 candidate papers. External reviewers and yearbook editors then assess the selected candidate papers. Based on the review results, the IMIA Yearbook editorial board chooses up to four best publications for the section at a selection meeting. To get an overview of the content of the retrieved articles, we use text mining and term co-occurrence mapping techniques.
Results:
We carried out the query in mid-January 2021 and retrieved a deduplicated result set of 2,787 articles from 1,135 different journals. We nominated 15 papers as candidates and finally selected four of them as the best papers in the CIS section. As in the previous years, the content analysis of the articles revealed the broad spectrum of topics covered by CIS research. Thus, this year we could observe a significant impact of COVID-19 on CIS research.
Conclusions:
The trends in CIS research, as seen in recent years, continue to be observable. What was very visible was the impact of the Corona Virus Disease 2019 (COVID-19) pandemic, which has affected not only our lives but also CIS.
Collapse
Affiliation(s)
- W O Hackl
- Institute of Medical Informatics, UMIT - Private University for Health Sciences, Medical Informatics and Technology, Hall in Tirol, Austria
| | - A Hoerbst
- Medical Technologies Department, MCI - The Entrepreneurial School, Innsbruck, Austria
| | | |
Collapse
|
16
|
Rankin NM, McWilliams A, Marshall HM. Lung cancer screening implementation: Complexities and priorities. Respirology 2021; 25 Suppl 2:5-23. [PMID: 33200529 DOI: 10.1111/resp.13963] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Accepted: 10/06/2020] [Indexed: 12/17/2022]
Abstract
Lung cancer is the number one cause of cancer death worldwide. The benefits of lung cancer screening to reduce mortality and detect early-stage disease are no longer in any doubt based on the results of two landmark trials using LDCT. Lung cancer screening has been implemented in the US and South Korea and is under consideration by other communities. Successful translation of demonstrated research outcomes into the routine clinical setting requires careful implementation and co-ordinated input from multiple stakeholders. Implementation aspects may be specific to different healthcare settings. Important knowledge gaps remain, which must be addressed in order to optimize screening benefits and minimize screening harms. Lung cancer screening differs from all other cancer screening programmes as lung cancer risk is driven by smoking, a highly stigmatized behaviour. Stigma, along with other factors, can impact smokers' engagement with screening, meaning that smokers are generally 'hard to reach'. This review considers critical points along the patient journey. The first steps include selecting a risk threshold at which to screen, successfully engaging the target population and maximizing screening uptake. We review barriers to smoker engagement in lung and other cancer screening programmes. Recruitment strategies used in trials and real-world (clinical) programmes and associated screening uptake are reviewed. To aid cross-study comparisons, we propose a standardized nomenclature for recording and calculating recruitment outcomes. Once participants have engaged with the screening programme, we discuss programme components that are critical to maximize net benefit. A whole-of-programme approach is required including a standardized and multidisciplinary approach to pulmonary nodule management, incorporating probabilistic nodule risk assessment and longitudinal volumetric analysis, to reduce unnecessary downstream investigations and surgery; the integration of smoking cessation; and identification and intervention for other tobacco related diseases, such as coronary artery calcification and chronic obstructive pulmonary disease. National support, integrated with tobacco control programmes, and with appropriate funding, accreditation, data collection, quality assurance and reporting mechanisms will enhance lung cancer screening programme success and reduce the risks associated with opportunistic, ad hoc screening. Finally, implementation research must play a greater role in informing policy change about targeted LDCT screening programmes.
Collapse
Affiliation(s)
- Nicole M Rankin
- School of Public Health, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Australia
| | - Annette McWilliams
- Department of Respiratory Medicine, Fiona Stanley Hospital, Perth, WA, Australia.,Faculty of Health and Medical Sciences, University of Western Australia, Perth, WA, Australia.,Thoracic Tumour Collaborative of Western Australia, Western Australia Cancer and Palliative Care Network, Perth, WA, Australia
| | - Henry M Marshall
- Department of Thoracic Medicine, The Prince Charles Hospital, Brisbane, QLD, Australia.,The University of Queensland Thoracic Research Centre, Brisbane, QLD, Australia
| |
Collapse
|
17
|
Artificial Intelligence and the Medical Physicist: Welcome to the Machine. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11041691] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Artificial intelligence (AI) is a branch of computer science dedicated to giving machines or computers the ability to perform human-like cognitive functions, such as learning, problem-solving, and decision making. Since it is showing superior performance than well-trained human beings in many areas, such as image classification, object detection, speech recognition, and decision-making, AI is expected to change profoundly every area of science, including healthcare and the clinical application of physics to healthcare, referred to as medical physics. As a result, the Italian Association of Medical Physics (AIFM) has created the “AI for Medical Physics” (AI4MP) group with the aims of coordinating the efforts, facilitating the communication, and sharing of the knowledge on AI of the medical physicists (MPs) in Italy. The purpose of this review is to summarize the main applications of AI in medical physics, describe the skills of the MPs in research and clinical applications of AI, and define the major challenges of AI in healthcare.
Collapse
|
18
|
Vieira MDC, Vieira SAG, Skupien JA, Boeck CR. Nanoencapsulation of unsaturated omega-3 fatty acids as protection against oxidation: A systematic review and data-mining. Crit Rev Food Sci Nutr 2021; 62:4356-4370. [PMID: 33506691 DOI: 10.1080/10408398.2021.1874870] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
The chemical structure of unsaturated fatty acids makes them highly prone to oxidation, which decreases their nutritional properties. Nanocarriers have the ability to protect unstable nutraceuticals and take them to their specific targets. Thus, the aim is to determine the effectiveness of nanoencapsulation of omega-3 unsaturated fatty acids as protection against oxidation, as well as to apply data-mining approach to identify nanoencapsulation profiles. Three databases were used to search for studies focused on comparing omega-3 encapsulation to the active compound in its raw form. Studies without oxidation test or no use omega 3-rich oil as active ingredient in nanoformulations were excluded. Twenty-three studies were included in the systematic review. The qualitative analysis indicated that the main evaluated parameters were encapsulation efficiency (%), physical-chemical parameters and oxidation (analyzed at different storage temperatures), oil type, and whether the formulation was added to food. With regard to quantitative analysis, studies that did not perform oxidation tests focused on comparing free oil to the encapsulated one were excluded. Data-mining indicated that encapsulation efficiency and particle size were the main characteristic defining nanocarrier's effectiveness in protecting the oil against oxidation. Nevertheless, it is important to note the main characteristics associated with oil protection in nanocarriers.
Collapse
Affiliation(s)
- Maiana da Costa Vieira
- Programa de Pós-graduação em Nanociências, Universidade Franciscana, Santa Maria, Brazil
| | | | - Jovito Adiel Skupien
- Mestrado em Ciências da Saúde e da Vida, Universidade Franciscana, Santa Maria, Brazil
| | - Carina Rodrigues Boeck
- Programa de Pós-graduação em Nanociências, Universidade Franciscana, Santa Maria, Brazil.,Mestrado em Ciências da Saúde e da Vida, Universidade Franciscana, Santa Maria, Brazil
| |
Collapse
|
19
|
Brunekreef TE, Otten HG, van den Bosch SC, Hoefer IE, van Laar JM, Limper M, Haitjema S. Text Mining of Electronic Health Records Can Accurately Identify and Characterize Patients With Systemic Lupus Erythematosus. ACR Open Rheumatol 2021; 3:65-71. [PMID: 33434395 PMCID: PMC7882527 DOI: 10.1002/acr2.11211] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 11/16/2020] [Indexed: 12/20/2022] Open
Abstract
Objective Electronic health records (EHR) are increasingly being recognized as a major source of data reusable for medical research and quality monitoring, although patient identification and assessment of symptoms (characterization) remain challenging, especially in complex diseases such as systemic lupus erythematosus (SLE). Current coding systems are unable to assess information recorded in the physician’s free‐text notes. This study shows that text mining can be used as a reliable alternative. Methods In a multidisciplinary research team of data scientists and medical experts, a text mining algorithm on 4607 patient records was developed to assess the diagnosis of 14 different immune‐mediated inflammatory diseases and the presence of 18 different symptoms in the EHR. The text mining algorithm included key words in the EHR, while mining the context for exclusion phrases. The accuracy of the text mining algorithm was assessed by manually checking the EHR of 100 random patients suspected of having SLE for diagnoses and symptoms and comparing the outcome with the outcome of the text mining algorithm. Results After evaluation of 100 patient records, the text mining algorithm had a sensitivity of 96.4% and a specificity of 93.3% in assessing the presence of SLE. The algorithm detected potentially life‐threatening symptoms (nephritis, pleuritis) with good sensitivity (80%‐82%) and high specificity (97%‐97%). Conclusion We present a text mining algorithm that can accurately identify and characterize patients with SLE using routinely collected data from the EHR. Our study shows that using text mining, data from the EHR can be reused in research and quality control.
Collapse
Affiliation(s)
- Tammo E Brunekreef
- University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Henny G Otten
- University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | | | - Imo E Hoefer
- University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Jacob M van Laar
- University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Maarten Limper
- University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Saskia Haitjema
- University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| |
Collapse
|