Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Groenhof TKJ, Koers LR, Blasse E, de Groot M, Grobbee DE, Bots ML, Asselbergs FW, Lely AT, Haitjema S, van Solinge W, Hoefer I, Haitjema S, de Groot M, Blasse E, Asselbergs FW, Nathoe HM, de Borst GJ, Bots ML, Geerlings MI, Emmelot MH, de Jong PA, Leiner T, Lely AT, van der Kaaij NP, Kappelle LJ, Ruigrok YM, Verhaar MC, Visseren FL, Westerink J. Data mining information from electronic health records produced high yield and accuracy for current smoking status. J Clin Epidemiol 2020;118:100-106. [DOI: 10.1016/j.jclinepi.2019.11.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2019] [Revised: 09/06/2019] [Accepted: 11/06/2019] [Indexed: 12/19/2022]

For:	Groenhof TKJ, Koers LR, Blasse E, de Groot M, Grobbee DE, Bots ML, Asselbergs FW, Lely AT, Haitjema S, van Solinge W, Hoefer I, Haitjema S, de Groot M, Blasse E, Asselbergs FW, Nathoe HM, de Borst GJ, Bots ML, Geerlings MI, Emmelot MH, de Jong PA, Leiner T, Lely AT, van der Kaaij NP, Kappelle LJ, Ruigrok YM, Verhaar MC, Visseren FL, Westerink J. Data mining information from electronic health records produced high yield and accuracy for current smoking status. J Clin Epidemiol 2020;118:100-106. [DOI: 10.1016/j.jclinepi.2019.11.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2019] [Revised: 09/06/2019] [Accepted: 11/06/2019] [Indexed: 12/19/2022]

Number

Cited by Other Article(s)

Muizelaar H, Haas M, van Dortmont K, van der Putten P, Spruit M. Extracting patient lifestyle characteristics from Dutch clinical text with BERT models. BMC Med Inform Decis Mak 2024;24:151. [PMID: 38831420 PMCID: PMC11149227 DOI: 10.1186/s12911-024-02557-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 05/28/2024] [Indexed: 06/05/2024] Open

Abstract

BACKGROUND

BERT models have seen widespread use on unstructured text within the clinical domain. However, little to no research has been conducted into classifying unstructured clinical notes on the basis of patient lifestyle indicators, especially in Dutch. This article aims to test the feasibility of deep BERT models on the task of patient lifestyle classification, as well as introducing an experimental framework that is easily reproducible in future research.

METHODS

This study makes use of unstructured general patient text data from HagaZiekenhuis, a large hospital in The Netherlands. Over 148 000 notes were provided to us, which were each automatically labelled on the basis of the respective patients' smoking, alcohol usage and drug usage statuses. In this paper we test feasibility of automatically assigning labels, and justify it using hand-labelled input. Ultimately, we compare macro F1-scores of string matching, SGD and several BERT models on the task of classifying smoking, alcohol and drug usage. We test Dutch BERT models and English models with translated input.

RESULTS

We find that our further pre-trained MedRoBERTa.nl-HAGA model outperformed every other model on smoking (0.93) and drug usage (0.77). Interestingly, our ClinicalBERT model that was merely fine-tuned on translated text performed best on the alcohol task (0.80). In t-SNE visualisations, we show our MedRoBERTa.nl-HAGA model is the best model to differentiate between classes in the embedding space, explaining its superior classification performance.

CONCLUSIONS

We suggest MedRoBERTa.nl-HAGA to be used as a baseline in future research on Dutch free text patient lifestyle classification. We furthermore strongly suggest further exploring the application of translation to input text in non-English clinical BERT research, as we only translated a subset of the full set and yet achieved very promising results.

Collapse

Ebrahimi A, Henriksen MBH, Brasen CL, Hilberg O, Hansen TF, Jensen LH, Peimankar A, Wiil UK. Identification of patients' smoking status using an explainable AI approach: a Danish electronic health records case study. BMC Med Res Methodol 2024;24:114. [PMID: 38760718 PMCID: PMC11100078 DOI: 10.1186/s12874-024-02231-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2023] [Accepted: 04/23/2024] [Indexed: 05/19/2024] Open

Abstract

BACKGROUND

Smoking is a critical risk factor responsible for over eight million annual deaths worldwide. It is essential to obtain information on smoking habits to advance research and implement preventive measures such as screening of high-risk individuals. In most countries, including Denmark, smoking habits are not systematically recorded and at best documented within unstructured free-text segments of electronic health records (EHRs). This would require researchers and clinicians to manually navigate through extensive amounts of unstructured data, which is one of the main reasons that smoking habits are rarely integrated into larger studies. Our aim is to develop machine learning models to classify patients' smoking status from their EHRs.

METHODS

This study proposes an efficient natural language processing (NLP) pipeline capable of classifying patients' smoking status and providing explanations for the decisions. The proposed NLP pipeline comprises four distinct components, which are; (1) considering preprocessing techniques to address abbreviations, punctuation, and other textual irregularities, (2) four cutting-edge feature extraction techniques, i.e. Embedding, BERT, Word2Vec, and Count Vectorizer, employed to extract the optimal features, (3) utilization of a Stacking-based Ensemble (SE) model and a Convolutional Long Short-Term Memory Neural Network (CNN-LSTM) for the identification of smoking status, and (4) application of a local interpretable model-agnostic explanation to explain the decisions rendered by the detection models. The EHRs of 23,132 patients with suspected lung cancer were collected from the Region of Southern Denmark during the period 1/1/2009-31/12/2018. A medical professional annotated the data into 'Smoker' and 'Non-Smoker' with further classifications as 'Active-Smoker', 'Former-Smoker', and 'Never-Smoker'. Subsequently, the annotated dataset was used for the development of binary and multiclass classification models. An extensive comparison was conducted of the detection performance across various model architectures.

RESULTS

The results of experimental validation confirm the consistency among the models. However, for binary classification, BERT method with CNN-LSTM architecture outperformed other models by achieving precision, recall, and F1-scores between 97% and 99% for both Never-Smokers and Active-Smokers. In multiclass classification, the Embedding technique with CNN-LSTM architecture yielded the most favorable results in class-specific evaluations, with equal performance measures of 97% for Never-Smoker and measures in the range of 86 to 89% for Active-Smoker and 91-92% for Never-Smoker.

CONCLUSION

Our proposed NLP pipeline achieved a high level of classification performance. In addition, we presented the explanation of the decision made by the best performing detection model. Future work will expand the model's capabilities to analyze longer notes and a broader range of categories to maximize its utility in further research and screening applications.

Collapse

Haque MA, Gedara MLB, Nickel N, Turgeon M, Lix LM. The validity of electronic health data for measuring smoking status: a systematic review and meta-analysis. BMC Med Inform Decis Mak 2024;24:33. [PMID: 38308231 PMCID: PMC10836023 DOI: 10.1186/s12911-024-02416-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 01/03/2024] [Indexed: 02/04/2024] Open

Abstract

BACKGROUND

Smoking is a risk factor for many chronic diseases. Multiple smoking status ascertainment algorithms have been developed for population-based electronic health databases such as administrative databases and electronic medical records (EMRs). Evidence syntheses of algorithm validation studies have often focused on chronic diseases rather than risk factors. We conducted a systematic review and meta-analysis of smoking status ascertainment algorithms to describe the characteristics and validity of these algorithms.

METHODS

The Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines were followed. We searched articles published from 1990 to 2022 in EMBASE, MEDLINE, Scopus, and Web of Science with key terms such as validity, administrative data, electronic health records, smoking, and tobacco use. The extracted information, including article characteristics, algorithm characteristics, and validity measures, was descriptively analyzed. Sources of heterogeneity in validity measures were estimated using a meta-regression model. Risk of bias (ROB) in the reviewed articles was assessed using the Quality Assessment of Diagnostic Accuracy Studies-2 tool.

RESULTS

The initial search yielded 2086 articles; 57 were selected for review and 116 algorithms were identified. Almost three-quarters (71.6%) of algorithms were based on EMR data. The algorithms were primarily constructed using diagnosis codes for smoking-related conditions, although prescription medication codes for smoking treatments were also adopted. About half of the algorithms were developed using machine-learning models. The pooled estimates of positive predictive value, sensitivity, and specificity were 0.843, 0.672, and 0.918 respectively. Algorithm sensitivity and specificity were highly variable and ranged from 3 to 100% and 36 to 100%, respectively. Model-based algorithms had significantly greater sensitivity (p = 0.006) than rule-based algorithms. Algorithms for EMR data had higher sensitivity than algorithms for administrative data (p = 0.001). The ROB was low in most of the articles (76.3%) that underwent the assessment.

CONCLUSIONS

Multiple algorithms using different data sources and methods have been proposed to ascertain smoking status in electronic health data. Many algorithms had low sensitivity and positive predictive value, but the data source influenced their validity. Algorithms based on machine-learning models for multiple linked data sources have improved validity.

Collapse

Matthews AK, Inwanna S, Oyaluade D, Akufo J, Jeremiah R, Kim SJ. Examination of provider knowledge, attitudes, and behaviors associated with lung cancer screening among Black men receiving care at a federally qualified health center. QUALITATIVE RESEARCH IN MEDICINE & HEALTHCARE 2023;7:11546. [PMID: 38115824 PMCID: PMC10726993 DOI: 10.4081/qrmh.2023.11546] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 11/07/2023] [Indexed: 12/21/2023] Open

Lewis AE, Weiskopf N, Abrams ZB, Foraker R, Lai AM, Payne PRO, Gupta A. Electronic health record data quality assessment and tools: a systematic review. J Am Med Inform Assoc 2023;30:1730-1740. [PMID: 37390812 PMCID: PMC10531113 DOI: 10.1093/jamia/ocad120] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 05/16/2023] [Accepted: 06/23/2023] [Indexed: 07/02/2023] Open

Carroll NM, Burnett-Hartman AN, Rendle KA, Neslund-Dudas CM, Greenlee RT, Honda SA, Vachani A, Ritzwoller DP. Smoking status and the association between patient-level factors and survival among lung cancer patients. J Natl Cancer Inst 2023;115:937-948. [PMID: 37228018 PMCID: PMC10407692 DOI: 10.1093/jnci/djad098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 05/16/2023] [Accepted: 05/19/2023] [Indexed: 05/27/2023] Open

Abstract

BACKGROUND

Declines in the prevalence of cigarette smoking, advances in targeted therapies, and implementation of lung cancer screening have changed the clinical landscape for lung cancer. The proportion of lung cancer deaths is increasing in those who have never smoked cigarettes. To better understand contemporary patterns in survival among patients with lung cancer, a comprehensive evaluation of factors associated with survival, including differential associations by smoking status, is needed.

METHODS

Patients diagnosed with lung cancer between January 1, 2010, and September 30, 2019, were identified. We estimated all-cause and lung cancer-specific median, 5-year, and multivariable restricted mean survival time (RMST) to identify demographic, socioeconomic, and clinical factors associated with survival, overall and stratified by smoking status (never, former, and current).

RESULTS

Analyses included 6813 patients with lung cancer: 13.9% never smoked, 54.2% formerly smoked, and 31.9% currently smoked. All-cause RMST through 5 years for those who never, formerly, and currently smoked was 32.1, 25.9, and 23.3 months, respectively. Lung cancer-specific RMST was 36.3 months, 30.3 months, and 26.0 months, respectively. Across most models, female sex, younger age, higher socioeconomic measures, first-course surgery, histology, and body mass index were positively associated, and higher stage was inversely associated with survival. Relative to White patients, Black patients had increased survival among those who formerly smoked.

CONCLUSIONS

We identify actionable factors associated with survival between those who never, formerly, and currently smoked cigarettes. These findings illuminate opportunities to address underlying mechanisms driving lung cancer progression, including use of first-course treatment, and enhanced implementation of tailored smoking cessation interventions for individuals diagnosed with cancer.

Collapse

Riskin D, Cady R, Shroff A, Hindiyeh NA, Smith T, Kymes S. Using artificial intelligence to identify patients with migraine and associated symptoms and conditions within electronic health records. BMC Med Inform Decis Mak 2023;23:121. [PMID: 37452338 PMCID: PMC10349448 DOI: 10.1186/s12911-023-02190-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 05/04/2023] [Indexed: 07/18/2023] Open

Abstract

BACKGROUND

Real-world evidence (RWE)-based on information obtained from sources such as electronic health records (EHRs), claims and billing databases, product and disease registries, and personal devices and health applications-is increasingly used to support healthcare decision making. There is variability in the collection of EHR data, which includes "structured data" in predefined fields (e.g., problem list, open claims, medication list, etc.) and "unstructured data" as free text or narrative. Healthcare providers are likely to provide more complete information as free text, but extracting meaning from these fields requires newer technologies and a rigorous methodology to generate higher-quality evidence. Herein, an approach to identify concepts associated with the presence and progression of migraine was developed and validated using the complete patient record in EHR data, including both the structured and unstructured portions.

METHODS

"Traditional RWE" approaches (i.e., capture from structured EHR fields and extraction using structured queries) and "Advanced RWE" approaches (i.e., capture from unstructured EHR data and processing by artificial intelligence [AI] technology, including natural language processing and AI-based inference) were evaluated against a manual chart abstraction reference standard for data collected from a tertiary care setting. The primary endpoint was recall; differences were compared using chi square.

RESULTS

Compared with manual chart abstraction, recall for migraine and headache were 66.6% and 29.6%, respectively, for Traditional RWE, and 96.8% and 92.9% for Advanced RWE; differences were statistically significant (absolute differences, 30.2% and 63.3%; P < 0.001). Recall of 6 migraine-associated symptoms favored Advanced RWE over Traditional RWE to a greater extent (absolute differences, 71.5-88.8%; P < 0.001). The difference between traditional and advanced techniques for recall of migraine medications was less pronounced, approximately 80% for Traditional RWE and ≥ 98% for Advanced RWE (P < 0.001).

CONCLUSION

Unstructured EHR data, processed using AI technologies, provides a more credible approach to enable RWE in migraine than using structured EHR and claims data alone. An algorithm was developed that could be used to further study and validate the use of RWE to support diagnosis and management of patients with migraine.

Collapse

Syed R, Eden R, Makasi T, Chukwudi I, Mamudu A, Kamalpour M, Kapugama Geeganage D, Sadeghianasl S, Leemans SJJ, Goel K, Andrews R, Wynn MT, Ter Hofstede A, Myers T. Digital Health Data Quality Issues: Systematic Review. J Med Internet Res 2023;25:e42615. [PMID: 37000497 PMCID: PMC10131725 DOI: 10.2196/42615] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Revised: 12/07/2022] [Accepted: 12/31/2022] [Indexed: 04/01/2023] Open

Abstract

BACKGROUND

The promise of digital health is principally dependent on the ability to electronically capture data that can be analyzed to improve decision-making. However, the ability to effectively harness data has proven elusive, largely because of the quality of the data captured. Despite the importance of data quality (DQ), an agreed-upon DQ taxonomy evades literature. When consolidated frameworks are developed, the dimensions are often fragmented, without consideration of the interrelationships among the dimensions or their resultant impact.

OBJECTIVE

The aim of this study was to develop a consolidated digital health DQ dimension and outcome (DQ-DO) framework to provide insights into 3 research questions: What are the dimensions of digital health DQ? How are the dimensions of digital health DQ related? and What are the impacts of digital health DQ?

METHODS

Following the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines, a developmental systematic literature review was conducted of peer-reviewed literature focusing on digital health DQ in predominately hospital settings. A total of 227 relevant articles were retrieved and inductively analyzed to identify digital health DQ dimensions and outcomes. The inductive analysis was performed through open coding, constant comparison, and card sorting with subject matter experts to identify digital health DQ dimensions and digital health DQ outcomes. Subsequently, a computer-assisted analysis was performed and verified by DQ experts to identify the interrelationships among the DQ dimensions and relationships between DQ dimensions and outcomes. The analysis resulted in the development of the DQ-DO framework.

RESULTS

The digital health DQ-DO framework consists of 6 dimensions of DQ, namely accessibility, accuracy, completeness, consistency, contextual validity, and currency; interrelationships among the dimensions of digital health DQ, with consistency being the most influential dimension impacting all other digital health DQ dimensions; 5 digital health DQ outcomes, namely clinical, clinician, research-related, business process, and organizational outcomes; and relationships between the digital health DQ dimensions and DQ outcomes, with the consistency and accessibility dimensions impacting all DQ outcomes.

CONCLUSIONS

The DQ-DO framework developed in this study demonstrates the complexity of digital health DQ and the necessity for reducing digital health DQ issues. The framework further provides health care executives with holistic insights into DQ issues and resultant outcomes, which can help them prioritize which DQ-related problems to tackle first.

Collapse

Ardesch F, Meulendijk M, Kist J, Vos R, Vos H, Kiefte-de Jong J, Spruit M, Bruijnzeels M, Bussemaker M, Numans M, Struijs J. A data-driven population health management approach: The Extramural LUMC Academic Network data infrastructure. Health Policy 2023;132:104769. [PMID: 37018883 DOI: 10.1016/j.healthpol.2023.104769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 02/27/2023] [Accepted: 03/09/2023] [Indexed: 03/17/2023]

Groenhof TKJ, Haitjema S, Lely AT, Grobbee DE, Asselbergs FW, Bots ML. Optimizing cardiovascular risk assessment and registration in a developing cardiovascular learning health care system: Women benefit most. PLOS DIGITAL HEALTH 2023;2:e0000190. [PMID: 36812613 PMCID: PMC9931327 DOI: 10.1371/journal.pdig.0000190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 12/30/2022] [Indexed: 02/11/2023]

Abstract

Since 2015 we organized a uniform, structured collection of a fixed set of cardiovascular risk factors according the (inter)national guidelines on cardiovascular risk management. We evaluated the current state of a developing cardiovascular towards learning healthcare system-the Utrecht Cardiovascular Cohort Cardiovascular Risk Management (UCC-CVRM)-and its potential effect on guideline adherence in cardiovascular risk management. We conducted a before-after study comparing data from patients included in UCC-CVRM (2015-2018) and patients treated in our center before UCC-CVRM (2013-2015) who would have been eligible for UCC-CVRM using the Utrecht Patient Oriented Database (UPOD). Proportions of cardiovascular risk factor measurement before and after UCC-CVRM initiation were compared, as were proportions of patients that required (change of) blood pressure, lipid, or blood glucose lowering treatment. We estimated the likelihood to miss patients with hypertension, dyslipidemia, and elevated HbA1c before UCC-CVRM for the whole cohort and stratified for sex. In the present study, patients included up to October 2018 (n = 1904) were matched with 7195 UPOD patients with similar age, sex, department of referral and diagnose description. Completeness of risk factor measurement increased, ranging from 0% -77% before to 82%-94% after UCC-CVRM initiation. Before UCC-CVRM, we found more unmeasured risk factors in women compared to men. This sex-gap resolved in UCC-CVRM. The likelihood to miss hypertension, dyslipidemia, and elevated HbA1c was reduced by 67%, 75% and 90%, respectively, after UCC-CVRM initiation. A finding more pronounced in women compared to men. In conclusion, a systematic registration of the cardiovascular risk profile substantially improves guideline adherent assessment and decreases the risk of missing patients with elevated levels with an indication for treatment. The sex-gap disappeared after UCC-CVRM initiation. Thus, an LHS approach contributes to a more inclusive insight into quality of care and prevention of cardiovascular disease (progression).

Collapse

de Boer AR, de Groot MCH, Groenhof TKJ, van Doorn S, Vaartjes I, Bots ML, Haitjema S. Data mining to retrieve smoking status from electronic health records in general practice^{. EUROPEAN HEART JOURNAL. DIGITAL HEALTH 2022;3:437-444. [PMID: 36712169 PMCID: PMC9707867 DOI: 10.1093/ehjdh/ztac031] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 04/19/2022] [Indexed: 02/01/2023]}

Chintalapudi N, Angeloni U, Battineni G, di Canio M, Marotta C, Rezza G, Sagaro GG, Silenzi A, Amenta F. LASSO Regression Modeling on Prediction of Medical Terms among Seafarers’ Health Documents Using Tidy Text Mining. Bioengineering (Basel) 2022;9:bioengineering9030124. [PMID: 35324813 PMCID: PMC8945331 DOI: 10.3390/bioengineering9030124] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 03/02/2022] [Accepted: 03/16/2022] [Indexed: 12/31/2022] Open

Overmars LM, van Es B, Groepenhoff F, De Groot MCH, Pasterkamp G, den Ruijter HM, van Solinge WW, Hoefer IE, Haitjema S. Preventing unnecessary imaging in patients suspect of coronary artery disease through machine learning of electronic health records. EUROPEAN HEART JOURNAL. DIGITAL HEALTH 2021;3:11-19. [PMID: 36713995 PMCID: PMC9707976 DOI: 10.1093/ehjdh/ztab103] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Revised: 11/22/2021] [Accepted: 12/02/2021] [Indexed: 02/01/2023]

Using Electronic Medical Records to Identify Potentially Eligible Study Subjects for Lung Cancer Screening with Biomarkers. Cancers (Basel) 2021;13:cancers13215449. [PMID: 34771612 PMCID: PMC8582572 DOI: 10.3390/cancers13215449] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Revised: 10/18/2021] [Accepted: 10/19/2021] [Indexed: 11/18/2022] Open

Abstract

Simple Summary

Recent cancer screening trials have found that using low-dose computed tomography (LDCT), compared to chest radiography, resulted in a significant reduction in lung cancer mortality. To effectively carry out this intervention, individuals at a high risk of developing lung cancer are targeted. However, accurately identifying and retaining these groups can be challenging. As electronic medical records (EMRs) contain important demographic and clinical information, they could be used to accurately identify subjects for screening. To determine whether EMRs can be used for this purpose, this paper examines the evidence around the use of EMRs in screening trials and the information contained in them that could be used to aid researchers in identifying eligible subjects.

Abstract

Lung cancer screening trials using low-dose computed tomography (LDCT) show reduced late-stage diagnosis and mortality rates. These trials have identified high-risk groups that would benefit from screening. However, these sub-populations can be difficult to access and retain in trials. Implementation of national screening programmes further suggests that there is poor uptake in eligible populations. A new approach to participant selection may be more effective. Electronic medical records (EMRs) are a viable alternative to population-based or health registries, as they contain detailed clinical and demographic information. Trials have identified that e-screening using EMRs has improved trial retention and eligible subject identification. As such, this paper argues for greater use of EMRs in trial recruitment and screening programmes. Moreover, this opinion paper explores the current issues in and approaches to lung cancer screening, whether records can be used to identify eligible subjects for screening and the challenges that researchers face when using EMR data.

Collapse

Hackl WO, Hoerbst A. Clinical Information Systems Research in the Pandemic Year 2020. Yearb Med Inform 2021;30:134-140. [PMID: 34479385 PMCID: PMC8416225 DOI: 10.1055/s-0041-1726516] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open

Rankin NM, McWilliams A, Marshall HM. Lung cancer screening implementation: Complexities and priorities. Respirology 2021;25 Suppl 2:5-23. [PMID: 33200529 DOI: 10.1111/resp.13963] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Accepted: 10/06/2020] [Indexed: 12/17/2022]

Abstract

Lung cancer is the number one cause of cancer death worldwide. The benefits of lung cancer screening to reduce mortality and detect early-stage disease are no longer in any doubt based on the results of two landmark trials using LDCT. Lung cancer screening has been implemented in the US and South Korea and is under consideration by other communities. Successful translation of demonstrated research outcomes into the routine clinical setting requires careful implementation and co-ordinated input from multiple stakeholders. Implementation aspects may be specific to different healthcare settings. Important knowledge gaps remain, which must be addressed in order to optimize screening benefits and minimize screening harms. Lung cancer screening differs from all other cancer screening programmes as lung cancer risk is driven by smoking, a highly stigmatized behaviour. Stigma, along with other factors, can impact smokers' engagement with screening, meaning that smokers are generally 'hard to reach'. This review considers critical points along the patient journey. The first steps include selecting a risk threshold at which to screen, successfully engaging the target population and maximizing screening uptake. We review barriers to smoker engagement in lung and other cancer screening programmes. Recruitment strategies used in trials and real-world (clinical) programmes and associated screening uptake are reviewed. To aid cross-study comparisons, we propose a standardized nomenclature for recording and calculating recruitment outcomes. Once participants have engaged with the screening programme, we discuss programme components that are critical to maximize net benefit. A whole-of-programme approach is required including a standardized and multidisciplinary approach to pulmonary nodule management, incorporating probabilistic nodule risk assessment and longitudinal volumetric analysis, to reduce unnecessary downstream investigations and surgery; the integration of smoking cessation; and identification and intervention for other tobacco related diseases, such as coronary artery calcification and chronic obstructive pulmonary disease. National support, integrated with tobacco control programmes, and with appropriate funding, accreditation, data collection, quality assurance and reporting mechanisms will enhance lung cancer screening programme success and reduce the risks associated with opportunistic, ad hoc screening. Finally, implementation research must play a greater role in informing policy change about targeted LDCT screening programmes.

Collapse

Artificial Intelligence and the Medical Physicist: Welcome to the Machine. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11041691] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Vieira MDC, Vieira SAG, Skupien JA, Boeck CR. Nanoencapsulation of unsaturated omega-3 fatty acids as protection against oxidation: A systematic review and data-mining. Crit Rev Food Sci Nutr 2021;62:4356-4370. [PMID: 33506691 DOI: 10.1080/10408398.2021.1874870] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Brunekreef TE, Otten HG, van den Bosch SC, Hoefer IE, van Laar JM, Limper M, Haitjema S. Text Mining of Electronic Health Records Can Accurately Identify and Characterize Patients With Systemic Lupus Erythematosus. ACR Open Rheumatol 2021;3:65-71. [PMID: 33434395 PMCID: PMC7882527 DOI: 10.1002/acr2.11211] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 11/16/2020] [Indexed: 12/20/2022] Open