Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Liu V, Clark MP, Mendoza M, Saket R, Gardner MN, Turk BJ, Escobar GJ. Automated identification of pneumonia in chest radiograph reports in critically ill patients. BMC Med Inform Decis Mak 2013;13:90. [PMID: 23947340 PMCID: PMC3765332 DOI: 10.1186/1472-6947-13-90] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2013] [Accepted: 08/12/2013] [Indexed: 11/22/2022] Open

For:	Liu V, Clark MP, Mendoza M, Saket R, Gardner MN, Turk BJ, Escobar GJ. Automated identification of pneumonia in chest radiograph reports in critically ill patients. BMC Med Inform Decis Mak 2013;13:90. [PMID: 23947340 PMCID: PMC3765332 DOI: 10.1186/1472-6947-13-90] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2013] [Accepted: 08/12/2013] [Indexed: 11/22/2022] Open

Number

Cited by Other Article(s)

Panny A, Hegde H, Glurich I, Scannapieco FA, Vedre JG, VanWormer JJ, Miecznikowski J, Acharya A. A methodological approach to validate pneumonia encounters from radiology reports using Natural Language Processing (NLP). Methods Inf Med 2022;61:38-45. [PMID: 35381617 DOI: 10.1055/a-1817-7008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Abstract

INTRODUCTION

Pneumonia is caused by microbes that establish an infectious process in the lungs. The gold standard for pneumonia diagnosis is radiologist-documented pneumonia-related features in radiology notes that are captured in electronic health records in an unstructured format.

OBJECTIVE

The study objective was to develop a methodological approach for assessing validity of a pneumonia diagnosis based on identifying presence or absence of key radiographic features in radiology reports with subsequent rendering of diagnostic decisions into a structured format.

METHODS

A pneumonia-specific Natural Language Processing (NLP) pipeline was strategically developed applying cTAKES to validate pneumonia diagnoses following development of a pneumonia feature-specific lexicon. Radiographic reports of study-eligible subjects identified by International Classification of Diseases (ICD) codes were parsed through the NLP pipeline. Classification rules were developed to assign each pneumonia episode into one of three categories: "positive", "negative" or "not classified: requires manual review" based on tagged concepts that support or refute diagnostic codes.

RESULTS

A total of 91,998 pneumonia episodes diagnosed in 65,904 patients were retrieved retrospectively. Approximately 89% (81,707/91,998) of the total pneumonia episodes were documented by 225,893 chest x-ray reports. NLP classified and validated 33% (26,800/81,707) of pneumonia episodes classified as 'Pneumonia-positive', 19% as (15401/81,707) as 'Pneumonia-negative' and 48% (39,209/81,707) as ''episode classification pending further manual review'. NLP pipeline performance metrics included accuracy (76.3%), sensitivity (88%), and specificity (75%).

CONCLUSION

The pneumonia-specific NLP pipeline exhibited good performance comparable to other pneumonia-specific NLP systems developed to date.

Collapse

Barr J, Paulson SS, Kamdar B, Ervin JN, Lane-Fall M, Liu V, Kleinpell R. The Coming of Age of Implementation Science and Research in Critical Care Medicine. Crit Care Med 2021;49:1254-1275. [PMID: 34261925 PMCID: PMC8549627 DOI: 10.1097/ccm.0000000000005131] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Affiliation(s)

Juliana Barr Anesthesiology and Perioperative Care Service, VA Palo Alto Health Care System, Palo Alto, CA Department of Anesthesiology, Perioperative and Pain Medicine, Stanford University School of Medicine, Stanford, CA
Shirley S Paulson Regional Adult Patient Care Services, Kaiser Permanente, Northern California, Oakland, CA
Biren Kamdar Division of Pulmonary, Critical Care and Sleep Medicine, Department of Medicine, University of California, San Diego School of Medicine, La Jolla, CA
Jennifer N Ervin Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI
Meghan Lane-Fall Department of Anesthesiology and Critical Care, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA Penn Implementation Science Center at the Leonard Davis Institute of Health Economics, University of Pennsylvania, Philadelphia, PA
Vincent Liu Anesthesiology and Perioperative Care Service, VA Palo Alto Health Care System, Palo Alto, CA Department of Anesthesiology, Perioperative and Pain Medicine, Stanford University School of Medicine, Stanford, CA Regional Adult Patient Care Services, Kaiser Permanente, Northern California, Oakland, CA Division of Pulmonary, Critical Care and Sleep Medicine, Department of Medicine, University of California, San Diego School of Medicine, La Jolla, CA Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI Department of Anesthesiology and Critical Care, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA Penn Implementation Science Center at the Leonard Davis Institute of Health Economics, University of Pennsylvania, Philadelphia, PA Division of Research, Kaiser Permanente Northern California, Santa Clara, CA Kaiser Permanente Medical Center, Santa Clara, CA Stanford University, Stanford, CA Hospital Advanced Analytics, Kaiser Permanente Northern California, Santa Clara, CA Vanderbilt University School of Nursing, Nashville, TN
Ruth Kleinpell Vanderbilt University School of Nursing, Nashville, TN

Collapse

Song J, Woo K, Shang J, Ojo M, Topaz M. Predictive Risk Models for Wound Infection-Related Hospitalization or ED Visits in Home Health Care Using Machine-Learning Algorithms. Adv Skin Wound Care 2021;34:1-12. [PMID: 34260423 DOI: 10.1097/01.asw.0000755928.30524.22] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Duggan GE, Reicher JJ, Liu Y, Tse D, Shetty S. Improving reference standards for validation of AI-based radiography. Br J Radiol 2021;94:20210435. [PMID: 34142868 PMCID: PMC8248225 DOI: 10.1259/bjr.20210435] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open

Abstract

Objective:

Demonstrate the importance of combining multiple readers' opinions, in a context-aware manner, when establishing the reference standard for validation of artificial intelligence (AI) applications for, e.g. chest radiographs. By comparing individual readers, majority vote of a panel, and panel-based discussion, we identify methods which maximize interobserver agreement and label reproducibility.

Methods:

1100 frontal chest radiographs were evaluated for 6 findings: airspace opacity, cardiomegaly, pulmonary edema, fracture, nodules, and pneumothorax. Each image was reviewed by six radiologists, first individually and then via asynchronous adjudication (web-based discussion) in two panels of three readers to resolve disagreements within each panel. We quantified the reproducibility of each method by measuring interreader agreement.

Results:

Panel-based majority vote improved agreement relative to individual readers for all findings. Most disagreements were resolved with two rounds of adjudication, which further improved reproducibility for some findings, particularly reducing misses. Improvements varied across finding categories, with adjudication improving agreement for cardiomegaly, fractures, and pneumothorax.

Conclusion:

The likelihood of interreader agreement, even within panels of US board-certified radiologists, must be considered before reads can be used as a reference standard for validation of proposed AI tools. Agreement and, by extension, reproducibility can be improved by applying majority vote, maximum sensitivity, or asynchronous adjudication for different findings, which supports the development of higher quality clinical research.

Advances in knowledge:

A panel of three experts is a common technique for establishing reference standards when ground truth is not available for use in AI validation. The manner in which differing opinions are resolved is shown to be important, and has not been previously explored.

Collapse

Utility of radiographic keyword abstraction for identification of misdiagnosed pneumonia. Infect Control Hosp Epidemiol 2021;42:1500-1502. [PMID: 33910668 DOI: 10.1017/ice.2020.1417] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]

The Presentation, Pace, and Profile of Infection and Sepsis Patients Hospitalized Through the Emergency Department: An Exploratory Analysis. Crit Care Explor 2021;3:e0344. [PMID: 33655214 PMCID: PMC7909460 DOI: 10.1097/cce.0000000000000344] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Abstract

To characterize the signs and symptoms of sepsis, compare them with those from simple infection and other emergent conditions and evaluate their association with hospital outcomes.

Design Setting Participants and INTERVENTION

A multicenter, retrospective cohort study of 408,377 patients hospitalized through the emergency department from 2012 to 2017 with sepsis, suspected infection, heart failure, or stroke. Infected patients were identified based on Sepsis-3 criteria, whereas noninfected patients were identified through diagnosis codes.

MEASUREMENTS AND MAIN RESULTS

Signs and symptoms were identified within physician clinical documentation in the first 24 hours of hospitalization using natural language processing. The time of sign and symptom onset prior to presentation was quantified, and sign and symptom prevalence was assessed. Using multivariable logistic regression, the association of each sign and symptom with four outcomes was evaluated: sepsis versus suspected infection diagnosis, hospital mortality, ICU admission, and time of first antibiotics (> 3 vs ≤ 3 hr from presentation). A total of 10,825 signs and symptoms were identified in 6,148,348 clinical documentation fragments. The most common symptoms overall were as follows: dyspnea (35.2%), weakness (27.2%), altered mental status (24.3%), pain (23.9%), cough (19.7%), edema (17.8%), nausea (16.9%), hypertension (15.6%), fever (13.9%), and chest pain (12.1%). Compared with predominant signs and symptoms in heart failure and stroke, those present in infection were heterogeneous. Signs and symptoms indicative of neurologic dysfunction, significant respiratory conditions, and hypotension were strongly associated with sepsis diagnosis, hospital mortality, and intensive care. Fever, present in only a minority of patients, was associated with improved mortality (odds ratio, 0.67, 95% CI, 0.64-0.70; p < 0.001). For common symptoms, the peak time of symptom onset before sepsis was 2 days, except for altered mental status, which peaked at 1 day prior to presentation.

Conclusions

The clinical presentation of sepsis was heterogeneous and occurred with rapid onset prior to hospital presentation. These findings have important implications for improving public education, clinical treatment, and quality measures of sepsis care.

Collapse

Smith JC, Spann A, McCoy AB, Johnson JA, Arnold DH, Williams DJ, Weitkamp AO. Natural Language Processing and Machine Learning to Enable Clinical Decision Support for Treatment of Pediatric Pneumonia. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2021;2020:1130-1139. [PMID: 33936489 PMCID: PMC8075487] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Medic G, Kosaner Kließ M, Atallah L, Weichert J, Panda S, Postma M, EL-Kerdi A. Evidence-based Clinical Decision Support Systems for the prediction and detection of three disease states in critical care: A systematic literature review. F1000Res 2019;8:1728. [PMID: 31824670 PMCID: PMC6894361 DOI: 10.12688/f1000research.20498.1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/24/2019] [Indexed: 10/21/2023] Open

Abstract

Background: Clinical decision support (CDS) systems have emerged as tools providing intelligent decision making to address challenges of critical care. CDS systems can be based on existing guidelines or best practices; and can also utilize machine learning to provide a diagnosis, recommendation, or therapy course. Methods: This research aimed to identify evidence-based study designs and outcome measures to determine the clinical effectiveness of clinical decision support systems in the detection and prediction of hemodynamic instability, respiratory distress, and infection within critical care settings. PubMed, ClinicalTrials.gov and Cochrane Database of Systematic Reviews were systematically searched to identify primary research published in English between 2013 and 2018. Studies conducted in the USA, Canada, UK, Germany and France with more than 10 participants per arm were included. Results: In studies on hemodynamic instability, the prediction and management of septic shock were the most researched topics followed by the early prediction of heart failure. For respiratory distress, the most popular topics were pneumonia detection and prediction followed by pulmonary embolisms. Given the importance of imaging and clinical notes, this area combined Machine Learning with image analysis and natural language processing. In studies on infection, the most researched areas were the detection, prediction, and management of sepsis, surgical site infections, as well as acute kidney injury. Overall, a variety of Machine Learning algorithms were utilized frequently, particularly support vector machines, boosting techniques, random forest classifiers and neural networks. Sensitivity, specificity, and ROC AUC were the most frequently reported performance measures. Conclusion: This review showed an increasing use of Machine Learning for CDS in all three areas. Large datasets are required for training these algorithms; making it imperative to appropriately address, challenges such as class imbalance, correct labelling of data and missing data. Recommendations are formulated for the development and successful adoption of CDS systems.

Collapse

Medic G, Kosaner Kließ M, Atallah L, Weichert J, Panda S, Postma M, EL-Kerdi A. Evidence-based Clinical Decision Support Systems for the prediction and detection of three disease states in critical care: A systematic literature review. F1000Res 2019;8:1728. [PMID: 31824670 PMCID: PMC6894361 DOI: 10.12688/f1000research.20498.2] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/22/2019] [Indexed: 02/01/2023] Open

Abstract

Collapse

Liang CH, Liu YC, Wu MT, Garcia-Castro F, Alberich-Bayarri A, Wu FZ. Identifying pulmonary nodules or masses on chest radiography using deep learning: external validation and strategies to improve clinical practice. Clin Radiol 2019;75:38-45. [PMID: 31521323 DOI: 10.1016/j.crad.2019.08.005] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2019] [Accepted: 08/14/2019] [Indexed: 01/01/2023]

Pavinkurve NP, Natarajan K, Perotte AJ. Deep Vision: Learning to Identify Renal Disease With Neural Networks. Kidney Int Rep 2019;4:914-916. [PMID: 31317112 PMCID: PMC6612041 DOI: 10.1016/j.ekir.2019.04.023] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open

Accurate Identification of Colonoscopy Quality and Polyp Findings Using Natural Language Processing. J Clin Gastroenterol 2019;53:e25-e30. [PMID: 28906424 PMCID: PMC5847417 DOI: 10.1097/mcg.0000000000000929] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Abstract

OBJECTIVES

The aim of this study was to test the ability of a commercially available natural language processing (NLP) tool to accurately extract examination quality-related and large polyp information from colonoscopy reports with varying report formats.

BACKGROUND

Colonoscopy quality reporting often requires manual data abstraction. NLP is another option for extracting information; however, limited data exist on its ability to accurately extract examination quality and polyp findings from unstructured text in colonoscopy reports with different reporting formats.

STUDY DESIGN

NLP strategies were developed using 500 colonoscopy reports from Kaiser Permanente Northern California and then tested using 300 separate colonoscopy reports that underwent manual chart review. Using findings from manual review as the reference standard, we evaluated the NLP tool's sensitivity, specificity, positive predictive value (PPV), and accuracy for identifying colonoscopy examination indication, cecal intubation, bowel preparation adequacy, and polyps ≥10 mm.

RESULTS

The NLP tool was highly accurate in identifying examination quality-related variables from colonoscopy reports. Compared with manual review, sensitivity for screening indication was 100% (95% confidence interval: 95.3%-100%), PPV was 90.6% (82.3%-95.8%), and accuracy was 98.2% (97.0%-99.4%). For cecal intubation, sensitivity was 99.6% (98.0%-100%), PPV was 100% (98.5%-100%), and accuracy was 99.8% (99.5%-100%). For bowel preparation adequacy, sensitivity was 100% (98.5%-100%), PPV was 100% (98.5%-100%), and accuracy was 100% (100%-100%). For polyp(s) ≥10 mm, sensitivity was 90.5% (69.6%-98.8%), PPV was 100% (82.4%-100%), and accuracy was 95.2% (88.8%-100%).

CONCLUSION

NLP yielded a high degree of accuracy for identifying examination quality-related and large polyp information from diverse types of colonoscopy reports.

Collapse

Reinventing Radiology: Big Data and the Future of Medical Imaging. J Thorac Imaging 2018;33:4-16. [PMID: 29252898 DOI: 10.1097/rti.0000000000000311] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Can You Read Me Now? Unlocking Narrative Data with Natural Language Processing. Ann Am Thorac Soc 2018;13:1443-5. [PMID: 27627470 DOI: 10.1513/annalsats.201606-498ed] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open

Jones BE, South BR, Shao Y, Lu CC, Leng J, Sauer BC, Gundlapalli AV, Samore MH, Zeng Q. Development and Validation of a Natural Language Processing Tool to Identify Patients Treated for Pneumonia across VA Emergency Departments. Appl Clin Inform 2018;9:122-128. [PMID: 29466818 DOI: 10.1055/s-0038-1626725] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022] Open

Abstract

BACKGROUND

Identifying pneumonia using diagnosis codes alone may be insufficient for research on clinical decision making. Natural language processing (NLP) may enable the inclusion of cases missed by diagnosis codes.

OBJECTIVES

This article (1) develops a NLP tool that identifies the clinical assertion of pneumonia from physician emergency department (ED) notes, and (2) compares classification methods using diagnosis codes versus NLP against a gold standard of manual chart review to identify patients initially treated for pneumonia.

METHODS

Among a national population of ED visits occurring between 2006 and 2012 across the Veterans Affairs health system, we extracted 811 physician documents containing search terms for pneumonia for training, and 100 random documents for validation. Two reviewers annotated span- and document-level classifications of the clinical assertion of pneumonia. An NLP tool using a support vector machine was trained on the enriched documents. We extracted diagnosis codes assigned in the ED and upon hospital discharge and calculated performance characteristics for diagnosis codes, NLP, and NLP plus diagnosis codes against manual review in training and validation sets.

RESULTS

Among the training documents, 51% contained clinical assertions of pneumonia; in the validation set, 9% were classified with pneumonia, of which 100% contained pneumonia search terms. After enriching with search terms, the NLP system alone demonstrated a recall/sensitivity of 0.72 (training) and 0.55 (validation), and a precision/positive predictive value (PPV) of 0.89 (training) and 0.71 (validation). ED-assigned diagnostic codes demonstrated lower recall/sensitivity (0.48 and 0.44) but higher precision/PPV (0.95 in training, 1.0 in validation); the NLP system identified more "possible-treated" cases than diagnostic coding. An approach combining NLP and ED-assigned diagnostic coding classification achieved the best performance (sensitivity 0.89 and PPV 0.80).

CONCLUSION

System-wide application of NLP to clinical text can increase capture of initial diagnostic hypotheses, an important inclusion when studying diagnosis and clinical decision-making under uncertainty.

Collapse

Pham T, Rubenfeld GD. Fifty Years of Research in ARDS. The Epidemiology of Acute Respiratory Distress Syndrome. A 50th Birthday Review. Am J Respir Crit Care Med 2017;195:860-870. [PMID: 28157386 DOI: 10.1164/rccm.201609-1773cp] [Citation(s) in RCA: 158] [Impact Index Per Article: 22.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open

Wang Y, Coiera E, Runciman W, Magrabi F. Using multiclass classification to automate the identification of patient safety incident reports by type and severity. BMC Med Inform Decis Mak 2017;17:84. [PMID: 28606174 PMCID: PMC5468980 DOI: 10.1186/s12911-017-0483-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2016] [Accepted: 06/06/2017] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Approximately 10% of admissions to acute-care hospitals are associated with an adverse event. Analysis of incident reports helps to understand how and why incidents occur and can inform policy and practice for safer care. Unfortunately our capacity to monitor and respond to incident reports in a timely manner is limited by the sheer volumes of data collected. In this study, we aim to evaluate the feasibility of using multiclass classification to automate the identification of patient safety incidents in hospitals.

METHODS

Text based classifiers were applied to identify 10 incident types and 4 severity levels. Using the one-versus-one (OvsO) and one-versus-all (OvsA) ensemble strategies, we evaluated regularized logistic regression, linear support vector machine (SVM) and SVM with a radial-basis function (RBF) kernel. Classifiers were trained and tested with "balanced" datasets (n_ _Type = 2860, n_ _{SeverityLevel} = 1160) from a state-wide incident reporting system. Testing was also undertaken with imbalanced "stratified" datasets (n_ _Type = 6000, n_ _{SeverityLevel} =5950) from the state-wide system and an independent hospital reporting system. Classifier performance was evaluated using a confusion matrix, as well as F-score, precision and recall.

RESULTS

The most effective combination was a OvsO ensemble of binary SVM RBF classifiers with binary count feature extraction. For incident type, classifiers performed well on balanced and stratified datasets (F-score: 78.3, 73.9%), but were worse on independent datasets (68.5%). Reports about falls, medications, pressure injury, aggression and blood products were identified with high recall and precision. "Documentation" was the hardest type to identify. For severity level, F-score for severity assessment code (SAC) 1 (extreme risk) was 87.3 and 64% for SAC4 (low risk) on balanced data. With stratified data, high recall was achieved for SAC1 (82.8-84%) but precision was poor (6.8-11.2%). High risk incidents (SAC2) were confused with medium risk incidents (SAC3).

CONCLUSIONS

Binary classifier ensembles appear to be a feasible method for identifying incidents by type and severity level. Automated identification should enable safety problems to be detected and addressed in a more timely manner. Multi-label classifiers may be necessary for reports that relate to more than one incident type.

Collapse

Meystre S, Gouripeddi R, Tieder J, Simmons J, Srivastava R, Shah S. Enhancing Comparative Effectiveness Research With Automated Pediatric Pneumonia Detection in a Multi-Institutional Clinical Repository: A PHIS+ Pilot Study. J Med Internet Res 2017;19:e162. [PMID: 28506958 PMCID: PMC5447826 DOI: 10.2196/jmir.6887] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2016] [Revised: 01/26/2017] [Accepted: 03/06/2017] [Indexed: 11/27/2022] Open

Abstract

Background

Community-acquired pneumonia is a leading cause of pediatric morbidity. Administrative data are often used to conduct comparative effectiveness research (CER) with sufficient sample sizes to enhance detection of important outcomes. However, such studies are prone to misclassification errors because of the variable accuracy of discharge diagnosis codes.

Objective

The aim of this study was to develop an automated, scalable, and accurate method to determine the presence or absence of pneumonia in children using chest imaging reports.

Methods

The multi-institutional PHIS+ clinical repository was developed to support pediatric CER by expanding an administrative database of children’s hospitals with detailed clinical data. To develop a scalable approach to find patients with bacterial pneumonia more accurately, we developed a Natural Language Processing (NLP) application to extract relevant information from chest diagnostic imaging reports. Domain experts established a reference standard by manually annotating 282 reports to train and then test the NLP application. Findings of pleural effusion, pulmonary infiltrate, and pneumonia were automatically extracted from the reports and then used to automatically classify whether a report was consistent with bacterial pneumonia.

Results

Compared with the annotated diagnostic imaging reports reference standard, the most accurate implementation of machine learning algorithms in our NLP application allowed extracting relevant findings with a sensitivity of .939 and a positive predictive value of .925. It allowed classifying reports with a sensitivity of .71, a positive predictive value of .86, and a specificity of .962. When compared with each of the domain experts manually annotating these reports, the NLP application allowed for significantly higher sensitivity (.71 vs .527) and similar positive predictive value and specificity .

Conclusions

NLP-based pneumonia information extraction of pediatric diagnostic imaging reports performed better than domain experts in this pilot study. NLP is an efficient method to extract information from a large collection of imaging reports to facilitate CER.

Collapse

Pons E, Braun LMM, Hunink MGM, Kors JA. Natural Language Processing in Radiology: A Systematic Review. Radiology 2016;279:329-43. [PMID: 27089187 DOI: 10.1148/radiol.16142770] [Citation(s) in RCA: 285] [Impact Index Per Article: 35.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Conway M, Khojoyan A, Fana F, Scuba W, Castine M, Mowery D, Chapman W, Jupp S. Developing a web-based SKOS editor. J Biomed Semantics 2016;7:5. [PMID: 27047653 PMCID: PMC4819276 DOI: 10.1186/s13326-015-0043-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2015] [Accepted: 12/21/2015] [Indexed: 12/03/2022] Open

Abstract

Background

The Simple Knowledge Organization System (SKOS) was introduced to the wider research community by a 2005 World Wide Web Consortium (W3C) working draft, and further developed and refined in a 2009 W3C recommendation. Since then, SKOS has become the de facto standard for representing and sharing thesauri, lexicons, vocabularies, taxonomies, and classification schemes. In this paper, we describe the development of a web-based, free, open-source SKOS editor built for the development, curation, and management of small to medium-sized lexicons for health-related Natural Language Processing (NLP).

Results

The web-based SKOS editor allows users to create, curate, version, manage, and visualise SKOS resources. We tested the system against five widely-used, publicly-available SKOS vocabularies of various sizes and found that the editor is suitable for the development and management of small to medium-size lexicons. Qualitative testing has focussed on using the editor to develop lexical resources to drive NLP applications in two domains. First, developing a lexicon to support an Electronic Health Record-based NLP system for the automatic identification of pneumonia symptoms. Second, creating a taxonomy of lexical cues associated with Diagnostic and Statistical Manual of Mental Disorders (DSM-5) diagnoses with the goal of facilitating the automatic identification of symptoms associated with depression from short, informal texts.

Conclusions

The SKOS editor we have developed is — to the best of our knowledge — the first free, open-source, web-based, SKOS editor capable of creating, curating, versioning, managing, and visualising SKOS lexicons.

Electronic supplementary material

The online version of this article (doi:10.1186/s13326-015-0043-z) contains supplementary material, which is available to authorized users.

Collapse

Cormack J, Nath C, Milward D, Raja K, Jonnalagadda SR. Agile text mining for the 2014 i2b2/UTHealth Cardiac risk factors challenge. J Biomed Inform 2015. [PMID: 26209007 DOI: 10.1016/j.jbi.2015.06.030] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Perotte A, Ranganath R, Hirsch JS, Blei D, Elhadad N. Risk prediction for chronic kidney disease progression using heterogeneous electronic health record data and time series analysis. J Am Med Inform Assoc 2015;22:872-80. [PMID: 25896647 PMCID: PMC4482276 DOI: 10.1093/jamia/ocv024] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2014] [Accepted: 03/08/2015] [Indexed: 12/22/2022] Open