1
|
Cagino KA, Wiley RL, Ghose I, Ciomperlik HN, Sibai BM, Mendez-Figueroa H, Chauhan SP. Risk of Postpartum Hemorrhage in Hypertensive Disorders of Pregnancy: Stratified by Severity. Am J Perinatol 2024; 41:2165-2174. [PMID: 38565195 DOI: 10.1055/a-2297-8790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
OBJECTIVE We aimed to determine the composite maternal hemorrhagic outcome (CMHO) among individuals with and without hypertensive disorders of pregnancy (HDP), stratified by disease severity. Additionally, we investigated the composite neonatal adverse outcome (CNAO) among individuals with HDP who had postpartum hemorrhage (PPH) versus did not have PPH. STUDY DESIGN Our retrospective cohort study included all singletons who delivered at a Level IV center over two consecutive years. The primary outcome was the rate of CMHO, defined as blood loss ≥1,000 mL, use of uterotonics, mechanical tamponade, surgical techniques for atony, transfusion, venous thromboembolism, intensive care unit admission, hysterectomy, or maternal death. A subgroup analysis was performed to investigate the primary outcome stratified by (1) chronic hypertension, (2) gestational hypertension and preeclampsia without severe features, and (3) preeclampsia with severe features. A multivariable regression analysis was performed to investigate the association of HDP with and without PPH on a CNAO which included APGAR <7 at 5 minutes, bronchopulmonary dysplasia, intraventricular hemorrhage, necrotizing enterocolitis, seizures, neonatal sepsis, meconium aspiration syndrome, ventilation >6 hours, hypoxic-ischemic encephalopathy, or neonatal death. RESULTS Of 8,357 singletons, 2,827 (34%) had HDP. Preterm delivery <37 weeks, induction of labor, prolonged oxytocin use, and magnesium sulfate usage were more common in those with versus without HDP (p < 0.001). CMHO was higher among individuals with HDP than those without HDP (26 vs. 19%; adjusted relative risk [aRR] = 1.11, 95% CI: 1.01-1.22). In the subgroup analysis, only individuals with preeclampsia with severe features were associated with higher CMHO (n = 802; aRR = 1.52, 95% CI: 1.32-1.75). There was a higher likelihood of CNAO in individuals with both HDP and PPH compared to those with HDP without PPH (aRR = 1.49, 95% CI: 1.06-2.09). CONCLUSION CMHO was higher among those with HDP. After stratification, only those with preeclampsia with severe features had an increased risk of CMHO. Among individuals with HDP, those who also had a PPH had worse neonatal outcomes than those without hemorrhage. KEY POINTS · Individuals with HDP had an 11% higher likelihood of CMHO.. · After stratification, increased CMHO was limited to those with preeclampsia with severe features.. · There was a higher likelihood of CNAO in those with both HDP and PPH compared to HDP without PPH..
Collapse
Affiliation(s)
- Kristen A Cagino
- Department of Obstetrics, Gynecology and Reproductive Sciences, McGovern Medical School at The University of Texas Health Science Center at Houston (UTHealth), Houston, Texas
| | - Rachel L Wiley
- Department of Obstetrics, Gynecology and Reproductive Sciences, McGovern Medical School at The University of Texas Health Science Center at Houston (UTHealth), Houston, Texas
| | - Ipsita Ghose
- Department of Obstetrics, Gynecology and Reproductive Sciences, McGovern Medical School at The University of Texas Health Science Center at Houston (UTHealth), Houston, Texas
| | - Hailie N Ciomperlik
- Department of Obstetrics, Gynecology and Reproductive Sciences, McGovern Medical School at The University of Texas Health Science Center at Houston (UTHealth), Houston, Texas
| | - Baha M Sibai
- Department of Obstetrics, Gynecology and Reproductive Sciences, McGovern Medical School at The University of Texas Health Science Center at Houston (UTHealth), Houston, Texas
| | - Hector Mendez-Figueroa
- Department of Obstetrics, Gynecology and Reproductive Sciences, McGovern Medical School at The University of Texas Health Science Center at Houston (UTHealth), Houston, Texas
| | - Suneet P Chauhan
- Department of Obstetrics, Gynecology and Reproductive Sciences, McGovern Medical School at The University of Texas Health Science Center at Houston (UTHealth), Houston, Texas
| |
Collapse
|
2
|
Covell MM, Gajjar AA, Sioutas GS, Burkhardt JK, Srinivasan VM. Improper National Inpatient Sample ICD-10 coding limits comparative value of impact of ARUBA trial on prevalence and rupture rates of arteriovenous malformations. J Neurointerv Surg 2024; 16:532-534. [PMID: 37898552 DOI: 10.1136/jnis-2023-021006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Accepted: 09/20/2023] [Indexed: 10/30/2023]
Affiliation(s)
- Michael M Covell
- School of Medicine, Georgetown University, Washington, District of Columbia, USA
| | - Avi A Gajjar
- Department of Neurosurgery, Hospital of the University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Georgios S Sioutas
- Department of Neurosurgery, Hospital of the University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Jan-Karl Burkhardt
- Department of Neurosurgery, Hospital of the University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Visish M Srinivasan
- Department of Neurosurgery, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
3
|
Arthur R, Mayberry RM, Odum S, Kempton LB. Can researchers trust ICD-10 coding of medical comorbidities in orthopaedic trauma patients? OTA Int 2024; 7:e307. [PMID: 38425488 PMCID: PMC10904096 DOI: 10.1097/oi9.0000000000000307] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 10/31/2023] [Accepted: 12/02/2023] [Indexed: 03/02/2024]
Abstract
Objectives The 10th revision of the International Classification of Diseases (ICD-10) coding system may prove useful to orthopaedic trauma researchers to identify and document populations based on comorbidities. However, its use for research first necessitates determination of its reliability. The purpose of this study was to assess the reliability of electronic medical record (EMR) ICD-10 coding of nonorthopaedic diagnoses in orthopaedic trauma patients relative to the gold standard of prospective data collection. Design Nonexperimental cross-sectional study. Setting Level 1 Trauma Center. Patients/Participants Two hundred sixty-three orthopaedic trauma patients from 2 prior prospective studies from September 2018 to April 2022. Intervention Prospectively collected data were compared with EMR ICD-10 code abstraction for components of the Charlson Comorbidity Index (CCI), obesity, alcohol abuse, and tobacco use (retrospective data). Main Outcome Measurements Percent agreement and Cohen's kappa reliability. Results Percent agreement ranged from 86.7% to 96.9% for all CCI diagnoses and was as low as 72.6% for the diagnosis "overweight." Only 2 diagnoses, diabetes without end-organ damage (kappa = 0.794) and AIDS (kappa = 0.798) demonstrated Cohen's kappa values to indicate substantial agreement. Conclusion EMR diagnostic coding for medical comorbidities in orthopaedic trauma patients demonstrated variable reliability. Researchers may be able to rely on EMR coding to identify patients with diabetes without complications or AIDS. Chart review may still be necessary to confirm diagnoses. Low prevalence of most comorbidities led to high percentage agreement with low reliability. Level of Evidence Level 1 diagnostic.
Collapse
Affiliation(s)
- Rodney Arthur
- University of North Carolina School of Medicine, Chapel Hill, NC
- Department of Orthopaedic Surgery, Atrium Health Musculoskeletal Institute, Carolinas Medical Center, Charlotte, NC
| | - R. Miles Mayberry
- Wake Forest School of Medicine, Winston-Salem, NC
- Department of Orthopaedic Surgery, Atrium Health Musculoskeletal Institute, Carolinas Medical Center, Charlotte, NC
| | - Susan Odum
- Department of Orthopaedic Surgery, Atrium Health Musculoskeletal Institute, Carolinas Medical Center, Charlotte, NC
| | - Laurence B. Kempton
- Department of Orthopaedic Surgery, Atrium Health Musculoskeletal Institute, Carolinas Medical Center, Charlotte, NC
| |
Collapse
|
4
|
Kim TJ, Lee HS, Kim SE, Park J, Kim JY, Lee J, Song JE, Hong JH, Lee J, Chung JH, Kim HC, Shin DH, Lee HY, Kim BJ, Seo WK, Park JM, Lee SJ, Jung KH, Kwon SU, Hong YC, Kim HS, Kang HJ, Lee J, Bae HJ. Developing a national surveillance system for stroke and acute myocardial infarction using claims data in the Republic of Korea: a retrospective study. Osong Public Health Res Perspect 2024; 15:18-32. [PMID: 38481047 PMCID: PMC10982659 DOI: 10.24171/j.phrp.2023.0248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2023] [Revised: 11/30/2023] [Accepted: 12/03/2023] [Indexed: 04/04/2024] Open
Abstract
BACKGROUND Limited information is available concerning the epidemiology of stroke and acute myocardial infarction (AMI) in the Republic of Korea. This study aimed to develop a national surveillance system to monitor the incidence of stroke and AMI using national claims data. METHODS We developed and validated identification algorithms for stroke and AMI using claims data. This validation involved a 2-stage stratified sampling method with a review of medical records for sampled cases. The weighted positive predictive value (PPV) and negative predictive value (NPV) were calculated based on the sampling structure and the corresponding sampling rates. Incident cases and the incidence rates of stroke and AMI in the Republic of Korea were estimated by applying the algorithms and weighted PPV and NPV to the 2018 National Health Insurance Service claims data. RESULTS In total, 2,200 cases (1,086 stroke cases and 1,114 AMI cases) were sampled from the 2018 claims database. The sensitivity and specificity of the algorithms were 94.3% and 88.6% for stroke and 97.9% and 90.1% for AMI, respectively. The estimated number of cases, including recurrent events, was 150,837 for stroke and 40,529 for AMI in 2018. The age- and sex-standardized incidence rate for stroke and AMI was 180.2 and 46.1 cases per 100,000 person-years, respectively, in 2018. CONCLUSION This study demonstrates the feasibility of developing a national surveillance system based on claims data and identification algorithms for stroke and AMI to monitor their incidence rates.
Collapse
Affiliation(s)
- Tae Jung Kim
- Department of Critical Care Medicine, Seoul National University Hospital, Seoul, Republic of Korea
- Department of Neurology, Seoul National University Hospital, Seoul National University College of Medicine, Seoul, Republic of Korea
| | | | - Seong-Eun Kim
- Department of Neurology, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam, Republic of Korea
| | - Jinju Park
- Central Division of Cardio-cerebrovascular Disease Management, Seoul National University Hospital, Seoul, Republic of Korea
| | - Jun Yup Kim
- Department of Neurology, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam, Republic of Korea
| | - Jiyoon Lee
- Department of Biostatistics, Korea University College of Medicine, Seoul, Republic of Korea
| | - Ji Eun Song
- Department of Biostatistics, Korea University College of Medicine, Seoul, Republic of Korea
| | - Jin-Hyuk Hong
- Central Division of Cardio-cerebrovascular Disease Management, Seoul National University Hospital, Seoul, Republic of Korea
| | - Joongyub Lee
- Department of Preventive Medicine, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Joong-Hwa Chung
- Department of Cardiology, Chosun University Hospital, Gwangju, Republic of Korea
| | - Hyeon Chang Kim
- Department of Preventive Medicine, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Dong-Ho Shin
- Department of Cardiology, Severance Cardiovascular Hospital, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Hae-Young Lee
- Department of Internal Medicine and Cardiovascular Center, Seoul National University Hospital, Seoul, Republic of Korea
| | - Bum Joon Kim
- Department of Neurology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
| | - Woo-Keun Seo
- Department of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea
| | - Jong-Moo Park
- Department of Neurology, Uijeongbu Eulji Medical Center, Eulji University, Seoul, Republic of Korea
| | - Soo Joo Lee
- Department of Neurology, Daejeon Eulji Medical Center, Eulji University, Daejeon, Republic of Korea
| | - Keun-Hwa Jung
- Department of Neurology, Seoul National University Hospital, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Sun U. Kwon
- Department of Neurology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
| | - Yun-Chul Hong
- Department of Preventive Medicine, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Hyo-Soo Kim
- Department of Internal Medicine and Cardiovascular Center, Seoul National University Hospital, Seoul, Republic of Korea
| | - Hyun-Jae Kang
- Department of Internal Medicine and Cardiovascular Center, Seoul National University Hospital, Seoul, Republic of Korea
| | - Juneyoung Lee
- Department of Biostatistics, Korea University College of Medicine, Seoul, Republic of Korea
| | - Hee-Joon Bae
- Department of Neurology, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam, Republic of Korea
| |
Collapse
|
5
|
Tariq A, Goddard K, Elugunti P, Piorkowski K, Staal J, Viramontes A, Banerjee I, Patel BN. Contrastive diagnostic embedding (CDE) model for automated coding - A case study using emergency department encounters. Int J Med Inform 2023; 179:105212. [PMID: 37729838 DOI: 10.1016/j.ijmedinf.2023.105212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 08/07/2023] [Accepted: 09/01/2023] [Indexed: 09/22/2023]
Abstract
BACKGROUND Billing codes are utilized for medical reimbursement, clinical quality metric valuation and for epidemiologic purposes to report and follow disease trends and outcomes. The current paradigm of manual coding can be expensive, time-consuming, and subject to human error. Though automation of the billing codes has been widely reported in the literature via rule-based and supervised approaches, existing strategies lack generalizability and robustness towards large and constantly changing ICD hierarchical structure. METHOD We propose a weakly supervised training strategy by leveraging contrastive learning, contrastive diagnosis embedding (CDE) to capture the fine semantic variations between the diagnosis codes. The approach consists of a two-phase contrastive training for generating the semantic embedding space adapted to incorporate hierarchical information of ICD-10 vocabulary and a weakly supervised retrieval scheme. Core strength of the proposed method is that it puts no limit on the 70 K ICD-10 codes set and can handle all rare codes for coding the diagnosis. RESULTS Our CDE model outperformed string-based partial matching and ClinicalBERT embedding on three test cases (a retrospective testset, a prospective testset, and external testset) and produced an accurate prediction of rare and newly introduced diagnosis codes. A detailed ablation study showed the importance of each phase of the proposed multi-phase training. Each successive phase of training - ICD-10 group sensitive training (phase 1.1), ICD-10 subgroup sensitive training (phase 1.2), free-text diagnosis description-based training (phase 2) - improved performance beyond the previous phase of training. The model also outperformed existing supervised models like CAML and PLM-ICD and produced satisfactory performance on the rare codes. CONCLUSION Compared to the existing rule-based and supervised models, the proposed weakly supervised contrastive learning overcomes the limitations in terms of generalization capability and increases the robustness of the automated billing. Such a model will allow flexibility through accurate billing code automation for practice convergence and gains efficiencies in a value-based care payment environment.
Collapse
Affiliation(s)
- Amara Tariq
- Department of Administration, Mayo Clinic, AZ, USA.
| | - Kris Goddard
- Department of Administration, Mayo Clinic, AZ, USA
| | | | | | - Jared Staal
- Department of Administration, Mayo Clinic, AZ, USA
| | | | - Imon Banerjee
- Department of Radiology, Mayo Clinic, Phoenix, AZ, USA
| | | |
Collapse
|
6
|
Nogues IE, Wen J, Lin Y, Liu M, Tedeschi SK, Geva A, Cai T, Hong C. Weakly Semi-supervised phenotyping using Electronic Health records. J Biomed Inform 2022; 134:104175. [PMID: 36064111 PMCID: PMC10112494 DOI: 10.1016/j.jbi.2022.104175] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Revised: 04/23/2022] [Accepted: 08/15/2022] [Indexed: 01/07/2023]
Abstract
OBJECTIVE Electronic Health Record (EHR) based phenotyping is a crucial yet challenging problem in the biomedical field. Though clinicians typically determine patient-level diagnoses via manual chart review, the sheer volume and heterogeneity of EHR data renders such tasks challenging, time-consuming, and prohibitively expensive, thus leading to a scarcity of clinical annotations in EHRs. Weakly supervised learning algorithms have been successfully applied to various EHR phenotyping problems, due to their ability to leverage information from large quantities of unlabeled samples to better inform predictions based on a far smaller number of patients. However, most weakly supervised methods are subject to the challenge to choose the right cutoff value to generate an optimal classifier. Furthermore, since they only utilize the most informative features (i.e., main ICD and NLP counts) they may fail for episodic phenotypes that cannot be consistently detected via ICD and NLP data. In this paper, we propose a label-efficient, weakly semi-supervised deep learning algorithm for EHR phenotyping (WSS-DL), which overcomes the limitations above. MATERIALS AND METHODS WSS-DL classifies patient-level disease status through a series of learning stages: 1) generating silver standard labels, 2) deriving enhanced-silver-standard labels by fitting a weakly supervised deep learning model to data with silver standard labels as outcomes and high dimensional EHR features as input, and 3) obtaining the final prediction score and classifier by fitting a supervised learning model to data with a minimal number of gold standard labels as the outcome, and the enhanced-silver-standard labels and a minimal set of most informative EHR features as input. To assess the generalizability of WSS-DL across different phenotypes and medical institutions, we apply WSS-DL to classify a total of 17 diseases, including both acute and chronic conditions, using EHR data from three healthcare systems. Additionally, we determine the minimum quantity of training labels required by WSS-DL to outperform existing supervised and semi-supervised phenotyping methods. RESULTS The proposed method, in combining the strengths of deep learning and weakly semi-supervised learning, successfully leverages the crucial phenotyping information contained in EHR features from unlabeled samples. Indeed, the deep learning model's ability to handle high-dimensional EHR features allows it to generate strong phenotype status predictions from silver standard labels. These predictions, in turn, provide highly effective features in the final logistic regression stage, leading to high phenotyping accuracy in notably small subsets of labeled data (e.g. n = 40 labeled samples). CONCLUSION Our method's high performance in EHR datasets with very small numbers of labels indicates its potential value in aiding doctors to diagnose rare diseases as well as conditions susceptible to misdiagnosis.
Collapse
Affiliation(s)
| | - Jun Wen
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Yucong Lin
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA; Center for Statistical Science, Tsinghua University, Beijing, China
| | - Molei Liu
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Sara K Tedeschi
- Department of Medicine, Division of Rheumatology, Inflammation and Immunity, Brigham and Women's Hospital, Boston, MA, USA
| | - Alon Geva
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA; Department of Anesthesiology, Critical Care, and Pain Medicine, and Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, USA; Department of Anesthesia, Harvard Medical School, Boston, MA, USA
| | - Tianxi Cai
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA; Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Chuan Hong
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
7
|
Jhaveri P, Bozkurt S, Moyal A, Belov A, Anderson S, Shan H, Whitaker B, Hernandez-Boussard T. Analyzing real world data of blood transfusion adverse events: Opportunities and challenges. Transfusion 2022; 62:1019-1026. [PMID: 35437749 DOI: 10.1111/trf.16880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 03/11/2022] [Accepted: 03/11/2022] [Indexed: 11/27/2022]
Abstract
BACKGROUND Blood transfusions are a vital component of modern healthcare, yet adverse reactions to blood product transfusions can cause morbidity, and rarely result in mortality. Therefore, accurate reporting of transfusion related adverse events (TRAEs) is paramount to improved transfusion practice. This study aims to investigate real-world data (RWD) on TRAEs by evaluating differences between ICD 9/10-based electronic health records (EHR) and blood bank-specific reporting. STUDY DESIGN AND METHODS TRAE data were retrospectively collected from a blood bank-specific database between Jan 2015 and June 2019 as the reference data source and compared it to ICD 9/10 diagnostic codes corresponding to various TRAEs. Seven reactions that have corresponding ICD 9/10 diagnostic codes were evaluated: Transfusion related circulatory overload (TACO), transfusion related acute lung injury (TRALI), febrile non-hemolytic reaction (FNHTR), transfusion-related anaphylactic reaction (TRA), acute hemolytic transfusion reaction (AHTR), delayed hemolytic transfusion reaction (DHTR), and delayed serologic reaction (DSTR). These accounted for 33% of the TRAEs at an academic institution during the study period. RESULTS Among 18637 adult blood transfusion recipients, there were 229 unique patients with 263 TRAE related ICD codes in the EHR, while there were 191 unique patients with 287 TRAEs identified in the blood bank database. None of the categories of reaction we investigated had perfect alignment between ICD 9/10 codes and blood bank specific diagnoses. DISCUSSION Multiple systemic challenges were identified that hinder effective reporting of TRAEs. Identifying factors causing inconsistent reporting between blood banks and EHRs is paramount to developing effective workability between these electronic systems, as well as across clinical and laboratory teams.
Collapse
Affiliation(s)
- Perrin Jhaveri
- School of Medicine, Stanford University, Stanford, California, USA.,Stanford Blood Center, Stanford, California, USA
| | - Selen Bozkurt
- Department of Medicine (Biomedical Informatics), Stanford University School of Medicine, Stanford, California, USA
| | - Axel Moyal
- Department of Medicine (Biomedical Informatics), Stanford University School of Medicine, Stanford, California, USA
| | - Artur Belov
- Center for Biologics Evaluation and Research, Office of Biostatistics and Epidemiology, US FDA, Silver Spring, Maryland, USA
| | - Steven Anderson
- Center for Biologics Evaluation and Research, Office of Biostatistics and Epidemiology, US FDA, Silver Spring, Maryland, USA
| | - Hua Shan
- School of Medicine, Stanford University, Stanford, California, USA.,Stanford Blood Center, Stanford, California, USA
| | - Barbee Whitaker
- Center for Biologics Evaluation and Research, Office of Biostatistics and Epidemiology, US FDA, Silver Spring, Maryland, USA
| | - Tina Hernandez-Boussard
- School of Medicine, Stanford University, Stanford, California, USA.,Department of Medicine (Biomedical Informatics), Stanford University School of Medicine, Stanford, California, USA
| |
Collapse
|
8
|
Sun X, Guo W, Shen J. Toward attention-based learning to predict the risk of brain degeneration with multimodal medical data. Front Neurosci 2022; 16:1043626. [PMID: 36741058 PMCID: PMC9889549 DOI: 10.3389/fnins.2022.1043626] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 12/12/2022] [Indexed: 01/20/2023] Open
Abstract
Introduction Brain degeneration is commonly caused by some chronic diseases, such as Alzheimer's disease (AD) and diabetes mellitus (DM). The risk prediction of brain degeneration aims to forecast the situation of disease progression of patients in the near future based on their historical health records. It is beneficial for patients to make an accurate clinical diagnosis and early prevention of disease. Current risk predictions of brain degeneration mainly rely on single-modality medical data, such as Electronic Health Records (EHR) or magnetic resonance imaging (MRI). However, only leveraging EHR or MRI data for the pertinent and accurate prediction is insufficient because of single-modality information (e.g., pixel or volume information of image data or clinical context information of non-image data). Methods Several deep learning-based methods have used multimodal data to predict the risks of specified diseases. However, most of them simply integrate different modalities in an early, intermediate, or late fusion structure and do not care about the intra-modal and intermodal dependencies. A lack of these dependencies would lead to sub-optimal prediction performance. Thus, we propose an encoder-decoder framework for better risk prediction of brain degeneration by using MRI and EHR. An encoder module is one of the key components and mainly focuses on feature extraction of input data. Specifically, we introduce an encoder module, which integrates intra-modal and inter-modal dependencies with the spatial-temporal attention and cross-attention mechanism. The corresponding decoder module is another key component and mainly parses the features from the encoder. In the decoder module, a disease-oriented module is used to extract the most relevant disease representation features. We take advantage of a multi-head attention module followed by a fully connected layer to produce the predicted results. Results As different types of AD and DM influence the nature and severity of brain degeneration, we evaluate the proposed method for three-class prediction of AD and three-class prediction of DM. Our results show that the proposed method with integrated MRI and EHR data achieves an accuracy of 0.859 and 0.899 for the risk prediction of AD and DM, respectively. Discussion The prediction performance is significantly better than the benchmarks, including MRI-only, EHR-only, and state-of-the-art multimodal fusion methods.
Collapse
Affiliation(s)
- Xiaofei Sun
- Department of Electrical and Electronic Engineering, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
| | - Weiwei Guo
- EchoX Technology Limited, Hong Kong, Hong Kong SAR, China
| | - Jing Shen
- Department of Radiology, Affiliated Zhongshan Hospital of Dalian University, Dalian, Liaoning, China
- *Correspondence: Jing Shen,
| |
Collapse
|
9
|
Song J, Lim Y, Ko I, Kim JY, Kim DK. Association between Air Pollutants and Initial Hospital Admission for Ischemic Stroke in Korea from 2002 to 2013. J Stroke Cerebrovasc Dis 2021; 30:106080. [PMID: 34496310 DOI: 10.1016/j.jstrokecerebrovasdis.2021.106080] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Revised: 08/16/2021] [Accepted: 08/19/2021] [Indexed: 11/16/2022] Open
Abstract
OBJECTIVES There is limited information regarding the association between air pollution exposure and stroke incidence. Therefore, this study aimed to assess the associations between short-term exposure to ambient air pollutants and initial hospital admission for ischemic stroke. MATERIALS AND METHODS From the Korea National Health Insurance Service-National Sample Cohort 2002-2013 database in South Korea, 55,852 first hospital admissions for ischemic stroke were identified. A generalized additive Poisson model was used to explore the association between air pollutants, including particulate matter, sulfur dioxide, nitrogen dioxide, and carbon monoxide and admissions for ischemic stroke. RESULTS All air pollutant models showed significant associations with ischemic stroke in the single lag model. In all air pollutant models excluding particulate matter 10 μm, a significant association was found between nitrogen dioxide exposure and initial admission for ischemic stroke after adjusting for other pollutants. An increment of 10 μg/m3 in nitrogen dioxide concentration at lag 0 and 14 days corresponded to a 0.259% (95% confidence interval, 0.231-0.287%) and 0.110% (95% confidence interval, 0.097-0.124) increase in initial admission for ischemic stroke, respectively. CONCLUSIONS The exposure-response relationship between nitrogen dioxide and initial admissions for ischemic stroke was approximately linear, with a sharper response at higher concentrations. Short-term exposure to nitrogen dioxide was positively associated with initial hospital admission for ischemic stroke.
Collapse
Affiliation(s)
- Jihye Song
- Department of Neurosurgery, Ajou College of Medicine, Ajou University Hospital, Suwon, Republic of Korea
| | - Yong Lim
- Department of Neurosurgery, Ajou College of Medicine, Ajou University Hospital, Suwon, Republic of Korea
| | - Inseok Ko
- Department of Biomedical Informatics, College of Medicine, Konyang University, Daejeon, Republic of Korea
| | - Jong-Yeup Kim
- Department of Biomedical Informatics, College of Medicine, Konyang University, Daejeon, Republic of Korea; Department of Otorhinolaryngology-Head and Neck Surgery, College of Medicine, Konyang University, Daejeon, Republic of Korea.
| | - Dong-Kyu Kim
- Department of Otorhinolaryngology-Head and Neck Surgery, Chuncheon Sacred Heart Hospital, Hallym University College of Medicine, Chuncheon, Republic of Korea; Institute of New Frontier Research, Hallym University College of Medicine, Chuncheon, Republic of Korea.
| |
Collapse
|
10
|
Khurshid S, Weng LC, Al-Alusi MA, Halford JL, Haimovich JS, Benjamin EJ, Trinquart L, Ellinor PT, McManus DD, Lubitz SA. Accelerometer-derived physical activity and risk of atrial fibrillation. Eur Heart J 2021; 42:2472-2483. [PMID: 34037209 DOI: 10.1093/eurheartj/ehab250] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/06/2021] [Revised: 02/27/2021] [Accepted: 04/01/2021] [Indexed: 02/07/2023] Open
Abstract
AIMS Physical activity may be an important modifiable risk factor for atrial fibrillation (AF), but associations have been variable and generally based on self-reported activity. METHODS AND RESULTS We analysed 93 669 participants of the UK Biobank prospective cohort study without prevalent AF who wore a wrist-based accelerometer for 1 week. We categorized whether measured activity met the standard recommendations of the European Society of Cardiology, American Heart Association, and World Health Organization [moderate-to-vigorous physical activity (MVPA) ≥150 min/week]. We tested associations between guideline-adherent activity and incident AF (primary) and stroke (secondary) using Cox proportional hazards models adjusted for age, sex, and each component of the Cohorts for Heart and Aging Research in Genomic Epidemiology AF (CHARGE-AF) risk score. We also assessed correlation between accelerometer-derived and self-reported activity. The mean age was 62 ± 8 years and 57% were women. Over a median of 5.2 years, 2338 incident AF events occurred. In multivariable adjusted models, guideline-adherent activity was associated with lower risks of AF [hazard ratio (HR) 0.82, 95% confidence interval (CI) 0.75-0.89; incidence 3.5/1000 person-years, 95% CI 3.3-3.8 vs. 6.5/1000 person-years, 95% CI 6.1-6.8] and stroke (HR 0.76, 95% CI 0.64-0.90; incidence 1.0/1000 person-years, 95% CI 0.9-1.1 vs. 1.8/1000 person-years, 95% CI 1.6-2.0). Correlation between accelerometer-derived and self-reported MVPA was weak (Spearman r = 0.16, 95% CI 0.16-0.17). Self-reported activity was not associated with incident AF or stroke. CONCLUSIONS Greater accelerometer-derived physical activity is associated with lower risks of AF and stroke. Future preventive efforts to reduce AF risk may be most effective when targeting adherence to objective activity thresholds.
Collapse
Affiliation(s)
- Shaan Khurshid
- Division of Cardiology, Department of Medicine, Massachusetts General Hospital, 55 Fruit Street, GRB 8, Boston, MA, 02114, USA.,Cardiovascular Disease Initiative, Broad Institute of Harvard University and the Massachusetts Institute of Technology, Cambridge, MA, USA.,Cardiovascular Research Center, Massachusetts General Hospital, Simches Research Building, 185 Cambridge Street, Office 3.188, Boston, MA 02114, USA
| | - Lu-Chen Weng
- Cardiovascular Disease Initiative, Broad Institute of Harvard University and the Massachusetts Institute of Technology, Cambridge, MA, USA.,Cardiovascular Research Center, Massachusetts General Hospital, Simches Research Building, 185 Cambridge Street, Office 3.188, Boston, MA 02114, USA
| | - Mostafa A Al-Alusi
- Division of Cardiology, Department of Medicine, Massachusetts General Hospital, 55 Fruit Street, GRB 8, Boston, MA, 02114, USA.,Cardiovascular Disease Initiative, Broad Institute of Harvard University and the Massachusetts Institute of Technology, Cambridge, MA, USA.,Cardiovascular Research Center, Massachusetts General Hospital, Simches Research Building, 185 Cambridge Street, Office 3.188, Boston, MA 02114, USA
| | - Jennifer L Halford
- Cardiovascular Disease Initiative, Broad Institute of Harvard University and the Massachusetts Institute of Technology, Cambridge, MA, USA.,Cardiovascular Research Center, Massachusetts General Hospital, Simches Research Building, 185 Cambridge Street, Office 3.188, Boston, MA 02114, USA.,Department of Medicine, Massachusetts General Hospital, 55 Fruit Street, GRB 8, Boston, MA, 02114, USA
| | - Julian S Haimovich
- Cardiovascular Disease Initiative, Broad Institute of Harvard University and the Massachusetts Institute of Technology, Cambridge, MA, USA.,Cardiovascular Research Center, Massachusetts General Hospital, Simches Research Building, 185 Cambridge Street, Office 3.188, Boston, MA 02114, USA.,Department of Medicine, Massachusetts General Hospital, 55 Fruit Street, GRB 8, Boston, MA, 02114, USA
| | - Emelia J Benjamin
- Boston University and National Heart, Lung, and Blood Institute's Framingham Heart Study, 73 Mt Wayte Ave, Framingham, MA 01702, USA.,Sections of Preventive Medicine and Cardiovascular Medicine, Department of Medicine, Boston University School of Medicine, 715 Albany St. E-113 Boston, MA 02118, USA.,Department of Epidemiology, Boston University School of Public Heath, 801 Mass Ave, Boston, MA 02118, USA
| | - Ludovic Trinquart
- Boston University and National Heart, Lung, and Blood Institute's Framingham Heart Study, 73 Mt Wayte Ave, Framingham, MA 01702, USA.,Department of Biostatistics, Boston University School of Public Health, 801 Mass Ave, Boston, MA 02118, USA
| | - Patrick T Ellinor
- Cardiovascular Disease Initiative, Broad Institute of Harvard University and the Massachusetts Institute of Technology, Cambridge, MA, USA.,Cardiovascular Research Center, Massachusetts General Hospital, Simches Research Building, 185 Cambridge Street, Office 3.188, Boston, MA 02114, USA.,Cardiac Arrhythmia Service, Division of Cardiology, Department of Medicine, Massachusetts General Hospital, 55 Fruit Street, GRB 109, Boston, MA 02114, USA
| | - David D McManus
- Department of Medicine, University of Massachusetts Medical School, 55 N Lake Ave, Worcester, MA 01655, USA
| | - Steven A Lubitz
- Cardiovascular Disease Initiative, Broad Institute of Harvard University and the Massachusetts Institute of Technology, Cambridge, MA, USA.,Cardiovascular Research Center, Massachusetts General Hospital, Simches Research Building, 185 Cambridge Street, Office 3.188, Boston, MA 02114, USA.,Cardiac Arrhythmia Service, Division of Cardiology, Department of Medicine, Massachusetts General Hospital, 55 Fruit Street, GRB 109, Boston, MA 02114, USA
| |
Collapse
|
11
|
Lee SH, Aw KL, McVerry F, McCarron MO. Systematic Review and Meta-Analysis of Diagnostic Agreement in Suspected TIA. Neurol Clin Pract 2021; 11:57-63. [PMID: 33968473 DOI: 10.1212/cpj.0000000000000830] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Accepted: 01/07/2020] [Indexed: 12/13/2022]
Abstract
Objective To determine the interrater variability for TIA diagnostic agreement among expert clinicians (neurologists/stroke physicians), administrative data, and nonspecialists. Methods We performed a meta-analysis of studies from January 1984 to January 2019 using MEDLINE, EMBASE, and PubMed. Two reviewers independently screened for eligible studies and extracted interrater variability measurements using Cohen's kappa scores to assess diagnostic agreement. Results Nineteen original studies consisting of 19,421 patients were included. Expert clinicians demonstrate good agreement for TIA diagnosis (κ = 0.71, 95% confidence interval [CI] = 0.62-0.81). Interrater variability between clinicians' TIA diagnosis and administrative data also demonstrated good agreement (κ = 0.68, 95% CI = 0.62-0.74). There was moderate agreement (κ = 0.41, 95% CI = 0.22-0.61) between referring clinicians and clinicians at TIA clinics receiving the referrals. Sixty percent of 748 patient referrals to TIA clinics were TIA mimics. Conclusions Overall agreement between expert clinicians was good for TIA diagnosis, although variation still existed for a sizeable proportion of cases. Diagnostic agreement for TIA decreased among nonspecialists. The substantial number of patients being referred to TIA clinics with other (often neurologic) diagnoses was large, suggesting that clinicians, who are proficient in managing TIAs and their mimics, should run TIA clinics.
Collapse
Affiliation(s)
- Seong Hoon Lee
- School of Medicine, Dentistry and Biomedical Sciences (SHL, KLA), Queen's University Belfast, Belfast; and Department of Neurology (FM, MOM), Altnagelvin Hospital, Derry, United Kingdom
| | - Kah Long Aw
- School of Medicine, Dentistry and Biomedical Sciences (SHL, KLA), Queen's University Belfast, Belfast; and Department of Neurology (FM, MOM), Altnagelvin Hospital, Derry, United Kingdom
| | - Ferghal McVerry
- School of Medicine, Dentistry and Biomedical Sciences (SHL, KLA), Queen's University Belfast, Belfast; and Department of Neurology (FM, MOM), Altnagelvin Hospital, Derry, United Kingdom
| | - Mark O McCarron
- School of Medicine, Dentistry and Biomedical Sciences (SHL, KLA), Queen's University Belfast, Belfast; and Department of Neurology (FM, MOM), Altnagelvin Hospital, Derry, United Kingdom
| |
Collapse
|
12
|
St Sauver JL, Chamberlain AM, Bobo WV, Boyd CM, Finney Rutten LJ, Jacobson DJ, McGree ME, Grossardt BR, Rocca WA. Implementing the US Department of Health and Human Services definition of multimorbidity: a comparison between billing codes and medical record review in a population-based sample of persons 40 -84 years old. BMJ Open 2021; 11:e042870. [PMID: 33895712 PMCID: PMC8074567 DOI: 10.1136/bmjopen-2020-042870] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
OBJECTIVE To assess the validity of the US Department of Health and Human Services (DHHS) definition of multimorbidity using International Classification of Diseases, ninth edition (ICD-9) codes from administrative data. DESIGN Cross-sectional comparison of two ICD-9 billing code algorithms to data abstracted from medical records. SETTING Olmsted County, Minnesota, USA. PARTICIPANTS An age-stratified and sex-stratified random sample of 1509 persons ages 40-84 years old residing in Olmsted County on 31 December 2010. STUDY MEASURES Seventeen chronic conditions identified by the US DHHS as important in studies of multimorbidity were identified through medical record review of each participant between 2006 and 2010. ICD-9 administrative billing codes corresponding to the 17 conditions were extracted using the Rochester Epidemiology Project records-linkage system. Persons were classified as having each condition using two algorithms: at least one code or at least two codes separated by more than 30 days. We compared the ICD-9 code algorithms with the diagnoses obtained through medical record review to identify persons with multimorbidity (defined as ≥2, ≥3 or ≥4 chronic conditions). RESULTS Use of a single code to define each of the 17 chronic conditions resulted in sensitivity and positive predictive values (PPV) ≥70%, and in specificity and negative predictive values (NPV) ≥70% for identifying multimorbidity in the overall study population. PPV and sensitivity were highest in persons 65-84 years of age, whereas NPV and specificity were highest in persons 40-64 years. The results varied by condition, and by age and sex. The use of at least two codes reduced sensitivity, but increased specificity. CONCLUSIONS The use of a single code to identify each of the 17 chronic conditions may be a simple and valid method to identify persons who meet the DHHS definition of multimorbidity in populations with similar demographic, socioeconomic, and health care characteristics.
Collapse
Affiliation(s)
- Jennifer L St Sauver
- Division of Epidemiology, Department of Quantitative Health Sciences, Mayo Clinic, Rochester, Minnesota, USA
- The Robert D. and Patricia E. Kern Center for the Science of Health Care Delivery, Mayo Clinic, Rochester, Minnesota, USA
| | - Alanna M Chamberlain
- Division of Epidemiology, Department of Quantitative Health Sciences, Mayo Clinic, Rochester, Minnesota, USA
| | - William V Bobo
- Department of Psychiatry and Psychology, Mayo Clinic, Jacksonville, Florida, USA
| | - Cynthia M Boyd
- Division of Geriatric Medicine and Gerontology, Department of Medicine, Johns Hopkins University, Baltimore, Maryland, USA
| | - Lila J Finney Rutten
- Division of Epidemiology, Department of Quantitative Health Sciences, Mayo Clinic, Rochester, Minnesota, USA
- The Robert D. and Patricia E. Kern Center for the Science of Health Care Delivery, Mayo Clinic, Rochester, Minnesota, USA
| | - Debra J Jacobson
- Division of Clinical Trials and Biostatistics, Department of Quantitative Health Sciences, Mayo Clinic, Rochester, Minnesota, USA
| | - Michaela E McGree
- Division of Clinical Trials and Biostatistics, Department of Quantitative Health Sciences, Mayo Clinic, Rochester, Minnesota, USA
| | - Brandon R Grossardt
- Division of Clinical Trials and Biostatistics, Department of Quantitative Health Sciences, Mayo Clinic, Rochester, Minnesota, USA
| | - Walter A Rocca
- Division of Epidemiology, Department of Quantitative Health Sciences, Mayo Clinic, Rochester, Minnesota, USA
- Department of Neurology, Mayo Clinic, Rochester, Minnesota, USA
- Women's Health Research Center, Mayo Clinic, Rochester, MN, USA
| |
Collapse
|
13
|
Liao KP, Sun J, Cai TA, Link N, Hong C, Huang J, Huffman JE, Gronsbell J, Zhang Y, Ho YL, Castro V, Gainer V, Murphy SN, O'Donnell CJ, Gaziano JM, Cho K, Szolovits P, Kohane IS, Yu S, Cai T. High-throughput multimodal automated phenotyping (MAP) with application to PheWAS. J Am Med Inform Assoc 2021; 26:1255-1262. [PMID: 31613361 DOI: 10.1093/jamia/ocz066] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Revised: 04/08/2019] [Accepted: 04/26/2019] [Indexed: 01/01/2023] Open
Abstract
OBJECTIVE Electronic health records linked with biorepositories are a powerful platform for translational studies. A major bottleneck exists in the ability to phenotype patients accurately and efficiently. The objective of this study was to develop an automated high-throughput phenotyping method integrating International Classification of Diseases (ICD) codes and narrative data extracted using natural language processing (NLP). MATERIALS AND METHODS We developed a mapping method for automatically identifying relevant ICD and NLP concepts for a specific phenotype leveraging the Unified Medical Language System. Along with health care utilization, aggregated ICD and NLP counts were jointly analyzed by fitting an ensemble of latent mixture models. The multimodal automated phenotyping (MAP) algorithm yields a predicted probability of phenotype for each patient and a threshold for classifying participants with phenotype yes/no. The algorithm was validated using labeled data for 16 phenotypes from a biorepository and further tested in an independent cohort phenome-wide association studies (PheWAS) for 2 single nucleotide polymorphisms with known associations. RESULTS The MAP algorithm achieved higher or similar AUC and F-scores compared to the ICD code across all 16 phenotypes. The features assembled via the automated approach had comparable accuracy to those assembled via manual curation (AUCMAP 0.943, AUCmanual 0.941). The PheWAS results suggest that the MAP approach detected previously validated associations with higher power when compared to the standard PheWAS method based on ICD codes. CONCLUSION The MAP approach increased the accuracy of phenotype definition while maintaining scalability, thereby facilitating use in studies requiring large-scale phenotyping, such as PheWAS.
Collapse
Affiliation(s)
- Katherine P Liao
- Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Boston, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA
| | - Jiehuan Sun
- Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA.,Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Tianrun A Cai
- Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Boston, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA
| | - Nicholas Link
- Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA
| | - Chuan Hong
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA.,Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Jie Huang
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | | | | | - Yichi Zhang
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.,University of Rhode Island, Kingston, RI, USA
| | - Yuk-Lam Ho
- Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA
| | | | | | - Shawn N Murphy
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Partners Healthcare Systems, Summerville, MA, USA.,Massachusetts General Hospital, Boston, MA, USA
| | - Christopher J O'Donnell
- Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Boston, MA, USA.,Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA
| | - J Michael Gaziano
- Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Boston, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA
| | - Kelly Cho
- Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Boston, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA
| | - Peter Szolovits
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Isaac S Kohane
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Sheng Yu
- Center for Statistical Science, Tsinghua University, Beijing, China.,Department of Industrial Engineering, Tsinghua University, Beijing, China.,Institute for Data Science, Tsinghua University, Beijing, China
| | - Tianxi Cai
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Division of Data Sciences, VA Boston Healthcare System, Boston, MA, USA.,Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| |
Collapse
|
14
|
Kelly GF, Makhoul T, Zammit CG, Jones CM, Acquisto NM. The accuracy of
ICD‐CM
codes to identify thromboembolic events for clinical outcomes research. JOURNAL OF THE AMERICAN COLLEGE OF CLINICAL PHARMACY 2021. [DOI: 10.1002/jac5.1353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Gregory F. Kelly
- Department of Pharmacy University of Rochester Medical Center Rochester New York USA
- Department of Pharmacy Penn Medicine, Hospital of the University of Pennsylvania Philadelphia Pennsylvania USA
| | - Therese Makhoul
- Department of Pharmacy University of Rochester Medical Center Rochester New York USA
- Department of Pharmacy Santa Rosa Memorial Hospital Santa Rosa California USA
| | - Christopher G. Zammit
- Department of Emergency Medicine University of Rochester School of Medicine and Dentistry Rochester New York USA
- Department of Critical Care Medicine TriHealth Cincinnati Ohio USA
| | - Courtney M.C. Jones
- Department of Emergency Medicine University of Rochester School of Medicine and Dentistry Rochester New York USA
| | - Nicole M. Acquisto
- Department of Pharmacy University of Rochester Medical Center Rochester New York USA
- Department of Emergency Medicine University of Rochester School of Medicine and Dentistry Rochester New York USA
| |
Collapse
|
15
|
Shweikeh F, Nuno M, Adamo M. Trends in endovascular interventions for pediatric ischemic stroke at the national level: data from 2000 to 2009. Childs Nerv Syst 2021; 37:161-166. [PMID: 32529548 DOI: 10.1007/s00381-020-04714-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Accepted: 05/28/2020] [Indexed: 10/24/2022]
Abstract
PURPOSE Present knowledge is limited with regard to endovascular and interventional management of pediatric acute ischemic stroke (AIS). The current practice of neurointerventions in this population was analyzed via a national database. METHODS The Kids' Inpatient Database for years 2000, 2003, 2006, and 2009 was examined for patients aged < 18 years discharged with a primary diagnosis of AIS and identified according to ICD-9 codes. Descriptive statistics were tabulated on each of the subcohorts. RESULTS There were 3467 patients identified; 920 (26.5%) underwent angiograms, 51 (1.5%) angiogram + thrombolysis, and 18 (0.5%) received angiogram + endovascular recanalization. The angiogram only subcohort was significantly younger compared with thrombolysis and endovascular procedure subcohorts (9.8 vs. 12.2 vs. 14.9 years, P < 0.001). Mortality was 4.3%, significantly lower for angiogram only than for thrombolysis (1.1% vs. 18.2%, P < 0.0001). Thrombolysis also had significantly higher hospital charges ($149,045 vs. $64,826, P < 0.0001). While not many differences in outcomes between angiogram only versus endovascular procedures, the latter had higher financial burden ($122,482 vs. $64,826, P < 0.0001). CONCLUSIONS This national study suggests that children receiving neurointerventions tend to be older (> 12 years) and heart and valvular defects are their most likely comorbidities. There was a lower mortality and fewer complications with endovascular procedures when compared with intravenous/intraarterial thrombolysis alone. Thrombolysis was also associated with more non-routine discharges and lengthier stay.
Collapse
Affiliation(s)
- Faris Shweikeh
- Department of Neurosurgery, Cedars-Sinai Medical Center, Los Angeles, CA, USA.
| | - Miriam Nuno
- Department of Neurosurgery, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Matthew Adamo
- Department of Neurosurgery, Albany Medical Center, Albany, NY, USA
| |
Collapse
|
16
|
Thangaraj PM, Kummer BR, Lorberbaum T, Elkind MSV, Tatonetti NP. Comparative analysis, applications, and interpretation of electronic health record-based stroke phenotyping methods. BioData Min 2020; 13:21. [PMID: 33372632 PMCID: PMC7720570 DOI: 10.1186/s13040-020-00230-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Accepted: 11/15/2020] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Accurate identification of acute ischemic stroke (AIS) patient cohorts is essential for a wide range of clinical investigations. Automated phenotyping methods that leverage electronic health records (EHRs) represent a fundamentally new approach cohort identification without current laborious and ungeneralizable generation of phenotyping algorithms. We systematically compared and evaluated the ability of machine learning algorithms and case-control combinations to phenotype acute ischemic stroke patients using data from an EHR. MATERIALS AND METHODS Using structured patient data from the EHR at a tertiary-care hospital system, we built and evaluated machine learning models to identify patients with AIS based on 75 different case-control and classifier combinations. We then estimated the prevalence of AIS patients across the EHR. Finally, we externally validated the ability of the models to detect AIS patients without AIS diagnosis codes using the UK Biobank. RESULTS Across all models, we found that the mean AUROC for detecting AIS was 0.963 ± 0.0520 and average precision score 0.790 ± 0.196 with minimal feature processing. Classifiers trained with cases with AIS diagnosis codes and controls with no cerebrovascular disease codes had the best average F1 score (0.832 ± 0.0383). In the external validation, we found that the top probabilities from a model-predicted AIS cohort were significantly enriched for AIS patients without AIS diagnosis codes (60-150 fold over expected). CONCLUSIONS Our findings support machine learning algorithms as a generalizable way to accurately identify AIS patients without using process-intensive manual feature curation. When a set of AIS patients is unavailable, diagnosis codes may be used to train classifier models.
Collapse
Affiliation(s)
- Phyllis M Thangaraj
- Department of Biomedical Informatics, Columbia University, 622 W 168th St., PH-20, New York, NY, 10032, USA
- Department of Systems Biology, Columbia University, New York, NY, USA
| | - Benjamin R Kummer
- Department of Neurology, Icahn School of Medicine at Mt. Sinai, New York, NY, USA
| | - Tal Lorberbaum
- Department of Biomedical Informatics, Columbia University, 622 W 168th St., PH-20, New York, NY, 10032, USA
- Department of Systems Biology, Columbia University, New York, NY, USA
| | - Mitchell S V Elkind
- Department of Neurology, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA
- Department of Epidemiology, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Nicholas P Tatonetti
- Department of Biomedical Informatics, Columbia University, 622 W 168th St., PH-20, New York, NY, 10032, USA.
- Department of Systems Biology, Columbia University, New York, NY, USA.
| |
Collapse
|
17
|
Sangal RB, Fodeh S, Taylor A, Rothenberg C, Finn EB, Sheth K, Matouk C, Ulrich A, Parwani V, Sather J, Venkatesh A. Identification of Patients with Nontraumatic Intracranial Hemorrhage Using Administrative Claims Data. J Stroke Cerebrovasc Dis 2020; 29:105306. [PMID: 33070110 DOI: 10.1016/j.jstrokecerebrovasdis.2020.105306] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Revised: 09/02/2020] [Accepted: 09/05/2020] [Indexed: 10/23/2022] Open
Abstract
INTRODUCTION Nontraumatic intracranial hemorrhage (ICH) is a neurological emergency of research interest; however, unlike ischemic stroke, has not been well studied in large datasets due to the lack of an established administrative claims-based definition. We aimed to evaluate both explicit diagnosis codes and machine learning methods to create a claims-based definition for this clinical phenotype. METHODS We examined all patients admitted to our tertiary medical center with a primary or secondary International Classification of Disease version 9 (ICD-9) or 10 (ICD-10) code for ICH in claims from any portion of the hospitalization in 2014-2015. As a gold standard, we defined the nontraumatic ICH phenotype based on manual chart review. We tested explicit definitions based on ICD-9 and ICD-10 that had been previously published in the literature as well as four machine learning classifiers including support vector machine (SVM), logistic regression with LASSO, random forest and xgboost. We report five standard measures of model performance for each approach. RESULTS A total of 1830 patients with 2145 unique ICD-10 codes were included in the initial dataset, of which 437 (24%) were true positive based on manual review. The explicit ICD-10 definition performed best (Sensitivity = 0.89 (95% CI 0.85-0.92), Specificity = 0.83 (0.81-0.85), F-score = 0.73 (0.69-0.77)) and improves on an explicit ICD-9 definition (Sensitivity = 0.87 (0.83-0.90), Specificity = 0.77 (0.74-0.79), F-score = 0.67 (0.63-0.71). Among machine learning classifiers, SVM performed best (Sensitivity = 0.78 (0.75-0.82), Specificity = 0.84 (0.81-0.87), AUC = 0.89 (0.87-0.92), F-score = 0.66 (0.62-0.69)). CONCLUSIONS An explicit ICD-10 definition can be used to accurately identify patients with a nontraumatic ICH phenotype with substantially better performance than ICD-9. An explicit ICD-10 based definition is easier to implement and quantitatively not appreciably improved with the additional application of machine learning classifiers. Future research utilizing large datasets should utilize this definition to address important research gaps.
Collapse
Affiliation(s)
- Rohit B Sangal
- Department of Emergency Medicine, United States; Yale University School of Medicine, New Haven CT, United States
| | - Samah Fodeh
- Department of Emergency Medicine, United States; Yale University School of Medicine, New Haven CT, United States
| | - Andrew Taylor
- Department of Emergency Medicine, United States; Yale University School of Medicine, New Haven CT, United States
| | - Craig Rothenberg
- Department of Emergency Medicine, United States; Yale University School of Medicine, New Haven CT, United States
| | - Emily B Finn
- Department of Internal Medicine, United States; Yale University School of Medicine, New Haven CT, United States
| | - Kevin Sheth
- Department of Neurology, United States; Yale University School of Medicine, New Haven CT, United States
| | | | - Andrew Ulrich
- Department of Emergency Medicine, United States; Yale University School of Medicine, New Haven CT, United States
| | - Vivek Parwani
- Department of Emergency Medicine, United States; Yale University School of Medicine, New Haven CT, United States
| | - John Sather
- Department of Emergency Medicine, United States; Yale University School of Medicine, New Haven CT, United States
| | - Arjun Venkatesh
- Department of Emergency Medicine, United States; Yale University School of Medicine, New Haven CT, United States.
| |
Collapse
|
18
|
Reduction in Stroke After Transient Ischemic Attack in a Province-Wide Cohort Between 2003 and 2015. Can J Neurol Sci 2020; 48:335-343. [PMID: 32959741 DOI: 10.1017/cjn.2020.205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
BACKGROUND Improvements in management of transient ischemic attack (TIA) have decreased stroke and mortality post-TIA. Studies examining trends over time on a provincial level are limited. We analyzed whether efforts to improve management have decreased the rate of stroke and mortality after TIA from 2003 to 2015 across an entire province. METHODS Using administrative data from the Canadian Institute for Health Information's (CIHI) databases from 2003 to 2015, we identified a cohort of patients with a diagnosis of TIA upon discharge from the emergency department (ED). We examined stroke rates at Day 1, 2, 7, 30, 90, 180, and 365 post-TIA and 1-year mortality rates and compared trends over time between 2003 and 2015. RESULTS From 2003 to 2015 in Ontario, there were 61,710 patients with an ED diagnosis of TIA. Linear regressions of stroke after the index TIA showed a significant decline between 2003 and 2015, decreasing by 25% at Day 180 and 32% at 1 year (p < 0.01). The 1-year stroke rate decreased from 6.0% in 2003 to 3.4% in 2015. Early (within 48 h) stroke after TIA continued to represent approximately half of the 1-year event rates. The 1-year mortality rate after ED discharge following a TIA decreased from 1.3% in 2003 to 0.3% in 2015 (p < 0.001). INTERPRETATION At a province-wide level, 1-year rates of stroke and mortality after TIA have declined significantly between 2003 and 2015, suggesting that efforts to improve management may have contributed toward the decline in long-term risk of stroke and mortality. Continued efforts are needed to further reduce the immediate risk of stroke following a TIA.
Collapse
|
19
|
Raja A, Trivedi PD, Dhamoon MS. Discharge against medical advice among neurological patients: Characteristics and outcomes. Health Serv Res 2020; 55:681-689. [PMID: 32578887 DOI: 10.1111/1475-6773.13306] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Abstract
OBJECTIVE To study characteristics and outcomes of patients with stroke, traumatic brain injury (TBI), and epilepsy with discharge against medical advice (DAMA). DATA SOURCES/STUDY SETTING Retrospective analysis of the 2013 Nationwide Readmissions Database, a nationally representative inpatient administrative dataset. STUDY DESIGN Associations between predictors and DAMA at index admission were analyzed using adjusted logistic models. We examined 30-day all-cause readmissions. DATA COLLECTION METHODS Patients aged ≥18 years at index admission for International Classification of Diseases-9 diagnosis code of epilepsy, TBI, or stroke were included. PRINCIPAL FINDINGS Discharge against medical advice occurred in 1998/58278 patients (3.43 percent) in the epilepsy group, 1762/211 213 (0.83 percent) in the stroke group, and 1289/74 652 (1.73 percent) in the TBI group. Factors consistently associated with increased likelihood of DAMA included lower age, male sex, non-Medicare and nonprivate insurance, lower socioeconomic status, and behavioral risk factors (smoking history, alcohol history, and drug use). The crude 30-day all-cause readmission rate for those with DAMA from their index admission was 16.4 percent for the stroke cohort, 13.9 percent for epilepsy, and 13.4 percent for TBI. DAMA at index admission was significantly associated with increased risk of 30-day all-cause readmission among all groups (adjusted odds ratio 1.79, 95% CI: 1.65-1.94, P < .0001). CONCLUSIONS Age, sex, insurance status, socioeconomic status, and behavioral factors were associated with DAMA in neurological patients. Further research is needed to develop interventions to reduce DAMA in high-risk groups.
Collapse
Affiliation(s)
- Aishwarya Raja
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York
| | - Parth D Trivedi
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York
| | - Mandip S Dhamoon
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York
| |
Collapse
|
20
|
Venkataraman GR, Pineda AL, Bear Don’t Walk IV OJ, Zehnder AM, Ayyar S, Page RL, Bustamante CD, Rivas MA. FasTag: Automatic text classification of unstructured medical narratives. PLoS One 2020; 15:e0234647. [PMID: 32569327 PMCID: PMC7307763 DOI: 10.1371/journal.pone.0234647] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 05/30/2020] [Indexed: 02/07/2023] Open
Abstract
Unstructured clinical narratives are continuously being recorded as part of delivery of care in electronic health records, and dedicated tagging staff spend considerable effort manually assigning clinical codes for billing purposes. Despite these efforts, however, label availability and accuracy are both suboptimal. In this retrospective study, we aimed to automate the assignment of top-level International Classification of Diseases version 9 (ICD-9) codes to clinical records from human and veterinary data stores using minimal manual labor and feature curation. Automating top-level annotations could in turn enable rapid cohort identification, especially in a veterinary setting. To this end, we trained long short-term memory (LSTM) recurrent neural networks (RNNs) on 52,722 human and 89,591 veterinary records. We investigated the accuracy of both separate-domain and combined-domain models and probed model portability. We established relevant baseline classification performances by training Decision Trees (DT) and Random Forests (RF). We also investigated whether transforming the data using MetaMap Lite, a clinical natural language processing tool, affected classification performance. We showed that the LSTM-RNNs accurately classify veterinary and human text narratives into top-level categories with an average weighted macro F1 score of 0.74 and 0.68 respectively. In the "neoplasia" category, the model trained on veterinary data had a high validation accuracy in veterinary data and moderate accuracy in human data, with F1 scores of 0.91 and 0.70 respectively. Our LSTM method scored slightly higher than that of the DT and RF models. The use of LSTM-RNN models represents a scalable structure that could prove useful in cohort identification for comparative oncology studies. Digitization of human and veterinary health information will continue to be a reality, particularly in the form of unstructured narratives. Our approach is a step forward for these two domains to learn from and inform one another.
Collapse
Affiliation(s)
- Guhan Ram Venkataraman
- Department of Biomedical Data Science, School of Medicine, Stanford University, Stanford, CA, United States of America
| | - Arturo Lopez Pineda
- Department of Biomedical Data Science, School of Medicine, Stanford University, Stanford, CA, United States of America
| | - Oliver J. Bear Don’t Walk IV
- Department of Biomedical Informatics, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, United States of America
| | | | - Sandeep Ayyar
- Department of Biomedical Data Science, School of Medicine, Stanford University, Stanford, CA, United States of America
| | - Rodney L. Page
- Department of Clinical Sciences, College of Veterinary Medicine and Biomedical Sciences, Colorado State University, Fort Collins, CO, United States of America
| | - Carlos D. Bustamante
- Department of Biomedical Data Science, School of Medicine, Stanford University, Stanford, CA, United States of America
- Chan Zuckerberg Biohub, San Francisco, CA, United States of America
| | - Manuel A. Rivas
- Department of Biomedical Data Science, School of Medicine, Stanford University, Stanford, CA, United States of America
| |
Collapse
|
21
|
Lee D, Jiang X, Yu H. Harmonized representation learning on dynamic EHR graphs. J Biomed Inform 2020; 106:103426. [DOI: 10.1016/j.jbi.2020.103426] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Revised: 02/18/2020] [Accepted: 04/19/2020] [Indexed: 11/29/2022]
|
22
|
Identification of patients with carotid stenosis using natural language processing. Eur Radiol 2020; 30:4125-4133. [DOI: 10.1007/s00330-020-06721-z] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Revised: 12/20/2019] [Accepted: 02/05/2020] [Indexed: 11/25/2022]
|
23
|
Faure M, Castilloux AM, Lillo-Le-Louet A, Bégaud B, Moride Y. Secondary Stroke Prevention: A Population-Based Cohort Study on Anticoagulation and Antiplatelet Treatments, and the Risk of Death or Recurrence. Clin Pharmacol Ther 2020; 107:443-451. [PMID: 31502245 DOI: 10.1002/cpt.1625] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Accepted: 07/26/2019] [Indexed: 01/05/2023]
Abstract
Using claims databases of a public healthcare program (Quebec) for the years 2010-2013, we conducted a cohort study of patients with acute ischemic stroke (AIS) to describe secondary prevention treatments and determine how they stood against practice guidelines. We compared the risk of death or AIS recurrence over 1 year in patients treated with anticoagulants, antiplatelets, and/or other cardiovascular drugs. In the month after discharge, 44.3% of the patients did not receive the recommended treatment and > 20% did not have any treatment. Untreated patients were younger, had less comorbidities, and a more severe AIS. Anticoagulants and antiplatelets were associated with a reduced risk of death or recurrence (hazard ratio (HR) 0.27; 95% confidence interval (CI) 0.20-0.36 and HR 0.25; 95% CI 0.16-0.38, respectively) compared with the untreated group. Effect size was similar for the other treatments. Findings confirm treatment benefits shown in clinical trials and emphasize the importance of AIS secondary prevention.
Collapse
Affiliation(s)
- Mareva Faure
- Faculty of Pharmacy, Université de Montréal, Montréal, Québec, Canada
| | | | - Agnès Lillo-Le-Louet
- Centre Régional de Pharmacovigilance de Paris, Hôpital Européen Georges Pompidou, Paris, France
| | - Bernard Bégaud
- Département de Pharmacologie médicale, Université de Bordeaux, Bordeaux, France
| | - Yola Moride
- Faculty of Pharmacy, Université de Montréal, Montréal, Québec, Canada.,Center for Pharmacoepidemiology and Treatment Science, Rutgers, The State University of New Jersey, New Brunswick, New Jersey, USA
| |
Collapse
|
24
|
Reliability of International Classification of Disease-9 Versus International Classification of Disease-10 Coding for Proximal Femur Fractures at a Level 1 Trauma Center. J Am Acad Orthop Surg 2020; 28:29-36. [PMID: 30969187 DOI: 10.5435/jaaos-d-17-00874] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
INTRODUCTION The Centers for Medicare & Medicaid services proposed that transitioning from the 9th to the 10th revision of the International Classification of Disease (ICD) would provide better data for research. This study sought to determine the reliability of ICD-10 compared with ICD-9 for proximal femur fractures. METHODS Available imaging studies from 196 consecutively treated proximal femur fractures were retrospectively reviewed and assigned ICD codes by three physicians. Intercoder reliability (ICR) was calculated. Collectively, the physicians agreed on what should be the correct codes for each fracture, and this was compared with coding found in the medical and billing records. RESULTS No significant difference was observed in ICR for both ICD-9 and ICD-10 exact coding, which were both unreliable. Less specific coding improved ICR. ICD-9 general coding was better than ICD-10. Electronic medical record coding was unreliable. Billing codes were also unreliable, yet ICD-10 was better than ICD-9. DISCUSSION ICD-9 and ICD-10 lack reliability in coding proximal femur fractures. ICD-10 results in data that are no more reliable than those found with ICD-9. LEVEL OF EVIDENCE Level I diagnostic.
Collapse
|
25
|
Ahmed Z, Zeeshan S, Mendhe D, Dong X. Human gene and disease associations for clinical-genomics and precision medicine research. Clin Transl Med 2020; 10:297-318. [PMID: 32508008 PMCID: PMC7240856 DOI: 10.1002/ctm2.28] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2020] [Revised: 04/02/2020] [Accepted: 04/03/2020] [Indexed: 12/15/2022] Open
Abstract
We are entering the era of personalized medicine in which an individual's genetic makeup will eventually determine how a doctor can tailor his or her therapy. Therefore, it is becoming critical to understand the genetic basis of common diseases, for example, which genes predispose and rare genetic variants contribute to diseases, and so on. Our study focuses on helping researchers, medical practitioners, and pharmacists in having a broad view of genetic variants that may be implicated in the likelihood of developing certain diseases. Our focus here is to create a comprehensive database with mobile access to all available, authentic and actionable genes, SNPs, and classified diseases and drugs collected from different clinical and genomics databases worldwide, including Ensembl, GenCode, ClinVar, GeneCards, DISEASES, HGMD, OMIM, GTR, CNVD, Novoseek, Swiss-Prot, LncRNADisease, Orphanet, GWAS Catalog, SwissVar, COSMIC, WHO, and FDA. We present a new cutting-edge gene-SNP-disease-drug mobile database with a smart phone application, integrating information about classified diseases and related genes, germline and somatic mutations, and drugs. Its database includes over 59 000 protein-coding and noncoding genes; over 67 000 germline SNPs and over a million somatic mutations reported for over 19 000 protein-coding genes located in over 1000 regions, published with over 3000 articles in over 415 journals available at the PUBMED; over 80 000 ICDs; over 123 000 NDCs; and over 100 000 classified gene-SNP-disease associations. We present an application that can provide new insights into the information about genetic basis of human complex diseases and contribute to assimilating genomic with phenotypic data for the availability of gene-based designer drugs, precise targeting of molecular fingerprints for tumor, appropriate drug therapy, predicting individual susceptibility to disease, diagnosis, and treatment of rare illnesses are all a few of the many transformations expected in the decade to come.
Collapse
Affiliation(s)
- Zeeshan Ahmed
- Institute for Health, Health Care Policy and Aging Research, RutgersThe State University of New JerseyNew BrunswickNew JerseyUSA
- Department of Medicine, Rutgers Robert Wood Johnson Medical SchoolRutgers Biomedical and Health SciencesNew BrunswickNew JerseyUSA
| | - Saman Zeeshan
- Rutgers Cancer Institute of New Jersey, RutgersThe State University of New JerseyNew BrunswickNew JerseyUSA
| | - Dinesh Mendhe
- Institute for Health, Health Care Policy and Aging Research, RutgersThe State University of New JerseyNew BrunswickNew JerseyUSA
| | - XinQi Dong
- Institute for Health, Health Care Policy and Aging Research, RutgersThe State University of New JerseyNew BrunswickNew JerseyUSA
- Department of Medicine, Rutgers Robert Wood Johnson Medical SchoolRutgers Biomedical and Health SciencesNew BrunswickNew JerseyUSA
| |
Collapse
|
26
|
Tuttle KL, Wickner P. Capturing anaphylaxis through medical records: Are ICD and CPT codes sufficient? Ann Allergy Asthma Immunol 2019; 124:150-155. [PMID: 31785369 DOI: 10.1016/j.anai.2019.11.026] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2019] [Revised: 11/14/2019] [Accepted: 11/18/2019] [Indexed: 11/29/2022]
Abstract
OBJECTIVE The identification of anaphylaxis cases is imperative for optimal clinicalprovider knowledge deficiencies in diagnosis and treatment and the efficacy of reimbursement codes, such as International Classification of Diseases (ICD) and current procedural terminology (CPT) codes, in detecting anaphylaxis. DATA SOURCES Pubmed. STUDY SELECTIONS Recent and clinically relevant literature on anaphylaxis and provider knowledge, ICD, CPT, Healthcare Common Procedural Coding System (HCPCS), and E-codes were selected and reviewed. RESULTS Reimbursement codes are used to detect anaphylaxis in administrative claims databases. Inaccurate recognition of the diagnosis by providers, underreporting, and cause identification are challenges faced by health researchers using reimbursement codes for anaphylaxis case identification. Anaphylactic shock-specific ICD codes were noted to have a positive predictive value (PPV) of 52% to 53% of anaphylaxis events compared with physician chart review, which was improved to 63% to 67.3% when used in conjunction with anaphylaxis symptom-specific ICD, CPT, HCPCS, and E-codes 31, 34, and 35. CONCLUSION Education of providers to properly diagnose and treat anaphylaxis requires systematic and educational investments. The ICD codes specific to anaphylactic shock have suboptimal PPV to identify anaphylaxis in administrative claims databases. Use of algorithms incorporating other reimbursement codes improve the PPV, but they are limited by inaccurate diagnoses and underreporting of anaphylaxis. Future ICD-11 reclassification may improve anaphylaxis detection by reimbursement codes.
Collapse
Affiliation(s)
- Katherine L Tuttle
- Division of Allergy and Clinical Immunology, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts; Harvard Medical School, Boston, Massachusetts
| | - Paige Wickner
- Division of Allergy and Clinical Immunology, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts; Harvard Medical School, Boston, Massachusetts; Department of Quality and Safety, Brigham and Women's Hospital, Boston, Massachusetts.
| |
Collapse
|
27
|
George MP, Garrison GM, Merten Z, Heredia D, Gonzales C, Angstman KB. Impact of Personality Disorder Cluster on Depression Outcomes Within Collaborative Care Management Model of Care. J Prim Care Community Health 2019; 9:2150132718776877. [PMID: 29785866 PMCID: PMC5967151 DOI: 10.1177/2150132718776877] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
BACKGROUND Previous studies have suggested that having a comorbid personality disorder (PD) along with major depression is associated with poorer depression outcomes relative to those without comorbid PD. However, few studies have examined the influence of specific PD cluster types. The purpose of the current study is to compare depression outcomes between cluster A, cluster B, and cluster C PD patients treated within a collaborative care management (CCM), relative to CCM patients without a PD diagnosis. The overarching goal was to identify cluster types that might confer a worse clinical prognosis. METHODS This retrospective chart review study examined 2826 adult patients with depression enrolled in CCM. The cohort was divided into 4 groups based on the presence of a comorbid PD diagnosis (cluster A/nonspecified, cluster B, cluster C, or no PD). Baseline clinical and demographic variables, along with 6-month follow-up Patient Health Questionnaire-9 (PHQ-9) scores were obtained for all groups. Depression remission was defined as a PHQ-9 score <5 at 6 months, and persistent depressive symptoms (PDS) was defined as a PHQ-9 score ≥10 at 6 months. Adjusted odds ratios (AORs) were determined for both remission and PDS using logistic regression modeling for the 6-month PHQ-9 outcome, while retaining all study variables. RESULTS A total of 59 patients (2.1%) had a cluster A or nonspecified PD diagnosis, 122 patients (4.3%) had a cluster B diagnosis, 35 patients (1.2%) had a cluster C diagnosis, and 2610 patients (92.4%) did not have any PD diagnosis. The presence of a cluster A/nonspecified PD diagnosis was associated with a 62% lower likelihood of remission at 6 months (AOR = 0.38; 95% CI 0.20-0.70). The presence of a cluster B PD diagnosis was associated with a 71% lower likelihood of remission at 6 months (AOR = 0.29; 95% CI 0.18-0.47). Conversely, having a cluster C diagnosis was not associated with a significantly lower likelihood of remission at 6 months (AOR = 0.83; 95% CI 0.42-1.65). Increased odds of having PDS at 6-month follow-up were seen with cluster A/nonspecified PD patients (AOR = 3.35; 95% CI 1.92-5.84) as well as cluster B patients (AOR = 3.66; 95% CI 2.45-5.47). However, cluster C patents did not have significantly increased odds of experiencing persistent depressive symptoms at 6-month follow-up (AOR = 0.95; 95% CI 0.45-2.00). CONCLUSIONS Out of the 3 clusters, the presence of a cluster B PD diagnosis was most significantly associated with poorer depression outcomes at 6-month follow-up, including reduced remission rates and increased risk for PDS. The cluster A/nonspecified PD group also showed poor outcomes; however, the heterogeneity of this subgroup with regard to PD features must be noted. The development of novel targeted interventions for at-risk clusters may be warranted in order to improve outcomes of these patients within the CCM model of care.
Collapse
|
28
|
Gentile C, Stein L, Dhamoon MS. Alcohol-Related Hospital Encounters Trigger Thrombotic and Hemorrhagic Vascular Events. J Stroke Cerebrovasc Dis 2019; 28:104395. [PMID: 31540781 DOI: 10.1016/j.jstrokecerebrovasdis.2019.104395] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2019] [Accepted: 09/04/2019] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND AND PURPOSE We investigated the associations between alcohol-related emergency department visits and hospitalizations and vascular events including acute ischemic stroke, intracerebral hemorrhage (ICH), and subarachnoid hemorrhage. METHODS The New York State Inpatient and Emergency Department Databases were examined (2006-2013). Validated International Classification of Diseases 9th edition definitions identified index vascular hospitalizations and alcohol abuse encounters. We used case cross-over analysis with conditional logistic regression to estimate odds ratios (OR) for the association between alcohol-related encounters during 6 case periods (7, 14, 30, 60, 90, and 120 days before index event) compared to control periods (1 year before). Multivariate logistic regression was used to examine the association between an alcohol-related encounter within 6 months before index admission and 30-day readmission after discharge. RESULTS An alcohol encounter before index admission was associated with acute ischemic stroke (OR = 1.765 within 60 days, 1.418 within 90 days, and 1.287 within 120 days) and subarachnoid hemorrhage (OR = 2.375 within 90 days), but not ICH. Alcohol-related encounters within 6 months before index vascular events increased the likelihood of 30-day readmission after index admission. CONCLUSION We found that a recent alcohol-related counter was associated with occurrence of vascular events, but not ICH, as well as worse outcomes after index admission.
Collapse
Affiliation(s)
- Caroline Gentile
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Laura Stein
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Mandip S Dhamoon
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, New York.
| |
Collapse
|
29
|
Zhou L, Siddiqui T, Seliger SL, Blumenthal JB, Kang Y, Doerfler R, Fink JC. Text preprocessing for improving hypoglycemia detection from clinical notes - A case study of patients with diabetes. Int J Med Inform 2019; 129:374-380. [PMID: 31445280 DOI: 10.1016/j.ijmedinf.2019.06.020] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Revised: 06/10/2019] [Accepted: 06/20/2019] [Indexed: 11/17/2022]
Abstract
BACKGROUND AND OBJECTIVE Hypoglycemia is a common safety event when attempting to optimize glycemic control in diabetes (DM). While electronic medical records provide a natural ground for detecting and analyzing hypoglycemia, ICD codes used in the databases may be invalid, insensitive or non-specific in detecting new hypoglycemic events. We developed text preprocessing methods to improve automatic detection of hypoglycemia from analysis of clinical encounter text notes. METHODS We set out to improve hypoglycemia detection from clinical notes by introducing three preprocessing methods: stop word filtering, medication signaling, and ICD narrative enrichment. To test the proposed methods, we selected clinical notes from VA Maryland Healthcare System, based on various combinations of three criteria that are suggestive of hypoglycemia, including ICD-9 code of diabetes and hypoglycemia, laboratory glucose values < 70 md/dL, and text reference to a proximate hypoglycemia event. In addition, we constructed one dataset of 395 clinical notes from year 2009 and another of 460 notes from year 2014 to test the generality of the proposed methods. For each of the datasets, two physician judges manually reviewed individual clinical notes to determine whether hypoglycemia was present or absent. A third physician judge served as a final adjudicator for disagreements. RESULTS Each of the proposed preprocessing methods contributed to the performance of hypoglycemia detection by significantly increasing the F1 score in the range of 5.3∼7.4% on one dataset (p < .01). Among the methods, stop word filtering contributed most to the performance improvement (7.4%). Combining all the preprocessing methods led to greater performance gain (p < .001) compared with using each method individually. Similar patterns were observed for the other dataset with the F1 score being increased in the range of 7.7%∼9.4% by individual methods (p < .001). Nevertheless, combining the three methods did not yield additional performance gain. CONCLUSION The proposed text preprocessing methods improved the performance of hypoglycemia detection from clinical text notes. Stop word filtering achieved the most performance improvement. ICD narrative enrichment boosted the recall of detection. Combining the three preprocessing methods led to additional performance gains.
Collapse
Affiliation(s)
- Lina Zhou
- University of North Carolina at Charlotte, Department of Business Information Systems and Operations Management, United States
| | - Tariq Siddiqui
- University of Maryland School of Medicine, Department of Medicine, United States
| | - Stephen L Seliger
- University of Maryland School of Medicine, Division of Nephrology, Department of Medicine, United States
| | - Jacob B Blumenthal
- University of Maryland School of Medicine, Division of Gerontology & Geriatric Medicine, Department of Medicine, Baltimore Geriatrics Research, Education and Clinical Center (GRECC), Baltimore Veterans Affairs and Medical Center, United States
| | - Yin Kang
- University of Maryland, Baltimore County, Department of Information Systems, United States
| | - Rebecca Doerfler
- University of Maryland School of Medicine, Department of Medicine, United States
| | - Jeffrey C Fink
- University of Maryland School of Medicine, Department of Medicine, United States.
| |
Collapse
|
30
|
Duff S, Hasbun R, Balada-Llasat JM, Zimmer L, Bozzette SA, Ginocchio CC. Economic analysis of rapid multiplex polymerase chain reaction testing for meningitis/encephalitis in adult patients. Infection 2019; 47:945-953. [PMID: 31111325 DOI: 10.1007/s15010-019-01320-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Accepted: 05/11/2019] [Indexed: 01/14/2023]
Abstract
PURPOSE Many patients with suspected meningitis do not require hospitalization yet are admitted, often resulting in unnecessary care and additional cost. We assessed the possible economic impact of a rapid multiplex test for suspected adult community-acquired meningitis/encephalitis. METHODS A model simulated diagnosis, clinical decisions, resource use/costs of standard of care (SOC) and two cerebrospinal fluid (CSF) testing strategies using the FDA-cleared BioFire® FilmArray® System (FA) which provides results in approximately one hour. RESULTS Pathogens detected by FA caused approximately 74% of cases, 97% of which would be accurately diagnosed with FA. False positives and false negatives more often led to extended/unnecessary admission than inappropriate discharge/missed admission. Mean cost per case ranged from 16829 to 20791. A strategy of testing all suspected cases yielded greater savings (2213/case) than testing only those with abnormal CSF (812/case) and both were less expensive than SOC. CONCLUSION This economic analysis demonstrates that FA can inform more appropriate clinician decisions resulting in cost savings with greater economic benefits achievable with syndromic testing of all cases, rather than SOC or targeted syndromic testing.
Collapse
Affiliation(s)
- Steve Duff
- Veritas Health Economics Consulting, 8033 Corte Sasafras, Carlsbad, CA, 92009, USA.
| | | | | | | | - Samuel A Bozzette
- bioMérieux, Durham, NC, USA.,University of California, San Diego, La Jolla, CA, USA
| | - Christine C Ginocchio
- bioMérieux, Durham, NC, USA.,Zucker School of Medicine at Hofstra Northwell, Hempstead, NY, USA
| |
Collapse
|
31
|
Adelman EE, Burke JF. Can Electronic Health Records Make Quality Measurement Fast and Easy? Circ Cardiovasc Qual Outcomes 2019; 10:CIRCOUTCOMES.117.004180. [PMID: 28912203 DOI: 10.1161/circoutcomes.117.004180] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Eric E Adelman
- From the Department of Neurology, University of Wisconsin-Madison (E.E.A.); Stroke Program, University of Michigan, Ann Arbor (J.F.B.); and Department of Neurology, Veterans Affairs Health System, Ann Arbor, MI (J.F.B.)
| | - James F Burke
- From the Department of Neurology, University of Wisconsin-Madison (E.E.A.); Stroke Program, University of Michigan, Ann Arbor (J.F.B.); and Department of Neurology, Veterans Affairs Health System, Ann Arbor, MI (J.F.B.).
| |
Collapse
|
32
|
Hong C, Liao KP, Cai T. Semi‐supervised validation of multiple surrogate outcomes with application to electronic medical records phenotyping. Biometrics 2019; 75:78-89. [DOI: 10.1111/biom.12971] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2017] [Accepted: 09/14/2018] [Indexed: 01/29/2023]
Affiliation(s)
- Chuan Hong
- Department of BiostatisticsHarvard T.H. Chan School of Public HealthBostonMassachusetts
| | - Katherine P. Liao
- Division of RheumatologyBrigham and Womens HospitalBostonMassachusetts
| | - Tianxi Cai
- Department of BiostatisticsHarvard T.H. Chan School of Public HealthBostonMassachusetts
| |
Collapse
|
33
|
Ning W, Chan S, Beam A, Yu M, Geva A, Liao K, Mullen M, Mandl KD, Kohane I, Cai T, Yu S. Feature extraction for phenotyping from semantic and knowledge resources. J Biomed Inform 2019; 91:103122. [PMID: 30738949 PMCID: PMC6424621 DOI: 10.1016/j.jbi.2019.103122] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
OBJECTIVE Phenotyping algorithms can efficiently and accurately identify patients with a specific disease phenotype and construct electronic health records (EHR)-based cohorts for subsequent clinical or genomic studies. Previous studies have introduced unsupervised EHR-based feature selection methods that yielded algorithms with high accuracy. However, those selection methods still require expert intervention to tweak the parameter settings according to the EHR data distribution for each phenotype. To further accelerate the development of phenotyping algorithms, we propose a fully automated and robust unsupervised feature selection method that leverages only publicly available medical knowledge sources, instead of EHR data. METHODS SEmantics-Driven Feature Extraction (SEDFE) collects medical concepts from online knowledge sources as candidate features and gives them vector-form distributional semantic representations derived with neural word embedding and the Unified Medical Language System Metathesaurus. A number of features that are semantically closest and that sufficiently characterize the target phenotype are determined by a linear decomposition criterion and are selected for the final classification algorithm. RESULTS SEDFE was compared with the EHR-based SAFE algorithm and domain experts on feature selection for the classification of five phenotypes including coronary artery disease, rheumatoid arthritis, Crohn's disease, ulcerative colitis, and pediatric pulmonary arterial hypertension using both supervised and unsupervised approaches. Algorithms yielded by SEDFE achieved comparable accuracy to those yielded by SAFE and expert-curated features. SEDFE is also robust to the input semantic vectors. CONCLUSION SEDFE attains satisfying performance in unsupervised feature selection for EHR phenotyping. Both fully automated and EHR-independent, this method promises efficiency and accuracy in developing algorithms for high-throughput phenotyping.
Collapse
Affiliation(s)
- Wenxin Ning
- Department of Industrial Engineering, Tsinghua University, Beijing, China
| | - Stephanie Chan
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Andrew Beam
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Ming Yu
- Department of Industrial Engineering, Tsinghua University, Beijing, China
| | - Alon Geva
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, USA; Department of Anesthesiology, Critical Care, and Pain Medicine, Boston Children's Hospital, Boston, MA, USA; Department of Anesthesia, Harvard Medical School, Boston, MA, USA
| | - Katherine Liao
- Department of Medicine, Division of Rheumatology, Immunology and Allergy, Brigham and Women's Hospital, Boston, MA, USA; Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Mary Mullen
- Department of Cardiology, Boston Children's Hospital, Boston, MA, USA; Department of Pediatrics, Harvard Medical School, Boston, MA, USA
| | - Kenneth D Mandl
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, USA; Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Isaac Kohane
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Tianxi Cai
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA; Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Sheng Yu
- Center for Statistical Science, Tsinghua University, Beijing, China; Department of Industrial Engineering, Tsinghua University, Beijing, China; Institute for Data Science, Tsinghua University, Beijing, China.
| |
Collapse
|
34
|
Hernandez-Ibarburu G, Perez-Rey D, Alonso-Oset E, Alonso-Calvo R, Voets D, Mueller C, Claerhout B, Custodix NV. ICD-10-PCS extension with ICD-9 procedure codes to support integrated access to clinical legacy data. Int J Med Inform 2019; 122:70-79. [PMID: 30623787 DOI: 10.1016/j.ijmedinf.2018.11.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Revised: 09/26/2018] [Accepted: 11/05/2018] [Indexed: 10/27/2022]
Abstract
Since the creation of The International Classification of Diseases (ICD), new versions have been released to keep updated with the current medical knowledge. Migrations of Electronic Health Records (EHR) from ICD-9 to ICD-10-PCS as clinical procedure codification system, has been a significant challenge and involved large resources. In addition, it created new barriers for integrated access to legacy medical procedure data (frequently ICD-9 coded) with current data (frequently ICD-10-PCS coded). This work proposes a solution based on extending ICD-10-PCS with a subgroup of ICD-9-CM concepts to facilitate such integrated access. The General Equivalence Mappings (GEMs) has been used as foundation to set the terminology relations of these inserted concepts in ICD-10-PCS hierarchy, but due to the existence of 1-to-many mappings, advanced rules are required to seamlessly integrate both terminologies. With the generation of rules based on GEMs relationships, 2014 ICD-9 concepts were included within the ICD-10-PCS hierarchy. For the rest of the concepts, a new method is also proposed to increase 1-to-1 mappings. As results, with the suggested approach, the percentage of ICD-9-CM procedure concepts that can be mapped accurately (avoiding mappings to a large number of concepts) rise from 11.56% to 69.01% of ICD-9-Proc, through the extended ICD-10-PCS hierarchy.
Collapse
Affiliation(s)
- G Hernandez-Ibarburu
- Biomedical Informatics Group, Departamento de Inteligencia Artificial, ETSI Informaticos, Universidad Politecnica de Madrid, 28660 Boadilla del Monte, Madrid, Spain.
| | - D Perez-Rey
- Biomedical Informatics Group, Departamento de Inteligencia Artificial, ETSI Informaticos, Universidad Politecnica de Madrid, 28660 Boadilla del Monte, Madrid, Spain.
| | - E Alonso-Oset
- Biomedical Informatics Group, Departamento de Inteligencia Artificial, ETSI Informaticos, Universidad Politecnica de Madrid, 28660 Boadilla del Monte, Madrid, Spain
| | - R Alonso-Calvo
- Biomedical Informatics Group, Departamento de Inteligencia Artificial, ETSI Informaticos, Universidad Politecnica de Madrid, 28660 Boadilla del Monte, Madrid, Spain
| | | | | | | | | |
Collapse
|
35
|
Handley JD, Emsley HC. Validation of ICD-10 codes shows intracranial venous thrombosis incidence to be higher than previously reported. Health Inf Manag 2018; 49:58-61. [PMID: 30563370 DOI: 10.1177/1833358318819105] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
BACKGROUND Intracranial venous thrombosis (ICVT) accounts for around 0.5% of all stroke cases. There have been no previously published studies of the International Classification of Diseases, Tenth Edition (ICD-10) validation for the identification of ICVT admissions in adults. OBJECTIVE The aims of this study were to validate and quantify the performance of the ICD-10 coding system for identifying cases of ICVT in adults and to derive an estimate of incidence. METHOD Administrative data were collected for all patients admitted to a regional neurosciences centre over a 5-year period. We searched for the following ICD-10 codes at any position: G08.X (intracranial and intraspinal phlebitis and thrombophlebitis), I67.6 (non-pyogenic thrombosis of intracranial venous system), I63.6 (cerebral infarction due to cerebral venous thrombosis, non-pyogenic), O22.5 (cerebral venous thrombosis in pregnancy) and O87.3 (cerebral venous thrombosis in the puerperium). RESULTS Sixty-five admissions were identified by at least one of the relevant ICD-10 codes. The overall positive predictive value (PPV) for confirmed ICVT from all of the admissions combined was 92.3% (60 out of 65) with the results for each code as follows: G08.X 91.5% (54 of 59), O22.5 100% (4 of 4), I67.6 100% (1 of 1), I63.6 100% (1 of 1) and O87.3 100% (1 of 1). There were 40 unique cases of ICVT over a 5-year period giving an annual incidence of ICVT of 5 per million. CONCLUSIONS All codes gave a high PPV. IMPLICATIONS FOR PRACTICE As demonstrated in previous studies, the incidence of ICVT may be higher than previously thought.
Collapse
|
36
|
Hasbun R, Balada-Llasat JM, Duff S. Letter to the Editor reply: economic model of the FilmArray Meningitis Encephalitis panel in children. Future Microbiol 2018; 13:1555-1556. [DOI: 10.2217/fmb-2018-0232] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Affiliation(s)
- Rodrigo Hasbun
- UTHealth McGovern Medical School; Houston, TX, 770302, USA
| | | | - Steve Duff
- Veritas Health Economics Consulting; Carlsbad, CA, 92009, USA
| |
Collapse
|
37
|
Catling F, Spithourakis GP, Riedel S. Towards automated clinical coding. Int J Med Inform 2018; 120:50-61. [PMID: 30409346 DOI: 10.1016/j.ijmedinf.2018.09.021] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2018] [Revised: 08/21/2018] [Accepted: 09/30/2018] [Indexed: 11/30/2022]
Abstract
BACKGROUND Patients' encounters with healthcare services must undergo clinical coding. These codes are typically derived from free-text notes. Manual clinical coding is expensive, time-consuming and prone to error. Automated clinical coding systems have great potential to save resources, and realtime availability of codes would improve oversight of patient care and accelerate research. Automated coding is made challenging by the idiosyncrasies of clinical text, the large number of disease codes and their unbalanced distribution. METHODS We explore methods for representing clinical text and the labels in hierarchical clinical coding ontologies. Text is represented as term frequency-inverse document frequency counts and then as word embeddings, which we use as input to recurrent neural networks. Labels are represented atomically, and then by learning representations of each node in a coding ontology and composing a representation for each label from its respective node path. We consider different strategies for initialisation of the node representations. We evaluate our methods using the publicly-available Medical Information Mart for Intensive Care III dataset: we extract the history of presenting illness section from each discharge summary in the dataset, then predicting the International Classification of Diseases, ninth revision, Clinical Modification codes associated with these. RESULTS Composing the label representations from the clinical-coding-ontology nodes increased weighted F1 for prediction of the 17,561 disease labels to 0.264-0.281 from 0.232-0.249 for atomic representations. Recurrent neural network text representation improved weighted F1 for prediction of the 19 disease-category labels to 0.682-0.701 from 0.662-0.682 using term frequency-inverse document frequency. However, term frequency-inverse document frequency outperformed recurrent neural networks for prediction of the 17,561 disease labels. CONCLUSIONS This study demonstrates that hierarchically-structured medical knowledge can be incorporated into statistical models, and produces improved performance during automated clinical coding. This performance improvement results primarily from improved representation of rarer diseases. We also show that recurrent neural networks improve representation of medical text in some settings. Learning good representations of the very rare diseases in clinical coding ontologies from data alone remains challenging, and alternative means of representing these diseases will form a major focus of future work on automated clinical coding.
Collapse
Affiliation(s)
- Finneas Catling
- University College London, Gower Street, London WC1E 6BT, UK.
| | | | | |
Collapse
|
38
|
Zhang A, Lu X, Gunter CA, Yao S, Tao F, Zhu R, Gui H, Fabbri D, Liebovitz D, Malin B. De facto diagnosis specialties: Recognition and discovery. Learn Health Syst 2018; 2:e10057. [PMID: 31245585 PMCID: PMC6508768 DOI: 10.1002/lrh2.10057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2017] [Revised: 04/10/2018] [Accepted: 04/11/2018] [Indexed: 11/17/2022] Open
Abstract
A medical specialty indicates the skills needed by health care providers to conduct key procedures or make critical judgments. However, documentation about specialties may be lacking or inaccurately specified in a health care institution. Thus, we propose to leverage diagnosis histories to recognize medical specialties that exist in practice. Such specialties that are highly recognizable through diagnosis histories are de facto diagnosis specialties. We aim to recognize de facto diagnosis specialties that are listed in the Health Care Provider Taxonomy Code Set (HPTCS) and discover those that are unlisted. First, to recognize the former, we use similarity and supervised learning models. Next, to discover de facto diagnosis specialties unlisted in the HPTCS, we introduce a general discovery-evaluation framework. In this framework, we use a semi-supervised learning model and an unsupervised learning model, from which the discovered specialties are subsequently evaluated by the similarity and supervised learning models used in recognition. To illustrate the potential for these approaches, we collect 2 data sets of 1 year of diagnosis histories from a large academic medical center: One is a subset of the other except for additional information useful for network analysis. The results indicate that 12 core de facto diagnosis specialties listed in the HPTCS are highly recognizable. Additionally, the semi-supervised learning model discovers a specialty for breast cancer on the smaller data set based on network analysis, while the unsupervised learning model confirms this discovery and suggests an additional specialty for Obesity on the larger data set. The potential correctness of these 2 specialties is reinforced by the evaluation results that they are highly recognizable by similarity and supervised learning models in comparison with 12 core de facto diagnosis specialties listed in the HPTCS.
Collapse
Affiliation(s)
- Aston Zhang
- Department of Computer ScienceUniversity of Illinois at Urbana‐ChampaignUrbanaIllinois
| | - Xun Lu
- Department of Computer ScienceUniversity of Illinois at Urbana‐ChampaignUrbanaIllinois
| | - Carl A. Gunter
- Department of Computer ScienceUniversity of Illinois at Urbana‐ChampaignUrbanaIllinois
| | - Shuochao Yao
- Department of Computer ScienceUniversity of Illinois at Urbana‐ChampaignUrbanaIllinois
| | - Fangbo Tao
- Department of Computer ScienceUniversity of Illinois at Urbana‐ChampaignUrbanaIllinois
| | - Rongda Zhu
- Department of Computer ScienceUniversity of Illinois at Urbana‐ChampaignUrbanaIllinois
| | - Huan Gui
- Department of Computer ScienceUniversity of Illinois at Urbana‐ChampaignUrbanaIllinois
| | - Daniel Fabbri
- Department of Biomedical InformaticsVanderbilt UniversityNashvilleTennessee
| | | | - Bradley Malin
- Department of Biomedical InformaticsVanderbilt UniversityNashvilleTennessee
| |
Collapse
|
39
|
Pavon JM, Sloane RJ, Pieper CF, Colón-Emeric CS, Cohen HJ, Gallagher D, Morey MC, McCarty M, Ortel TL, Hastings SN. Automated versus Manual Data Extraction of the Padua Prediction Score for Venous Thromboembolism Risk in Hospitalized Older Adults. Appl Clin Inform 2018; 9:743-751. [PMID: 30257260 PMCID: PMC6158031 DOI: 10.1055/s-0038-1670678] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2018] [Accepted: 08/09/2018] [Indexed: 10/28/2022] Open
Abstract
OBJECTIVE Venous thromboembolism (VTE) prophylaxis is an important consideration for hospitalized older adults, and the Padua Prediction Score (PPS) is a risk prediction tool used to prioritize patient selection. We developed an automated PPS (APPS) algorithm using electronic health record (EHR) data. This study examines the accuracy of APPS and its individual components versus manual data extraction. METHODS This is a retrospective cohort study of hospitalized general internal medicine patients, aged 70 and over. Fourteen clinical variables were collected to determine their PPS; APPS used EHR data exports from health system databases, and a trained abstractor performed manual chart abstractions. We calculated sensitivity and specificity of the APPS, using manual PPS as the gold standard for classifying risk category (low vs. high). We also examined performance characteristics of the APPS for individual variables. RESULTS PPS was calculated by both methods on 311 individuals. The mean PPS was 3.6 (standard deviation, 1.8) for manual abstraction and 2.8 (1.4) for APPS. In detecting patients at high risk for VTE, the sensitivity and specificity of the APPS algorithm were 46 and 94%, respectively. The sensitivity for APPS was poor (range: 6-34%) for detecting acute conditions (i.e., acute myocardial infarction), moderate (range: 52-74%) for chronic conditions (i.e., heart failure), and excellent (range: 94-98%) for conditions of obesity and restricted mobility. Specificity of the automated extraction method for each PPS variable was > 87%. CONCLUSION APPS as a stand-alone tool was suboptimal for classifying risk of VTE occurrence. The APPS accurately identified high risk patients (true positives), but lower scores were considered indeterminate.
Collapse
Affiliation(s)
- Juliessa M. Pavon
- Duke University, Durham, North Carolina, United States
- Geriatric Research Education Clinical Center, Durham Veteran Affairs Medical Center, Durham, North Carolina, United States
- Duke University Claude D. Pepper Center, Duke University, Durham, North Carolina, United States
| | - Richard J. Sloane
- Duke University, Durham, North Carolina, United States
- Geriatric Research Education Clinical Center, Durham Veteran Affairs Medical Center, Durham, North Carolina, United States
- Duke University Claude D. Pepper Center, Duke University, Durham, North Carolina, United States
| | - Carl F. Pieper
- Duke University, Durham, North Carolina, United States
- Geriatric Research Education Clinical Center, Durham Veteran Affairs Medical Center, Durham, North Carolina, United States
- Duke University Claude D. Pepper Center, Duke University, Durham, North Carolina, United States
| | - Cathleen S. Colón-Emeric
- Duke University, Durham, North Carolina, United States
- Geriatric Research Education Clinical Center, Durham Veteran Affairs Medical Center, Durham, North Carolina, United States
- Duke University Claude D. Pepper Center, Duke University, Durham, North Carolina, United States
| | - Harvey J. Cohen
- Duke University, Durham, North Carolina, United States
- Geriatric Research Education Clinical Center, Durham Veteran Affairs Medical Center, Durham, North Carolina, United States
- Duke University Claude D. Pepper Center, Duke University, Durham, North Carolina, United States
| | | | - Miriam C. Morey
- Duke University, Durham, North Carolina, United States
- Geriatric Research Education Clinical Center, Durham Veteran Affairs Medical Center, Durham, North Carolina, United States
- Duke University Claude D. Pepper Center, Duke University, Durham, North Carolina, United States
| | | | | | - Susan N. Hastings
- Duke University, Durham, North Carolina, United States
- Geriatric Research Education Clinical Center, Durham Veteran Affairs Medical Center, Durham, North Carolina, United States
- Duke University Claude D. Pepper Center, Duke University, Durham, North Carolina, United States
- Health Services Research and Development Center of Innovation, Durham Veterans Affairs Health Care System, Durham, North Carolina, United States
| |
Collapse
|
40
|
Baldereschi M, Balzi D, Di Fabrizio V, De Vito L, Ricci R, D’Onofrio P, Di Carlo A, Mechi MT, Bellomo F, Inzitari D. Administrative data underestimate acute ischemic stroke events and thrombolysis treatments: Data from a multicenter validation survey in Italy. PLoS One 2018. [PMID: 29534079 PMCID: PMC5849308 DOI: 10.1371/journal.pone.0193776] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Background Informing health systems and monitoring hospital performances using administrative data sets, mainly hospital discharge data coded according to International-Classification-Diseases-9edition-Clinical-Modifiers (ICD9-CM), is now commonplace in several countries, but the reliability of diagnostic coding of acute ischemic stroke in the routine practice is uncertain. This study aimed at estimating accuracy of ICD9-CM codes for the identification of acute ischemic stroke and the use of thrombolysis treatment comparing hospital discharge data with medical record review in all the six hospitals of the Florence Area, Italy, through 2015. Methods We reviewed the medical records of all the 3915 potential acute stroke events during 2015 across the six hospitals of the Florence Area, Italy. We then estimated sensitivity and Positive Predictive Value of ICD9-CM code-groups 433*1, 434*1 and thrombolysis code 99.10 against medical record review with clinical adjudication. For each false-positive case we obtained the actual diagnosis. For each false-negative case we obtained the primary and secondary ICD9-CM diagnoses. Results The medical record review identified 1273 acute ischemic stroke events. The hospital discharge records identified 898 among those (true-positive cases),but missed 375 events (false-negative cases), and identified 104 events that were not eventually confirmed as acute ischemic events (false-positive cases). Code-group specific Positive Predictive Value was 85.7% (95%CI,74.6–93.3) for 433*1 and 89.9% (95%CI, 87.8–91.7) for 434*1 codes. Thrombolysis treatment, as identified by ICD9-CM code 99.10, was only documented in 6.0% of acute ischemic stroke events, but was 13.6% in medical record review. Conclusions Hospital discharge data were found to be fairly specific but insensitive in the reporting of acute ischemic stroke and thrombolysis, providing misleading indications about both quantity and quality of acute ischemic stroke hospital care. Efforts to improve coding accuracy should precede the use of hospital discharge data to measure hospital performances in acute ischemic stroke care.
Collapse
Affiliation(s)
- Marzia Baldereschi
- Institute of Neuroscience, Italian National Research Council, Florence, Italy
- * E-mail:
| | | | | | - Lucia De Vito
- Emergency Medical Services, Regione Toscana, Florence, Italy
| | | | | | - Antonio Di Carlo
- Institute of Neuroscience, Italian National Research Council, Florence, Italy
| | | | | | - Domenico Inzitari
- Department of Neurology, Pharmacology and Pediatrics Department (Neurofarba), University of Florence, Florence, Italy
| |
Collapse
|
41
|
Dong YH, Chang CH, Wu LC, Hwang JS, Toh S. Comparative cardiovascular safety of nonsteroidal anti-inflammatory drugs in patients with hypertension: a population-based cohort study. Br J Clin Pharmacol 2018; 84:1045-1056. [PMID: 29468706 DOI: 10.1111/bcp.13537] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2017] [Revised: 01/04/2018] [Accepted: 01/10/2018] [Indexed: 12/24/2022] Open
Abstract
AIMS Previous studies have suggested that nonsteroidal anti-inflammatory drugs (NSAIDs) may be associated with higher cardiovascular risks. However, few have been active comparison studies that directly assessed the potential differential cardiovascular risk between NSAID classes or across individual NSAIDs. We compared the risk of major cardiovascular events between cyclooxygenase 2 (COX-2)-selective and nonselective NSAIDs in patients with hypertension. METHODS We conducted a cohort study of patients with hypertension who initiated COX-2-selective or nonselective NSAIDs in a population-based Taiwanese database. The outcomes included hospitalization for the following major cardiovascular events: ischaemic stroke, acute myocardial infarction, congestive heart failure, transient ischaemic attack, unstable angina or coronary revascularization. We followed patients for up to 4 weeks, based on the as-treated principle. We used inverse probability weighting to control for baseline and time-varying covariates, and estimated the on-treatment hazard ratios (HRs) and 95% conservative confidence interval (CIs). RESULTS We identified 2749 eligible COX-2-selective NSAID users and 52 880 eligible nonselective NSAID users. The HR of major cardiovascular events comparing COX-2-selective with nonselective NSAIDs after adjusting for baseline and time-varying covariates was 1.07 (95% CI 0.65, 1.74). We did not observe a differential risk when comparing celecoxib to diclofenac (HR 1.17; 95% CI 0.61, 2.25), ibuprofen (HR 1.36; 95% CI 0.58, 3.18) or naproxen (HR 0.75; 95% CI 0.23, 2.44). There was an increased risk with COX-2-selective NSAIDs, however, when comparing COX-2-selective NSAIDs with mefenamic acid (HR 2.11; 95% CI 1.09, 4.09). CONCLUSIONS Our results provide important information about the comparative cardiovascular safety of NSAIDs in patients with hypertension.
Collapse
Affiliation(s)
- Yaa-Hui Dong
- Faculty of Pharmacy, School of Pharmaceutical Science, National Yang-Ming University, Taipei, 112, Taiwan
| | - Chia-Hsuin Chang
- Department of Medicine, College of Medicine, National Taiwan University Hospital, Taipei, 100, Taiwan.,Graduate Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, 100, Taiwan
| | - Li-Chiu Wu
- Department of Medicine, College of Medicine, National Taiwan University Hospital, Taipei, 100, Taiwan
| | - Jing-Shiang Hwang
- Institute of Statistical Science, Academia Sinica, Taipei, 115, Taiwan
| | - Sengwee Toh
- Department of Population Medicine, Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, MA, 02215, USA
| |
Collapse
|
42
|
Furlan L, Solbiati M, Pacetti V, Dipaola F, Meda M, Bonzi M, Fiorelli E, Cernuschi G, Alberio D, Casazza G, Montano N, Furlan R, Costantino G. Diagnostic accuracy of ICD-9 code 780.2 for the identification of patients with syncope in the emergency department. Clin Auton Res 2018; 28:577-582. [PMID: 29435866 DOI: 10.1007/s10286-018-0509-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2017] [Accepted: 01/31/2018] [Indexed: 11/29/2022]
Abstract
PURPOSE Syncope is a common condition that affects individuals of all ages and is responsible for 1-3% of all emergency department (ED) visits. Prospective studies on syncope are often limited by the exiguous number of subjects enrolled. A possible alternative approach would be to use of hospital discharge diagnoses from administrative databases to identify syncope subjects in epidemiological observational studies. We assessed the accuracy of the International Classification of Diseases, Ninth Revision (ICD-9) code 780.2 "syncope and collapse" to identify patients with syncope. METHODS Patients in two teaching hospitals in Milan, Italy with a triage assessment for ED access that was possibly related to syncope were recruited in this study. We considered the index test to be the attribution of the ICD-9 code 780.2 at ED discharge and the reference standard to be the diagnosis of syncope by the ED physician. RESULTS The sensitivity, specificity, positive and negative predictive values of the ICD-9 code 780.2 to identify patients with syncope were 0.63 (95% confidence interval [CI] 0.58-0.67), 0.98 (95% CI 0.98-0.99), 0.83 (95% CI 0.79-0.87) and 0.95 (95% CI 0.94-0.95), respectively. CONCLUSIONS The moderate sensitivity of ICD-9 code 780.2 should be considered when the code is used to identify patients with syncope through administrative databases.
Collapse
Affiliation(s)
- Ludovico Furlan
- Dipartimento di Medicina Interna, Fondazione IRCCS Ca' Granda, Ospedale Maggiore Policlinico, Via Francesco Sforza 35, 20122, Milan, Italy. .,Dipartimento di Scienze Cliniche e di Comunità, Università degli Studi di Milano, Milan, Italy.
| | - Monica Solbiati
- Dipartimento di Medicina Interna, Fondazione IRCCS Ca' Granda, Ospedale Maggiore Policlinico, Via Francesco Sforza 35, 20122, Milan, Italy.,Dipartimento di Scienze Cliniche e di Comunità, Università degli Studi di Milano, Milan, Italy
| | - Veronica Pacetti
- Department of Biomedical Sciences, Humanitas University-Humanitas Research Hospital, Rozzano, Italy
| | - Franca Dipaola
- Department of Biomedical Sciences, Humanitas University-Humanitas Research Hospital, Rozzano, Italy
| | - Martino Meda
- Unità Operativa di Cardiologia, Istituto Scientifico Ospedale San Luca, Milan, Italy.,Università degli Studi di Milano-Bicocca, Milan, Italy
| | - Mattia Bonzi
- Dipartimento di Medicina Interna, Fondazione IRCCS Ca' Granda, Ospedale Maggiore Policlinico, Via Francesco Sforza 35, 20122, Milan, Italy.,Dipartimento di Scienze Cliniche e di Comunità, Università degli Studi di Milano, Milan, Italy
| | - Elisa Fiorelli
- Dipartimento di Medicina Interna, Fondazione IRCCS Ca' Granda, Ospedale Maggiore Policlinico, Via Francesco Sforza 35, 20122, Milan, Italy.,Dipartimento di Scienze Cliniche e di Comunità, Università degli Studi di Milano, Milan, Italy
| | - Giulia Cernuschi
- Dipartimento di Medicina Interna, Fondazione IRCCS Ca' Granda, Ospedale Maggiore Policlinico, Via Francesco Sforza 35, 20122, Milan, Italy.,Dipartimento di Scienze Cliniche e di Comunità, Università degli Studi di Milano, Milan, Italy
| | - Daniele Alberio
- Health Information Management, Humanitas Research Hospital, Rozzano, Italy
| | - Giovanni Casazza
- Dipartimento di Scienze Biomediche e Cliniche "L. Sacco", Università degli Studi di Milano, Milan, Italy
| | - Nicola Montano
- Dipartimento di Medicina Interna, Fondazione IRCCS Ca' Granda, Ospedale Maggiore Policlinico, Via Francesco Sforza 35, 20122, Milan, Italy.,Dipartimento di Scienze Cliniche e di Comunità, Università degli Studi di Milano, Milan, Italy
| | - Raffaello Furlan
- Department of Biomedical Sciences, Humanitas University-Humanitas Research Hospital, Rozzano, Italy
| | - Giorgio Costantino
- Dipartimento di Medicina Interna, Fondazione IRCCS Ca' Granda, Ospedale Maggiore Policlinico, Via Francesco Sforza 35, 20122, Milan, Italy
| |
Collapse
|
43
|
Yu S, Chakrabortty A, Liao KP, Cai T, Ananthakrishnan AN, Gainer VS, Churchill SE, Szolovits P, Murphy SN, Kohane IS, Cai T. Surrogate-assisted feature extraction for high-throughput phenotyping. J Am Med Inform Assoc 2018; 24:e143-e149. [PMID: 27632993 DOI: 10.1093/jamia/ocw135] [Citation(s) in RCA: 51] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2016] [Accepted: 08/17/2016] [Indexed: 12/18/2022] Open
Abstract
Objective Phenotyping algorithms are capable of accurately identifying patients with specific phenotypes from within electronic medical records systems. However, developing phenotyping algorithms in a scalable way remains a challenge due to the extensive human resources required. This paper introduces a high-throughput unsupervised feature selection method, which improves the robustness and scalability of electronic medical record phenotyping without compromising its accuracy. Methods The proposed Surrogate-Assisted Feature Extraction (SAFE) method selects candidate features from a pool of comprehensive medical concepts found in publicly available knowledge sources. The target phenotype's International Classification of Diseases, Ninth Revision and natural language processing counts, acting as noisy surrogates to the gold-standard labels, are used to create silver-standard labels. Candidate features highly predictive of the silver-standard labels are selected as the final features. Results Algorithms were trained to identify patients with coronary artery disease, rheumatoid arthritis, Crohn's disease, and ulcerative colitis using various numbers of labels to compare the performance of features selected by SAFE, a previously published automated feature extraction for phenotyping procedure, and domain experts. The out-of-sample area under the receiver operating characteristic curve and F -score from SAFE algorithms were remarkably higher than those from the other two, especially at small label sizes. Conclusion SAFE advances high-throughput phenotyping methods by automatically selecting a succinct set of informative features for algorithm training, which in turn reduces overfitting and the needed number of gold-standard labels. SAFE also potentially identifies important features missed by automated feature extraction for phenotyping or experts.
Collapse
Affiliation(s)
- Sheng Yu
- Center for Statistical Science, Tsinghua University, Beijing, China.,Department of Industrial Engineering, Tsinghua University, Beijing, China
| | - Abhishek Chakrabortty
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Katherine P Liao
- Division of Rheumatology, Brigham and Women's Hospital, Boston, Massachusetts, USA
| | - Tianrun Cai
- Department of Radiology, Brigham and Women's Hospital, Boston, Massachusetts, USA
| | | | - Vivian S Gainer
- Research IS and Computing, Partners HealthCare, Charlestown, Massachusetts, USA
| | - Susanne E Churchill
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Peter Szolovits
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Shawn N Murphy
- Research IS and Computing, Partners HealthCare, Charlestown, Massachusetts, USA.,Department of Neurology, Massachusetts General Hospital, Boston, Massachusetts, USA
| | - Isaac S Kohane
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Tianxi Cai
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA
| |
Collapse
|
44
|
Chang VW, Langa KM, Weir D, Iwashyna TJ. The obesity paradox and incident cardiovascular disease: A population-based study. PLoS One 2017; 12:e0188636. [PMID: 29216243 PMCID: PMC5720539 DOI: 10.1371/journal.pone.0188636] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2017] [Accepted: 11/11/2017] [Indexed: 12/01/2022] Open
Abstract
Background Prior work suggests that obesity may confer a survival advantage among persons with cardiovascular disease (CVD). This obesity “paradox” is frequently studied in the context of prevalent disease, a stage in the disease process when confounding from illness-related weight loss and selective survival are especially problematic. Our objective was to examine the association of obesity with mortality among persons with incident CVD, where biases are potentially reduced, and to compare these findings with those based on prevalent disease. Methods We used data from the Health and Retirement Study, an ongoing, nationally representative longitudinal survey of U.S. adults age 50 years and older initiated in 1992 and linked to Medicare claims. Cox proportional hazard models were used to estimate the association between weight status and mortality among persons with specific CVD diagnoses. CVD diagnoses were established by self-reported survey data as well as Medicare claims. Prevalent disease models used concurrent weight status, and incident disease models used pre-diagnosis weight status. Results We examined myocardial infarction, congestive heart failure, stroke, and ischemic heart disease. A strong and significant obesity paradox was consistently observed in prevalent disease models (hazard of death 18–36% lower for obese class I relative to normal weight), replicating prior findings. However, in incident disease models of the same conditions in the same dataset, there was no evidence of this survival benefit. Findings from models using survey- vs. claims-based diagnoses were largely consistent. Conclusion We observed an obesity paradox in prevalent CVD, replicating prior findings in a population-based sample with longer-term follow-up. In incident CVD, however, we did not find evidence of a survival advantage for obesity. Our findings do not offer support for reevaluating clinical and public health guidelines in pursuit of a potential obesity paradox.
Collapse
Affiliation(s)
- Virginia W. Chang
- Department of Social and Behavioral Sciences, College of Global Public Health, New York University, New York, New York, United States of America
- Department of Population Health, School of Medicine, New York University, New York, New York, United States of America
- * E-mail:
| | - Kenneth M. Langa
- Department of Medicine, University of Michigan Medical School, Ann Arbor, Michigan, United States of America
- Veterans Affairs Center for Clinical Management Research, Ann Arbor, Michigan, United States of America
- Institute for Healthcare Policy and Innovation, University of Michigan, Ann Arbor, Michigan, United States of America
| | - David Weir
- Institute for Social Research, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Theodore J. Iwashyna
- Department of Medicine, University of Michigan Medical School, Ann Arbor, Michigan, United States of America
- Veterans Affairs Center for Clinical Management Research, Ann Arbor, Michigan, United States of America
- Institute for Healthcare Policy and Innovation, University of Michigan, Ann Arbor, Michigan, United States of America
| |
Collapse
|
45
|
Jordan LC, Bhattacharya PD. Stroke after trauma in children and young adults: Some opportunity for primary prevention. Neurology 2017; 89:2306-2307. [PMID: 29117958 DOI: 10.1212/wnl.0000000000004723] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Affiliation(s)
- Lori C Jordan
- From the Department of Pediatrics, Division of Pediatric Neurology (L.C.J.), Vanderbilt University Medical Center, Nashville, TN; and Department of Neurology (P.D.B.), Saint Joseph Mercy Oakland, Pontiac, MI.
| | - Pratik D Bhattacharya
- From the Department of Pediatrics, Division of Pediatric Neurology (L.C.J.), Vanderbilt University Medical Center, Nashville, TN; and Department of Neurology (P.D.B.), Saint Joseph Mercy Oakland, Pontiac, MI
| |
Collapse
|
46
|
Fox CK, Hills NK, Vinson DR, Numis AL, Dicker RA, Sidney S, Fullerton HJ. Population-based study of ischemic stroke risk after trauma in children and young adults. Neurology 2017; 89:2310-2316. [PMID: 29117963 PMCID: PMC5719927 DOI: 10.1212/wnl.0000000000004708] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2017] [Accepted: 08/16/2017] [Indexed: 12/27/2022] Open
Abstract
OBJECTIVE To quantify the incidence, timing, and risk of ischemic stroke after trauma in a population-based young cohort. METHODS We electronically identified trauma patients (<50 years old) from a population enrolled in a Northern Californian integrated health care delivery system (1997-2011). Within this cohort, we identified cases of arterial ischemic stroke within 4 weeks of trauma and 3 controls per case. A physician panel reviewed medical records, confirmed cases, and adjudicated whether the stroke was related to trauma. We calculated the 4-week stroke incidence and estimated stroke odds ratios (OR) by injury location using logistic regression. RESULTS From 1,308,009 trauma encounters, we confirmed 52 trauma-related ischemic strokes. The 4-week stroke incidence was 4.0 per 100,000 encounters (95% confidence interval [CI] 3.0-5.2). Trauma was multisystem in 26 (50%). In 19 (37%), the stroke occurred on the day of trauma, and all occurred within 15 days. In 7/28 cases with cerebrovascular angiography at the time of trauma, no abnormalities were detected. In unadjusted analyses, head, neck, chest, back, and abdominal injuries increased stroke risk. Only head (OR 4.1, CI 1.1-14.9) and neck (OR 5.6, CI 1.03-30.9) injuries remained associated with stroke after adjusting for demographics and trauma severity markers (multisystem trauma, motor vehicle collision, arrival by ambulance, intubation). CONCLUSIONS Stroke risk is elevated for 2 weeks after trauma. Onset is frequently delayed, providing an opportunity for stroke prevention during this period. However, in one-quarter of stroke cases with cerebrovascular angiography at the time of trauma, no vascular abnormality was detected.
Collapse
Affiliation(s)
- Christine K Fox
- From the Departments of Neurology (C.K.F., A.L.N., H.J.F.), Pediatrics (C.K.F., A.L.N., H.J.F.), Epidemiology and Biostatistics (N.K.H.), and Surgery (R.A.D.), University of California, San Francisco; the Division of Research (D.R.V., S.S.), Kaiser Permanente Northern California, Oakland; and the Department of Emergency Medicine (D.R.V.), Kaiser Permanente Sacramento Medical Center, Sacramento, CA.
| | - Nancy K Hills
- From the Departments of Neurology (C.K.F., A.L.N., H.J.F.), Pediatrics (C.K.F., A.L.N., H.J.F.), Epidemiology and Biostatistics (N.K.H.), and Surgery (R.A.D.), University of California, San Francisco; the Division of Research (D.R.V., S.S.), Kaiser Permanente Northern California, Oakland; and the Department of Emergency Medicine (D.R.V.), Kaiser Permanente Sacramento Medical Center, Sacramento, CA
| | - David R Vinson
- From the Departments of Neurology (C.K.F., A.L.N., H.J.F.), Pediatrics (C.K.F., A.L.N., H.J.F.), Epidemiology and Biostatistics (N.K.H.), and Surgery (R.A.D.), University of California, San Francisco; the Division of Research (D.R.V., S.S.), Kaiser Permanente Northern California, Oakland; and the Department of Emergency Medicine (D.R.V.), Kaiser Permanente Sacramento Medical Center, Sacramento, CA
| | - Adam L Numis
- From the Departments of Neurology (C.K.F., A.L.N., H.J.F.), Pediatrics (C.K.F., A.L.N., H.J.F.), Epidemiology and Biostatistics (N.K.H.), and Surgery (R.A.D.), University of California, San Francisco; the Division of Research (D.R.V., S.S.), Kaiser Permanente Northern California, Oakland; and the Department of Emergency Medicine (D.R.V.), Kaiser Permanente Sacramento Medical Center, Sacramento, CA
| | - Rochelle A Dicker
- From the Departments of Neurology (C.K.F., A.L.N., H.J.F.), Pediatrics (C.K.F., A.L.N., H.J.F.), Epidemiology and Biostatistics (N.K.H.), and Surgery (R.A.D.), University of California, San Francisco; the Division of Research (D.R.V., S.S.), Kaiser Permanente Northern California, Oakland; and the Department of Emergency Medicine (D.R.V.), Kaiser Permanente Sacramento Medical Center, Sacramento, CA
| | - Stephen Sidney
- From the Departments of Neurology (C.K.F., A.L.N., H.J.F.), Pediatrics (C.K.F., A.L.N., H.J.F.), Epidemiology and Biostatistics (N.K.H.), and Surgery (R.A.D.), University of California, San Francisco; the Division of Research (D.R.V., S.S.), Kaiser Permanente Northern California, Oakland; and the Department of Emergency Medicine (D.R.V.), Kaiser Permanente Sacramento Medical Center, Sacramento, CA
| | - Heather J Fullerton
- From the Departments of Neurology (C.K.F., A.L.N., H.J.F.), Pediatrics (C.K.F., A.L.N., H.J.F.), Epidemiology and Biostatistics (N.K.H.), and Surgery (R.A.D.), University of California, San Francisco; the Division of Research (D.R.V., S.S.), Kaiser Permanente Northern California, Oakland; and the Department of Emergency Medicine (D.R.V.), Kaiser Permanente Sacramento Medical Center, Sacramento, CA
| |
Collapse
|
47
|
Ammann EM, Leira EC, Winiecki SK, Nagaraja N, Dandapat S, Carnahan RM, Schweizer ML, Torner JC, Fuller CC, Leonard CE, Garcia C, Pimentel M, Chrischilles EA. Chart validation of inpatient ICD-9-CM administrative diagnosis codes for ischemic stroke among IGIV users in the Sentinel Distributed Database. Medicine (Baltimore) 2017; 96:e9440. [PMID: 29384925 PMCID: PMC6392785 DOI: 10.1097/md.0000000000009440] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/20/2017] [Revised: 11/08/2017] [Accepted: 12/01/2017] [Indexed: 01/25/2023] Open
Abstract
The Sentinel Distributed Database (SDD) is a large database of patient-level medical and prescription records, primarily derived from insurance claims and electronic health records, and is sponsored by the U.S. Food and Drug Administration for drug safety assessments. In this chart validation study, we report on the positive predictive value (PPV) of inpatient ICD-9-CM acute ischemic stroke (AIS) administrative diagnosis codes (433.x1, 434.xx, and 436) in the SDD.As part of an assessment of the risk of thromboembolic adverse events following treatment with intravenous immune globulin (IGIV), charts were obtained for 131 potential post-IGIV AIS cases. Charts were abstracted by trained nurses and then adjudicated by stroke experts using pre-specified diagnostic criteria.Case status could be determined for 128 potential AIS cases, of which 34 were confirmed. The PPVs for the inpatient AIS diagnoses recorded in the SDD were 27% overall [95% confidence interval (95% CI): 19-35], 60% (95% CI: 32-84) for principal-position diagnoses, 42% (95% CI: 28-57) for secondary diagnoses, and 6% (95% CI: 2-15) for position-unspecified diagnoses (which in the SDD generally originate from separate physician claims associated with an inpatient stay).Position-unspecified diagnoses were unlikely to represent true AIS cases. PPVs for principal and secondary inpatient diagnosis codes were higher, but still meaningfully lower than estimates from prior chart validation studies. The low PPVs may be specific to the IGIV user study population. Additional research is needed to assess the validity of AIS administrative diagnosis codes in other study populations within the SDD.
Collapse
Affiliation(s)
- Eric M. Ammann
- College of Public Health, University of Iowa, Iowa City, IA
| | - Enrique C. Leira
- College of Public Health, University of Iowa, Iowa City, IA
- Carver College of Medicine, University of Iowa, Iowa City, IA
| | - Scott K. Winiecki
- Center for Drug Evaluation and Research, Food and Drug Administration, Silver Spring, MD
| | | | | | | | - Marin L. Schweizer
- Carver College of Medicine, University of Iowa, Iowa City, IA
- Iowa City VA Health Care System, Iowa City, IA
| | | | - Candace C. Fuller
- Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, MA
| | - Charles E. Leonard
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| | - Crystal Garcia
- Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, MA
| | - Madelyn Pimentel
- Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, MA
| | | |
Collapse
|
48
|
Graiser M, Moore SG, Victor R, Hilliard A, Hill L, Keehan MS, Flowers CR. Development of Query Strategies to Identify a Histologic Lymphoma Subtype in a Large Linked Database System. Cancer Inform 2017. [DOI: 10.1177/117693510700300017] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Background Large linked databases (LLDB) represent a novel resource for cancer outcomes research. However, accurate means of identifying a patient population of interest within these LLDBs can be challenging. Our research group developed a fully integrated platform that provides a means of combining independent legacy databases into a single cancer-focused LLDB system. We compared the sensitivity and specificity of several SQL-based query strategies for identifying a histologic lymphoma subtype in this LLDB to determine the most accurate legacy data source for identifying a specific cancer patient population. Methods Query strategies were developed to identify patients with follicular lymphoma from a LLDB of cancer registry data, electronic medical records (EMR), laboratory, administrative, pharmacy, and other clinical data. Queries were performed using common diagnostic codes (ICD-9), cancer registry histology codes (ICD-O), and text searches of EMRs. We reviewed medical records and pathology reports to confirm each diagnosis and calculated the sensitivity and specificity for each query strategy. Results Together the queries identified 1538 potential cases of follicular lymphoma. Review of pathology and other medical reports confirmed 415 cases of follicular lymphoma, 300 pathology-verified and 115 verified from other medical reports. The query using ICD-O codes was highly specific (96%). Queries using text strings varied in sensitivity (range 7–92%) and specificity (range 86–99%). Queries using ICD-9 codes were both less sensitive (34–44%) and specific (35–87%). Conclusions Queries of linked-cancer databases that include cancer registry data should utilize ICD-O codes or employ structured free-text searches to identify patient populations with a precise histologic diagnosis.
Collapse
Affiliation(s)
- Michael Graiser
- Emory University School of Medicine, Winship Cancer Institute, Oncology Informatics, 1365 Clifton Road, N.E., Atlanta, GA, U.S.A
| | - Susan G. Moore
- Emory University School of Medicine, Winship Cancer Institute, Oncology Informatics, 1365 Clifton Road, N.E., Atlanta, GA, U.S.A
| | - Rochelle Victor
- Emory University School of Medicine, Winship Cancer Institute, Oncology Informatics, 1365 Clifton Road, N.E., Atlanta, GA, U.S.A
| | - Ashley Hilliard
- Emory University School of Medicine, Winship Cancer Institute, Oncology Informatics, 1365 Clifton Road, N.E., Atlanta, GA, U.S.A
| | - Leroy Hill
- Emory University School of Medicine, Winship Cancer Institute, Oncology Informatics, 1365 Clifton Road, N.E., Atlanta, GA, U.S.A
| | | | - Christopher R. Flowers
- Emory University School of Medicine, Winship Cancer Institute, Oncology Informatics, 1365 Clifton Road, N.E., Atlanta, GA, U.S.A
| |
Collapse
|
49
|
Jin P, Matos Diaz I, Stein L, Thaler A, Tuhrim S, Dhamoon MS. Intermediate risk of cardiac events and recurrent stroke after stroke admission in young adults. Int J Stroke 2017; 13:576-584. [DOI: 10.1177/1747493017733929] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Background In older adults with stroke, there is an increased risk of cardiovascular events in the intermediate period, up to one year after stroke. The risk of cardiovascular events in this period in young adults after stroke has not been studied. We hypothesized that in the intermediate risk period, young adults with ischemic stroke have an increased risk of recurrent stroke and a smaller increase of cardiac events. Methods Using the National Readmissions Database during the year 2013, we identified ischemic stroke admissions among those aged 18–45 years using International Classification of Disease, Ninth Revision, Clinical Modification codes to identify index vascular events and risk factors. Primary outcomes were readmission for cardiac events and stroke. Multivariable Cox proportional hazard models and Kaplan–Meier analysis were used to estimate risk of primary outcomes. Results We identified 12,392 young adults with index stroke. The readmission rate due to recurrent stroke was higher than for cardiac events (2913.3.1 vs. 1132.4 per 100,000 index hospitalizations at 90 days). There was a higher cumulative risk of both cardiac events and recurrent stroke in the presence of baseline diabetes and hypercholesterolemia. Conclusion In a large, nationally representative database, the intermediate risk of recurrent stroke after index stroke in young adults was higher than the risk of cardiac events. The presence of vascular risk factors augmented this risk but did not entirely account for it. The aggressive control of hypercholesterolemia and diabetes may play an important role in secondary prevention in young adults with stroke.
Collapse
Affiliation(s)
- Peter Jin
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, USA
| | - Ivan Matos Diaz
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, USA
| | - Laura Stein
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, USA
| | - Alison Thaler
- Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, USA
| | - Stanley Tuhrim
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, USA
| | - Mandip S Dhamoon
- Department of Neurology, Icahn School of Medicine at Mount Sinai, New York, USA
| |
Collapse
|
50
|
Yu AYX, Quan H, McRae A, Wagner GO, Hill MD, Coutts SB. Moderate sensitivity and high specificity of emergency department administrative data for transient ischemic attacks. BMC Health Serv Res 2017; 17:666. [PMID: 28923103 PMCID: PMC5604304 DOI: 10.1186/s12913-017-2612-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2016] [Accepted: 09/11/2017] [Indexed: 11/11/2022] Open
Abstract
Background Validation of administrative data case definitions is key for accurate passive surveillance of disease. Transient ischemic attack (TIA) is a condition primarily managed in the emergency department. However, prior validation studies have focused on data after inpatient hospitalization. We aimed to determine the validity of the Canadian 10th International Classification of Diseases (ICD-10-CA) codes for TIA in the national ambulatory administrative database. Methods We performed a diagnostic accuracy study of four ICD-10-CA case definition algorithms for TIA in the emergency department setting. The study population was obtained from two ongoing studies on the diagnosis of TIA and minor stroke versus stroke mimic using serum biomarkers and neuroimaging. Two reference standards were used 1) the emergency department clinical diagnosis determined by chart abstractors and 2) the 90-day final diagnosis, both obtained by stroke neurologists, to calculate the sensitivity, specificity, positive and negative predictive values (PPV and NPV) of the ICD-10-CA algorithms for TIA. Results Among 417 patients, emergency department adjudication showed 163 (39.1%) TIA, 155 (37.2%) ischemic strokes, and 99 (23.7%) stroke mimics. The most restrictive algorithm, defined as a TIA code in the main position had the lowest sensitivity (36.8%), but highest specificity (92.5%) and PPV (76.0%). The most inclusive algorithm, defined as a TIA code in any position with and without query prefix had the highest sensitivity (63.8%), but lowest specificity (81.5%) and PPV (68.9%). Sensitivity, specificity, PPV, and NPV were overall lower when using the 90-day diagnosis as reference standard. Conclusions Emergency department administrative data reflect diagnosis of suspected TIA with high specificity, but underestimate the burden of disease. Future studies are necessary to understand the reasons for the low to moderate sensitivity. Electronic supplementary material The online version of this article (10.1186/s12913-017-2612-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Amy Y X Yu
- Department of Clinical Neurosciences, Community Health Sciences, Cumming School of Medicine, University of Calgary, Health Sciences Centre, Office 2935-B, 3300 Hospital Drive NW, Calgary, AB, T2N 4N1, Canada.
| | - Hude Quan
- Department of Community Health Sciences, O'Brien Institute for Public Health, Cumming School of Medicine, University of Calgary, Heritage Medical Research Building 3330 Hospital Drive NW, Calgary, AB, T2N 4N1, Canada
| | - Andrew McRae
- Department of Emergency Medicine, Community Health Sciences, Cumming School of Medicine, University of Calgary, Foothills Campus, 3330 Hospital Drive NW, Calgary, AB, T2N 4N1, Canada
| | - Gabrielle O Wagner
- Department of Clinical Neurosciences, Community Health Sciences, Cumming School of Medicine, University of Calgary, Health Sciences Centre, Office 2935-B, 3300 Hospital Drive NW, Calgary, AB, T2N 4N1, Canada
| | - Michael D Hill
- Departments of Clinical Neurosciences, Community Health Sciences, Medicine, Radiology, and Hotchkiss Brain Institute, Cumming School of Medicine, University of Calgary, Health Sciences Centre, Office 2939, 3300 Hospital Drive NW, Calgary, AB, T2N 4N1, Canada
| | - Shelagh B Coutts
- Department of Clinical Neurosciences, Radiology, Community Health Sciences, Hotchkiss Brain Institute, Cumming School of Medicine, University of Calgary, C1242A, Foothills Medical Centre, 1403 29th St NW, Calgary, AB, T2N 2T9, Canada
| |
Collapse
|