Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Uno H, Ritzwoller DP, Cronin AM, Carroll NM, Hornbrook MC, Hassett MJ. Determining the Time of Cancer Recurrence Using Claims or Electronic Medical Record Data. JCO Clin Cancer Inform 2019;2:1-10. [PMID: 30652573 DOI: 10.1200/cci.17.00163] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open

For:	Uno H, Ritzwoller DP, Cronin AM, Carroll NM, Hornbrook MC, Hassett MJ. Determining the Time of Cancer Recurrence Using Claims or Electronic Medical Record Data. JCO Clin Cancer Inform 2019;2:1-10. [PMID: 30652573 DOI: 10.1200/cci.17.00163] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open

Number

Cited by Other Article(s)

Wen J, Hou J, Bonzel CL, Zhao Y, Castro VM, Gainer VS, Weisenfeld D, Cai T, Ho YL, Panickan VA, Costa L, Hong C, Gaziano JM, Liao KP, Lu J, Cho K, Cai T. LATTE: Label-efficient incident phenotyping from longitudinal electronic health records. PATTERNS (NEW YORK, N.Y.) 2024;5:100906. [PMID: 38264714 PMCID: PMC10801250 DOI: 10.1016/j.patter.2023.100906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 09/06/2023] [Accepted: 12/01/2023] [Indexed: 01/25/2024]

Hong C, Liang L, Yuan Q, Cho K, Liao KP, Pencina MJ, Christiani DC, Cai T. Semi-supervised calibration of noisy event risk (SCANER) with electronic health records. J Biomed Inform 2023;144:104425. [PMID: 37331495 PMCID: PMC10478159 DOI: 10.1016/j.jbi.2023.104425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2022] [Revised: 05/05/2023] [Accepted: 05/19/2023] [Indexed: 06/20/2023]

Abstract

OBJECTIVE

Electronic health records (EHR), containing detailed longitudinal clinical information on a large number of patients and covering broad patient populations, open opportunities for comprehensive predictive modeling of disease progression and treatment response. However, since EHRs were originally constructed for administrative purposes not for research, in the EHR-linked studies, it is often not feasible to capture reliable information for analytical variables, especially in the survival setting, when both accurate event status and event times are needed for model building. For example, progression-free survival (PFS), a commonly used survival outcome for cancer patients, often involves complex information embedded in free-text clinical notes and cannot be extracted reliably. Proxies of PFS time such as time to the first mention of progression in the notes are at best good approximations to the true event time. This leads to difficulty in efficiently estimating event rates for an EHR patient cohort. Estimating survival rates based on error-prone outcome definitions can lead to biased results and hamper the power in the downstream analysis. On the other hand, extracting accurate event time information via manual annotation is time and resource intensive. The objective of this study is to develop a calibrated survival rate estimator using noisy outcomes from EHR data.

MATERIALS AND METHODS

In this paper, we propose a two-stage semi-supervised calibration of noisy event rate (SCANER) estimator that can effectively overcome censoring induced dependency and attains more robust performance (i.e., not sensitive to misspecification of the imputation model) by fully utilizing both a small-labeled set of gold-standard survival outcomes annotated via manual chart review and a set of proxy features automatically captured via EHR in the unlabeled set. We validate the SCANER estimator by estimating the PFS rates for a virtual cohort of lung cancer patients from one large tertiary care center and the ICU-free survival rates for COVID patients from two large tertiary care centers.

RESULTS

In terms of survival rate estimates, the SCANER had very similar point estimates compared to the complete-case Kaplan Meier estimator. On the other hand, other benchmark methods for comparison, which fail to account for the induced dependency between event time and the censoring time conditioning on surrogate outcomes, produced biased results across all three case studies. In terms of standard errors, the SCANER estimator was more efficient than the KM estimator, with up to 50% efficiency gain.

CONCLUSION

The SCANER estimator achieves more efficient, robust, and accurate survival rate estimates compared to existing approaches. This promising new approach can also improve the resolution (i.e., granularity of event time) by using labels conditioning on multiple surrogates, particularly among less common or poorly coded conditions.

Collapse

Ahuja Y, Liang L, Zhou D, Huang S, Cai T. Semisupervised Calibration of Risk with Noisy Event Times (SCORNET) using electronic health record data. Biostatistics 2023;24:760-775. [PMID: 35166342 PMCID: PMC10544799 DOI: 10.1093/biostatistics/kxac003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2021] [Revised: 01/18/2022] [Accepted: 01/24/2022] [Indexed: 01/19/2023] Open

Hou J, Chan SF, Wang X, Cai T. Risk prediction with imperfect survival outcome information from electronic health records. Biometrics 2023;79:190-202. [PMID: 34747010 PMCID: PMC9741856 DOI: 10.1111/biom.13599] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Revised: 10/28/2021] [Accepted: 10/29/2021] [Indexed: 12/14/2022]

Beyrer J, Nelson DR, Sheffield KM, Huang YJ, Lau YK, Hincapie AL. Development and Validation of Coding Algorithms to Identify Patients with Incident Non-Small Cell Lung Cancer in United States Healthcare Claims Data. Clin Epidemiol 2023;15:73-89. [PMID: 36659903 PMCID: PMC9842515 DOI: 10.2147/clep.s389824] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 12/23/2022] [Indexed: 01/13/2023] Open

Rasmussen LA, Christensen NL, Winther-Larsen A, Dalton SO, Virgilsen LF, Jensen H, Vedsted P. A Validated Register-Based Algorithm to Identify Patients Diagnosed with Recurrence of Surgically Treated Stage I Lung Cancer in Denmark. Clin Epidemiol 2023;15:251-261. [PMID: 36890800 PMCID: PMC9986467 DOI: 10.2147/clep.s396738] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 02/15/2023] [Indexed: 03/04/2023] Open

Ahuja Y, Wen J, Hong C, Xia Z, Huang S, Cai T. A semi-supervised adaptive Markov Gaussian embedding process (SAMGEP) for prediction of phenotype event times using the electronic health record. Sci Rep 2022;12:17737. [PMID: 36273240 PMCID: PMC9588081 DOI: 10.1038/s41598-022-22585-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2021] [Accepted: 10/17/2022] [Indexed: 01/18/2023] Open

Liang L, Hou J, Uno H, Cho K, Ma Y, Cai T. Semi-supervised approach to event time annotation using longitudinal electronic health records. LIFETIME DATA ANALYSIS 2022;28:428-491. [PMID: 35753014 PMCID: PMC10044535 DOI: 10.1007/s10985-022-09557-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Accepted: 05/13/2022] [Indexed: 06/15/2023]

Khair S, Dort JC, Quan ML, Cheung WY, Sauro KM, Nakoneshny SC, Popowich BL, Liu P, Wu G, Xu Y. Validated algorithms for identifying timing of second event of oropharyngeal squamous cell carcinoma using real-world data. Head Neck 2022;44:1909-1917. [PMID: 35653151 DOI: 10.1002/hed.27109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 04/29/2022] [Accepted: 05/18/2022] [Indexed: 11/07/2022] Open

Affiliation(s)

Shahreen Khair Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
Joseph C Dort Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.,Department of Surgery, Cumming School of Medicine, University of Calgary, North Tower, Foothills Medical Centre, Calgary, Alberta, Canada
May Lynn Quan Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.,Department of Surgery, Cumming School of Medicine, University of Calgary, North Tower, Foothills Medical Centre, Calgary, Alberta, Canada.,Department of Oncology, Cumming School of Medicine, University of Calgary, Tom Baker, Cancer Centre, Calgary, Alberta, Canada
Winson Y Cheung Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.,Department of Surgery, Cumming School of Medicine, University of Calgary, North Tower, Foothills Medical Centre, Calgary, Alberta, Canada
Khara M Sauro Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.,Department of Surgery, Cumming School of Medicine, University of Calgary, North Tower, Foothills Medical Centre, Calgary, Alberta, Canada.,Department of Oncology, Cumming School of Medicine, University of Calgary, Tom Baker, Cancer Centre, Calgary, Alberta, Canada
Steven C Nakoneshny The Ohlson Research Initiative, Arnie Charbonneau Cancer Institute, University of Calgary, Calgary, Alberta, Canada
Brittany Lynn Popowich Centre for Health Informatics, Cumming School of Medicine, University of Calgary, Teaching Research and Wellness (TRW), Calgary, Alberta, Canada
Ping Liu Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
Guosong Wu Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.,Centre for Health Informatics, Cumming School of Medicine, University of Calgary, Teaching Research and Wellness (TRW), Calgary, Alberta, Canada
Yuan Xu Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.,Department of Surgery, Cumming School of Medicine, University of Calgary, North Tower, Foothills Medical Centre, Calgary, Alberta, Canada.,Department of Oncology, Cumming School of Medicine, University of Calgary, Tom Baker, Cancer Centre, Calgary, Alberta, Canada.,Centre for Health Informatics, Cumming School of Medicine, University of Calgary, Teaching Research and Wellness (TRW), Calgary, Alberta, Canada

Collapse

Ritzwoller DP, Hassett MJ, Uno H. Regarding the Utility of Unstructured Data and Natural Language Processing for Identification of Breast Cancer Recurrence. JCO Clin Cancer Inform 2021;5:1024-1025. [PMID: 34637320 PMCID: PMC9848577 DOI: 10.1200/cci.21.00091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 08/20/2021] [Indexed: 01/23/2023] Open

Caswell-Jin JL, Callahan A, Purington N, Han SS, Itakura H, John EM, Blayney DW, Sledge GW, Shah NH, Kurian AW. Treatment and Monitoring Variability in US Metastatic Breast Cancer Care. JCO Clin Cancer Inform 2021;5:600-614. [PMID: 34043432 DOI: 10.1200/cci.21.00031] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Abstract

PURPOSE

Treatment and monitoring options for patients with metastatic breast cancer (MBC) are increasing, but little is known about variability in care. We sought to improve understanding of MBC care and its correlates by analyzing real-world claims data using a search engine with a novel query language to enable temporal electronic phenotyping.

METHODS

Using the Advanced Cohort Engine, we identified 6,180 women who met criteria for having estrogen receptor-positive, human epidermal growth factor receptor 2-negative MBC from IBM MarketScan US insurance claims (2007-2014). We characterized treatment, monitoring, and hospice usage, along with clinical and nonclinical factors affecting care.

RESULTS

We observed wide variability in treatment modality and monitoring across patients and geography. Most women received first-recorded therapy with endocrine (67%) versus chemotherapy, underwent more computed tomography (CT) (76%) than positron emission tomography-CT, and were monitored using tumor markers (58%). Nearly half (46%) met criteria for aggressive disease, which were associated with receiving chemotherapy first, monitoring primarily with CT, and more frequent imaging. Older age was associated with endocrine therapy first, less frequent imaging, and less use of tumor markers. After controlling for clinical factors, care strategies varied significantly by nonclinical factors (median regional income with first-recorded therapy and imaging type, geographic region with these and with imaging frequency and use of tumor markers; P < .0001).

CONCLUSION

Variability in US MBC care is explained by patient and disease factors and by nonclinical factors such as geographic region, suggesting that treatment decisions are influenced by local practice patterns and/or resources. A search engine designed to express complex electronic phenotypes from longitudinal patient records enables the identification of variability in patient care, helping to define disparities and areas for improvement.

Collapse

Izci H, Tambuyzer T, Tuand K, Depoorter V, Laenen A, Wildiers H, Vergote I, Van Eycken L, De Schutter H, Verdoodt F, Neven P. A Systematic Review of Estimating Breast Cancer Recurrence at the Population Level With Administrative Data. J Natl Cancer Inst 2021;112:979-988. [PMID: 32259259 DOI: 10.1093/jnci/djaa050] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2020] [Revised: 03/20/2020] [Accepted: 03/31/2020] [Indexed: 12/18/2022] Open

Abstract

BACKGROUND

Exact numbers of breast cancer recurrences are currently unknown at the population level, because they are challenging to actively collect. Previously, real-world data such as administrative claims have been used within expert- or data-driven (machine learning) algorithms for estimating cancer recurrence. We present the first systematic review and meta-analysis, to our knowledge, of publications estimating breast cancer recurrence at the population level using algorithms based on administrative data.

METHODS

The systematic literature search followed Preferred Reporting Items for Systematic Reviews and Meta-Analysis guidelines. We evaluated and compared sensitivity, specificity, positive predictive value, negative predictive value, and overall accuracy of algorithms. A random-effects meta-analysis was performed using a generalized linear mixed model to obtain a pooled estimate of accuracy.

RESULTS

Seventeen articles met the inclusion criteria. Most articles used information from medical files as the gold standard, defined as any recurrence. Two studies included bone metastases only in the definition of recurrence. Fewer studies used a model-based approach (decision trees or logistic regression) (41.2%) compared with studies using detection rules without specified model (58.8%). The generalized linear mixed model for all recurrence types reported an accuracy of 92.2% (95% confidence interval = 88.4% to 94.8%).

CONCLUSIONS

Publications reporting algorithms for detecting breast cancer recurrence are limited in number and heterogeneous. A thorough analysis of the existing algorithms demonstrated the need for more standardization and validation. The meta-analysis reported a high accuracy overall, which indicates algorithms as promising tools to identify breast cancer recurrence at the population level. The rule-based approach combined with emerging machine learning algorithms could be interesting to explore in the future.

Collapse

Grabner M, Molife C, Wang L, Winfree KB, Cui ZL, Cuyun Carter G, Hess LM. Data Integration to Improve Real-world Health Outcomes Research for Non-Small Cell Lung Cancer in the United States: Descriptive and Qualitative Exploration. JMIR Cancer 2021;7:e23161. [PMID: 33843600 PMCID: PMC8076987 DOI: 10.2196/23161] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 01/29/2021] [Accepted: 02/01/2021] [Indexed: 12/20/2022] Open

Abstract

Background

The integration of data from disparate sources could help alleviate data insufficiency in real-world studies and compensate for the inadequacies of single data sources and short-duration, small sample size studies while improving the utility of data for research.

Objective

This study aims to describe and evaluate a process of integrating data from several complementary sources to conduct health outcomes research in patients with non–small cell lung cancer (NSCLC). The integrated data set is also used to describe patient demographics, clinical characteristics, treatment patterns, and mortality rates.

Methods

This retrospective cohort study integrated data from 4 sources: administrative claims from the HealthCore Integrated Research Database, clinical data from a Cancer Care Quality Program (CCQP), clinical data from abstracted medical records (MRs), and mortality data from the US Social Security Administration. Patients with lung cancer who initiated second-line (2L) therapy between November 01, 2015, and April 13, 2018, were identified in the claims and CCQP data. Eligible patients were 18 years or older and received atezolizumab, docetaxel, erlotinib, nivolumab, pembrolizumab, pemetrexed, or ramucirumab in the 2L setting. The main analysis cohort included patients with claims data and data from at least one additional data source (CCQP or MR). Patients without integrated data (claims only) were reported separately. Descriptive and univariate statistics were reported.

Results

Data integration resulted in a main analysis cohort of 2195 patients with NSCLC; 2106 patients had CCQP and 407 patients had MR data. The claims-only cohort included 931 eligible patients. For the main analysis cohort, the mean age was 62.1 (SD 9.27) years, 48.56% (1066/2195) were female, the median length of follow-up was 6.8 months, and for 37.77% (829/2195), death was observed. For the claims-only cohort, the mean age was 66.6 (SD 12.69) years, 52.1% (485/931) were female, the median length of follow-up was 8.6 months, and for 29.3% (273/931), death was observed. The most frequent 2L treatment was immunotherapy (1094/2195, 49.84%), followed by platinum-based regimens (472/2195, 21.50%) and single-agent chemotherapy (441/2195, 20.09%); mean duration of 2L therapy was 5.6 (SD 4.9, median 4) months. We describe challenges and learnings from the data integration process, and the benefits of the integrated data set, which includes a richer set of clinical and outcome data to supplement the utilization metrics available in administrative claims.

Conclusions

The management of patients with NSCLC requires care from a multidisciplinary team, leading to a lack of a single aggregated data source in real-world settings. The availability of integrated clinical data from MRs, health plan claims, and other sources of clinical care may improve the ability to assess emerging treatments.

Collapse

Rasmussen LA, Jensen H, Virgilsen LF, Jeppesen MM, Blaakaer J, Hansen DG, Jensen PT, Mogensen O, Vedsted P. Identification of endometrial cancer recurrence - a validated algorithm based on nationwide Danish registries. Acta Oncol 2021;60:452-458. [PMID: 33306454 DOI: 10.1080/0284186x.2020.1859133] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Abstract

INTRODUCTION

Recurrence of endometrial cancer is not routinely registered in the Danish national health registers. The aim of this study was to develop and validate a register-based algorithm to identify women diagnosed with endometrial cancer recurrence in Denmark to facilitate register-based research in this field.

MATERIAL AND METHODS

We conducted a cohort study based on data from Danish health registers. The algorithm was designed to identify women with recurrence and estimate the accompanying diagnosis date, which was based on information from the Danish National Patient Registry and the Danish National Pathology Registry. Indicators of recurrence were pathology registrations and procedure or diagnosis codes suggesting recurrence and related treatment. The gold standard for endometrial cancer recurrence originated from a Danish nationwide study of 2612 women diagnosed with endometrial cancer, FIGO stage I-II during 2005-2009. Recurrence was suspected in 308 women based on pathology reports, and recurrence suspicion was confirmed or rejected in the 308 women based on reviews of the medical records. The algorithm was validated by comparing the recurrence status identified by the algorithm and the recurrence status in the gold standard.

RESULTS

After relevant exclusions, the final study population consisted of 268 women, hereof 160 (60%) with recurrence according to the gold standard. The algorithm displayed a sensitivity of 91.3% (95% confidence interval (CI): 85.8-95.1), a specificity of 91.7% (95% CI: 84.8-96.1) and a positive predictive value of 94.2% (95% CI: 89.3-97.3). The algorithm estimated the recurrence date within 30 days of the gold standard in 86% and within 60 days of the gold standard in 94% of the identified patients.

DISCUSSION

The algorithm demonstrated good performance; it could be a valuable tool for future research in endometrial cancer recurrence and may facilitate studies with potential impact on clinical practice.

Collapse

Rasmussen LA, Jensen H, Virgilsen LF, Hölmich LR, Vedsted P. A Validated Register-Based Algorithm to Identify Patients Diagnosed with Recurrence of Malignant Melanoma in Denmark. Clin Epidemiol 2021;13:207-214. [PMID: 33758549 PMCID: PMC7979354 DOI: 10.2147/clep.s295844] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Accepted: 02/18/2021] [Indexed: 11/23/2022] Open

Kohane IS, Aronow BJ, Avillach P, Beaulieu-Jones BK, Bellazzi R, Bradford RL, Brat GA, Cannataro M, Cimino JJ, García-Barrio N, Gehlenborg N, Ghassemi M, Gutiérrez-Sacristán A, Hanauer DA, Holmes JH, Hong C, Klann JG, Loh NHW, Luo Y, Mandl KD, Daniar M, Moore JH, Murphy SN, Neuraz A, Ngiam KY, Omenn GS, Palmer N, Patel LP, Pedrera-Jiménez M, Sliz P, South AM, Tan ALM, Taylor DM, Taylor BW, Torti C, Vallejos AK, Wagholikar KB, Weber GM, Cai T. What Every Reader Should Know About Studies Using Electronic Health Record Data but May Be Afraid to Ask. J Med Internet Res 2021;23:e22219. [PMID: 33600347 PMCID: PMC7927948 DOI: 10.2196/22219] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 09/14/2020] [Accepted: 01/10/2021] [Indexed: 12/13/2022] Open

Affiliation(s)

Isaac S Kohane Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States
Bruce J Aronow Biomedical Informatics, Cincinnati Children's Hospital Medical Center, University of Cincinnati, Cincinnati, OH, United States
Paul Avillach Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States
Brett K Beaulieu-Jones Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States
Riccardo Bellazzi Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy.,ICS Maugeri, Pavia, Italy
Robert L Bradford North Carolina Translational and Clinical Sciences Institute, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
Gabriel A Brat Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States
Mario Cannataro Data Analytics Research Center, University Magna Graecia of Catanzaro, Catanzaro, Italy.,Department of Medical and Surgical Sciences, University Magna Graecia of Catanzaro, Catanzaro, Italy
James J Cimino Informatics Institute, University of Alabama at Birmingham, Birmingham, AL, United States
Noelia García-Barrio Department of Informatics, 12 de Octubre University Hospital, Madrid, Spain
Nils Gehlenborg Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States
Marzyeh Ghassemi Department of Computer Science and Medicine, University of Toronto, Toronto, ON, Canada
Alba Gutiérrez-Sacristán Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States
David A Hanauer Department of Learning Health Sciences, University of Michigan Medical School, Ann Arbor, MI, United States
John H Holmes Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
Chuan Hong Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States
Jeffrey G Klann Department of Medicine, Harvard Medical School, Boston, MA, United States.,Laboratory of Computer Science, Massachusetts General Hospital, Boston, MA, United States
Ne Hooi Will Loh National University Health Systems, Singapore, Singapore
Yuan Luo Department of Preventive Medicine, Northwestern University, Chicago, IL, United States
Kenneth D Mandl Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, United States
Mohamad Daniar Clinical Research Informatics, Boston Children's Hospital, Boston, MA, United States
Jason H Moore Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, United States
Shawn N Murphy Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States.,Department of Neurology, Massachusetts General Hospital, Boston, MA, United States
Antoine Neuraz Department of Biomedical Informatics, Necker-Enfant Malades Hospital, Assistance Publique - Hôpitaux de Paris, Paris, France.,Centre de Recherche des Cordeliers, INSERM UMRS 1138 Team 22, Université de Paris, Paris, France
Kee Yuan Ngiam National University Health Systems, Singapore, Singapore
Gilbert S Omenn Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, MI, United States
Nathan Palmer Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States
Lav P Patel Department of Internal Medicine, Division of Medical Informatics, University of Kansas Medical Center, Kansas City, KS, United States
Miguel Pedrera-Jiménez Department of Informatics, 12 de Octubre University Hospital, Madrid, Spain
Piotr Sliz Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, United States
Andrew M South Section of Nephrology, Department of Pediatrics, Brenner Children's Hospital, Wake Forest School of Medicine, Winston Salem, NC, United States
Amelia Li Min Tan Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States.,Department of Biomedical Informatics, National University of Singapore, Singapore, Singapore
Deanne M Taylor Department of Biomedical and Health Informatics, The Children's Hospital of Philadelphia, Philadelphia, PA, United States.,Department of Pediatrics, Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA, United States
Bradley W Taylor Clinical and Translational Science Institute, Medical College of Wisconsin, Milwaukee, WI, United States
Carlo Torti Department of Medical and Surgical Sciences, University Magna Graecia of Catanzaro, Catanzaro, Italy
Andrew K Vallejos Clinical and Translational Science Institute, Medical College of Wisconsin, Milwaukee, WI, United States
Kavishwar B Wagholikar Department of Medicine, Harvard Medical School, Boston, MA, United States.,Laboratory of Computer Science, Massachusetts General Hospital, Boston, MA, United States
See Acknowledgments,
Griffin M Weber Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States
Tianxi Cai Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States

Collapse

Beyrer J, Nelson DR, Sheffield KM, Huang YJ, Ellington T, Hincapie AL. Development and validation of coding algorithms to identify patients with incident lung cancer in United States healthcare claims data. Pharmacoepidemiol Drug Saf 2020;29:1465-1479. [PMID: 33012044 DOI: 10.1002/pds.5137] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 09/01/2020] [Accepted: 09/09/2020] [Indexed: 11/11/2022]

Carroll NM, Ritzwoller DP, Banegas MP, O'Keeffe-Rosetti M, Cronin AM, Uno H, Hornbrook MC, Hassett MJ. Performance of Cancer Recurrence Algorithms After Coding Scheme Switch From International Classification of Diseases 9th Revision to International Classification of Diseases 10th Revision. JCO Clin Cancer Inform 2020;3:1-9. [PMID: 30869998 DOI: 10.1200/cci.18.00113] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Abstract

PURPOSE

We previously developed and validated informatic algorithms that used International Classification of Diseases 9th revision (ICD9)-based diagnostic and procedure codes to detect the presence and timing of cancer recurrence (the RECUR Algorithms). In 2015, ICD10 replaced ICD9 as the worldwide coding standard. To understand the impact of this transition, we evaluated the performance of the RECUR Algorithms after incorporating ICD10 codes.

METHODS

Using publicly available translation tables along with clinician and other expertise, we updated the algorithms to include ICD10 codes as additional input variables. We evaluated the performance of the algorithms using gold standard recurrence measures associated with a contemporary cohort of patients with stage I to III breast, colorectal, and lung (excluding IIIB) cancer and derived performance measures, including the area under the receiver operating curve, average absolute prediction error, and correct classification rate. These values were compared with the performance measures derived from the validation of the original algorithms.

RESULTS

A total of 659 colorectal, 280 lung, and 2,053 breast cancer cases were identified. Area under the receiver operating curve derived from the updated algorithms was 89.0% (95% CI, 82.3% to 95.7%), 88.9% (95% CI, 79.3% to 98.2%), and 80.5% (95% CI, 72.8% to 88.2%) for the colorectal, lung, and breast cancer algorithms, respectively. Average absolute prediction errors for recurrence timing were 2.7 (SE, 11.3%), 2.4 (SE, 10.4%), and 5.6 months (SE, 21.8%), respectively, and timing estimates were within 6 months of actual recurrence for more than 80% of colorectal, more than 90% of lung, and more than 50% of breast cancer cases using the updated algorithm.

CONCLUSION

Performance measures derived from the updated and original algorithms had overlapping confidence intervals, suggesting that the ICD9 to ICD10 transition did not affect the RECUR Algorithm performance.

Collapse

Kehl KL, Hassett MJ, Schrag D. Patterns of care for older patients with stage IV non-small cell lung cancer in the immunotherapy era. Cancer Med 2020;9:2019-2029. [PMID: 31989786 PMCID: PMC7064091 DOI: 10.1002/cam4.2854] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2019] [Revised: 12/19/2019] [Accepted: 01/05/2020] [Indexed: 12/26/2022] Open

Abstract

BACKGROUND

Historically, older patients with advanced lung cancer have often received no systemic treatment. Immunotherapy has improved outcomes in clinical trials, but its dissemination and implementation at the population level is not well-understood.

METHODS

A retrospective cohort study of patients with stage IV non-small cell lung cancer (NSCLC) diagnosed age 66 or older from 2012 to 2015 was conducted using SEER-Medicare. Treatment patterns within one year of diagnosis were ascertained. Outcomes included delivery of (a) any systemic therapy; (b) any second-line infusional therapy, following first-line infusional therapy; and (c) any second-line immunotherapy, following first-line infusional therapy. Trends in care patterns associated with second-line immunotherapy approvals in 2015 were assessed using generalized additive models. Sociodemographic and clinical predictors of treatment were explored using logistic regression.

RESULTS

Among 10 303 patients, 5173 (50.2%) received first-line systemic therapy, with little change between the years 2012 (47.5%) and 2015 (50.3%). Among 3943 patients completing first-line infusional therapy, the proportion starting second-line infusional treatment remained stable from 2012 (30.5%) through 2014 (32.9%), before increasing in 2015 (42.4%) concurrent with second-line immunotherapy approvals. Factors associated with decreased utilization of any therapy included age, black race, Medicaid eligibility, residence in a high-poverty area, nonadenocarcinoma histology, and comorbidity; factors associated with increased utilization of any therapy included Asian race and Hispanic ethnicity. Among patients who received first-line infusional therapy, factors associated with decreased utilization of second-line infusional therapy included age, Medicaid eligibility, nonadenocarcinoma histology, and comorbidity; Asian race was associated with increased utilization of second-line infusional therapy.

CONCLUSION

United States Food and Drug Administration (FDA) approvals of immunotherapy for the second-line treatment of advanced NSCLC in 2015 were associated with increased rates of any second-line treatment, but disparities based on social determinants of health persisted.

Collapse