Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Kehl KL, Elmarakeby H, Nishino M, Van Allen EM, Lepisto EM, Hassett MJ, Johnson BE, Schrag D. Assessment of Deep Natural Language Processing in Ascertaining Oncologic Outcomes From Radiology Reports. JAMA Oncol 2019;5:1421-1429. [PMID: 31343664 DOI: 10.1001/jamaoncol.2019.1800] [Citation(s) in RCA: 73] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

For:	Kehl KL, Elmarakeby H, Nishino M, Van Allen EM, Lepisto EM, Hassett MJ, Johnson BE, Schrag D. Assessment of Deep Natural Language Processing in Ascertaining Oncologic Outcomes From Radiology Reports. JAMA Oncol 2019;5:1421-1429. [PMID: 31343664 DOI: 10.1001/jamaoncol.2019.1800] [Citation(s) in RCA: 73] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Number

Cited by Other Article(s)

Murnan AW, Tscholl JJ, Ganta R, Duah HO, Qasem I, Sezgin E. Identification of Child Survivors of Sex Trafficking From Electronic Health Records: An Artificial Intelligence Guided Approach. CHILD MALTREATMENT 2024;29:601-611. [PMID: 37545138 PMCID: PMC11000265 DOI: 10.1177/10775595231194599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]

Shah AT, Blanchard I, Padda SK, Wakelee HA, Neal JW. Molecular Characteristics and Pretreatment Neutrophil-to-Lymphocyte Ratio as Predictors of Durable Clinical Benefit from Immune Checkpoint Inhibition in Non-Small Cell Lung Cancer. Clin Lung Cancer 2024;25:550-559. [PMID: 38987048 PMCID: PMC11365775 DOI: 10.1016/j.cllc.2024.06.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Revised: 06/06/2024] [Accepted: 06/15/2024] [Indexed: 07/12/2024]

Riaz IB, Khan MA, Haddad TC. Potential application of artificial intelligence in cancer therapy. Curr Opin Oncol 2024;36:437-448. [PMID: 39007164 DOI: 10.1097/cco.0000000000001068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]

Loor-Torres R, Wu Y, Esteban Cabezas, Borras-Osorio M, Toro-Tobon D, Duran M, Al Zahidy M, Mateo Chavez M, Soto Jacome C, Fan JW, Singh Ospina NM, Wu Y, Brito JP. Use of Natural Language Processing to Extract and Classify Papillary Thyroid Cancer Features From Surgical Pathology Reports. Endocr Pract 2024:S1530-891X(24)00657-8. [PMID: 39197747 DOI: 10.1016/j.eprac.2024.08.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Revised: 08/13/2024] [Accepted: 08/20/2024] [Indexed: 09/01/2024]

Abstract

BACKGROUND

We aim to use Natural Language Processing to automate the extraction and classification of thyroid cancer risk factors from pathology reports.

METHODS

We analyzed 1410 surgical pathology reports from adult papillary thyroid cancer patients from 2010 to 2019. Structured and nonstructured reports were used to create a consensus-based ground truth dictionary and categorized them into modified recurrence risk levels. Nonstructured reports were narrative, while structured reports followed standardized formats. We developed ThyroPath, a rule-based Natural Language Processing pipeline, to extract and classify thyroid cancer features into risk categories. Training involved 225 reports (150 structured, 75 unstructured), with testing on 170 reports (120 structured, 50 unstructured) for evaluation. The pipeline's performance was assessed using both strict and lenient criteria for accuracy, precision, recall, and F1-score; a metric that combines precision and recall evaluation.

RESULTS

In extraction tasks, ThyroPath achieved overall strict F-1 scores of 93% for structured reports and 90% for unstructured reports, covering 18 thyroid cancer pathology features. In classification tasks, ThyroPath-extracted information demonstrated an overall accuracy of 93% in categorizing reports based on their corresponding guideline-based risk of recurrence: 76.9% for high-risk, 86.8% for intermediate risk, and 100% for both low and very low-risk cases. However, ThyroPath achieved 100% accuracy across all risk categories with human extracted pathology information.

CONCLUSIONS

ThyroPath shows promise in automating the extraction and risk recurrence classification of thyroid pathology reports at large scale. It offers a solution to laborious manual reviews and advancing virtual registries. However, it requires further validation before implementation.

Collapse

Affiliation(s)

Ricardo Loor-Torres Knowledge and Evaluation Research Unit, Division of Endocrinology, Diabetes, Metabolism, and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota
Yuqi Wu Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, Minnesota
Esteban Cabezas Knowledge and Evaluation Research Unit, Division of Endocrinology, Diabetes, Metabolism, and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota
Mariana Borras-Osorio Knowledge and Evaluation Research Unit, Division of Endocrinology, Diabetes, Metabolism, and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota
David Toro-Tobon Division of Endocrinology, Diabetes, Metabolism, and Nutrition, Mayo Clinic, Rochester, Minnesota
Mayra Duran Knowledge and Evaluation Research Unit, Division of Endocrinology, Diabetes, Metabolism, and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota
Misk Al Zahidy Knowledge and Evaluation Research Unit, Division of Endocrinology, Diabetes, Metabolism, and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota
Maria Mateo Chavez Knowledge and Evaluation Research Unit, Division of Endocrinology, Diabetes, Metabolism, and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota
Cristian Soto Jacome Knowledge and Evaluation Research Unit, Division of Endocrinology, Diabetes, Metabolism, and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota
Jungwei W Fan Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, Minnesota
Naykky M Singh Ospina Division of Endocrinology, Department of Medicine, University of Florida, Gainesville, Florida
Yonghui Wu Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, Florida
Juan P Brito Knowledge and Evaluation Research Unit, Division of Endocrinology, Diabetes, Metabolism, and Nutrition, Department of Medicine, Mayo Clinic, Rochester, Minnesota; Division of Endocrinology, Diabetes, Metabolism, and Nutrition, Mayo Clinic, Rochester, Minnesota

Collapse

Kanemaru N, Yasaka K, Fujita N, Kanzawa J, Abe O. The Fine-Tuned Large Language Model for Extracting the Progressive Bone Metastasis from Unstructured Radiology Reports. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2024:10.1007/s10278-024-01242-3. [PMID: 39187702 DOI: 10.1007/s10278-024-01242-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Revised: 08/03/2024] [Accepted: 08/19/2024] [Indexed: 08/28/2024]

Chen JC, Luo C, Li Y, Tan DH. Knowledge domain and emerging trends in the rupture risk of intracranial aneurysms research from 2004 to 2023. World J Clin Cases 2024;12:5382-5403. [PMID: 39156083 PMCID: PMC11238678 DOI: 10.12998/wjcc.v12.i23.5382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/12/2024] [Revised: 06/20/2024] [Accepted: 06/26/2024] [Indexed: 07/05/2024] Open

Abstract

BACKGROUND

Intracranial aneurysms (IAs) pose significant health risks, attributable to their potential for sudden rupture, which can result in severe outcomes such as stroke and death. Despite extensive research, the variability of aneurysm behavior, with some remaining stable for years while others rupture unexpectedly, remains poorly understood.

AIM

To employ bibliometric analysis to map the research landscape concerning risk factors associated with IAs rupture.

METHODS

A systematic literature review of publications from 2004 to 2023 was conducted, analyzing 3804 documents from the Web of Science Core Collection database, with a focus on full-text articles and reviews in English. The analysis encompassed citation and co-citation networks, keyword bursts, and temporal trends to delineate the evolution of research themes and collaboration patterns. Advanced software tools, CiteSpace and VOSviewer, were utilized for comprehensive data visualization and trend analysis.

RESULTS

Analysis uncovered a total of 3804 publications on IA rupture risk factors between 2006 and 2023. Research interest surged after 2013, peaking in 2023. The United States led with 28.97% of publications, garnering 37706 citations. Notable United States-China collaborations were observed. Capital Medical University produced 184 publications, while Utrecht University boasted a citation average of 69.62 per publication. "World Neurosurgery" published the most papers, contrasting with "Stroke", the most cited journal. The PHASES score from "Lancet Neurology" emerged as a vital rupture risk prediction tool. Early research favored endovascular therapy, transitioning to magnetic resonance imaging and flow diverters. "Subarachnoid hemorrhage" stood out as a recurrent keyword.

CONCLUSION

This study assesses global IA research trends and highlights crucial gaps, guiding future investigations to improve preventive and therapeutic approaches.

Collapse

Kanzawa J, Yasaka K, Fujita N, Fujiwara S, Abe O. Automated classification of brain MRI reports using fine-tuned large language models. Neuroradiology 2024:10.1007/s00234-024-03427-7. [PMID: 38995393 DOI: 10.1007/s00234-024-03427-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Accepted: 07/05/2024] [Indexed: 07/13/2024]

Swaminathan A, Ren AL, Wu JY, Bhargava-Shah A, Lopez I, Srivastava U, Alexopoulos V, Pizzitola R, Bui B, Alkhani L, Lee S, Mohit N, Seo N, Macedo N, Cheng W, Wang W, Tran E, Thomas R, Gevaert O. Extraction of Unstructured Electronic Health Records to Evaluate Glioblastoma Treatment Patterns. JCO Clin Cancer Inform 2024;8:e2300091. [PMID: 38857465 PMCID: PMC11371099 DOI: 10.1200/cci.23.00091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 11/15/2023] [Accepted: 03/12/2024] [Indexed: 06/12/2024] Open

Sastry RA, Setty A, Liu DD, Zheng B, Ali R, Weil RJ, Roye GD, Doberstein CE, Oyelese AA, Niu T, Gokaslan ZL, Telfeian AE. Natural language processing augments comorbidity documentation in neurosurgical inpatient admissions. PLoS One 2024;19:e0303519. [PMID: 38723044 PMCID: PMC11081267 DOI: 10.1371/journal.pone.0303519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Accepted: 04/04/2024] [Indexed: 05/13/2024] Open

Affiliation(s)

Rahul A. Sastry Department of Neurosurgery, Warren Alpert Medical School, Rhode Island Hospital, Brown University, Providence, RI, United States of America
Aayush Setty Department of Neurosurgery, Warren Alpert Medical School, Rhode Island Hospital, Brown University, Providence, RI, United States of America Department of Computer Science, Brown University, Providence, RI, United States of America
David D. Liu Department of Neurosurgery, Warren Alpert Medical School, Rhode Island Hospital, Brown University, Providence, RI, United States of America
Bryan Zheng Department of Neurosurgery, Warren Alpert Medical School, Rhode Island Hospital, Brown University, Providence, RI, United States of America
Rohaid Ali Department of Neurosurgery, Warren Alpert Medical School, Rhode Island Hospital, Brown University, Providence, RI, United States of America
Robert J. Weil Department of Neurosurgery, Brain & Spine, Southcoast Health, Dartmouth, MA, United States of America
G. Dean Roye Department of Surgery, Warren Alpert Medical School, Rhode Island Hospital, Brown University, Providence, RI, United States of America
Curtis E. Doberstein Department of Neurosurgery, Warren Alpert Medical School, Rhode Island Hospital, Brown University, Providence, RI, United States of America
Adetokunbo A. Oyelese Department of Neurosurgery, Warren Alpert Medical School, Rhode Island Hospital, Brown University, Providence, RI, United States of America
Tianyi Niu Department of Neurosurgery, Warren Alpert Medical School, Rhode Island Hospital, Brown University, Providence, RI, United States of America
Ziya L. Gokaslan Department of Neurosurgery, Warren Alpert Medical School, Rhode Island Hospital, Brown University, Providence, RI, United States of America
Albert E. Telfeian Department of Neurosurgery, Warren Alpert Medical School, Rhode Island Hospital, Brown University, Providence, RI, United States of America

Collapse

Tay SB, Low GH, Wong GJE, Tey HJ, Leong FL, Li C, Chua MLK, Tan DSW, Thng CH, Tan IBH, Tan RSYC. Use of Natural Language Processing to Infer Sites of Metastatic Disease From Radiology Reports at Scale. JCO Clin Cancer Inform 2024;8:e2300122. [PMID: 38788166 PMCID: PMC11371090 DOI: 10.1200/cci.23.00122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 03/02/2024] [Accepted: 04/01/2024] [Indexed: 05/26/2024] Open

Abstract

PURPOSE

To evaluate natural language processing (NLP) methods to infer metastatic sites from radiology reports.

METHODS

A set of 4,522 computed tomography (CT) reports of 550 patients with 14 types of cancer was used to fine-tune four clinical large language models (LLMs) for multilabel classification of metastatic sites. We also developed an NLP information extraction (IE) system (on the basis of named entity recognition, assertion status detection, and relation extraction) for comparison. Model performances were measured by F1 scores on test and three external validation sets. The best model was used to facilitate analysis of metastatic frequencies in a cohort study of 6,555 patients with 53,838 CT reports.

RESULTS

The RadBERT, BioBERT, GatorTron-base, and GatorTron-medium LLMs achieved F1 scores of 0.84, 0.87, 0.89, and 0.91, respectively, on the test set. The IE system performed best, achieving an F1 score of 0.93. F1 scores of the IE system by individual cancer type ranged from 0.89 to 0.96. The IE system attained F1 scores of 0.89, 0.83, and 0.81, respectively, on external validation sets including additional cancer types, positron emission tomography-CT ,and magnetic resonance imaging scans, respectively. In our cohort study, we found that for colorectal cancer, liver-only metastases were higher in de novo stage IV versus recurrent patients (29.7% v 12.2%; P < .001). Conversely, lung-only metastases were more frequent in recurrent versus de novo stage IV patients (17.2% v 7.3%; P < .001).

CONCLUSION

We developed an IE system that accurately infers metastatic sites in multiple primary cancers from radiology reports. It has explainable methods and performs better than some clinical LLMs. The inferred metastatic phenotypes could enhance cancer research databases and clinical trial matching, and identify potential patients for oligometastatic interventions.

Collapse

Affiliation(s)

See Boon Tay Division of Medical Oncology, National Cancer Centre Singapore, Singapore, Singapore NUS Yong Loo Lin School of Medicine, Singapore, Singapore
Guat Hwa Low Division of Medical Oncology, National Cancer Centre Singapore, Singapore, Singapore Data and Computational Science Core, National Cancer Centre Singapore, Singapore, Singapore
Gillian Jing En Wong NUS Yong Loo Lin School of Medicine, Singapore, Singapore
Han Jieh Tey Division of Medical Oncology, National Cancer Centre Singapore, Singapore, Singapore Data and Computational Science Core, National Cancer Centre Singapore, Singapore, Singapore
Fun Loon Leong Division of Medical Oncology, National Cancer Centre Singapore, Singapore, Singapore Data and Computational Science Core, National Cancer Centre Singapore, Singapore, Singapore
Constance Li Data and Computational Science Core, National Cancer Centre Singapore, Singapore, Singapore
Melvin Lee Kiang Chua Data and Computational Science Core, National Cancer Centre Singapore, Singapore, Singapore Singapore Duke-NUS Medical School, Singapore, Singapore Division of Radiation Oncology, National Cancer Centre Singapore, Singapore, Singapore
Daniel Shao Weng Tan Division of Medical Oncology, National Cancer Centre Singapore, Singapore, Singapore Singapore Duke-NUS Medical School, Singapore, Singapore Division of Clinical Trials and Epidemiological Sciences, National Cancer Centre Singapore, Singapore, Singapore
Choon Hua Thng Singapore Duke-NUS Medical School, Singapore, Singapore Division of Oncologic Imaging, National Cancer Centre Singapore, Singapore, Singapore
Iain Bee Huat Tan Division of Medical Oncology, National Cancer Centre Singapore, Singapore, Singapore Data and Computational Science Core, National Cancer Centre Singapore, Singapore, Singapore Singapore Duke-NUS Medical School, Singapore, Singapore
Ryan Shea Ying Cong Tan Division of Medical Oncology, National Cancer Centre Singapore, Singapore, Singapore Data and Computational Science Core, National Cancer Centre Singapore, Singapore, Singapore Singapore Duke-NUS Medical School, Singapore, Singapore Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY

Collapse

Xu W, Gu B, Lotter WE, Kehl KL. Extraction and Imputation of Eastern Cooperative Oncology Group Performance Status From Unstructured Oncology Notes Using Language Models. JCO Clin Cancer Inform 2024;8:e2300269. [PMID: 38810206 DOI: 10.1200/cci.23.00269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 02/08/2024] [Accepted: 04/11/2024] [Indexed: 05/31/2024] Open

Altahawi F, Owens A, Caruso CH, Wetzel JR, Strnad GJ, Chiunda AB, Spindler KP, Subhas N. Development and Operationalization of an Automated Workflow for Correlation of Knee MRI and Arthroscopy Findings. J Am Coll Radiol 2024;21:609-616. [PMID: 37302680 DOI: 10.1016/j.jacr.2023.04.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 03/23/2023] [Accepted: 04/06/2023] [Indexed: 06/13/2023]

Kehl KL, Lavery JA, Brown S, Fuchs H, Riely G, Schrag D, Newcomb A, Nichols C, Micheel CM, Bedard PL, Sweeney SM, Fiandalo M, Panageas KS. Biomarker Inference and the Timing of Next-Generation Sequencing in a Multi-Institutional, Cross-Cancer Clinicogenomic Data Set. JCO Precis Oncol 2024;8:e2300489. [PMID: 38484212 PMCID: PMC10954072 DOI: 10.1200/po.23.00489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2023] [Revised: 12/03/2023] [Accepted: 01/03/2024] [Indexed: 03/19/2024] Open

Abstract

PURPOSE

Observational clinicogenomic data sets, consisting of tumor next-generation sequencing (NGS) data linked to clinical records, are commonly used for cancer research. However, in real-world practice, oncologists frequently request NGS in search of treatment options for progressive cancer. The extent and impact of this dynamic on analysis of clinicogenomic research data are not well understood.

METHODS

We analyzed clinicogenomic data for patients with non-small cell lung, colorectal, breast, prostate, pancreatic, or urothelial cancers in the American Association for Cancer Research Biopharmaceutical Consortium cohort. Associations between baseline and time-varying clinical characteristics and time from diagnosis to NGS were measured. To explore the impact of informative cohort entry on biomarker inference, statistical interactions between selected biomarkers and time to NGS with respect to overall survival were calculated.

RESULTS

Among 7,182 patients, time from diagnosis to NGS varied significantly by clinical factors, including cancer type, calendar year of sequencing, institution, and age and stage at diagnosis. NGS rates also varied significantly by dynamic clinical status variables; in an adjusted model, compared with patients with stable disease at any given time after diagnosis, patients with progressive disease by imaging or oncologist assessment had higher NGS rates (hazard ratio for NGS, 1.61 [95% CI, 1.45 to 1.78] and 2.32 [95% CI, 2.01 to 2.67], respectively). Statistical interactions between selected biomarkers and time to NGS with respect to survival, potentially indicating biased biomarker inference results, were explored.

CONCLUSION

To evaluate the appropriateness of a data set for a particular research question, it is crucial to measure associations between dynamic cancer status and the timing of NGS, as well as to evaluate interactions involving biomarkers of interest and NGS timing with respect to survival outcomes.

Collapse

Si S, Shou L, Gao Q, Qin W, Zhao D. Worldwide productivity and research trend of publications concerning intestinal polyps: A bibliometric study. Medicine (Baltimore) 2024;103:e36507. [PMID: 38215143 PMCID: PMC10783372 DOI: 10.1097/md.0000000000036507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/23/2023] [Accepted: 11/16/2023] [Indexed: 01/14/2024] Open

Abstract

There is a significant relationship between intestinal polyps and colorectal cancer, and in recent years, research on intestinal polyps has been rapidly developing around the world. However, there is still a lack of adequate quantification and analysis of publications in this field. The aim of this study was to perform a comprehensive bibliometric analysis of publications related to intestinal polyps over the past 20 years. To enhance the understanding of current research hotspots and potential trends, and to point out the direction of future research. Publications related to intestinal polyps were retrieved from the Science Citation Index Expanded in Web of Science Core Collection. the Bibliometric online analysis platform (https://bibliometric.com/app), the Bibliometrix Package, and the CiteSpace are used for bibliometric analysis and visualization, including the overall range of annual output and annual citations, country-region analysis, author and institution analysis, core journal analysis, reference and keyword analysis. Prior to 2017, the amount of research on intestinal polyps was slow to grow, but it picked up speed after that year. In 1019 journals, 4280 papers on intestinal polyps were published in English. The journal with the highest productivity was Gastrointestinal Endoscopy (189, 4.42%). United States (1124, 26.26%), which is also the hub of collaboration in this subject, was the most productive nation. Mayo Clinic (n = 70, 1.64%) is the most productive institution. Intestinal microbiota, endoscopic mucosal resection, gut microbiota, deep learning, tea polyphenol, insulin resistance and artificial intelligence were current hot subjects in the field. Studies of intestinal polyps increased significantly after 2017. The United States contributed the largest number of publications. Countries and institutions were actively cooperating with one another. artificial intelligence is currently an emerging topic.

Collapse

Singh H, Keller RB, Kapner KS, Dilly J, Raghavan S, Yuan C, Cohen EF, Tolstorukov M, Andrews E, Brais LK, Da Silva A, Perez K, Rubinson DA, Surana R, Giannakis M, Ng K, Clancy TE, Yurgelun MB, Schletchter B, Clark JW, Shapiro GI, Rosenthal MH, Hornick JL, Nardi V, Li YY, Gupta H, Cherniack AD, Meyerson M, Cleary JM, Nowak JA, Wolpin BM, Aguirre AJ. Oncogenic Drivers and Therapeutic Vulnerabilities in KRAS Wild-Type Pancreatic Cancer. Clin Cancer Res 2023;29:4627-4643. [PMID: 37463056 PMCID: PMC10795103 DOI: 10.1158/1078-0432.ccr-22-3930] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 05/17/2023] [Accepted: 07/14/2023] [Indexed: 07/20/2023]

Affiliation(s)

Harshabad Singh Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Department of Medicine, Brigham and Women’s Hospital, Boston, MA Harvard Medical School, Boston, MA
Rachel B. Keller Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
Kevin S. Kapner Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
Julien Dilly Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Biological and biomedical sciences program, Harvard Medical School, Boston, MA The Broad Institute of Harvard and MIT, Cambridge, MA
Srivatsan Raghavan Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Department of Medicine, Brigham and Women’s Hospital, Boston, MA Harvard Medical School, Boston, MA The Broad Institute of Harvard and MIT, Cambridge, MA
Chen Yuan Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
Elizabeth F. Cohen Department of Informatics and Analytics, Dana-Farber Cancer Institute, Boston, MA
Michael Tolstorukov Department of Informatics and Analytics, Dana-Farber Cancer Institute, Boston, MA
Elizabeth Andrews Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
Lauren K. Brais Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
Annacarolina Da Silva Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Department of Pathology, Weill Cornell Medical College, New York, NY
Kimberly Perez Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Department of Medicine, Brigham and Women’s Hospital, Boston, MA Harvard Medical School, Boston, MA
Douglas A. Rubinson Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Department of Medicine, Brigham and Women’s Hospital, Boston, MA Harvard Medical School, Boston, MA
Rishi Surana Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Department of Medicine, Brigham and Women’s Hospital, Boston, MA Harvard Medical School, Boston, MA
Marios Giannakis Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Department of Medicine, Brigham and Women’s Hospital, Boston, MA Harvard Medical School, Boston, MA
Kimmie Ng Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Department of Medicine, Brigham and Women’s Hospital, Boston, MA Harvard Medical School, Boston, MA
Thomas E. Clancy Harvard Medical School, Boston, MA Division of Surgical Oncology, Dana-Farber Cancer Institute, Boston, MA Department of Surgery, Brigham and Women’s Hospital, Boston, MA
Matthew B. Yurgelun Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Department of Medicine, Brigham and Women’s Hospital, Boston, MA Harvard Medical School, Boston, MA
Benjamin Schletchter Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Department of Medicine, Brigham and Women’s Hospital, Boston, MA Harvard Medical School, Boston, MA
Jeffrey W. Clark Harvard Medical School, Boston, MA Massachusetts General Hospital Cancer Center, Boston, MA
Geoffrey I. Shapiro Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Department of Medicine, Brigham and Women’s Hospital, Boston, MA Harvard Medical School, Boston, MA
Michael H. Rosenthal Department of Radiology, Dana-Farber Cancer Institute, Boston, MA Department of Radiology, Brigham and Women’s Hospital, Boston, MA
Jason L. Hornick Department of Pathology, Brigham and Women’s Hospital, Boston, MA
Valentina Nardi Department of Pathology, Massachusetts General Hospital, Boston, MA
Yvonne Y. Li Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Harvard Medical School, Boston, MA The Broad Institute of Harvard and MIT, Cambridge, MA
Hersh Gupta Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Harvard Medical School, Boston, MA The Broad Institute of Harvard and MIT, Cambridge, MA
Andrew D. Cherniack Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Harvard Medical School, Boston, MA The Broad Institute of Harvard and MIT, Cambridge, MA
Matthew Meyerson Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Harvard Medical School, Boston, MA The Broad Institute of Harvard and MIT, Cambridge, MA
James M. Cleary Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Department of Medicine, Brigham and Women’s Hospital, Boston, MA Harvard Medical School, Boston, MA
Jonathan A. Nowak Harvard Medical School, Boston, MA Department of Radiology, Brigham and Women’s Hospital, Boston, MA
Brian M. Wolpin Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Department of Medicine, Brigham and Women’s Hospital, Boston, MA Harvard Medical School, Boston, MA
Andrew J. Aguirre Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA Department of Medicine, Brigham and Women’s Hospital, Boston, MA Harvard Medical School, Boston, MA The Broad Institute of Harvard and MIT, Cambridge, MA

Collapse

Jiang Y, Wang C, Zhou S. Artificial intelligence-based risk stratification, accurate diagnosis and treatment prediction in gynecologic oncology. Semin Cancer Biol 2023;96:82-99. [PMID: 37783319 DOI: 10.1016/j.semcancer.2023.09.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2022] [Revised: 08/27/2023] [Accepted: 09/25/2023] [Indexed: 10/04/2023]

Amorrortu R, Garcia M, Zhao Y, El Naqa I, Balagurunathan Y, Chen DT, Thieu T, Schabath MB, Rollison DE. Overview of approaches to estimate real-world disease progression in lung cancer. JNCI Cancer Spectr 2023;7:pkad074. [PMID: 37738580 PMCID: PMC10637832 DOI: 10.1093/jncics/pkad074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 08/28/2023] [Accepted: 09/18/2023] [Indexed: 09/24/2023] Open

Abstract

BACKGROUND

Randomized clinical trials of novel treatments for solid tumors normally measure disease progression using the Response Evaluation Criteria in Solid Tumors. However, novel, scalable approaches to estimate disease progression using real-world data are needed to advance cancer outcomes research. The purpose of this narrative review is to summarize examples from the existing literature on approaches to estimate real-world disease progression and their relative strengths and limitations, using lung cancer as a case study.

METHODS

A narrative literature review was conducted in PubMed to identify articles that used approaches to estimate real-world disease progression in lung cancer patients. Data abstracted included data source, approach used to estimate real-world progression, and comparison to a selected gold standard (if applicable).

RESULTS

A total of 40 articles were identified from 2008 to 2022. Five approaches to estimate real-world disease progression were identified including manual abstraction of medical records, natural language processing of clinical notes and/or radiology reports, treatment-based algorithms, changes in tumor volume, and delta radiomics-based approaches. The accuracy of these progression approaches were assessed using different methods, including correlations between real-world endpoints and overall survival for manual abstraction (Spearman rank ρ = 0.61-0.84) and area under the curve for natural language processing approaches (area under the curve = 0.86-0.96).

CONCLUSIONS

Real-world disease progression has been measured in several observational studies of lung cancer. However, comparing the accuracy of methods across studies is challenging, in part, because of the lack of a gold standard and the different methods used to evaluate accuracy. Concerted efforts are needed to define a gold standard and quality metrics for real-world data.

Collapse

Shi Y, Zhang C, Pan S, Chen Y, Miao X, He G, Wu Y, Ye H, Weng C, Zhang H, Zhou W, Yang X, Liang C, Chen D, Hong L, Su F. The diagnosis of tuberculous meningitis: advancements in new technologies and machine learning algorithms. Front Microbiol 2023;14:1290746. [PMID: 37942080 PMCID: PMC10628659 DOI: 10.3389/fmicb.2023.1290746] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Accepted: 10/09/2023] [Indexed: 11/10/2023] Open

Affiliation(s)

Yi Shi Department of Infectious Diseases, Wenzhou Central Hospital, Wenzhou, China The First School of Medicine, Wenzhou Medical University, Wenzhou, China
Chengxi Zhang School of Materials Science and Engineering, Shandong Jianzhu University, Jinan, China
Shuo Pan The First School of Medicine, Wenzhou Medical University, Wenzhou, China
Yi Chen The First School of Medicine, Wenzhou Medical University, Wenzhou, China
Xingguo Miao Department of Infectious Diseases, Wenzhou Central Hospital, Wenzhou, China Department of Infectious Diseases, Wenzhou Sixth People’s Hospital, Wenzhou, China Wenzhou Key Laboratory of Diagnosis and Treatment of Emerging and Recurrent Infectious Diseases, Wenzhou, China
Guoqiang He Postgraduate Training Base Alliance of Wenzhou Medical University, Wenzhou, China Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, China
Yanchan Wu School of Electrical and Information Engineering, Quzhou University, Quzhou, China
Hui Ye Department of Infectious Diseases, Wenzhou Central Hospital, Wenzhou, China Department of Infectious Diseases, Wenzhou Sixth People’s Hospital, Wenzhou, China Wenzhou Key Laboratory of Diagnosis and Treatment of Emerging and Recurrent Infectious Diseases, Wenzhou, China
Chujun Weng The Fourth Affiliated Hospital Zhejiang University School of Medicine, Yiwu, China
Huanhuan Zhang School and Hospital of Stomatology, Wenzhou Medical University, Wenzhou, China
Wenya Zhou School and Hospital of Stomatology, Wenzhou Medical University, Wenzhou, China
Xiaojie Yang Wenzhou Medical University Renji College, Wenzhou, China
Chenglong Liang The First School of Medicine, Wenzhou Medical University, Wenzhou, China
Dong Chen Wenzhou Key Laboratory of Diagnosis and Treatment of Emerging and Recurrent Infectious Diseases, Wenzhou, China Wenzhou Central Blood Station, Wenzhou, China
Liang Hong Department of Infectious Diseases, The Third Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
Feifei Su Department of Infectious Diseases, Wenzhou Central Hospital, Wenzhou, China Department of Infectious Diseases, Wenzhou Sixth People’s Hospital, Wenzhou, China Wenzhou Key Laboratory of Diagnosis and Treatment of Emerging and Recurrent Infectious Diseases, Wenzhou, China

Collapse

Elbatarny L, Do RKG, Gangai N, Ahmed F, Chhabra S, Simpson AL. Applying Natural Language Processing to Single-Report Prediction of Metastatic Disease Response Using the OR-RADS Lexicon. Cancers (Basel) 2023;15:4909. [PMID: 37894276 PMCID: PMC10605614 DOI: 10.3390/cancers15204909] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 09/25/2023] [Accepted: 09/26/2023] [Indexed: 10/29/2023] Open

Tan RSYC, Lin Q, Low GH, Lin R, Goh TC, Chang CCE, Lee FF, Chan WY, Tan WC, Tey HJ, Leong FL, Tan HQ, Nei WL, Chay WY, Tai DWM, Lai GGY, Cheng LTE, Wong FY, Chua MCH, Chua MLK, Tan DSW, Thng CH, Tan IBH, Ng HT. Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting. J Am Med Inform Assoc 2023;30:1657-1664. [PMID: 37451682 PMCID: PMC10531105 DOI: 10.1093/jamia/ocad133] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2023] [Revised: 06/27/2023] [Accepted: 07/04/2023] [Indexed: 07/18/2023] Open

Affiliation(s)

Ryan Shea Ying Cong Tan Division of Medical Oncology, National Cancer Centre Singapore, Singapore Duke-NUS Medical School, Singapore
Qian Lin Department of Computer Science, National University of Singapore, Singapore
Guat Hwa Low Division of Medical Oncology, National Cancer Centre Singapore, Singapore
Ruixi Lin Department of Computer Science, National University of Singapore, Singapore
Tzer Chew Goh Institute of Systems Science, National University of Singapore, Singapore
Christopher Chu En Chang Institute of Systems Science, National University of Singapore, Singapore
Fung Fung Lee Institute of Systems Science, National University of Singapore, Singapore
Wei Yin Chan Institute of Systems Science, National University of Singapore, Singapore
Wei Chong Tan Division of Medical Oncology, National Cancer Centre Singapore, Singapore Duke-NUS Medical School, Singapore
Han Jieh Tey Division of Medical Oncology, National Cancer Centre Singapore, Singapore
Fun Loon Leong Division of Medical Oncology, National Cancer Centre Singapore, Singapore
Hong Qi Tan Division of Radiation Oncology, National Cancer Centre Singapore, Singapore
Wen Long Nei Division of Radiation Oncology, National Cancer Centre Singapore, Singapore
Wen Yee Chay Division of Medical Oncology, National Cancer Centre Singapore, Singapore Duke-NUS Medical School, Singapore
David Wai Meng Tai Division of Medical Oncology, National Cancer Centre Singapore, Singapore Duke-NUS Medical School, Singapore
Gillianne Geet Yi Lai Division of Medical Oncology, National Cancer Centre Singapore, Singapore Duke-NUS Medical School, Singapore
Lionel Tim-Ee Cheng Duke-NUS Medical School, Singapore Department of Diagnostic Radiology, Singapore General Hospital, Singapore
Fuh Yong Wong Division of Radiation Oncology, National Cancer Centre Singapore, Singapore
Matthew Chin Heng Chua Yong Loo Lin School of Medicine, National University of Singapore, Singapore
Melvin Lee Kiang Chua Duke-NUS Medical School, Singapore Division of Radiation Oncology, National Cancer Centre Singapore, Singapore Data and Computational Science Core, National Cancer Centre Singapore, Singapore
Daniel Shao Weng Tan Division of Medical Oncology, National Cancer Centre Singapore, Singapore Division of Clinical Trials and Epidemiological Sciences, National Cancer Centre Singapore, Singapore
Choon Hua Thng Duke-NUS Medical School, Singapore Division of Oncologic Imaging, National Cancer Centre Singapore, Singapore
Iain Bee Huat Tan Division of Medical Oncology, National Cancer Centre Singapore, Singapore Duke-NUS Medical School, Singapore Data and Computational Science Core, National Cancer Centre Singapore, Singapore
Hwee Tou Ng Department of Computer Science, National University of Singapore, Singapore

Collapse

Elmarakeby HA, Trukhanov PS, Arroyo VM, Riaz IB, Schrag D, Van Allen EM, Kehl KL. Empirical evaluation of language modeling to ascertain cancer outcomes from clinical text reports. BMC Bioinformatics 2023;24:328. [PMID: 37658330 PMCID: PMC10474750 DOI: 10.1186/s12859-023-05439-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 08/07/2023] [Indexed: 09/03/2023] Open

Abstract

BACKGROUND

Longitudinal data on key cancer outcomes for clinical research, such as response to treatment and disease progression, are not captured in standard cancer registry reporting. Manual extraction of such outcomes from unstructured electronic health records is a slow, resource-intensive process. Natural language processing (NLP) methods can accelerate outcome annotation, but they require substantial labeled data. Transfer learning based on language modeling, particularly using the Transformer architecture, has achieved improvements in NLP performance. However, there has been no systematic evaluation of NLP model training strategies on the extraction of cancer outcomes from unstructured text.

RESULTS

We evaluated the performance of nine NLP models at the two tasks of identifying cancer response and cancer progression within imaging reports at a single academic center among patients with non-small cell lung cancer. We trained the classification models under different conditions, including training sample size, classification architecture, and language model pre-training. The training involved a labeled dataset of 14,218 imaging reports for 1112 patients with lung cancer. A subset of models was based on a pre-trained language model, DFCI-ImagingBERT, created by further pre-training a BERT-based model using an unlabeled dataset of 662,579 reports from 27,483 patients with cancer from our center. A classifier based on our DFCI-ImagingBERT, trained on more than 200 patients, achieved the best results in most experiments; however, these results were marginally better than simpler "bag of words" or convolutional neural network models.

CONCLUSION

When developing AI models to extract outcomes from imaging reports for clinical cancer research, if computational resources are plentiful but labeled training data are limited, large language models can be used for zero- or few-shot learning to achieve reasonable performance. When computational resources are more limited but labeled training data are readily available, even simple machine learning architectures can achieve good performance for such tasks.

Collapse

Yang E, Li MD, Raghavan S, Deng F, Lang M, Succi MD, Huang AJ, Kalpathy-Cramer J. Transformer versus traditional natural language processing: how much data is enough for automated radiology report classification? Br J Radiol 2023;96:20220769. [PMID: 37162253 PMCID: PMC10461267 DOI: 10.1259/bjr.20220769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Revised: 04/21/2023] [Accepted: 04/26/2023] [Indexed: 05/11/2023] Open

Abstract

OBJECTIVES

Current state-of-the-art natural language processing (NLP) techniques use transformer deep-learning architectures, which depend on large training datasets. We hypothesized that traditional NLP techniques may outperform transformers for smaller radiology report datasets.

METHODS

We compared the performance of BioBERT, a deep-learning-based transformer model pre-trained on biomedical text, and three traditional machine-learning models (gradient boosted tree, random forest, and logistic regression) on seven classification tasks given free-text radiology reports. Tasks included detection of appendicitis, diverticulitis, bowel obstruction, and enteritis/colitis on abdomen/pelvis CT reports, ischemic infarct on brain CT/MRI reports, and medial and lateral meniscus tears on knee MRI reports (7,204 total annotated reports). The performance of NLP models on held-out test sets was compared after training using the full training set, and 2.5%, 10%, 25%, 50%, and 75% random subsets of the training data.

RESULTS

In all tested classification tasks, BioBERT performed poorly at smaller training sample sizes compared to non-deep-learning NLP models. Specifically, BioBERT required training on approximately 1,000 reports to perform similarly or better than non-deep-learning models. At around 1,250 to 1,500 training samples, the testing performance for all models began to plateau, where additional training data yielded minimal performance gain.

CONCLUSIONS

With larger sample sizes, transformer NLP models achieved superior performance in radiology report binary classification tasks. However, with smaller sizes (<1000) and more imbalanced training data, traditional NLP techniques performed better.

ADVANCES IN KNOWLEDGE

Our benchmarks can help guide clinical NLP researchers in selecting machine-learning models according to their dataset characteristics.

Collapse

Bitterman DS, Gensheimer MF, Jaffray D, Pryma DA, Jiang SB, Morin O, Ginart JB, Upadhaya T, Vallis KA, Buatti JM, Deasy J, Hsiao HT, Chung C, Fuller CD, Greenspan E, Cloyd-Warwick K, Courdy S, Mao A, Barnholtz-Sloan J, Topaloglu U, Hands I, Maurer I, Terry M, Curran WJ, Le QT, Nadaf S, Kibbe W. Cancer Informatics for Cancer Centers: Sharing Ideas on How to Build an Artificial Intelligence-Ready Informatics Ecosystem for Radiation Oncology. JCO Clin Cancer Inform 2023;7:e2300136. [PMID: 38055914 PMCID: PMC10703125 DOI: 10.1200/cci.23.00136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 08/15/2023] [Accepted: 10/16/2023] [Indexed: 12/08/2023] Open

Affiliation(s)

Danielle S. Bitterman Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA Department of Radiation Oncology, Brigham and Women's Hospital/Dana-Farber Cancer Institute, Boston, MA
Michael F. Gensheimer Department of Radiation Oncology, Stanford University School of Medicine, Stanford, CA
David Jaffray Department of Radiation Physics, M.D. Anderson Cancer Center, Houston, TX
Daniel A. Pryma Abramson Cancer Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
Steve B. Jiang Medical Artificial Intelligence and Automation Laboratory and Department of Radiation Oncology, University of Texas Southwestern Medical Center, Dallas, TX
Olivier Morin Department of Radiation Oncology, MEDomics Laboratory, University of California San Francisco, San Francisco, CA
Jorge Barrios Ginart Department of Radiation Oncology, MEDomics Laboratory, University of California San Francisco, San Francisco, CA
Taman Upadhaya Department of Radiation Oncology, MEDomics Laboratory, University of California San Francisco, San Francisco, CA
Katherine A. Vallis Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA
John M. Buatti Department of Oncology, University of Oxford, Oxford, United Kingdom
Joseph Deasy Department of Radiation Oncology, University of Iowa Carver College of Medicine, Iowa City, IA
H. Timothy Hsiao Department of Medical Physics, Memorial Sloan Kettering Cancer Center, New York, NY
Caroline Chung Department of Scientific Affairs, American Society for Radiation Oncology, Arlington, VA
Clifton D. Fuller Department of Scientific Affairs, American Society for Radiation Oncology, Arlington, VA
Emily Greenspan Department of Radiation Oncology, M.D. Anderson Cancer Center, Houston, TX
Kristy Cloyd-Warwick Center for Biomedical Informatics and Information Technology, National Cancer Institute, Rockville, MD
Samir Courdy DNAnexus, Mountain View, CA
Allen Mao DNAnexus, Mountain View, CA
Jill Barnholtz-Sloan Department of Radiation Oncology, M.D. Anderson Cancer Center, Houston, TX Center for Informatics, Digital Vertical, City of Hope National Comprehensive Cancer Center, Los Angeles, CA
Umit Topaloglu Department of Radiation Oncology, M.D. Anderson Cancer Center, Houston, TX
Isaac Hands Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD Cancer Research Informatics Shared Resource Facility, University of Kentucky Markey Cancer Center, Lexington, NY
Ian Maurer Kentucky Cancer Registry, Lexington, NY
May Terry GenomOncology, Cleveland, OH
Walter J. Curran . MITRE Corporation, Bedford, MA GenesisCare, Fort Myers, FL
Quynh-Thu Le Department of Radiation Oncology, Stanford University School of Medicine, Stanford, CA
Sorena Nadaf Department of Radiation Oncology, Emory University, Atlanta, GA
Warren Kibbe Cancer Center Informatics Society, Los Angeles, CA

Collapse

Amin K, Khosla P, Doshi R, Chheang S, Forman HP. Artificial Intelligence to Improve Patient Understanding of Radiology Reports. THE YALE JOURNAL OF BIOLOGY AND MEDICINE 2023;96:407-417. [PMID: 37780992 PMCID: PMC10524809 DOI: 10.59249/nkoy5498] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/03/2023]

Doi K, Takegawa H, Yui M, Anetai Y, Koike Y, Nakamura S, Tanigawa N, Koziumi M, Nishio T. Deep learning-based detection of patients with bone metastasis from Japanese radiology reports. Jpn J Radiol 2023;41:900-908. [PMID: 36988827 DOI: 10.1007/s11604-023-01413-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 03/07/2023] [Indexed: 03/30/2023]

Affiliation(s)

Kentaro Doi Department of Medical Physics and Engineering, Osaka University Graduate School of Medicine, 1-7 Yamadaoka, Suita-Shi, Osaka, Japan Department of Radiology, Kansai Medical University Graduate School of Medicine, 2-5-1 Shinmachi, Hirakata-Shi, Osaka, Japan
Hideki Takegawa Department of Medical Physics and Engineering, Osaka University Graduate School of Medicine, 1-7 Yamadaoka, Suita-Shi, Osaka, Japan. Department of Radiology, Kansai Medical University Graduate School of Medicine, 2-5-1 Shinmachi, Hirakata-Shi, Osaka, Japan. Department of Radiation Oncology, Kansai Medical University Hospital, 2-5-1 Shinmachi, Hirakata-Shi, Osaka, Japan.
Midori Yui Department of Radiology, Kansai Medical University Graduate School of Medicine, 2-5-1 Shinmachi, Hirakata-Shi, Osaka, Japan Department of Radiation Oncology, Kansai Medical University Hospital, 2-5-1 Shinmachi, Hirakata-Shi, Osaka, Japan
Yusuke Anetai Department of Radiology, Kansai Medical University Graduate School of Medicine, 2-5-1 Shinmachi, Hirakata-Shi, Osaka, Japan Department of Radiation Oncology, Kansai Medical University Hospital, 2-5-1 Shinmachi, Hirakata-Shi, Osaka, Japan
Yuhei Koike Department of Radiology, Kansai Medical University Graduate School of Medicine, 2-5-1 Shinmachi, Hirakata-Shi, Osaka, Japan Department of Radiation Oncology, Kansai Medical University Hospital, 2-5-1 Shinmachi, Hirakata-Shi, Osaka, Japan
Satoaki Nakamura Department of Radiology, Kansai Medical University Graduate School of Medicine, 2-5-1 Shinmachi, Hirakata-Shi, Osaka, Japan Department of Radiation Oncology, Kansai Medical University Hospital, 2-5-1 Shinmachi, Hirakata-Shi, Osaka, Japan
Noboru Tanigawa Department of Radiology, Kansai Medical University Graduate School of Medicine, 2-5-1 Shinmachi, Hirakata-Shi, Osaka, Japan
Masahiko Koziumi Department of Medical Physics and Engineering, Osaka University Graduate School of Medicine, 1-7 Yamadaoka, Suita-Shi, Osaka, Japan
Teiji Nishio Department of Medical Physics and Engineering, Osaka University Graduate School of Medicine, 1-7 Yamadaoka, Suita-Shi, Osaka, Japan

Collapse

Thummalapalli R, Bernstein E, Herzberg B, Li BT, Iqbal A, Preeshagul I, Santini FC, Eng J, Ladanyi M, Yang SR, Shen R, Lito P, Riely GJ, Sabari JK, Arbour KC. Clinical and Genomic Features of Response and Toxicity to Sotorasib in a Real-World Cohort of Patients With Advanced KRAS G12C-Mutant Non-Small Cell Lung Cancer. JCO Precis Oncol 2023;7:e2300030. [PMID: 37384866 PMCID: PMC10581626 DOI: 10.1200/po.23.00030] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Revised: 04/03/2023] [Accepted: 05/23/2023] [Indexed: 07/01/2023] Open

Abstract

PURPOSE

With the recent approval of the KRAS G12C inhibitor sotorasib for patients with advanced KRAS G12C-mutant non-small cell lung cancer (NSCLC), there is a new need to identify factors associated with activity and toxicity among patients treated in routine practice.

MATERIALS AND METHODS

We conducted a multicenter retrospective study of patients treated with sotorasib outside of clinical trials to identify factors associated with real-world progression free survival (rwPFS), overall survival (OS), and toxicity.

RESULTS

Among 105 patients with advanced KRAS G12C-mutant NSCLC treated with sotorasib, treatment led to a 5.3-month median rwPFS, 12.6-month median OS, and 28% real-world response rate. KEAP1 comutations were associated with shorter rwPFS and OS (rwPFS hazard ratio [HR], 3.19; P = .004; OS HR, 4.10; P = .003); no significant differences in rwPFS or OS were observed across TP53 (rwPFS HR, 1.10; P = .731; OS HR, 1.19; P = .631) or STK11 (rwPFS HR, 1.66; P = .098; OS HR, 1.73; P = .168) comutation status. Notably, almost all patients who developed grade 3 or higher treatment-related adverse events (G3+ TRAEs) had previously been treated with anti-PD-(L)1 therapy. Among these patients, anti-PD-(L)1 therapy exposure within 12 weeks of sotorasib was strongly associated with G3+ TRAEs (P < .001) and TRAE-related sotorasib discontinuation (P = .014). Twenty-eight percent of patients with recent anti-PD-(L)1 therapy exposure experienced G3+ TRAEs, most commonly hepatotoxicity.

CONCLUSION

Among patients treated with sotorasib in routine practice, KEAP1 comutations were associated with resistance and recent anti-PD-(L)1 therapy exposure was associated with toxicity. These observations may help guide use of sotorasib in the clinic and may help inform the next generation of KRAS G12C-targeted clinical trials.

Collapse

Khan MS, Usman MS, Talha KM, Van Spall HGC, Greene SJ, Vaduganathan M, Khan SS, Mills NL, Ali ZA, Mentz RJ, Fonarow GC, Rao SV, Spertus JA, Roe MT, Anker SD, James SK, Butler J, McGuire DK. Leveraging electronic health records to streamline the conduct of cardiovascular clinical trials. Eur Heart J 2023;44:1890-1909. [PMID: 37098746 DOI: 10.1093/eurheartj/ehad171] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/17/2022] [Revised: 02/05/2023] [Accepted: 03/07/2023] [Indexed: 04/27/2023] Open

Abstract

Conventional randomized controlled trials (RCTs) can be expensive, time intensive, and complex to conduct. Trial recruitment, participation, and data collection can burden participants and research personnel. In the past two decades, there have been rapid technological advances and an exponential growth in digitized healthcare data. Embedding RCTs, including cardiovascular outcome trials, into electronic health record systems or registries may streamline screening, consent, randomization, follow-up visits, and outcome adjudication. Moreover, wearable sensors (i.e. health and fitness trackers) provide an opportunity to collect data on cardiovascular health and risk factors in unprecedented detail and scale, while growing internet connectivity supports the collection of patient-reported outcomes. There is a pressing need to develop robust mechanisms that facilitate data capture from diverse databases and guidance to standardize data definitions. Importantly, the data collection infrastructure should be reusable to support multiple cardiovascular RCTs over time. Systems, processes, and policies will need to have sufficient flexibility to allow interoperability between different sources of data acquisition. Clinical research guidelines, ethics oversight, and regulatory requirements also need to evolve. This review highlights recent progress towards the use of routinely generated data to conduct RCTs and discusses potential solutions for ongoing barriers. There is a particular focus on methods to utilize routinely generated data for trials while complying with regional data protection laws. The discussion is supported with examples of cardiovascular outcome trials that have successfully leveraged the electronic health record, web-enabled devices or administrative databases to conduct randomized trials.

Collapse

Affiliation(s)

Muhammad Shahzeb Khan Division of Cardiology, Duke University School of Medicine, 2301 Erwin Rd., Durham, NC 27705, USA
Muhammad Shariq Usman Department of Medicine, University of Mississippi Medical Center, 2500 N State St, Jackson, MS 39216, USA
Khawaja M Talha Department of Medicine, University of Mississippi Medical Center, 2500 N State St, Jackson, MS 39216, USA
Harriette G C Van Spall Department of Medicine, McMaster University, Hamilton, ON, Canada Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, ON, Canada Population Health Research Institute, Hamilton, ON, Canada
Stephen J Greene Division of Cardiology, Duke University School of Medicine, 2301 Erwin Rd., Durham, NC 27705, USA Duke Clinical Research Institute, Durham, NC, USA
Muthiah Vaduganathan Cardiovascular Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
Sadiya S Khan Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Nicholas L Mills BHF Centre for Cardiovascular Science, University of Edinburgh, Chancellors Building, Royal Infirmary of Edinburgh, Edinburgh, Scotland, UK Usher Institute, University of Edinburgh, Edinburgh, Scotland, UK
Ziad A Ali DeMatteis Cardiovascular Institute, St Francis Hospital and Heart Center, Roslyn, NY, USA
Robert J Mentz Division of Cardiology, Duke University School of Medicine, 2301 Erwin Rd., Durham, NC 27705, USA Duke Clinical Research Institute, Durham, NC, USA
Gregg C Fonarow Division of Cardiology, University of California Los Angeles, Los Angeles, CA, USA
Sunil V Rao Division of Cardiology, New York University Langone Health System, New York, NY, USA
John A Spertus Department of Cardiology, Saint Luke's Mid America Heart Institute, Kansas City, MO, USA Kansas City's Healthcare Institute for Innovations in Quality, University of Missouri, Kansas, MO, USA
Matthew T Roe Division of Cardiology, Duke University School of Medicine, 2301 Erwin Rd., Durham, NC 27705, USA Duke Clinical Research Institute, Durham, NC, USA
Stefan D Anker Department of Cardiology (CVK), Berlin Institute of Health Center for Regenerative Therapies (BCRT), and German Centre for Cardiovascular Research (DZHK) Partner Site Berlin, Charité Universitätsmedizin, Berlin, Germany
Stefan K James Department of Medical Sciences, Scientific Director UCR, Uppsala University, Uppsala, Uppland, Sweden
Javed Butler Department of Medicine, University of Mississippi Medical Center, 2500 N State St, Jackson, MS 39216, USA Baylor Scott & White Research Institute, Dallas, TX, USA
Darren K McGuire Division of Cardiology, Department of Internal Medicine, UT Southwestern Medical Center and Parkland Health and Hospital System, Dallas, TX, USA

Collapse

Preston S, Wei M, Rao R, Tinn R, Usuyama N, Lucas M, Gu Y, Weerasinghe R, Lee S, Piening B, Tittel P, Valluri N, Naumann T, Bifulco C, Poon H. Toward structuring real-world data: Deep learning for extracting oncology information from clinical text with patient-level supervision. PATTERNS (NEW YORK, N.Y.) 2023;4:100726. [PMID: 37123439 PMCID: PMC10140604 DOI: 10.1016/j.patter.2023.100726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 11/11/2022] [Accepted: 03/14/2023] [Indexed: 05/02/2023]

Araki K, Matsumoto N, Togo K, Yonemoto N, Ohki E, Xu L, Hasegawa Y, Satoh D, Takemoto R, Miyazaki T. Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records. Adv Ther 2023;40:934-950. [PMID: 36547809 PMCID: PMC9988800 DOI: 10.1007/s12325-022-02397-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Accepted: 12/01/2022] [Indexed: 12/24/2022]

Araki K, Matsumoto N, Togo K, Yonemoto N, Ohki E, Xu L, Hasegawa Y, Inoue H, Yamashita S, Miyazaki T. Real-world treatment response in Japanese patients with cancer using unstructured data from electronic health records. HEALTH AND TECHNOLOGY 2023. [DOI: 10.1007/s12553-023-00739-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]

Abstract Abstract Purpose We generated methods for evaluating clinical outcomes including treatment response in oncology using the unstructured data from electronic health records (EHR) in Japanese language. Methods This retrospective analysis used medical record database and administrative data of University of Miyazaki Hospital in Japan of patients with lung/breast cancer. Treatment response (objective response [OR], stable disease [SD] or progressive disease [PD]) was adjudicated by two evaluators using clinicians’ progress notes, radiology reports and pathological reports of 15 patients with lung cancer (training data set). For assessing key terms to describe treatment response, natural language processing (NLP) rules were created from the texts identified by the evaluators and broken down by morphological analysis. The NLP rules were applied for assessing data of other 70 lung cancer and 30 breast cancer patients, who were not adjudicated, to examine if any difference in using key terms exist between these patients. Results A total of 2,039 records in progress notes, 131 in radiology reports and 60 in pathological reports of 15 patients, were adjudicated. Progress notes were the most common primary source data for treatment assessment (60.7%), wherein, the most common key terms with high sensitivity and specificity to describe OR were “reduction/shrink”, for SD were “(no) remarkable change/(no) aggravation)” and for PD were “(limited) effect” and “enlargement/grow”. These key terms were also found in other larger cohorts of 70 patients with lung cancer and 30 patients with breast cancer. Conclusion This study demonstrated that assessing response to anticancer therapy using Japanese EHRs is feasible by interpreting progress notes, radiology reports and Japanese key terms using NLP. Collapse

IKAR: An Interdisciplinary Knowledge-Based Automatic Retrieval Method from Chinese Electronic Medical Record. INFORMATION 2023. [DOI: 10.3390/info14010049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open

Amiri P, Montazeri M, Ghasemian F, Asadi F, Niksaz S, Sarafzadeh F, Khajouei R. Prediction of mortality risk and duration of hospitalization of COVID-19 patients with chronic comorbidities based on machine learning algorithms. Digit Health 2023;9:20552076231170493. [PMID: 37312960 PMCID: PMC10259141 DOI: 10.1177/20552076231170493] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 03/31/2023] [Indexed: 06/15/2023] Open

Abstract

Background

The severity of coronavirus (COVID-19) in patients with chronic comorbidities is much higher than in other patients, which can lead to their death. Machine learning (ML) algorithms as a potential solution for rapid and early clinical evaluation of the severity of the disease can help in allocating and prioritizing resources to reduce mortality.

Objective

The objective of this study was to predict the mortality risk and length of stay (LoS) of patients with COVID-19 and history of chronic comorbidities using ML algorithms.

Methods

This retrospective study was conducted by reviewing the medical records of COVID-19 patients with a history of chronic comorbidities from March 2020 to January 2021 in Afzalipour Hospital in Kerman, Iran. The outcome of patients, hospitalization was recorded as discharge or death. The filtering technique used to score the features and well-known ML algorithms were applied to predict the risk of mortality and LoS of patients. Ensemble Learning methods is also used. To evaluate the performance of the models, different measures including F1, precision, recall, and accuracy were calculated. The TRIPOD guideline assessed transparent reporting.

Results

This study was performed on 1291 patients, including 900 alive and 391 dead patients. Shortness of breath (53.6%), fever (30.1%), and cough (25.3%) were the three most common symptoms in patients. Diabetes mellitus(DM) (31.3%), hypertension (HTN) (27.3%), and ischemic heart disease (IHD) (14.2%) were the three most common chronic comorbidities of patients. Twenty-six important factors were extracted from each patient's record. Gradient boosting model with 84.15% accuracy was the best model for predicting mortality risk and multilayer perceptron (MLP) with rectified linear unit function (MSE = 38.96) was the best model for predicting the LoS. The most common chronic comorbidities among these patients were DM (31.3%), HTN (27.3%), and IHD (14.2%). The most important factors in predicting the risk of mortality were hyperlipidemia, diabetes, asthma, and cancer, and in predicting LoS was shortness of breath.

Conclusion

The results of this study showed that the use of ML algorithms can be a good tool to predict the risk of mortality and LoS of patients with COVID-19 and chronic comorbidities based on physiological conditions, symptoms, and demographic information of patients. The Gradient boosting and MLP algorithms can quickly identify patients at risk of death or long-term hospitalization and notify physicians to do appropriate interventions.

Collapse

Keller RB, Mazor T, Sholl L, Aguirre AJ, Singh H, Sethi N, Bass A, Nagaraja AK, Brais LK, Hill E, Hennessey C, Cusick M, Del Vecchio Fitz C, Zwiesler Z, Siegel E, Ovalle A, Trukhanov P, Hansel J, Shapiro GI, Abrams TA, Biller LH, Chan JA, Cleary JM, Corsello SM, Enzinger AC, Enzinger PC, Mayer RJ, McCleary NJ, Meyerhardt JA, Ng K, Patel AK, Perez KJ, Rahma OE, Rubinson DA, Wisch JS, Yurgelun MB, Hassett MJ, MacConaill L, Schrag D, Cerami E, Wolpin BM, Nowak JA, Giannakis M. Programmatic Precision Oncology Decision Support for Patients With Gastrointestinal Cancer. JCO Precis Oncol 2023;7:e2200342. [PMID: 36634297 PMCID: PMC9929103 DOI: 10.1200/po.22.00342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Revised: 08/30/2022] [Accepted: 11/22/2022] [Indexed: 01/13/2023] Open

Abstract

PURPOSE

With the growing number of available targeted therapeutics and molecular biomarkers, the optimal care of patients with cancer now depends on a comprehensive understanding of the rapidly evolving landscape of precision oncology, which can be challenging for oncologists to navigate alone.

METHODS

We developed and implemented a precision oncology decision support system, GI TARGET, (Gastrointestinal Treatment Assistance Regarding Genomic Evaluation of Tumors) within the Gastrointestinal Cancer Center at the Dana-Farber Cancer Institute. With a multidisciplinary team, we systematically reviewed tumor molecular profiling for GI tumors and provided molecularly informed clinical recommendations, which included identifying appropriate clinical trials aided by the computational matching platform MatchMiner, suggesting targeted therapy options on or off the US Food and Drug Administration-approved label, and consideration of additional or orthogonal molecular testing.

RESULTS

We reviewed genomic data and provided clinical recommendations for 506 patients with GI cancer who underwent tumor molecular profiling between January and June 2019 and determined follow-up using the electronic health record. Summary reports were provided to 19 medical oncologists for patients with colorectal (n = 198, 39%), pancreatic (n = 124, 24%), esophagogastric (n = 67, 13%), biliary (n = 40, 8%), and other GI cancers. We recommended ≥ 1 precision medicine clinical trial for 80% (406 of 506) of patients, leading to 24 enrollments. We recommended on-label and off-label targeted therapies for 6% (28 of 506) and 25% (125 of 506) of patients, respectively. Recommendations for additional or orthogonal testing were made for 42% (211 of 506) of patients.

CONCLUSION

The integration of precision medicine in routine cancer care through a dedicated multidisciplinary molecular tumor board is scalable and sustainable, and implementation of precision oncology recommendations has clinical utility for patients with cancer.

Collapse

Affiliation(s)

Rachel B. Keller Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Tali Mazor Department of Data Science, Dana-Farber Cancer Institute, Boston, MA
Lynette Sholl Center for Advanced Molecular Diagnostics, Brigham & Women's Hospital & Harvard Medical School, Boston, MA
Andrew J. Aguirre Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA Broad Institute of Harvard and MIT, Cambridge, MA
Harshabad Singh Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Nilay Sethi Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Adam Bass Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Ankur K. Nagaraja Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Lauren K. Brais Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Emma Hill Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Connor Hennessey Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Margaret Cusick Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Catherine Del Vecchio Fitz Department of Data Science, Dana-Farber Cancer Institute, Boston, MA
Zachary Zwiesler Department of Data Science, Dana-Farber Cancer Institute, Boston, MA
Ethan Siegel Department of Data Science, Dana-Farber Cancer Institute, Boston, MA
Andrea Ovalle Department of Data Science, Dana-Farber Cancer Institute, Boston, MA
Pavel Trukhanov Department of Data Science, Dana-Farber Cancer Institute, Boston, MA
Jason Hansel Department of Data Science, Dana-Farber Cancer Institute, Boston, MA
Geoffrey I. Shapiro Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Thomas A. Abrams Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Leah H. Biller Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Jennifer A. Chan Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
James M. Cleary Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Steven M. Corsello Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Andrea C. Enzinger Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Peter C. Enzinger Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Robert J. Mayer Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Nadine J. McCleary Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Jeffrey A. Meyerhardt Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Kimmie Ng Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Anuj K. Patel Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Kimberley J. Perez Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Osama E. Rahma Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Douglas A. Rubinson Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Jeffrey S. Wisch Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Matthew B. Yurgelun Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Michael J. Hassett Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Laura MacConaill Center for Advanced Molecular Diagnostics, Brigham & Women's Hospital & Harvard Medical School, Boston, MA
Deborah Schrag Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Ethan Cerami Department of Data Science, Dana-Farber Cancer Institute, Boston, MA
Brian M. Wolpin Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA
Jonathan A. Nowak Center for Advanced Molecular Diagnostics, Brigham & Women's Hospital & Harvard Medical School, Boston, MA
Marios Giannakis Department of Medical Oncology, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA Broad Institute of Harvard and MIT, Cambridge, MA

Collapse

Mayta-Tovalino F, Munive-Degregori A, Luza S, Cárdenas-Mariño F, Guerrero M, Barja-Ore J. Applications and perspectives of artificial intelligence, machine learning and “dentronics” in dentistry: A literature review. J Int Soc Prev Community Dent 2023;13:1-8. [PMID: 37153930 PMCID: PMC10155874 DOI: 10.4103/jispcd.jispcd_35_22] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Revised: 06/08/2022] [Accepted: 06/28/2022] [Indexed: 03/11/2023] Open

Lee EB, Heo GE, Choi CM, Song M. MLM-based typographical error correction of unstructured medical texts for named entity recognition. BMC Bioinformatics 2022;23:486. [PMID: 36384464 PMCID: PMC9670595 DOI: 10.1186/s12859-022-05035-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Accepted: 11/04/2022] [Indexed: 11/18/2022] Open

Abstract

Background

Unstructured text in medical records, such as Electronic Health Records, contain an enormous amount of valuable information for research; however, it is difficult to extract and structure important information because of frequent typographical errors. Therefore, improving the quality of data with errors for text analysis is an essential task. To date, few prior studies have been conducted addressing this. Here, we propose a new methodology for extracting important information from unstructured medical texts by overcoming the typographical problem in surgical pathology records related to lung cancer.

Methods

We propose a typo correction model that considers context, based on the Masked Language Model, to solve the problem of typographical errors in real-world medical data. In addition, a word dictionary was used for the typo correction model based on PubMed abstracts. After refining the data through typo correction, fine tuning was performed on pre-trained BERT model. Next, deep learning-based Named Entity Recognition (NER) was performed. By solving the quality problem of medical data, we sought to improve the accuracy of information extraction in unstructured text data.

Results

We compared the performance of the proposed typo correction model based on contextual information with an existing SymSpell model. We confirmed that our proposed model outperformed the existing model in a typographical correction task. The F1-score of the model improved by approximately 5% and 9% when compared with the model without contextual information in the NCBI-disease and surgical pathology record datasets, respectively. In addition, the F1-score of NER after typo correction increased by 2% in the NCBI-disease dataset. There was a significant performance difference of approximately 25% between the before and after typo correction in the Surgical pathology record dataset. This confirmed that typos influenced the information extraction of the unstructured text.

Conclusion

We verified that typographical errors in unstructured text negatively affect the performance of natural language processing tasks. The proposed method of a typo correction model outperformed the existing SymSpell model. This study shows that the proposed model is robust and can be applied in real-world environments by focusing on the typos that cause difficulties in analyzing unstructured medical text.

Collapse

Elgarten CW, Thompson JC, Angiolillo A, Chen Z, Conway S, Devidas M, Gupta S, Kairalla JA, McNeer JL, O’Brien MM, Rabin KR, Rau RE, Rheingold SR, Wang C, Wood C, Raetz EA, Loh ML, Alexander S, Miller TP. Improving infectious adverse event reporting for children and adolescents enrolled in clinical trials for acute lymphoblastic leukemia: A report from the Children's Oncology Group. Pediatr Blood Cancer 2022;69:e29937. [PMID: 36083863 PMCID: PMC9529813 DOI: 10.1002/pbc.29937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 07/28/2022] [Accepted: 07/30/2022] [Indexed: 11/08/2022]

Affiliation(s)

Caitlin W. Elgarten Children’s Hospital of Philadelphia, Department of Pediatrics, Division of Oncology, Philadelphia, PA
Joel C. Thompson Children’s Mercy Hospital, Department of Pediatrics, Division of Hematology/Oncology/Bone Marrow Transplant, University of Missouri-Kansas City, Kansas City, MO
Anne Angiolillo Children’s National Medical Center, Center for Cancer and Blood Disorders, Washington DC
Zhiguo Chen University of Florida, Department of Biostatistics, Gainesville, FL
Susan Conway University of Florida, Department of Biostatistics, Gainesville, FL
Meenakshi Devidas St. Jude Children’s Research Hospital, Memphis, TN
Sumit Gupta Department of Hematology/Oncology, Hospital for Sick Children, Toronto, ON
John A. Kairalla University of Florida, Department of Biostatistics, Gainesville, FL
Jennifer L. McNeer University of Chicago Comer Children’s Hospital, Chicago, IL
Maureen M. O’Brien University of Cincinnati College of Medicine, Cincinnati Children’s Hospital Medical Center, Pediatric Hematology/Oncology, Cincinnati, OH
Karen R. Rabin Baylor College of Medicine, Pediatric Hematology/Oncology, Houston, TX
Rachel E. Rau Baylor College of Medicine, Pediatric Hematology/Oncology, Houston, TX
Susan R. Rheingold Children’s Hospital of Philadelphia, Department of Pediatrics, Division of Oncology, Philadelphia, PA
Cindy Wang University of Florida, Department of Biostatistics, Gainesville, FL
Charlotte Wood University of Florida, Department of Biostatistics, Gainesville, FL
Elizabeth A. Raetz Laura and Isaac Perlmutter Cancer Center at NYU Langone, New York, NY
Mignon L. Loh Division of Hematology, Oncology, Bone Marrow Transplant, and Cellular Therapies, Seattle Children’s Hospital and the Ben Towne Center for Childhood Cancer Research, University of Washington, Seattle, WA
Sarah Alexander Department of Hematology/Oncology, Hospital for Sick Children, Toronto, ON
Tamara P. Miller Children’s Healthcare of Atlanta – Egleston, Pediatric Hematology/Oncology, Atlanta, GA

Collapse

McKee M, Wouters OJ. The Challenges of Regulating Artificial Intelligence in Healthcare Comment on "Clinical Decision Support and New Regulatory Frameworks for Medical Devices: Are We Ready for It? - A Viewpoint Paper". Int J Health Policy Manag 2022;12:7261. [PMID: 36243948 PMCID: PMC10125205 DOI: 10.34172/ijhpm.2022.7261] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 09/07/2022] [Indexed: 11/07/2022] Open

Causa Andrieu P, Golia Pernicka JS, Yaeger R, Lupton K, Batch K, Zulkernine F, Simpson AL, Taya M, Gazit L, Nguyen H, Nicholas K, Gangai N, Sevilimedu V, Dickinson S, Paroder V, Bates DD, Do R. Natural Language Processing of Computed Tomography Reports to Label Metastatic Phenotypes With Prognostic Significance in Patients With Colorectal Cancer. JCO Clin Cancer Inform 2022;6:e2200014. [PMID: 36103642 PMCID: PMC9848599 DOI: 10.1200/cci.22.00014] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Revised: 06/04/2022] [Accepted: 08/04/2022] [Indexed: 01/21/2023] Open

Abstract

PURPOSE

Natural language processing (NLP) applied to radiology reports can help identify clinically relevant M1 subcategories of patients with colorectal cancer (CRC). The primary purpose was to compare the overall survival (OS) of CRC according to American Joint Committee on Cancer TNM staging and explore an alternative classification. The secondary objective was to estimate the frequency of metastasis for each organ.

METHODS

Retrospective study of CRC who underwent computed tomography (CT) chest, abdomen, and pelvis between July 1, 2009, and March 26, 2019, at a tertiary cancer center, previously labeled for the presence or absence of metastasis by an NLP prediction model. Patients were classified in M0, M1a, M1b, and M1c (American Joint Committee on Cancer), or an alternative classification on the basis of the metastasis organ number: M1, single; M2, two; M3, three or more organs. Cox regression models were used to estimate hazard ratios; Kaplan-Meier curves were used to visualize survival curves using the two M1 subclassifications.

RESULTS

Nine thousand nine hundred twenty-eight patients with a total of 48,408 CT chest, abdomen, and pelvis reports were included. On the basis of NLP prediction, the median OS of M1a, M1b, and M1c was 4.47, 1.72, and 1.52 years, respectively. The median OS of M1, M2, and M3 was 4.24, 2.05, and 1.04 years, respectively. Metastases occurred most often in liver (35.8%), abdominopelvic lymph nodes (32.9%), lungs (29.3%), peritoneum (22.0%), thoracic nodes (19.9%), bones (9.2%), and pelvic organs (7.5%). Spleen and adrenal metastases occurred in < 5%.

CONCLUSION

NLP applied to a large radiology report database can identify clinically relevant metastatic phenotypes and be used to investigate new M1 substaging for CRC. Patients with three or more metastatic disease organs have the worst prognosis, with an OS of 1 year.

Collapse

Fink MA, Kades K, Bischoff A, Moll M, Schnell M, Küchler M, Köhler G, Sellner J, Heussel CP, Kauczor HU, Schlemmer HP, Maier-Hein K, Weber TF, Kleesiek J. Deep Learning-based Assessment of Oncologic Outcomes from Natural Language Processing of Structured Radiology Reports. Radiol Artif Intell 2022;4:e220055. [PMID: 36204531 PMCID: PMC9530771 DOI: 10.1148/ryai.220055] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 06/20/2022] [Accepted: 07/07/2022] [Indexed: 06/16/2023]

Abstract

PURPOSE

To train a deep natural language processing (NLP) model, using data mined structured oncology reports (SOR), for rapid tumor response category (TRC) classification from free-text oncology reports (FTOR) and to compare its performance with human readers and conventional NLP algorithms.

MATERIALS AND METHODS

In this retrospective study, databases of three independent radiology departments were queried for SOR and FTOR dated from March 2018 to August 2021. An automated data mining and curation pipeline was developed to extract Response Evaluation Criteria in Solid Tumors-related TRCs for SOR for ground truth definition. The deep NLP bidirectional encoder representations from transformers (BERT) model and three feature-rich algorithms were trained on SOR to predict TRCs in FTOR. Models' F1 scores were compared against scores of radiologists, medical students, and radiology technologist students. Lexical and semantic analyses were conducted to investigate human and model performance on FTOR.

RESULTS

Oncologic findings and TRCs were accurately mined from 9653 of 12 833 (75.2%) queried SOR, yielding oncology reports from 10 455 patients (mean age, 60 years ± 14 [SD]; 5303 women) who met inclusion criteria. On 802 FTOR in the test set, BERT achieved better TRC classification results (F1, 0.70; 95% CI: 0.68, 0.73) than the best-performing reference linear support vector classifier (F1, 0.63; 95% CI: 0.61, 0.66) and technologist students (F1, 0.65; 95% CI: 0.63, 0.67), had similar performance to medical students (F1, 0.73; 95% CI: 0.72, 0.75), but was inferior to radiologists (F1, 0.79; 95% CI: 0.78, 0.81). Lexical complexity and semantic ambiguities in FTOR influenced human and model performance, revealing maximum F1 score drops of -0.17 and -0.19, respectively.

CONCLUSION

The developed deep NLP model reached the performance level of medical students but not radiologists in curating oncologic outcomes from radiology FTOR.Keywords: Neural Networks, Computer Applications-Detection/Diagnosis, Oncology, Research Design, Staging, Tumor Response, Comparative Studies, Decision Analysis, Experimental Investigations, Observer Performance, Outcomes Analysis Supplemental material is available for this article. © RSNA, 2022.

Collapse

Natural Language Processing in Radiology: Update on Clinical Applications. J Am Coll Radiol 2022;19:1271-1285. [PMID: 36029890 DOI: 10.1016/j.jacr.2022.06.016] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Revised: 05/25/2022] [Accepted: 06/03/2022] [Indexed: 11/24/2022]

Development and validation of an abnormality-derived deep-learning diagnostic system for major respiratory diseases. NPJ Digit Med 2022;5:124. [PMID: 35999467 PMCID: PMC9395860 DOI: 10.1038/s41746-022-00648-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 07/04/2022] [Indexed: 01/05/2023] Open

Abstract

Respiratory diseases impose a tremendous global health burden on large patient populations. In this study, we aimed to develop DeepMRD^TR, a deep learning-based medical image interpretation system for the diagnosis of major respiratory diseases based on the automated identification of a wide range of radiological abnormalities through computed tomography (CT) and chest X-ray (CXR) from real-world, large-scale datasets. DeepMRD^TR comprises four networks (two CT-Nets and two CXR-Nets) that exploit contrastive learning to generate pre-training parameters that are fine-tuned on the retrospective dataset collected from a single institution. The performance of DeepMRD^TR was evaluated for abnormality identification and disease diagnosis on data from two different institutions: one was an internal testing dataset from the same institution as the training data and the second was collected from an external institution to evaluate the model generalizability and robustness to an unrelated population dataset. In such a difficult multi-class diagnosis task, our system achieved the average area under the receiver operating characteristic curve (AUC) of 0.856 (95% confidence interval (CI):0.843–0.868) and 0.841 (95%CI:0.832–0.887) for abnormality identification, and 0.900 (95%CI:0.872–0.958) and 0.866 (95%CI:0.832–0.887) for major respiratory diseases’ diagnosis on CT and CXR datasets, respectively. Furthermore, to achieve a clinically actionable diagnosis, we deployed a preliminary version of DeepMRD^TR into the clinical workflow, which was performed on par with senior experts in disease diagnosis, with an AUC of 0.890 and a Cohen’s k of 0.746–0.877 at a reasonable timescale; these findings demonstrate the potential to accelerate the medical workflow to facilitate early diagnosis as a triage tool for respiratory diseases which supports improved clinical diagnoses and decision-making.

Collapse

An Artificial Intelligence-Based Tool for Data Analysis and Prognosis in Cancer Patients: Results from the Clarify Study. Cancers (Basel) 2022;14:cancers14164041. [PMID: 36011034 PMCID: PMC9406336 DOI: 10.3390/cancers14164041] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 08/18/2022] [Accepted: 08/19/2022] [Indexed: 11/16/2022] Open

Abstract

Simple Summary

Cancer is associated with significant morbimortality worldwide. Although significant advances have been made in the last few decades in terms of early detection and treatment, providing personalized care remains a challenge. Artificial intelligence (AI) has emerged as a means of improving cancer care with the use of computer science. Identification of risk factors for poor prognosis and patient profiling with AI techniques and tools is feasible and has potential application in clinical settings, including surveillance management. The goal of this study is to present an AI-based solution tool for cancer patients data analysis and improve their management by identifying clinical factors associated with relapse and survival, developing a prognostic model that identifies features associated with poor prognosis, and stratifying patients by risk.

Abstract

Background: Artificial intelligence (AI) has contributed substantially in recent years to the resolution of different biomedical problems, including cancer. However, AI tools with significant and widespread impact in oncology remain scarce. The goal of this study is to present an AI-based solution tool for cancer patients data analysis that assists clinicians in identifying the clinical factors associated with poor prognosis, relapse and survival, and to develop a prognostic model that stratifies patients by risk. Materials and Methods: We used clinical data from 5275 patients diagnosed with non-small cell lung cancer, breast cancer, and non-Hodgkin lymphoma at Hospital Universitario Puerta de Hierro-Majadahonda. Accessible clinical parameters measured with a wearable device and quality of life questionnaires data were also collected. Results: Using an AI-tool, data from 5275 cancer patients were analyzed, integrating clinical data, questionnaires data, and data collected from wearable devices. Descriptive analyses were performed in order to explore the patients’ characteristics, survival probabilities were calculated, and a prognostic model identified low and high-risk profile patients. Conclusion: Overall, the reconstruction of the population’s risk profile for the cancer-specific predictive model was achieved and proved useful in clinical practice using artificial intelligence. It has potential application in clinical settings to improve risk stratification, early detection, and surveillance management of cancer patients.

Collapse

Lindvall C, Deng CY, Agaronnik ND, Kwok A, Samineni S, Umeton R, Mackie-Jenkins W, Kehl KL, Tulsky JA, Enzinger AC. Deep Learning for Cancer Symptoms Monitoring on the Basis of Electronic Health Record Unstructured Clinical Notes. JCO Clin Cancer Inform 2022;6:e2100136. [PMID: 35714301 PMCID: PMC9232368 DOI: 10.1200/cci.21.00136] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Abstract

PURPOSE

Symptoms are vital outcomes for cancer clinical trials, observational research, and population-level surveillance. Patient-reported outcomes (PROs) are valuable for monitoring symptoms, yet there are many challenges to collecting PROs at scale. We sought to develop, test, and externally validate a deep learning model to extract symptoms from unstructured clinical notes in the electronic health record.

METHODS

We randomly selected 1,225 outpatient progress notes from among patients treated at the Dana-Farber Cancer Institute between January 2016 and December 2019 and used 1,125 notes as our training/validation data set and 100 notes as our test data set. We evaluated the performance of 10 deep learning models for detecting 80 symptoms included in the National Cancer Institute's Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE) framework. Model performance as compared with manual chart abstraction was assessed using standard metrics, and the highest performer was externally validated on a sample of 100 physician notes from a different clinical context.

RESULTS

In our training and test data sets, 75 of the 80 candidate symptoms were identified. The ELECTRA-small model had the highest performance for symptom identification at the token level (ie, at the individual symptom level), with an F1 of 0.87 and a processing time of 3.95 seconds per note. For the 10 most common symptoms in the test data set, the F1 score ranged from 0.98 for anxious to 0.86 for fatigue. For external validation of the same symptoms, the note-level performance ranged from F1 = 0.97 for diarrhea and dizziness to F1 = 0.73 for swelling.

CONCLUSION

Training a deep learning model to identify a wide range of electronic health record-documented symptoms relevant to cancer care is feasible. This approach could be used at the health system scale to complement to electronic PROs.

Collapse

Ma X, Bellomo L, Hooley I, Williams T, Samant M, Tan K, Segal B, Bourla AB. Concordance of Clinician-Documented and Imaging Response in Patients With Stage IV Non-Small Cell Lung Cancer Treated With First-Line Therapy. JAMA Netw Open 2022;5:e229655. [PMID: 35552726 PMCID: PMC9099424 DOI: 10.1001/jamanetworkopen.2022.9655] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open

Abstract

IMPORTANCE

In observational oncology studies of solid tumors, response to treatment can be evaluated based on electronic health record (EHR) documentation (clinician-assessed response [CAR]), an approach different from standardized radiologist-measured response (Response Evaluation Criteria in Solid Tumours [RECIST] 1.1).

OBJECTIVE

To evaluate the feasibility of an imaging response based on RECIST (IRb-RECIST) and the concordance between CAR and imaging response based on RECIST assessments, and investigate discordance causes.

DESIGN, SETTING, AND PARTICIPANTS

This cohort study used an EHR-derived, deidentified database that included patients with stage IV non-small cell lung cancer (NSCLC) diagnosed between January 1, 2011, to June 30, 2019, selected from 3 study sites. Data analysis was conducted in August, 2020.

EXPOSURES

Undergoing first-line therapy and imaging assessments of response to treatment.

MAIN OUTCOMES AND MEASURES

In this study, CAR assessments (referred to in prior publications as "real-world response" [rwR]) were defined as clinician-documented changes in disease burden at radiologic evaluation time points; they were abstracted manually and assigned to response categories. The RECIST-based assessments accommodated routine practice patterns by using a modified version of RECIST 1.1 (IRb-RECIST), with independent radiology reads. Concordance was calculated as the percent agreement across all response categories and across a dichotomous stratification (response [complete or partial] vs no response), unconfirmed or confirmed.

RESULTS

This study found that, in 100 patients evaluated for concordance, agreement between CAR and IRb-RECIST was 71% (95% CI, 61%-80%), and 74% (95% CI, 64%-82%) for confirmed and unconfirmed response, respectively. There were more responders using CAR than IRb-RECIST (40 vs 29 with confirmation; 64 vs 43 without confirmation). The main sources of discordance were the different use of thresholds for tumor size changes by RECIST vs routine care, and unavailable baseline or follow-up scans resulting in inconsistent anatomic coverage over time.

CONCLUSIONS AND RELEVANCE

In this cohort study of patients with stage IV NSCLC, we collected routine-care imaging, showing the feasibility of response evaluation using IRb-RECIST criteria with independent centralized review. Concordance between CAR and centralized IRb-RECIST was moderate. Future work is needed to evaluate the generalizability of these results to broader populations, and investigate concordance in other clinical settings.

Collapse

Eresen A. Diagnosis of meniscal tears through automated interpretation of medical reports via machine learning. Acad Radiol 2022;29:488-489. [PMID: 34996688 DOI: 10.1016/j.acra.2021.12.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 12/04/2021] [Accepted: 12/06/2021] [Indexed: 11/26/2022]

Linna N, Kahn CE. Applications of Natural Language Processing in Radiology: A Systematic Review. Int J Med Inform 2022;163:104779. [DOI: 10.1016/j.ijmedinf.2022.104779] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 03/28/2022] [Accepted: 04/21/2022] [Indexed: 12/27/2022]

Automated Radiology-Arthroscopy Correlation of Knee Meniscal Tears Using Natural Language Processing Algorithms. Acad Radiol 2022;29:479-487. [PMID: 33583713 DOI: 10.1016/j.acra.2021.01.017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Revised: 01/19/2021] [Accepted: 01/21/2021] [Indexed: 12/29/2022]

Abstract

RATIONALE AND OBJECTIVES

Train and apply natural language processing (NLP) algorithms for automated radiology-arthroscopy correlation of meniscal tears.

MATERIALS AND METHODS

In this retrospective single-institution study, we trained supervised machine learning models (logistic regression, support vector machine, and random forest) to detect medial or lateral meniscus tears on free-text MRI reports. We trained and evaluated model performances with cross-validation using 3593 manually annotated knee MRI reports. To assess radiology-arthroscopy correlation, we then randomly partitioned this dataset 80:20 for training and testing, where 108 test set MRIs were followed by knee arthroscopy within 1 year. These free-text arthroscopy reports were also manually annotated. The NLP algorithms trained on the knee MRI training dataset were then evaluated on the MRI and arthroscopy report test datasets. We assessed radiology-arthroscopy agreement using the ensembled NLP-extracted findings versus manually annotated findings.

RESULTS

The NLP models showed high cross-validation performance for meniscal tear detection on knee MRI reports (medial meniscus F1 scores 0.93-0.94, lateral meniscus F1 scores 0.86-0.88). When these algorithms were evaluated on arthroscopy reports, despite never training on arthroscopy reports, performance was similar, though higher with model ensembling (medial meniscus F1 score 0.97, lateral meniscus F1 score 0.99). However, ensembling did not improve performance on knee MRI reports. In the radiology-arthroscopy test set, the ensembled NLP models were able to detect mismatches between MRI and arthroscopy reports with sensitivity 79% and specificity 87%.

CONCLUSION

Radiology-arthroscopy correlation can be automated for knee meniscal tears using NLP algorithms, which shows promise for education and quality improvement.

Collapse

Batch KE, Yue J, Darcovich A, Lupton K, Liu CC, Woodlock DP, El Amine MAK, Causa-Andrieu PI, Gazit L, Nguyen GH, Zulkernine F, Do RKG, Simpson AL. Developing a Cancer Digital Twin: Supervised Metastases Detection From Consecutive Structured Radiology Reports. Front Artif Intell 2022;5:826402. [PMID: 35310959 PMCID: PMC8924403 DOI: 10.3389/frai.2022.826402] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Accepted: 01/27/2022] [Indexed: 11/13/2022] Open

Abstract

The development of digital cancer twins relies on the capture of high-resolution representations of individual cancer patients throughout the course of their treatment. Our research aims to improve the detection of metastatic disease over time from structured radiology reports by exposing prediction models to historical information. We demonstrate that Natural language processing (NLP) can generate better weak labels for semi-supervised classification of computed tomography (CT) reports when it is exposed to consecutive reports through a patient's treatment history. Around 714,454 structured radiology reports from Memorial Sloan Kettering Cancer Center adhering to a standardized departmental structured template were used for model development with a subset of the reports included for validation. To develop the models, a subset of the reports was curated for ground-truth: 7,732 total reports in the lung metastases dataset from 867 individual patients; 2,777 reports in the liver metastases dataset from 315 patients; and 4,107 reports in the adrenal metastases dataset from 404 patients. We use NLP to extract and encode important features from the structured text reports, which are then used to develop, train, and validate models. Three models—a simple convolutional neural network (CNN), a CNN augmented with an attention layer, and a recurrent neural network (RNN)—were developed to classify the type of metastatic disease and validated against the ground truth labels. The models use features from consecutive structured text radiology reports of a patient to predict the presence of metastatic disease in the reports. A single-report model, previously developed to analyze one report instead of multiple past reports, is included and the results from all four models are compared based on accuracy, precision, recall, and F1-score. The best model is used to label all 714,454 reports to generate metastases maps. Our results suggest that NLP models can extract cancer progression patterns from multiple consecutive reports and predict the presence of metastatic disease in multiple organs with higher performance when compared with a single-report-based prediction. It demonstrates a promising automated approach to label large numbers of radiology reports without involving human experts in a time- and cost-effective manner and enables tracking of cancer progression over time.

Collapse

Lavery JA, Lepisto EM, Brown S, Rizvi H, McCarthy C, LeNoue-Newton M, Yu C, Lee J, Guo X, Yu T, Rudolph J, Sweeney S, Park BH, Warner JL, Bedard PL, Riely G, Schrag D, Panageas KS. A Scalable Quality Assurance Process for Curating Oncology Electronic Health Records: The Project GENIE Biopharma Collaborative Approach. JCO Clin Cancer Inform 2022;6:e2100105. [PMID: 35192403 PMCID: PMC8863125 DOI: 10.1200/cci.21.00105] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Boehm KM, Khosravi P, Vanguri R, Gao J, Shah SP. Harnessing multimodal data integration to advance precision oncology. Nat Rev Cancer 2022;22:114-126. [PMID: 34663944 PMCID: PMC8810682 DOI: 10.1038/s41568-021-00408-3] [Citation(s) in RCA: 158] [Impact Index Per Article: 79.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/08/2021] [Indexed: 02/07/2023]