Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Banerjee I, Gensheimer MF, Wood DJ, Henry S, Aggarwal S, Chang DT, Rubin DL. Probabilistic Prognostic Estimates of Survival in Metastatic Cancer Patients (PPES-Met) Utilizing Free-Text Clinical Narratives. Sci Rep 2018;8:10037. [PMID: 29968730 PMCID: PMC6030075 DOI: 10.1038/s41598-018-27946-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2017] [Accepted: 06/12/2018] [Indexed: 02/07/2023] Open

For:	Banerjee I, Gensheimer MF, Wood DJ, Henry S, Aggarwal S, Chang DT, Rubin DL. Probabilistic Prognostic Estimates of Survival in Metastatic Cancer Patients (PPES-Met) Utilizing Free-Text Clinical Narratives. Sci Rep 2018;8:10037. [PMID: 29968730 PMCID: PMC6030075 DOI: 10.1038/s41598-018-27946-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2017] [Accepted: 06/12/2018] [Indexed: 02/07/2023] Open

Number

Cited by Other Article(s)

Lin H, Ni L, Phuong C, Hong JC. Natural Language Processing for Radiation Oncology: Personalizing Treatment Pathways. Pharmgenomics Pers Med 2024;17:65-76. [PMID: 38370334 PMCID: PMC10874185 DOI: 10.2147/pgpm.s396971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Accepted: 01/29/2024] [Indexed: 02/20/2024] Open

Saha A, Burns L, Kulkarni AM. A scoping review of natural language processing of radiology reports in breast cancer. Front Oncol 2023;13:1160167. [PMID: 37124523 PMCID: PMC10130381 DOI: 10.3389/fonc.2023.1160167] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Accepted: 03/28/2023] [Indexed: 05/02/2023] Open

DRUG REPOSITIONING FOR CANCER IN THE ERA OF BIG OMICS AND REAL-WORLD DATA. Crit Rev Oncol Hematol 2022;175:103730. [DOI: 10.1016/j.critrevonc.2022.103730] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Revised: 05/25/2022] [Accepted: 05/27/2022] [Indexed: 11/15/2022] Open

Zeng J, Gensheimer MF, Rubin DL, Athey S, Shachter RD. Uncovering interpretable potential confounders in electronic medical records. Nat Commun 2022;13:1014. [PMID: 35197467 PMCID: PMC8866497 DOI: 10.1038/s41467-022-28546-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 01/28/2022] [Indexed: 12/25/2022] Open

Wang S, Tseng B, Hernandez-Boussard T. Development and evaluation of novel ophthalmology domain-specific neural word embeddings to predict visual prognosis. Int J Med Inform 2021;150:104464. [PMID: 33892445 PMCID: PMC8183292 DOI: 10.1016/j.ijmedinf.2021.104464] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Revised: 03/20/2021] [Accepted: 04/11/2021] [Indexed: 01/17/2023]

Eyuboglu S, Angus G, Patel BN, Pareek A, Davidzon G, Long J, Dunnmon J, Lungren MP. Multi-task weak supervision enables anatomically-resolved abnormality detection in whole-body FDG-PET/CT. Nat Commun 2021;12:1880. [PMID: 33767174 PMCID: PMC7994797 DOI: 10.1038/s41467-021-22018-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2020] [Accepted: 02/16/2021] [Indexed: 11/09/2022] Open

Banerjee I, de Sisternes L, Hallak JA, Leng T, Osborne A, Rosenfeld PJ, Gregori G, Durbin M, Rubin D. Prediction of age-related macular degeneration disease using a sequential deep learning approach on longitudinal SD-OCT imaging biomarkers. Sci Rep 2020;10:15434. [PMID: 32963300 PMCID: PMC7508843 DOI: 10.1038/s41598-020-72359-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Accepted: 08/23/2020] [Indexed: 01/28/2023] Open

Banerjee I, Bozkurt S, Caswell-Jin JL, Kurian AW, Rubin DL. Natural Language Processing Approaches to Detect the Timeline of Metastatic Recurrence of Breast Cancer. JCO Clin Cancer Inform 2020;3:1-12. [PMID: 31584836 DOI: 10.1200/cci.19.00034] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open

Abstract

PURPOSE

Electronic medical records (EMRs) and population-based cancer registries contain information on cancer outcomes and treatment, yet rarely capture information on the timing of metastatic cancer recurrence, which is essential to understand cancer survival outcomes. We developed a natural language processing (NLP) system to identify patient-specific timelines of metastatic breast cancer recurrence.

PATIENTS AND METHODS

We used the OncoSHARE database, which includes merged data from the California Cancer Registry and EMRs of 8,956 women diagnosed with breast cancer in 2000 to 2018. We curated a comprehensive vocabulary by interviewing expert clinicians and processing radiology and pathology reports and progress notes. We developed and evaluated the following two distinct NLP approaches to analyze free-text notes: a traditional rule-based model, using rules for metastatic detection from the literature and curated by domain experts; and a contemporary neural network model. For each 3-month period (quarter) from 2000 to 2018, we applied both models to infer recurrence status for that quarter. We trained the NLP models using 894 randomly selected patient records that were manually reviewed by clinical experts and evaluated model performance using 179 hold-out patients (20%) as a test set.

RESULTS

The median follow-up time was 19 quarters (5 years) for the training set and 15 quarters (4 years) for the test set. The neural network model predicted the timing of distant metastatic recurrence with a sensitivity of 0.83 and specificity of 0.73, outperforming the rule-based model, which had a specificity of 0.35 and sensitivity of 0.88 (P < .001).

CONCLUSION

We developed an NLP method that enables identification of the occurrence and timing of metastatic breast cancer recurrence from EMRs. This approach may be adaptable to other cancer sites and could help to unlock the potential of EMRs for research on real-world cancer outcomes.

Collapse

Spasic I, Nenadic G. Clinical Text Data in Machine Learning: Systematic Review. JMIR Med Inform 2020;8:e17984. [PMID: 32229465 PMCID: PMC7157505 DOI: 10.2196/17984] [Citation(s) in RCA: 108] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Revised: 02/24/2020] [Accepted: 02/24/2020] [Indexed: 12/22/2022] Open

Abstract

Background

Clinical narratives represent the main form of communication within health care, providing a personalized account of patient history and assessments, and offering rich information for clinical decision making. Natural language processing (NLP) has repeatedly demonstrated its feasibility to unlock evidence buried in clinical narratives. Machine learning can facilitate rapid development of NLP tools by leveraging large amounts of text data.

Objective

The main aim of this study was to provide systematic evidence on the properties of text data used to train machine learning approaches to clinical NLP. We also investigated the types of NLP tasks that have been supported by machine learning and how they can be applied in clinical practice.

Methods

Our methodology was based on the guidelines for performing systematic reviews. In August 2018, we used PubMed, a multifaceted interface, to perform a literature search against MEDLINE. We identified 110 relevant studies and extracted information about text data used to support machine learning, NLP tasks supported, and their clinical applications. The data properties considered included their size, provenance, collection methods, annotation, and any relevant statistics.

Results

The majority of datasets used to train machine learning models included only hundreds or thousands of documents. Only 10 studies used tens of thousands of documents, with a handful of studies utilizing more. Relatively small datasets were utilized for training even when much larger datasets were available. The main reason for such poor data utilization is the annotation bottleneck faced by supervised machine learning algorithms. Active learning was explored to iteratively sample a subset of data for manual annotation as a strategy for minimizing the annotation effort while maximizing the predictive performance of the model. Supervised learning was successfully used where clinical codes integrated with free-text notes into electronic health records were utilized as class labels. Similarly, distant supervision was used to utilize an existing knowledge base to automatically annotate raw text. Where manual annotation was unavoidable, crowdsourcing was explored, but it remains unsuitable because of the sensitive nature of data considered. Besides the small volume, training data were typically sourced from a small number of institutions, thus offering no hard evidence about the transferability of machine learning models. The majority of studies focused on text classification. Most commonly, the classification results were used to support phenotyping, prognosis, care improvement, resource management, and surveillance.

Conclusions

We identified the data annotation bottleneck as one of the key obstacles to machine learning approaches in clinical NLP. Active learning and distant supervision were explored as a way of saving the annotation efforts. Future research in this field would benefit from alternatives such as data augmentation and transfer learning, or unsupervised learning, which do not require data annotation.

Collapse

Deep learning-based interpretation of basal/acetazolamide brain perfusion SPECT leveraging unstructured reading reports. Eur J Nucl Med Mol Imaging 2020;47:2186-2196. [PMID: 31912255 DOI: 10.1007/s00259-019-04670-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Accepted: 12/23/2019] [Indexed: 12/27/2022]

Abstract

PURPOSE

Basal/acetazolamide brain perfusion single-photon emission computed tomography (SPECT) has been used to evaluate functional hemodynamics in patients with carotid artery stenosis. We aimed to develop a deep learning model as a support system for interpreting brain perfusion SPECT leveraging unstructured text reports.

METHODS

In total, 7345 basal/acetazolamide brain perfusion SPECT images and their text reports were retrospectively collected. A long short-term memory (LSTM) network was trained using 500 randomly selected text reports to predict manually labeled structured information, including abnormalities of basal perfusion and vascular reserve for each vascular territory. Using this trained LSTM model, we extracted structured information from the remaining 6845 text reports to develop a deep learning model for interpreting SPECT images. The model was based on a 3D convolutional neural network (CNN), and the performance was tested on the other 500 cases by measuring the area under the receiver-operating characteristic curve (AUC). We then applied the model to patients who underwent revascularization (n = 33) to compare the estimated output of the CNN model for pre- and post-revascularization SPECT and clinical outcomes.

RESULTS

The AUC of the LSTM model for extracting structured labels was 1.00 for basal perfusion and 0.99 for vascular reserve for all 9 brain regions. The AUC of the CNN model designed to identify abnormal perfusion was 0.83 for basal perfusion and 0.89 for vascular reserve. The output of the CNN model was significantly improved according to the revascularization in the target vascular territory, and its changes in brain territories were concordant with clinical outcomes.

CONCLUSION

We developed a deep learning model to support the interpretation of brain perfusion SPECT by converting unstructured text reports into structured labels. This model can be used as a support system not only to identify perfusion abnormalities but also to provide quantitative scores of abnormalities, particularly for patients who require revascularization.

Collapse

Koutkias V, Bouaud J. Contributions on Clinical Decision Support from the 2018 Literature. Yearb Med Inform 2019;28:135-137. [PMID: 31419825 PMCID: PMC6697519 DOI: 10.1055/s-0039-1677929] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open

Abstract

Objectives : To summarize recent research and select the best papers published in 2018 in the field of computerized clinical decision support for the Decision Support section of the International Medical Informatics Association (IMIA) yearbook.

Methods : A literature review was performed by searching two bibliographic databases for papers referring to clinical decision support systems (CDSSs). The aim was to identify a list of candidate best papers from the retrieved bibliographic records, which were then peer-reviewed by external reviewers. A consensus meeting of the IMIA editorial team finally selected the best papers on the basis of all reviews and the section editors' evaluation.

Results : Among 1,148 retrieved articles, 15 best paper candidates were selected, the review of which resulted in the selection of four best papers. The first paper introduces a deep learning model for estimating short-term life expectancy (>3 months) of metastatic cancer patients by analyzing free-text clinical notes in electronic medical records, while maintaining the temporal visit sequence. The second paper takes note that CDSSs become routinely integrated in health information systems and compares statistical anomaly detection models to identify CDSS malfunctions which, if remain unnoticed, may have a negative impact on care delivery. The third paper fairly reports on lessons learnt from the development of an oncology CDSS using artificial intelligence techniques and from its assessment in a large US cancer center. The fourth paper implements a preference learning methodology for detecting inconsistencies in clinical practice guidelines and illustrates the applicability of the proposed methodology to antibiotherapy.

Conclusions : Three of the four best papers rely on data-driven methods, and one builds on a knowledge-based approach. While there is currently a trend for data-driven decision support, the promising results of such approaches still need to be confirmed by the adoption of these systems and their routine use.

Collapse

Gensheimer MF, Henry AS, Wood DJ, Hastie TJ, Aggarwal S, Dudley SA, Pradhan P, Banerjee I, Cho E, Ramchandran K, Pollom E, Koong AC, Rubin DL, Chang DT. Automated Survival Prediction in Metastatic Cancer Patients Using High-Dimensional Electronic Medical Record Data. J Natl Cancer Inst 2019;111:568-574. [PMID: 30346554 PMCID: PMC6579743 DOI: 10.1093/jnci/djy178] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2018] [Revised: 06/28/2018] [Accepted: 09/05/2018] [Indexed: 12/19/2022] Open

Fathiamini S, Johnson AM, Zeng J, Holla V, Sanchez NS, Meric-Bernstam F, Bernstam EV, Cohen T. Rapamycin - mTOR + BRAF = ? Using relational similarity to find therapeutically relevant drug-gene relationships in unstructured text. J Biomed Inform 2019;90:103094. [PMID: 30615938 PMCID: PMC6386529 DOI: 10.1016/j.jbi.2019.103094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2018] [Revised: 11/30/2018] [Accepted: 12/27/2018] [Indexed: 11/17/2022]