Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Delespierre T, Denormandie P, Bar-Hen A, Josseran L. Empirical advances with text mining of electronic health records. BMC Med Inform Decis Mak 2017;17:127. [PMID: 28830417 PMCID: PMC5568397 DOI: 10.1186/s12911-017-0519-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2016] [Accepted: 08/04/2017] [Indexed: 11/20/2022] Open

For:	Delespierre T, Denormandie P, Bar-Hen A, Josseran L. Empirical advances with text mining of electronic health records. BMC Med Inform Decis Mak 2017;17:127. [PMID: 28830417 PMCID: PMC5568397 DOI: 10.1186/s12911-017-0519-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2016] [Accepted: 08/04/2017] [Indexed: 11/20/2022] Open

Number

Cited by Other Article(s)

Bazoge A, Morin E, Daille B, Gourraud PA. Applying Natural Language Processing to Textual Data From Clinical Data Warehouses: Systematic Review. JMIR Med Inform 2023;11:e42477. [PMID: 38100200 PMCID: PMC10757232 DOI: 10.2196/42477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Revised: 01/16/2023] [Accepted: 09/07/2023] [Indexed: 12/17/2023] Open

Abstract

BACKGROUND

In recent years, health data collected during the clinical care process have been often repurposed for secondary use through clinical data warehouses (CDWs), which interconnect disparate data from different sources. A large amount of information of high clinical value is stored in unstructured text format. Natural language processing (NLP), which implements algorithms that can operate on massive unstructured textual data, has the potential to structure the data and make clinical information more accessible.

OBJECTIVE

The aim of this review was to provide an overview of studies applying NLP to textual data from CDWs. It focuses on identifying the (1) NLP tasks applied to data from CDWs and (2) NLP methods used to tackle these tasks.

METHODS

This review was performed according to the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. We searched for relevant articles in 3 bibliographic databases: PubMed, Google Scholar, and ACL Anthology. We reviewed the titles and abstracts and included articles according to the following inclusion criteria: (1) focus on NLP applied to textual data from CDWs, (2) articles published between 1995 and 2021, and (3) written in English.

RESULTS

We identified 1353 articles, of which 194 (14.34%) met the inclusion criteria. Among all identified NLP tasks in the included papers, information extraction from clinical text (112/194, 57.7%) and the identification of patients (51/194, 26.3%) were the most frequent tasks. To address the various tasks, symbolic methods were the most common NLP methods (124/232, 53.4%), showing that some tasks can be partially achieved with classical NLP techniques, such as regular expressions or pattern matching that exploit specialized lexica, such as drug lists and terminologies. Machine learning (70/232, 30.2%) and deep learning (38/232, 16.4%) have been increasingly used in recent years, including the most recent approaches based on transformers. NLP methods were mostly applied to English language data (153/194, 78.9%).

CONCLUSIONS

CDWs are central to the secondary use of clinical texts for research purposes. Although the use of NLP on data from CDWs is growing, there remain challenges in this field, especially with regard to languages other than English. Clinical NLP is an effective strategy for accessing, extracting, and transforming data from CDWs. Information retrieved with NLP can assist in clinical research and have an impact on clinical practice.

Collapse

Hjaltelin JX, Novitski SI, Jørgensen IF, Siggaard T, Vulpius SA, Westergaard D, Johansen JS, Chen IM, Juhl Jensen L, Brunak S. Pancreatic cancer symptom trajectories from Danish registry data and free text in electronic health records. eLife 2023;12:e84919. [PMID: 37988407 PMCID: PMC10662947 DOI: 10.7554/elife.84919] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Accepted: 10/19/2023] [Indexed: 11/23/2023] Open

Hacking C, Verbeek H, Hamers JPH, Aarts S. Comparing text mining and manual coding methods: Analysing interview data on quality of care in long-term care for older adults. PLoS One 2023;18:e0292578. [PMID: 37939098 PMCID: PMC10631650 DOI: 10.1371/journal.pone.0292578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 09/24/2023] [Indexed: 11/10/2023] Open

Hacking C, Verbeek H, Hamers JPH, Aarts S. The development of an automatic speech recognition model using interview data from long-term care for older adults. J Am Med Inform Assoc 2022;30:411-417. [PMID: 36495570 PMCID: PMC9933064 DOI: 10.1093/jamia/ocac241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 11/08/2022] [Accepted: 12/07/2022] [Indexed: 12/14/2022] Open

Kakoti BB, Bezbaruah R, Ahmed N. Therapeutic drug repositioning with special emphasis on neurodegenerative diseases: Threats and issues. Front Pharmacol 2022;13:1007315. [PMID: 36263141 PMCID: PMC9574100 DOI: 10.3389/fphar.2022.1007315] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Accepted: 09/12/2022] [Indexed: 11/21/2022] Open

Baty F, Hegermann J, Locatelli T, Rüegg C, Gysin C, Rassouli F, Brutsche M. Text mining-based measurement of precision of polysomnographic reports as basis for intervention. J Biomed Semantics 2022;13:5. [PMID: 35101128 PMCID: PMC8805265 DOI: 10.1186/s13326-022-00259-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Accepted: 01/06/2022] [Indexed: 11/10/2022] Open

van Laar SA, Gombert-Handoko KB, Wassenaar S, Kroep JR, Guchelaar HJ, Zwaveling J. Real-world evaluation of supportive care using an electronic health record text-mining tool: G-CSF use in breast cancer patients. Support Care Cancer 2022;30:9181-9189. [PMID: 36044088 PMCID: PMC9633501 DOI: 10.1007/s00520-022-07343-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 08/24/2022] [Indexed: 01/05/2023]

Park S, Kim-Knauss Y, Sim JA. Leveraging Text Mining Approach to Identify What People Want to Know About Mental Disorders From Online Inquiry Platforms. Front Public Health 2021;9:759802. [PMID: 34712643 PMCID: PMC8546111 DOI: 10.3389/fpubh.2021.759802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 09/13/2021] [Indexed: 11/13/2022] Open

Du YQ, Zhu GD, Cao J, Huang JY. Research supporting malaria control and elimination in China over four decades: a bibliometric analysis of academic articles published in chinese from 1980 to 2019. Malar J 2021;20:158. [PMID: 33743712 PMCID: PMC7980574 DOI: 10.1186/s12936-021-03698-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Accepted: 03/12/2021] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

China has accumulated considerable experience in malaria control and elimination over the past decades. Many research papers have been published in Chinese journals. This study intends to describe the development and experience of malaria control and elimination in China by quantitatively analysing relevant research using a bibliometric analysis.

METHODS

A long-term, multistage bibliometric analysis was performed. Research articles published in Chinese journals from 1980 to 2019 were retrieved from the Wanfang and China National Knowledge Infrastructure (CNKI) databases. Year of publication, journal name and keywords were extracted by the Bibliographic Items Co-occurrence Matrix Builder (BICOMB). The K/A ratio (the frequency of a keyword among the total number of articles within a certain period) was considered an indicator of the popularity of a keyword in different decades. VOSviewer software was used to construct keyword co-occurrence network maps.

RESULTS

A total of 16,290 articles were included. The overall number of articles continually increased. However, the number of articles published in the last three years decreased. There were two kinds of keyword frequency trends among the different decades. The K/A ratio of the keyword 'Plasmodium falciparum' decreased (17.05 in the 1980s, 13.04% in the 1990s, 9.86 in the 2000s, 5.28 in the 2010s), but those of 'imported case' and 'surveillance' increased. Drug resistance has been a continuous concern. The keyword co-occurrence network maps showed that the themes of malaria research diversified, and the degree of multidisciplinary cooperation gradually increased.

CONCLUSIONS

This bibliometric analysis revealed the trends in malaria research in China over the past 40 years. The results suggest emphasis on investigation, multidisciplinary participation and drug resistance by researchers and policymakers in malaria epidemic areas. The results also provide domestic experts with qualitative evidence of China's experience in malaria control and elimination.

Collapse

Derington CG, Mueller SR, Glanz JM, Binswanger IA. Identifying naloxone administrations in electronic health record data using a text-mining tool. Subst Abus 2020;42:806-812. [PMID: 33320803 PMCID: PMC8203755 DOI: 10.1080/08897077.2020.1856288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Lee HJ, Chung YJ, Jang S, Seo DW, Lee HK, Yoon D, Lim D, Lee SH. Genome-wide identification of major genes and genomic prediction using high-density and text-mined gene-based SNP panels in Hanwoo (Korean cattle). PLoS One 2020;15:e0241848. [PMID: 33264312 PMCID: PMC7710051 DOI: 10.1371/journal.pone.0241848] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2020] [Accepted: 10/21/2020] [Indexed: 11/24/2022] Open

Application of Text Mining to Nursing Texts. Comput Inform Nurs 2020;38:475-482. [DOI: 10.1097/cin.0000000000000681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Gonzalez-Garcia J, Telleria-Orriols C, Estupinan-Romero F, Bernal-Delgado E. Construction of Empirical Care Pathways Process Models From Multiple Real-World Datasets. IEEE J Biomed Health Inform 2020;24:2671-2680. [PMID: 32092019 DOI: 10.1109/jbhi.2020.2971146] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Paranjpe MD, Taubes A, Sirota M. Insights into Computational Drug Repurposing for Neurodegenerative Disease. Trends Pharmacol Sci 2019;40:565-576. [PMID: 31326236 PMCID: PMC6771436 DOI: 10.1016/j.tips.2019.06.003] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2019] [Revised: 04/26/2019] [Accepted: 06/12/2019] [Indexed: 12/14/2022]

Delespierre T, Josseran L. Issues in Building a Nursing Home Syndromic Surveillance System with Textmining: Longitudinal Observational Study. JMIR Public Health Surveill 2018;4:e69. [PMID: 30545816 PMCID: PMC6315244 DOI: 10.2196/publichealth.9022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2017] [Revised: 01/23/2018] [Accepted: 07/23/2018] [Indexed: 11/17/2022] Open

Abstract

Background

New nursing homes (NH) data warehouses fed from residents’ medical records allow monitoring the health of elderly population on a daily basis. Elsewhere, syndromic surveillance has already shown that professional data can be used for public health (PH) surveillance but not during a long-term follow-up of the same cohort.

Objective

This study aimed to build and assess a national ecological NH PH surveillance system (SS).

Methods

Using a national network of 126 NH, we built a residents’ cohort, extracted medical and personal data from their electronic health records, and transmitted them through the internet to a national server almost in real time. After recording sociodemographic, autonomic and syndromic information, a set of 26 syndromes was defined using pattern matching with the standard query language-LIKE operator and a Delphi-like technique, between November 2010 and June 2016. We used early aberration reporting system (EARS) and Bayes surveillance algorithms of the R surveillance package (Höhle) to assess our influenza and acute gastroenteritis (AGE) syndromic data against the Sentinelles network data, French epidemics gold standard, following Centers for Disease Control and Prevention surveillance system assessment guidelines.

Results

By extracting all sociodemographic residents’ data, a cohort of 41,061 senior citizens was built. EARS_C3 algorithm on NH influenza and AGE syndromic data gave sensitivities of 0.482 and 0.539 and specificities of 0.844 and 0.952, respectively, over a 6-year period, forecasting the last influenza outbreak by catching early flu signals. In addition, assessment of influenza and AGE syndromic data quality showed precisions of 0.98 and 0.96 during last season epidemic weeks’ peaks (weeks 03-2017 and 01-2017) and precisions of 0.95 and 0.92 during last summer epidemic weeks’ low (week 33-2016).

Conclusions

This study confirmed that using syndromic information gives a good opportunity to develop a genuine French national PH SS dedicated to senior citizens. Access to senior citizens’ free-text validated health data on influenza and AGE responds to a PH issue for the surveillance of this fragile population. This database will also make possible new ecological research on other subjects that will improve prevention, care, and rapid response when facing health threats.

Collapse