Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Dara J, Dowling JN, Travers D, Cooper GF, Chapman WW. Evaluation of preprocessing techniques for chief complaint classification. J Biomed Inform 2007;41:613-23. [PMID: 18166502 DOI: 10.1016/j.jbi.2007.11.004] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2007] [Revised: 11/08/2007] [Accepted: 11/19/2007] [Indexed: 11/28/2022]

For:	Dara J, Dowling JN, Travers D, Cooper GF, Chapman WW. Evaluation of preprocessing techniques for chief complaint classification. J Biomed Inform 2007;41:613-23. [PMID: 18166502 DOI: 10.1016/j.jbi.2007.11.004] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2007] [Revised: 11/08/2007] [Accepted: 11/19/2007] [Indexed: 11/28/2022]

Number

Cited by Other Article(s)

Koleck TA, Dreisbach C, Bourne PE, Bakken S. Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review. J Am Med Inform Assoc 2020;26:364-379. [PMID: 30726935 DOI: 10.1093/jamia/ocy173] [Citation(s) in RCA: 182] [Impact Index Per Article: 45.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2018] [Revised: 11/20/2018] [Accepted: 11/27/2018] [Indexed: 12/26/2022] Open

Abstract

OBJECTIVE

Natural language processing (NLP) of symptoms from electronic health records (EHRs) could contribute to the advancement of symptom science. We aim to synthesize the literature on the use of NLP to process or analyze symptom information documented in EHR free-text narratives.

MATERIALS AND METHODS

Our search of 1964 records from PubMed and EMBASE was narrowed to 27 eligible articles. Data related to the purpose, free-text corpus, patients, symptoms, NLP methodology, evaluation metrics, and quality indicators were extracted for each study.

RESULTS

Symptom-related information was presented as a primary outcome in 14 studies. EHR narratives represented various inpatient and outpatient clinical specialties, with general, cardiology, and mental health occurring most frequently. Studies encompassed a wide variety of symptoms, including shortness of breath, pain, nausea, dizziness, disturbed sleep, constipation, and depressed mood. NLP approaches included previously developed NLP tools, classification methods, and manually curated rule-based processing. Only one-third (n = 9) of studies reported patient demographic characteristics.

DISCUSSION

NLP is used to extract information from EHR free-text narratives written by a variety of healthcare providers on an expansive range of symptoms across diverse clinical specialties. The current focus of this field is on the development of methods to extract symptom information and the use of symptom information for disease classification tasks rather than the examination of symptoms themselves.

CONCLUSION

Future NLP studies should concentrate on the investigation of symptoms and symptom documentation in EHR free-text narratives. Efforts should be undertaken to examine patient characteristics and make symptom-related NLP algorithms or pipelines and vocabularies openly available.

Collapse

Horng S, Greenbaum NR, Nathanson LA, McClay JC, Goss FR, Nielson JA. Consensus Development of a Modern Ontology of Emergency Department Presenting Problems-The Hierarchical Presenting Problem Ontology (HaPPy). Appl Clin Inform 2019;10:409-420. [PMID: 31189204 DOI: 10.1055/s-0039-1691842] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Santhi B, Brindha G. Multinomial Naïve Bayes using similarity based conditional probability. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2019. [DOI: 10.3233/jifs-181009] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Prognosis Essay Scoring and Article Relevancy Using Multi-Text Features and Machine Learning. Symmetry (Basel) 2017. [DOI: 10.3390/sym9010011] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Zimmerman PA, Mason M, Elder E. A healthy degree of suspicion: A discussion of the implementation of transmission based precautions in the emergency department. ACTA ACUST UNITED AC 2016;19:149-52. [PMID: 27133874 PMCID: PMC7128487 DOI: 10.1016/j.aenj.2016.03.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2015] [Revised: 03/10/2016] [Accepted: 03/29/2016] [Indexed: 02/01/2023]

Hatakeyama Y, Miyano I, Kataoka H, Nakajima N, Watabe T, Yasuda N, Okuhara Y. Use of a Latent Topic Model for Characteristic Extraction from Health Checkup Questionnaire Data. Methods Inf Med 2015;54:515-21. [PMID: 26063536 DOI: 10.3414/me15-01-0023] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2015] [Accepted: 05/29/2015] [Indexed: 12/19/2022]

Abstract

OBJECTIVES

When patients complete questionnaires during health checkups, many of their responses are subjective, making topic extraction difficult. Therefore, the purpose of this study was to develop a model capable of extracting appropriate topics from subjective data in questionnaires conducted during health checkups.

METHODS

We employed a latent topic model to group the lifestyle habits of the study participants and represented their responses to items on health checkup questionnaires as a probability model. For the probability model, we used latent Dirichlet allocation to extract 30 topics from the questionnaires. According to the model parameters, a total of 4381 study participants were then divided into groups based on these topics. Results from laboratory tests, including blood glucose level, triglycerides, and estimated glomerular filtration rate, were compared between each group, and these results were then compared with those obtained by hierarchical clustering.

RESULTS

If a significant (p < 0.05) difference was observed in any of the laboratory measurements between groups, it was considered to indicate a questionnaire response pattern corresponding to the value of the test result. A comparison between the latent topic model and hierarchical clustering grouping revealed that, in the latent topic model method, a small group of participants who reported having subjective signs of urinary disorder were allocated to a single group.

CONCLUSIONS

The latent topic model is useful for extracting characteristics from a small number of groups from questionnaires with a large number of items. These results show that, in addition to chief complaints and history of past illness, questionnaire data obtained during medical checkups can serve as useful judgment criteria for assessing the conditions of patients.

Collapse

Emergency Medical Text Classifier: New system improves processing and classification of triage notes. Online J Public Health Inform 2014;6:e178. [PMID: 25379126 PMCID: PMC4221085 DOI: 10.5210/ojphi.v6i2.5469] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open

Zheng H, Gaff H, Smith G, DeLisle S. Epidemic surveillance using an electronic medical record: an empiric approach to performance improvement. PLoS One 2014;9:e100845. [PMID: 25006878 PMCID: PMC4090236 DOI: 10.1371/journal.pone.0100845] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2013] [Accepted: 05/30/2014] [Indexed: 01/19/2023] Open

Gerbier-Colomban S, Gicquel Q, Millet AL, Riou C, Grando J, Darmoni S, Potinet-Pagliaroli V, Metzger MH. Evaluation of syndromic algorithms for detecting patients with potentially transmissible infectious diseases based on computerised emergency-department data. BMC Med Inform Decis Mak 2013;13:101. [PMID: 24004720 PMCID: PMC3766242 DOI: 10.1186/1472-6947-13-101] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2012] [Accepted: 08/30/2013] [Indexed: 11/17/2022] Open

Yan W, Palm L, Lu X, Nie S, Xu B, Zhao Q, Tao T, Cheng L, Tan L, Dong H, Diwan VK. ISS--an electronic syndromic surveillance system for infectious disease in rural China. PLoS One 2013;8:e62749. [PMID: 23626853 PMCID: PMC3633833 DOI: 10.1371/journal.pone.0062749] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2012] [Accepted: 03/29/2013] [Indexed: 12/04/2022] Open

Abstract

Background

syndromic surveillance system has great advantages in promoting the early detection of epidemics and reducing the necessities of disease confirmation, and it is especially effective for surveillance in resource poor settings. However, most current syndromic surveillance systems are established in developed countries, and there are very few reports on the development of an electronic syndromic surveillance system in resource-constrained settings.

Objective

this study describes the design and pilot implementation of an electronic surveillance system (ISS) for the early detection of infectious disease epidemics in rural China, complementing the conventional case report surveillance system.

Methods

ISS was developed based on an existing platform ‘Crisis Information Sharing Platform’ (CRISP), combining with modern communication and GIS technology. ISS has four interconnected functions: 1) work group and communication group; 2) data source and collection; 3) data visualization; and 4) outbreak detection and alerting.

Results

As of Jan. 31^st 2012, ISS has been installed and pilot tested for six months in four counties in rural China. 95 health facilities, 14 pharmacies and 24 primary schools participated in the pilot study, entering respectively 74256, 79701, and 2330 daily records into the central database. More than 90% of surveillance units at the study sites are able to send daily information into the system. In the paper, we also presented the pilot data from health facilities in the two counties, which showed the ISS system had the potential to identify the change of disease patterns at the community level.

Conclusions

The ISS platform may facilitate the early detection of infectious disease epidemic as it provides near real-time syndromic data collection, interactive visualization, and automated aberration detection. However, several constraints and challenges were encountered during the pilot implementation of ISS in rural China.

Collapse

Conway M, Dowling JN, Chapman WW. Using chief complaints for syndromic surveillance: a review of chief complaint based classifiers in North America. J Biomed Inform 2013;46:734-43. [PMID: 23602781 DOI: 10.1016/j.jbi.2013.04.003] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2012] [Revised: 08/30/2012] [Accepted: 04/03/2013] [Indexed: 11/27/2022]

Dórea FC, Muckle CA, Kelton D, McClure JT, McEwen BJ, McNab WB, Sanchez J, Revie CW. Exploratory analysis of methods for automated classification of laboratory test orders into syndromic groups in veterinary medicine. PLoS One 2013;8:e57334. [PMID: 23505427 PMCID: PMC3591392 DOI: 10.1371/journal.pone.0057334] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2012] [Accepted: 01/21/2013] [Indexed: 12/02/2022] Open

Abstract

BACKGROUND

Recent focus on earlier detection of pathogen introduction in human and animal populations has led to the development of surveillance systems based on automated monitoring of health data. Real- or near real-time monitoring of pre-diagnostic data requires automated classification of records into syndromes--syndromic surveillance--using algorithms that incorporate medical knowledge in a reliable and efficient way, while remaining comprehensible to end users.

METHODS

This paper describes the application of two of machine learning (Naïve Bayes and Decision Trees) and rule-based methods to extract syndromic information from laboratory test requests submitted to a veterinary diagnostic laboratory.

RESULTS

High performance (F1-macro = 0.9995) was achieved through the use of a rule-based syndrome classifier, based on rule induction followed by manual modification during the construction phase, which also resulted in clear interpretability of the resulting classification process. An unmodified rule induction algorithm achieved an F(1-micro) score of 0.979 though this fell to 0.677 when performance for individual classes was averaged in an unweighted manner (F(1-macro)), due to the fact that the algorithm failed to learn 3 of the 16 classes from the training set. Decision Trees showed equal interpretability to the rule-based approaches, but achieved an F(1-micro) score of 0.923 (falling to 0.311 when classes are given equal weight). A Naïve Bayes classifier learned all classes and achieved high performance (F(1-micro)= 0.994 and F(1-macro) = .955), however the classification process is not transparent to the domain experts.

CONCLUSION

The use of a manually customised rule set allowed for the development of a system for classification of laboratory tests into syndromic groups with very high performance, and high interpretability by the domain experts. Further research is required to develop internal validation rules in order to establish automated methods to update model rules without user input.

Collapse

Alemi F, Torii M, Atherton MJ, Pattie DC, Cox KL. Bayesian Processing of Context-Dependent Text. Med Decis Making 2012;32:E1-9. [DOI: 10.1177/0272989x12439753] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Smith CA. Consumer language, patient language, and thesauri: a review of the literature. J Med Libr Assoc 2011;99:135-44. [PMID: 21464851 PMCID: PMC3066584 DOI: 10.3163/1536-5050.99.2.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open

Burkom HS. Comments on 'some methodological issues in biosurveillance'. Stat Med 2011;30:426-9; discussion 434-41. [PMID: 21312212 DOI: 10.1002/sim.3986] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

DeLisle S, South B, Anthony JA, Kalp E, Gundlapallli A, Curriero FC, Glass GE, Samore M, Perl TM. Combining free text and structured electronic medical record entries to detect acute respiratory infections. PLoS One 2010;5:e13377. [PMID: 20976281 PMCID: PMC2954790 DOI: 10.1371/journal.pone.0013377] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2010] [Accepted: 08/30/2010] [Indexed: 11/25/2022] Open

Abstract

BACKGROUND

The electronic medical record (EMR) contains a rich source of information that could be harnessed for epidemic surveillance. We asked if structured EMR data could be coupled with computerized processing of free-text clinical entries to enhance detection of acute respiratory infections (ARI).

METHODOLOGY

A manual review of EMR records related to 15,377 outpatient visits uncovered 280 reference cases of ARI. We used logistic regression with backward elimination to determine which among candidate structured EMR parameters (diagnostic codes, vital signs and orders for tests, imaging and medications) contributed to the detection of those reference cases. We also developed a computerized free-text search to identify clinical notes documenting at least two non-negated ARI symptoms. We then used heuristics to build case-detection algorithms that best combined the retained structured EMR parameters with the results of the text analysis.

PRINCIPAL FINDINGS

An adjusted grouping of diagnostic codes identified reference ARI patients with a sensitivity of 79%, a specificity of 96% and a positive predictive value (PPV) of 32%. Of the 21 additional structured clinical parameters considered, two contributed significantly to ARI detection: new prescriptions for cough remedies and elevations in body temperature to at least 38°C. Together with the diagnostic codes, these parameters increased detection sensitivity to 87%, but specificity and PPV declined to 95% and 25%, respectively. Adding text analysis increased sensitivity to 99%, but PPV dropped further to 14%. Algorithms that required satisfying both a query of structured EMR parameters as well as text analysis disclosed PPVs of 52-68% and retained sensitivities of 69-73%.

CONCLUSION

Structured EMR parameters and free-text analyses can be combined into algorithms that can detect ARI cases with new levels of sensitivity or precision. These results highlight potential paths by which repurposed EMR information could facilitate the discovery of epidemics before they cause mass casualties.

Collapse

Chen H, Zeng D, Yan P. RODS. INTEGRATED SERIES IN INFORMATION SYSTEMS 2010. [PMCID: PMC7498900 DOI: 10.1007/978-1-4419-1278-7_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]