1
|
Cui Z, Yu K, Yuan Z, Dong X, Luo W. Language inference-based learning for Low-Resource Chinese clinical named entity recognition using language model. J Biomed Inform 2024; 149:104559. [PMID: 38056702 DOI: 10.1016/j.jbi.2023.104559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 10/24/2023] [Accepted: 11/30/2023] [Indexed: 12/08/2023]
Abstract
Electronic health records (EHRs) have been widely used and are gradually replacing paper records. Therefore, extracting valuable information from EHRs has become the focus and hotspot of current research. Clinical named entity recognition (CNER) is an important task in information extraction. Most current research methods used standard supervised learning approaches to fine-tune pre-trained language models (PLMs), which require a large amount of annotated data for model training. However, in realistic medical scenarios, annotated data are scarce, especially in the healthcare field. The process of annotating data in real clinical settings is time-consuming and labour-intensive. In this paper, a language inference-based learning method (LANGIL) is proposed to study clinical NER tasks with limited annotated samples, i.e., in low-resource clinical scenarios. A method based on prompt learning is designed to reformulate the entity recognition task into a language inference-based task. Differing from the standard fine-tuning method, the approach introduced in this paper does not design the additional network layers that train from scratch. This alleviates the gap between pre-training tasks and downstream tasks, allowing the comprehension capabilities of PLMs to be leveraged under the condition of limited training samples. The experiments on four Chinese clinical named entity recognition datasets showed that LANGIL achieves significant improvements in F1-score compared to the former method.
Collapse
Affiliation(s)
- Zhaojian Cui
- Schoolof Information Science and Technology, Hangzhou Normal University, Hangzhou, 311121 China
| | - Kai Yu
- Schoolof Information Science and Technology, Hangzhou Normal University, Hangzhou, 311121 China.
| | - Zhenming Yuan
- Schoolof Information Science and Technology, Hangzhou Normal University, Hangzhou, 311121 China
| | - Xiaofeng Dong
- Schoolof Information Science and Technology, Hangzhou Normal University, Hangzhou, 311121 China
| | - Weibin Luo
- Schoolof Information Science and Technology, Hangzhou Normal University, Hangzhou, 311121 China
| |
Collapse
|
2
|
Liu H, Dai H, Chen J, Xu J, Tao Y, Lin H. Interactive similar patient retrieval for visual summary of patient outcomes. J Vis (Tokyo) 2022. [DOI: 10.1007/s12650-022-00898-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
3
|
Li C, Liu J, Qian G, Wang Z, Han J. Double chain system for online and offline medical data sharing via private and consortium blockchain: A system design study. Front Public Health 2022; 10:1012202. [PMID: 36304235 PMCID: PMC9595571 DOI: 10.3389/fpubh.2022.1012202] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 09/26/2022] [Indexed: 01/27/2023] Open
Abstract
With the informatization development and digital construction in the healthcare industry, electronic medical records and Internet medicine facilitate people's medical treatment. However, the current data storage method has the risk of data loss, leakage, and tampering, and can't support extensive and secure sharing of medical data. To realize effective and secure medical data storage and sharing among offline medical institutions and Internet medicine platforms, this study used a combined private blockchain and consortium blockchain to design a medical blockchain double-chain system (MBDS). This system can store encrypted medical data in distributed storage mode and systematically integrate the medical data of patients in offline medical institutions and Internet medicine platforms, to achieve equality, credibility, and data sharing among participating nodes. The MBDS system constructed in this study incorporated Internet medicine care services into the current healthcare system and provided new solutions and practical guidance for the future development of collaborative medical care. This study helped to solve the problems of medical data interconnection and resource sharing, improve the efficiency and effect of disease diagnosis, alleviate the contradiction between doctors and patients, and facilitate personal health management. This study has substantial theoretical and practical implications for the research and application of medical data storage and sharing.
Collapse
Affiliation(s)
- Chaoran Li
- School of Economics and Management, Shanghai University of Sport, Shanghai, China
| | - Jusheng Liu
- School of Economics and Management, Shanghai University of Political Science and Law, Shanghai, China,*Correspondence: Jusheng Liu
| | - Guanyu Qian
- Business School, Hunan University, Changsha, China
| | - Ziyi Wang
- School of Humanities, Shanghai University of Finance and Economics, Shanghai, China
| | - Jingti Han
- School of Information Management and Engineering, Shanghai University of Finance and Economics, Shanghai, China
| |
Collapse
|
4
|
Navaz AN, T. El-Kassabi H, Serhani MA, Oulhaj A, Khalil K. A Novel Patient Similarity Network (PSN) Framework Based on Multi-Model Deep Learning for Precision Medicine. J Pers Med 2022; 12:768. [PMID: 35629190 PMCID: PMC9144142 DOI: 10.3390/jpm12050768] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2022] [Accepted: 05/02/2022] [Indexed: 02/05/2023] Open
Abstract
Precision medicine can be defined as the comparison of a new patient with existing patients that have similar characteristics and can be referred to as patient similarity. Several deep learning models have been used to build and apply patient similarity networks (PSNs). However, the challenges related to data heterogeneity and dimensionality make it difficult to use a single model to reduce data dimensionality and capture the features of diverse data types. In this paper, we propose a multi-model PSN that considers heterogeneous static and dynamic data. The combination of deep learning models and PSN allows ample clinical evidence and information extraction against which similar patients can be compared. We use the bidirectional encoder representations from transformers (BERT) to analyze the contextual data and generate word embedding, where semantic features are captured using a convolutional neural network (CNN). Dynamic data are analyzed using a long-short-term-memory (LSTM)-based autoencoder, which reduces data dimensionality and preserves the temporal features of the data. We propose a data fusion approach combining temporal and clinical narrative data to estimate patient similarity. The experiments we conducted proved that our model provides a higher classification accuracy in determining various patient health outcomes when compared with other traditional classification algorithms.
Collapse
Affiliation(s)
- Alramzana Nujum Navaz
- Department of Information Systems and Security, College of Information Technology, UAE University, Al Ain P.O. Box 15551, United Arab Emirates;
| | - Hadeel T. El-Kassabi
- Department of Computer Science and Software Engineering, Concordia University, Montreal, QC H3G 1M8, Canada;
| | - Mohamed Adel Serhani
- Department of Information Systems and Security, College of Information Technology, UAE University, Al Ain P.O. Box 15551, United Arab Emirates;
| | - Abderrahim Oulhaj
- Department of Epidemiology and Public Health, College of Medicine and Health Sciences, Khalifa University, Abu Dhabi P.O. Box 17666, United Arab Emirates;
- Institute of Public Health, College of Medicine and Health Sciences, UAE University, Al Ain P.O. Box 15551, United Arab Emirates
| | - Khaled Khalil
- Faculty of Applied Science and Engineering, University of Toronto, Toronto, ON M5S 1A4, Canada;
| |
Collapse
|
5
|
Pinevich Y, Clark KJ, Harrison AM, Pickering BW, Herasevich V. Interaction Time with Electronic Health Records: A Systematic Review. Appl Clin Inform 2021; 12:788-799. [PMID: 34433218 DOI: 10.1055/s-0041-1733909] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
Abstract
BACKGROUND The amount of time that health care clinicians (physicians and nurses) spend interacting with the electronic health record is not well understood. OBJECTIVE This study aimed to evaluate the time that health care providers spend interacting with electronic health records (EHR). METHODS Data are retrieved from Ovid MEDLINE(R) and Epub Ahead of Print, In-Process and Other Non-Indexed Citations and Daily, (Ovid) Embase, CINAHL, and SCOPUS. STUDY ELIGIBILITY CRITERIA Peer-reviewed studies that describe the use of EHR and include measurement of time either in hours, minutes, or in the percentage of a clinician's workday. Papers were written in English and published between 1990 and 2021. PARTICIPANTS All physicians and nurses involved in inpatient and outpatient settings. STUDY APPRAISAL AND SYNTHESIS METHODS A narrative synthesis of the results, providing summaries of interaction time with EHR. The studies were rated according to Quality Assessment Tool for Studies with Diverse Designs. RESULTS Out of 5,133 de-duplicated references identified through database searching, 18 met inclusion criteria. Most were time-motion studies (50%) that followed by logged-based analysis (44%). Most were conducted in the United States (94%) and examined a clinician workflow in the inpatient settings (83%). The average time was nearly 37% of time of their workday by physicians in both inpatient and outpatient settings and 22% of the workday by nurses in inpatient settings. The studies showed methodological heterogeneity. CONCLUSION This systematic review evaluates the time that health care providers spend interacting with EHR. Interaction time with EHR varies depending on clinicians' roles and clinical settings, computer systems, and users' experience. The average time spent by physicians on EHR exceeded one-third of their workday. The finding is a possible indicator that the EHR has room for usability, functionality improvement, and workflow optimization.
Collapse
Affiliation(s)
- Yuliya Pinevich
- Department of Anesthesiology and Perioperative Medicine, Mayo Clinic, Rochester, Minnesota, United States
| | - Kathryn J Clark
- Department of Anesthesiology and Perioperative Medicine, Mayo Clinic, Rochester, Minnesota, United States
| | - Andrew M Harrison
- Division of Pulmonary and Critical Care Medicine, Mayo Clinic, Rochester, Minnesota, United States
| | - Brian W Pickering
- Department of Anesthesiology and Perioperative Medicine, Mayo Clinic, Rochester, Minnesota, United States
| | - Vitaly Herasevich
- Department of Anesthesiology and Perioperative Medicine, Mayo Clinic, Rochester, Minnesota, United States
| |
Collapse
|
6
|
Geo-SPS: bipartite graph representation for GeoSpatial prenatal survey data. Neural Comput Appl 2021. [DOI: 10.1007/s00521-021-06371-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
7
|
Huang Y, Wang N, Zhang Z, Liu H, Fei X, Wei L, Chen H. Patient Representation From Structured Electronic Medical Records Based on Embedding Technique: Development and Validation Study. JMIR Med Inform 2021; 9:e19905. [PMID: 34297000 PMCID: PMC8367145 DOI: 10.2196/19905] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Revised: 12/18/2020] [Accepted: 06/05/2021] [Indexed: 01/22/2023] Open
Abstract
Background The secondary use of structured electronic medical record (sEMR) data has become a challenge due to the diversity, sparsity, and high dimensionality of the data representation. Constructing an effective representation for sEMR data is becoming more and more crucial for subsequent data applications. Objective We aimed to apply the embedding technique used in the natural language processing domain for the sEMR data representation and to explore the feasibility and superiority of the embedding-based feature and patient representations in clinical application. Methods The entire training corpus consisted of records of 104,752 hospitalized patients with 13,757 medical concepts of disease diagnoses, physical examinations and procedures, laboratory tests, medications, etc. Each medical concept was embedded into a 200-dimensional real number vector using the Skip-gram algorithm with some adaptive changes from shuffling the medical concepts in a record 20 times. The average of vectors for all medical concepts in a patient record represented the patient. For embedding-based feature representation evaluation, we used the cosine similarities among the medical concept vectors to capture the latent clinical associations among the medical concepts. We further conducted a clustering analysis on stroke patients to evaluate and compare the embedding-based patient representations. The Hopkins statistic, Silhouette index (SI), and Davies-Bouldin index were used for the unsupervised evaluation, and the precision, recall, and F1 score were used for the supervised evaluation. Results The dimension of patient representation was reduced from 13,757 to 200 using the embedding-based representation. The average cosine similarity of the selected disease (subarachnoid hemorrhage) and its 15 clinically relevant medical concepts was 0.973. Stroke patients were clustered into two clusters with the highest SI (0.852). Clustering analyses conducted on patients with the embedding representations showed higher applicability (Hopkins statistic 0.931), higher aggregation (SI 0.862), and lower dispersion (Davies-Bouldin index 0.551) than those conducted on patients with reference representation methods. The clustering solutions for patients with the embedding-based representation achieved the highest F1 scores of 0.944 and 0.717 for two clusters. Conclusions The feature-level embedding-based representations can reflect the potential clinical associations among medical concepts effectively. The patient-level embedding-based representation is easy to use as continuous input to standard machine learning algorithms and can bring performance improvements. It is expected that the embedding-based representation will be helpful in a wide range of secondary uses of sEMR data.
Collapse
Affiliation(s)
- Yanqun Huang
- School of Biomedical Engineering, Capital Medical University, Beijing, China.,Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Ni Wang
- School of Biomedical Engineering, Capital Medical University, Beijing, China.,Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Zhiqiang Zhang
- School of Biomedical Engineering, Capital Medical University, Beijing, China.,Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Honglei Liu
- School of Biomedical Engineering, Capital Medical University, Beijing, China.,Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Xiaolu Fei
- Information Center, Xuanwu Hospital, Capital Medical University, Beijing, China
| | - Lan Wei
- Information Center, Xuanwu Hospital, Capital Medical University, Beijing, China
| | - Hui Chen
- School of Biomedical Engineering, Capital Medical University, Beijing, China.,Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| |
Collapse
|
8
|
Oei RW, Fang HSA, Tan WY, Hsu W, Lee ML, Tan NC. Using Domain Knowledge and Data-Driven Insights for Patient Similarity Analytics. J Pers Med 2021; 11:jpm11080699. [PMID: 34442343 PMCID: PMC8398126 DOI: 10.3390/jpm11080699] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 07/15/2021] [Accepted: 07/21/2021] [Indexed: 12/23/2022] Open
Abstract
Patient similarity analytics has emerged as an essential tool to identify cohorts of patients who have similar clinical characteristics to some specific patient of interest. In this study, we propose a patient similarity measure called D3K that incorporates domain knowledge and data-driven insights. Using the electronic health records (EHRs) of 169,434 patients with either diabetes, hypertension or dyslipidaemia (DHL), we construct patient feature vectors containing demographics, vital signs, laboratory test results, and prescribed medications. We discretize the variables of interest into various bins based on domain knowledge and make the patient similarity computation to be aligned with clinical guidelines. Key findings from this study are: (1) D3K outperforms baseline approaches in all seven sub-cohorts; (2) our domain knowledge-based binning strategy outperformed the traditional percentile-based binning in all seven sub-cohorts; (3) there is substantial agreement between D3K and physicians (κ = 0.746), indicating that D3K can be applied to facilitate shared decision making. This is the first study to use patient similarity analytics on a cardiometabolic syndrome-related dataset sourced from medical institutions in Singapore. We consider patient similarity among patient cohorts with the same medical conditions to develop localized models for personalized decision support to improve the outcomes of a target patient.
Collapse
Affiliation(s)
- Ronald Wihal Oei
- Institute of Data Science, National University of Singapore, Singapore 117602, Singapore; (W.-Y.T.); (W.H.); (M.-L.L.)
- Correspondence:
| | - Hao Sen Andrew Fang
- SingHealth Polyclinics, SingHealth, Singapore 150167, Singapore; (H.S.A.F.); (N.-C.T.)
| | - Wei-Ying Tan
- Institute of Data Science, National University of Singapore, Singapore 117602, Singapore; (W.-Y.T.); (W.H.); (M.-L.L.)
| | - Wynne Hsu
- Institute of Data Science, National University of Singapore, Singapore 117602, Singapore; (W.-Y.T.); (W.H.); (M.-L.L.)
- School of Computing, National University of Singapore, Singapore 117417, Singapore
| | - Mong-Li Lee
- Institute of Data Science, National University of Singapore, Singapore 117602, Singapore; (W.-Y.T.); (W.H.); (M.-L.L.)
- School of Computing, National University of Singapore, Singapore 117417, Singapore
| | - Ngiap-Chuan Tan
- SingHealth Polyclinics, SingHealth, Singapore 150167, Singapore; (H.S.A.F.); (N.-C.T.)
| |
Collapse
|
9
|
Mehrtak M, SeyedAlinaghi S, MohsseniPour M, Noori T, Karimi A, Shamsabadi A, Heydari M, Barzegary A, Mirzapour P, Soleymanzadeh M, Vahedi F, Mehraeen E, Dadras O. Security challenges and solutions using healthcare cloud computing. J Med Life 2021; 14:448-461. [PMID: 34621367 PMCID: PMC8485370 DOI: 10.25122/jml-2021-0100] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2021] [Accepted: 07/22/2021] [Indexed: 02/05/2023] Open
Abstract
Cloud computing is among the most beneficial solutions to digital problems. Security is one of the focal issues in cloud computing technology, and this study aims at investigating security issues of cloud computing and their probable solutions. A systematic review was performed using Scopus, Pubmed, Science Direct, and Web of Science databases. Once the title and abstract were evaluated, the quality of studies was assessed in order to choose the most relevant according to exclusion and inclusion criteria. Then, the full texts of studies selected were read thoroughly to extract the necessary results. According to the review, data security, availability, and integrity, as well as information confidentiality and network security, were the major challenges in cloud security. Further, data encryption, authentication, and classification, besides application programming interfaces (API), were security solutions to cloud infrastructure. Data encryption could be applied to store and retrieve data from the cloud in order to provide secure communication. Besides, several central challenges, which make the cloud security engineering process problematic, have been considered in this study.
Collapse
Affiliation(s)
- Mohammad Mehrtak
- School of Medicine and Allied Medical Sciences, Ardabil University of Medical Sciences, Ardabil, Iran
| | - SeyedAhmad SeyedAlinaghi
- Iranian Research Center for HIV/AIDS, Iranian Institute for Reduction of High Risk Behaviors, Tehran University of Medical Sciences, Tehran, Iran
| | - Mehrzad MohsseniPour
- Iranian Research Center for HIV/AIDS, Iranian Institute for Reduction of High Risk Behaviors, Tehran University of Medical Sciences, Tehran, Iran
| | - Tayebeh Noori
- Department of Health Information Technology, Zabol University of Medical Sciences, Zabol, Iran
| | - Amirali Karimi
- School of medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Ahmadreza Shamsabadi
- Department of Health Information Technology, Esfarayen Faculty of Medical Sciences, Esfarayen, Iran
| | - Mohammad Heydari
- Department of Health Information Technology, Khalkhal University of Medical Sciences, Khalkhal, Iran
| | | | - Pegah Mirzapour
- Iranian Research Center for HIV/AIDS, Iranian Institute for Reduction of High Risk Behaviors, Tehran University of Medical Sciences, Tehran, Iran
| | - Mahdi Soleymanzadeh
- Farabi Hospital, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Farzin Vahedi
- School of medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Esmaeil Mehraeen
- Department of Health Information Technology, Khalkhal University of Medical Sciences, Khalkhal, Iran
| | - Omid Dadras
- Department of Global Health and Socioepidemiology, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| |
Collapse
|
10
|
Instance-Based Learning Following Physician Reasoning for Assistance during Medical Consultation. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11135886] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
This article presents an automatic system for modeling clinical knowledge to follow a physician’s reasoning in medical consultation. Instance-based learning is applied to provide suggestions when recording electronic medical records. The system was validated on a real case study involving advanced medical students. The proposed system is accurate and efficient: 2.5× more efficient than a baseline empirical tool for suggestions and two orders of magnitude faster than a Bayesian learning method, when processing a testbed of 250 clinical case types. The research provides a framework to implement a real-time system to assist physicians during medical consultations.
Collapse
|
11
|
Abstract
Infectious diseases are caused by microorganisms belonging to the class of bacteria, viruses, fungi, or parasites. These pathogens are transmitted, directly or indirectly, and can lead to epidemics or even pandemics. The resulting infection may lead to mild-to-severe symptoms such as life-threatening fever or diarrhea. Infectious diseases may be asymptomatic in some individuals but may lead to disastrous effects in others. Despite the advances in medicine, infectious diseases are a leading cause of death worldwide, especially in low-income countries. With the advent of mathematical tools, scientists are now able to better predict epidemics, understand the specificity of each pathogen, and identify potential targets for drug development. Artificial intelligence and its components have been widely publicized for their ability to better diagnose certain types of cancer from imaging data. This chapter aims at identifying potential applications of machine learning in the field of infectious diseases. We are deliberately focusing on key aspects of infection: diagnosis, transmission, response to treatment, and resistance. We are proposing the use of extreme values as an avenue of interest for future developments in the field of infectious diseases. This chapter covers a series of applications selectively chosen to showcase how artificial intelligence is moving the field of infectious disease further and how it helps institutions to better tackles them, especially in low-income countries.
Collapse
Affiliation(s)
- Said Agrebi
- Yobitrust, Technopark El Gazala, Ariana, Tunisia
| | - Anis Larbi
- Singapore Immunology Network, Agency for Science, Technology and Research, Singapore, Singapore,Department of Microbiology & Immunology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| |
Collapse
|
12
|
Huang M, Shah ND, Yao L. Evaluating global and local sequence alignment methods for comparing patient medical records. BMC Med Inform Decis Mak 2019; 19:263. [PMID: 31856819 PMCID: PMC6921442 DOI: 10.1186/s12911-019-0965-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Background Sequence alignment is a way of arranging sequences (e.g., DNA, RNA, protein, natural language, financial data, or medical events) to identify the relatedness between two or more sequences and regions of similarity. For Electronic Health Records (EHR) data, sequence alignment helps to identify patients of similar disease trajectory for more relevant and precise prognosis, diagnosis and treatment of patients. Methods We tested two cutting-edge global sequence alignment methods, namely dynamic time warping (DTW) and Needleman-Wunsch algorithm (NWA), together with their local modifications, DTW for Local alignment (DTWL) and Smith-Waterman algorithm (SWA), for aligning patient medical records. We also used 4 sets of synthetic patient medical records generated from a large real-world EHR database as gold standard data, to objectively evaluate these sequence alignment algorithms. Results For global sequence alignments, 47 out of 80 DTW alignments and 11 out of 80 NWA alignments had superior similarity scores than reference alignments while the rest 33 DTW alignments and 69 NWA alignments had the same similarity scores as reference alignments. Forty-six out of 80 DTW alignments had better similarity scores than NWA alignments with the rest 34 cases having the equal similarity scores from both algorithms. For local sequence alignments, 70 out of 80 DTWL alignments and 68 out of 80 SWA alignments had larger coverage and higher similarity scores than reference alignments while the rest DTWL alignments and SWA alignments received the same coverage and similarity scores as reference alignments. Six out of 80 DTWL alignments showed larger coverage and higher similarity scores than SWA alignments. Thirty DTWL alignments had the equal coverage but better similarity scores than SWA. DTWL and SWA received the equal coverage and similarity scores for the rest 44 cases. Conclusions DTW, NWA, DTWL and SWA outperformed the reference alignments. DTW (or DTWL) seems to align better than NWA (or SWA) by inserting new daily events and identifying more similarities between patient medical records. The evaluation results could provide valuable information on the strengths and weakness of these sequence alignment methods for future development of sequence alignment methods and patient similarity-based studies.
Collapse
Affiliation(s)
- Ming Huang
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Nilay D Shah
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Lixia Yao
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA.
| |
Collapse
|
13
|
Sung YS, Dravenstott RW, Darer JD, Devapriya PD, Kumara S. SuperOrder: Provider order recommendation system for outpatient clinics. Health Informatics J 2019; 26:999-1016. [PMID: 31266390 DOI: 10.1177/1460458219857383] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
This study aims at developing SuperOrder, an order recommendation system for outpatient clinics. Using the electronic health record data available at midnight, SuperOrder predicts the order contents for each upcoming appointment on a daily basis. A two-level prediction framework is proposed. At the base-level, the predictions are produced by aggregating three machine learning methods. The meta-level predictions are generated by integrating the base-level predictions with the order co-occurrence network. We used the retrospective data between 1 April 2014 and 31 March 2015 in pulmonary clinics from five hospital sites within a large rural health care facility in Pennsylvania to test the feasibility. With a decrease of 6 per cent in the precision, the improvement of the recall at the meta-level is approximately 20 per cent from the base-level. This demonstrates that the proposed order co-occurrence network helps in increasing the performance of order predictions. The implementation will bring a more effective and efficient way to place outpatient orders.
Collapse
Affiliation(s)
- Yi-Shan Sung
- University of Arkansas for Medical Sciences, USA
| | | | | | | | | |
Collapse
|
14
|
Chen Y, Ding S, Xu Z, Zheng H, Yang S. Blockchain-Based Medical Records Secure Storage and Medical Service Framework. J Med Syst 2018; 43:5. [PMID: 30467604 DOI: 10.1007/s10916-018-1121-4] [Citation(s) in RCA: 70] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2018] [Accepted: 11/08/2018] [Indexed: 10/27/2022]
Abstract
Accurate and complete medical data are one valuable asset for patients. Privacy protection and the secure storage of medical data are crucial issues during medical services. Secure storage and making full use of personal medical records has always been a concern for the general population. The emergence of blockchain technology brings a new idea to solve this problem. As a hash chain with the characteristics of decentralization, verifiability and immutability, blockchain technology can be used to securely store personal medical data. In this paper, we design a storage scheme to manage personal medical data based on blockchain and cloud storage. Furthermore, a service framework for sharing medical records is described. In addition, the characteristics of the medical blockchain are presented and analyzed through a comparison with traditional systems. The proposed storage and sharing scheme does not depend on any third-party and no single party has absolute power to affect the processing.
Collapse
Affiliation(s)
- Yi Chen
- School of Management, Hefei University of Technology, Hefei, 23009, Anhui, China.,Key Laboratory of Process Optimization and Intelligent Decision-Making (Ministry of Education), Hefei University of Technology, Hefei, 23009, Anhui, China
| | - Shuai Ding
- School of Management, Hefei University of Technology, Hefei, 23009, Anhui, China. .,Key Laboratory of Process Optimization and Intelligent Decision-Making (Ministry of Education), Hefei University of Technology, Hefei, 23009, Anhui, China.
| | - Zheng Xu
- The Third Research Institute of the Ministry of Public Security, Shanghai, 201142, China.
| | - Handong Zheng
- School of Management, Hefei University of Technology, Hefei, 23009, Anhui, China.,Key Laboratory of Process Optimization and Intelligent Decision-Making (Ministry of Education), Hefei University of Technology, Hefei, 23009, Anhui, China
| | - Shanlin Yang
- School of Management, Hefei University of Technology, Hefei, 23009, Anhui, China.,Key Laboratory of Process Optimization and Intelligent Decision-Making (Ministry of Education), Hefei University of Technology, Hefei, 23009, Anhui, China
| |
Collapse
|
15
|
Patani N, MacAskill F, Eshelby S, Omar A, Kaura A, Contractor K, Thiruchelvam P, Curtis S, Main J, Cunningham D, Hogben K, Al-Mufti R, Hadjiminas DJ, Leff DR. Best-practice care pathway for improving management of mastitis and breast abscess. Br J Surg 2018; 105:1615-1622. [PMID: 29993125 DOI: 10.1002/bjs.10919] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2017] [Revised: 03/13/2018] [Accepted: 05/07/2018] [Indexed: 11/09/2022]
Abstract
BACKGROUND Surgical subspecialization has resulted in mastitis and breast abscesses being managed with unnecessary admission to hospital, prolonged inpatient stay, variable antibiotic prescribing, incision and drainage rather than percutaneous aspiration, and loss to specialist follow-up. The objective was to evaluate a best-practice algorithm with the aim of improving management of mastitis and breast abscesses across a multisite NHS Trust. The focus was on uniformity of antibiotic prescribing, ultrasound assessment, admission rates, length of hospital stay, intervention by aspiration or incision and drainage, and specialist follow-up. METHODS Management was initially evaluated in a retrospective cohort (phase I) and subsequently compared with that in two prospective cohorts after introduction of a breast abscess and mastitis pathway. One prospective cohort was analysed immediately after introduction of the pathway (phase II), and the second was used to assess the sustainability of the quality improvements (phase III). The overall impact of the pathway was assessed by comparing data from phase I with combined data from phases II and III; results from phases II and III were compared to judge sustainability. RESULTS Fifty-three patients were included in phase I, 61 in phase II and 80 in phase III. The management pathway and referral pro forma improved compliance with antibiotic guidelines from 34 per cent to 58·2 per cent overall (phases II and III) after implementation (P = 0·003). The improvement was maintained between phases II and III (54 and 61 per cent respectively; P = 0·684). Ultrasound assessment increased from 38 to 77·3 per cent overall (P < 0·001), in a sustained manner (75 and 79 per cent in phases II and III respectively; P = 0·894). Reductions in rates of incision and drainage (from 8 to 0·7 per cent overall; P = 0·007) were maintained (0 per cent in phase II versus 1 per cent in phase III; P = 0·381). Specialist follow-up improved consistently from 43 to 95·7 per cent overall (P < 0·001), 92 per cent in phase II and 99 per cent in phase III (P = 0·120). Rates of hospital admission and median length of stay were not significantly reduced after implementation of the pathway. CONCLUSION A standardized approach to mastitis and breast abscess reduced undesirable practice variation, with sustained improvements in process and patient outcomes.
Collapse
Affiliation(s)
- N Patani
- Breast Unit, Imperial College Healthcare NHS Trust, London, UK
| | - F MacAskill
- Breast Unit, Imperial College Healthcare NHS Trust, London, UK
| | - S Eshelby
- Breast Unit, Imperial College Healthcare NHS Trust, London, UK
| | - A Omar
- Breast Unit, Imperial College Healthcare NHS Trust, London, UK
| | - A Kaura
- Breast Unit, Imperial College Healthcare NHS Trust, London, UK
| | - K Contractor
- Breast Unit, Imperial College Healthcare NHS Trust, London, UK
| | - P Thiruchelvam
- Breast Unit, Imperial College Healthcare NHS Trust, London, UK.,Department of Surgery and Cancer, Imperial College, London, UK
| | - S Curtis
- Department of Microbiology, Imperial College Healthcare NHS Trust, London, UK
| | - J Main
- Department of Infectious Diseases, Imperial College Healthcare NHS Trust, London, UK
| | - D Cunningham
- Breast Unit, Imperial College Healthcare NHS Trust, London, UK
| | - K Hogben
- Breast Unit, Imperial College Healthcare NHS Trust, London, UK
| | - R Al-Mufti
- Breast Unit, Imperial College Healthcare NHS Trust, London, UK
| | - D J Hadjiminas
- Breast Unit, Imperial College Healthcare NHS Trust, London, UK.,Department of Surgery and Cancer, Imperial College, London, UK
| | - D R Leff
- Breast Unit, Imperial College Healthcare NHS Trust, London, UK.,Department of Surgery and Cancer, Imperial College, London, UK
| |
Collapse
|
16
|
Patient representation learning and interpretable evaluation using clinical notes. J Biomed Inform 2018; 84:103-113. [PMID: 29966746 DOI: 10.1016/j.jbi.2018.06.016] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2018] [Revised: 06/07/2018] [Accepted: 06/28/2018] [Indexed: 11/22/2022]
Abstract
We have three contributions in this work: 1. We explore the utility of a stacked denoising autoencoder and a paragraph vector model to learn task-independent dense patient representations directly from clinical notes. To analyze if these representations are transferable across tasks, we evaluate them in multiple supervised setups to predict patient mortality, primary diagnostic and procedural category, and gender. We compare their performance with sparse representations obtained from a bag-of-words model. We observe that the learned generalized representations significantly outperform the sparse representations when we have few positive instances to learn from, and there is an absence of strong lexical features. 2. We compare the model performance of the feature set constructed from a bag of words to that obtained from medical concepts. In the latter case, concepts represent problems, treatments, and tests. We find that concept identification does not improve the classification performance. 3. We propose novel techniques to facilitate model interpretability. To understand and interpret the representations, we explore the best encoded features within the patient representations obtained from the autoencoder model. Further, we calculate feature sensitivity across two networks to identify the most significant input features for different classification tasks when we use these pretrained representations as the supervised input. We successfully extract the most influential features for the pipeline using this technique.
Collapse
|
17
|
Zhang YF, Gou L, Zhou TS, Lin DN, Zheng J, Li Y, Li JS. An ontology-based approach to patient follow-up assessment for continuous and personalized chronic disease management. J Biomed Inform 2017; 72:45-59. [PMID: 28676255 DOI: 10.1016/j.jbi.2017.06.021] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2017] [Revised: 06/23/2017] [Accepted: 06/30/2017] [Indexed: 01/22/2023]
Abstract
OBJECTIVE Chronic diseases are complex and persistent clinical conditions that require close collaboration among patients and health care providers in the implementation of long-term and integrated care programs. However, current solutions focus partially on intensive interventions at hospitals rather than on continuous and personalized chronic disease management. This study aims to fill this gap by providing computerized clinical decision support during follow-up assessments of chronically ill patients at home. METHODS We proposed an ontology-based framework to integrate patient data, medical domain knowledge, and patient assessment criteria for chronic disease patient follow-up assessments. A clinical decision support system was developed to implement this framework for automatic selection and adaptation of standard assessment protocols to suit patient personal conditions. We evaluated our method in the case study of type 2 diabetic patient follow-up assessments. RESULTS The proposed framework was instantiated using real data from 115,477 follow-up assessment records of 36,162 type 2 diabetic patients. Standard evaluation criteria were automatically selected and adapted to the particularities of each patient. Assessment results were generated as a general typing of patient overall condition and detailed scoring for each criterion, providing important indicators to the case manager about possible inappropriate judgments, in addition to raising patient awareness of their disease control outcomes. Using historical data as the gold standard, our system achieved a rate of accuracy of 99.93% and completeness of 95.00%. CONCLUSIONS This study contributes to improving the accessibility, efficiency and quality of current patient follow-up services. It also provides a generic approach to knowledge sharing and reuse for patient-centered chronic disease management.
Collapse
Affiliation(s)
- Yi-Fan Zhang
- Engineering Research Center of EMR and Intelligent Expert System, Ministry of Education, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China
| | - Ling Gou
- Engineering Research Center of EMR and Intelligent Expert System, Ministry of Education, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China
| | - Tian-Shu Zhou
- Engineering Research Center of EMR and Intelligent Expert System, Ministry of Education, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China
| | - De-Nan Lin
- Health Information Center, Shenzhen, China
| | - Jing Zheng
- Health Information Center, Shenzhen, China
| | - Ye Li
- Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Jing-Song Li
- Engineering Research Center of EMR and Intelligent Expert System, Ministry of Education, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China.
| |
Collapse
|
18
|
Brown SA. Patient Similarity: Emerging Concepts in Systems and Precision Medicine. Front Physiol 2016; 7:561. [PMID: 27932992 PMCID: PMC5121278 DOI: 10.3389/fphys.2016.00561] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 11/07/2016] [Indexed: 12/19/2022] Open
Affiliation(s)
- Sherry-Ann Brown
- Department of Cardiovascular Diseases, Mayo Clinic Rochester, MN, USA
| |
Collapse
|
19
|
Ramanan SV, Radhakrishna K, Waghmare A, Raj T, Nathan SP, Sreerama SM, Sampath S. Dense Annotation of Free-Text Critical Care Discharge Summaries from an Indian Hospital and Associated Performance of a Clinical NLP Annotator. J Med Syst 2016; 40:187. [PMID: 27342107 DOI: 10.1007/s10916-016-0541-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2015] [Accepted: 06/08/2016] [Indexed: 10/21/2022]
Abstract
Electronic Health Record (EHR) use in India is generally poor, and structured clinical information is mostly lacking. This work is the first attempt aimed at evaluating unstructured text mining for extracting relevant clinical information from Indian clinical records. We annotated a corpus of 250 discharge summaries from an Intensive Care Unit (ICU) in India, with markups for diseases, procedures, and lab parameters, their attributes, as well as key demographic information and administrative variables such as patient outcomes. In this process, we have constructed guidelines for an annotation scheme useful to clinicians in the Indian context. We evaluated the performance of an NLP engine, Cocoa, on a cohort of these Indian clinical records. We have produced an annotated corpus of roughly 90 thousand words, which to our knowledge is the first tagged clinical corpus from India. Cocoa was evaluated on a test corpus of 50 documents. The overlap F-scores across the major categories, namely disease/symptoms, procedures, laboratory parameters and outcomes, are 0.856, 0.834, 0.961 and 0.872 respectively. These results are competitive with results from recent shared tasks based on US records. The annotated corpus and associated results from the Cocoa engine indicate that unstructured text mining is a viable method for cohort analysis in the Indian clinical context, where structured EHR records are largely absent.
Collapse
Affiliation(s)
- S V Ramanan
- RelAgent Technologies (P) Limited, IIT Madras Research Park, #14, 1st Floor, Taramani, Chennai, 600113, India.
| | - Kedar Radhakrishna
- Division of Medical Informatics, St. John's Research Institute, 100 Feet Road, Koramangala, Bangalore, 560034, India.
| | - Abijeet Waghmare
- Division of Medical Informatics, St. John's Research Institute, 100 Feet Road, Koramangala, Bangalore, 560034, India
| | - Tony Raj
- Division of Medical Informatics, St. John's Research Institute, 100 Feet Road, Koramangala, Bangalore, 560034, India
| | - Senthil P Nathan
- RelAgent Technologies (P) Limited, IIT Madras Research Park, #14, 1st Floor, Taramani, Chennai, 600113, India
| | - Sai Madhukar Sreerama
- Division of Medical Informatics, St. John's Research Institute, 100 Feet Road, Koramangala, Bangalore, 560034, India
| | - Sriram Sampath
- Department of Critical Care Medicine, St. John's Medical College, Bangalore, India
| |
Collapse
|