Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Sharafoddini A, Dubin JA, Lee J. Patient Similarity in Prediction Models Based on Health Data: A Scoping Review. JMIR Med Inform 2017;5:e7. [PMID: 28258046 PMCID: PMC5357318 DOI: 10.2196/medinform.6730] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2016] [Revised: 11/29/2016] [Accepted: 02/04/2017] [Indexed: 12/22/2022] Open

For:	Sharafoddini A, Dubin JA, Lee J. Patient Similarity in Prediction Models Based on Health Data: A Scoping Review. JMIR Med Inform 2017;5:e7. [PMID: 28258046 PMCID: PMC5357318 DOI: 10.2196/medinform.6730] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2016] [Revised: 11/29/2016] [Accepted: 02/04/2017] [Indexed: 12/22/2022] Open

Number

Cited by Other Article(s)

Pinho X, Meijer W, de Graaf A. Deriving Treatment Decision Support From Dutch Electronic Health Records by Exploring the Applicability of a Precision Cohort-Based Procedure for Patients With Type 2 Diabetes Mellitus: Precision Cohort Study. Online J Public Health Inform 2024;16:e51092. [PMID: 38691393 PMCID: PMC11097050 DOI: 10.2196/51092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 02/28/2024] [Accepted: 03/15/2024] [Indexed: 05/03/2024] Open

Abstract

BACKGROUND

The rapidly increasing availability of medical data in electronic health records (EHRs) may contribute to the concept of learning health systems, allowing for better personalized care. Type 2 diabetes mellitus was chosen as the use case in this study.

OBJECTIVE

This study aims to explore the applicability of a recently developed patient similarity-based analytics approach based on EHRs as a candidate data analytical decision support tool.

METHODS

A previously published precision cohort analytics workflow was adapted for the Dutch primary care setting using EHR data from the Nivel Primary Care Database. The workflow consisted of extracting patient data from the Nivel Primary Care Database to retrospectively generate decision points for treatment change, training a similarity model, generating a precision cohort of the most similar patients, and analyzing treatment options. This analysis showed the treatment options that led to a better outcome for the precision cohort in terms of clinical readouts for glycemic control.

RESULTS

Data from 11,490 registered patients diagnosed with type 2 diabetes mellitus were extracted from the database. Treatment-specific filter cohorts of patient groups were generated, and the effect of past treatment choices in these cohorts was assessed separately for glycated hemoglobin and fasting glucose as clinical outcome variables. Precision cohorts were generated for several individual patients from the filter cohorts. Treatment options and outcome analyses were technically well feasible but in general had a lack of statistical power to demonstrate statistical significance for treatment options with better outcomes.

CONCLUSIONS

The precision cohort analytics workflow was successfully adapted for the Dutch primary care setting, proving its potential for use as a learning health system component. Although the approach proved technically well feasible, data size limitations need to be overcome before application for clinical decision support becomes realistically possible.

Collapse

Ahmed MS, Hasan T, Islam S, Ahmed N. Investigating Rhythmicity in App Usage to Predict Depressive Symptoms: Protocol for Personalized Framework Development and Validation Through a Countrywide Study. JMIR Res Protoc 2024;13:e51540. [PMID: 38657238 PMCID: PMC11079771 DOI: 10.2196/51540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Revised: 12/27/2023] [Accepted: 01/11/2024] [Indexed: 04/26/2024] Open

Abstract

BACKGROUND

Understanding a student's depressive symptoms could facilitate significantly more precise diagnosis and treatment. However, few studies have focused on depressive symptom prediction through unobtrusive systems, and these studies are limited by small sample sizes, low performance, and the requirement for higher resources. In addition, research has not explored whether statistically significant rhythms based on different app usage behavioral markers (eg, app usage sessions) exist that could be useful in finding subtle differences to predict with higher accuracy like the models based on rhythms of physiological data.

OBJECTIVE

The main objective of this study is to explore whether there exist statistically significant rhythms in resource-insensitive app usage behavioral markers and predict depressive symptoms through these marker-based rhythmic features. Another objective of this study is to understand whether there is a potential link between rhythmic features and depressive symptoms.

METHODS

Through a countrywide study, we collected 2952 students' raw app usage behavioral data and responses to the 9 depressive symptoms in the 9-item Patient Health Questionnaire (PHQ-9). The behavioral data were retrieved through our developed app, which was previously used in our pilot studies in Bangladesh on different research problems. To explore whether there is a rhythm based on app usage data, we will conduct a zero-amplitude test. In addition, we will develop a cosinor model for each participant to extract rhythmic parameters (eg, acrophase). In addition, to obtain a comprehensive picture of the rhythms, we will explore nonparametric rhythmic features (eg, interdaily stability). Furthermore, we will conduct regression analysis to understand the association of rhythmic features with depressive symptoms. Finally, we will develop a personalized multitask learning (MTL) framework to predict symptoms through rhythmic features.

RESULTS

After applying inclusion criteria (eg, having app usage data of at least 2 days to explore rhythmicity), we kept the data of 2902 (98.31%) students for analysis, with 24.48 million app usage events, and 7 days' app usage of 2849 (98.17%) students. The students are from all 8 divisions of Bangladesh, both public and private universities (19 different universities and 52 different departments). We are analyzing the data and will publish the findings in a peer-reviewed publication.

CONCLUSIONS

Having an in-depth understanding of app usage rhythms and their connection with depressive symptoms through a countrywide study can significantly help health care professionals and researchers better understand depressed students and may create possibilities for using app usage-based rhythms for intervention. In addition, the MTL framework based on app usage rhythmic features may more accurately predict depressive symptoms due to the rhythms' capability to find subtle differences.

INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID)

DERR1-10.2196/51540.

Collapse

Seki T, Kawazoe Y, Ohe K. Clinical Feature Vector Generation using Unsupervised Graph Representation Learning from Heterogeneous Medical Records. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2024;2023:618-623. [PMID: 38222342 PMCID: PMC10785854] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 01/16/2024]

Rigdon J, Ostasiewski B, Woelfel K, Wiseman KD, Hetherington T, Downs S, Kowalkowski M. Automated generation of comparator patients in the electronic medical record. Learn Health Syst 2024;8:e10362. [PMID: 38249842 PMCID: PMC10797581 DOI: 10.1002/lrh2.10362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Revised: 02/17/2023] [Accepted: 02/18/2023] [Indexed: 03/30/2023] Open

Ma M, Sun P, Li Y, Huo W. Predicting the risk of mortality in ICU patients based on dynamic graph attention network of patient similarity. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023;20:15326-15344. [PMID: 37679182 DOI: 10.3934/mbe.2023685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/09/2023]

Pikoula M, Kallis C, Madjiheurem S, Quint JK, Bafadhel M, Denaxas S. Evaluation of data processing pipelines on real-world electronic health records data for the purpose of measuring patient similarity. PLoS One 2023;18:e0287264. [PMID: 37319288 PMCID: PMC10270623 DOI: 10.1371/journal.pone.0287264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 06/01/2023] [Indexed: 06/17/2023] Open

Abstract

BACKGROUND

The ever-growing size, breadth, and availability of patient data allows for a wide variety of clinical features to serve as inputs for phenotype discovery using cluster analysis. Data of mixed types in particular are not straightforward to combine into a single feature vector, and techniques used to address this can be biased towards certain data types in ways that are not immediately obvious or intended. In this context, the process of constructing clinically meaningful patient representations from complex datasets has not been systematically evaluated.

AIMS

Our aim was to a) outline and b) implement an analytical framework to evaluate distinct methods of constructing patient representations from routine electronic health record data for the purpose of measuring patient similarity. We applied the analysis on a patient cohort diagnosed with chronic obstructive pulmonary disease.

METHODS

Using data from the CALIBER data resource, we extracted clinically relevant features for a cohort of patients diagnosed with chronic obstructive pulmonary disease. We used four different data processing pipelines to construct lower dimensional patient representations from which we calculated patient similarity scores. We described the resulting representations, ranked the influence of each individual feature on patient similarity and evaluated the effect of different pipelines on clustering outcomes. Experts evaluated the resulting representations by rating the clinical relevance of similar patient suggestions with regard to a reference patient.

RESULTS

Each of the four pipelines resulted in similarity scores primarily driven by a unique set of features. It was demonstrated that data transformations according to each pipeline prior to clustering can result in a variation of clustering results of over 40%. The most appropriate pipeline was selected on the basis of feature ranking and clinical expertise. There was moderate agreement between clinicians as measured by Cohen's kappa coefficient.

CONCLUSIONS

Data transformation has downstream and unforeseen consequences in cluster analysis. Rather than viewing this process as a black box, we have shown ways to quantitatively and qualitatively evaluate and select the appropriate preprocessing pipeline.

Collapse

Liu Q, Ostinelli EG, De Crescenzo F, Li Z, Tomlinson A, Salanti G, Cipriani A, Efthimiou O. Predicting outcomes at the individual patient level: what is the best method? BMJ MENTAL HEALTH 2023;26:e300701. [PMID: 37316257 PMCID: PMC10277128 DOI: 10.1136/bmjment-2023-300701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 04/26/2023] [Indexed: 06/16/2023]

Abstract

OBJECTIVE

When developing prediction models, researchers commonly employ a single model which uses all the available data (end-to-end approach). Alternatively, a similarity-based approach has been previously proposed, in which patients with similar clinical characteristics are first grouped into clusters, then prediction models are developed within each cluster. The potential advantage of the similarity-based approach is that it may better address heterogeneity in patient characteristics. However, it remains unclear whether it improves the overall predictive performance. We illustrate the similarity-based approach using data from people with depression and empirically compare its performance with the end-to-end approach.

METHODS

We used primary care data collected in general practices in the UK. Using 31 predefined baseline variables, we aimed to predict the severity of depressive symptoms, measured by Patient Health Questionnaire-9, 60 days after initiation of antidepressant treatment. Following the similarity-based approach, we used k-means to cluster patients based on their baseline characteristics. We derived the optimal number of clusters using the Silhouette coefficient. We used ridge regression to build prediction models in both approaches. To compare the models' performance, we calculated the mean absolute error (MAE) and the coefficient of determination (R2) using bootstrapping.

RESULTS

We analysed data from 16 384 patients. The end-to-end approach resulted in an MAE of 4.64 and R2 of 0.20. The best-performing similarity-based model was for four clusters, with MAE of 4.65 and R2 of 0.19.

CONCLUSIONS

The end-to-end and the similarity-based model yielded comparable performance. Due to its simplicity, the end-to-end approach can be favoured when using demographic and clinical data to build prediction models on pharmacological treatments for depression.

Collapse

Jo H, Jun CH. A personalized classification model using similarity learning via supervised autoencoder. Appl Soft Comput 2022. [DOI: 10.1016/j.asoc.2022.109773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Omar N, Nazirun NN, Vijayam B, Wahab AA, Bahuri HA. Diabetes subtypes classification for personalized health care: A review. Artif Intell Rev 2022. [DOI: 10.1007/s10462-022-10202-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Gliozzo J, Mesiti M, Notaro M, Petrini A, Patak A, Puertas-Gallardo A, Paccanaro A, Valentini G, Casiraghi E. Heterogeneous data integration methods for patient similarity networks. Brief Bioinform 2022;23:6604996. [PMID: 35679533 PMCID: PMC9294435 DOI: 10.1093/bib/bbac207] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2021] [Revised: 04/14/2022] [Accepted: 05/04/2022] [Indexed: 12/29/2022] Open

Gim JA. A Genomic Information Management System for Maintaining Healthy Genomic States and Application of Genomic Big Data in Clinical Research. Int J Mol Sci 2022;23:5963. [PMID: 35682641 PMCID: PMC9180925 DOI: 10.3390/ijms23115963] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 05/22/2022] [Accepted: 05/25/2022] [Indexed: 01/19/2023] Open

Personalised Outcomes Forecasts of Supervised Exercise Therapy in Intermittent Claudication: An Application of Neighbours Based Prediction Methods with Routinely Collected Clinical Data. Eur J Vasc Endovasc Surg 2022;63:594-601. [PMID: 35210160 DOI: 10.1016/j.ejvs.2021.12.040] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Revised: 12/08/2021] [Accepted: 12/29/2021] [Indexed: 11/21/2022]

Wang N, Wang M, Zhou Y, Liu H, Wei L, Fei X, Chen H. Sequential Data-Based Patient Similarity Framework for Patient Outcome Prediction: Algorithm Development. J Med Internet Res 2022;24:e30720. [PMID: 34989682 PMCID: PMC8778569 DOI: 10.2196/30720] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Revised: 10/08/2021] [Accepted: 11/08/2021] [Indexed: 12/23/2022] Open

Abstract

BACKGROUND

Sequential information in electronic medical records is valuable and helpful for patient outcome prediction but is rarely used for patient similarity measurement because of its unevenness, irregularity, and heterogeneity.

OBJECTIVE

We aimed to develop a patient similarity framework for patient outcome prediction that makes use of sequential and cross-sectional information in electronic medical record systems.

METHODS

Sequence similarity was calculated from timestamped event sequences using edit distance, and trend similarity was calculated from time series using dynamic time warping and Haar decomposition. We also extracted cross-sectional information, namely, demographic, laboratory test, and radiological report data, for additional similarity calculations. We validated the effectiveness of the framework by constructing k-nearest neighbors classifiers to predict mortality and readmission for acute myocardial infarction patients, using data from (1) a public data set and (2) a private data set, at 3 time points-at admission, on Day 7, and at discharge-to provide early warning patient outcomes. We also constructed state-of-the-art Euclidean-distance k-nearest neighbor, logistic regression, random forest, long short-term memory network, and recurrent neural network models, which were used for comparison.

RESULTS

With all available information during a hospitalization episode, predictive models using the similarity model outperformed baseline models based on both public and private data sets. For mortality predictions, all models except for the logistic regression model showed improved performances over time. There were no such increasing trends in predictive performances for readmission predictions. The random forest and logistic regression models performed best for mortality and readmission predictions, respectively, when using information from the first week after admission.

CONCLUSIONS

For patient outcome predictions, the patient similarity framework facilitated sequential similarity calculations for uneven electronic medical record data and helped improve predictive performance.

Collapse

Oh SH, Back S, Park J. Measuring Patient Similarity on Multiple Diseases by Joint Learning via a Convolutional Neural Network. SENSORS (BASEL, SWITZERLAND) 2021;22:131. [PMID: 35009673 PMCID: PMC8749530 DOI: 10.3390/s22010131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Revised: 12/22/2021] [Accepted: 12/23/2021] [Indexed: 06/14/2023]

Huang HZ, Lu XD, Guo W, Jiang XB, Yan ZM, Wang SP. Heterogeneous Information Network-Based Patient Similarity Search. Front Cell Dev Biol 2021;9:735687. [PMID: 34568345 PMCID: PMC8456037 DOI: 10.3389/fcell.2021.735687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2021] [Accepted: 07/30/2021] [Indexed: 11/13/2022] Open

Abstract

Patient similarity search is a fundamental and important task in artificial intelligence-assisted medicine service, which is beneficial to medical diagnosis, such as making accurate predictions for similar diseases and recommending personalized treatment plans. Existing patient similarity search methods retrieve medical events associated with patients from Electronic Health Record (EHR) data and map them to vectors. The similarity between patients is expressed by calculating the similarity or dissimilarity between the corresponding vectors of medical events, thereby completing the patient similarity measurement. However, the obtained vectors tend to be high dimensional and sparse, which makes it hard to calculate patient similarity accurately. In addition, most of existing methods cannot capture the time information in the EHR, which is not conducive to analyzing the influence of time factors on patient similarity search. To solve these problems, we propose a patient similarity search method based on a heterogeneous information network. On the one hand, the proposed method uses a heterogeneous information network to connect patients, diseases, and drugs, which solves the problem of vector representation of mixed information related to patients, diseases, and drugs. Meanwhile, our method measures the similarity between patients by calculating the similarity between nodes in the heterogeneous information network. In this way, the challenges caused by high-dimensional and sparse vectors can be addressed. On the other hand, the proposed method solves the problem of inaccurate patient similarity search caused by the lack of use of time information in the patient similarity measurement process by encoding time information into an annotated heterogeneous information network. Experiments show that our method is better than the compared baseline methods.

Collapse

Wang N, Huang Y, Liu H, Zhang Z, Wei L, Fei X, Chen H. Study on the semi-supervised learning-based patient similarity from heterogeneous electronic medical records. BMC Med Inform Decis Mak 2021;21:58. [PMID: 34330261 PMCID: PMC8323210 DOI: 10.1186/s12911-021-01432-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Accepted: 02/09/2021] [Indexed: 12/24/2022] Open

Abstract

Background

A new learning-based patient similarity measurement was proposed to measure patients’ similarity for heterogeneous electronic medical records (EMRs) data.

Methods

We first calculated feature-level similarities according to the features’ attributes. A domain expert provided patient similarity scores of 30 randomly selected patients. These similarity scores and feature-level similarities for 30 patients comprised the labeled sample set, which was used for the semi-supervised learning algorithm to learn the patient-level similarities for all patients. Then we used the k-nearest neighbor (kNN) classifier to predict four liver conditions. The predictive performances were compared in four different situations. We also compared the performances between personalized kNN models and other machine learning models. We assessed the predictive performances by the area under the receiver operating characteristic curve (AUC), F1-score, and cross-entropy (CE) loss.

Results

As the size of the random training samples increased, the kNN models using the learned patient similarity to select near neighbors consistently outperformed those using the Euclidean distance to select near neighbors (all P values < 0.001). The kNN models using the learned patient similarity to identify the top k nearest neighbors from the random training samples also had a higher best-performance (AUC: 0.95 vs. 0.89, F1-score: 0.84 vs. 0.67, and CE loss: 1.22 vs. 1.82) than those using the Euclidean distance. As the size of the similar training samples increased, which composed the most similar samples determined by the learned patient similarity, the performance of kNN models using the simple Euclidean distance to select the near neighbors degraded gradually. When exchanging the role of the Euclidean distance, and the learned patient similarity in selecting the near neighbors and similar training samples, the performance of the kNN models gradually increased. These two kinds of kNN models had the same best-performance of AUC 0.95, F1-score 0.84, and CE loss 1.22. Among the four reference models, the highest AUC and F1-score were 0.94 and 0.80, separately, which were both lower than those for the simple and similarity-based kNN models.

Conclusions

This learning-based method opened an opportunity for similarity measurement based on heterogeneous EMR data and supported the secondary use of EMR data.

Collapse

Affiliation(s)

Ni Wang School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing, 100069, People's Republic of China.,Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, 100069, People's Republic of China
Yanqun Huang School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing, 100069, People's Republic of China.,Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, 100069, People's Republic of China
Honglei Liu School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing, 100069, People's Republic of China.,Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, 100069, People's Republic of China
Zhiqiang Zhang School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing, 100069, People's Republic of China.,Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, 100069, People's Republic of China
Lan Wei Information Center, Xuanwu Hospital, Capital Medical University, Beijing, 100053, People's Republic of China
Xiaolu Fei Information Center, Xuanwu Hospital, Capital Medical University, Beijing, 100053, People's Republic of China
Hui Chen School of Biomedical Engineering, Capital Medical University, No.10, Xitoutiao, You An Men, Fengtai District, Beijing, 100069, People's Republic of China. .,Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, 100069, People's Republic of China.

Collapse

Oei RW, Fang HSA, Tan WY, Hsu W, Lee ML, Tan NC. Using Domain Knowledge and Data-Driven Insights for Patient Similarity Analytics. J Pers Med 2021;11:jpm11080699. [PMID: 34442343 PMCID: PMC8398126 DOI: 10.3390/jpm11080699] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 07/15/2021] [Accepted: 07/21/2021] [Indexed: 12/23/2022] Open

Xu D, Sheng JQ, Hu PJH, Huang TS, Hsu CC. A Deep Learning-Based Unsupervised Method to Impute Missing Values in Patient Records for Improved Management of Cardiovascular Patients. IEEE J Biomed Health Inform 2021;25:2260-2272. [PMID: 33095720 DOI: 10.1109/jbhi.2020.3033323] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Yong Z, Luo L, Gu Y, Li C. Implication of excessive length of stay of asthma patient with heterogenous status attributed to air pollution. JOURNAL OF ENVIRONMENTAL HEALTH SCIENCE & ENGINEERING 2021;19:95-106. [PMID: 34150221 PMCID: PMC8172679 DOI: 10.1007/s40201-020-00584-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2020] [Accepted: 11/05/2020] [Indexed: 02/08/2023]

Abstract

OBJECTIVE

Air pollution has potential risk on asthma patients, further prolongs the length of stay. However, it is unclear that the impact of air pollution on excessive length of stay (ELoS) of heterogeneous asthma patients. In this study, we proposed a K-Nearest Neighbor (KNN) embedded approach incorporating with patient status to analyze the impact of short-term air pollution on the ELoS of asthma patients.

METHODS

The KNN embedded approach includes two stages. Firstly, the KNN algorithm was employed to search for the most similar patient community and approximate kernel proxy of each index patient by Euclidean distance. Then, we built the differential fixed-effect linear model to estimate the risk of air pollution to the ELoS.

RESULTS

We analyzed 6563 asthma patients' medical insurance records in a large city of China from January to December in 2014. It was found that when the duration of exposure to air pollution (i.e., PM2.5, PM10, SO2, NO2, and CO) reaches around 4-5 days, the risk of increasing the ELoS becomes the largest. But only O3 shows the opposite effect. What's more, CO is the dominant risk to increase the ELoS. With a 1 mg/m3 increment of CO average concentration in 5 days, the ELoS will go up by 0.8157 day (95%CI:0.72,0.9114). Based on the kernel proxy in the top 1% similar patient community, the additional financial burden posed on each patient increases by RMB 488.6002 (95%CI:430.1962,547.0043) due to the ELoS.

CONCLUSIONS

The KNN embedded approach is an innovative method that takes into account the heterogeneous patient status, and effectively estimates the impact of air pollution on the ELoS. It is concluded that air pollution poses adverse effects and additional financial burdens on asthma patients. Heterogeneous patients should adopt different strategies in health management to reduce the risk of increasing the ELoS due to air pollution, and improve the efficiency of medical resource utilization.

SUPPLEMENTARY INFORMATION

The online version contains supplementary material available at 10.1007/s40201-020-00584-8.

Collapse

Seligson ND, Warner JL, Dalton WS, Martin D, Miller RS, Patt D, Kehl KL, Palchuk MB, Alterovitz G, Wiley LK, Huang M, Shen F, Wang Y, Nguyen KA, Wong AF, Meric-Bernstam F, Bernstam EV, Chen JL. Recommendations for patient similarity classes: results of the AMIA 2019 workshop on defining patient similarity. J Am Med Inform Assoc 2021;27:1808-1812. [PMID: 32885823 PMCID: PMC7671612 DOI: 10.1093/jamia/ocaa159] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Revised: 06/19/2020] [Accepted: 07/24/2020] [Indexed: 12/14/2022] Open

Sisk R, Lin L, Sperrin M, Barrett JK, Tom B, Diaz-Ordaz K, Peek N, Martin GP. Informative presence and observation in routine health data: A review of methodology for clinical risk prediction. J Am Med Inform Assoc 2021;28:155-166. [PMID: 33164082 PMCID: PMC7810439 DOI: 10.1093/jamia/ocaa242] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2020] [Accepted: 09/17/2020] [Indexed: 12/20/2022] Open

Abstract

Objective

Informative presence (IP) is the phenomenon whereby the presence or absence of patient data is potentially informative with respect to their health condition, with informative observation (IO) being the longitudinal equivalent. These phenomena predominantly exist within routinely collected healthcare data, in which data collection is driven by the clinical requirements of patients and clinicians. The extent to which IP and IO are considered when using such data to develop clinical prediction models (CPMs) is unknown, as is the existing methodology aiming at handling these issues. This review aims to synthesize such existing methodology, thereby helping identify an agenda for future methodological work.

Materials and Methods

A systematic literature search was conducted by 2 independent reviewers using prespecified keywords.

Results

Thirty-six articles were included. We categorized the methods presented within as derived predictors (including some representation of the measurement process as a predictor in the model), modeling under IP, and latent structures. Including missing indicators or summary measures as predictors is the most commonly presented approach amongst the included studies (24 of 36 articles).

Discussion

This is the first review to collate the literature in this area under a prediction framework. A considerable body relevant of literature exists, and we present ways in which the described methods could be developed further. Guidance is required for specifying the conditions under which each method should be used to enable applied prediction modelers to use these methods.

Conclusions

A growing recognition of IP and IO exists within the literature, and methodology is increasingly becoming available to leverage these phenomena for prediction purposes. IP and IO should be approached differently in a prediction context than when the primary goal is explanation. The work included in this review has demonstrated theoretical and empirical benefits of incorporating IP and IO, and therefore we recommend that applied health researchers consider incorporating these methods in their work.

Collapse

Personalized treatment options for chronic diseases using precision cohort analytics. Sci Rep 2021;11:1139. [PMID: 33441956 PMCID: PMC7806725 DOI: 10.1038/s41598-021-80967-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Accepted: 12/31/2020] [Indexed: 12/15/2022] Open

Feng Y, Wang Y, Zeng C, Mao H. Artificial Intelligence and Machine Learning in Chronic Airway Diseases: Focus on Asthma and Chronic Obstructive Pulmonary Disease. Int J Med Sci 2021;18:2871-2889. [PMID: 34220314 PMCID: PMC8241767 DOI: 10.7150/ijms.58191] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Accepted: 05/20/2021] [Indexed: 02/05/2023] Open

Lopez Pineda A, Pourshafeie A, Ioannidis A, Leibold CM, Chan AL, Bustamante CD, Frankovich J, Wojcik GL. Discovering prescription patterns in pediatric acute-onset neuropsychiatric syndrome patients. J Biomed Inform 2020;113:103664. [PMID: 33359113 DOI: 10.1016/j.jbi.2020.103664] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Revised: 10/28/2020] [Accepted: 12/10/2020] [Indexed: 11/28/2022]

Sharafoddini A, Dubin JA, Lee J. Identifying subpopulations of septic patients: A temporal data-driven approach. Comput Biol Med 2020;130:104182. [PMID: 33370712 DOI: 10.1016/j.compbiomed.2020.104182] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Revised: 12/14/2020] [Accepted: 12/14/2020] [Indexed: 01/31/2023]

Saad M, Lee IH. Leveraging hybrid biomarkers in clinical endpoint prediction. BMC Med Inform Decis Mak 2020;20:255. [PMID: 33028301 PMCID: PMC7538849 DOI: 10.1186/s12911-020-01262-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Accepted: 09/15/2020] [Indexed: 11/20/2022] Open

Abstract

Background

Clinical endpoint prediction remains challenging for health providers. Although predictors such as age, gender, and disease staging are of considerable predictive value, the accuracy often ranges between 60 and 80%. An accurate prognosis assessment is required for making effective clinical decisions.

Methods

We proposed an extended prognostic model based on clinical covariates with adjustment for additional variables that were radio-graphically induced, termed imaging biomarkers. Eight imaging biomarkers were introduced and investigated in a cohort of 68 non-small cell lung cancer subjects with tumor internal characteristic. The subjects comprised of 40 males and 28 females with mean age at 68.7 years. The imaging biomarkers used to quantify the solid component and non-solid component of a tumor. The extended model comprises of additional frameworks that correlate these markers to the survival ends through uni- and multi-variable analysis to determine the most informative predictors, before combining them with existing clinical predictors. Performance was compared between traditional and extended approaches using Receiver Operating Characteristic (ROC) curves, Area under the ROC curves (AUC), Kaplan-Meier (KM) curves, Cox Proportional Hazard, and log-rank tests (p-value).

Results

The proposed hybrid model exhibited an impressive boosting pattern over the traditional approach of prognostic modelling in the survival prediction (AUC ranging from 77 to 97%). Four developed imaging markers were found to be significant in distinguishing between subjects having more and less dense components: (P = 0.002–0.006). The correlation to survival analysis revealed that patients with denser composition of tumor (solid dominant) lived 1.6–2.2 years longer (mean survival) and 0.5–2.0 years longer (median survival), than those with less dense composition (non-solid dominant).

Conclusion

The present study provides crucial evidence that there is an added value for incorporating additional image-based predictors while predicting clinical endpoints. Though the hypotheses were confirmed in a customized case study, we believe the proposed model is easily adapted to various clinical cases, such as predictions of complications, treatment response, and disease evolution.

Collapse

Hendrickx JO, van Gastel J, Leysen H, Martin B, Maudsley S. High-dimensionality Data Analysis of Pharmacological Systems Associated with Complex Diseases. Pharmacol Rev 2020;72:191-217. [PMID: 31843941 DOI: 10.1124/pr.119.017921] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Abstract

It is widely accepted that molecular reductionist views of highly complex human physiologic activity, e.g., the aging process, as well as therapeutic drug efficacy are largely oversimplifications. Currently some of the most effective appreciation of biologic disease and drug response complexity is achieved using high-dimensionality (H-D) data streams from transcriptomic, proteomic, metabolomics, or epigenomic pipelines. Multiple H-D data sets are now common and freely accessible for complex diseases such as metabolic syndrome, cardiovascular disease, and neurodegenerative conditions such as Alzheimer's disease. Over the last decade our ability to interrogate these high-dimensionality data streams has been profoundly enhanced through the development and implementation of highly effective bioinformatic platforms. Employing these computational approaches to understand the complexity of age-related diseases provides a facile mechanism to then synergize this pathologic appreciation with a similar level of understanding of therapeutic-mediated signaling. For informative pathology and drug-based analytics that are able to generate meaningful therapeutic insight across diverse data streams, novel informatics processes such as latent semantic indexing and topological data analyses will likely be important. Elucidation of H-D molecular disease signatures from diverse data streams will likely generate and refine new therapeutic strategies that will be designed with a cognizance of a realistic appreciation of the complexity of human age-related disease and drug effects. We contend that informatic platforms should be synergistic with more advanced chemical/drug and phenotypic cellular/tissue-based analytical predictive models to assist in either de novo drug prioritization or effective repurposing for the intervention of aging-related diseases. SIGNIFICANCE STATEMENT: All diseases, as well as pharmacological mechanisms, are far more complex than previously thought a decade ago. With the advent of commonplace access to technologies that produce large volumes of high-dimensionality data (e.g., transcriptomics, proteomics, metabolomics), it is now imperative that effective tools to appreciate this highly nuanced data are developed. Being able to appreciate the subtleties of high-dimensionality data will allow molecular pharmacologists to develop the most effective multidimensional therapeutics with effectively engineered efficacy profiles.

Collapse

Hier DB, Kopel J, Brint SU, Wunsch DC, Olbricht GR, Azizi S, Allen B. Evaluation of standard and semantically-augmented distance metrics for neurology patients. BMC Med Inform Decis Mak 2020;20:203. [PMID: 32843023 PMCID: PMC7448345 DOI: 10.1186/s12911-020-01217-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Accepted: 08/12/2020] [Indexed: 12/23/2022] Open

Tokodi M, Shrestha S, Bianco C, Kagiyama N, Casaclang-Verzosa G, Narula J, Sengupta PP. Interpatient Similarities in Cardiac Function: A Platform for Personalized Cardiovascular Medicine. JACC Cardiovasc Imaging 2020;13:1119-1132. [PMID: 32199835 PMCID: PMC7556337 DOI: 10.1016/j.jcmg.2019.12.018] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Revised: 10/31/2019] [Accepted: 12/19/2019] [Indexed: 12/20/2022]

Abstract

OBJECTIVES

The authors applied unsupervised machine-learning techniques for integrating echocardiographic features of left ventricular (LV) structure and function into a patient similarity network that predicted major adverse cardiac event(s) (MACE) in an individual patient.

BACKGROUND

Patient similarity analysis is an evolving paradigm for precision medicine in which patients are clustered or classified based on their similarities in several clinical features.

METHODS

A retrospective cohort of 866 patients was used to develop a network architecture using 9 echocardiographic features of LV structure and function. The data for 468 patients from 2 prospective cohort registries were then added to test the model's generalizability.

RESULTS

The map of cross-sectional data in the retrospective cohort resulted in a looped patient network that persisted even after the addition of data from the prospective cohort registries. After subdividing the loop into 4 regions, patients in each region showed unique differences in LV function, with Kaplan-Meier curves demonstrating significant differences in MACE-related rehospitalization and death (both p < 0.001). Addition of network information to clinical risk predictors resulted in significant improvements in net reclassification, integrated discrimination, and median risk scores for predicting MACE (p < 0.05 for all). Furthermore, the network predicted the cardiac disease cycle in each of the 96 patients who had second echocardiographic evaluations. An improvement or remaining in low-risk regions was associated with lower MACE-related rehospitalization rates than worsening or remaining in high-risk regions (3% vs. 37%; p < 0.001).

CONCLUSIONS

Patient similarity analysis integrates multiple features of cardiac function to develop a phenotypic network in which patients can be mapped to specific locations associated with specific disease stage and clinical outcomes. The use of patient similarity analysis may have relevance for automated staging of cardiac disease severity, personalized prediction of prognosis, and monitoring progression or response to therapies.

Collapse

Cho JS, Shrestha S, Kagiyama N, Hu L, Ghaffar YA, Casaclang-Verzosa G, Zeb I, Sengupta PP. A Network-Based "Phenomics" Approach for Discovering Patient Subtypes From High-Throughput Cardiac Imaging Data. JACC Cardiovasc Imaging 2020;13:1655-1670. [PMID: 32762883 DOI: 10.1016/j.jcmg.2020.02.008] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Revised: 02/19/2020] [Accepted: 02/20/2020] [Indexed: 12/16/2022]

Abstract

OBJECTIVES

The authors present a method that focuses on cohort matching algorithms for performing patient-to-patient comparisons along multiple echocardiographic parameters for predicting meaningful patient subgroups.

BACKGROUND

Recent efforts in collecting multiomics data open numerous opportunities for comprehensive integration of highly heterogenous data to classify a patient's cardiovascular state, eventually leading to tailored therapies.

METHODS

A total of 42 echocardiography features, including 2-dimensional and Doppler measurements, left ventricular (LV) and atrial speckle-tracking, and vector flow mapping data, were obtained in 297 patients. A similarity network was developed to delineate distinct patient phenotypes, and then neural network models were trained for discriminating the phenotypic presentations.

RESULTS

The patient similarity model identified 4 clusters (I to IV), with patients in each cluster showed distinctive clinical presentations based on American College of Cardiology/American Heart Association heart failure stage and the occurrence of short-term major adverse cardiac and cerebrovascular events. Compared with other clusters, cluster IV had a higher prevalence of stage C or D heart failure (78%; p < 0.001), New York Heart Association functional classes III or IV (61%; p < 0.001), and a higher incidence of major adverse cardiac and cerebrovascular events (p < 0.001). The neural network model showed robust prediction of patient clusters, with area under the receiver-operating characteristic curve ranging from 0.82 to 0.99 for the independent hold-out validation set.

CONCLUSIONS

Automated computational methods for phenotyping can be an effective strategy to fuse multidimensional parameters of LV structure and function. It can identify distinct cardiac phenogroups in terms of clinical characteristics, cardiac structure and function, hemodynamics, and outcomes.

Collapse

Wentzel A, Hanula P, Luciani T, Elgohari B, Elhalawani H, Canahuate G, Vock D, Fuller CD, Marai GE. Cohort-based T-SSIM Visual Computing for Radiation Therapy Prediction and Exploration. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2020;26:949-959. [PMID: 31442988 PMCID: PMC7253296 DOI: 10.1109/tvcg.2019.2934546] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Huang M, Shah ND, Yao L. Evaluating global and local sequence alignment methods for comparing patient medical records. BMC Med Inform Decis Mak 2019;19:263. [PMID: 31856819 PMCID: PMC6921442 DOI: 10.1186/s12911-019-0965-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open

Abstract

Background

Sequence alignment is a way of arranging sequences (e.g., DNA, RNA, protein, natural language, financial data, or medical events) to identify the relatedness between two or more sequences and regions of similarity. For Electronic Health Records (EHR) data, sequence alignment helps to identify patients of similar disease trajectory for more relevant and precise prognosis, diagnosis and treatment of patients.

Methods

We tested two cutting-edge global sequence alignment methods, namely dynamic time warping (DTW) and Needleman-Wunsch algorithm (NWA), together with their local modifications, DTW for Local alignment (DTWL) and Smith-Waterman algorithm (SWA), for aligning patient medical records. We also used 4 sets of synthetic patient medical records generated from a large real-world EHR database as gold standard data, to objectively evaluate these sequence alignment algorithms.

Results

For global sequence alignments, 47 out of 80 DTW alignments and 11 out of 80 NWA alignments had superior similarity scores than reference alignments while the rest 33 DTW alignments and 69 NWA alignments had the same similarity scores as reference alignments. Forty-six out of 80 DTW alignments had better similarity scores than NWA alignments with the rest 34 cases having the equal similarity scores from both algorithms. For local sequence alignments, 70 out of 80 DTWL alignments and 68 out of 80 SWA alignments had larger coverage and higher similarity scores than reference alignments while the rest DTWL alignments and SWA alignments received the same coverage and similarity scores as reference alignments. Six out of 80 DTWL alignments showed larger coverage and higher similarity scores than SWA alignments. Thirty DTWL alignments had the equal coverage but better similarity scores than SWA. DTWL and SWA received the equal coverage and similarity scores for the rest 44 cases.

Conclusions

DTW, NWA, DTWL and SWA outperformed the reference alignments. DTW (or DTWL) seems to align better than NWA (or SWA) by inserting new daily events and identifying more similarities between patient medical records. The evaluation results could provide valuable information on the strengths and weakness of these sequence alignment methods for future development of sequence alignment methods and patient similarity-based studies.

Collapse

Ruan T, Lei L, Zhou Y, Zhai J, Zhang L, He P, Gao J. Representation learning for clinical time series prediction tasks in electronic health records. BMC Med Inform Decis Mak 2019;19:259. [PMID: 31842854 PMCID: PMC6916209 DOI: 10.1186/s12911-019-0985-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Chen X, Garcelon N, Neuraz A, Billot K, Lelarge M, Bonald T, Garcia H, Martin Y, Benoit V, Vincent M, Faour H, Douillet M, Lyonnet S, Saunier S, Burgun A. Phenotypic similarity for rare disease: Ciliopathy diagnoses and subtyping. J Biomed Inform 2019;100:103308. [DOI: 10.1016/j.jbi.2019.103308] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Revised: 09/05/2019] [Accepted: 10/11/2019] [Indexed: 01/29/2023]

Wang N, Huang Y, Liu H, Fei X, Wei L, Zhao X, Chen H. Measurement and application of patient similarity in personalized predictive modeling based on electronic medical records. Biomed Eng Online 2019;18:98. [PMID: 31601207 PMCID: PMC6788002 DOI: 10.1186/s12938-019-0718-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2019] [Accepted: 10/01/2019] [Indexed: 12/24/2022] Open

Xu D, Sheng JQ, Hu PJH, Huang TS, Lee WC. Predicting hepatocellular carcinoma recurrences: A data-driven multiclass classification method incorporating latent variables. J Biomed Inform 2019;96:103237. [PMID: 31238108 DOI: 10.1016/j.jbi.2019.103237] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2018] [Revised: 03/30/2019] [Accepted: 06/18/2019] [Indexed: 12/12/2022]

Haas K, Morton S, Gupta S, Mahoui M. Using Similarity Metrics on Real World Data and Patient Treatment Pathways to Recommend the Next Treatment. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2019;2019:398-406. [PMID: 31258993 PMCID: PMC6568112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Sharafoddini A, Dubin JA, Maslove DM, Lee J. A New Insight Into Missing Data in Intensive Care Unit Patient Profiles: Observational Study. JMIR Med Inform 2019;7:e11605. [PMID: 30622091 PMCID: PMC6329436 DOI: 10.2196/11605] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2018] [Revised: 10/30/2018] [Accepted: 10/30/2018] [Indexed: 01/08/2023] Open

Abstract

Background

The data missing from patient profiles in intensive care units (ICUs) are substantial and unavoidable. However, this incompleteness is not always random or because of imperfections in the data collection process.

Objective

This study aimed to investigate the potential hidden information in data missing from electronic health records (EHRs) in an ICU and examine whether the presence or missingness of a variable itself can convey information about the patient health status.

Methods

Daily retrieval of laboratory test (LT) measurements from the Medical Information Mart for Intensive Care III database was set as our reference for defining complete patient profiles. Missingness indicators were introduced as a way of representing presence or absence of the LTs in a patient profile. Thereafter, various feature selection methods (filter and embedded feature selection methods) were used to examine the predictive power of missingness indicators. Finally, a set of well-known prediction models (logistic regression [LR], decision tree, and random forest) were used to evaluate whether the absence status itself of a variable recording can provide predictive power. We also examined the utility of missingness indicators in improving predictive performance when used with observed laboratory measurements as model input. The outcome of interest was in-hospital mortality and mortality at 30 days after ICU discharge.

Results

Regardless of mortality type or ICU day, more than 40% of the predictors selected by feature selection methods were missingness indicators. Notably, employing missingness indicators as the only predictors achieved reasonable mortality prediction on all days and for all mortality types (for instance, in 30-day mortality prediction with LR, we achieved area under the curve of the receiver operating characteristic [AUROC] of 0.6836±0.012). Including indicators with observed measurements in the prediction models also improved the AUROC; the maximum improvement was 0.0426. Indicators also improved the AUROC for Simplified Acute Physiology Score II model—a well-known ICU severity of illness score—confirming the additive information of the indicators (AUROC of 0.8045±0.0109 for 30-day mortality prediction for LR).

Conclusions

Our study demonstrated that the presence or absence of LT measurements is informative and can be considered a potential predictor of in-hospital and 30-day mortality. The comparative analysis of prediction models also showed statistically significant prediction improvement when indicators were included. Moreover, missing data might reflect the opinions of examining clinicians. Therefore, the absence of measurements can be informative in ICUs and has predictive power beyond the measured data themselves. This initial case study shows promise for more in-depth analysis of missing data and its informativeness in ICUs. Future studies are needed to generalize these results.

Collapse

Tsaneva-Atanasova K, Diaz-Zuccarini V. Editorial: Mathematics for Healthcare as Part of Computational Medicine. Front Physiol 2018;9:985. [PMID: 30087624 PMCID: PMC6066689 DOI: 10.3389/fphys.2018.00985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2018] [Accepted: 07/04/2018] [Indexed: 11/13/2022] Open

Zhang H, Zhu F, Dodge HH, Higgins GA, Omenn GS, Guan Y. A similarity-based approach to leverage multi-cohort medical data on the diagnosis and prognosis of Alzheimer's disease. Gigascience 2018;7:5052206. [PMID: 30010762 PMCID: PMC6054197 DOI: 10.1093/gigascience/giy085] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2017] [Revised: 04/15/2018] [Accepted: 06/28/2018] [Indexed: 01/17/2023] Open

Affiliation(s)

Hongjiu Zhang Department of Computational Medicine and Bioinformatics, University of Michigan, 2017G Palmer Commons, 100 Washtenaw Avenue, Ann Arbor, MI, USA 48109
Fan Zhu Department of Computational Medicine and Bioinformatics, University of Michigan, 2017G Palmer Commons, 100 Washtenaw Avenue, Ann Arbor, MI, USA 48109 Chongqing Key Laboratory of Big Data and Intelligent Computing, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, 266 Fangzheng Avenue, Shuitu Hi-tech Industrial Park, Shuitu Town, Beibei District, Chongqing, China 400714
Hiroko H Dodge Michigan Alzheimer's Disease Center, University of Michigan, 2101 Commonwealth Blvd, Ann Arbor, MI, USA 48105 Department of Neurology, University of Michigan, 1500 E. Medical Center Dr., 1914 Taubman Center SPC 5316, Ann Arbor, MI, USA 48109 Layton Aging and Alzheimer's Disease Center and Department of Neurology, Oregon Health & Science University, 3181 S.W. Sam Jackson Park Road, L226, Portland, OR, USA 97239
Gerald A Higgins Department of Computational Medicine and Bioinformatics, University of Michigan, 2017G Palmer Commons, 100 Washtenaw Avenue, Ann Arbor, MI, USA 48109
Gilbert S Omenn Department of Computational Medicine and Bioinformatics, University of Michigan, 2017G Palmer Commons, 100 Washtenaw Avenue, Ann Arbor, MI, USA 48109 Department of Internal Medicine, University of Michigan, 3110 Taubman Center, SPC 5368, 1500 East Medical Center Drive, Ann Arbor, MI, USA 48109 Department of Human Genetics, University of Michigan, 4909 Buhl Building, 1241 E. Catherine St., Ann Arbor, MI, USA 48109 School of Public Health, University of Michigan, 1415 Washington Heights, Ann Arbor, MI, USA 48109
Yuanfang Guan Department of Computational Medicine and Bioinformatics, University of Michigan, 2017G Palmer Commons, 100 Washtenaw Avenue, Ann Arbor, MI, USA 48109 Department of Internal Medicine, University of Michigan, 3110 Taubman Center, SPC 5368, 1500 East Medical Center Drive, Ann Arbor, MI, USA 48109 Department of Electronic Engineering and Computer Science, Bob and Betty Beyster Building, 2260 Hayward Street, University of Michigan, Ann Arbor, MI, USA 48109
the Alzheimer's Disease Neuroimaging Initiative

Collapse

Suo Q, Ma F, Yuan Y, Huai M, Zhong W, Gao J, Zhang A. Deep Patient Similarity Learning for Personalized Healthcare. IEEE Trans Nanobioscience 2018;17:219-227. [DOI: 10.1109/tnb.2018.2837622] [Citation(s) in RCA: 50] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Tényi Á, Vela E, Cano I, Cleries M, Monterde D, Gomez-Cabrero D, Roca J. Risk and temporal order of disease diagnosis of comorbidities in patients with COPD: a population health perspective. BMJ Open Respir Res 2018;5:e000302. [PMID: 29955364 PMCID: PMC6018856 DOI: 10.1136/bmjresp-2018-000302] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2018] [Revised: 05/22/2018] [Indexed: 02/06/2023] Open

Parimbelli E, Marini S, Sacchi L, Bellazzi R. Patient similarity for precision medicine: A systematic review. J Biomed Inform 2018;83:87-96. [PMID: 29864490 DOI: 10.1016/j.jbi.2018.06.001] [Citation(s) in RCA: 66] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Revised: 05/16/2018] [Accepted: 06/01/2018] [Indexed: 12/19/2022]

Balikuddembe MS, Tumwesigye NM, Wakholi PK, Tylleskär T. Computerized Childbirth Monitoring Tools for Health Care Providers Managing Labor: A Scoping Review. JMIR Med Inform 2017;5:e14. [PMID: 28619702 PMCID: PMC5491898 DOI: 10.2196/medinform.6959] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Revised: 02/11/2017] [Accepted: 04/11/2017] [Indexed: 11/24/2022] Open

Abstract

Background

Proper monitoring of labor and childbirth prevents many pregnancy-related complications. However, monitoring is still poor in many places partly due to the usability concerns of support tools such as the partograph. In 2011, the World Health Organization (WHO) called for the development and evaluation of context-adaptable electronic health solutions to health challenges. Computerized tools have penetrated many areas of health care, but their influence in supporting health staff with childbirth seems limited.

Objective

The objective of this scoping review was to determine the scope and trends of research on computerized labor monitoring tools that could be used by health care providers in childbirth management.

Methods

We used key terms to search the Web for eligible peer-reviewed and gray literature. Eligibility criteria were a computerized labor monitoring tool for maternity service providers and dated 2006 to mid-2016. Retrieved papers were screened to eliminate ineligible papers, and consensus was reached on the papers included in the final analysis.

Results

We started with about 380,000 papers, of which 14 papers qualified for the final analysis. Most tools were at the design and implementation stages of development. Three papers addressed post-implementation evaluations of two tools. No documentation on clinical outcome studies was retrieved. The parameters targeted with the tools varied, but they included fetal heart (10 of 11 tools), labor progress (8 of 11), and maternal status (7 of 11). Most tools were designed for use in personal computers in low-resource settings and could be customized for different user needs.

Conclusions

Research on computerized labor monitoring tools is inadequate. Compared with other labor parameters, there was preponderance to fetal heart monitoring and hardly any summative evaluation of the available tools. More research, including clinical outcomes evaluation of computerized childbirth monitoring tools, is needed.

Collapse

Cano I, Tenyi A, Vela E, Miralles F, Roca J. Perspectives on Big Data applications of health information. ACTA ACUST UNITED AC 2017. [DOI: 10.1016/j.coisb.2017.04.012] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Kim BY, Lee J. Smart Devices for Older Adults Managing Chronic Disease: A Scoping Review. JMIR Mhealth Uhealth 2017;5:e69. [PMID: 28536089 PMCID: PMC5461419 DOI: 10.2196/mhealth.7141] [Citation(s) in RCA: 132] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2016] [Revised: 03/30/2017] [Accepted: 04/18/2017] [Indexed: 12/17/2022] Open

Abstract

BACKGROUND

The emergence of smartphones and tablets featuring vastly advancing functionalities (eg, sensors, computing power, interactivity) has transformed the way mHealth interventions support chronic disease management for older adults. Baby boomers have begun to widely adopt smart devices and have expressed their desire to incorporate technologies into their chronic care. Although smart devices are actively used in research, little is known about the extent, characteristics, and range of smart device-based interventions.

OBJECTIVE

We conducted a scoping review to (1) understand the nature, extent, and range of smart device-based research activities, (2) identify the limitations of the current research and knowledge gap, and (3) recommend future research directions.

METHODS

We used the Arksey and O'Malley framework to conduct a scoping review. We identified relevant studies from MEDLINE, Embase, CINAHL, and Web of Science databases using search terms related to mobile health, chronic disease, and older adults. Selected studies used smart devices, sampled older adults, and were published in 2010 or after. The exclusion criteria were sole reliance on text messaging (short message service, SMS) or interactive voice response, validation of an electronic version of a questionnaire, postoperative monitoring, and evaluation of usability. We reviewed references. We charted quantitative data and analyzed qualitative studies using thematic synthesis. To collate and summarize the data, we used the chronic care model.

RESULTS

A total of 51 articles met the eligibility criteria. Research activity increased steeply in 2014 (17/51, 33%) and preexperimental design predominated (16/50, 32%). Diabetes (16/46, 35%) and heart failure management (9/46, 20%) were most frequently studied. We identified diversity and heterogeneity in the collection of biometrics and patient-reported outcome measures within and between chronic diseases. Across studies, we found 8 self-management supporting strategies and 4 distinct communication channels for supporting the decision-making process. In particular, self-monitoring (38/40, 95%), automated feedback (15/40, 38%), and patient education (13/40, 38%) were commonly used as self-management support strategies. Of the 23 studies that implemented decision support strategies, clinical decision making was delegated to patients in 10 studies (43%). The impact on patient outcomes was consistent with studies that used cellular phones. Patients with heart failure and asthma reported improved quality of life. Qualitative analysis yielded 2 themes of facilitating technology adoption for older adults and 3 themes of barriers.

CONCLUSIONS

Limitations of current research included a lack of gerontological focus, dominance of preexperimental design, narrow research scope, inadequate support for participants, and insufficient evidence for clinical outcome. Recommendations for future research include generating evidence for smart device-based programs, using patient-generated data for advanced data mining techniques, validating patient decision support systems, and expanding mHealth practice through innovative technologies.

Collapse