1
|
Zhang R, Liu Z, Zhu C, Cai H, Yin K, Zhong F, Liu L. Constructing a Clinical Patient Similarity Network of Gastric Cancer. Bioengineering (Basel) 2024; 11:808. [PMID: 39199766 PMCID: PMC11351872 DOI: 10.3390/bioengineering11080808] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2024] [Accepted: 08/05/2024] [Indexed: 09/01/2024] Open
Abstract
OBJECTIVES Clinical molecular genetic testing and molecular imaging dramatically increase the quantity of clinical data. Combined with the extensive application of electronic health records, a medical data ecosystem is forming, which calls for big-data-based medicine models. We tried to use big data analytics to search for similar patients in a cancer cohort, showing how to apply artificial intelligence (AI) algorithms to clinical data processing to obtain clinically significant results, with the ultimate goal of improving healthcare management. METHODS In order to overcome the weaknesses of most data processing algorithms that rely on expert labeling and annotation, we uniformly adopted one-hot encoding for all types of clinical data, calculating the Euclidean distance to measure patient similarity and subgrouping via an unsupervised learning model. Overall survival (OS) was investigated to assess the clinical validity and clinical relevance of the model. RESULTS We took gastric cancers (GCs) as an example to build a high-dimensional clinical patient similarity network (cPSN). When performing the survival analysis, we found that Cluster_2 had the longest survival rates, while Cluster_5 had the worst prognosis among all the subgroups. As patients in the same subgroup share some clinical characteristics, the clinical feature analysis found that Cluster_2 harbored more lower distal GCs than upper proximal GCs, shedding light on the debates. CONCLUSION Overall, we constructed a cancer-specific cPSN with excellent interpretability and clinical significance, which would recapitulate patient similarity in the real-world. The constructed cPSN model is scalable, generalizable, and performs well for various data types.
Collapse
Affiliation(s)
- Rukui Zhang
- Institute of Biomedical Sciences, Fudan University, 131 Dongan Road, Shanghai 200032, China
| | - Zhaorui Liu
- Department of Gastrointestinal Surgery, Changhai Hospital, Naval Military Medical University, 168 Changhai Road, Shanghai 200433, China
| | - Chaoyu Zhu
- Intelligent Medicine Institute, Fudan University, 131 Dongan Road, Shanghai 200032, China
| | - Hui Cai
- Department of Gastrointestinal Surgery, Changhai Hospital, Naval Military Medical University, 168 Changhai Road, Shanghai 200433, China
| | - Kai Yin
- Department of Gastrointestinal Surgery, Changhai Hospital, Naval Military Medical University, 168 Changhai Road, Shanghai 200433, China
| | - Fan Zhong
- Intelligent Medicine Institute, Fudan University, 131 Dongan Road, Shanghai 200032, China
| | - Lei Liu
- Institute of Biomedical Sciences, Fudan University, 131 Dongan Road, Shanghai 200032, China
- Intelligent Medicine Institute, Fudan University, 131 Dongan Road, Shanghai 200032, China
| |
Collapse
|
2
|
Beason-Held LL, Kerley CI, Chaganti S, Moghekar A, Thambisetty M, Ferrucci L, Resnick SM, Landman BA. Health Conditions Associated with Alzheimer's Disease and Vascular Dementia. Ann Neurol 2023; 93:805-818. [PMID: 36571386 DOI: 10.1002/ana.26584] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 12/15/2022] [Accepted: 12/17/2022] [Indexed: 12/27/2022]
Abstract
OBJECTIVE We examined medical records to determine health conditions associated with dementia at varied intervals prior to dementia diagnosis in participants from the Baltimore Longitudinal Study of Aging (BLSA). METHODS Data were available for 347 Alzheimer's disease (AD), 76 vascular dementia (VaD), and 811 control participants without dementia. Logistic regressions were performed associating International Classification of Diseases, 9th Revision (ICD-9) health codes with dementia status across all time points, at 5 and 1 year(s) prior to dementia diagnosis, and at the year of diagnosis, controlling for age, sex, and follow-up length of the medical record. RESULTS In AD, the earliest and most consistent associations across all time points included depression, erectile dysfunction, gait abnormalities, hearing loss, and nervous and musculoskeletal symptoms. Cardiomegaly, urinary incontinence, non-epithelial skin cancer, and pneumonia were not significant until 1 year before dementia diagnosis. In VaD, the earliest and most consistent associations across all time points included abnormal electrocardiogram (EKG), cardiac dysrhythmias, cerebrovascular disease, non-epithelial skin cancer, depression, and hearing loss. Atrial fibrillation, occlusion of cerebral arteries, essential tremor, and abnormal reflexes were not significant until 1 year before dementia diagnosis. INTERPRETATION These findings suggest that some health conditions are associated with future dementia beginning at least 5 years before dementia diagnosis and are consistently seen over time, while others only reach significance closer to the date of diagnosis. These results also show that there are both shared and distinctive health conditions associated with AD and VaD. These results reinforce the need for medical intervention and treatment to lessen the impact of health comorbidities in the aging population. ANN NEUROL 2023;93:805-818.
Collapse
Affiliation(s)
- Lori L Beason-Held
- National Institute on Aging Intramural Research Program, Baltimore, Maryland, USA
| | - Cailey I Kerley
- Department of Electrical and Computer Engineering, Vanderbilt University, Nashville, Tennessee, USA
| | - Shikha Chaganti
- Department of Electrical and Computer Engineering, Vanderbilt University, Nashville, Tennessee, USA
| | - Abhay Moghekar
- Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
| | - Madhav Thambisetty
- National Institute on Aging Intramural Research Program, Baltimore, Maryland, USA
| | - Luigi Ferrucci
- National Institute on Aging Intramural Research Program, Baltimore, Maryland, USA
| | - Susan M Resnick
- National Institute on Aging Intramural Research Program, Baltimore, Maryland, USA
| | - Bennett A Landman
- Department of Electrical and Computer Engineering, Vanderbilt University, Nashville, Tennessee, USA
| |
Collapse
|
3
|
Tan WY, Gao Q, Oei RW, Hsu W, Lee ML, Tan NC. Diabetes medication recommendation system using patient similarity analytics. Sci Rep 2022; 12:20910. [PMID: 36463296 PMCID: PMC9719534 DOI: 10.1038/s41598-022-24494-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 11/16/2022] [Indexed: 12/07/2022] Open
Abstract
Type-2 diabetes mellitus (T2DM) is a medical condition in which oral medications avail to patients to curb their hyperglycaemia after failed dietary therapy. However, individual responses to the prescribed pharmacotherapy may differ due to their clinical profiles, comorbidities, lifestyles and medical adherence. One approach is to identify similar patients within the same community to predict their likely response to the prescribed diabetes medications. This study aims to present an evidence-based diabetes medication recommendation system (DMRS) underpinned by patient similarity analytics. The DMRS was developed using 10-year electronic health records of 54,933 adult patients with T2DM from six primary care clinics in Singapore. Multiple clinical variables including patient demographics, comorbidities, laboratory test results, existing medications, and trajectory patterns of haemoglobin A1c (HbA1c) were used to identify similar patients. The DMRS was evaluated on four groups of patients with comorbidities such as hyperlipidaemia (HLD) and hypertension (HTN). Recommendations were assessed using hit ratio which represents the percentage of patients with at least one recommended sets of medication matches exactly the diabetes prescriptions in both the type and dosage. Recall, precision, and mean reciprocal ranking of the recommendation against the diabetes prescriptions in the EHR records were also computed. Evaluation against the EHR prescriptions revealed that the DMRS recommendations can achieve hit ratio of 81% for diabetes patients with no comorbidity, 84% for those with HLD, 78% for those with HTN, and 75% for those with both HLD and HTN. By considering patients' clinical profiles and their trajectory patterns of HbA1c, the DMRS can provide an individualized recommendation that resembles the actual prescribed medication and dosage. Such a system is useful as a shared decision-making tool to assist clinicians in selecting the appropriate medications for patients with T2DM.
Collapse
Affiliation(s)
- Wei Ying Tan
- Institute of Data Science, National University of Singapore, 3 Research Link, #04-06, Singapore, 117602, Singapore.
- Saw Swee Hock School of Public Health, National University of Singapore, Singapore, Singapore.
| | - Qiao Gao
- Institute of Data Science, National University of Singapore, 3 Research Link, #04-06, Singapore, 117602, Singapore
| | - Ronald Wihal Oei
- Institute of Data Science, National University of Singapore, 3 Research Link, #04-06, Singapore, 117602, Singapore
| | - Wynne Hsu
- Institute of Data Science, National University of Singapore, 3 Research Link, #04-06, Singapore, 117602, Singapore
- School of Computing, National University of Singapore, Singapore, Singapore
| | - Mong Li Lee
- Institute of Data Science, National University of Singapore, 3 Research Link, #04-06, Singapore, 117602, Singapore
- School of Computing, National University of Singapore, Singapore, Singapore
| | - Ngiap Chuan Tan
- SingHealth Polyclinics, SingHealth, Singapore, Singapore
- Family Medicine Academic Clinical Programme, SingHealth-Duke NUS Academic Medical Centre, Singapore, Singapore
| |
Collapse
|
4
|
Oei RW, Hsu W, Lee ML, Tan NC. Using similar patients to predict complication in patients with diabetes, hypertension, and lipid disorder: a domain knowledge-infused convolutional neural network approach. J Am Med Inform Assoc 2022; 30:273-281. [PMID: 36343096 PMCID: PMC9846687 DOI: 10.1093/jamia/ocac212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 09/27/2022] [Accepted: 10/21/2022] [Indexed: 11/09/2022] Open
Abstract
OBJECTIVE This study aims to develop a convolutional neural network-based learning framework called domain knowledge-infused convolutional neural network (DK-CNN) for retrieving clinically similar patient and to personalize the prediction of macrovascular complication using the retrieved patients. MATERIALS AND METHODS We use the electronic health records of 169 434 patients with diabetes, hypertension, and/or lipid disorder. Patients are partitioned into 7 subcohorts based on their comorbidities. DK-CNN integrates both domain knowledge and disease trajectory of patients over multiple visits to retrieve similar patients. We use normalized discounted cumulative gain (nDCG) and macrovascular complication prediction performance to evaluate the effectiveness of DK-CNN compared to state-of-the-art models. Ablation studies are conducted to compare DK-CNN with reduced models that do not use domain knowledge as well as models that do not consider short-term, medium-term, and long-term trajectory over multiple visits. RESULTS Key findings from this study are: (1) DK-CNN is able to retrieve clinically similar patients and achieves the highest nDCG values in all 7 subcohorts; (2) DK-CNN outperforms other state-of-the-art approaches in terms of complication prediction performance in all 7 subcohorts; and (3) the ablation studies show that the full model achieves the highest nDCG compared with other 2 reduced models. DISCUSSION AND CONCLUSIONS DK-CNN is a deep learning-based approach which incorporates domain knowledge and patient trajectory data to retrieve clinically similar patients. It can be used to assist physicians who may refer to the outcomes and past treatments of similar patients as a guide for choosing an effective treatment for patients.
Collapse
Affiliation(s)
- Ronald Wihal Oei
- Corresponding Author: Ronald Wihal Oei, MBBS, Institute of Data Science, National University of Singapore, Innovation 4.0, #04-06, 3 Research Link, 117602 Singapore;
| | - Wynne Hsu
- Institute of Data Science, National University of Singapore, Singapore,School of Computing, National University of Singapore, Singapore
| | - Mong Li Lee
- Institute of Data Science, National University of Singapore, Singapore,School of Computing, National University of Singapore, Singapore
| | | |
Collapse
|
5
|
Marzano L, Darwich AS, Tendler S, Dan A, Lewensohn R, De Petris L, Raghothama J, Meijer S. A novel analytical framework for risk stratification of real-world data using machine learning: A small cell lung cancer study. Clin Transl Sci 2022; 15:2437-2447. [PMID: 35856401 PMCID: PMC9579402 DOI: 10.1111/cts.13371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 06/26/2022] [Accepted: 07/08/2022] [Indexed: 01/25/2023] Open
Abstract
In recent studies, small cell lung cancer (SCLC) treatment guidelines based on Veterans' Administration Lung Study Group limited/extensive disease staging and resulted in broad and inseparable prognostic subgroups. Evidence suggests that the eight versions of tumor, node, and metastasis (TNM) staging can play an important role to address this issue. The aim of the present study was to improve the detection of prognostic subgroups from a real-word data (RWD) cohort of patients and analyze their patterns using a development pipeline with thoracic oncologists and machine learning methods. The method detected subgroups of patients informing unsupervised learning (partition around medoids) including the impact of covariates on prognosis (Cox regression and random survival forest). An analysis was carried out using patients with SCLC (n = 636) with stage IIIA-IVB according to TNM classification. The analysis yielded k = 7 compacted and well-separated clusters of patients. Performance status (Eastern Cooperative Oncology Group-Performance Status), lactate dehydrogenase, spreading of metastasis, cancer stage, and CRP were the baselines that characterized the subgroups. The selected clustering method outperformed standard clustering techniques, which were not capable of detecting meaningful subgroups. From the analysis of cluster treatment decisions, we showed the potential of future RWD applications to understand disease, develop individualized therapies, and improve healthcare decision making.
Collapse
Affiliation(s)
- Luca Marzano
- Division of Health Informatics and LogisticsSchool of Engineering Sciences in Chemistry, Biotechnology and Health (CBH), KTH Royal Institute of TechnologyHuddingeSweden
| | - Adam S. Darwich
- Division of Health Informatics and LogisticsSchool of Engineering Sciences in Chemistry, Biotechnology and Health (CBH), KTH Royal Institute of TechnologyHuddingeSweden
| | - Salomon Tendler
- Department of Oncology‐PathologyKarolinska Institutet and the Thoracic Oncology Center, Karolinska University HospitalStockholmSweden
| | - Asaf Dan
- Department of Oncology‐PathologyKarolinska Institutet and the Thoracic Oncology Center, Karolinska University HospitalStockholmSweden
| | - Rolf Lewensohn
- Department of Oncology‐PathologyKarolinska Institutet and the Thoracic Oncology Center, Karolinska University HospitalStockholmSweden
| | - Luigi De Petris
- Department of Oncology‐PathologyKarolinska Institutet and the Thoracic Oncology Center, Karolinska University HospitalStockholmSweden
| | - Jayanth Raghothama
- Division of Health Informatics and LogisticsSchool of Engineering Sciences in Chemistry, Biotechnology and Health (CBH), KTH Royal Institute of TechnologyHuddingeSweden
| | - Sebastiaan Meijer
- Division of Health Informatics and LogisticsSchool of Engineering Sciences in Chemistry, Biotechnology and Health (CBH), KTH Royal Institute of TechnologyHuddingeSweden
| |
Collapse
|