1
|
Asatryan MN, Shmyr IS, Timofeev BI, Shcherbinin DN, Agasaryan VG, Timofeeva TA, Ershov IF, Gerasimuk ER, Nozdracheva AV, Semenenko TA, Logunov DY, Gintsburg AL. Development, study, and comparison of models of cross-immunity to the influenza virus using statistical methods and machine learning. Vopr Virusol 2024; 69:349-362. [PMID: 39361928 DOI: 10.36233/0507-4088-250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2024] [Indexed: 10/05/2024]
Abstract
INTRODUCTION The World Health Organization considers the values of antibody titers in the hemagglutination inhibition assay as one of the most important criteria for assessing successful vaccination. Mathematical modeling of cross-immunity allows for identification on a real-time basis of new antigenic variants, which is of paramount importance for human health. MATERIALS AND METHODS This study uses statistical methods and machine learning techniques from simple to complex: logistic regression model, random forest method, and gradient boosting. The calculations used the AAindex matrices in parallel to the Hamming distance. The calculations were carried out with different types and values of antigenic escape thresholds, on four data sets. The results were compared using common binary classification metrics. RESULTS Significant differentiation is shown depending on the data sets used. The best results were demonstrated by all three models for the forecast autumn season of 2022, which were preliminary trained on the February season of the same year (Auroc 0.934; 0.958; 0.956, respectively). The lowest results were obtained for the entire forecast year 2023, they were set up on data from two seasons of 2022 (Aucroc 0.614; 0.658; 0.775). The dependence of the results on the types of thresholds used and their values turned out to be insignificant. The additional use of AAindex matrices did not significantly improve the results of the models without introducing significant deterioration. CONCLUSION More complex models show better results. When developing cross-immunity models, testing on a variety of data sets is important to make strong claims about their prognostic robustness.
Collapse
Affiliation(s)
- M N Asatryan
- National Research Center for Epidemiology and Microbiology named after Honorary Academician N.F. Gamaleya
| | - I S Shmyr
- National Research Center for Epidemiology and Microbiology named after Honorary Academician N.F. Gamaleya
| | - B I Timofeev
- National Research Center for Epidemiology and Microbiology named after Honorary Academician N.F. Gamaleya
| | - D N Shcherbinin
- National Research Center for Epidemiology and Microbiology named after Honorary Academician N.F. Gamaleya
| | - V G Agasaryan
- National Research Center for Epidemiology and Microbiology named after Honorary Academician N.F. Gamaleya
| | - T A Timofeeva
- National Research Center for Epidemiology and Microbiology named after Honorary Academician N.F. Gamaleya
| | - I F Ershov
- National Research Center for Epidemiology and Microbiology named after Honorary Academician N.F. Gamaleya
| | - E R Gerasimuk
- National Research Center for Epidemiology and Microbiology named after Honorary Academician N.F. Gamaleya
- State University «Dubna»
| | - A V Nozdracheva
- National Research Center for Epidemiology and Microbiology named after Honorary Academician N.F. Gamaleya
| | - T A Semenenko
- National Research Center for Epidemiology and Microbiology named after Honorary Academician N.F. Gamaleya
| | - D Y Logunov
- National Research Center for Epidemiology and Microbiology named after Honorary Academician N.F. Gamaleya
| | - A L Gintsburg
- National Research Center for Epidemiology and Microbiology named after Honorary Academician N.F. Gamaleya
| |
Collapse
|
2
|
Essay P, Rajasekharan A. Robust diagnosis recommendation system for Primary Care Telemedicine using long short-term memory multi-class sequence classification. Heliyon 2024; 10:e26770. [PMID: 38510056 PMCID: PMC10950495 DOI: 10.1016/j.heliyon.2024.e26770] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 02/12/2024] [Accepted: 02/20/2024] [Indexed: 03/22/2024] Open
Abstract
Background Telemedicine offers opportunity for robust diagnoses recommendations to support healthcare providers intra-consultation in a way that does not limit providers ability to explore diagnostic codes and make the most appropriate selection for each consultation. Objective The objective of this work was to develop a recommendation system for ICD-10 coding using multiclass sequence classification and deep learning. The recommendations are intended to support telemedicine clinicians in making timely and appropriate diagnosis selections. The recommendations allow clinicians to find and select the best diagnosis code much quicker and without leaving the telemedicine platform to search codes and code descriptions. Methods We developed an LSTM model for multi-class text sequence classification to make diagnosis recommendations. The LSTM recommender used text-based symptoms, complaints, and consultation request reasons as model inputs. Data were extracted from a live telemedicine platform which spans general medicine, dermatology, and mental health clinical specialties. A popularity-based model was used for baseline comparison. Results Using over 2.8 MM telemedicine consultations during 2021 and 2022, our LSTM recommender average accuracy was 31.7%. LSTM recommender average coverage in the top 20 recommended diagnoses was 85.8% with an average personalization score of 0.87. Conclusions LSTM multi-class sequence classification recommends diagnoses specific to individual consultations, is retrainable on regular intervals, and could improve diagnoses recommendations such that providers require less time and resources searching for diagnosis codes. In addition, the LSTM recommender is robust enough to make recommendations across clinical specialties such as general medicine, dermatology, and mental health.
Collapse
Affiliation(s)
- Patrick Essay
- Teladoc Health, Inc, 1875 Lawrence St, Denver, CO, 80202, USA
| | | |
Collapse
|
3
|
Slade E, Rennick-Egglestone S, Ng F, Kotera Y, Llewellyn-Beardsley J, Newby C, Glover T, Keppens J, Slade M. The Implementation of Recommender Systems for Mental Health Recovery Narratives: Evaluation of Use and Performance. JMIR Ment Health 2024; 11:e45754. [PMID: 38551630 PMCID: PMC11015364 DOI: 10.2196/45754] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Revised: 09/11/2023] [Accepted: 02/15/2024] [Indexed: 04/02/2024] Open
Abstract
BACKGROUND Recommender systems help narrow down a large range of items to a smaller, personalized set. NarraGive is a first-in-field hybrid recommender system for mental health recovery narratives, recommending narratives based on their content and narrator characteristics (using content-based filtering) and on narratives beneficially impacting other similar users (using collaborative filtering). NarraGive is integrated into the Narrative Experiences Online (NEON) intervention, a web application providing access to the NEON Collection of recovery narratives. OBJECTIVE This study aims to analyze the 3 recommender system algorithms used in NarraGive to inform future interventions using recommender systems for lived experience narratives. METHODS Using a recently published framework for evaluating recommender systems to structure the analysis, we compared the content-based filtering algorithm and collaborative filtering algorithms by evaluating the accuracy (how close the predicted ratings are to the true ratings), precision (the proportion of the recommended narratives that are relevant), diversity (how diverse the recommended narratives are), coverage (the proportion of all available narratives that can be recommended), and unfairness (whether the algorithms produce less accurate predictions for disadvantaged participants) across gender and ethnicity. We used data from all participants in 2 parallel-group, waitlist control clinical trials of the NEON intervention (NEON trial: N=739; NEON for other [eg, nonpsychosis] mental health problems [NEON-O] trial: N=1023). Both trials included people with self-reported mental health problems who had and had not used statutory mental health services. In addition, NEON trial participants had experienced self-reported psychosis in the previous 5 years. Our evaluation used a database of Likert-scale narrative ratings provided by trial participants in response to validated narrative feedback questions. RESULTS Participants from the NEON and NEON-O trials provided 2288 and 1896 narrative ratings, respectively. Each rated narrative had a median of 3 ratings and 2 ratings, respectively. For the NEON trial, the content-based filtering algorithm performed better for coverage; the collaborative filtering algorithms performed better for accuracy, diversity, and unfairness across both gender and ethnicity; and neither algorithm performed better for precision. For the NEON-O trial, the content-based filtering algorithm did not perform better on any metric; the collaborative filtering algorithms performed better on accuracy and unfairness across both gender and ethnicity; and neither algorithm performed better for precision, diversity, or coverage. CONCLUSIONS Clinical population may be associated with recommender system performance. Recommender systems are susceptible to a wide range of undesirable biases. Approaches to mitigating these include providing enough initial data for the recommender system (to prevent overfitting), ensuring that items can be accessed outside the recommender system (to prevent a feedback loop between accessed items and recommended items), and encouraging participants to provide feedback on every narrative they interact with (to prevent participants from only providing feedback when they have strong opinions).
Collapse
Affiliation(s)
- Emily Slade
- School of Health Sciences, Institute of Mental Health, University of Nottingham, Nottingham, United Kingdom
- Department of Computer Science, University of Oxford, Oxford, United Kingdom
| | - Stefan Rennick-Egglestone
- School of Health Sciences, Institute of Mental Health, University of Nottingham, Nottingham, United Kingdom
| | - Fiona Ng
- School of Health Sciences, Institute of Mental Health, University of Nottingham, Nottingham, United Kingdom
| | - Yasuhiro Kotera
- School of Health Sciences, Institute of Mental Health, University of Nottingham, Nottingham, United Kingdom
- Center for Infectious Disease Education and Research, Osaka University, Osaka, Japan
| | - Joy Llewellyn-Beardsley
- School of Health Sciences, Institute of Mental Health, University of Nottingham, Nottingham, United Kingdom
| | - Chris Newby
- School of Medicine, University of Nottingham, Nottingham, United Kingdom
| | - Tony Glover
- DRT Software Ltd., Nottingham, United Kingdom
| | - Jeroen Keppens
- Department of Informatics, King's College London, London, United Kingdom
| | - Mike Slade
- School of Health Sciences, Institute of Mental Health, University of Nottingham, Nottingham, United Kingdom
- Nord University, Faculty of Nursing and Health Sciences, Namsos, Norway
| |
Collapse
|
4
|
Chinnasamy P, Wong WK, Raja AA, Khalaf OI, Kiran A, Babu JC. Health Recommendation System using Deep Learning-based Collaborative Filtering. Heliyon 2023; 9:e22844. [PMID: 38144343 PMCID: PMC10746410 DOI: 10.1016/j.heliyon.2023.e22844] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 11/16/2023] [Accepted: 11/21/2023] [Indexed: 12/26/2023] Open
Abstract
The crucial aspect of the medical sector is healthcare in today's modern society. To analyze a massive quantity of medical information, a medical system is necessary to gain additional perspectives and facilitate prediction and diagnosis. This device should be intelligent enough to analyze a patient's state of health through social activities, individual health information, and behavior analysis. The Health Recommendation System (HRS) has become an essential mechanism for medical care. In this sense, efficient healthcare networks are critical for medical decision-making processes. The fundamental purpose is to maintain that sensitive information can be shared only at the right moment while guaranteeing the effectiveness of data, authenticity, security, and legal concerns. As some people use social media to recognize their medical problems, healthcare recommendation systems need to generate findings like diagnosis recommendations, medical insurance, medical passageway-based care strategies, and homeopathic remedies associated with a patient's health status. New studies aimed at the use of vast numbers of health information by integrating multidisciplinary data from various sources are addressed, which also decreases the burden and health care costs. This article presents a recommended intelligent HRS using the deep learning system of the Restricted Boltzmann Machine (RBM)-Coevolutionary Neural Network (CNN) that provides insights on how data mining techniques could be used to introduce an efficient and effective health recommendation systems engine and highlights the pharmaceutical industry's ability to translate from either a conventional scenario towards a more personalized. We developed our proposed system using TensorFlow and Python. We evaluate the suggested method's performance using distinct error quantities compared to alternative methods using the health care dataset. Furthermore, the suggested approach's accuracy, precision, recall, and F-measure were compared with the current methods.
Collapse
Affiliation(s)
- P. Chinnasamy
- Department of Computer Science and Engineering, MLR Institute of Technology, Hyderabad, India
| | | | - A. Ambeth Raja
- PG Department of Computer Science, Thiruthangal Nadar College, Chennai, 600051, India
| | - Osamah Ibrahim Khalaf
- Department of Solar, Al-Nahrain Research Center for Renewable Energy, Al-Nahrain University, Jadriya, Baghdad, Iraq
| | - Ajmeera Kiran
- Department of Computer Science and Engineering, MLR Institute of Technology, Hyderabad, Telangana, 500043, India
| | - J. Chinna Babu
- Department of Electronics and Communication Engineering, Annamacharya Institute of Technology and Sciences, Rajampet, AP, India
| |
Collapse
|
5
|
Wu Y, Zhang L, Bhatti UA, Huang M. Interpretable Machine Learning for Personalized Medical Recommendations: A LIME-Based Approach. Diagnostics (Basel) 2023; 13:2681. [PMID: 37627940 PMCID: PMC10453635 DOI: 10.3390/diagnostics13162681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 07/10/2023] [Accepted: 08/10/2023] [Indexed: 08/27/2023] Open
Abstract
Chronic diseases are increasingly major threats to older persons, seriously affecting their physical health and well-being. Hospitals have accumulated a wealth of health-related data, including patients' test reports, treatment histories, and diagnostic records, to better understand patients' health, safety, and disease progression. Extracting relevant information from this data enables physicians to provide personalized patient-treatment recommendations. While collaborative filtering techniques and classical algorithms such as naive Bayes, logistic regression, and decision trees have had notable success in health-recommendation systems, most current systems primarily inform users of their likely preferences without providing explanations. This paper proposes an approach of deep learning with a local interpretable model-agnostic explanations (LIME)-based interpretable recommendation system to solve this problem. Specifically, we apply the proposed approach to two chronic diseases common in older adults: heart disease and diabetes. After data preprocessing, we use six deep-learning algorithms to form interpretations. In the heart-disease data set, the actual model recommendation of multi-layer perceptron and gradient-boosting algorithm differs from the local model's recommendation of LIME, which can be used as its approximate prediction. From the feature importance of these two algorithms, it can be seen that the CholCheck, GenHith, and HighBP features are the most important for predicting heart disease. In the diabetes data set, the actual model predictions of the multi-layer perceptron and logistic-regression algorithm were little different from the local model's prediction of LIME, which can be used as its approximate recommendation. Moreover, from the feature importance of the two algorithms, it can be seen that the three features of glucose, BMI, and age were the most important for predicting heart disease. Next, LIME is used to determine the importance of each feature that affected the results of the calculated model. Subsequently, we present the contribution coefficients of these features to the final recommendation. By analyzing the impact of different patient characteristics on the recommendations, our proposed system elucidates the underlying reasons behind these recommendations and enhances patient trust. This approach has important implications for medical recommendation systems and encourages informed decision-making in healthcare.
Collapse
Affiliation(s)
| | | | - Uzair Aslam Bhatti
- School of Information and Communication Engineering, Hainan University, Haikou 570100, China; (Y.W.); (L.Z.)
| | - Mengxing Huang
- School of Information and Communication Engineering, Hainan University, Haikou 570100, China; (Y.W.); (L.Z.)
| |
Collapse
|
6
|
Ling D, Liu A, Sun J, Wang Y, Wang L, Song X, Zhao X. Integration of IDPC Clustering Analysis and Interpretable Machine Learning for Survival Risk Prediction of Patients with ESCC. Interdiscip Sci 2023:10.1007/s12539-023-00569-9. [PMID: 37248421 DOI: 10.1007/s12539-023-00569-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2022] [Revised: 04/26/2023] [Accepted: 04/26/2023] [Indexed: 05/31/2023]
Abstract
Precise forecasting of survival risk plays a pivotal role in comprehending and predicting the prognosis of patients afflicted with esophageal squamous cell carcinoma (ESCC). The existing methods have the problems of insufficient fitting ability and poor interpretability. To address this issue, this work proposes a novel interpretable survival risk prediction method for ESCC patients based on extreme gradient boosting improved by whale optimization algorithm (WOA-XGBoost) and shapley additive explanations (SHAP). Given the imbalanced nature of the data set, the adaptive synthetic sampling (ADASYN) is first used to generate the samples with high survival risk. Then, an improved clustering by fast search and find of density peaks (IDPC) algorithm based on cosine distance and K nearest neighbors is used to cluster the patients. Next, the prediction model for each cluster is obtained by WOA-XGBoost and the constructed model is visualized with SHAP to uncover the factors hidden in the structured model and improve the interpretability of the black-box model. Finally, the effectiveness of the proposed scheme is demonstrated by analyzing the data collected from the First Affiliated Hospital of Zhengzhou University. The results of the analysis reveal that the proposed methodology exhibits superior performance, as indicated by the area under the receiver operating characteristic curve (AUROC) of 0.918 and accuracy of 0.881.
Collapse
Affiliation(s)
- Dan Ling
- Henan Key Lab of Information-Based Electrical Appliances, Zhengzhou University of Light Industry, Zhengzhou, 450002, China
| | - Anhao Liu
- Henan Key Lab of Information-Based Electrical Appliances, Zhengzhou University of Light Industry, Zhengzhou, 450002, China
| | - Junwei Sun
- Henan Key Lab of Information-Based Electrical Appliances, Zhengzhou University of Light Industry, Zhengzhou, 450002, China
| | - Yanfeng Wang
- Henan Key Lab of Information-Based Electrical Appliances, Zhengzhou University of Light Industry, Zhengzhou, 450002, China.
| | - Lidong Wang
- State Key Laboratory of Esophageal Cancer Prevention and Treatment and Henan Key Laboratory for Esophageal Cancer Research of The First Affiliated Hospital, Zhengzhou University, Zhengzhou, 450052, China
| | - Xin Song
- State Key Laboratory of Esophageal Cancer Prevention and Treatment and Henan Key Laboratory for Esophageal Cancer Research of The First Affiliated Hospital, Zhengzhou University, Zhengzhou, 450052, China
| | - Xueke Zhao
- State Key Laboratory of Esophageal Cancer Prevention and Treatment and Henan Key Laboratory for Esophageal Cancer Research of The First Affiliated Hospital, Zhengzhou University, Zhengzhou, 450052, China
| |
Collapse
|
7
|
Ali MH, Jaber MM, Abd SK, Alkhayyat A, Jasim AD. Artificial Neural Network-Based Medical Diagnostics and Therapeutics. INT J PATTERN RECOGN 2022; 36. [DOI: 10.1142/s0218001422400079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
Abstract
The advancement of healthcare technology is impossible without machine learning (ML). There have been numerous advances in ML to analyze, predict, and diagnose medical data. Integrating a centralized scheme and therapy for classifying and diagnosing illnesses and disorders is a major obstacle in modern healthcare. To standardize all medical data into a single repository, researchers have proposed using ML using the centralized artificial neural network model (ML-CANNM). Random tree, support vector machine, and gradient booster are just a few proposed ML classifiers. Artificial neural networks (ANNs) have been trained using a variety of medical datasets to predict and analyze outcomes. ML-CANNM collects patient data from various studies and uses ML and ANNs to determine the results. Three layers make up an ANN. ML is used to classify the given patients’ data in the input layer. In the hidden layer, classification data are compared to a training dataset. The output layer’s job is to identify, classify, and diagnose diseases. As a result, disease diagnosis and detection are integrated into a single healthcare database. The proposed framework has proven that ML-CANNM works with more accuracy and lesser execution time. Thus, the numerical outcome suggested ML-CANNM increased accuracy ratio of 99.2% and a prediction ratio of 97.5%. The findings further show that the execution time is enhanced by less than 2[Formula: see text]h, decision table using ML and results in an efficiency ratio of 97.5%.
Collapse
Affiliation(s)
- Mohammed Hasan Ali
- Computer Techniques Engineering Department, Faculty of Information Technology, Imam Ja’afar Al-Sadiq University, Najaf 10023, Iraq
| | - Mustafa Musa Jaber
- Department of Computer Science, Al-Turath University College, Baghdad, Iraq
- Department of Medical Instruments Engineering Techniques, Al-Farahidi University, Baghdad, Iraq
| | - Sura Khalil Abd
- Department of Computer Science, Dijlah University College, Baghdad 10021, Iraq
| | - Ahmed Alkhayyat
- Department of Computer Engineering Techniques, College of Technical Engineering, The Islamic University, Najaf, Iraq
| | - Abdali Dakhil Jasim
- English Language Department, Al-Mustaqbal University College, Hillah 51001, Iraq
| |
Collapse
|
8
|
Hai T, Zhou J, Srividhya SR, Jain SK, Young P, Agrawal S. BVFLEMR: an integrated federated learning and blockchain technology for cloud-based medical records recommendation system. JOURNAL OF CLOUD COMPUTING: ADVANCES, SYSTEMS AND APPLICATIONS 2022. [DOI: 10.1186/s13677-022-00294-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
AbstractBlockchain is the latest boon in the world which handles mainly banking and finance. The blockchain is also used in the healthcare management system for effective maintenance of electronic health and medical records. The technology ensures security, privacy, and immutability. Federated Learning is a revolutionary learning technique in deep learning, which supports learning from the distributed environment. This work proposes a framework by integrating the blockchain and Federated Deep Learning in order to provide a tailored recommendation system. The work focuses on two modules of blockchain-based storage for electronic health records, where the blockchain uses a Hyperledger fabric and is capable of continuously monitoring and tracking the updates in the Electronic Health Records in the cloud server. In the second module, LightGBM and N-Gram models are used in the collaborative learning module to recommend a tailored treatment for the patient’s cloud-based database after analyzing the EHR. The work shows good accuracy. Several metrics like precision, recall, and F1 scores are measured showing its effective utilization in the cloud database security.
Collapse
|
9
|
Cai Y, Yu F, Kumar M, Gladney R, Mostafa J. Health Recommender Systems Development, Usage, and Evaluation from 2010 to 2022: A Scoping Review. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:15115. [PMID: 36429832 PMCID: PMC9690602 DOI: 10.3390/ijerph192215115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Revised: 11/10/2022] [Accepted: 11/13/2022] [Indexed: 06/16/2023]
Abstract
A health recommender system (HRS) provides a user with personalized medical information based on the user's health profile. This scoping review aims to identify and summarize the HRS development in the most recent decade by focusing on five key aspects: health domain, user, recommended item, recommendation technology, and system evaluation. We searched PubMed, ACM Digital Library, IEEE Xplore, Web of Science, and Scopus databases for English literature published between 2010 and 2022. Our study selection and data extraction followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews. The following are the primary results: sixty-three studies met the eligibility criteria and were included in the data analysis. These studies involved twenty-four health domains, with both patients and the general public as target users and ten major recommended items. The most adopted algorithm of recommendation technologies was the knowledge-based approach. In addition, fifty-nine studies reported system evaluations, in which two types of evaluation methods and three categories of metrics were applied. However, despite existing research progress on HRSs, the health domains, recommended items, and sample size of system evaluation have been limited. In the future, HRS research shall focus on dynamic user modelling, utilizing open-source knowledge bases, and evaluating the efficacy of HRSs using a large sample size. In conclusion, this study summarized the research activities and evidence pertinent to HRSs in the most recent ten years and identified gaps in the existing research landscape. Further work shall address the gaps and continue improving the performance of HRSs to empower users in terms of healthcare decision making and self-management.
Collapse
Affiliation(s)
- Yao Cai
- School of Information and Library Science, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- School of Economics and Management, Nanjing University of Science and Technology, Nanjing 210094, China
| | - Fei Yu
- School of Information and Library Science, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Carolina Health Informatics Program, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Manish Kumar
- Public Health Leadership Program, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Roderick Gladney
- Carolina Health Informatics Program, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Javed Mostafa
- School of Information and Library Science, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Carolina Health Informatics Program, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
10
|
Cao J, Chen J, Zhang X, Peng Y. Diabetic retinopathy classification based on dense connectivity and asymmetric convolutional neural network. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07952-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
11
|
Knowledge distillation for multi-depth-model-fusion recommendation algorithm. PLoS One 2022; 17:e0275955. [PMID: 36282818 PMCID: PMC9595540 DOI: 10.1371/journal.pone.0275955] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Accepted: 09/27/2022] [Indexed: 11/06/2022] Open
Abstract
Recommendation algorithms save a lot of valuable time for people to get the information they are interested in. However, the feature calculation and extraction process of each machine learning or deep learning recommendation algorithm are different, so how to obtain various features with different dimensions, i.e., how to integrate the advantages of each model and improve the model inference efficiency, becomes the focus of this paper. In this paper, a better deep learning model is obtained by integrating several cutting-edge deep learning models. Meanwhile, to make the integrated learning model converge better and faster, the parameters of the integrated module are initialized, constraints are imposed, and a new activation function is designed for better integration of the sub-models. Finally, the integrated large model is distilled for knowledge distillation, which greatly reduces the number of model parameters and improves the model inference efficiency.
Collapse
|
12
|
Detection of Subarachnoid Hemorrhage in Computed Tomography Using Association Rules Mining. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:1133819. [PMID: 36093508 PMCID: PMC9451997 DOI: 10.1155/2022/1133819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 07/13/2022] [Accepted: 07/22/2022] [Indexed: 11/17/2022]
Abstract
Subarachnoid hemorrhage (SAH) is one of the serious strokes of cerebrovascular accidents. There is an approx. 15% probability of spontaneous subarachnoid hemorrhage in all acute cerebrovascular accidents (CVAs). Most spontaneous subarachnoid hemorrhages are caused by ruptures of intracranial aneurysms, accounting for about 85% of all occurrences. About 15% of acute cerebrovascular disorders are caused by spontaneous subarachnoid hemorrhage. This illness is mostly caused by brain/spinal arteriovenous malformations, extracranial aneurysms, and hypertension. Computed tomography (CT) scan is the common diagnostic modality to evaluate SAH, but it is very difficult to identify the abnormality. Thus, automatic detection of SAH is required to recognize the early signs and symptoms of SAH and to provide appropriate therapeutic intervention and treatment. In this article, the gray-level cooccurrence matrix (GLCM) is used to extract useful features from CT images. Then, the New Association Classification Frequent Pattern (NCFP-growth) algorithm is applied, which is based on association rules. Then, it is compared with FP-growth methods with association rules and FP-growth methods without association rules. The experimental results indicate that the suggested approach outperforms in terms of classification accuracy. The proposed approach equates to a 95.2% accuracy rate compared to the conventional data mining algorithm.
Collapse
|
13
|
Min X, Li W, Yang J, Xie W, Zhao D. Dual-level diagnostic feature learning with recurrent neural networks for treatment sequence recommendation. J Biomed Inform 2022; 134:104165. [PMID: 36038067 DOI: 10.1016/j.jbi.2022.104165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2021] [Revised: 07/22/2022] [Accepted: 08/11/2022] [Indexed: 10/15/2022]
Abstract
In recent years, the massive electronic medical records (EMRs) have supported the development of intelligent medical services such as treatment recommendations. However, existing treatment recommendations usually follow the traditional sequential recommendation strategies, ignoring the partial temporality of the practical process and the patient's diagnostic features. To this end, in this paper, we propose a new Dual-level Diagnostic Feature Learning with Recurrent Neural Network for treatment sequence recommendation (DDFL-RNN), where the dual-level diagnostic features including patients' historical medical records and current treatment results. Firstly, we divide the dataset into several sequential sets of treatment item based on the patient's treatment days. Secondly, we propose two kinds of attention mechanisms to learn diagnostic features, including the elemental attention mechanism and the sequential attention mechanism. Finally, the dual-level learned diagnostic features are brought into the recurrent neural network for encoding and recommendation. Extensive experiments on a breast cancer dataset from a first-rate hospital have shown that our model achieves significantly better performance than several classical and state-of-the-art baseline methods.
Collapse
Affiliation(s)
- Xin Min
- School of Computer Science and Engineering, Northeastern University, Shenyang, 110000, China.
| | - Wei Li
- School of Computer Science and Engineering, Northeastern University, Shenyang, 110000, China; Key Laboratory of Intelligent Computing in Medical Image (MIIC), Northeastern University, Ministry of Education, Shenyang, 110000, China.
| | - Jinzhao Yang
- School of Computer Science and Engineering, Northeastern University, Shenyang, 110000, China.
| | - Weidong Xie
- School of Computer Science and Engineering, Northeastern University, Shenyang, 110000, China.
| | - Dazhe Zhao
- School of Computer Science and Engineering, Northeastern University, Shenyang, 110000, China; Key Laboratory of Intelligent Computing in Medical Image (MIIC), Northeastern University, Ministry of Education, Shenyang, 110000, China.
| |
Collapse
|
14
|
Jha R, Bhattacharjee V, Mustafi A, Sahana SK. Improved disease diagnosis system for COVID-19 with data refactoring and handling methods. Front Psychol 2022; 13:951027. [PMID: 36033018 PMCID: PMC9416861 DOI: 10.3389/fpsyg.2022.951027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 07/19/2022] [Indexed: 12/15/2022] Open
Abstract
The novel coronavirus illness (COVID-19) outbreak, which began in a seafood market in Wuhan, Hubei Province, China, in mid-December 2019, has spread to almost all countries, territories, and places throughout the world. And since the fault in diagnosis of a disease causes a psychological impact, this was very much visible in the spread of COVID-19. This research aims to address this issue by providing a better solution for diagnosis of the COVID-19 disease. The paper also addresses a very important issue of having less data for disease prediction models by elaborating on data handling techniques. Thus, special focus has been given on data processing and handling, with an aim to develop an improved machine learning model for diagnosis of COVID-19. Random Forest (RF), Decision tree (DT), K-Nearest Neighbor (KNN), Logistic Regression (LR), Support vector machine, and Deep Neural network (DNN) models are developed using the Hospital Israelita Albert Einstein (in São Paulo, Brazil) dataset to diagnose COVID-19. The dataset is pre-processed and distributed DT is applied to rank the features. Data augmentation has been applied to generate datasets for improving classification accuracy. The DNN model dominates overall techniques giving the highest accuracy of 96.99%, recall of 96.98%, and precision of 96.94%, which is better than or comparable to other research work. All the algorithms are implemented in a distributed environment on the Spark platform.
Collapse
Affiliation(s)
| | | | | | - Sudip Kumar Sahana
- Department of Computer Science and Engineering, Birla Institute of Technology, Mesra, Ranchi, India
| |
Collapse
|
15
|
|
16
|
Huang S, Wan X, Qiu H, Li L, Yu H. Constrained optimization for stratified treatment rules with multiple responses of survival data. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.02.047] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
17
|
A Fair and Safe Usage Drug Recommendation System in Medical Emergencies by a Stacked ANN. ALGORITHMS 2022. [DOI: 10.3390/a15060186] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
The importance of online recommender systems for drugs, medical professionals, and hospitals is growing. Today, the majority of people use online consultations for drug recommendations for all types of health issues. Emergencies such as pandemics, floods, or cyclones can be helped by the medical recommender system. In the era of machine learning (ML), recommender systems produce more accurate, quick, and reliable clinical predictions with minimal costs. As a result, these systems maintain better performance, integrity, and privacy of patient data in the decision-making process and provide precise information at any time. Therefore, we present drug recommender systems with a stacked artificial neural network (ANN) model to improve the fairness and safety of treatment for infectious diseases. To reduce side effects, drugs are recommended based on a patient’s previous health profile, lifestyle, and habits. The proposed system produced results with 97.5% accuracy. A system such as this could be useful in recommending safe medicines to patients, especially during health emergencies.
Collapse
|
18
|
Lu W, Zhai Y. Self-Adaptive Telemedicine Specialist Recommendation Considering Specialist Activity and Patient Feedback. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:ijerph19095594. [PMID: 35564988 PMCID: PMC9101090 DOI: 10.3390/ijerph19095594] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Revised: 04/24/2022] [Accepted: 04/27/2022] [Indexed: 12/10/2022]
Abstract
Purpose: With the rapid development of medical informatization, information overload and asymmetry have become major obstacles that limit patients' ability to find appropriate telemedicine specialists. Although doctor recommendation methods have been proposed, they fail to address data sparsity and cold-start issues, and electronic medical records (EMRs), patient preferences, potential interest of service providers and the changes over time are largely under-explored. Therefore, this study develops a self-adaptive telemedicine specialist recommendation method that incorporates specialist activity and patient utility feedback from the perspective of privacy protection to fill the research gaps. Methods: First, text vectorization, view similarity and probabilistic topic model are used to construct the patient and specialist feature models based on patients' EMRs and specialists' long- and short-term knowledge backgrounds, respectively. Second, the recommended specialist candidate set and recommendation index are obtained based on the similarity between patient features. Then, the specialist long-term knowledge feature model is used to update the newly registered specialist recommendation index and the recommended specialist candidate set to overcome the data sparsity and cold-start issues, and the specialist short-term knowledge feature model is adopted to extend the recommended specialist candidate set at the semantic level. Finally, we introduce the specialists' activity and patients' perceived utility feedback mechanism to construct a closed-loop adjusted and optimized specialist recommendation method. Results: An empirical study was conducted integrating EMRs of telemedicine patients from the National Telemedicine Center of China and specialists' profiles and ratings from an online healthcare platform. The proposed method successfully recommended relevant and active telemedicine specialists to the target patient, and increased the recommended opportunities for newly registered specialists to some extent. Conclusions: The proposed method emphasizes the adaptability and acceptability of the recommended results while ensuring their accuracy and relevance. Specialists' activity and patients' perceived utility jointly contribute to the acceptability of recommended results, and the recommendation strategy achieves the organic fusion of the two. Several comparative experiments demonstrate the effectiveness and operability of the hybrid recommendation strategy under the premise of data sparsity and privacy protection, enabling effective matching of patients' demand and service providers' capabilities, and providing beneficial insights for data-driven telemedicine services.
Collapse
Affiliation(s)
- Wei Lu
- School of Management Engineering, Zhengzhou University, Zhengzhou 450001, China;
| | - Yunkai Zhai
- School of Management Engineering, Zhengzhou University, Zhengzhou 450001, China;
- National Engineering Laboratory for Internet Medical Systems and Applications, Zhengzhou 450052, China
- Correspondence:
| |
Collapse
|
19
|
Bhatia M, Manocha A, Ahanger TA, Alqahtani A. Artificial intelligence-inspired comprehensive framework for Covid-19 outbreak control. Artif Intell Med 2022; 127:102288. [PMID: 35430039 PMCID: PMC8956352 DOI: 10.1016/j.artmed.2022.102288] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Revised: 03/19/2022] [Accepted: 03/22/2022] [Indexed: 12/18/2022]
Abstract
COVID-19 is a life-threatening contagious virus that has spread across the globe rapidly. To reduce the outbreak impact of COVID-19 virus illness, continual identification and remote surveillance of patients are essential. Medical service delivery based on the Internet of Things (IoT) technology backed up by the fog-cloud paradigm is an efficient and time-sensitive solution for remote patient surveillance. Conspicuously, a comprehensive framework based on Radio Frequency Identification Device (RFID) and body-wearable sensor technologies supported by the fog-cloud platform is proposed for the identification and management of COVID-19 patients. The J48 decision tree is used to assess the infection degree of the user based on corresponding symptoms. RFID is used to detect Temporal Proximity Interactions (TPI) among users. Using TPI quantification, Temporal Network Analysis is used to analyze and track the current stage of the COVID-19 spread. The statistical performance and accuracy of the framework are assessed by utilizing synthetically-generated data for 250,000 users. Based on the comparative analysis, the proposed framework acquired an enhanced measure of classification accuracy, and sensitivity of 96.68% and 94.65% respectively. Moreover, significant improvement has been registered for proposed fog-cloud-based data analysis in terms of Temporal Delay efficacy, Precision, and F-measure.
Collapse
Affiliation(s)
- Munish Bhatia
- Department of Computer Science and Engineering, Lovely Professional University, India.
| | - Ankush Manocha
- Department of Computer Applications, Lovely Professional University, India
| | - Tariq Ahamed Ahanger
- College of Computer Engineering and Science, Prince Sattam Bin ABdulaziz University, Al-Kharj, Saudi Arabia.
| | - Abdullah Alqahtani
- College of Computer Engineering and Science, Prince Sattam Bin ABdulaziz University, Al-Kharj, Saudi Arabia.
| |
Collapse
|
20
|
Chen X, Jin W, Wu Q, Zhang W, Liang H. A hybrid cost-sensitive machine learning approach for the classification of intelligent disease diagnosis. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-213486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Automatic risk classification of diseases is one of the most significant health problems in medical and healthcare domain. However, the related studies are relative scarce. In this paper, we design an intelligent diagnosis model based on optimal machine learning algorithms with rich clinical data. First, the disease risk classification problem based on machine learning is defined. Then, the K-means clustering algorithm is used to validate the class label of given data, thereby removing misclassified instances from the original dataset. Furthermore, naive Bayesian algorithm is applied to build the final classifier by using 10-fold cross-validation method. In addition, a novel class-specific attribute weighted approach is adopted to alleviate the conditional independence assumption of naive Bayes, which means we assign each disease attribute a specific weight for each class. Last but not least, a hybrid cost-sensitive disease risk classification model is formulated, and a practical example from the University of California Irvine (UCI) machine learning database is used to illustrate the potential of the proposed method. Experimental results demonstrate that the approach is competitive with the state-of-the-art classifiers.
Collapse
Affiliation(s)
- Xi Chen
- School of Economics & Management, Xidian University, Xi’an, China
| | - Wenquan Jin
- School of Economics & Management, Xidian University, Xi’an, China
| | - Qirui Wu
- School of Foreign Languages, Xidian University, China
| | - Wenbo Zhang
- School of Economics & Management, Xidian University, Xi’an, China
| | | |
Collapse
|
21
|
Lee S, Lee E, Park SS, Park MS, Jung J, Min GJ, Park S, Lee SE, Cho BS, Eom KS, Kim YJ, Lee S, Kim HJ, Min CK, Cho SG, Lee JW, Hwang HJ, Yoon JH. Prediction and recommendation by machine learning through repetitive internal validation for hepatic veno-occlusive disease/sinusoidal obstruction syndrome and early death after allogeneic hematopoietic cell transplantation. Bone Marrow Transplant 2022; 57:538-546. [PMID: 35075247 DOI: 10.1038/s41409-022-01583-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Revised: 01/12/2022] [Accepted: 01/13/2022] [Indexed: 12/23/2022]
Abstract
Using traditional statistical methods, we previously analyzed the risk factors and treatment outcomes of veno-occlusive disease/sinusoidal obstruction syndrome (VOD/SOS) after allogeneic hematopoietic cell transplantation. Within the same cohort, we applied machine learning to create prediction and recommendation models. We analyzed 2572 transplants using eXtreme Gradient Boosting (XGBoost) to predict post-transplant VOD/SOS and early death. Using the XGBoost and SHapley Additive exPlanations (SHAP), we found influential factors and devised recommendation models, which were internally verified by repetitive ten-fold cross-validation. SHAP values suggested that gender, busulfan dosage, age, forced expiratory volume, and Disease Risk Index were significant factors for VOD/SOS. The areas under the receiver operating characteristic curves and the areas under the precision-recall curve of the models were 0.740, 0.144 for all VOD/SOS, 0.793, 0.793 for severe to very severe VOD/SOS, and 0.746, 0.304 for early death. According to our single feature recommendation, following the busulfan dosage was the most effective for preventing VOD/SOS. The recommendation method for six adjustable feature sets was also validated, and a subgroup corresponding to five to six features showed significant preventive power for VOD/SOS and early death. Our personalized treatment set recommendation showed reproducibility in repetitive internal validation, but large external cohorts should prospectively validate our model.
Collapse
Affiliation(s)
| | - Eunsaem Lee
- Department of Mathematics, Pohang University of Science and Technology (POSTECH), Pohang, Gyeongbuk, Korea
| | - Sung-Soo Park
- Department of Hematology, Catholic Hematology Hospital and Leukemia Research Institute, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Min Sue Park
- Department of Mathematics, Pohang University of Science and Technology (POSTECH), Pohang, Gyeongbuk, Korea
| | | | - Gi June Min
- Department of Hematology, Catholic Hematology Hospital and Leukemia Research Institute, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Silvia Park
- Department of Hematology, Catholic Hematology Hospital and Leukemia Research Institute, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Sung-Eun Lee
- Department of Hematology, Catholic Hematology Hospital and Leukemia Research Institute, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Byung-Sik Cho
- Department of Hematology, Catholic Hematology Hospital and Leukemia Research Institute, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Ki-Seong Eom
- Department of Hematology, Catholic Hematology Hospital and Leukemia Research Institute, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Yoo-Jin Kim
- Department of Hematology, Catholic Hematology Hospital and Leukemia Research Institute, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Seok Lee
- Department of Hematology, Catholic Hematology Hospital and Leukemia Research Institute, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Hee-Je Kim
- Department of Hematology, Catholic Hematology Hospital and Leukemia Research Institute, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Chang-Ki Min
- Department of Hematology, Catholic Hematology Hospital and Leukemia Research Institute, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Seok-Goo Cho
- Department of Hematology, Catholic Hematology Hospital and Leukemia Research Institute, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Jong Wook Lee
- Department of Hematology, Catholic Hematology Hospital and Leukemia Research Institute, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
| | - Hyung Ju Hwang
- AMSquare Corp., Pohang, Gyeongbuk, Korea.
- Department of Mathematics, Pohang University of Science and Technology (POSTECH), Pohang, Gyeongbuk, Korea.
| | - Jae-Ho Yoon
- Department of Hematology, Catholic Hematology Hospital and Leukemia Research Institute, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea.
| |
Collapse
|
22
|
A Survey of Recommendation Systems: Recommendation Models, Techniques, and Application Fields. ELECTRONICS 2022. [DOI: 10.3390/electronics11010141] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
This paper reviews the research trends that link the advanced technical aspects of recommendation systems that are used in various service areas and the business aspects of these services. First, for a reliable analysis of recommendation models for recommendation systems, data mining technology, and related research by application service, more than 135 top-ranking articles and top-tier conferences published in Google Scholar between 2010 and 2021 were collected and reviewed. Based on this, studies on recommendation system models and the technology used in recommendation systems were systematized, and research trends by year were analyzed. In addition, the application service fields where recommendation systems were used were classified, and research on the recommendation system model and recommendation technique used in each field was analyzed. Furthermore, vast amounts of application service-related data used by recommendation systems were collected from 2010 to 2021 without taking the journal ranking into consideration and reviewed along with various recommendation system studies, as well as applied service field industry data. As a result of this study, it was found that the flow and quantitative growth of various detailed studies of recommendation systems interact with the business growth of the actual applied service field. While providing a comprehensive summary of recommendation systems, this study provides insight to many researchers interested in recommendation systems through the analysis of its various technologies and trends in the service field to which recommendation systems are applied.
Collapse
|
23
|
Efficacy prediction based on attribute and multi-source data collaborative for auxiliary medical system in developing countries. Neural Comput Appl 2022. [DOI: 10.1007/s00521-021-06713-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
24
|
Paliwal S, Kumar Mishra A, Krishn Mishra R, Nawaz N, Senthilkumar M. XGBRS Framework Integrated with Word2Vec Sentiment Analysis for Augmented Drug Recommendation. COMPUTERS, MATERIALS & CONTINUA 2022; 72:5345-5362. [DOI: 10.32604/cmc.2022.025858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 02/16/2022] [Indexed: 09/15/2023]
|
25
|
Wang J, Zhang G, Wang W, Zhang K, Sheng Y. Cloud-based intelligent self-diagnosis and department recommendation service using Chinese medical BERT. JOURNAL OF CLOUD COMPUTING: ADVANCES, SYSTEMS AND APPLICATIONS 2021. [DOI: 10.1186/s13677-020-00218-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
AbstractWith the rapid development of hospital informatization and Internet medical service in recent years, most hospitals have launched online hospital appointment registration systems to remove patient queues and improve the efficiency of medical services. However, most of the patients lack professional medical knowledge and have no idea of how to choose department when registering. To instruct the patients to seek medical care and register effectively, we proposed CIDRS, an intelligent self-diagnosis and department recommendation framework based on Chinese medical Bidirectional Encoder Representations from Transformers (BERT) in the cloud computing environment. We also established a Chinese BERT model (CHMBERT) trained on a large-scale Chinese medical text corpus. This model was used to optimize self-diagnosis and department recommendation tasks. To solve the limited computing power of terminals, we deployed the proposed framework in a cloud computing environment based on container and micro-service technologies. Real-world medical datasets from hospitals were used in the experiments, and results showed that the proposed model was superior to the traditional deep learning models and other pre-trained language models in terms of performance.
Collapse
|
26
|
|
27
|
|
28
|
Ahmed SA, Nath B. Identification of adverse disease agents and risk analysis using frequent pattern mining. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.07.061] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
29
|
Cha S, Kim SS. Comorbidity Patterns of Mood Disorders in Adult Inpatients: Applying Association Rule Mining. Healthcare (Basel) 2021; 9:1155. [PMID: 34574929 PMCID: PMC8470302 DOI: 10.3390/healthcare9091155] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 08/20/2021] [Accepted: 08/30/2021] [Indexed: 11/26/2022] Open
Abstract
This study explored physical and psychiatric comorbidities of mood disorders using association rule mining. There were 7709 subjects who were patients (≥19 years old) diagnosed with mood disorders and included in the data collected by the Korean National Hospital Discharge In-depth Injury Survey (KNHDS) between 2006 and 2018. Physical comorbidities (46.17%) were higher than that of psychiatric comorbidities (27.28%). The frequent comorbidities of mood disorders (F30-F39) were hypertensive diseases (I10-I15), neurotic, stress-related and somatoform disorders (F40-F48), diabetes mellitus (E10-E14), and diseases of esophagus, stomach, and duodenum (K20-K31). The bidirectional association path of mood disorders (F30-F39) with hypertensive diseases (I10-I15) and diabetes mellitus (E10-E14) were the strongest. Depressive episodes (F32) and recurrent depressive disorders (F33) revealed strong bidirectional association paths with other degenerative diseases of the nervous system (G30-G32) and organic, including symptomatic and mental disorders (F00-F09). Bipolar affective disorders (F31) revealed strong bidirectional association paths with diabetes mellitus (E10-E14) and hypertensive diseases (I10-I15). It was found that different physical and psychiatric disorders are comorbid according to the sub-classification of mood disorders. Understanding the comorbidity patterns of major comorbidities for each mood disorder can assist mental health providers in treating and managing patients with mood disorders.
Collapse
Affiliation(s)
- Sunkyung Cha
- Department of Nursing Science, Sunmoon University, Asan 31460, Korea;
| | - Sung-Soo Kim
- Department of Health Administration & Healthcare, Cheongju University, Cheongju 28503, Korea
| |
Collapse
|
30
|
Xu Z, Shen D, Nie T, Kou Y, Yin N, Han X. A cluster-based oversampling algorithm combining SMOTE and k-means for imbalanced medical data. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.02.056] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
31
|
Zheng G, Xu Y. Efficient face detection and tracking in video sequences based on deep learning. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.03.027] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
32
|
|
33
|
Xu Z, Shen D, Kou Y, Nie T. A hybrid feature selection algorithm combining ReliefF and Particle swarm optimization for high-dimensional medical data. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2021. [DOI: 10.3233/jifs-202948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Due to high-dimensional feature and strong correlation of features, the classification accuracy of medical data is not as good enough as expected. feature selection is a common algorithm to solve this problem, and selects effective features by reducing the dimensionality of high-dimensional data. However, traditional feature selection algorithms have the blindness of threshold setting and the search algorithms are liable to fall into a local optimal solution. Based on it, this paper proposes a hybrid feature selection algorithm combining ReliefF and Particle swarm optimization. The algorithm is mainly divided into three parts: Firstly, the ReliefF is used to calculate the feature weight, and the features are ranked by the weight. Then ranking feature is grouped according to the density equalization, where the density of features in each group is the same. Finally, the Particle Swarm Optimization algorithm is used to search the ranking feature groups, and the feature selection is performed according to a new fitness function. Experimental results show that the random forest has the highest classification accuracy on the features selected. More importantly, it has the least number of features. In addition, experimental results on 2 medical datasets show that the average accuracy of random forest reaches 90.20%, which proves that the hybrid algorithm has a certain application value.
Collapse
Affiliation(s)
- Zhaozhao Xu
- School of Computer Science and Engineering, Northeastern University, Shenyang, China
| | - Derong Shen
- School of Computer Science and Engineering, Northeastern University, Shenyang, China
| | - Yue Kou
- School of Computer Science and Engineering, Northeastern University, Shenyang, China
| | - Tiezheng Nie
- School of Computer Science and Engineering, Northeastern University, Shenyang, China
| |
Collapse
|
34
|
Shao M, Qi D, Xue H. Big data outlier detection model based on improved density peak algorithm. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2021. [DOI: 10.3233/jifs-189456] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Outlier detection is an important branch of data mining. This paper proposes an advanced fast density peak outlier detection algorithm based on the characteristics of big data. The algorithm is an outlier detection method based on the improved density peak clustering algorithm. This paper improves the original algorithm. From the perspective of outlier detection, although it is a clustering idea, it avoids the clustering process, reduces the time complexity of the cluster-based outlier detection algorithm, and absorbs. The outlier detection based on neighbors is not sensitive to data dimensions and other advantages. In the power industry, outlier detection can be used in areas such as grid fault detection, equipment fault detection, and power abnormality detection. The simulation experiment of outlier detection based on the daily load curve of single and multiple transformers in a certain province shows that the improved algorithm can effectively detect outliers in the data.
Collapse
Affiliation(s)
- Mengliang Shao
- Department of Computer Science, South China Institute of Software Engineering, Guangzhou University, Guangzhou, China
- Research Institute of Computer Systems, South China University of Technology, Guangzhou, Guangdong, China
| | - Deyu Qi
- Research Institute of Computer Systems, South China University of Technology, Guangzhou, Guangdong, China
| | - Huili Xue
- School of Information Engineering, Guangzhou Nanyang Polytechnic College, Guangzhou, China
| |
Collapse
|
35
|
Scaldelai D, Matioli LC, Santos SR, Kleina M. MulticlusterKDE: a new algorithm for clustering based on multivariate kernel density estimation. J Appl Stat 2020; 49:98-121. [PMID: 35707794 PMCID: PMC9041763 DOI: 10.1080/02664763.2020.1799958] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2019] [Accepted: 07/19/2020] [Indexed: 10/23/2022]
Abstract
In this paper, we propose the MulticlusterKDE algorithm applied to classify elements of a database into categories based on their similarity. MulticlusterKDE is centered on the multiple optimization of the kernel density estimator function with multivariate Gaussian kernel. One of the main features of the proposed algorithm is that the number of clusters is an optional input parameter. Furthermore, it is very simple, easy to implement, well defined and stops at a finite number of steps and it always converges regardless of the data set. We illustrate our findings by implementing the algorithm in R software. The results indicate that the MulticlusterKDE algorithm is competitive when compared to K-means, K-medoids, CLARA, DBSCAN and PdfCluster algorithms. Features such as simplicity and efficiency make the proposed algorithm an attractive and promising research field that can be used as basis for its improvement and also for the development of new density-based clustering algorithms.
Collapse
Affiliation(s)
- D. Scaldelai
- Colegiado de Matemática, Universidade Estadual do Paraná – Unespar, Campo Mourão, Brazil
| | - L. C. Matioli
- Departamento de Matemática, Universidade Federal do Paraná – UFPR, Curitiba, Brazil
| | - S. R. Santos
- Colegiado de Matemática, Universidade Estadual do Paraná – Unespar, Campo Mourão, Brazil
| | - M. Kleina
- Departamento de Engenharia de Produção, Universidade Federal do Paraná – UFPR, Curitiba, Brazil
| |
Collapse
|
36
|
Tahmasebi N, Boulanger P, Yun J, Fallone G, Noga M, Punithakumar K. Real-Time Lung Tumor Tracking Using a CUDA Enabled Nonrigid Registration Algorithm for MRI. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE-JTEHM 2020; 8:4300308. [PMID: 32411543 PMCID: PMC7217296 DOI: 10.1109/jtehm.2020.2989124] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Revised: 11/30/2019] [Accepted: 12/30/2019] [Indexed: 11/09/2022]
Abstract
Objective: This study intends to develop an accurate, real-time tumor tracking algorithm for the automated radiation therapy for cancer treatment using Graphics Processing Unit (GPU) computing. Although a previous moving mesh based tumor tracking approach has been shown to be successful in delineating the tumor regions from a sequence of magnetic resonance image, the algorithm is computationally intensive and its computation time on standard Central Processing Unit (CPU) processors is too slow to be used clinically especially for automated radiation therapy system. Method: A re-implementation of the algorithm on a low-cost parallel GPU-based computing platform is utilized to accelerate this computation at a speed that is amicable to clinical usages. Several components in the registration algorithm such as the computation of similarity metric are inherently parallel which fits well with the GPU parallel processing capabilities. Solving a partial differential equation numerically to generate the mesh deformation is one of the computationally intensive components which has been accelerated by utilizing a much faster shared memory on the GPU. Results: Implemented on an NVIDIA Tesla K40c GPU, the proposed approach yielded a computational acceleration improvement of over 5 times its implementation on a CPU. The proposed approach yielded an average Dice score of 0.87 evaluated over 600 images acquired from six patients. Conclusion: This study demonstrated that the GPU computing approach can be used to accelerate tumor tracking for automated radiation therapy for mobile lung tumors. Clinical Impact: Accurately tracking mobile tumor boundaries in real-time is important to automate radiation therapy and the proposed study offers an excellent option for fast tumor region tracking for cancer treatment.
Collapse
Affiliation(s)
- Nazanin Tahmasebi
- 1Department of Radiology and Diagnostic ImagingUniversity of AlbertaEdmontonABT6G 2R3Canada.,2Servier Virtual Cardiac CentreMazankowski Alberta Heart InstituteEdmontonABT6G 2B7Canada.,3Department of Computing ScienceUniversity of AlbertaEdmontonABT6G 2R3Canada
| | - Pierre Boulanger
- 1Department of Radiology and Diagnostic ImagingUniversity of AlbertaEdmontonABT6G 2R3Canada.,2Servier Virtual Cardiac CentreMazankowski Alberta Heart InstituteEdmontonABT6G 2B7Canada.,3Department of Computing ScienceUniversity of AlbertaEdmontonABT6G 2R3Canada
| | - Jihyun Yun
- 4Medical Physics DivisionDepartment of OncologyUniversity of AlbertaEdmontonABT6G 2R3Canada
| | - Gino Fallone
- 4Medical Physics DivisionDepartment of OncologyUniversity of AlbertaEdmontonABT6G 2R3Canada
| | - Michelle Noga
- 1Department of Radiology and Diagnostic ImagingUniversity of AlbertaEdmontonABT6G 2R3Canada.,2Servier Virtual Cardiac CentreMazankowski Alberta Heart InstituteEdmontonABT6G 2B7Canada
| | - Kumaradevan Punithakumar
- 1Department of Radiology and Diagnostic ImagingUniversity of AlbertaEdmontonABT6G 2R3Canada.,2Servier Virtual Cardiac CentreMazankowski Alberta Heart InstituteEdmontonABT6G 2B7Canada.,3Department of Computing ScienceUniversity of AlbertaEdmontonABT6G 2R3Canada
| |
Collapse
|
37
|
Chen J, Sun L, Guo C, Xie Y. A fusion framework to extract typical treatment patterns from electronic medical records. Artif Intell Med 2020; 103:101782. [DOI: 10.1016/j.artmed.2019.101782] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Revised: 11/18/2019] [Accepted: 12/26/2019] [Indexed: 12/26/2022]
|
38
|
Lagunes-García G, Rodríguez-González A, Prieto-Santamaría L, García Del Valle EP, Zanin M, Menasalvas-Ruiz E. DISNET: a framework for extracting phenotypic disease information from public sources. PeerJ 2020; 8:e8580. [PMID: 32110491 PMCID: PMC7032061 DOI: 10.7717/peerj.8580] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2019] [Accepted: 01/16/2020] [Indexed: 12/25/2022] Open
Abstract
Background Within the global endeavour of improving population health, one major challenge is the identification and integration of medical knowledge spread through several information sources. The creation of a comprehensive dataset of diseases and their clinical manifestations based on information from public sources is an interesting approach that allows one not only to complement and merge medical knowledge but also to increase it and thereby to interconnect existing data and analyse and relate diseases to each other. In this paper, we present DISNET (http://disnet.ctb.upm.es/), a web-based system designed to periodically extract the knowledge from signs and symptoms retrieved from medical databases, and to enable the creation of customisable disease networks. Methods We here present the main features of the DISNET system. We describe how information on diseases and their phenotypic manifestations is extracted from Wikipedia and PubMed websites; specifically, texts from these sources are processed through a combination of text mining and natural language processing techniques. Results We further present the validation of our system on Wikipedia and PubMed texts, obtaining the relevant accuracy. The final output includes the creation of a comprehensive symptoms-disease dataset, shared (free access) through the system's API. We finally describe, with some simple use cases, how a user can interact with it and extract information that could be used for subsequent analyses. Discussion DISNET allows retrieving knowledge about the signs, symptoms and diagnostic tests associated with a disease. It is not limited to a specific category (all the categories that the selected sources of information offer us) and clinical diagnosis terms. It further allows to track the evolution of those terms through time, being thus an opportunity to analyse and observe the progress of human knowledge on diseases. We further discussed the validation of the system, suggesting that it is good enough to be used to extract diseases and diagnostically-relevant terms. At the same time, the evaluation also revealed that improvements could be introduced to enhance the system's reliability.
Collapse
Affiliation(s)
- Gerardo Lagunes-García
- Centro de Tecnología Biomédica, Universidad Politécnica de Madrid, Pozuelo de Alarcón, Madrid, Spain
| | - Alejandro Rodríguez-González
- Centro de Tecnología Biomédica, Universidad Politécnica de Madrid, Pozuelo de Alarcón, Madrid, Spain.,Escuela Técnica Superior de Ingenieros Informáticos, Universidad Politécnica de Madrid, Boadilla del Monte, Madrid, Spain
| | - Lucía Prieto-Santamaría
- Centro de Tecnología Biomédica, Universidad Politécnica de Madrid, Pozuelo de Alarcón, Madrid, Spain
| | | | - Massimiliano Zanin
- Centro de Tecnología Biomédica, Universidad Politécnica de Madrid, Pozuelo de Alarcón, Madrid, Spain
| | - Ernestina Menasalvas-Ruiz
- Centro de Tecnología Biomédica, Universidad Politécnica de Madrid, Pozuelo de Alarcón, Madrid, Spain
| |
Collapse
|
39
|
Huang J, Zhuo Y, Tian X, Zhu D, Mustafa R. Personalized disease treatment plan suggestion system based on big data and knowledge base. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2020. [DOI: 10.3233/jifs-179425] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Jun Huang
- The First Affiliated Hospital of Jinan University, Guangzhou, China
| | - Yumin Zhuo
- The First Affiliated Hospital of Jinan University, Guangzhou, China
| | - Xuemei Tian
- School of Life Sciences, South China Normal University, Guangzhou, China
| | - Dingju Zhu
- School of Computer Science, South China Normal University, Guangzhou, China
| | - Rashed Mustafa
- Department of Computer Science and EngineeringUniversity of Chittagong, Chittagong, Bangladesh
| |
Collapse
|
40
|
|
41
|
Hu Y, Li J, He L. A reformed task scheduling algorithm for heterogeneous distributed systems with energy consumption constraints. Neural Comput Appl 2019. [DOI: 10.1007/s00521-019-04415-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
42
|
|
43
|
Modeling Side Information in Preference Relation based Restricted Boltzmann Machine for recommender systems. Inf Sci (N Y) 2019. [DOI: 10.1016/j.ins.2019.03.064] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
44
|
Song Y, Vivian Hu Q, He L. Let terms choose their own kernels: An intelligent approach to kernel selection for healthcare search. Inf Sci (N Y) 2019. [DOI: 10.1016/j.ins.2019.02.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
45
|
DeepReco: Deep Learning Based Health Recommender System Using Collaborative Filtering. COMPUTATION 2019. [DOI: 10.3390/computation7020025] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
In today’s digital world healthcare is one core area of the medical domain. A healthcare system is required to analyze a large amount of patient data which helps to derive insights and assist the prediction of diseases. This system should be intelligent in order to predict a health condition by analyzing a patient’s lifestyle, physical health records and social activities. The health recommender system (HRS) is becoming an important platform for healthcare services. In this context, health intelligent systems have become indispensable tools in decision making processes in the healthcare sector. Their main objective is to ensure the availability of the valuable information at the right time by ensuring information quality, trustworthiness, authentication and privacy concerns. As people use social networks to understand their health condition, so the health recommender system is very important to derive outcomes such as recommending diagnoses, health insurance, clinical pathway-based treatment methods and alternative medicines based on the patient’s health profile. Recent research which targets the utilization of large volumes of medical data while combining multimodal data from disparate sources is discussed which reduces the workload and cost in health care. In the healthcare sector, big data analytics using recommender systems have an important role in terms of decision-making processes with respect to a patient’s health. This paper gives a proposed intelligent HRS using Restricted Boltzmann Machine (RBM)-Convolutional Neural Network (CNN) deep learning method, which provides an insight into how big data analytics can be used for the implementation of an effective health recommender engine, and illustrates an opportunity for the health care industry to transition from a traditional scenario to a more personalized paradigm in a tele-health environment. By considering Root Square Mean Error (RSME) and Mean Absolute Error (MAE) values, the proposed deep learning method (RBM-CNN) presents fewer errors compared to other approaches.
Collapse
|
46
|
|
47
|
|
48
|
Prediction of mechanical properties of micro-alloyed steels via neural networks learned by water wave optimization. Neural Comput Appl 2019. [DOI: 10.1007/s00521-019-04149-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
49
|
Improving the Accuracy of Feature Selection in Big Data Mining Using Accelerated Flower Pollination (AFP) Algorithm. J Med Syst 2019; 43:96. [PMID: 30852692 DOI: 10.1007/s10916-019-1200-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2018] [Accepted: 02/11/2019] [Indexed: 01/13/2023]
Abstract
In recent times, the main problem associated with big data analytics is its high dimensional data over the search space. Such data gathers continuously in search space making traditional algorithms infeasible for data mining in real time environment. Hence, feature selection is an important method to lighten the load during processing while inducing a model for mining. However, mining over such high dimensional data leads to formulation of optimal feature subset, which grows exponentially and leads to intractable computational demand. In this paper, a novel lightweight mechanism is used as a feature selection method, which solves the after effects arising with optimal feature selection. The feature selection in big data mining is done using accelerated flower pollination (AFP) algorithm. This method improves the accuracy of feature selection with reduced processing time. The proposed method is tested under larger set of data with high dimensionality to test the performance of proposed method.
Collapse
|
50
|
Wang P, Zhu W, Liao B, Cai L, Peng L, Yang J. Predicting Influenza Antigenicity by Matrix Completion With Antigen and Antiserum Similarity. Front Microbiol 2018; 9:2500. [PMID: 30405563 PMCID: PMC6206390 DOI: 10.3389/fmicb.2018.02500] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Accepted: 10/01/2018] [Indexed: 12/20/2022] Open
Abstract
The rapid mutation of influenza viruses especially on the two surface proteins hemagglutinin (HA) and neuraminidase (NA) has made them capable to escape from population immunity, which has become a key challenge for influenza vaccine design. Thus, it is crucial to predict influenza antigenic evolution and identify new antigenic variants in a timely manner. However, traditional experimental methods like hemagglutination inhibition (HI) assay to select vaccine strains are time and labor-intensive, while popular computational methods are less sensitive, which presents the need for more accurate algorithms. In this study, we have proposed a novel low-rank matrix completion model MCAAS to infer antigenic distances between antigens and antisera based on partially revealed antigenic distances, virus similarity based on HA protein sequences, and vaccine similarity based on vaccine strains. The model exploits the correlations of viruses and vaccines in serological tests as well as the ability of HAs from viruses and vaccine strains in inferring influenza antigenicity. We also compared the effects of comprehensive 65 amino acids substitution matrices in predicting influenza antigenicity. As a result, we applied MCAAS into H3N2 seasonal influenza virus data. Our model achieved a 10-fold cross validation root-mean-squared error (RMSE) of 0.5982, significantly outperformed existing computational methods like antigenic cartography, AntigenMap and BMCSI. We also constructed the antigenic map and studied the association between genetic and antigenic evolution of H3N2 influenza viruses. Finally, our analyses showed that homologous structure derived amino acid substitution matrix (HSDM) is most powerful in predicting influenza antigenicity, which is consistent with previous studies.
Collapse
Affiliation(s)
- Peng Wang
- College of Information Science and Engineering, Hunan University, Changsha, Changsha, China
| | - Wen Zhu
- School of Mathematics and Statistics, Hainan Normal University, Haikou, China
| | - Bo Liao
- College of Information Science and Engineering, Hunan University, Changsha, Changsha, China.,School of Mathematics and Statistics, Hainan Normal University, Haikou, China
| | - Lijun Cai
- College of Information Science and Engineering, Hunan University, Changsha, Changsha, China
| | - Lihong Peng
- School of Computer Science, Hunan University of Technology, Zhuzhou, China
| | - Jialiang Yang
- School of Mathematics and Statistics, Hainan Normal University, Haikou, China.,Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine At Mount Sinai, New York, NY, United States
| |
Collapse
|