1
|
Wu A, Luo L, Zeng Q, Wu C, Shu X, Huang P, Wang Z, Hu T, Feng Z, Tu Y, Zhu Y, Cao Y, Li Z. Comparative assessment of the capability of machine learning-based radiomic models for predicting omental metastasis in locally advanced gastric cancer. Sci Rep 2024; 14:16208. [PMID: 39003337 PMCID: PMC11246510 DOI: 10.1038/s41598-024-66979-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Accepted: 07/06/2024] [Indexed: 07/15/2024] Open
Abstract
The study aims to investigate the predictive capability of machine learning algorithms for omental metastasis in locally advanced gastric cancer (LAGC) and to compare the performance metrics of various machine learning predictive models. A retrospective collection of 478 pathologically confirmed LAGC patients was undertaken, encompassing both clinical features and arterial phase computed tomography images. Radiomic features were extracted using 3D Slicer software. Clinical and radiomic features were further filtered through lasso regression. Selected clinical and radiomic features were used to construct omental metastasis predictive models using support vector machine (SVM), decision tree (DT), random forest (RF), K-nearest neighbors (KNN), and logistic regression (LR). The models' performance metrics included accuracy, area under the curve (AUC) of the receiver operating characteristic curve, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). In the training cohort, the RF predictive model surpassed LR, SVM, DT, and KNN in terms of accuracy, AUC, sensitivity, specificity, PPV, and NPV. Compared to the other four predictive models, the RF model significantly improved PPV. In the test cohort, all five machine learning predictive models exhibited lower PPVs. The DT model demonstrated the most significant variation in performance metrics relative to the other models, with a sensitivity of 0.231 and specificity of 0.990. The LR-based predictive model had the lowest PPV at 0.210, compared to the other four models. In the external validation cohort, the performance metrics of the predictive models were generally consistent with those in the test cohort. The LR-based model for predicting omental metastasis exhibited a lower PPV. Among the machine learning algorithms, the RF predictive model demonstrated higher accuracy and improved PPV relative to LR, SVM, KNN, and DT models.
Collapse
Affiliation(s)
- Ahao Wu
- Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, 330006, Jiangxi Province, China
- Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, China
| | - Lianghua Luo
- Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, 330006, Jiangxi Province, China
- General Surgery Department of Jiangxi Provincial People's Hospital, Nanchang, 330006, Jiangxi Province, China
| | - Qingwen Zeng
- Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, 330006, Jiangxi Province, China
| | - Changlei Wu
- Department of Gastrointestinal Surgery, The Second Affiliated Hospital, Nanchang University, Nanchang, 330006, Jiangxi Province, China
| | - Xufeng Shu
- Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, 330006, Jiangxi Province, China
| | - Pang Huang
- Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, 330006, Jiangxi Province, China
| | - Zhonghao Wang
- Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, 330006, Jiangxi Province, China
| | - Tengcheng Hu
- Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, 330006, Jiangxi Province, China
| | - Zongfeng Feng
- Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, 330006, Jiangxi Province, China
| | - Yi Tu
- Department of Pathology, The First Affiliated Hospital of Nanchang University, Nanchang, 330006, Jiangxi Province, China
| | - Yanyan Zhu
- Department of Radiology, The First Affiliated Hospital, Nanchang University, Nanchang, 330006, Jiangxi Province, China
| | - Yi Cao
- Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, 330006, Jiangxi Province, China.
| | - Zhengrong Li
- Department of Digestive Surgery, Digestive Disease Hospital, The First Affiliated Hospital of Nanchang University, Nanchang, 330006, Jiangxi Province, China.
- Department of Digestive Surgery, Digestive Disease Hospital, The Third Affiliated Hospital of Nanchang University, Nanchang, 330006, Jiangxi Province, China.
| |
Collapse
|
2
|
Artificial Intelligence Models for Zoonotic Pathogens: A Survey. Microorganisms 2022; 10:microorganisms10101911. [PMID: 36296187 PMCID: PMC9607465 DOI: 10.3390/microorganisms10101911] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 09/19/2022] [Accepted: 09/22/2022] [Indexed: 11/22/2022] Open
Abstract
Zoonotic diseases or zoonoses are infections due to the natural transmission of pathogens between species (animals and humans). More than 70% of emerging infectious diseases are attributed to animal origin. Artificial Intelligence (AI) models have been used for studying zoonotic pathogens and the factors that contribute to their spread. The aim of this literature survey is to synthesize and analyze machine learning, and deep learning approaches applied to study zoonotic diseases to understand predictive models to help researchers identify the risk factors, and develop mitigation strategies. Based on our survey findings, machine learning and deep learning are commonly used for the prediction of both foodborne and zoonotic pathogens as well as the factors associated with the presence of the pathogens.
Collapse
|
3
|
Exploring the predictive capability of machine learning models in identifying foot and mouth disease outbreak occurrences in cattle farms in an endemic setting of Thailand. Prev Vet Med 2022; 207:105706. [DOI: 10.1016/j.prevetmed.2022.105706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 06/09/2022] [Accepted: 07/01/2022] [Indexed: 11/20/2022]
|
4
|
Kumar Y, Koul A, Singla R, Ijaz MF. Artificial intelligence in disease diagnosis: a systematic literature review, synthesizing framework and future research agenda. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING 2022; 14:8459-8486. [PMID: 35039756 PMCID: PMC8754556 DOI: 10.1007/s12652-021-03612-z] [Citation(s) in RCA: 117] [Impact Index Per Article: 58.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Accepted: 11/18/2021] [Indexed: 05/03/2023]
Abstract
Artificial intelligence can assist providers in a variety of patient care and intelligent health systems. Artificial intelligence techniques ranging from machine learning to deep learning are prevalent in healthcare for disease diagnosis, drug discovery, and patient risk identification. Numerous medical data sources are required to perfectly diagnose diseases using artificial intelligence techniques, such as ultrasound, magnetic resonance imaging, mammography, genomics, computed tomography scan, etc. Furthermore, artificial intelligence primarily enhanced the infirmary experience and sped up preparing patients to continue their rehabilitation at home. This article covers the comprehensive survey based on artificial intelligence techniques to diagnose numerous diseases such as Alzheimer, cancer, diabetes, chronic heart disease, tuberculosis, stroke and cerebrovascular, hypertension, skin, and liver disease. We conducted an extensive survey including the used medical imaging dataset and their feature extraction and classification process for predictions. Preferred reporting items for systematic reviews and Meta-Analysis guidelines are used to select the articles published up to October 2020 on the Web of Science, Scopus, Google Scholar, PubMed, Excerpta Medical Database, and Psychology Information for early prediction of distinct kinds of diseases using artificial intelligence-based techniques. Based on the study of different articles on disease diagnosis, the results are also compared using various quality parameters such as prediction rate, accuracy, sensitivity, specificity, the area under curve precision, recall, and F1-score.
Collapse
Affiliation(s)
- Yogesh Kumar
- Department of Computer Engineering, Indus Institute of Technology and Engineering, Indus University, Ahmedabad, 382115 India
| | | | - Ruchi Singla
- Department of Research, Innovations, Sponsored Projects and Entrepreneurship, CGC Landran, Mohali, India
| | - Muhammad Fazal Ijaz
- Department of Intelligent Mechatronics Engineering, Sejong University, Seoul, 05006 South Korea
| |
Collapse
|
5
|
Romero MP, Chang YM, Brunton LA, Parry J, Prosser A, Upton P, Drewe JA. Machine learning classification methods informing the management of inconclusive reactors at bovine tuberculosis surveillance tests in England. Prev Vet Med 2021; 199:105565. [PMID: 34954421 DOI: 10.1016/j.prevetmed.2021.105565] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Revised: 12/15/2021] [Accepted: 12/17/2021] [Indexed: 10/19/2022]
Abstract
Bovine tuberculosis (bTB) remains one of the most complex, challenging, and costly animal health problems in England. Identifying and promptly removing all infected cattle from affected herds is key to its eradication strategy; the imperfect sensitivity of the diagnostic testing regime remaining a serious obstacle. The main diagnostic test for bTB in cattle in England, the Single Intradermal Comparative Cervical Tuberculin Test (SICCT: also known as the skin test), can produce inconclusive results below the reactor threshold. The immediate isolation of inconclusive reactor (IR) animals followed by a 60-day retest may not prevent lateral spread within the herd (if it is substandard, allowing transmission) or transmission to wildlife. Over half of IR-only herds that went on to have a positive skin test result (a bTB herd 'incident') in 2020, had it triggered by at least one IR not clearing their 60-day retest, instead of by another test within the previous 15 months. Machine learning classification algorithms (classification tree analysis and random forest), applied to England's 2012-2020 IR-only surveillance herd tests, identified at-risk tests for an incident at the IRs' 60-day retest. In this period, 4 739 out of 22 946 (21 %) IR-only surveillance tests disclosing 6 296 out of 42 685 total IRs, had an incident at retest (2 716 IRs became reactors and 3 580 IRs became two-time IRs). Both models showed an AUC above 80 % in the 2012-2019 dataset. Classification tree analysis was preferred due to its easy-to-interpret outputs, 70 % sensitivity, and 93 % specificity in the 20 % of 2019-2020 testing dataset. The paper aimed to identify IR-only surveillance tests at-risk of an incident at the 60-day retest to target them with appropriate measures to mitigate the IRs' risk. Sixteen percent (341 out of 2 177) of IR-only herd tests were identified as high-risk in the 2020 dataset, with 265 (78 %) of these having at least one reactor or IR at retest. Severe-level reinterpretation of the high-risk IR-only disclosing tests identified in this dataset would turn 68 out of the 590 (12 %) IRs into reactors, generating 23 incidents, the majority (19 or 83 %) part of the 265 incidents that would have been declared at the retest. Classification tree analysis used to identify IR-only high-risk tests in herds eligible for severe interpretation would enhance the sensitivity of the test-and-slaughter regime, cornerstone of the bTB eradication programme in England, further mitigating the risk of disease spread posed by IRs.
Collapse
Affiliation(s)
- M Pilar Romero
- Animal and Plant Health Agency, Woodham Lane, Addlestone, Surrey, KT15 3NB, United Kingdom; Royal Veterinary College, Hawkshead Lane, North Mymms, Hatfield, Hertfordshire, AL9 7TA, United Kingdom.
| | - Yu-Mei Chang
- Royal Veterinary College, Hawkshead Lane, North Mymms, Hatfield, Hertfordshire, AL9 7TA, United Kingdom
| | - Lucy A Brunton
- Royal Veterinary College, Hawkshead Lane, North Mymms, Hatfield, Hertfordshire, AL9 7TA, United Kingdom
| | - Jessica Parry
- Animal and Plant Health Agency, Woodham Lane, Addlestone, Surrey, KT15 3NB, United Kingdom
| | - Alison Prosser
- Animal and Plant Health Agency, Woodham Lane, Addlestone, Surrey, KT15 3NB, United Kingdom
| | - Paul Upton
- Animal and Plant Health Agency, Woodham Lane, Addlestone, Surrey, KT15 3NB, United Kingdom
| | - Julian A Drewe
- Royal Veterinary College, Hawkshead Lane, North Mymms, Hatfield, Hertfordshire, AL9 7TA, United Kingdom
| |
Collapse
|
6
|
Yu S, Li X, Wang H, Zhang X, Chen S. C_CART: An instance confidence-based decision tree algorithm for classification. INTELL DATA ANAL 2021. [DOI: 10.3233/ida-205361] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
In classification, a decision tree is a common model due to its simple structure and easy understanding. Most of decision tree algorithms assume all instances in a dataset have the same degree of confidence, so they use the same generation and pruning strategies for all training instances. In fact, the instances with greater degree of confidence are more useful than the ones with lower degree of confidence in the same dataset. Therefore, the instances should be treated discriminately according to their corresponding confidence degrees when training classifiers. In this paper, we investigate the impact and significance of degree of confidence of instances on the classification performance of decision tree algorithms, taking the classification and regression tree (CART) algorithm as an example. First, the degree of confidence of instances is quantified from a statistical perspective. Then, a developed CART algorithm named C_CART is proposed by introducing the confidence of instances into the generation and pruning processes of CART algorithm. Finally, we conduct experiments to evaluate the performance of C_CART algorithm. The experimental results show that our C_CART algorithm can significantly improve the generalization performance as well as avoiding the over-fitting problem to a certain extend.
Collapse
Affiliation(s)
- Shuang Yu
- Key Laboratory of Symbolic Computation and Knowledge Engineer, Ministry of Education, Changchun, Jilin, China
- College of Computer Science and Technology, Jilin University, Changchun, Jilin, China
- CSIRO Data61, Sydney, NSW, Australia
| | - Xiongfei Li
- Key Laboratory of Symbolic Computation and Knowledge Engineer, Ministry of Education, Changchun, Jilin, China
- College of Computer Science and Technology, Jilin University, Changchun, Jilin, China
| | - Hancheng Wang
- State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, Jiangsu, China
| | - Xiaoli Zhang
- Key Laboratory of Symbolic Computation and Knowledge Engineer, Ministry of Education, Changchun, Jilin, China
- College of Computer Science and Technology, Jilin University, Changchun, Jilin, China
| | | |
Collapse
|
7
|
Zhang M, Zhang K, Yu D, Xie Q, Liu B, Chen D, Xv D, Li Z, Liu C. Computerized assisted evaluation system for canine cardiomegaly via key points detection with deep learning. Prev Vet Med 2021; 193:105399. [PMID: 34118647 DOI: 10.1016/j.prevetmed.2021.105399] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Revised: 05/21/2021] [Accepted: 06/01/2021] [Indexed: 10/21/2022]
Abstract
Cardiomegaly is the main imaging finding for canine heart diseases. There are many advances in the field of medical diagnosing based on imaging with deep learning for human being. However there are also increasing realization of the potential of using deep learning in veterinary medicine. We reported a clinically applicable assisted platform for diagnosing the canine cardiomegaly with deep learning. VHS (vertebral heart score) is a measuring method used for the heart size of a dog. The concrete value of VHS is calculated with the relative position of 16 key points detected by the system, and this result is then combined with VHS reference range of all dog breeds to assist in the evaluation of the canine cardiomegaly. We adopted HRNet (high resolution network) to detect 16 key points (12 and four key points located on vertebra and heart respectively) in 2274 lateral X-ray images (training and validation datasets) of dogs, the model was then used to detect the key points in external testing dataset (396 images), the AP (average performance) for key point detection reach 86.4 %. Then we applied an additional post processing procedure to correct the output of HRNets so that the AP reaches 90.9 %. This result signifies that this system can effectively assist the evaluation of canine cardiomegaly in a real clinical scenario.
Collapse
Affiliation(s)
- Mengni Zhang
- New Ruipeng Pet Healthcare Group Co. LTD., Beijing, 100010, China
| | - Kai Zhang
- New Ruipeng Pet Healthcare Group Co. LTD., Beijing, 100010, China.
| | - Deying Yu
- Hospital University Sains Malaysia, Kota Bharu, 16150, Kelantan, Malaysia
| | - Qianru Xie
- New Ruipeng Pet Healthcare Group Co. LTD., Beijing, 100010, China
| | - Binlong Liu
- New Ruipeng Pet Healthcare Group Co. LTD., Beijing, 100010, China
| | - Dacan Chen
- New Ruipeng Pet Healthcare Group Co. LTD., Beijing, 100010, China
| | - Dongxing Xv
- New Ruipeng Pet Healthcare Group Co. LTD., Beijing, 100010, China
| | - Zhiwei Li
- New Ruipeng Pet Healthcare Group Co. LTD., Beijing, 100010, China
| | - Chaofei Liu
- New Ruipeng Pet Healthcare Group Co. LTD., Beijing, 100010, China
| |
Collapse
|
8
|
Romero MP, Chang YM, Brunton LA, Prosser A, Upton P, Rees E, Tearne O, Arnold M, Stevens K, Drewe JA. A comparison of the value of two machine learning predictive models to support bovine tuberculosis disease control in England. Prev Vet Med 2021; 188:105264. [PMID: 33556783 DOI: 10.1016/j.prevetmed.2021.105264] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Revised: 12/03/2020] [Accepted: 01/07/2021] [Indexed: 11/16/2022]
Abstract
Nearly a decade into Defra's current eradication strategy, bovine tuberculosis (bTB) remains a serious animal health problem in England, with c.30,000 cattle slaughtered annually in the fight against this insidious disease. There is an urgent need to improve our understanding of bTB risk in order to enhance the current disease control policy. Machine learning approaches applied to big datasets offer a potential way to do this. Regularized regression and random forest machine learning methodologies were implemented using 2016 herd-level data to generate the best possible predictive models for a bTB incident in England and its three surveillance risk areas (High-risk area [HRA], Edge area [EA] and Low-risk area [LRA]). Their predictive performance was compared and the best models in each area were used to characterize herds according to risk. While all models provided excellent discrimination, random forest models achieved the highest balanced accuracy (i.e. average of sensitivity and specificity) in England, HRA and LRA, whereas the regularized regression LASSO model did so in the EA. The time since the last confirmed incident was resolved was the only variable in the top-ten ranking in all areas according to both types of models, which highlights the importance of bTB history as a predictor of a new incident. Risk categorisation based on Receiver Operating Characteristic (ROC) analysis was carried out using the best predictive models in each area setting a 99 % threshold value for sensitivity and specificity (97 % in the LRA). Thirteen percent of herds in the whole of England as well as in its HRA, 14 % in its EA and 31 % in its LRA were classified as high-risk. These could be selected for the deployment of additional disease control measures at national or area level. In this way, low-risk herds within the area considered would not be penalised unnecessarily by blanket control measures and limited resources be used more efficiently. The methodology presented in this paper demonstrates a way to accurately identify high-risk farms to inform a targeted disease control and prevention strategy in England that supplements existing population strategies.
Collapse
Affiliation(s)
- M Pilar Romero
- Animal and Plant Health Agency, Woodham Lane, Addlestone, Surrey, KT15 3NB, United Kingdom; Royal Veterinary College, Hawkshead Lane, North Mymms, Hatfield, Hertfordshire, AL9 7TA, United Kingdom.
| | - Yu-Mei Chang
- Royal Veterinary College, Hawkshead Lane, North Mymms, Hatfield, Hertfordshire, AL9 7TA, United Kingdom
| | - Lucy A Brunton
- Royal Veterinary College, Hawkshead Lane, North Mymms, Hatfield, Hertfordshire, AL9 7TA, United Kingdom
| | - Alison Prosser
- Animal and Plant Health Agency, Woodham Lane, Addlestone, Surrey, KT15 3NB, United Kingdom
| | - Paul Upton
- Animal and Plant Health Agency, Woodham Lane, Addlestone, Surrey, KT15 3NB, United Kingdom
| | - Eleanor Rees
- Animal and Plant Health Agency, Woodham Lane, Addlestone, Surrey, KT15 3NB, United Kingdom
| | - Oliver Tearne
- Animal and Plant Health Agency, Woodham Lane, Addlestone, Surrey, KT15 3NB, United Kingdom
| | - Mark Arnold
- Animal and Plant Health Agency, Woodham Lane, Addlestone, Surrey, KT15 3NB, United Kingdom
| | - Kim Stevens
- Royal Veterinary College, Hawkshead Lane, North Mymms, Hatfield, Hertfordshire, AL9 7TA, United Kingdom
| | - Julian A Drewe
- Royal Veterinary College, Hawkshead Lane, North Mymms, Hatfield, Hertfordshire, AL9 7TA, United Kingdom
| |
Collapse
|