1
|
Chen J, Jiang Y, Li Z, Zhang M, Liu L, Li A, Lu H. Predictive machine learning models for anticipating loss to follow-up in tuberculosis patients throughout anti-TB treatment journey. Sci Rep 2024; 14:24685. [PMID: 39433802 PMCID: PMC11494039 DOI: 10.1038/s41598-024-74942-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Accepted: 09/30/2024] [Indexed: 10/23/2024] Open
Abstract
Loss to follow-up (LTFU) in tuberculosis (TB) management increases morbidity and mortality, challenging effective control strategies. This study aims to develop and evaluate machine learning models to predict loss to follow-up in TB patients, improving treatment adherence and outcomes. Retrospective data encompassing tuberculosis patients who underwent treatment or registration at the National Center for Clinical Medical Research on Infectious Diseases from January 2017 to December 2021 were compiled. Employing machine learning techniques, namely SVM, RF, XGBoost, and logistic regression, the study aimed to prognosticate LTFU. A comprehensive cohort of 24,265 tuberculosis patients underwent scrutiny, revealing a LTFU prevalence of 12.51% (n = 3036). Education level, history of hospitalization, alcohol consumption, outpatient admission, and prior tuberculosis history emerged as precursors for pre-treatment LTFU. Employment status, outpatient admission, presence of chronic hepatitis/cirrhosis, drug adverse reactions, alternative contact availability, and health insurance coverage exerted substantial influence on treatment-phase LTFU. XGBoost consistently surpassed alternative models, boasting superior discriminative ability with an average AUC of 0.921 for pre-treatment LTFU and 0.825 for in-treatment LTFU. Our study demonstrates that the XGBoost model provides superior predictive performance in identifying LTFU risk among tuberculosis patients. The identification of key risk factors highlights the importance of targeted interventions, which could lead to significant improvements in treatment adherence and patient outcomes.
Collapse
Affiliation(s)
- Jingfang Chen
- The Third People's Hospital of Shenzhen, Shenzhen, 518112, China
- National Clinical Research Center for Infectious Diseases, Shenzhen, 518112, China
- Second Hospital Affiliated With Southern University of Science and Technology, Shenzhen, 518112, China
- Faculty of Medicine, Macau University of Science and Technology, Macau, 999078, China
| | - Youli Jiang
- Nursing Department, The People's Hospital of Longhua, Shenzhen, 518109, China
| | - Zhihuan Li
- Faculty of Medicine, Macau University of Science and Technology, Macau, 999078, China
| | - Mingshu Zhang
- Faculty of Medicine, Macau University of Science and Technology, Macau, 999078, China
| | - Linlin Liu
- Hengyang Medical School, School of Nursing, University of South China, Hengyang, 421001, China
| | - Ao Li
- Clinical Nursing Teaching and Research Section, The Second Xiangya Hospital of Central South University, Changsha, China
| | - Hongzhou Lu
- The Third People's Hospital of Shenzhen, Shenzhen, 518112, China.
- National Clinical Research Center for Infectious Diseases, Shenzhen, 518112, China.
- Second Hospital Affiliated With Southern University of Science and Technology, Shenzhen, 518112, China.
| |
Collapse
|
2
|
Ahmed IA, Senan EM, Shatnawi HSA, Alkhraisha ZM, Al-Azzam MMA. Multi-Techniques for Analyzing X-ray Images for Early Detection and Differentiation of Pneumonia and Tuberculosis Based on Hybrid Features. Diagnostics (Basel) 2023; 13:diagnostics13040814. [PMID: 36832302 PMCID: PMC9955018 DOI: 10.3390/diagnostics13040814] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Revised: 02/16/2023] [Accepted: 02/19/2023] [Indexed: 02/23/2023] Open
Abstract
An infectious disease called tuberculosis (TB) exhibits pneumonia-like symptoms and traits. One of the most important methods for identifying and diagnosing pneumonia and tuberculosis is X-ray imaging. However, early discrimination is difficult for radiologists and doctors because of the similarities between pneumonia and tuberculosis. As a result, patients do not receive the proper care, which in turn does not prevent the disease from spreading. The goal of this study is to extract hybrid features using a variety of techniques in order to achieve promising results in differentiating between pneumonia and tuberculosis. In this study, several approaches for early identification and distinguishing tuberculosis from pneumonia were suggested. The first proposed system for differentiating between pneumonia and tuberculosis uses hybrid techniques, VGG16 + support vector machine (SVM) and ResNet18 + SVM. The second proposed system for distinguishing between pneumonia and tuberculosis uses an artificial neural network (ANN) based on integrating features of VGG16 and ResNet18, before and after reducing the high dimensions using the principal component analysis (PCA) method. The third proposed system for distinguishing between pneumonia and tuberculosis uses ANN based on integrating features of VGG16 and ResNet18 separately with handcrafted features extracted by local binary pattern (LBP), discrete wavelet transform (DWT) and gray level co-occurrence matrix (GLCM) algorithms. All the proposed systems have achieved superior results in the early differentiation between pneumonia and tuberculosis. An ANN based on the features of VGG16 with LBP, DWT and GLCM (LDG) reached an accuracy of 99.6%, sensitivity of 99.17%, specificity of 99.42%, precision of 99.63%, and an AUC of 99.58%.
Collapse
Affiliation(s)
- Ibrahim Abdulrab Ahmed
- Computer Department, Applied College, Najran University, Najran 66462, Saudi Arabia
- Correspondence: author: (I.A.A.); (E.M.S.)
| | - Ebrahim Mohammed Senan
- Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, Alrazi University, Sana’a, Yemen
- Correspondence: author: (I.A.A.); (E.M.S.)
| | | | | | | |
Collapse
|
3
|
Ali MH, Khan DM, Jamal K, Ahmad Z, Manzoor S, Khan Z. Prediction of Multidrug-Resistant Tuberculosis Using Machine Learning Algorithms in SWAT, Pakistan. JOURNAL OF HEALTHCARE ENGINEERING 2021; 2021:2567080. [PMID: 34512933 PMCID: PMC8426057 DOI: 10.1155/2021/2567080] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 08/18/2021] [Indexed: 11/20/2022]
Abstract
In this paper, we have focused on machine learning (ML) feature selection (FS) algorithms for identifying and diagnosing multidrug-resistant (MDR) tuberculosis (TB). MDR-TB is a universal public health problem, and its early detection has been one of the burning issues. The present study has been conducted in the Malakand Division of Khyber Pakhtunkhwa, Pakistan, to further add to the knowledge on the disease and to deal with the issues of identification and early detection of MDR-TB by ML algorithms. These models also identify the most important factors causing MDR-TB infection whose study gives additional insights into the matter. ML algorithms such as random forest, k-nearest neighbors, support vector machine, logistic regression, leaset absolute shrinkage and selection operator (LASSO), artificial neural networks (ANNs), and decision trees are applied to analyse the case-control dataset. This study reveals that close contacts of MDR-TB patients, smoking, depression, previous TB history, improper treatment, and interruption in first-line TB treatment have a great impact on the status of MDR. Accordingly, weight loss, chest pain, hemoptysis, and fatigue are important symptoms. Based on accuracy, sensitivity, and specificity, SVM and RF are the suggested models to be used for patients' classifications.
Collapse
Affiliation(s)
- Mian Haider Ali
- Department of Statistics, Abdul Wali Khan University, Mardan, Pakistan
- Programmatic Management of Drug-Resistant Tuberculosis, Saidu Teaching Hospital, Swat, Pakistan
| | | | - Khalid Jamal
- Programmatic Management of Drug-Resistant Tuberculosis, Saidu Teaching Hospital, Swat, Pakistan
| | - Zubair Ahmad
- Department of Statistics, Yazd University, P.O. Box 89175-741, Yazd, Iran
| | - Sadaf Manzoor
- Department of Statistics, Islamia College Peshawar, Peshawar, Pakistan
| | - Zardad Khan
- Department of Statistics, Abdul Wali Khan University, Mardan, Pakistan
| |
Collapse
|
4
|
Herman B, Sirichokchatchawan W, Pongpanich S, Nantasenamat C. Development and performance of CUHAS-ROBUST application for pulmonary rifampicin-resistance tuberculosis screening in Indonesia. PLoS One 2021; 16:e0249243. [PMID: 33765092 PMCID: PMC7993842 DOI: 10.1371/journal.pone.0249243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Accepted: 03/13/2021] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND AND OBJECTIVES Diagnosis of Pulmonary Rifampicin Resistant Tuberculosis (RR-TB) with the Drug-Susceptibility Test (DST) is costly and time-consuming. Furthermore, GeneXpert for rapid diagnosis is not widely available in Indonesia. This study aims to develop and evaluate the CUHAS-ROBUST model performance, an artificial-intelligence-based RR-TB screening tool. METHODS A cross-sectional study involved suspected all type of RR-TB patients with complete sputum Lowenstein Jensen DST (reference) and 19 clinical, laboratory, and radiology parameter results, retrieved from medical records in hospitals under the Faculty of Medicine, Hasanuddin University Indonesia, from January 2015-December 2019. The Artificial Neural Network (ANN) models were built along with other classifiers. The model was tested on participants recruited from January 2020-October 2020 and deployed into CUHAS-ROBUST (index test) application. Sensitivity, specificity, and accuracy were obtained for assessment. RESULTS A total of 487 participants (32 Multidrug-Resistant/MDR 57 RR-TB, 398 drug-sensitive) were recruited for model building and 157 participants (23 MDR and 21 RR) in prospective testing. The ANN full model yields the highest values of accuracy (88% (95% CI 85-91)), and sensitivity (84% (95% CI 76-89)) compare to other models that show sensitivity below 80% (Logistic Regression 32%, Decision Tree 44%, Random Forest 25%, Extreme Gradient Boost 25%). However, this ANN has lower specificity among other models (90% (95% CI 86-93)) where Logistic Regression demonstrates the highest (99% (95% CI 97-99)). This ANN model was selected for the CUHAS-ROBUST application, although still lower than the sensitivity of global GeneXpert results (87.5%). CONCLUSION The ANN-CUHAS ROBUST outperforms other AI classifiers model in detecting all type of RR-TB, and by deploying into the application, the health staff can utilize the tool for screening purposes particularly at the primary care level where the GeneXpert examination is not available. TRIAL REGISTRATION NCT04208789.
Collapse
Affiliation(s)
- Bumi Herman
- College of Public Health Science, Chulalongkorn University, Bangkok, Thailand
- * E-mail: (SP); , (BH)
| | | | - Sathirakorn Pongpanich
- College of Public Health Science, Chulalongkorn University, Bangkok, Thailand
- * E-mail: (SP); , (BH)
| | - Chanin Nantasenamat
- Faculty of Medical Technology, Mahidol University, Salaya, Nakhon Pathom, Thailand
| |
Collapse
|
5
|
Ul Abideen Z, Ghafoor M, Munir K, Saqib M, Ullah A, Zia T, Tariq SA, Ahmed G, Zahra A. Uncertainty Assisted Robust Tuberculosis Identification With Bayesian Convolutional Neural Networks. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2020; 8:22812-22825. [PMID: 32391238 PMCID: PMC7176037 DOI: 10.1109/access.2020.2970023] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2020] [Accepted: 01/21/2020] [Indexed: 05/07/2023]
Abstract
Tuberculosis (TB) is an infectious disease that can lead towards death if left untreated. TB detection involves extraction of complex TB manifestation features such as lung cavity, air space consolidation, endobronchial spread, and pleural effusions from chest x-rays (CXRs). Deep learning based approach named convolutional neural network (CNN) has the ability to learn complex features from CXR images. The main problem is that CNN does not consider uncertainty to classify CXRs using softmax layer. It lacks in presenting the true probability of CXRs by differentiating confusing cases during TB detection. This paper presents the solution for TB identification by using Bayesian-based convolutional neural network (B-CNN). It deals with the uncertain cases that have low discernibility among the TB and non-TB manifested CXRs. The proposed TB identification methodology based on B-CNN is evaluated on two TB benchmark datasets, i.e., Montgomery and Shenzhen. For training and testing of proposed scheme we have utilized Google Colab platform which provides NVidia Tesla K80 with 12 GB of VRAM, single core of 2.3 GHz Xeon Processor, 12 GB RAM and 320 GB of disk. B-CNN achieves 96.42% and 86.46% accuracy on both dataset, respectively as compared to the state-of-the-art machine learning and CNN approaches. Moreover, B-CNN validates its results by filtering the CXRs as confusion cases where the variance of B-CNN predicted outputs is more than a certain threshold. Results prove the supremacy of B-CNN for the identification of TB and non-TB sample CXRs as compared to counterparts in terms of accuracy, variance in the predicted probabilities and model uncertainty.
Collapse
Affiliation(s)
- Zain Ul Abideen
- 1Department of Computer ScienceCOMSATS University Islamabad (CUI)Islamabad44000Pakistan
| | - Mubeen Ghafoor
- 1Department of Computer ScienceCOMSATS University Islamabad (CUI)Islamabad44000Pakistan
- 2FET - Computer Science and Creative TechnologiesUniversity of the West of EnglandBristolBS16 1QYU.K
| | - Kamran Munir
- 2FET - Computer Science and Creative TechnologiesUniversity of the West of EnglandBristolBS16 1QYU.K
| | - Madeeha Saqib
- 3Department of Computer Information SystemsCollege of Computer Science and Information TechnologyImam Abdulrahman Bin Faisal UniversityDammam34212Saudi Arabia
| | - Ata Ullah
- 4Department of Computer ScienceNational University of Modern Languages (NUML)Islamabad44000Pakistan
| | - Tehseen Zia
- 1Department of Computer ScienceCOMSATS University Islamabad (CUI)Islamabad44000Pakistan
| | - Syed Ali Tariq
- 1Department of Computer ScienceCOMSATS University Islamabad (CUI)Islamabad44000Pakistan
| | - Ghufran Ahmed
- 5Department of Computer ScienceNational University of Computer and Emerging Sciences (NUCES)Karachi54700Pakistan
| | - Asma Zahra
- 1Department of Computer ScienceCOMSATS University Islamabad (CUI)Islamabad44000Pakistan
| |
Collapse
|
6
|
Wang H, Li C, Zhang J, Wang J, Ma Y, Lian Y. A new LSTM-based gene expression prediction model: L-GEPM. J Bioinform Comput Biol 2019; 17:1950022. [PMID: 31617459 DOI: 10.1142/s0219720019500227] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Molecular biology combined with in silico machine learning and deep learning has facilitated the broad application of gene expression profiles for gene function prediction, optimal crop breeding, disease-related gene discovery, and drug screening. Although the acquisition cost of genome-wide expression profiles has been steadily declining, the requirement generates a compendium of expression profiles using thousands of samples remains high. The Library of Integrated Network-Based Cellular Signatures (LINCS) program used approximately 1000 landmark genes to predict the expression of the remaining target genes by linear regression; however, this approach ignored the nonlinear features influencing gene expression relationships, limiting the accuracy of the experimental results. We herein propose a gene expression prediction model, L-GEPM, based on long short-term memory (LSTM) neural networks, which captures the nonlinear features affecting gene expression and uses learned features to predict the target genes. By comparing and analyzing experimental errors and fitting the effects of different prediction models, the LSTM neural network-based model, L-GEPM, can achieve low error and a superior fitting effect.
Collapse
Affiliation(s)
- Huiqing Wang
- College of Information and Computer, Taiyuan University of Technology, P. R. China
| | - Chun Li
- College of Information and Computer, Taiyuan University of Technology, P. R. China
| | - Jianhui Zhang
- College of Information and Computer, Taiyuan University of Technology, P. R. China
| | - Jingjing Wang
- College of Information and Computer, Taiyuan University of Technology, P. R. China
| | - Yue Ma
- College of Information and Computer, Taiyuan University of Technology, P. R. China
| | - Yuanyuan Lian
- College of Information and Computer, Taiyuan University of Technology, P. R. China
| |
Collapse
|
7
|
CBN: Constructing a clinical Bayesian network based on data from the electronic medical record. J Biomed Inform 2018; 88:1-10. [DOI: 10.1016/j.jbi.2018.10.007] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Revised: 09/17/2018] [Accepted: 10/28/2018] [Indexed: 11/24/2022]
|
8
|
Duarte R, Silva DR, Rendon A, Alves TG, Rabahi MF, Centis R, Kritski A, Migliori GB. Eliminating tuberculosis in Latin America: making it the point. J Bras Pneumol 2018; 44:73-76. [PMID: 29791551 PMCID: PMC6044666 DOI: 10.1590/s1806-37562017000000449] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Affiliation(s)
- Raquel Duarte
- Serviço de Pneumologia, Centro Hospitalar de Vila Nova de Gaia-Espinho, Porto, Portugal
| | - Denise Rossato Silva
- Faculdade de Medicina, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Adrian Rendon
- Centro de Investigación, Prevención y Tratamiento de Infecciones Respiratorias, Hospital Universitario, Universidad Autonoma de Nuevo Leon, Monterrey, México
| | - Tatiana Galvẫo Alves
- Hospital Especializado Octávio Mangabeira, Secretaria de Saúde do Estado da Bahia, Salvador, BA, Brasil
| | | | - Rosella Centis
- WHO Collaborating Centre for TB and Lung Diseases, Fondazione Salvatore Maugeri, Istituto di Ricovero e Cura a Carattere Scientifico, Tradate, Italia
| | - Afrânio Kritski
- Instituto de Doenças do Tórax, Faculdade de Medicina, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| | - Giovanni Battista Migliori
- WHO Collaborating Centre for TB and Lung Diseases, Fondazione Salvatore Maugeri, Istituto di Ricovero e Cura a Carattere Scientifico, Tradate, Italia
| |
Collapse
|
9
|
Disease Diagnosis in Smart Healthcare: Innovation, Technologies and Applications. SUSTAINABILITY 2017. [DOI: 10.3390/su9122309] [Citation(s) in RCA: 72] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
10
|
Caliskan A, Yuksel ME. Classification of coronary artery disease data sets by using a deep neural network. EUROBIOTECH JOURNAL 2017. [DOI: 10.24190/issn2564-615x/2017/04.03] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Abstract
In this study, a deep neural network classifier is proposed for the classification of coronary artery disease medical data sets. The proposed classifier is tested on reference CAD data sets from the literature and also compared with popular representative classification methods regarding its classification performance. Experimental results show that the deep neural network classifier offers much better accuracy, sensitivity and specificity rates when compared with other methods. The proposed method presents itself as an easily accessible and cost-effective alternative to currently existing methods used for the diagnosis of CAD and it can be applied for easily checking whether a given subject under examination has at least one occluded coronary artery or not.
Collapse
Affiliation(s)
- Abdullah Caliskan
- Department of Biomedical Engineering, Erciyes University, Kayseri , Turkey
| | - Mehmet Emin Yuksel
- Department of Biomedical Engineering, Erciyes University, Kayseri , Turkey
| |
Collapse
|