1
|
Krishnan P, Bobak CA, Hill JE. Sex-specific blood-derived RNA biomarkers for childhood tuberculosis. Sci Rep 2024; 14:16859. [PMID: 39039071 PMCID: PMC11263679 DOI: 10.1038/s41598-024-66946-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2024] [Accepted: 07/05/2024] [Indexed: 07/24/2024] Open
Abstract
Confirmatory diagnosis of childhood tuberculosis (TB) remains a challenge mainly due to its dependence on sputum samples and the paucibacillary nature of the disease. Thus, only ~ 30% of suspected cases in children are diagnosed and the need for minimally invasive, non-sputum-based biomarkers remains unmet. Understanding host molecular changes by measuring blood-based transcriptomic markers has shown promise as a diagnostic tool for TB. However, the implication of sex contributing to disease heterogeneity and therefore diagnosis remains to be understood. Using publicly available gene expression data (GSE39939, GSE39940; n = 370), we report a sex-specific RNA biomarker signature that could improve the diagnosis of TB disease in children. We found four gene biomarker signatures for male (SLAMF8, GBP2, WARS, and FCGR1C) and female pediatric patients (GBP6, CELSR3, ALDH1A1, and GBP4) from Kenya, South Africa, and Malawi. Both signatures achieved a sensitivity of 85% and a specificity of 70%, which approaches the WHO-recommended target product profile for a triage test. Our gene signatures outperform most other gene signatures reported previously for childhood TB diagnosis.
Collapse
Affiliation(s)
- Preethi Krishnan
- Department of Chemical and Biological Engineering, University of British Columbia, Vancouver, V6T 1Z3, Canada
| | - Carly A Bobak
- Department of Biomedical Data Science, Dartmouth College, Hanover, NH, 03755, USA
| | - Jane E Hill
- Department of Chemical and Biological Engineering, University of British Columbia, Vancouver, V6T 1Z3, Canada.
| |
Collapse
|
2
|
Balakrishnan V, Kehrabi Y, Ramanathan G, Paul SA, Tiong CK. Machine learning approaches in diagnosing tuberculosis through biomarkers - A systematic review. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2023; 179:16-25. [PMID: 36931609 DOI: 10.1016/j.pbiomolbio.2023.03.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 02/25/2023] [Accepted: 03/09/2023] [Indexed: 03/17/2023]
Abstract
Biomarker-based tests may facilitate Tuberculosis (TB) diagnosis, accelerate treatment initiation, and thus improve outcomes. This review synthesizes the literature on biomarker-based detection for TB diagnosis using machine learning. The systematic review approach follows the PRISMA guideline. Articles were sought using relevant keywords from Web of Science, PubMed, and Scopus, resulting in 19 eligible studies after a meticulous screening. All the studies were found to have focused on the supervised learning approach, with Support Vector Machine (SVM) and Random Forest emerging as the top two algorithms, with the highest accuracy, sensitivity and specificity reported to be 97.0%, 99.2%, and 98.0%, respectively. Further, protein-based biomarkers were widely explored, followed by gene-based such as RNA sequence and, Spoligotypes. Publicly available datasets were observed to be popularly used by the studies reviewed whilst studies targeting specific cohorts such as HIV patients or children gathering their own data from healthcare facilities, leading to smaller datasets. Of these, most studies used the leave one out cross validation technique to mitigate overfitting. The review shows that machine learning is increasingly assessed in research to improve TB diagnosis through biomarkers, as promising results were shown in terms of model's detection performance. This provides insights on the possible application of machine learning approaches to diagnose TB using biomarkers as opposed to the traditional methods that can be time consuming. Low-middle income settings, where access to basic biomarkers could be provided as compared to sputum-based tests that are not always available, could be a major application of such models.
Collapse
Affiliation(s)
- Vimala Balakrishnan
- Faculty of Computer Science and Information Technology, Universiti Malaya, 50603, Kuala Lumpur, Malaysia.
| | - Yousra Kehrabi
- Department of Infectious Diseases, Hôpital Bichat-Claude Bernard, Assistance Publique Hôpitaux de Paris, Paris, France
| | - Ghayathri Ramanathan
- Faculty of Computer Science and Information Technology, Universiti Malaya, 50603, Kuala Lumpur, Malaysia
| | - Scott Arjay Paul
- School of Biosciences, Faculty of Health and Medical Sciences, Taylor's University, Subang Jaya, Malaysia
| | - Chiong Kian Tiong
- Faculty of Medicine, Universiti Malaya, 50603, Kuala Lumpur, Malaysia
| |
Collapse
|
3
|
Koul A, Bawa RK, Kumar Y. Artificial Intelligence Techniques to Predict the Airway Disorders Illness: A Systematic Review. ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING : STATE OF THE ART REVIEWS 2022; 30:831-864. [PMID: 36189431 PMCID: PMC9516534 DOI: 10.1007/s11831-022-09818-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 09/04/2022] [Indexed: 06/16/2023]
Abstract
Airway disease is a major healthcare issue that causes at least 3 million fatalities every year. It is also considered one of the foremost causes of death all around the globe by 2030. Numerous studies have been undertaken to demonstrate the latest advances in artificial intelligence algorithms to assist in identifying and classifying these diseases. This comprehensive review aims to summarise the state-of-the-art machine and deep learning-based systems for detecting airway disorders, envisage the trends of the recent work in this domain, and analyze the difficulties and potential future paths. This systematic literature review includes the study of one hundred fifty-five articles on airway diseases such as cystic fibrosis, emphysema, lung cancer, Mesothelioma, covid-19, pneumoconiosis, asthma, pulmonary edema, tuberculosis, pulmonary embolism as well as highlights the automated learning techniques to predict them. The study concludes with a discussion and challenges about expanding the efficiency and machine and deep learning-assisted airway disease detection applications.
Collapse
Affiliation(s)
- Apeksha Koul
- Department of Computer Science and Engineering, Punjabi University, Patiala, Punjab India
| | - Rajesh K. Bawa
- Department of Computer Science, Punjabi University, Patiala, Punjab India
| | - Yogesh Kumar
- Department of Computer Science and Engineering, School of Technology, Pandit Deendayal Energy University, Gandhinagar, Gujarat India
| |
Collapse
|
4
|
Orjuela-Cañón AD, Jutinico AL, Awad C, Vergara E, Palencia A. Machine learning in the loop for tuberculosis diagnosis support. Front Public Health 2022; 10:876949. [PMID: 35958865 PMCID: PMC9362992 DOI: 10.3389/fpubh.2022.876949] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Accepted: 06/30/2022] [Indexed: 11/13/2022] Open
Abstract
The use of machine learning (ML) for diagnosis support has advanced in the field of health. In the present paper, the results of studying ML techniques in a tuberculosis diagnosis loop in a scenario of limited resources are presented. Data are analyzed using a tuberculosis (TB) therapy program at a health institution in a main city of a developing country using five ML models. Logistic regression, classification trees, random forest, support vector machines, and artificial neural networks are trained under physician supervision following physicians' typical daily work. The models are trained on seven main variables collected when patients arrive at the facility. Additionally, the variables applied to train the models are analyzed, and the models' advantages and limitations are discussed in the context of the automated ML techniques. The results show that artificial neural networks obtain the best results in terms of accuracy, sensitivity, and area under the receiver operating curve. These results represent an improvement over smear microscopy, which is commonly used techniques to detect TB for special cases. Findings demonstrate that ML in the TB diagnosis loop can be reinforced with available data to serve as an alternative diagnosis tool based on data processing in places where the health infrastructure is limited.
Collapse
Affiliation(s)
| | | | - Carlos Awad
- Subred Integrada de Servicios de Salud Centro Oriente E.S.E, Bogotá, Colombia
| | - Erika Vergara
- Biomedical Engineering, Universidad Antonio Nariño, Bogotá, Colombia
| | - Angélica Palencia
- Subred Integrada de Servicios de Salud Centro Oriente E.S.E, Bogotá, Colombia
| |
Collapse
|
5
|
Salem A, Khanfar E, Nagy S, Széchenyi A. Cocrystals of tuberculosis antibiotics: Challenges and missed opportunities. Int J Pharm 2022; 623:121924. [PMID: 35738333 DOI: 10.1016/j.ijpharm.2022.121924] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 05/26/2022] [Accepted: 06/13/2022] [Indexed: 01/10/2023]
Abstract
Cocrystals have been extensively used to improve the physicochemical properties and bioavailability of active pharmaceutical ingredients. Cocrystals of anti-tuberculosis medications are among those commonly reported. This review provides a summary of the tuberculosis antibiotic cocrystals reported in the literature, providing the main results on current tuberculosis medications utilized in cocrystals. Moreover, anti-tuberculosis cocrystals limitations and advantages are described, including evidence for enhanced solubility, stability and effect. Opportunities to enhance anti-tuberculosis medications and fixed dose combinations using cocrystals are given. Several cocrystal pairs are suggested to enhance the effectiveness of anti-tuberculosis drugs.
Collapse
Affiliation(s)
- Ala' Salem
- Institute of Pharmaceutical Technology and Biopharmacy, Faculty of Pharmacy, University of Pécs, Pécs, Hungary.
| | - Esam Khanfar
- Department of Immunology and Biotechnology, Medical School, University of Pécs, Pécs, Hungary
| | - Sándor Nagy
- Institute of Pharmaceutical Technology and Biopharmacy, Faculty of Pharmacy, University of Pécs, Pécs, Hungary
| | - Aleksandar Széchenyi
- Institute of Pharmaceutical Technology and Biopharmacy, Faculty of Pharmacy, University of Pécs, Pécs, Hungary; Department of Chemistry, Josip Juraj Strossmayer University of Osijek, Osijek, Croatia
| |
Collapse
|
6
|
AIM in Medical Informatics. Artif Intell Med 2022. [DOI: 10.1007/978-3-030-64573-1_32] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
7
|
|
8
|
Zhang Y, Patil P, Johnson WE, Parmigiani G. Robustifying genomic classifiers to batch effects via ensemble learning. Bioinformatics 2021; 37:1521-1527. [PMID: 33245114 DOI: 10.1093/bioinformatics/btaa986] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2020] [Revised: 10/20/2020] [Accepted: 11/13/2020] [Indexed: 01/08/2023] Open
Abstract
MOTIVATION Genomic data are often produced in batches due to practical restrictions, which may lead to unwanted variation in data caused by discrepancies across batches. Such 'batch effects' often have negative impact on downstream biological analysis and need careful consideration. In practice, batch effects are usually addressed by specifically designed software, which merge the data from different batches, then estimate batch effects and remove them from the data. Here, we focus on classification and prediction problems, and propose a different strategy based on ensemble learning. We first develop prediction models within each batch, then integrate them through ensemble weighting methods. RESULTS We provide a systematic comparison between these two strategies using studies targeting diverse populations infected with tuberculosis. In one study, we simulated increasing levels of heterogeneity across random subsets of the study, which we treat as simulated batches. We then use the two methods to develop a genomic classifier for the binary indicator of disease status. We evaluate the accuracy of prediction in another independent study targeting a different population cohort. We observed that in independent validation, while merging followed by batch adjustment provides better discrimination at low level of heterogeneity, our ensemble learning strategy achieves more robust performance, especially at high severity of batch effects. These observations provide practical guidelines for handling batch effects in the development and evaluation of genomic classifiers. AVAILABILITY AND IMPLEMENTATION The data underlying this article are available in the article and in its online supplementary material. Processed data is available in the Github repository with implementation code, at https://github.com/zhangyuqing/bea_ensemble. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yuqing Zhang
- Clinical Bioinformatics, Gilead Sciences, Inc., Foster City, CA 94404, USA
| | - Prasad Patil
- Department of Biostatistics, Boston University School of Public Health, Boston, MA 02118, USA
| | - W Evan Johnson
- Department of Biostatistics, Boston University School of Public Health, Boston, MA 02118, USA.,Division of Computational Biomedicine, Boston University School of Medicine, Boston, MA 02118, USA
| | - Giovanni Parmigiani
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA 02215, USA.,Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| |
Collapse
|
9
|
Synthesis and Characterization of Nano-Sized 4-Aminosalicylic Acid-Sulfamethazine Cocrystals. Pharmaceutics 2021; 13:pharmaceutics13020277. [PMID: 33669489 PMCID: PMC7923100 DOI: 10.3390/pharmaceutics13020277] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Revised: 02/09/2021] [Accepted: 02/16/2021] [Indexed: 11/26/2022] Open
Abstract
Drug–drug cocrystals are formulated to produce combined medication, not just to modulate active pharmaceutical ingredient (API) properties. Nano-crystals adjust the pharmacokinetic properties and enhance the dissolution of APIs. Nano-cocrystals seem to enhance API properties by combining the benefits of both technologies. Despite the promising opportunities of nano-sized cocrystals, the research at the interface of nano-technology and cocrystals has, however, been described to be in its infancy. In this study, high-pressure homogenization (HPH) and high-power ultrasound were used to prepare nano-sized cocrystals of 4-aminosalysilic acid and sulfamethazine in order to establish differences between the two methods in terms of cocrystal size, morphology, polymorphic form, and dissolution rate enhancement. It was found that both methods resulted in the formation of form I cocrystals with a high degree of crystallinity. HPH yielded nano-sized cocrystals, while those prepared by high-power ultrasound were in the micro-size range. Furthermore, HPH produced smaller-size cocrystals with a narrow size distribution when a higher pressure was used. Cocrystals appeared to be needle-like when prepared by HPH compared to those prepared by high-power ultrasound, which had a different morphology. The highest dissolution enhancement was observed in cocrystals prepared by HPH; however, both micro- and nano-sized cocrystals enhanced the dissolution of sulfamethazine.
Collapse
|
10
|
Bruno P, Calimeri F, Greco G. AIM in Medical Informatics. Artif Intell Med 2021. [DOI: 10.1007/978-3-030-58080-3_32-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
11
|
López-Martínez F, Núñez-Valdez ER, Crespo RG, García-Díaz V. An artificial neural network approach for predicting hypertension using NHANES data. Sci Rep 2020; 10:10620. [PMID: 32606434 PMCID: PMC7327031 DOI: 10.1038/s41598-020-67640-z] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2019] [Accepted: 06/09/2020] [Indexed: 02/07/2023] Open
Abstract
This paper focus on a neural network classification model to estimate the association among gender, race, BMI, age, smoking, kidney disease and diabetes in hypertensive patients. It also shows that artificial neural network techniques applied to large clinical data sets may provide a meaningful data-driven approach to categorize patients for population health management, and support in the control and detection of hypertensive patients, which is part of the critical factors for diseases of the heart. Data was obtained from the National Health and Nutrition Examination Survey from 2007 to 2016. This paper utilized an imbalanced data set of 24,434 with (69.71%) non-hypertensive patients, and (30.29%) hypertensive patients. The results indicate a sensitivity of 40%, a specificity of 87%, precision of 57.8% and a measured AUC of 0.77 (95% CI [75.01-79.01]). This paper showed results that are to some degree more effectively than a previous study performed by the authors using a statistical model with similar input features that presents a calculated AUC of 0.73. This classification model can be used as an inference agent to assist the professionals in diseases of the heart field, and can be implemented in applications to assist population health management programs in identifying patients with high risk of developing hypertension.
Collapse
Affiliation(s)
- Fernando López-Martínez
- Department of Computer Science, Oviedo University, C/ Federico Garca Lorca, 33007, Oviedo, Spain
- Sanitas, 8400 NW 33rd St, Doral, FL, 33122, USA
| | | | - Rubén González Crespo
- Department of Computer Science and Technology, Universidad Internacional de La Rioja, Av. de la Paz, 137, 26006, Logroño, La Rioja, Spain.
| | - Vicente García-Díaz
- Department of Computer Science, Oviedo University, C/ Federico Garca Lorca, 33007, Oviedo, Spain
| |
Collapse
|