1
|
Hao Y, Lu R, Guo Y, Bao P. Specific association of MTHFD1 expressions with small cell lung cancer development and chemoradiotherapy outcome. Saudi Med J 2024; 45:783-790. [PMID: 39074897 PMCID: PMC11288495 DOI: 10.15537/smj.2024.45.8.20230990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2024] [Accepted: 07/04/2024] [Indexed: 07/31/2024] Open
Abstract
OBJECTIVES To identify biomarkers that can discriminated small cell lung cancer (SCLC) from non-SCLC (NSCLC), and explore their association with the prognosis of SCLC under chemoradiotherapy. METHODS The GSE40275 dataset was used to identify potential targets in SCLC. There were 196 patients of lung cancer (LC) in cohort 1 of this study. MTHFD1 levels in tissues were determined by immunohistochemistry assay in cohort 1. Lung cancer patients who were all underwent local chemoradiotherapy (CRT) were included in cohort 2, and the association of MTHFD1 levels with CRT treatment outcome were determined in cohort 2. Cell experiments were used to determine the function of MTHFD1 on the radio-sensitivity of SCLC and NSCLC cells. RESULTS The MTHFD1 levels in LC tissues were increased, and could discriminate SCLC from both lung squamous cell carcinoma (LUSC) and lung adenocarcinoma (LUAD). Small cell lung cancer patients with MTHFD1 high phenotype had a poorer prognosis after CRT treatment, whereas no significant correlation was found between MTHFD1 levels and prognosis in LUSC and LUAD group. Cell experiments demonstrated that overexpression of MTHFD1 increases radio-resistance in both SCLC and NSCLC in vitro. CONCLUSION MTHFD1 expressions might be a novel specifically prognostic biomarker for SCLC and the CRT treatment outcome.
Collapse
Affiliation(s)
- Yujia Hao
- From the Department of Respiratory Medicine Second Ward (Hao), JinanThird People’s Hospital, Jinan, from the Department of Cadre Healthcare/Geriatrics (Lu), Qingdao Hospital, University of Health and Rehabilitation Sciences (Qingdao Municipal Hospital), Qingdao, from the Hebei North University (Guo), Zhangjiakou, and from the Department of Pulmonary and Critical Care Medicine (Bao), The Eighth Medical Center of Chinese PLA General Hospital, Beijing, China.
| | - Ruichun Lu
- From the Department of Respiratory Medicine Second Ward (Hao), JinanThird People’s Hospital, Jinan, from the Department of Cadre Healthcare/Geriatrics (Lu), Qingdao Hospital, University of Health and Rehabilitation Sciences (Qingdao Municipal Hospital), Qingdao, from the Hebei North University (Guo), Zhangjiakou, and from the Department of Pulmonary and Critical Care Medicine (Bao), The Eighth Medical Center of Chinese PLA General Hospital, Beijing, China.
| | - Ying Guo
- From the Department of Respiratory Medicine Second Ward (Hao), JinanThird People’s Hospital, Jinan, from the Department of Cadre Healthcare/Geriatrics (Lu), Qingdao Hospital, University of Health and Rehabilitation Sciences (Qingdao Municipal Hospital), Qingdao, from the Hebei North University (Guo), Zhangjiakou, and from the Department of Pulmonary and Critical Care Medicine (Bao), The Eighth Medical Center of Chinese PLA General Hospital, Beijing, China.
| | - Pengtao Bao
- From the Department of Respiratory Medicine Second Ward (Hao), JinanThird People’s Hospital, Jinan, from the Department of Cadre Healthcare/Geriatrics (Lu), Qingdao Hospital, University of Health and Rehabilitation Sciences (Qingdao Municipal Hospital), Qingdao, from the Hebei North University (Guo), Zhangjiakou, and from the Department of Pulmonary and Critical Care Medicine (Bao), The Eighth Medical Center of Chinese PLA General Hospital, Beijing, China.
| |
Collapse
|
2
|
Krishnamoorthy L, Lakshmanan VR. Groundwater quality assessment using machine learning models: a comprehensive study on the industrial corridor of a semi-arid region. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024:10.1007/s11356-024-34119-7. [PMID: 38963621 DOI: 10.1007/s11356-024-34119-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Accepted: 06/21/2024] [Indexed: 07/05/2024]
Abstract
Water plays a significant role in sustaining the lives of humans and other living organisms. Groundwater quality analysis has become inevitable, because of increased contamination of water resources and global warming. This study used machine learning (ML) models to predict the water quality index (WQI) and water quality classification (WQC). Forty groundwater samples were collected near the Ranipet industrial corridor, and the hydrogeochemistry and heavy metal contamination were analyzed. WQC prediction employed random forest (RF), gradient boosting (GB), decision tree (DT), and K-nearest neighbor (KNN) models, and WQI prediction used extreme gradient boosting (XGBoost), support vector regressor (SVR), RF, and multi-layer perceptron (MLP) models. The grid search method is used to evaluate the ML model by F1 score, accuracy, recall, precision, and Matthews correlation coefficient (MCC) for WQC and the coefficient of determination (R2), mean absolute error (MAE), mean square error (MSE), and median absolute percentage error (MAPE) for WQI. The WQI results indicate that the groundwater quality of the study area is very poor and unsuitable for drinking or irrigation purposes. The performance metrics of the RF model excelled in predicting both WQC (accuracy = 97%) and WQI (R2 = 91.0%), outperforming other models and emphasizing ML's superiority in groundwater quality assessment. The findings suggest that ML models perform well and yield better accuracy than conventional techniques used in groundwater quality assessment studies.
Collapse
|
3
|
Mendapara K. Development and evaluation of a chronic kidney disease risk prediction model using random forest. Front Genet 2024; 15:1409755. [PMID: 38993480 PMCID: PMC11236722 DOI: 10.3389/fgene.2024.1409755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Accepted: 05/29/2024] [Indexed: 07/13/2024] Open
Abstract
This research aims to advance the detection of Chronic Kidney Disease (CKD) through a novel gene-based predictive model, leveraging recent breakthroughs in gene sequencing. We sourced and merged gene expression profiles of CKD-affected renal tissues from the Gene Expression Omnibus (GEO) database, classifying them into two sets for training and validation in a 7:3 ratio. The training set included 141 CKD and 33 non-CKD specimens, while the validation set had 60 and 14, respectively. The disease risk prediction model was constructed using the training dataset, while the validation dataset confirmed the model's identification capabilities. The development of our predictive model began with evaluating differentially expressed genes (DEGs) between the two groups. We isolated six genes using Lasso and random forest (RF) methods-DUSP1, GADD45B, IFI44L, IFI30, ATF3, and LYZ-which are critical in differentiating CKD from non-CKD tissues. We refined our random forest (RF) model through 10-fold cross-validation, repeated five times, to optimize the mtry parameter. The performance of our model was robust, with an average AUC of 0.979 across the folds, translating to a 91.18% accuracy. Validation tests further confirmed its efficacy, with a 94.59% accuracy and an AUC of 0.990. External validation using dataset GSE180394 yielded an AUC of 0.913, 89.83% accuracy, and a sensitivity rate of 0.889, underscoring the model's reliability. In summary, the study identified critical genetic biomarkers and successfully developed a novel disease risk prediction model for CKD. This model can serve as a valuable tool for CKD disease risk assessment and contribute significantly to CKD identification.
Collapse
Affiliation(s)
- Krish Mendapara
- Faculty of Health Sciences, Queen's University, Kingston, ON, Canada
| |
Collapse
|
4
|
Ma L, Gao Y, Huo Y, Tian T, Hong G, Li H. Integrated analysis of diverse cancer types reveals a breast cancer-specific serum miRNA biomarker through relative expression orderings analysis. Breast Cancer Res Treat 2024; 204:475-484. [PMID: 38191685 PMCID: PMC10959809 DOI: 10.1007/s10549-023-07208-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 11/29/2023] [Indexed: 01/10/2024]
Abstract
PURPOSE Serum microRNA (miRNA) holds great potential as a non-invasive biomarker for diagnosing breast cancer (BrC). However, most diagnostic models rely on the absolute expression levels of miRNAs, which are susceptible to batch effects and challenging for clinical transformation. Furthermore, current studies on liquid biopsy diagnostic biomarkers for BrC mainly focus on distinguishing BrC patients from healthy controls, needing more specificity assessment. METHODS We collected a large number of miRNA expression data involving 8465 samples from GEO, including 13 different cancer types and non-cancer controls. Based on the relative expression orderings (REOs) of miRNAs within each sample, we applied the greedy, LASSO multiple linear regression, and random forest algorithms to identify a qualitative biomarker specific to BrC by comparing BrC samples to samples of other cancers as controls. RESULTS We developed a BrC-specific biomarker called 7-miRPairs, consisting of seven miRNA pairs. It demonstrated comparable classification performance in our analyzed machine learning algorithms while requiring fewer miRNA pairs, accurately distinguishing BrC from 12 other cancer types. The diagnostic performance of 7-miRPairs was favorable in the training set (accuracy = 98.47%, specificity = 98.14%, sensitivity = 99.25%), and similar results were obtained in the test set (accuracy = 97.22%, specificity = 96.87%, sensitivity = 98.02%). KEGG pathway enrichment analysis of the 11 miRNAs within the 7-miRPairs revealed significant enrichment of target mRNAs in pathways associated with BrC. CONCLUSION Our study provides evidence that utilizing serum miRNA pairs can offer significant advantages for BrC-specific diagnosis in clinical practice by directly comparing serum samples with BrC to other cancer types.
Collapse
Affiliation(s)
- Liyuan Ma
- School of Public Health and Health Management, Gannan Medical University, Ganzhou, 341000, China
| | - Yaru Gao
- School of Public Health and Health Management, Gannan Medical University, Ganzhou, 341000, China
| | - Yue Huo
- School of Public Health and Health Management, Gannan Medical University, Ganzhou, 341000, China
| | - Tian Tian
- School of Medical Information Engineering, Gannan Medical University, Ganzhou, 341000, China
| | - Guini Hong
- School of Medical Information Engineering, Gannan Medical University, Ganzhou, 341000, China.
| | - Hongdong Li
- School of Medical Information Engineering, Gannan Medical University, Ganzhou, 341000, China.
| |
Collapse
|
5
|
Chakraborty N, Lawrence A, Campbell R, Yang R, Hammamieh R. Biomarker discovery process at binomial decision point (2BDP): Analytical pipeline to construct biomarker panel. Comput Struct Biotechnol J 2023; 21:4729-4742. [PMID: 37822559 PMCID: PMC10562676 DOI: 10.1016/j.csbj.2023.09.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 09/21/2023] [Accepted: 09/21/2023] [Indexed: 10/13/2023] Open
Abstract
A clinical incident is typically manifested by several molecular events; therefore, it seems logical that a successful diagnosis, prognosis, or stratification of a clinical landmark require multiple biomarkers. In this report, we presented a machine learning pipeline, namely "Biomarker discovery process at binomial decision point" (2BDP) that took an integrative approach in systematically curating independent variables (e.g., multiple molecular markers) to explain an output variable (e.g., clinical landmark) of binary in nature. In a logical sequence, 2BDP includes feature selection, unsupervised model development and cross validation. In the present work, the efficiency of 2BDP was demonstrated by finding three biomarker panels that independently explained three stages of Alzheimer's disease (AD) marked as Braak stages I, II and III, respectively. We designed three assortments from the entire cohort based on these Braak stages; subsequently, each assortment was split into two populations at Braak score I, II or III. 2BDP systematically integrated random forest and logistic regression fitting model to find biomarker panels with minimum features that explained these three assortments, e.g., significantly differentiated two populations segregated by Braak stage I, II or III, respectively. Thereafter, the efficacies of these panels were measured by the area under the curve (AUC) values of the receiver operating characteristic (ROC) plot. The AUC-ROC was calculated by two cross-validation methods. Final set of gene markers was a mix of novel and a priori established AD signatures. These markers were weighted by unique coefficients and linearly connected in a group of 2-10 to explain Braak stage I, II or III by AUC ≥ 0.8. Small sample size and a lack of distinctly recruited Training and Test sets were the limitations of the present undertaking; yet 2BDP demonstrated its capability to curate a panel of optimum numbers of biomarkers to describe the outcome variable with high efficacy.
Collapse
Affiliation(s)
- Nabarun Chakraborty
- Medical Readiness Systems Biology, Center for Military Psychiatry and Neuroscience (CMPN), Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Alexander Lawrence
- Medical Readiness Systems Biology, Center for Military Psychiatry and Neuroscience (CMPN), Walter Reed Army Institute of Research, Silver Spring, MD, USA
- ORISE, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Ross Campbell
- Medical Readiness Systems Biology, Center for Military Psychiatry and Neuroscience (CMPN), Walter Reed Army Institute of Research, Silver Spring, MD, USA
- Geneva Foundation, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Ruoting Yang
- Medical Readiness Systems Biology, Center for Military Psychiatry and Neuroscience (CMPN), Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Rasha Hammamieh
- Medical Readiness Systems Biology, Center for Military Psychiatry and Neuroscience (CMPN), Walter Reed Army Institute of Research, Silver Spring, MD, USA
| |
Collapse
|
6
|
Restrepo JC, Dueñas D, Corredor Z, Liscano Y. Advances in Genomic Data and Biomarkers: Revolutionizing NSCLC Diagnosis and Treatment. Cancers (Basel) 2023; 15:3474. [PMID: 37444584 DOI: 10.3390/cancers15133474] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 06/23/2023] [Accepted: 06/27/2023] [Indexed: 07/15/2023] Open
Abstract
Non-small cell lung cancer (NSCLC) is a significant public health concern with high mortality rates. Recent advancements in genomic data, bioinformatics tools, and the utilization of biomarkers have improved the possibilities for early diagnosis, effective treatment, and follow-up in NSCLC. Biomarkers play a crucial role in precision medicine by providing measurable indicators of disease characteristics, enabling tailored treatment strategies. The integration of big data and artificial intelligence (AI) further enhances the potential for personalized medicine through advanced biomarker analysis. However, challenges remain in the impact of new biomarkers on mortality and treatment efficacy due to limited evidence. Data analysis, interpretation, and the adoption of precision medicine approaches in clinical practice pose additional challenges and emphasize the integration of biomarkers with advanced technologies such as genomic data analysis and artificial intelligence (AI), which enhance the potential of precision medicine in NSCLC. Despite these obstacles, the integration of biomarkers into precision medicine has shown promising results in NSCLC, improving patient outcomes and enabling targeted therapies. Continued research and advancements in biomarker discovery, utilization, and evidence generation are necessary to overcome these challenges and further enhance the efficacy of precision medicine. Addressing these obstacles will contribute to the continued improvement of patient outcomes in non-small cell lung cancer.
Collapse
Affiliation(s)
- Juan Carlos Restrepo
- Grupo de Investigación en Salud Integral (GISI), Departamento Facultad de Salud, Universidad Santiago de Cali, Cali 760035, Colombia
| | - Diana Dueñas
- Grupo de Investigación en Salud Integral (GISI), Departamento Facultad de Salud, Universidad Santiago de Cali, Cali 760035, Colombia
| | - Zuray Corredor
- Grupo de Investigaciones en Odontología (GIOD), Facultad de Odontología, Universidad Cooperativa de Colombia, Pasto 520002, Colombia
- Facultad de Salud, Departamento de Ciencias Básicas, Universidad Libre, Cali 760026, Colombia
| | - Yamil Liscano
- Grupo de Investigación en Salud Integral (GISI), Departamento Facultad de Salud, Universidad Santiago de Cali, Cali 760035, Colombia
| |
Collapse
|