Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Azadifar S, Rostami M, Berahmand K, Moradi P, Oussalah M. Graph-based relevancy-redundancy gene selection method for cancer diagnosis. Comput Biol Med 2022;147:105766. [PMID: 35779479 DOI: 10.1016/j.compbiomed.2022.105766] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Revised: 06/12/2022] [Accepted: 06/18/2022] [Indexed: 11/26/2022]

For:	Azadifar S, Rostami M, Berahmand K, Moradi P, Oussalah M. Graph-based relevancy-redundancy gene selection method for cancer diagnosis. Comput Biol Med 2022;147:105766. [PMID: 35779479 DOI: 10.1016/j.compbiomed.2022.105766] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Revised: 06/12/2022] [Accepted: 06/18/2022] [Indexed: 11/26/2022]

Number

Cited by Other Article(s)

Roy S, Singh J, Ray SS. Weighted Combination of Łukasiewicz implication and Fuzzy Jaccard similarity in Hybrid Ensemble Framework (WCLFJHEF) for Gene Selection. Comput Biol Med 2024;170:107981. [PMID: 38262204 DOI: 10.1016/j.compbiomed.2024.107981] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 01/02/2024] [Accepted: 01/12/2024] [Indexed: 01/25/2024]

Abstract

A framework is developed for gene expression analysis by introducing fuzzy Jaccard similarity (FJS) and combining Łukasiewicz implication with it through weights in hybrid ensemble framework (WCLFJHEF) for gene selection in cancer. The method is called weighted combination of Łukasiewicz implication and fuzzy Jaccard similarity in hybrid ensemble framework (WCLFJHEF). While the fuzziness in Jaccard similarity is incorporated by using the existing Gödel fuzzy logic, the weights are obtained by maximizing the average F-score of selected genes in classifying the cancer patients. The patients are first divided into different clusters, based on the number of patient groups, using average linkage agglomerative clustering and a new score, called WCLFJ (weighted combination of Łukasiewicz implication and fuzzy Jaccard similarity). The genes are then selected from each cluster separately using filter based Relief-F and wrapper based SVMRFE (Support Vector Machine with Recursive Feature Elimination). A gene (feature) pool is created by considering the union of selected features for all the clusters. A set of informative genes is selected from the pool using sequential backward floating search (SBFS) algorithm. Patients are then classified using Naïve Bayes'(NB) and Support Vector Machine (SVM) separately, using the selected genes and the related F-scores are calculated. The weights in WCLFJ are then updated iteratively to maximize the average F-score obtained from the results of the classifier. The effectiveness of WCLFJHEF is demonstrated on six gene expression datasets. The average values of accuracy, F-score, recall, precision and MCC over all the datasets, are 95%, 94%, 94%, 94%, and 90%, respectively. The explainability of the selected genes is shown using SHapley Additive exPlanations (SHAP) values and this information is further used to rank them. The relevance of the selected gene set are biologically validated using the KEGG Pathway, Gene Ontology (GO), and existing literatures. It is seen that the genes that are selected by WCLFJHEF are candidates for genomic alterations in the various cancer types. The source code of WCLFJHEF is available at http://www.isical.ac.in/~shubhra/WCLFJHEF.html.

Collapse

Yang J, Shu L, Han M, Pan J, Chen L, Yuan T, Tan L, Shu Q, Duan H, Li H. RDmaster: A novel phenotype-oriented dialogue system supporting differential diagnosis of rare disease. Comput Biol Med 2024;169:107924. [PMID: 38181610 DOI: 10.1016/j.compbiomed.2024.107924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 12/18/2023] [Accepted: 01/01/2024] [Indexed: 01/07/2024]

Osama S, Ali M, Ali AA, Shaban H. Gene selection and tumor identification based on a hybrid of the multi-filter embedded recursive mountain gazelle algorithm. Comput Biol Med 2023;167:107674. [PMID: 37976816 DOI: 10.1016/j.compbiomed.2023.107674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 10/09/2023] [Accepted: 11/06/2023] [Indexed: 11/19/2023]

Abstract

Microarray gene expression data are useful for identifying gene expression patterns associated with cancer outcomes; however, their high dimensionality make it difficult to extract meaningful information and accurately classify tumors. Hence, developing effective methods for reducing dimensionality while preserving relevant information is a crucial task. Hybrid-based gene selection methods are widely proposed in the gene expression analysis domain and can still be enhanced in terms of efficiency and reliability. This study proposes a new hybrid-based gene selection method, called multi-filter embedded mountain gazelle optimizer (MUL-MGO), which utilizes two filters and an embedded method to remove irrelevant genes, followed by selecting the most relevant genes using recently developed MGO algorithm. To the best of our knowledge, this is the first work to exploit MGO as a gene or feature selection method. A new version of MGO, called recursive mountain gazelle optimizer (RMGO), which implements MGO algorithm recursively to avoid local optima, minimize search space, and obtain minimum gene count without decreasing the classifier's performance, is developed. The proposed RMGO is used to develop a new hybrid gene selection method employing similar filters and embedded methods as MUL-MGO, but with a recursive MGO algorithm version. The resulting method is called multi-filter embedded recursive mountain gazelle optimizer (MUL-RMGO). Several classifiers are used for cancer classification. Accordingly, several experimental studies are performed on eight microarray gene expression datasets to demonstrate the proficiencies of MUL-MGO and MUL-RMGO methods. The experimental findings indicate the efficiency and productivity of the suggested MUL-MGO and MUL-RMGO methods for gene selection. The methods outperform cutting-edge methods in the literature, with MUL-RMGO exceeding MUL-MGO in terms of accuracy and selected gene count.

Collapse

Moslemi A, Ahmadian A. Dual regularized subspace learning using adaptive graph learning and rank constraint: Unsupervised feature selection on gene expression microarray datasets. Comput Biol Med 2023;167:107659. [PMID: 37950946 DOI: 10.1016/j.compbiomed.2023.107659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 10/13/2023] [Accepted: 10/31/2023] [Indexed: 11/13/2023]

Yang J, Hussein Kadir D. Data mining techniques in breast cancer diagnosis at the cellular-molecular level. J Cancer Res Clin Oncol 2023;149:12605-12620. [PMID: 37442866 DOI: 10.1007/s00432-023-05090-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2023] [Accepted: 06/30/2023] [Indexed: 07/15/2023]

Schürmeyer L, Schorning K, Rahnenführer J. Designs for the simultaneous inference of concentration-response curves. BMC Bioinformatics 2023;24:393. [PMID: 37858091 PMCID: PMC10588042 DOI: 10.1186/s12859-023-05526-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 10/09/2023] [Indexed: 10/21/2023] Open

Abstract

BACKGROUND

An important problem in toxicology in the context of gene expression data is the simultaneous inference of a large number of concentration-response relationships. The quality of the inference substantially depends on the choice of design of the experiments, in particular, on the set of different concentrations, at which observations are taken for the different genes under consideration. As this set has to be the same for all genes, the efficient planning of such experiments is very challenging. We address this problem by determining efficient designs for the simultaneous inference of a large number of concentration-response models. For that purpose, we both construct a D-optimality criterion for simultaneous inference and a K-means procedure which clusters the support points of the locally D-optimal designs of the individual models.

RESULTS

We show that a planning of experiments that addresses the simultaneous inference of a large number of concentration-response relationships yields a substantially more accurate statistical analysis. In particular, we compare the performance of the constructed designs to the ones of other commonly used designs in terms of D-efficiencies and in terms of the quality of the resulting model fits using a real data example dealing with valproic acid. For the quality comparison we perform an extensive simulation study.

CONCLUSIONS

The design maximizing the D-optimality criterion for simultaneous inference improves the inference of the different concentration-response relationships substantially. The design based on the K-means procedure also performs well, whereas a log-equidistant design, which was also included in the analysis, performs poorly in terms of the quality of the simultaneous inference. Based on our findings, the D-optimal design for simultaneous inference should be used for upcoming analyses dealing with high-dimensional gene expression data.

Collapse

Wang J, Zhu X, Chen K, Hao L, Liu Y. HAHNet: a convolutional neural network for HER2 status classification of breast cancer. BMC Bioinformatics 2023;24:353. [PMID: 37730567 PMCID: PMC10512620 DOI: 10.1186/s12859-023-05474-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 09/12/2023] [Indexed: 09/22/2023] Open

Angaitkar P, Aljrees T, Kumar Pandey S, Kumar A, Janghel RR, Sahu TP, Singh KU, Singh T. Inferring linear-B cell epitopes using 2-step metaheuristic variant-feature selection using genetic algorithm. Sci Rep 2023;13:14593. [PMID: 37670007 PMCID: PMC10480427 DOI: 10.1038/s41598-023-41179-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Accepted: 08/23/2023] [Indexed: 09/07/2023] Open

Yao N, Pan J, Chen X, Li P, Li Y, Wang Z, Yao T, Qian L, Yi D, Wu Y. Discovery of potential biomarkers for lung cancer classification based on human proteome microarrays using Stochastic Gradient Boosting approach. J Cancer Res Clin Oncol 2023;149:6803-6812. [PMID: 36807761 DOI: 10.1007/s00432-023-04643-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Accepted: 02/08/2023] [Indexed: 02/21/2023]

Wang X, Han Y, Wang B. A Two-Phase Feature Selection Method for Identifying Influential Spreaders of Disease Epidemics in Complex Networks. ENTROPY (BASEL, SWITZERLAND) 2023;25:1068. [PMID: 37510015 PMCID: PMC10378310 DOI: 10.3390/e25071068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 06/28/2023] [Accepted: 07/07/2023] [Indexed: 07/30/2023]

Park J, Lee JW, Park M. Comparison of cancer subtype identification methods combined with feature selection methods in omics data analysis. BioData Min 2023;16:18. [PMID: 37420304 DOI: 10.1186/s13040-023-00334-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Accepted: 06/30/2023] [Indexed: 07/09/2023] Open

Abstract

BACKGROUND

Cancer subtype identification is important for the early diagnosis of cancer and the provision of adequate treatment. Prior to identifying the subtype of cancer in a patient, feature selection is also crucial for reducing the dimensionality of the data by detecting genes that contain important information about the cancer subtype. Numerous cancer subtyping methods have been developed, and their performance has been compared. However, combinations of feature selection and subtype identification methods have rarely been considered. This study aimed to identify the best combination of variable selection and subtype identification methods in single omics data analysis.

RESULTS

Combinations of six filter-based methods and six unsupervised subtype identification methods were investigated using The Cancer Genome Atlas (TCGA) datasets for four cancers. The number of features selected varied, and several evaluation metrics were used. Although no single combination was found to have a distinctively good performance, Consensus Clustering (CC) and Neighborhood-Based Multi-omics Clustering (NEMO) used with variance-based feature selection had a tendency to show lower p-values, and nonnegative matrix factorization (NMF) stably showed good performance in many cases unless the Dip test was used for feature selection. In terms of accuracy, the combination of NMF and similarity network fusion (SNF) with Monte Carlo Feature Selection (MCFS) and Minimum-Redundancy Maximum Relevance (mRMR) showed good overall performance. NMF always showed among the worst performances without feature selection in all datasets, but performed much better when used with various feature selection methods. iClusterBayes (ICB) had decent performance when used without feature selection.

CONCLUSIONS

Rather than a single method clearly emerging as optimal, the best methodology was different depending on the data used, the number of features selected, and the evaluation method. A guideline for choosing the best combination method under various situations is provided.

Collapse

Fu Q, Li Q, Li X. An improved multi-objective marine predator algorithm for gene selection in classification of cancer microarray data. Comput Biol Med 2023;160:107020. [PMID: 37196457 DOI: 10.1016/j.compbiomed.2023.107020] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2023] [Revised: 04/09/2023] [Accepted: 05/05/2023] [Indexed: 05/19/2023]

Gurmani SH, Zhang Z, Zulqarnain RM, Askar S. An interaction and feedback mechanism-based group decision-making for emergency medical supplies supplier selection using T-spherical fuzzy information. Sci Rep 2023;13:8726. [PMID: 37253823 DOI: 10.1038/s41598-023-35909-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Accepted: 05/25/2023] [Indexed: 06/01/2023] Open

Semi-supervised segmentation of coronary DSA using mixed networks and multi-strategies. Comput Biol Med 2023;156:106493. [PMID: 36893708 DOI: 10.1016/j.compbiomed.2022.106493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 12/11/2022] [Accepted: 12/27/2022] [Indexed: 12/31/2022]

Abstract

The coronary arteries supply blood to the myocardium, which originate from the root of the aorta and mainly branch into the left and right. X-ray digital subtraction angiography (DSA) is a technique for evaluating coronary artery plaques and narrowing, that is widely used because of its time efficiency and cost-effectiveness. However, automated coronary vessel classification and segmentation remains challenging using a little data. Therefore, the purpose of this study is twofold: one is to propose a more robust method for vessel segmentation, the other is to provide a solution that is feasible with a small amount of labeled data. Currently, there are three main types of vessel segmentation methods, i.e., graphical- and statistical-based; clustering theory based, and deep learning-based methods for pixel-by-pixel probabilistic prediction, among which the last method is the mainstream with high accuracy and automation. Under this trend, an Inception-SwinUnet (ISUnet) network combining the convolutional neural network and Transformer basic module was proposed in this paper. Considering that data-driven fully supervised learning (FSL) segmentation methods require a large set of paired data with high-quality pixel-level annotation, which is expertise-demanding and time-consuming, we proposed a Semi-supervised Learning (SSL) method to achieve better performance with a small amount of labeled and unlabeled data. Different from the classical SSL method, i.e., Mean-Teacher, our method used two different networks for cross-teaching as the backbone. Meanwhile, inspired by deep supervision and confidence learning (CL), two effective strategies for SSL were adopted, which were denominated Pyramid-consistency Learning (PL) and Confidence Learning (CL), respectively. Both were designed to filter the noise and improve the credibility of pseudo labels generated by unlabeled data. Compared with existing methods, ours achieved superior segmentation performance over other FSL and SSL ones by using data with a small equal number of labels. Code is available in https://github.com/Allenem/SSL4DSA.

Collapse

Deng S, Wang L, Guan S, Li M, Wang L. Non-parametric Nearest Neighbor Classification Based on Global Variance Difference. INT J COMPUT INT SYS 2023. [DOI: 10.1007/s44196-023-00200-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023] Open

Liu X, Teng L, Zuo W, Zhong S, Xu Y, Sun J. Deafness gene screening based on a multilevel cascaded BPNN model. BMC Bioinformatics 2023;24:56. [PMID: 36803022 PMCID: PMC9942297 DOI: 10.1186/s12859-023-05182-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Accepted: 02/11/2023] [Indexed: 02/22/2023] Open

Kang IA, Njimbouom SN, Kim JD. Optimal Feature Selection-Based Dental Caries Prediction Model Using Machine Learning for Decision Support System. Bioengineering (Basel) 2023;10:bioengineering10020245. [PMID: 36829739 PMCID: PMC9952690 DOI: 10.3390/bioengineering10020245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 02/07/2023] [Accepted: 02/08/2023] [Indexed: 02/16/2023] Open

Alharbi F, Vakanski A. Machine Learning Methods for Cancer Classification Using Gene Expression Data: A Review. Bioengineering (Basel) 2023;10:bioengineering10020173. [PMID: 36829667 PMCID: PMC9952758 DOI: 10.3390/bioengineering10020173] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 01/24/2023] [Accepted: 01/26/2023] [Indexed: 01/31/2023] Open

MultiScale-CNN-4mCPred: a multi-scale CNN and adaptive embedding-based method for mouse genome DNA N4-methylcytosine prediction. BMC Bioinformatics 2023;24:21. [PMID: 36653789 PMCID: PMC9847203 DOI: 10.1186/s12859-023-05135-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 01/04/2023] [Indexed: 01/19/2023] Open

Liu C, Wu S, Lai L, Liu J, Guo Z, Ye Z, Chen X. Comprehensive analysis of cuproptosis-related lncRNAs in immune infiltration and prognosis in hepatocellular carcinoma. BMC Bioinformatics 2023;24:4. [PMID: 36597032 PMCID: PMC9811804 DOI: 10.1186/s12859-022-05091-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 12/01/2022] [Indexed: 01/05/2023] Open

Abstract

BACKGROUND

Being among the most common malignancies worldwide, hepatocellular carcinoma (HCC) accounting for the third cause of cancer mortality. The regulation of cell death is the most crucial step in tumor progression and has become a crucial target for nearly all therapeutic options. Cuproptosis, a copper-induced cell death, was recently reported in Science. However, its primary function in carcinogenesis is still unclear.

METHODS

Cuproptosis-related lncRNAs significantly associated with overall survival (OS) were screened by stepwise univariate Cox regression. The signature of cuproptosis-related lncRNAs for HCC prognosis was constructed by the LASSO algorithm and multivariate Cox regression. Further Kaplan-Meier analysis, proportional hazards model, and ROC analysis were performed. Functional annotation was performed using gene set enrichment analysis (GSEA). The relationship between prognostic cuproptosis-related lncRNAs and HCC prognosis was further explored by GEPIA( http://gepia.cancer-pku.cn/ ) online analysis tool. Finally, we used the ESTIMATE and XCELL algorithms to estimate stromal and immune cells in tumor tissue and cast each sample to infer the underlying mechanism of cuproptosis-related lncRNAs in the tumor immune microenvironment (TIME) of HCC patients.

RESULTS

Four cuproptosis-related lncRNAs were used to construct a prognostic lncRNA signature, which was an independent factor in predicting OS in HCC patients. Kaplan-Meier curves showed significant differences in survival rates between risk subgroups (p = 0.002). At the same time, we found that the expression levels of most immune checkpoint genes increased with increasing risk scores. Tumorigenesis and immunological-related pathways were primarily enhanced in the high-risk group, as determined by GSEA. The results of drug sensitivity analysis showed that compared with patients in the high-risk group, the IC50 values of erlotinib and lapatinib were lower in patients in the low-risk group, while the opposite was true for sunitinib, paclitaxel, gemcitabine, and imatinib. We also found that elevated AL133243.2 expression was significantly associated with worse OS and disease-free survival (DFS), more advanced T stage and higher tumor grade, and reduced immune cell infiltration, suggesting that HCC patients with low AL133243.2 expression in tumor tissues may have a better response to immunotherapy.

CONCLUSION

Collectively, the cuproptosis-associated lncRNA signature can serve as an independent predictor to guide individual treatment strategies. Furthermore, AL133243.2 is a promising marker for predicting immunotherapy response in HCC patients. This data may facilitate further exploration of more effective immunotherapy strategies for HCC.

Collapse

Sheikhpour R. A local spline regression-based framework for semi-supervised sparse feature selection. Knowl Based Syst 2023. [DOI: 10.1016/j.knosys.2023.110265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]

Xiang J, Wang X, Wang X, Zhang J, Yang S, Yang W, Han X, Liu Y. Automatic diagnosis and grading of Prostate Cancer with weakly supervised learning on whole slide images. Comput Biol Med 2023;152:106340. [PMID: 36481762 DOI: 10.1016/j.compbiomed.2022.106340] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2022] [Revised: 11/02/2022] [Accepted: 11/16/2022] [Indexed: 11/23/2022]

Abstract

BACKGROUND

The workflow of prostate cancer diagnosis and grading is cumbersome and the results suffer from substantial inter-observer variability. Recent trials have shown potential in using machine learning to develop automated systems to address this challenge. Most automated deep learning systems for prostate cancer Gleason grading focused on supervised learning requiring demanding fine-grained pixel-level annotations.

METHODS

A weakly-supervised deep learning model with slide-level labels is presented in this study for the diagnosis and grading of prostate cancer with whole slide image (WSI). WSIs are first cropped into small patches and then processed with a deep learning model to extract patch-level features. A graph convolution network (GCN) is used to aggregate the features for classifications. Throughout the training process, the noisy labels are progressively filtered out to reduce inter-observer variations in clinical reports. Finally, multi-center independent test cohorts with 6,174 slides are collected to evaluate the prostate cancer diagnosis and grading performance of our model.

RESULTS

The cancer diagnosis (2-level classification) results on two external test sets (n= 4,675, n= 844) show an area under the receiver operating characteristic curve (AUC) of 0.985 and 0.986. The Gleason grading (6-level classification) results reach 0.931 quadratic weighted kappa on the internal test set (n= 531). It generalizes well on the external test dataset (n= 844) with 0.801 quadratic weighted kappa with the reference standard set independently. The model enables pathological meaningful interpretability by visualizing the most attended lesions which are highly consistent with expert annotations.

CONCLUSION

The proposed model incorporates a graph network in weakly supervised learning with only slide-level reports. A robust learning strategy is also employed to correct the label noise. It is highly accurate (>0.985 AUC for diagnosis) and also interpretable with intuitive heatmap visualization. It can be unified with a digital pathology pipeline to deliver prostate cancer metrics for a pathology report.

Collapse

Buckler AJ, Marlevi D, Skenteris NT, Lengquist M, Kronqvist M, Matic L, Hedin U. In silico model of atherosclerosis with individual patient calibration to enable precision medicine for cardiovascular disease. Comput Biol Med 2023;152:106364. [PMID: 36525832 DOI: 10.1016/j.compbiomed.2022.106364] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Revised: 11/01/2022] [Accepted: 11/25/2022] [Indexed: 12/03/2022]

Abstract

OBJECTIVE

Guidance for preventing myocardial infarction and ischemic stroke by tailoring treatment for individual patients with atherosclerosis is an unmet need. Such development may be possible with computational modeling. Given the multifactorial biology of atherosclerosis, modeling must be based on complete biological networks that capture protein-protein interactions estimated to drive disease progression. Here, we aimed to develop a clinically relevant scale model of atherosclerosis, calibrate it with individual patient data, and use it to simulate optimized pharmacotherapy for individual patients.

APPROACH AND RESULTS

The study used a uniquely constituted plaque proteomic dataset to create a comprehensive systems biology disease model for simulating individualized responses to pharmacotherapy. Plaque tissue was collected from 18 patients with 6735 proteins at two locations per patient. 113 pathways were identified and included in the systems biology model of endothelial cells, vascular smooth muscle cells, macrophages, lymphocytes, and the integrated intima, altogether spanning 4411 proteins, demonstrating a range of 39-96% plaque instability. After calibrating the systems biology models for individual patients, we simulated intensive lipid-lowering, anti-inflammatory, and anti-diabetic drugs. We also simulated a combination therapy. Drug response was evaluated as the degree of change in plaque stability, where an improvement was defined as a reduction of plaque instability. In patients with initially unstable lesions, simulated responses varied from high (20%, on combination therapy) to marginal improvement, whereas patients with initially stable plaques showed generally less improvement.

CONCLUSION

In this pilot study, proteomics-based system biology modeling was shown to simulate drug response based on atherosclerotic plaque instability with a power of 90%, providing a potential strategy for improved personalized management of patients with cardiovascular disease.

Collapse

Liang JX, Chen Q, Gao W, Chen D, Qian XY, Bi JQ, Lin XC, Han BB, Liu JS. A novel glycosylation-related gene signature predicts survival in patients with lung adenocarcinoma. BMC Bioinformatics 2022;23:562. [PMID: 36575396 PMCID: PMC9793550 DOI: 10.1186/s12859-022-05109-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 12/12/2022] [Indexed: 12/28/2022] Open

Abstract

BACKGROUND

Lung adenocarcinoma (LUAD) is the most common malignant tumor that seriously affects human health. Previous studies have indicated that abnormal levels of glycosylation promote progression and poor prognosis of lung cancer. Thus, the present study aimed to explore the prognostic signature related to glycosyltransferases (GTs) for LUAD.

METHODS

The gene expression profiles were obtained from The Cancer Genome Atlas (TCGA) database, and GTs were obtained from the GlycomeDB database. Differentially expressed GTs-related genes (DGTs) were identified using edge package and Venn diagram. Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and ingenuity pathway analysis (IPA) methods were used to investigate the biological processes of DGTs. Subsequently, Cox and Least Absolute Shrinkage and Selection Operator (LASSO) regression analyses were performed to construct a prognostic model for LUAD. Kaplan-Meier (K-M) analysis was adopted to explore the overall survival (OS) of LUAD patients. The accuracy and specificity of the prognostic model were evaluated by receiver operating characteristic analysis (ROC). In addition, single-sample gene set enrichment analysis (ssGSEA) algorithm was used to analyze the infiltrating immune cells in the tumor environment.

RESULTS

A total of 48 DGTs were mainly enriched in the processes of glycosylation, glycoprotein biosynthetic process, glycosphingolipid biosynthesis-lacto and neolacto series, and cell-mediated immune response. Furthermore, B3GNT3, MFNG, GYLTL1B, ALG3, and GALNT13 were screened as prognostic genes to construct a risk model for LUAD, and the LUAD patients were divided into high- and low-risk groups. K-M curve suggested that patients with a high-risk score had shorter OS than those with a low-risk score. The ROC analysis demonstrated that the risk model efficiently diagnoses LUAD. Additionally, the proportion of infiltrating aDCs (p < 0.05) and Tgds (p < 0.01) was higher in the high-risk group than in the low-risk group. Spearman's correlation analysis manifested that the prognostic genes (MFNG and ALG3) were significantly correlated with infiltrating immune cells.

CONCLUSION

In summary, this study established a novel GTs-related risk model for the prognosis of LUAD patients, providing new therapeutic targets for LUAD. However, the biological role of glycosylation-related genes in LUAD needs to be explored further.

Collapse

Affiliation(s)

Jin-Xiao Liang Department of Oncological Surgery, Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), No. 1 of Banshan East Road, Hangzhou, 310022, Zhejiang Province, Republic of China Institute of Cancer and Basic Medicine (IBMC), Chinese Academy of Sciences, Hangzhou, People's Republic of China
Qian Chen Department of Oncological Surgery, Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), No. 1 of Banshan East Road, Hangzhou, 310022, Zhejiang Province, Republic of China Institute of Cancer and Basic Medicine (IBMC), Chinese Academy of Sciences, Hangzhou, People's Republic of China
Wei Gao School of Medicine, Zhejiang University City College, Hangzhou, People's Republic of China
Da Chen Department of Oncological Surgery, Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), No. 1 of Banshan East Road, Hangzhou, 310022, Zhejiang Province, Republic of China Institute of Cancer and Basic Medicine (IBMC), Chinese Academy of Sciences, Hangzhou, People's Republic of China
Xin-Yu Qian School of Medicine, Zhejiang University City College, Hangzhou, People's Republic of China
Jin-Qiao Bi School of Medicine, Zhejiang University City College, Hangzhou, People's Republic of China
Xing-Chen Lin School of Medicine, Zhejiang University City College, Hangzhou, People's Republic of China
Bing-Bing Han School of Medicine, Zhejiang University City College, Hangzhou, People's Republic of China
Jin-Shi Liu Department of Oncological Surgery, Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), No. 1 of Banshan East Road, Hangzhou, 310022, Zhejiang Province, Republic of China. Institute of Cancer and Basic Medicine (IBMC), Chinese Academy of Sciences, Hangzhou, People's Republic of China.

Collapse

Zhu J, Jiang Z, Feng L. Improved neural network with least square support vector machine for wastewater treatment process. CHEMOSPHERE 2022;308:136116. [PMID: 36037940 DOI: 10.1016/j.chemosphere.2022.136116] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2022] [Revised: 07/22/2022] [Accepted: 08/16/2022] [Indexed: 06/15/2023]

Wang C. Efficient customer segmentation in digital marketing using deep learning with swarm intelligence approach. Inf Process Manag 2022. [DOI: 10.1016/j.ipm.2022.103085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Dual Regularized Unsupervised Feature Selection Based on Matrix Factorization and Minimum Redundancy with application in gene selection. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Alsaleem MN, Islam MS, Al-Ahmadi S, Soudani A. Multiscale Encoding of Electrocardiogram Signals with a Residual Network for the Detection of Atrial Fibrillation. BIOENGINEERING (BASEL, SWITZERLAND) 2022;9:bioengineering9090480. [PMID: 36135025 PMCID: PMC9495512 DOI: 10.3390/bioengineering9090480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 09/09/2022] [Accepted: 09/14/2022] [Indexed: 11/16/2022]