1
|
Zhou X, Liang B, Lin W, Zha L. Identification of MACC1 as a potential biomarker for pulmonary arterial hypertension based on bioinformatics and machine learning. Comput Biol Med 2024; 173:108372. [PMID: 38552277 DOI: 10.1016/j.compbiomed.2024.108372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Revised: 03/13/2024] [Accepted: 03/24/2024] [Indexed: 04/17/2024]
Abstract
BACKGROUND Pulmonary arterial hypertension (PAH) is a life-threatening disease characterized by abnormal early activation of pulmonary arterial smooth muscle cells (PASMCs), yet the underlying mechanisms remain to be elucidated. METHODS Normal and PAH gene expression profiles were obtained from the Gene Expression Omnibus (GEO) database and analyzed using gene set enrichment analysis (GSEA) to uncover the underlying mechanisms. Weighted gene co-expression network analysis (WGCNA) and machine learning methods were deployed to further filter hub genes. A number of immune infiltration analysis methods were applied to explore the immune landscape of PAH. Enzyme-linked immunosorbent assay (ELISA) was employed to compare MACC1 levels between PAH and normal subjects. The important role of MACC1 in the progression of PAH was verified through Western blot and real-time qPCR, among others. RESULTS 39 up-regulated and 7 down-regulated genes were identified by 'limma' and 'RRA' packages. WGCNA and machine learning further narrowed down the list to 4 hub genes, with MACC1 showing strong diagnostic capacity. In vivo and in vitro experiments revealed that MACC1 was highsly associated with malignant features of PASMCs in PAH. CONCLUSIONS These findings suggest that targeting MACC1 may offer a promising therapeutic strategy for treating PAH, and further clinical studies are warranted to evaluate its efficacy.
Collapse
Affiliation(s)
- Xinyi Zhou
- Department of Cardiology, Xiangya Hospital, Central South University, Changsha, Hunan, China
| | - Benhui Liang
- Department of Cardiology, Xiangya Hospital, Central South University, Changsha, Hunan, China; National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, Hunan, 410008, China
| | - Wenchao Lin
- Department of Nephrology, Xiangya Hospital, Central South University, Changsha, Hunan, China; National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, Hunan, 410008, China
| | - Lihuang Zha
- Department of Cardiology, Xiangya Hospital, Central South University, Changsha, Hunan, China; National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, Hunan, 410008, China.
| |
Collapse
|
2
|
Zhang W, Mou M, Hu W, Lu M, Zhang H, Zhang H, Luo Y, Xu H, Tao L, Dai H, Gao J, Zhu F. MOINER: A Novel Multiomics Early Integration Framework for Biomedical Classification and Biomarker Discovery. J Chem Inf Model 2024; 64:2720-2732. [PMID: 38373720 DOI: 10.1021/acs.jcim.4c00013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2024]
Abstract
In the context of precision medicine, multiomics data integration provides a comprehensive understanding of underlying biological processes and is critical for disease diagnosis and biomarker discovery. One commonly used integration method is early integration through concatenation of multiple dimensionally reduced omics matrices due to its simplicity and ease of implementation. However, this approach is seriously limited by information loss and lack of latent feature interaction. Herein, a novel multiomics early integration framework (MOINER) based on information enhancement and image representation learning is thus presented to address the challenges. MOINER employs the self-attention mechanism to capture the intrinsic correlations of omics-features, which make it significantly outperform the existing state-of-the-art methods for multiomics data integration. Moreover, visualizing the attention embedding and identifying potential biomarkers offer interpretable insights into the prediction results. All source codes and model for MOINER are freely available https://github.com/idrblab/MOINER.
Collapse
Affiliation(s)
- Wei Zhang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Minjie Mou
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Wei Hu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Mingkun Lu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Hanyu Zhang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Hongning Zhang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Yongchao Luo
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Hongquan Xu
- Key Laboratory of Elemene Class Anti-Cancer Chinese Medicines, School of Pharmacy, Hangzhou Normal University, Hangzhou 311121, China
| | - Lin Tao
- Key Laboratory of Elemene Class Anti-Cancer Chinese Medicines, School of Pharmacy, Hangzhou Normal University, Hangzhou 311121, China
| | - Haibin Dai
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Jianqing Gao
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| |
Collapse
|
3
|
Ma Y, Zhang B, Liu Z, Liu Y, Wang J, Li X, Feng F, Ni Y, Li S. IAS-FET: An intelligent assistant system and an online platform for enhancing successful rate of in-vitro fertilization embryo transfer technology based on clinical features. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 245:108050. [PMID: 38301430 DOI: 10.1016/j.cmpb.2024.108050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 01/20/2024] [Accepted: 01/23/2024] [Indexed: 02/03/2024]
Abstract
BACKGROUND Among all of the assisted reproductive technology (ART) methods, in vitro fertilization-embryo transfer (IVF-ET) holds a prominent position as a key solution for overcoming infertility. However, its success rate hovers at a modest 30% to 70%. Adding to the challenge is the absence of effective models and clinical tools capable of predicting the outcome of IVF-ET before embryo formation. Our study is dedicated to filling this critical gap by aiming to predict IVF-ET outcomes and ultimately enhance the success rate of this transformative procedure. METHODS In this retrospective study, infertile patients who received artificial assisted pregnancy treatment at Gansu Provincial Maternity and Child-care Hospital in China were enrolled from 2016 to 2020. Individual's clinical information were studied by cascade XGBoost method to build an intelligent assisted system for predicting the outcome of IVF-ET, called IAS-FET. The cascade XGBoost model was trained using clinical information from 2292 couples and externally tested using clinical information from 573 couples. In addition, several schemes which will be of help for patients to adjust their physical condition to improve their success rate on ART were suggested by IAS-FET. RESULTS The outcome of IVF-ET can be predicted by the built IAS-FET method with the area under curve (AUC) value of 0.8759 on the external test set. Besides, this IAS-FET method can provide several schemes to improve the successful rate of IVF-ET outcomes. The built tool for IAS-FET is addressed as a free platform online at http://www.cppdd.cn/ART for the convenient usage of users. CONCLUSIONS It suggested the significant influence of personal clinical features for the success of ART. The proposed system IAS-FET based on the top 27 factors could be a promising tool to predict the outcome of ART and propose a plan for the patient's physical adjustment. With the help of IAS-FET, patients can take informed steps towards increasing their chances of a successful outcome on their journey to parenthood.
Collapse
Affiliation(s)
- Ying Ma
- Gansu Provincial Maternity and Child-care Hospital, Lanzhou, Gansu 730030, China
| | - Bowen Zhang
- School of Medical Information and Engineering, Xuzhou Medical University, Xuzhou, Jiangsu 221004, China; School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, Hubei 430073, China
| | - Zhaoqing Liu
- School of Medical Information and Engineering, Xuzhou Medical University, Xuzhou, Jiangsu 221004, China
| | - Yujie Liu
- School of Medical Information and Engineering, Xuzhou Medical University, Xuzhou, Jiangsu 221004, China
| | - Jiarui Wang
- School of Medical Information and Engineering, Xuzhou Medical University, Xuzhou, Jiangsu 221004, China
| | - Xingxuan Li
- School of Chemistry and Chemical Engineering, Lanzhou University, Lanzhou, Gansu 730030, China
| | - Fan Feng
- Gansu Provincial Maternity and Child-care Hospital, Lanzhou, Gansu 730030, China
| | - Yali Ni
- Gansu Provincial Maternity and Child-care Hospital, Lanzhou, Gansu 730030, China
| | - Shuyan Li
- School of Medical Information and Engineering, Xuzhou Medical University, Xuzhou, Jiangsu 221004, China.
| |
Collapse
|
4
|
Chen Y, Lu P, Wu S, Yang J, Liu W, Zhang Z, Xu Q. CD163-Mediated Small-Vessel Injury in Alzheimer's Disease: An Exploration from Neuroimaging to Transcriptomics. Int J Mol Sci 2024; 25:2293. [PMID: 38396970 PMCID: PMC10888773 DOI: 10.3390/ijms25042293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Revised: 02/10/2024] [Accepted: 02/11/2024] [Indexed: 02/25/2024] Open
Abstract
Patients with Alzheimer's disease (AD) often present with imaging features indicative of small-vessel injury, among which, white-matter hyperintensities (WMHs) are the most prevalent. However, the underlying mechanism of the association between AD and small-vessel injury is still obscure. The aim of this study is to investigate the mechanism of small-vessel injury in AD. Differential gene expression analyses were conducted to identify the genes related to WMHs separately in mild cognitive impairment (MCI) and cognitively normal (CN) subjects from the ADNI database. The WMH-related genes identified in patients with MCI were considered to be associated with small-vessel injury in early AD. Functional enrichment analyses and a protein-protein interaction (PPI) network were performed to explore the pathway and hub genes related to the mechanism of small-vessel injury in MCI. Subsequently, the Boruta algorithm and support vector machine recursive feature elimination (SVM-RFE) algorithm were performed to identify feature-selection genes. Finally, the mechanism of small-vessel injury was analyzed in MCI from the immunological perspectives; the relationship of feature-selection genes with various immune cells and neuroimaging indices were also explored. Furthermore, 5×FAD mice were used to demonstrate the genes related to small-vessel injury. The results of the logistic regression analyses suggested that WMHs significantly contributed to MCI, the early stage of AD. A total of 276 genes were determined as WMH-related genes in patients with MCI, while 203 WMH-related genes were obtained in CN patients. Among them, only 15 genes overlapped and were thus identified as the crosstalk genes. By employing the Boruta and SVM-RFE algorithms, CD163, ALDH3B1, MIR22HG, DTX2, FOLR2, ALDH2, and ZNF23 were recognized as the feature-selection genes linked to small-vessel injury in MCI. After considering the results from the PPI network, CD163 was finally determined as the critical WMH-related gene in MCI. The expression of CD163 was correlated with fractional anisotropy (FA) values in regions that are vulnerable to small-vessel injury in AD. The immunostaining and RT-qPCR results from the verifying experiments demonstrated that the indicators of small-vessel injury presented in the cortical tissue of 5×FAD mice and related to the upregulation of CD163 expression. CD163 may be the most pivotal candidates related to small-vessel injury in early AD.
Collapse
Affiliation(s)
- Yuewei Chen
- Health Management Center, Renji Hospital of Medical School, Shanghai Jiao Tong University, Shanghai 200127, China; (Y.C.); (P.L.); (W.L.)
- Department of Neurology, Renji Hospital of Medical School, Shanghai Jiao Tong University, Shanghai 200127, China
- Renji-UNSW CHeBA (Centre for Healthy Brain Ageing of University of New South Wales) Neurocognitive Center, Renji Hospital of Medical School, Shanghai Jiao Tong University, Shanghai 200127, China
| | - Peiwen Lu
- Health Management Center, Renji Hospital of Medical School, Shanghai Jiao Tong University, Shanghai 200127, China; (Y.C.); (P.L.); (W.L.)
- Renji-UNSW CHeBA (Centre for Healthy Brain Ageing of University of New South Wales) Neurocognitive Center, Renji Hospital of Medical School, Shanghai Jiao Tong University, Shanghai 200127, China
| | - Shengju Wu
- School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Jie Yang
- Health Management Center, Renji Hospital of Medical School, Shanghai Jiao Tong University, Shanghai 200127, China; (Y.C.); (P.L.); (W.L.)
- Department of Neurology, Renji Hospital of Medical School, Shanghai Jiao Tong University, Shanghai 200127, China
- Renji-UNSW CHeBA (Centre for Healthy Brain Ageing of University of New South Wales) Neurocognitive Center, Renji Hospital of Medical School, Shanghai Jiao Tong University, Shanghai 200127, China
| | - Wanwan Liu
- Health Management Center, Renji Hospital of Medical School, Shanghai Jiao Tong University, Shanghai 200127, China; (Y.C.); (P.L.); (W.L.)
| | - Zhijun Zhang
- School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Qun Xu
- Health Management Center, Renji Hospital of Medical School, Shanghai Jiao Tong University, Shanghai 200127, China; (Y.C.); (P.L.); (W.L.)
- Department of Neurology, Renji Hospital of Medical School, Shanghai Jiao Tong University, Shanghai 200127, China
- Renji-UNSW CHeBA (Centre for Healthy Brain Ageing of University of New South Wales) Neurocognitive Center, Renji Hospital of Medical School, Shanghai Jiao Tong University, Shanghai 200127, China
| |
Collapse
|
5
|
Wang J, Zhang L, Sun J, Yang X, Wu W, Chen W, Zhao Q. Predicting drug-induced liver injury using graph attention mechanism and molecular fingerprints. Methods 2024; 221:18-26. [PMID: 38040204 DOI: 10.1016/j.ymeth.2023.11.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Revised: 11/14/2023] [Accepted: 11/25/2023] [Indexed: 12/03/2023] Open
Abstract
Drug-induced liver injury (DILI) is a significant issue in drug development and clinical treatment due to its potential to cause liver dysfunction or damage, which, in severe cases, can lead to liver failure or even fatality. DILI has numerous pathogenic factors, many of which remain incompletely understood. Consequently, it is imperative to devise methodologies and tools for anticipatory assessment of DILI risk in the initial phases of drug development. In this study, we present DMFPGA, a novel deep learning predictive model designed to predict DILI. To provide a comprehensive description of molecular properties, we employ a multi-head graph attention mechanism to extract features from the molecular graphs, representing characteristics at the level of compound nodes. Additionally, we combine multiple fingerprints of molecules to capture features at the molecular level of compounds. The fusion of molecular fingerprints and graph features can more fully express the properties of compounds. Subsequently, we employ a fully connected neural network to classify compounds as either DILI-positive or DILI-negative. To rigorously evaluate DMFPGA's performance, we conduct a 5-fold cross-validation experiment. The obtained results demonstrate the superiority of our method over four existing state-of-the-art computational approaches, exhibiting an average AUC of 0.935 and an average ACC of 0.934. We believe that DMFPGA is helpful for early-stage DILI prediction and assessment in drug development.
Collapse
Affiliation(s)
- Jifeng Wang
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan 114051, China
| | - Li Zhang
- School of Life Science, Liaoning University, Shenyang 110036, China
| | - Jianqiang Sun
- School of Information Science and Engineering, Linyi University, Linyi 276000, China
| | - Xin Yang
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan 114051, China
| | - Wei Wu
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan 114051, China
| | - Wei Chen
- Innovative Institute of Chinese Medicine and Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China.
| | - Qi Zhao
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan 114051, China.
| |
Collapse
|
6
|
Li X, Qin X, Huang C, Lu Y, Cheng J, Wang L, Liu O, Shuai J, Yuan CA. SUnet: A multi-organ segmentation network based on multiple attention. Comput Biol Med 2023; 167:107596. [PMID: 37890423 DOI: 10.1016/j.compbiomed.2023.107596] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Revised: 09/13/2023] [Accepted: 10/17/2023] [Indexed: 10/29/2023]
Abstract
Organ segmentation in abdominal or thoracic computed tomography (CT) images plays a crucial role in medical diagnosis as it enables doctors to locate and evaluate organ abnormalities quickly, thereby guiding surgical planning, and aiding treatment decision-making. This paper proposes a novel and efficient medical image segmentation method called SUnet for multi-organ segmentation in the abdomen and thorax. SUnet is a fully attention-based neural network. Firstly, an efficient spatial reduction attention (ESRA) module is introduced not only to extract image features better, but also to reduce overall model parameters, and to alleviate overfitting. Secondly, SUnet's multiple attention-based feature fusion module enables effective cross-scale feature integration. Additionally, an enhanced attention gate (EAG) module is considered by using grouped convolution and residual connections, providing richer semantic features. We evaluate the performance of the proposed model on synapse multiple organ segmentation dataset and automated cardiac diagnostic challenge dataset. SUnet achieves an average Dice of 84.29% and 92.25% on these two datasets, respectively, outperforming other models of similar complexity and size, and achieving state-of-the-art results.
Collapse
Affiliation(s)
- Xiaosen Li
- School of Artificial Intelligence, Guangxi Minzu University, Nanning, 530006, China; Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, 325105, China
| | - Xiao Qin
- Guangxi Key Lab of Human-machine Interaction and Intelligent Decision, Nanning Normal University, Nanning, 530023, China
| | - Chengliang Huang
- Academy of Artificial Intelligence, Zhejiang Dongfang Polytechnic, Wenzhou, 325025, China
| | - Yuer Lu
- Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, 325105, China
| | - Jinyan Cheng
- Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, 325105, China
| | - Liansheng Wang
- Department of Computer Science, Xiamen University, Xiamen, 361005, China
| | - Ou Liu
- Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, 325105, China
| | - Jianwei Shuai
- Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, 325105, China.
| | - Chang-An Yuan
- Guangxi Key Lab of Human-machine Interaction and Intelligent Decision, Nanning Normal University, Nanning, 530023, China; Guangxi Academy of Science, Nanning, 530007, China.
| |
Collapse
|
7
|
Zhao Q, Chen Y, Huang W, Zhou H, Zhang W. Drug-microbiota interactions: an emerging priority for precision medicine. Signal Transduct Target Ther 2023; 8:386. [PMID: 37806986 PMCID: PMC10560686 DOI: 10.1038/s41392-023-01619-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 07/20/2023] [Accepted: 08/24/2023] [Indexed: 10/10/2023] Open
Abstract
Individual variability in drug response (IVDR) can be a major cause of adverse drug reactions (ADRs) and prolonged therapy, resulting in a substantial health and economic burden. Despite extensive research in pharmacogenomics regarding the impact of individual genetic background on pharmacokinetics (PK) and pharmacodynamics (PD), genetic diversity explains only a limited proportion of IVDR. The role of gut microbiota, also known as the second genome, and its metabolites in modulating therapeutic outcomes in human diseases have been highlighted by recent studies. Consequently, the burgeoning field of pharmacomicrobiomics aims to explore the correlation between microbiota variation and IVDR or ADRs. This review presents an up-to-date overview of the intricate interactions between gut microbiota and classical therapeutic agents for human systemic diseases, including cancer, cardiovascular diseases (CVDs), endocrine diseases, and others. We summarise how microbiota, directly and indirectly, modify the absorption, distribution, metabolism, and excretion (ADME) of drugs. Conversely, drugs can also modulate the composition and function of gut microbiota, leading to changes in microbial metabolism and immune response. We also discuss the practical challenges, strategies, and opportunities in this field, emphasizing the critical need to develop an innovative approach to multi-omics, integrate various data types, including human and microbiota genomic data, as well as translate lab data into clinical practice. To sum up, pharmacomicrobiomics represents a promising avenue to address IVDR and improve patient outcomes, and further research in this field is imperative to unlock its full potential for precision medicine.
Collapse
Affiliation(s)
- Qing Zhao
- Department of Clinical Pharmacology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, PR China
- Institute of Clinical Pharmacology, Central South University, Hunan Key Laboratory of Pharmacogenetics, 110 Xiangya Road, Changsha, 410078, PR China
- Engineering Research Center of Applied Technology of Pharmacogenomics, Ministry of Education, 110 Xiangya Road, Changsha, 410078, PR China
- National Clinical Research Center for Geriatric Disorders, 87 Xiangya Road, Changsha, 410008, PR China
| | - Yao Chen
- Department of Clinical Pharmacology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, PR China
- Institute of Clinical Pharmacology, Central South University, Hunan Key Laboratory of Pharmacogenetics, 110 Xiangya Road, Changsha, 410078, PR China
- Engineering Research Center of Applied Technology of Pharmacogenomics, Ministry of Education, 110 Xiangya Road, Changsha, 410078, PR China
- National Clinical Research Center for Geriatric Disorders, 87 Xiangya Road, Changsha, 410008, PR China
| | - Weihua Huang
- Department of Clinical Pharmacology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, PR China
- Institute of Clinical Pharmacology, Central South University, Hunan Key Laboratory of Pharmacogenetics, 110 Xiangya Road, Changsha, 410078, PR China
- Engineering Research Center of Applied Technology of Pharmacogenomics, Ministry of Education, 110 Xiangya Road, Changsha, 410078, PR China
- National Clinical Research Center for Geriatric Disorders, 87 Xiangya Road, Changsha, 410008, PR China
| | - Honghao Zhou
- Department of Clinical Pharmacology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, PR China
- Institute of Clinical Pharmacology, Central South University, Hunan Key Laboratory of Pharmacogenetics, 110 Xiangya Road, Changsha, 410078, PR China
- Engineering Research Center of Applied Technology of Pharmacogenomics, Ministry of Education, 110 Xiangya Road, Changsha, 410078, PR China
- National Clinical Research Center for Geriatric Disorders, 87 Xiangya Road, Changsha, 410008, PR China
| | - Wei Zhang
- Department of Clinical Pharmacology, Xiangya Hospital, Central South University, 87 Xiangya Road, Changsha, 410008, PR China.
- The First Affiliated Hospital of Shantou University Medical College, Shantou, 515041, PR China.
- The First Affiliated Hospital of Guangdong Pharmaceutical University, Guangzhou, 510080, PR China.
- Central Laboratory of Hunan Cancer Hospital, Central South University, 283 Tongzipo Road, Changsha, 410013, PR China.
| |
Collapse
|
8
|
Ma M, Pan XF, Pan A, Jiang L. Effects of Sample Dilution on Nuclear Magnetic Resonance-Derived Metabolic Profiles of Human Urine. Anal Chem 2023; 95:13769-13778. [PMID: 37681715 DOI: 10.1021/acs.analchem.3c00029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/09/2023]
Abstract
Traditionally, a relatively big urine volume (e.g., 500 μL) is used in nuclear magnetic resonance (NMR)-based human metabolomics, which is not feasible for studies with limited/precious samples. Although urine may be diluted before conventional high-throughput metabolomics analysis, the comprehensive effect of urine dilution on metabolic profiles is unknown. Here, for the first time, we systematically investigated the effect of urine dilution on 1H NMR metabolic profiles, by evaluating signal detectability, integration, signal-to-noise ratio (SNR), chemical shift (δ) and its variation, and signal overlapping of 47 metabolites in 10 volunteers. We observed significant linear changes along with increased dilution, including decreased integration and SNR, altered δ, decreased intersample variation of δ, and increased separation between overlapped signals, e.g., lactate and threonine, β-d-glucose and an unassigned signal, and histidine and 3-methylhistidine. We further tested the 40% dilution level (i.e., employing 300 μL urine) in an epidemiological study containing 1018 pregnant women from the Tongji-Shuangliu Birth Cohort, showing acceptable detectability and chemical shift variability for most of the 47 metabolites profiled. It indicated that mild (e.g., 40%) dilution of human urine can largely preserve the high-abundance metabolites profiled, reduce intersample chemical shift variations, and increase separations of overlapped signals, which is an improvement of routine sample preparation methods in NMR-based metabolomics and is applicable for studies with limited urine volumes, including large-scale epidemiological studies.
Collapse
Affiliation(s)
- Mengnan Ma
- Department of Epidemiology and Biostatistics, Ministry of Education Key Laboratory of Environment and Health, State Key Laboratory of Environmental Health (Incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, 13 Hangkong Road, Wuhan 430030, China
| | - Xiong-Fei Pan
- Section of Epidemiology and Population Health, Ministry of Education Key Laboratory of Birth Defects and Related Diseases of Women and Children, West China Second University Hospital & West China Biomedical Big Data Center, West China Hospital, Sichuan University; Shuangliu Institute of Women's and Children's Health, Shuangliu Maternal and Child Health Hospital, Chengdu, Sichuan 610041, China
| | - An Pan
- Department of Epidemiology and Biostatistics, Ministry of Education Key Laboratory of Environment and Health, State Key Laboratory of Environmental Health (Incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, 13 Hangkong Road, Wuhan 430030, China
| | - Limiao Jiang
- Department of Epidemiology and Biostatistics, Ministry of Education Key Laboratory of Environment and Health, State Key Laboratory of Environmental Health (Incubating), School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, 13 Hangkong Road, Wuhan 430030, China
| |
Collapse
|
9
|
Liang S, Zhao Y, Jin J, Qiao J, Wang D, Wang Y, Wei L. Rm-LR: A long-range-based deep learning model for predicting multiple types of RNA modifications. Comput Biol Med 2023; 164:107238. [PMID: 37515874 DOI: 10.1016/j.compbiomed.2023.107238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 06/16/2023] [Accepted: 07/07/2023] [Indexed: 07/31/2023]
Abstract
Recent research has highlighted the pivotal role of RNA post-transcriptional modifications in the regulation of RNA expression and function. Accurate identification of RNA modification sites is important for understanding RNA function. In this study, we propose a novel RNA modification prediction method, namely Rm-LR, which leverages a long-range-based deep learning approach to accurately predict multiple types of RNA modifications using RNA sequences only. Rm-LR incorporates two large-scale RNA language pre-trained models to capture discriminative sequential information and learn local important features, which are subsequently integrated through a bilinear attention network. Rm-LR supports a total of ten RNA modification types (m6A, m1A, m5C, m5U, m6Am, Ψ, Am, Cm, Gm, and Um) and significantly outperforms the state-of-the-art methods in terms of predictive capability on benchmark datasets. Experimental results show the effectiveness and superiority of Rm-LR in prediction of various RNA modifications, demonstrating the strong adaptability and robustness of our proposed model. We demonstrate that RNA language pretrained models enable to learn dense biological sequential representations from large-scale long-range RNA corpus, and meanwhile enhance the interpretability of the models. This work contributes to the development of accurate and reliable computational models for RNA modification prediction, providing insights into the complex landscape of RNA modifications.
Collapse
Affiliation(s)
- Sirui Liang
- School of Software, Shandong University, Jinan, 250101, China; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, 250101, China
| | - Yanxi Zhao
- School of Software, Shandong University, Jinan, 250101, China; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, 250101, China
| | - Junru Jin
- School of Software, Shandong University, Jinan, 250101, China; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, 250101, China
| | - Jianbo Qiao
- School of Software, Shandong University, Jinan, 250101, China; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, 250101, China
| | - Ding Wang
- School of Software, Shandong University, Jinan, 250101, China; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, 250101, China
| | - Yu Wang
- School of Software, Shandong University, Jinan, 250101, China; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, 250101, China
| | - Leyi Wei
- School of Software, Shandong University, Jinan, 250101, China; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, 250101, China.
| |
Collapse
|
10
|
Vahabzadeh V, Moattar MH. Robust microarray data feature selection using a correntropy based distance metric learning approach. Comput Biol Med 2023; 161:107056. [PMID: 37235945 DOI: 10.1016/j.compbiomed.2023.107056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 04/18/2023] [Accepted: 05/20/2023] [Indexed: 05/28/2023]
Abstract
Classification of high-dimensional microarray data is a challenge in bioinformatics and genetic data processing. One of the challenging issues of feature selection is the presence of outliers. The Euclidean distance metric is sensitive to outliers. In this study, a distance metric learning based feature selection approach that uses the correntropy function as the discrimination metric is proposed. For this purpose, the metric learning problem is formulated as an optimization problem and solved using the Lagrange method. The output of the approach signifies the most important and robust features. After feature selection, different classification methods such as SVM, decision trees, and NN classifiers are used to investigate the classification accuracy of the proposed method as well as precision, recall, and F-measure. Experiments are carried out on 13 high-dimensional datasets and show that the proposed method outperforms the previous models in terms of accuracy and robustness.
Collapse
Affiliation(s)
- Venus Vahabzadeh
- Department of Software Engineering, Mashhad Branch, Islamic Azad University, Mashhad, Iran.
| | | |
Collapse
|
11
|
Fajarda O, Almeida JR, Duarte-Pereira S, Silva RM, Oliveira JL. Methodology to identify a gene expression signature by merging microarray datasets. Comput Biol Med 2023; 159:106867. [PMID: 37060770 DOI: 10.1016/j.compbiomed.2023.106867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 03/01/2023] [Accepted: 03/30/2023] [Indexed: 04/17/2023]
Abstract
A vast number of microarray datasets have been produced as a way to identify differentially expressed genes and gene expression signatures. A better understanding of these biological processes can help in the diagnosis and prognosis of diseases, as well as in the therapeutic response to drugs. However, most of the available datasets are composed of a reduced number of samples, leading to low statistical, predictive and generalization power. One way to overcome this problem is by merging several microarray datasets into a single dataset, which is typically a challenging task. Statistical methods or supervised machine learning algorithms are usually used to determine gene expression signatures. Nevertheless, statistical methods require an arbitrary threshold to be defined, and supervised machine learning methods can be ineffective when applied to high-dimensional datasets like microarrays. We propose a methodology to identify gene expression signatures by merging microarray datasets. This methodology uses statistical methods to obtain several sets of differentially expressed genes and uses supervised machine learning algorithms to select the gene expression signature. This methodology was validated using two distinct research applications: one using heart failure and the other using autism spectrum disorder microarray datasets. For the first, we obtained a gene expression signature composed of 117 genes, with a classification accuracy of approximately 98%. For the second use case, we obtained a gene expression signature composed of 79 genes, with a classification accuracy of approximately 82%. This methodology was implemented in R language and is available, under the MIT licence, at https://github.com/bioinformatics-ua/MicroGES.
Collapse
Affiliation(s)
- Olga Fajarda
- DETI/IEETA, LASI, University of Aveiro, Aveiro, Portugal.
| | - João Rafael Almeida
- DETI/IEETA, LASI, University of Aveiro, Aveiro, Portugal; Department of Computation, University of A Coruña, A Coruña, Spain.
| | - Sara Duarte-Pereira
- DETI/IEETA, LASI, University of Aveiro, Aveiro, Portugal; Department of Medical Sciences and iBiMED-Institute of Biomedicine, University of Aveiro, Aveiro, Portugal.
| | - Raquel M Silva
- Universidade Católica Portuguesa, Faculty of Dental Medicine (FMD), Center for Interdisciplinary Research in Health (CIIS), Viseu, Portugal.
| | | |
Collapse
|
12
|
Mortezapour M, Tapak L, Bahreini F, Najafi R, Afshar S. Identification of key genes in colorectal cancer diagnosis by co-expression analysis weighted gene co-expression network analysis. Comput Biol Med 2023; 157:106779. [PMID: 36931200 DOI: 10.1016/j.compbiomed.2023.106779] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Revised: 01/10/2023] [Accepted: 03/09/2023] [Indexed: 03/17/2023]
Abstract
BACKGROUND The purpose of this study was using bioinformatics tools to identify biomarkers and molecular factors involved in the diagnosis of colorectal cancer, which are effective for the diagnosis and treatment of the disease. METHODS We determined differentially expressed genes (DEGs) related to colorectal cancer (CRC) using the data series retrieved from GEO database. Then the weighted gene co-expression network analysis (WGCNA) was conducted to explore co-expression modules related to CRC diagnosis. Next, the relationship between the integrated modules with clinical features such as the stage of CRC was evaluated. Other downstream analyses were performed on selected module genes. RESULTS In this study, after performing the WGCNA method, a module named blue module which was more significantly associated with the CRC stage was selected for further evaluation. Afterward, the Protein-protein interaction network through sting software for 154 genes of the blue module was constructed and eight hub genes were identified through the evaluation of constructed network with Cytoscape. Among these eight hub genes, upregulation of MMP9, SERPINH1, COL1A2, COL5A2, COL1A1, SPARC, and COL5A1 in CRC was validated in other microarray and TCGA data. Based on the results of the mRNA-miRNA interaction network, SERPINH1 was found as a target gene of miR-940. Finally, results of the DGIDB database indicated that Andecaliximab, Carboxylated glucosamine, Marimastat, Tozuleristide, S-3304, Incyclinide, Curcumin, Prinomastat, Demethylwedelolactone, and Bevacizumab, could be used as a therapeutic agent for targeting the MMP9. Furthermore, Ocriplasmin and Collagenase clostridium histolyticum could target COL1A1, COL1A2, COL5A1, and COL5A2. CONCLUSION Taken together, the results of the current study indicated that seven hub genes including COL1A2, COL5A1, COL5A2, SERPINH1, MMP9, SPARC, and COL1A1 which were upregulated in CRC could be used as a diagnostic and progression biomarker of CRC. On the other hand, miR-940 which targets SERPINH1 could be used as a potential biomarker of CRC. More ever, Andecaliximab, Carboxylated glucosamine, Marimastat, Tozuleristide, S-3304, Incyclinide, Curcumin, Prinomastat, Demethylwedelolactone, Bevacizumab, Ocriplasmin , and Collagenase clostridium histolyticum were introduced as therapeutic agents for CRC which their therapeutic potential should be evaluated experimentally.
Collapse
Affiliation(s)
- Mahdie Mortezapour
- Department of Molecular Medicine and Genetics, School of Medicine, Hamadan University of Medical Sciences, Hamadan, Iran
| | - Leili Tapak
- Department of Biostatistics, School of Public Health and Modeling of Noncommunicable Diseases Research Center, Hamadan University of Medical Sciences, Hamadan, Iran
| | - Fatemeh Bahreini
- Department of Molecular Medicine and Genetics, School of Medicine, Hamadan University of Medical Sciences, Hamadan, Iran; Research Center for Molecular Medicine, Hamadan University of Medical Sciences, Hamadan, Iran
| | - Rezvan Najafi
- Department of Molecular Medicine and Genetics, School of Medicine, Hamadan University of Medical Sciences, Hamadan, Iran; Research Center for Molecular Medicine, Hamadan University of Medical Sciences, Hamadan, Iran
| | - Saeid Afshar
- Department of Molecular Medicine and Genetics, School of Medicine, Hamadan University of Medical Sciences, Hamadan, Iran; Research Center for Molecular Medicine, Hamadan University of Medical Sciences, Hamadan, Iran.
| |
Collapse
|
13
|
Nalepa J, Kotowski K, Machura B, Adamski S, Bozek O, Eksner B, Kokoszka B, Pekala T, Radom M, Strzelczak M, Zarudzki L, Krason A, Arcadu F, Tessier J. Deep learning automates bidimensional and volumetric tumor burden measurement from MRI in pre- and post-operative glioblastoma patients. Comput Biol Med 2023; 154:106603. [PMID: 36738710 DOI: 10.1016/j.compbiomed.2023.106603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 01/11/2023] [Accepted: 01/22/2023] [Indexed: 02/05/2023]
Abstract
Tumor burden assessment by magnetic resonance imaging (MRI) is central to the evaluation of treatment response for glioblastoma. This assessment is, however, complex to perform and associated with high variability due to the high heterogeneity and complexity of the disease. In this work, we tackle this issue and propose a deep learning pipeline for the fully automated end-to-end analysis of glioblastoma patients. Our approach simultaneously identifies tumor sub-regions, including the enhancing tumor, peritumoral edema and surgical cavity in the first step, and then calculates the volumetric and bidimensional measurements that follow the current Response Assessment in Neuro-Oncology (RANO) criteria. Also, we introduce a rigorous manual annotation process which was followed to delineate the tumor sub-regions by the human experts, and to capture their segmentation confidences that are later used while training deep learning models. The results of our extensive experimental study performed over 760 pre-operative and 504 post-operative adult patients with glioma obtained from the public database (acquired at 19 sites in years 2021-2020) and from a clinical treatment trial (47 and 69 sites for pre-/post-operative patients, 2009-2011) and backed up with thorough quantitative, qualitative and statistical analysis revealed that our pipeline performs accurate segmentation of pre- and post-operative MRIs in a fraction of the manual delineation time (up to 20 times faster than humans). Volumetric measurements were in strong agreement with experts with the Intraclass Correlation Coefficient (ICC): 0.959, 0.703, 0.960 for ET, ED, and cavity. Similarly, automated RANO compared favorably with experienced readers (ICC: 0.681 and 0.866) producing consistent and accurate results. Additionally, we showed that RANO measurements are not always sufficient to quantify tumor burden. The high performance of the automated tumor burden measurement highlights the potential of the tool for considerably improving and simplifying radiological evaluation of glioblastoma in clinical trials and clinical practice.
Collapse
Affiliation(s)
- Jakub Nalepa
- Graylight Imaging, Gliwice, Poland; Department of Algorithmics and Software, Silesian University of Technology, Gliwice, Poland.
| | | | | | | | - Oskar Bozek
- Department of Radiodiagnostics and Invasive Radiology, School of Medicine in Katowice, Medical University of Silesia in Katowice, Katowice, Poland
| | - Bartosz Eksner
- Department of Radiology and Nuclear Medicine, ZSM Chorzów, Chorzów, Poland
| | - Bartosz Kokoszka
- Department of Radiodiagnostics, Interventional Radiology and Nuclear Medicine, University Clinical Centre, Katowice, Poland
| | - Tomasz Pekala
- Department of Radiodiagnostics, Interventional Radiology and Nuclear Medicine, University Clinical Centre, Katowice, Poland
| | - Mateusz Radom
- Department of Radiology and Diagnostic Imaging, Maria Skłodowska-Curie National Research Institute of Oncology, Gliwice Branch, Gliwice, Poland
| | - Marek Strzelczak
- Department of Radiology and Diagnostic Imaging, Maria Skłodowska-Curie National Research Institute of Oncology, Gliwice Branch, Gliwice, Poland
| | - Lukasz Zarudzki
- Department of Radiology and Diagnostic Imaging, Maria Skłodowska-Curie National Research Institute of Oncology, Gliwice Branch, Gliwice, Poland
| | - Agata Krason
- Roche Pharmaceutical Research & Early Development, Early Clinical Development Oncology, Roche Innovation Center Basel, Basel, Switzerland
| | - Filippo Arcadu
- Roche Pharmaceutical Research & Early Development, Early Clinical Development Informatics, Roche Innovation Center Basel, Basel, Switzerland
| | - Jean Tessier
- Roche Pharmaceutical Research & Early Development, Early Clinical Development Oncology, Roche Innovation Center Basel, Basel, Switzerland
| |
Collapse
|
14
|
Baran Y, Doğan B. scMAGS: Marker gene selection from scRNA-seq data for spatial transcriptomics studies. Comput Biol Med 2023; 155:106634. [PMID: 36774895 DOI: 10.1016/j.compbiomed.2023.106634] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Revised: 01/28/2023] [Accepted: 02/04/2023] [Indexed: 02/11/2023]
Abstract
Single-Cell RNA sequencing (scRNA-seq) has provided unprecedented opportunities for exploring gene expression and thus uncovering regulatory relationships between genes at the single-cell level. However, scRNA-seq relies on isolating cells from tissues. Therefore, the spatial context of the regulatory processes is lost. A recent technological innovation, spatial transcriptomics, allows for the measurement of gene expression while preserving spatial information. An initial step in the spatial transcriptomic analysis is to identify the cell type, which requires a careful selection of cell-specific marker genes. For this purpose, currently, scRNA-seq data is used to select a limited number of marker genes from among all genes that distinguish cell types from each other. This study proposes scMAGS (single-cell MArker Gene Selection), a novel method for marker gene selection from scRNA-seq data for spatial transcriptomics studies. scMAGS uses a filtering step in which the candidate genes are identified before the marker gene selection step. For the selection of marker genes, cluster validity indices, the Silhouette index, or the Calinski-Harabasz index (for large datasets) are utilized. Experimental results showed that, in comparison to the existing methods, scMAGS is scalable, fast, and accurate. Even for large datasets with millions of cells, scMAGS could find the required number of marker genes in a reasonable amount of time with fewer memory requirements. scMAGS is made freely available at https://github.com/doganlab/scmags and can be downloaded from the Python Package Directory (PyPI) software repository with the command pip install scmags.
Collapse
Affiliation(s)
- Yusuf Baran
- Department of Biomedical Engineering, Inonu University, Malatya, Turkey
| | - Berat Doğan
- Department of Biomedical Engineering, Inonu University, Malatya, Turkey.
| |
Collapse
|
15
|
Yagin FH, Cicek İB, Alkhateeb A, Yagin B, Colak C, Azzeh M, Akbulut S. Explainable artificial intelligence model for identifying COVID-19 gene biomarkers. Comput Biol Med 2023; 154:106619. [PMID: 36738712 PMCID: PMC9889119 DOI: 10.1016/j.compbiomed.2023.106619] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 01/11/2023] [Accepted: 01/28/2023] [Indexed: 02/04/2023]
Abstract
AIM COVID-19 has revealed the need for fast and reliable methods to assist clinicians in diagnosing the disease. This article presents a model that applies explainable artificial intelligence (XAI) methods based on machine learning techniques on COVID-19 metagenomic next-generation sequencing (mNGS) samples. METHODS In the data set used in the study, there are 15,979 gene expressions of 234 patients with COVID-19 negative 141 (60.3%) and COVID-19 positive 93 (39.7%). The least absolute shrinkage and selection operator (LASSO) method was applied to select genes associated with COVID-19. Support Vector Machine - Synthetic Minority Oversampling Technique (SVM-SMOTE) method was used to handle the class imbalance problem. Logistics regression (LR), SVM, random forest (RF), and extreme gradient boosting (XGBoost) methods were constructed to predict COVID-19. An explainable approach based on local interpretable model-agnostic explanations (LIME) and SHAPley Additive exPlanations (SHAP) methods was applied to determine COVID-19- associated biomarker candidate genes and improve the final model's interpretability. RESULTS For the diagnosis of COVID-19, the XGBoost (accuracy: 0.930) model outperformed the RF (accuracy: 0.912), SVM (accuracy: 0.877), and LR (accuracy: 0.912) models. As a result of the SHAP, the three most important genes associated with COVID-19 were IFI27, LGR6, and FAM83A. The results of LIME showed that especially the high level of IFI27 gene expression contributed to increasing the probability of positive class. CONCLUSIONS The proposed model (XGBoost) was able to predict COVID-19 successfully. The results show that machine learning combined with LIME and SHAP can explain the biomarker prediction for COVID-19 and provide clinicians with an intuitive understanding and interpretability of the impact of risk factors in the model.
Collapse
Affiliation(s)
- Fatma Hilal Yagin
- Department of Biostatistics and Medical Informatics, Faculty of Medicine, Inonu University, 44280, Malatya, Turkey.
| | - İpek Balikci Cicek
- Department of Biostatistics and Medical Informatics, Faculty of Medicine, Inonu University, 44280, Malatya, Turkey.
| | - Abedalrhman Alkhateeb
- Software Engineering Department, King Hussein School for Computing Sciences, Amman, Jordan.
| | - Burak Yagin
- Department of Biostatistics and Medical Informatics, Faculty of Medicine, Inonu University, 44280, Malatya, Turkey.
| | - Cemil Colak
- Department of Biostatistics and Medical Informatics, Faculty of Medicine, Inonu University, 44280, Malatya, Turkey.
| | - Mohammad Azzeh
- Data Science Department, King Hussein School for Computing Sciences, Amman, Jordan.
| | - Sami Akbulut
- Department of Biostatistics and Medical Informatics, Faculty of Medicine, Inonu University, 44280, Malatya, Turkey; Inonu University, Faculty of Medicine, Department of Surgery, 44280, Malatya, Turkey; Inonu University, Faculty of Medicine, Department of Public Health, 44280, Malatya, Turkey.
| |
Collapse
|
16
|
Moon S, Kim HJ, Lee Y, Lee YJ, Jung S, Lee JS, Hahn SH, Kim K, Roh JY, Nam S. Oncogenic signaling pathways and hallmarks of cancer in Korean patients with acral melanoma. Comput Biol Med 2023; 154:106602. [PMID: 36716688 DOI: 10.1016/j.compbiomed.2023.106602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 12/23/2022] [Accepted: 01/22/2023] [Indexed: 01/25/2023]
Abstract
Acral melanoma (AM), a rare subtype of cutaneous melanoma, shows higher incidence in Asians, including Koreans, than in Caucasians. However, the genetic modification associated with AM in Koreans is not well known and has not been comprehensively investigated in terms of oncogenic signaling, and hallmarks of cancer. We performed whole-exome and RNA sequencing for Korean patients with AM and acquired the genetic alterations and gene expression profiles. KIT alterations (previously known to be recurrent alterations in AM) and CDK4/CCND1 copy number amplifications were identified in the patients. Genetic and transcriptomic alterations in patients with AM were functionally converge to the hallmarks of cancer and oncogenic pathways, including 'proliferative signal persistence', 'apoptotic resistance', and 'activation of invasion and metastasis', despite the heterogeneous somatic mutation profiles of Korean patients with AM. This study may provide a molecular understanding for therapeutic strategy for AM.
Collapse
Affiliation(s)
- SeongRyeol Moon
- Department of Health Sciences and Technology, Gachon Advanced Institute for Health Sciences and Technology, Gachon University, Incheon, 21999, South Korea
| | - Hee Joo Kim
- Department of Dermatology, Gachon University Gil Medical Center, Gachon University College of Medicine, Incheon, 21565, South Korea
| | - Yeeun Lee
- Department of Health Sciences and Technology, Gachon Advanced Institute for Health Sciences and Technology, Gachon University, Incheon, 21999, South Korea
| | - Yu Joo Lee
- Department of Genome Medicine and Science, Gachon Institute of Genome Medicine and Science, Gachon University Gil Medical Center, Gachon University College of Medicine, Incheon, 21565, South Korea
| | - Sungwon Jung
- Department of Genome Medicine and Science, Gachon Institute of Genome Medicine and Science, Gachon University Gil Medical Center, Gachon University College of Medicine, Incheon, 21565, South Korea
| | - Jin Sook Lee
- Department of Genome Medicine and Science, Gachon Institute of Genome Medicine and Science, Gachon University Gil Medical Center, Gachon University College of Medicine, Incheon, 21565, South Korea; Department of Pediatrics, Seoul National University Hospital Child Cancer and Rare Disease Administration, Seoul National University Children's Hospital, Seoul, 03080, South Korea
| | - Si Houn Hahn
- Division of Genetic Medicine, Department of Pediatrics, University of Washington School of Medicine, Seattle Children's Hospital, Seattle, WA, 98105, USA
| | | | - Joo Young Roh
- Department of Dermatology, Ewha Womans University College of Medicine, Seoul Hospital, Seoul, 07804, South Korea.
| | - Seungyoon Nam
- Department of Health Sciences and Technology, Gachon Advanced Institute for Health Sciences and Technology, Gachon University, Incheon, 21999, South Korea; Department of Genome Medicine and Science, Gachon Institute of Genome Medicine and Science, Gachon University Gil Medical Center, Gachon University College of Medicine, Incheon, 21565, South Korea; AI Convergence Center for Medical Science, Gachon University College of Medicine, Incheon, 21565, South Korea.
| |
Collapse
|
17
|
Zafari N, Bathaei P, Velayati M, Khojasteh-Leylakoohi F, Khazaei M, Fiuji H, Nassiri M, Hassanian SM, Ferns GA, Nazari E, Avan A. Integrated analysis of multi-omics data for the discovery of biomarkers and therapeutic targets for colorectal cancer. Comput Biol Med 2023; 155:106639. [PMID: 36805214 DOI: 10.1016/j.compbiomed.2023.106639] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 01/14/2023] [Accepted: 02/05/2023] [Indexed: 02/12/2023]
Abstract
The considerable burden of colorectal cancer and the rising trend in young adults emphasize the necessity of understanding its underlying mechanisms, providing new diagnostic and prognostic markers, and improving therapeutic approaches. Precision medicine is a new trend all over the world and identification of novel biomarkers and therapeutic targets is a step forward towards this trend. In this context, multi-omics data and integrated analysis are being investigated to develop personalized medicine in the management of colorectal cancer. Given the large amount of data from multi-omics approach, data integration and analysis is a great challenge. In this Review, we summarize how statistical and machine learning techniques are applied to analyze multi-omics data and how it contributes to the discovery of useful diagnostic and prognostic biomarkers and therapeutic targets. Moreover, we discuss the importance of these biomarkers and therapeutic targets in the clinical management of colorectal cancer in the future. Taken together, integrated analysis of multi-omics data has great potential for finding novel diagnostic and prognostic biomarkers and therapeutic targets, however, there are still challenges to overcome in future studies.
Collapse
Affiliation(s)
- Nima Zafari
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Parsa Bathaei
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Mahla Velayati
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Fatemeh Khojasteh-Leylakoohi
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran; Basic Sciences Research Institute, Mashhad University of Medical Sciences, Mashhad, Iran; Medical Genetics Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Majid Khazaei
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran; Basic Sciences Research Institute, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Hamid Fiuji
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Mohammadreza Nassiri
- Recombinant Proteins Research Group, The Research Institute of Biotechnology, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Seyed Mahdi Hassanian
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran; Basic Sciences Research Institute, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Gordon A Ferns
- Brighton & Sussex Medical School, Division of Medical Education, Falmer, Brighton, Sussex, BN1 9PH, UK
| | - Elham Nazari
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran; Basic Sciences Research Institute, Mashhad University of Medical Sciences, Mashhad, Iran.
| | - Amir Avan
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran; Basic Sciences Research Institute, Mashhad University of Medical Sciences, Mashhad, Iran; Medical Genetics Research Center, Mashhad University of Medical Sciences, Mashhad, Iran.
| |
Collapse
|
18
|
He H, Duo H, Hao Y, Zhang X, Zhou X, Zeng Y, Li Y, Li B. Computational drug repurposing by exploiting large-scale gene expression data: Strategy, methods and applications. Comput Biol Med 2023; 155:106671. [PMID: 36805225 DOI: 10.1016/j.compbiomed.2023.106671] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2022] [Revised: 02/05/2023] [Accepted: 02/10/2023] [Indexed: 02/18/2023]
Abstract
De novo drug development is an extremely complex, time-consuming and costly task. Urgent needs for therapies of various diseases have greatly accelerated searches for more effective drug development methods. Luckily, drug repurposing provides a new and effective perspective on disease treatment. Rapidly increased large-scale transcriptome data paints a detailed prospect of gene expression during disease onset and thus has received wide attention in the field of computational drug repurposing. However, how to efficiently mine transcriptome data and identify new indications for old drugs remains a critical challenge. This review discussed the irreplaceable role of transcriptome data in computational drug repurposing and summarized some representative databases, tools and strategies. More importantly, it proposed a practical guideline through establishing the correspondence between three gene expression data types and five strategies, which would facilitate researchers to adopt appropriate strategies to deeply mine large-scale transcriptome data and discover more effective therapies.
Collapse
Affiliation(s)
- Hao He
- College of Life Sciences, Chongqing Normal University, Chongqing, 400044, PR China; State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Institutes of Brain Science, Fudan University, Shanghai, 200032, PR China
| | - Hongrui Duo
- College of Life Sciences, Chongqing Normal University, Chongqing, 400044, PR China
| | - Youjin Hao
- College of Life Sciences, Chongqing Normal University, Chongqing, 400044, PR China
| | - Xiaoxi Zhang
- College of Life Sciences, Chongqing Normal University, Chongqing, 400044, PR China
| | - Xinyi Zhou
- College of Life Sciences, Chongqing Normal University, Chongqing, 400044, PR China
| | - Yujie Zeng
- College of Life Sciences, Chongqing Normal University, Chongqing, 400044, PR China
| | - Yinghong Li
- The Key Laboratory on Big Data for Bio Intelligence, Chongqing University of Posts and Telecommunications, Chongqing, 400065, PR China
| | - Bo Li
- College of Life Sciences, Chongqing Normal University, Chongqing, 400044, PR China.
| |
Collapse
|
19
|
Wang T, Sun J, Zhao Q. Investigating cardiotoxicity related with hERG channel blockers using molecular fingerprints and graph attention mechanism. Comput Biol Med 2023; 153:106464. [PMID: 36584603 DOI: 10.1016/j.compbiomed.2022.106464] [Citation(s) in RCA: 108] [Impact Index Per Article: 108.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 12/12/2022] [Accepted: 12/19/2022] [Indexed: 12/24/2022]
Abstract
Human ether-a-go-go-related gene (hERG) channel blockade by small molecules is a big concern during drug development in the pharmaceutical industry. Failure or inhibition of hERG channel activity caused by drug molecules can lead to prolonging QT interval, which will result in serious cardiotoxicity. Thus, evaluating the hERG blocking activity of all these small molecular compounds is technically challenging, and the relevant procedures are expensive and time-consuming. In this study, we develop a novel deep learning predictive model named DMFGAM for predicting hERG blockers. In order to characterize the molecule more comprehensively, we first consider the fusion of multiple molecular fingerprint features to characterize its final molecular fingerprint features. Then, we use the multi-head attention mechanism to extract the molecular graph features. Both molecular fingerprint features and molecular graph features are fused as the final features of the compounds to make the feature expression of compounds more comprehensive. Finally, the molecules are classified into hERG blockers or hERG non-blockers through the fully connected neural network. We conduct 5-fold cross-validation experiment to evaluate the performance of DMFGAM, and verify the robustness of DMFGAM on external validation datasets. We believe DMFGAM can serve as a powerful tool to predict hERG channel blockers in the early stages of drug discovery and development.
Collapse
Affiliation(s)
- Tianyi Wang
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, 114051, China
| | - Jianqiang Sun
- School of Automation and Electrical Engineering, Linyi University, Linyi, 276000, China
| | - Qi Zhao
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, 114051, China.
| |
Collapse
|
20
|
Zhang J, Jiang H, Shi T. ASE-Net: A tumor segmentation method based on image pseudo enhancement and adaptive-scale attention supervision module. Comput Biol Med 2023; 152:106363. [PMID: 36516579 DOI: 10.1016/j.compbiomed.2022.106363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2022] [Revised: 11/08/2022] [Accepted: 11/25/2022] [Indexed: 11/29/2022]
Abstract
Fluorine 18(18F) fluorodeoxyglucose positron emission tomography and Computed Tomography (PET/CT) is the preferred imaging method of choice for the diagnosis and treatment of many cancers. However, factors such as low-contrast organ and tissue images, and the original scale of tumors pose huge obstacles to the accurate segmentation of tumors. In this work, we propose a novel model ASE-Net which is used for multimodality tumor segmentation. Firstly, we propose a pseudo-enhanced CT image generation method based on metabolic intensity to generate pseudo-enhanced CT images as additional input, which reduces the learning of the network in the spatial position of PET/CT and increases the discriminability of the corresponding structural positions of the high and low metabolic region. Second, unlike previous networks that directly segment tumors of all scales, we propose an Adaptive-Scale Attention Supervision Module at the skip connections, after combining the results of all paths, tumors of different scales will be given different receptive fields. Finally, Dual Path Block is used as the backbone of our network to leverage the ability of residual learning for feature reuse and dense connection for exploring new features. Our experimental results on two clinical PET/CT datasets demonstrate the effectiveness of our proposed network and achieve 78.56% and 72.57% in Dice Similarity Coefficient, respectively, which has better performance compared to state-of-the-art network models, whether for large or small tumors. The proposed model will help pathologists formulate more accurate diagnoses by providing reference opinions during diagnosis, consequently improving patient survival rate.
Collapse
Affiliation(s)
- Junzhi Zhang
- Software College, Northeastern University, No. 195, Chuangxin Road, Hunnan District, Shenyang, 110169, Liaoning, China
| | - Huiyan Jiang
- Software College, Northeastern University, No. 195, Chuangxin Road, Hunnan District, Shenyang, 110169, Liaoning, China; Key Laboratory of Intelligent Computing in Biomedical Image, Ministry of Education, Northeastern University, No. 195, Chuangxin Road, Hunnan District, Shenyang, 110169, Liaoning, China.
| | - Tianyu Shi
- Software College, Northeastern University, No. 195, Chuangxin Road, Hunnan District, Shenyang, 110169, Liaoning, China
| |
Collapse
|
21
|
Huang P, Yan L, Li Z, Zhao S, Feng Y, Zeng J, Chen L, Huang A, Chen Y, Lei S, Huang X, Deng Y, Xie D, Guan H, Peng W, Yu L, Chen B. Potential shared gene signatures and molecular mechanisms between atherosclerosis and depression: Evidence from transcriptome data. Comput Biol Med 2023; 152:106450. [PMID: 36565484 DOI: 10.1016/j.compbiomed.2022.106450] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 12/09/2022] [Accepted: 12/19/2022] [Indexed: 12/24/2022]
Abstract
BACKGROUND Atherosclerosis and depression contribute to each other; however, mechanisms linking them at the genetic level remain unexplored. This study aimed to identify shared gene signatures and related pathways between these comorbidities. METHODS Atherosclerosis-related datasets were downloaded from the Gene Expression Omnibus database. Differential and weighted gene co-expression network analyses were employed to identify atherosclerosis-related genes. Depression-related genes were downloaded from the DisGeNET database, and the overlaps between atherosclerosis-related genes and depression-related genes were characterized as crosstalk genes. The functional enrichment analysis and protein-protein interaction network were performed in these gene sets. Subsequently, the Boruta algorithm and Recursive Feature Elimination algorithm were performed to identify feature-selection genes. A support vector machine was constructed to measure the accuracy of calculations, and two external validation sets were included to verify the results. RESULTS Based on two atherosclerosis-related datasets (GSE28829 and GSE43292), 165 genes were determined as atherosclerosis-related genes. Meanwhile, 1478 depression-related genes were obtained. After intersecting, 24 crosstalk genes were identified, and two pathways, "lipid and atherosclerosis" and "tryptophan metabolism," were revealed as mutual pathways according to the enrichment analysis results. Through the protein-protein interaction network, Molecular Complex Detection plugin, and cytoHubba plugin, PTPRC and MMP9 were identified as the hub gene. Moreover, SLC22A3, CASP1, AMPD3, and PIK3CG were recognized as feature-selection genes. Based on two external validation sets, CASP1 and MMP9 were finally determined as the critical crosstalk genes. CONCLUSIONS "Lipid and atherosclerosis" and "tryptophan metabolism" were possibly the pathways of atherosclerosis secondary to depression and depression due to atherosclerosis, respectively. CASP1 and MMP9 were revealed as the most pivotal candidates linking atherosclerosis and depression by mediating these two pathways. Further experimentation is needed to confirm these conclusions.
Collapse
Affiliation(s)
- Peiying Huang
- The Second Clinical Medical School of Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Li Yan
- Department of Neurosurgery of Shenyang Second Hospital of Traditional Chinese Medicine, Shenyang, China
| | - Zhishang Li
- Emergency Department of Guangdong Provincial Hospital of Traditional Chinese Medicine, Guangzhou, China
| | - Shuai Zhao
- Emergency Department of Guangdong Provincial Hospital of Traditional Chinese Medicine, Guangzhou, China
| | - Yuchao Feng
- Guangdong Provincial Key Laboratory of Research on Emergency in Traditional Chinese Medicine, Clinical Research Team of Prevention and Treatment of Cardiac Emergencies with Traditional Chinese Medicine, Guangzhou, China
| | - Jing Zeng
- Emergency Department of Guangdong Provincial Hospital of Traditional Chinese Medicine, Guangzhou, China
| | - Li Chen
- Emergency Department of Guangdong Provincial Hospital of Traditional Chinese Medicine, Guangzhou, China
| | - Afang Huang
- Departments of Laboratory Medicine of Foshan Forth People's Hospital, Foshan, China
| | - Yan Chen
- Emergency Department of Guangdong Provincial Hospital of Traditional Chinese Medicine, Guangzhou, China
| | - Sisi Lei
- The Second Clinical Medical School of Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Xiaoyan Huang
- Emergency Department of Guangdong Provincial Hospital of Traditional Chinese Medicine, Guangzhou, China
| | - Yi Deng
- Emergency Department of Guangdong Provincial Hospital of Traditional Chinese Medicine, Guangzhou, China
| | - Dan Xie
- The Second Clinical Medical School of Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Hansu Guan
- The Third Affiliated Hospital of Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Weihang Peng
- The Second Clinical Medical School of Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Liyuan Yu
- The Second Clinical Medical School of Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Bojun Chen
- The Second Clinical Medical School of Guangzhou University of Chinese Medicine, Guangzhou, China; Emergency Department of Guangdong Provincial Hospital of Traditional Chinese Medicine, Guangzhou, China; Guangdong Provincial Key Laboratory of Research on Emergency in Traditional Chinese Medicine, Clinical Research Team of Prevention and Treatment of Cardiac Emergencies with Traditional Chinese Medicine, Guangzhou, China.
| |
Collapse
|
22
|
Xiang J, Wang X, Wang X, Zhang J, Yang S, Yang W, Han X, Liu Y. Automatic diagnosis and grading of Prostate Cancer with weakly supervised learning on whole slide images. Comput Biol Med 2023; 152:106340. [PMID: 36481762 DOI: 10.1016/j.compbiomed.2022.106340] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2022] [Revised: 11/02/2022] [Accepted: 11/16/2022] [Indexed: 11/23/2022]
Abstract
BACKGROUND The workflow of prostate cancer diagnosis and grading is cumbersome and the results suffer from substantial inter-observer variability. Recent trials have shown potential in using machine learning to develop automated systems to address this challenge. Most automated deep learning systems for prostate cancer Gleason grading focused on supervised learning requiring demanding fine-grained pixel-level annotations. METHODS A weakly-supervised deep learning model with slide-level labels is presented in this study for the diagnosis and grading of prostate cancer with whole slide image (WSI). WSIs are first cropped into small patches and then processed with a deep learning model to extract patch-level features. A graph convolution network (GCN) is used to aggregate the features for classifications. Throughout the training process, the noisy labels are progressively filtered out to reduce inter-observer variations in clinical reports. Finally, multi-center independent test cohorts with 6,174 slides are collected to evaluate the prostate cancer diagnosis and grading performance of our model. RESULTS The cancer diagnosis (2-level classification) results on two external test sets (n= 4,675, n= 844) show an area under the receiver operating characteristic curve (AUC) of 0.985 and 0.986. The Gleason grading (6-level classification) results reach 0.931 quadratic weighted kappa on the internal test set (n= 531). It generalizes well on the external test dataset (n= 844) with 0.801 quadratic weighted kappa with the reference standard set independently. The model enables pathological meaningful interpretability by visualizing the most attended lesions which are highly consistent with expert annotations. CONCLUSION The proposed model incorporates a graph network in weakly supervised learning with only slide-level reports. A robust learning strategy is also employed to correct the label noise. It is highly accurate (>0.985 AUC for diagnosis) and also interpretable with intuitive heatmap visualization. It can be unified with a digital pathology pipeline to deliver prostate cancer metrics for a pathology report.
Collapse
Affiliation(s)
| | - Xiyue Wang
- College of Computer Science, Sichuan University, Chengdu, China
| | - Xinran Wang
- Department of Pathology, The Fourth Hospital of Hebei Medical University, Shijiazhuang, China
| | | | - Sen Yang
- AI Lab, Tencent, Shenzhen, China
| | - Wei Yang
- AI Lab, Tencent, Shenzhen, China
| | - Xiao Han
- AI Lab, Tencent, Shenzhen, China
| | - Yueping Liu
- Department of Pathology, The Fourth Hospital of Hebei Medical University, Shijiazhuang, China.
| |
Collapse
|
23
|
Falchetti M, Delgobo M, Zancanaro H, Almeida K, das Neves RN, Dos Santos B, Stefanes NM, Bishop A, Santos-Silva MC, Zanotto-Filho A. Omics-based identification of an NRF2-related auranofin resistance signature in cancer: Insights into drug repurposing. Comput Biol Med 2023; 152:106347. [PMID: 36493734 DOI: 10.1016/j.compbiomed.2022.106347] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 10/04/2022] [Accepted: 11/22/2022] [Indexed: 11/27/2022]
Abstract
Auranofin is a thioredoxin reductase-1 inhibitor originally approved for the treatment of rheumatoid arthritis. Recently, auranofin has been repurposed as an anticancer drug, with pharmacological activity reported in multiple cancer types. In this study, we characterized transcriptional and genetic alterations associated with auranofin response in cancer. By integrating data from an auranofin cytotoxicity screen with transcriptome profiling of lung cancer cell lines, we identified an auranofin resistance signature comprising 29 genes, most of which are classical targets of the transcription factor NRF2, such as genes involved in glutathione metabolism (GCLC, GSR, SLC7A11) and thioredoxin system (TXN, TXNRD1). Pan-cancer analysis revealed that mutations in NRF2 pathway genes, namely KEAP1 and NFE2L2, are strongly associated with overexpression of the auranofin resistance gene set. By clustering cancer types based on auranofin resistance signature expression, hepatocellular carcinoma, and a subset of non-small cell lung cancer, head-neck squamous cell carcinoma, and esophageal cancer carrying NFE2L2/KEAP1 mutations were predicted resistant, whereas leukemia, lymphoma, and multiple myeloma were predicted sensitive to auranofin. Cell viability assays in a panel of 20 cancer cell lines confirmed the augmented sensitivity of hematological cancers to auranofin; an effect associated with dependence upon glutathione and decreased expression of NRF2 target genes involved in GSH synthesis and recycling (GCLC, GCLM and GSR) in these cancer types. In summary, the omics-based identification of sensitive/resistant cancers and genetic alterations associated with these phenotypes may guide an appropriate repurposing of auranofin in cancer therapy.
Collapse
Affiliation(s)
- Marcelo Falchetti
- Laboratório de Farmacologia e Bioquímica do Câncer (LabCancer), Departamento de Farmacologia, Centro de Ciências Biológicas (CCB), Universidade Federal de Santa Catarina (UFSC), Florianópolis, Santa Catarina, 88040-900, Brazil
| | - Marina Delgobo
- Laboratório de Farmacologia e Bioquímica do Câncer (LabCancer), Departamento de Farmacologia, Centro de Ciências Biológicas (CCB), Universidade Federal de Santa Catarina (UFSC), Florianópolis, Santa Catarina, 88040-900, Brazil
| | - Helena Zancanaro
- Laboratório de Farmacologia e Bioquímica do Câncer (LabCancer), Departamento de Farmacologia, Centro de Ciências Biológicas (CCB), Universidade Federal de Santa Catarina (UFSC), Florianópolis, Santa Catarina, 88040-900, Brazil
| | - Karoline Almeida
- Laboratório de Farmacologia e Bioquímica do Câncer (LabCancer), Departamento de Farmacologia, Centro de Ciências Biológicas (CCB), Universidade Federal de Santa Catarina (UFSC), Florianópolis, Santa Catarina, 88040-900, Brazil
| | - Raquel Nascimento das Neves
- Laboratório de Farmacologia e Bioquímica do Câncer (LabCancer), Departamento de Farmacologia, Centro de Ciências Biológicas (CCB), Universidade Federal de Santa Catarina (UFSC), Florianópolis, Santa Catarina, 88040-900, Brazil; Greehey Children's Cancer Research Institute, University of Texas Health at San Antonio, San Antonio, TX, 78229, USA
| | - Barbara Dos Santos
- Laboratório de Farmacologia e Bioquímica do Câncer (LabCancer), Departamento de Farmacologia, Centro de Ciências Biológicas (CCB), Universidade Federal de Santa Catarina (UFSC), Florianópolis, Santa Catarina, 88040-900, Brazil
| | - Natália Marcéli Stefanes
- Laboratório de Oncologia Experimental e Hemopatias (LOEH), Departamento de Análises Clínicas, Centro de Ciências da Saúde, Universidade Federal de Santa Catarina (UFSC), Florianópolis, Santa Catarina, 88040-900, Brazil
| | - Alexander Bishop
- Greehey Children's Cancer Research Institute, University of Texas Health at San Antonio, San Antonio, TX, 78229, USA; Department of Cell Systems and Anatomy, University of Texas Health at San Antonio, San Antonio, TX, 78229, USA
| | - Maria Cláudia Santos-Silva
- Laboratório de Oncologia Experimental e Hemopatias (LOEH), Departamento de Análises Clínicas, Centro de Ciências da Saúde, Universidade Federal de Santa Catarina (UFSC), Florianópolis, Santa Catarina, 88040-900, Brazil
| | - Alfeu Zanotto-Filho
- Laboratório de Farmacologia e Bioquímica do Câncer (LabCancer), Departamento de Farmacologia, Centro de Ciências Biológicas (CCB), Universidade Federal de Santa Catarina (UFSC), Florianópolis, Santa Catarina, 88040-900, Brazil. https://labcancer.paginas.ufsc.br
| |
Collapse
|
24
|
Yue ZX, Yan TC, Xu HQ, Liu YH, Hong YF, Chen GX, Xie T, Tao L. A systematic review on the state-of-the-art strategies for protein representation. Comput Biol Med 2023; 152:106440. [PMID: 36543002 DOI: 10.1016/j.compbiomed.2022.106440] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 12/08/2022] [Accepted: 12/15/2022] [Indexed: 12/23/2022]
Abstract
The study of drug-target protein interaction is a key step in drug research. In recent years, machine learning techniques have become attractive for research, including drug research, due to their automated nature, predictive power, and expected efficiency. Protein representation is a key step in the study of drug-target protein interaction by machine learning, which plays a fundamental role in the ultimate accomplishment of accurate research. With the progress of machine learning, protein representation methods have gradually attracted attention and have consequently developed rapidly. Therefore, in this review, we systematically classify current protein representation methods, comprehensively review them, and discuss the latest advances of interest. According to the information extraction methods and information sources, these representation methods are generally divided into structure and sequence-based representation methods. Each primary class can be further divided into specific subcategories. As for the particular representation methods involve both traditional and the latest approaches. This review contains a comprehensive assessment of the various methods which researchers can use as a reference for their specific protein-related research requirements, including drug research.
Collapse
Affiliation(s)
- Zi-Xuan Yue
- Key Laboratory of Elemene Class Anti-cancer Chinese Medicines, School of Pharmacy, Hangzhou Normal University, Hangzhou, 311121, China
| | - Tian-Ci Yan
- Key Laboratory of Elemene Class Anti-cancer Chinese Medicines, School of Pharmacy, Hangzhou Normal University, Hangzhou, 311121, China
| | - Hong-Quan Xu
- Key Laboratory of Elemene Class Anti-cancer Chinese Medicines, School of Pharmacy, Hangzhou Normal University, Hangzhou, 311121, China
| | - Yu-Hong Liu
- Key Laboratory of Elemene Class Anti-cancer Chinese Medicines, School of Pharmacy, Hangzhou Normal University, Hangzhou, 311121, China
| | - Yan-Feng Hong
- Key Laboratory of Elemene Class Anti-cancer Chinese Medicines, School of Pharmacy, Hangzhou Normal University, Hangzhou, 311121, China
| | - Gong-Xing Chen
- Key Laboratory of Elemene Class Anti-cancer Chinese Medicines, School of Pharmacy, Hangzhou Normal University, Hangzhou, 311121, China
| | - Tian Xie
- Key Laboratory of Elemene Class Anti-cancer Chinese Medicines, School of Pharmacy, Hangzhou Normal University, Hangzhou, 311121, China.
| | - Lin Tao
- Key Laboratory of Elemene Class Anti-cancer Chinese Medicines, School of Pharmacy, Hangzhou Normal University, Hangzhou, 311121, China.
| |
Collapse
|
25
|
Liu C, Zhou Y, Zhou Y, Tang X, Tang L, Wang J. Identification of crucial genes for predicting the risk of atherosclerosis with system lupus erythematosus based on comprehensive bioinformatics analysis and machine learning. Comput Biol Med 2023; 152:106388. [PMID: 36470144 DOI: 10.1016/j.compbiomed.2022.106388] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 11/22/2022] [Accepted: 11/28/2022] [Indexed: 12/02/2022]
Abstract
BACKGROUND Systemic lupus erythematosus (SLE) has become a major public health problem over the years, and atherosclerosis (AS) is one of the main complications of SLE associated with serious cardiovascular consequences in this patient population. The present study aimed to identify potential biomarkers for SLE patients with AS. METHODS Five microarray datasets (GSE50772, GSE81622, GSE100927, GSE28829, GSE37356) were downloaded from the NCBI Gene Expression Omnibus database. The Limma package was used to identify differentially expressed genes (DEGs) in AS. Weighted gene coexpression network analysis (WGCNA) was used to identify significant module genes associated with SLE. Functional enrichment analysis, protein-protein interaction (PPI) network construction, and machine learning algorithms (least absolute shrinkage and selection operator (Lasso, Support Vector Machine-Recursive Feature Elimination (SVM-RFE), and random forest) were applied to identify hub genes. Subsequently, we generated a nomogram and receiver operating characteristic curve (ROC) for predicting the risk of AS in SLE patients. Finally, immune cell infiltrations were analyzed, and Consensus Cluster Analysis was conducted based on Single Sample Gene Set Enrichment Analysis (ssGSEA) scores. RESULTS Five hub genes (SPI1, MMP9, C1QA, CX3CR1, and MNDA) were identified and used to establish a nomogram that yielded a high predictive performance (area under the curve 0.900-0.981). Dysregulated immune cell infiltrations were found in AS, with positive correlations with the five hub genes. Consensus clustering showed that the optimal number of subtypes was 3. Compared to subtypes A and B, subtype C presented higher expression of the five hub genes, immune cell infiltration levels and immune checkpoint expression. CONCLUSION Our study systematically identified five candidate hub genes (SPI1, MMP9, C1QA, CX3CR1, MNDA) and established a nomogram that could predict the risk of AS with SLE using various bioinformatic analyses and machine learning algorithms. Our findings provide the foothold for future studies on potential crucial genes for AS in SLE patients. Additionally, the dysregulated immune cell proportions and immune checkpoint expressions in AS with SLE were identified.
Collapse
Affiliation(s)
- Chunjiang Liu
- Department of General Surgery, Division of Vascular Surgery, Shaoxing People's Hospital (Shaoxing Hospital of Zhejiang University), Shaoxing, 312000, China
| | - Yufei Zhou
- Department of Cardiology, Shanghai Institute of Cardiovascular Diseases, Zhongshan Hospital and Institutes of Biomedical Sciences, Fudan University, Shanghai, 200032, China
| | - Yue Zhou
- Department of General Surgery, Division of Vascular Surgery, Shaoxing People's Hospital (Shaoxing Hospital of Zhejiang University), Shaoxing, 312000, China
| | - Xiaoqi Tang
- Department of General Surgery, Division of Vascular Surgery, Shaoxing People's Hospital (Shaoxing Hospital of Zhejiang University), Shaoxing, 312000, China
| | - Liming Tang
- Department of General Surgery, Division of Vascular Surgery, Shaoxing People's Hospital (Shaoxing Hospital of Zhejiang University), Shaoxing, 312000, China.
| | - Jiajia Wang
- Department of Rheumatology, Shaoxing People's Hospital (Shaoxing Hospital of Zhejiang University), Shaoxing, 312000, China.
| |
Collapse
|
26
|
Li Q, Wu J, Zhu M, Tang Y, Jin L, Chen Y, Jin M, Peng Z. A novel risk signature based on autophagy-related genes to evaluate tumor immune microenvironment and predict prognosis in hepatocellular carcinoma. Comput Biol Med 2022. [DOI: 10.1016/j.compbiomed.2022.106437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
27
|
Tang H, Sun L, Huang J, Yang Z, Li C, Zhou X. The mechanism and biomarker function of Cavin-2 in lung ischemia-reperfusion injury. Comput Biol Med 2022; 151:106234. [PMID: 36335812 DOI: 10.1016/j.compbiomed.2022.106234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 10/01/2022] [Accepted: 10/22/2022] [Indexed: 12/27/2022]
Abstract
BACKGROUND Lung Ischemia Reperfusion injury(LIRI) is one of the most predominant complications of ischemic lung disease. Cavin-2 emerged as a regulator of a variety of cellular processes, including endocytosis, lipid homeostasis, signal transduction and tumorigenesis, but the function of Cavin-2 in LIRI is unknown. The purpose of this study was to determine the predictive potential of Cavin-2 in protecting lung ischemia-reperfusion injury and its corresponding mechanisms. METHODS We found the strong relationship between Cavin-2 and multiple immune-related genes by deep learning method. To reveal the mechanism of Cavin-2 in LIRI, the LIRI SD rat model was constructed to detect the expression of Cavin-2 in the lung tissue of SD rats after LIRI, and the expression of Cavin-2 in lung cell lines was also detected. The expression of IL-6, IL-10 and MDA in cells after Cavin-2 over-expression or knockdown was examined under hypoxic conditions. The expression levels of p-AKT, p-STAT3 and p-ERK1/2 were measured in over-expressing Cavin-2 cells under hypoxic-ischemia conditions, and then the corresponding blockers of AKT, STAT3 and ERK1/2 were given to verify, whether they play a protective role in LIRI. RESULTS After hypoxia, the expression of Cavin-2 in rat lung tissues was significantly increased, and the cellular activity and IL-10 in Cavin-2 over-expressing cells were significantly higher than that of the control group, while IL-6 and MDA were significantly lower than that of the control group, while the above results were reversed in Cavin-2 knockdown cells; Meanwhile, the phosphorylation levels of AKT, STAT3, and ERK1/2 were significantly increased in Cavin-2 over-expression cells after hypoxia. When AKT, STAT3, and ERK1/2 specific blockers were given, they lost their protective effect against LIRI. CONCLUSIONS Cavin-2 shows biomarker potential in protecting lung from ischemia-reperfusion injury through the survivor activating factor enhancement (SAFE) and reperfusion injury salvage kinase (RISK) pathway.
Collapse
Affiliation(s)
- Hexiao Tang
- Department of Thoracic Surgery, Zhongnan Hospital of Wuhan University, Wuhan, Hubei, China
| | - Linao Sun
- Tianjin Medical University, Tianjin, China
| | - Jingyu Huang
- Department of Thoracic Surgery, Zhongnan Hospital of Wuhan University, Wuhan, Hubei, China
| | - Zetian Yang
- Department of Thoracic Surgery, Zhongnan Hospital of Wuhan University, Wuhan, Hubei, China
| | - Changsheng Li
- Department of Thoracic Surgery, Zhongnan Hospital of Wuhan University, Wuhan, Hubei, China.
| | - Xuefeng Zhou
- Department of Thoracic Surgery, Zhongnan Hospital of Wuhan University, Wuhan, Hubei, China.
| |
Collapse
|
28
|
Yang Q, Li B, Wang P, Xie J, Feng Y, Liu Z, Zhu F. LargeMetabo: an out-of-the-box tool for processing and analyzing large-scale metabolomic data. Brief Bioinform 2022; 23:6768054. [PMID: 36274234 DOI: 10.1093/bib/bbac455] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2022] [Revised: 09/06/2022] [Accepted: 09/24/2022] [Indexed: 12/14/2022] Open
Abstract
Large-scale metabolomics is a powerful technique that has attracted widespread attention in biomedical studies focused on identifying biomarkers and interpreting the mechanisms of complex diseases. Despite a rapid increase in the number of large-scale metabolomic studies, the analysis of metabolomic data remains a key challenge. Specifically, diverse unwanted variations and batch effects in processing many samples have a substantial impact on identifying true biological markers, and it is a daunting challenge to annotate a plethora of peaks as metabolites in untargeted mass spectrometry-based metabolomics. Therefore, the development of an out-of-the-box tool is urgently needed to realize data integration and to accurately annotate metabolites with enhanced functions. In this study, the LargeMetabo package based on R code was developed for processing and analyzing large-scale metabolomic data. This package is unique because it is capable of (1) integrating multiple analytical experiments to effectively boost the power of statistical analysis; (2) selecting the appropriate biomarker identification method by intelligent assessment for large-scale metabolic data and (3) providing metabolite annotation and enrichment analysis based on an enhanced metabolite database. The LargeMetabo package can facilitate flexibility and reproducibility in large-scale metabolomics. The package is freely available from https://github.com/LargeMetabo/LargeMetabo.
Collapse
Affiliation(s)
- Qingxia Yang
- Department of Bioinformatics, Smart Health Big Data Analysis and Location Services Engineering Lab of Jiangsu Province, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing, 210023, China.,College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Bo Li
- College of Life Sciences, Chongqing Normal University, Chongqing, Chongqing 401331, China
| | - Panpan Wang
- College of Chemistry and Pharmaceutical Engineering, Huanghuai University, Zhumadian 463000, China
| | - Jicheng Xie
- Department of Bioinformatics, Smart Health Big Data Analysis and Location Services Engineering Lab of Jiangsu Province, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing, 210023, China
| | - Yuhao Feng
- Department of Bioinformatics, Smart Health Big Data Analysis and Location Services Engineering Lab of Jiangsu Province, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing, 210023, China
| | - Ziqiang Liu
- Department of Bioinformatics, Smart Health Big Data Analysis and Location Services Engineering Lab of Jiangsu Province, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing, 210023, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China
| |
Collapse
|
29
|
Rong Z, Liu Z, Song J, Cao L, Yu Y, Qiu M, Hou Y. MCluster-VAEs: An end-to-end variational deep learning-based clustering method for subtype discovery using multi-omics data. Comput Biol Med 2022; 150:106085. [PMID: 36162197 DOI: 10.1016/j.compbiomed.2022.106085] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Revised: 07/30/2022] [Accepted: 09/03/2022] [Indexed: 11/03/2022]
Abstract
The discovery of cancer subtypes based on unsupervised clustering helps in providing a precise diagnosis, guide treatment, and improve patients' prognoses. Instead of single-omics data, multi-omics data can improve the clustering performance because it obtains a comprehensive landscape for understanding biological systems and mechanisms. However, heterogeneous data from multiple sources raises high complexity and different kinds of noise, which are detrimental to the extraction of clustering information. We propose an end-to-end deep learning based method, called Multi-omics Clustering Variational Autoencoders (MCluster-VAEs), that can extract cluster-friendly representations on multi-omics data. First, a unified network architecture with an attention mechanism was developed for accurately modeling multi-omics data. Then, using a novel objective function built from the Variational Bayes technique, the model was trained to effectively obtain the posterior estimation of the clustering assignments. Compared with 12 other state-of-the-art multi-omics clustering methods, MCluster-VAEs achieved an outstanding performance on benchmark datasets from the TCGA database. On the Pan Cancer dataset, MCluster-VAEs achieved an adjusted Rand index of approximately 0.78 for cancer category recognition, an increase of more than 18% compared with other methods. Furthermore, a survival analysis and clinical parameter enrichment tests conducted on 10 cancer datasets demonstrated that MCluster-VAEs provides comparable and even better results than many common integrative approaches. These results demonstrate that MCluster-VAEs are a powerful new tool for dissecting complex multi-omics relationships and providing new insights for cancer subtype discovery.
Collapse
Affiliation(s)
- Zhiwei Rong
- Department of Biostatistics Beijing, Peking University School of Public Health, No. 38 Xueyuan Road, Haidian District, Beijing, 100000, China
| | - Zhilin Liu
- Department of Biostatistics Beijing, Peking University School of Public Health, No. 38 Xueyuan Road, Haidian District, Beijing, 100000, China
| | - Jiali Song
- Department of Biostatistics Beijing, Peking University School of Public Health, No. 38 Xueyuan Road, Haidian District, Beijing, 100000, China
| | - Lei Cao
- Department of Epidemiology and Biostatistics Harbin, Harbin Medical University School of Public Health, Harbin, 150000, Heilongjiang, China
| | - Yipe Yu
- Department of Biostatistics Beijing, Peking University School of Public Health, No. 38 Xueyuan Road, Haidian District, Beijing, 100000, China
| | - Mantang Qiu
- Department of Thoracic Surgery Beijing, Peking University People's Hospital, Beijing, 100000, China.
| | - Yan Hou
- Department of Biostatistics Beijing, Peking University School of Public Health, No. 38 Xueyuan Road, Haidian District, Beijing, 100000, China; Peking University Clinical Research Center, No. 38 Xueyuan Road, Haidian District, Beijing, 100000, China.
| |
Collapse
|
30
|
Zhou K, Cai C, He Y, Chen Z. Potential prognostic biomarkers of sudden cardiac death discovered by machine learning. Comput Biol Med 2022; 150:106154. [PMID: 36208596 DOI: 10.1016/j.compbiomed.2022.106154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 09/16/2022] [Accepted: 09/24/2022] [Indexed: 11/24/2022]
Abstract
OBJECTIVE Sudden cardiac death (SCD) is a serious public health burden. This study aims to find prognostic biomarkers of SCD using machine learning. METHODS The myocardial samples from 21 accidental death and 82 sudden death donors were compared to seek for differential genes. Enriched active genes were found according to the PPI interaction network. GSEA analyzed differences in function and pathway between control and experimental groups. Related diseases caused by active genes are mainly exhibited through DO enrichment. Prognostic biomarkers for SCD are identified via two machine learning algorithms. The CIBERSORT method was used to compare the immune microenvironment changes in patients with SCD. RESULTS SCD was mainly associated with heart and kidney diseases caused by atherosclerosis. DEFA1B, BGN, SERPINE1, CCL2 and HBB are considered to be prognostic biomarkers for SCD after machine learning. And immune infiltration plays an important role in the process of SCD. CONCLUSION We discovered 5 prognostic biomarkers for SCD. And immune microenvironment changes was also found in SCD. Moreover, atherosclerosis might be an important risk factor for SCD.
Collapse
Affiliation(s)
- Kena Zhou
- Gastroenterology Department of Ningbo No.9 Hospital, Ningbo, Zhejiang, 315000, China
| | - Congbo Cai
- Emergency Department of Yinzhou No.2 Hospital, Ningbo, Zhejiang, 315000, China
| | - Yi He
- Gastroenterology Department of Ningbo No.9 Hospital, Ningbo, Zhejiang, 315000, China
| | - Zhihua Chen
- Emergency Department of Ningbo No.1 Hospital, Ningbo, Zhejiang, 315000, China.
| |
Collapse
|
31
|
Sun X, Zhang Y, Li H, Zhou Y, Shi S, Chen Z, He X, Zhang H, Li F, Yin J, Mou M, Wang Y, Qiu Y, Zhu F. DRESIS: the first comprehensive landscape of drug resistance information. Nucleic Acids Res 2022; 51:D1263-D1275. [PMID: 36243960 PMCID: PMC9825618 DOI: 10.1093/nar/gkac812] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Revised: 08/22/2022] [Accepted: 10/11/2022] [Indexed: 01/30/2023] Open
Abstract
Widespread drug resistance has become the key issue in global healthcare. Extensive efforts have been made to reveal not only diverse diseases experiencing drug resistance, but also the six distinct types of molecular mechanisms underlying this resistance. A database that describes a comprehensive list of diseases with drug resistance (not just cancers/infections) and all types of resistance mechanisms is now urgently needed. However, no such database has been available to date. In this study, a comprehensive database describing drug resistance information named 'DRESIS' was therefore developed. It was introduced to (i) systematically provide, for the first time, all existing types of molecular mechanisms underlying drug resistance, (ii) extensively cover the widest range of diseases among all existing databases and (iii) explicitly describe the clinically/experimentally verified resistance data for the largest number of drugs. Since drug resistance has become an ever-increasing clinical issue, DRESIS is expected to have great implications for future new drug discovery and clinical treatment optimization. It is now publicly accessible without any login requirement at: https://idrblab.org/dresis/.
Collapse
Affiliation(s)
| | | | | | | | - Shuiyang Shi
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Zhen Chen
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Xin He
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China,Zhejiang University–University of Edinburgh Institute, Zhejiang University, Haining 314499, China
| | - Hanyu Zhang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Fengcheng Li
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Jiayi Yin
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Minjie Mou
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Yunzhu Wang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Yunqing Qiu
- The First Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Feng Zhu
- To whom correspondence should be addressed.
| |
Collapse
|
32
|
Li F, Yin J, Lu M, Mou M, Li Z, Zeng Z, Tan Y, Wang S, Chu X, Dai H, Hou T, Zeng S, Chen Y, Zhu F. DrugMAP: molecular atlas and pharma-information of all drugs. Nucleic Acids Res 2022; 51:D1288-D1299. [PMID: 36243961 PMCID: PMC9825453 DOI: 10.1093/nar/gkac813] [Citation(s) in RCA: 41] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 08/30/2022] [Accepted: 10/12/2022] [Indexed: 02/06/2023] Open
Abstract
The efficacy and safety of drugs are widely known to be determined by their interactions with multiple molecules of pharmacological importance, and it is therefore essential to systematically depict the molecular atlas and pharma-information of studied drugs. However, our understanding of such information is neither comprehensive nor precise, which necessitates the construction of a new database providing a network containing a large number of drugs and their interacting molecules. Here, a new database describing the molecular atlas and pharma-information of drugs (DrugMAP) was therefore constructed. It provides a comprehensive list of interacting molecules for >30 000 drugs/drug candidates, gives the differential expression patterns for >5000 interacting molecules among different disease sites, ADME (absorption, distribution, metabolism and excretion)-relevant organs and physiological tissues, and weaves a comprehensive and precise network containing >200 000 interactions among drugs and molecules. With the great efforts made to clarify the complex mechanism underlying drug pharmacokinetics and pharmacodynamics and rapidly emerging interests in artificial intelligence (AI)-based network analyses, DrugMAP is expected to become an indispensable supplement to existing databases to facilitate drug discovery. It is now fully and freely accessible at: https://idrblab.org/drugmap/.
Collapse
Affiliation(s)
| | | | - Mingkun Lu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Minjie Mou
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Zhaorong Li
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba–Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Zhenyu Zeng
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba–Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Ying Tan
- State Key Laboratory of Chemical Oncogenomics, Key Laboratory of Chemical Biology, Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, China
| | - Shanshan Wang
- Qian Xuesen Collaborative Research Center of Astrochemistry and Space Life Sciences, Institute of Drug Discovery Technology, Ningbo University, Ningbo 315211, China
| | - Xinyi Chu
- Qian Xuesen Collaborative Research Center of Astrochemistry and Space Life Sciences, Institute of Drug Discovery Technology, Ningbo University, Ningbo 315211, China
| | - Haibin Dai
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Tingjun Hou
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Su Zeng
- Correspondence may also be addressed to Su Zeng.
| | - Yuzong Chen
- Correspondence may also be addressed to Yuzong Chen.
| | - Feng Zhu
- To whom correspondence should be addressed.
| |
Collapse
|
33
|
Beura S, Kundu P, Das AK, Ghosh A. Metagenome-scale community metabolic modelling for understanding the role of gut microbiota in human health. Comput Biol Med 2022; 149:105997. [DOI: 10.1016/j.compbiomed.2022.105997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 07/03/2022] [Accepted: 08/14/2022] [Indexed: 11/03/2022]
|
34
|
Fu Z, Liu W, Huang C, Mei T. A Review of Performance Prediction Based on Machine Learning in Materials Science. NANOMATERIALS (BASEL, SWITZERLAND) 2022; 12:nano12172957. [PMID: 36079994 PMCID: PMC9457802 DOI: 10.3390/nano12172957] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Revised: 08/07/2022] [Accepted: 08/24/2022] [Indexed: 05/11/2023]
Abstract
With increasing demand in many areas, materials are constantly evolving. However, they still have numerous practical constraints. The rational design and discovery of new materials can create a huge technological and social impact. However, such rational design and discovery require a holistic, multi-stage design process, including the design of the material composition, material structure, material properties as well as process design and engineering. Such a complex exploration using traditional scientific methods is not only blind but also a huge waste of time and resources. Machine learning (ML), which is used across data to find correlations in material properties and understand the chemical properties of materials, is being considered a new way to explore the materials field. This paper reviews some of the major recent advances and applications of ML in the field of properties prediction of materials and discusses the key challenges and opportunities in this cross-cutting area.
Collapse
Affiliation(s)
- Ziyang Fu
- School of Computer Science and Information Engineering, Hubei University, Wuhan 430062, China
- Hubei Software Engineering Technology Research Center, Wuhan 430062, China
- Hubei Engineering Research Center for Smart Government and Artificial Intelligence Application, Wuhan 430062, China
| | - Weiyi Liu
- School of Materials Science and Engineering, Hubei University, Wuhan 430062, China
| | - Chen Huang
- School of Computer Science and Information Engineering, Hubei University, Wuhan 430062, China
- Hubei Software Engineering Technology Research Center, Wuhan 430062, China
- Hubei Engineering Research Center for Smart Government and Artificial Intelligence Application, Wuhan 430062, China
- Correspondence: (C.H.); (T.M.)
| | - Tao Mei
- School of Materials Science and Engineering, Hubei University, Wuhan 430062, China
- Hubei Collaborative Innovation Center for Advanced Organic Chemical Materials, Wuhan 430062, China
- Key Laboratory for the Green Preparation and Application of Functional Materials, Wuhan 430062, China
- Correspondence: (C.H.); (T.M.)
| |
Collapse
|
35
|
Zhang S, Sun X, Mou M, Amahong K, Sun H, Zhang W, Shi S, Li Z, Gao J, Zhu F. REGLIV: Molecular regulation data of diverse living systems facilitating current multiomics research. Comput Biol Med 2022; 148:105825. [PMID: 35872412 DOI: 10.1016/j.compbiomed.2022.105825] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 06/29/2022] [Accepted: 07/03/2022] [Indexed: 12/24/2022]
Abstract
Multiomics is a powerful technique in molecular biology that facilitates the identification of new associations among different molecules (genes, proteins & metabolites). It has attracted tremendous research interest from the scientists worldwide and has led to an explosive number of published studies. Most of these studies are based on the regulation data provided in available databases. Therefore, it is essential to have molecular regulation data that are strictly validated in the living systems of various cell lines and in vivo models. However, no database has been developed yet to provide comprehensive molecular regulation information validated by living systems. Herein, a new database, Molecular Regulation Data of Living System Facilitating Multiomics Study (REGLIV) is introduced to describe various types of molecular regulation tested by the living systems. (1) A total of 2996 regulations describe the changes in 1109 metabolites triggered by alterations in 284 genes or proteins, and (2) 1179 regulations describe the variations in 926 proteins induced by 125 endogenous metabolites. Overall, REGLIV is unique in (a) providing the molecular regulation of a clearly defined regulatory direction other than simple correlation, (b) focusing on molecular regulations that are validated in a living system not simply in an in vitro test, and (c) describing the disease/tissue/species specific property underlying each regulation. Therefore, REGLIV has important implications for the future practice of not only multiomics, but also other fields relevant to molecular regulation. REGLIV is freely accessible at: https://idrblab.org/regliv/.
Collapse
Affiliation(s)
- Song Zhang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Xiuna Sun
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Minjie Mou
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Kuerbannisha Amahong
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Huaicheng Sun
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Wei Zhang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Shuiyang Shi
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Zhaorong Li
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 330110, China
| | - Jianqing Gao
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China; Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China; Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 330110, China.
| |
Collapse
|
36
|
An ensemble framework for microarray data classification based on feature subspace partitioning. Comput Biol Med 2022; 148:105820. [PMID: 35872409 DOI: 10.1016/j.compbiomed.2022.105820] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2022] [Revised: 06/05/2022] [Accepted: 07/03/2022] [Indexed: 12/14/2022]
Abstract
Feature selection is exposed to the curse of dimensionality risk, and it is even more exacerbated with high-dimensional data such as microarrays. Moreover, the low-instance/high-feature (LIHF) property of microarray data needs considerable processing time to do some calculations and comparisons among features to choose the best subset of them, which has led to many efforts to subdue the LIHF property of such genomic medicine data. Due to the promising results of the ensemble models in machine learning problems, this paper presents a novel framework, named feature-level aggregation-based ensemble based on overlapped feature subspace partitioning (FLAE-OFSP) for microarray data classification. The proposed ensemble has three main steps: after generating several subsets by the proposed partitioning approach, a feature selection algorithm (i.e., a feature ranker) is applied on each subset, and finally, their results are combined into a single ranked list using six defined aggregation functions. Evaluation of the presented framework based on seven microarray datasets and using four measures, including stability, classification accuracy, runtime, and Modscore shows substantial runtime improvement and also quality results in other evaluated measures compared to individual methods.
Collapse
|
37
|
Diagnosis and prognosis of COVID-19 employing analysis of patients' plasma and serum via LC-MS and machine learning. Comput Biol Med 2022; 146:105659. [PMID: 35751188 PMCID: PMC9123826 DOI: 10.1016/j.compbiomed.2022.105659] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2022] [Revised: 05/11/2022] [Accepted: 05/18/2022] [Indexed: 01/11/2023]
Abstract
OBJECTIVE To implement and evaluate machine learning (ML) algorithms for the prediction of COVID-19 diagnosis, severity, and fatality and to assess biomarkers potentially associated with these outcomes. MATERIAL AND METHODS Serum (n = 96) and plasma (n = 96) samples from patients with COVID-19 (acute, severe and fatal illness) from two independent hospitals in China were analyzed by LC-MS. Samples from healthy volunteers and from patients with pneumonia caused by other viruses (i.e. negative RT-PCR for COVID-19) were used as controls. Seven different ML-based models were built: PLS-DA, ANNDA, XGBoostDA, SIMCA, SVM, LREG and KNN. RESULTS The PLS-DA model presented the best performance for both datasets, with accuracy rates to predict the diagnosis, severity and fatality of COVID-19 of 93%, 94% and 97%, respectively. Low levels of the metabolites ribothymidine, 4-hydroxyphenylacetoylcarnitine and uridine were associated with COVID-19 positivity, whereas high levels of N-acetyl-glucosamine-1-phosphate, cysteinylglycine, methyl isobutyrate, l-ornithine and 5,6-dihydro-5-methyluracil were significantly related to greater severity and fatality from COVID-19. CONCLUSION The PLS-DA model can help to predict SARS-CoV-2 diagnosis, severity and fatality in daily practice. Some biomarkers typically increased in COVID-19 patients' serum or plasma (i.e. ribothymidine, N-acetyl-glucosamine-1-phosphate, l-ornithine, 5,6-dihydro-5-methyluracil) should be further evaluated as prognostic indicators of the disease.
Collapse
|
38
|
Li F, Yin J, Lu M, Yang Q, Zeng Z, Zhang B, Li Z, Qiu Y, Dai H, Chen Y, Zhu F. ConSIG: consistent discovery of molecular signature from OMIC data. Brief Bioinform 2022; 23:6618243. [PMID: 35758241 DOI: 10.1093/bib/bbac253] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Revised: 05/09/2022] [Accepted: 05/31/2022] [Indexed: 12/12/2022] Open
Abstract
The discovery of proper molecular signature from OMIC data is indispensable for determining biological state, physiological condition, disease etiology, and therapeutic response. However, the identified signature is reported to be highly inconsistent, and there is little overlap among the signatures identified from different biological datasets. Such inconsistency raises doubts about the reliability of reported signatures and significantly hampers its biological and clinical applications. Herein, an online tool, ConSIG, was constructed to realize consistent discovery of gene/protein signature from any uploaded transcriptomic/proteomic data. This tool is unique in a) integrating a novel strategy capable of significantly enhancing the consistency of signature discovery, b) determining the optimal signature by collective assessment, and c) confirming the biological relevance by enriching the disease/gene ontology. With the increasingly accumulated concerns about signature consistency and biological relevance, this online tool is expected to be used as an essential complement to other existing tools for OMIC-based signature discovery. ConSIG is freely accessible to all users without login requirement at https://idrblab.org/consig/.
Collapse
Affiliation(s)
- Fengcheng Li
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Jiayi Yin
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Mingkun Lu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Qingxia Yang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Zhenyu Zeng
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Bing Zhang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Zhaorong Li
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Yunqing Qiu
- State Key Laboratory for Diagnosis and Treatment of Infectious Disease, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, 79 QingChun Road, Hangzhou, Zhejiang 310000, China
| | - Haibin Dai
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Yuzong Chen
- State Key Laboratory of Chemical Oncogenomics, Key Laboratory of Chemical Biology, The Graduate School at Shenzhen, Tsinghua University, Shenzhen 518055, China.,Qian Xuesen Collaborative Research Center of Astrochemistry and Space Life Sciences, Institute of Drug Discovery Technology, Ningbo University, Ningbo 315211, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China.,Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| |
Collapse
|
39
|
Pharmacokinetic-Pharmacometabolomic Approach in Early-Phase Clinical Trials: A Way Forward for Targeted Therapy in Type 2 Diabetes. Pharmaceutics 2022; 14:pharmaceutics14061268. [PMID: 35745841 PMCID: PMC9231303 DOI: 10.3390/pharmaceutics14061268] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 06/09/2022] [Accepted: 06/10/2022] [Indexed: 12/20/2022] Open
Abstract
Pharmacometabolomics in early phase clinical trials demonstrate the metabolic profiles of a subject responding to a drug treatment in a controlled environment, whereas pharmacokinetics measure the drug plasma concentration in human circulation. Application of the personalized peak plasma concentration from pharmacokinetics in pharmacometabolomic studies provides insights into drugs’ pharmacological effects through dysregulation of metabolic pathways or pharmacodynamic biomarkers. This proof-of-concept study integrates personalized pharmacokinetic and pharmacometabolomic approaches to determine the predictive pharmacodynamic response of human metabolic pathways for type 2 diabetes. In this study, we use metformin as a model drug. Metformin is a first-line glucose-lowering agent; however, the variation of metabolites that potentially affect the efficacy and safety profile remains inconclusive. Seventeen healthy subjects were given a single dose of 1000 mg of metformin under fasting conditions. Fifteen sampling time-points were collected and analyzed using the validated bioanalytical LCMS method for metformin quantification in plasma. The individualized peak-concentration plasma samples determined from the pharmacokinetic parameters calculated using Matlab Simbiology were further analyzed with pre-dose plasma samples using an untargeted metabolomic approach. Pharmacometabolomic data processing and statistical analysis were performed using MetaboAnalyst with a functional meta-analysis peaks-to-pathway approach to identify dysregulated human metabolic pathways. The validated metformin calibration ranged from 80.4 to 2010 ng/mL for accuracy, precision, stability and others. The median and IQR for Cmax was 1248 (849–1391) ng/mL; AUC0-infinity was 9510 (7314–10,411) ng·h/mL, and Tmax was 2.5 (2.5–3.0) h. The individualized Cmax pharmacokinetics guided the untargeted pharmacometabolomics of metformin, suggesting a series of provisional predictive human metabolic pathways, which include arginine and proline metabolism, branched-chain amino acid (BCAA) metabolism, glutathione metabolism and others that are associated with metformin’s pharmacological effects of increasing insulin sensitivity and lipid metabolism. Integration of pharmacokinetic and pharmacometabolomic approaches in early-phase clinical trials may pave a pathway for developing targeted therapy. This could further reduce variability in a controlled trial environment and aid in identifying surrogates for drug response pathways, increasing the prediction of responders for dose selection in phase II clinical trials.
Collapse
|
40
|
Zhang C, Mou M, Zhou Y, Zhang W, Lian X, Shi S, Lu M, Sun H, Li F, Wang Y, Zeng Z, Li Z, Zhang B, Qiu Y, Zhu F, Gao J. Biological activities of drug inactive ingredients. Brief Bioinform 2022; 23:6582006. [PMID: 35524477 DOI: 10.1093/bib/bbac160] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 04/01/2022] [Accepted: 04/09/2022] [Indexed: 02/06/2023] Open
Abstract
In a drug formulation (DFM), the major components by mass are not Active Pharmaceutical Ingredient (API) but rather Drug Inactive Ingredients (DIGs). DIGs can reach much higher concentrations than that achieved by API, which raises great concerns about their clinical toxicities. Therefore, the biological activities of DIG on physiologically relevant target are widely demanded by both clinical investigation and pharmaceutical industry. However, such activity data are not available in any existing pharmaceutical knowledge base, and their potentials in predicting the DIG-target interaction have not been evaluated yet. In this study, the comprehensive assessment and analysis on the biological activities of DIGs were therefore conducted. First, the largest number of DIGs and DFMs were systematically curated and confirmed based on all drugs approved by US Food and Drug Administration. Second, comprehensive activities for both DIGs and DFMs were provided for the first time to pharmaceutical community. Third, the biological targets of each DIG and formulation were fully referenced to available databases that described their pharmaceutical/biological characteristics. Finally, a variety of popular artificial intelligence techniques were used to assess the predictive potential of DIGs' activity data, which was the first evaluation on the possibility to predict DIG's activity. As the activities of DIGs are critical for current pharmaceutical studies, this work is expected to have significant implications for the future practice of drug discovery and precision medicine.
Collapse
Affiliation(s)
- Chenyang Zhang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Minjie Mou
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Ying Zhou
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China.,State Key Laboratory for Diagnosis and Treatment of Infectious Disease, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, 79 QingChun Road, Hangzhou, Zhejiang 310000, China
| | - Wei Zhang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Xichen Lian
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Shuiyang Shi
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Mingkun Lu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Huaicheng Sun
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Fengcheng Li
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Yunxia Wang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Zhenyu Zeng
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Zhaorong Li
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Bing Zhang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Yunqing Qiu
- State Key Laboratory for Diagnosis and Treatment of Infectious Disease, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, 79 QingChun Road, Hangzhou, Zhejiang 310000, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China.,Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Jianqing Gao
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China.,Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang, China
| |
Collapse
|
41
|
Chen Y, Wang Y, Ding Y, Su X, Wang C. RGCNCDA: Relational graph convolutional network improves circRNA-disease association prediction by incorporating microRNAs. Comput Biol Med 2022; 143:105322. [PMID: 35217342 DOI: 10.1016/j.compbiomed.2022.105322] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Revised: 02/11/2022] [Accepted: 02/13/2022] [Indexed: 12/21/2022]
Abstract
Recently, a large number of studies have indicated that circRNAs with covalently closed loops play important roles in biological processes and have potential as diagnostic biomarkers. Therefore, research on the circRNA-disease relationship is helpful in disease diagnosis and treatment. However, traditional biological verification methods require considerable labor and time costs. In this paper, we propose a new computational method (RGCNCDA) to predict circRNA-disease associations based on relational graph convolutional networks (R-GCNs). The method first integrates the circRNA similarity network, miRNA similarity network, disease similarity network and association networks among them to construct a global heterogeneous network. Then, it employs the random walk with restart (RWR) and principal component analysis (PCA) models to learn low-dimensional and high-order information from the global heterogeneous network as the topological features. Finally, a prediction model based on an R-GCN encoder and a DistMult decoder is built to predict the potential disease-associated circRNA. The predicted results demonstrate that RGCNCDA performs significantly better than the other six state-of-the-art methods in a 5-fold cross validation. Furthermore, the case study illustrates that RGCNCDA can effectively discover potential circRNA-disease associations.
Collapse
Affiliation(s)
- Yaojia Chen
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China
| | - Yanpeng Wang
- Beidahuang Industry Group General Hospital, Harbin, China
| | - Yijie Ding
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China
| | - Xi Su
- Foshan Maternity & Child Healthcare Hospital, Southern Medical University, Foshan, China.
| | - Chunyu Wang
- Faculty of Computing, Harbin Institute of Technology, Harbin, China.
| |
Collapse
|
42
|
Xia W, Zheng L, Fang J, Li F, Zhou Y, Zeng Z, Zhang B, Li Z, Li H, Zhu F. PFmulDL: a novel strategy enabling multi-class and multi-label protein function annotation by integrating diverse deep learning methods. Comput Biol Med 2022; 145:105465. [PMID: 35366467 DOI: 10.1016/j.compbiomed.2022.105465] [Citation(s) in RCA: 36] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 03/22/2022] [Accepted: 03/25/2022] [Indexed: 02/06/2023]
Abstract
Bioinformatic annotation of protein function is essential but extremely sophisticated, which asks for extensive efforts to develop effective prediction method. However, the existing methods tend to amplify the representativeness of the families with large number of proteins by misclassifying the proteins in the families with small number of proteins. That is to say, the ability of the existing methods to annotate proteins in the 'rare classes' remains limited. Herein, a new protein function annotation strategy, PFmulDL, integrating multiple deep learning methods, was thus constructed. First, the recurrent neural network was integrated, for the first time, with the convolutional neural network to facilitate the function annotation. Second, a transfer learning method was introduced to the model construction for further improving the prediction performances. Third, based on the latest data of Gene Ontology, the newly constructed model could annotate the largest number of protein families comparing with the existing methods. Finally, this newly constructed model was found capable of significantly elevating the prediction performance for the 'rare classes' without sacrificing that for the 'major classes'. All in all, due to the emerging requirements on improving the prediction performance for the proteins in 'rare classes', this new strategy would become an essential complement to the existing methods for protein function prediction. All the models and source codes are freely available and open to all users at: https://github.com/idrblab/PFmulDL.
Collapse
Affiliation(s)
- Weiqi Xia
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China
| | - Lingyan Zheng
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China; Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 330110, China
| | - Jiebin Fang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China
| | - Fengcheng Li
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China
| | - Ying Zhou
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China
| | - Zhenyu Zeng
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 330110, China
| | - Bing Zhang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 330110, China
| | - Zhaorong Li
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 330110, China
| | - Honglin Li
- School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China.
| | - Feng Zhu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou, 310058, China; Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 330110, China.
| |
Collapse
|
43
|
Wang Y, Zhu C, Wang Y, Sun J, Ling D, Wang L. Survival risk prediction model for ESCC based on relief feature selection and CNN. Comput Biol Med 2022; 145:105460. [PMID: 35364307 DOI: 10.1016/j.compbiomed.2022.105460] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Revised: 03/23/2022] [Accepted: 03/24/2022] [Indexed: 01/10/2023]
Abstract
Esophageal squamous cell carcinoma (ESCC) is a common malignant tumor of the digestive system with poor prognosis and high mortality. It is of great significance to predict the prognosis risk of patients with cancer by using medical pathology information. To take full advantage of the clinic pathological information of ESCC patients and improve the accuracy of postoperative survival risk prediction, this paper proposes an ESCC survival risk prediction model based on Relief feature selection and convolutional neural network (CNN). Firstly, statistical analysis methods and relief feature selection algorithm are used to extract the important risk factors related to the survival risk of patients. Then, One-dimensional convolutional neural network (1D-CNN) is used to establish the survival risk prediction model of patients with esophageal cancer. Finally, the data of patients with esophageal cancer provided by the First Affiliated Hospital of Zhengzhou University is used to assess the performance of the model. The results show that the model proposed in this paper has a high accuracy rate, which can effectively predict the postoperative survival risk of the patient through the clinical phenotypic index of the patient.
Collapse
Affiliation(s)
- Yanfeng Wang
- School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, 45002, China
| | - Chuanqian Zhu
- School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, 45002, China
| | - Yan Wang
- School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, 45002, China.
| | - Junwei Sun
- School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, 45002, China
| | - Dan Ling
- School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, 45002, China
| | - Lidong Wang
- State Key Laboratory of Esophageal Cancer Prevention, Treatment and Henan Key Laboratory for Esophageal Cancer Research of The First Affiliated Hospital, Zhengzhou University, Zhengzhou, 450052, China
| |
Collapse
|
44
|
A novel mTOR-associated gene signature for predicting prognosis and evaluating tumor immune microenvironment in lung adenocarcinoma. Comput Biol Med 2022; 145:105394. [PMID: 35325730 DOI: 10.1016/j.compbiomed.2022.105394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 03/08/2022] [Accepted: 03/08/2022] [Indexed: 11/24/2022]
Abstract
BACKGROUND The mechanistic target of rapamycin (mTOR) was proven to have great impact on apoptosis, cell proliferation, autophagy, and many other fundamental cellular processes; moreover, it closely correlates with tumor occurrence and development. However, few studies have constructed signatures based on mTOR-associated genes to assess multiple indicators of prognosis in lung adenocarcinoma (LUAD) patients. METHODS mTOR-associated gene sets, whole mRNA expression matrices, and clinical information of LUAD patients in training and validation cohorts were obtained from multiple public databases. Multiple methods were used to screen candidate genes, construct signatures, validate internally and externally, and conduct further studies: differentially expressed gene analysis, LASSO Cox regression analysis, Cox regression analysis, risk factor analysis, nomogram analysis, functional enrichment analysis, analyses in tumor immune microenvironment, and therapy. RESULTS A prognostic signature containing 8 genes (LDHA, SLA, WNT7A, PLK1, CCT6A, BTG2, TXNRD1, and DDIT4) was constructed. It performed well in both internal and external validation. Subsequent analysis found that the prognostic signature was of great significance in evaluating the tumor immune microenvironment and could guide the treatment of patients with LUAD to a certain extent. CONCLUSION The constructed mTOR-associated gene signature accurately predicted the prognostic pattern of patients with LUAD and is expected to be extremely useful in guiding LUAD therapy.
Collapse
|
45
|
Li F, Zhou Y, Zhang Y, Yin J, Qiu Y, Gao J, Zhu F. POSREG: proteomic signature discovered by simultaneously optimizing its reproducibility and generalizability. Brief Bioinform 2022; 23:6532538. [PMID: 35183059 DOI: 10.1093/bib/bbac040] [Citation(s) in RCA: 69] [Impact Index Per Article: 34.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 01/21/2022] [Accepted: 01/27/2022] [Indexed: 12/17/2022] Open
Abstract
Mass spectrometry-based proteomic technique has become indispensable in current exploration of complex and dynamic biological processes. Instrument development has largely ensured the effective production of proteomic data, which necessitates commensurate advances in statistical framework to discover the optimal proteomic signature. Current framework mainly emphasizes the generalizability of the identified signature in predicting the independent data but neglects the reproducibility among signatures identified from independently repeated trials on different sub-dataset. These problems seriously restricted the wide application of the proteomic technique in molecular biology and other related directions. Thus, it is crucial to enable the generalizable and reproducible discovery of the proteomic signature with the subsequent indication of phenotype association. However, no such tool has been developed and available yet. Herein, an online tool, POSREG, was therefore constructed to identify the optimal signature for a set of proteomic data. It works by (i) identifying the proteomic signature of good reproducibility and aggregating them to ensemble feature ranking by ensemble learning, (ii) assessing the generalizability of ensemble feature ranking to acquire the optimal signature and (iii) indicating the phenotype association of discovered signature. POSREG is unique in its capacity of discovering the proteomic signature by simultaneously optimizing its reproducibility and generalizability. It is now accessible free of charge without any registration or login requirement at https://idrblab.org/posreg/.
Collapse
Affiliation(s)
- Fengcheng Li
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Ying Zhou
- State Key Laboratory for Diagnosis and Treatment of Infectious Disease, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, Hangzhou, Zhejiang 310000, China
| | - Ying Zhang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Jiayi Yin
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yunqing Qiu
- State Key Laboratory for Diagnosis and Treatment of Infectious Disease, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, Hangzhou, Zhejiang 310000, China
| | - Jianqing Gao
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
46
|
Xue W, Fu T, Deng S, Yang F, Yang J, Zhu F. Molecular Mechanism for the Allosteric Inhibition of the Human Serotonin Transporter by Antidepressant Escitalopram. ACS Chem Neurosci 2022; 13:340-351. [PMID: 35041375 DOI: 10.1021/acschemneuro.1c00694] [Citation(s) in RCA: 67] [Impact Index Per Article: 33.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Human serotine transporter (hSERT) is one of the most influential drug targets, and its allosteric modulators (e.g., escitalopram) have emerged to be the next-generation medication for psychiatric disorders. However, the molecular mechanism underlying the allosteric modulation of hSERT is still elusive. Here, the simulation strategies of conventional (cMD) and steered (SMD) molecular dynamics were applied to investigate this molecular mechanism from distinct perspectives. First, cMD simulations revealed that escitalopram's binding to hSERT's allosteric site simultaneously enhanced its binding to the orthosteric site. Then, SMD simulation identified that the occupation of hSERT's allosteric site by escitalopram could also block its dissociation from the orthosteric site. Finally, by comparing the simulated structures of two hSERT-escitalopram complexes with and without allosteric modulation, a new conformational coupling between an extracellular (Arg104-Glu494) and an intracellular (Lys490-Glu494) salt bridge was identified. In summary, this study explored the mechanism underlying the allosteric modulation of hSERT by collectively applying two MD simulation strategies, which could facilitate our understanding of the allosteric modulations of not only hSERT but also other clinically important therapeutic targets.
Collapse
Affiliation(s)
- Weiwei Xue
- Chongqing Key Laboratory of Natural Product Synthesis and Drug Research, Innovative Drug Research Center, School of Pharmaceutical Sciences, Chongqing University, Chongqing 401331, China
- Central Nervous System Drug Key Laboratory of Sichuan Province, Luzhou 646000, China
| | - Tingting Fu
- Chongqing Key Laboratory of Natural Product Synthesis and Drug Research, Innovative Drug Research Center, School of Pharmaceutical Sciences, Chongqing University, Chongqing 401331, China
| | - Shengzhe Deng
- Chongqing Key Laboratory of Natural Product Synthesis and Drug Research, Innovative Drug Research Center, School of Pharmaceutical Sciences, Chongqing University, Chongqing 401331, China
| | - Fengyuan Yang
- Chongqing Key Laboratory of Natural Product Synthesis and Drug Research, Innovative Drug Research Center, School of Pharmaceutical Sciences, Chongqing University, Chongqing 401331, China
| | - Jingyi Yang
- Chongqing Key Laboratory of Natural Product Synthesis and Drug Research, Innovative Drug Research Center, School of Pharmaceutical Sciences, Chongqing University, Chongqing 401331, China
| | - Feng Zhu
- Chongqing Key Laboratory of Natural Product Synthesis and Drug Research, Innovative Drug Research Center, School of Pharmaceutical Sciences, Chongqing University, Chongqing 401331, China
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| |
Collapse
|