1
|
Tao Y, Li M, Liu C. Inference and analysis of cell-cell communication of non-myeloid circulating cells in late sepsis based on single-cell RNA-seq. IET Syst Biol 2024; 18:218-226. [PMID: 39578684 DOI: 10.1049/syb2.12109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2024] [Revised: 10/08/2024] [Accepted: 10/27/2024] [Indexed: 11/24/2024] Open
Abstract
Sepsis is a severe systemic inflammatory syndrome triggered by infection and is a leading cause of morbidity and mortality in intensive care units (ICUs). Immune dysfunction is a hallmark of sepsis. In this study, the authors investigated cell-cell communication among lymphoid-derived leucocytes using single-cell RNA sequencing (scRNA-seq) to gain a deeper understanding of the underlying mechanisms in late-stage sepsis. The authors' findings revealed that both the number and strength of cellular interactions were elevated in septic patients compared to healthy individuals, with several pathways showing significant alterations, particularly in conventional dendritic cells (cDCs) and plasmacytoid dendritic cells (pDCs). Notably, pathways such as CD6-ALCAM were more activated in sepsis, potentially due to T cell suppression. This study offers new insights into the mechanisms of immunosuppression and provides potential avenues for clinical intervention in sepsis.
Collapse
Affiliation(s)
- Yanyan Tao
- Department of Emergency Medicine, The First Affiliated Hospital of Bengbu Medical University, Bengbu, Anhui, China
- Institute of Critical Care Medicine, The First Affiliated Hospital of Bengbu Medical College, Bengbu, Anhui, China
| | - Miaomiao Li
- Department of Critical Care Medicine, The First Affiliated Hospital of Bengbu Medical University, Bengbu, Anhui, China
| | - Cheng Liu
- Institute of Critical Care Medicine, The First Affiliated Hospital of Bengbu Medical College, Bengbu, Anhui, China
- Department of Critical Care Medicine, The First Affiliated Hospital of Bengbu Medical University, Bengbu, Anhui, China
| |
Collapse
|
2
|
Ren J, Guo Z, Qi Y, Zhang Z, Liu L. Prediction of YY1 loop anchor based on multi-omics features. Methods 2024; 232:96-106. [PMID: 39521361 DOI: 10.1016/j.ymeth.2024.11.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Revised: 10/22/2024] [Accepted: 11/06/2024] [Indexed: 11/16/2024] Open
Abstract
The three-dimensional structure of chromatin is crucial for the regulation of gene expression. YY1 promotes enhancer-promoter interactions in a manner analogous to CTCF-mediated chromatin interactions. However, little is known about which YY1 binding sites can form loop anchors. In this study, the LightGBM model was used to predict YY1-loop anchors by integrating multi-omics data. Due to the large imbalance in the number of positive and negative samples, we use AUPRC to reflect the quality of the classifier. The results show that the LightGBM model exhibits strong predictive performance (AUPRC≥0.93). To verify the robustness of the model, the dataset was divided into training and test sets at a 4:1 ratio. The results show that the model performs well for YY1-loop anchor prediction on both the training and independent test sets. Additionally, we ranked the importance of the features and found that the formation of YY1-loop anchors is primarily influenced by the co-binding of transcription factors CTCF, SMC3, and RAD21, as well as histone modifications and sequence context.
Collapse
Affiliation(s)
- Jun Ren
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China; School of Mathematics and Statistics, Hainan Normal University, Haikou, China
| | - Zhiling Guo
- Beidahuang Industry Group General Hospital, Harbin, China
| | - Yixuan Qi
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China; School of Mathematics and Statistics, Hainan Normal University, Haikou, China; School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Zheng Zhang
- Computer Science and Information Systems, Murray State University, Murray, USA
| | - Li Liu
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China.
| |
Collapse
|
3
|
Fekry B, Ugartemendia L, Esnaola NF, Goetzl L. Extracellular Vesicles, Circadian Rhythms, and Cancer: A Comprehensive Review with Emphasis on Hepatocellular Carcinoma. Cancers (Basel) 2024; 16:2552. [PMID: 39061191 PMCID: PMC11274441 DOI: 10.3390/cancers16142552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Revised: 07/12/2024] [Accepted: 07/14/2024] [Indexed: 07/28/2024] Open
Abstract
This review comprehensively explores the complex interplay between extracellular vesicles (ECVs)/exosomes and circadian rhythms, with a focus on the role of this interaction in hepatocellular carcinoma (HCC). Exosomes are nanovesicles derived from cells that facilitate intercellular communication by transporting bioactive molecules such as proteins, lipids, and RNA/DNA species. ECVs are implicated in a range of diseases, where they play crucial roles in signaling between cells and their surrounding environment. In the setting of cancer, ECVs are known to influence cancer initiation and progression. The scope of this review extends to all cancer types, synthesizing existing knowledge on the various roles of ECVs. A unique aspect of this review is the emphasis on the circadian-controlled release and composition of exosomes, highlighting their potential as biomarkers for early cancer detection and monitoring metastasis. We also discuss how circadian rhythms affect multiple cancer-related pathways, proposing that disruptions in the circadian clock can alter tumor development and treatment response. Additionally, this review delves into the influence of circadian clock components on ECV biogenesis and their impact on reshaping the tumor microenvironment, a key component driving HCC progression. Finally, we address the potential clinical applications of ECVs, particularly their use as diagnostic tools and drug delivery vehicles, while considering the challenges associated with clinical implementation.
Collapse
Affiliation(s)
- Baharan Fekry
- McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX 77030, USA; (L.U.); (L.G.)
| | - Lierni Ugartemendia
- McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX 77030, USA; (L.U.); (L.G.)
| | - Nestor F. Esnaola
- Division of Surgical Oncology and Gastrointestinal Surgery, Department of Surgery, Houston Methodist Hospital, Houston, TX 77030, USA;
| | - Laura Goetzl
- McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX 77030, USA; (L.U.); (L.G.)
| |
Collapse
|
4
|
Yin KL, Sun T, Duan YX, Ye WT, Ming Li, Liao R. Nomograms incorporating hsa_circ_0029325 highly expressed in exosomes of hepatocellular carcinoma predict the postoperative outcomes. Discov Oncol 2024; 15:212. [PMID: 38836972 PMCID: PMC11153441 DOI: 10.1007/s12672-024-01060-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 05/27/2024] [Indexed: 06/06/2024] Open
Abstract
BACKGROUND Liquid biopsies, for example, exosomal circular RNA (circRNA) can be used to assess potential predictive markers for hepatocellular carcinoma (HCC) in patients after curative resection. This study aimed to search for effective prognostic biomarkers for HCC in patients after surgical resection based on exosomal circRNA expression profiles. We developed two nomograms incorporating circRNAs to predict the postoperative recurrence-free survival (RFS) and overall survival (OS) of HCC patients. METHOD Plasma exosomes isolated from HCC patients and healthy individuals were used for circRNA microarray analysis to explore differentially expressed circRNAs. Pearson correlation analysis was used to evaluate the correlation between circRNAs and clinicopathological features. Cox regression analysis was used to explore the correlation between circRNA and postoperative survival time as well as recurrence time. A nomogram based on circRNA and clinicopathological characteristics was established and further evaluated to predict prognosis and recurrence. RESULT Among 60 significantly upregulated circRNAs and 25 downregulated circRNAs, hsa_circ_0029325 was selected to verify its power for predicting HCC outcomes. The high expression level of exosomal hsa_circ_0029325 was significantly correlated with OS (P = 0.001, HR = 2.04, 95% CI 1.41-3.32) and RFS (P = 0.009, HR = 1.62, 95% CI 1.14-2.30). Among 273 HCC patients, multivariate regression analysis showed that hsa_circ_0029325 (HR = 1.96, 95% CI 1.21-3.18), tumor size (HR = 2.11, 95% CI 1.33-3.32), clinical staging (HR = 2.31, 95% CI 1.54-3.48), and tumor thrombus (HR = 1.74, 95% CI 1.12-2.7) were independent risk factors for poor prognosis in HCC patients after radical resection. These independent predictors of prognosis were incorporated into the two nomograms. The AUCs under the 1-year, 3-year, and 5-year survival and recurrence curves of the OS and RFS nomograms were 0.755, 0.749, and 0.742 and 0.702, 0.685, and 0.642, respectively. The C-index, calibration curves, and clinical decision curves showed that the two prediction models had good predictive performance. These results were verified in the validation cohort with 90 HCC patients. CONCLUSION Our study established two reliable nomograms for predicting recurrence and prognosis in HCC patients. We also show that it is feasible to screen potential predictive markers for HCC after curative resection through exosomal circRNA expression profile analysis.
Collapse
Affiliation(s)
- Kun-Li Yin
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Chongqing Medical University, No. 1 Youyi Rd, Chongqing, 400016, China
| | - Taiwei Sun
- Department of Radiation Oncology, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Yu-Xin Duan
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Chongqing Medical University, No. 1 Youyi Rd, Chongqing, 400016, China
| | - Wen-Tao Ye
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Chongqing Medical University, No. 1 Youyi Rd, Chongqing, 400016, China
| | - Ming Li
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Chongqing Medical University, No. 1 Youyi Rd, Chongqing, 400016, China.
| | - Rui Liao
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Chongqing Medical University, No. 1 Youyi Rd, Chongqing, 400016, China.
| |
Collapse
|
5
|
He T, Gao Z, Lin L, Zhang X, Zou Q. Prognostic signature analysis and survival prediction of esophageal cancer based on N6-methyladenosine associated lncRNAs. Brief Funct Genomics 2024; 23:239-248. [PMID: 37465899 DOI: 10.1093/bfgp/elad028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Revised: 06/27/2023] [Accepted: 07/04/2023] [Indexed: 07/20/2023] Open
Abstract
Esophageal cancer (ESCA) has a bad prognosis. Long non-coding RNA (lncRNA) impacts on cell proliferation. However, the prognosis function of N6-methyladenosine (m6A)-associated lncRNAs (m6A-lncRNAs) in ESCA remains unknown. Univariate Cox analysis was applied to investigate prognosis related m6A-lncRNAs, based on which the samples were clustered. Wilcoxon rank and Chi-square tests were adopted to compare the clinical traits, survival, pathway activity and immune infiltration in different clusters where overall survival, clinical traits (N stage), tumor-invasive immune cells and pathway activity were found significantly different. Through least absolute shrinkage and selection operator and proportional hazard (Lasso-Cox) model, five m6A-lncRNAs were selected to construct the prognostic signature (m6A-lncSig) and risk score. To investigate the link between risk score and clinical traits or immunological microenvironments, Chi-square test and Spearman correlation analysis were utilized. Risk score was found connected with N stage, tumor stage, different clusters, macrophages M2, B cells naive and T cells CD4 memory resting. Risk score and tumor stage were found as independent prognostic variables. And the constructed nomogram model had high accuracy in predicting prognosis. The obtained m6A-lncSig could be taken as potential prognostic biomarker for ESCA patients. This study offers a theoretical foundation for clinical diagnosis and prognosis of ESCA.
Collapse
Affiliation(s)
- Ting He
- School of Mathematics and Statistics, Southwest University, Chongqing 400715, China
| | - Zhipeng Gao
- Beidahuang Industry Group General Hospital, Harbin 150000, China
| | - Ling Lin
- Yucai School Attached to Sichuan Chengdu No. 7 High School, Chengdu 610503, China
| | - Xu Zhang
- School of Mathematics and Statistics, Southwest University, Chongqing 400715, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 611730, China
- School of Mathematics and Statistics, Southwest University, Chongqing 400715, China
| |
Collapse
|
6
|
Bao Y, Xu S, Zhou J, Zhao C, Dai S, Zhang Y, Rao M. Exosomal miR-93 derived from hepatocellular carcinoma cell promotes the sorafenib resistance of hepatocellular carcinoma through PTEN/PI3K/Akt pathway. J Biochem Mol Toxicol 2024; 38:e23666. [PMID: 38375688 DOI: 10.1002/jbt.23666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2023] [Revised: 01/26/2024] [Accepted: 01/31/2024] [Indexed: 02/21/2024]
Abstract
Exosomal microRNAs (miRNAs) derived from cancer cell is an important regulatory molecule that mediates the formation of tumor drug resistance, but function and mechanisms of exosomal miRNA in sorafenib resistance of hepatocellular carcinoma (HCC) have not been studied. We detected the level and prognosis of miR-93 in HCC by using TCGA HCC database. For confirming the extracted exosome, transmission electron microscopy was used. Cy3-labeled miR-93 and quantitative reverse transcription-polymerase chain reaction were used to prove that exosomal miR-93 derived from HCC cell can be transferred to sensitive HCC cells. CCK8, EdU, and flow cytometer assay were used to confirm the function of exosomal miR-93 in sorafenib resistance of HCC. Bioinformatics software and luciferase reporter assay was used to confirm the direct targeting relationship between PTEN and miR-93. Western blot was used to validate downstream pathways. We found that miR-93 is overexpressed and a prognostic risk factor for the HCC patients. miR-93 was overexpressed in sorafenib resistant HCC cells compared with sensitive cells, and miR-93 contributed to sorafenib resistance of HCC cells through targeting PTEN. miR-93 was enriched in exosomes that secreted from sorafenib resistant cells, and these exosomal miR-93 promote the spread of sorafenib resistant through targeting PTEN to reactivate PI3K/AKT pathway. Therefore, miR-93 can act as a potential therapeutic target for advanced patients with acquired sorafenib resistance.
Collapse
Affiliation(s)
- Yuanpeng Bao
- Department of Hepatobiliary Surgery, Affiliated Hospital of Jiangnan University, Wuxi, Jiangsu, China
| | - Song Xu
- Department of Hepatobiliary Surgery, Affiliated Hospital of Jiangnan University, Wuxi, Jiangsu, China
| | - Junjing Zhou
- Department of Hepatobiliary Surgery, Affiliated Hospital of Jiangnan University, Wuxi, Jiangsu, China
| | - Chongyong Zhao
- Department of Hepatobiliary Surgery, Affiliated Hospital of Jiangnan University, Wuxi, Jiangsu, China
| | - Saimin Dai
- Department of Hepatobiliary Surgery, Affiliated Hospital of Jiangnan University, Wuxi, Jiangsu, China
| | - Yong Zhang
- Department of Hepatobiliary Surgery, Affiliated Hospital of Jiangnan University, Wuxi, Jiangsu, China
| | - Min Rao
- Department of Hepatobiliary Surgery, Affiliated Hospital of Jiangnan University, Wuxi, Jiangsu, China
| |
Collapse
|
7
|
Ferro A, Saccu G, Mattivi S, Gaido A, Herrera Sanchez MB, Haque S, Silengo L, Altruda F, Durazzo M, Fagoonee S. Extracellular Vesicles as Delivery Vehicles for Non-Coding RNAs: Potential Biomarkers for Chronic Liver Diseases. Biomolecules 2024; 14:277. [PMID: 38540698 PMCID: PMC10967855 DOI: 10.3390/biom14030277] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 02/15/2024] [Accepted: 02/21/2024] [Indexed: 01/03/2025] Open
Abstract
In recent years, EVs have emerged as promising vehicles for coding and non-coding RNAs (ncRNAs), which have demonstrated remarkable potential as biomarkers for various diseases, including chronic liver diseases (CLDs). EVs are small, membrane-bound particles released by cells, carrying an arsenal of ncRNAs, including microRNAs (miRNAs), long non-coding RNAs (lncRNAs), and other ncRNA species, such as piRNAs, circRNAs, and tsRNAs. These ncRNAs act as key regulators of gene expression, splicing, and translation, providing a comprehensive molecular snapshot of the cells of origin. The non-invasive nature of EV sampling, typically via blood or serum collection, makes them highly attractive candidates for clinical biomarker applications. Moreover, EV-encapsulated ncRNAs offer unique advantages over traditional cell-free ncRNAs due to their enhanced stability within the EVs, hence allowing for their detection in circulation for extended periods and enabling more sensitive and reliable biomarker measurements. Numerous studies have investigated the potential of EV-enclosed ncRNAs as biomarkers for CLD. MiRNAs, in particular, have gained significant attention due to their ability to rapidly respond to changes in cellular stress and inflammation, hallmarks of CLD pathogenesis. Elevated levels of specific miRNAs have been consistently associated with various CLD subtypes, including metabolic dysfunction-associated steatotic liver disease (MASLD), metabolic dysfunction-associated steatohepatitis (MASH), and chronic hepatitis B and C. LncRNAs have also emerged as promising biomarkers for CLD. These transcripts are involved in a wide range of cellular processes, including liver regeneration, fibrosis, and cancer progression. Studies have shown that lncRNA expression profiles can distinguish between different CLD subtypes, providing valuable insights into disease progression and therapeutic response. Promising EV-enclosed ncRNA biomarkers for CLD included miR-122 (elevated levels of miR-122 are associated with MASLD progression and liver fibrosis), miR-21 (increased expression of miR-21 is linked to liver inflammation and fibrosis in CLD patients), miR-192 (elevated levels of miR-192 are associated with more advanced stages of CLD, including cirrhosis and HCC), LncRNA HOTAIR (increased HOTAIR expression is associated with MASLD progression and MASH development), and LncRNA H19 (dysregulation of H19 expression is linked to liver fibrosis and HCC progression). In the present review, we focus on the EV-enclosed ncRNAs as promising tools for the diagnosis and monitoring of CLD of various etiologies.
Collapse
Affiliation(s)
- Arianna Ferro
- Department of Medical Sciences, University of Turin, 10126 Turin, Italy; (A.F.); (G.S.); (S.M.); (A.G.); (M.D.)
| | - Gabriele Saccu
- Department of Medical Sciences, University of Turin, 10126 Turin, Italy; (A.F.); (G.S.); (S.M.); (A.G.); (M.D.)
| | - Simone Mattivi
- Department of Medical Sciences, University of Turin, 10126 Turin, Italy; (A.F.); (G.S.); (S.M.); (A.G.); (M.D.)
| | - Andrea Gaido
- Department of Medical Sciences, University of Turin, 10126 Turin, Italy; (A.F.); (G.S.); (S.M.); (A.G.); (M.D.)
| | - Maria Beatriz Herrera Sanchez
- 2i3T, Società per la Gestione Dell’incubatore di Imprese e per il Trasferimento Tecnologico, University of Torino, 10126 Turin, Italy;
- Molecular Biotechnology Centre “Guido Tarone”, 10126 Turin, Italy; (L.S.); (F.A.)
| | - Shafiul Haque
- Research and Scientific Studies Unit, College of Nursing and Health Sciences, Jazan University, Jazan 45142, Saudi Arabia;
- Centre of Medical and Bio-Allied Health Sciences Research, Ajman University, Ajman 13306, United Arab Emirates
- Gilbert and Rose-Marie Chagoury School of Medicine, Lebanese American University, Beirut 1102 2801, Lebanon
| | - Lorenzo Silengo
- Molecular Biotechnology Centre “Guido Tarone”, 10126 Turin, Italy; (L.S.); (F.A.)
| | - Fiorella Altruda
- Molecular Biotechnology Centre “Guido Tarone”, 10126 Turin, Italy; (L.S.); (F.A.)
| | - Marilena Durazzo
- Department of Medical Sciences, University of Turin, 10126 Turin, Italy; (A.F.); (G.S.); (S.M.); (A.G.); (M.D.)
| | - Sharmila Fagoonee
- Institute for Biostructure and Bioimaging, National Research Council (CNR), Molecular Biotechnology Centre “Guido Tarone”, 10126 Turin, Italy
| |
Collapse
|
8
|
Jia J, Wu G, Li M. iGly-IDN: Identifying Lysine Glycation Sites in Proteins Based on Improved DenseNet. J Comput Biol 2024; 31:161-174. [PMID: 38016151 DOI: 10.1089/cmb.2023.0112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2023] Open
Abstract
Lysine glycation is one of the most significant protein post-translational modifications, which changes the properties of the proteins and causes them to be dysfunctional. Accurately identifying glycation sites helps to understand the biological function and potential mechanism of glycation in disease treatments. Nonetheless, the experimental methods are ordinarily inefficient and costly, so effective computational methods need to be developed. In this study, we proposed the new model called iGly-IDN based on the improved densely connected convolutional networks (DenseNet). First, one hot encoding was adopted to obtain the original feature maps. Afterward, the improved DenseNet was adopted to capture feature information with the importance degrees during the feature learning. According to the experimental results, Acc reaches 66%, and Mathews correlation coefficient reaches 0.33 on the independent testing data set, which indicates that the iGly-IDN can provide more effective glycation site identification than the current predictors.
Collapse
Affiliation(s)
- Jianhua Jia
- School of Information Engineering, Jingdezhen Ceramic University, Jingdezhen, China
| | - Genqiang Wu
- School of Information Engineering, Jingdezhen Ceramic University, Jingdezhen, China
- College of Modern Economics and Management, Jiangxi University of Finance and Economics, Nanchang, China
| | - Meifang Li
- School of Computer Information Engineering, Nanchang Institute of Technology, Nanchang, China
| |
Collapse
|
9
|
Zhou M, He X, Mei C, Ou C. Exosome derived from tumor-associated macrophages: biogenesis, functions, and therapeutic implications in human cancers. Biomark Res 2023; 11:100. [PMID: 37981718 PMCID: PMC10658727 DOI: 10.1186/s40364-023-00538-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2023] [Accepted: 11/05/2023] [Indexed: 11/21/2023] Open
Abstract
Tumor-associated macrophages (TAMs), one of the most abundant immune cell types in the tumor microenvironment (TME), account for approximately 50% of the local hematopoietic cells. TAMs play an important role in tumorigenesis and tumor development through crosstalk between various immune cells and cytokines in the TME. Exosomes are small extracellular vesicles with a diameter of 50-150 nm, that can transfer biological information (e.g., proteins, nucleic acids, and lipids) from secretory cells to recipient cells through the circulatory system, thereby influencing the progression of various human diseases, including cancer. Recent studies have suggested that TAMs-derived exosomes play crucial roles in malignant cell proliferation, invasion, metastasis, angiogenesis, immune responses, drug resistance, and tumor metabolic reprogramming. TAMs-derived exosomes have the potential to be targeted for tumor therapy. In addition, the abnormal expression of non-coding RNAs and proteins in TAMs-derived exosomes is closely related to the clinicopathological features of patients with cancer, and these exosomes are expected to become new liquid biopsy markers for the early diagnosis, prognosis, and monitoring of tumors. In this review, we explored the role of TAMs-derived exosomes in tumorigenesis to provide new diagnostic biomarkers and therapeutic targets for cancer prevention.
Collapse
Affiliation(s)
- Manli Zhou
- Department of Pathology, Xiangya Hospital, Central South University, Changsha, 410008, Hunan, China
| | - Xiaoyun He
- Departments of Ultrasound Imaging, Xiangya Hospital, Central South University, Changsha, 410008, Hunan, China
| | - Cheng Mei
- Department of Blood Transfusion, Xiangya Hospital, Clinical Transfusion Research Center, Central South University, Changsha, 410008, Hunan, China.
| | - Chunlin Ou
- Department of Pathology, Xiangya Hospital, Central South University, Changsha, 410008, Hunan, China.
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China.
| |
Collapse
|
10
|
Bai T, Liu B. ncRNALocate-EL: a multi-label ncRNA subcellular locality prediction model based on ensemble learning. Brief Funct Genomics 2023; 22:442-452. [PMID: 37122147 DOI: 10.1093/bfgp/elad007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 12/31/2022] [Accepted: 01/31/2023] [Indexed: 05/02/2023] Open
Abstract
Subcellular localizations of ncRNAs are associated with specific functions. Currently, an increasing number of biological researchers are focusing on computational approaches to identify subcellular localizations of ncRNAs. However, the performance of the existing computational methods is low and needs to be further studied. First, most prediction models are trained with outdated databases. Second, only a few predictors can identify multiple subcellular localizations simultaneously. In this work, we establish three human ncRNA subcellular datasets based on the latest RNALocate, including lncRNA, miRNA and snoRNA, and then we propose a novel multi-label classification model based on ensemble learning called ncRNALocate-EL to identify multi-label subcellular localizations of three ncRNAs. The results show that the ncRNALocate-EL outperforms previous methods. Our method achieved an average precision of 0.709,0.977 and 0.730 on three human ncRNA datasets. The web server of ncRNALocate-EL has been established, which can be accessed at https://bliulab.net/ncRNALocate-EL.
Collapse
|
11
|
Zeng Y, Hu S, Luo Y, He K. Exosome Cargos as Biomarkers for Diagnosis and Prognosis of Hepatocellular Carcinoma. Pharmaceutics 2023; 15:2365. [PMID: 37765333 PMCID: PMC10537613 DOI: 10.3390/pharmaceutics15092365] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 09/14/2023] [Accepted: 09/18/2023] [Indexed: 09/29/2023] Open
Abstract
Hepatocellular carcinoma (HCC) is one of the most common cancers worldwide. Due to the insidiousness of HCC onset and the lack of specific early-stage markers, the early diagnosis and treatment of HCC are still unsatisfactory, leading to a poor prognosis. Exosomes are a type of extracellular vesicle containing various components, which play an essential part in the development, progression, and metastasis of HCC. A large number of studies have demonstrated that exosomes could serve as novel biomarkers for the diagnosis of HCC. These diagnostic components mainly include proteins, microRNAs, long noncoding RNAs, and circular RNAs. The exosome biomarkers showed high sensitivity and high specificity in distinguishing HCC from health controls and other liver diseases, such as chronic HBV and liver cirrhosis. The expression of these biomarkers also exhibits correlations with various clinical factors such as tumor size, TMN stage, overall survival, and recurrence rate. In this review, we summarize the function of exosomes in the development of HCC and highlight their application as HCC biomarkers for diagnosis and prognosis prediction.
Collapse
Affiliation(s)
- Yulai Zeng
- Department of Liver Surgery, School of Medicine, Renji Hospital, Shanghai Jiao Tong University, Shanghai 200127, China; (Y.Z.); (S.H.)
- Shanghai Engineering Research Center of Transplantation and Immunology, Shanghai 200127, China
- Shanghai Institute of Transplantation, Shanghai 200127, China
| | - Shuyu Hu
- Department of Liver Surgery, School of Medicine, Renji Hospital, Shanghai Jiao Tong University, Shanghai 200127, China; (Y.Z.); (S.H.)
- Shanghai Engineering Research Center of Transplantation and Immunology, Shanghai 200127, China
- Shanghai Institute of Transplantation, Shanghai 200127, China
| | - Yi Luo
- Department of Liver Surgery, School of Medicine, Renji Hospital, Shanghai Jiao Tong University, Shanghai 200127, China; (Y.Z.); (S.H.)
- Shanghai Engineering Research Center of Transplantation and Immunology, Shanghai 200127, China
- Shanghai Institute of Transplantation, Shanghai 200127, China
| | - Kang He
- Department of Liver Surgery, School of Medicine, Renji Hospital, Shanghai Jiao Tong University, Shanghai 200127, China; (Y.Z.); (S.H.)
- Shanghai Engineering Research Center of Transplantation and Immunology, Shanghai 200127, China
- Shanghai Institute of Transplantation, Shanghai 200127, China
| |
Collapse
|
12
|
Momanyi BM, Zulfiqar H, Grace-Mercure BK, Ahmed Z, Ding H, Gao H, Liu F. CFNCM: Collaborative filtering neighborhood-based model for predicting miRNA-disease associations. Comput Biol Med 2023; 163:107165. [PMID: 37315383 DOI: 10.1016/j.compbiomed.2023.107165] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 05/31/2023] [Accepted: 06/08/2023] [Indexed: 06/16/2023]
Abstract
MicroRNAs have a significant role in the emergence of various human disorders. Consequently, it is essential to understand the existing interactions between miRNAs and diseases, as this will help scientists better study and comprehend the diseases' biological mechanisms. Findings can be employed as biomarkers or drug targets to advance the detection, diagnosis, and treatment of complex human disorders by foretelling possible disease-related miRNAs. This study proposed a computational model for predicting potential miRNA-disease associations called the Collaborative Filtering Neighborhood-based Classification Model (CFNCM), in light of the shortcomings of conventional and biological experiments, which are expensive and time-consuming. The model generated integrated miRNA and disease similarity matrices using the validated associations and miRNA and disease similarity information and used them as the input features for CFNCM. To produce class labels, we first determined the association scores for brand-new pairs using user-based collaborative filtering. With zero as the threshold, the associations with scores >0 were labelled 1, indicating a potential positive association, otherwise, it is marked as 0. Then, we developed classification models using various machine-learning algorithms. By comparison, we discovered that the support vector machine (SVM) produced the best AUC of 0.96 with 10-fold cross-validation through the GridSearchCV technique for identifying optimal parameter values. In addition, the models were evaluated and verified by analyzing the top 50 breast and lung neoplasms-related miRNAs, of which 46 and 47 associations were verified in two authoritative databases, dbDEMC and miR2Disease.
Collapse
Affiliation(s)
- Biffon Manyura Momanyi
- School of Computer Science and Engineering, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Hasan Zulfiqar
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, 610054, China; Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou, Zhejiang, 313001, China
| | - Bakanina Kissanga Grace-Mercure
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, 610054, China
| | - Zahoor Ahmed
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, 610054, China; Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou, Zhejiang, 313001, China
| | - Hui Ding
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, 610054, China.
| | - Hui Gao
- School of Computer Science and Engineering, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.
| | - Fen Liu
- Department of Radiation Oncology, Peking University Cancer Hospital (Inner Mongolia Campus), Affiliated Cancer Hospital of Inner Mongolia Medical University, Inner Mongolia Cancer Hospital, Hohhot, China.
| |
Collapse
|
13
|
Zhu W, Yuan SS, Li J, Huang CB, Lin H, Liao B. A First Computational Frame for Recognizing Heparin-Binding Protein. Diagnostics (Basel) 2023; 13:2465. [PMID: 37510209 PMCID: PMC10377868 DOI: 10.3390/diagnostics13142465] [Citation(s) in RCA: 36] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2023] [Revised: 07/13/2023] [Accepted: 07/21/2023] [Indexed: 07/30/2023] Open
Abstract
Heparin-binding protein (HBP) is a cationic antibacterial protein derived from multinuclear neutrophils and an important biomarker of infectious diseases. The correct identification of HBP is of great significance to the study of infectious diseases. This work provides the first HBP recognition framework based on machine learning to accurately identify HBP. By using four sequence descriptors, HBP and non-HBP samples were represented by discrete numbers. By inputting these features into a support vector machine (SVM) and random forest (RF) algorithm and comparing the prediction performances of these methods on training data and independent test data, it is found that the SVM-based classifier has the greatest potential to identify HBP. The model could produce an auROC of 0.981 ± 0.028 on training data using 10-fold cross-validation and an overall accuracy of 95.0% on independent test data. As the first model for HBP recognition, it will provide some help for infectious diseases and stimulate further research in related fields.
Collapse
Affiliation(s)
- Wen Zhu
- Key Laboratory of Computational Science and Application of Hainan Province, Haikou 571158, China
- Key Laboratory of Data Science and Intelligence Education, Hainan Normal University, Ministry of Education, Haikou 571158, China
- School of Mathematics and Statistics, Hainan Normal University, Haikou 571158, China
| | - Shi-Shi Yuan
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Jian Li
- School of Basic Medical Sciences, Chengdu University, Chengdu 610106, China
| | - Cheng-Bing Huang
- School of Computer Science and Technology, ABa Teachers University, Chengdu 623002, China
| | - Hao Lin
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Bo Liao
- Key Laboratory of Computational Science and Application of Hainan Province, Haikou 571158, China
- Key Laboratory of Data Science and Intelligence Education, Hainan Normal University, Ministry of Education, Haikou 571158, China
- School of Mathematics and Statistics, Hainan Normal University, Haikou 571158, China
| |
Collapse
|
14
|
Phan LT, Oh C, He T, Manavalan B. A comprehensive revisit of the machine-learning tools developed for the identification of enhancers in the human genome. Proteomics 2023; 23:e2200409. [PMID: 37021401 DOI: 10.1002/pmic.202200409] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 03/18/2023] [Accepted: 03/27/2023] [Indexed: 04/07/2023]
Abstract
Enhancers are non-coding DNA elements that play a crucial role in enhancing the transcription rate of a specific gene in the genome. Experiments for identifying enhancers can be restricted by their conditions and involve complicated, time-consuming, laborious, and costly steps. To overcome these challenges, computational platforms have been developed to complement experimental methods that enable high-throughput identification of enhancers. Over the last few years, the development of various enhancer computational tools has resulted in significant progress in predicting putative enhancers. Thus, researchers are now able to use a variety of strategies to enhance and advance enhancer study. In this review, an overview of machine learning (ML)-based prediction methods for enhancer identification and related databases has been provided. The existing enhancer-prediction methods have also been reviewed regarding their algorithms, feature selection processes, validation techniques, and software utility. In addition, the advantages and drawbacks of these ML approaches and guidelines for developing bioinformatic tools have been highlighted for a more efficient enhancer prediction. This review will serve as a useful resource for experimentalists in selecting the appropriate ML tool for their study, and for bioinformaticians in developing more accurate and advanced ML-based predictors.
Collapse
Affiliation(s)
- Le Thi Phan
- Computational Biology and Bioinformatics Laboratory, Department of Integrative Biotechnology, College of Biotechnology and Bioengineering, Sungkyunkwan University, Suwon, Gyeonggi-do, South Korea
| | - Changmin Oh
- Computational Biology and Bioinformatics Laboratory, Department of Integrative Biotechnology, College of Biotechnology and Bioengineering, Sungkyunkwan University, Suwon, Gyeonggi-do, South Korea
| | - Tao He
- Beidahuang Industry Group General Hospital, Harbin, China
| | - Balachandran Manavalan
- Computational Biology and Bioinformatics Laboratory, Department of Integrative Biotechnology, College of Biotechnology and Bioengineering, Sungkyunkwan University, Suwon, Gyeonggi-do, South Korea
| |
Collapse
|
15
|
Ali F, Alghamdi W, Almagrabi AO, Alghushairy O, Banjar A, Khalid M. Deep-AGP: Prediction of angiogenic protein by integrating two-dimensional convolutional neural network with discrete cosine transform. Int J Biol Macromol 2023; 243:125296. [PMID: 37301349 DOI: 10.1016/j.ijbiomac.2023.125296] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2023] [Revised: 06/05/2023] [Accepted: 06/07/2023] [Indexed: 06/12/2023]
Abstract
Angiogenic proteins (AGPs) play a primary role in the formation of new blood vessels from pre-existing ones. AGPs have diverse applications in cancer, including serving as biomarkers, guiding anti-angiogenic therapies, and aiding in tumor imaging. Understanding the role of AGPs in cardiovascular and neurodegenerative diseases is vital for developing new diagnostic tools and therapeutic approaches. Considering the significance of AGPs, in this research, we first time established a computational model using deep learning for identifying AGPs. First, we constructed a sequence-based dataset. Second, we explored features by designing a novel feature encoder, called position-specific scoring matrix-decomposition-discrete cosine transform (PSSM-DC-DCT) and existing descriptors including Dipeptide Deviation from Expected Mean (DDE) and bigram-position-specific scoring matrix (Bi-PSSM). Third, each feature set is fed into two-dimensional convolutional neural network (2D-CNN) and machine learning classifiers. Finally, the performance of each learning model is validated by 10-fold cross-validation (CV). The experimental results demonstrate that 2D-CNN with proposed novel feature descriptor achieved the highest success rate on both training and testing datasets. In addition to being an accurate predictor for identification of angiogenic proteins, our proposed method (Deep-AGP) might be fruitful in understanding cancer, cardiovascular, and neurodegenerative diseases, development of their novel therapeutic methods and drug designing.
Collapse
Affiliation(s)
- Farman Ali
- Sarhad University of Science and Information Technology Peshawar, Mardan Campus, Pakistan.
| | - Wajdi Alghamdi
- Department of Information Technology, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - Alaa Omran Almagrabi
- Department of Information Technology, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah 21589, Saudi Arabia.
| | - Omar Alghushairy
- Department of Information Systems and Technology, College of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi Arabia
| | - Ameen Banjar
- Department of Information Systems and Technology, College of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi Arabia
| | - Majdi Khalid
- Department of Computer Science, College of Computers and Information Systems, Umm Al-Qura University, Makkah 21955, Saudi Arabia
| |
Collapse
|
16
|
Su W, Qian X, Yang K, Ding H, Huang C, Zhang Z. Recognition of outer membrane proteins using multiple feature fusion. Front Genet 2023; 14:1211020. [PMID: 37351347 PMCID: PMC10284346 DOI: 10.3389/fgene.2023.1211020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Accepted: 05/24/2023] [Indexed: 06/24/2023] Open
Abstract
Introduction: Outer membrane proteins are crucial in maintaining the structural stability and permeability of the outer membrane. Outer membrane proteins exhibit several functions such as antigenicity and strong immunogenicity, which have potential applications in clinical diagnosis and disease prevention. However, wet experiments for studying OMPs are time and capital-intensive, thereby necessitating the use of computational methods for their identification. Methods: In this study, we developed a computational model to predict outer membrane proteins. The non-redundant dataset consists of a positive set of 208 outer membrane proteins and a negative set of 876 non-outer membrane proteins. In this study, we employed the pseudo amino acid composition method to extract feature vectors and subsequently utilized the support vector machine for prediction. Results and Discussion: In the Jackknife cross-validation, the overall accuracy and the area under receiver operating characteristic curve were observed to be 93.19% and 0.966, respectively. These results demonstrate that our model can produce accurate predictions, and could serve as a valuable guide for experimental research on outer membrane proteins.
Collapse
Affiliation(s)
- Wenxia Su
- College of Science, Inner Mongolia Agriculture University, Hohhot, China
| | - Xiaojun Qian
- School of Life Science and Technology, Center for Information Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Keli Yang
- Nonlinear Research Institute, Baoji University of Arts and Sciences, Baoji, China
| | - Hui Ding
- School of Life Science and Technology, Center for Information Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Chengbing Huang
- School of Computer Science and Technology, Aba Teachers University, Aba, China
| | - Zhaoyue Zhang
- School of Life Science and Technology, Center for Information Biology, University of Electronic Science and Technology of China, Chengdu, China
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| |
Collapse
|
17
|
Lin Y, Sun M, Zhang J, Li M, Yang K, Wu C, Zulfiqar H, Lai H. Computational identification of promoters in Klebsiella aerogenes by using support vector machine. Front Microbiol 2023; 14:1200678. [PMID: 37250059 PMCID: PMC10215528 DOI: 10.3389/fmicb.2023.1200678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Accepted: 04/18/2023] [Indexed: 05/31/2023] Open
Abstract
Promoters are the basic functional cis-elements to which RNA polymerase binds to initiate the process of gene transcription. Comprehensive understanding gene expression and regulation depends on the precise identification of promoters, as they are the most important component of gene expression. This study aimed to develop a machine learning-based model to predict promoters in Klebsiella aerogenes (K. aerogenes). In the prediction model, the promoter sequences in K. aerogenes genome were encoded by pseudo k-tuple nucleotide composition (PseKNC) and position-correlation scoring function (PCSF). Numerical features were obtained and then optimized using mRMR by combining with support vector machine (SVM) and 5-fold cross-validation (CV). Subsequently, these optimized features were inputted into SVM-based classifier to discriminate promoter sequences from non-promoter sequences in K. aerogenes. Results of 10-fold CV showed that the model could yield the overall accuracy of 96.0% and the area under the ROC curve (AUC) of 0.990. We hope that this model will provide help for the study of promoter and gene regulation in K. aerogenes.
Collapse
Affiliation(s)
- Yan Lin
- Key Laboratory for Animal Disease-Resistance Nutrition of the Ministry of Agriculture, Animal Nutrition Institute, Sichuan Agricultural University, Chengdu, China
| | - Meili Sun
- Beidahuang Industry Group General Hospital, Harbin, China
| | - Junjie Zhang
- Key Laboratory for Animal Disease-Resistance Nutrition of the Ministry of Agriculture, Animal Nutrition Institute, Sichuan Agricultural University, Chengdu, China
| | - Mingyan Li
- Chifeng Product Quality Inspection and Testing Centre, Chifeng, China
| | - Keli Yang
- Nonlinear Research Institute, Baoji University of Arts and Sciences, Baoji, China
| | - Chengyan Wu
- Baotou Teacher’s College, Inner Mongolia University of Science and Technology, Baotou, China
| | - Hasan Zulfiqar
- Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou, Zhejiang, China
| | - Hongyan Lai
- Chongqing Key Laboratory of Big Data for Bio Intelligence, Chongqing University of Posts and Telecommunications, Chongqing, China
| |
Collapse
|
18
|
Su D, Xiong Y, Wei H, Wang S, Ke J, Liang P, Zhang H, Yu Y, Zuo Y, Yang L. Integrated analysis of ovarian cancer patients from prospective transcription factor activity reveals subtypes of prognostic significance. Heliyon 2023; 9:e16147. [PMID: 37215759 PMCID: PMC10199194 DOI: 10.1016/j.heliyon.2023.e16147] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 05/04/2023] [Accepted: 05/07/2023] [Indexed: 05/24/2023] Open
Abstract
Transcription factors are protein molecules that act as regulators of gene expression. Aberrant protein activity of transcription factors can have a significant impact on tumor progression and metastasis in tumor patients. In this study, 868 immune-related transcription factors were identified from the transcription factor activity profile of 1823 ovarian cancer patients. The prognosis-related transcription factors were identified through univariate Cox analysis and random survival tree analysis, and two distinct clustering subtypes were subsequently derived based on these transcription factors. We assessed the clinical significance and genomics landscape of the two clustering subtypes and found statistically significant differences in prognosis, response to immunotherapy, and chemotherapy among ovarian cancer patients with different subtypes. Multi-scale Embedded Gene Co-expression Network Analysis was used to identify differential gene modules between the two clustering subtypes, which allowed us to conduct further analysis of biological pathways that exhibited significant differences between them. Finally, a ceRNA network was constructed to analyze lncRNA-miRNA-mRNA regulatory pairs with differential expression levels between two clustering subtypes. We expected that our study may provide some useful references for stratifying and treating patients with ovarian cancer.
Collapse
Affiliation(s)
- Dongqing Su
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Yuqiang Xiong
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Haodong Wei
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Shiyuan Wang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Jiawei Ke
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Pengfei Liang
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot, 010070, China
| | - Haoxin Zhang
- Department of Gastrointestinal Oncology, Harbin Medical University Cancer Hospital, Harbin 150081, China
| | - Yao Yu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Yongchun Zuo
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot, 010070, China
- Digital College, Inner Mongolia Intelligent Union Big Data Academy, Inner Mongolia Wesure Date Technology Co., Ltd., Hohhot, 010010, China
| | - Lei Yang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| |
Collapse
|
19
|
Kalita A, Sikora-Skrabaka M, Nowakowska-Zajdel E. Role of Some microRNA/ADAM Proteins Axes in Gastrointestinal Cancers as a Novel Biomarkers and Potential Therapeutic Targets—A Review. Curr Issues Mol Biol 2023; 45:2917-2936. [PMID: 37185715 PMCID: PMC10136553 DOI: 10.3390/cimb45040191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 03/16/2023] [Accepted: 03/29/2023] [Indexed: 04/05/2023] Open
Abstract
Gastrointestinal (GI) cancers are some of the most common cancers in the world and their number is increasing. Their etiology and pathogenesis are still unclear. ADAM proteins are a family of transmembrane and secreted metalloproteinases that play a role in cancerogenesis, metastasis and neoangiogenesis. MicroRNAs are small single-stranded non-coding RNAs that take part in the post-transcriptional regulation of gene expression. Some ADAM proteins can be targets for microRNAs. In this review, we analyze the impact of microRNA/ADAM protein axes in GI cancers.
Collapse
Affiliation(s)
- Agnieszka Kalita
- Department of Nutrition-Related Disease Prevention, Department of Metabolic Disease Prevention, Faculty of Health Sciences in Bytom, Medical University of Silesia in Katowice, 40-055 Katowice, Poland
- Department of Clinical Oncology, No. 4 Provincial Specialist Hospital, 41-902 Bytom, Poland
| | - Magdalena Sikora-Skrabaka
- Department of Nutrition-Related Disease Prevention, Department of Metabolic Disease Prevention, Faculty of Health Sciences in Bytom, Medical University of Silesia in Katowice, 40-055 Katowice, Poland
- Department of Clinical Oncology, No. 4 Provincial Specialist Hospital, 41-902 Bytom, Poland
| | - Ewa Nowakowska-Zajdel
- Department of Nutrition-Related Disease Prevention, Department of Metabolic Disease Prevention, Faculty of Health Sciences in Bytom, Medical University of Silesia in Katowice, 40-055 Katowice, Poland
- Department of Clinical Oncology, No. 4 Provincial Specialist Hospital, 41-902 Bytom, Poland
| |
Collapse
|
20
|
Zhang X, Zhu W, Sun H, Ding Y, Liu L. Prediction of CTCF loop anchor based on machine learning. Front Genet 2023; 14:1181956. [PMID: 37077544 PMCID: PMC10106609 DOI: 10.3389/fgene.2023.1181956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 03/24/2023] [Indexed: 04/05/2023] Open
Abstract
Introduction: Various activities in biological cells are affected by three-dimensional genome structure. The insulators play an important role in the organization of higher-order structure. CTCF is a representative of mammalian insulators, which can produce barriers to prevent the continuous extrusion of chromatin loop. As a multifunctional protein, CTCF has tens of thousands of binding sites in the genome, but only a portion of them can be used as anchors of chromatin loops. It is still unclear how cells select the anchor in the process of chromatin looping.Methods: In this paper, a comparative analysis is performed to investigate the sequence preference and binding strength of anchor and non-anchor CTCF binding sites. Furthermore, a machine learning model based on the CTCF binding intensity and DNA sequence is proposed to predict which CTCF sites can form chromatin loop anchors.Results: The accuracy of the machine learning model that we constructed for predicting the anchor of the chromatin loop mediated by CTCF reached 0.8646. And we find that the formation of loop anchor is mainly influenced by the CTCF binding strength and binding pattern (which can be interpreted as the binding of different zinc fingers).Discussion: In conclusion, our results suggest that The CTCF core motif and it’s flanking sequence may be responsible for the binding specificity. This work contributes to understanding the mechanism of loop anchor selection and provides a reference for the prediction of CTCF-mediated chromatin loops.
Collapse
Affiliation(s)
- Xiao Zhang
- School of Mathematics and Statistics, Hainan Normal University, Haikou, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China
- Key Laboratory of Computational Science and Application of Hainan Province, Haikou, China
| | - Wen Zhu
- School of Mathematics and Statistics, Hainan Normal University, Haikou, China
- Key Laboratory of Computational Science and Application of Hainan Province, Haikou, China
- *Correspondence: Wen Zhu,
| | - Huimin Sun
- School of Physical Science and Technology, Inner Mongolia University, Hohhot, China
| | - Yijie Ding
- Key Laboratory of Computational Science and Application of Hainan Province, Haikou, China
| | - Li Liu
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China
| |
Collapse
|
21
|
Yang YH, Ma CY, Gao D, Liu XW, Yuan SS, Ding H. i2OM: Toward a better prediction of 2'-O-methylation in human RNA. Int J Biol Macromol 2023; 239:124247. [PMID: 37003392 DOI: 10.1016/j.ijbiomac.2023.124247] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 03/06/2023] [Accepted: 03/22/2023] [Indexed: 04/03/2023]
Abstract
2'-O-methylation (2OM) is an omnipresent post-transcriptional modification in RNAs. It is important for the regulation of RNA stability, mRNA splicing and translation, as well as innate immunity. With the increase in publicly available 2OM data, several computational tools have been developed for the identification of 2OM sites in human RNA. Unfortunately, these tools suffer from the low discriminative power of redundant features, unreasonable dataset construction or overfitting. To address those issues, based on four types of 2OM (2OM-adenine (A), cytosine (C), guanine (G), and uracil (U)) data, we developed a two-step feature selection model to identify 2OM. For each type, the one-way analysis of variance (ANOVA) combined with mutual information (MI) was proposed to rank sequence features for obtaining the optimal feature subset. Subsequently, four predictors based on eXtreme Gradient Boosting (XGBoost) or support vector machine (SVM) were presented to identify the four types of 2OM sites. Finally, the proposed model could produce an overall accuracy of 84.3 % on the independent set. To provide a convenience for users, an online tool called i2OM was constructed and can be freely access at i2om.lin-group.cn. The predictor may provide a reference for the study of the 2OM.
Collapse
Affiliation(s)
- Yu-He Yang
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Cai-Yi Ma
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Dong Gao
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Xiao-Wei Liu
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Shi-Shi Yuan
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Hui Ding
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China.
| |
Collapse
|
22
|
Liu M, Zhou J, Xi Q, Liang Y, Li H, Liang P, Guo Y, Liu M, Temuqile T, Yang L, Zuo Y. A computational framework of routine test data for the cost-effective chronic disease prediction. Brief Bioinform 2023; 24:7034465. [PMID: 36772998 DOI: 10.1093/bib/bbad054] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Revised: 01/04/2023] [Accepted: 01/26/2023] [Indexed: 02/12/2023] Open
Abstract
Chronic diseases, because of insidious onset and long latent period, have become the major global disease burden. However, the current chronic disease diagnosis methods based on genetic markers or imaging analysis are challenging to promote completely due to high costs and cannot reach universality and popularization. This study analyzed massive data from routine blood and biochemical test of 32 448 patients and developed a novel framework for cost-effective chronic disease prediction with high accuracy (AUC 87.32%). Based on the best-performing XGBoost algorithm, 20 classification models were further constructed for 17 types of chronic diseases, including 9 types of cancers, 5 types of cardiovascular diseases and 3 types of mental illness. The highest accuracy of the model was 90.13% for cardia cancer, and the lowest was 76.38% for rectal cancer. The model interpretation with the SHAP algorithm showed that CREA, R-CV, GLU and NEUT% might be important indices to identify the most chronic diseases. PDW and R-CV are also discovered to be crucial indices in classifying the three types of chronic diseases (cardiovascular disease, cancer and mental illness). In addition, R-CV has a higher specificity for cancer, ALP for cardiovascular disease and GLU for mental illness. The association between chronic diseases was further revealed. At last, we build a user-friendly explainable machine-learning-based clinical decision support system (DisPioneer: http://bioinfor.imu.edu.cn/dispioneer) to assist in predicting, classifying and treating chronic diseases. This cost-effective work with simple blood tests will benefit more people and motivate clinical implementation and further investigation of chronic diseases prevention and surveillance program.
Collapse
Affiliation(s)
- Mingzhu Liu
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, School of Life Sciences, Inner Mongolia University, Hohhot 010021, China
- Digital College, Inner Mongolia Intelligent Union Big Data Academy, Inner Mongolia Wesure Date Technology Co., Ltd., Hohhot 010010, China
- Inner Mongolia International Mongolian Hospital, Hohhot 010065, China
| | - Jian Zhou
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, School of Life Sciences, Inner Mongolia University, Hohhot 010021, China
- Digital College, Inner Mongolia Intelligent Union Big Data Academy, Inner Mongolia Wesure Date Technology Co., Ltd., Hohhot 010010, China
| | - Qilemuge Xi
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, School of Life Sciences, Inner Mongolia University, Hohhot 010021, China
| | - Yuchao Liang
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, School of Life Sciences, Inner Mongolia University, Hohhot 010021, China
- Digital College, Inner Mongolia Intelligent Union Big Data Academy, Inner Mongolia Wesure Date Technology Co., Ltd., Hohhot 010010, China
| | - Haicheng Li
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, School of Life Sciences, Inner Mongolia University, Hohhot 010021, China
- Digital College, Inner Mongolia Intelligent Union Big Data Academy, Inner Mongolia Wesure Date Technology Co., Ltd., Hohhot 010010, China
| | - Pengfei Liang
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, School of Life Sciences, Inner Mongolia University, Hohhot 010021, China
| | - Yuting Guo
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, School of Life Sciences, Inner Mongolia University, Hohhot 010021, China
| | - Ming Liu
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, School of Life Sciences, Inner Mongolia University, Hohhot 010021, China
| | - Temuqile Temuqile
- Inner Mongolia International Mongolian Hospital, Hohhot 010065, China
| | - Lei Yang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Yongchun Zuo
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, School of Life Sciences, Inner Mongolia University, Hohhot 010021, China
- Digital College, Inner Mongolia Intelligent Union Big Data Academy, Inner Mongolia Wesure Date Technology Co., Ltd., Hohhot 010010, China
- Inner Mongolia International Mongolian Hospital, Hohhot 010065, China
| |
Collapse
|
23
|
Constructing discriminative feature space for LncRNA-protein interaction based on deep autoencoder and marginal fisher analysis. Comput Biol Med 2023; 157:106711. [PMID: 36924738 DOI: 10.1016/j.compbiomed.2023.106711] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 01/26/2023] [Accepted: 02/26/2023] [Indexed: 03/04/2023]
Abstract
Long non-coding RNAs (lncRNAs) play important roles by regulating proteins in many biological processes and life activities. To uncover molecular mechanisms of lncRNA, it is very necessary to identify interactions of lncRNA with proteins. Recently, some machine learning methods were proposed to detect lncRNA-protein interactions according to the distribution of known interactions. The performances of these methods were largely dependent upon: (1) how exactly the distribution of known interactions was characterized by feature space; (2) how discriminative the feature space was for distinguishing lncRNA-protein interactions. Because the known interactions may be multiple and complex model, it remains a challenge to construct discriminative feature space for lncRNA-protein interactions. To resolve this problem, a novel method named DFRPI was developed based on deep autoencoder and marginal fisher analysis in this paper. Firstly, some initial features of lncRNA-protein interactions were extracted from the primary sequences and secondary structures of lncRNA and protein. Secondly, a deep autoencoder was exploited to learn encode parameters of the initial features to describe the known interactions precisely. Next, the marginal fisher analysis was employed to optimize the encode parameters of features to characterize a discriminative feature space of the lncRNA-protein interactions. Finally, a random forest-based predictor was trained on the discriminative feature space to detect lncRNA-protein interactions. Verified by a series of experiments, the results showed that our predictor achieved the precision of 0.920, recall of 0.916, accuracy of 0.918, MCC of 0.836, specificity of 0.920, sensitivity of 0.916 and AUC of 0.906 respectively, which outperforms the concerned methods for predicting lncRNA-protein interaction. It may be suggested that the proposed method can generate a reasonable and effective feature space for distinguishing lncRNA-protein interactions accurately. The code and data are available on https://github.com/D0ub1e-D/DFRPI.
Collapse
|
24
|
Zhang H, Chi M, Su D, Xiong Y, Wei H, Yu Y, Zuo Y, Yang L. A random forest-based metabolic risk model to assess the prognosis and metabolism-related drug targets in ovarian cancer. Comput Biol Med 2023; 153:106432. [PMID: 36608460 DOI: 10.1016/j.compbiomed.2022.106432] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Revised: 11/13/2022] [Accepted: 12/13/2022] [Indexed: 12/23/2022]
Abstract
As one of the most common gynecologic malignant tumors, ovarian cancer is usually diagnosed at an advanced and incurable stage because of its early asymptomatic onset. Increasing research into tumor biology has demonstrated that abnormal cellular metabolism precedes tumorigenesis, therefore it has become an area of active research in academia. Cellular metabolism is of great significance in cancer diagnostic and prognostic studies. In this study, we integrated The Cancer Genome Atlas dataset with multiple Gene Expression Omnibus ovarian cancer datasets, identified 17 metabolic pathways with prognostic values using the random forest algorithm, constructed a metabolic risk scoring model based on metabolic pathway enrichment scores, and classified patients with ovarian cancer into two subtypes. Then, we systematically investigated the differences between different subtypes in terms of prognosis, differential gene expression, immune signature enrichment, Hallmark signature enrichment, and somatic mutations. As well, we successfully predicted differences in sensitivity to immunotherapy and chemotherapy drugs in patients with different metabolic risk subtypes. Moreover, we identified 5 drug targets associated with high metabolic risk and low metabolic risk ovarian cancer phenotypes through the weighted correlation network analysis and investigated their roles in the genesis of ovarian cancer. Finally, we developed an XGBoost classifier for predicting metabolic risk types in patients with ovarian cancer, producing a good predictive effect. In light of the above study, the research findings will provide valuable information for prognostic prediction and personalized medical treatment of patients with ovarian cancer.
Collapse
Affiliation(s)
- Haoxin Zhang
- Department of Gastrointestinal Oncology, Harbin Medical University Cancer Hospital, Harbin, 150081, China
| | - Meng Chi
- Department of Anesthesiology, Harbin Medical University Cancer Hospital, Harbin, 150081, China
| | - Dongqing Su
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Yuqiang Xiong
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Haodong Wei
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Yao Yu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China
| | - Yongchun Zuo
- The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot, 010070, China; Digital College, Inner Mongolia Intelligent Union Big Data Academy, Inner Mongolia Wesure Date Technology Co., Ltd, Hohhot, 010010, China.
| | - Lei Yang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, China.
| |
Collapse
|
25
|
Zhang YF, Wang YH, Gu ZF, Pan XR, Li J, Ding H, Zhang Y, Deng KJ. Bitter-RF: A random forest machine model for recognizing bitter peptides. Front Med (Lausanne) 2023; 10:1052923. [PMID: 36778738 PMCID: PMC9909039 DOI: 10.3389/fmed.2023.1052923] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2022] [Accepted: 01/05/2023] [Indexed: 01/27/2023] Open
Abstract
Introduction Bitter peptides are short peptides with potential medical applications. The huge potential behind its bitter taste remains to be tapped. To better explore the value of bitter peptides in practice, we need a more effective classification method for identifying bitter peptides. Methods In this study, we developed a Random forest (RF)-based model, called Bitter-RF, using sequence information of the bitter peptide. Bitter-RF covers more comprehensive and extensive information by integrating 10 features extracted from the bitter peptides and achieves better results than the latest generation model on independent validation set. Results The proposed model can improve the accurate classification of bitter peptides (AUROC = 0.98 on independent set test) and enrich the practical application of RF method in protein classification tasks which has not been used to build a prediction model for bitter peptides. Discussion We hope the Bitter-RF could provide more conveniences to scholars for bitter peptide research.
Collapse
Affiliation(s)
- Yu-Fei Zhang
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Yu-Hao Wang
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Zhi-Feng Gu
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Xian-Run Pan
- Innovative Institute of Chinese Medicine and Pharmacy, Academy for Interdiscipline, Chengdu University of Traditional Chinese Medicine, Chengdu, China
| | - Jian Li
- School of Basic Medical Sciences, Chengdu University, Chengdu, China
| | - Hui Ding
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Yang Zhang
- Innovative Institute of Chinese Medicine and Pharmacy, Academy for Interdiscipline, Chengdu University of Traditional Chinese Medicine, Chengdu, China
| | - Ke-Jun Deng
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| |
Collapse
|
26
|
Zhang L, Li H, Zhang Z, Wang J, Chen G, Chen D, Shi W, Jia G, Liu M. Hybrid gMLP model for interaction prediction of MHC-peptide and TCR. Front Genet 2023; 13:1092822. [PMID: 36685858 PMCID: PMC9845249 DOI: 10.3389/fgene.2022.1092822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 12/01/2022] [Indexed: 01/05/2023] Open
Abstract
Understanding the interaction of T-cell receptor (TCR) with major histocompatibility-peptide (MHC-peptide) complex is extremely important in human immunotherapy and vaccine development. However, due to the limited available data, the performance of existing models for predicting the interaction of T-cell receptors (TCR) with major histocompatibility-peptide complexes is still unsatisfactory. Deep learning models have been applied to prediction tasks in various fields and have achieved better results compared with other traditional models. In this study, we leverage the gMLP model combined with attention mechanism to predict the interaction of MHC-peptide and TCR. Experiments show that our model can predict TCR-peptide interactions accurately and can handle the problems caused by different TCR lengths. Moreover, we demonstrate that the models trained with paired CDR3β-chain and CDR3α-chain data are better than those trained with only CDR3β-chain or with CDR3α-chain data. We also demonstrate that the hybrid model has greater potential than the traditional convolutional neural network.
Collapse
Affiliation(s)
- Lichao Zhang
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | - Haojin Li
- School of Software, Shandong University, Jinan, China
| | - Zhenjiu Zhang
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | - Jinjin Wang
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | | | | | - Wentao Shi
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | - Gaozhi Jia
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | - Mingjun Liu
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| |
Collapse
|
27
|
Wan H, Liu Q, Ju Y. Utilize a few features to classify presynaptic and postsynaptic neurotoxins. Comput Biol Med 2023; 152:106380. [PMID: 36473343 DOI: 10.1016/j.compbiomed.2022.106380] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2022] [Revised: 10/21/2022] [Accepted: 11/28/2022] [Indexed: 12/02/2022]
Abstract
Neurotoxins are a class of proteins that have a significant damaging effect on nerve tissue. Neurotoxins are classified into presynaptic neurotoxins and postsynaptic neurotoxins, and accurate identification of neurotoxins plays a key role in drug development. In this study, 90 presynaptic neurotoxins and 165 postsynaptic neurotoxins were classified. The features of the presynaptic and postsynaptic neurotoxin sequences were extracted using the AutoProp feature extraction method and feature selection was performed using the maximum relevance maximum distance (MRMD) program, Finally, only two features were retained to achieve 84.7% classification accuracy. Moreover, it was found that the two retained features were present in the conserved sites and motifs of presynaptic neurotoxins and could represent the critical structures of presynaptic neurotoxins. This method demonstrates that using a few key features to classify proteins can effectively identify critical protein structures.
Collapse
Affiliation(s)
- Hao Wan
- Institute of Advanced Cross-field Science, College of Life Science, Qingdao University, Qingdao, China
| | - Qing Liu
- Department of Anesthesiology, Hospital (T.C.M) Affiliated to Southwest Medical University, Luzhou, China.
| | - Ying Ju
- School of Informatics, Xiamen University, Xiamen, China.
| |
Collapse
|
28
|
Su W, Deng S, Gu Z, Yang K, Ding H, Chen H, Zhang Z. Prediction of apoptosis protein subcellular location based on amphiphilic pseudo amino acid composition. Front Genet 2023; 14:1157021. [PMID: 36926588 PMCID: PMC10011625 DOI: 10.3389/fgene.2023.1157021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 02/20/2023] [Indexed: 03/08/2023] Open
Abstract
Introduction: Apoptosis proteins play an important role in the process of cell apoptosis, which makes the rate of cell proliferation and death reach a relative balance. The function of apoptosis protein is closely related to its subcellular location, it is of great significance to study the subcellular locations of apoptosis proteins. Many efforts in bioinformatics research have been aimed at predicting their subcellular location. However, the subcellular localization of apoptotic proteins needs to be carefully studied. Methods: In this paper, based on amphiphilic pseudo amino acid composition and support vector machine algorithm, a new method was proposed for the prediction of apoptosis proteins\x{2019} subcellular location. Results and Discussion: The method achieved good performance on three data sets. The Jackknife test accuracy of the three data sets reached 90.5%, 93.9% and 84.0%, respectively. Compared with previous methods, the prediction accuracies of APACC_SVM were improved.
Collapse
Affiliation(s)
- Wenxia Su
- College of Science, Inner Mongolia Agriculture University, Hohhot, China
| | - Shuyi Deng
- School of Life Science and Technology, Center for Information Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Zhifeng Gu
- School of Life Science and Technology, Center for Information Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Keli Yang
- Nonlinear Research Institute, Baoji University of Arts and Sciences, Baoji, China
| | - Hui Ding
- School of Life Science and Technology, Center for Information Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Hui Chen
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Zhaoyue Zhang
- School of Life Science and Technology, Center for Information Biology, University of Electronic Science and Technology of China, Chengdu, China.,School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| |
Collapse
|
29
|
Shi H, Li Y, Chen Y, Qin Y, Tang Y, Zhou X, Zhang Y, Wu Y. ToxMVA: An end-to-end multi-view deep autoencoder method for protein toxicity prediction. Comput Biol Med 2022; 151:106322. [PMID: 36435057 DOI: 10.1016/j.compbiomed.2022.106322] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 11/03/2022] [Accepted: 11/14/2022] [Indexed: 11/18/2022]
Abstract
Effectively predicting protein toxicity plays an essential step in the early stage of protein-based drug discovery, which is of great help to speed up novel drug screening and reduce costs. Recently, several relevant datasets have been designed, and then machine learning-based methods have been proposed to predict the toxicity of the protein and have shown satisfactory performance. However, previous studies generally directly concatenate different protein features, which may introduce irrelevant information and decrease model performance. In this study, we present a novel end-to-end deep learning-based method called ToxMVA, to predict protein toxicity. To be specific, we first build comprehensive feature profiles of proteins based on primary sequences, including sequential, physicochemical, and contextual semantic information. Next, an autoencoder network is introduced to integrate the multi-view information for obtaining a more concise and accurate feature representation. Extensive experimental results on three datasets demonstrate that ToxMVA has superior performance for protein toxicity prediction and shows better robustness among three different datasets.
Collapse
Affiliation(s)
- Hua Shi
- School of Opto-electronic and Communication Engineering, Xiamen University of Technology, Xiamen, 361024, Fujian, China
| | - Yan Li
- School of Opto-electronic and Communication Engineering, Xiamen University of Technology, Xiamen, 361024, Fujian, China
| | - Yi Chen
- School of Opto-electronic and Communication Engineering, Xiamen University of Technology, Xiamen, 361024, Fujian, China
| | - Yuming Qin
- Anesthesiology Department, The Affiliated Hospital of Southwest Medical University, Luzhou, 646000, Sichuan, China
| | - Yifan Tang
- Anesthesiology Department, The Affiliated Hospital of Southwest Medical University, Luzhou, 646000, Sichuan, China
| | - Xun Zhou
- Beidahuang Industry Group General Hospital, Harbin, China.
| | - Ying Zhang
- Anesthesiology Department, The Affiliated Hospital of Southwest Medical University, Luzhou, 646000, Sichuan, China.
| | - Yun Wu
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, 361024, Fujian, China.
| |
Collapse
|
30
|
Dong B, Li M, Jiang B, Gao B, Li D, Zhang T. Antimicrobial Peptides Prediction method based on sequence multidimensional feature embedding. Front Genet 2022; 13:1069558. [PMID: 36468005 PMCID: PMC9714691 DOI: 10.3389/fgene.2022.1069558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 11/02/2022] [Indexed: 09/10/2024] Open
Abstract
Antimicrobial peptides (AMPs) are alkaline substances with efficient bactericidal activity produced in living organisms. As the best substitute for antibiotics, they have been paid more and more attention in scientific research and clinical application. AMPs can be produced from almost all organisms and are capable of killing a wide variety of pathogenic microorganisms. In addition to being antibacterial, natural AMPs have many other therapeutically important activities, such as wound healing, antioxidant and immunomodulatory effects. To discover new AMPs, the use of wet experimental methods is expensive and difficult, and bioinformatics technology can effectively solve this problem. Recently, some deep learning methods have been applied to the prediction of AMPs and achieved good results. To further improve the prediction accuracy of AMPs, this paper designs a new deep learning method based on sequence multidimensional representation. By encoding and embedding sequence features, and then inputting the model to identify AMPs, high-precision classification of AMPs and Non-AMPs with lengths of 10-200 is achieved. The results show that our method improved accuracy by 1.05% compared to the most advanced model in independent data validation without decreasing other indicators.
Collapse
Affiliation(s)
- Benzhi Dong
- College of Information and Computer Engineering, Northeast Forestry University, Harbin, China
| | - Mengna Li
- College of Information and Computer Engineering, Northeast Forestry University, Harbin, China
| | - Bei Jiang
- Tianjin Second People's Hospital, Tianjin Institute of Hepatology, Tianjin, China
| | - Bo Gao
- Department of Radiology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Dan Li
- College of Information and Computer Engineering, Northeast Forestry University, Harbin, China
| | - Tianjiao Zhang
- College of Information and Computer Engineering, Northeast Forestry University, Harbin, China
| |
Collapse
|
31
|
Matsuzaka Y, Yashiro R. Regulation of Extracellular Vesicle-Mediated Immune Responses against Antigen-Specific Presentation. Vaccines (Basel) 2022; 10:1691. [PMID: 36298556 PMCID: PMC9607341 DOI: 10.3390/vaccines10101691] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Revised: 09/28/2022] [Accepted: 10/04/2022] [Indexed: 11/24/2022] Open
Abstract
Extracellular vesicles (EVs) produced by various immune cells, including B and T cells, macrophages, dendritic cells (DCs), natural killer (NK) cells, and mast cells, mediate intercellular communication and have attracted much attention owing to the novel delivery system of molecules in vivo. DCs are among the most active exosome-secreting cells of the immune system. EVs produced by cancer cells contain cancer antigens; therefore, the development of vaccine therapy that does not require the identification of cancer antigens using cancer-cell-derived EVs may have significant clinical implications. In this review, we summarise the molecular mechanisms underlying EV-based immune responses and their therapeutic effects on tumour vaccination.
Collapse
Affiliation(s)
- Yasunari Matsuzaka
- Division of Molecular and Medical Genetics, Center for Gene and Cell Therapy, The Institute of Medical Science, The University of Tokyo, Minato-ku, Tokyo 108-8639, Japan
- Administrative Section of Radiation Protection, National Institute of Neuroscience, National Center of Neurology and Psychiatry, Kodaira 187-8551, Tokyo, Japan
| | - Ryu Yashiro
- Administrative Section of Radiation Protection, National Institute of Neuroscience, National Center of Neurology and Psychiatry, Kodaira 187-8551, Tokyo, Japan
- Department of Infectious Diseases, Kyorin University School of Medicine, 6-20-2 Shinkawa, Mitaka-shi 181-8611, Tokyo, Japan
| |
Collapse
|
32
|
Yuan SS, Gao D, Xie XQ, Ma CY, Su W, Zhang ZY, Zheng Y, Ding H. IBPred: A sequence-based predictor for identifying ion binding protein in phage. Comput Struct Biotechnol J 2022; 20:4942-4951. [PMID: 36147670 PMCID: PMC9474292 DOI: 10.1016/j.csbj.2022.08.053] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Revised: 08/23/2022] [Accepted: 08/24/2022] [Indexed: 11/16/2022] Open
Abstract
Ion binding proteins (IBPs) can selectively and non-covalently interact with ions. IBPs in phages also play an important role in biological processes. Therefore, accurate identification of IBPs is necessary for understanding their biological functions and molecular mechanisms that involve binding to ions. Since molecular biology experimental methods are still labor-intensive and cost-ineffective in identifying IBPs, it is helpful to develop computational methods to identify IBPs quickly and efficiently. In this work, a random forest (RF)-based model was constructed to quickly identify IBPs. Based on the protein sequence information and residues' physicochemical properties, the dipeptide composition combined with the physicochemical correlation between two residues were proposed for the extraction of features. A feature selection technique called analysis of variance (ANOVA) was used to exclude redundant information. By comparing with other classified methods, we demonstrated that our method could identify IBPs accurately. Based on the model, a Python package named IBPred was built with the source code which can be accessed at https://github.com/ShishiYuan/IBPred.
Collapse
Affiliation(s)
- Shi-Shi Yuan
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Dong Gao
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Xue-Qin Xie
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Cai-Yi Ma
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Wei Su
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Zhao-Yue Zhang
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu 611844, China
| | - Yan Zheng
- Baotou Medical College, Baotou 014040, China
| | - Hui Ding
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| |
Collapse
|