Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Ren J, Zhang Y, Guo W, Feng K, Yuan Y, Huang T, Cai YD. Identification of Genes Associated with the Impairment of Olfactory and Gustatory Functions in COVID-19 via Machine-Learning Methods. Life (Basel) 2023;13:798. [PMID: 36983953 PMCID: PMC10051382 DOI: 10.3390/life13030798] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 03/10/2023] [Accepted: 03/13/2023] [Indexed: 03/17/2023] Open

For:	Ren J, Zhang Y, Guo W, Feng K, Yuan Y, Huang T, Cai YD. Identification of Genes Associated with the Impairment of Olfactory and Gustatory Functions in COVID-19 via Machine-Learning Methods. Life (Basel) 2023;13:798. [PMID: 36983953 PMCID: PMC10051382 DOI: 10.3390/life13030798] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 03/10/2023] [Accepted: 03/13/2023] [Indexed: 03/17/2023] Open

Number

Cited by Other Article(s)

Ren J, Gao Q, Zhou X, Chen L, Guo W, Feng K, Huang T, Cai YD. Identification of key gene expression associated with quality of life after recovery from COVID-19. Med Biol Eng Comput 2024;62:1031-1048. [PMID: 38123886 DOI: 10.1007/s11517-023-02988-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 11/30/2023] [Indexed: 12/23/2023]

Ma Q, Chen L, Feng K, Guo W, Huang T, Cai YD. Exploring Prognostic Gene Factors in Breast Cancer via Machine Learning. Biochem Genet 2024:10.1007/s10528-024-10712-w. [PMID: 38383836 DOI: 10.1007/s10528-024-10712-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Accepted: 01/21/2024] [Indexed: 02/23/2024]

Chen L, Xu J, Zhou Y. PDATC-NCPMKL: Predicting drug's Anatomical Therapeutic Chemical (ATC) codes based on network consistency projection and multiple kernel learning. Comput Biol Med 2024;169:107862. [PMID: 38150886 DOI: 10.1016/j.compbiomed.2023.107862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 11/19/2023] [Accepted: 12/17/2023] [Indexed: 12/29/2023]

Ren J, Zhou X, Huang K, Chen L, Guo W, Feng K, Huang T, Cai YD. Identification of key genes associated with persistent immune changes and secondary immune activation responses induced by influenza vaccination after COVID-19 recovery by machine learning methods. Comput Biol Med 2024;169:107883. [PMID: 38157776 DOI: 10.1016/j.compbiomed.2023.107883] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 11/27/2023] [Accepted: 12/18/2023] [Indexed: 01/03/2024]

Chen L, Zhang C, Xu J. PredictEFC: a fast and efficient multi-label classifier for predicting enzyme family classes. BMC Bioinformatics 2024;25:50. [PMID: 38291384 PMCID: PMC10829269 DOI: 10.1186/s12859-024-05665-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 01/22/2024] [Indexed: 02/01/2024] Open

Abstract

BACKGROUND

Enzymes play an irreplaceable and important role in maintaining the lives of living organisms. The Enzyme Commission (EC) number of an enzyme indicates its essential functions. Correct identification of the first digit (family class) of the EC number for a given enzyme is a hot topic in the past twenty years. Several previous methods adopted functional domain composition to represent enzymes. However, it would lead to dimension disaster, thereby reducing the efficiency of the methods. On the other hand, most previous methods can only deal with enzymes belonging to one family class. In fact, several enzymes belong to two or more family classes.

RESULTS

In this study, a fast and efficient multi-label classifier, named PredictEFC, was designed. To construct this classifier, a novel feature extraction scheme was designed for processing functional domain information of enzymes, which counting the distribution of each functional domain entry across seven family classes in the training dataset. Based on this scheme, each training or test enzyme was encoded into a 7-dimenion vector by fusing its functional domain information and above statistical results. Random k-labelsets (RAKEL) was adopted to build the classifier, where random forest was selected as the base classification algorithm. The two tenfold cross-validation results on the training dataset shown that the accuracy of PredictEFC can reach 0.8493 and 0.8370. The independent test on two datasets indicated the accuracy values of 0.9118 and 0.8777.

CONCLUSION

The performance of PredictEFC was slightly lower than the classifier directly using functional domain composition. However, its efficiency was sharply improved. The running time was less than one-tenth of the time of the classifier directly using functional domain composition. In additional, the utility of PredictEFC was superior to the classifiers using traditional dimensionality reduction methods and some previous methods, and this classifier can be transplanted for predicting enzyme family classes of other species. Finally, a web-server available at http://124.221.158.221/ was set up for easy usage.

Collapse

Zhou B, Ran B, Chen L. A GraphSAGE-based model with fingerprints only to predict drug-drug interactions. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2024;21:2922-2942. [PMID: 38454713 DOI: 10.3934/mbe.2024130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]

Abstract

Drugs are an effective way to treat various diseases. Some diseases are so complicated that the effect of a single drug for such diseases is limited, which has led to the emergence of combination drug therapy. The use multiple drugs to treat these diseases can improve the drug efficacy, but it can also bring adverse effects. Thus, it is essential to determine drug-drug interactions (DDIs). Recently, deep learning algorithms have become popular to design DDI prediction models. However, most deep learning-based models need several types of drug properties, inducing the application problems for drugs without these properties. In this study, a new deep learning-based model was designed to predict DDIs. For wide applications, drugs were first represented by commonly used properties, referred to as fingerprint features. Then, these features were perfectly fused with the drug interaction network by a type of graph convolutional network method, GraphSAGE, yielding high-level drug features. The inner product was adopted to score the strength of drug pairs. The model was evaluated by 10-fold cross-validation, resulting in an AUROC of 0.9704 and AUPR of 0.9727. Such performance was better than the previous model which directly used drug fingerprint features and was competitive compared with some other previous models that used more drug properties. Furthermore, the ablation tests indicated the importance of the main parts of the model, and we analyzed the strengths and limitations of a model for drugs with different degrees in the network. This model identified some novel DDIs that may bring expected benefits, such as the combination of PEA and cannabinol that may produce better effects. DDIs that may cause unexpected side effects have also been discovered, such as the combined use of WIN 55,212-2 and cannabinol. These DDIs can provide novel insights for treating complex diseases or avoiding adverse drug events.

Collapse

Ding S, Liao H, Huang F, Chen L, Guo W, Feng K, Huang T, Cai YD. Analyzing domain features of small proteins using a machine-learning method. Proteomics 2024:e2300302. [PMID: 38258387 DOI: 10.1002/pmic.202300302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Revised: 01/14/2024] [Accepted: 01/15/2024] [Indexed: 01/24/2024]

Chen L, Qu R, Liu X. Improved multi-label classifiers for predicting protein subcellular localization. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2024;21:214-236. [PMID: 38303420 DOI: 10.3934/mbe.2024010] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/03/2024]

Lin X, Ma Q, Chen L, Guo W, Huang Z, Huang T, Cai YD. Identifying genes associated with resistance to KRAS G12C inhibitors via machine learning methods. Biochim Biophys Acta Gen Subj 2023;1867:130484. [PMID: 37805078 DOI: 10.1016/j.bbagen.2023.130484] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 10/02/2023] [Accepted: 10/04/2023] [Indexed: 10/09/2023]

Chen L, Zhao X. PCDA-HNMP: Predicting circRNA-disease association using heterogeneous network and meta-path. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023;20:20553-20575. [PMID: 38124565 DOI: 10.3934/mbe.2023909] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]

Abstract

Increasing amounts of experimental studies have shown that circular RNAs (circRNAs) play important regulatory roles in human diseases through interactions with related microRNAs (miRNAs). CircRNAs have become new potential disease biomarkers and therapeutic targets. Predicting circRNA-disease association (CDA) is of great significance for exploring the pathogenesis of complex diseases, which can improve the diagnosis level of diseases and promote the targeted therapy of diseases. However, determination of CDAs through traditional clinical trials is usually time-consuming and expensive. Computational methods are now alternative ways to predict CDAs. In this study, a new computational method, named PCDA-HNMP, was designed. For obtaining informative features of circRNAs and diseases, a heterogeneous network was first constructed, which defined circRNAs, mRNAs, miRNAs and diseases as nodes and associations between them as edges. Then, a deep analysis was conducted on the heterogeneous network by extracting meta-paths connecting to circRNAs (diseases), thereby mining hidden associations between various circRNAs (diseases). These associations constituted the meta-path-induced networks for circRNAs and diseases. The features of circRNAs and diseases were derived from the aforementioned networks via mashup. On the other hand, miRNA-disease associations (mDAs) were employed to improve the model's performance. miRNA features were yielded from the meta-path-induced networks on miRNAs and circRNAs, which were constructed from the meta-paths connecting miRNAs and circRNAs in the heterogeneous network. A concatenation operation was adopted to build the features of CDAs and mDAs. Such representations of CDAs and mDAs were fed into XGBoost to set up the model. The five-fold cross-validation yielded an area under the curve (AUC) of 0.9846, which was better than those of some existing state-of-the-art methods. The employment of mDAs can really enhance the model's performance and the importance analysis on meta-path-induced networks shown that networks produced by the meta-paths containing validated CDAs provided the most important contributions.

Collapse

Yang Y, Zhang Y, Ren J, Feng K, Li Z, Huang T, Cai Y. Identification of Colon Immune Cell Marker Genes Using Machine Learning Methods. Life (Basel) 2023;13:1876. [PMID: 37763280 PMCID: PMC10532943 DOI: 10.3390/life13091876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2023] [Revised: 08/24/2023] [Accepted: 09/04/2023] [Indexed: 09/29/2023] Open

Ren JX, Gao Q, Zhou XC, Chen L, Guo W, Feng KY, Lu L, Huang T, Cai YD. Identification of Gene Markers Associated with COVID-19 Severity and Recovery in Different Immune Cell Subtypes. BIOLOGY 2023;12:947. [PMID: 37508378 PMCID: PMC10376631 DOI: 10.3390/biology12070947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 06/20/2023] [Accepted: 06/29/2023] [Indexed: 07/30/2023]

Chen L, Chen K, Zhou B. Inferring drug-disease associations by a deep analysis on drug and disease networks. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023;20:14136-14157. [PMID: 37679129 DOI: 10.3934/mbe.2023632] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/09/2023]

Ma QL, Huang FM, Guo W, Feng KY, Huang T, Cai YD. Machine Learning Classification of Time since BNT162b2 COVID-19 Vaccination Based on Array-Measured Antibody Activity. Life (Basel) 2023;13:1304. [PMID: 37374086 DOI: 10.3390/life13061304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 05/26/2023] [Accepted: 05/29/2023] [Indexed: 06/29/2023] Open

Abstract

Vaccines trigger an immunological response that includes B and T cells, with B cells producing antibodies. SARS-CoV-2 immunity weakens over time after vaccination. Discovering key changes in antigen-reactive antibodies over time after vaccination could help improve vaccine efficiency. In this study, we collected data on blood antibody levels in a cohort of healthcare workers vaccinated for COVID-19 and obtained 73 antigens in samples from four groups according to the duration after vaccination, including 104 unvaccinated healthcare workers, 534 healthcare workers within 60 days after vaccination, 594 healthcare workers between 60 and 180 days after vaccination, and 141 healthcare workers over 180 days after vaccination. Our work was a reanalysis of the data originally collected at Irvine University. This data was obtained in Orange County, California, USA, with the collection process commencing in December 2020. British variant (B.1.1.7), South African variant (B.1.351), and Brazilian/Japanese variant (P.1) were the most prevalent strains during the sampling period. An efficient machine learning based framework containing four feature selection methods (least absolute shrinkage and selection operator, light gradient boosting machine, Monte Carlo feature selection, and maximum relevance minimum redundancy) and four classification algorithms (decision tree, k-nearest neighbor, random forest, and support vector machine) was designed to select essential antibodies against specific antigens. Several efficient classifiers with a weighted F1 value around 0.75 were constructed. The antigen microarray used for identifying antibody levels in the coronavirus features ten distinct SARS-CoV-2 antigens, comprising various segments of both nucleocapsid protein (NP) and spike protein (S). This study revealed that S1 + S2, S1.mFcTag, S1.HisTag, S1, S2, Spike.RBD.His.Bac, Spike.RBD.rFc, and S1.RBD.mFc were most highly ranked among all features, where S1 and S2 are the subunits of Spike, and the suffixes represent the tagging information of different recombinant proteins. Meanwhile, the classification rules were obtained from the optimal decision tree to explain quantitatively the roles of antigens in the classification. This study identified antibodies associated with decreased clinical immunity based on populations with different time spans after vaccination. These antibodies have important implications for maintaining long-term immunity to SARS-CoV-2.

Collapse

Xu Y, Ma Q, Ren J, Chen L, Guo W, Feng K, Zeng Z, Huang T, Cai Y. Using Machine Learning Methods in Identifying Genes Associated with COVID-19 in Cardiomyocytes and Cardiac Vascular Endothelial Cells. Life (Basel) 2023;13:life13041011. [PMID: 37109540 PMCID: PMC10146712 DOI: 10.3390/life13041011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 04/02/2023] [Accepted: 04/08/2023] [Indexed: 04/29/2023] Open