1
|
Huckvale ED, Moseley HNB. Predicting the Association of Metabolites with Both Pathway Categories and Individual Pathways. Metabolites 2024; 14:510. [PMID: 39330517 PMCID: PMC11433779 DOI: 10.3390/metabo14090510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2024] [Revised: 09/04/2024] [Accepted: 09/20/2024] [Indexed: 09/28/2024] Open
Abstract
Metabolism is a network of chemical reactions that sustain cellular life. Parts of this metabolic network are defined as metabolic pathways containing specific biochemical reactions. Products and reactants of these reactions are called metabolites, which are associated with certain human-defined metabolic pathways. Metabolic knowledgebases, such as the Kyoto Encyclopedia of Gene and Genomes (KEGG) contain metabolites, reactions, and pathway annotations; however, such resources are incomplete due to current limits of metabolic knowledge. To fill in missing metabolite pathway annotations, past machine learning models showed some success at predicting the KEGG Level 2 pathway category involvement of metabolites based on their chemical structure. Here, we present the first machine learning model to predict metabolite association to more granular KEGG Level 3 metabolic pathways. We used a feature and dataset engineering approach to generate over one million metabolite-pathway entries in the dataset used to train a single binary classifier. This approach produced a mean Matthews correlation coefficient (MCC) of 0.806 ± 0.017 SD across 100 cross-validation iterations. The 172 Level 3 pathways were predicted with an overall MCC of 0.726. Moreover, metabolite association with the 12 Level 2 pathway categories was predicted with an overall MCC of 0.891, representing significant transfer learning from the Level 3 pathway entries. These are the best metabolite pathway prediction results published so far in the field.
Collapse
Affiliation(s)
- Erik D Huckvale
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA
- Superfund Research Center, University of Kentucky, Lexington, KY 40536, USA
| | - Hunter N B Moseley
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA
- Superfund Research Center, University of Kentucky, Lexington, KY 40536, USA
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, KY 40536, USA
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, KY 40536, USA
- Institute for Biomedical Informatics, University of Kentucky, Lexington, KY 40536, USA
| |
Collapse
|
2
|
Huckvale ED, Moseley HN. Predicting the Pathway Involvement of Metabolites in Both Pathway Categories and Individual Pathways. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.07.607025. [PMID: 39149299 PMCID: PMC11326255 DOI: 10.1101/2024.08.07.607025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]
Abstract
Metabolism is the network of chemical reactions that sustain cellular life. Parts of this metabolic network are defined as metabolic pathways containing specific biochemical reactions. Products and reactants of these reactions are called metabolites, which are associated with certain human-defined metabolic pathways. Metabolic knowledgebases, such as the Kyoto Encyclopedia of Gene and Genomes (KEGG) contain metabolites, reactions, and pathway annotations; however, such resources are incomplete due to current limits of metabolic knowledge. To fill in missing metabolite pathway annotations, past machine learning models showed some success at predicting KEGG Level 2 pathway category involvement of metabolites based on their chemical structure. Here, we present the first machine learning model to predict metabolite association to more granular KEGG Level 3 metabolic pathways. We used a feature and dataset engineering approach to generate over one million metabolite-pathway entries in the dataset used to train a single binary classifier. This approach produced a mean Matthews correlation coefficient (MCC) of 0.806 ± 0.017 SD across 100 cross-validations iterations. The 172 Level 3 pathways were predicted with an overall MCC of 0.726. Moreover, metabolite association with the 12 Level 2 pathway categories were predicted with an overall MCC of 0.891, representing significant transfer learning from the Level 3 pathway entries. These are the best metabolite-pathway prediction results published so far in the field.
Collapse
Affiliation(s)
- Erik D. Huckvale
- Markey Cancer Center, University of Kentucky, Lexington, KY, USA
- Superfund Research Center, University of Kentucky, Lexington, KY, USA
| | - Hunter N.B. Moseley
- Markey Cancer Center, University of Kentucky, Lexington, KY, USA
- Superfund Research Center, University of Kentucky, Lexington, KY, USA
- Department of Computer Science (Data Science Program), University of Kentucky, Lexington, KY, USA
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, KY, USA
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, KY, USA
- Institute for Biomedical Informatics, University of Kentucky, Lexington, KY, USA
| |
Collapse
|
3
|
Wang C, Wang Y, Ding P, Li S, Yu X, Yu B. ML-FGAT: Identification of multi-label protein subcellular localization by interpretable graph attention networks and feature-generative adversarial networks. Comput Biol Med 2024; 170:107944. [PMID: 38215617 DOI: 10.1016/j.compbiomed.2024.107944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 12/08/2023] [Accepted: 01/01/2024] [Indexed: 01/14/2024]
Abstract
The prediction of multi-label protein subcellular localization (SCL) is a pivotal area in bioinformatics research. Recent advancements in protein structure research have facilitated the application of graph neural networks. This paper introduces a novel approach termed ML-FGAT. The approach begins by extracting node information of proteins from sequence data, physical-chemical properties, evolutionary insights, and structural details. Subsequently, various evolutionary techniques are integrated to consolidate multi-view information. A linear discriminant analysis framework, grounded on entropy weight, is then employed to reduce the dimensionality of the merged features. To enhance the robustness of the model, the training dataset is augmented using feature-generative adversarial networks. For the primary prediction step, graph attention networks are employed to determine multi-label protein SCL, leveraging both node and neighboring information. The interpretability is enhanced by analyzing the attention weight parameters. The training is based on the Gram-positive bacteria dataset, while validation employs newly constructed datasets: human, virus, Gram-negative bacteria, plant, and SARS-CoV-2. Following a leave-one-out cross-validation procedure, ML-FGAT demonstrates noteworthy superiority in this domain.
Collapse
Affiliation(s)
- Congjing Wang
- College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao, 266061, China; School of Data Science, Qingdao University of Science and Technology, Qingdao, 266061, China
| | - Yifei Wang
- College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao, 266061, China; School of Data Science, Qingdao University of Science and Technology, Qingdao, 266061, China
| | - Pengju Ding
- College of Information Science and Technology, Qingdao University of Science and Technology, Qingdao, 266061, China
| | - Shan Li
- School of Mathematics and Statistics, Central South University, Changsha, 410083, China
| | - Xu Yu
- Qingdao Institute of Software, College of Computer Science and Technology, China University of Petroleum, Qingdao, 266580, China
| | - Bin Yu
- School of Data Science, Qingdao University of Science and Technology, Qingdao, 266061, China; School of Data Science, University of Science and Technology of China, Hefei, 230027, China.
| |
Collapse
|
4
|
Moseley H. In the AI science boom, beware: your results are only as good as your data. Nature 2024:10.1038/d41586-024-00306-2. [PMID: 38302705 DOI: 10.1038/d41586-024-00306-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2024]
|
5
|
Liu Y, Jiang Y, Zhang F, Yang Y. A Novel Multi-Scale Graph Neural Network for Metabolic Pathway Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:178-187. [PMID: 38127612 DOI: 10.1109/tcbb.2023.3345647] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
Predicting the metabolic pathway classes of compounds in the human body is an important problem in drug research and development. For this purpose, we propose a Multi-Scale Graph Neural Network framework, named MSGNN. The framework includes a subgraph encoder, a feature encoder and a global feature processor, and a graph augmentation strategy is adopted. The subgraph encoder is responsible for extracting the local structural features of the compound, the feature encoder learns the characteristics of the atoms, and the global feature processor processes the information from the pre-training model and the two molecular fingerprints, while the graph augmentation strategy is to expand the train set through a scientific and reasonable method. The experiment result illustrates that the accuracy, precision, recall and F1 metrics of MSGNN reach 98.17%, 94.18%, 94.43% and 94.30%, respectively, which is superior to the similar models we have known. In addition, the ablation experiment demonstrates the indispensability of MSGNN modules.
Collapse
|
6
|
Huckvale ED, Powell CD, Jin H, Moseley HNB. Benchmark Dataset for Training Machine Learning Models to Predict the Pathway Involvement of Metabolites. Metabolites 2023; 13:1120. [PMID: 37999216 PMCID: PMC10673125 DOI: 10.3390/metabo13111120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 10/25/2023] [Accepted: 10/30/2023] [Indexed: 11/25/2023] Open
Abstract
Metabolic pathways are a human-defined grouping of life sustaining biochemical reactions, metabolites being both the reactants and products of these reactions. But many public datasets include identified metabolites whose pathway involvement is unknown, hindering metabolic interpretation. To address these shortcomings, various machine learning models, including those trained on data from the Kyoto Encyclopedia of Genes and Genomes (KEGG), have been developed to predict the pathway involvement of metabolites based on their chemical descriptions; however, these prior models are based on old metabolite KEGG-based datasets, including one benchmark dataset that is invalid due to the presence of over 1500 duplicate entries. Therefore, we have developed a new benchmark dataset derived from the KEGG following optimal standards of scientific computational reproducibility and including all source code needed to update the benchmark dataset as KEGG changes. We have used this new benchmark dataset with our atom coloring methodology to develop and compare the performance of Random Forest, XGBoost, and multilayer perceptron with autoencoder models generated from our new benchmark dataset. Best overall weighted average performance across 1000 unique folds was an F1 score of 0.8180 and a Matthews correlation coefficient of 0.7933, which was provided by XGBoost binary classification models for 11 KEGG-defined pathway categories.
Collapse
Affiliation(s)
- Erik D. Huckvale
- Markey Cancer Center, University of Kentucky, Lexington, KY 40506, USA
- Superfund Research Center, University of Kentucky, Lexington, KY 40506, USA
| | - Christian D. Powell
- Markey Cancer Center, University of Kentucky, Lexington, KY 40506, USA
- Superfund Research Center, University of Kentucky, Lexington, KY 40506, USA
- Department of Computer Science (Data Science Program), University of Kentucky, Lexington, KY 40506, USA
| | - Huan Jin
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, KY 40536, USA
| | - Hunter N. B. Moseley
- Markey Cancer Center, University of Kentucky, Lexington, KY 40506, USA
- Superfund Research Center, University of Kentucky, Lexington, KY 40506, USA
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, KY 40536, USA
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, KY 40506, USA
- Institute for Biomedical Informatics, University of Kentucky, Lexington, KY 40506, USA
| |
Collapse
|
7
|
Huckvale ED, Powell CD, Jin H, Moseley HN. Benchmark dataset for training machine learning models to predict the pathway involvement of metabolites. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.03.560715. [PMID: 37873272 PMCID: PMC10592640 DOI: 10.1101/2023.10.03.560715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Metabolic pathways are a human-defined grouping of life sustaining biochemical reactions, metabolites being both the reactants and products of these reactions. But many public datasets include identified metabolites whose pathway involvement is unknown, hindering metabolic interpretation. To address these shortcomings, various machine learning models, including those trained on data from the Kyoto Encyclopedia of Genes and Genomes (KEGG), have been developed to predict the pathway involvement of metabolites based on their chemical descriptions; however, these prior models are based on old metabolite KEGG-based datasets, including one benchmark dataset that is invalid due to the presence of over 1500 duplicate entries. Therefore, we have developed a new benchmark dataset derived from the KEGG following optimal standards of scientific computational reproducibility and including all source code needed to update the benchmark dataset as KEGG changes. We have used this new benchmark dataset with our atom coloring methodology to develop and compare the performance of Random Forest, XGBoost, and multilayer perceptron with autoencoder models generated from our new benchmark dataset. Best overall weighted average performance across 1000 unique folds was an F1-score of 0.8180 and Matthews correlation coefficient of 0.7933, which was provided by XGBoost binary classification models for 11 KEGG-defined pathway categories.
Collapse
Affiliation(s)
- Erik D. Huckvale
- Department of Computer Science (Data Science Program), University of Kentucky, Lexington, KY 40506, USA
| | - Christian D. Powell
- Department of Computer Science (Data Science Program), University of Kentucky, Lexington, KY 40506, USA
- Markey Cancer Center, University of Kentucky, Lexington, KY 40506, USA
- Superfund Research Center, University of Kentucky, Lexington, KY 40506, USA
| | - Huan Jin
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, KY 40536, USA
| | - Hunter N.B. Moseley
- Markey Cancer Center, University of Kentucky, Lexington, KY 40506, USA
- Superfund Research Center, University of Kentucky, Lexington, KY 40506, USA
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, KY 40506, USA
- Institute for Biomedical Informatics, University of Kentucky, Lexington, KY 40506, USA
| |
Collapse
|
8
|
Liu X, Yang H, Ai C, Ding Y, Guo F, Tang J. MVML-MPI: Multi-View Multi-Label Learning for Metabolic Pathway Inference. Brief Bioinform 2023; 24:bbad393. [PMID: 37930024 DOI: 10.1093/bib/bbad393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 09/20/2023] [Accepted: 10/11/2023] [Indexed: 11/07/2023] Open
Abstract
Development of robust and effective strategies for synthesizing new compounds, drug targeting and constructing GEnome-scale Metabolic models (GEMs) requires a deep understanding of the underlying biological processes. A critical step in achieving this goal is accurately identifying the categories of pathways in which a compound participated. However, current machine learning-based methods often overlook the multifaceted nature of compounds, resulting in inaccurate pathway predictions. Therefore, we present a novel framework on Multi-View Multi-Label Learning for Metabolic Pathway Inference, hereby named MVML-MPI. First, MVML-MPI learns the distinct compound representations in parallel with corresponding compound encoders to fully extract features. Subsequently, we propose an attention-based mechanism that offers a fusion module to complement these multi-view representations. As a result, MVML-MPI accurately represents and effectively captures the complex relationship between compounds and metabolic pathways and distinguishes itself from current machine learning-based methods. In experiments conducted on the Kyoto Encyclopedia of Genes and Genomes pathways dataset, MVML-MPI outperformed state-of-the-art methods, demonstrating the superiority of MVML-MPI and its potential to utilize the field of metabolic pathway design, which can aid in optimizing drug-like compounds and facilitating the development of GEMs. The code and data underlying this article are freely available at https://github.com/guofei-tju/MVML-MPI. Contact: jtang@cse.sc.edu, guofei@csu.edu.com or wuxi_dyj@csj.uestc.edu.cn.
Collapse
Affiliation(s)
- Xiaoyi Liu
- Computer Science and Engineering, University of South Carolina, Columbia 29208, USA
| | - Hongpeng Yang
- Computer Science and Engineering, University of South Carolina, Columbia 29208, USA
| | - Chengwei Ai
- Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Yijie Ding
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou 324000, China
| | - Fei Guo
- Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Jijun Tang
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Nanshan 518055, China
| |
Collapse
|
9
|
Bao H, Zhao J, Zhao X, Zhao C, Lu X, Xu G. Prediction of plant secondary metabolic pathways using deep transfer learning. BMC Bioinformatics 2023; 24:348. [PMID: 37726702 PMCID: PMC10507959 DOI: 10.1186/s12859-023-05485-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 09/14/2023] [Indexed: 09/21/2023] Open
Abstract
BACKGROUND Plant secondary metabolites are highly valued for their applications in pharmaceuticals, nutrition, flavors, and aesthetics. It is of great importance to elucidate plant secondary metabolic pathways due to their crucial roles in biological processes during plant growth and development. However, understanding plant biosynthesis and degradation pathways remains a challenge due to the lack of sufficient information in current databases. To address this issue, we proposed a transfer learning approach using a pre-trained hybrid deep learning architecture that combines Graph Transformer and convolutional neural network (GTC) to predict plant metabolic pathways. RESULTS GTC provides comprehensive molecular representation by extracting both structural features from the molecular graph and textual information from the SMILES string. GTC is pre-trained on the KEGG datasets to acquire general features, followed by fine-tuning on plant-derived datasets. Four metrics were chosen for model performance evaluation. The results show that GTC outperforms six other models, including three previously reported machine learning models, on the KEGG dataset. GTC yields an accuracy of 96.75%, precision of 85.14%, recall of 83.03%, and F1_score of 84.06%. Furthermore, an ablation study confirms the indispensability of all the components of the hybrid GTC model. Transfer learning is then employed to leverage the shared knowledge acquired from the KEGG metabolic pathways. As a result, the transferred GTC exhibits outstanding accuracy in predicting plant secondary metabolic pathways with an average accuracy of 98.30% in fivefold cross-validation and 97.82% on the final test. In addition, GTC is employed to classify natural products. It achieves a perfect accuracy score of 100.00% for alkaloids, while the lowest accuracy score of 98.42% for shikimates and phenylpropanoids. CONCLUSIONS The proposed GTC effectively captures molecular features, and achieves high performance in classifying KEGG metabolic pathways and predicting plant secondary metabolic pathways via transfer learning. Furthermore, GTC demonstrates its generalization ability by accurately classifying natural products. A user-friendly executable program has been developed, which only requires the input of the SMILES string of the query compound in a graphical interface.
Collapse
Affiliation(s)
- Han Bao
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, People's Republic of China
- University of Chinese Academy of Sciences, Beijing, 100049, People's Republic of China
- Liaoning Province Key Laboratory of Metabolomics, Dalian, 116023, People's Republic of China
| | - Jinhui Zhao
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, People's Republic of China
- University of Chinese Academy of Sciences, Beijing, 100049, People's Republic of China
- Liaoning Province Key Laboratory of Metabolomics, Dalian, 116023, People's Republic of China
| | - Xinjie Zhao
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, People's Republic of China
- University of Chinese Academy of Sciences, Beijing, 100049, People's Republic of China
- Liaoning Province Key Laboratory of Metabolomics, Dalian, 116023, People's Republic of China
| | - Chunxia Zhao
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, People's Republic of China
- University of Chinese Academy of Sciences, Beijing, 100049, People's Republic of China
- Liaoning Province Key Laboratory of Metabolomics, Dalian, 116023, People's Republic of China
| | - Xin Lu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, People's Republic of China.
- University of Chinese Academy of Sciences, Beijing, 100049, People's Republic of China.
- Liaoning Province Key Laboratory of Metabolomics, Dalian, 116023, People's Republic of China.
| | - Guowang Xu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, People's Republic of China.
- University of Chinese Academy of Sciences, Beijing, 100049, People's Republic of China.
- Liaoning Province Key Laboratory of Metabolomics, Dalian, 116023, People's Republic of China.
| |
Collapse
|
10
|
Du BX, Long Y, Li X, Wu M, Shi JY. CMMS-GCL: cross-modality metabolic stability prediction with graph contrastive learning. Bioinformatics 2023; 39:btad503. [PMID: 37572298 PMCID: PMC10457661 DOI: 10.1093/bioinformatics/btad503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Revised: 07/26/2023] [Accepted: 08/11/2023] [Indexed: 08/14/2023] Open
Abstract
MOTIVATION Metabolic stability plays a crucial role in the early stages of drug discovery and development. Accurately modeling and predicting molecular metabolic stability has great potential for the efficient screening of drug candidates as well as the optimization of lead compounds. Considering wet-lab experiment is time-consuming, laborious, and expensive, in silico prediction of metabolic stability is an alternative choice. However, few computational methods have been developed to address this task. In addition, it remains a significant challenge to explain key functional groups determining metabolic stability. RESULTS To address these issues, we develop a novel cross-modality graph contrastive learning model named CMMS-GCL for predicting the metabolic stability of drug candidates. In our framework, we design deep learning methods to extract features for molecules from two modality data, i.e. SMILES sequence and molecule graph. In particular, for the sequence data, we design a multihead attention BiGRU-based encoder to preserve the context of symbols to learn sequence representations of molecules. For the graph data, we propose a graph contrastive learning-based encoder to learn structure representations by effectively capturing the consistencies between local and global structures. We further exploit fully connected neural networks to combine the sequence and structure representations for model training. Extensive experimental results on two datasets demonstrate that our CMMS-GCL consistently outperforms seven state-of-the-art methods. Furthermore, a collection of case studies on sequence data and statistical analyses of the graph structure module strengthens the validation of the interpretability of crucial functional groups recognized by CMMS-GCL. Overall, CMMS-GCL can serve as an effective and interpretable tool for predicting metabolic stability, identifying critical functional groups, and thus facilitating the drug discovery process and lead compound optimization. AVAILABILITY AND IMPLEMENTATION The code and data underlying this article are freely available at https://github.com/dubingxue/CMMS-GCL.
Collapse
Affiliation(s)
- Bing-Xue Du
- School of Life Sciences, Northwestern Polytechnical University, Xi’an 710072, China
- Institute for Infocomm Research (IR), Agency for Science, Technology and Research (A*STAR), Singapore 138632, Singapore
| | - Yahui Long
- Singapore Immunology Network (SIgN), Agency for Science, Technology and Research (A*STAR), Singapore 138648, Singapore
| | - Xiaoli Li
- Institute for Infocomm Research (IR), Agency for Science, Technology and Research (A*STAR), Singapore 138632, Singapore
| | - Min Wu
- Institute for Infocomm Research (IR), Agency for Science, Technology and Research (A*STAR), Singapore 138632, Singapore
| | - Jian-Yu Shi
- School of Life Sciences, Northwestern Polytechnical University, Xi’an 710072, China
| |
Collapse
|