1
|
Huang J, Wang X, Xia R, Yang D, Liu J, Lv Q, Yu X, Meng J, Chen K, Song B, Wang Y. Domain-knowledge enabled ensemble learning of 5-formylcytosine (f5C) modification sites. Comput Struct Biotechnol J 2024; 23:3175-3185. [PMID: 39253057 PMCID: PMC11381828 DOI: 10.1016/j.csbj.2024.08.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2024] [Revised: 08/07/2024] [Accepted: 08/07/2024] [Indexed: 09/11/2024] Open
Abstract
5-formylcytidine (f5C) is a unique post-transcriptional RNA modification found in mRNA and tRNA at the wobble site, playing a crucial role in mitochondrial protein synthesis and potentially contributing to the regulation of translation. Recent studies have unveiled that the f5C modifications may drive mitochondrial mRNA translation to power cancer metastasis. Accurate identification of f5C sites is essential for further unraveling their molecular functions and regulatory mechanisms, but there are currently no computational methods available for predicting their locations. In this study, we introduce an innovative ensemble approach, successfully enabling the computational recognition of Saccharomyces cerevisiae f5C. We conducted a comprehensive model selection process that involved multiple basic machine learning and deep learning algorithms such as recurrent neural networks, convolutional neural networks and Transformer-based models. Initially trained only on sequence information, these individual models achieved an AUROC ranging from 0.7104 to 0.7492. Through the integration of 32 novel domain-derived genomic features, the performance of individual models has significantly improved to an AUROC between 0.7309 and 0.8076. To further enhance accuracy and robustness, we then constructed the ensembles of these individual models with different combinations. The best performance attained by our ensemble models reached an AUROC of 0.8391. Shapley additive explanations were conducted to explain the significant contributions of genomic features, providing insights into the putative distribution of f5C across various topological regions and potentially paving the way for revealing their functional relevance within distinct genomic contexts. A freely accessible web server that allows real-time analysis of user-uploaded sites can be accessed at: www.rnamd.org/Resf5C-Pred.
Collapse
Affiliation(s)
- Jiaming Huang
- Jiangsu Key Laboratory for Functional Substance of Chinese Medicine, School of Pharmacy, Nanjing University of Chinese Medicine, Nanjing 210023, China
- Department of Biological Sciences, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China
| | - Xuan Wang
- Department of Biological Sciences, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China
| | - Rong Xia
- Department of Biological Sciences, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China
- School of AI and Advanced Computing, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China
| | - Dongqing Yang
- Department of Public Health, School of Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
| | - Jian Liu
- Jiangsu Key Laboratory for Functional Substance of Chinese Medicine, School of Pharmacy, Nanjing University of Chinese Medicine, Nanjing 210023, China
| | - Qi Lv
- Jiangsu Key Laboratory for Functional Substance of Chinese Medicine, School of Pharmacy, Nanjing University of Chinese Medicine, Nanjing 210023, China
| | - Xiaoxuan Yu
- Department of Pharmacology, School of Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
| | - Jia Meng
- Department of Biological Sciences, School of Science, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China
- AI University Research Centre, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L7 8TX, United Kingdom
| | - Kunqi Chen
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350004, China
| | - Bowen Song
- Department of Public Health, School of Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
| | - Yue Wang
- Jiangsu Key Laboratory for Functional Substance of Chinese Medicine, School of Pharmacy, Nanjing University of Chinese Medicine, Nanjing 210023, China
| |
Collapse
|
2
|
Harigua-Souiai E, Masmoudi O, Makni S, Oualha R, Abdelkrim YZ, Hamdi S, Souiai O, Guizani I. cidalsDB: an AI-empowered platform for anti-pathogen therapeutics research. J Cheminform 2024; 16:134. [PMID: 39609715 PMCID: PMC11605991 DOI: 10.1186/s13321-024-00929-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2024] [Accepted: 11/11/2024] [Indexed: 11/30/2024] Open
Abstract
Computer-aided drug discovery (CADD) is nurtured by late advances in big data analytics and Artificial Intelligence (AI) towards enhanced drug discovery (DD) outcomes. In this context, reliable datasets are of utmost importance. We herein present CidalsDB a novel web server for AI-assisted DD against infectious pathogens, namely Leishmania parasites and Coronaviruses. We performed a literature search on molecules with validated anti-pathogen effects. Then, we consolidated these data with bioassays from PubChem. Finally, we constructed a database to store these datasets and make them accessible and ready-to-use for the scientific community through CidalsDB, a web-based interface. In a second step, we implemented and optimized four machine learning (ML) and three deep learning (DL) algorithms that optimally predicted the biological activity of molecules. Random Forests (RF), Multi-Layer Perceptron (MLP) and ChemBERTa were the best classifiers of anti-Leishmania molecules, while Gradient Boosting (GB), Graph-Convolutional Network (GCN) and ChemBERTa achieved the best performances on the Coronaviruses dataset. All six models were optimized and deployed through CidalsDB as anti-pathogen activity prediction models.Scientific contributionCidalsDB is an open access web-based tool that allows browsing and access to ready-to-use datasets of anti-pathogen molecules, alongside best performing AI models for biological activity prediction. It offers a democratized no-code platform for AI-based CADD, which shall foster innovation and collaboration within the DD community. CidalsDB is accessible through https://cidalsdb.streamlit.app/ .
Collapse
Affiliation(s)
- Emna Harigua-Souiai
- Laboratory of Molecular Epidemiology and Experimental Pathology - LR16IPT04, Institut Pasteur de Tunis, Université de Tunis El Manar, 13, Place Pasteur, 1002, Tunis, Tunisia.
| | - Ons Masmoudi
- Laboratory of Molecular Epidemiology and Experimental Pathology - LR16IPT04, Institut Pasteur de Tunis, Université de Tunis El Manar, 13, Place Pasteur, 1002, Tunis, Tunisia
| | - Samer Makni
- Laboratory of Molecular Epidemiology and Experimental Pathology - LR16IPT04, Institut Pasteur de Tunis, Université de Tunis El Manar, 13, Place Pasteur, 1002, Tunis, Tunisia
| | - Rafeh Oualha
- Laboratory of Molecular Epidemiology and Experimental Pathology - LR16IPT04, Institut Pasteur de Tunis, Université de Tunis El Manar, 13, Place Pasteur, 1002, Tunis, Tunisia
| | - Yosser Z Abdelkrim
- Laboratory of Molecular Epidemiology and Experimental Pathology - LR16IPT04, Institut Pasteur de Tunis, Université de Tunis El Manar, 13, Place Pasteur, 1002, Tunis, Tunisia
| | - Sara Hamdi
- Laboratory of Molecular Epidemiology and Experimental Pathology - LR16IPT04, Institut Pasteur de Tunis, Université de Tunis El Manar, 13, Place Pasteur, 1002, Tunis, Tunisia
| | - Oussama Souiai
- Laboratory of BioInformatics, BioMathematics and BioStatistics - LR20IPT09, Institut Pasteur de Tunis, Université de Tunis El Manar, 13, Place Pasteur, 1002, Tunis, Tunisia
- Institut Supérieur des technologies médicales de Tunis, ISTMT, Université de Tunis El Manar, 9, Rue Docteur Zouheïr Safi, 1006, Tunis, Tunisia
| | - Ikram Guizani
- Laboratory of Molecular Epidemiology and Experimental Pathology - LR16IPT04, Institut Pasteur de Tunis, Université de Tunis El Manar, 13, Place Pasteur, 1002, Tunis, Tunisia
| |
Collapse
|
3
|
Song Z, Chen G, Chen CYC. AI empowering traditional Chinese medicine? Chem Sci 2024; 15:d4sc04107k. [PMID: 39355231 PMCID: PMC11440359 DOI: 10.1039/d4sc04107k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Accepted: 09/22/2024] [Indexed: 10/03/2024] Open
Abstract
For centuries, Traditional Chinese Medicine (TCM) has been a prominent treatment method in China, incorporating acupuncture, herbal remedies, massage, and dietary therapy to promote holistic health and healing. TCM has played a major role in drug discovery, with over 60% of small-molecule drugs approved by the FDA from 1981 to 2019 being derived from natural products. However, TCM modernization faces challenges such as data standardization and the complexity of TCM formulations. The establishment of comprehensive TCM databases has significantly improved the efficiency and accuracy of TCM research, enabling easier access to information on TCM ingredients and encouraging interdisciplinary collaborations. These databases have revolutionized TCM research, facilitating advancements in TCM modernization and patient care. In addition, advancements in AI algorithms and database data quality have accelerated progress in AI for TCM. The application of AI in TCM encompasses a wide range of areas, including herbal screening and new drug discovery, diagnostic and treatment principles, pharmacological mechanisms, network pharmacology, and the incorporation of innovative AI technologies. AI also has the potential to enable personalized medicine by identifying patterns and correlations in patient data, leading to more accurate diagnoses and tailored treatments. The potential benefits of AI for TCM are vast and diverse, promising continued progress and innovation in the field.
Collapse
Affiliation(s)
- Zhilin Song
- State Key Laboratory of Chemical Oncogenomics, School of Chemical Biology and Biotechnology, Peking University Shenzhen Graduate School Shenzhen Guangdong 518055 China
- AI for Science (AI4S)-Preferred Program, School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School Shenzhen Guangdong 518055 China
| | - Guanxing Chen
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University Shenzhen Guangdong 518107 China
| | - Calvin Yu-Chian Chen
- State Key Laboratory of Chemical Oncogenomics, School of Chemical Biology and Biotechnology, Peking University Shenzhen Graduate School Shenzhen Guangdong 518055 China
- AI for Science (AI4S)-Preferred Program, School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School Shenzhen Guangdong 518055 China
- Department of Medical Research, China Medical University Hospital Taichung 40447 Taiwan
- Department of Bioinformatics and Medical Engineering, Asia University Taichung 41354 Taiwan
- Guangdong L-Med Biotechnology Co., Ltd Meizhou Guangdong 514699 China
| |
Collapse
|
4
|
Wei G, Wu N, Zhao K, Yang S, Wang L, Liu Y. DeepCheck: multitask learning aids in assessing microbial genome quality. Brief Bioinform 2024; 25:bbae539. [PMID: 39438078 PMCID: PMC11495869 DOI: 10.1093/bib/bbae539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 08/26/2024] [Accepted: 10/09/2024] [Indexed: 10/25/2024] Open
Abstract
Metagenomic analyses facilitate the exploration of the microbial world, advancing our understanding of microbial roles in ecological and biological processes. A pivotal aspect of metagenomic analysis involves assessing the quality of metagenome-assembled genomes (MAGs), crucial for accurate biological insights. Current machine learning-based methods often treat completeness and contamination prediction as separate tasks, overlooking their inherent relationship and limiting models' generalization. In this study, we present DeepCheck, a multitasking deep learning framework for simultaneous prediction of MAG completeness and contamination. DeepCheck consistently outperforms existing tools in accuracy across various experimental settings and demonstrates comparable speed while maintaining high predictive accuracy even for new lineages. Additionally, we employ interpretable machine learning techniques to identify specific genes and pathways that drive the model's predictions, enabling independent investigation and assessment of these biological elements for deeper insights.
Collapse
Affiliation(s)
- Guo Wei
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210000, China
| | - Nannan Wu
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210000, China
| | - Kunyang Zhao
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210000, China
| | - Sihai Yang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210000, China
- Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, 159 Panlong road, Xuanwu District, Nanjing 210000, China
| | - Long Wang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210000, China
| | - Yan Liu
- Department of Computer Science, Yangzhou University, 196 Huaxi Road, Hanjiang District, Yangzhou 225100, China
| |
Collapse
|
5
|
Ahmed W, Zaman S, Asif E, Ali K, Mahmoud EE, Asheboss MA. Exploring the role of topological descriptors to predict physicochemical properties of anti-HIV drugs by using supervised machine learning algorithms. BMC Chem 2024; 18:167. [PMID: 39267184 PMCID: PMC11395299 DOI: 10.1186/s13065-024-01266-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Accepted: 08/12/2024] [Indexed: 09/14/2024] Open
Abstract
In order to explore the role of topological indices for predicting physio-chemical properties of anti-HIV drugs, this research uses python program-based algorithms to compute topological indices as well as machine learning algorithms. Degree-based topological indices are calculated using Python algorithm, providing important information about the structural behavior of drugs that are essential to their anti-HIV effectiveness. Furthermore, machine learning algorithms analyze the physio-chemical properties that correspond to anti-HIV activities, making use of their ability to identify complex trends in large, convoluted datasets. In addition to improving our comprehension of the links between molecular structure and effectiveness, the collaboration between machine learning and QSPR research further highlights the potential of computational approaches in drug discovery. This work reveals the mechanisms underlying anti-HIV effectiveness, which paves the way for the development of more potent anti-HIV drugs. This work reveals the mechanisms underlying anti-HIV efficiency, which paves the way for the development of more potent anti-HIV drugs which demonstrates the invaluable advantages of machine learning in assessing drug properties by clarifying the biological processes underlying anti-HIV behavior, which paves the way for the design and development of more effective anti-HIV drugs.
Collapse
Affiliation(s)
- Wakeel Ahmed
- Department of Mathematics, University of Sialkot, Sialkot, 51310, Pakistan.
- Department of Mathematics, COMSATS University, Islamabad Lahore Campus, Lahore, 51000, Pakistan.
| | - Shahid Zaman
- Department of Mathematics, University of Sialkot, Sialkot, 51310, Pakistan
- Department of Mathematical and Physical Sciences, University of Nizwa, Nizwa, Oman
| | - Eizzah Asif
- Department of Mathematics, University of Sialkot, Sialkot, 51310, Pakistan
| | - Kashif Ali
- Department of Mathematics, COMSATS University, Islamabad Lahore Campus, Lahore, 51000, Pakistan
| | - Emad E Mahmoud
- Department of Mathematics and Statistics, Collage of Science, Taif University, P.O. Box 11099, 21944, Taif, Saudi Arabia
| | | |
Collapse
|
6
|
Li XL, Zhang JQ, Shen XJ, Zhang Y, Guo DA. Overview and limitations of database in global traditional medicines: A narrative review. Acta Pharmacol Sin 2024:10.1038/s41401-024-01353-1. [PMID: 39095509 DOI: 10.1038/s41401-024-01353-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 07/02/2024] [Indexed: 08/04/2024] Open
Abstract
The study of traditional medicine has garnered significant interest, resulting in various research areas including chemical composition analysis, pharmacological research, clinical application, and quality control. The abundance of available data has made databases increasingly essential for researchers to manage the vast amount of information and explore new drugs. In this article we provide a comprehensive overview and summary of 182 databases that are relevant to traditional medicine research, including 73 databases for chemical component analysis, 70 for pharmacology research, and 39 for clinical application and quality control from published literature (2000-2023). The review categorizes the databases by functionality, offering detailed information on websites and capacities to facilitate easier access. Moreover, this article outlines the primary function of each database, supplemented by case studies to aid in database selection. A practical test was conducted on 68 frequently used databases using keywords and functionalities, resulting in the identification of highlighted databases. This review serves as a reference for traditional medicine researchers to choose appropriate databases and also provides insights and considerations for the function and content design of future databases.
Collapse
Affiliation(s)
- Xiao-Lan Li
- Shanghai Research Center for Modernization of Traditional Chinese Medicine, National Engineering Research Center of TCM Standardization Technology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Jian-Qing Zhang
- Shanghai Research Center for Modernization of Traditional Chinese Medicine, National Engineering Research Center of TCM Standardization Technology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China
| | - Xuan-Jing Shen
- Shanghai Research Center for Modernization of Traditional Chinese Medicine, National Engineering Research Center of TCM Standardization Technology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yu Zhang
- Shanghai Research Center for Modernization of Traditional Chinese Medicine, National Engineering Research Center of TCM Standardization Technology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - De-An Guo
- Shanghai Research Center for Modernization of Traditional Chinese Medicine, National Engineering Research Center of TCM Standardization Technology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
| |
Collapse
|
7
|
Oniani D, Hilsman J, Zang C, Wang J, Cai L, Zawala J, Wang Y. Emerging opportunities of using large language models for translation between drug molecules and indications. Sci Rep 2024; 14:10738. [PMID: 38730226 PMCID: PMC11087469 DOI: 10.1038/s41598-024-61124-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Accepted: 05/02/2024] [Indexed: 05/12/2024] Open
Abstract
A drug molecule is a substance that changes an organism's mental or physical state. Every approved drug has an indication, which refers to the therapeutic use of that drug for treating a particular medical condition. While the Large Language Model (LLM), a generative Artificial Intelligence (AI) technique, has recently demonstrated effectiveness in translating between molecules and their textual descriptions, there remains a gap in research regarding their application in facilitating the translation between drug molecules and indications (which describes the disease, condition or symptoms for which the drug is used), or vice versa. Addressing this challenge could greatly benefit the drug discovery process. The capability of generating a drug from a given indication would allow for the discovery of drugs targeting specific diseases or targets and ultimately provide patients with better treatments. In this paper, we first propose a new task, the translation between drug molecules and corresponding indications, and then test existing LLMs on this new task. Specifically, we consider nine variations of the T5 LLM and evaluate them on two public datasets obtained from ChEMBL and DrugBank. Our experiments show the early results of using LLMs for this task and provide a perspective on the state-of-the-art. We also emphasize the current limitations and discuss future work that has the potential to improve the performance on this task. The creation of molecules from indications, or vice versa, will allow for more efficient targeting of diseases and significantly reduce the cost of drug discovery, with the potential to revolutionize the field of drug discovery in the era of generative AI.
Collapse
Affiliation(s)
- David Oniani
- Department of Health Information Management, University of Pittsburgh, Pittsburgh, PA, USA
| | - Jordan Hilsman
- Department of Health Information Management, University of Pittsburgh, Pittsburgh, PA, USA
| | - Chengxi Zang
- Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
- Institute of Artificial Intelligence for Digital Health, Weill Cornell Medicine, New York, NY, USA
| | - Junmei Wang
- Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, PA, USA
| | - Lianjin Cai
- Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, PA, USA
| | - Jan Zawala
- Jerzy Haber Institute of Catalysis and Surface Chemistry, Polish Academy of Sciences, Kraków, Poland
| | - Yanshan Wang
- Department of Health Information Management, University of Pittsburgh, Pittsburgh, PA, USA.
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA.
- Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA, USA.
- Clinical and Translational Science Institute, University of Pittsburgh, Pittsburgh, PA, USA.
| |
Collapse
|
8
|
Li Z, Zhao H, Zhu G, Du J, Wu Z, Jiang Z, Li Y. Classification method of traditional Chinese medicine compound decoction duration based on multi-dimensional feature weighted fusion. Comput Methods Biomech Biomed Engin 2024:1-15. [PMID: 38193238 DOI: 10.1080/10255842.2024.2302225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 12/10/2023] [Indexed: 01/10/2024]
Abstract
This paper extends a text classification method utilizing natural language processing (NLP) into the field of traditional Chinese medicine (TCM) compound decoction to effectively and scientifically extend the TCM compound decoction duration. Specifically, a TCM compound decoction duration classification named TCM-TextCNN is proposed to fuse multi-dimensional herb features and improve TextCNN. Indeed, first, we utilize word vector technology to construct feature vectors of herb names and medicinal parts, aiming to describe the herb characteristics comprehensively. Second, considering the impact of different herb features on the decoction duration, we use an improved Term Frequency-Inverse Word Frequency (TF-IWF) algorithm to weigh the feature vectors of herb names and medicinal parts. These weighted feature vectors are then concatenated to obtain a multi-dimensional herb feature vector, allowing for a more comprehensive representation. Finally, the feature vector is input into the improved TextCNN, which uses k-max pooling to reduce information loss rather than max pooling. Three fully connected layers are added to generate higher-level feature representations, followed by softmax to obtain the final results. Experimental results on a dataset of TCM compound decoction duration demonstrate that TCM-TextCNN improves accuracy, recall, and F1 score by 5.31%, 5.63%, and 5.22%, respectively, compared to methods solely rely on herb name features, thereby confirming our method's effectiveness in classifying TCM compound decoction duration.
Collapse
Affiliation(s)
- Zhibiao Li
- Computer Science College, Jiangxi University of Chinese Medicine, Nanchang, Jiangxi, China
- Ganjiang New Area Zhiyao Shanhe Technology Co., Ltd, Nanchang, Jiangxi, China
- Key Laboratory of Artificial Intelligence in Chinese Medicine, Nanchang, Jiangxi, China
| | - Huayong Zhao
- Computer Science College, Jiangxi University of Chinese Medicine, Nanchang, Jiangxi, China
| | - Genhua Zhu
- Computer Science College, Jiangxi University of Chinese Medicine, Nanchang, Jiangxi, China
| | - Jianqiang Du
- Computer Science College, Jiangxi University of Chinese Medicine, Nanchang, Jiangxi, China
| | - Zhenfeng Wu
- Computer Science College, Jiangxi University of Chinese Medicine, Nanchang, Jiangxi, China
- Ganjiang New Area Zhiyao Shanhe Technology Co., Ltd, Nanchang, Jiangxi, China
| | - Zhicheng Jiang
- Computer Science College, Jiangxi University of Chinese Medicine, Nanchang, Jiangxi, China
| | - Yiwen Li
- Computer Science College, Jiangxi University of Chinese Medicine, Nanchang, Jiangxi, China
| |
Collapse
|
9
|
Li M, Zhang J. A focus on harnessing big data and artificial intelligence: revolutionizing drug discovery from traditional Chinese medicine sources. Chem Sci 2023; 14:10628-10630. [PMID: 37829008 PMCID: PMC10566454 DOI: 10.1039/d3sc90185h] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2023] Open
Abstract
The advent of big data-driven artificial intelligence (AI) modeling has profoundly impacted the realm of drug discovery. Chen et al. (Q. Lv et al., Chem. Sci., 2023, https://doi.org/10.1039/D3SC02139D) have paved a way for modern drug discovery from traditional Chinese medicine (TCM) sources through their efforts over the past decade. They achieved this by creating TCMBank, the most extensive systematic central resource for TCM, which integrates standardized TCM-related big data and streamlines the AI-based drug discovery process.
Collapse
Affiliation(s)
- Mingyu Li
- State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine Shanghai China
| | - Jian Zhang
- State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine Shanghai China
- Medicinal Chemistry and Bioinformatics Center, Shanghai Jiao Tong University School of Medicine Shanghai 200025 China
| |
Collapse
|