1
|
Ren Z, Ren Y, Li Z, Xu H. TCMM: A unified database for traditional Chinese medicine modernization and therapeutic innovations. Comput Struct Biotechnol J 2024; 23:1619-1630. [PMID: 38680873 PMCID: PMC11047297 DOI: 10.1016/j.csbj.2024.04.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Revised: 03/30/2024] [Accepted: 04/09/2024] [Indexed: 05/01/2024] Open
Abstract
Mining the potential of traditional Chinese medicine (TCM) in treating modern diseases requires a profound understanding of its action mechanism and a comprehensive knowledge system that seamlessly bridges modern medical insights with traditional theories. However, existing databases for modernizing TCM are plagued by varying degrees of information loss, which impede the multidimensional dissection of pharmacological effects. To address this challenge, we introduce traditional Chinese medicine modernization (TCMM), the currently largest modernized TCM database that integrates pioneering intelligent pipelines. By aligning high-quality TCM and modern medicine data, TCMM boasts the most extensive TCM modernization knowledge, including 20 types of modernized TCM concepts such as prescription, ingredient, target and 46 biological relations among them, totaling 3,447,023 records. We demonstrate the efficacy and reliability of TCMM with two features, prescription generation and knowledge discovery, the outcomes show consistency with biological experimental results. A publicly available web interface is at https://www.tcmm.net.cn/.
Collapse
Affiliation(s)
- Zhixiang Ren
- Peng Cheng Laboratory, Shenzhen, 518055, Guangdong Province, China
| | - Yiming Ren
- Peng Cheng Laboratory, Shenzhen, 518055, Guangdong Province, China
| | - Zeting Li
- Peng Cheng Laboratory, Shenzhen, 518055, Guangdong Province, China
| | - Huan Xu
- School of Public Health, Anhui University of Science and Technology, Hefei, 231131, Anhui Province, China
| |
Collapse
|
2
|
Tao Q, Liao J, Zhang E, Li L. A Dual Robust Graph Neural Network Against Graph Adversarial Attacks. Neural Netw 2024; 175:106276. [PMID: 38599138 DOI: 10.1016/j.neunet.2024.106276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Revised: 12/18/2023] [Accepted: 03/25/2024] [Indexed: 04/12/2024]
Abstract
Graph Neural Networks (GNNs) have gained widespread usage and achieved remarkable success in various real-world applications. Nevertheless, recent studies reveal the vulnerability of GNNs to graph adversarial attacks that fool them by modifying graph structure. This vulnerability undermines the robustness of GNNs and poses significant security and privacy risks across various applications. Hence, it is crucial to develop robust GNN models that can effectively defend against such attacks. One simple approach is to remodel the graph. However, most existing methods cannot fully preserve the similarity relationship among the original nodes while learning the node representation required for reweighting the edges. Furthermore, they lack supervision information regarding adversarial perturbations, hampering their ability to recognize adversarial edges. To address these limitations, we propose a novel Dual Robust Graph Neural Network (DualRGNN) against graph adversarial attacks. DualRGNN first incorporates a node-similarity-preserving graph refining (SPGR) module to prune and refine the graph based on the learned node representations, which contain the original nodes' similarity relationships, weakening the poisoning of graph adversarial attacks on graph data. DualRGNN then employs an adversarial-supervised graph attention (ASGAT) network to enhance the model's capability in identifying adversarial edges by treating these edges as supervised signals. Through extensive experiments conducted on four benchmark datasets, DualRGNN has demonstrated remarkable robustness against various graph adversarial attacks.
Collapse
Affiliation(s)
- Qian Tao
- School of Software, South China University of Technology, Guangzhou, Guangdong, 510006, China; Pazhou Lab, Guangzhou, Guangdong, 510006, China.
| | - Jianpeng Liao
- School of Software, South China University of Technology, Guangzhou, Guangdong, 510006, China.
| | - Enze Zhang
- School of Software, South China University of Technology, Guangzhou, Guangdong, 510006, China.
| | - Lusi Li
- Department of Computer Science, Old Dominion University, Norfolk, VA, 23529, USA.
| |
Collapse
|
3
|
Dong X, Zhao C, Song X, Zhang L, Liu Y, Wu J, Xu Y, Xu N, Liu J, Yu H, Yang K, Zhou X. PresRecST: a novel herbal prescription recommendation algorithm for real-world patients with integration of syndrome differentiation and treatment planning. J Am Med Inform Assoc 2024; 31:1268-1279. [PMID: 38598532 PMCID: PMC11105127 DOI: 10.1093/jamia/ocae066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 02/21/2024] [Accepted: 03/13/2024] [Indexed: 04/12/2024] Open
Abstract
OBJECTIVES Herbal prescription recommendation (HPR) is a hot topic and challenging issue in field of clinical decision support of traditional Chinese medicine (TCM). However, almost all previous HPR methods have not adhered to the clinical principles of syndrome differentiation and treatment planning of TCM, which has resulted in suboptimal performance and difficulties in application to real-world clinical scenarios. MATERIALS AND METHODS We emphasize the synergy among diagnosis and treatment procedure in real-world TCM clinical settings to propose the PresRecST model, which effectively combines the key components of symptom collection, syndrome differentiation, treatment method determination, and herb recommendation. This model integrates a self-curated TCM knowledge graph to learn the high-quality representations of TCM biomedical entities and performs 3 stages of clinical predictions to meet the principle of systematic sequential procedure of TCM decision making. RESULTS To address the limitations of previous datasets, we constructed the TCM-Lung dataset, which is suitable for the simultaneous training of the syndrome differentiation, treatment method determination, and herb recommendation. Overall experimental results on 2 datasets demonstrate that the proposed PresRecST outperforms the state-of-the-art algorithm by significant improvements (eg, improvements of P@5 by 4.70%, P@10 by 5.37%, P@20 by 3.08% compared with the best baseline). DISCUSSION The workflow of PresRecST effectively integrates the embedding vectors of the knowledge graph for progressive recommendation tasks, and it closely aligns with the actual diagnostic and treatment procedures followed by TCM doctors. A series of ablation experiments and case study show the availability and interpretability of PresRecST, indicating the proposed PresRecST can be beneficial for assisting the diagnosis and treatment in real-world TCM clinical settings. CONCLUSION Our technology can be applied in a progressive recommendation scenario, providing recommendations for related items in a progressive manner, which can assist in providing more reliable diagnoses and herbal therapies for TCM clinical task.
Collapse
Affiliation(s)
- Xin Dong
- Beijing Key Lab of Traffic Data Analysis and Mining, Institute of Medical Intelligence, School of Computer Science & Technology, Beijing Jiaotong University, Beijing 100044, China
| | - Chenxi Zhao
- Beijing Key Lab of Traffic Data Analysis and Mining, Institute of Medical Intelligence, School of Computer Science & Technology, Beijing Jiaotong University, Beijing 100044, China
| | - Xinpeng Song
- Beijing Key Lab of Traffic Data Analysis and Mining, Institute of Medical Intelligence, School of Computer Science & Technology, Beijing Jiaotong University, Beijing 100044, China
| | - Lei Zhang
- National Data Center of Traditional Chinese Medicine, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Yu Liu
- Beijing Key Lab of Traffic Data Analysis and Mining, Institute of Medical Intelligence, School of Computer Science & Technology, Beijing Jiaotong University, Beijing 100044, China
| | - Jun Wu
- Beijing Key Lab of Traffic Data Analysis and Mining, Institute of Medical Intelligence, School of Computer Science & Technology, Beijing Jiaotong University, Beijing 100044, China
| | - Yiran Xu
- Department of Computer Science, Cornell University, New York, NY 14853, United States
| | - Ning Xu
- National Data Center of Traditional Chinese Medicine, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Jialing Liu
- Beijing Key Lab of Traffic Data Analysis and Mining, Institute of Medical Intelligence, School of Computer Science & Technology, Beijing Jiaotong University, Beijing 100044, China
| | - Haibin Yu
- The First Affiliated Hospital, Henan University of Chinese Medicine, Zhengzhou 450000, China
- Collaborative Innovation Center for Chinese Medicine and Respiratory Diseases Co-Constructed by Henan Province & Education Ministry of P.R. China, Henan University of Chinese Medicine, Zhengzhou 450046, China
| | - Kuo Yang
- Beijing Key Lab of Traffic Data Analysis and Mining, Institute of Medical Intelligence, School of Computer Science & Technology, Beijing Jiaotong University, Beijing 100044, China
| | - Xuezhong Zhou
- Beijing Key Lab of Traffic Data Analysis and Mining, Institute of Medical Intelligence, School of Computer Science & Technology, Beijing Jiaotong University, Beijing 100044, China
| |
Collapse
|
4
|
Pan D, Guo Y, Fan Y, Wan H. Development and Application of Traditional Chinese Medicine Using AI Machine Learning and Deep Learning Strategies. THE AMERICAN JOURNAL OF CHINESE MEDICINE 2024; 52:605-623. [PMID: 38715181 DOI: 10.1142/s0192415x24500265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2024]
Abstract
Traditional Chinese medicine (TCM) has been used for thousands of years and has been proven to be effective at treating many complicated illnesses with minimal side effects. The application and advancement of TCM are, however, constrained by the absence of objective measuring standards due to its relatively abstract diagnostic methods and syndrome differentiation theories. Ongoing developments in machine learning (ML) and deep learning (DL), specifically in computer vision (CV) and natural language processing (NLP), offer novel opportunities to modernize TCM by exploring the profound connotations of its theory. This review begins with an overview of the ML and DL methods employed in TCM; this is followed by practical instances of these applications. Furthermore, extensive discussions emphasize the mature integration of ML and DL in TCM, such as tongue diagnosis, pulse diagnosis, and syndrome differentiation treatment, highlighting their early successful application in the TCM field. Finally, this study validates the accomplishments and addresses the problems and challenges posed by the application and development of TCM powered by ML and DL. As ML and DL techniques continue to evolve, modern technology will spark new advances in TCM.
Collapse
Affiliation(s)
- Danping Pan
- School of Basic Medicine Sciences, Zhejiang Chinese Medical University, Hangzhou 310053, P. R. China
| | - Yilei Guo
- School of Basic Medicine Sciences, Zhejiang Chinese Medical University, Hangzhou 310053, P. R. China
| | - Yongfu Fan
- School of Basic Medicine Sciences, Zhejiang Chinese Medical University, Hangzhou 310053, P. R. China
| | - Haitong Wan
- School of Basic Medicine Sciences, Zhejiang Chinese Medical University, Hangzhou 310053, P. R. China
| |
Collapse
|
5
|
Zhou Y, Zhu C, Zhu W, Li H. SCMEA: A stacked co-enhanced model for entity alignment based on multi-aspect information fusion and bidirectional contrastive learning. Neural Netw 2024; 173:106178. [PMID: 38367354 DOI: 10.1016/j.neunet.2024.106178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 10/31/2023] [Accepted: 02/13/2024] [Indexed: 02/19/2024]
Abstract
Entity alignment refers to discovering the entity pairs with the same realistic meaning in different knowledge graphs. This technology is of great significance for completing and fusing knowledge graphs. Recently, methods based on knowledge representation learning have achieved remarkable achievements in entity alignment. However, most existing approaches do not mine hidden information in the knowledge graph as much as possible. This paper suggests SCMEA, a novel cross-lingual entity alignment framework based on multi-aspect information fusion and bidirectional contrastive learning. SCMEA initially adopts diverse representation learning models to embed multi-aspect information of entities and integrates them into a unified embedding space with an adaptive weighted mechanism to overcome the missing information and the problem of different-aspect information are not uniform. Then, we propose a stacked relation-entity co-enhanced model to further improve the representations of entities, wherein relation representation is modeled using an Entity Collector with Global Entity Attention. Finally, a combined loss function based on improved bidirectional contrastive learning is introduced to optimize model parameters and entity representation, effectively mitigating the hubness problem and accelerating model convergence. We conduct extensive experiments to evaluate the alignment performance of SCMEA. The overall experimental results, ablation studies, and analysis performed on five cross-lingual datasets demonstrate that our model achieves varying degrees of performance improvement and verifies the effectiveness and robustness of the model.
Collapse
Affiliation(s)
- Yunfeng Zhou
- Faculty of Information Technology, Beijing University of Technology, Beijing, 100020, China.
| | - Cui Zhu
- Faculty of Information Technology, Beijing University of Technology, Beijing, 100020, China.
| | - Wenjun Zhu
- Faculty of Information Technology, Beijing University of Technology, Beijing, 100020, China.
| | - Hongyang Li
- Faculty of Information Technology, Beijing University of Technology, Beijing, 100020, China.
| |
Collapse
|
6
|
Zeng X, Liu Y, Zhang J, Guo Y. Medical object detector jointly driven by knowledge and data. Neural Netw 2024; 172:106084. [PMID: 38183830 DOI: 10.1016/j.neunet.2023.12.038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Revised: 11/15/2023] [Accepted: 12/22/2023] [Indexed: 01/08/2024]
Abstract
Most of the existing object detection algorithms are trained on medical datasets and then used for prediction. When the features of an object are not obvious in an image, these models are prone to mislocalize and misclassify it. In this paper, we propose a medical Object Detection algorithm jointly driven by Knowledge and Data (ODKD). It enables medical semantic knowledge provided by specialized physicians to be effective and helpful when traditional models have difficulty in correctly detecting objects relying on features alone. Our model consists of a base object detector together with a fusion module: the base object detector is trained based on medical datasets to obtain data-driven results; then we use a graph to represent external semantic knowledge and map the data-driven results to the nodes embedding of this graph structure. In the fusion module, a graph convolution network is used to fuse the data-driven results with the external semantic knowledge to output category adjustment coefficients. Finally, the adjustment coefficients are used to adjust the data-driven results to obtain results jointly driven by knowledge and data. Experiments show that professional medical semantic knowledge can effectively correct the erroneous results of the base detector, and the effect of our model outperforms Faster Rcnn, YOLOv5, YOLOv7, etc. on three medical datasets, Camus, Synapse, and AMOS.
Collapse
Affiliation(s)
- Xianhua Zeng
- School of Computer Science and Technology/School of Artificial Intelligence, Chongqing University of Posts and Telecommunications, Chongqing 400065, China.
| | - Yuhang Liu
- School of Computer Science and Technology/School of Artificial Intelligence, Chongqing University of Posts and Telecommunications, Chongqing 400065, China.
| | - Jian Zhang
- School of Computer Science and Technology/School of Artificial Intelligence, Chongqing University of Posts and Telecommunications, Chongqing 400065, China.
| | - Yongli Guo
- School of Computer Science and Technology/School of Artificial Intelligence, Chongqing University of Posts and Telecommunications, Chongqing 400065, China.
| |
Collapse
|
7
|
Jin Y, Ji W, Shi Y, Wang X, Yang X. Meta-path guided graph attention network for explainable herb recommendation. Health Inf Sci Syst 2023; 11:5. [PMID: 36660407 PMCID: PMC9847457 DOI: 10.1007/s13755-022-00207-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Accepted: 12/16/2022] [Indexed: 01/20/2023] Open
Abstract
Traditional Chinese Medicine (TCM) has been widely adopted in clinical practice by Eastern Asia people for thousands of years. Nowadays, TCM still plays a critical role in Chinese society and receives increasing attention worldwide. The existing herb recommenders learn the complex relations between symptoms and herbs by mining the TCM prescriptions. Given a set of symptoms, they will provide a set of herbs and explanations from the TCM theory. However, the foundation of TCM is Yinyangism (i.e. the combination of Five Phases theory with Yin-yang theory), which is very different from modern medicine philosophy. Only recommending herbs from the TCM theory aspect largely prevents TCM from modern medical treatment. As TCM and modern medicine share a common view at the molecular level, it is necessary to integrate the ancient practice of TCM and standards of modern medicine. In this paper, we explore the underlying action mechanisms of herbs from both TCM and modern medicine, and propose a Meta-path guided Graph Attention Network (MGAT) to provide the explainable herb recommendations. Technically, to translate TCM from an experience-based medicine to an evidence-based medicine system, we incorporate the pharmacology knowledge of modern Chinese medicine with the TCM knowledge. We design a meta-path guided information propagation scheme based on the extended knowledge graph, which combines information propagation and decision process. This scheme adopts meta-paths (predefined relation sequences) to guide neighbor selection in the propagation process. Furthermore, the attention mechanism is utilized in aggregation to help distinguish the salience of different paths connecting a symptom with a herb. In this way, our model can distill the long-range semantics along meta-paths and generate fine-grained explanations. We conduct extensive experiments on a public TCM dataset, demonstrating comparable performance to the state-of-the-art herb recommendation models and the strong explainability.
Collapse
Affiliation(s)
- Yuanyuan Jin
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou, China
- Shanghai Key Laboratory of Trustworthy Computing, East China Normal University, Shanghai, China
| | - Wendi Ji
- Shanghai Key Laboratory of Trustworthy Computing, East China Normal University, Shanghai, China
| | - Yao Shi
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou, China
| | - Xiaoling Wang
- Shanghai Key Laboratory of Trustworthy Computing, East China Normal University, Shanghai, China
- Shanghai Institute of Intelligent Science and Technology, Tongji University, Shanghai, China
| | - Xiaochun Yang
- School of Computer Science and Engineering, Northeastern University, Shenyang, China
| |
Collapse
|
8
|
Zhang P, Zhang D, Zhou W, Wang L, Wang B, Zhang T, Li S. Network pharmacology: towards the artificial intelligence-based precision traditional Chinese medicine. Brief Bioinform 2023; 25:bbad518. [PMID: 38197310 PMCID: PMC10777171 DOI: 10.1093/bib/bbad518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2023] [Revised: 11/03/2023] [Accepted: 11/30/2023] [Indexed: 01/11/2024] Open
Abstract
Network pharmacology (NP) provides a new methodological perspective for understanding traditional medicine from a holistic perspective, giving rise to frontiers such as traditional Chinese medicine network pharmacology (TCM-NP). With the development of artificial intelligence (AI) technology, it is key for NP to develop network-based AI methods to reveal the treatment mechanism of complex diseases from massive omics data. In this review, focusing on the TCM-NP, we summarize involved AI methods into three categories: network relationship mining, network target positioning and network target navigating, and present the typical application of TCM-NP in uncovering biological basis and clinical value of Cold/Hot syndromes. Collectively, our review provides researchers with an innovative overview of the methodological progress of NP and its application in TCM from the AI perspective.
Collapse
Affiliation(s)
- Peng Zhang
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics/Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Dingfan Zhang
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics/Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Wuai Zhou
- China Mobile Information System Integration Co., Ltd, Beijing 100032, China
| | - Lan Wang
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics/Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Boyang Wang
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics/Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Tingyu Zhang
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics/Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, Beijing 100084, China
| | - Shao Li
- Institute for TCM-X, MOE Key Laboratory of Bioinformatics/Bioinformatics Division, BNRIST, Department of Automation, Tsinghua University, Beijing 100084, China
| |
Collapse
|
9
|
Ding J, Qiao Y, Zhang L. Plant disease prescription recommendation based on electronic medical records and sentence embedding retrieval. PLANT METHODS 2023; 19:91. [PMID: 37633904 PMCID: PMC10463767 DOI: 10.1186/s13007-023-01070-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 08/11/2023] [Indexed: 08/28/2023]
Abstract
BACKGROUND In the era of Agri 4.0 and the popularity of Plantwise systems, the availability of Plant Electronic Medical Records has provided opportunities to extract valuable disease information and treatment knowledge. However, developing an effective prescription recommendation method based on these records presents unique challenges, such as inadequate labeling data, lack of structural and linguistic specifications, incorporation of new prescriptions, and consideration of multiple factors in practical situations. RESULTS This study proposes a plant disease prescription recommendation method called PRSER, which is based on sentence embedding retrieval. The semantic matching model is created using a pre-trained language model and a sentence embedding method with contrast learning ideas, and the constructed prescription reference database is retrieved for optimal prescription recommendations. A multi-vegetable disease dataset and a multi-fruit disease dataset are constructed to compare three pre-trained language models, four pooling types, and two loss functions. The PRSER model achieves the best semantic matching performance by combining MacBERT, CoSENT, and CLS pooling, resulting in a Pearson coefficient of 86.34% and a Spearman coefficient of 77.67%. The prescription recommendation capability of the model is also verified. PRSER performs well in closed-set testing with Top-1/Top-3/Top-5 accuracy of 88.20%/96.07%/97.70%; and slightly worse in open-set testing with Top-1/Top-3/Top-5 accuracy of 82.04%/91.50%/94.90%. Finally, a plant disease prescription recommendation system for mobile terminals is constructed and its generalization ability with incomplete inputs is verified. When only symptom information is available without environment and plant information, our model shows slightly lower accuracy with Top-1/Top-3/Top-5 accuracy of 75.24%/88.35%/91.99% in closed-set testing and Top-1/Top-3/Top-5 accuracy of 75.08%/87.54%/89.84% in open-set testing. CONCLUSIONS The experiments validate the effectiveness and generalization ability of the proposed approach for recommending plant disease prescriptions. This research has significant potential to facilitate the implementation of artificial intelligence in plant disease treatment, addressing the needs of farmers and advancing scientific plant disease management.
Collapse
Affiliation(s)
- Junqi Ding
- China Agricultural University, Beijing, 100083, China
| | - Yan Qiao
- Beijing Plant Protection Station, Beijing, 100029, China
| | - Lingxian Zhang
- China Agricultural University, Beijing, 100083, China.
- Key Laboratory of Agricultural Informationization Standardization, Ministry of Agriculture and Rural Affairs, Beijing, China.
- College of Information and Electrical Engineering, China Agricultural University, 209# No.17 Qinghua Donglu, Haidian District, Beijing, 100083, China.
| |
Collapse
|
10
|
An effective knowledge graph entity alignment model based on multiple information. Neural Netw 2023; 162:83-98. [PMID: 36893693 DOI: 10.1016/j.neunet.2023.02.029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 01/09/2023] [Accepted: 02/20/2023] [Indexed: 03/09/2023]
Abstract
Entity alignment refers to matching entities with the same realistic meaning in different knowledge graphs. The structure of a knowledge graph provides the global signal for entity alignment. But in the real world, a knowledge graph provides insufficient structural information in general. Moreover, the problem of knowledge graph heterogeneity is common. The semantic and string information can alleviate the problems caused by the sparse and heterogeneous nature of knowledge graphs, yet both of them have not been fully utilized by most existing work. Therefore, we propose an entity alignment model based on multiple information (EAMI), which employs structural, semantic and string information. EAMI learns the structural representation of a knowledge graph by using multi-layer graph convolutional networks. To acquire more accurate entity vector representation, we incorporate the attribute semantic representation into the structural representation. In addition, to further improve entity alignment, we study the entity name string information. There is no training required to calculate the similarity of entity names. Our model is tested on publicly available cross-lingual datasets and cross-resource datasets, and the experimental results demonstrate the effectiveness of our model.
Collapse
|
11
|
TCM Prescription Generation via Knowledge Source Guidance Network Combined with Herbal Candidate Mechanism. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2023; 2023:3301605. [PMID: 36643583 PMCID: PMC9836810 DOI: 10.1155/2023/3301605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 11/27/2022] [Accepted: 12/24/2022] [Indexed: 01/07/2023]
Abstract
Traditional Chinese medicine (TCM) prescriptions have made great contributions to the treatment of diseases and health preservation. To alleviate the shortage of TCM resources and improve the professionalism of automatically generated prescriptions, this paper deeply explores the connection between symptoms and herbs through deep learning technology, and realizes the automatic generation of TCM prescriptions. Particularly, this paper considers the significance of referring to similar underlying prescriptions as herbal candidates in the TCM prescribing process. Moreover, this paper incorporates the idea of referring to the potential guidance information of corresponding prescriptions when model extracts symptoms representations. To provide a reference for inexperienced young TCM doctors when they prescribe, this paper proposes a dual-branch guidance strategy combined with candidate attention model (DGSCAM) to automatically generate TCM prescriptions based on symptoms text. The format of the data used this paper is the "symptoms-prescription" data pair. The specific method is as follows. First, DGSCAM realizes the extraction of key information of prescription-guided symptoms through a dual-branch network. Then, herbal candidates in the form of prescriptions that can treat symptoms are proposed in view of the compatibility knowledge of TCM prescriptions. To our knowledge, this is the first attempt to use prescriptions as herbal candidates in the prescription generation process. We conduct extensive experiments on a mixed public and clinical dataset, achieving 37.39% precision, 25.04% recall, and 29.99% F1 score, with an average doctor score of 7.7 out of 10. The experimental results show that our proposed model is valid and can generate more specialized TCM prescriptions than the baseline models. The DGSCAM developed by us has broad application scenarios and greatly promotes the research on intelligent TCM prescribing.
Collapse
|
12
|
Yang Y, Sun Y, Ju F, Wang S, Gao J, Yin B. Multi-graph Fusion Graph Convolutional Networks with pseudo-label supervision. Neural Netw 2023; 158:305-317. [PMID: 36493533 DOI: 10.1016/j.neunet.2022.11.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 09/13/2022] [Accepted: 11/21/2022] [Indexed: 11/29/2022]
Abstract
Graph convolutional networks (GCNs) have become a popular tool for learning unstructured graph data due to their powerful learning ability. Many researchers have been interested in fusing topological structures and node features to extract the correlation information for classification tasks. However, it is inadequate to integrate the embedding from topology and feature spaces to gain the most correlated information. At the same time, most GCN-based methods assume that the topology graph or feature graph is compatible with the properties of GCNs, but this is usually not satisfied since meaningless, missing, or even unreal edges are very common in actual graphs. To obtain a more robust and accurate graph structure, we intend to construct an adaptive graph with topology and feature graphs. We propose Multi-graph Fusion Graph Convolutional Networks with pseudo-label supervision (MFGCN), which learn a connected embedding by fusing the multi-graphs and node features. We can obtain the final node embedding for semi-supervised node classification by propagating node features over multi-graphs. Furthermore, to alleviate the problem of labels missing in semi-supervised classification, a pseudo-label generation mechanism is proposed to generate more reliable pseudo-labels based on the similarity of node features. Extensive experiments on six benchmark datasets demonstrate the superiority of MFGCN over state-of-the-art classification methods.
Collapse
Affiliation(s)
- Yachao Yang
- Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
| | - Yanfeng Sun
- Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China.
| | - Fujiao Ju
- Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
| | - Shaofan Wang
- Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
| | - Junbin Gao
- Discipline of Business Analytics, The University of Sydney Business School, The University of Sydney, NSW 2006, Australia
| | - Baocai Yin
- Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
| |
Collapse
|
13
|
YAN J, WEN Z, ZOU B. Heterogeneous graph construction and node representation learning method of Treatise on Febrile Diseases based on graph convolutional network. DIGITAL CHINESE MEDICINE 2022. [DOI: 10.1016/j.dcmed.2022.12.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023] Open
|
14
|
Liu J, Huang Q, Yang X, Ding C. HPE-GCN: predicting efficacy of tonic formulae via graph convolutional networks integrating traditionally defined herbal properties. Methods 2022; 204:101-109. [PMID: 35597515 DOI: 10.1016/j.ymeth.2022.05.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 05/04/2022] [Accepted: 05/16/2022] [Indexed: 11/29/2022] Open
Abstract
Chinese herbal formulae are the heritage of traditional Chinese medicine (TCM) in treating diseases through thousands of years. The formula function is not just a simple herbal efficacy addition, but produces complex and nonlinear relationships between different herbs and their overall efficacy, which brings challenges to the formula efficacy analysis. In our study, we proposed a model called HPE-GCN that combines graph convolutional networks (GCN) with TCM-defined herbal properties (TCM-HPs) to predict formulae efficacy. In addition, to process the unstructured natural language in the formula text, we proposed a weighting calculation method related to herb frequency and the number of herbs in a formula called Formula-Herb dependence degree (FHDD), to assess the dependency degree of a formula with its herbs. In our research, 214 classic tonic formulae from ancient TCM books such as Synopsis of the Golden Chamber, Jingyue's Complete Works and the Golden Mirror of Medicin were collected as datasets. The performance of HPE-GCN on multi-classification of tonic formulae reached the best result compared with classic machine learning models, such as support vector machine, naive Bayes, logistic regression, gradient boosting decision tree, and K-nearest neighbors. The evaluated index Macro-Precision, Macro-Recall, Macro-F1 of HPE-GCN on the test set were 87.70%, 84.08% and 83.51% respectively, increased by 7.27%, 7.41% and 7.30% respectively from second best compared models. GCN has the advantage of low-dimensional feature expression for herbs and formulae, and is an effective analysis tool for TCM research. HPE-GCN integrates TCM-HPs and fits the complex nonlinear mapping relationship between TCM-HPs and formulae efficacy, which provides new ideas for related research.
Collapse
Affiliation(s)
- Jiajun Liu
- School of Informatics, Hunan University of Chinese Medicine, Changsha, Hunan, 410208, China
| | - Qunfu Huang
- School of Informatics, Hunan University of Chinese Medicine, Changsha, Hunan, 410208, China
| | - Xiaoyan Yang
- School of Informatics, Hunan University of Chinese Medicine, Changsha, Hunan, 410208, China
| | - Changsong Ding
- School of Informatics, Hunan University of Chinese Medicine, Changsha, Hunan, 410208, China; Big Data Analysis Laboratory of Traditional Chinese Medicine, Hunan University of Chinese Medicine, Changsha, Hunan, 410208, China.
| |
Collapse
|