1
|
Yang Y, Zheng Z, Xu Y, Wei H, Yan W. BioGSF: a graph-driven semantic feature integration framework for biomedical relation extraction. Brief Bioinform 2024; 26:bbaf025. [PMID: 39853110 PMCID: PMC11759886 DOI: 10.1093/bib/bbaf025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2024] [Revised: 12/24/2024] [Accepted: 01/09/2025] [Indexed: 01/26/2025] Open
Abstract
The automatic and accurate extraction of diverse biomedical relations from literature constitutes the core elements of medical knowledge graphs, which are indispensable for healthcare artificial intelligence. Currently, fine-tuning through stacking various neural networks on pre-trained language models (PLMs) represents a common framework for end-to-end resolution of the biomedical relation extraction (RE) problem. Nevertheless, sequence-based PLMs, to a certain extent, fail to fully exploit the connections between semantics and the topological features formed by these connections. In this study, we presented a graph-driven framework named BioGSF for RE from the literature by integrating shortest dependency paths (SDP) with entity-pair graph through the employment of the graph neural network model. Initially, we leveraged dependency relationships to obtain the SDP between entities and incorporated this information into the entity-pair graph. Subsequently, the graph attention network was utilized to acquire the topological information of the entity-pair graph. Ultimately, the obtained topological information was combined with the semantic features of the contextual information for relation classification. Our method was evaluated on two distinct datasets, namely S4 and BioRED. The outcomes reveal that BioGSF not only attains the superior performance among previous models with a micro-F1 score of 96.68% (S4) and 96.03% (BioRED), but also demands the shortest running times. BioGSF emerges as an efficient framework for biomedical RE.
Collapse
Affiliation(s)
- Yang Yang
- Computing Science and Artificial Intelligence College, Suzhou City University, No. 1188 Wuzhong Avenue, Wuzhong District Suzhou, Suzhou 215004, China
- Suzhou Key Lab of Multi-modal Data Fusion and Intelligent Healthcare, No. 1188 Wuzhong Avenue, Wuzhong District Suzhou, Suzhou 215004, China
- School of Computer Science & Technology, Soochow University, No. 1 Shizi Street, Suzhou 215000, China
| | - Zixuan Zheng
- School of Computer Science & Technology, Soochow University, No. 1 Shizi Street, Suzhou 215000, China
| | - Yuyang Xu
- School of Computer Science & Technology, Soochow University, No. 1 Shizi Street, Suzhou 215000, China
| | - Huifang Wei
- School of Basic Medical Sciences, Suzhou Medical College of Soochow University, No. 199 Renai Road, SIP, Suzhou 215123, China
| | - Wenying Yan
- Suzhou Key Lab of Multi-modal Data Fusion and Intelligent Healthcare, No. 1188 Wuzhong Avenue, Wuzhong District Suzhou, Suzhou 215004, China
- School of Basic Medical Sciences, Suzhou Medical College of Soochow University, No. 199 Renai Road, SIP, Suzhou 215123, China
- Jiangsu Province Engineering Research Center of Precision Diagnostics and Therapeutics Development, Soochow University, No. 199 Renai Road, SIP, Suzhou 215123, China
| |
Collapse
|