1
|
Yang S, Zhang P, Che C, Zhong Z. B-LBConA: a medical entity disambiguation model based on Bio-LinkBERT and context-aware mechanism. BMC Bioinformatics 2023; 24:97. [PMID: 36927359 PMCID: PMC10021986 DOI: 10.1186/s12859-023-05209-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Accepted: 02/24/2023] [Indexed: 03/18/2023] Open
Abstract
BACKGROUND The main task of medical entity disambiguation is to link mentions, such as diseases, drugs, or complications, to standard entities in the target knowledge base. To our knowledge, models based on Bidirectional Encoder Representations from Transformers (BERT) have achieved good results in this task. Unfortunately, these models only consider text in the current document, fail to capture dependencies with other documents, and lack sufficient mining of hidden information in contextual texts. RESULTS We propose B-LBConA, which is based on Bio-LinkBERT and context-aware mechanism. Specifically, B-LBConA first utilizes Bio-LinkBERT, which is capable of learning cross-document dependencies, to obtain embedding representations of mentions and candidate entities. Then, cross-attention is used to capture the interaction information of mention-to-entity and entity-to-mention. Finally, B-LBConA incorporates disambiguation clues about the relevance between the mention context and candidate entities via the context-aware mechanism. CONCLUSIONS Experiment results on three publicly available datasets, NCBI, ADR and ShARe/CLEF, show that B-LBConA achieves a signifcantly more accurate performance compared with existing models.
Collapse
Affiliation(s)
- Siyu Yang
- Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, Dalian University, 116622, Dalian, China
| | - Peiliang Zhang
- School of Computer Science and Artificial Intelligence, Wuhan University of Technology, 430070, Wuhan, China
| | - Chao Che
- Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, Dalian University, 116622, Dalian, China
| | - Zhaoqian Zhong
- Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, Dalian University, 116622, Dalian, China.
| |
Collapse
|
2
|
Liu CM, Ta VD, Le NQK, Tadesse DA, Shi C. Deep Neural Network Framework Based on Word Embedding for Protein Glutarylation Sites Prediction. Life (Basel) 2022; 12:life12081213. [PMID: 36013392 PMCID: PMC9410500 DOI: 10.3390/life12081213] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 08/03/2022] [Accepted: 08/05/2022] [Indexed: 04/08/2023] Open
Abstract
In recent years, much research has found that dysregulation of glutarylation is associated with many human diseases, such as diabetes, cancer, and glutaric aciduria type I. Therefore, glutarylation identification and characterization are essential tasks for determining modification-specific proteomics. This study aims to propose a novel deep neural network framework based on word embedding techniques for glutarylation sites prediction. Multiple deep neural network models are implemented to evaluate the performance of glutarylation sites prediction. Furthermore, an extensive experimental comparison of word embedding techniques is conducted to utilize the most efficient method for improving protein sequence data representation. The results suggest that the proposed deep neural networks not only improve protein sequence representation but also work effectively in glutarylation sites prediction by obtaining a higher accuracy and confidence rate compared to the previous work. Moreover, embedding techniques were proven to be more productive than the pre-trained word embedding techniques for glutarylation sequence representation. Our proposed method has significantly outperformed all traditional performance metrics compared to the advanced integrated vector support, with accuracy, specificity, sensitivity, and correlation coefficient of 0.79, 0.89, 0.59, and 0.51, respectively. It shows the potential to detect new glutarylation sites and uncover the relationships between glutarylation and well-known lysine modification.
Collapse
Affiliation(s)
- Chuan-Ming Liu
- Department of Computer Science and Information Engineering, National Taipei University of Technology (Taipei Tech), Taipei City 106, Taiwan
- Correspondence: (C.-M.L.); (C.S.); Tel.: +886-2-2771-2171 (ext. 4251) (C.-M.L.)
| | - Van-Dai Ta
- Samsung Display Vietnam (SDV), Yen Phong Industrial Park, Bac Ninh 16000, Vietnam
| | - Nguyen Quoc Khanh Le
- Professional Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei City 106, Taiwan
| | | | - Chongyang Shi
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing 102488, China
- Correspondence: (C.-M.L.); (C.S.); Tel.: +886-2-2771-2171 (ext. 4251) (C.-M.L.)
| |
Collapse
|
3
|
Koutsomitropoulos DA, Andriopoulos AD. Thesaurus-based word embeddings for automated biomedical literature classification. Neural Comput Appl 2021; 34:937-950. [PMID: 33994670 PMCID: PMC8111057 DOI: 10.1007/s00521-021-06053-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2020] [Accepted: 04/15/2021] [Indexed: 11/29/2022]
Abstract
The special nature, volume and broadness of biomedical literature pose barriers for automated classification methods. On the other hand, manually indexing is time-consuming, costly and error prone. We argue that current word embedding algorithms can be efficiently used to support the task of biomedical text classification even in a multilabel setting, with many distinct labels. The ontology representation of Medical Subject Headings provides machine-readable labels and specifies the dimensionality of the problem space. Both deep- and shallow network approaches are implemented. Predictions are determined by the similarity between extracted features from contextualized representations of abstracts and headings. The addition of a separate classifier for transfer learning is also proposed and evaluated. Large datasets of biomedical citations are harvested for their metadata and used for training and testing. These automated approaches are still far from entirely substituting human experts, yet they can be useful as a mechanism for validation and recommendation. Dataset balancing, distributed processing and training parallelization in GPUs, all play an important part regarding the effectiveness and performance of proposed methods.
Collapse
Affiliation(s)
| | - Andreas D Andriopoulos
- Department of Computer Engineering and Informatics, School of Engineering, University of Patras, Patras, Greece
| |
Collapse
|
4
|
Abstract
The existing joint embedding Visual Question Answering models use different combinations of image characterization, text characterization and feature fusion method, but all the existing models use static word vectors for text characterization. However, in the real language environment, the same word may represent different meanings in different contexts, and may also be used as different grammatical components. These differences cannot be effectively expressed by static word vectors, so there may be semantic and grammatical deviations. In order to solve this problem, our article constructs a joint embedding model based on dynamic word vector-none KB-Specific network (N-KBSN) model which is different from commonly used Visual Question Answering models based on static word vectors. The N-KBSN model consists of three main parts: question text and image feature extraction module, self attention and guided attention module, feature fusion and classifier module. Among them, the key parts of N-KBSN model are: image characterization based on Faster R-CNN, text characterization based on ELMo and feature enhancement based on multi-head attention mechanism. The experimental results show that the N-KBSN constructed in our experiment is better than the other 2017-winner (glove) model and 2019-winner (glove) model. The introduction of dynamic word vector improves the accuracy of the overall results.
Collapse
Affiliation(s)
- Zhiyang Ma
- School of Automation, University of Electronic Science and Technology of China, Chengdu, P. R. China
| | - Wenfeng Zheng
- School of Automation, University of Electronic Science and Technology of China, Chengdu, P. R. China
| | - Xiaobing Chen
- School of Automation, University of Electronic Science and Technology of China, Chengdu, P. R. China
| | - Lirong Yin
- Department of Geography and Anthropology, Louisiana State University, LA, USA
| |
Collapse
|
5
|
Li Y, Wang X, Hui L, Zou L, Li H, Xu L, Liu W. Chinese Clinical Named Entity Recognition in Electronic Medical Records: Development of a Lattice Long Short-Term Memory Model With Contextualized Character Representations. JMIR Med Inform 2020; 8:e19848. [PMID: 32885786 PMCID: PMC7501578 DOI: 10.2196/19848] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Revised: 07/22/2020] [Accepted: 08/03/2020] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Clinical named entity recognition (CNER), whose goal is to automatically identify clinical entities in electronic medical records (EMRs), is an important research direction of clinical text data mining and information extraction. The promotion of CNER can provide support for clinical decision making and medical knowledge base construction, which could then improve overall medical quality. Compared with English CNER, and due to the complexity of Chinese word segmentation and grammar, Chinese CNER was implemented later and is more challenging. OBJECTIVE With the development of distributed representation and deep learning, a series of models have been applied in Chinese CNER. Different from the English version, Chinese CNER is mainly divided into character-based and word-based methods that cannot make comprehensive use of EMR information and cannot solve the problem of ambiguity in word representation. METHODS In this paper, we propose a lattice long short-term memory (LSTM) model combined with a variant contextualized character representation and a conditional random field (CRF) layer for Chinese CNER: the Embeddings from Language Models (ELMo)-lattice-LSTM-CRF model. The lattice LSTM model can effectively utilize the information from characters and words in Chinese EMRs; in addition, the variant ELMo model uses Chinese characters as input instead of the character-encoding layer of the ELMo model, so as to learn domain-specific contextualized character embeddings. RESULTS We evaluated our method using two Chinese CNER datasets from the China Conference on Knowledge Graph and Semantic Computing (CCKS): the CCKS-2017 CNER dataset and the CCKS-2019 CNER dataset. We obtained F1 scores of 90.13% and 85.02% on the test sets of these two datasets, respectively. CONCLUSIONS Our results show that our proposed method is effective in Chinese CNER. In addition, the results of our experiments show that variant contextualized character representations can significantly improve the performance of the model.
Collapse
Affiliation(s)
- Yongbin Li
- School of Medical Information Engineering, Zunyi Medical University, Zunyi, China
| | - Xiaohua Wang
- School of Medical Information Engineering, Zunyi Medical University, Zunyi, China
| | - Linhu Hui
- School of Medical Information Engineering, Zunyi Medical University, Zunyi, China
| | - Liping Zou
- School of Medical Information Engineering, Zunyi Medical University, Zunyi, China
| | - Hongjin Li
- School of Medical Information Engineering, Zunyi Medical University, Zunyi, China
| | - Luo Xu
- School of Medical Information Engineering, Zunyi Medical University, Zunyi, China
| | - Weihai Liu
- Radiology Department, Beilun District People's Hospital, Ningbo, China
| |
Collapse
|
6
|
Zhang L, Fan H, Peng C, Rao G, Cong Q. Sentiment Analysis Methods for HPV VaccinesRelated Tweets Based on Transfer Learning. Healthcare (Basel) 2020; 8:E307. [PMID: 32872330 PMCID: PMC7551482 DOI: 10.3390/healthcare8030307] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2020] [Revised: 08/24/2020] [Accepted: 08/24/2020] [Indexed: 01/08/2023] Open
Abstract
The widespread use of social media provides a large amount of data for public sentimentanalysis. Based on social media data, researchers can study public opinions on humanpapillomavirus (HPV) vaccines on social media using machine learning-based approaches that willhelp us understand the reasons behind the low vaccine coverage. However, social media data isusually unannotated, and data annotation is costly. The lack of an abundant annotated dataset limitsthe application of deep learning methods in effectively training models. To tackle this problem, wepropose three transfer learning approaches to analyze the public sentiment on HPV vaccines onTwitter. One was transferring static embeddings and embeddings from language models (ELMo)and then processing by bidirectional gated recurrent unit with attention (BiGRU-Att), called DWEBiGRU-Att. The others were fine-tuning pre-trained models with limited annotated data, called finetuninggenerative pre-training (GPT) and fine-tuning bidirectional encoder representations fromtransformers (BERT). The fine-tuned GPT model was built on the pre-trained generative pretraining(GPT) model. The fine-tuned BERT model was constructed with BERT model. Theexperimental results on the HPV dataset demonstrated the efficacy of the three methods in thesentiment analysis of the HPV vaccination task. The experimental results on the HPV datasetdemonstrated the efficacy of the methods in the sentiment analysis of the HPV vaccination task. Thefine-tuned BERT model outperforms all other methods. It can help to find strategies to improvevaccine uptake.
Collapse
Affiliation(s)
- Li Zhang
- School of Economics and Management, Tianjin University of Science and Technology, Tianjin 300457, China; (L.Z.); (H.F.)
| | - Haimeng Fan
- School of Economics and Management, Tianjin University of Science and Technology, Tianjin 300457, China; (L.Z.); (H.F.)
| | - Chengxia Peng
- College of Intelligence and Computing, Tianjin University, Tianjin 300350, China; (C.P.); (Q.C.)
| | - Guozheng Rao
- College of Intelligence and Computing, Tianjin University, Tianjin 300350, China; (C.P.); (Q.C.)
- Tianjin Key Laboratory of Cognitive Computing and Applications, Tianjin University, Tianjin 300350, China
- School of New Media and Communication, Tianjin University, Tianjin 300072, China
| | - Qing Cong
- College of Intelligence and Computing, Tianjin University, Tianjin 300350, China; (C.P.); (Q.C.)
| |
Collapse
|
7
|
Richter-Pechanski P, Amr A, Katus HA, Dieterich C. Deep Learning Approaches Outperform Conventional Strategies in De-Identification of German Medical Reports. Stud Health Technol Inform 2019; 267:101-109. [PMID: 31483261 DOI: 10.3233/shti190813] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
One of the major obstacles for research on German medical reports is the lack of de-identified medical corpora. Previous de-identification tasks focused on non-German medical texts, which raised the demand for an in-depth evaluation of de-identification methods on German medical texts. Because of remarkable advancements in natural language processing using supervised machine learning methods on limited training data, we evaluated them for the first time on German medical reports using our annotated data set consisting of 113 medical reports from the cardiology domain. We applied state-of-the-art deep learning methods using pre-trained models as input to a bidirectional LSTM network and well-established conditional random fields for de-identification of German medical reports. We performed an extensive evaluation for de-identification and multiclass named entity recognition. Using rule based and out of domain machine learning methods as a baseline, the conditional random field improved F2-score from 70 to 93% for de-identification, the neural approach reached 96% in F2-score while keeping balanced precision and recall rates. These results show, that state-of-the-art machine learning methods can play a crucial role in de-identification of German medical reports.
Collapse
Affiliation(s)
- Phillip Richter-Pechanski
- Section of Bioinformatics and Systems Cardiology, Klaus Tschira Institute for Integrative Computational Cardiology, Heidelberg.,Department of Internal Medicine III, University Hospital Heidelberg.,German Center for Cardiovascular Research (DZHK) - Partner site Heidelberg/Mannheim
| | - Ali Amr
- Department of Internal Medicine III, University Hospital Heidelberg.,German Center for Cardiovascular Research (DZHK) - Partner site Heidelberg/Mannheim
| | - Hugo A Katus
- Department of Internal Medicine III, University Hospital Heidelberg.,German Center for Cardiovascular Research (DZHK) - Partner site Heidelberg/Mannheim
| | - Christoph Dieterich
- Section of Bioinformatics and Systems Cardiology, Klaus Tschira Institute for Integrative Computational Cardiology, Heidelberg.,Department of Internal Medicine III, University Hospital Heidelberg.,German Center for Cardiovascular Research (DZHK) - Partner site Heidelberg/Mannheim
| |
Collapse
|