1
|
Jia Y, Wang H, Yuan Z, Zhu L, Xiang ZL. Biomedical relation extraction method based on ensemble learning and attention mechanism. BMC Bioinformatics 2024; 25:333. [PMID: 39425010 PMCID: PMC11488084 DOI: 10.1186/s12859-024-05951-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2024] [Accepted: 10/07/2024] [Indexed: 10/21/2024] Open
Abstract
BACKGROUND Relation extraction (RE) plays a crucial role in biomedical research as it is essential for uncovering complex semantic relationships between entities in textual data. Given the significance of RE in biomedical informatics and the increasing volume of literature, there is an urgent need for advanced computational models capable of accurately and efficiently extracting these relationships on a large scale. RESULTS This paper proposes a novel approach, SARE, combining ensemble learning Stacking and attention mechanisms to enhance the performance of biomedical relation extraction. By leveraging multiple pre-trained models, SARE demonstrates improved adaptability and robustness across diverse domains. The attention mechanisms enable the model to capture and utilize key information in the text more accurately. SARE achieved performance improvements of 4.8, 8.7, and 0.8 percentage points on the PPI, DDI, and ChemProt datasets, respectively, compared to the original BERT variant and the domain-specific PubMedBERT model. CONCLUSIONS SARE offers a promising solution for improving the accuracy and efficiency of relation extraction tasks in biomedical research, facilitating advancements in biomedical informatics. The results suggest that combining ensemble learning with attention mechanisms is effective for extracting complex relationships from biomedical texts. Our code and data are publicly available at: https://github.com/GS233/Biomedical .
Collapse
Affiliation(s)
- Yaxun Jia
- Department of Radiation Oncology, Shanghai East Hospital, Tongji University School of Medicine, Shanghai, China
| | - Haoyang Wang
- Department of Computer College, Beijing Information Science and Technology University, Beijing, China
| | - Zhu Yuan
- Department of Information Management, The National Police University for Criminal Justice, Baoding, China
| | - Lian Zhu
- Department of Radiation Oncology, Shanghai East Hospital Ji'an hospital, Jian, China
| | - Zuo-Lin Xiang
- Department of Radiation Oncology, Shanghai East Hospital, Tongji University School of Medicine, Shanghai, China.
- Department of Radiation Oncology, Shanghai East Hospital Ji'an hospital, Jian, China.
| |
Collapse
|
2
|
Dara ON, Ibrahim AA, Mohammed TA. Advancing medical imaging: detecting polypharmacy and adverse drug effects with Graph Convolutional Networks (GCN). BMC Med Imaging 2024; 24:174. [PMID: 39009978 PMCID: PMC11247854 DOI: 10.1186/s12880-024-01349-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Accepted: 06/28/2024] [Indexed: 07/17/2024] Open
Abstract
Polypharmacy involves an individual using many medications at the same time and is a frequent healthcare technique used to treat complex medical disorders. Nevertheless, it also presents substantial risks of negative medication responses and interactions. Identifying and addressing adverse effects caused by polypharmacy is crucial to ensure patient safety and improve healthcare results. This paper introduces a new method using Graph Convolutional Networks (GCN) to identify polypharmacy side effects. Our strategy involves developing a medicine interaction graph in which edges signify drug-drug intuitive predicated on pharmacological properties and hubs symbolize drugs. GCN is a well-suited profound learning procedure for graph-based representations of social information. It can be used to anticipate the probability of medicate unfavorable impacts and to memorize important representations of sedate intuitive. Tests were conducted on a huge dataset of patients' pharmaceutical records commented on with watched medicate unfavorable impacts in arrange to approve our strategy. Execution of the GCN show, which was prepared on a subset of this dataset, was evaluated through a disarray framework. The perplexity network shows the precision with which the show categories occasions. Our discoveries demonstrate empowering advance within the recognizable proof of antagonistic responses related with polypharmaceuticals. For cardiovascular system target drugs, GCN technique achieved an accuracy of 94.12%, precision of 86.56%, F1-Score of 88.56%, AUC of 89.74% and recall of 87.92%. For respiratory system target drugs, GCN technique achieved an accuracy of 93.38%, precision of 85.64%, F1-Score of 89.79%, AUC of 91.85% and recall of 86.35%. And for nervous system target drugs, GCN technique achieved an accuracy of 95.27%, precision of 88.36%, F1-Score of 86.49%, AUC of 88.83% and recall of 84.73%. This research provides a significant contribution to pharmacovigilance by proposing a data-driven method to detect and reduce polypharmacy side effects, thereby increasing patient safety and healthcare decision-making.
Collapse
Affiliation(s)
- Omer Nabeel Dara
- Collage of Engineering, Department of Electrical and Computer Engineering, Altinbas University, Istanbul, Turkey.
| | - Abdullahi Abdu Ibrahim
- Collage of Engineering, Department of Electrical and Computer Engineering, Altinbas University, Istanbul, Turkey
| | - Tareq Abed Mohammed
- College of Computer Science and Information Technology, Department of Information Technology, University of Kirkuk, Kirkuk, Iraq
| |
Collapse
|
3
|
He SH, Yun L, Yi HC. Accurate prediction of drug combination risk levels based on relational graph convolutional network and multi-head attention. J Transl Med 2024; 22:572. [PMID: 38880914 PMCID: PMC11180398 DOI: 10.1186/s12967-024-05372-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 06/02/2024] [Indexed: 06/18/2024] Open
Abstract
BACKGROUND Accurately identifying the risk level of drug combinations is of great significance in investigating the mechanisms of combination medication and adverse reactions. Most existing methods can only predict whether there is an interaction between two drugs, but cannot directly determine their accurate risk level. METHODS In this study, we propose a multi-class drug combination risk prediction model named AERGCN-DDI, utilizing a relational graph convolutional network with a multi-head attention mechanism. Drug-drug interaction events with varying risk levels are modeled as a heterogeneous information graph. Attribute features of drug nodes and links are learned based on compound chemical structure information. Finally, the AERGCN-DDI model is proposed to predict drug combination risk level based on heterogenous graph neural network and multi-head attention modules. RESULTS To evaluate the effectiveness of the proposed method, five-fold cross-validation and ablation study were conducted. Furthermore, we compared its predictive performance with baseline models and other state-of-the-art methods on two benchmark datasets. Empirical studies demonstrated the superior performances of AERGCN-DDI. CONCLUSIONS AERGCN-DDI emerges as a valuable tool for predicting the risk levels of drug combinations, thereby aiding in clinical medication decision-making, mitigating severe drug side effects, and enhancing patient clinical prognosis.
Collapse
Affiliation(s)
- Shi-Hui He
- School of Information Science and Technology, Yunnan Normal University, Kunming, 650500, China
- Engineering Research Center of Computer Vision and Intelligent Control Technology, Department of Education, Kunming, 650500, China
| | - Lijun Yun
- School of Information Science and Technology, Yunnan Normal University, Kunming, 650500, China.
- Engineering Research Center of Computer Vision and Intelligent Control Technology, Department of Education, Kunming, 650500, China.
| | - Hai-Cheng Yi
- School of Computer Science, Northwestern Polytechnical University, Xi'an, 710129, China.
| |
Collapse
|
4
|
Pham T, Ghafoor M, Grañana-Castillo S, Marzolini C, Gibbons S, Khoo S, Chiong J, Wang D, Siccardi M. DeepARV: ensemble deep learning to predict drug-drug interaction of clinical relevance with antiretroviral therapy. NPJ Syst Biol Appl 2024; 10:48. [PMID: 38710671 PMCID: PMC11074332 DOI: 10.1038/s41540-024-00374-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Accepted: 04/17/2024] [Indexed: 05/08/2024] Open
Abstract
Drug-drug interaction (DDI) may result in clinical toxicity or treatment failure of antiretroviral therapy (ARV) or comedications. Despite the high number of possible drug combinations, only a limited number of clinical DDI studies are conducted. Computational prediction of DDIs could provide key evidence for the rational management of complex therapies. Our study aimed to assess the potential of deep learning approaches to predict DDIs of clinical relevance between ARVs and comedications. DDI severity grading between 30,142 drug pairs was extracted from the Liverpool HIV Drug Interaction database. Two feature construction techniques were employed: 1) drug similarity profiles by comparing Morgan fingerprints, and 2) embeddings from SMILES of each drug via ChemBERTa, a transformer-based model. We developed DeepARV-Sim and DeepARV-ChemBERTa to predict four categories of DDI: i) Red: drugs should not be co-administered, ii) Amber: interaction of potential clinical relevance manageable by monitoring/dose adjustment, iii) Yellow: interaction of weak relevance and iv) Green: no expected interaction. The imbalance in the distribution of DDI severity grades was addressed by undersampling and applying ensemble learning. DeepARV-Sim and DeepARV-ChemBERTa predicted clinically relevant DDI between ARVs and comedications with a weighted mean balanced accuracy of 0.729 ± 0.012 and 0.776 ± 0.011, respectively. DeepARV-Sim and DeepARV-ChemBERTa have the potential to leverage molecular structures associated with DDI risks and reduce DDI class imbalance, effectively increasing the predictive ability on clinically relevant DDIs. This approach could be developed for identifying high-risk pairing of drugs, enhancing the screening process, and targeting DDIs to study in clinical drug development.
Collapse
Affiliation(s)
- Thao Pham
- Institute of Systems, Molecular & Integrative Biology, Department of Pharmacology and Therapeutics, University of Liverpool, Liverpool, UK
| | - Mohamed Ghafoor
- Department of Computer Science, University of Liverpool, Liverpool, UK
| | - Sandra Grañana-Castillo
- Institute of Systems, Molecular & Integrative Biology, Department of Pharmacology and Therapeutics, University of Liverpool, Liverpool, UK
| | - Catia Marzolini
- Institute of Systems, Molecular & Integrative Biology, Department of Pharmacology and Therapeutics, University of Liverpool, Liverpool, UK
- Department of Infectious Diseases and Hospital Epidemiology, Departments of Medicine and Clinical Research, University Hospital Basel, University of Basel, Basel, Switzerland
| | - Sara Gibbons
- Institute of Systems, Molecular & Integrative Biology, Department of Pharmacology and Therapeutics, University of Liverpool, Liverpool, UK
| | - Saye Khoo
- Institute of Systems, Molecular & Integrative Biology, Department of Pharmacology and Therapeutics, University of Liverpool, Liverpool, UK
| | - Justin Chiong
- Institute of Systems, Molecular & Integrative Biology, Department of Pharmacology and Therapeutics, University of Liverpool, Liverpool, UK
| | - Dennis Wang
- National Heart and Lung Institute, Imperial College London, London, UK.
- Singapore Institute for Clinical Sciences, Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore.
| | - Marco Siccardi
- Institute of Systems, Molecular & Integrative Biology, Department of Pharmacology and Therapeutics, University of Liverpool, Liverpool, UK
| |
Collapse
|
5
|
Zhu J, Liu Y, Zhang Y, Chen Z, Wu X. Multi-Attribute Discriminative Representation Learning for Prediction of Adverse Drug-Drug Interaction. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022; 44:10129-10144. [PMID: 34914581 DOI: 10.1109/tpami.2021.3135841] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Adverse drug-drug interaction (ADDI) is a significant life-threatening issue, posing a leading cause of hospitalizations and deaths in healthcare systems. This paper proposes a unified Multi-Attribute Discriminative Representation Learning (MADRL) model for ADDI prediction. Unlike the existing works that equally treat features of each attribute without discrimination and do not consider the underlying relationship among drugs, we first develop a regularized optimization problem based on CUR matrix decomposition for joint representative drug and discriminative feature selection such that the selected drugs and features can well approximate the original feature spaces and the critical factors discriminative to ADDIs can be properly explored. Different from the existing models that ignore the consistent and unique properties among attributes, a Generative Adversarial Network (GAN) framework is then designed to capture the inter-attribute shared and intra-attribute specific representations of adverse drug pairs for exploiting their consensus and complementary information in ADDI prediction. Meanwhile, MADRL is compatible with any kind of attributes and capable of exploring their respective effects on ADDI prediction. An iterative algorithm based on the alternating direction method of multipliers is developed for optimization. Experiments on publicly available dataset demonstrate the effectiveness of MADRL when compared with eleven baselines and its six variants.
Collapse
|
6
|
Duan B, Peng J, Zhang Y. IMSE: interaction information attention and molecular structure based drug drug interaction extraction. BMC Bioinformatics 2022; 23:338. [PMID: 35965308 PMCID: PMC9375903 DOI: 10.1186/s12859-022-04876-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Accepted: 08/03/2022] [Indexed: 11/10/2022] Open
Abstract
Background Extraction of drug drug interactions from biomedical literature and other textual data is an important component to monitor drug-safety and this has attracted attention of many researchers in healthcare. Existing works are more pivoted around relation extraction using bidirectional long short-term memory networks (BiLSTM) and BERT model which does not attain the best feature representations. Results Our proposed DDI (drug drug interaction) prediction model provides multiple advantages: (1) The newly proposed attention vector is added to better deal with the problem of overlapping relations, (2) The molecular structure information of drugs is integrated into the model to better express the functional group structure of drugs, (3) We also added text features that combined the T-distribution and chi-square distribution to make the model more focused on drug entities and (4) it achieves similar or better prediction performance (F-scores up to 85.16%) compared to state-of-the-art DDI models when tested on benchmark datasets. Conclusions Our model that leverages state of the art transformer architecture in conjunction with multiple features can bolster the performances of drug drug interation tasks in the biomedical domain. In particular, we believe our research would be helpful in identification of potential adverse drug reactions.
Collapse
|
7
|
Yadav S, Ramesh S, Saha S, Ekbal A. Relation Extraction From Biomedical and Clinical Text: Unified Multitask Learning Framework. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:1105-1116. [PMID: 32853152 DOI: 10.1109/tcbb.2020.3020016] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
MOTIVATION To minimize the accelerating amount of time invested on the biomedical literature search, numerous approaches for automated knowledge extraction have been proposed. Relation extraction is one such task where semantic relations between the entities are identified from the free text. In the biomedical domain, extraction of regulatory pathways, metabolic processes, adverse drug reaction or disease models necessitates knowledge from the individual relations, for example, physical or regulatory interactions between genes, proteins, drugs, chemical, disease or phenotype. RESULTS In this paper, we study the relation extraction task from three major biomedical and clinical tasks, namely drug-drug interaction, protein-protein interaction, and medical concept relation extraction. Towards this, we model the relation extraction problem in a multi-task learning (MTL)framework, and introduce for the first time the concept of structured self-attentive network complemented with the adversarial learning approach for the prediction of relationships from the biomedical and clinical text. The fundamental notion of MTL is to simultaneously learn multiple problems together by utilizing the concepts of the shared representation. Additionally, we also generate the highly efficient single task model which exploits the shortest dependency path embedding learned over the attentive gated recurrent unit to compare our proposed MTL models. The framework we propose significantly improves over all the baselines (deep learning techniques)and single-task models for predicting the relationships, without compromising on the performance of all the tasks.
Collapse
|
8
|
Fei H, Zhang Y, Ren Y, Ji D. A span-graph neural model for overlapping entity relation extraction in biomedical texts. Bioinformatics 2021; 37:1581-1589. [PMID: 33245108 DOI: 10.1093/bioinformatics/btaa993] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2020] [Revised: 10/25/2020] [Accepted: 11/17/2020] [Indexed: 01/14/2023] Open
Abstract
MOTIVATION Entity relation extraction is one of the fundamental tasks in biomedical text mining, which is usually solved by the models from natural language processing. Compared with traditional pipeline methods, joint methods can avoid the error propagation from entity to relation, giving better performances. However, the existing joint models are built upon sequential scheme, and fail to detect overlapping entity and relation, which are ubiquitous in biomedical texts. The main reason is that sequential models have relatively weaker power in capturing long-range dependencies, which results in lower performance in encoding longer sentences. In this article, we propose a novel span-graph neural model for jointly extracting overlapping entity relation in biomedical texts. Our model treats the task as relation triplets prediction, and builds the entity-graph by enumerating possible candidate entity spans. The proposed model captures the relationship between the correlated entities via a span scorer and a relation scorer, respectively, and finally outputs all valid relational triplets. RESULTS Experimental results on two biomedical entity relation extraction tasks, including drug-drug interaction detection and protein-protein interaction detection, show that the proposed method outperforms previous models by a substantial margin, demonstrating the effectiveness of span-graph-based method for overlapping relation extraction in biomedical texts. Further in-depth analysis proves that our model is more effective in capturing the long-range dependencies for relation extraction compared with the sequential models. AVAILABILITY AND IMPLEMENTATION Related codes are made publicly available at http://github.com/Baxelyne/SpanBioER.
Collapse
Affiliation(s)
- Hao Fei
- School of Cyber Science and Engineering, Wuhan University, Wuhan 430072, China
| | - Yue Zhang
- School of Engineering, Westlake University, Hangzhou 310024, China
| | - Yafeng Ren
- Laboratory of Language and Artificial Intelligence, Guangdong University of Foreign Studies, Guangzhou 510420, China
| | - Donghong Ji
- School of Cyber Science and Engineering, Wuhan University, Wuhan 430072, China
| |
Collapse
|
9
|
Wu H, Xing Y, Ge W, Liu X, Zou J, Zhou C, Liao J. Drug-drug interaction extraction via hybrid neural networks on biomedical literature. J Biomed Inform 2020; 106:103432. [PMID: 32335223 DOI: 10.1016/j.jbi.2020.103432] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Revised: 04/15/2020] [Accepted: 04/20/2020] [Indexed: 01/16/2023]
Abstract
Adverse events caused by drug-drug interaction (DDI) not only pose a serious threat to health, but also increase additional medical care expenditure. However, despite the emergence of many excellent text mining-based DDI classification methods, achieving a balance between using simpler method and better model performance is still unsatisfactory. In this article, we present a deep learning method of stacked bidirectional Gated Recurrent Unit (GRU)- convolutional neural network (SGRU-CNN) model which apply stacked bidirectional GRU (BiGRU) network and convolutional neural network (CNN) on lexical information and entity position information respectively to conduct DDIs extraction task. Furthermore, SGRU-CNN model assigns the weights of each word feature to improve performance with one attentive pooling layer. On the condition that other values are not inferior to other algorithms, experimental results on the DDI Extraction 2013 corpus show that our model achieves a 1.54% improvement in recall value. And the proposed SGRU-CNN model reaches great performance (F1-score: 0.75) with the fewest features, indicating an excellent balance between avoiding redundant preprocessing task and higher accuracy in relation extraction on biomedical literature using our method.
Collapse
Affiliation(s)
- Hong Wu
- School of science, China Pharmaceutical University, Nanjing, China
| | - Yan Xing
- School of science, China Pharmaceutical University, Nanjing, China
| | - Weihong Ge
- Department of Pharmacy, Nanjing Drum Tower Hospital, Nanjing, China; School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing, China
| | - Xiaoquan Liu
- School of Pharmacy, China Pharmaceutical University, Nanjing, China
| | - Jianjun Zou
- School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing, China; Department of Clinical Pharmacology, Nanjing First Hospital, Nanjing Medical University, Nanjing, China
| | - Changjiang Zhou
- School of science, China Pharmaceutical University, Nanjing, China
| | - Jun Liao
- School of science, China Pharmaceutical University, Nanjing, China.
| |
Collapse
|
10
|
Zhang T, Leng J, Liu Y. Deep learning for drug–drug interaction extraction from the literature: a review. Brief Bioinform 2019; 21:1609-1627. [DOI: 10.1093/bib/bbz087] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2019] [Revised: 06/20/2019] [Accepted: 06/21/2019] [Indexed: 01/07/2023] Open
Abstract
Abstract
Drug–drug interactions (DDIs) are crucial for drug research and pharmacovigilance. These interactions may cause adverse drug effects that threaten public health and patient safety. Therefore, the DDIs extraction from biomedical literature has been widely studied and emphasized in modern biomedical research. The previous rules-based and machine learning approaches rely on tedious feature engineering, which is labourious, time-consuming and unsatisfactory. With the development of deep learning technologies, this problem is alleviated by learning feature representations automatically. Here, we review the recent deep learning methods that have been applied to the extraction of DDIs from biomedical literature. We describe each method briefly and compare its performance in the DDI corpus systematically. Next, we summarize the advantages and disadvantages of these deep learning models for this task. Furthermore, we discuss some challenges and future perspectives of DDI extraction via deep learning methods. This review aims to serve as a useful guide for interested researchers to further advance bioinformatics algorithms for DDIs extraction from the literature.
Collapse
Affiliation(s)
- Tianlin Zhang
- School of Computer Science and Technology, University of Chinese Academy of Sciences, China
| | - Jiaxu Leng
- School of Computer Science and Technology, University of Chinese Academy of Sciences, China
| | - Ying Liu
- University of Chinese Academy of Sciences, Key Lab of Big Data Mining and Knowledge Management
| |
Collapse
|
11
|
Randriatsitohaina T, Hamon T. Extracting Food-Drug Interactions from Scientific Literature: Tackling Unspecified Relation. Artif Intell Med 2019. [DOI: 10.1007/978-3-030-21642-9_34] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
12
|
Ching T, Himmelstein DS, Beaulieu-Jones BK, Kalinin AA, Do BT, Way GP, Ferrero E, Agapow PM, Zietz M, Hoffman MM, Xie W, Rosen GL, Lengerich BJ, Israeli J, Lanchantin J, Woloszynek S, Carpenter AE, Shrikumar A, Xu J, Cofer EM, Lavender CA, Turaga SC, Alexandari AM, Lu Z, Harris DJ, DeCaprio D, Qi Y, Kundaje A, Peng Y, Wiley LK, Segler MHS, Boca SM, Swamidass SJ, Huang A, Gitter A, Greene CS. Opportunities and obstacles for deep learning in biology and medicine. J R Soc Interface 2018; 15:20170387. [PMID: 29618526 PMCID: PMC5938574 DOI: 10.1098/rsif.2017.0387] [Citation(s) in RCA: 834] [Impact Index Per Article: 139.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2017] [Accepted: 03/07/2018] [Indexed: 11/12/2022] Open
Abstract
Deep learning describes a class of machine learning algorithms that are capable of combining raw inputs into layers of intermediate features. These algorithms have recently shown impressive results across a variety of domains. Biology and medicine are data-rich disciplines, but the data are complex and often ill-understood. Hence, deep learning techniques may be particularly well suited to solve problems of these fields. We examine applications of deep learning to a variety of biomedical problems-patient classification, fundamental biological processes and treatment of patients-and discuss whether deep learning will be able to transform these tasks or if the biomedical sphere poses unique challenges. Following from an extensive literature review, we find that deep learning has yet to revolutionize biomedicine or definitively resolve any of the most pressing challenges in the field, but promising advances have been made on the prior state of the art. Even though improvements over previous baselines have been modest in general, the recent progress indicates that deep learning methods will provide valuable means for speeding up or aiding human investigation. Though progress has been made linking a specific neural network's prediction to input features, understanding how users should interpret these models to make testable hypotheses about the system under study remains an open challenge. Furthermore, the limited amount of labelled data for training presents problems in some domains, as do legal and privacy constraints on work with sensitive health records. Nonetheless, we foresee deep learning enabling changes at both bench and bedside with the potential to transform several areas of biology and medicine.
Collapse
Affiliation(s)
- Travers Ching
- Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, HI, USA
| | - Daniel S Himmelstein
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Brett K Beaulieu-Jones
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Alexandr A Kalinin
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | | | - Gregory P Way
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Enrico Ferrero
- Computational Biology and Stats, Target Sciences, GlaxoSmithKline, Stevenage, UK
| | | | - Michael Zietz
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, Toronto, Ontario, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Wei Xie
- Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN, USA
| | - Gail L Rosen
- Ecological and Evolutionary Signal-processing and Informatics Laboratory, Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA, USA
| | - Benjamin J Lengerich
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Johnny Israeli
- Biophysics Program, Stanford University, Stanford, CA, USA
| | - Jack Lanchantin
- Department of Computer Science, University of Virginia, Charlottesville, VA, USA
| | - Stephen Woloszynek
- Ecological and Evolutionary Signal-processing and Informatics Laboratory, Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA, USA
| | - Anne E Carpenter
- Imaging Platform, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Avanti Shrikumar
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Jinbo Xu
- Toyota Technological Institute at Chicago, Chicago, IL, USA
| | - Evan M Cofer
- Department of Computer Science, Trinity University, San Antonio, TX, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Christopher A Lavender
- Integrative Bioinformatics, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, NC, USA
| | - Srinivas C Turaga
- Howard Hughes Medical Institute, Janelia Research Campus, Ashburn, VA, USA
| | - Amr M Alexandari
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Zhiyong Lu
- National Center for Biotechnology Information and National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - David J Harris
- Department of Wildlife Ecology and Conservation, University of Florida, Gainesville, FL, USA
| | | | - Yanjun Qi
- Department of Computer Science, University of Virginia, Charlottesville, VA, USA
| | - Anshul Kundaje
- Department of Computer Science, Stanford University, Stanford, CA, USA
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Yifan Peng
- National Center for Biotechnology Information and National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Laura K Wiley
- Division of Biomedical Informatics and Personalized Medicine, University of Colorado School of Medicine, Aurora, CO, USA
| | - Marwin H S Segler
- Institute of Organic Chemistry, Westfälische Wilhelms-Universität Münster, Münster, Germany
| | - Simina M Boca
- Innovation Center for Biomedical Informatics, Georgetown University Medical Center, Washington, DC, USA
| | - S Joshua Swamidass
- Department of Pathology and Immunology, Washington University in Saint Louis, St Louis, MO, USA
| | - Austin Huang
- Department of Medicine, Brown University, Providence, RI, USA
| | - Anthony Gitter
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA
- Morgridge Institute for Research, Madison, WI, USA
| | - Casey S Greene
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|