1
|
Ghandikota SK, Jegga AG. Application of artificial intelligence and machine learning in drug repurposing. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2024; 205:171-211. [PMID: 38789178 DOI: 10.1016/bs.pmbts.2024.03.030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2024]
Abstract
The purpose of drug repurposing is to leverage previously approved drugs for a particular disease indication and apply them to another disease. It can be seen as a faster and more cost-effective approach to drug discovery and a powerful tool for achieving precision medicine. In addition, drug repurposing can be used to identify therapeutic candidates for rare diseases and phenotypic conditions with limited information on disease biology. Machine learning and artificial intelligence (AI) methodologies have enabled the construction of effective, data-driven repurposing pipelines by integrating and analyzing large-scale biomedical data. Recent technological advances, especially in heterogeneous network mining and natural language processing, have opened up exciting new opportunities and analytical strategies for drug repurposing. In this review, we first introduce the challenges in repurposing approaches and highlight some success stories, including those during the COVID-19 pandemic. Next, we review some existing computational frameworks in the literature, organized on the basis of the type of biomedical input data analyzed and the computational algorithms involved. In conclusion, we outline some exciting new directions that drug repurposing research may take, as pioneered by the generative AI revolution.
Collapse
Affiliation(s)
- Sudhir K Ghandikota
- Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, United States
| | - Anil G Jegga
- Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, United States; Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, United States.
| |
Collapse
|
2
|
Chen H, King FJ, Zhou B, Wang Y, Canedy CJ, Hayashi J, Zhong Y, Chang MW, Pache L, Wong JL, Jia Y, Joslin J, Jiang T, Benner C, Chanda SK, Zhou Y. Drug target prediction through deep learning functional representation of gene signatures. Nat Commun 2024; 15:1853. [PMID: 38424040 PMCID: PMC10904399 DOI: 10.1038/s41467-024-46089-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 02/14/2024] [Indexed: 03/02/2024] Open
Abstract
Many machine learning applications in bioinformatics currently rely on matching gene identities when analyzing input gene signatures and fail to take advantage of preexisting knowledge about gene functions. To further enable comparative analysis of OMICS datasets, including target deconvolution and mechanism of action studies, we develop an approach that represents gene signatures projected onto their biological functions, instead of their identities, similar to how the word2vec technique works in natural language processing. We develop the Functional Representation of Gene Signatures (FRoGS) approach by training a deep learning model and demonstrate that its application to the Broad Institute's L1000 datasets results in more effective compound-target predictions than models based on gene identities alone. By integrating additional pharmacological activity data sources, FRoGS significantly increases the number of high-quality compound-target predictions relative to existing approaches, many of which are supported by in silico and/or experimental evidence. These results underscore the general utility of FRoGS in machine learning-based bioinformatics applications. Prediction networks pre-equipped with the knowledge of gene functions may help uncover new relationships among gene signatures acquired by large-scale OMICs studies on compounds, cell types, disease models, and patient cohorts.
Collapse
Affiliation(s)
- Hao Chen
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA.
- Department of Computer Science and Engineering, University of California, Riverside, 900 University Avenue, Riverside, CA, 92521, USA.
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, 15213, USA.
| | - Frederick J King
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Bin Zhou
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Yu Wang
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Carter J Canedy
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Joel Hayashi
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Yang Zhong
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Max W Chang
- Department of Medicine, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA
| | - Lars Pache
- NCI Designated Cancer Center, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, 92037, USA
| | - Julian L Wong
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Yong Jia
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - John Joslin
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Tao Jiang
- Department of Computer Science and Engineering, University of California, Riverside, 900 University Avenue, Riverside, CA, 92521, USA
| | - Christopher Benner
- Department of Medicine, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA
| | - Sumit K Chanda
- Department of Immunology and Microbiology, Scripps Research, La Jolla, CA, 92037, USA
| | - Yingyao Zhou
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA.
| |
Collapse
|
3
|
Chivukula N, Ramesh K, Subbaroyan A, Sahoo AK, Dhanakoti GB, Ravichandran J, Samal A. ViCEKb: Vitiligo-linked Chemical Exposome Knowledgebase. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 913:169711. [PMID: 38160837 DOI: 10.1016/j.scitotenv.2023.169711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 12/23/2023] [Accepted: 12/25/2023] [Indexed: 01/03/2024]
Abstract
Vitiligo is a complex disease wherein the environmental factors, in conjunction with the underlying genetic predispositions, trigger the autoimmune destruction of melanocytes, ultimately leading to depigmented patches on the skin. While genetic factors have been extensively studied, the knowledge on environmental triggers remains sparse and less understood. To address this knowledge gap, we present the first comprehensive knowledgebase of vitiligo-triggering chemicals namely, Vitiligo-linked Chemical Exposome Knowledgebase (ViCEKb). ViCEKb involves an extensive and systematic manual effort in curation of published literature and subsequent compilation of 113 unique chemical triggers of vitiligo. ViCEKb standardizes various chemical information, and categorizes the chemicals based on their evidences and sources of exposure. Importantly, ViCEKb contains a wide range of metrics necessary for different toxicological evaluations. Notably, we observed that ViCEKb chemicals are present in a variety of consumer products. For instance, Propyl gallate is present as a fragrance substance in various household products, and Flutamide is used in medication to treat prostate cancer. These two chemicals have the highest level of evidence in ViCEKb, but are not regulated for their skin sensitizing effects. Furthermore, an extensive cheminformatics-based investigation revealed that ViCEKb chemical space is structurally diverse and comprises unique chemical scaffolds in comparison with skin specific regulatory lists. For example, Neomycin and 2,3,5-Triglycidyl-4-aminophenol have unique chemical scaffolds and the highest level of evidence in ViCEKb, but are not regulated for their skin sensitizing effects. Finally, a transcriptomics-based analysis of ViCEKb chemical perturbations in skin cell samples highlighted the commonality in their linked biological processes. Overall, we present the first comprehensive effort in compilation and exploration of various chemical triggers of vitiligo. We believe such a resource will enable in deciphering the complex etiology of vitiligo and aid in the characterization of human chemical exposome. ViCEKb is freely available for academic research at: https://cb.imsc.res.in/vicekb.
Collapse
Affiliation(s)
- Nikhil Chivukula
- The Institute of Mathematical Sciences (IMSc), Chennai, India; Homi Bhabha National Institute (HBNI), Mumbai, India
| | | | - Ajay Subbaroyan
- The Institute of Mathematical Sciences (IMSc), Chennai, India; Homi Bhabha National Institute (HBNI), Mumbai, India
| | - Ajaya Kumar Sahoo
- The Institute of Mathematical Sciences (IMSc), Chennai, India; Homi Bhabha National Institute (HBNI), Mumbai, India
| | | | - Janani Ravichandran
- The Institute of Mathematical Sciences (IMSc), Chennai, India; Homi Bhabha National Institute (HBNI), Mumbai, India
| | - Areejit Samal
- The Institute of Mathematical Sciences (IMSc), Chennai, India; Homi Bhabha National Institute (HBNI), Mumbai, India.
| |
Collapse
|
4
|
Ghorbanali Z, Zare-Mirakabad F, Salehi N, Akbari M, Masoudi-Nejad A. DrugRep-HeSiaGraph: when heterogenous siamese neural network meets knowledge graphs for drug repurposing. BMC Bioinformatics 2023; 24:374. [PMID: 37789314 PMCID: PMC10548718 DOI: 10.1186/s12859-023-05479-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 09/12/2023] [Indexed: 10/05/2023] Open
Abstract
BACKGROUND Drug repurposing is an approach that holds promise for identifying new therapeutic uses for existing drugs. Recently, knowledge graphs have emerged as significant tools for addressing the challenges of drug repurposing. However, there are still major issues with constructing and embedding knowledge graphs. RESULTS This study proposes a two-step method called DrugRep-HeSiaGraph to address these challenges. The method integrates the drug-disease knowledge graph with the application of a heterogeneous siamese neural network. In the first step, a drug-disease knowledge graph named DDKG-V1 is constructed by defining new relationship types, and then numerical vector representations for the nodes are created using the distributional learning method. In the second step, a heterogeneous siamese neural network called HeSiaNet is applied to enrich the embedding of drugs and diseases by bringing them closer in a new unified latent space. Then, it predicts potential drug candidates for diseases. DrugRep-HeSiaGraph achieves impressive performance metrics, including an AUC-ROC of 91.16%, an AUC-PR of 90.32%, an accuracy of 84.63%, a BS of 0.119, and an MCC of 69.31%. CONCLUSION We demonstrate the effectiveness of the proposed method in identifying potential drugs for COVID-19 as a case study. In addition, this study shows the role of dipeptidyl peptidase 4 (DPP-4) as a potential receptor for SARS-CoV-2 and the effectiveness of DPP-4 inhibitors in facing COVID-19. This highlights the practical application of the model in addressing real-world challenges in the field of drug repurposing. The code and data for DrugRep-HeSiaGraph are publicly available at https://github.com/CBRC-lab/DrugRep-HeSiaGraph .
Collapse
Affiliation(s)
- Zahra Ghorbanali
- Computational Biology Research Center (CBRC), Department of Mathematics and Computer Science, Amirkabir University of Technology, Tehran, Iran
| | - Fatemeh Zare-Mirakabad
- Computational Biology Research Center (CBRC), Department of Mathematics and Computer Science, Amirkabir University of Technology, Tehran, Iran.
| | - Najmeh Salehi
- School of Biological Science, Institute for Research in Fundamental Sciences (IPM), Tehran, Iran
| | - Mohammad Akbari
- Computational Biology Research Center (CBRC), Department of Mathematics and Computer Science, Amirkabir University of Technology, Tehran, Iran
| | - Ali Masoudi-Nejad
- Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| |
Collapse
|
5
|
Zaeri N. Artificial intelligence and machine learning responses to COVID-19 related inquiries. J Med Eng Technol 2023; 47:301-320. [PMID: 38625639 DOI: 10.1080/03091902.2024.2321846] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2021] [Accepted: 02/18/2024] [Indexed: 04/17/2024]
Abstract
Researchers and scientists can use computational-based models to turn linked data into useful information, aiding in disease diagnosis, examination, and viral containment due to recent artificial intelligence and machine learning breakthroughs. In this paper, we extensively study the role of artificial intelligence and machine learning in delivering efficient responses to the COVID-19 pandemic almost four years after its start. In this regard, we examine a large number of critical studies conducted by various academic and research communities from multiple disciplines, as well as practical implementations of artificial intelligence algorithms that suggest potential solutions in investigating different COVID-19 decision-making scenarios. We identify numerous areas where artificial intelligence and machine learning can impact this context, including diagnosis (using chest X-ray imaging and CT imaging), severity, tracking, treatment, and the drug industry. Furthermore, we analyse the dilemma's limits, restrictions, and hazards.
Collapse
Affiliation(s)
- Naser Zaeri
- Faculty of Computer Studies, Arab Open University, Kuwait
| |
Collapse
|
6
|
Novakovsky G, Sasaki S, Fornes O, Omur ME, Huang H, Bayly CL, Zhang D, Lim N, Cherkasov A, Pavlidis P, Mostafavi S, Lynn FC, Wasserman WW. In silico discovery of small molecules for efficient stem cell differentiation into definitive endoderm. Stem Cell Reports 2023; 18:765-781. [PMID: 36801003 PMCID: PMC10031281 DOI: 10.1016/j.stemcr.2023.01.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Revised: 01/18/2023] [Accepted: 01/19/2023] [Indexed: 02/18/2023] Open
Abstract
Improving methods for human embryonic stem cell differentiation represents a challenge in modern regenerative medicine research. Using drug repurposing approaches, we discover small molecules that regulate the formation of definitive endoderm. Among them are inhibitors of known processes involved in endoderm differentiation (mTOR, PI3K, and JNK pathways) and a new compound, with an unknown mechanism of action, capable of inducing endoderm formation in the absence of growth factors in the media. Optimization of the classical protocol by inclusion of this compound achieves the same differentiation efficiency with a 90% cost reduction. The presented in silico procedure for candidate molecule selection has broad potential for improving stem cell differentiation protocols.
Collapse
Affiliation(s)
- Gherman Novakovsky
- BC Children's Hospital Research Institute, University of British Columbia, Vancouver, BC, Canada; Bioinformatics Graduate Program, University of British Columbia, Vancouver, BC, Canada; Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada
| | - Shugo Sasaki
- BC Children's Hospital Research Institute, University of British Columbia, Vancouver, BC, Canada; Department of Surgery, University of British Columbia, Vancouver, BC, Canada; School of Biomedical Engineering, University of British Columbia, Vancouver, BC, Canada
| | - Oriol Fornes
- BC Children's Hospital Research Institute, University of British Columbia, Vancouver, BC, Canada; Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada
| | - Meltem E Omur
- BC Children's Hospital Research Institute, University of British Columbia, Vancouver, BC, Canada; Bioinformatics Graduate Program, University of British Columbia, Vancouver, BC, Canada; Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada
| | - Helen Huang
- BC Children's Hospital Research Institute, University of British Columbia, Vancouver, BC, Canada; Department of Surgery, University of British Columbia, Vancouver, BC, Canada; School of Biomedical Engineering, University of British Columbia, Vancouver, BC, Canada
| | - Carmen L Bayly
- BC Children's Hospital Research Institute, University of British Columbia, Vancouver, BC, Canada; Department of Surgery, University of British Columbia, Vancouver, BC, Canada
| | - Dahai Zhang
- BC Children's Hospital Research Institute, University of British Columbia, Vancouver, BC, Canada
| | - Nathaniel Lim
- Bioinformatics Graduate Program, University of British Columbia, Vancouver, BC, Canada; Department of Psychiatry, Michael Smith Laboratories, University of British Columbia, Vancouver, BC, Canada
| | - Artem Cherkasov
- Department of Urological Sciences, Vancouver Prostate Centre, University of British Columbia, Vancouver, BC, Canada
| | - Paul Pavlidis
- Department of Psychiatry, Michael Smith Laboratories, University of British Columbia, Vancouver, BC, Canada
| | - Sara Mostafavi
- BC Children's Hospital Research Institute, University of British Columbia, Vancouver, BC, Canada; Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada; Department of Statistics, University of British Columbia, Vancouver, BC, Canada; Department of Computer Science, University of Washington, Seattle, WA, USA
| | - Francis C Lynn
- BC Children's Hospital Research Institute, University of British Columbia, Vancouver, BC, Canada; Department of Surgery, University of British Columbia, Vancouver, BC, Canada; School of Biomedical Engineering, University of British Columbia, Vancouver, BC, Canada.
| | - Wyeth W Wasserman
- BC Children's Hospital Research Institute, University of British Columbia, Vancouver, BC, Canada; Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada.
| |
Collapse
|
7
|
Mun S, Han K, Hyun JK. The Time Sequence of Gene Expression Changes after Spinal Cord Injury. Cells 2022; 11:cells11142236. [PMID: 35883679 PMCID: PMC9324287 DOI: 10.3390/cells11142236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Revised: 07/15/2022] [Accepted: 07/17/2022] [Indexed: 02/01/2023] Open
Abstract
Gene expression changes following spinal cord injury (SCI) are time-dependent, and an accurate understanding of these changes can be crucial in determining time-based treatment options in a clinical setting. We performed RNA sequencing of the contused spinal cord of rats at five different time points from the very acute to chronic stages (1 hour, 1 day, 1 week, 1 month, and 3 months) following SCI. We identified differentially expressed genes (DEGs) and Gene Ontology (GO) terms at each time point, and 14,257 genes were commonly expressed at all time points. The biological process of the inflammatory response was increased at 1 hour and 1 day, and the cellular component of the integral component of the synaptic membrane was increased at 1 day. DEGs associated with cell activation and the innate immune response were highly enriched at 1 week and 1 month, respectively. A total of 2841 DEGs were differentially expressed at any of the five time points, and 18 genes (17 upregulated and 1 downregulated) showed common expression differences at all time points. We found that interleukin signaling, neutrophil degranulation, eukaryotic translation, collagen degradation, LGI–ADAM interactions, GABA receptor, and L1CAM-ankyrin interactions were prominent after SCI depending on the time post injury. We also performed gene–drug network analysis and found several potential antagonists and agonists which can be used to treat SCI. We expect to discover effective treatments in the clinical field through further studies revealing the efficacy and safety of potential drugs.
Collapse
Affiliation(s)
- Seyoung Mun
- Department of Nanobiomedical Science & BK21 NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 31116, Korea;
- Center for Bio Medical Engineering Core Facility, Dankook University, Cheonan 31116, Korea;
| | - Kyudong Han
- Center for Bio Medical Engineering Core Facility, Dankook University, Cheonan 31116, Korea;
- Department of Microbiology, College of Science & Technology, Dankook University, Cheonan 31116, Korea
| | - Jung Keun Hyun
- Department of Nanobiomedical Science & BK21 NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 31116, Korea;
- Department of Rehabilitation Medicine, College of Medicine, Dankook University, Cheonan 31116, Korea
- Institute of Tissue Regeneration Engineering (ITREN), Dankook University, Cheonan 31116, Korea
- Correspondence: ; Tel.: +82-10-2293-3415
| |
Collapse
|
8
|
Suresh N, Chinnakonda Ashok Kumar N, Subramanian S, Srinivasa G. Memory augmented recurrent neural networks for de-novo drug design. PLoS One 2022; 17:e0269461. [PMID: 35737661 PMCID: PMC9223405 DOI: 10.1371/journal.pone.0269461] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Accepted: 05/22/2022] [Indexed: 12/01/2022] Open
Abstract
A recurrent neural network (RNN) is a machine learning model that learns the relationship between elements of an input series, in addition to inferring a relationship between the data input to the model and target output. Memory augmentation allows the RNN to learn the interrelationships between elements of the input over a protracted length of the input series. Inspired by the success of stack augmented RNN (StackRNN) to generate strings for various applications, we present two memory augmented RNN-based architectures: the Neural Turing Machine (NTM) and the Differentiable Neural Computer (DNC) for the de-novo generation of small molecules. We trained a character-level convolutional neural network (CNN) to predict the properties of a generated string and compute a reward or loss in a deep reinforcement learning setup to bias the Generator to produce molecules with the desired property. Further, we compare the performance of these architectures to gain insight to their relative merits in terms of the validity and novelty of the generated molecules and the degree of property bias towards the computational generation of de-novo drugs. We also compare the performance of these architectures with simpler recurrent neural networks (Vanilla RNN, LSTM, and GRU) without an external memory component to explore the impact of augmented memory in the task of de-novo generation of small molecules.
Collapse
Affiliation(s)
- Naveen Suresh
- PES Center for Pattern Recognition and Department of Computer Science and Engineering, PES University, Bengaluru, Karnataka, India
| | - Neelesh Chinnakonda Ashok Kumar
- PES Center for Pattern Recognition and Department of Computer Science and Engineering, PES University, Bengaluru, Karnataka, India
| | - Srikumar Subramanian
- PES Center for Pattern Recognition and Department of Computer Science and Engineering, PES University, Bengaluru, Karnataka, India
| | - Gowri Srinivasa
- PES Center for Pattern Recognition and Department of Computer Science and Engineering, PES University, Bengaluru, Karnataka, India
- * E-mail:
| |
Collapse
|
9
|
Ahmed F, Lee JW, Samantasinghar A, Kim YS, Kim KH, Kang IS, Memon FH, Lim JH, Choi KH. SperoPredictor: An Integrated Machine Learning and Molecular Docking-Based Drug Repurposing Framework With Use Case of COVID-19. Front Public Health 2022; 10:902123. [PMID: 35784208 PMCID: PMC9244710 DOI: 10.3389/fpubh.2022.902123] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 05/02/2022] [Indexed: 12/13/2022] Open
Abstract
The global spread of the SARS coronavirus 2 (SARS-CoV-2), its manifestation in human hosts as a contagious disease, and its variants have induced a pandemic resulting in the deaths of over 6,000,000 people. Extensive efforts have been devoted to drug research to cure and refrain the spread of COVID-19, but only one drug has received FDA approval yet. Traditional drug discovery is inefficient, costly, and unable to react to pandemic threats. Drug repurposing represents an effective strategy for drug discovery and reduces the time and cost compared to de novo drug discovery. In this study, a generic drug repurposing framework (SperoPredictor) has been developed which systematically integrates the various types of drugs and disease data and takes the advantage of machine learning (Random Forest, Tree Ensemble, and Gradient Boosted Trees) to repurpose potential drug candidates against any disease of interest. Drug and disease data for FDA-approved drugs (n = 2,865), containing four drug features and three disease features, were collected from chemical and biological databases and integrated with the form of drug-disease association tables. The resulting dataset was split into 70% for training, 15% for testing, and the remaining 15% for validation. The testing and validation accuracies of the models were 99.3% for Random Forest and 99.03% for Tree Ensemble. In practice, SperoPredictor identified 25 potential drug candidates against 6 human host-target proteomes identified from a systematic review of journals. Literature-based validation indicated 12 of 25 predicted drugs (48%) have been already used for COVID-19 followed by molecular docking and re-docking which indicated 4 of 13 drugs (30%) as potential candidates against COVID-19 to be pre-clinically and clinically validated. Finally, SperoPredictor results illustrated the ability of the platform to be rapidly deployed to repurpose the drugs as a rapid response to emergent situations (like COVID-19 and other pandemics).
Collapse
Affiliation(s)
- Faheem Ahmed
- Department of Mechatronics Engineering, Jeju National University, Jeju, South Korea
| | - Jae Wook Lee
- Department of Mechatronics Engineering, Jeju National University, Jeju, South Korea
- BioSpero, Inc., Jeju, South Korea
| | | | | | - Kyung Hwan Kim
- Department of Mechatronics Engineering, Jeju National University, Jeju, South Korea
| | - In Suk Kang
- Department of Mechatronics Engineering, Jeju National University, Jeju, South Korea
| | - Fida Hussain Memon
- Department of Mechatronics Engineering, Jeju National University, Jeju, South Korea
| | - Jong Hwan Lim
- Department of Mechatronics Engineering, Jeju National University, Jeju, South Korea
| | - Kyung Hyun Choi
- Department of Mechatronics Engineering, Jeju National University, Jeju, South Korea
- BioSpero, Inc., Jeju, South Korea
| |
Collapse
|
10
|
Alqahtani A. Application of Artificial Intelligence in Discovery and Development of Anticancer and Antidiabetic Therapeutic Agents. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE : ECAM 2022; 2022:6201067. [PMID: 35509623 PMCID: PMC9060979 DOI: 10.1155/2022/6201067] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 03/17/2022] [Accepted: 04/05/2022] [Indexed: 11/18/2022]
Abstract
Spectacular developments in molecular and cellular biology have led to important discoveries in cancer research. Despite cancer is one of the major causes of morbidity and mortality globally, diabetes is one of the most leading sources of group of disorders. Artificial intelligence (AI) has been considered the fourth industrial revolution machine. The most major hurdles in drug discovery and development are the time and expenditures required to sustain the drug research pipeline. Large amounts of data can be explored and generated by AI, which can then be converted into useful knowledge. Because of this, the world's largest drug companies have already begun to use AI in their drug development research. In the present era, AI has a huge amount of potential for the rapid discovery and development of new anticancer drugs. Clinical studies, electronic medical records, high-resolution medical imaging, and genomic assessments are just a few of the tools that could aid drug development. Large data sets are available to researchers in the pharmaceutical and medical fields, which can be analyzed by advanced AI systems. This review looked at how computational biology and AI technologies may be utilized in cancer precision drug development by combining knowledge of cancer medicines, drug resistance, and structural biology. This review also highlighted a realistic assessment of the potential for AI in understanding and managing diabetes.
Collapse
Affiliation(s)
- Amal Alqahtani
- College of Medicine, Imam Abdulrahman Bin Faisal University, Dammam, 31541, Saudi Arabia
- Department of Basic Sciences, Deanship of Preparatory Year and Supporting Studies, Imam Abdulrahman Bin Faisal University, P.O. Box 1982, Dammam 34212, Saudi Arabia
| |
Collapse
|
11
|
Gao S, Han L, Luo D, Xiao Z, Liu G, Zhang Y, Zhou W. Deep Learning Applications for the accurate identification of low-transcriptional activity drugs and their mechanism of actions. Pharmacol Res 2022; 180:106225. [PMID: 35452801 DOI: 10.1016/j.phrs.2022.106225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 04/07/2022] [Accepted: 04/15/2022] [Indexed: 10/18/2022]
Abstract
Analysis of drug-induced expression profiles facilitated comprehensive understanding of drug properties. However, many compounds exhibit weak transcription responses though they mostly possess definite pharmacological effects. Actually, as a representative example, over 66.4% of 312,438 molecular signatures in the Library of Integrated Cellular Signatures (LINCS) database exhibit low-transcriptional activities (i.e. TAS-low signatures). When computing the association between TAS-low signatures with shared mechanism of actions (MOAs), commonly used algorithms showed inadequate performance with an average area under receiver operating characteristic curve (AUROC) of 0.55, but the computation accuracy of the same task can be improved by our developed tool Genetic profile activity relationship (GPAR) with an average AUROC of 0.68. Up to 36 out of 74 TAS-low MOAs were well trained with AUROC≥0.7 by GPAR, higher than those by other approaches. Further studies showed that GPAR benefited from the size of training samples more significantly than other approaches. Lastly, in biological validation of the MOA prediction for a TAS-low drug Tropisetron, we found an unreported mechanism that Tropisetron can bind to the glucocorticoid receptor. This study indicated that GPAR can serve as an effective approach for the accurate identification of low-transcriptional activity drugs and their MOAs, thus providing a good tool for drug repurposing with both TAS-low and TAS-high signatures.
Collapse
Affiliation(s)
- Shengqiao Gao
- Beijing Institute of Pharmacology and Toxicology, Beijing 100850, China; State Key Laboratory of Toxicology and Medical Countermeasures, Beijing 100850, China
| | - Lu Han
- Beijing Institute of Pharmacology and Toxicology, Beijing 100850, China; State Key Laboratory of Toxicology and Medical Countermeasures, Beijing 100850, China
| | - Dan Luo
- Beijing Institute of Pharmacology and Toxicology, Beijing 100850, China; State Key Laboratory of Toxicology and Medical Countermeasures, Beijing 100850, China
| | - Zhiyong Xiao
- Beijing Institute of Pharmacology and Toxicology, Beijing 100850, China; State Key Laboratory of Toxicology and Medical Countermeasures, Beijing 100850, China
| | - Gang Liu
- Beijing Institute of Pharmacology and Toxicology, Beijing 100850, China; State Key Laboratory of Toxicology and Medical Countermeasures, Beijing 100850, China
| | - Yongxiang Zhang
- Beijing Institute of Pharmacology and Toxicology, Beijing 100850, China; State Key Laboratory of Toxicology and Medical Countermeasures, Beijing 100850, China.
| | - Wenxia Zhou
- Beijing Institute of Pharmacology and Toxicology, Beijing 100850, China; State Key Laboratory of Toxicology and Medical Countermeasures, Beijing 100850, China.
| |
Collapse
|
12
|
Bundy JL, Judson R, Williams AJ, Grulke C, Shah I, Everett LJ. Predicting molecular initiating events using chemical target annotations and gene expression. BioData Min 2022; 15:7. [PMID: 35246223 PMCID: PMC8895536 DOI: 10.1186/s13040-022-00292-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Accepted: 02/10/2022] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND The advent of high-throughput transcriptomic screening technologies has resulted in a wealth of publicly available gene expression data associated with chemical treatments. From a regulatory perspective, data sets that cover a large chemical space and contain reference chemicals offer utility for the prediction of molecular initiating events associated with chemical exposure. Here, we integrate data from a large compendium of transcriptomic responses to chemical exposure with a comprehensive database of chemical-protein associations to train binary classifiers that predict mechanism(s) of action from transcriptomic responses. First, we linked reference chemicals present in the LINCS L1000 gene expression data collection to chemical identifiers in RefChemDB, a database of chemical-protein interactions. Next, we trained binary classifiers on MCF7 human breast cancer cell line derived gene expression profiles and chemical-protein labels using six classification algorithms to identify optimal analysis parameters. To validate classifier accuracy, we used holdout data sets, training-excluded reference chemicals, and empirical significance testing of null models derived from permuted chemical-protein associations. To identify classifiers that have variable predicting performance across training data derived from different cellular contexts, we trained a separate set of binary classifiers on the PC3 human prostate cancer cell line. RESULTS We trained classifiers using expression data associated with chemical treatments linked to 51 molecular initiating events. This analysis identified and validated 9 high-performing classifiers with empirical p-values lower than 0.05 and internal accuracies ranging from 0.73 to 0.94 and holdout accuracies of 0.68 to 0.92. High-ranking predictions for training-excluded reference chemicals demonstrating that predictive accuracy extends beyond the set of chemicals used in classifier training. To explore differences in classifier performance as a function of training data cellular context, MCF7-trained classifier accuracies were compared to classifiers trained on the PC3 gene expression data for the same molecular initiating events. CONCLUSIONS This methodology can offer insight in prioritizing candidate perturbagens of interest for targeted screens. This approach can also help guide the selection of relevant cellular contexts for screening classes of candidate perturbagens using cell line specific model performance.
Collapse
Affiliation(s)
- Joseph L Bundy
- Biomolecular and Computational Toxicology Division, Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, 109 T.W. Alexander Drive, Durham, NC, 27709, USA
| | - Richard Judson
- Biomolecular and Computational Toxicology Division, Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, 109 T.W. Alexander Drive, Durham, NC, 27709, USA
| | - Antony J Williams
- Chemical Characterization and Exposure Division, Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, 109 T.W. Alexander Drive, Durham, NC, 27709, USA
| | - Chris Grulke
- Chemical Characterization and Exposure Division, Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, 109 T.W. Alexander Drive, Durham, NC, 27709, USA
| | - Imran Shah
- Biomolecular and Computational Toxicology Division, Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, 109 T.W. Alexander Drive, Durham, NC, 27709, USA
| | - Logan J Everett
- Biomolecular and Computational Toxicology Division, Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, 109 T.W. Alexander Drive, Durham, NC, 27709, USA.
| |
Collapse
|
13
|
Schuler J, Falls Z, Mangione W, Hudson ML, Bruggemann L, Samudrala R. Evaluating the performance of drug-repurposing technologies. Drug Discov Today 2022; 27:49-64. [PMID: 34400352 PMCID: PMC10014214 DOI: 10.1016/j.drudis.2021.08.002] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2021] [Revised: 06/20/2021] [Accepted: 08/08/2021] [Indexed: 01/22/2023]
Abstract
Drug-repurposing technologies are growing in number and maturing. However, comparisons to each other and to reality are hindered because of a lack of consensus with respect to performance evaluation. Such comparability is necessary to determine scientific merit and to ensure that only meaningful predictions from repurposing technologies carry through to further validation and eventual patient use. Here, we review and compare performance evaluation measures for these technologies using version 2 of our shotgun repurposing Computational Analysis of Novel Drug Opportunities (CANDO) platform to illustrate their benefits, drawbacks, and limitations. Understanding and using different performance evaluation metrics ensures robust cross-platform comparability, enabling us to continue to strive toward optimal repurposing by decreasing the time and cost of drug discovery and development.
Collapse
Affiliation(s)
- James Schuler
- Department of Biomedical Informatics, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, USA.
| | - Zackary Falls
- Department of Biomedical Informatics, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, USA
| | - William Mangione
- Department of Biomedical Informatics, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, USA
| | - Matthew L Hudson
- Department of Biomedical Informatics, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, USA
| | - Liana Bruggemann
- Department of Biomedical Informatics, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, USA
| | - Ram Samudrala
- Department of Biomedical Informatics, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY, USA.
| |
Collapse
|
14
|
Badkas A, De Landtsheer S, Sauter T. Topological network measures for drug repositioning. Brief Bioinform 2021; 22:bbaa357. [PMID: 33348366 PMCID: PMC8294518 DOI: 10.1093/bib/bbaa357] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Revised: 11/03/2020] [Accepted: 11/05/2020] [Indexed: 12/15/2022] Open
Abstract
Drug repositioning has received increased attention since the past decade as several blockbuster drugs have come out of repositioning. Computational approaches are significantly contributing to these efforts, of which, network-based methods play a key role. Various structural (topological) network measures have thereby contributed to uncovering unintuitive functional relationships and repositioning candidates in drug-disease and other networks. This review gives a broad overview of the topic, and offers perspectives on the application of topological measures for network analysis. It also discusses unexplored measures, and draws attention to a wider scope of application efforts, especially in drug repositioning.
Collapse
|
15
|
Luo Q, Mo S, Xue Y, Zhang X, Gu Y, Wu L, Zhang J, Sun L, Liu M, Hu Y. Novel deep learning-based transcriptome data analysis for drug-drug interaction prediction with an application in diabetes. BMC Bioinformatics 2021; 22:318. [PMID: 34116627 PMCID: PMC8194123 DOI: 10.1186/s12859-021-04241-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2021] [Accepted: 06/03/2021] [Indexed: 11/12/2022] Open
Abstract
Background Drug-drug interaction (DDI) is a serious public health issue. The L1000 database of the LINCS project has collected millions of genome-wide expressions induced by 20,000 small molecular compounds on 72 cell lines. Whether this unified and comprehensive transcriptome data resource can be used to build a better DDI prediction model is still unclear. Therefore, we developed and validated a novel deep learning model for predicting DDI using 89,970 known DDIs extracted from the DrugBank database (version 5.1.4). Results The proposed model consists of a graph convolutional autoencoder network (GCAN) for embedding drug-induced transcriptome data from the L1000 database of the LINCS project; and a long short-term memory (LSTM) for DDI prediction. Comparative evaluation of various machine learning methods demonstrated the superior performance of our proposed model for DDI prediction. Many of our predicted DDIs were revealed in the latest DrugBank database (version 5.1.7). In the case study, we predicted drugs interacting with sulfonylureas to cause hypoglycemia and drugs interacting with metformin to cause lactic acidosis, and showed both to induce effects on the proteins involved in the metabolic mechanism in vivo. Conclusions The proposed deep learning model can accelerate the discovery of new DDIs. It can support future clinical research for safer and more effective drug co-prescription. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04241-1.
Collapse
Affiliation(s)
- Qichao Luo
- Big Data Decision Institute, Jinan University, Guangzhou, 510632, China.,School of Management, Jinan University, Guangzhou, 510632, China
| | - Shenglong Mo
- Big Data Decision Institute, Jinan University, Guangzhou, 510632, China
| | - Yunfei Xue
- Big Data Decision Institute, Jinan University, Guangzhou, 510632, China
| | - Xiangzhou Zhang
- Big Data Decision Institute, Jinan University, Guangzhou, 510632, China
| | - Yuliang Gu
- Big Data Decision Institute, Jinan University, Guangzhou, 510632, China
| | - Lijuan Wu
- Big Data Decision Institute, Jinan University, Guangzhou, 510632, China
| | - Jia Zhang
- Department of Geriatrics, The First Affiliated Hospital of Chongqing Medical University, Chongqing, 400016, China
| | - Linyan Sun
- Xi'an Hospital of Traditional Chinese Medicine, Xi'an, 710021, China
| | - Mei Liu
- Division of Medical Informatics, Department of Internal Medicine, Medical Center, University of Kansas, Kansas City, KS, 66160, USA.
| | - Yong Hu
- Big Data Decision Institute, Jinan University, Guangzhou, 510632, China.
| |
Collapse
|
16
|
Zanin M, Aitya NA, Basilio J, Baumbach J, Benis A, Behera CK, Bucholc M, Castiglione F, Chouvarda I, Comte B, Dao TT, Ding X, Pujos-Guillot E, Filipovic N, Finn DP, Glass DH, Harel N, Iesmantas T, Ivanoska I, Joshi A, Boudjeltia KZ, Kaoui B, Kaur D, Maguire LP, McClean PL, McCombe N, de Miranda JL, Moisescu MA, Pappalardo F, Polster A, Prasad G, Rozman D, Sacala I, Sanchez-Bornot JM, Schmid JA, Sharp T, Solé-Casals J, Spiwok V, Spyrou GM, Stalidzans E, Stres B, Sustersic T, Symeonidis I, Tieri P, Todd S, Van Steen K, Veneva M, Wang DH, Wang H, Wang H, Watterson S, Wong-Lin K, Yang S, Zou X, Schmidt HH. An Early Stage Researcher's Primer on Systems Medicine Terminology. NETWORK AND SYSTEMS MEDICINE 2021; 4:2-50. [PMID: 33659919 PMCID: PMC7919422 DOI: 10.1089/nsm.2020.0003] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/27/2020] [Indexed: 12/19/2022] Open
Abstract
Background: Systems Medicine is a novel approach to medicine, that is, an interdisciplinary field that considers the human body as a system, composed of multiple parts and of complex relationships at multiple levels, and further integrated into an environment. Exploring Systems Medicine implies understanding and combining concepts coming from diametral different fields, including medicine, biology, statistics, modeling and simulation, and data science. Such heterogeneity leads to semantic issues, which may slow down implementation and fruitful interaction between these highly diverse fields. Methods: In this review, we collect and explain more than100 terms related to Systems Medicine. These include both modeling and data science terms and basic systems medicine terms, along with some synthetic definitions, examples of applications, and lists of relevant references. Results: This glossary aims at being a first aid kit for the Systems Medicine researcher facing an unfamiliar term, where he/she can get a first understanding of them, and, more importantly, examples and references for digging into the topic.
Collapse
Affiliation(s)
- Massimiliano Zanin
- Centro de Tecnología Biomédica, Universidad Politécnica de Madrid, Madrid, Spain
| | - Nadim A.A. Aitya
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - José Basilio
- Center for Physiology and Pharmacology, Institute of Vascular Biology and Thrombosis Research, Medical University of Vienna, Vienna, Austria
| | - Jan Baumbach
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Arriel Benis
- Faculty of Technology Management, Holon Institute of Technology (HIT), Holon, Israel
| | - Chandan K. Behera
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Magda Bucholc
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Filippo Castiglione
- CNR National Research Council, IAC Institute for Applied Computing, Rome, Italy
| | - Ioanna Chouvarda
- Lab of Computing, Medical Informatics, and Biomedical Imaging Technologies, School of Medicine, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Blandine Comte
- Université Clermont Auvergne, INRAE, UNH, Plateforme d'Exploration du Métabolisme, MetaboHUB Clermont, Clermont-Ferrand, France
| | - Tien-Tuan Dao
- Biomechanics and Bioengineering Laboratory (UMR CNRS 7338), Université de Technologie de Compiègne, Compiègne, France
- Labex MS2T “Control of Technological Systems-of-Systems,” CNRS and Université de Technologie de Compiègne, Compiègne, France
| | - Xuemei Ding
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Estelle Pujos-Guillot
- Université Clermont Auvergne, INRAE, UNH, Plateforme d'Exploration du Métabolisme, MetaboHUB Clermont, Clermont-Ferrand, France
| | - Nenad Filipovic
- Faculty of Engineering, University of Kragujevac, Kragujevac, Serbia
- Bioengineering Research and Development Center (BioIRC), Kragujevac, Serbia
- Steinbeis Advanced Risk Technologies Institute doo Kragujevac, Kragujevac, Serbia
| | - David P. Finn
- Pharmacology and Therapeutics, School of Medicine, Galway Neuroscience Centre, National University of Ireland, Galway, Republic of Ireland
| | - David H. Glass
- School of Computing, Ulster University, Ulster, United Kingdom
| | - Nissim Harel
- Faculty of Sciences, Holon Institute of Technology (HIT), Holon, Israel
| | - Tomas Iesmantas
- Department of Mathematics and Natural Sciences, Kaunas University of Technology, Kaunas, Lithuania
| | - Ilinka Ivanoska
- Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University, Skopje, Macedonia
| | - Alok Joshi
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Karim Zouaoui Boudjeltia
- Laboratory of Experimental Medicine (ULB 222), Medicine Faculty, Université libre de Bruxelles, CHU de Charleroi, Charleroi, Belgium
| | - Badr Kaoui
- Biomechanics and Bioengineering Laboratory (UMR CNRS 7338), Université de Technologie de Compiègne, Compiègne, France
- Labex MS2T “Control of Technological Systems-of-Systems,” CNRS and Université de Technologie de Compiègne, Compiègne, France
| | - Daman Kaur
- Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, Ulster University, Ulster, United Kingdom
| | - Liam P. Maguire
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Paula L. McClean
- Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, Ulster University, Ulster, United Kingdom
| | - Niamh McCombe
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - João Luís de Miranda
- Escola Superior de Tecnologia e Gestão, Instituto Politécnico de Portalegre, Portalegre, Portugal
- Centro de Recursos Naturais e Ambiente (CERENA), Instituto Superior Técnico, Universidade de Lisboa, Lisboa, Portugal
| | | | | | - Annikka Polster
- Centre for Molecular Medicine Norway (NCMM), Forskningparken, Oslo, Norway
| | - Girijesh Prasad
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Damjana Rozman
- Centre for Functional Genomics and Bio-Chips, Institute of Biochemistry, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Ioan Sacala
- Faculty of Automatic Control and Computers, University Politehnica of Bucharest, Bucharest, Romania
| | - Jose M. Sanchez-Bornot
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Johannes A. Schmid
- Center for Physiology and Pharmacology, Institute of Vascular Biology and Thrombosis Research, Medical University of Vienna, Vienna, Austria
| | - Trevor Sharp
- Department of Pharmacology, University of Oxford, Oxford, United Kingdom
| | - Jordi Solé-Casals
- Data and Signal Processing Research Group, University of Vic–Central University of Catalonia, Vic, Spain
- Department of Psychiatry, University of Cambridge, Cambridge, United Kingdom
- College of Artificial Intelligence, Nankai University, Tianjin, China
| | - Vojtěch Spiwok
- Department of Biochemistry and Microbiology, University of Chemistry and Technology, Prague, Czech Republic
| | - George M. Spyrou
- The Cyprus School of Molecular Medicine, The Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Egils Stalidzans
- Computational Systems Biology Group, Institute of Microbiology and Biotechnology, University of Latvia, Riga, Latvia
| | - Blaž Stres
- Department of Animal Science, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
- Faculty of Civil and Geodetic Engineering, University of Ljubljana, Ljubljana, Slovenia
- Department of Automation, Biocybernetics and Robotics, Jozef Stefan Institute, Ljubljana, Slovenia
| | - Tijana Sustersic
- Faculty of Engineering, University of Kragujevac, Kragujevac, Serbia
- Bioengineering Research and Development Center (BioIRC), Kragujevac, Serbia
- Steinbeis Advanced Risk Technologies Institute doo Kragujevac, Kragujevac, Serbia
| | - Ioannis Symeonidis
- Center for Research and Technology Hellas, Hellenic Institute of Transport, Thessaloniki, Greece
| | - Paolo Tieri
- CNR National Research Council, IAC Institute for Applied Computing, Rome, Italy
| | - Stephen Todd
- Altnagelvin Area Hospital, Western Health and Social Care Trust, Altnagelvin, United Kingdom
| | - Kristel Van Steen
- BIO3-Systems Genetics, GIGA-R, University of Liege, Liege, Belgium
- BIO3-Systems Medicine, Department of Human Genetics, KU Leuven, Leuven, Belgium
| | | | - Da-Hui Wang
- State Key Laboratory of Cognitive Neuroscience and Learning, and School of Systems Science, Beijing Normal University, Beijing, China
| | - Haiying Wang
- School of Computing, Ulster University, Ulster, United Kingdom
| | - Hui Wang
- School of Computing, Ulster University, Ulster, United Kingdom
| | - Steven Watterson
- Northern Ireland Centre for Stratified Medicine, Ulster University, Londonderry, United Kingdom
| | - KongFatt Wong-Lin
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Su Yang
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Ulster, United Kingdom
| | - Xin Zou
- Shanghai Centre for Systems Biomedicine, Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Jiao Tong University, Shanghai, China
| | - Harald H.H.W. Schmidt
- Faculty of Health, Medicine & Life Science, Maastricht University, Maastricht, The Netherlands
| |
Collapse
|
17
|
Tworowski D, Gorohovski A, Mukherjee S, Carmi G, Levy E, Detroja R, Mukherjee SB, Frenkel-Morgenstern M. COVID19 Drug Repository: text-mining the literature in search of putative COVID19 therapeutics. Nucleic Acids Res 2021; 49:D1113-D1121. [PMID: 33166390 PMCID: PMC7778969 DOI: 10.1093/nar/gkaa969] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Revised: 10/07/2020] [Accepted: 11/04/2020] [Indexed: 12/12/2022] Open
Abstract
The recent outbreak of COVID-19 has generated an enormous amount of Big Data. To date, the COVID-19 Open Research Dataset (CORD-19), lists ∼130,000 articles from the WHO COVID-19 database, PubMed Central, medRxiv, and bioRxiv, as collected by Semantic Scholar. According to LitCovid (11 August 2020), ∼40,300 COVID19-related articles are currently listed in PubMed. It has been shown in clinical settings that the analysis of past research results and the mining of available data can provide novel opportunities for the successful application of currently approved therapeutics and their combinations for the treatment of conditions caused by a novel SARS-CoV-2 infection. As such, effective responses to the pandemic require the development of efficient applications, methods and algorithms for data navigation, text-mining, clustering, classification, analysis, and reasoning. Thus, our COVID19 Drug Repository represents a modular platform for drug data navigation and analysis, with an emphasis on COVID-19-related information currently being reported. The COVID19 Drug Repository enables users to focus on different levels of complexity, starting from general information about (FDA-) approved drugs, PubMed references, clinical trials, recipes as well as the descriptions of molecular mechanisms of drugs' action. Our COVID19 drug repository provide a most updated world-wide collection of drugs that has been repurposed for COVID19 treatments around the world.
Collapse
Affiliation(s)
- Dmitry Tworowski
- Laboratory of Cancer Genomics and Biocomputing of Complex Diseases, Azrieli Faculty of Medicine, Bar-Ilan University, Henrietta Szold 8, Safed 13195, Israel
| | - Alessandro Gorohovski
- Laboratory of Cancer Genomics and Biocomputing of Complex Diseases, Azrieli Faculty of Medicine, Bar-Ilan University, Henrietta Szold 8, Safed 13195, Israel
| | - Sumit Mukherjee
- Laboratory of Cancer Genomics and Biocomputing of Complex Diseases, Azrieli Faculty of Medicine, Bar-Ilan University, Henrietta Szold 8, Safed 13195, Israel
| | - Gon Carmi
- Laboratory of Cancer Genomics and Biocomputing of Complex Diseases, Azrieli Faculty of Medicine, Bar-Ilan University, Henrietta Szold 8, Safed 13195, Israel
| | - Eliad Levy
- Laboratory of Cancer Genomics and Biocomputing of Complex Diseases, Azrieli Faculty of Medicine, Bar-Ilan University, Henrietta Szold 8, Safed 13195, Israel
| | - Rajesh Detroja
- Laboratory of Cancer Genomics and Biocomputing of Complex Diseases, Azrieli Faculty of Medicine, Bar-Ilan University, Henrietta Szold 8, Safed 13195, Israel
| | - Sunanda Biswas Mukherjee
- Laboratory of Cancer Genomics and Biocomputing of Complex Diseases, Azrieli Faculty of Medicine, Bar-Ilan University, Henrietta Szold 8, Safed 13195, Israel
| | - Milana Frenkel-Morgenstern
- Laboratory of Cancer Genomics and Biocomputing of Complex Diseases, Azrieli Faculty of Medicine, Bar-Ilan University, Henrietta Szold 8, Safed 13195, Israel
| |
Collapse
|
18
|
Kim H, Kim E, Lee I, Bae B, Park M, Nam H. Artificial Intelligence in Drug Discovery: A Comprehensive Review of Data-driven and Machine Learning Approaches. BIOTECHNOL BIOPROC E 2021; 25:895-930. [PMID: 33437151 PMCID: PMC7790479 DOI: 10.1007/s12257-020-0049-y] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Revised: 05/27/2020] [Accepted: 06/03/2020] [Indexed: 02/07/2023]
Abstract
As expenditure on drug development increases exponentially, the overall drug discovery process requires a sustainable revolution. Since artificial intelligence (AI) is leading the fourth industrial revolution, AI can be considered as a viable solution for unstable drug research and development. Generally, AI is applied to fields with sufficient data such as computer vision and natural language processing, but there are many efforts to revolutionize the existing drug discovery process by applying AI. This review provides a comprehensive, organized summary of the recent research trends in AI-guided drug discovery process including target identification, hit identification, ADMET prediction, lead optimization, and drug repositioning. The main data sources in each field are also summarized in this review. In addition, an in-depth analysis of the remaining challenges and limitations will be provided, and proposals for promising future directions in each of the aforementioned areas.
Collapse
Affiliation(s)
- Hyunho Kim
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Gwangju, 61005 Korea
| | - Eunyoung Kim
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Gwangju, 61005 Korea
| | - Ingoo Lee
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Gwangju, 61005 Korea
| | - Bongsung Bae
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Gwangju, 61005 Korea
| | - Minsu Park
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Gwangju, 61005 Korea
| | - Hojung Nam
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Gwangju, 61005 Korea
| |
Collapse
|
19
|
Gao S, Han L, Luo D, Liu G, Xiao Z, Shan G, Zhang Y, Zhou W. Modeling drug mechanism of action with large scale gene-expression profiles using GPAR, an artificial intelligence platform. BMC Bioinformatics 2021; 22:17. [PMID: 33413089 PMCID: PMC7788535 DOI: 10.1186/s12859-020-03915-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Accepted: 11/30/2020] [Indexed: 01/03/2023] Open
Abstract
Background Querying drug-induced gene expression profiles with machine learning method is an effective way for revealing drug mechanism of actions (MOAs), which is strongly supported by the growth of large scale and high-throughput gene expression databases. However, due to the lack of code-free and user friendly applications, it is not easy for biologists and pharmacologists to model MOAs with state-of-art deep learning approach. Results In this work, a newly developed online collaborative tool, Genetic profile-activity relationship (GPAR) was built to help modeling and predicting MOAs easily via deep learning. The users can use GPAR to customize their training sets to train self-defined MOA prediction models, to evaluate the model performances and to make further predictions automatically. Cross-validation tests show GPAR outperforms Gene set enrichment analysis in predicting MOAs. Conclusion GPAR can serve as a better approach in MOAs prediction, which may facilitate researchers to generate more reliable MOA hypothesis.
Collapse
Affiliation(s)
- Shengqiao Gao
- Beijing Institute of Pharmacology and Toxicology, State Key Laboratory of Toxicology and Medical Countermeasures, Beijing, 100850, China
| | - Lu Han
- Beijing Institute of Pharmacology and Toxicology, State Key Laboratory of Toxicology and Medical Countermeasures, Beijing, 100850, China
| | - Dan Luo
- Beijing Institute of Pharmacology and Toxicology, State Key Laboratory of Toxicology and Medical Countermeasures, Beijing, 100850, China
| | - Gang Liu
- Beijing Institute of Pharmacology and Toxicology, State Key Laboratory of Toxicology and Medical Countermeasures, Beijing, 100850, China
| | - Zhiyong Xiao
- Beijing Institute of Pharmacology and Toxicology, State Key Laboratory of Toxicology and Medical Countermeasures, Beijing, 100850, China
| | - Guangcun Shan
- School of Instrumentation Science and Opto-Electronics Engineering and Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, Beihang University, Beijing, 100083, China
| | - Yongxiang Zhang
- Beijing Institute of Pharmacology and Toxicology, State Key Laboratory of Toxicology and Medical Countermeasures, Beijing, 100850, China.
| | - Wenxia Zhou
- Beijing Institute of Pharmacology and Toxicology, State Key Laboratory of Toxicology and Medical Countermeasures, Beijing, 100850, China.
| |
Collapse
|
20
|
Hudson IL. Data Integration Using Advances in Machine Learning in Drug Discovery and Molecular Biology. Methods Mol Biol 2021; 2190:167-184. [PMID: 32804365 DOI: 10.1007/978-1-0716-0826-5_7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
While the term artificial intelligence and the concept of deep learning are not new, recent advances in high-performance computing, the availability of large annotated data sets required for training, and novel frameworks for implementing deep neural networks have led to an unprecedented acceleration of the field of molecular (network) biology and pharmacogenomics. The need to align biological data to innovative machine learning has stimulated developments in both data integration (fusion) and knowledge representation, in the form of heterogeneous, multiplex, and biological networks or graphs. In this chapter we briefly introduce several popular neural network architectures used in deep learning, namely, the fully connected deep neural network, recurrent neural network, convolutional neural network, and the autoencoder. Deep learning predictors, classifiers, and generators utilized in modern feature extraction may well assist interpretability and thus imbue AI tools with increased explication, potentially adding insights and advancements in novel chemistry and biology discovery.The capability of learning representations from structures directly without using any predefined structure descriptor is an important feature distinguishing deep learning from other machine learning methods and makes the traditional feature selection and reduction procedures unnecessary. In this chapter we briefly show how these technologies are applied for data integration (fusion) and analysis in drug discovery research covering these areas: (1) application of convolutional neural networks to predict ligand-protein interactions; (2) application of deep learning in compound property and activity prediction; (3) de novo design through deep learning. We also: (1) discuss some aspects of future development of deep learning in drug discovery/chemistry; (2) provide references to published information; (3) provide recently advocated recommendations on using artificial intelligence and deep learning in -omics research and drug discovery.
Collapse
Affiliation(s)
- Irene Lena Hudson
- Mathematical Sciences, School of Science, RMIT University, Melbourne, VIC, Australia.
| |
Collapse
|
21
|
Chiu YC, Chen HIH, Gorthi A, Mostavi M, Zheng S, Huang Y, Chen Y. Deep learning of pharmacogenomics resources: moving towards precision oncology. Brief Bioinform 2020; 21:2066-2083. [PMID: 31813953 PMCID: PMC7711267 DOI: 10.1093/bib/bbz144] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2019] [Revised: 08/22/2019] [Accepted: 10/18/2019] [Indexed: 12/13/2022] Open
Abstract
The recent accumulation of cancer genomic data provides an opportunity to understand how a tumor's genomic characteristics can affect its responses to drugs. This field, called pharmacogenomics, is a key area in the development of precision oncology. Deep learning (DL) methodology has emerged as a powerful technique to characterize and learn from rapidly accumulating pharmacogenomics data. We introduce the fundamentals and typical model architectures of DL. We review the use of DL in classification of cancers and cancer subtypes (diagnosis and treatment stratification of patients), prediction of drug response and drug synergy for individual tumors (treatment prioritization for a patient), drug repositioning and discovery and the study of mechanism/mode of action of treatments. For each topic, we summarize current genomics and pharmacogenomics data resources such as pan-cancer genomics data for cancer cell lines (CCLs) and tumors, and systematic pharmacologic screens of CCLs. By revisiting the published literature, including our in-house analyses, we demonstrate the unprecedented capability of DL enabled by rapid accumulation of data resources to decipher complex drug response patterns, thus potentially improving cancer medicine. Overall, this review provides an in-depth summary of state-of-the-art DL methods and up-to-date pharmacogenomics resources and future opportunities and challenges to realize the goal of precision oncology.
Collapse
Affiliation(s)
- Yu-Chiao Chiu
- Greehey Children’s Cancer Research Institute, University of Texas Health San Antonio, San Antonio, TX 78229, USA
| | - Hung-I Harry Chen
- Greehey Children’s Cancer Research Institute, University of Texas Health San Antonio, San Antonio, TX 78229, USA
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, TX 78249, USA
| | - Aparna Gorthi
- Greehey Children’s Cancer Research Institute, University of Texas Health San Antonio, San Antonio, TX 78229, USA
| | - Milad Mostavi
- Greehey Children’s Cancer Research Institute, University of Texas Health San Antonio, San Antonio, TX 78229, USA
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, TX 78249, USA
| | - Siyuan Zheng
- Greehey Children’s Cancer Research Institute, University of Texas Health San Antonio, San Antonio, TX 78229, USA
- Department of Population Health Sciences, University of Texas Health San Antonio, San Antonio, TX 78229, USA
| | - Yufei Huang
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, TX 78249, USA
- Department of Population Health Sciences, University of Texas Health San Antonio, San Antonio, TX 78229, USA
| | - Yidong Chen
- Greehey Children’s Cancer Research Institute, University of Texas Health San Antonio, San Antonio, TX 78229, USA
- Department of Population Health Sciences, University of Texas Health San Antonio, San Antonio, TX 78229, USA
| |
Collapse
|
22
|
Xue Y, Ding MQ, Lu X. Learning to encode cellular responses to systematic perturbations with deep generative models. NPJ Syst Biol Appl 2020; 6:35. [PMID: 33159077 PMCID: PMC7648057 DOI: 10.1038/s41540-020-00158-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Accepted: 10/07/2020] [Indexed: 11/09/2022] Open
Abstract
Cellular signaling systems play a vital role in maintaining homeostasis when a cell is exposed to different perturbations. Components of the systems are organized as hierarchical networks, and perturbing different components often leads to transcriptomic profiles that exhibit compositional statistical patterns. Mining such patterns to investigate how cellular signals are encoded is an important problem in systems biology, where artificial intelligence techniques can be of great assistance. Here, we investigated the capability of deep generative models (DGMs) to modeling signaling systems and learn representations of cellular states underlying transcriptomic responses to diverse perturbations. Specifically, we show that the variational autoencoder and the supervised vector-quantized variational autoencoder can accurately regenerate gene expression data in response to perturbagen treatments. The models can learn representations that reveal the relationships between different classes of perturbagens and enable mappings between drugs and their target genes. In summary, DGMs can adequately learn and depict how cellular signals are encoded. The resulting representations have broad applications, demonstrating the power of artificial intelligence in systems biology and precision medicine.
Collapse
Affiliation(s)
- Yifan Xue
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15206, USA
| | - Michael Q Ding
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15206, USA
| | - Xinghua Lu
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15206, USA.
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA, 15206, USA.
| |
Collapse
|
23
|
Zhao L, Ciallella HL, Aleksunes LM, Zhu H. Advancing computer-aided drug discovery (CADD) by big data and data-driven machine learning modeling. Drug Discov Today 2020; 25:1624-1638. [PMID: 32663517 PMCID: PMC7572559 DOI: 10.1016/j.drudis.2020.07.005] [Citation(s) in RCA: 75] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Revised: 06/26/2020] [Accepted: 07/06/2020] [Indexed: 02/06/2023]
Abstract
Advancing a new drug to market requires substantial investments in time as well as financial resources. Crucial bioactivities for drug candidates, including their efficacy, pharmacokinetics (PK), and adverse effects, need to be investigated during drug development. With advancements in chemical synthesis and biological screening technologies over the past decade, a large amount of biological data points for millions of small molecules have been generated and are stored in various databases. These accumulated data, combined with new machine learning (ML) approaches, such as deep learning, have shown great potential to provide insights into relevant chemical structures to predict in vitro, in vivo, and clinical outcomes, thereby advancing drug discovery and development in the big data era.
Collapse
Affiliation(s)
- Linlin Zhao
- The Rutgers Center for Computational and Integrative Biology, Camden, NJ 08102, USA
| | - Heather L Ciallella
- The Rutgers Center for Computational and Integrative Biology, Camden, NJ 08102, USA
| | - Lauren M Aleksunes
- Department of Pharmacology and Toxicology, Ernest Mario School of Pharmacy, Rutgers University, Piscataway, NJ 08854, USA
| | - Hao Zhu
- The Rutgers Center for Computational and Integrative Biology, Camden, NJ 08102, USA; Department of Chemistry, Rutgers University, Camden, NJ 08102, USA.
| |
Collapse
|
24
|
Bhat R, Kaushik R, Singh A, DasGupta D, Jayaraj A, Soni A, Shandilya A, Shekhar V, Shekhar S, Jayaram B. A comprehensive automated computer-aided discovery pipeline from genomes to hit molecules. Chem Eng Sci 2020. [DOI: 10.1016/j.ces.2020.115711] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
|
25
|
Zhu Y, Che C, Jin B, Zhang N, Su C, Wang F. Knowledge-driven drug repurposing using a comprehensive drug knowledge graph. Health Informatics J 2020; 26:2737-2750. [DOI: 10.1177/1460458220937101] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Due to the huge costs associated with new drug discovery and development, drug repurposing has become an important complement to the traditional de novo approach. With the increasing number of public databases and the rapid development of analytical methodologies, computational approaches have gained great momentum in the field of drug repurposing. In this study, we introduce an approach to knowledge-driven drug repurposing based on a comprehensive drug knowledge graph. We design and develop a drug knowledge graph by systematically integrating multiple drug knowledge bases. We describe path- and embedding-based data representation methods of transforming information in the drug knowledge graph into valuable inputs to allow machine learning models to predict drug repurposing candidates. The evaluation demonstrates that the knowledge-driven approach can produce high predictive results for known diabetes mellitus treatments by only using treatment information on other diseases. In addition, this approach supports exploratory investigation through the review of meta paths that connect drugs with diseases. This knowledge-driven approach is an effective drug repurposing strategy supporting large-scale prediction and the investigation of case studies.
Collapse
Affiliation(s)
| | | | - Bo Jin
- Dalian University of Technology, China
| | | | | | | |
Collapse
|
26
|
Liu Z, Chen Q, Lan W, Liang J, Chen YPP, Chen B. A Survey of Network Embedding for Drug Analysis and Prediction. Curr Protein Pept Sci 2020; 22:CPPS-EPUB-107859. [PMID: 32614745 DOI: 10.2174/1389203721666200702145701] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2020] [Revised: 04/05/2020] [Accepted: 05/21/2020] [Indexed: 11/22/2022]
Abstract
Traditional network-based computational methods have shown good results in drug analysis and prediction. However, these methods are time consuming and lack universality, and it is difficult to exploit the auxiliary information of nodes and edges. Network embedding provides a promising way for alleviating the above problems by transforming network into a low-dimensional space while preserving network structure and auxiliary information. This thus facilitates the application of machine learning algorithms for subsequent processing. Network embedding has been introduced into drug analysis and prediction in the last few years, and has shown superior performance over traditional methods. However, there is no systematic review of this issue. This article offers a comprehensive survey of the primary network embedding methods and their applications in drug analysis and prediction. The network embedding technologies applied in homogeneous network and heterogeneous network are investigated and compared, including matrix decomposition, random walk, and deep learning. Especially, the Graph neural network (GNN) methods in deep learning are highlighted. Further, the applications of network embedding in drug similarity estimation, drug-target interaction prediction, adverse drug reactions prediction, protein function and therapeutic peptides prediction are discussed. Several future potential research directions are also discussed.
Collapse
Affiliation(s)
- Zhixian Liu
- School of Medical, Guangxi University, Nanning. China
| | - Qingfeng Chen
- School of Computer, Electronic and Information, Guangxi University, Nanning. China
| | - Wei Lan
- School of Computer, Electronic and Information, Guangxi University, Nanning. China
| | - Jiahai Liang
- School of Electronics and Information Engineering, Beibu Gulf University, Qinzhou. China
| | - Yi-Ping Phoebe Chen
- Department of Computer Science and Information Technology, La Trobe University, Melbourne. Australia
| | - Baoshan Chen
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, Guangxi University, Nanning. China
| |
Collapse
|
27
|
Qi X, Shen M, Fan P, Guo X, Wang T, Feng N, Zhang M, Sweet RA, Kirisci L, Wang L. The Performance of Gene Expression Signature-Guided Drug-Disease Association in Different Categories of Drugs and Diseases. Molecules 2020; 25:molecules25122776. [PMID: 32560162 PMCID: PMC7357095 DOI: 10.3390/molecules25122776] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 06/05/2020] [Accepted: 06/05/2020] [Indexed: 12/27/2022] Open
Abstract
A gene expression signature (GES) is a group of genes that shows a unique expression profile as a result of perturbations by drugs, genetic modification or diseases on the transcriptional machinery. The comparisons between GES profiles have been used to investigate the relationships between drugs, their targets and diseases with quite a few successful cases reported. Especially in the study of GES-guided drugs–disease associations, researchers believe that if a GES induced by a drug is opposite to a GES induced by a disease, the drug may have potential as a treatment of that disease. In this study, we data-mined the crowd extracted expression of differential signatures (CREEDS) database to evaluate the similarity between GES profiles from drugs and their indicated diseases. Our study aims to explore the application domains of GES-guided drug–disease associations through the analysis of the similarity of GES profiles on known pairs of drug–disease associations, thereby identifying subgroups of drugs/diseases that are suitable for GES-guided drug repositioning approaches. Our results supported our hypothesis that the GES-guided drug–disease association method is better suited for some subgroups or pathways such as drugs and diseases associated with the immune system, diseases of the nervous system, non-chemotherapy drugs or the mTOR signaling pathway.
Collapse
Affiliation(s)
- Xiguang Qi
- Department of Pharmaceutical Sciences, Computational Chemical Genomics Screening Center, University of Pittsburgh School of Pharmacy, 3501 Terrace St Pittsburgh, PA 15261, USA; (X.Q.); (M.S.); (P.F.); (X.G.)
| | - Mingzhe Shen
- Department of Pharmaceutical Sciences, Computational Chemical Genomics Screening Center, University of Pittsburgh School of Pharmacy, 3501 Terrace St Pittsburgh, PA 15261, USA; (X.Q.); (M.S.); (P.F.); (X.G.)
| | - Peihao Fan
- Department of Pharmaceutical Sciences, Computational Chemical Genomics Screening Center, University of Pittsburgh School of Pharmacy, 3501 Terrace St Pittsburgh, PA 15261, USA; (X.Q.); (M.S.); (P.F.); (X.G.)
| | - Xiaojiang Guo
- Department of Pharmaceutical Sciences, Computational Chemical Genomics Screening Center, University of Pittsburgh School of Pharmacy, 3501 Terrace St Pittsburgh, PA 15261, USA; (X.Q.); (M.S.); (P.F.); (X.G.)
| | - Tianqi Wang
- Department of Biological Sciences, University of Pittsburgh School of Arts & Sciences, Pittsburgh, PA 15260, USA;
| | - Ning Feng
- Division of Cardiology, Vascular Medicine Institute, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA; (N.F.); (M.Z.)
| | - Manling Zhang
- Division of Cardiology, Vascular Medicine Institute, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA; (N.F.); (M.Z.)
| | - Robert A. Sweet
- Department of Neurology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
- Department of Psychiatry, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
- Correspondence: (R.A.S.); (L.K.); (L.W.); Tel.: +1 412-624-8118 (L.K.); +1 412-383-6089 (R.A.S.)
| | - Levent Kirisci
- Department of Pharmaceutical Sciences, Computational Chemical Genomics Screening Center, University of Pittsburgh School of Pharmacy, 3501 Terrace St Pittsburgh, PA 15261, USA; (X.Q.); (M.S.); (P.F.); (X.G.)
- Correspondence: (R.A.S.); (L.K.); (L.W.); Tel.: +1 412-624-8118 (L.K.); +1 412-383-6089 (R.A.S.)
| | - Lirong Wang
- Department of Pharmaceutical Sciences, Computational Chemical Genomics Screening Center, University of Pittsburgh School of Pharmacy, 3501 Terrace St Pittsburgh, PA 15261, USA; (X.Q.); (M.S.); (P.F.); (X.G.)
- Correspondence: (R.A.S.); (L.K.); (L.W.); Tel.: +1 412-624-8118 (L.K.); +1 412-383-6089 (R.A.S.)
| |
Collapse
|
28
|
Qin Z, Xu Q, Hu H, Yu L, Zeng S. Extracellular Vesicles in Renal Cell Carcinoma: Multifaceted Roles and Potential Applications Identified by Experimental and Computational Methods. Front Oncol 2020; 10:724. [PMID: 32457844 PMCID: PMC7221139 DOI: 10.3389/fonc.2020.00724] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Accepted: 04/16/2020] [Indexed: 12/24/2022] Open
Abstract
Renal cell carcinoma (RCC) is the most common type of kidney cancer. Increasingly evidences indicate that extracellular vesicles (EVs) orchestrate multiple processes in tumorigenesis, metastasis, immune evasion, and drug response of RCC. EVs are lipid membrane-bound vesicles in nanometer size and secreted by almost all cell types into the extracellular milieu. A myriad of bioactive molecules such as RNA, DNA, protein, and lipid are able to be delivered via EVs for the intercellular communication. Hence, the abundant content of EVs is appealing reservoir for biomarker identification through computational analysis and experimental validation. EVs with excellent biocompatibility and biodistribution are natural platforms that can be engineered to offer achievable drug delivery strategies for RCC therapies. Moreover, the multifaceted roles of EVs in RCC progression also provide substantial targets and facilitate EVs-based drug discovery, which will be accelerated by using artificial intelligence approaches. In this review, we summarized the vital roles of EVs in occurrence, metastasis, immune evasion, and drug resistance of RCC. Furthermore, we also recapitulated and prospected the EVs-based potential applications in RCC, including biomarker identification, drug vehicle development as well as drug target discovery.
Collapse
Affiliation(s)
| | | | | | | | - Su Zeng
- College of Pharmaceutical Sciences, Institute of Drug Metabolism and Pharmaceutical Analysis, Zhejiang University, Hangzhou, China
| |
Collapse
|
29
|
Baptista D, Ferreira PG, Rocha M. Deep learning for drug response prediction in cancer. Brief Bioinform 2020; 22:360-379. [PMID: 31950132 DOI: 10.1093/bib/bbz171] [Citation(s) in RCA: 86] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Revised: 11/04/2019] [Indexed: 01/15/2023] Open
Abstract
Predicting the sensitivity of tumors to specific anti-cancer treatments is a challenge of paramount importance for precision medicine. Machine learning(ML) algorithms can be trained on high-throughput screening data to develop models that are able to predict the response of cancer cell lines and patients to novel drugs or drug combinations. Deep learning (DL) refers to a distinct class of ML algorithms that have achieved top-level performance in a variety of fields, including drug discovery. These types of models have unique characteristics that may make them more suitable for the complex task of modeling drug response based on both biological and chemical data, but the application of DL to drug response prediction has been unexplored until very recently. The few studies that have been published have shown promising results, and the use of DL for drug response prediction is beginning to attract greater interest from researchers in the field. In this article, we critically review recently published studies that have employed DL methods to predict drug response in cancer cell lines. We also provide a brief description of DL and the main types of architectures that have been used in these studies. Additionally, we present a selection of publicly available drug screening data resources that can be used to develop drug response prediction models. Finally, we also address the limitations of these approaches and provide a discussion on possible paths for further improvement. Contact: mrocha@di.uminho.pt.
Collapse
Affiliation(s)
| | | | - Miguel Rocha
- Department of Informatics and a Senior Researcher of the Centre of Biological Engineering at the University of Minho
| |
Collapse
|
30
|
Machine learning for target discovery in drug development. Curr Opin Chem Biol 2019; 56:16-22. [PMID: 31734566 DOI: 10.1016/j.cbpa.2019.10.003] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2019] [Revised: 10/01/2019] [Accepted: 10/03/2019] [Indexed: 12/15/2022]
Abstract
The discovery of macromolecular targets for bioactive agents is currently a bottleneck for the informed design of chemical probes and drug leads. Typically, activity profiling against genetically manipulated cell lines or chemical proteomics is pursued to shed light on their biology and deconvolute drug-target networks. By taking advantage of the ever-growing wealth of publicly available bioactivity data, learning algorithms now provide an attractive means to generate statistically motivated research hypotheses and thereby prioritize biochemical screens. Here, we highlight recent successes in machine intelligence for target identification and discuss challenges and opportunities for drug discovery.
Collapse
|
31
|
Moridi M, Ghadirinia M, Sharifi-Zarchi A, Zare-Mirakabad F. The assessment of efficient representation of drug features using deep learning for drug repositioning. BMC Bioinformatics 2019; 20:577. [PMID: 31726977 PMCID: PMC6854697 DOI: 10.1186/s12859-019-3165-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2019] [Accepted: 10/21/2019] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND De novo drug discovery is a time-consuming and expensive process. Nowadays, drug repositioning is utilized as a common strategy to discover a new drug indication for existing drugs. This strategy is mostly used in cases with a limited number of candidate pairs of drugs and diseases. In other words, they are not scalable to a large number of drugs and diseases. Most of the in-silico methods mainly focus on linear approaches while non-linear models are still scarce for new indication predictions. Therefore, applying non-linear computational approaches can offer an opportunity to predict possible drug repositioning candidates. RESULTS In this study, we present a non-linear method for drug repositioning. We extract four drug features and two disease features to find the semantic relations between drugs and diseases. We utilize deep learning to extract an efficient representation for each feature. These representations reduce the dimension and heterogeneity of biological data. Then, we assess the performance of different combinations of drug features to introduce a pipeline for drug repositioning. In the available database, there are different numbers of known drug-disease associations corresponding to each combination of drug features. Our assessment shows that as the numbers of drug features increase, the numbers of available drugs decrease. Thus, the proposed method with large numbers of drug features is as accurate as small numbers. CONCLUSION Our pipeline predicts new indications for existing drugs systematically, in a more cost-effective way and shorter timeline. We assess the pipeline to discover the potential drug-disease associations based on cross-validation experiments and some clinical trial studies.
Collapse
Affiliation(s)
- Mahroo Moridi
- Department of Mathematics and Computer Science, Amirkabir University of Technology, (Tehran Polytechnic), Tehran, Iran
| | - Marzieh Ghadirinia
- Department of Computer Engineering, Sharif University of Technology, Tehran, Iran
| | - Ali Sharifi-Zarchi
- Department of Computer Engineering, Sharif University of Technology, Tehran, Iran
| | - Fatemeh Zare-Mirakabad
- Department of Mathematics and Computer Science, Amirkabir University of Technology, (Tehran Polytechnic), Tehran, Iran.
| |
Collapse
|
32
|
Zhavoronkov A. Artificial Intelligence for Drug Discovery, Biomarker Development, and Generation of Novel Chemistry. Mol Pharm 2019; 15:4311-4313. [PMID: 30269508 DOI: 10.1021/acs.molpharmaceut.8b00930] [Citation(s) in RCA: 73] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Alex Zhavoronkov
- JHU , Insilico Medicine, Inc. , 9601 Medical Center Dr, Suite 127 , Rockville , Maryland 20850 , United States
| |
Collapse
|
33
|
Gilvary C, Madhukar N, Elkhader J, Elemento O. The Missing Pieces of Artificial Intelligence in Medicine. Trends Pharmacol Sci 2019; 40:555-564. [DOI: 10.1016/j.tips.2019.06.001] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Revised: 06/03/2019] [Accepted: 06/04/2019] [Indexed: 12/22/2022]
|
34
|
Masoudi-Sobhanzadeh Y, Omidi Y, Amanlou M, Masoudi-Nejad A. DrugR+: A comprehensive relational database for drug repurposing, combination therapy, and replacement therapy. Comput Biol Med 2019; 109:254-262. [PMID: 31096089 DOI: 10.1016/j.compbiomed.2019.05.006] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2019] [Revised: 04/26/2019] [Accepted: 05/06/2019] [Indexed: 12/14/2022]
Abstract
Drug repurposing or repositioning, which introduces new applications of the existing drugs, is an emerging field in drug discovery scope. To enhance the success rate of the research and development (R&D) process in a cost- and time-effective manner, a number of pharmaceutical companies worldwide have made tremendous investments. Besides, many researchers have proposed various methods and databases for the repurposing of various drugs. However, there is not a proper and well-organized database available. To this end, for the first time, we developed a new database based on DrugBank and KEGG data, which is named "DrugR+". Our developed database provides some advantages relative to the DrugBank, and its interface supplies new capabilities for both single and synthetic repositioning of drugs. Moreover, it includes four new datasets which can be used for predicting drug-target interactions using supervised machine learning methods. As a case study, we introduced novel applications of some drugs and discussed the obtained results. A comparison of several machine learning methods on the generated datasets has also been reported in the Supplementary File. Having included several normalized tables, DrugR + has been organized to provide key information on data structures for the repurposing and combining applications of drugs. It provides the SQL query capability for professional users and an appropriate method with different options for unprofessional users. Additionally, DrugR + consists of repurposing service that accepts a drug and proposes a list of potential drugs for some usages. Taken all, DrugR+ is a free web-based database and accessible using (http://www.drugr.ir), which can be updated through a map-reduce parallel processing method to provide the most relevant information.
Collapse
Affiliation(s)
- Yosef Masoudi-Sobhanzadeh
- Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Yadollah Omidi
- Research Center for Pharmaceutical Nanotechnology and Department of Pharmaceutics, Faculty of Pharmacy, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Massoud Amanlou
- Drug Design and Development Research Center, The Institute of Pharmaceutical Sciences (TIPS), Tehran University of Medical Sciences, Tehran, 14176-53955, Iran
| | - Ali Masoudi-Nejad
- Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran. http://LBB.ut.ac.ir
| |
Collapse
|
35
|
Sun J, Yang J, Chi J, Ding X, Lv N. Identification of drug repurposing candidates based on a miRNA-mediated drug and pathway network for cardiac hypertrophy and acute myocardial infarction. Hum Genomics 2018; 12:52. [PMID: 30514363 PMCID: PMC6280539 DOI: 10.1186/s40246-018-0184-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2018] [Accepted: 11/25/2018] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Cardiac hypertrophy and acute myocardial infarction (AMI) are two common heart diseases worldwide. However, research is needed into the exact pathogenesis and effective treatment strategies for these diseases. Recently, microRNAs (miRNAs) have been suggested to regulate the pathological pathways of heart disease, indicating a potential role in novel treatments. RESULTS In our study, we constructed a miRNA-gene-drug network and analyzed its topological features. We also identified some significantly dysregulated miRNA-gene-drug triplets (MGDTs) in cardiac hypertrophy and AMI using a computational method. Then, we characterized the activity score profile features for MGDTs in cardiac hypertrophy and AMI. The functional analyses suggested that the genes in the network held special functions. We extracted an insulin-like growth factor 1 receptor-related subnetwork in cardiac hypertrophy and a vascular endothelial growth factor A-related subnetwork in AMI. Finally, we considered insulin-like growth factor 1 receptor and vascular endothelial growth factor A as two candidate drug targets by utilizing the cardiac hypertrophy and AMI pathways. CONCLUSION These results provide novel insights into the mechanisms and treatment of cardiac hypertrophy and AMI.
Collapse
Affiliation(s)
- Jiantao Sun
- Department of Cardiology, the First Affiliated Hospital, Harbin Medical University, Harbin, Heilongjiang People’s Republic of China
| | - Jiemei Yang
- Department of Cardiology, the First Affiliated Hospital, Harbin Medical University, Harbin, Heilongjiang People’s Republic of China
| | - Jing Chi
- Department of Cardiology, the First Affiliated Hospital, Harbin Medical University, Harbin, Heilongjiang People’s Republic of China
| | - Xue Ding
- Department of Cardiology, the First Affiliated Hospital, Harbin Medical University, Harbin, Heilongjiang People’s Republic of China
| | - Nan Lv
- Department of Obstetrics, the Second Affiliated Hospital, Harbin Medical University, 246 XueFu Road, Harbin, 150086 Heilongjiang People’s Republic of China
| |
Collapse
|