1
|
Anderson C, Bucholc M, McClean PL, Zhang SD. The Potential of a Stratified Approach to Drug Repurposing in Alzheimer's Disease. Biomolecules 2023; 14:11. [PMID: 38275752 PMCID: PMC10813465 DOI: 10.3390/biom14010011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 12/13/2023] [Accepted: 12/15/2023] [Indexed: 01/27/2024] Open
Abstract
Alzheimer's disease (AD) is a complex neurodegenerative condition that is characterized by the build-up of amyloid-beta plaques and neurofibrillary tangles. While multiple theories explaining the aetiology of the disease have been suggested, the underlying cause of the disease is still unknown. Despite this, several modifiable and non-modifiable factors that increase the risk of developing AD have been identified. To date, only eight AD drugs have ever gained regulatory approval, including six symptomatic and two disease-modifying drugs. However, not all are available in all countries and high costs associated with new disease-modifying biologics prevent large proportions of the patient population from accessing them. With the current patient population expected to triple by 2050, it is imperative that new, effective, and affordable drugs become available to patients. Traditional drug development strategies have a 99% failure rate in AD, which is far higher than in other disease areas. Even when a drug does reach the market, additional barriers such as high cost and lack of accessibility prevent patients from benefiting from them. In this review, we discuss how a stratified medicine drug repurposing approach may address some of the limitations and barriers that traditional strategies face in relation to drug development in AD. We believe that novel, stratified drug repurposing studies may expedite the discovery of alternative, effective, and more affordable treatment options for a rapidly expanding patient population in comparison with traditional drug development methods.
Collapse
Affiliation(s)
- Chloe Anderson
- Personalised Medicine Centre, School of Medicine, Altnagelvin Hospital Campus, Ulster University, Glenshane Road, Derry/Londonderry BT47 6SB, UK;
| | - Magda Bucholc
- School of Computing, Engineering and Intelligent Systems, Magee Campus, Ulster University, Northland Road, Derry/Londonderry BT48 7JL, UK
| | - Paula L. McClean
- Personalised Medicine Centre, School of Medicine, Altnagelvin Hospital Campus, Ulster University, Glenshane Road, Derry/Londonderry BT47 6SB, UK;
| | - Shu-Dong Zhang
- Personalised Medicine Centre, School of Medicine, Altnagelvin Hospital Campus, Ulster University, Glenshane Road, Derry/Londonderry BT47 6SB, UK;
| |
Collapse
|
2
|
Martínez-García M, Hernández-Lemus E. Data Integration Challenges for Machine Learning in Precision Medicine. Front Med (Lausanne) 2022; 8:784455. [PMID: 35145977 PMCID: PMC8821900 DOI: 10.3389/fmed.2021.784455] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Accepted: 12/28/2021] [Indexed: 12/19/2022] Open
Abstract
A main goal of Precision Medicine is that of incorporating and integrating the vast corpora on different databases about the molecular and environmental origins of disease, into analytic frameworks, allowing the development of individualized, context-dependent diagnostics, and therapeutic approaches. In this regard, artificial intelligence and machine learning approaches can be used to build analytical models of complex disease aimed at prediction of personalized health conditions and outcomes. Such models must handle the wide heterogeneity of individuals in both their genetic predisposition and their social and environmental determinants. Computational approaches to medicine need to be able to efficiently manage, visualize and integrate, large datasets combining structure, and unstructured formats. This needs to be done while constrained by different levels of confidentiality, ideally doing so within a unified analytical architecture. Efficient data integration and management is key to the successful application of computational intelligence approaches to medicine. A number of challenges arise in the design of successful designs to medical data analytics under currently demanding conditions of performance in personalized medicine, while also subject to time, computational power, and bioethical constraints. Here, we will review some of these constraints and discuss possible avenues to overcome current challenges.
Collapse
Affiliation(s)
- Mireya Martínez-García
- Clinical Research Division, National Institute of Cardiology ‘Ignacio Chávez’, Mexico City, Mexico
| | - Enrique Hernández-Lemus
- Computational Genomics Division, National Institute of Genomic Medicine (INMEGEN), Mexico City, Mexico
- Center for Complexity Sciences, Universidad Nacional Autnoma de Mexico, Mexico City, Mexico
| |
Collapse
|
3
|
Scaling computational genomics to millions of individuals with GPUs. Genome Biol 2019; 20:228. [PMID: 31675989 PMCID: PMC6823959 DOI: 10.1186/s13059-019-1836-7] [Citation(s) in RCA: 90] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Accepted: 09/25/2019] [Indexed: 12/26/2022] Open
Abstract
Current genomics methods are designed to handle tens to thousands of samples but will need to scale to millions to match the pace of data and hypothesis generation in biomedical science. Here, we show that high efficiency at low cost can be achieved by leveraging general-purpose libraries for computing using graphics processing units (GPUs), such as PyTorch and TensorFlow. We demonstrate > 200-fold decreases in runtime and ~ 5–10-fold reductions in cost relative to CPUs. We anticipate that the accessibility of these libraries will lead to a widespread adoption of GPUs in computational genomics.
Collapse
|
4
|
Keenan AB, Wojciechowicz ML, Wang Z, Jagodnik KM, Jenkins SL, Lachmann A, Ma'ayan A. Connectivity Mapping: Methods and Applications. Annu Rev Biomed Data Sci 2019. [DOI: 10.1146/annurev-biodatasci-072018-021211] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Connectivity mapping resources consist of signatures representing changes in cellular state following systematic small-molecule, disease, gene, or other form of perturbations. Such resources enable the characterization of signatures from novel perturbations based on similarity; provide a global view of the space of many themed perturbations; and allow the ability to predict cellular, tissue, and organismal phenotypes for perturbagens. A signature search engine enables hypothesis generation by finding connections between query signatures and the database of signatures. This framework has been used to identify connections between small molecules and their targets, to discover cell-specific responses to perturbations and ways to reverse disease expression states with small molecules, and to predict small-molecule mimickers for existing drugs. This review provides a historical perspective and the current state of connectivity mapping resources with a focus on both methodology and community implementations.
Collapse
Affiliation(s)
- Alexandra B. Keenan
- Department of Pharmacological Sciences and Mount Sinai Center for Bioinformatics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Megan L. Wojciechowicz
- Department of Pharmacological Sciences and Mount Sinai Center for Bioinformatics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Zichen Wang
- Department of Pharmacological Sciences and Mount Sinai Center for Bioinformatics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Kathleen M. Jagodnik
- Department of Pharmacological Sciences and Mount Sinai Center for Bioinformatics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Sherry L. Jenkins
- Department of Pharmacological Sciences and Mount Sinai Center for Bioinformatics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Alexander Lachmann
- Department of Pharmacological Sciences and Mount Sinai Center for Bioinformatics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Avi Ma'ayan
- Department of Pharmacological Sciences and Mount Sinai Center for Bioinformatics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| |
Collapse
|
5
|
Banegas-Luna AJ, Imbernón B, Llanes Castro A, Pérez-Garrido A, Cerón-Carrasco JP, Gesing S, Merelli I, D'Agostino D, Pérez-Sánchez H. Advances in distributed computing with modern drug discovery. Expert Opin Drug Discov 2018; 14:9-22. [PMID: 30484337 DOI: 10.1080/17460441.2019.1552936] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
INTRODUCTION Computational chemistry dramatically accelerates the drug discovery process and high-performance computing (HPC) can be used to speed up the most expensive calculations. Supporting a local HPC infrastructure is both costly and time-consuming, and, therefore, many research groups are moving from in-house solutions to remote-distributed computing platforms. Areas covered: The authors focus on the use of distributed technologies, solutions, and infrastructures to gain access to HPC capabilities, software tools, and datasets to run the complex simulations required in computational drug discovery (CDD). Expert opinion: The use of computational tools can decrease the time to market of new drugs. HPC has a crucial role in handling the complex algorithms and large volumes of data required to achieve specificity and avoid undesirable side-effects. Distributed computing environments have clear advantages over in-house solutions in terms of cost and sustainability. The use of infrastructures relying on virtualization reduces set-up costs. Distributed computing resources can be difficult to access, although web-based solutions are becoming increasingly available. There is a trade-off between cost-effectiveness and accessibility in using on-demand computing resources rather than free/academic resources. Graphics processing unit computing, with its outstanding parallel computing power, is becoming increasingly important.
Collapse
Affiliation(s)
- Antonio Jesús Banegas-Luna
- a Bioinformatics and High Performance Computing Research Group (BIO-HPC) , Universidad Católica de Murcia (UCAM) , Murcia , Spain
| | - Baldomero Imbernón
- a Bioinformatics and High Performance Computing Research Group (BIO-HPC) , Universidad Católica de Murcia (UCAM) , Murcia , Spain
| | - Antonio Llanes Castro
- a Bioinformatics and High Performance Computing Research Group (BIO-HPC) , Universidad Católica de Murcia (UCAM) , Murcia , Spain
| | - Alfonso Pérez-Garrido
- a Bioinformatics and High Performance Computing Research Group (BIO-HPC) , Universidad Católica de Murcia (UCAM) , Murcia , Spain
| | - José Pedro Cerón-Carrasco
- a Bioinformatics and High Performance Computing Research Group (BIO-HPC) , Universidad Católica de Murcia (UCAM) , Murcia , Spain
| | - Sandra Gesing
- b Center for Research Computing , University of Notre Dame , Notre Dame , IN , USA
| | - Ivan Merelli
- c Institute for Biomedical Technologies , National Research Council of Italy , Segrate (Milan) , Italy
| | - Daniele D'Agostino
- d Institute for Applied Mathematics and Information Technologies "E. Magenes" , National Research Council of Italy , Genoa , Italy
| | - Horacio Pérez-Sánchez
- a Bioinformatics and High Performance Computing Research Group (BIO-HPC) , Universidad Católica de Murcia (UCAM) , Murcia , Spain
| |
Collapse
|
6
|
Musa A, Ghoraie LS, Zhang SD, Glazko G, Yli-Harja O, Dehmer M, Haibe-Kains B, Emmert-Streib F. A review of connectivity map and computational approaches in pharmacogenomics. Brief Bioinform 2018; 19:506-523. [PMID: 28069634 PMCID: PMC5952941 DOI: 10.1093/bib/bbw112] [Citation(s) in RCA: 93] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Large-scale perturbation databases, such as Connectivity Map (CMap) or Library of Integrated Network-based Cellular Signatures (LINCS), provide enormous opportunities for computational pharmacogenomics and drug design. A reason for this is that in contrast to classical pharmacology focusing at one target at a time, the transcriptomics profiles provided by CMap and LINCS open the door for systems biology approaches on the pathway and network level. In this article, we provide a review of recent developments in computational pharmacogenomics with respect to CMap and LINCS and related applications.
Collapse
Affiliation(s)
- Aliyu Musa
- Predictive Medicine and Analytics Lab, Department of Signal Processing, Tampere University of Technology, Tampere, Finland
| | - Laleh Soltan Ghoraie
- Bioinformatics and Computational Genomics Laboratory, Princess Margaret Cancer Center, University Health Network, Toronto, ON, Canada
| | - Shu-Dong Zhang
- Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, University of Ulster, C-TRIC Building, Altnagelvin Area Hospital, Glenshane Road, Derry/Londonderry, Northern Ireland, UK
| | - Galina Glazko
- University of Rochester Department of Biostatistics and Computational Biology, Rochester, New York, USA
| | - Olli Yli-Harja
- Computational Systems Biology, Department of Signal Processing, Tampere University of Technology, Tampere, Finland
| | - Matthias Dehmer
- Institute for Bioinformatics and Translational Research, UMIT- The Health and Life Sciences University, Eduard Wallnoefer Zentrum 1, Hall in Tyrol, Austria
| | - Benjamin Haibe-Kains
- Bioinformatics and Computational Genomics Laboratory, Princess Margaret Cancer Center, University Health Network, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Ontario Institute of Cancer Research, Toronto, ON, Canada
| | - Frank Emmert-Streib
- Predictive Medicine and Analytics Lab, Department of Signal Processing, Tampere University of Technology, Tampere, Finland
| |
Collapse
|
7
|
Musa A, Ghoraie LS, Zhang SD, Glazko G, Yli-Harja O, Dehmer M, Haibe-Kains B, Emmert-Streib F. A review of connectivity map and computational approaches in pharmacogenomics. Brief Bioinform 2018. [PMID: 28069634 DOI: 10.1093/bib] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/14/2023] Open
Abstract
Large-scale perturbation databases, such as Connectivity Map (CMap) or Library of Integrated Network-based Cellular Signatures (LINCS), provide enormous opportunities for computational pharmacogenomics and drug design. A reason for this is that in contrast to classical pharmacology focusing at one target at a time, the transcriptomics profiles provided by CMap and LINCS open the door for systems biology approaches on the pathway and network level. In this article, we provide a review of recent developments in computational pharmacogenomics with respect to CMap and LINCS and related applications.
Collapse
Affiliation(s)
- Aliyu Musa
- Predictive Medicine and Analytics Lab, Department of Signal Processing, Tampere University of Technology, Tampere, Finland
| | - Laleh Soltan Ghoraie
- Bioinformatics and Computational Genomics Laboratory, Princess Margaret Cancer Center, University Health Network, Toronto, ON, Canada
| | - Shu-Dong Zhang
- Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, University of Ulster, C-TRIC Building, Altnagelvin Area Hospital, Glenshane Road, Derry/Londonderry BT47 6SB, Northern Ireland, UK
| | - Galina Glazko
- University of Rochester Department of Biostatistics and Computational Biology, Rochester, New York 14642, USA
| | - Olli Yli-Harja
- Computational Systems Biology, Department of Signal Processing, Tampere University of Technology, Tampere, Finland
| | - Matthias Dehmer
- Institute for Bioinformatics and Translational Research, UMIT- The Health and Life Sciences University, Eduard Wallnoefer Zentrum 1, 6060 Hall in Tyrol, Austria
| | - Benjamin Haibe-Kains
- Bioinformatics and Computational Genomics Laboratory, Princess Margaret Cancer Center, University Health Network, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Ontario Institute of Cancer Research, Toronto, ON, Canada
| | - Frank Emmert-Streib
- Predictive Medicine and Analytics Lab, Department of Signal Processing, Tampere University of Technology, Tampere, Finland
| |
Collapse
|
8
|
Wen Q, Dunne PD, O’Reilly PG, Li G, Bjourson AJ, McArt DG, Hamilton PW, Zhang SD. KRAS mutant colorectal cancer gene signatures identified angiotensin II receptor blockers as potential therapies. Oncotarget 2017; 8:3206-3225. [PMID: 27965461 PMCID: PMC5356876 DOI: 10.18632/oncotarget.13884] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2016] [Accepted: 11/30/2016] [Indexed: 01/13/2023] Open
Abstract
Colorectal cancer (CRC) is a life-threatening disease with high prevalence and mortality worldwide. The KRAS oncogene is mutated in approximately 40% of CRCs. While antibody based EGFR inhibitors (cetuximab and panitumumab) represent a major treatment strategy for advanced KRAS wild type (KRAS-WT) CRCs, there still remains no effective therapeutic course for advanced KRAS mutant (KRAS-MT) CRC patients.In this study, we employed a novel and comprehensive approach of gene expression connectivity mapping (GECM) to identify candidate compounds to target KRAS-MT tumors. We first created a combined KRAS-MT gene signature with 248 ranked significant genes using 677 CRC clinical samples. A series of 248 sub-signatures was then created containing an increasing number of the top ranked genes. As an input to GECM analysis, each sub-signature was translated into a statistically significant therapeutic drugs list, which was finally combined to obtain a single list of significant drugs.We identify four antihypertensive angiotensin II receptor blockers (ARBs) within the top 30 significant drugs indicating that these drugs have a mechanism of action that can alter the KRAS-MT CRC oncogenic signaling. A hypergeometric test (p-value = 6.57 × 10-6) confirmed that ARBs are significantly enriched in our results. These findings support the hypothesis that ARB antihypertensive drugs may directly block KRAS signaling resulting in improvement in patient outcome or, through a reversion to a KRAS wild-type phenotype, improve the response to anti-EGFR treatment. Antihypertensive angiotensin II receptor blockers are therefore worth further investigation as potential therapeutic candidates in this difficult category of advanced colorectal cancers.
Collapse
Affiliation(s)
- Qing Wen
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, UK
| | - Philip D. Dunne
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, UK
| | - Paul G. O’Reilly
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, UK
| | - Gerald Li
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, UK
| | - Anthony J. Bjourson
- Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, Ulster University, C-TRIC, Londonderry, UK
| | - Darragh G. McArt
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, UK
| | - Peter W. Hamilton
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, UK
| | - Shu-Dong Zhang
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, UK
- Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, Ulster University, C-TRIC, Londonderry, UK
| |
Collapse
|
9
|
Kashyap H, Ahmed HA, Hoque N, Roy S, Bhattacharyya DK. Big data analytics in bioinformatics: architectures, techniques, tools and issues. ACTA ACUST UNITED AC 2016. [DOI: 10.1007/s13721-016-0135-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
10
|
O'Reilly PG, Wen Q, Bankhead P, Dunne PD, McArt DG, McPherson S, Hamilton PW, Mills KI, Zhang SD. QUADrATiC: scalable gene expression connectivity mapping for repurposing FDA-approved therapeutics. BMC Bioinformatics 2016; 17:198. [PMID: 27143038 PMCID: PMC4855472 DOI: 10.1186/s12859-016-1062-1] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2016] [Accepted: 04/22/2016] [Indexed: 12/19/2022] Open
Abstract
Background Gene expression connectivity mapping has proven to be a powerful and flexible tool for research. Its application has been shown in a broad range of research topics, most commonly as a means of identifying potential small molecule compounds, which may be further investigated as candidates for repurposing to treat diseases. The public release of voluminous data from the Library of Integrated Cellular Signatures (LINCS) programme further enhanced the utilities and potentials of gene expression connectivity mapping in biomedicine. Results We describe QUADrATiC (http://go.qub.ac.uk/QUADrATiC), a user-friendly tool for the exploration of gene expression connectivity on the subset of the LINCS data set corresponding to FDA-approved small molecule compounds. It enables the identification of compounds for repurposing therapeutic potentials. The software is designed to cope with the increased volume of data over existing tools, by taking advantage of multicore computing architectures to provide a scalable solution, which may be installed and operated on a range of computers, from laptops to servers. This scalability is provided by the use of the modern concurrent programming paradigm provided by the Akka framework. The QUADrATiC Graphical User Interface (GUI) has been developed using advanced Javascript frameworks, providing novel visualization capabilities for further analysis of connections. There is also a web services interface, allowing integration with other programs or scripts. Conclusions QUADrATiC has been shown to provide an improvement over existing connectivity map software, in terms of scope (based on the LINCS data set), applicability (using FDA-approved compounds), usability and speed. It offers potential to biological researchers to analyze transcriptional data and generate potential therapeutics for focussed study in the lab. QUADrATiC represents a step change in the process of investigating gene expression connectivity and provides more biologically-relevant results than previous alternative solutions. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1062-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Paul G O'Reilly
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, Belfast, BT9 7AE, UK
| | - Qing Wen
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, Belfast, BT9 7AE, UK
| | - Peter Bankhead
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, Belfast, BT9 7AE, UK
| | - Philip D Dunne
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, Belfast, BT9 7AE, UK
| | - Darragh G McArt
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, Belfast, BT9 7AE, UK
| | - Suzanne McPherson
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, Belfast, BT9 7AE, UK
| | - Peter W Hamilton
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, Belfast, BT9 7AE, UK
| | - Ken I Mills
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, Belfast, BT9 7AE, UK.
| | - Shu-Dong Zhang
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, Belfast, BT9 7AE, UK. .,Northern Ireland Centre for Stratified Medicine, University of Ulster, C-TRIC Building, Altnagelvin Hospital campus, Glenshane Road, Derry/Londonderry, BT47 6SB, UK.
| |
Collapse
|
11
|
Connectivity mapping using a combined gene signature from multiple colorectal cancer datasets identified candidate drugs including existing chemotherapies. BMC SYSTEMS BIOLOGY 2015; 9 Suppl 5:S4. [PMID: 26356760 PMCID: PMC4565135 DOI: 10.1186/1752-0509-9-s5-s4] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
BACKGROUND While the discovery of new drugs is a complex, lengthy and costly process, identifying new uses for existing drugs is a cost-effective approach to therapeutic discovery. Connectivity mapping integrates gene expression profiling with advanced algorithms to connect genes, diseases and small molecule compounds and has been applied in a large number of studies to identify potential drugs, particularly to facilitate drug repurposing. Colorectal cancer (CRC) is a commonly diagnosed cancer with high mortality rates, presenting a worldwide health problem. With the advancement of high throughput omics technologies, a number of large scale gene expression profiling studies have been conducted on CRCs, providing multiple datasets in gene expression data repositories. In this work, we systematically apply gene expression connectivity mapping to multiple CRC datasets to identify candidate therapeutics to this disease. RESULTS We developed a robust method to compile a combined gene signature for colorectal cancer across multiple datasets. Connectivity mapping analysis with this signature of 148 genes identified 10 candidate compounds, including irinotecan and etoposide, which are chemotherapy drugs currently used to treat CRCs. These results indicate that we have discovered high quality connections between the CRC disease state and the candidate compounds, and that the gene signature we created may be used as a potential therapeutic target in treating the disease. The method we proposed is highly effective in generating quality gene signature through multiple datasets; the publication of the combined CRC gene signature and the list of candidate compounds from this work will benefit both cancer and systems biology research communities for further development and investigations.
Collapse
|
12
|
Alyass A, Turcotte M, Meyre D. From big data analysis to personalized medicine for all: challenges and opportunities. BMC Med Genomics 2015; 8:33. [PMID: 26112054 PMCID: PMC4482045 DOI: 10.1186/s12920-015-0108-y] [Citation(s) in RCA: 228] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2015] [Accepted: 06/15/2015] [Indexed: 02/07/2023] Open
Abstract
Recent advances in high-throughput technologies have led to the emergence of systems biology as a holistic science to achieve more precise modeling of complex diseases. Many predict the emergence of personalized medicine in the near future. We are, however, moving from two-tiered health systems to a two-tiered personalized medicine. Omics facilities are restricted to affluent regions, and personalized medicine is likely to widen the growing gap in health systems between high and low-income countries. This is mirrored by an increasing lag between our ability to generate and analyze big data. Several bottlenecks slow-down the transition from conventional to personalized medicine: generation of cost-effective high-throughput data; hybrid education and multidisciplinary teams; data storage and processing; data integration and interpretation; and individual and global economic relevance. This review provides an update of important developments in the analysis of big data and forward strategies to accelerate the global transition to personalized medicine.
Collapse
Affiliation(s)
- Akram Alyass
- Department of Clinical Epidemiology and Biostatistics, McMaster University, 1280 Main Street West, Hamilton, ON, Canada.
| | - Michelle Turcotte
- Department of Clinical Epidemiology and Biostatistics, McMaster University, 1280 Main Street West, Hamilton, ON, Canada.
| | - David Meyre
- Department of Clinical Epidemiology and Biostatistics, McMaster University, 1280 Main Street West, Hamilton, ON, Canada.
- Department of Pathology and Molecular Medicine, McMaster University, 1280 Main Street West, Hamilton, ON, Canada.
| |
Collapse
|
13
|
Manzotti G, Parenti S, Ferrari-Amorotti G, Soliera AR, Cattelani S, Montanari M, Cavalli D, Ertel A, Grande A, Calabretta B. Monocyte-macrophage differentiation of acute myeloid leukemia cell lines by small molecules identified through interrogation of the Connectivity Map database. Cell Cycle 2015; 14:2578-89. [PMID: 26102293 DOI: 10.1080/15384101.2015.1033591] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
Abstract
The transcription factor C/EBPα is required for granulocytic differentiation of normal myeloid progenitors and is frequently inactivated in acute myeloid leukemia (AML) cells. Ectopic expression of C/EBPα in AML cells suppresses proliferation and induces differentiation suggesting that restoring C/EBPα expression/activity in AML cells could be therapeutically useful. Unfortunately, current approaches of gene or protein delivery in leukemic cells are unsatisfactory. However, "drug repurposing" is becoming a very attractive strategy to identify potential new uses for existing drugs. In this study, we assessed the biological effects of candidate C/EBPα-mimetics identified by interrogation of the Connectivity Map database. We found that amantadine, an antiviral and anti-Parkinson agent, induced a monocyte-macrophage-like differentiation of HL60, U937, Kasumi-1 myeloid leukemia cell lines, as indicated by morphology and differentiation antigen expression, when used in combination with suboptimal concentration of all trans retinoic acid (ATRA) or Vit D3. The effect of amantadine depends, in part, on increased activity of the vitamin D receptor (VDR), since it induced VDR expression and amantadine-dependent monocyte-macrophage differentiation of HL60 cells was blocked by expression of dominant-negative VDR. These results reveal a new function for amantadine and support the concept that screening of the Connectivity Map database can identify small molecules that mimic the effect of transcription factors required for myelo-monocytic differentiation.
Collapse
Affiliation(s)
- Gloria Manzotti
- a Department of Diagnostic and Clinical Medicine and Public Health ; University of Modena and R. Emilia ; Modena , Italy
| | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Mejía-Roa E, Tabas-Madrid D, Setoain J, García C, Tirado F, Pascual-Montano A. NMF-mGPU: non-negative matrix factorization on multi-GPU systems. BMC Bioinformatics 2015; 16:43. [PMID: 25887585 PMCID: PMC4339678 DOI: 10.1186/s12859-015-0485-4] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2014] [Accepted: 01/30/2015] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND In the last few years, the Non-negative Matrix Factorization ( NMF ) technique has gained a great interest among the Bioinformatics community, since it is able to extract interpretable parts from high-dimensional datasets. However, the computing time required to process large data matrices may become impractical, even for a parallel application running on a multiprocessors cluster. In this paper, we present NMF-mGPU, an efficient and easy-to-use implementation of the NMF algorithm that takes advantage of the high computing performance delivered by Graphics-Processing Units ( GPUs ). Driven by the ever-growing demands from the video-games industry, graphics cards usually provided in PCs and laptops have evolved from simple graphics-drawing platforms into high-performance programmable systems that can be used as coprocessors for linear-algebra operations. However, these devices may have a limited amount of on-board memory, which is not considered by other NMF implementations on GPU. RESULTS NMF-mGPU is based on CUDA ( Compute Unified Device Architecture ), the NVIDIA's framework for GPU computing. On devices with low memory available, large input matrices are blockwise transferred from the system's main memory to the GPU's memory, and processed accordingly. In addition, NMF-mGPU has been explicitly optimized for the different CUDA architectures. Finally, platforms with multiple GPUs can be synchronized through MPI ( Message Passing Interface ). In a four-GPU system, this implementation is about 120 times faster than a single conventional processor, and more than four times faster than a single GPU device (i.e., a super-linear speedup). CONCLUSIONS Applications of GPUs in Bioinformatics are getting more and more attention due to their outstanding performance when compared to traditional processors. In addition, their relatively low price represents a highly cost-effective alternative to conventional clusters. In life sciences, this results in an excellent opportunity to facilitate the daily work of bioinformaticians that are trying to extract biological meaning out of hundreds of gigabytes of experimental information. NMF-mGPU can be used "out of the box" by researchers with little or no expertise in GPU programming in a variety of platforms, such as PCs, laptops, or high-end GPU clusters. NMF-mGPU is freely available at https://github.com/bioinfo-cnb/bionmf-gpu .
Collapse
Affiliation(s)
- Edgardo Mejía-Roa
- ArTeCS Group, Department of Computer Architecture, Complutense University of Madrid (UCM), Madrid, 28040, Spain.
| | - Daniel Tabas-Madrid
- Functional Bioinformatics Group, Biocomputing Unit, National Center for Biotechnology-CSIC, UAM, Madrid, 28049, Spain.
| | - Javier Setoain
- Functional Bioinformatics Group, Biocomputing Unit, National Center for Biotechnology-CSIC, UAM, Madrid, 28049, Spain.
| | - Carlos García
- ArTeCS Group, Department of Computer Architecture, Complutense University of Madrid (UCM), Madrid, 28040, Spain.
| | - Francisco Tirado
- ArTeCS Group, Department of Computer Architecture, Complutense University of Madrid (UCM), Madrid, 28040, Spain.
| | - Alberto Pascual-Montano
- Functional Bioinformatics Group, Biocomputing Unit, National Center for Biotechnology-CSIC, UAM, Madrid, 28049, Spain.
| |
Collapse
|
15
|
Liang M, Zhang F, Jin G, Zhu J. FastGCN: a GPU accelerated tool for fast gene co-expression networks. PLoS One 2015; 10:e0116776. [PMID: 25602758 PMCID: PMC4300192 DOI: 10.1371/journal.pone.0116776] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2014] [Accepted: 12/08/2014] [Indexed: 01/31/2023] Open
Abstract
Gene co-expression networks comprise one type of valuable biological networks. Many methods and tools have been published to construct gene co-expression networks; however, most of these tools and methods are inconvenient and time consuming for large datasets. We have developed a user-friendly, accelerated and optimized tool for constructing gene co-expression networks that can fully harness the parallel nature of GPU (Graphic Processing Unit) architectures. Genetic entropies were exploited to filter out genes with no or small expression changes in the raw data preprocessing step. Pearson correlation coefficients were then calculated. After that, we normalized these coefficients and employed the False Discovery Rate to control the multiple tests. At last, modules identification was conducted to construct the co-expression networks. All of these calculations were implemented on a GPU. We also compressed the coefficient matrix to save space. We compared the performance of the GPU implementation with those of multi-core CPU implementations with 16 CPU threads, single-thread C/C++ implementation and single-thread R implementation. Our results show that GPU implementation largely outperforms single-thread C/C++ implementation and single-thread R implementation, and GPU implementation outperforms multi-core CPU implementation when the number of genes increases. With the test dataset containing 16,000 genes and 590 individuals, we can achieve greater than 63 times the speed using a GPU implementation compared with a single-thread R implementation when 50 percent of genes were filtered out and about 80 times the speed when no genes were filtered out.
Collapse
Affiliation(s)
- Meimei Liang
- Institute of Bioinformatics, Zhejiang University, Hangzhou, Zhejiang, RP China, 310058
| | - Futao Zhang
- Institute of Bioinformatics, Zhejiang University, Hangzhou, Zhejiang, RP China, 310058
| | - Gulei Jin
- Institute of Bioinformatics, Zhejiang University, Hangzhou, Zhejiang, RP China, 310058
| | - Jun Zhu
- Institute of Bioinformatics, Zhejiang University, Hangzhou, Zhejiang, RP China, 310058
- * E-mail:
| |
Collapse
|