251
|
Hua GJ, Hung CL, Lin CY, Wu FC, Chan YW, Tang CY. MGUPGMA: A Fast UPGMA Algorithm With Multiple Graphics Processing Units Using NCCL. Evol Bioinform Online 2017; 13:1176934317734220. [PMID: 29051701 PMCID: PMC5637958 DOI: 10.1177/1176934317734220] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Accepted: 09/06/2017] [Indexed: 11/15/2022] Open
Abstract
A phylogenetic tree is a visual diagram of the relationship between a set of biological species. The scientists usually use it to analyze many characteristics of the species. The distance-matrix methods, such as Unweighted Pair Group Method with Arithmetic Mean and Neighbor Joining, construct a phylogenetic tree by calculating pairwise genetic distances between taxa. These methods have the computational performance issue. Although several new methods with high-performance hardware and frameworks have been proposed, the issue still exists. In this work, a novel parallel Unweighted Pair Group Method with Arithmetic Mean approach on multiple Graphics Processing Units is proposed to construct a phylogenetic tree from extremely large set of sequences. The experimental results present that the proposed approach on a DGX-1 server with 8 NVIDIA P100 graphic cards achieves approximately 3-fold to 7-fold speedup over the implementation of Unweighted Pair Group Method with Arithmetic Mean on a modern CPU and a single GPU, respectively.
Collapse
Affiliation(s)
- Guan-Jie Hua
- Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan
| | - Che-Lun Hung
- Big Data Research Center, Department of Computer Science and Communication Engineering, Providence University, Taichung, Taiwan
| | - Chun-Yuan Lin
- Department of Computer Science and Information Engineering, Chang Gung University, Taoyuan, Taiwan
| | - Fu-Che Wu
- Department of Computer Science and Communication Engineering, Providence University, Taichung, Taiwan
| | - Yu-Wei Chan
- College of Computing and Informatics, Providence University, Taichung, Taiwan
| | - Chuan Yi Tang
- Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan.,Department of Computer Science and Information Engineering, Providence University, Taichung, Taiwan
| |
Collapse
|
252
|
|
253
|
Srinivasan B, Tonddast-Navaei S, Skolnick J. Pocket detection and interaction-weighted ligand-similarity search yields novel high-affinity binders for Myocilin-OLF, a protein implicated in glaucoma. Bioorg Med Chem Lett 2017; 27:4133-4139. [PMID: 28739043 DOI: 10.1016/j.bmcl.2017.07.035] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2017] [Revised: 07/10/2017] [Accepted: 07/11/2017] [Indexed: 10/19/2022]
Abstract
Traditional structure and ligand based virtual screening approaches rely on the availability of structural and ligand binding information. To overcome this limitation, hybrid approaches were developed that relied on extraction of ligand binding information from proteins sharing similar folds and hence, evolutionarily relationship. However, they cannot target a chosen pocket in a protein. To address this, a pocket centric virtual ligand screening approach is required. Here, we employ a new, iterative implementation of a pocket and ligand-similarity based approach to virtual ligand screening to predict small molecule binders for the olfactomedin domain of human myocilin implicated in glaucoma. Small-molecule binders of the protein might prevent the aggregation of the protein, commonly seen during glaucoma. First round experimental assessment of the predictions using differential scanning fluorimetry with myoc-OLF yielded 7 hits with a success rate of 12.7%; the best hit had an apparent dissociation constant of 99nM. By matching to the key functional groups of the best ligand that were likely involved in binding, the affinity of the best hit was improved by almost 10,000 fold from the high nanomolar to the low picomolar range. Thus, this study provides preliminary validation of the methodology on a medically important glaucoma associated protein.
Collapse
Affiliation(s)
- Bharath Srinivasan
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, 950, Atlantic Drive, Atlanta, GA 30332, United States
| | - Sam Tonddast-Navaei
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, 950, Atlantic Drive, Atlanta, GA 30332, United States
| | - Jeffrey Skolnick
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, 950, Atlantic Drive, Atlanta, GA 30332, United States.
| |
Collapse
|
254
|
Ziemska J, Solecka J, Jarończyk M. QSAR, docking studies and toxicology prediction of isoquinoline derivatives as leucine aminopeptidase inhibitors. CHEMICAL PAPERS 2017. [DOI: 10.1007/s11696-017-0251-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
255
|
Wei JD, Cheng HJ, Lin CY, Ye J, Yeh KY. Embedded-Based Graphics Processing Unit Cluster Platform for Multiple Sequence Alignments. Evol Bioinform Online 2017; 13:1176934317724764. [PMID: 28835734 PMCID: PMC5555494 DOI: 10.1177/1176934317724764] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2017] [Accepted: 07/12/2017] [Indexed: 11/20/2022] Open
Abstract
High-end graphics processing units (GPUs), such as NVIDIA Tesla/Fermi/Kepler series cards with thousands of cores per chip, are widely applied to high-performance computing fields in a decade. These desktop GPU cards should be installed in personal computers/servers with desktop CPUs, and the cost and power consumption of constructing a GPU cluster platform are very high. In recent years, NVIDIA releases an embedded board, called Jetson Tegra K1 (TK1), which contains 4 ARM Cortex-A15 CPUs and 192 Compute Unified Device Architecture cores (belong to Kepler GPUs). Jetson Tegra K1 has several advantages, such as the low cost, low power consumption, and high applicability, and it has been applied into several specific applications. In our previous work, a bioinformatics platform with a single TK1 (STK platform) was constructed, and this previous work is also used to prove that the Web and mobile services can be implemented in the STK platform with a good cost-performance ratio by comparing a STK platform with the desktop CPU and GPU. In this work, an embedded-based GPU cluster platform will be constructed with multiple TK1s (MTK platform). Complex system installation and setup are necessary procedures at first. Then, 2 job assignment modes are designed for the MTK platform to provide services for users. Finally, ClustalW v2.0.11 and ClustalWtk will be ported to the MTK platform. The experimental results showed that the speedup ratios achieved 5.5 and 4.8 times for ClustalW v2.0.11 and ClustalWtk, respectively, by comparing 6 TK1s with a single TK1. The MTK platform is proven to be useful for multiple sequence alignments.
Collapse
Affiliation(s)
- Jyh-Da Wei
- Department of Computer Science and Information Engineering, School of Electrical and Computer Engineering, College of Engineering, Chang Gung University, Taoyuan, Taiwan.,Department of Ophthalmology, Chang Gung Memorial Hospital, Keelung, Taiwan
| | - Hui-Jun Cheng
- Department of Computer Science and Information Engineering, School of Electrical and Computer Engineering, College of Engineering, Chang Gung University, Taoyuan, Taiwan
| | - Chun-Yuan Lin
- Department of Computer Science and Information Engineering, School of Electrical and Computer Engineering, College of Engineering, Chang Gung University, Taoyuan, Taiwan
| | - Jin Ye
- Department of Computer Science and Information Engineering, School of Electrical and Computer Engineering, College of Engineering, Chang Gung University, Taoyuan, Taiwan
| | - Kuan-Yu Yeh
- Department of Computer Science and Information Engineering, School of Electrical and Computer Engineering, College of Engineering, Chang Gung University, Taoyuan, Taiwan
| |
Collapse
|
256
|
Yin Z, Lan H, Tan G, Lu M, Vasilakos AV, Liu W. Computing Platforms for Big Biological Data Analytics: Perspectives and Challenges. Comput Struct Biotechnol J 2017; 15:403-411. [PMID: 28883909 PMCID: PMC5581845 DOI: 10.1016/j.csbj.2017.07.004] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2017] [Revised: 06/30/2017] [Accepted: 07/28/2017] [Indexed: 12/25/2022] Open
Abstract
The last decade has witnessed an explosion in the amount of available biological sequence data, due to the rapid progress of high-throughput sequencing projects. However, the biological data amount is becoming so great that traditional data analysis platforms and methods can no longer meet the need to rapidly perform data analysis tasks in life sciences. As a result, both biologists and computer scientists are facing the challenge of gaining a profound insight into the deepest biological functions from big biological data. This in turn requires massive computational resources. Therefore, high performance computing (HPC) platforms are highly needed as well as efficient and scalable algorithms that can take advantage of these platforms. In this paper, we survey the state-of-the-art HPC platforms for big biological data analytics. We first list the characteristics of big biological data and popular computing platforms. Then we provide a taxonomy of different biological data analysis applications and a survey of the way they have been mapped onto various computing platforms. After that, we present a case study to compare the efficiency of different computing platforms for handling the classical biological sequence alignment problem. At last we discuss the open issues in big biological data analytics.
Collapse
Affiliation(s)
- Zekun Yin
- Shandong University, Jinan, Shandong, China
| | | | - Guangming Tan
- Institute of Computing Technology, Chinese Academy of Sciences, China
| | - Mian Lu
- Huawei Singapore Research Centre, Singapore
| | - Athanasios V Vasilakos
- Department of Computer Science, Electrical and Space Engineering, Luleå University of Technology, Skellefteå SE-931 87, Sweden
| | - Weiguo Liu
- Shandong University, Jinan, Shandong, China
| |
Collapse
|
257
|
|
258
|
Koulayev S, Simeonova E, Skipper N. Can Physicians Affect Patient Adherence With Medication? HEALTH ECONOMICS 2017; 26:779-794. [PMID: 27311330 DOI: 10.1002/hec.3357] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/06/2015] [Revised: 02/08/2016] [Accepted: 04/05/2016] [Indexed: 06/06/2023]
Abstract
Non-compliance with medication therapy remains an unsolved and expensive problem for healthcare systems around the world, yet we know little about the factors that affect a patient's decision to follow treatment recommendations. In particular, there is little evidence on the extent to which doctors can influence patient adherence behavior. This study uses a unique panel dataset comprising all prescription drug users, physicians, and all prescription drug sales in Denmark over 7 years to analyze the contributions of doctor-specific, patient-specific, and drug-specific factors to the adherence decision. We find that physicians exert substantial influence on patient compliance. Further, the quality of the match between a doctor and a patient accounts for a substantial portion of the variation in adherence outcomes. This suggests that the sorting of patients across doctors is an important mechanism that affects patient adherence beyond the effects of individual patient-specific and physician-specific factors. Copyright © 2016 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
| | | | - Niels Skipper
- Department of Economics and Business Economics, Aarhus University, Aarhus V, Denmark
| |
Collapse
|
259
|
Jiang X, Zhang H, Quan X, Liu Z, Yin Y. Disease-related gene module detection based on a multi-label propagation clustering algorithm. PLoS One 2017; 12:e0178006. [PMID: 28542379 PMCID: PMC5438150 DOI: 10.1371/journal.pone.0178006] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2016] [Accepted: 05/06/2017] [Indexed: 01/11/2023] Open
Abstract
Detecting disease-related gene modules by analyzing gene expression data is of great significance. It is helpful for exploratory analysis of the interaction mechanisms of genes under complex disease phenotypes. The multi-label propagation algorithm (MLPA) has been widely used in module detection for its fast and easy implementation. The accuracy of MLPA greatly depends on the connections between nodes, and most existing research focuses on measuring the similarity between nodes. However, MLPA does not perform well with loose connections between disease-related genes. Moreover, the biological significance of modules obtained by MLPA has not been demonstrated. To solve these problems, we designed a double label propagation clustering algorithm (DLPCA) based on MLPA to study Huntington's disease. In DLPCA, in addition to category labels, we introduced pathogenic labels to supervise the process of multi-label propagation clustering. The pathogenic labels contain pathogenic information about disease genes and the hierarchical structure of gene expression data. Experimental results demonstrated the superior performance of DLPCA compared with other conventional gene-clustering algorithms.
Collapse
Affiliation(s)
- Xue Jiang
- College of Computer and Control Engineering, Nankai University, Tianjin 300350, China
- Tianjin Key Laboratory of Intelligent Robotics, Nankai University, Tianjin 300350, China
| | - Han Zhang
- College of Computer and Control Engineering, Nankai University, Tianjin 300350, China
- Tianjin Key Laboratory of Intelligent Robotics, Nankai University, Tianjin 300350, China
| | - Xiongwen Quan
- College of Computer and Control Engineering, Nankai University, Tianjin 300350, China
- Tianjin Key Laboratory of Intelligent Robotics, Nankai University, Tianjin 300350, China
| | - Zhandong Liu
- Department of Pediatrics-Neurology, Baylor College of Medicine, Houston, TX 77030, United States of America
| | - Yanbin Yin
- Department of Biological Sciences, Northern Illinois University, DeKalb, IL 60115, United States of America
| |
Collapse
|
260
|
A General-Purpose Graphics Processing Unit (GPGPU)-Accelerated Robotic Controller Using a Low Power Mobile Platform. JOURNAL OF LOW POWER ELECTRONICS AND APPLICATIONS 2017. [DOI: 10.3390/jlpea7020010] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
261
|
FPGASW: Accelerating Large-Scale Smith-Waterman Sequence Alignment Application with Backtracking on FPGA Linear Systolic Array. Interdiscip Sci 2017; 10:176-188. [PMID: 28432608 DOI: 10.1007/s12539-017-0225-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2016] [Revised: 02/21/2017] [Accepted: 03/09/2017] [Indexed: 10/19/2022]
Abstract
The Smith-Waterman (SW) algorithm based on dynamic programming is a well-known classical method for high precision sequence matching and has become the gold standard to evaluate sequence alignment software. In this paper, we propose fine-grained parallelized SW algorithms using affine gap penalty and implement a parallel computing structures to accelerating the SW with backtracking on FPGA platform. We analysis the dynamic parallel computing features of anti-diagonal elements and storage expansion problem resulting from backtracking stage, and propose a series of optimization strategies to eliminate data dependency, reduce storage requirements, and overlap memory access latency. Our implementation is capable of supporting multi-type, large-scale biological sequence alignment applications. We obtain a speedup between 3.6 and 25.2 over the typical SW algorithm running on a general-purpose computer configured with an Intel Core i5 3.2 GHz CPU. Moreover, our work is superior to other FPGA implementations in both array size and clock frequency, and the experiment results show that it can get a performance closed to that of the latest GPU implementation, but the power consumption is only about 26% of that of the GPU platforms.
Collapse
|
262
|
Chu H, Sun P, Yin J, Liu G, Wang Y, Zhao P, Zhu Y, Yang X, Zheng T, Zhou X, Jin W, Sun C. Integrated network analysis reveals potentially novel molecular mechanisms and therapeutic targets of refractory epilepsies. PLoS One 2017; 12:e0174964. [PMID: 28388656 PMCID: PMC5384674 DOI: 10.1371/journal.pone.0174964] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2016] [Accepted: 03/19/2017] [Indexed: 01/03/2023] Open
Abstract
Epilepsy is a complex neurological disorder and a significant health problem. The pathogenesis of epilepsy remains obscure in a significant number of patients and the current treatment options are not adequate in about a third of individuals which were known as refractory epilepsies (RE). Network medicine provides an effective approach for studying the molecular mechanisms underlying complex diseases. Here we integrated 1876 disease-gene associations of RE and located those genes to human protein-protein interaction (PPI) network to obtain 42 significant RE-associated disease modules. The functional analysis of these disease modules showed novel molecular pathological mechanisms of RE, such as the novel enriched pathways (e.g., "presynaptic nicotinic acetylcholine receptors", "signaling by insulin receptor"). Further analysis on the relationships between current drug targets and the RE-related disease genes showed the rational mechanisms of most antiepileptic drugs. In addition, we detected ten potential novel drug targets (e.g., KCNA1, KCNA4-6, KCNC3, KCND2, KCNMA1, CAMK2G, CACNB4 and GRM1) located in three RE related disease modules, which might provide novel insights into the new drug discovery for RE therapy.
Collapse
Affiliation(s)
- Hongwei Chu
- Department of Biomedical Engineering, Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, China
- Liaoning Provincial Key Laboratory of Cerebral Diseases, Institute for Brain Disorders, Dalian Medical University, Dalian, China
| | - Pin Sun
- Shanghai Medical College, Fudan University, Shanghai, China
| | - Jiahui Yin
- College of Electronics and Information Engineering, Tongji University, Shanghai, China
| | - Guangming Liu
- School of Computer and Information Technology and Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing, China
| | - Yiwei Wang
- Liaoning Provincial Key Laboratory of Cerebral Diseases, Institute for Brain Disorders, Dalian Medical University, Dalian, China
| | - Pengyao Zhao
- School of Computer and Information Technology and Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing, China
| | - Yizhun Zhu
- Department of Pharmacology, School of Pharmacy, Fudan University, Shanghai, China
| | - Xiaohan Yang
- Liaoning Provincial Key Laboratory of Cerebral Diseases, Institute for Brain Disorders, Dalian Medical University, Dalian, China
| | - Tiezheng Zheng
- Liaoning Provincial Key Laboratory of Cerebral Diseases, Institute for Brain Disorders, Dalian Medical University, Dalian, China
| | - Xuezhong Zhou
- School of Computer and Information Technology and Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing, China
| | - Weilin Jin
- Institute of Nano Biomedicine and Engineering, Department of Instrument Science and Engineering, Key Lab. for Thin Film and Microfabrication Technology of Ministry of Education, School of Electronic Information and Electronic Engineering, Shanghai Jiao Tong University, Shanghai, China
| | - Changkai Sun
- Department of Biomedical Engineering, Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, China
- Liaoning Provincial Key Laboratory of Cerebral Diseases, Institute for Brain Disorders, Dalian Medical University, Dalian, China
- Research Center for the Control Engineering of Translational Precision Medicine, Dalian University of Technology, Dalian, China
- State Key Laboratory of Fine Chemicals, Dalian R&D Center for Stem Cell and Tissue Engineering, Dalian University of Technology, Dalian, China
| |
Collapse
|
263
|
Hung CL, Lin CY, Wu PC. An Efficient GPU-Based Multiple Pattern Matching Algorithm for Packet Filtering. JOURNAL OF SIGNAL PROCESSING SYSTEMS 2017; 86:347-358. [DOI: 10.1007/s11265-016-1139-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2025]
|
264
|
Kečo D, Subasi A, Kevric J. Cloud computing-based parallel genetic algorithm for gene selection in cancer classification. Neural Comput Appl 2016. [DOI: 10.1007/s00521-016-2780-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
265
|
Genomic analyses of multidrug resistant Pseudomonas aeruginosa PA1 resequenced by single-molecule real-time sequencing. Biosci Rep 2016; 36:BSR20160282. [PMID: 27765811 PMCID: PMC5293553 DOI: 10.1042/bsr20160282] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Revised: 10/17/2016] [Accepted: 10/20/2016] [Indexed: 11/17/2022] Open
Abstract
As a third-generation sequencing (TGS) method, single-molecule real-time (SMRT) technology provides long read length, and it is well suited for resequencing projects and de novo assembly. In the present study, Pseudomonas aeruginosa PA1 was characterized and resequenced using SMRT technology. PA1 was also subjected to genomic, comparative and pan-genomic analyses. The multidrug resistant strain PA1 possesses a 6,498,072 bp genome and a sequence type of ST-782. The genome of PA1 was also visualized, and the results revealed the details of general genome annotations, virulence factors, regulatory proteins (RPs), secretion system proteins, type II toxin–antitoxin (T–A) pairs and genomic islands. Whole genome comparison analysis suggested that PA1 exhibits similarity to other P. aeruginosa strains but differs in terms of horizontal gene transfer (HGT) regions, such as prophages and genomic islands. Phylogenetic analyses based on 16S rRNA sequences demonstrated that PA1 is closely related to PAO1, and P. aeruginosa strains can be divided into two main groups. The pan-genome of P. aeruginosa consists of a core genome of approximately 4,000 genes and an accessory genome of at least 6,600 genes. The present study presented a detailed, visualized and comparative analysis of the PA1 genome, to enhance our understanding of this notorious pathogen.
Collapse
|
266
|
Hung C, Lin C, Ou C, Tseng Y, Hung P, Li S, Fu C. Efficient bit‐parallel subcircuit extraction using CUDA. CONCURRENCY AND COMPUTATION: PRACTICE AND EXPERIENCE 2016; 28:4326-4338. [DOI: 10.1002/cpe.3732] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2025]
Abstract
SummaryWafer processing technology has been improving rapidly. Moore's law has been exceeded as the number of transistors in a dense integrated circuit, now increases threefold or more, approximately every year. The integrated circuit has gone from very large scale to giga large scale. The extraction of subcircuits has therefore become computation‐intensive. In this paper, we propose an efficient bit‐parallel subcircuit extraction algorithm using graphic processing units. We conducted experimental trials and demonstrated that the proposed algorithm can achieve high throughput, suggesting practical applications in the extraction of subcircuits. Copyright © 2015 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Che‐Lun Hung
- Department of Computer Science and Communication Engineering Providence University No. 200, Sec. 7, Taiwan Boulevard, Shalu District Taichung City 43301 Taiwan
| | - Chun‐Yuan Lin
- Department of Computer Science and Information Engineering Chang Gung University No. 259, Sanmin Rd., Guishan Township Taoyuan City 33302 Taiwan
| | - Chia‐Shin Ou
- Department of Computer Science National Tsing Hua University Hsinchu Taiwan
| | - Yuan‐Hong Tseng
- Department of Computer Science and Communication Engineering Providence University No. 200, Sec. 7, Taiwan Boulevard, Shalu District Taichung City 43301 Taiwan
| | - Po‐Yen Hung
- Department of Computer Science and Communication Engineering Providence University No. 200, Sec. 7, Taiwan Boulevard, Shalu District Taichung City 43301 Taiwan
| | - Ship‐Peng Li
- Department of Computer Science and Communication Engineering Providence University No. 200, Sec. 7, Taiwan Boulevard, Shalu District Taichung City 43301 Taiwan
| | - Chun‐Ting Fu
- Department of Computer Science & Information Management Providence University No. 200, Sec. 7, Taiwan Boulevard, Shalu District Taichung City 43301 Taiwan
| |
Collapse
|
267
|
Molecular modelling studies of 3,5-dipyridyl-1,2,4-triazole derivatives as xanthine oxidoreductase inhibitors using 3D-QSAR, Topomer CoMFA, molecular docking and molecular dynamic simulations. J Taiwan Inst Chem Eng 2016. [DOI: 10.1016/j.jtice.2016.09.018] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
268
|
Magoulès F, Hung C, Jiang H, Hu J. Embedded multicore computing and applications. CONCURRENCY AND COMPUTATION: PRACTICE AND EXPERIENCE 2016; 28:4211-4214. [DOI: 10.1002/cpe.3910] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2025]
Affiliation(s)
| | | | - Hai Jiang
- Arkansas State University Jonesboro USA
| | - Jia Hu
- Liverpool Hope University Liverpool UK
| |
Collapse
|
269
|
Sneha P, Doss CGP. Gliptins in managing diabetes - Reviewing computational strategy. Life Sci 2016; 166:108-120. [PMID: 27744054 DOI: 10.1016/j.lfs.2016.10.009] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Revised: 10/05/2016] [Accepted: 10/11/2016] [Indexed: 12/12/2022]
Abstract
The pace of anti-diabetic drug discovery is very slow in spite of increasing rate of prevalence of Type 2 Diabetes which remains a major public health concern. Though extensive research steps are taken in the past decade, yet craves for better new treatment strategies to overcome the current scenario. One such general finding is the evolution of gliptins which discriminately inhibits DPP4 (Dipeptidyl peptidase-4) enzyme. Although the mechanism of action of gliptin is highly target oriented and accurate, still its long-term use stands unknown. This step calls for a fast, flexible, and cost-effective strategies to meet the demands of producing arrays of high-content lead compounds with improved efficiency for better clinical success. The present review highlights the available gliptins in the market and also other naturally occurring DPP4 enzyme inhibitors. Along with describing the known inhibitors and their origin in this review, we attempted to identify a probable new lead compounds using advanced computational techniques. In this context, computational methods that integrate the knowledge of proteins and drug responses were utilized in prioritizing targets and designing drugs towards clinical trials with better efficacy. The compounds obtained as a result of virtual screening were compared with the commercially available gliptin in the market to have better efficiency in the identification and validation of the potential DPP4 inhibitors. The combinatorial computational methods used in the present study identified Compound 1: 25022354 as promising inhibitor.
Collapse
Affiliation(s)
- P Sneha
- School of Biosciences and Technology, VIT University, Vellore, Tamil Nadu 632014, India
| | - C George Priya Doss
- School of Biosciences and Technology, VIT University, Vellore, Tamil Nadu 632014, India.
| |
Collapse
|
270
|
Shen M, Le S, Jin X, Li G, Tan Y, Li M, Zhao X, Shen W, Yang Y, Wang J, Zhu H, Li S, Rao X, Hu F, Lu S. Characterization and Comparative Genomic Analyses of Pseudomonas aeruginosa Phage PaoP5: New Members Assigned to PAK_P1-like Viruses. Sci Rep 2016; 6:34067. [PMID: 27659070 PMCID: PMC5034324 DOI: 10.1038/srep34067] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2016] [Accepted: 09/07/2016] [Indexed: 11/10/2022] Open
Abstract
As a potential alternative to antibiotics, phages can be used to treat multi-drug resistant bacteria. As such, the biological characteristics of phages should be investigated to utilize them as effective antimicrobial agents. In this study, phage PaoP5, a lytic virus that infects Pseudomonas aeruginosa PAO1, was isolated and genomically characterized. PaoP5 comprises an icosahedral head with an apex diameter of 69 nm and a contractile tail with a length of 120 nm. The PaoP5 genome is a linear dsDNA molecule containing 93,464 base pairs (bp) with 49.51% G + C content of 11 tRNA genes and a 1,200 bp terminal redundancy. A total of 176 protein-coding genes were predicted in the PaoP5 genome. Nine PaoP5 structural proteins were identified. Three hypothetical proteins were determined as structural. Comparative genomic analyses revealed that seven new Pseudomonas phages, namely, PaoP5, K8, C11, vB_PaeM_C2-10_Ab02, vB_PaeM_C2-10_Ab08, vB_PaeM_C2-10_Ab10, and vB_PaeM_C2-10_Ab15, were similar to PAK_P1-like viruses. Phylogenetic and pan-genome analyses suggested that the new phages should be assigned to PAK_P1-like viruses, which possess approximately 100 core genes and 150 accessory genes. This work presents a detailed and comparative analysis of PaoP5 to enhance our understanding of phage biology.
Collapse
Affiliation(s)
- Mengyu Shen
- Department of Microbiology, College of Basic Medical Science, Third Military Medical University, Chongqing, 400038, China
| | - Shuai Le
- Department of Microbiology, College of Basic Medical Science, Third Military Medical University, Chongqing, 400038, China
| | - Xiaolin Jin
- Department of Microbiology, College of Basic Medical Science, Third Military Medical University, Chongqing, 400038, China
| | - Gang Li
- Department of Microbiology, College of Basic Medical Science, Third Military Medical University, Chongqing, 400038, China
| | - Yinling Tan
- Department of Microbiology, College of Basic Medical Science, Third Military Medical University, Chongqing, 400038, China
| | - Ming Li
- Department of Microbiology, College of Basic Medical Science, Third Military Medical University, Chongqing, 400038, China
| | - Xia Zhao
- Department of Microbiology, College of Basic Medical Science, Third Military Medical University, Chongqing, 400038, China
| | - Wei Shen
- Department of Microbiology, College of Basic Medical Science, Third Military Medical University, Chongqing, 400038, China
| | - Yuhui Yang
- Department of Microbiology, College of Basic Medical Science, Third Military Medical University, Chongqing, 400038, China
| | - Jing Wang
- Department of Microbiology, College of Basic Medical Science, Third Military Medical University, Chongqing, 400038, China
| | - Hongbin Zhu
- Department of Microbiology, College of Basic Medical Science, Third Military Medical University, Chongqing, 400038, China
| | - Shu Li
- Department of Microbiology, College of Basic Medical Science, Third Military Medical University, Chongqing, 400038, China
| | - Xiancai Rao
- Department of Microbiology, College of Basic Medical Science, Third Military Medical University, Chongqing, 400038, China
| | - Fuquan Hu
- Department of Microbiology, College of Basic Medical Science, Third Military Medical University, Chongqing, 400038, China
| | - Shuguang Lu
- Department of Microbiology, College of Basic Medical Science, Third Military Medical University, Chongqing, 400038, China
| |
Collapse
|
271
|
Suzuki S, Kakuta M, Ishida T, Akiyama Y. GPU-Acceleration of Sequence Homology Searches with Database Subsequence Clustering. PLoS One 2016; 11:e0157338. [PMID: 27482905 PMCID: PMC4970815 DOI: 10.1371/journal.pone.0157338] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2015] [Accepted: 05/27/2016] [Indexed: 11/30/2022] Open
Abstract
Sequence homology searches are used in various fields and require large amounts of computation time, especially for metagenomic analysis, owing to the large number of queries and the database size. To accelerate computing analyses, graphics processing units (GPUs) are widely used as a low-cost, high-performance computing platform. Therefore, we mapped the time-consuming steps involved in GHOSTZ, which is a state-of-the-art homology search algorithm for protein sequences, onto a GPU and implemented it as GHOSTZ-GPU. In addition, we optimized memory access for GPU calculations and for communication between the CPU and GPU. As per results of the evaluation test involving metagenomic data, GHOSTZ-GPU with 12 CPU threads and 1 GPU was approximately 3.0- to 4.1-fold faster than GHOSTZ with 12 CPU threads. Moreover, GHOSTZ-GPU with 12 CPU threads and 3 GPUs was approximately 5.8- to 7.7-fold faster than GHOSTZ with 12 CPU threads.
Collapse
Affiliation(s)
- Shuji Suzuki
- Graduate School of Information Science and Engineering, Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
- Education Academy of Computational Life Sciences (ACLS), Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
| | - Masanori Kakuta
- Graduate School of Information Science and Engineering, Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
| | - Takashi Ishida
- Graduate School of Information Science and Engineering, Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
- Education Academy of Computational Life Sciences (ACLS), Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
| | - Yutaka Akiyama
- Graduate School of Information Science and Engineering, Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
- Education Academy of Computational Life Sciences (ACLS), Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
- * E-mail:
| |
Collapse
|
272
|
Scalable metagenomics alignment research tool (SMART): a scalable, rapid, and complete search heuristic for the classification of metagenomic sequences from complex sequence populations. BMC Bioinformatics 2016; 17:292. [PMID: 27465705 PMCID: PMC4963998 DOI: 10.1186/s12859-016-1159-6] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2015] [Accepted: 07/21/2016] [Indexed: 11/24/2022] Open
Abstract
Background Next generation sequencing technology has enabled characterization of metagenomics through massively parallel genomic DNA sequencing. The complexity and diversity of environmental samples such as the human gut microflora, combined with the sustained exponential growth in sequencing capacity, has led to the challenge of identifying microbial organisms by DNA sequence. We sought to validate a Scalable Metagenomics Alignment Research Tool (SMART), a novel searching heuristic for shotgun metagenomics sequencing results. Results After retrieving all genomic DNA sequences from the NCBI GenBank, over 1 × 1011 base pairs of 3.3 × 106 sequences from 9.25 × 105 species were indexed using 4 base pair hashtable shards. A MapReduce searching strategy was used to distribute the search workload in a computing cluster environment. In addition, a one base pair permutation algorithm was used to account for single nucleotide polymorphisms and sequencing errors. Simulated datasets used to evaluate Kraken, a similar metagenomics classification tool, were used to measure and compare precision and accuracy. Finally using a same set of training sequences we compared Kraken, CLARK, and SMART within the same computing environment. Utilizing 12 computational nodes, we completed the classification of all datasets in under 10 min each using exact matching with an average throughput of over 1.95 × 106 reads classified per minute. With permutation matching, we achieved sensitivity greater than 83 % and precision greater than 94 % with simulated datasets at the species classification level. We demonstrated the application of this technique applied to conjunctival and gut microbiome metagenomics sequencing results. In our head to head comparison, SMART and CLARK had similar accuracy gains over Kraken at the species classification level, but SMART required approximately half the amount of RAM of CLARK. Conclusions SMART is the first scalable, efficient, and rapid metagenomics classification algorithm capable of matching against all the species and sequences present in the NCBI GenBank and allows for a single step classification of microorganisms as well as large plant, mammalian, or invertebrate genomes from which the metagenomic sample may have been derived.
Collapse
|
273
|
Guryanov I, Fiorucci S, Tennikova T. Receptor-ligand interactions: Advanced biomedical applications. MATERIALS SCIENCE & ENGINEERING. C, MATERIALS FOR BIOLOGICAL APPLICATIONS 2016; 68:890-903. [PMID: 27524092 DOI: 10.1016/j.msec.2016.07.072] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2016] [Revised: 07/11/2016] [Accepted: 07/26/2016] [Indexed: 12/24/2022]
Abstract
Receptor-ligand interactions (RLIs) are at the base of all biological events occurring in living cells. The understanding of interactions between complementary macromolecules in biological systems represents a high-priority research area in bionanotechnology to design the artificial systems mimicking natural processes. This review summarizes and analyzes RLIs in some cutting-edge biomedical fields, in particular, for the preparation of novel stationary phases to separate complex biological mixtures in medical diagnostics, for the design of ultrasensitive biosensors for identification of biomarkers of various diseases at early stages, as well as in the development of innovative biomaterials and approaches for regenerative medicine. All these biotechnological fields are closely related, because their success depends on a proper choice, combination and spatial disposition of the single components of ligand-receptor pairs on the surface of appropriately designed support.
Collapse
Affiliation(s)
- Ivan Guryanov
- Institute of Chemistry, St. Petersburg State University, 198504 St. Petersburg, Russia.
| | - Stefano Fiorucci
- Department of Clinical and Experimental Medicine, University of Perugia, 06122 Perugia, Italy.
| | - Tatiana Tennikova
- Institute of Chemistry, St. Petersburg State University, 198504 St. Petersburg, Russia.
| |
Collapse
|
274
|
Using Genome-Wide SNP Discovery and Genotyping to Reveal the Main Source of Population Differentiation in Nothofagus dombeyi (Mirb.) Oerst. in Chile. Int J Genomics 2016; 2016:3654093. [PMID: 27446942 PMCID: PMC4944027 DOI: 10.1155/2016/3654093] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2016] [Accepted: 05/23/2016] [Indexed: 11/17/2022] Open
Abstract
Within a woody plant species, environmental heterogeneity has the potential to influence the distribution of genetic variation among populations through several evolutionary processes. In some species, a relationship between environmental characteristics and the distribution of genotypes can be detected, showing the importance of natural selection as the main source of differentiation. Nothofagus dombeyi (Mirb.) Oerst. (Nothofagaceae) is an endemic tree species occurring both in Chile and in Argentina temperate forests. Postglacial history has been studied with chloroplast DNA and evolutionary forces shaping genetic variation patterns have been analysed with isozymes but fine-scale genetic diversity studies are needed. The study of demographic and selection histories in Nothofagus dombeyi requires more informative markers such as single nucleotide polymorphisms (SNP). Genotyping-by-Sequencing tools now allow studying thousands of SNP markers at reasonable prices in nonmodel species. We investigated more than 10 K SNP loci for signatures of local adaptation and showed that interrogation of genomic resources can identify shifts in genetic diversity and putative adaptive signals in this nonmodel woody species.
Collapse
|
275
|
Lan H, Chan Y, Xu K, Schmidt B, Peng S, Liu W. Parallel algorithms for large-scale biological sequence alignment on Xeon-Phi based clusters. BMC Bioinformatics 2016; 17 Suppl 9:267. [PMID: 27455061 PMCID: PMC4959381 DOI: 10.1186/s12859-016-1128-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Computing alignments between two or more sequences are common operations frequently performed in computational molecular biology. The continuing growth of biological sequence databases establishes the need for their efficient parallel implementation on modern accelerators. RESULTS This paper presents new approaches to high performance biological sequence database scanning with the Smith-Waterman algorithm and the first stage of progressive multiple sequence alignment based on the ClustalW heuristic on a Xeon Phi-based compute cluster. Our approach uses a three-level parallelization scheme to take full advantage of the compute power available on this type of architecture; i.e. cluster-level data parallelism, thread-level coarse-grained parallelism, and vector-level fine-grained parallelism. Furthermore, we re-organize the sequence datasets and use Xeon Phi shuffle operations to improve I/O efficiency. CONCLUSIONS Evaluations show that our method achieves a peak overall performance up to 220 GCUPS for scanning real protein sequence databanks on a single node consisting of two Intel E5-2620 CPUs and two Intel Xeon Phi 7110P cards. It also exhibits good scalability in terms of sequence length and size, and number of compute nodes for both database scanning and multiple sequence alignment. Furthermore, the achieved performance is highly competitive in comparison to optimized Xeon Phi and GPU implementations. Our implementation is available at https://github.com/turbo0628/LSDBS-mpi .
Collapse
Affiliation(s)
- Haidong Lan
- School of Computer Science and Technology, Shandong University, Shunhua Road 1500, Jinan, Shandong, China
| | - Yuandong Chan
- School of Computer Science and Technology, Shandong University, Shunhua Road 1500, Jinan, Shandong, China
| | - Kai Xu
- School of Computer Science and Technology, Shandong University, Shunhua Road 1500, Jinan, Shandong, China
| | | | - Shaoliang Peng
- School of Computer Science, National University of Defense Technology, Changsha, Hunan, China
| | - Weiguo Liu
- School of Computer Science and Technology, Shandong University, Shunhua Road 1500, Jinan, Shandong, China.
| |
Collapse
|
276
|
HDInsight4PSi: Boosting performance of 3D protein structure similarity searching with HDInsight clusters in Microsoft Azure cloud. Inf Sci (N Y) 2016. [DOI: 10.1016/j.ins.2016.02.029] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
277
|
Jiang H, Ganesan N. CUDAMPF: a multi-tiered parallel framework for accelerating protein sequence search in HMMER on CUDA-enabled GPU. BMC Bioinformatics 2016; 17:106. [PMID: 26920848 PMCID: PMC4769571 DOI: 10.1186/s12859-016-0946-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2015] [Accepted: 02/15/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND HMMER software suite is widely used for analysis of homologous protein and nucleotide sequences with high sensitivity. The latest version of hmmsearch in HMMER 3.x, utilizes heuristic-pipeline which consists of MSV/SSV (Multiple/Single ungapped Segment Viterbi) stage, P7Viterbi stage and the Forward scoring stage to accelerate homology detection. Since the latest version is highly optimized for performance on modern multi-core CPUs with SSE capabilities, only a few acceleration attempts report speedup. However, the most compute intensive tasks within the pipeline (viz., MSV/SSV and P7Viterbi stages) still stand to benefit from the computational capabilities of massively parallel processors. RESULTS A Multi-Tiered Parallel Framework (CUDAMPF) implemented on CUDA-enabled GPUs presented here, offers a finer-grained parallelism for MSV/SSV and Viterbi algorithms. We couple SIMT (Single Instruction Multiple Threads) mechanism with SIMD (Single Instructions Multiple Data) video instructions with warp-synchronism to achieve high-throughput processing and eliminate thread idling. We also propose a hardware-aware optimal allocation scheme of scarce resources like on-chip memory and caches in order to boost performance and scalability of CUDAMPF. In addition, runtime compilation via NVRTC available with CUDA 7.0 is incorporated into the presented framework that not only helps unroll innermost loop to yield upto 2 to 3-fold speedup than static compilation but also enables dynamic loading and switching of kernels depending on the query model size, in order to achieve optimal performance. CONCLUSIONS CUDAMPF is designed as a hardware-aware parallel framework for accelerating computational hotspots within the hmmsearch pipeline as well as other sequence alignment applications. It achieves significant speedup by exploiting hierarchical parallelism on single GPU and takes full advantage of limited resources based on their own performance features. In addition to exceeding performance of other acceleration attempts, comprehensive evaluations against high-end CPUs (Intel i5, i7 and Xeon) shows that CUDAMPF yields upto 440 GCUPS for SSV, 277 GCUPS for MSV and 14.3 GCUPS for P7Viterbi all with 100 % accuracy, which translates to a maximum speedup of 37.5, 23.1 and 11.6-fold for MSV, SSV and P7Viterbi respectively. The source code is available at https://github.com/Super-Hippo/CUDAMPF.
Collapse
Affiliation(s)
- Hanyu Jiang
- Department of Elec. and Comp. Engg, Stevens Institute of Technology, Hoboken, NJ, 07030, USA.
| | - Narayan Ganesan
- Department of Elec. and Comp. Engg, Stevens Institute of Technology, Hoboken, NJ, 07030, USA.
| |
Collapse
|
278
|
Lin YL, Chen CH, Wu HY, Tsai NM, Jian TY, Chang YC, Lin CH, Wu CH, Hsu FT, Leung TK, Liao KW. Inhibition of breast cancer with transdermal tamoxifen-encapsulated lipoplex. J Nanobiotechnology 2016; 14:11. [PMID: 26892504 PMCID: PMC4759757 DOI: 10.1186/s12951-016-0163-3] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2015] [Accepted: 02/08/2016] [Indexed: 12/21/2022] Open
Abstract
Background Tamoxifen is currently used for the treatment of both early and advanced estrogen receptor (ER) positive breast cancer in pre- and post-menopausal women. However, using tamoxifen routinely to inhibit endogenous or exogenous estrogen effects is occasionally difficult because of its potential side effects. Objectives The aim of this study is to design a local drug delivery system to encapsulate tamoxifen for observing their efficacy of skin penetration, drug accumulation and cancer therapy. Methods A cationic liposome-PEG-PEI complex (LPPC) was used as a carrier for the encapsulation of tamoxifen and forming ‘LPPC/TAM’ for transdermal release. The cytotoxicity of LPPC/TAM was analyzed by MTT. The skin penetration, tumor growth inhibition and organ damages were measured in xenograft mice following transdermal treatment. Results LPPC/TAM had an average size less than 270 nm and a zeta-potential of approximately 40 mV. LPPC/TAM displayed dramatically increased the cytotoxic activity in all breast cancer cells, especially in ER-positive breast cancer cells. In vivo, LPPC drug delivery helped the fluorescent dye penetrating across the skim and accumulating rapidly in tumor area.
Administration of LPPC/TAM by transdermal route inhibited about 86 % of tumor growth in mice bearing BT474 tumors. This local treatment of LPPC/TAM did not injury skin and any organs. Conclusion LPPC-delivery system provided a better skin penetration and drug accumulation and therapeutic efficacy. Therefore, LPPC/TAM drug delivery maybe a useful transdermal tool of drugs utilization for breast cancer therapy.
Collapse
Affiliation(s)
- Yu-Ling Lin
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan, ROC. .,Center for Bioinformatics Research, National Chiao Tung University, Hsinchu, Taiwan, ROC.
| | - Chia-Hung Chen
- Institute of Molecular Medicine and Bioengineering, National Chiao Tung University, Hsinchu, Taiwan, ROC.
| | - Hsin-Yi Wu
- Institute of Molecular Medicine and Bioengineering, National Chiao Tung University, Hsinchu, Taiwan, ROC.
| | - Nu-Man Tsai
- School of Medical Laboratory and Biotechnology, Chung Shan Medical University, Taichung, Taiwan, ROC. .,Clinical Laboratory, Chung Shan Medical University Hospital, Taichung, Taiwan, ROC.
| | - Ting-Yan Jian
- Institute of Molecular Medicine and Bioengineering, National Chiao Tung University, Hsinchu, Taiwan, ROC.
| | - Yuan-Ching Chang
- Department of Surgery, MacKay Memorial Hospital, Taipei, Taiwan, ROC.
| | - Chi-Hsin Lin
- Department of Medical Research, MacKay Memorial Hospital, New Taipei City, Taiwan, ROC.
| | - Chih-Hsiung Wu
- Department of Surgery, En Chu Kong Hospital, New Taipei City, Taiwan, ROC.
| | - Fei-Ting Hsu
- Department of Medical Imaging, Taipei Medical University Hospital, Taipei, Taiwan, ROC. .,Translational Imaging Research Center, Taipei Medical University, Taipei, Taiwan, ROC.
| | - Ting Kai Leung
- Department of Diagnostic Radiology, Taipei Medical University Hospital, Taipei, Taiwan, ROC. .,Graduate Institute of Biomedical Materials and Tissue Engineering, College of Biomedical Engineering, Taipei Medical University, Taipei, Taiwan, ROC. .,Department of Diagnostic Radiology, Taipei Hospital, Ministry of Health and Welfare, Taipei, Taiwan, ROC. .,College of Science and Engineering, Fu Jen Catholic University, Hsinchuang, Taiwan, ROC.
| | - Kuang-Wen Liao
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan, ROC. .,Institute of Molecular Medicine and Bioengineering, National Chiao Tung University, Hsinchu, Taiwan, ROC. .,Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan, ROC.
| |
Collapse
|
279
|
McCarthy M, Prakash P, Gorfe AA. Computational allosteric ligand binding site identification on Ras proteins. Acta Biochim Biophys Sin (Shanghai) 2016; 48:3-10. [PMID: 26487442 DOI: 10.1093/abbs/gmv100] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Accepted: 08/16/2015] [Indexed: 12/19/2022] Open
Abstract
A number of computational techniques have been proposed to expedite the process of allosteric ligand binding site identification in inherently flexible and hence challenging drug targets. Some of these techniques have been instrumental in the discovery of allosteric ligand binding sites on Ras proteins, a group of elusive anticancer drug targets. This review provides an overview of these techniques and their application to Ras proteins. A summary of molecular docking and binding site identification is provided first, followed by a more detailed discussion of two specific techniques for binding site identification in ensembles of Ras conformations generated by molecular simulations.
Collapse
Affiliation(s)
- Michael McCarthy
- Department of Integrative Biology and Pharmacology, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Priyanka Prakash
- Department of Integrative Biology and Pharmacology, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Alemayehu A Gorfe
- Department of Integrative Biology and Pharmacology, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| |
Collapse
|
280
|
Horlacher O, Lisacek F, Müller M. Mining Large Scale Tandem Mass Spectrometry Data for Protein Modifications Using Spectral Libraries. J Proteome Res 2015; 15:721-31. [PMID: 26653734 DOI: 10.1021/acs.jproteome.5b00877] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Experimental improvements in post-translational modification (PTM) detection by tandem mass spectrometry (MS/MS) has allowed the identification of vast numbers of PTMs. Open modification searches (OMSs) of MS/MS data, which do not require prior knowledge of the modifications present in the sample, further increased the diversity of detected PTMs. Despite much effort, there is still a lack of functional annotation of PTMs. One possibility to narrow the annotation gap is to mine MS/MS data deposited in public repositories and to correlate the PTM presence with biological meta-information attached to the data. Since the data volume can be quite substantial and contain tens of millions of MS/MS spectra, the data mining tools must be able to cope with big data. Here, we present two tools, Liberator and MzMod, which are built using the MzJava class library and the Apache Spark large scale computing framework. Liberator builds large MS/MS spectrum libraries, and MzMod searches them in an OMS mode. We applied these tools to a recently published set of 25 million spectra from 30 human tissues and present tissue specific PTMs. We also compared the results to the ones obtained with the OMS tool MODa and the search engine X!Tandem.
Collapse
Affiliation(s)
- Oliver Horlacher
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics , Geneva 1211, Switzerland.,Centre Universitaire de Bioinformatique, University of Geneva , Geneva 1211, Switzerland
| | - Frederique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics , Geneva 1211, Switzerland.,Centre Universitaire de Bioinformatique, University of Geneva , Geneva 1211, Switzerland
| | - Markus Müller
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics , Geneva 1211, Switzerland.,Centre Universitaire de Bioinformatique, University of Geneva , Geneva 1211, Switzerland
| |
Collapse
|
281
|
G2LC: Resources Autoscaling for Real Time Bioinformatics Applications in IaaS. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2015; 2015:549026. [PMID: 26504488 PMCID: PMC4609414 DOI: 10.1155/2015/549026] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/17/2015] [Revised: 06/22/2015] [Accepted: 06/23/2015] [Indexed: 11/17/2022]
Abstract
Cloud computing has started to change the way how bioinformatics research is being carried out. Researchers who have taken advantage of this technology can process larger amounts of data and speed up scientific discovery. The variability in data volume results in variable computing requirements. Therefore, bioinformatics researchers are pursuing more reliable and efficient methods for conducting sequencing analyses. This paper proposes an automated resource provisioning method, G2LC, for bioinformatics applications in IaaS. It enables application to output the results in a real time manner. Its main purpose is to guarantee applications performance, while improving resource utilization. Real sequence searching data of BLAST is used to evaluate the effectiveness of G2LC. Experimental results show that G2LC guarantees the application performance, while resource is saved up to 20.14%.
Collapse
|
282
|
Liu Y, Hong Y, Lin CY, Hung CL. Accelerating Smith-Waterman Alignment for Protein Database Search Using Frequency Distance Filtration Scheme Based on CPU-GPU Collaborative System. Int J Genomics 2015; 2015:761063. [PMID: 26568953 PMCID: PMC4629039 DOI: 10.1155/2015/761063] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2015] [Revised: 08/18/2015] [Accepted: 08/26/2015] [Indexed: 11/30/2022] Open
Abstract
The Smith-Waterman (SW) algorithm has been widely utilized for searching biological sequence databases in bioinformatics. Recently, several works have adopted the graphic card with Graphic Processing Units (GPUs) and their associated CUDA model to enhance the performance of SW computations. However, these works mainly focused on the protein database search by using the intertask parallelization technique, and only using the GPU capability to do the SW computations one by one. Hence, in this paper, we will propose an efficient SW alignment method, called CUDA-SWfr, for the protein database search by using the intratask parallelization technique based on a CPU-GPU collaborative system. Before doing the SW computations on GPU, a procedure is applied on CPU by using the frequency distance filtration scheme (FDFS) to eliminate the unnecessary alignments. The experimental results indicate that CUDA-SWfr runs 9.6 times and 96 times faster than the CPU-based SW method without and with FDFS, respectively.
Collapse
Affiliation(s)
- Yu Liu
- School of Electronic Information Engineering, Tianjin University, Tianjin 300072, China
| | - Yang Hong
- School of Electronic Information Engineering, Tianjin University, Tianjin 300072, China
| | - Chun-Yuan Lin
- Department of Computer Science and Information Engineering, Chang Gung University, Taoyuan 33302, Taiwan
| | - Che-Lun Hung
- Department of Computer Science and Communication Engineering, Providence University, Taichung 43301, Taiwan
| |
Collapse
|
283
|
Evaluating the Cassandra NoSQL Database Approach for Genomic Data Persistency. Int J Genomics 2015; 2015:502795. [PMID: 26558254 PMCID: PMC4629038 DOI: 10.1155/2015/502795] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2015] [Accepted: 05/14/2015] [Indexed: 12/02/2022] Open
Abstract
Rapid advances in high-throughput sequencing techniques have created interesting computational challenges in bioinformatics. One of them refers to management of massive amounts of data generated by automatic sequencers. We need to deal with the persistency of genomic data, particularly storing and analyzing these large-scale processed data. To find an alternative to the frequently considered relational database model becomes a compelling task. Other data models may be more effective when dealing with a very large amount of nonconventional data, especially for writing and retrieving operations. In this paper, we discuss the Cassandra NoSQL database approach for storing genomic data. We perform an analysis of persistency and I/O operations with real data, using the Cassandra database system. We also compare the results obtained with a classical relational database system and another NoSQL database approach, MongoDB.
Collapse
|
284
|
Hung CL, Lin YS, Lin CY, Chung YC, Chung YF. CUDA ClustalW: An efficient parallel algorithm for progressive multiple sequence alignment on Multi-GPUs. Comput Biol Chem 2015; 58:62-68. [PMID: 26052076 DOI: 10.1016/j.compbiolchem.2015.05.004] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2014] [Revised: 05/14/2015] [Accepted: 05/14/2015] [Indexed: 10/23/2022]
Abstract
For biological applications, sequence alignment is an important strategy to analyze DNA and protein sequences. Multiple sequence alignment is an essential methodology to study biological data, such as homology modeling, phylogenetic reconstruction and etc. However, multiple sequence alignment is a NP-hard problem. In the past decades, progressive approach has been proposed to successfully align multiple sequences by adopting iterative pairwise alignments. Due to rapid growth of the next generation sequencing technologies, a large number of sequences can be produced in a short period of time. When the problem instance is large, progressive alignment will be time consuming. Parallel computing is a suitable solution for such applications, and GPU is one of the important architectures for contemporary parallel computing researches. Therefore, we proposed a GPU version of ClustalW v2.0.11, called CUDA ClustalW v1.0, in this work. From the experiment results, it can be seen that the CUDA ClustalW v1.0 can achieve more than 33× speedups for overall execution time by comparing to ClustalW v2.0.11.
Collapse
Affiliation(s)
- Che-Lun Hung
- Department of Computer Science and Communication Engineering, Providence University, 200, Sec. 7, Taiwan Boulevard, Shalu Dist., Taichung City 43301, Taiwan.
| | - Yu-Shiang Lin
- Department of Computer Science, National Tsing Hua University, 101, Sec. 2, Kuang-Fu Road, Hsinchu City 30013, Taiwan
| | - Chun-Yuan Lin
- Department of Computer Science and Information Engineering, Chang Gung University, 259, Wen-Hwa 1st Road, Kwei-Shan, Taoyuan City 33302, Taiwan.
| | - Yeh-Ching Chung
- Department of Computer Science, National Tsing Hua University, 101, Sec. 2, Kuang-Fu Road, Hsinchu City 30013, Taiwan
| | - Yi-Fang Chung
- Department of Computer Science and Information Engineering, Chang Gung University, 259, Wen-Hwa 1st Road, Kwei-Shan, Taoyuan City 33302, Taiwan
| |
Collapse
|
285
|
Lin Y, Lin C, Hung C, Chung Y, Lee K. GPU‐UPGMA: high‐performance computing for UPGMA algorithm based on graphics processing units. CONCURRENCY AND COMPUTATION: PRACTICE AND EXPERIENCE 2015; 27:3403-3414. [DOI: 10.1002/cpe.3355] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2025]
Abstract
SummaryConstructing phylogenetic trees is of priority concern in computational biology, especially for developing biological taxonomies. As a conventional means of constructing phylogenetic trees, unweighted pair group method with arithmetic (UPGMA) is also an extensively adopted heuristic algorithm for constructing ultrametric trees (UT). Although the UT constructed by UPGMA is often not a true tree unless the molecular clock assumption holds, UT is still useful for the clocklike data. Moreover, UT has been successfully adopted in other problems, including orthologous‐domain classification and multiple sequence alignment. However, previous implementations of the UPGMA method have a limited ability to handle large taxa sets efficiently. This work describes a novel graphics processing unit (GPU)‐UPGMA approach, capable of providing rapid construction of extremely large datasets for biologists. Experimental results indicate that the proposed GPU‐UPGMA approach achieves an approximately 95× speedup ratio on NVIDIA Tesla C2050 GPU over the implementation with 2.13 GHz CPU. The developed techniques in GPU‐UPGMA also can be applied to solve the classification problem for large data set with more than tens of thousands items in the future.Copyright © 2014 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Yu‐Shiang Lin
- Department of Computer Science National Tsing Hua University No. 101, Section 2, Kuang‐Fu Road Hsinchu City 30013 Taiwan
| | - Chun‐Yuan Lin
- Department of Computer Science and Information Engineering Chang Gung University No. 259, Sanmin Rd., Guishan Township Taoyuan City 33302 Taiwan
| | - Che‐Lun Hung
- Department of Computer Science and Communication Engineering Providence University No. 200, Sec. 7, Taiwan Boulevard, Shalu Dist. Taichung City 43301 Taiwan
| | - Yeh‐Ching Chung
- Department of Computer Science National Tsing Hua University No. 101, Section 2, Kuang‐Fu Road Hsinchu City 30013 Taiwan
| | - Kual‐Zheng Lee
- Information and Communications Research Laboratories Industrial Technology Rearch Institute No. 195, Sec. 4, Chung Hsing Rd., Chutung Hsinchu City 31040 Taiwan
| |
Collapse
|
286
|
Roy A, Srinivasan B, Skolnick J. PoLi: A Virtual Screening Pipeline Based on Template Pocket and Ligand Similarity. J Chem Inf Model 2015. [PMID: 26225536 DOI: 10.1021/acs.jcim.5b00232] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Often in pharmaceutical research the goal is to identify small molecules that can interact with and appropriately modify the biological behavior of a new protein target. Unfortunately, most proteins lack both known structures and small molecule binders, prerequisites of many virtual screening, VS, approaches. For such proteins, ligand homology modeling, LHM, that copies ligands from homologous and perhaps evolutionarily distant template proteins, has been shown to be a powerful VS approach to identify possible binding ligands. However, if we want to target a specific pocket for which there is no homologous holo template protein structure, then LHM will not work. To address this issue, in a new pocket-based approach, PoLi, we generalize LHM by exploiting the fact that the number of distinct small molecule ligand-binding pockets in proteins is small. PoLi identifies similar ligand-binding pockets in a holo template protein library, selectively copies relevant parts of template ligands, and uses them for VS. In practice, PoLi is a hybrid structure and ligand-based VS algorithm that integrates 2D fingerprint-based and 3D shape-based similarity metrics for improved virtual screening performance. On standard DUD and DUD-E benchmark databases, using modeled receptor structures, PoLi achieves an average enrichment factor of 13.4 and 9.6, respectively, in the top 1% of the screened library. In contrast, traditional docking-based VS using AutoDock Vina and homology-based VS using FINDSITE(filt) have an average enrichment of 1.6 (3.0) and 9.0 (7.9) on the DUD (DUD-E) sets, respectively. Experimental validation of PoLi predictions on dihydrofolate reductase, DHFR, using differential scanning fluorimetry, DSF, identifies multiple ligands with diverse molecular scaffolds, thus demonstrating the advantage of PoLi over current state-of-the-art VS methods.
Collapse
Affiliation(s)
- Ambrish Roy
- Center for the Study of Systems Biology, Georgia Institute of Technology , 250 14th Street NW, Atlanta, Georgia 30318, United States
| | - Bharath Srinivasan
- Center for the Study of Systems Biology, Georgia Institute of Technology , 250 14th Street NW, Atlanta, Georgia 30318, United States
| | - Jeffrey Skolnick
- Center for the Study of Systems Biology, Georgia Institute of Technology , 250 14th Street NW, Atlanta, Georgia 30318, United States
| |
Collapse
|
287
|
Abstract
Drug discovery utilizes chemical biology and computational drug design approaches for the efficient identification and optimization of lead compounds. Chemical biology is mostly involved in the elucidation of the biological function of a target and the mechanism of action of a chemical modulator. On the other hand, computer-aided drug design makes use of the structural knowledge of either the target (structure-based) or known ligands with bioactivity (ligand-based) to facilitate the determination of promising candidate drugs. Various virtual screening techniques are now being used by both pharmaceutical companies and academic research groups to reduce the cost and time required for the discovery of a potent drug. Despite the rapid advances in these methods, continuous improvements are critical for future drug discovery tools. Advantages presented by structure-based and ligand-based drug design suggest that their complementary use, as well as their integration with experimental routines, has a powerful impact on rational drug design. In this article, we give an overview of the current computational drug design and their application in integrated rational drug development to aid in the progress of drug discovery research.
Collapse
Affiliation(s)
- Stephani Joy Y Macalino
- National Leading Research Laboratory of Molecular Modeling and Drug Design, College of Pharmacy and Graduate School of Pharmaceutical Sciences, and Global Top 5 Research Program, Ewha Womans University, Seoul, 120-750, Korea
| | - Vijayakumar Gosu
- National Leading Research Laboratory of Molecular Modeling and Drug Design, College of Pharmacy and Graduate School of Pharmaceutical Sciences, and Global Top 5 Research Program, Ewha Womans University, Seoul, 120-750, Korea
| | - Sunhye Hong
- National Leading Research Laboratory of Molecular Modeling and Drug Design, College of Pharmacy and Graduate School of Pharmaceutical Sciences, and Global Top 5 Research Program, Ewha Womans University, Seoul, 120-750, Korea
| | - Sun Choi
- National Leading Research Laboratory of Molecular Modeling and Drug Design, College of Pharmacy and Graduate School of Pharmaceutical Sciences, and Global Top 5 Research Program, Ewha Womans University, Seoul, 120-750, Korea.
| |
Collapse
|
288
|
Horlacher O, Nikitin F, Alocci D, Mariethoz J, Müller M, Lisacek F. MzJava: An open source library for mass spectrometry data processing. J Proteomics 2015; 129:63-70. [PMID: 26141507 DOI: 10.1016/j.jprot.2015.06.013] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2015] [Revised: 06/17/2015] [Accepted: 06/22/2015] [Indexed: 10/23/2022]
Abstract
Mass spectrometry (MS) is a widely used and evolving technique for the high-throughput identification of molecules in biological samples. The need for sharing and reuse of code among bioinformaticians working with MS data prompted the design and implementation of MzJava, an open-source Java Application Programming Interface (API) for MS related data processing. MzJava provides data structures and algorithms for representing and processing mass spectra and their associated biological molecules, such as metabolites, glycans and peptides. MzJava includes functionality to perform mass calculation, peak processing (e.g. centroiding, filtering, transforming), spectrum alignment and clustering, protein digestion, fragmentation of peptides and glycans as well as scoring functions for spectrum-spectrum and peptide/glycan-spectrum matches. For data import and export MzJava implements readers and writers for commonly used data formats. For many classes support for the Hadoop MapReduce (hadoop.apache.org) and Apache Spark (spark.apache.org) frameworks for cluster computing was implemented. The library has been developed applying best practices of software engineering. To ensure that MzJava contains code that is correct and easy to use the library's API was carefully designed and thoroughly tested. MzJava is an open-source project distributed under the AGPL v3.0 licence. MzJava requires Java 1.7 or higher. Binaries, source code and documentation can be downloaded from http://mzjava.expasy.org and https://bitbucket.org/sib-pig/mzjava. This article is part of a Special Issue entitled: Computational Proteomics.
Collapse
Affiliation(s)
- Oliver Horlacher
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva 1211, Switzerland; Centre Universitaire de Bioinformatique, University of Geneva, Geneva 1211, Switzerland
| | - Frederic Nikitin
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva 1211, Switzerland
| | - Davide Alocci
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva 1211, Switzerland; Centre Universitaire de Bioinformatique, University of Geneva, Geneva 1211, Switzerland
| | - Julien Mariethoz
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva 1211, Switzerland
| | - Markus Müller
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva 1211, Switzerland; Centre Universitaire de Bioinformatique, University of Geneva, Geneva 1211, Switzerland.
| | - Frederique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva 1211, Switzerland; Centre Universitaire de Bioinformatique, University of Geneva, Geneva 1211, Switzerland.
| |
Collapse
|
289
|
Thareja S. Steroidal 5α-Reductase Inhibitors: A Comparative 3D-QSAR Study Review. Chem Rev 2015; 115:2883-94. [DOI: 10.1021/cr5005953] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
- Suresh Thareja
- School
of Pharmaceutical
Sciences, Guru Ghasidas Central University, Bilaspur, Chhattisgarh 495 009, India
| |
Collapse
|
290
|
Klein EY, Makowsky M, Orlando M, Hatna E, Braykov NP, Laxminarayan R. Influence of provider and urgent care density across different socioeconomic strata on outpatient antibiotic prescribing in the USA. J Antimicrob Chemother 2015; 70:1580-7. [PMID: 25604743 DOI: 10.1093/jac/dku563] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2014] [Accepted: 12/13/2014] [Indexed: 12/15/2022] Open
Abstract
OBJECTIVES Despite a strong link between antibiotic use and resistance, and highly variable antibiotic consumption rates across the USA, drivers of differences in consumption rates are not fully understood. The objective of this study was to examine how provider density affects antibiotic prescribing rates across socioeconomic groups in the USA. METHODS We aggregated data on all outpatient antibiotic prescriptions filled in retail pharmacies in the USA in 2000 and 2010 from IMS Health into 3436 geographically distinct hospital service areas and combined this with socioeconomic and structural factors that affect antibiotic prescribing from the US Census. We then used fixed-effect models to estimate the interaction between poverty and the number of physician offices per capita (i.e. physician density) and the presence of urgent care and retail clinics on antibiotic prescribing rates. RESULTS We found large geographical variation in prescribing, driven in part by the number of physician offices per capita. For an increase of one standard deviation in the number of physician offices per capita there was a 25.9% increase in prescriptions per capita. However, the determinants of the prescription rate were dependent on socioeconomic conditions. In poorer areas, clinics substitute for traditional physician offices, reducing the impact of physician density. In wealthier areas, clinics increase the effect of physician density on the prescribing rate. CONCLUSIONS In areas with higher poverty rates, access to providers drives the prescribing rate. However, in wealthier areas, where access is less of a problem, a higher density of providers and clinics increases the prescribing rate, potentially due to competition.
Collapse
Affiliation(s)
- Eili Y Klein
- Department of Emergency Medicine, Johns Hopkins University, Baltimore, MD, USA Center for Disease Dynamics, Economics & Policy, Washington, DC, USA
| | - Michael Makowsky
- Department of Emergency Medicine, Johns Hopkins University, Baltimore, MD, USA
| | - Megan Orlando
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA
| | - Erez Hatna
- Department of Emergency Medicine, Johns Hopkins University, Baltimore, MD, USA
| | - Nikolay P Braykov
- Center for Disease Dynamics, Economics & Policy, Washington, DC, USA
| | - Ramanan Laxminarayan
- Center for Disease Dynamics, Economics & Policy, Washington, DC, USA Princeton Environmental Institute, Princeton University, Princeton, NJ, USA
| |
Collapse
|
291
|
Hung CL, Chen WP, Hua GJ, Zheng H, Tsai SJJ, Lin YL. Cloud computing-based TagSNP selection algorithm for human genome data. Int J Mol Sci 2015; 16:1096-110. [PMID: 25569088 PMCID: PMC4307292 DOI: 10.3390/ijms16011096] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2014] [Accepted: 12/04/2014] [Indexed: 12/31/2022] Open
Abstract
Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used.
Collapse
Affiliation(s)
- Che-Lun Hung
- Department of Computer Science and Communication Engineering, Providence University, Taichung 43301, Taiwan.
| | - Wen-Pei Chen
- Department of Applied Chemistry, Providence University, Taiwan 43301, Taiwan.
| | - Guan-Jie Hua
- Department of Computer Science, National Tsing Hua University, Hsinchu 30013, Taiwan.
| | - Huiru Zheng
- School of Computing and Mathematics, University of Ulster, Newtownabbey BT37 0QB, UK.
| | - Suh-Jen Jane Tsai
- Department of Applied Chemistry, Providence University, Taiwan 43301, Taiwan.
| | - Yaw-Ling Lin
- Department of Applied Chemistry, Providence University, Taiwan 43301, Taiwan.
| |
Collapse
|
292
|
Seralathan MV, Sivanesan S, Bafana A, Kashyap SM, Patrizio A, Krishnamurthi K, Chakrabarti T. Cytochrome P450 BM3 of Bacillus megaterium - a possible endosulfan biotransforming gene. J Environ Sci (China) 2014; 26:2307-2314. [PMID: 25458686 DOI: 10.1016/j.jes.2014.09.016] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2013] [Revised: 01/23/2014] [Accepted: 04/03/2014] [Indexed: 06/04/2023]
Abstract
Computing chemistry was applied to understand biotransformation mechanism of an organochlorine pesticide, endosulfan. The stereo specific metabolic activity of human CYP-2B6 (cytochrome P450) on endosulfan has been well demonstrated. Sequence and structural similarity search revealed that the bacterium Bacillus megaterium encodes CYP-BM3, which is similar to CYP-2B6. The functional similarity was studied at organism level by batch-scale studies and it was proved that B. megaterium could metabolize endosulfan to endosulfan sulfate, as CYP-2B6 does in human system. The gene expression analyses also confirmed the possible role of CYP-BM3 in endosulfan metabolism. Thus, our results show that the protein structure based in-silico approach can help us to understand and identify microbes for remediation strategy development. To the best of our knowledge this is the first report which has extrapolated the bacterial gene for endosulfan biotransformation through in silico prediction approach for metabolic gene identification.
Collapse
Affiliation(s)
| | | | - Amit Bafana
- Environmental Health Division, CSIR-NEERI, Nagpur 440020, India
| | | | | | | | | |
Collapse
|
293
|
Mohammed EA, Far BH, Naugler C. Applications of the MapReduce programming framework to clinical big data analysis: current landscape and future trends. BioData Min 2014; 7:22. [PMID: 25383096 PMCID: PMC4224309 DOI: 10.1186/1756-0381-7-22] [Citation(s) in RCA: 75] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2014] [Accepted: 10/18/2014] [Indexed: 12/23/2022] Open
Abstract
The emergence of massive datasets in a clinical setting presents both challenges and opportunities in data storage and analysis. This so called "big data" challenges traditional analytic tools and will increasingly require novel solutions adapted from other fields. Advances in information and communication technology present the most viable solutions to big data analysis in terms of efficiency and scalability. It is vital those big data solutions are multithreaded and that data access approaches be precisely tailored to large volumes of semi-structured/unstructured data. THE MAPREDUCE PROGRAMMING FRAMEWORK USES TWO TASKS COMMON IN FUNCTIONAL PROGRAMMING: Map and Reduce. MapReduce is a new parallel processing framework and Hadoop is its open-source implementation on a single computing node or on clusters. Compared with existing parallel processing paradigms (e.g. grid computing and graphical processing unit (GPU)), MapReduce and Hadoop have two advantages: 1) fault-tolerant storage resulting in reliable data processing by replicating the computing tasks, and cloning the data chunks on different computing nodes across the computing cluster; 2) high-throughput data processing via a batch processing framework and the Hadoop distributed file system (HDFS). Data are stored in the HDFS and made available to the slave nodes for computation. In this paper, we review the existing applications of the MapReduce programming framework and its implementation platform Hadoop in clinical big data and related medical health informatics fields. The usage of MapReduce and Hadoop on a distributed system represents a significant advance in clinical big data processing and utilization, and opens up new opportunities in the emerging era of big data analytics. The objective of this paper is to summarize the state-of-the-art efforts in clinical big data analytics and highlight what might be needed to enhance the outcomes of clinical big data analytics tools. This paper is concluded by summarizing the potential usage of the MapReduce programming framework and Hadoop platform to process huge volumes of clinical data in medical health informatics related fields.
Collapse
Affiliation(s)
- Emad A Mohammed
- Department of Electrical and Computer Engineering, Schulich School of Engineering, University of Calgary, Calgary, AB, Canada
| | - Behrouz H Far
- Department of Electrical and Computer Engineering, Schulich School of Engineering, University of Calgary, Calgary, AB, Canada
| | - Christopher Naugler
- Department of Pathology and Laboratory Medicine, University of Calgary and Calgary Laboratory Services, Calgary, AB, Canada
| |
Collapse
|
294
|
Wu Z, Huang C, Zhou T, Lin J, Zhang K, Li W, Zheng J, Chen B, Wang B, Zhang X, Xing J. Association of polymorphisms in AGTR1 and AGTR2 genes with primary aldosteronism in the Chinese Han population. J Renin Angiotensin Aldosterone Syst 2014; 16:880-7. [PMID: 25172908 DOI: 10.1177/1470320314534511] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022] Open
Abstract
HYPOTHESIS Polymorphisms in angiotensin II type-1/2 receptor genes (AGTR1/AGTR2) may be involved in the pathogenesis of primary aldosteronism. The present study aims to reveal some loci susceptible to the disease on the genes in a group of Chinese Han nationality. MATERIALS AND METHODS A case-control study was conducted in 202 patients and 188 controls. Ten tagging SNPs on AGTR1/AGTR2 were genotyped for all subjects via the method of multiplex PCR-ligase detection reaction. Statistical analysis was performed with chi-square test and logistic regression analysis. RESULTS rs3772616 on the AGTR1 gene was a factor for susceptibility to primary aldosteronism (p<0.001), and the TT genotype significantly decreased the risk of primary aldosteronism compared with the CC homozygote (p=0.008, adjusted OR=0.13; 95%CI: 0.03-0.59). The rs3772616 polymorphism was associated with primary aldosteronism under the additive and dominant models. The female carriers of the G allele in rs5193 showed a significant difference compared with the T allele. CONCLUSIONS The AGTR1 rs3772616 polymorphism can be considered as a hereditary marker for primary aldosteronism, and in the Chinese Han population the rs5193 G allele seems to predispose to it only in women.
Collapse
Affiliation(s)
- Zhun Wu
- Department of Urology, The First Affiliated Hospital of Xiamen University, Xiamen, Fujian, China
| | - Chao Huang
- Department of Urology, The First Affiliated Hospital of Xiamen University, Xiamen, Fujian, China
| | - Tingting Zhou
- Department of Urology, Chengdu Military General Hospital, Chengdu, Sichuan, China
| | - Jinglai Lin
- Department of Urology, The First Affiliated Hospital of Xiamen University, Xiamen, Fujian, China
| | - Kaiyan Zhang
- Department of Urology, The First Affiliated Hospital of Xiamen University, Xiamen, Fujian, China
| | - Wei Li
- Department of Urology, The First Affiliated Hospital of Xiamen University, Xiamen, Fujian, China
| | - Jiaxin Zheng
- Department of Urology, The First Affiliated Hospital of Xiamen University, Xiamen, Fujian, China
| | - Bin Chen
- Department of Urology, The First Affiliated Hospital of Xiamen University, Xiamen, Fujian, China
| | - Baojun Wang
- Department of Urology, Chinese PLA General Hospital, Beijing, China
| | - Xu Zhang
- Department of Urology, Chinese PLA General Hospital, Beijing, China
| | - Jinchun Xing
- Department of Urology, The First Affiliated Hospital of Xiamen University, Xiamen, Fujian, China
| |
Collapse
|
295
|
Liyanage H, de Lusignan S, Liaw ST, Kuziemsky CE, Mold F, Krause P, Fleming D, Jones S. Big Data Usage Patterns in the Health Care Domain: A Use Case Driven Approach Applied to the Assessment of Vaccination Benefits and Risks. Contribution of the IMIA Primary Healthcare Working Group. Yearb Med Inform 2014; 9:27-35. [PMID: 25123718 PMCID: PMC4287086 DOI: 10.15265/iy-2014-0016] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Generally benefits and risks of vaccines can be determined from studies carried out as part of regulatory compliance, followed by surveillance of routine data; however there are some rarer and more long term events that require new methods. Big data generated by increasingly affordable personalised computing, and from pervasive computing devices is rapidly growing and low cost, high volume, cloud computing makes the processing of these data inexpensive. OBJECTIVE To describe how big data and related analytical methods might be applied to assess the benefits and risks of vaccines. METHOD We reviewed the literature on the use of big data to improve health, applied to generic vaccine use cases, that illustrate benefits and risks of vaccination. We defined a use case as the interaction between a user and an information system to achieve a goal. We used flu vaccination and pre-school childhood immunisation as exemplars. RESULTS We reviewed three big data use cases relevant to assessing vaccine benefits and risks: (i) Big data processing using crowdsourcing, distributed big data processing, and predictive analytics, (ii) Data integration from heterogeneous big data sources, e.g. the increasing range of devices in the "internet of things", and (iii) Real-time monitoring for the direct monitoring of epidemics as well as vaccine effects via social media and other data sources. CONCLUSIONS Big data raises new ethical dilemmas, though its analysis methods can bring complementary real-time capabilities for monitoring epidemics and assessing vaccine benefit-risk balance.
Collapse
Affiliation(s)
| | - S de Lusignan
- Simon de Lusignan, Clinical Informatics & Health Outcomes research group, Department of Health Care Policy and Management, University of Surrey, GUILDFORD, Surrey GU2 7XH, UK, E-mail:
| | | | | | | | | | | | | |
Collapse
|
296
|
Mrozek D, Małysiak-Mrozek B, Kłapciński A. Cloud4Psi: cloud computing for 3D protein structure similarity searching. Bioinformatics 2014; 30:2822-5. [PMID: 24930141 PMCID: PMC4173022 DOI: 10.1093/bioinformatics/btu389] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Summary: Popular methods for 3D protein structure similarity searching, especially those that generate high-quality alignments such as Combinatorial Extension (CE) and Flexible structure Alignment by Chaining Aligned fragment pairs allowing Twists (FATCAT) are still time consuming. As a consequence, performing similarity searching against large repositories of structural data requires increased computational resources that are not always available. Cloud computing provides huge amounts of computational power that can be provisioned on a pay-as-you-go basis. We have developed the cloud-based system that allows scaling of the similarity searching process vertically and horizontally. Cloud4Psi (Cloud for Protein Similarity) was tested in the Microsoft Azure cloud environment and provided good, almost linearly proportional acceleration when scaled out onto many computational units. Availability and implementation: Cloud4Psi is available as Software as a Service for testing purposes at: http://cloud4psi.cloudapp.net/. For source code and software availability, please visit the Cloud4Psi project home page at http://zti.polsl.pl/dmrozek/science/cloud4psi.htm. Contact:dariusz.mrozek@polsl.pl
Collapse
Affiliation(s)
- Dariusz Mrozek
- Institute of Informatics, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland
| | - Bożena Małysiak-Mrozek
- Institute of Informatics, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland
| | - Artur Kłapciński
- Institute of Informatics, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland
| |
Collapse
|
297
|
Hung CL, Hua GJ. Local alignment tool based on Hadoop framework and GPU architecture. BIOMED RESEARCH INTERNATIONAL 2014; 2014:541490. [PMID: 24955362 PMCID: PMC4052794 DOI: 10.1155/2014/541490] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/30/2014] [Accepted: 04/14/2014] [Indexed: 11/17/2022]
Abstract
With the rapid growth of next generation sequencing technologies, such as Slex, more and more data have been discovered and published. To analyze such huge data the computational performance is an important issue. Recently, many tools, such as SOAP, have been implemented on Hadoop and GPU parallel computing architectures. BLASTP is an important tool, implemented on GPU architectures, for biologists to compare protein sequences. To deal with the big biology data, it is hard to rely on single GPU. Therefore, we implement a distributed BLASTP by combining Hadoop and multi-GPUs. The experimental results present that the proposed method can improve the performance of BLASTP on single GPU, and also it can achieve high availability and fault tolerance.
Collapse
Affiliation(s)
- Che-Lun Hung
- Department of Computer Science and Communication Engineering, Providence University, No. 200, Section 7, Taiwan Boulevard, Shalu District, Taichung 43301, Taiwan
| | - Guan-Jie Hua
- Department of Computer Science and Information Engineering, Providence University, No. 200, Section 7, Taiwan Boulevard, Shalu District, Taichung 43301, Taiwan
| |
Collapse
|
298
|
Degli Esposti M, Chouaia B, Comandatore F, Crotti E, Sassera D, Lievens PMJ, Daffonchio D, Bandi C. Evolution of mitochondria reconstructed from the energy metabolism of living bacteria. PLoS One 2014; 9:e96566. [PMID: 24804722 PMCID: PMC4013037 DOI: 10.1371/journal.pone.0096566] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2014] [Accepted: 04/07/2014] [Indexed: 11/26/2022] Open
Abstract
The ancestors of mitochondria, or proto-mitochondria, played a crucial role in the evolution of eukaryotic cells and derived from symbiotic α-proteobacteria which merged with other microorganisms - the basis of the widely accepted endosymbiotic theory. However, the identity and relatives of proto-mitochondria remain elusive. Here we show that methylotrophic α-proteobacteria could be the closest living models for mitochondrial ancestors. We reached this conclusion after reconstructing the possible evolutionary pathways of the bioenergy systems of proto-mitochondria with a genomic survey of extant α-proteobacteria. Results obtained with complementary molecular and genetic analyses of diverse bioenergetic proteins converge in indicating the pathway stemming from methylotrophic bacteria as the most probable route of mitochondrial evolution. Contrary to other α-proteobacteria, methylotrophs show transition forms for the bioenergetic systems analysed. Our approach of focusing on these bioenergetic systems overcomes the phylogenetic impasse that has previously complicated the search for mitochondrial ancestors. Moreover, our results provide a new perspective for experimentally re-evolving mitochondria from extant bacteria and in the future produce synthetic mitochondria.
Collapse
Affiliation(s)
| | - Bessem Chouaia
- Department of Food, Environmental and Evolutionary Sciences, University of Milan, Milan, Italy
| | - Francesco Comandatore
- Dipartimento di Scienze Veterinarie e Sanità Pubblica, University of Milan, Milan, Italy
| | - Elena Crotti
- Department of Food, Environmental and Evolutionary Sciences, University of Milan, Milan, Italy
| | - Davide Sassera
- Dipartimento di Scienze Veterinarie e Sanità Pubblica, University of Milan, Milan, Italy
| | | | - Daniele Daffonchio
- Department of Food, Environmental and Evolutionary Sciences, University of Milan, Milan, Italy
| | - Claudio Bandi
- Dipartimento di Scienze Veterinarie e Sanità Pubblica, University of Milan, Milan, Italy
| |
Collapse
|
299
|
Chang TH, Wu SL, Wang WJ, Horng JT, Chang CW. A novel approach for discovering condition-specific correlations of gene expressions within biological pathways by using cloud computing technology. BIOMED RESEARCH INTERNATIONAL 2014; 2014:763237. [PMID: 24579087 PMCID: PMC3919110 DOI: 10.1155/2014/763237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/25/2013] [Revised: 11/18/2013] [Accepted: 12/15/2013] [Indexed: 11/18/2022]
Abstract
Microarrays are widely used to assess gene expressions. Most microarray studies focus primarily on identifying differential gene expressions between conditions (e.g., cancer versus normal cells), for discovering the major factors that cause diseases. Because previous studies have not identified the correlations of differential gene expression between conditions, crucial but abnormal regulations that cause diseases might have been disregarded. This paper proposes an approach for discovering the condition-specific correlations of gene expressions within biological pathways. Because analyzing gene expression correlations is time consuming, an Apache Hadoop cloud computing platform was implemented. Three microarray data sets of breast cancer were collected from the Gene Expression Omnibus, and pathway information from the Kyoto Encyclopedia of Genes and Genomes was applied for discovering meaningful biological correlations. The results showed that adopting the Hadoop platform considerably decreased the computation time. Several correlations of differential gene expressions were discovered between the relapse and nonrelapse breast cancer samples, and most of them were involved in cancer regulation and cancer-related pathways. The results showed that breast cancer recurrence might be highly associated with the abnormal regulations of these gene pairs, rather than with their individual expression levels. The proposed method was computationally efficient and reliable, and stable results were obtained when different data sets were used. The proposed method is effective in identifying meaningful biological regulation patterns between conditions.
Collapse
Affiliation(s)
- Tzu-Hao Chang
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei 110, Taiwan
| | - Shih-Lin Wu
- Department of Computer Science and Information Engineering, College of Engineering, Chang Gung University, Taoyuan 333, Taiwan
| | - Wei-Jen Wang
- Department of Computer Science and Information Engineering, National Central University, Taoyuan 320, Taiwan
| | - Jorng-Tzong Horng
- Department of Computer Science and Information Engineering, National Central University, Taoyuan 320, Taiwan
- Department of Biomedical Informatics, Asia University, Taichung 413, Taiwan
| | - Cheng-Wei Chang
- Department of Information Management, Hsing Wu University, New Taipei City 244, Taiwan
| |
Collapse
|
300
|
Hung CL, Hua GJ. Cloud computing for protein-ligand binding site comparison. BIOMED RESEARCH INTERNATIONAL 2013; 2013:170356. [PMID: 23762824 PMCID: PMC3671236 DOI: 10.1155/2013/170356] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Accepted: 03/28/2013] [Indexed: 12/30/2022]
Abstract
The proteome-wide analysis of protein-ligand binding sites and their interactions with ligands is important in structure-based drug design and in understanding ligand cross reactivity and toxicity. The well-known and commonly used software, SMAP, has been designed for 3D ligand binding site comparison and similarity searching of a structural proteome. SMAP can also predict drug side effects and reassign existing drugs to new indications. However, the computing scale of SMAP is limited. We have developed a high availability, high performance system that expands the comparison scale of SMAP. This cloud computing service, called Cloud-PLBS, combines the SMAP and Hadoop frameworks and is deployed on a virtual cloud computing platform. To handle the vast amount of experimental data on protein-ligand binding site pairs, Cloud-PLBS exploits the MapReduce paradigm as a management and parallelizing tool. Cloud-PLBS provides a web portal and scalability through which biologists can address a wide range of computer-intensive questions in biology and drug discovery.
Collapse
Affiliation(s)
- Che-Lun Hung
- Department of Computer Science and Communication Engineering, Providence University, Taiwan Boulevard, Shalu District, Taichung 43301, Taiwan.
| | | |
Collapse
|