1
|
Schmidt B, Hildebrandt A. From GPUs to AI and quantum: three waves of acceleration in bioinformatics. Drug Discov Today 2024; 29:103990. [PMID: 38663581 DOI: 10.1016/j.drudis.2024.103990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 04/05/2024] [Accepted: 04/17/2024] [Indexed: 05/01/2024]
Abstract
The enormous growth in the amount of data generated by the life sciences is continuously shifting the field from model-driven science towards data-driven science. The need for efficient processing has led to the adoption of massively parallel accelerators such as graphics processing units (GPUs). Consequently, the development of bioinformatics methods nowadays often heavily depends on the effective use of these powerful technologies. Furthermore, progress in computational techniques and architectures continues to be highly dynamic, involving novel deep neural network models and artificial intelligence (AI) accelerators, and potentially quantum processing units in the future. These are expected to be disruptive for the life sciences as a whole and for drug discovery in particular. Here, we identify three waves of acceleration and their applications in a bioinformatics context: (i) GPU computing, (ii) AI and (iii) next-generation quantum computers.
Collapse
Affiliation(s)
- Bertil Schmidt
- Institut für Informatik, Johannes Gutenberg University, Mainz, Germany.
| | | |
Collapse
|
2
|
Zhao X, Sun C, Jin M, Chen J, Xing L, Yan J, Wang H, Liu Z, Chen WH. Enrichment Culture but Not Metagenomic Sequencing Identified a Highly Prevalent Phage Infecting Lactiplantibacillus plantarum in Human Feces. Microbiol Spectr 2023; 11:e0434022. [PMID: 36995238 PMCID: PMC10269749 DOI: 10.1128/spectrum.04340-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 03/07/2023] [Indexed: 03/31/2023] Open
Abstract
Lactiplantibacillus plantarum (previously known as Lactobacillus plantarum) is increasingly used as a probiotic to treat human diseases, but its phages in the human gut remain unexplored. Here, we report its first gut phage, Gut-P1, which we systematically screened using metagenomic sequencing, virus-like particle (VLP) sequencing, and enrichment culture from 35 fecal samples. Gut-P1 is virulent, belongs to the Douglaswolinvirus genus, and is highly prevalent in the gut (~11% prevalence); it has a genome of 79,928 bp consisting of 125 protein coding genes and displaying low sequence similarities to public L. plantarum phages. Physiochemical characterization shows that it has a short latent period and adapts to broad ranges of temperatures and pHs. Furthermore, Gut-P1 strongly inhibits the growth of L. plantarum strains at a multiplicity of infection (MOI) of 1e-6. Together, these results indicate that Gut-P1 can greatly impede the application of L. plantarum in humans. Strikingly, Gut-P1 was identified only in the enrichment culture, not in our metagenomic or VLP sequencing data nor in any public human phage databases, indicating the inefficiency of bulk sequencing in recovering low-abundance but highly prevalent phages and pointing to the unexplored hidden diversity of the human gut virome despite recent large-scale sequencing and bioinformatics efforts. IMPORTANCE As Lactiplantibacillus plantarum (previously known as Lactobacillus plantarum) is increasingly used as a probiotic to treat human gut-related diseases, its bacteriophages may pose a certain threat to their further application and should be identified and characterized more often from the human intestine. Here, we isolated and identified the first gut L. plantarum phage that is prevalent in a Chinese population. This phage, Gut-P1, is virulent and can strongly inhibit the growth of multiple L. plantarum strains at low MOIs. Our results also show that bulk sequencing is inefficient at recovering low-abundance but highly prevalent phages such as Gut-P1, suggesting that the hidden diversity of human enteroviruses has not yet been explored. Our results call for innovative approaches to isolate and identify intestinal phages from the human gut and to rethink our current understanding of the enterovirus, particularly its underestimated diversity and overestimated individual specificity.
Collapse
Affiliation(s)
- Xueyang Zhao
- College of Life Science, Henan Normal University, Xinxiang, Henan, China
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Chuqing Sun
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Menglu Jin
- College of Life Science, Henan Normal University, Xinxiang, Henan, China
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Jingchao Chen
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Lulu Xing
- College of Life Science, Henan Normal University, Xinxiang, Henan, China
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Jin Yan
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Hailei Wang
- College of Life Science, Henan Normal University, Xinxiang, Henan, China
| | - Zhi Liu
- Department of Biotechnology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China
| | - Wei-Hua Chen
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China
- Institution of Medical Artificial Intelligence, Binzhou Medical University, Yantai, China
| |
Collapse
|
3
|
Cheng T, Chin PJ, Cha K, Petrick N, Mikailov M. Profiling the BLAST bioinformatics application for load balancing on high-performance computing clusters. BMC Bioinformatics 2022; 23:544. [PMID: 36526957 PMCID: PMC9758941 DOI: 10.1186/s12859-022-05029-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 10/31/2022] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND The Basic Local Alignment Search Tool (BLAST) is a suite of commonly used algorithms for identifying matches between biological sequences. The user supplies a database file and query file of sequences for BLAST to find identical sequences between the two. The typical millions of database and query sequences make BLAST computationally challenging but also well suited for parallelization on high-performance computing clusters. The efficacy of parallelization depends on the data partitioning, where the optimal data partitioning relies on an accurate performance model. In previous studies, a BLAST job was sped up by 27 times by partitioning the database and query among thousands of processor nodes. However, the optimality of the partitioning method was not studied. Unlike BLAST performance models proposed in the literature that usually have problem size and hardware configuration as the only variables, the execution time of a BLAST job is a function of database size, query size, and hardware capability. In this work, the nucleotide BLAST application BLASTN was profiled using three methods: shell-level profiling with the Unix "time" command, code-level profiling with the built-in "profiler" module, and system-level profiling with the Unix "gprof" program. The runtimes were measured for six node types, using six different database files and 15 query files, on a heterogeneous HPC cluster with 500+ nodes. The empirical measurement data were fitted with quadratic functions to develop performance models that were used to guide the data parallelization for BLASTN jobs. RESULTS Profiling results showed that BLASTN contains more than 34,500 different functions, but a single function, RunMTBySplitDB, takes 99.12% of the total runtime. Among its 53 child functions, five core functions were identified to make up 92.12% of the overall BLASTN runtime. Based on the performance models, static load balancing algorithms can be applied to the BLASTN input data to minimize the runtime of the longest job on an HPC cluster. Four test cases being run on homogeneous and heterogeneous clusters were tested. Experiment results showed that the runtime can be reduced by 81% on a homogeneous cluster and by 20% on a heterogeneous cluster by re-distributing the workload. DISCUSSION Optimal data partitioning can improve BLASTN's overall runtime 5.4-fold in comparison with dividing the database and query into the same number of fragments. The proposed methodology can be used in the other applications in the BLAST+ suite or any other application as long as source code is available.
Collapse
Affiliation(s)
- Trinity Cheng
- grid.417587.80000 0001 2243 3366Center for Devices and Radiological Health, U.S. Food and Drug Administration, Silver Spring, MD 20993 USA ,grid.21107.350000 0001 2171 9311Department of Biomedical Engineering, Whiting School of Engineering, Johns Hopkins University, Baltimore, MD 21218 USA
| | - Pei-Ju Chin
- grid.290496.00000 0001 1945 2072Center for Biologics Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD 20993 USA
| | - Kenny Cha
- grid.417587.80000 0001 2243 3366Center for Devices and Radiological Health, U.S. Food and Drug Administration, Silver Spring, MD 20993 USA
| | - Nicholas Petrick
- grid.417587.80000 0001 2243 3366Center for Devices and Radiological Health, U.S. Food and Drug Administration, Silver Spring, MD 20993 USA
| | - Mike Mikailov
- grid.417587.80000 0001 2243 3366Center for Devices and Radiological Health, U.S. Food and Drug Administration, Silver Spring, MD 20993 USA
| |
Collapse
|
4
|
Xiao C, Sun T, Yang Z, Zou L, Deng J, Yang X. Whole transcriptome RNA Sequencing Reveals the Global Molecular Responses and circRNA/lncRNA-miRNA-mRNA ceRNA Regulatory Network in Chicken Fat Deposition. Poult Sci 2022; 101:102121. [PMID: 36116349 PMCID: PMC9485216 DOI: 10.1016/j.psj.2022.102121] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 03/21/2022] [Accepted: 08/03/2022] [Indexed: 11/29/2022] Open
Abstract
Fat deposition is a vital factor affecting the economics of poultry production. Numerous studies on fat deposition have been done. However, the molecular regulatory mechanism is still unclear. In the present study, the whole-transcriptome RNA sequencing in abdominal fat, back skin, and liver both high- and low-abdominal fat groups was used to uncover the competitive endogenous RNA (ceRNA) regulation network related to chicken fat deposition. The results showed that differentially expressed (DE) genes in abdominal fat, back skin, liver were 1207(784 mRNAs, 330 lncRNAs, 41 circRNAs, 52 miRNAs), 860 (607 mRNAs, 166 lncRNAs, 26 circRNAs, 61 miRNAs), and 923 (501 mRNAs, 262 lncRNAs, 15 circRNAs, 145 miRNAs), respectively. The ceRNA regulatory network analysis indicated that the fatty acid metabolic process, monocarboxylic acid metabolic process, carboxylic acid metabolic process, glycerolipid metabolism, fatty acid metabolism, and peroxisome proliferator-activated receptor (PPAR) signaling pathway took part in chicken fat deposition. Meanwhile, we scan the important genes, FADS2, HSD17B12, ELOVL5, AKR1E2, DGKQ, GPAM, PLIN2, which were regulated by gga-miR-460b-5p, gga-miR-199-5p, gga-miR-7470-3p, gga-miR-6595-5p, gga-miR-101-2-5p. While these miRNAs were competitive combined by lncRNAs including MSTRG.18043, MSTRG.7738, MSTRG.21310, MSTRG.19577, and circRNAs including novel_circ_PTPN2, novel_circ_CTNNA1, novel_circ_PTPRD. This finding provides new insights into the regulatory mechanism of mRNA, miRNA, lncRNA, and circRNA in chicken fat deposition.
Collapse
Affiliation(s)
- Cong Xiao
- College of Animal Science and Technology, Guangxi University, Nanning 530004, China
| | - Tiantian Sun
- College of Animal Science and Technology, Guangxi University, Nanning 530004, China
| | - Zhuliang Yang
- College of Animal Science and Technology, Guangxi University, Nanning 530004, China
| | - Leqin Zou
- College of Animal Science and Technology, Guangxi University, Nanning 530004, China
| | - Jixian Deng
- College of Animal Science and Technology, Guangxi University, Nanning 530004, China
| | - Xiurong Yang
- College of Animal Science and Technology, Guangxi University, Nanning 530004, China.
| |
Collapse
|
5
|
GPrimer: a fast GPU-based pipeline for primer design for qPCR experiments. BMC Bioinformatics 2021; 22:220. [PMID: 33926379 PMCID: PMC8082839 DOI: 10.1186/s12859-021-04133-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Accepted: 04/14/2021] [Indexed: 11/10/2022] Open
Abstract
Background Design of valid high-quality primers is essential for qPCR experiments. MRPrimer is a powerful pipeline based on MapReduce that combines both primer design for target sequences and homology tests on off-target sequences. It takes an entire sequence DB as input and returns all feasible and valid primer pairs existing in the DB. Due to the effectiveness of primers designed by MRPrimer in qPCR analysis, it has been widely used for developing many online design tools and building primer databases. However, the computational speed of MRPrimer is too slow to deal with the sizes of sequence DBs growing exponentially and thus must be improved. Results We develop a fast GPU-based pipeline for primer design (GPrimer) that takes the same input and returns the same output with MRPrimer. MRPrimer consists of a total of seven MapReduce steps, among which two steps are very time-consuming. GPrimer significantly improves the speed of those two steps by exploiting the computational power of GPUs. In particular, it designs data structures for coalesced memory access in GPU and workload balancing among GPU threads and copies the data structures between main memory and GPU memory in a streaming fashion. For human RefSeq DB, GPrimer achieves a speedup of 57 times for the entire steps and a speedup of 557 times for the most time-consuming step using a single machine of 4 GPUs, compared with MRPrimer running on a cluster of six machines. Conclusions We propose a GPU-based pipeline for primer design that takes an entire sequence DB as input and returns all feasible and valid primer pairs existing in the DB at once without an additional step using BLAST-like tools. The software is available at https://github.com/qhtjrmin/GPrimer.git.
Collapse
|
6
|
Wang Y, Zhao B, Lu Z, Shi Y, Li J. The complete chloroplast genome provides insight into the polymorphism and adaptive evolution of Garcinia paucinervis. BIOTECHNOL BIOTEC EQ 2021. [DOI: 10.1080/13102818.2021.1879676] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
Affiliation(s)
- Yifei Wang
- Department of Pharmacognosy, College of Pharmacy, Guilin Medical University, Guilin, China
| | - Bo Zhao
- Department of Pharmacognosy, College of Pharmacy, Guilin Medical University, Guilin, China
| | - Zhaocen Lu
- Department of Characteristic Economic Plant Research Center, Guangxi Institute of Botany, The Chinese Academy of Sciences, Guilin, China
| | - Yancai Shi
- Department of Characteristic Economic Plant Research Center, Guangxi Institute of Botany, The Chinese Academy of Sciences, Guilin, China
| | - Jingjian Li
- Department of Pharmacognosy, College of Pharmacy, Guilin Medical University, Guilin, China
| |
Collapse
|
7
|
Carroll TC, Ojiaku JT, Wong PWH. Semiglobal Sequence Alignment with Gaps Using GPU. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020; 17:2086-2097. [PMID: 31056513 DOI: 10.1109/tcbb.2019.2914105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
In this paper, we consider the pair-wise semiglobal sequence alignment problem with gaps, which is motivated by the re-sequencing problem that requires to assemble short reads sequences into a genome sequence by referring to a reference sequence. The problem has been studied before for single gap and bounded number of gaps. For single gap, there is a GPU-based algorithm proposed (Barton et al., 2015). In our work, we propose a GPU-based algorithm for the bounded number of gaps case, called GPUGapsMis. We implement the algorithm and compare the performance with the CPU-based algorithm, called CPUGapsMis. The algorithm has two distinct stages: the alignment phase, and the backtrack phase. We investigate several different approaches, in order to determine the most favorable for this problem, by means of a Hybrid model or a wholly-GPU based model, as well as the alignment of single text sequences or multiple text sequences on the GPU at a time. We show that the alignment phase of the algorithm is a good candidate for parallelization, with peak speedup of 11 times. We show that although the backtracking phase is sequential, it is more beneficial to perform it on the GPU, as opposed to returning to the CPU and performing there. When performing both phases on the GPU, GPUGapsMis achieves a peak speedup of 10.4 times against CPUGapsMis. Our data parallel GPU algorithm achieves results which are an improvement on those of an existing GPU data parallel implementation (Ojiaku, 2014).
Collapse
|
8
|
Sha SP, Suryavanshi MV, Jani K, Sharma A, Shouche Y, Tamang JP. Diversity of Yeasts and Molds by Culture-Dependent and Culture-Independent Methods for Mycobiome Surveillance of Traditionally Prepared Dried Starters for the Production of Indian Alcoholic Beverages. Front Microbiol 2018; 9:2237. [PMID: 30319566 PMCID: PMC6169615 DOI: 10.3389/fmicb.2018.02237] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2018] [Accepted: 09/03/2018] [Indexed: 12/25/2022] Open
Abstract
Marcha, thiat, dawdim, hamei, humao, khekhrii, chowan, and phut are traditionally prepared dried starters used for production of various ethnic alcoholic beverages in North East states of India. The surveillance of mycobiome associated with these starters have been revealed by culture-dependent methods using phenotypic and molecular tools. We identified Wickerhamomyces anomalus, Pichia anomala, Saccharomycopsis fibuligera, Pichia terricola, Pichia kudriavzevii, and Candida glabrata by ITS-PCR. The diversity of yeasts and molds in all 40 samples was also investigated by culture-independent method using PCR-DGGE analysis. The average distributions of yeasts showed Saccharomyces cerevisiae (16.5%), Saccharomycopsis fibuligera (15.3%), Wickerhamomyces anomalus (11.3%), S. malanga (11.7%), Kluyveromyces marxianus (5.3%), Meyerozyma sp. (2.7%), Candida glabrata (2.7%), and many strains below 2%. About 12 strains of molds were also identified based on PCR-DGGE analysis which included Aspergillus penicillioides (5.0%), Rhizopus oryzae (3.3%), and sub-phylum: Mucoromycotina (2.1%). Different techniques used in this paper revealed the diversity and differences of mycobiome species in starter cultures of India which may be referred as baseline data for further research.
Collapse
Affiliation(s)
- Shankar Prasad Sha
- DAICENTRE (DBT-AIST International Centre for Translational and Environmental Research) and Bioinformatics Centre, Department of Microbiology, School of Life Sciences, Sikkim University, Gangtok, India
| | - Mangesh Vasant Suryavanshi
- DAICENTRE (DBT-AIST International Centre for Translational and Environmental Research) and Bioinformatics Centre, Department of Microbiology, School of Life Sciences, Sikkim University, Gangtok, India.,National Centre for Microbial Resource, National Centre for Cell Science, Pune, India
| | - Kunal Jani
- National Centre for Microbial Resource, National Centre for Cell Science, Pune, India
| | - Avinash Sharma
- National Centre for Microbial Resource, National Centre for Cell Science, Pune, India
| | - Yogesh Shouche
- National Centre for Microbial Resource, National Centre for Cell Science, Pune, India
| | - Jyoti Prakash Tamang
- DAICENTRE (DBT-AIST International Centre for Translational and Environmental Research) and Bioinformatics Centre, Department of Microbiology, School of Life Sciences, Sikkim University, Gangtok, India
| |
Collapse
|
9
|
Wang X, Cheng F, Rohlsen D, Bi C, Wang C, Xu Y, Wei S, Ye Q, Yin T, Ye N. Organellar genome assembly methods and comparative analysis of horticultural plants. HORTICULTURE RESEARCH 2018; 5:3. [PMID: 29423233 PMCID: PMC5798811 DOI: 10.1038/s41438-017-0002-1] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2017] [Revised: 11/20/2017] [Accepted: 11/26/2017] [Indexed: 05/31/2023]
Abstract
Although organellar genomes (including chloroplast and mitochondrial genomes) are smaller than nuclear genomes in size and gene number, organellar genomes are very important for the investigation of plant evolution and molecular ecology mechanisms. Few studies have focused on the organellar genomes of horticultural plants. Approximately 1193 chloroplast genomes and 199 mitochondrial genomes of land plants are available in the National Center for Biotechnology Information (NCBI), of which only 39 are from horticultural plants. In this paper, we report an innovative and efficient method for high-quality horticultural organellar genome assembly from next-generation sequencing (NGS) data. Sequencing reads were first assembled by Newbler, Amos, and Minimus software with default parameters. The remaining gaps were then filled through BLASTN search and PCR. The complete DNA sequence was corrected based on Illumina sequencing data using BWA (Burrows-Wheeler Alignment tool) software. The advantage of this approach is that there is no need to isolate organellar DNA from total DNA during sample preparation. Using this procedure, the complete mitochondrial and chloroplast genomes of an ornamental plant, Salix suchowensis, and a fruit tree, Ziziphus jujuba, were identified. This study shows that horticultural plants have similar mitochondrial and chloroplast sequence organization to other seed plants. Most horticultural plants demonstrate a slight bias toward A+T rich features in the mitochondrial genome. In addition, a phylogenetic analysis of 39 horticultural plants based on 15 protein-coding genes showed that some mitochondrial genes are horizontally transferred from chloroplast DNA. Our study will provide an important reference for organellar genome assembly in other horticultural plants. Furthermore, phylogenetic analysis of the organellar genomes of horticultural plants could accurately clarify the unanticipated relationships among these plants.
Collapse
Affiliation(s)
- Xuelin Wang
- College of Information Science and Technology, Nanjing Forestry University, Nanjing, Jiangsu China
| | - Feng Cheng
- Department of Pharmaceutical Science, College of Pharmacy, University of South Florida, Tampa, FL 33612 USA
| | - Dekai Rohlsen
- Department of Pharmaceutical Science, College of Pharmacy, University of South Florida, Tampa, FL 33612 USA
| | - Changwei Bi
- School of Biological Science and Medical Engineering, Southeast University, Nanjing, Jiangsu China
| | - Chunyan Wang
- College of Information Science and Technology, Nanjing Forestry University, Nanjing, Jiangsu China
| | - Yiqing Xu
- College of Information Science and Technology, Nanjing Forestry University, Nanjing, Jiangsu China
| | - Suyun Wei
- College of Information Science and Technology, Nanjing Forestry University, Nanjing, Jiangsu China
| | - Qiaolin Ye
- College of Information Science and Technology, Nanjing Forestry University, Nanjing, Jiangsu China
| | - Tongming Yin
- College of Forestry, Nanjing Forestry University, Nanjing, Jiangsu China
| | - Ning Ye
- College of Information Science and Technology, Nanjing Forestry University, Nanjing, Jiangsu China
| |
Collapse
|
10
|
Nobile MS, Cazzaniga P, Tangherloni A, Besozzi D. Graphics processing units in bioinformatics, computational biology and systems biology. Brief Bioinform 2017; 18:870-885. [PMID: 27402792 PMCID: PMC5862309 DOI: 10.1093/bib/bbw058] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2016] [Indexed: 01/18/2023] Open
Abstract
Several studies in Bioinformatics, Computational Biology and Systems Biology rely on the definition of physico-chemical or mathematical models of biological systems at different scales and levels of complexity, ranging from the interaction of atoms in single molecules up to genome-wide interaction networks. Traditional computational methods and software tools developed in these research fields share a common trait: they can be computationally demanding on Central Processing Units (CPUs), therefore limiting their applicability in many circumstances. To overcome this issue, general-purpose Graphics Processing Units (GPUs) are gaining an increasing attention by the scientific community, as they can considerably reduce the running time required by standard CPU-based software, and allow more intensive investigations of biological systems. In this review, we present a collection of GPU tools recently developed to perform computational analyses in life science disciplines, emphasizing the advantages and the drawbacks in the use of these parallel architectures. The complete list of GPU-powered tools here reviewed is available at http://bit.ly/gputools.
Collapse
Affiliation(s)
- Marco S Nobile
- Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milano, Italy
- SYSBIO.IT Centre of Systems Biology, Milano, Italy
| | - Paolo Cazzaniga
- Department of Human and Social Sciences, University of Bergamo, Bergamo, Italy
- SYSBIO.IT Centre of Systems Biology, Milano, Italy
| | - Andrea Tangherloni
- Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milano, Italy
| | - Daniela Besozzi
- Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milano, Italy
- SYSBIO.IT Centre of Systems Biology, Milano, Italy
- Corresponding author. Daniela Besozzi, Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milano, Italy and SYSBIO.IT Centre of Systems Biology, Milano, Italy. Tel.: +39 02 6448 7874. E-mail:
| |
Collapse
|
11
|
Accelerating Wright-Fisher Forward Simulations on the Graphics Processing Unit. G3-GENES GENOMES GENETICS 2017; 7:3229-3236. [PMID: 28768689 PMCID: PMC5592947 DOI: 10.1534/g3.117.300103] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Forward Wright–Fisher simulations are powerful in their ability to model complex demography and selection scenarios, but suffer from slow execution on the Central Processor Unit (CPU), thus limiting their usefulness. However, the single-locus Wright–Fisher forward algorithm is exceedingly parallelizable, with many steps that are so-called “embarrassingly parallel,” consisting of a vast number of individual computations that are all independent of each other and thus capable of being performed concurrently. The rise of modern Graphics Processing Units (GPUs) and programming languages designed to leverage the inherent parallel nature of these processors have allowed researchers to dramatically speed up many programs that have such high arithmetic intensity and intrinsic concurrency. The presented GPU Optimized Wright–Fisher simulation, or “GO Fish” for short, can be used to simulate arbitrary selection and demographic scenarios while running over 250-fold faster than its serial counterpart on the CPU. Even modest GPU hardware can achieve an impressive speedup of over two orders of magnitude. With simulations so accelerated, one can not only do quick parametric bootstrapping of previously estimated parameters, but also use simulated results to calculate the likelihoods and summary statistics of demographic and selection models against real polymorphism data, all without restricting the demographic and selection scenarios that can be modeled or requiring approximations to the single-locus forward algorithm for efficiency. Further, as many of the parallel programming techniques used in this simulation can be applied to other computationally intensive algorithms important in population genetics, GO Fish serves as an exciting template for future research into accelerating computation in evolution. GO Fish is part of the Parallel PopGen Package available at: http://dl42.github.io/ParallelPopGen/.
Collapse
|
12
|
Zhang J, Wang H, Feng WC. cuBLASTP: Fine-Grained Parallelization of Protein Sequence Search on CPU+GPU. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:830-843. [PMID: 26469393 DOI: 10.1109/tcbb.2015.2489662] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
BLAST, short for Basic Local Alignment Search Tool, is a ubiquitous tool used in the life sciences for pairwise sequence search. However, with the advent of next-generation sequencing (NGS), whether at the outset or downstream from NGS, the exponential growth of sequence databases is outstripping our ability to analyze the data. While recent studies have utilized the graphics processing unit (GPU) to speedup the BLAST algorithm for searching protein sequences (i.e., BLASTP), these studies use coarse-grained parallelism, where one sequence alignment is mapped to only one thread. Such an approach does not efficiently utilize the capabilities of a GPU, particularly due to the irregularity of BLASTP in both execution paths and memory-access patterns. To address the above shortcomings, we present a fine-grained approach to parallelize BLASTP, where each individual phase of sequence search is mapped to many threads on a GPU. This approach, which we refer to as cuBLASTP, reorders data-access patterns and reduces divergent branches of the most time-consuming phases (i.e., hit detection and ungapped extension). In addition, cuBLASTP optimizes the remaining phases (i.e., gapped extension and alignment with trace back) on a multicore CPU and overlaps their execution with the phases running on the GPU.
Collapse
|
13
|
Ye W, Chen Y, Zhang Y, Xu Y. H-BLAST: a fast protein sequence alignment toolkit on heterogeneous computers with GPUs. Bioinformatics 2017; 33:1130-1138. [PMID: 28087515 DOI: 10.1093/bioinformatics/btw769] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2016] [Accepted: 12/12/2016] [Indexed: 11/15/2022] Open
Abstract
Motivation The sequence alignment is a fundamental problem in bioinformatics. BLAST is a routinely used tool for this purpose with over 118 000 citations in the past two decades. As the size of bio-sequence databases grows exponentially, the computational speed of alignment softwares must be improved. Results We develop the heterogeneous BLAST (H-BLAST), a fast parallel search tool for a heterogeneous computer that couples CPUs and GPUs, to accelerate BLASTX and BLASTP-basic tools of NCBI-BLAST. H-BLAST employs a locally decoupled seed-extension algorithm for better performance on GPUs, and offers a performance tuning mechanism for better efficiency among various CPUs and GPUs combinations. H-BLAST produces identical alignment results as NCBI-BLAST and its computational speed is much faster than that of NCBI-BLAST. Speedups achieved by H-BLAST over sequential NCBI-BLASTP (resp. NCBI-BLASTX) range mostly from 4 to 10 (resp. 5 to 7.2). With 2 CPU threads and 2 GPUs, H-BLAST can be faster than 16-threaded NCBI-BLASTX. Furthermore, H-BLAST is 1.5-4 times faster than GPU-BLAST. Availability and Implementation https://github.com/Yeyke/H-BLAST.git. Contact yux06@syr.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Weicai Ye
- School of Data and Computer Science, and Guangdong Province Key Laboratory of Computational Science, Sun Yat-sen University, Guangzhou 510275, People's Republic of China
| | - Ying Chen
- School of Data and Computer Science, and Guangdong Province Key Laboratory of Computational Science, Sun Yat-sen University, Guangzhou 510275, People's Republic of China
| | - Yongdong Zhang
- School of Data and Computer Science, and Guangdong Province Key Laboratory of Computational Science, Sun Yat-sen University, Guangzhou 510275, People's Republic of China
| | - Yuesheng Xu
- School of Data and Computer Science, and Guangdong Province Key Laboratory of Computational Science, Sun Yat-sen University, Guangzhou 510275, People's Republic of China.,Professor Emeritus of Department of Mathematics, Syracuse University, Syracuse, NY 13244, USA
| |
Collapse
|
14
|
Manconi A, Moscatelli M, Armano G, Gnocchi M, Orro A, Milanesi L. Removing duplicate reads using graphics processing units. BMC Bioinformatics 2016; 17:346. [PMID: 28185553 PMCID: PMC5123249 DOI: 10.1186/s12859-016-1192-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Background During library construction polymerase chain reaction is used to enrich the DNA before sequencing. Typically, this process generates duplicate read sequences. Removal of these artifacts is mandatory, as they can affect the correct interpretation of data in several analyses. Ideally, duplicate reads should be characterized by identical nucleotide sequences. However, due to sequencing errors, duplicates may also be nearly-identical. Removing nearly-identical duplicates can result in a notable computational effort. To deal with this challenge, we recently proposed a GPU method aimed at removing identical and nearly-identical duplicates generated with an Illumina platform. The method implements an approach based on prefix-suffix comparison. Read sequences with identical prefix are considered potential duplicates. Then, their suffixes are compared to identify and remove those that are actually duplicated. Although the method can be efficiently used to remove duplicates, there are some limitations that need to be overcome. In particular, it cannot to detect potential duplicates in the event that prefixes are longer than 27 bases, and it does not provide support for paired-end read libraries. Moreover, large clusters of potential duplicates are split into smaller with the aim to guarantees a reasonable computing time. This heuristic may affect the accuracy of the analysis. Results In this work we propose GPU-DupRemoval, a new implementation of our method able to (i) cluster reads without constraints on the maximum length of the prefixes, (ii) support both single- and paired-end read libraries, and (iii) analyze large clusters of potential duplicates. Conclusions Due to the massive parallelization obtained by exploiting graphics cards, GPU-DupRemoval removes duplicate reads faster than other cutting-edge solutions, while outperforming most of them in terms of amount of duplicates reads.
Collapse
Affiliation(s)
- Andrea Manconi
- Institute for Biomedical Technologies, National Research Council, Via Fratelli Cervi, 93, Segrate (Mi), 20090, Italy.
| | - Marco Moscatelli
- Institute for Biomedical Technologies, National Research Council, Via Fratelli Cervi, 93, Segrate (Mi), 20090, Italy
| | - Giuliano Armano
- Department of Electrical and Electronic Engineering, University of Cagliari, P.zza D'Armi, Cagliari (CA), 09123, Italy
| | - Matteo Gnocchi
- Institute for Biomedical Technologies, National Research Council, Via Fratelli Cervi, 93, Segrate (Mi), 20090, Italy
| | - Alessandro Orro
- Institute for Biomedical Technologies, National Research Council, Via Fratelli Cervi, 93, Segrate (Mi), 20090, Italy
| | - Luciano Milanesi
- Institute for Biomedical Technologies, National Research Council, Via Fratelli Cervi, 93, Segrate (Mi), 20090, Italy
| |
Collapse
|
15
|
Zhang J, Misra S, Wang H, Feng WC. muBLASTP: database-indexed protein sequence search on multicore CPUs. BMC Bioinformatics 2016; 17:443. [PMID: 27809763 PMCID: PMC5096327 DOI: 10.1186/s12859-016-1302-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2016] [Accepted: 10/21/2016] [Indexed: 11/16/2022] Open
Abstract
Background The Basic Local Alignment Search Tool (BLAST) is a fundamental program in the life sciences that searches databases for sequences that are most similar to a query sequence. Currently, the BLAST algorithm utilizes a query-indexed approach. Although many approaches suggest that sequence search with a database index can achieve much higher throughput (e.g., BLAT, SSAHA, and CAFE), they cannot deliver the same level of sensitivity as the query-indexed BLAST, i.e., NCBI BLAST, or they can only support nucleotide sequence search, e.g., MegaBLAST. Due to different challenges and characteristics between query indexing and database indexing, the existing techniques for query-indexed search cannot be used into database indexed search. Results muBLASTP, a novel database-indexed BLAST for protein sequence search, delivers identical hits returned to NCBI BLAST. On Intel Haswell multicore CPUs, for a single query, the single-threaded muBLASTP achieves up to a 4.41-fold speedup for alignment stages, and up to a 1.75-fold end-to-end speedup over single-threaded NCBI BLAST. For a batch of queries, the multithreaded muBLASTP achieves up to a 5.7-fold speedups for alignment stages, and up to a 4.56-fold end-to-end speedup over multithreaded NCBI BLAST. Conclusions With a newly designed index structure for protein database and associated optimizations in BLASTP algorithm, we re-factored BLASTP algorithm for modern multicore processors that achieves much higher throughput with acceptable memory footprint for the database index. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1302-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jing Zhang
- Department of Computer Science, Virginia Tech, 225 Stanger Street, Blacksburg, 24060, VA, USA.
| | - Sanchit Misra
- Parallel Computing Lab, Intel Corporation, Bengaluru, Karnataka, 560102, India
| | - Hao Wang
- Department of Computer Science, Virginia Tech, 225 Stanger Street, Blacksburg, 24060, VA, USA
| | - Wu-Chun Feng
- Department of Computer Science, Virginia Tech, 225 Stanger Street, Blacksburg, 24060, VA, USA
| |
Collapse
|
16
|
Suzuki S, Kakuta M, Ishida T, Akiyama Y. GPU-Acceleration of Sequence Homology Searches with Database Subsequence Clustering. PLoS One 2016; 11:e0157338. [PMID: 27482905 PMCID: PMC4970815 DOI: 10.1371/journal.pone.0157338] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2015] [Accepted: 05/27/2016] [Indexed: 11/30/2022] Open
Abstract
Sequence homology searches are used in various fields and require large amounts of computation time, especially for metagenomic analysis, owing to the large number of queries and the database size. To accelerate computing analyses, graphics processing units (GPUs) are widely used as a low-cost, high-performance computing platform. Therefore, we mapped the time-consuming steps involved in GHOSTZ, which is a state-of-the-art homology search algorithm for protein sequences, onto a GPU and implemented it as GHOSTZ-GPU. In addition, we optimized memory access for GPU calculations and for communication between the CPU and GPU. As per results of the evaluation test involving metagenomic data, GHOSTZ-GPU with 12 CPU threads and 1 GPU was approximately 3.0- to 4.1-fold faster than GHOSTZ with 12 CPU threads. Moreover, GHOSTZ-GPU with 12 CPU threads and 3 GPUs was approximately 5.8- to 7.7-fold faster than GHOSTZ with 12 CPU threads.
Collapse
Affiliation(s)
- Shuji Suzuki
- Graduate School of Information Science and Engineering, Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
- Education Academy of Computational Life Sciences (ACLS), Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
| | - Masanori Kakuta
- Graduate School of Information Science and Engineering, Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
| | - Takashi Ishida
- Graduate School of Information Science and Engineering, Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
- Education Academy of Computational Life Sciences (ACLS), Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
| | - Yutaka Akiyama
- Graduate School of Information Science and Engineering, Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
- Education Academy of Computational Life Sciences (ACLS), Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan
- * E-mail:
| |
Collapse
|
17
|
Bonnici V, Busato F, Micale G, Bombieri N, Pulvirenti A, Giugno R. APPAGATO: an APproximate PArallel and stochastic GrAph querying TOol for biological networks. Bioinformatics 2016; 32:2159-66. [DOI: 10.1093/bioinformatics/btw223] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2015] [Accepted: 04/10/2016] [Indexed: 02/02/2023] Open
Affiliation(s)
- Vincenzo Bonnici
- Department of Computer Science, University of Verona, Strada Le Grazie, Verona
| | - Federico Busato
- Department of Computer Science, University of Verona, Strada Le Grazie, Verona
| | - Giovanni Micale
- Department of Math and Computer Science, University of Catania, Viale a. Doria, Catania
| | - Nicola Bombieri
- Department of Computer Science, University of Verona, Strada Le Grazie, Verona
| | - Alfredo Pulvirenti
- Department of Clinical and Experimental Medicine, University of Catania, via Palermo, Catania
| | - Rosalba Giugno
- Department of Computer Science, University of Verona, Strada Le Grazie, Verona
- Department of Clinical and Experimental Medicine, University of Catania, via Palermo, Catania
| |
Collapse
|
18
|
Langenkämper D, Jakobi T, Feld D, Jelonek L, Goesmann A, Nattkemper TW. Comparison of Acceleration Techniques for Selected Low-Level Bioinformatics Operations. Front Genet 2016; 7:5. [PMID: 26904094 PMCID: PMC4748744 DOI: 10.3389/fgene.2016.00005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2015] [Accepted: 01/17/2016] [Indexed: 12/27/2022] Open
Abstract
Within the recent years clock rates of modern processors stagnated while the demand for computing power continued to grow. This applied particularly for the fields of life sciences and bioinformatics, where new technologies keep on creating rapidly growing piles of raw data with increasing speed. The number of cores per processor increased in an attempt to compensate for slight increments of clock rates. This technological shift demands changes in software development, especially in the field of high performance computing where parallelization techniques are gaining in importance due to the pressing issue of large sized datasets generated by e.g., modern genomics. This paper presents an overview of state-of-the-art manual and automatic acceleration techniques and lists some applications employing these in different areas of sequence informatics. Furthermore, we provide examples for automatic acceleration of two use cases to show typical problems and gains of transforming a serial application to a parallel one. The paper should aid the reader in deciding for a certain techniques for the problem at hand. We compare four different state-of-the-art automatic acceleration approaches (OpenMP, PluTo-SICA, PPCG, and OpenACC). Their performance as well as their applicability for selected use cases is discussed. While optimizations targeting the CPU worked better in the complex k-mer use case, optimizers for Graphics Processing Units (GPUs) performed better in the matrix multiplication example. But performance is only superior at a certain problem size due to data migration overhead. We show that automatic code parallelization is feasible with current compiler software and yields significant increases in execution speed. Automatic optimizers for CPU are mature and usually no additional manual adjustment is required. In contrast, some automatic parallelizers targeting GPUs still lack maturity and are limited to simple statements and structures.
Collapse
Affiliation(s)
- Daniel Langenkämper
- Biodata Mining Group, Faculty of Technology, Bielefeld University Bielefeld, Germany
| | - Tobias Jakobi
- Sektion für Bioinformatik und Systemkardiologie, Universitätsklinikum Heidelberg Heidelberg, Germany
| | | | - Lukas Jelonek
- Bioinformatik und Systembiologie, Justus Liebig University Gießen, Germany
| | - Alexander Goesmann
- Bioinformatik und Systembiologie, Justus Liebig University Gießen, Germany
| | - Tim W Nattkemper
- Biodata Mining Group, Faculty of Technology, Bielefeld University Bielefeld, Germany
| |
Collapse
|
19
|
Zhang K. Gctf: Real-time CTF determination and correction. J Struct Biol 2015; 193:1-12. [PMID: 26592709 PMCID: PMC4711343 DOI: 10.1016/j.jsb.2015.11.003] [Citation(s) in RCA: 2541] [Impact Index Per Article: 282.3] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2015] [Revised: 11/08/2015] [Accepted: 11/11/2015] [Indexed: 11/26/2022]
Abstract
Accurate estimation of the contrast transfer function (CTF) is critical for a near-atomic resolution cryo electron microscopy (cryoEM) reconstruction. Here, a GPU-accelerated computer program, Gctf, for accurate and robust, real-time CTF determination is presented. The main target of Gctf is to maximize the cross-correlation of a simulated CTF with the logarithmic amplitude spectra (LAS) of observed micrographs after background subtraction. Novel approaches in Gctf improve both speed and accuracy. In addition to GPU acceleration (e.g. 10–50×), a fast ‘1-dimensional search plus 2-dimensional refinement (1S2R)’ procedure further speeds up Gctf. Based on the global CTF determination, the local defocus for each particle and for single frames of movies is accurately refined, which improves CTF parameters of all particles for subsequent image processing. Novel diagnosis method using equiphase averaging (EPA) and self-consistency verification procedures have also been implemented in the program for practical use, especially for aims of near-atomic reconstruction. Gctf is an independent program and the outputs can be easily imported into other cryoEM software such as Relion (Scheres, 2012) and Frealign (Grigorieff, 2007). The results from several representative datasets are shown and discussed in this paper.
Collapse
Affiliation(s)
- Kai Zhang
- Medical Research Council Laboratory of Molecular Biology, Division of Structural Studies, Francis Crick Avenue, Cambridge CB2 0QH, UK.
| |
Collapse
|
20
|
Chen Y, Ye W, Zhang Y, Xu Y. High speed BLASTN: an accelerated MegaBLAST search tool. Nucleic Acids Res 2015; 43:7762-8. [PMID: 26250111 PMCID: PMC4652774 DOI: 10.1093/nar/gkv784] [Citation(s) in RCA: 272] [Impact Index Per Article: 30.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2015] [Accepted: 07/22/2015] [Indexed: 11/14/2022] Open
Abstract
Sequence alignment is a long standing problem in bioinformatics. The Basic Local Alignment Search Tool (BLAST) is one of the most popular and fundamental alignment tools. The explosive growth of biological sequences calls for speedup of sequence alignment tools such as BLAST. To this end, we develop high speed BLASTN (HS-BLASTN), a parallel and fast nucleotide database search tool that accelerates MegaBLAST—the default module of NCBI-BLASTN. HS-BLASTN builds a new lookup table using the FMD-index of the database and employs an accurate and effective seeding method to find short stretches of identities (called seeds) between the query and the database. HS-BLASTN produces the same alignment results as MegaBLAST and its computational speed is much faster than MegaBLAST. Specifically, our experiments conducted on a 12-core server show that HS-BLASTN can be 22 times faster than MegaBLAST and exhibits better parallel performance than MegaBLAST. HS-BLASTN is written in C++ and the related source code is available at https://github.com/chenying2016/queries under the GPLv3 license.
Collapse
Affiliation(s)
- Ying Chen
- Guangdong Province Key Laboratory of Computational Science, School of Mathematics and Computational Science, Sun Yat-sen University, Guangzhou 510275, P. R. China
| | - Weicai Ye
- Guangdong Province Key Laboratory of Computational Science, School of Mathematics and Computational Science, Sun Yat-sen University, Guangzhou 510275, P. R. China
| | - Yongdong Zhang
- Guangdong Province Key Laboratory of Computational Science, School of Mathematics and Computational Science, Sun Yat-sen University, Guangzhou 510275, P. R. China
| | - Yuesheng Xu
- Guangdong Province Key Laboratory of Computational Science, School of Mathematics and Computational Science, Sun Yat-sen University, Guangzhou 510275, P. R. China Department of Mathematics, Syracuse University, Syracuse, NY 13244, USA
| |
Collapse
|
21
|
Metabolic and metagenomic outcomes from early-life pulsed antibiotic treatment. Nat Commun 2015; 6:7486. [PMID: 26123276 PMCID: PMC4491183 DOI: 10.1038/ncomms8486] [Citation(s) in RCA: 257] [Impact Index Per Article: 28.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2015] [Accepted: 05/13/2015] [Indexed: 12/12/2022] Open
Abstract
Mammalian species have co-evolved with intestinal microbial communities that can shape development and adapt to environmental changes, including antibiotic perturbation or nutrient flux. In humans, especially children, microbiota disruption is common, yet the dynamic microbiome recovery from early-life antibiotics is still uncharacterized. Here we use a mouse model mimicking paediatric antibiotic use and find that therapeutic-dose pulsed antibiotic treatment (PAT) with a beta-lactam or macrolide alters both host and microbiota development. Early-life PAT accelerates total mass and bone growth, and causes progressive changes in gut microbiome diversity, population structure and metagenomic content, with microbiome effects dependent on the number of courses and class of antibiotic. Whereas control microbiota rapidly adapts to a change in diet, PAT slows the ecological progression, with delays lasting several months with previous macrolide exposure. This study identifies key markers of disturbance and recovery, which may help provide therapeutic targets for microbiota restoration following antibiotic treatment. The potential recovery of the human gut microbiota after an antibiotic treatment, and its effects on our health, are poorly understood. Here, the authors use a mouse model mimicking paediatric antibiotic use to shed new light into these processes.
Collapse
|
22
|
Manconi A, Manca E, Moscatelli M, Gnocchi M, Orro A, Armano G, Milanesi L. G-CNV: A GPU-Based Tool for Preparing Data to Detect CNVs with Read-Depth Methods. Front Bioeng Biotechnol 2015; 3:28. [PMID: 25806367 PMCID: PMC4354384 DOI: 10.3389/fbioe.2015.00028] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2014] [Accepted: 02/19/2015] [Indexed: 11/23/2022] Open
Abstract
Copy number variations (CNVs) are the most prevalent types of structural variations (SVs) in the human genome and are involved in a wide range of common human diseases. Different computational methods have been devised to detect this type of SVs and to study how they are implicated in human diseases. Recently, computational methods based on high-throughput sequencing (HTS) are increasingly used. The majority of these methods focus on mapping short-read sequences generated from a donor against a reference genome to detect signatures distinctive of CNVs. In particular, read-depth based methods detect CNVs by analyzing genomic regions with significantly different read-depth from the other ones. The pipeline analysis of these methods consists of four main stages: (i) data preparation, (ii) data normalization, (iii) CNV regions identification, and (iv) copy number estimation. However, available tools do not support most of the operations required at the first two stages of this pipeline. Typically, they start the analysis by building the read-depth signal from pre-processed alignments. Therefore, third-party tools must be used to perform most of the preliminary operations required to build the read-depth signal. These data-intensive operations can be efficiently parallelized on graphics processing units (GPUs). In this article, we present G-CNV, a GPU-based tool devised to perform the common operations required at the first two stages of the analysis pipeline. G-CNV is able to filter low-quality read sequences, to mask low-quality nucleotides, to remove adapter sequences, to remove duplicated read sequences, to map the short-reads, to resolve multiple mapping ambiguities, to build the read-depth signal, and to normalize it. G-CNV can be efficiently used as a third-party tool able to prepare data for the subsequent read-depth signal generation and analysis. Moreover, it can also be integrated in CNV detection tools to generate read-depth signals.
Collapse
Affiliation(s)
- Andrea Manconi
- Institute for Biomedical Technologies, National Research Council , Milan , Italy
| | - Emanuele Manca
- Department of Electrical and Electronic Engineering, University of Cagliari , Cagliari , Italy
| | - Marco Moscatelli
- Institute for Biomedical Technologies, National Research Council , Milan , Italy
| | - Matteo Gnocchi
- Institute for Biomedical Technologies, National Research Council , Milan , Italy
| | - Alessandro Orro
- Institute for Biomedical Technologies, National Research Council , Milan , Italy
| | - Giuliano Armano
- Department of Electrical and Electronic Engineering, University of Cagliari , Cagliari , Italy
| | - Luciano Milanesi
- Institute for Biomedical Technologies, National Research Council , Milan , Italy
| |
Collapse
|
23
|
CLAST: CUDA implemented large-scale alignment search tool. BMC Bioinformatics 2014; 15:406. [PMID: 25495907 PMCID: PMC4271471 DOI: 10.1186/s12859-014-0406-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2014] [Accepted: 12/02/2014] [Indexed: 01/13/2023] Open
Abstract
BACKGROUND Metagenomics is a powerful methodology to study microbial communities, but it is highly dependent on nucleotide sequence similarity searching against sequence databases. Metagenomic analyses with next-generation sequencing technologies produce enormous numbers of reads from microbial communities, and many reads are derived from microbes whose genomes have not yet been sequenced, limiting the usefulness of existing sequence similarity search tools. Therefore, there is a clear need for a sequence similarity search tool that can rapidly detect weak similarity in large datasets. RESULTS We developed a tool, which we named CLAST (CUDA implemented large-scale alignment search tool), that enables analyses of millions of reads and thousands of reference genome sequences, and runs on NVIDIA Fermi architecture graphics processing units. CLAST has four main advantages over existing alignment tools. First, CLAST was capable of identifying sequence similarities ~80.8 times faster than BLAST and 9.6 times faster than BLAT. Second, CLAST executes global alignment as the default (local alignment is also an option), enabling CLAST to assign reads to taxonomic and functional groups based on evolutionarily distant nucleotide sequences with high accuracy. Third, CLAST does not need a preprocessed sequence database like Burrows-Wheeler Transform-based tools, and this enables CLAST to incorporate large, frequently updated sequence databases. Fourth, CLAST requires <2 GB of main memory, making it possible to run CLAST on a standard desktop computer or server node. CONCLUSIONS CLAST achieved very high speed (similar to the Burrows-Wheeler Transform-based Bowtie 2 for long reads) and sensitivity (equal to BLAST, BLAT, and FR-HIT) without the need for extensive database preprocessing or a specialized computing platform. Our results demonstrate that CLAST has the potential to be one of the most powerful and realistic approaches to analyze the massive amount of sequence data from next-generation sequencing technologies.
Collapse
|
24
|
Fürstberger A, Maucher M, Kestler HA. Extended pairwise local alignment of wild card DNA/RNA sequences using dynamic programming. J STAT COMPUT SIM 2014. [DOI: 10.1080/00949655.2014.928294] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
|
25
|
Li Y, Chi H, Xia L, Chu X. Accelerating the scoring module of mass spectrometry-based peptide identification using GPUs. BMC Bioinformatics 2014; 15:121. [PMID: 24773593 PMCID: PMC4049470 DOI: 10.1186/1471-2105-15-121] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2012] [Accepted: 04/23/2014] [Indexed: 11/10/2022] Open
Abstract
Background Tandem mass spectrometry-based database searching is currently the main method for protein identification in shotgun proteomics. The explosive growth of protein and peptide databases, which is a result of genome translations, enzymatic digestions, and post-translational modifications (PTMs), is making computational efficiency in database searching a serious challenge. Profile analysis shows that most search engines spend 50%-90% of their total time on the scoring module, and that the spectrum dot product (SDP) based scoring module is the most widely used. As a general purpose and high performance parallel hardware, graphics processing units (GPUs) are promising platforms for speeding up database searches in the protein identification process. Results We designed and implemented a parallel SDP-based scoring module on GPUs that exploits the efficient use of GPU registers, constant memory and shared memory. Compared with the CPU-based version, we achieved a 30 to 60 times speedup using a single GPU. We also implemented our algorithm on a GPU cluster and achieved an approximately favorable speedup. Conclusions Our GPU-based SDP algorithm can significantly improve the speed of the scoring module in mass spectrometry-based protein identification. The algorithm can be easily implemented in many database search engines such as X!Tandem, SEQUEST, and pFind. A software tool implementing this algorithm is available at http://www.comp.hkbu.edu.hk/~youli/ProteinByGPU.html
Collapse
Affiliation(s)
| | | | | | - Xiaowen Chu
- Department of Computer Science, Hong Kong Baptist University, Kowloon Tong, Hong Kong.
| |
Collapse
|