1
|
Adetiba E, Olugbara OO, Taiwo TB, Adebiyi MO, Badejo JA, Akanle MB, Matthews VO. Alignment-Free Z-Curve Genomic Cepstral Coefficients and Machine Learning for Classification of Viruses. BIOINFORMATICS AND BIOMEDICAL ENGINEERING 2018. [PMCID: PMC7120486 DOI: 10.1007/978-3-319-78723-7_25] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Accurate detection of pathogenic viruses has become highly imperative. This is because viral diseases constitute a huge threat to human health and wellbeing on a global scale. However, both traditional and recent techniques for viral detection suffer from various setbacks. In codicil, some of the existing alignment-free methods are also limited with respect to viral detection accuracy. In this paper, we present the development of an alignment-free, digital signal processing based method for pathogenic viral detection named Z-Curve Genomic Cesptral Coefficients (ZCGCC). To evaluate the method, ZCGCC were computed from twenty six pathogenic viral strains extracted from the ViPR corpus. Naïve Bayesian classifier, which is a popular machine learning method was experimentally trained and validated using the extracted ZCGCC and other alignment-free methods in the literature. Comparative results show that the proposed ZCGCC gives good accuracy (93.0385%) and improved performance to existing alignment-free methods.
Collapse
|
2
|
Su Z, Zhu J, Xu Z, Xiao R, Zhou R, Li L, Chen H. A Transcriptome Map of Actinobacillus pleuropneumoniae at Single-Nucleotide Resolution Using Deep RNA-Seq. PLoS One 2016; 11:e0152363. [PMID: 27018591 PMCID: PMC4809551 DOI: 10.1371/journal.pone.0152363] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2015] [Accepted: 03/13/2016] [Indexed: 12/21/2022] Open
Abstract
Actinobacillus pleuropneumoniae is the pathogen of porcine contagious pleuropneumoniae, a highly contagious respiratory disease of swine. Although the genome of A. pleuropneumoniae was sequenced several years ago, limited information is available on the genome-wide transcriptional analysis to accurately annotate the gene structures and regulatory elements. High-throughput RNA sequencing (RNA-seq) has been applied to study the transcriptional landscape of bacteria, which can efficiently and accurately identify gene expression regions and unknown transcriptional units, especially small non-coding RNAs (sRNAs), UTRs and regulatory regions. The aim of this study is to comprehensively analyze the transcriptome of A. pleuropneumoniae by RNA-seq in order to improve the existing genome annotation and promote our understanding of A. pleuropneumoniae gene structures and RNA-based regulation. In this study, we utilized RNA-seq to construct a single nucleotide resolution transcriptome map of A. pleuropneumoniae. More than 3.8 million high-quality reads (average length ~90 bp) from a cDNA library were generated and aligned to the reference genome. We identified 32 open reading frames encoding novel proteins that were mis-annotated in the previous genome annotations. The start sites for 35 genes based on the current genome annotation were corrected. Furthermore, 51 sRNAs in the A. pleuropneumoniae genome were discovered, of which 40 sRNAs were never reported in previous studies. The transcriptome map also enabled visualization of 5'- and 3'-UTR regions, in which contained 11 sRNAs. In addition, 351 operons covering 1230 genes throughout the whole genome were identified. The RNA-Seq based transcriptome map validated annotated genes and corrected annotations of open reading frames in the genome, and led to the identification of many functional elements (e.g. regions encoding novel proteins, non-coding sRNAs and operon structures). The transcriptional units described in this study provide a foundation for future studies concerning the gene functions and the transcriptional regulatory architectures of this pathogen.
Collapse
Affiliation(s)
- Zhipeng Su
- State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
| | - Jiawen Zhu
- State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
| | - Zhuofei Xu
- State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
| | - Ran Xiao
- State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
| | - Rui Zhou
- State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
- Cooperative Innovation Center of Sustainable Pig Production, Wuhan 430070, China
| | - Lu Li
- State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
- Cooperative Innovation Center of Sustainable Pig Production, Wuhan 430070, China
- * E-mail: (HC); (LL)
| | - Huanchun Chen
- State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
- Cooperative Innovation Center of Sustainable Pig Production, Wuhan 430070, China
- * E-mail: (HC); (LL)
| |
Collapse
|
3
|
Hua ZG, Lin Y, Yuan YZ, Yang DC, Wei W, Guo FB. ZCURVE 3.0: identify prokaryotic genes with higher accuracy as well as automatically and accurately select essential genes. Nucleic Acids Res 2015; 43:W85-90. [PMID: 25977299 PMCID: PMC4489317 DOI: 10.1093/nar/gkv491] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2015] [Accepted: 05/02/2015] [Indexed: 01/09/2023] Open
Abstract
In 2003, we developed an ab initio program, ZCURVE 1.0, to find genes in bacterial and archaeal genomes. In this work, we present the updated version (i.e. ZCURVE 3.0). Using 422 prokaryotic genomes, the average accuracy was 93.7% with the updated version, compared with 88.7% with the original version. Such results also demonstrate that ZCURVE 3.0 is comparable with Glimmer 3.02 and may provide complementary predictions to it. In fact, the joint application of the two programs generated better results by correctly finding more annotated genes while also containing fewer false-positive predictions. As the exclusive function, ZCURVE 3.0 contains one post-processing program that can identify essential genes with high accuracy (generally >90%). We hope ZCURVE 3.0 will receive wide use with the web-based running mode. The updated ZCURVE can be freely accessed from http://cefg.uestc.edu.cn/zcurve/ or http://tubic.tju.edu.cn/zcurveb/ without any restrictions.
Collapse
Affiliation(s)
- Zhi-Gang Hua
- Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 610054, China Center for Information in BioMedicine, University of Electronic Science and Technology of China, Chengdu 610054, China Health Big Data Science Research Center, Big Data Research Center, University of Electronic Science and Technology of China, Chengdu 610054, China Key Laboratory for NeuroInformation of the Ministry of Education, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Yan Lin
- Department of Physics, Tianjin University, Tianjin 300072, China Key Laboratory of Systems Bioengineering, Ministry of Education, Tianjin 300072, China Collaborative Innovation Center of Chemical Science and Engineering, Tianjin 300072, China
| | - Ya-Zhou Yuan
- Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 610054, China Center for Information in BioMedicine, University of Electronic Science and Technology of China, Chengdu 610054, China Health Big Data Science Research Center, Big Data Research Center, University of Electronic Science and Technology of China, Chengdu 610054, China Key Laboratory for NeuroInformation of the Ministry of Education, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - De-Chang Yang
- Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 610054, China Center for Information in BioMedicine, University of Electronic Science and Technology of China, Chengdu 610054, China Health Big Data Science Research Center, Big Data Research Center, University of Electronic Science and Technology of China, Chengdu 610054, China Key Laboratory for NeuroInformation of the Ministry of Education, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Wen Wei
- Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 610054, China Center for Information in BioMedicine, University of Electronic Science and Technology of China, Chengdu 610054, China Health Big Data Science Research Center, Big Data Research Center, University of Electronic Science and Technology of China, Chengdu 610054, China Key Laboratory for NeuroInformation of the Ministry of Education, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Feng-Biao Guo
- Center of Bioinformatics, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 610054, China Center for Information in BioMedicine, University of Electronic Science and Technology of China, Chengdu 610054, China Health Big Data Science Research Center, Big Data Research Center, University of Electronic Science and Technology of China, Chengdu 610054, China Key Laboratory for NeuroInformation of the Ministry of Education, University of Electronic Science and Technology of China, Chengdu 610054, China
| |
Collapse
|