Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Reid I, O’Toole N, Zabaneh O, Nourzadeh R, Dahdouli M, Abdellateef M, Gordon PMK, Soh J, Butler G, Sensen CW, Tsang A. SnowyOwl: accurate prediction of fungal genes by using RNA-Seq and homology information to select among ab initio models. BMC Bioinformatics 2014;15:229. [PMID: 24980894 PMCID: PMC4084796 DOI: 10.1186/1471-2105-15-229] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2013] [Accepted: 06/17/2014] [Indexed: 12/02/2022] Open

For:	Reid I, O’Toole N, Zabaneh O, Nourzadeh R, Dahdouli M, Abdellateef M, Gordon PMK, Soh J, Butler G, Sensen CW, Tsang A. SnowyOwl: accurate prediction of fungal genes by using RNA-Seq and homology information to select among ab initio models. BMC Bioinformatics 2014;15:229. [PMID: 24980894 PMCID: PMC4084796 DOI: 10.1186/1471-2105-15-229] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2013] [Accepted: 06/17/2014] [Indexed: 12/02/2022] Open

Number

Cited by Other Article(s)

Abdullah-Zawawi MR, Govender N, Harun S, Muhammad NAN, Zainal Z, Mohamed-Hussein ZA. Multi-Omics Approaches and Resources for Systems-Level Gene Function Prediction in the Plant Kingdom. PLANTS (BASEL, SWITZERLAND) 2022;11:2614. [PMID: 36235479 PMCID: PMC9573505 DOI: 10.3390/plants11192614] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 09/05/2022] [Accepted: 09/13/2022] [Indexed: 06/16/2023]

Wiltschi B, Cernava T, Dennig A, Galindo Casas M, Geier M, Gruber S, Haberbauer M, Heidinger P, Herrero Acero E, Kratzer R, Luley-Goedl C, Müller CA, Pitzer J, Ribitsch D, Sauer M, Schmölzer K, Schnitzhofer W, Sensen CW, Soh J, Steiner K, Winkler CK, Winkler M, Wriessnegger T. Enzymes revolutionize the bioproduction of value-added compounds: From enzyme discovery to special applications. Biotechnol Adv 2020;40:107520. [DOI: 10.1016/j.biotechadv.2020.107520] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2019] [Revised: 10/18/2019] [Accepted: 01/13/2020] [Indexed: 12/11/2022]

Scalzitti N, Jeannin-Girardon A, Collet P, Poch O, Thompson JD. A benchmark study of ab initio gene prediction methods in diverse eukaryotic organisms. BMC Genomics 2020;21:293. [PMID: 32272892 PMCID: PMC7147072 DOI: 10.1186/s12864-020-6707-9] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Accepted: 03/30/2020] [Indexed: 02/02/2023] Open

Cook DE, Valle-Inclan JE, Pajoro A, Rovenich H, Thomma BP, Faino L. Long-Read Annotation: Automated Eukaryotic Genome Annotation Based on Long-Read cDNA Sequencing. PLANT PHYSIOLOGY 2019;179:38-54. [PMID: 30401722 PMCID: PMC6324239 DOI: 10.1104/pp.18.00848] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Accepted: 10/19/2018] [Indexed: 05/16/2023]

Park SG, Ryu D, Lee H, Ryu H, Ahn YJ, Yoo SI, Ko J, Hong CP. TaF: a web platform for taxonomic profile-based fungal gene prediction. Genes Genomics 2018;41:337-342. [PMID: 30456524 DOI: 10.1007/s13258-018-0766-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2018] [Accepted: 11/13/2018] [Indexed: 10/27/2022]

Laothanachareon T, Tamayo-Ramos JA, Nijsse B, Schaap PJ. Forward Genetics by Genome Sequencing Uncovers the Central Role of the Aspergillus niger goxB Locus in Hydrogen Peroxide Induced Glucose Oxidase Expression. Front Microbiol 2018;9:2269. [PMID: 30319579 PMCID: PMC6165874 DOI: 10.3389/fmicb.2018.02269] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2018] [Accepted: 09/05/2018] [Indexed: 01/09/2023] Open

Reid I. Evaluating Programs for Predicting Genes and Transcripts with RNA-Seq Support in Fungal Genomes. Methods Mol Biol 2018;1775:209-227. [PMID: 29876820 DOI: 10.1007/978-1-4939-7804-5_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

McDonnell E, Strasser K, Tsang A. Manual Gene Curation and Functional Annotation. Methods Mol Biol 2018;1775:185-208. [PMID: 29876819 DOI: 10.1007/978-1-4939-7804-5_16] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Swart V, Crampton BG, Ridenour JB, Bluhm BH, Olivier NA, Meyer JJM, Berger DK. Complementation of CTB7 in the Maize Pathogen Cercospora zeina Overcomes the Lack of In Vitro Cercosporin Production. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2017;30:710-724. [PMID: 28535078 DOI: 10.1094/mpmi-03-17-0054-r] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]

Chan KL, Rosli R, Tatarinova TV, Hogan M, Firdaus-Raih M, Low ETL. Seqping: gene prediction pipeline for plant genomes using self-training gene models and transcriptomic data. BMC Bioinformatics 2017;18:1426. [PMID: 28466793 PMCID: PMC5333190 DOI: 10.1186/s12859-016-1426-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Gene prediction is one of the most important steps in the genome annotation process. A large number of software tools and pipelines developed by various computing techniques are available for gene prediction. However, these systems have yet to accurately predict all or even most of the protein-coding regions. Furthermore, none of the currently available gene-finders has a universal Hidden Markov Model (HMM) that can perform gene prediction for all organisms equally well in an automatic fashion.

RESULTS

We present an automated gene prediction pipeline, Seqping that uses self-training HMM models and transcriptomic data. The pipeline processes the genome and transcriptome sequences of the target species using GlimmerHMM, SNAP, and AUGUSTUS pipelines, followed by MAKER2 program to combine predictions from the three tools in association with the transcriptomic evidence. Seqping generates species-specific HMMs that are able to offer unbiased gene predictions. The pipeline was evaluated using the Oryza sativa and Arabidopsis thaliana genomes. Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis showed that the pipeline was able to identify at least 95% of BUSCO's plantae dataset. Our evaluation shows that Seqping was able to generate better gene predictions compared to three HMM-based programs (MAKER2, GlimmerHMM and AUGUSTUS) using their respective available HMMs. Seqping had the highest accuracy in rice (0.5648 for CDS, 0.4468 for exon, and 0.6695 nucleotide structure) and A. thaliana (0.5808 for CDS, 0.5955 for exon, and 0.8839 nucleotide structure).

CONCLUSIONS

Seqping provides researchers a seamless pipeline to train species-specific HMMs and predict genes in newly sequenced or less-studied genomes. We conclude that the Seqping pipeline predictions are more accurate than gene predictions using the other three approaches with the default or available HMMs.

Collapse

Magnan C, Yu J, Chang I, Jahn E, Kanomata Y, Wu J, Zeller M, Oakes M, Baldi P, Sandmeyer S. Sequence Assembly of Yarrowia lipolytica Strain W29/CLIB89 Shows Transposable Element Diversity. PLoS One 2016;11:e0162363. [PMID: 27603307 PMCID: PMC5014426 DOI: 10.1371/journal.pone.0162363] [Citation(s) in RCA: 53] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2016] [Accepted: 08/22/2016] [Indexed: 12/27/2022] Open

Affiliation(s)

Christophe Magnan Department of Computer Science, School of Computer Sciences, University of California Irvine, Irvine, California, United States of America Institute for Genomics and Bioinformatics, University of California Irvine, Irvine, California, United States of America
James Yu Department of Biological Chemistry, School of Medicine, University of California Irvine, Irvine, California, United States of America
Ivan Chang Department of Biological Chemistry, School of Medicine, University of California Irvine, Irvine, California, United States of America
Ethan Jahn Department of Biological Chemistry, School of Medicine, University of California Irvine, Irvine, California, United States of America
Yuzo Kanomata Department of Computer Science, School of Computer Sciences, University of California Irvine, Irvine, California, United States of America Institute for Genomics and Bioinformatics, University of California Irvine, Irvine, California, United States of America
Jenny Wu Department of Biological Chemistry, School of Medicine, University of California Irvine, Irvine, California, United States of America
Michael Zeller Department of Computer Science, School of Computer Sciences, University of California Irvine, Irvine, California, United States of America
Melanie Oakes Department of Biological Chemistry, School of Medicine, University of California Irvine, Irvine, California, United States of America
Pierre Baldi Department of Computer Science, School of Computer Sciences, University of California Irvine, Irvine, California, United States of America Institute for Genomics and Bioinformatics, University of California Irvine, Irvine, California, United States of America Department of Biological Chemistry, School of Medicine, University of California Irvine, Irvine, California, United States of America
Suzanne Sandmeyer Institute for Genomics and Bioinformatics, University of California Irvine, Irvine, California, United States of America Department of Biological Chemistry, School of Medicine, University of California Irvine, Irvine, California, United States of America * E-mail:

Collapse

Testa AC, Oliver RP, Hane JK. OcculterCut: A Comprehensive Survey of AT-Rich Regions in Fungal Genomes. Genome Biol Evol 2016;8:2044-64. [PMID: 27289099 PMCID: PMC4943192 DOI: 10.1093/gbe/evw121] [Citation(s) in RCA: 83] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/14/2016] [Indexed: 12/03/2022] Open

Hoff KJ, Lange S, Lomsadze A, Borodovsky M, Stanke M. BRAKER1: Unsupervised RNA-Seq-Based Genome Annotation with GeneMark-ET and AUGUSTUS. Bioinformatics 2016;32:767-9. [PMID: 26559507 PMCID: PMC6078167 DOI: 10.1093/bioinformatics/btv661] [Citation(s) in RCA: 636] [Impact Index Per Article: 79.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2015] [Revised: 10/02/2015] [Accepted: 10/26/2015] [Indexed: 11/12/2022] Open

Wibberg D, Rupp O, Blom J, Jelonek L, Kröber M, Verwaaijen B, Goesmann A, Albaum S, Grosch R, Pühler A, Schlüter A. Development of a Rhizoctonia solani AG1-IB Specific Gene Model Enables Comparative Genome Analyses between Phytopathogenic R. solani AG1-IA, AG1-IB, AG3 and AG8 Isolates. PLoS One 2015;10:e0144769. [PMID: 26690577 PMCID: PMC4686921 DOI: 10.1371/journal.pone.0144769] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2015] [Accepted: 11/23/2015] [Indexed: 12/22/2022] Open

Abstract

Rhizoctonia solani, a soil-born plant pathogenic basidiomycetous fungus, affects various economically important agricultural and horticultural crops. The draft genome sequence for the R. solani AG1-IB isolate 7/3/14 as well as a corresponding transcriptome dataset (Expressed Sequence Tags—ESTs) were established previously. Development of a specific R. solani AG1-IB gene model based on GMAP transcript mapping within the eukaryotic gene prediction platform AUGUSTUS allowed detection of new genes and provided insights into the gene structure of this fungus. In total, 12,616 genes were recognized in the genome of the AG1-IB isolate. Analysis of predicted genes by means of different bioinformatics tools revealed new genes whose products potentially are involved in degradation of plant cell wall components, melanin formation and synthesis of secondary metabolites. Comparative genome analyses between members of different R. solani anastomosis groups, namely AG1-IA, AG3 and AG8 and the newly annotated R. solani AG1-IB genome were performed within the comparative genomics platform EDGAR. It appeared that only 21 to 28% of all genes encoded in the draft genomes of the different strains were identified as core genes. Based on Average Nucleotide Identity (ANI) and Average Amino-acid Identity (AAI) analyses, considerable sequence differences between isolates representing different anastomosis groups were identified. However, R. solani isolates form a distinct cluster in relation to other fungi of the phylum Basidiomycota. The isolate representing AG1-IB encodes significant more genes featuring predictable functions in secondary metabolite production compared to other completely sequenced R. solani strains. The newly established R. solani AG1-IB 7/3/14 gene layout now provides a reliable basis for post-genomics studies.

Collapse

Cairns TC, Studholme DJ, Talbot NJ, Haynes K. New and Improved Techniques for the Study of Pathogenic Fungi. Trends Microbiol 2015;24:35-50. [PMID: 26549580 DOI: 10.1016/j.tim.2015.09.008] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2015] [Revised: 09/29/2015] [Accepted: 09/30/2015] [Indexed: 02/05/2023]

Testa AC, Hane JK, Ellwood SR, Oliver RP. CodingQuarry: highly accurate hidden Markov model gene prediction in fungal genomes using RNA-seq transcripts. BMC Genomics 2015;16:170. [PMID: 25887563 PMCID: PMC4363200 DOI: 10.1186/s12864-015-1344-4] [Citation(s) in RCA: 116] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Accepted: 02/13/2015] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

The impact of gene annotation quality on functional and comparative genomics makes gene prediction an important process, particularly in non-model species, including many fungi. Sets of homologous protein sequences are rarely complete with respect to the fungal species of interest and are often small or unreliable, especially when closely related species have not been sequenced or annotated in detail. In these cases, protein homology-based evidence fails to correctly annotate many genes, or significantly improve ab initio predictions. Generalised hidden Markov models (GHMM) have proven to be invaluable tools in gene annotation and, recently, RNA-seq has emerged as a cost-effective means to significantly improve the quality of automated gene annotation. As these methods do not require sets of homologous proteins, improving gene prediction from these resources is of benefit to fungal researchers. While many pipelines now incorporate RNA-seq data in training GHMMs, there has been relatively little investigation into additionally combining RNA-seq data at the point of prediction, and room for improvement in this area motivates this study.

RESULTS

CodingQuarry is a highly accurate, self-training GHMM fungal gene predictor designed to work with assembled, aligned RNA-seq transcripts. RNA-seq data informs annotations both during gene-model training and in prediction. Our approach capitalises on the high quality of fungal transcript assemblies by incorporating predictions made directly from transcript sequences. Correct predictions are made despite transcript assembly problems, including those caused by overlap between the transcripts of adjacent gene loci. Stringent benchmarking against high-confidence annotation subsets showed CodingQuarry predicted 91.3% of Schizosaccharomyces pombe genes and 90.4% of Saccharomyces cerevisiae genes perfectly. These results are 4-5% better than those of AUGUSTUS, the next best performing RNA-seq driven gene predictor tested. Comparisons against whole genome Sc. pombe and S. cerevisiae annotations further substantiate a 4-5% improvement in the number of correctly predicted genes.

CONCLUSIONS

We demonstrate the success of a novel method of incorporating RNA-seq data into GHMM fungal gene prediction. This shows that a high quality annotation can be achieved without relying on protein homology or a training set of genes. CodingQuarry is freely available ( https://sourceforge.net/projects/codingquarry/ ), and suitable for incorporation into genome annotation pipelines.

Collapse

Hoff KJ, Stanke M. Current methods for automated annotation of protein-coding genes. CURRENT OPINION IN INSECT SCIENCE 2015;7:8-14. [PMID: 32846689 DOI: 10.1016/j.cois.2015.02.008] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2014] [Revised: 12/08/2014] [Accepted: 02/18/2015] [Indexed: 06/11/2023]

Sperschneider J, Williams AH, Hane JK, Singh KB, Taylor JM. Evaluation of Secretion Prediction Highlights Differing Approaches Needed for Oomycete and Fungal Effectors. FRONTIERS IN PLANT SCIENCE 2015;6:1168. [PMID: 26779196 PMCID: PMC4688413 DOI: 10.3389/fpls.2015.01168] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2015] [Accepted: 12/07/2015] [Indexed: 05/03/2023]

Abstract

The steadily increasing number of sequenced fungal and oomycete genomes has enabled detailed studies of how these eukaryotic microbes infect plants and cause devastating losses in food crops. During infection, fungal and oomycete pathogens secrete effector molecules which manipulate host plant cell processes to the pathogen's advantage. Proteinaceous effectors are synthesized intracellularly and must be externalized to interact with host cells. Computational prediction of secreted proteins from genomic sequences is an important technique to narrow down the candidate effector repertoire for subsequent experimental validation. In this study, we benchmark secretion prediction tools on experimentally validated fungal and oomycete effectors. We observe that for a set of fungal SwissProt protein sequences, SignalP 4 and the neural network predictors of SignalP 3 (D-score) and SignalP 2 perform best. For effector prediction in particular, the use of a sensitive method can be desirable to obtain the most complete candidate effector set. We show that the neural network predictors of SignalP 2 and 3, as well as TargetP were the most sensitive tools for fungal effector secretion prediction, whereas the hidden Markov model predictors of SignalP 2 and 3 were the most sensitive tools for oomycete effectors. Thus, previous versions of SignalP retain value for oomycete effector prediction, as the current version, SignalP 4, was unable to reliably predict the signal peptide of the oomycete Crinkler effectors in the test set. Our assessment of subcellular localization predictors shows that cytoplasmic effectors are often predicted as not extracellular. This limits the reliability of secretion predictions that depend on these tools. We present our assessment with a view to informing future pathogenomics studies and suggest revised pipelines for secretion prediction to obtain optimal effector predictions in fungi and oomycetes.

Collapse

Tsang A. Fungal genomics. Brief Funct Genomics 2014;13:421-3. [PMID: 25411199 DOI: 10.1093/bfgp/elu041] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open